The circular law. Lewis Memorial Lecture / DIMACS minicourse March 19, Terence Tao (UCLA)

Similar documents
Random regular digraphs: singularity and spectrum

Inhomogeneous circular laws for random matrices with non-identically distributed entries

Random Matrices: Invertibility, Structure, and Applications

Least singular value of random matrices. Lewis Memorial Lecture / DIMACS minicourse March 18, Terence Tao (UCLA)

Random matrices: Distribution of the least singular value (via Property Testing)

Small Ball Probability, Arithmetic Structure and Random Matrices

Exponential tail inequalities for eigenvalues of random matrices

Invertibility of random matrices

Operator-Valued Free Probability Theory and Block Random Matrices. Roland Speicher Queen s University Kingston

Preface to the Second Edition...vii Preface to the First Edition... ix

Concentration Inequalities for Random Matrices

FREE PROBABILITY THEORY

1 Intro to RMT (Gene)

Eigenvalue variance bounds for Wigner and covariance random matrices

Comparison Method in Random Matrix Theory

Lecture I: Asymptotics for large GUE random matrices

Upper Bound for Intermediate Singular Values of Random Sub-Gaussian Matrices 1

Local law of addition of random matrices

Random Toeplitz Matrices

arxiv: v1 [math-ph] 19 Oct 2018

Eigenvalues and Singular Values of Random Matrices: A Tutorial Introduction

, then the ESD of the matrix converges to µ cir almost surely as n tends to.

The Matrix Dyson Equation in random matrix theory

arxiv: v5 [math.na] 16 Nov 2017

Spectral Continuity Properties of Graph Laplacians

Lectures 6 7 : Marchenko-Pastur Law

The norm of polynomials in large random matrices

Assessing the dependence of high-dimensional time series via sample autocovariances and correlations

The uniform uncertainty principle and compressed sensing Harmonic analysis and related topics, Seville December 5, 2008

AN ELEMENTARY PROOF OF THE SPECTRAL RADIUS FORMULA FOR MATRICES

arxiv: v3 [math.pr] 8 Oct 2017

DISTRIBUTION OF EIGENVALUES OF REAL SYMMETRIC PALINDROMIC TOEPLITZ MATRICES AND CIRCULANT MATRICES

Non white sample covariance matrices.

Random matrices: A Survey. Van H. Vu. Department of Mathematics Rutgers University

On singular values distribution of a matrix large auto-covariance in the ultra-dimensional regime. Title

arxiv: v2 [math.fa] 7 Apr 2010

Selfadjoint Polynomials in Independent Random Matrices. Roland Speicher Universität des Saarlandes Saarbrücken

Random Matrix Theory Lecture 1 Introduction, Ensembles and Basic Laws. Symeon Chatzinotas February 11, 2013 Luxembourg

THE EIGENVALUES AND EIGENVECTORS OF FINITE, LOW RANK PERTURBATIONS OF LARGE RANDOM MATRICES

Fluctuations from the Semicircle Law Lecture 4

Extreme eigenvalues of Erdős-Rényi random graphs

PCA with random noise. Van Ha Vu. Department of Mathematics Yale University

RANDOM MATRICES: TAIL BOUNDS FOR GAPS BETWEEN EIGENVALUES. 1. Introduction

arxiv: v1 [math.pr] 22 May 2008

Strong Convergence of the Empirical Distribution of Eigenvalues of Large Dimensional Random Matrices

Local Kesten McKay law for random regular graphs

Determinantal point processes and random matrix theory in a nutshell

Random Matrices: Beyond Wigner and Marchenko-Pastur Laws

Anti-concentration Inequalities

Convergence of spectral measures and eigenvalue rigidity

Semicircle law on short scales and delocalization for Wigner random matrices

RANDOM MATRICES: LAW OF THE DETERMINANT

Math 108b: Notes on the Spectral Theorem

Local semicircle law, Wegner estimate and level repulsion for Wigner random matrices

Lecture 3. Random Fourier measurements

Invertibility of symmetric random matrices

III. Quantum ergodicity on graphs, perspectives

Large sample covariance matrices and the T 2 statistic

1 Tridiagonal matrices

EIGENVECTORS OF RANDOM MATRICES OF SYMMETRIC ENTRY DISTRIBUTIONS. 1. introduction

1 Math 241A-B Homework Problem List for F2015 and W2016

The dichotomy between structure and randomness. International Congress of Mathematicians, Aug Terence Tao (UCLA)

arxiv: v4 [math.pr] 13 Mar 2012

Concentration inequalities: basics and some new challenges

Random Matrix: From Wigner to Quantum Chaos

arxiv: v3 [math-ph] 21 Jun 2012

arxiv: v2 [math.pr] 16 Aug 2014

Markov operators, classical orthogonal polynomial ensembles, and random matrices

The largest eigenvalues of the sample covariance matrix. in the heavy-tail case

On the distinguishability of random quantum states

On the concentration of eigenvalues of random symmetric matrices

Random Matrix Theory and its Applications to Econometrics

22m:033 Notes: 3.1 Introduction to Determinants

Freeness and the Transpose

A note on a Marčenko-Pastur type theorem for time series. Jianfeng. Yao

BALANCING GAUSSIAN VECTORS. 1. Introduction

Lecture 7: Positive Semidefinite Matrices

Further Mathematical Methods (Linear Algebra) 2002

From the mesoscopic to microscopic scale in random matrix theory

THE FORM SUM AND THE FRIEDRICHS EXTENSION OF SCHRÖDINGER-TYPE OPERATORS ON RIEMANNIAN MANIFOLDS

1 Introduction. 2 Measure theoretic definitions

CHAPTER 8: Matrices and Determinants

Painlevé Representations for Distribution Functions for Next-Largest, Next-Next-Largest, etc., Eigenvalues of GOE, GUE and GSE

Random Fermionic Systems

Chapter 2. Square matrices

Definition (T -invariant subspace) Example. Example

Lecture 8 : Eigenvalues and Eigenvectors

Compound matrices and some classical inequalities

Universality for random matrices and log-gases

THE SPARSE CIRCULAR LAW UNDER MINIMAL ASSUMPTIONS

A Note on the Central Limit Theorem for the Eigenvalue Counting Function of Wigner and Covariance Matrices

1 Lyapunov theory of stability

A Remark on Hypercontractivity and Tail Inequalities for the Largest Eigenvalues of Random Matrices

Numerical Range of non-hermitian random Ginibre matrices and the Dvoretzky theorem

RANDOM MATRICES: OVERCROWDING ESTIMATES FOR THE SPECTRUM. 1. introduction

LUCK S THEOREM ALEX WRIGHT

Review (Probability & Linear Algebra)

Math 102, Winter Final Exam Review. Chapter 1. Matrices and Gaussian Elimination

Szemerédi s regularity lemma revisited. Lewis Memorial Lecture March 14, Terence Tao (UCLA)

ON THE CONVERGENCE OF THE NEAREST NEIGHBOUR EIGENVALUE SPACING DISTRIBUTION FOR ORTHOGONAL AND SYMPLECTIC ENSEMBLES

Transcription:

The circular law Lewis Memorial Lecture / DIMACS minicourse March 19, 2008 Terence Tao (UCLA) 1

Eigenvalue distributions Let M = (a ij ) 1 i n;1 j n be a square matrix. Then one has n (generalised) eigenvalues λ 1,..., λ n C. We are interested in the case when the a ij are iid random variables (e.g. Gaussian or Bernoulli). We would also like to make as few moment assumptions as possible (ideally, just second moment). We would like to understand the asymptotic distribution of the spectrum {λ 1,..., λ n } in the limit n. (Note that there is no natural ordering of the n eigenvalues.) 2

The first task is to get a rough idea of how large the eigenvalues are. If the a ij are bounded (or at least have bounded second moment), then tr(mm ) = 1 i,j n a ij 2 = n j=1 σ2 j has size O(n 2 ) w.h.p. (law of large numbers). Thus we expect the singular values σ j to be of magnitude O( n) on average. It turns out that there is a linear algebra inequality n λ j 2 j=1 which then implies that the eigenvalues λ j also have magnitude O( n) (at least in a root mean square sense). 2 j=1 σ 2 j 3

Proof of linear algebra inequality: By the Jordan normal form, we can write M = AUA 1 where U is upper triangular. By the QR factorisation, we can write A = QR where Q is orthogonal and R is upper triangular. We conclude that M = QV Q 1 where V is upper triangular. The eigenvalues λ j are the diagonal entries of V. Meanwhile, the sum 2 j=1 σ2 j = tr(mm ) = tr(v V ) is the sum of squares of all the elements of V (including the ones above the diagonal). The claim follows. 4

If one assumes fourth moment control on the a ij, one can obtain σ 1 = O(n 1/2 ) with high probability. From the linear algebra inequality sup λ j M = σ 1 1 j n we thus obtain λ j = O(n 1/2 ) for all j (not just most j). But the previous argument is more general (it only needs second moment), and for the task of controlling limiting eigenvalue distributions, it is acceptable to lose control of a minority of the eigenvalues. 5

Since the eigenvalues are expected to have size O( n), it is natural to introduce the normalised spectrum { 1 n λ 1,..., 1 n λ n }. We then define the empirical spectral distribution (ESD) µ n : R 2 [0, 1] by the formula µ n (s, t) := 1 1 #{1 j n : Re( λ n ) s; Im( 1 λ n ) t}. n n n This is a sequence (in n) of (random) functions on the plane. Question: Does the sequence µ n converge to a limit as n? If so, what is this limit? 6

Let µ : R 2 [0, 1] be a (deterministic) function. Definition (Weak convergence) We say that µ n converges weakly to µ if for every s, t R, µ n (s, t) converges to µ(s, t) in probability, i.e. lim P( µ n(s, t) µ(s, t) ε) = 0 for all ε. n Definition (Strong convergence) We say that µ n converges strongly to µ if, with probability 1, the sequence µ n converges uniformly to µ. 7

To oversimplify, weak convergence is asking for P( µ n (s, t) µ(s, t) ε) = o(1) while strong convergence is essentially asking for n=1 P( µ n (s, t) µ(s, t) ε) < (cf. the Borel-Cantelli lemma). Thus one needs quantitative tail bounds (better than O(1/n)) for strong convergence, but only qualitative tail bounds o(1) for weak convergence. [The uniformity in s, t is relatively cheap to attain, due to the monotonicity properties of µ n.] 8

Wigner s semicircular law The situation is very well understood in the self-adjoint case, in which the matrix entries a ij are only iid for i j (and extended to i > j by self-adjointness). In this case, of course, the spectrum is real (and µ n (s, t) is really just a function of s. 9

The famous semi-circular law of Wigner then asserts that µ n converges strongly to the semicircular distribution µ(s) = s 1 dµ, where µ := (1 2π s2 /4) 1/2 + ds, at least in the case of the Gaussian ensemble. It can be extended to the case when a ij are independent for i j, have mean zero, unit variance, and uniformly controlled second moment (thus E a ij 2 1 aij >K 0 as K uniformly in i, j). 10

The semicircular law is proven by the moment method, using the identity s m dµ n (s) = 1 n n j=1 λ m j = 1 n tr(( 1 n M) m ) for m = 0, 1, 2,.... The key step is to show that s m dµ n (s) s m dµ(s) w.h.p. 11

When one only has second moment control, a preliminary truncation argument to force a ij to be bounded is also needed. But this is not difficult, because the spectrum is stable in the self-adjoint case; if M is a self-adjoint perturbation (or truncation) of a self-adjoint matrix M, then the spectrum of M is close to that of M, as can be seen for instance from the Weyl eigenvalue inequalities. 12

The circular law In the non self-adjoint case, there is an analogous law, in which the semi-circular measure dµ = 1 (1 2π s2 /4) 1/2 + is replaced by the circular measure dµ = 1 1 π s 2 +t 2 1 dsdt. Conjecture (circular law). If the a ij are iid with zero mean and unit variance, then µ n converges weakly or strongly to µ. This conjecture is strongly supported by numerics. It is not completely resolved, but there are many partial results. 13

For the Gaussian ensemble, the strong circular law was established in (Mehta, 1967), using an explicit formula for the joint distribution. In (Girko, 1984), Girko proposed a method to establish the circular law in general, but certain tail bounds on least singular values were needed to make the method rigorous. In (Bai, 1997), this was accomplished, proving the strong circular law for continuous complex distributions a ij with bounded sixth moment; this was relaxed to (2 + η) th moment for any η > 0 in (Bai-Silverstein, 2006). Several alternate proofs of the strong and weak circular laws under similar assumptions were also given in (Girko, 2004). 14

In the above arguments, the continuity of the a ij was needed to bound certain least singular values. But with the recent progress on these bounds in the discrete case (Rudelson, 2006; T.-Vu, 2007; Rudelson-Vershynin, 2007; T.-Vu, 2008), as discussed in the previous lecture, it became possible to remove this continuity assumption. 15

In (Götze-Tikhomirov, 2007) the weak circular law for discrete subgaussian ensembles was proven (using the least singular tail bounds of Rudelson); a similar result with the subgaussian hypothesis relaxed to (4 + η) th moment was described in (Girko, 2004). The strong circular law under a fourth moment assumption was established in (Pan-Zhou, 2007), and relaxed to (2 + η) th moment in (T.-Vu, 2008). An alternate proof of the weak circular law under (2 + η) th moment assumption was given in (Götze-Tikhomirov, 2008). Thus we are only an η away from fully resolving the circular law! 16

A new difficulty There is a major new difficulty in the non-selfadjoint case, which manifests itself in two related ways: The spectrum can be very unstable with respect to perturbations. (In particular, truncation is now dangerous.) One cannot always control the spectrum effectively just from knowing a few moments. [Note that the first few moments are very stable with respect to perturbations.] 17

For instance, consider the matrix 0 1 0... 0 0 0 1... 0 M :=....... 0 0 0... 1 0 0 0... 0 18

and the exponentially tiny perturbation 0 1 0... 0 0 0 1... 0 M :=....... 0 0 0... 1 2 n 0 0... 0. 19

The matrix M has characteristic polynomial det(m λi) = ( λ) n and so the spectrum is {0} (with multiplicity n). But the perturbation M has characteristic polynomial det(m λi) = ( λ) n ( 2) n, and so the spectrum is { 1 2 e2πij/n : j = 1,..., n}. An exponentially small change in the matrix has caused the spectrum to move by 1! Note also that trm k = tr(m ) k for all k = 0,..., n 1; the moment method cannot distinguish M from M until the n th moment, which is too difficult to compute in practice. 20

The instability of the spectrum near a complex number λ is closely related to the operator norm of the resolvent (M λi) 1, or equivalently to the least singular value of M λi. In other words, spectral instability is caused by the presence of pseudospectrum. Indeed, if (M λi) 1 is bounded, then a standard Neumann series argument shows that M λi remains invertible for small perturbations M of M, and so the spectrum cannot come close to λ. This already indicates that the least singular values of M λi will play an important role in the argument. 21

Girko s method As the moment method is now unavailable, one must establish the circular law by other means. The key breakthrough was made by Girko in 1984. Instead of moments trm k = n j=1 λk j, the idea was to study the Stieltjes transform dµn (s, t) s n (z) := s + it z = 1 n 1 1 n n λ j z. This is a meromorphic function with simple poles at each eigenvalue. In principle, control on s n leads to control on µ n, and vice versa. j=1 22

Because s n is meromorphic, it suffices to understand the real part s nr. For this, one has the fundamental identity s nr (s + it) = 1 n = d ds = d ds 1 n 1 n n Re j=1 1 1 n λ j s it n 1 log λ j s it n j=1 log det( 1 n M (s + it)i). Thus, in principle, it suffices to understand the quantity 1 n log det( 1 n M (s + it)i). (In practice, one controls the Fourier transform of this quantity.) 23

One can express 1 log det( 1 n n M (s + it)i) in terms of 1 the eigenvalues of n M (s + it)i and thus of M, but this leads us right back to where we started. But instead, we can express this quantity in terms of the singular values σ j ( 1 n M (s + it)i) of 1 n M (s + it)i: 1 n log det( 1 n M (s+it)i) = 1 n n j=1 log σ j ( 1 n M (s+it)i). 24

The reason why this is useful is because, in contrast to the eigenvalues, the distribution of the singular values of 1 n M (s + it)i can be efficiently controlled (in principle, at least) by the moment method: 1 n n j=1 σ j ( 1 n M (s + it)i) 2k = 1 n tr((( 1 M (s + it)i) ( 1 M (s + it)i)) k ). n n For instance, when s + it = 0 one obtains a Marchenko-Pastur law. One can also hope to control the determinant det( 1 n M (s + it)i) by a variety of other methods 25

(e.g. base height, cofactor expansion, etc.). 26

There is a remaining difficulty, which is that the function log : (0, + ) R is singular at 0 and thus cannot be uniformly approximated by polynomials. (The singularity at + is easily dealt with, using upper bounds on M.) To get around this, Girko made the approximation log σ j ( 1 M (s+it)i) log ε 2 + σ j ( 1 M (s + it)i) 2 n n for some ε > 0. The function log ε 2 + x 2 is now smooth at 0 and the contribution of the RHS can now be dealt with a variety of methods (after many computations). 27

The remaining difficulty is to show that the approximation gives an asymptotically accurate control on in the limit ε 0. For this it becomes necessary to establish lower tail bounds on the least singular value of 1 n M (s + it)i. Fortunately, with recent results we know that this singular value is usually bounded below by n O(1). This allows one to take ε = n O(1) in the above approximation, which eventually costs some factors of log 1 log n in ε various estimates (which, ultimately, cause one to require 2 + η moment rather than second moment). 28

Because of the instability of the spectrum, truncation methods are unavailable (until one has regularised the problem and reduced matters to controlling singular values instead of eigenvalues). So the case of random variables with few moment bounds is genuinely more difficult than those of higher moment bounds. In particular, once one loses fourth moment bounds, one can no longer assume that the largest singular value σ 1 of 1 n M (s + it)i is bounded by O(1); only polynomial bounds O(n O(1) ) are available. This causes some technical difficulties in this regime. 29

The circular law results have been extended to the case in which the entries are independent but not identical (though one always needs mean zero and unit variance), provided one has a (slightly technical) uniform control of second-moment type (which is called κ-controllability in (T.-Vu, 2008)). The methods also extend to sparser models in which only n 1+ε of the n 2 coefficients are non-zero (and adjusting the normalisation accordingly); see (Götze-Tikhomirov, 2007, T.-Vu, 2008). The case of non-zero mean is not well understood at present; the circular law here needs to be replaced by a different law, but it is not clear what the new law should be. 30