Uniqueness of the Gaussian Orthogonal Ensemble José Ángel Sánchez Gómez, Victor Amaya Carvajal. December, 017. arxiv:1901.0957v1 [math.pr] 6 Jan 019 Abstract It is not hard to prove that, given a random Wigner matrix X that belongs to the Gaussian Orthogonal Ensemble (GOE), then X is invariant under orthogonal conjugations. Our aim is to prove that if X is a random symmetric matrix that is invariant under orthogonal transformations then such matrix X belongs to the GOE. We will to this using nothing more than the characteristic function. 1 Introduction We will proof of the characterization of the matrices that belong to the Gaussian Orthogonal Ensemble. This work was done as a project for the course of Random Matrices at the Centro de Investigación en Matemáticas, A.C. (CIMAT). Notation We will denote by M n m (F), the set of matrices of size n m over the field F. The matrix I d d will denote the identity matrix of the set M d d (F). We will denote by O(d) the set of orthogonal matrices of M d d (F). This means, if O O(n) then O O = OO = I n n. A symmetric matrix X is a square matrix such that X = X. 3 Background We define the characteristic function of a random variable R to be a function ϕ R : R C given by, ϕ R (t) = E [ e itr], t R. In particular, given a random variable R N(µ,σ ), its characteristic function is given by: ϕ R (t) = E[e itr ] = e iµt 1 σ t. Another important thing to remember is that if two random variables R 1,R are such that their characteristic function coincide in every point, then their distributions are the same. In other words, if ϕ X (t) = ϕ Y (t), for every t R, then X Y. Given a symmetric random matrix X S d its characteristic function is defined by: C X : S d C C X (M) = E[exp{iTr(X M)}], M S d. Recall that the trace is invariant under cyclic permutations. From this, if A M d d (R) and for all M S d. jose.sanchez@cimat.mx victor.amaya@cimat.mx C X (A MA) = E[exp{iTr(X A MA)}] = E[exp{iTr(A X AM)}] = E[exp{iTr([A X A] M)}] = C A XA(M). 1
Notice that, if A,B S d, Tr(A B) = Furthermore, if A,B S d, (A B) jj = Tr(A B) = (A ) jk B kj = j,k=1 A jj B jj + j,k=1 A jk B jk. From this, note that C X (M) = E[exp{iTr(X M)}] = E exp i M jj X jj +i A kj B kj. M kj X kj. Furthermore, if the entries of the matrix X are independent, we get a concise formula of the characteristic function in terms of the characteristic functions of each entry: C X (M) = d E[exp{iM jj X jj }] E[exp{iM kj X kj }]. It will be useful to keep this formula in mind when computing the value of the characteristic function on particular symmetric matrices. 4 The problem Our aim is to prove the following theorem using nothing more than the characteristic function of a random matrix. Theorem 1. Let X be a random symmetric d d matrix such that it is invariant under orthogonal transformations. Suppose that all the entries of X are independent with finite variance. Then, there exist a matrix belonging to the Gaussian Orthogonal Ensemble Y, and real numbers µ R, σ 0 such that X = µi d d +σ Y. We will divided our prove into three steps: Show that the elements inside the diagonal share the same distribution, and that all the elements outside the main diagonal share the same distribution. Show that is enough to prove the result for square matrices. We will prove that the characteristic functions of the elements of the matrix X correspond to normally distributed random variables. Proof: Lets proof every step. First step: Let us define the matrix A t (k,j) as the matrix that has the value t in the positions (k,j) and (j,k) and zeros everywhere else, for 1 k < j d and t R. If we calculate the the characteristic function of A t (k,j) we obtain: [ { }] C X (A (k,j) (t)) = E exp itr((a t (k,j) ) X) = E[exp{itX kj }] = φ Xkj (t). Now, let k j and l g. There exists a permutation matrix P O(d) such that for all t R, P A t (k,j) P = At (l,g). From the in-variance of X with respect to orthogonal conjugations:
ϕ Xkj (t) = C X (A t (k,j) ) = C X(P A t (k,j) P) = C X(A t (l,g) ) = ϕ X lg (t), t R. It follows that X kj X lg. It means that all entries outside the main diagonal have the same distribution. In an analogous way, by conjugating we can prove that if 1 j < k d, X kj X kj. This implies the distribution of the entries outside of the diagonal are symmetric. This is because all entries of the matrix have finite second moment, E(X kj ) = 0. Now, let A t i = diag(t e j), i.e. a matrix that has as only non-zero entry t in the position (j,j), for 1 j d and t R. From this, C X (A t j) = E [ exp { itr((a t j) X) }] = E[exp{itX jj }] = ϕ Xjj (t), for all 1 j d and t R. Now, given j k, there exists a permutation matrix Q O(d) such that Q A t j Q = At k.from this, ϕ Xjj (t) = C X (A t j ) = C X(Q A t j Q) = C X(A t k ) = ϕ X kk (t) t R. Thus X jj X kk for all j k. Then, all entries in the diagonal have the same distribution. Second step: In the last step, we have just proved that all entries in the diagonal have the same distribution, and the entries outside the diagonal also have the same distribution. From this, it is possible to find the distribution of all entries of the matrix X by finding the distribution of the entries of the sub-matrix, X11 X X = 1. X 1 X It can be proved that X is invariant with respect to rotations in R. For this, let θ R and, Q Mθ 0 θ = M 0 I d d (R), (d ) (d ) where, Q θ = cos(θ) sin(θ) M sin(θ) cos(θ). (R). The matrix Q θ is an orthogonal matrix which rotates the first two coordinates and fixes the rest. Let M S, where, a b M = S b d. We can embed M into S d by filling all entries outside the first sub-matrix with zeros: M M 0 = S 0 0 d. We observe that, C X (M ) = E[exp{iTr(M X)}] = E[exp{i(aX 11 +bx 1 +dx )}] = C X (M). By evaluating the characteristic function of X on M and using the orthogonal in-variance, and considering that all entries outside the matrix in M are zero, C X (M) = C X (M ) = C X ((Q θ ) M Q θ ) = C X (Q θ MQ θ). 3
Third part: Since we have reduced the problem to the case of matrices of, we will keep M and Q θ to be the matrices defined in the previous step. We take M θ = Q θ MQ θ, a symmetric matrix. Name the entries of M θ as follow A B M θ =, B D we can compute explicitly the values of A,B,D in function the entries of M. We get the following: A = a+d B = a d D = a+d + a d cos(θ) bsin(θ), sin(θ)+bcos(θ), a d cos(θ)+bsin(θ). These expressions are valid for everyθ R.If we calculate the derivatives of the last expressions with respect to θ, what we get is the following: da dθ = B, db dθ = A D, Now, since X is invariant under rotations, we have that: C X (M θ ) = C X (M). dd dθ = B (1) Computing the characteristic function on both matrices we obtain that, ϕ 1 (a)ϕ (b)ϕ 1 (d) = ϕ 1 (A)ϕ (B)ϕ 1 (D), () where ϕ 1 represents the characteristic function of the random variable X 11 (which is the same of X since the have the same distribution). And ϕ is the characteristic function of X 1. Since entries of X have finite variance ϕ 1 and ϕ are at least two times derivable at zero and φ j (0) = ie(x 1j) for j = 1,. We can derive equation (1) with respect to θ, we get: 0 = ϕ 1(A)ϕ (B)ϕ 1 (D) da dθ +ϕ 1(A)ϕ (B)ϕ 1 (D) db dθ +ϕ 1(A)ϕ (B)ϕ 1(D) dd (3) dθ Note that since ϕ 1 (0) = ϕ (0) = 1, and the are continuous at zero, there exist an open set around zero where these functions are not zero. So, we can rewrite equation (3) in the following way: 0 = ϕ 1 (A) ϕ 1 (A) ϕ 1(A)ϕ (B)ϕ 1 (D) da dθ + ϕ (B) ϕ (B) ϕ 1(A)ϕ (B)ϕ 1 (D) db dθ + + ϕ 1(D) ϕ 1 (D) ϕ 1(A)ϕ (B)ϕ 1 (D) dd dθ. Now, replacing the values we found in (1) and dividing by B(A D) (for those values B 0,A D), we get that 1 ϕ [ ] (B) B ϕ (B) = 1 ϕ 1 (A) A D ϕ 1 (A) ϕ 1(D) (4) ϕ 1 (D) We observed that the left hand side of (4) only depends on B, whereas the right hand side depends in both A and D. By this observation by can conclude that (4) is a constant. Then, there exists k R such that: 1 ϕ (B) B ϕ (B) = k, Observe that, since ϕ is the characteristic function of a symmetric random variable, the codomain of ϕ is contained in R. This implies k R. Now, lets take x = B. We have that ϕ (x) = xk ϕ(x). This is an easy real-valued ODE whose solution is given by ϕ (x) = B 0 e kx, 4
for some B 0 R. Now, as ϕ is a characteristic function it is true that ϕ (0) = 1. This give us as a result that B 0 = 1 so we have. ϕ (x) = e k At first, we have this in a neighborhood of 0 R, but we can extend this solution to be for all R. Observe that if k < 0 then ϕ (t) when t. This leads to a contradiction since characteristic functions have bounded image. Then, if k > 0 and ϕ is the characteristic function of a normal random variable with parameters N(0,k/). Given k = 0, ϕ is the characteristic function of a singular distribution at 0. On the other hand, taking D = 0 in equation (4), it follows that 1 A x. [ ϕ 1 (A) ϕ 1 (A) ϕ 1 (0) ϕ 1 (0) We know that ϕ 1 (0) = 1 and that ϕ (0) = iµ, where µ = E(X 11 ). So, ϕ 1 (A) ϕ(a) = Ak +iµ ϕ 1 (A) = ( Ak +iµ)ϕ 1(A). The solution to this ODE is ϕ 1 (x) = e iµx kx. which is nothing more than the characteristic function of a random variable with distribution N(µ,k). Since k is non-negative we can write it as k = σ. So we can write that characteristic functions in the following way: ]. ϕ 1 (x) = e iµx σ x, ϕ (x) = e σ x. which means that X 11 N(µ,σ ) and X 1 N(0,σ ). With this, we proved that X jj N(µ,σ ) and X kj N(0,σ ). Now, if the take a GOE matrix Y, i.e., Y ii N(0,) and Y jk N(0,1), for every j k, we conclude that: X σ Y +µi d. We see that X is clearly invariant under orthogonal conjugations. 5 Conclusions There are important remarks that are worth stressing about this proof. First, we find interesting that it unfolded by using just elementary knowledge of basic properties of characteristic functions, differential equations and linear algebra. It is a basic proof for this result. Second, it states a generalization of the standard result. This result is proved usually for matrices that have zero-mean entries. This proof allow us to study general matrices that are invariant with respect to orthogonal conjugations. The independence of the entries is key for this proof. One example of a distribution that is invariant under orthogonal conjugation is the Haar distribution on O(d). This distribution does not hold the entry-wise independence since orthogonal matrices satisfy relations between them. In terms of this work, it is key to write the characteristic function of the matrix in terms of the product of the characteristic function of each entry. One possible extension of this project is to find the distribution of Wishart matrices by applying the same characteristic-function approach. References [1] Yi-Kai Liu. Mathematics Junior Seminar, Spring 001. Princeton University. http://web.math.princeton.edu/mathlab/projects/ranmatrices/yl/randmtx.pdf. [] Measure Theory and Probability Theory. Krishna B. Athreya and Soumendra N. Lahiri. Springer Text in Statistics. 5