A probabilistic proof of Perron s theorem arxiv: v1 [math.pr] 16 Jan 2018

Similar documents
Markov Chains, Random Walks on Graphs, and the Laplacian

Markov Chains and Stochastic Sampling

Perron Frobenius Theory

A Perron-type theorem on the principal eigenvalue of nonsymmetric elliptic operators

642:550, Summer 2004, Supplement 6 The Perron-Frobenius Theorem. Summer 2004

Markov Chains, Stochastic Processes, and Matrix Decompositions

Detailed Proof of The PerronFrobenius Theorem

CHUN-HUA GUO. Key words. matrix equations, minimal nonnegative solution, Markov chains, cyclic reduction, iterative methods, convergence rate

Partitions and Algebraic Structures

In particular, if A is a square matrix and λ is one of its eigenvalues, then we can find a non-zero column vector X with

SOMEWHAT STOCHASTIC MATRICES

Topic 1: Matrix diagonalization

Eigenvalues in Applications

4. Ergodicity and mixing

Matrix functions that preserve the strong Perron- Frobenius property

Course 311: Michaelmas Term 2005 Part III: Topics in Commutative Algebra

Invariant measures for iterated function systems

Affine iterations on nonnegative vectors

Section 1.7: Properties of the Leslie Matrix

ON ALLEE EFFECTS IN STRUCTURED POPULATIONS

Boolean Inner-Product Spaces and Boolean Matrices

Bare-bones outline of eigenvalue theory and the Jordan canonical form

Applied Mathematics Letters. Comparison theorems for a subclass of proper splittings of matrices

INTRODUCTION TO MARKOV CHAINS AND MARKOV CHAIN MIXING

Markov Chains. As part of Interdisciplinary Mathematical Modeling, By Warren Weckesser Copyright c 2006.

CHAPTER III THE PROOF OF INEQUALITIES

A NEW EFFECTIVE PRECONDITIONED METHOD FOR L-MATRICES

CAAM 335 Matrix Analysis

Language Acquisition and Parameters: Part II

P i [B k ] = lim. n=1 p(n) ii <. n=1. V i :=

Finite-Horizon Statistics for Markov chains

Lecture 15 Perron-Frobenius Theory

On the mathematical background of Google PageRank algorithm

Asymptotic Counting Theorems for Primitive. Juggling Patterns

Irregular Birth-Death process: stationarity and quasi-stationarity

STOCHASTIC PROCESSES Basic notions

Kernels of Directed Graph Laplacians. J. S. Caughman and J.J.P. Veerman

On the convergence of weighted-average consensus

INTRODUCTION TO FURSTENBERG S 2 3 CONJECTURE

Intertwining of Markov processes

FOR PISOT NUMBERS β. 1. Introduction This paper concerns the set(s) Λ = Λ(β,D) of real numbers with representations x = dim H (Λ) =,

Eigenvalue comparisons in graph theory

Elementary Operations and Matrices

1.3 Convergence of Regular Markov Chains

Perron eigenvector of the Tsetlin matrix

NORMS ON SPACE OF MATRICES

An Alternative Proof of Primitivity of Indecomposable Nonnegative Matrices with a Positive Trace

Section 3.9. Matrix Norm

MATH36001 Perron Frobenius Theory 2015

HONORS LINEAR ALGEBRA (MATH V 2020) SPRING 2013

Weighted Sums of Orthogonal Polynomials Related to Birth-Death Processes with Killing

We describe the generalization of Hazan s algorithm for symmetric programming

Valuations. 6.1 Definitions. Chapter 6

Spectral Properties of Matrix Polynomials in the Max Algebra

SEMI-INNER PRODUCTS AND THE NUMERICAL RADIUS OF BOUNDED LINEAR OPERATORS IN HILBERT SPACES

Note that in the example in Lecture 1, the state Home is recurrent (and even absorbing), but all other states are transient. f ii (n) f ii = n=1 < +

Pseudo Sylow numbers

Some Results Concerning Uniqueness of Triangle Sequences

arxiv:math/ v2 [math.nt] 18 Jun 1999

Recovery Based on Kolmogorov Complexity in Underdetermined Systems of Linear Equations

Math 304 Handout: Linear algebra, graphs, and networks.

Generalized Fibonacci Numbers and Blackwell s Renewal Theorem

Edexcel GCE A Level Maths Further Maths 3 Matrices.

Two Characterizations of Matrices with the Perron-Frobenius Property

Operations On Networks Of Discrete And Generalized Conductors

Measurable Choice Functions

Necessary and sufficient conditions for strong R-positivity

Vladimir Kirichenko and Makar Plakhotnyk

arxiv: v1 [math.co] 20 Sep 2014

18.175: Lecture 30 Markov chains

T.8. Perron-Frobenius theory of positive matrices From: H.R. Thieme, Mathematics in Population Biology, Princeton University Press, Princeton 2003

YOUNG TABLEAUX AND THE REPRESENTATIONS OF THE SYMMETRIC GROUP

Geometric Mapping Properties of Semipositive Matrices

arxiv: v3 [math.oa] 7 May 2016

Linear Algebra Practice Problems

Math 443 Differential Geometry Spring Handout 3: Bilinear and Quadratic Forms This handout should be read just before Chapter 4 of the textbook.

APPENDIX A. Background Mathematics. A.1 Linear Algebra. Vector algebra. Let x denote the n-dimensional column vector with components x 1 x 2.

Linear-fractional branching processes with countably many types

MATH 56A: STOCHASTIC PROCESSES CHAPTER 1

IRREDUCIBLE REPRESENTATIONS OF SEMISIMPLE LIE ALGEBRAS. Contents

Lecture: Local Spectral Methods (1 of 4)

EIGENVALUES IN LINEAR ALGEBRA *

The Perron Frobenius theorem and the Hilbert metric

The Kemeny constant of a Markov chain

Let (Ω, F) be a measureable space. A filtration in discrete time is a sequence of. F s F t

THE POINT SPECTRUM OF FROBENIUS-PERRON AND KOOPMAN OPERATORS

Measures and Measure Spaces

process on the hierarchical group

Consensus of Information Under Dynamically Changing Interaction Topologies

Nonnegative and spectral matrix theory Lecture notes

NOTES ON THE PERRON-FROBENIUS THEORY OF NONNEGATIVE MATRICES

Theory and Applications of Stochastic Systems Lecture Exponential Martingale for Random Walk

UNDERSTANDING THE DIAGONALIZATION PROBLEM. Roy Skjelnes. 1.- Linear Maps 1.1. Linear maps. A map T : R n R m is a linear map if

Generalized Numerical Radius Inequalities for Operator Matrices

(U) =, if 0 U, 1 U, (U) = X, if 0 U, and 1 U. (U) = E, if 0 U, but 1 U. (U) = X \ E if 0 U, but 1 U. n=1 A n, then A M.

Lecture 2: September 8

The SIS and SIR stochastic epidemic models revisited

On the minimal free resolution of a monomial ideal.

Invertibility and stability. Irreducibly diagonally dominant. Invertibility and stability, stronger result. Reducible matrices

Counting Matrices Over a Finite Field With All Eigenvalues in the Field

Transcription:

A probabilistic proof of Perron s theorem arxiv:80.05252v [math.pr] 6 Jan 208 Raphaël Cerf DMA, École Normale Supérieure January 7, 208 Abstract Joseba Dalmau CMAP, Ecole Polytechnique We present an alternative proof of Perron s theorem, which is probabilistic in nature. It rests on the representation of the Perron eigenvector as a functional of the trajectory of an auxiliary Markov chain. In 907, Oskar Perron proved the following theorem. Theorem Let A be a square matrix with posivite entries. Then the matrix A admits a positive eigenvalue λ such that: i to λ is associated an eigenvector µ whose components are all positive; ii if α is another eigenvalue of A, possibly complex, then α < λ; iii any other eigenvector associated to λ is a multiple of µ. This theorem was subsequently generalized by Frobenius in his work on non negative matrices in 92, leading to the so called Perron Frobenius theorem [5]. A myriad of mathematical models involve non negative matrices and their powers, thereby calling for the use of the Perron Frobenius theorem. Mathematicians have developped generalizations in several directions, notably in infinite dimensions for infinite matrices [6], for non negative kernels in arbitrary spaces [] and a whole Perron Frobenius theory has emerged. Hawkins wrote an historical account on the initial developpement of this theory [3]. MacCluer [4] describes several applications of Perron s theorem and review the different proofs that have been found over the years. The original proof of Perron rested on an induction over the size the matrix. A few years later Perron found a proof involving the resolvent of the matrix. A nowadays popular proof, which is found in most textbooks, is due to Wielandt and it rests on a miraculous max min functional. We present here an alternative proof of Perron s theorem, which is probabilistic in nature. It rests on an auxiliary Markov chain, and the representation of the Perron eigenvector as a functional of the trajectory of

this Markov chain [2]. This formula generalizes the well known formula for the invariant probability measure of a finite state Markov chain. To ease the exposition, we restrict ourselves to the Perron theorem, and we work with matrices whose entries are all positive. However our proof can be readily extended to primitive matrices, thereby yielding the classical Perron Frobenius theorem. Our proof might seem lengthy compared to other proofs, yet it is completely self contained and it requires only classical results of basic algebra and power series. We introduce next some notation in order to define the auxiliary Markov chain. Let d be a positive integer. Throughout the text, we consider a square matrix A Ai,j i,j d of size d d with positive entries. For i {,...,d}, we denote by Si the sum of the entries on the i th row of A, i.e., d i {,...,d} Si Ai,j, j and we create a new matrix M Mi,j i,j d by setting i,j {,...,d} Mi,j Ai,j Si Obviously, the sum ofeach row of M is now equal to one, i.e., M is stochastic, and we think of it as the transition matrix of a Markov chain. So, let X n n N be a Markovchain with state space {,...,d} and transition matrix M. Let us fix i {,...,d}. We denote by E i the expectation of the Markov chain issued from i and we introduce the time τ i of the first return of the chain to i, defined by τ i inf { n : X n i }. Finally, we define a function φ i by setting λ 0 φ i λ E i λ τi τ i SX n. The quantity in the expectation is non negative, so the function φ i is well defined and it might take infinite values. Proposition 2 The function φ i is continuous, decreasing on R + and lim φ iλ +, λ 0 lim φ iλ 0. λ + Proof. In fact, the function φ i can be written as a power series in the variable /λ, as follows:. 2

φ i λ k λ k k i,...,i k i k λ ke i {τik} SX n k SiSi Si k P X i,...,x k i k,x k i λ k i,...,i k i SiMi,i Si k Mi k,i k λ k i,...,i k i Ai,i Ai k,i. Since A has positive entries, the series contains non vanishing terms, and this implies that φ i is decreasing and tends to as λ goes to 0. Let R be the radius of the convergence circle of this series. From classical results on powers series, we know that φ i λ is continuous for λ > R. To prove that φ i is continuous, we have to show that φ i R +. Let B be the matrix obtained from A by removing the i th row and the i th column and let γ,...,γ d be its eigenvalues possibly complex, arranged so that γ γ d. Let m respectively M be the minimum respectively the maximum of the entries of A. For any k, we have i,...,i k i Ai,i Ai k,i m2 M i,...,i k i Ai,i 2 Ai k,i m2 M tracebk m2 γ k + +γd k. M Althoughthe eigenvaluesγ,...,γ d mightbe complexnumbers, the trace of B k is a positive real number. We can also a prove a similar inequality in the reverse direction, and we conclude that the power series defining φ i converges if and only if the series k λ k γ k + +γk d converges. This is certainly the case if λ > γ, therefore R γ. Let us define, for n, S n λ n k λ k γ k + +γk d. We shall rely on the following result on geometric series. 3

Lemma 3 Let z be a complex number such that z. Then lim z + +z n { 0 if z, n n if z. Proof. For z, the result is obvious. For z, we compute z + +z n z zn+ n n z, and we observe that this quantity goes to 0 when n goes to. Lemma 3 implies that, for λ a complex number such that λ γ, lim n n S nλ card { j : j d, λ γ j }. This implies in particular that Sn γ goes to with n. Observing that Sn γ Sn γ, we conclude that φ i γ lim n S n γ +. Therefore R γ and moreover φ i R +. Proposition 2 implies that φ i is one to one from ]R,+ [ onto ]0,+ [, thus there exists a unique positive real number λ i such that φ i λ i. The next result is the key to our proof of the Perron Frobenius theorem. We define a vector µ i by setting j {,...,d} τi µ i j E i {Xnj}λ n i n k0 SX k. Theorem 4 The value λ i is an eigenvalue of A and the vector µ i is an associated left eigenvector whose components are all positive and finite. Proof. Let us note E i,τ i,λ i,µ i simply by E,τ,λ,µ. Let us compute d µjaj, k j j n 0 d µjsjmj, k j d E {τ>n} λ n n SX t {Xnj}fjMj,k d E {τ>n} λ n n j n 0 SX t {Xnj} {Xn+k} 4

E λe τ {Xn+k}λ n n SX t τ n {Xnk}λ n n SX t. Suppose that k i. Then the term in the last sum vanishes for n 0 or n τ, and we obtain d µjaj,k λµk. j For k i, we obtain, noticing that µi, d µjaj,i λe λ τ τ j SX t λφ i λ λµi. Thus we have proved that µa λµ. Since µi, these equations imply that µ,...,µd are all positive and finite. Proposition 5 Let α be an eigenvalue of A, possibly complex, and let ν be an associated left eigenvector. Let i {,...,d} be such that νi 0. Either ν and µ i are proportional in which case α λ i or α < λ i. Proof. Let α,ν and i be as in the statement of the proposition. We suppose that α 0, otherwise there is nothing to prove. Let ν be an associated left eigenvector. We have k {,...,d} νk α d νjaj, k. j Let us focus on the equation for k i. We divide by νi which is assumed to be non zero and we isolate the term j i in the sum to obtain α Ai,i+ α j i νj νi Aj,i. We expand νj in the above equation as a sum, and we get α Ai,i+ νj α 2 νi Aj,jAj,i j j i 5

α Ai,i+ α 2 Ai,jAj,i+ α 2 j i Iterating n times this procedure, we get α Ai,i+ + α n+ + α n+ i,...,i n i i 0,i,...,i n i j i j i νj νi Aj,jAj,i. Ai,i Ai,i 2 Ai n,i νi 0 νi Ai 0,i Ai,i 2 Ai n,i. If φ i α +, then it follows from proposition 2 and the defintion of λ i that α < λ i and we are done. From now onwards, we suppose that φ i α < +. In the proof of proposition 2, we worked out a power series expansion of φ i. The convergence of this series at α implies in particular that the general term of this series goes to 0, hence lim n α n+ i,...,i n i Ai,i Ai,i 2 Ai n,i 0. Let m respectively M be the minimum respectively the maximum of the entries of A. For any i 0 i, we have Ai 0,i Ai,i 2 Ai n,i i,...,i n i It follows that, for any n, α n+ i 0,i,...,i n i Mdmax j d νj m νi M m i,...,i n i νi 0 νi Ai 0,i Ai,i 2 Ai n,i α n+ i,...,i n i and we conclude from the previous inequality that lim n α n+ i 0,i,...,i n i We send now n to in the identity and we get α Ai,i+ + n α n+ Ai,i Ai,i 2 Ai n,i. Ai,i Ai,i 2 Ai n,i νi 0 νi Ai 0,i Ai,i 2 Ai n,i 0. i,...,i n i Ai,i Ai,i 2 Ai n,i. 6

Recall that α might be complex. Taking the modulus, we conclude that φ i α, andsince φ i is decreasing, then α λ i. It remainsto examine the case α λ i. We suppose that the eigenvector ν associated to α is normalizedso that νi. We denote by ν the vectorwhose coordinates are the modulus of the coordinates of ν, i.e., ν j νj for j d. Since νa αν and the entries of A are positive, then k {,...,d} ν k λ d ν jaj, k. Starting from this inequality, we proceed as previously, that is, we isolate the term corresponding to j i in the sum, we bound from above the term νj for j i, we iterate the procedure n times. We check that the ultimate term goes to 0 when we send n to, as we get the inequality k {,...,d} For k {,...,d}, we have λ ν k d νjaj, k j j ν k µ i k. d ν jaj, k. It follows that d µi k ν k Ak,i λ µi νi 0. k This equation implies that µ i ν and that all the intermediate inequalities were in fact equalities. Since all the entries of A are positive and νi, then necessarily all the components of ν are non negative real numbers and ν µ i and α λ. The λ i s are positive eigenvalues of A, the eigenvectors µ i have positive coordinates, thus proposition 5 readily implies the following result. Corollary 6 The values λ,...,λ d are all equal. Their common value λ is a simple eigenvalue of A. The eigenvectors µ,...,µ d are proportional. Finally, we normalize these eigenvectors by imposing that the sum of the components is equal to, thereby getting a probability distribution. Corollary 7 The left Perron Frobenius eigenvector µ of A is given by i {,...,d} µi τi E i 7 j n λ n. SX t

This formula already proved in [2] is a generalization of the classical formula for the invariant probability measure of a Markov chain. Indeed, in the particular case where A is stochastic, S is constant equal to, λ is also equal to, the formula of the corollary becomes the well known formula i {,...,d} µi E i τ i. References [] Krishna B. Athreya and Peter Ney. A renewal approach to the Perron- Frobenius theory of nonnegative kernels on general state spaces. Math. Z., 794:507 529, 982. [2] Raphaël Cerf and Joseba Dalmau. A markov chain representation of the normalized perronfrobenius eigenvector. Electron. Commun. Probab., 22:6 pp., 207. [3] Thomas Hawkins. Continued fractions and the origins of the Perron- Frobenius theorem. Arch. Hist. Exact Sci., 626:655 77, 2008. [4] C. R. MacCluer. The many proofs and applications of Perron s theorem. SIAM Rev., 423:487 498, 2000. [5] E. Seneta. Non-negative matrices and Markov chains. Springer Series in Statistics. Springer, New York, 2006. Revised reprint of the second 98 edition [Springer-Verlag, New York; MR079544]. [6] D. Vere-Jones. Ergodic properties of nonnegative matrices. I. Pacific J. Math., 22:36 386, 967.