eigenvalues, markov matrices, and the power method

Similar documents
Graph Models The PageRank Algorithm

A Note on Google s PageRank

Application. Stochastic Matrices and PageRank

Introduction to Search Engine Technology Introduction to Link Structure Analysis. Ronny Lempel Yahoo Labs, Haifa

Link Analysis. Leonid E. Zhukov

0.1 Naive formulation of PageRank

ECEN 689 Special Topics in Data Science for Communications Networks

Uncertainty and Randomization

Eigenvalues and Eigenvectors

Computational Economics and Finance

How works. or How linear algebra powers the search engine. M. Ram Murty, FRSC Queen s Research Chair Queen s University

IR: Information Retrieval

Lab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018

Link Analysis Information Retrieval and Data Mining. Prof. Matteo Matteucci

Data Mining and Matrices

Node Centrality and Ranking on Networks

Online Social Networks and Media. Link Analysis and Web Search

1998: enter Link Analysis

Google PageRank. Francesco Ricci Faculty of Computer Science Free University of Bozen-Bolzano

Web Ranking. Classification (manual, automatic) Link Analysis (today s lesson)

Algebraic Representation of Networks

Link Analysis Ranking

Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University.

Link Mining PageRank. From Stanford C246

Google Page Rank Project Linear Algebra Summer 2012

DATA MINING LECTURE 13. Link Analysis Ranking PageRank -- Random walks HITS

Data Mining Recitation Notes Week 3

MATH36001 Perron Frobenius Theory 2015

Spectral radius, symmetric and positive matrices

CS 277: Data Mining. Mining Web Link Structure. CS 277: Data Mining Lectures Analyzing Web Link Structure Padhraic Smyth, UC Irvine

Calculating Web Page Authority Using the PageRank Algorithm

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Web Structure Mining Nodes, Links and Influence

How does Google rank webpages?

The Google Markov Chain: convergence speed and eigenvalues

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

Slides based on those in:

Link Analysis. Reference: Introduction to Information Retrieval by C. Manning, P. Raghavan, H. Schutze

Math 304 Handout: Linear algebra, graphs, and networks.

Online Social Networks and Media. Link Analysis and Web Search

1 Last time: least-squares problems

Data and Algorithms of the Web

Lecture 7 Mathematics behind Internet Search

Numerical Methods I: Eigenvalues and eigenvectors

No class on Thursday, October 1. No office hours on Tuesday, September 29 and Thursday, October 1.

Finite-Horizon Statistics for Markov chains

CS246: Mining Massive Datasets Jure Leskovec, Stanford University.

Affine iterations on nonnegative vectors

Lecture 10. Lecturer: Aleksander Mądry Scribes: Mani Bastani Parizi and Christos Kalaitzis

PageRank. Ryan Tibshirani /36-662: Data Mining. January Optional reading: ESL 14.10

Krylov Subspace Methods to Calculate PageRank

Linear Algebra in Actuarial Science: Slides to the lecture

TMA Calculus 3. Lecture 21, April 3. Toke Meier Carlsen Norwegian University of Science and Technology Spring 2013

Perron Frobenius Theory

Econ Slides from Lecture 7

Chapter 5. Eigenvalues and Eigenvectors

Solving Homogeneous Systems with Sub-matrices

MATH 304 Linear Algebra Lecture 20: The Gram-Schmidt process (continued). Eigenvalues and eigenvectors.

As it is not necessarily possible to satisfy this equation, we just ask for a solution to the more general equation

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides

Math 240 Calculus III

On the mathematical background of Google PageRank algorithm

Stochastic processes. MAS275 Probability Modelling. Introduction and Markov chains. Continuous time. Markov property

Robust PageRank: Stationary Distribution on a Growing Network Structure

Thanks to Jure Leskovec, Stanford and Panayiotis Tsaparas, Univ. of Ioannina for slides

Link Analysis. Stony Brook University CSE545, Fall 2016

Algebraic Multigrid Preconditioners for Computing Stationary Distributions of Markov Processes

ORIE 6300 Mathematical Programming I August 25, Recitation 1

Applications to network analysis: Eigenvector centrality indices Lecture notes

CS6220: DATA MINING TECHNIQUES

A linear model for a ranking problem

THE EIGENVALUE PROBLEM

THEORY OF SEARCH ENGINES. CONTENTS 1. INTRODUCTION 1 2. RANKING OF PAGES 2 3. TWO EXAMPLES 4 4. CONCLUSION 5 References 5

DIAGONALIZATION. In order to see the implications of this definition, let us consider the following example Example 1. Consider the matrix

Pr[positive test virus] Pr[virus] Pr[positive test] = Pr[positive test] = Pr[positive test]

Lecture 3 Eigenvalues and Eigenvectors

MATH3200, Lecture 31: Applications of Eigenvectors. Markov Chains and Chemical Reaction Systems

Daily Update. Math 290: Elementary Linear Algebra Fall 2018

Eigenvalues and Eigenvectors

Chapter 10. Finite-State Markov Chains. Introductory Example: Googling Markov Chains

Introduction to Information Retrieval

REVIEW FOR EXAM II. The exam covers sections , the part of 3.7 on Markov chains, and

MultiRank and HAR for Ranking Multi-relational Data, Transition Probability Tensors, and Multi-Stochastic Tensors

googling it: how google ranks search results Courtney R. Gibbons October 17, 2017

Markov Chains and Spectral Clustering

UCSD ECE269 Handout #18 Prof. Young-Han Kim Monday, March 19, Final Examination (Total: 130 points)

IMPORTANT DEFINITIONS AND THEOREMS REFERENCE SHEET

Spectral Graph Theory and You: Matrix Tree Theorem and Centrality Metrics

Google and Biosequence searches with Markov Chains

Updating PageRank. Amy Langville Carl Meyer

4. Linear transformations as a vector space 17

A A x i x j i j (i, j) (j, i) Let. Compute the value of for and

ON THE LIMITING PROBABILITY DISTRIBUTION OF A TRANSITION PROBABILITY TENSOR

CS 246 Review of Linear Algebra 01/17/19

Markov Chains, Stochastic Processes, and Matrix Decompositions

Math 443/543 Graph Theory Notes 5: Graphs as matrices, spectral graph theory, and PageRank

COMPSCI 514: Algorithms for Data Science

Lecture 12: Link Analysis for Web Retrieval

Cheat Sheet for MATH461

Transcription:

eigenvalues, markov matrices, and the power method Slides by Olson. Some taken loosely from Jeff Jauregui, Some from Semeraro L. Olson Department of Computer Science University of Illinois at Urbana-Champaign 1

objectives Create a stochastic matrix (or Markov matrix) that represents the probability of moving from one state to the next Establish properties of the Markov Matrix Find the steady state of a stochastic matrix Relate the steady state to an eigenvecture Find important eigenvectors with the Power Method 2

random transitions Given a system of states, we want to model the transition from state to state over time. Let n be the number of states So at time k the system is represented by x k R n. x (i) k Definition is the probability of being in state i at time k A probability vector is a vector of positive entries that sum to 1.0. 3

markov chains Definition A Markov matrix is a square matrix M with columns that are probability vectors. So the entries of M are positive and the column sums are 1.0. Definition A Markov Chain is a sequence of probability vectors x 0, x 1,..., x k,... such that x k+1 = Mx k for some Markov Matrix M 4

markov chains Does a steady-state exist? Does a steady state depend on the initial state? Will x k+1 be a probability vector if x k is a probability vector? Is the steady state unique? 5

markov theory Theorem Let M be a Markov Matrix. Then there is a vector x 0 such that Mx = x. Proof? M T is singular. Why? So there is an x such that M T x = x or so that (M T I)x = 0 Thus M I is singular. Why? 6

goal Find x = Ax and the elements of x are the probability vector (Basketball Ranking, Google Page Rank, etc). 7

power method Suppose that A is n n and that the eigenvalues are ordered: λ 1 > λ 2 λ 3 λ n Assuming A is nonsingular, we have a linearly independent set of v i such that Av i = λ i v i. Goal Computing the value of the largest (in magnitude) eigenvalue, λ 1. 8

power method Take a guess at the associated eigenvector, x 0. We know x (0) = c 1 v 1 + + c n v n Since the guess was random, start with all c j = 1: x (0) = v 1 + + v n Then compute x (1) = Ax (0) x (2) = Ax (1) x (3) = Ax (2). x (k+1) = Ax (k) 9

power method Or x (k) = A k x (0). Or x (k) = A k x (0) = A k v 1 + + A k v n = λ k 1v 1 +... λ k nv n And this can be written as ( x (k) = λ k 1 v 1 + So as k, we are left with ( λ2 λ 1 ) k v 2 + + ( λn λ 1 ) k v n ) x (k) λ k v 1 10

the power method (with normalization) 1 for k = 1 to kmax 2 y = Ax 3 r = φ(y)/φ(x) 4 x = y/ y often φ(x) = x 1 is sufficient r is an estimate of the eigenvalue; x the eigenvector 11

inverse power method We now want to find the smallest eigenvalue Av = λv A 1 v = 1 λ v So apply power method to A 1 (assuming a distinct smallest eigenvalue) x (k+1) = A 1 x (k) Easier with A = LU Update RHS and backsolve with U: Ux (k+1) = L 1 x (k) 12

theory Theorem Perron-Frobenius If M is a Markov matrix with positive entries, then M has a unique steady-state vector x. Theorem Perron-Frobenius Corollary Given an initial state x 0, then x k = M k x 0 converges to x. 13

pagerank Example Problem: Consider n linked webpages. Rank them. Let x 1,..., x n 0 represent importance A link to a page increases the perceived importance of a webpage Example Try n = 4. page 1: 2,3,4 page 2: 3,4 page 3: 1 page 4: 1,3 14

page rank First attempt Let x k be the number of links to page k Problem: a link from an important page like The NY Times has no more weight than lukeo.cs.illinois.edu 15

page rank Second attempt Let x k be the sum of importance scores of all pages that link to page k Problem: a webpage has more influence simply by having more outgoing links Problem: the linear system is trivial (oops!) 16

page rank Third attempt (Brin/Page 90s) Let n j be the number of outgoing links on page j Let x k = x j n j j linking to k The influence of a page is its importance. It is split evenly to the pages it links to. Example Let A be an n n matrix as { 1/nj if page j links to page i A ij = 0 otherwise 17

page rank Sum of column j is n j /n j = 1, so A is a Markov Matrix Problem: does not guarantee a unique x s.t. Ax = x Brin-Page: Use instead A 0.85A + 0.15 Still a Markov Matrix Now has all positive entries Guarantees a unique solution 18

page rank A 0.85A + 0.15 What does this mean though? This defines a stochastic process: PageRank can be thought of as a model of user behavior. We assume there is a random surfer who is given a web page at random and keeps clicking on links, never hitting bakc, but eventually gets bored and starts on another random page. So a surfer clicks on a link on the current page with probability 0.85 and opens a random page with probability 0.15. PageRank is the probability that the random user will end up on that page 19