Four graph partitioning algorithms. Fan Chung University of California, San Diego

Size: px
Start display at page:

Download "Four graph partitioning algorithms. Fan Chung University of California, San Diego"

Transcription

1 Four graph partitioning algorithms Fan Chung University of California, San Diego

2 History of graph partitioning NP-hard approximation algorithms Spectral method, Fiedler 73, Folklore Multicommunity flow, Leighton+Rao 88 Semidefinite programming, xxxxxxxxxxxxxarora+rao+vazirani 04 Expander flow, Arora+Hazan+Kale 04 Single commodity flows, xxxxxxxxxxkhandekar+rao+vazirani 06

3 traditional applications of graph partition algorithms: Divide-and-conquer algorithms Circuit layout & designs Parallel computing Hierarchical clusterings Bioinformatics

4 Applications of partitioning algorithms for massive graphs Web search identify communities locate hot spots trace targets combat link spam epidemics

5

6 Motivations Outline of the talk Conductance and Cheeger s inequality Four graph paritioning algorithms xxby using: eigenvectors random walks PageRank heat kernel Local graph algorithms Future directions

7 Two types of cuts: Vertex cut edge cut How good is the cut?

8 e(s,v-s) Vol S e(s,v-s) S Vol S = Σ v ε S deg(v) S = Σ 1 v ε S V-S S

9 The Cheeger constant for graphs The Cheeger constant h Φ G G = ess (, ) min S min( vol S, vol S ) The volume of S is vol( S) = dx h G and its variations are sometimes called Φ G x S conductance, isoperimetric number,

10 The Cheeger inequality The Cheeger constant Φh G = ess (, ) min S min( volsvols, ) The Cheeger inequality 2Φ λ G λ : the first nontrivial eigenvalue of the xx(normalized) Laplacian. Φ 2 G 2

11 The spectrum of a graph Adjacency matrix Many ways to define the spectrum of a graph. How are the eigenvalues related to properties of graphs?

12 The spectrum of a graph Adjacency matrix Combinatorial Laplacian L= D A diagonal degree matrix Normalized Laplacian Random walks Rate of convergence adjacency matrix

13 The spectrum of a graph Discrete Laplace operator 1 Δ f ( x) = ( f ( x) f( y)) d Lxy (, ) = not symmetric in general x y x 1 if x = y { 1 if x y and x y d x Normalized Laplacian symmetric normalized with eigenvalues L( x, y ) = { 1 if x = y 1 if x y and x y d d x 0= λ0 λ1 λn 1 2 = λ y

14 Can you hear the shape of a network? λ dictates many properties of a graph. connectivity diameter isoperimetry z(bottlenecks) How good is the cut by using the eigenvalue λ?

15 Finding a cut by a sweep Using a sweep by the eigenvector, can reduce the exponential number of choices of subsets to a linear number.

16 Finding a cut by a sweep Using a sweep by the eigenvector, can reduce the exponential number of choices of subsets to a linear number. Still, there is a lower bound guarantee by using the Cheeger inequality. 2Φ λ Φ 2 2

17 Partitioning algorithm The Cheeger inequality Using eigenvector f, the Cheeger inequality can be stated as α Φ 2Φ λ where λ is the first non-trivial eigenvalue of the Laplacian and α is the minimum Cheeger ratio in a sweep using the eigenvector. f

18 Eigenvalue problem for n x n matrix:. n 30 billion websites Hard to compute eigenvalues Even harder to compute eigenvectors

19 In the old days, compute for a given (whole) graph. In reality, can only afford to compute locally. (Access to a (huge) graph, e.g., for a vertex v, find its neighbors. Bounded number of access.)

20 Finding a cut by a sweep Using a sweep by the eigenvector can reduce the exponential number of choices of subsets to a linear number. Using a local sweep by random walks, PageRank and its variations can further reduce the a linear number of choices to a specified finite number of sizes.

21 Four one-sweep graph partitioning algorithms graph spectral method spectral partition algorithm random walks PageRank local partition algorithms heat kernel

22 4 Partitioning algorithm 4 Cheeger inequalities graph spectral method random walks PageRank Fiedler 73, Cheeger, 60 s Mihail 89 Lovasz, Simonovits, 90, 93 Spielman, Teng, 04 Andersen, Chung, Lang, 06 heat kernel Chung, PNAS, 08.

23 Graph partitioning Local graph partitioning Courtesy of Reid Andersen

24 What is a local graph partitioning algorithm? A local graph partitioning algorithm finds a small cut near the given seed(s) with running time depending only on the size of the output.

25 The definition of PageRank given by Brin and Page is based on random walks.

26 Partitioning Computing PageRank History of computing Pagerank Brin+Page 98 Personalized PageRank, Haveliwala 03 Computing personalized PageRank, xxxxxxxxxxxxxxxxxxxxxxxxjeh+widom 03 xxxxxxxxxxxxxxxxxxxxxxxx Berkhin 06

27 Random walks in a graph. G : a graph P : transition probability matrix Puv (, ) = 1 d u if u v, 0 otherwise. d u := the degree of u. A lazy walk: W = I + P 2

28 Original definition of PageRank A (bored) surfer either surf a random webpage with probability α or surf a linked webpage with probability 1- α α : the jumping constant p = α (,,..., ) (1 α n n n ) pw

29 Definition of personalized PageRank Two equivalent ways to define PageRank pr(α,s) (1) p = αs+ (1 α) pw s: the seed as a row vector α : the jumping constant s

30 Definition of PageRank Two equivalent ways to define PageRank p=pr(α,s) (1) p = αs+ (1 α) pw (2) p s = ( 1, 1,..., 1 n n n) t t = α (1 α) ( sw ) t= 0 s = some seed, e.g., (1,0,...,0) the (original) PageRank personalized PageRank (Organize the random walks by a scalar α.)

31 Partitioning algorithm using random walks Mihail 89, Lovász+Simonovits, 90, 93 k vol( S) W ( u, S) π ( S) 1 du β 2 k 8 k Leads to a Cheeger inequality: β 2Φ λ 8logn β G Φ 8logn 2 2 G where is the minimum Cheeger ratio over sweeps by using a lazy walk of k steps from every vertex for an appropriate range of k.

32 Algorithmic aspects of PageRank Fast approximation algorithm for x personalized PageRank greedy type algorithm, linear complexity Can use the jumping constant to approximate PageRank with a support of the desired size. Errors can be effectively bounded.

33 Approximate the pagerank vector : pr( α, s) = p+ pr( α, r) Approximate pagerank Residue vector

34 Partitioning algorithm using PageRank Using the PageRank vector with seed as a subset S vol( S) vol( G)/4, and a Cheeger inequality can be obtained : γ ΦS 8logs γ u Φ 2 2 u u 8log where is the minimum Cheeger ratio over sweeps by using personalized PageRank with a random seed in S. The volume of the set of such u is > vol(s)/4. s

35 A partitioning algorithm using PageRank Algorithm(φ,s,b): Compute ε-approximate Pagerank p=pr(α,s) with α=0.1/(φ 2 b), ε=2 -b /b. One sweep algorithm using p for finding cuts with conductance < φ. Performance analysis: If s is in a set S with conductance Φ>φ 2 log s, with constant probability, the algorithm outputs a cut C with condutance < φ, of size 1 order s and vol( C S) > vol( S). (Improving previous bounds by a factor of φlog s. ) 4

36 Courtesy of Reid Andersen.

37 Courtesy of Reid Andersen

38

39 Kevin Lang 2007

40 4 Partitioning algorithm 4 Cheeger inequalities graph spectral method random walks PageRank Fiedler 73, Cheeger, 60 s Mihail 89 Lovasz, Simonovits, 90, 93 Spielman, Teng, 04 Andersen, Chung, Lang, 06 heat kernel Chung, PNAS, 08.

41 PageRank versus heat kernel, s k k (1 ) ( sw ) ρts, k = 0 pα = α α t = e k = 0 s ( tw ) k! Geometric sum Exponential sum k

42 PageRank versus heat kernel p k k α, s = α (1 α) ( sw ) ρts, k = 0 t = e k = 0 s ( tw ) k! Geometric sum Exponential sum k p = α + (1 α) pw ρ = ρ( I W) t recurrence Heat equation

43 Definition of heat kernel t t t Ht = e I + tw + W + + W + 2 k! t( I W) = e = e tl 2 k 2 k (......) 2 k t 2 k t k = I tl+ L ( 1) L k! Ht = ( I W) Ht t ρ = sh ts, t

44 Partitioning algorithm using the heat kernel Theorem: vol( S) 2 tκtu, /4 ρtu, ( S) π( S) e du where κ tu, is the minimum Cheeger ratio over sweeps by using heat kernel pagerank over all u in S.

45 Partitioning algorithm using the heat kernel Theorem: vol( S) 2 tκtu, /4 ρtu, ( S) π( S) e du where κ tu, is the minimum Cheeger ratio over sweeps by using heat kernel pagerank over all u in S. Theorem: For vol S 2/3 ( ) vol( G), ρ, ( S ) π th ( S ) e S ts. (Improving the previous PageRank lower bound 1-t h S.)

46 Theorem: ( S), ( S ) ( S S ts ) (1 ( S )) e π ρ π π ht/1 ( ) Sketch of a proof: Consider Show Then Ft ( ) = log( ρ ( S) π ( S)) 2 Ft () 0 2 t ts, ΦS Ft () F(0) = t t 1 π S ( ) Solve and get /1 ( ( )), ( S ) ( S ht S S ts ) (1 ( S )) e π ρ π π

47 Random walks versus heat kernel How fast is the convergence to the stationary distribution? For what k, can one have fw k π? Choose t to satisfy the required property.

48 Partitioning algorithm using the heat kernel Using the upper and lower bounds, a Cheeger inequality can be obtained : 2 2 κ S ΦS ΦS λs 8 8 where λ S is the Dirichlet eigenvalue of the Laplacian, and κ S is the minimum Cheeger ratio over sweeps by using heat kernel with seeds S for appropriate t.

49 Dirichlet eigenvalues for a subset S V λ S = inf f u v ( f( u) f( v)) w f( w) 2 d w 2 S V over all f satisfying the Dirichlet boundary condition: f() v = 0 for all v S.

50 Partitioning algorithm using the heat kernel Using the upper and lower bounds, a Cheeger inequality can be obtained : 2 2 κ S ΦS ΦS λs 8 8 where λ S is the Dirichlet eigenvalue of the Laplacian, and κ S is the minimum Cheeger ratio over sweeps by using heat kernel with seeds S for appropriate t.

51 Partitioning algorithm using the heat kernel Using the upper and lower bounds, a Cheeger inequality can be obtained : 2 2 κ S ΦS ΦS λs 8 8 where λ S is the Dirichlet eigenvalue of the Laplacian, and κ S is the minimum Cheeger ratio over sweeps by using heat kernel with seeds S for appropriate t.

52 Partitioning algorithm using the heat kernel Using the upper and lower bounds, a Cheeger inequality can be obtained : 2 2 κu Φu ΦS λs 8logs 8logs where λ S is the Dirichlet eigenvalue of the Laplacian, and κ u is the minimum Cheeger ratio over sweeps by using heat kernel with a random seed in S. The volume of the set of such u is > vol(s)/4.

53 Courtesy of Reid Andersen

54 Future directions: Use PageRank and the heat kernel xxpagerank to shed light on: The geometry of graphs? Solving combinatorial problems, such xxas covering, packing, matching, etc. Graph drawing, visualization Metric embedding

55 Courtesy of Reid Andersen

The heat kernel as the pagerank of a graph

The heat kernel as the pagerank of a graph Classification: Physical Sciences, Mathematics The heat kernel as the pagerank of a graph by Fan Chung Department of Mathematics University of California at San Diego La Jolla, CA 9093-011 Corresponding

More information

A local graph partitioning algorithm using heat kernel pagerank

A local graph partitioning algorithm using heat kernel pagerank A local graph partitioning algorithm using heat kernel pagerank Fan Chung University of California at an Diego, La Jolla CA 92093, UA, fan@ucsd.edu, http://www.math.ucsd.edu/~fan/ Abstract. We give an

More information

Graph Partitioning Using Random Walks

Graph Partitioning Using Random Walks Graph Partitioning Using Random Walks A Convex Optimization Perspective Lorenzo Orecchia Computer Science Why Spectral Algorithms for Graph Problems in practice? Simple to implement Can exploit very efficient

More information

LOCAL CHEEGER INEQUALITIES AND SPARSE CUTS

LOCAL CHEEGER INEQUALITIES AND SPARSE CUTS LOCAL CHEEGER INEQUALITIES AND SPARSE CUTS CAELAN GARRETT 18.434 FINAL PROJECT CAELAN@MIT.EDU Abstract. When computing on massive graphs, it is not tractable to iterate over the full graph. Local graph

More information

Diffusion and random walks on graphs

Diffusion and random walks on graphs Diffusion and random walks on graphs Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Structural

More information

Lecture: Local Spectral Methods (2 of 4) 19 Computing spectral ranking with the push procedure

Lecture: Local Spectral Methods (2 of 4) 19 Computing spectral ranking with the push procedure Stat260/CS294: Spectral Graph Methods Lecture 19-04/02/2015 Lecture: Local Spectral Methods (2 of 4) Lecturer: Michael Mahoney Scribe: Michael Mahoney Warning: these notes are still very rough. They provide

More information

Detecting Sharp Drops in PageRank and a Simplified Local Partitioning Algorithm

Detecting Sharp Drops in PageRank and a Simplified Local Partitioning Algorithm Detecting Sharp Drops in PageRank and a Simplified Local Partitioning Algorithm Reid Andersen and Fan Chung University of California at San Diego, La Jolla CA 92093, USA, fan@ucsd.edu, http://www.math.ucsd.edu/~fan/,

More information

Algorithmic Primitives for Network Analysis: Through the Lens of the Laplacian Paradigm

Algorithmic Primitives for Network Analysis: Through the Lens of the Laplacian Paradigm Algorithmic Primitives for Network Analysis: Through the Lens of the Laplacian Paradigm Shang-Hua Teng Computer Science, Viterbi School of Engineering USC Massive Data and Massive Graphs 500 billions web

More information

Spectral Graph Theory and its Applications. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity

Spectral Graph Theory and its Applications. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity Spectral Graph Theory and its Applications Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity Outline Adjacency matrix and Laplacian Intuition, spectral graph drawing

More information

Cutting Graphs, Personal PageRank and Spilling Paint

Cutting Graphs, Personal PageRank and Spilling Paint Graphs and Networks Lecture 11 Cutting Graphs, Personal PageRank and Spilling Paint Daniel A. Spielman October 3, 2013 11.1 Disclaimer These notes are not necessarily an accurate representation of what

More information

Personal PageRank and Spilling Paint

Personal PageRank and Spilling Paint Graphs and Networks Lecture 11 Personal PageRank and Spilling Paint Daniel A. Spielman October 7, 2010 11.1 Overview These lecture notes are not complete. The paint spilling metaphor is due to Berkhin

More information

Random Walks and Evolving Sets: Faster Convergences and Limitations

Random Walks and Evolving Sets: Faster Convergences and Limitations Random Walks and Evolving Sets: Faster Convergences and Limitations Siu On Chan 1 Tsz Chiu Kwok 2 Lap Chi Lau 2 1 The Chinese University of Hong Kong 2 University of Waterloo Jan 18, 2017 1/23 Finding

More information

Spectral Graph Theory and You: Matrix Tree Theorem and Centrality Metrics

Spectral Graph Theory and You: Matrix Tree Theorem and Centrality Metrics Spectral Graph Theory and You: and Centrality Metrics Jonathan Gootenberg March 11, 2013 1 / 19 Outline of Topics 1 Motivation Basics of Spectral Graph Theory Understanding the characteristic polynomial

More information

Facebook Friends! and Matrix Functions

Facebook Friends! and Matrix Functions Facebook Friends! and Matrix Functions! Graduate Research Day Joint with David F. Gleich, (Purdue), supported by" NSF CAREER 1149756-CCF Kyle Kloster! Purdue University! Network Analysis Use linear algebra

More information

Spectral and Electrical Graph Theory. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity

Spectral and Electrical Graph Theory. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity Spectral and Electrical Graph Theory Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity Outline Spectral Graph Theory: Understand graphs through eigenvectors and

More information

Vectorized Laplacians for dealing with high-dimensional data sets

Vectorized Laplacians for dealing with high-dimensional data sets Vectorized Laplacians for dealing with high-dimensional data sets Fan Chung Graham University of California, San Diego The Power Grid Graph drawn by Derrick Bezanson 2010 Vectorized Laplacians ----- Graph

More information

A Local Spectral Method for Graphs: With Applications to Improving Graph Partitions and Exploring Data Graphs Locally

A Local Spectral Method for Graphs: With Applications to Improving Graph Partitions and Exploring Data Graphs Locally Journal of Machine Learning Research 13 (2012) 2339-2365 Submitted 11/12; Published 9/12 A Local Spectral Method for Graphs: With Applications to Improving Graph Partitions and Exploring Data Graphs Locally

More information

Data Analysis and Manifold Learning Lecture 7: Spectral Clustering

Data Analysis and Manifold Learning Lecture 7: Spectral Clustering Data Analysis and Manifold Learning Lecture 7: Spectral Clustering Radu Horaud INRIA Grenoble Rhone-Alpes, France Radu.Horaud@inrialpes.fr http://perception.inrialpes.fr/ Outline of Lecture 7 What is spectral

More information

Lecture: Local Spectral Methods (3 of 4) 20 An optimization perspective on local spectral methods

Lecture: Local Spectral Methods (3 of 4) 20 An optimization perspective on local spectral methods Stat260/CS294: Spectral Graph Methods Lecture 20-04/07/205 Lecture: Local Spectral Methods (3 of 4) Lecturer: Michael Mahoney Scribe: Michael Mahoney Warning: these notes are still very rough. They provide

More information

Machine Learning for Data Science (CS4786) Lecture 11

Machine Learning for Data Science (CS4786) Lecture 11 Machine Learning for Data Science (CS4786) Lecture 11 Spectral clustering Course Webpage : http://www.cs.cornell.edu/courses/cs4786/2016sp/ ANNOUNCEMENT 1 Assignment P1 the Diagnostic assignment 1 will

More information

Lecture 14: Random Walks, Local Graph Clustering, Linear Programming

Lecture 14: Random Walks, Local Graph Clustering, Linear Programming CSE 521: Design and Analysis of Algorithms I Winter 2017 Lecture 14: Random Walks, Local Graph Clustering, Linear Programming Lecturer: Shayan Oveis Gharan 3/01/17 Scribe: Laura Vonessen Disclaimer: These

More information

Data Analysis and Manifold Learning Lecture 3: Graphs, Graph Matrices, and Graph Embeddings

Data Analysis and Manifold Learning Lecture 3: Graphs, Graph Matrices, and Graph Embeddings Data Analysis and Manifold Learning Lecture 3: Graphs, Graph Matrices, and Graph Embeddings Radu Horaud INRIA Grenoble Rhone-Alpes, France Radu.Horaud@inrialpes.fr http://perception.inrialpes.fr/ Outline

More information

Graph Partitioning Algorithms and Laplacian Eigenvalues

Graph Partitioning Algorithms and Laplacian Eigenvalues Graph Partitioning Algorithms and Laplacian Eigenvalues Luca Trevisan Stanford Based on work with Tsz Chiu Kwok, Lap Chi Lau, James Lee, Yin Tat Lee, and Shayan Oveis Gharan spectral graph theory Use linear

More information

Spectral Analysis of Random Walks

Spectral Analysis of Random Walks Graphs and Networks Lecture 9 Spectral Analysis of Random Walks Daniel A. Spielman September 26, 2013 9.1 Disclaimer These notes are not necessarily an accurate representation of what happened in class.

More information

Spectral Graph Theory Lecture 2. The Laplacian. Daniel A. Spielman September 4, x T M x. ψ i = arg min

Spectral Graph Theory Lecture 2. The Laplacian. Daniel A. Spielman September 4, x T M x. ψ i = arg min Spectral Graph Theory Lecture 2 The Laplacian Daniel A. Spielman September 4, 2015 Disclaimer These notes are not necessarily an accurate representation of what happened in class. The notes written before

More information

Locally-biased analytics

Locally-biased analytics Locally-biased analytics You have BIG data and want to analyze a small part of it: Solution 1: Cut out small part and use traditional methods Challenge: cutting out may be difficult a priori Solution 2:

More information

1 Matrix notation and preliminaries from spectral graph theory

1 Matrix notation and preliminaries from spectral graph theory Graph clustering (or community detection or graph partitioning) is one of the most studied problems in network analysis. One reason for this is that there are a variety of ways to define a cluster or community.

More information

Lecture 13: Spectral Graph Theory

Lecture 13: Spectral Graph Theory CSE 521: Design and Analysis of Algorithms I Winter 2017 Lecture 13: Spectral Graph Theory Lecturer: Shayan Oveis Gharan 11/14/18 Disclaimer: These notes have not been subjected to the usual scrutiny reserved

More information

Networks and Their Spectra

Networks and Their Spectra Networks and Their Spectra Victor Amelkin University of California, Santa Barbara Department of Computer Science victor@cs.ucsb.edu December 4, 2017 1 / 18 Introduction Networks (= graphs) are everywhere.

More information

Lecture: Local Spectral Methods (1 of 4)

Lecture: Local Spectral Methods (1 of 4) Stat260/CS294: Spectral Graph Methods Lecture 18-03/31/2015 Lecture: Local Spectral Methods (1 of 4) Lecturer: Michael Mahoney Scribe: Michael Mahoney Warning: these notes are still very rough. They provide

More information

Algorithms, Graph Theory, and Linear Equa9ons in Laplacians. Daniel A. Spielman Yale University

Algorithms, Graph Theory, and Linear Equa9ons in Laplacians. Daniel A. Spielman Yale University Algorithms, Graph Theory, and Linear Equa9ons in Laplacians Daniel A. Spielman Yale University Shang- Hua Teng Carter Fussell TPS David Harbater UPenn Pavel Cur9s Todd Knoblock CTY Serge Lang 1927-2005

More information

Eigenvalue Problems Computation and Applications

Eigenvalue Problems Computation and Applications Eigenvalue ProblemsComputation and Applications p. 1/36 Eigenvalue Problems Computation and Applications Che-Rung Lee cherung@gmail.com National Tsing Hua University Eigenvalue ProblemsComputation and

More information

Lab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018

Lab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018 Lab 8: Measuring Graph Centrality - PageRank Monday, November 5 CompSci 531, Fall 2018 Outline Measuring Graph Centrality: Motivation Random Walks, Markov Chains, and Stationarity Distributions Google

More information

Algorithms, Graph Theory, and the Solu7on of Laplacian Linear Equa7ons. Daniel A. Spielman Yale University

Algorithms, Graph Theory, and the Solu7on of Laplacian Linear Equa7ons. Daniel A. Spielman Yale University Algorithms, Graph Theory, and the Solu7on of Laplacian Linear Equa7ons Daniel A. Spielman Yale University Rutgers, Dec 6, 2011 Outline Linear Systems in Laplacian Matrices What? Why? Classic ways to solve

More information

Laplacian and Random Walks on Graphs

Laplacian and Random Walks on Graphs Laplacian and Random Walks on Graphs Linyuan Lu University of South Carolina Selected Topics on Spectral Graph Theory (II) Nankai University, Tianjin, May 22, 2014 Five talks Selected Topics on Spectral

More information

CMPSCI 791BB: Advanced ML: Laplacian Learning

CMPSCI 791BB: Advanced ML: Laplacian Learning CMPSCI 791BB: Advanced ML: Laplacian Learning Sridhar Mahadevan Outline! Spectral graph operators! Combinatorial graph Laplacian! Normalized graph Laplacian! Random walks! Machine learning on graphs! Clustering!

More information

Using Local Spectral Methods in Theory and in Practice

Using Local Spectral Methods in Theory and in Practice Using Local Spectral Methods in Theory and in Practice Michael W. Mahoney ICSI and Dept of Statistics, UC Berkeley ( For more info, see: http: // www. stat. berkeley. edu/ ~ mmahoney or Google on Michael

More information

A local algorithm for finding dense bipartite-like subgraphs

A local algorithm for finding dense bipartite-like subgraphs A local algorithm for finding dense bipartite-like subgraphs Pan Peng 1,2 1 State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences 2 School of Information Science

More information

Heat Kernel Based Community Detection

Heat Kernel Based Community Detection Heat Kernel Based Community Detection Joint with David F. Gleich, (Purdue), supported by" NSF CAREER 1149756-CCF Kyle Kloster! Purdue University! Local Community Detection Given seed(s) S in G, find a

More information

Link Analysis Ranking

Link Analysis Ranking Link Analysis Ranking How do search engines decide how to rank your query results? Guess why Google ranks the query results the way it does How would you do it? Naïve ranking of query results Given query

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO. Local and Distributed Computation for Large Graphs

UNIVERSITY OF CALIFORNIA, SAN DIEGO. Local and Distributed Computation for Large Graphs UNIVERSITY OF CALIFORNIA, SAN DIEGO Local and Distributed Computation for Large Graphs A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Computer

More information

Finding central nodes in large networks

Finding central nodes in large networks Finding central nodes in large networks Nelly Litvak University of Twente Eindhoven University of Technology, The Netherlands Woudschoten Conference 2017 Complex networks Networks: Internet, WWW, social

More information

Parallel Local Graph Clustering

Parallel Local Graph Clustering Parallel Local Graph Clustering Kimon Fountoulakis, joint work with J. Shun, X. Cheng, F. Roosta-Khorasani, M. Mahoney, D. Gleich University of California Berkeley and Purdue University Based on J. Shun,

More information

ECEN 689 Special Topics in Data Science for Communications Networks

ECEN 689 Special Topics in Data Science for Communications Networks ECEN 689 Special Topics in Data Science for Communications Networks Nick Duffield Department of Electrical & Computer Engineering Texas A&M University Lecture 8 Random Walks, Matrices and PageRank Graphs

More information

Laplacian Matrices of Graphs: Spectral and Electrical Theory

Laplacian Matrices of Graphs: Spectral and Electrical Theory Laplacian Matrices of Graphs: Spectral and Electrical Theory Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale University Toronto, Sep. 28, 2 Outline Introduction to graphs

More information

Spectral Graph Theory

Spectral Graph Theory Spectral raph Theory to appear in Handbook of Linear Algebra, second edition, CCR Press Steve Butler Fan Chung There are many different ways to associate a matrix with a graph (an introduction of which

More information

Convergence of Random Walks

Convergence of Random Walks Graphs and Networks Lecture 9 Convergence of Random Walks Daniel A. Spielman September 30, 010 9.1 Overview We begin by reviewing the basics of spectral theory. We then apply this theory to show that lazy

More information

The Push Algorithm for Spectral Ranking

The Push Algorithm for Spectral Ranking The Push Algorithm for Spectral Ranking Paolo Boldi Sebastiano Vigna March 8, 204 Abstract The push algorithm was proposed first by Jeh and Widom [6] in the context of personalized PageRank computations

More information

Lecture 12: Introduction to Spectral Graph Theory, Cheeger s inequality

Lecture 12: Introduction to Spectral Graph Theory, Cheeger s inequality CSE 521: Design and Analysis of Algorithms I Spring 2016 Lecture 12: Introduction to Spectral Graph Theory, Cheeger s inequality Lecturer: Shayan Oveis Gharan May 4th Scribe: Gabriel Cadamuro Disclaimer:

More information

PageRank. Ryan Tibshirani /36-662: Data Mining. January Optional reading: ESL 14.10

PageRank. Ryan Tibshirani /36-662: Data Mining. January Optional reading: ESL 14.10 PageRank Ryan Tibshirani 36-462/36-662: Data Mining January 24 2012 Optional reading: ESL 14.10 1 Information retrieval with the web Last time we learned about information retrieval. We learned how to

More information

Conductance, the Normalized Laplacian, and Cheeger s Inequality

Conductance, the Normalized Laplacian, and Cheeger s Inequality Spectral Graph Theory Lecture 6 Conductance, the Normalized Laplacian, and Cheeger s Inequality Daniel A. Spielman September 17, 2012 6.1 About these notes These notes are not necessarily an accurate representation

More information

Spectral Graph Theory

Spectral Graph Theory Spectral Graph Theory Aaron Mishtal April 27, 2016 1 / 36 Outline Overview Linear Algebra Primer History Theory Applications Open Problems Homework Problems References 2 / 36 Outline Overview Linear Algebra

More information

CS168: The Modern Algorithmic Toolbox Lectures #11 and #12: Spectral Graph Theory

CS168: The Modern Algorithmic Toolbox Lectures #11 and #12: Spectral Graph Theory CS168: The Modern Algorithmic Toolbox Lectures #11 and #12: Spectral Graph Theory Tim Roughgarden & Gregory Valiant May 2, 2016 Spectral graph theory is the powerful and beautiful theory that arises from

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data Instructor: Yizhou Sun yzsun@ccs.neu.edu March 16, 2016 Methods to Learn Classification Clustering Frequent Pattern Mining Matrix Data Decision

More information

Certifying the Global Optimality of Graph Cuts via Semidefinite Programming: A Theoretic Guarantee for Spectral Clustering

Certifying the Global Optimality of Graph Cuts via Semidefinite Programming: A Theoretic Guarantee for Spectral Clustering Certifying the Global Optimality of Graph Cuts via Semidefinite Programming: A Theoretic Guarantee for Spectral Clustering Shuyang Ling Courant Institute of Mathematical Sciences, NYU Aug 13, 2018 Joint

More information

Eugene Wigner [4]: in the natural sciences, Communications in Pure and Applied Mathematics, XIII, (1960), 1 14.

Eugene Wigner [4]: in the natural sciences, Communications in Pure and Applied Mathematics, XIII, (1960), 1 14. Introduction Eugene Wigner [4]: The miracle of the appropriateness of the language of mathematics for the formulation of the laws of physics is a wonderful gift which we neither understand nor deserve.

More information

Regularized Laplacian Estimation and Fast Eigenvector Approximation

Regularized Laplacian Estimation and Fast Eigenvector Approximation Regularized Laplacian Estimation and Fast Eigenvector Approximation Patrick O. Perry Information, Operations, and Management Sciences NYU Stern School of Business New York, NY 1001 pperry@stern.nyu.edu

More information

Linear algebra and applications to graphs Part 1

Linear algebra and applications to graphs Part 1 Linear algebra and applications to graphs Part 1 Written up by Mikhail Belkin and Moon Duchin Instructor: Laszlo Babai June 17, 2001 1 Basic Linear Algebra Exercise 1.1 Let V and W be linear subspaces

More information

Data Analysis and Manifold Learning Lecture 9: Diffusion on Manifolds and on Graphs

Data Analysis and Manifold Learning Lecture 9: Diffusion on Manifolds and on Graphs Data Analysis and Manifold Learning Lecture 9: Diffusion on Manifolds and on Graphs Radu Horaud INRIA Grenoble Rhone-Alpes, France Radu.Horaud@inrialpes.fr http://perception.inrialpes.fr/ Outline of Lecture

More information

0.1 Naive formulation of PageRank

0.1 Naive formulation of PageRank PageRank is a ranking system designed to find the best pages on the web. A webpage is considered good if it is endorsed (i.e. linked to) by other good webpages. The more webpages link to it, and the more

More information

Online Social Networks and Media. Link Analysis and Web Search

Online Social Networks and Media. Link Analysis and Web Search Online Social Networks and Media Link Analysis and Web Search How to Organize the Web First try: Human curated Web directories Yahoo, DMOZ, LookSmart How to organize the web Second try: Web Search Information

More information

1998: enter Link Analysis

1998: enter Link Analysis 1998: enter Link Analysis uses hyperlink structure to focus the relevant set combine traditional IR score with popularity score Page and Brin 1998 Kleinberg Web Information Retrieval IR before the Web

More information

eigenvalues, markov matrices, and the power method

eigenvalues, markov matrices, and the power method eigenvalues, markov matrices, and the power method Slides by Olson. Some taken loosely from Jeff Jauregui, Some from Semeraro L. Olson Department of Computer Science University of Illinois at Urbana-Champaign

More information

Multi-commodity allocation for dynamic demands using PageRank vectors

Multi-commodity allocation for dynamic demands using PageRank vectors Multi-commodity allocation for dynamic demands using PageRank vectors Fan Chung, Paul Horn, and Jacob Hughes University of California, San Diego Harvard University Abstract. We consider a variant of the

More information

Local properties of PageRank and graph limits. Nelly Litvak University of Twente Eindhoven University of Technology, The Netherlands MIPT 2018

Local properties of PageRank and graph limits. Nelly Litvak University of Twente Eindhoven University of Technology, The Netherlands MIPT 2018 Local properties of PageRank and graph limits Nelly Litvak University of Twente Eindhoven University of Technology, The Netherlands MIPT 2018 Centrality in networks Network as a graph G = (V, E) Centrality

More information

Spectral radius, symmetric and positive matrices

Spectral radius, symmetric and positive matrices Spectral radius, symmetric and positive matrices Zdeněk Dvořák April 28, 2016 1 Spectral radius Definition 1. The spectral radius of a square matrix A is ρ(a) = max{ λ : λ is an eigenvalue of A}. For an

More information

Analytically tractable processes on networks

Analytically tractable processes on networks University of California San Diego CERTH, 25 May 2011 Outline Motivation 1 Motivation Networks Random walk and Consensus Epidemic models Spreading processes on networks 2 Networks Motivation Networks Random

More information

Today. Next lecture. (Ch 14) Markov chains and hidden Markov models

Today. Next lecture. (Ch 14) Markov chains and hidden Markov models Today (Ch 14) Markov chains and hidden Markov models Graphical representation Transition probability matrix Propagating state distributions The stationary distribution Next lecture (Ch 14) Markov chains

More information

Beyond Scalar Affinities for Network Analysis or Vector Diffusion Maps and the Connection Laplacian

Beyond Scalar Affinities for Network Analysis or Vector Diffusion Maps and the Connection Laplacian Beyond Scalar Affinities for Network Analysis or Vector Diffusion Maps and the Connection Laplacian Amit Singer Princeton University Department of Mathematics and Program in Applied and Computational Mathematics

More information

Combinatorics of optimal designs

Combinatorics of optimal designs Combinatorics of optimal designs R A Bailey and Peter J Cameron pjcameron@qmulacuk British Combinatorial Conference, St Andrews, July 2009 Mathematicians and statisticians There is a very famous joke about

More information

An Algorithmist s Toolkit September 10, Lecture 1

An Algorithmist s Toolkit September 10, Lecture 1 18.409 An Algorithmist s Toolkit September 10, 2009 Lecture 1 Lecturer: Jonathan Kelner Scribe: Jesse Geneson (2009) 1 Overview The class s goals, requirements, and policies were introduced, and topics

More information

Introducing the Laplacian

Introducing the Laplacian Introducing the Laplacian Prepared by: Steven Butler October 3, 005 A look ahead to weighted edges To this point we have looked at random walks where at each vertex we have equal probability of moving

More information

Mining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University

Mining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University Note to other teachers and users of these slides: We would be delighted if you found this our material useful in giving your own lectures. Feel free to use these slides verbatim, or to modify them to fit

More information

Spectral Clustering. Spectral Clustering? Two Moons Data. Spectral Clustering Algorithm: Bipartioning. Spectral methods

Spectral Clustering. Spectral Clustering? Two Moons Data. Spectral Clustering Algorithm: Bipartioning. Spectral methods Spectral Clustering Seungjin Choi Department of Computer Science POSTECH, Korea seungjin@postech.ac.kr 1 Spectral methods Spectral Clustering? Methods using eigenvectors of some matrices Involve eigen-decomposition

More information

eigenvalue bounds and metric uniformization Punya Biswal & James R. Lee University of Washington Satish Rao U. C. Berkeley

eigenvalue bounds and metric uniformization Punya Biswal & James R. Lee University of Washington Satish Rao U. C. Berkeley eigenvalue bounds and metric uniformization Punya Biswal & James R. Lee University of Washington Satish Rao U. C. Berkeley separators in planar graphs Lipton and Tarjan (1980) showed that every n-vertex

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data Instructor: Yizhou Sun yzsun@ccs.neu.edu November 16, 2015 Methods to Learn Classification Clustering Frequent Pattern Mining Matrix Data Decision

More information

Markov Chains, Random Walks on Graphs, and the Laplacian

Markov Chains, Random Walks on Graphs, and the Laplacian Markov Chains, Random Walks on Graphs, and the Laplacian CMPSCI 791BB: Advanced ML Sridhar Mahadevan Random Walks! There is significant interest in the problem of random walks! Markov chain analysis! Computer

More information

Spectral Graph Theory Lecture 3. Fundamental Graphs. Daniel A. Spielman September 5, 2018

Spectral Graph Theory Lecture 3. Fundamental Graphs. Daniel A. Spielman September 5, 2018 Spectral Graph Theory Lecture 3 Fundamental Graphs Daniel A. Spielman September 5, 2018 3.1 Overview We will bound and derive the eigenvalues of the Laplacian matrices of some fundamental graphs, including

More information

Solving Local Linear Systems with Boundary Conditions Using Heat Kernel Pagerank

Solving Local Linear Systems with Boundary Conditions Using Heat Kernel Pagerank olving Local Linear ystems with Boundary Conditions Using Heat Kernel Pagerank Fan Chung and Olivia impson Department of Computer cience and Engineering, University of California, an Diego La Jolla, CA

More information

Data Mining Recitation Notes Week 3

Data Mining Recitation Notes Week 3 Data Mining Recitation Notes Week 3 Jack Rae January 28, 2013 1 Information Retrieval Given a set of documents, pull the (k) most similar document(s) to a given query. 1.1 Setup Say we have D documents

More information

MAE 298, Lecture 8 Feb 4, Web search and decentralized search on small-worlds

MAE 298, Lecture 8 Feb 4, Web search and decentralized search on small-worlds MAE 298, Lecture 8 Feb 4, 2008 Web search and decentralized search on small-worlds Search for information Assume some resource of interest is stored at the vertices of a network: Web pages Files in a file-sharing

More information

Clustering compiled by Alvin Wan from Professor Benjamin Recht s lecture, Samaneh s discussion

Clustering compiled by Alvin Wan from Professor Benjamin Recht s lecture, Samaneh s discussion Clustering compiled by Alvin Wan from Professor Benjamin Recht s lecture, Samaneh s discussion 1 Overview With clustering, we have several key motivations: archetypes (factor analysis) segmentation hierarchy

More information

Introduction to Search Engine Technology Introduction to Link Structure Analysis. Ronny Lempel Yahoo Labs, Haifa

Introduction to Search Engine Technology Introduction to Link Structure Analysis. Ronny Lempel Yahoo Labs, Haifa Introduction to Search Engine Technology Introduction to Link Structure Analysis Ronny Lempel Yahoo Labs, Haifa Outline Anchor-text indexing Mathematical Background Motivation for link structure analysis

More information

Graph Models The PageRank Algorithm

Graph Models The PageRank Algorithm Graph Models The PageRank Algorithm Anna-Karin Tornberg Mathematical Models, Analysis and Simulation Fall semester, 2013 The PageRank Algorithm I Invented by Larry Page and Sergey Brin around 1998 and

More information

Mathematical Properties & Analysis of Google s PageRank

Mathematical Properties & Analysis of Google s PageRank Mathematical Properties & Analysis of Google s PageRank Ilse Ipsen North Carolina State University, USA Joint work with Rebecca M. Wills Cedya p.1 PageRank An objective measure of the citation importance

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu 2/7/2012 Jure Leskovec, Stanford C246: Mining Massive Datasets 2 Web pages are not equally important www.joe-schmoe.com

More information

PSRGs via Random Walks on Graphs

PSRGs via Random Walks on Graphs Spectral Graph Theory Lecture 11 PSRGs via Random Walks on Graphs Daniel A. Spielman October 3, 2012 11.1 Overview There has been a lot of work on the design of Pseudo-Random Number Generators (PSRGs)

More information

Testing Cluster Structure of Graphs. Artur Czumaj

Testing Cluster Structure of Graphs. Artur Czumaj Testing Cluster Structure of Graphs Artur Czumaj DIMAP and Department of Computer Science University of Warwick Joint work with Pan Peng and Christian Sohler (TU Dortmund) Dealing with BigData in Graphs

More information

CS249: ADVANCED DATA MINING

CS249: ADVANCED DATA MINING CS249: ADVANCED DATA MINING Graph and Network Instructor: Yizhou Sun yzsun@cs.ucla.edu May 31, 2017 Methods Learnt Classification Clustering Vector Data Text Data Recommender System Decision Tree; Naïve

More information

Network Essence: PageRank Completion and Centrality-Conforming Markov Chains

Network Essence: PageRank Completion and Centrality-Conforming Markov Chains Network Essence: PageRank Completion and Centrality-Conforming Markov Chains Shang-Hua Teng Computer Science and Mathematics USC November 29, 2016 Abstract Jiří Matoušek (1963-2015) had many breakthrough

More information

Kristina Lerman USC Information Sciences Institute

Kristina Lerman USC Information Sciences Institute Rethinking Network Structure Kristina Lerman USC Information Sciences Institute Università della Svizzera Italiana, December 16, 2011 Measuring network structure Central nodes Community structure Strength

More information

arxiv: v1 [math.co] 27 Jul 2012

arxiv: v1 [math.co] 27 Jul 2012 Harnack inequalities for graphs with non-negative Ricci curvature arxiv:1207.6612v1 [math.co] 27 Jul 2012 Fan Chung University of California, San Diego S.-T. Yau Harvard University May 2, 2014 Abstract

More information

Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University.

Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University. Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University http://www.mmds.org #1: C4.5 Decision Tree - Classification (61 votes) #2: K-Means - Clustering

More information

Dirichlet PageRank and Ranking Algorithms Based on Trust and Distrust

Dirichlet PageRank and Ranking Algorithms Based on Trust and Distrust Dirichlet PageRank and Ranking Algorithms Based on Trust and Distrust Fan Chung, Alexander Tsiatas, and Wensong Xu Department of Computer Science and Engineering University of California, San Diego {fan,atsiatas,w4xu}@cs.ucsd.edu

More information

STA141C: Big Data & High Performance Statistical Computing

STA141C: Big Data & High Performance Statistical Computing STA141C: Big Data & High Performance Statistical Computing Lecture 12: Graph Clustering Cho-Jui Hsieh UC Davis May 29, 2018 Graph Clustering Given a graph G = (V, E, W ) V : nodes {v 1,, v n } E: edges

More information

Algebraic Representation of Networks

Algebraic Representation of Networks Algebraic Representation of Networks 0 1 2 1 1 0 0 1 2 0 0 1 1 1 1 1 Hiroki Sayama sayama@binghamton.edu Describing networks with matrices (1) Adjacency matrix A matrix with rows and columns labeled by

More information

DATA MINING LECTURE 13. Link Analysis Ranking PageRank -- Random walks HITS

DATA MINING LECTURE 13. Link Analysis Ranking PageRank -- Random walks HITS DATA MINING LECTURE 3 Link Analysis Ranking PageRank -- Random walks HITS How to organize the web First try: Manually curated Web Directories How to organize the web Second try: Web Search Information

More information

Communities Via Laplacian Matrices. Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices

Communities Via Laplacian Matrices. Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices Communities Via Laplacian Matrices Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices The Laplacian Approach As with betweenness approach, we want to divide a social graph into

More information

Finding normalized and modularity cuts by spectral clustering. Ljubjana 2010, October

Finding normalized and modularity cuts by spectral clustering. Ljubjana 2010, October Finding normalized and modularity cuts by spectral clustering Marianna Bolla Institute of Mathematics Budapest University of Technology and Economics marib@math.bme.hu Ljubjana 2010, October Outline Find

More information

Ranking and sparsifying a connection graph

Ranking and sparsifying a connection graph Ranking and sparsifying a connection graph Fan Chung 1, and Wenbo Zhao 1 Department of Mathematics Department of Computer Science and Engineering University of California, San Diego La Jolla, CA 9093,

More information