Four graph partitioning algorithms. Fan Chung University of California, San Diego
|
|
- Myron Reeves
- 5 years ago
- Views:
Transcription
1 Four graph partitioning algorithms Fan Chung University of California, San Diego
2 History of graph partitioning NP-hard approximation algorithms Spectral method, Fiedler 73, Folklore Multicommunity flow, Leighton+Rao 88 Semidefinite programming, xxxxxxxxxxxxxarora+rao+vazirani 04 Expander flow, Arora+Hazan+Kale 04 Single commodity flows, xxxxxxxxxxkhandekar+rao+vazirani 06
3 traditional applications of graph partition algorithms: Divide-and-conquer algorithms Circuit layout & designs Parallel computing Hierarchical clusterings Bioinformatics
4 Applications of partitioning algorithms for massive graphs Web search identify communities locate hot spots trace targets combat link spam epidemics
5
6 Motivations Outline of the talk Conductance and Cheeger s inequality Four graph paritioning algorithms xxby using: eigenvectors random walks PageRank heat kernel Local graph algorithms Future directions
7 Two types of cuts: Vertex cut edge cut How good is the cut?
8 e(s,v-s) Vol S e(s,v-s) S Vol S = Σ v ε S deg(v) S = Σ 1 v ε S V-S S
9 The Cheeger constant for graphs The Cheeger constant h Φ G G = ess (, ) min S min( vol S, vol S ) The volume of S is vol( S) = dx h G and its variations are sometimes called Φ G x S conductance, isoperimetric number,
10 The Cheeger inequality The Cheeger constant Φh G = ess (, ) min S min( volsvols, ) The Cheeger inequality 2Φ λ G λ : the first nontrivial eigenvalue of the xx(normalized) Laplacian. Φ 2 G 2
11 The spectrum of a graph Adjacency matrix Many ways to define the spectrum of a graph. How are the eigenvalues related to properties of graphs?
12 The spectrum of a graph Adjacency matrix Combinatorial Laplacian L= D A diagonal degree matrix Normalized Laplacian Random walks Rate of convergence adjacency matrix
13 The spectrum of a graph Discrete Laplace operator 1 Δ f ( x) = ( f ( x) f( y)) d Lxy (, ) = not symmetric in general x y x 1 if x = y { 1 if x y and x y d x Normalized Laplacian symmetric normalized with eigenvalues L( x, y ) = { 1 if x = y 1 if x y and x y d d x 0= λ0 λ1 λn 1 2 = λ y
14 Can you hear the shape of a network? λ dictates many properties of a graph. connectivity diameter isoperimetry z(bottlenecks) How good is the cut by using the eigenvalue λ?
15 Finding a cut by a sweep Using a sweep by the eigenvector, can reduce the exponential number of choices of subsets to a linear number.
16 Finding a cut by a sweep Using a sweep by the eigenvector, can reduce the exponential number of choices of subsets to a linear number. Still, there is a lower bound guarantee by using the Cheeger inequality. 2Φ λ Φ 2 2
17 Partitioning algorithm The Cheeger inequality Using eigenvector f, the Cheeger inequality can be stated as α Φ 2Φ λ where λ is the first non-trivial eigenvalue of the Laplacian and α is the minimum Cheeger ratio in a sweep using the eigenvector. f
18 Eigenvalue problem for n x n matrix:. n 30 billion websites Hard to compute eigenvalues Even harder to compute eigenvectors
19 In the old days, compute for a given (whole) graph. In reality, can only afford to compute locally. (Access to a (huge) graph, e.g., for a vertex v, find its neighbors. Bounded number of access.)
20 Finding a cut by a sweep Using a sweep by the eigenvector can reduce the exponential number of choices of subsets to a linear number. Using a local sweep by random walks, PageRank and its variations can further reduce the a linear number of choices to a specified finite number of sizes.
21 Four one-sweep graph partitioning algorithms graph spectral method spectral partition algorithm random walks PageRank local partition algorithms heat kernel
22 4 Partitioning algorithm 4 Cheeger inequalities graph spectral method random walks PageRank Fiedler 73, Cheeger, 60 s Mihail 89 Lovasz, Simonovits, 90, 93 Spielman, Teng, 04 Andersen, Chung, Lang, 06 heat kernel Chung, PNAS, 08.
23 Graph partitioning Local graph partitioning Courtesy of Reid Andersen
24 What is a local graph partitioning algorithm? A local graph partitioning algorithm finds a small cut near the given seed(s) with running time depending only on the size of the output.
25 The definition of PageRank given by Brin and Page is based on random walks.
26 Partitioning Computing PageRank History of computing Pagerank Brin+Page 98 Personalized PageRank, Haveliwala 03 Computing personalized PageRank, xxxxxxxxxxxxxxxxxxxxxxxxjeh+widom 03 xxxxxxxxxxxxxxxxxxxxxxxx Berkhin 06
27 Random walks in a graph. G : a graph P : transition probability matrix Puv (, ) = 1 d u if u v, 0 otherwise. d u := the degree of u. A lazy walk: W = I + P 2
28 Original definition of PageRank A (bored) surfer either surf a random webpage with probability α or surf a linked webpage with probability 1- α α : the jumping constant p = α (,,..., ) (1 α n n n ) pw
29 Definition of personalized PageRank Two equivalent ways to define PageRank pr(α,s) (1) p = αs+ (1 α) pw s: the seed as a row vector α : the jumping constant s
30 Definition of PageRank Two equivalent ways to define PageRank p=pr(α,s) (1) p = αs+ (1 α) pw (2) p s = ( 1, 1,..., 1 n n n) t t = α (1 α) ( sw ) t= 0 s = some seed, e.g., (1,0,...,0) the (original) PageRank personalized PageRank (Organize the random walks by a scalar α.)
31 Partitioning algorithm using random walks Mihail 89, Lovász+Simonovits, 90, 93 k vol( S) W ( u, S) π ( S) 1 du β 2 k 8 k Leads to a Cheeger inequality: β 2Φ λ 8logn β G Φ 8logn 2 2 G where is the minimum Cheeger ratio over sweeps by using a lazy walk of k steps from every vertex for an appropriate range of k.
32 Algorithmic aspects of PageRank Fast approximation algorithm for x personalized PageRank greedy type algorithm, linear complexity Can use the jumping constant to approximate PageRank with a support of the desired size. Errors can be effectively bounded.
33 Approximate the pagerank vector : pr( α, s) = p+ pr( α, r) Approximate pagerank Residue vector
34 Partitioning algorithm using PageRank Using the PageRank vector with seed as a subset S vol( S) vol( G)/4, and a Cheeger inequality can be obtained : γ ΦS 8logs γ u Φ 2 2 u u 8log where is the minimum Cheeger ratio over sweeps by using personalized PageRank with a random seed in S. The volume of the set of such u is > vol(s)/4. s
35 A partitioning algorithm using PageRank Algorithm(φ,s,b): Compute ε-approximate Pagerank p=pr(α,s) with α=0.1/(φ 2 b), ε=2 -b /b. One sweep algorithm using p for finding cuts with conductance < φ. Performance analysis: If s is in a set S with conductance Φ>φ 2 log s, with constant probability, the algorithm outputs a cut C with condutance < φ, of size 1 order s and vol( C S) > vol( S). (Improving previous bounds by a factor of φlog s. ) 4
36 Courtesy of Reid Andersen.
37 Courtesy of Reid Andersen
38
39 Kevin Lang 2007
40 4 Partitioning algorithm 4 Cheeger inequalities graph spectral method random walks PageRank Fiedler 73, Cheeger, 60 s Mihail 89 Lovasz, Simonovits, 90, 93 Spielman, Teng, 04 Andersen, Chung, Lang, 06 heat kernel Chung, PNAS, 08.
41 PageRank versus heat kernel, s k k (1 ) ( sw ) ρts, k = 0 pα = α α t = e k = 0 s ( tw ) k! Geometric sum Exponential sum k
42 PageRank versus heat kernel p k k α, s = α (1 α) ( sw ) ρts, k = 0 t = e k = 0 s ( tw ) k! Geometric sum Exponential sum k p = α + (1 α) pw ρ = ρ( I W) t recurrence Heat equation
43 Definition of heat kernel t t t Ht = e I + tw + W + + W + 2 k! t( I W) = e = e tl 2 k 2 k (......) 2 k t 2 k t k = I tl+ L ( 1) L k! Ht = ( I W) Ht t ρ = sh ts, t
44 Partitioning algorithm using the heat kernel Theorem: vol( S) 2 tκtu, /4 ρtu, ( S) π( S) e du where κ tu, is the minimum Cheeger ratio over sweeps by using heat kernel pagerank over all u in S.
45 Partitioning algorithm using the heat kernel Theorem: vol( S) 2 tκtu, /4 ρtu, ( S) π( S) e du where κ tu, is the minimum Cheeger ratio over sweeps by using heat kernel pagerank over all u in S. Theorem: For vol S 2/3 ( ) vol( G), ρ, ( S ) π th ( S ) e S ts. (Improving the previous PageRank lower bound 1-t h S.)
46 Theorem: ( S), ( S ) ( S S ts ) (1 ( S )) e π ρ π π ht/1 ( ) Sketch of a proof: Consider Show Then Ft ( ) = log( ρ ( S) π ( S)) 2 Ft () 0 2 t ts, ΦS Ft () F(0) = t t 1 π S ( ) Solve and get /1 ( ( )), ( S ) ( S ht S S ts ) (1 ( S )) e π ρ π π
47 Random walks versus heat kernel How fast is the convergence to the stationary distribution? For what k, can one have fw k π? Choose t to satisfy the required property.
48 Partitioning algorithm using the heat kernel Using the upper and lower bounds, a Cheeger inequality can be obtained : 2 2 κ S ΦS ΦS λs 8 8 where λ S is the Dirichlet eigenvalue of the Laplacian, and κ S is the minimum Cheeger ratio over sweeps by using heat kernel with seeds S for appropriate t.
49 Dirichlet eigenvalues for a subset S V λ S = inf f u v ( f( u) f( v)) w f( w) 2 d w 2 S V over all f satisfying the Dirichlet boundary condition: f() v = 0 for all v S.
50 Partitioning algorithm using the heat kernel Using the upper and lower bounds, a Cheeger inequality can be obtained : 2 2 κ S ΦS ΦS λs 8 8 where λ S is the Dirichlet eigenvalue of the Laplacian, and κ S is the minimum Cheeger ratio over sweeps by using heat kernel with seeds S for appropriate t.
51 Partitioning algorithm using the heat kernel Using the upper and lower bounds, a Cheeger inequality can be obtained : 2 2 κ S ΦS ΦS λs 8 8 where λ S is the Dirichlet eigenvalue of the Laplacian, and κ S is the minimum Cheeger ratio over sweeps by using heat kernel with seeds S for appropriate t.
52 Partitioning algorithm using the heat kernel Using the upper and lower bounds, a Cheeger inequality can be obtained : 2 2 κu Φu ΦS λs 8logs 8logs where λ S is the Dirichlet eigenvalue of the Laplacian, and κ u is the minimum Cheeger ratio over sweeps by using heat kernel with a random seed in S. The volume of the set of such u is > vol(s)/4.
53 Courtesy of Reid Andersen
54 Future directions: Use PageRank and the heat kernel xxpagerank to shed light on: The geometry of graphs? Solving combinatorial problems, such xxas covering, packing, matching, etc. Graph drawing, visualization Metric embedding
55 Courtesy of Reid Andersen
The heat kernel as the pagerank of a graph
Classification: Physical Sciences, Mathematics The heat kernel as the pagerank of a graph by Fan Chung Department of Mathematics University of California at San Diego La Jolla, CA 9093-011 Corresponding
More informationA local graph partitioning algorithm using heat kernel pagerank
A local graph partitioning algorithm using heat kernel pagerank Fan Chung University of California at an Diego, La Jolla CA 92093, UA, fan@ucsd.edu, http://www.math.ucsd.edu/~fan/ Abstract. We give an
More informationGraph Partitioning Using Random Walks
Graph Partitioning Using Random Walks A Convex Optimization Perspective Lorenzo Orecchia Computer Science Why Spectral Algorithms for Graph Problems in practice? Simple to implement Can exploit very efficient
More informationLOCAL CHEEGER INEQUALITIES AND SPARSE CUTS
LOCAL CHEEGER INEQUALITIES AND SPARSE CUTS CAELAN GARRETT 18.434 FINAL PROJECT CAELAN@MIT.EDU Abstract. When computing on massive graphs, it is not tractable to iterate over the full graph. Local graph
More informationDiffusion and random walks on graphs
Diffusion and random walks on graphs Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Structural
More informationLecture: Local Spectral Methods (2 of 4) 19 Computing spectral ranking with the push procedure
Stat260/CS294: Spectral Graph Methods Lecture 19-04/02/2015 Lecture: Local Spectral Methods (2 of 4) Lecturer: Michael Mahoney Scribe: Michael Mahoney Warning: these notes are still very rough. They provide
More informationDetecting Sharp Drops in PageRank and a Simplified Local Partitioning Algorithm
Detecting Sharp Drops in PageRank and a Simplified Local Partitioning Algorithm Reid Andersen and Fan Chung University of California at San Diego, La Jolla CA 92093, USA, fan@ucsd.edu, http://www.math.ucsd.edu/~fan/,
More informationAlgorithmic Primitives for Network Analysis: Through the Lens of the Laplacian Paradigm
Algorithmic Primitives for Network Analysis: Through the Lens of the Laplacian Paradigm Shang-Hua Teng Computer Science, Viterbi School of Engineering USC Massive Data and Massive Graphs 500 billions web
More informationSpectral Graph Theory and its Applications. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity
Spectral Graph Theory and its Applications Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity Outline Adjacency matrix and Laplacian Intuition, spectral graph drawing
More informationCutting Graphs, Personal PageRank and Spilling Paint
Graphs and Networks Lecture 11 Cutting Graphs, Personal PageRank and Spilling Paint Daniel A. Spielman October 3, 2013 11.1 Disclaimer These notes are not necessarily an accurate representation of what
More informationPersonal PageRank and Spilling Paint
Graphs and Networks Lecture 11 Personal PageRank and Spilling Paint Daniel A. Spielman October 7, 2010 11.1 Overview These lecture notes are not complete. The paint spilling metaphor is due to Berkhin
More informationRandom Walks and Evolving Sets: Faster Convergences and Limitations
Random Walks and Evolving Sets: Faster Convergences and Limitations Siu On Chan 1 Tsz Chiu Kwok 2 Lap Chi Lau 2 1 The Chinese University of Hong Kong 2 University of Waterloo Jan 18, 2017 1/23 Finding
More informationSpectral Graph Theory and You: Matrix Tree Theorem and Centrality Metrics
Spectral Graph Theory and You: and Centrality Metrics Jonathan Gootenberg March 11, 2013 1 / 19 Outline of Topics 1 Motivation Basics of Spectral Graph Theory Understanding the characteristic polynomial
More informationFacebook Friends! and Matrix Functions
Facebook Friends! and Matrix Functions! Graduate Research Day Joint with David F. Gleich, (Purdue), supported by" NSF CAREER 1149756-CCF Kyle Kloster! Purdue University! Network Analysis Use linear algebra
More informationSpectral and Electrical Graph Theory. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity
Spectral and Electrical Graph Theory Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity Outline Spectral Graph Theory: Understand graphs through eigenvectors and
More informationVectorized Laplacians for dealing with high-dimensional data sets
Vectorized Laplacians for dealing with high-dimensional data sets Fan Chung Graham University of California, San Diego The Power Grid Graph drawn by Derrick Bezanson 2010 Vectorized Laplacians ----- Graph
More informationA Local Spectral Method for Graphs: With Applications to Improving Graph Partitions and Exploring Data Graphs Locally
Journal of Machine Learning Research 13 (2012) 2339-2365 Submitted 11/12; Published 9/12 A Local Spectral Method for Graphs: With Applications to Improving Graph Partitions and Exploring Data Graphs Locally
More informationData Analysis and Manifold Learning Lecture 7: Spectral Clustering
Data Analysis and Manifold Learning Lecture 7: Spectral Clustering Radu Horaud INRIA Grenoble Rhone-Alpes, France Radu.Horaud@inrialpes.fr http://perception.inrialpes.fr/ Outline of Lecture 7 What is spectral
More informationLecture: Local Spectral Methods (3 of 4) 20 An optimization perspective on local spectral methods
Stat260/CS294: Spectral Graph Methods Lecture 20-04/07/205 Lecture: Local Spectral Methods (3 of 4) Lecturer: Michael Mahoney Scribe: Michael Mahoney Warning: these notes are still very rough. They provide
More informationMachine Learning for Data Science (CS4786) Lecture 11
Machine Learning for Data Science (CS4786) Lecture 11 Spectral clustering Course Webpage : http://www.cs.cornell.edu/courses/cs4786/2016sp/ ANNOUNCEMENT 1 Assignment P1 the Diagnostic assignment 1 will
More informationLecture 14: Random Walks, Local Graph Clustering, Linear Programming
CSE 521: Design and Analysis of Algorithms I Winter 2017 Lecture 14: Random Walks, Local Graph Clustering, Linear Programming Lecturer: Shayan Oveis Gharan 3/01/17 Scribe: Laura Vonessen Disclaimer: These
More informationData Analysis and Manifold Learning Lecture 3: Graphs, Graph Matrices, and Graph Embeddings
Data Analysis and Manifold Learning Lecture 3: Graphs, Graph Matrices, and Graph Embeddings Radu Horaud INRIA Grenoble Rhone-Alpes, France Radu.Horaud@inrialpes.fr http://perception.inrialpes.fr/ Outline
More informationGraph Partitioning Algorithms and Laplacian Eigenvalues
Graph Partitioning Algorithms and Laplacian Eigenvalues Luca Trevisan Stanford Based on work with Tsz Chiu Kwok, Lap Chi Lau, James Lee, Yin Tat Lee, and Shayan Oveis Gharan spectral graph theory Use linear
More informationSpectral Analysis of Random Walks
Graphs and Networks Lecture 9 Spectral Analysis of Random Walks Daniel A. Spielman September 26, 2013 9.1 Disclaimer These notes are not necessarily an accurate representation of what happened in class.
More informationSpectral Graph Theory Lecture 2. The Laplacian. Daniel A. Spielman September 4, x T M x. ψ i = arg min
Spectral Graph Theory Lecture 2 The Laplacian Daniel A. Spielman September 4, 2015 Disclaimer These notes are not necessarily an accurate representation of what happened in class. The notes written before
More informationLocally-biased analytics
Locally-biased analytics You have BIG data and want to analyze a small part of it: Solution 1: Cut out small part and use traditional methods Challenge: cutting out may be difficult a priori Solution 2:
More information1 Matrix notation and preliminaries from spectral graph theory
Graph clustering (or community detection or graph partitioning) is one of the most studied problems in network analysis. One reason for this is that there are a variety of ways to define a cluster or community.
More informationLecture 13: Spectral Graph Theory
CSE 521: Design and Analysis of Algorithms I Winter 2017 Lecture 13: Spectral Graph Theory Lecturer: Shayan Oveis Gharan 11/14/18 Disclaimer: These notes have not been subjected to the usual scrutiny reserved
More informationNetworks and Their Spectra
Networks and Their Spectra Victor Amelkin University of California, Santa Barbara Department of Computer Science victor@cs.ucsb.edu December 4, 2017 1 / 18 Introduction Networks (= graphs) are everywhere.
More informationLecture: Local Spectral Methods (1 of 4)
Stat260/CS294: Spectral Graph Methods Lecture 18-03/31/2015 Lecture: Local Spectral Methods (1 of 4) Lecturer: Michael Mahoney Scribe: Michael Mahoney Warning: these notes are still very rough. They provide
More informationAlgorithms, Graph Theory, and Linear Equa9ons in Laplacians. Daniel A. Spielman Yale University
Algorithms, Graph Theory, and Linear Equa9ons in Laplacians Daniel A. Spielman Yale University Shang- Hua Teng Carter Fussell TPS David Harbater UPenn Pavel Cur9s Todd Knoblock CTY Serge Lang 1927-2005
More informationEigenvalue Problems Computation and Applications
Eigenvalue ProblemsComputation and Applications p. 1/36 Eigenvalue Problems Computation and Applications Che-Rung Lee cherung@gmail.com National Tsing Hua University Eigenvalue ProblemsComputation and
More informationLab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018
Lab 8: Measuring Graph Centrality - PageRank Monday, November 5 CompSci 531, Fall 2018 Outline Measuring Graph Centrality: Motivation Random Walks, Markov Chains, and Stationarity Distributions Google
More informationAlgorithms, Graph Theory, and the Solu7on of Laplacian Linear Equa7ons. Daniel A. Spielman Yale University
Algorithms, Graph Theory, and the Solu7on of Laplacian Linear Equa7ons Daniel A. Spielman Yale University Rutgers, Dec 6, 2011 Outline Linear Systems in Laplacian Matrices What? Why? Classic ways to solve
More informationLaplacian and Random Walks on Graphs
Laplacian and Random Walks on Graphs Linyuan Lu University of South Carolina Selected Topics on Spectral Graph Theory (II) Nankai University, Tianjin, May 22, 2014 Five talks Selected Topics on Spectral
More informationCMPSCI 791BB: Advanced ML: Laplacian Learning
CMPSCI 791BB: Advanced ML: Laplacian Learning Sridhar Mahadevan Outline! Spectral graph operators! Combinatorial graph Laplacian! Normalized graph Laplacian! Random walks! Machine learning on graphs! Clustering!
More informationUsing Local Spectral Methods in Theory and in Practice
Using Local Spectral Methods in Theory and in Practice Michael W. Mahoney ICSI and Dept of Statistics, UC Berkeley ( For more info, see: http: // www. stat. berkeley. edu/ ~ mmahoney or Google on Michael
More informationA local algorithm for finding dense bipartite-like subgraphs
A local algorithm for finding dense bipartite-like subgraphs Pan Peng 1,2 1 State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences 2 School of Information Science
More informationHeat Kernel Based Community Detection
Heat Kernel Based Community Detection Joint with David F. Gleich, (Purdue), supported by" NSF CAREER 1149756-CCF Kyle Kloster! Purdue University! Local Community Detection Given seed(s) S in G, find a
More informationLink Analysis Ranking
Link Analysis Ranking How do search engines decide how to rank your query results? Guess why Google ranks the query results the way it does How would you do it? Naïve ranking of query results Given query
More informationUNIVERSITY OF CALIFORNIA, SAN DIEGO. Local and Distributed Computation for Large Graphs
UNIVERSITY OF CALIFORNIA, SAN DIEGO Local and Distributed Computation for Large Graphs A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Computer
More informationFinding central nodes in large networks
Finding central nodes in large networks Nelly Litvak University of Twente Eindhoven University of Technology, The Netherlands Woudschoten Conference 2017 Complex networks Networks: Internet, WWW, social
More informationParallel Local Graph Clustering
Parallel Local Graph Clustering Kimon Fountoulakis, joint work with J. Shun, X. Cheng, F. Roosta-Khorasani, M. Mahoney, D. Gleich University of California Berkeley and Purdue University Based on J. Shun,
More informationECEN 689 Special Topics in Data Science for Communications Networks
ECEN 689 Special Topics in Data Science for Communications Networks Nick Duffield Department of Electrical & Computer Engineering Texas A&M University Lecture 8 Random Walks, Matrices and PageRank Graphs
More informationLaplacian Matrices of Graphs: Spectral and Electrical Theory
Laplacian Matrices of Graphs: Spectral and Electrical Theory Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale University Toronto, Sep. 28, 2 Outline Introduction to graphs
More informationSpectral Graph Theory
Spectral raph Theory to appear in Handbook of Linear Algebra, second edition, CCR Press Steve Butler Fan Chung There are many different ways to associate a matrix with a graph (an introduction of which
More informationConvergence of Random Walks
Graphs and Networks Lecture 9 Convergence of Random Walks Daniel A. Spielman September 30, 010 9.1 Overview We begin by reviewing the basics of spectral theory. We then apply this theory to show that lazy
More informationThe Push Algorithm for Spectral Ranking
The Push Algorithm for Spectral Ranking Paolo Boldi Sebastiano Vigna March 8, 204 Abstract The push algorithm was proposed first by Jeh and Widom [6] in the context of personalized PageRank computations
More informationLecture 12: Introduction to Spectral Graph Theory, Cheeger s inequality
CSE 521: Design and Analysis of Algorithms I Spring 2016 Lecture 12: Introduction to Spectral Graph Theory, Cheeger s inequality Lecturer: Shayan Oveis Gharan May 4th Scribe: Gabriel Cadamuro Disclaimer:
More informationPageRank. Ryan Tibshirani /36-662: Data Mining. January Optional reading: ESL 14.10
PageRank Ryan Tibshirani 36-462/36-662: Data Mining January 24 2012 Optional reading: ESL 14.10 1 Information retrieval with the web Last time we learned about information retrieval. We learned how to
More informationConductance, the Normalized Laplacian, and Cheeger s Inequality
Spectral Graph Theory Lecture 6 Conductance, the Normalized Laplacian, and Cheeger s Inequality Daniel A. Spielman September 17, 2012 6.1 About these notes These notes are not necessarily an accurate representation
More informationSpectral Graph Theory
Spectral Graph Theory Aaron Mishtal April 27, 2016 1 / 36 Outline Overview Linear Algebra Primer History Theory Applications Open Problems Homework Problems References 2 / 36 Outline Overview Linear Algebra
More informationCS168: The Modern Algorithmic Toolbox Lectures #11 and #12: Spectral Graph Theory
CS168: The Modern Algorithmic Toolbox Lectures #11 and #12: Spectral Graph Theory Tim Roughgarden & Gregory Valiant May 2, 2016 Spectral graph theory is the powerful and beautiful theory that arises from
More informationCS6220: DATA MINING TECHNIQUES
CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data Instructor: Yizhou Sun yzsun@ccs.neu.edu March 16, 2016 Methods to Learn Classification Clustering Frequent Pattern Mining Matrix Data Decision
More informationCertifying the Global Optimality of Graph Cuts via Semidefinite Programming: A Theoretic Guarantee for Spectral Clustering
Certifying the Global Optimality of Graph Cuts via Semidefinite Programming: A Theoretic Guarantee for Spectral Clustering Shuyang Ling Courant Institute of Mathematical Sciences, NYU Aug 13, 2018 Joint
More informationEugene Wigner [4]: in the natural sciences, Communications in Pure and Applied Mathematics, XIII, (1960), 1 14.
Introduction Eugene Wigner [4]: The miracle of the appropriateness of the language of mathematics for the formulation of the laws of physics is a wonderful gift which we neither understand nor deserve.
More informationRegularized Laplacian Estimation and Fast Eigenvector Approximation
Regularized Laplacian Estimation and Fast Eigenvector Approximation Patrick O. Perry Information, Operations, and Management Sciences NYU Stern School of Business New York, NY 1001 pperry@stern.nyu.edu
More informationLinear algebra and applications to graphs Part 1
Linear algebra and applications to graphs Part 1 Written up by Mikhail Belkin and Moon Duchin Instructor: Laszlo Babai June 17, 2001 1 Basic Linear Algebra Exercise 1.1 Let V and W be linear subspaces
More informationData Analysis and Manifold Learning Lecture 9: Diffusion on Manifolds and on Graphs
Data Analysis and Manifold Learning Lecture 9: Diffusion on Manifolds and on Graphs Radu Horaud INRIA Grenoble Rhone-Alpes, France Radu.Horaud@inrialpes.fr http://perception.inrialpes.fr/ Outline of Lecture
More information0.1 Naive formulation of PageRank
PageRank is a ranking system designed to find the best pages on the web. A webpage is considered good if it is endorsed (i.e. linked to) by other good webpages. The more webpages link to it, and the more
More informationOnline Social Networks and Media. Link Analysis and Web Search
Online Social Networks and Media Link Analysis and Web Search How to Organize the Web First try: Human curated Web directories Yahoo, DMOZ, LookSmart How to organize the web Second try: Web Search Information
More information1998: enter Link Analysis
1998: enter Link Analysis uses hyperlink structure to focus the relevant set combine traditional IR score with popularity score Page and Brin 1998 Kleinberg Web Information Retrieval IR before the Web
More informationeigenvalues, markov matrices, and the power method
eigenvalues, markov matrices, and the power method Slides by Olson. Some taken loosely from Jeff Jauregui, Some from Semeraro L. Olson Department of Computer Science University of Illinois at Urbana-Champaign
More informationMulti-commodity allocation for dynamic demands using PageRank vectors
Multi-commodity allocation for dynamic demands using PageRank vectors Fan Chung, Paul Horn, and Jacob Hughes University of California, San Diego Harvard University Abstract. We consider a variant of the
More informationLocal properties of PageRank and graph limits. Nelly Litvak University of Twente Eindhoven University of Technology, The Netherlands MIPT 2018
Local properties of PageRank and graph limits Nelly Litvak University of Twente Eindhoven University of Technology, The Netherlands MIPT 2018 Centrality in networks Network as a graph G = (V, E) Centrality
More informationSpectral radius, symmetric and positive matrices
Spectral radius, symmetric and positive matrices Zdeněk Dvořák April 28, 2016 1 Spectral radius Definition 1. The spectral radius of a square matrix A is ρ(a) = max{ λ : λ is an eigenvalue of A}. For an
More informationAnalytically tractable processes on networks
University of California San Diego CERTH, 25 May 2011 Outline Motivation 1 Motivation Networks Random walk and Consensus Epidemic models Spreading processes on networks 2 Networks Motivation Networks Random
More informationToday. Next lecture. (Ch 14) Markov chains and hidden Markov models
Today (Ch 14) Markov chains and hidden Markov models Graphical representation Transition probability matrix Propagating state distributions The stationary distribution Next lecture (Ch 14) Markov chains
More informationBeyond Scalar Affinities for Network Analysis or Vector Diffusion Maps and the Connection Laplacian
Beyond Scalar Affinities for Network Analysis or Vector Diffusion Maps and the Connection Laplacian Amit Singer Princeton University Department of Mathematics and Program in Applied and Computational Mathematics
More informationCombinatorics of optimal designs
Combinatorics of optimal designs R A Bailey and Peter J Cameron pjcameron@qmulacuk British Combinatorial Conference, St Andrews, July 2009 Mathematicians and statisticians There is a very famous joke about
More informationAn Algorithmist s Toolkit September 10, Lecture 1
18.409 An Algorithmist s Toolkit September 10, 2009 Lecture 1 Lecturer: Jonathan Kelner Scribe: Jesse Geneson (2009) 1 Overview The class s goals, requirements, and policies were introduced, and topics
More informationIntroducing the Laplacian
Introducing the Laplacian Prepared by: Steven Butler October 3, 005 A look ahead to weighted edges To this point we have looked at random walks where at each vertex we have equal probability of moving
More informationMining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University
Note to other teachers and users of these slides: We would be delighted if you found this our material useful in giving your own lectures. Feel free to use these slides verbatim, or to modify them to fit
More informationSpectral Clustering. Spectral Clustering? Two Moons Data. Spectral Clustering Algorithm: Bipartioning. Spectral methods
Spectral Clustering Seungjin Choi Department of Computer Science POSTECH, Korea seungjin@postech.ac.kr 1 Spectral methods Spectral Clustering? Methods using eigenvectors of some matrices Involve eigen-decomposition
More informationeigenvalue bounds and metric uniformization Punya Biswal & James R. Lee University of Washington Satish Rao U. C. Berkeley
eigenvalue bounds and metric uniformization Punya Biswal & James R. Lee University of Washington Satish Rao U. C. Berkeley separators in planar graphs Lipton and Tarjan (1980) showed that every n-vertex
More informationCS6220: DATA MINING TECHNIQUES
CS6220: DATA MINING TECHNIQUES Mining Graph/Network Data Instructor: Yizhou Sun yzsun@ccs.neu.edu November 16, 2015 Methods to Learn Classification Clustering Frequent Pattern Mining Matrix Data Decision
More informationMarkov Chains, Random Walks on Graphs, and the Laplacian
Markov Chains, Random Walks on Graphs, and the Laplacian CMPSCI 791BB: Advanced ML Sridhar Mahadevan Random Walks! There is significant interest in the problem of random walks! Markov chain analysis! Computer
More informationSpectral Graph Theory Lecture 3. Fundamental Graphs. Daniel A. Spielman September 5, 2018
Spectral Graph Theory Lecture 3 Fundamental Graphs Daniel A. Spielman September 5, 2018 3.1 Overview We will bound and derive the eigenvalues of the Laplacian matrices of some fundamental graphs, including
More informationSolving Local Linear Systems with Boundary Conditions Using Heat Kernel Pagerank
olving Local Linear ystems with Boundary Conditions Using Heat Kernel Pagerank Fan Chung and Olivia impson Department of Computer cience and Engineering, University of California, an Diego La Jolla, CA
More informationData Mining Recitation Notes Week 3
Data Mining Recitation Notes Week 3 Jack Rae January 28, 2013 1 Information Retrieval Given a set of documents, pull the (k) most similar document(s) to a given query. 1.1 Setup Say we have D documents
More informationMAE 298, Lecture 8 Feb 4, Web search and decentralized search on small-worlds
MAE 298, Lecture 8 Feb 4, 2008 Web search and decentralized search on small-worlds Search for information Assume some resource of interest is stored at the vertices of a network: Web pages Files in a file-sharing
More informationClustering compiled by Alvin Wan from Professor Benjamin Recht s lecture, Samaneh s discussion
Clustering compiled by Alvin Wan from Professor Benjamin Recht s lecture, Samaneh s discussion 1 Overview With clustering, we have several key motivations: archetypes (factor analysis) segmentation hierarchy
More informationIntroduction to Search Engine Technology Introduction to Link Structure Analysis. Ronny Lempel Yahoo Labs, Haifa
Introduction to Search Engine Technology Introduction to Link Structure Analysis Ronny Lempel Yahoo Labs, Haifa Outline Anchor-text indexing Mathematical Background Motivation for link structure analysis
More informationGraph Models The PageRank Algorithm
Graph Models The PageRank Algorithm Anna-Karin Tornberg Mathematical Models, Analysis and Simulation Fall semester, 2013 The PageRank Algorithm I Invented by Larry Page and Sergey Brin around 1998 and
More informationMathematical Properties & Analysis of Google s PageRank
Mathematical Properties & Analysis of Google s PageRank Ilse Ipsen North Carolina State University, USA Joint work with Rebecca M. Wills Cedya p.1 PageRank An objective measure of the citation importance
More informationCS246: Mining Massive Datasets Jure Leskovec, Stanford University
CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu 2/7/2012 Jure Leskovec, Stanford C246: Mining Massive Datasets 2 Web pages are not equally important www.joe-schmoe.com
More informationPSRGs via Random Walks on Graphs
Spectral Graph Theory Lecture 11 PSRGs via Random Walks on Graphs Daniel A. Spielman October 3, 2012 11.1 Overview There has been a lot of work on the design of Pseudo-Random Number Generators (PSRGs)
More informationTesting Cluster Structure of Graphs. Artur Czumaj
Testing Cluster Structure of Graphs Artur Czumaj DIMAP and Department of Computer Science University of Warwick Joint work with Pan Peng and Christian Sohler (TU Dortmund) Dealing with BigData in Graphs
More informationCS249: ADVANCED DATA MINING
CS249: ADVANCED DATA MINING Graph and Network Instructor: Yizhou Sun yzsun@cs.ucla.edu May 31, 2017 Methods Learnt Classification Clustering Vector Data Text Data Recommender System Decision Tree; Naïve
More informationNetwork Essence: PageRank Completion and Centrality-Conforming Markov Chains
Network Essence: PageRank Completion and Centrality-Conforming Markov Chains Shang-Hua Teng Computer Science and Mathematics USC November 29, 2016 Abstract Jiří Matoušek (1963-2015) had many breakthrough
More informationKristina Lerman USC Information Sciences Institute
Rethinking Network Structure Kristina Lerman USC Information Sciences Institute Università della Svizzera Italiana, December 16, 2011 Measuring network structure Central nodes Community structure Strength
More informationarxiv: v1 [math.co] 27 Jul 2012
Harnack inequalities for graphs with non-negative Ricci curvature arxiv:1207.6612v1 [math.co] 27 Jul 2012 Fan Chung University of California, San Diego S.-T. Yau Harvard University May 2, 2014 Abstract
More informationSlide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University.
Slide source: Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University http://www.mmds.org #1: C4.5 Decision Tree - Classification (61 votes) #2: K-Means - Clustering
More informationDirichlet PageRank and Ranking Algorithms Based on Trust and Distrust
Dirichlet PageRank and Ranking Algorithms Based on Trust and Distrust Fan Chung, Alexander Tsiatas, and Wensong Xu Department of Computer Science and Engineering University of California, San Diego {fan,atsiatas,w4xu}@cs.ucsd.edu
More informationSTA141C: Big Data & High Performance Statistical Computing
STA141C: Big Data & High Performance Statistical Computing Lecture 12: Graph Clustering Cho-Jui Hsieh UC Davis May 29, 2018 Graph Clustering Given a graph G = (V, E, W ) V : nodes {v 1,, v n } E: edges
More informationAlgebraic Representation of Networks
Algebraic Representation of Networks 0 1 2 1 1 0 0 1 2 0 0 1 1 1 1 1 Hiroki Sayama sayama@binghamton.edu Describing networks with matrices (1) Adjacency matrix A matrix with rows and columns labeled by
More informationDATA MINING LECTURE 13. Link Analysis Ranking PageRank -- Random walks HITS
DATA MINING LECTURE 3 Link Analysis Ranking PageRank -- Random walks HITS How to organize the web First try: Manually curated Web Directories How to organize the web Second try: Web Search Information
More informationCommunities Via Laplacian Matrices. Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices
Communities Via Laplacian Matrices Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices The Laplacian Approach As with betweenness approach, we want to divide a social graph into
More informationFinding normalized and modularity cuts by spectral clustering. Ljubjana 2010, October
Finding normalized and modularity cuts by spectral clustering Marianna Bolla Institute of Mathematics Budapest University of Technology and Economics marib@math.bme.hu Ljubjana 2010, October Outline Find
More informationRanking and sparsifying a connection graph
Ranking and sparsifying a connection graph Fan Chung 1, and Wenbo Zhao 1 Department of Mathematics Department of Computer Science and Engineering University of California, San Diego La Jolla, CA 9093,
More information