Spectral Graph Theory and its Applications. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity

Similar documents
Spectral and Electrical Graph Theory. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity

Laplacian Matrices of Graphs: Spectral and Electrical Theory

Data Analysis and Manifold Learning Lecture 7: Spectral Clustering

Algorithmic Primitives for Network Analysis: Through the Lens of the Laplacian Paradigm

Graphs, Vectors, and Matrices Daniel A. Spielman Yale University. AMS Josiah Willard Gibbs Lecture January 6, 2016

Graph Partitioning Algorithms and Laplacian Eigenvalues

Lecture 1. 1 if (i, j) E a i,j = 0 otherwise, l i,j = d i if i = j, and

Dissertation Defense

Data Analysis and Manifold Learning Lecture 3: Graphs, Graph Matrices, and Graph Embeddings

Diffusion and random walks on graphs

Lecture 13: Spectral Graph Theory

An Algorithmist s Toolkit September 10, Lecture 1

Spectral Graph Theory Lecture 2. The Laplacian. Daniel A. Spielman September 4, x T M x. ψ i = arg min

Spectra of Adjacency and Laplacian Matrices

Algorithms, Graph Theory, and the Solu7on of Laplacian Linear Equa7ons. Daniel A. Spielman Yale University

1 Matrix notation and preliminaries from spectral graph theory

Graph Partitioning Using Random Walks

Four graph partitioning algorithms. Fan Chung University of California, San Diego

Algebraic Representation of Networks

Bruce Hendrickson Discrete Algorithms & Math Dept. Sandia National Labs Albuquerque, New Mexico Also, CS Department, UNM

Lecture: Local Spectral Methods (1 of 4)

CS168: The Modern Algorithmic Toolbox Lectures #11 and #12: Spectral Graph Theory

Clustering Algorithms for Random and Pseudo-random Structures

Lecture 14: Random Walks, Local Graph Clustering, Linear Programming

Introduction. 1.1 First Things. 1.2 Introduction. 1.3 Mechanics. Spectral Graph Theory Lecture 1. Daniel A. Spielman August 29, 2018

Introduction to Spectral Graph Theory and Graph Clustering

Lecture 24: Element-wise Sampling of Graphs and Linear Equation Solving. 22 Element-wise Sampling of Graphs and Linear Equation Solving

SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices)

1 Matrix notation and preliminaries from spectral graph theory

Finding normalized and modularity cuts by spectral clustering. Ljubjana 2010, October

Linear algebra and applications to graphs Part 1

Graph Sparsification I : Effective Resistance Sampling

Fiedler s Theorems on Nodal Domains

Certifying the Global Optimality of Graph Cuts via Semidefinite Programming: A Theoretic Guarantee for Spectral Clustering

Algorithms, Graph Theory, and Linear Equa9ons in Laplacians. Daniel A. Spielman Yale University

Lecture Introduction. 2 Brief Recap of Lecture 10. CS-621 Theory Gems October 24, 2012

Lecture 14: SVD, Power method, and Planted Graph problems (+ eigenvalues of random matrices) Lecturer: Sanjeev Arora

Mining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University

Lecture 12: Introduction to Spectral Graph Theory, Cheeger s inequality

Networks and Their Spectra

Lecture 12 : Graph Laplacians and Cheeger s Inequality

Spectral Graph Theory Lecture 3. Fundamental Graphs. Daniel A. Spielman September 5, 2018

Data Mining and Analysis: Fundamental Concepts and Algorithms

Spectral Clustering on Handwritten Digits Database

Markov Chains and Spectral Clustering

Cutting Graphs, Personal PageRank and Spilling Paint

Spectral Algorithms I. Slides based on Spectral Mesh Processing Siggraph 2010 course

Fitting a Graph to Vector Data. Samuel I. Daitch (Yale) Jonathan A. Kelner (MIT) Daniel A. Spielman (Yale)

Fiedler s Theorems on Nodal Domains

Markov Chains, Random Walks on Graphs, and the Laplacian

Spectral clustering. Two ideal clusters, with two points each. Spectral clustering algorithms

Lecture: Local Spectral Methods (2 of 4) 19 Computing spectral ranking with the push procedure

Spectral Theory of Unsigned and Signed Graphs Applications to Graph Clustering. Some Slides

Network Embedding as Matrix Factorization: Unifying DeepWalk, LINE, PTE, and node2vec

Facebook Friends! and Matrix Functions

ORIE 4741: Learning with Big Messy Data. Spectral Graph Theory

Graph Sparsifiers: A Survey

THE HIDDEN CONVEXITY OF SPECTRAL CLUSTERING

EECS 275 Matrix Computation

Spectral Clustering. Guokun Lai 2016/10

Rings, Paths, and Paley Graphs

ORIE 6334 Spectral Graph Theory September 22, Lecture 11

Spectral Analysis of Random Walks

A New Spectral Technique Using Normalized Adjacency Matrices for Graph Matching 1

Testing Cluster Structure of Graphs. Artur Czumaj

ECS231: Spectral Partitioning. Based on Berkeley s CS267 lecture on graph partition

Random Matrices: Invertibility, Structure, and Applications

Eugene Wigner [4]: in the natural sciences, Communications in Pure and Applied Mathematics, XIII, (1960), 1 14.

Contents. Preface for the Instructor. Preface for the Student. xvii. Acknowledgments. 1 Vector Spaces 1 1.A R n and C n 2

Spectral Processing. Misha Kazhdan

Embedding and Spectrum of Graphs

Spectral Generative Models for Graphs

Specral Graph Theory and its Applications September 7, Lecture 2 L G1,2 = 1 1.

Locally-biased analytics

Algebraic Constructions of Graphs

Rings, Paths, and Cayley Graphs

EIGENVECTOR NORMS MATTER IN SPECTRAL GRAPH THEORY

Lecture: Local Spectral Methods (3 of 4) 20 An optimization perspective on local spectral methods

U.C. Berkeley Better-than-Worst-Case Analysis Handout 3 Luca Trevisan May 24, 2018

Lab 8: Measuring Graph Centrality - PageRank. Monday, November 5 CompSci 531, Fall 2018

A Statistical Look at Spectral Graph Analysis. Deep Mukhopadhyay

A sequence of triangle-free pseudorandom graphs

Lecture 10. Lecturer: Aleksander Mądry Scribes: Mani Bastani Parizi and Christos Kalaitzis

Spectral Partitioning of Random Graphs

Random Walks on Graphs. One Concrete Example of a random walk Motivation applications

Towards Practically-Efficient Spectral Sparsification of Graphs. Zhuo Feng

CMPSCI 791BB: Advanced ML: Laplacian Learning

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Spectral Partitiong in a Stochastic Block Model

Lecture 13 Spectral Graph Algorithms

Graph Clustering Algorithms

Lecture 8: Spectral Graph Theory III

Spectral Techniques for Clustering

Conductance, the Normalized Laplacian, and Cheeger s Inequality

Beyond Scalar Affinities for Network Analysis or Vector Diffusion Maps and the Connection Laplacian

Math 4153 Exam 3 Review. The syllabus for Exam 3 is Chapter 6 (pages ), Chapter 7 through page 137, and Chapter 8 through page 182 in Axler.

Spectral Clustering. Zitao Liu

Ma/CS 6b Class 23: Eigenvalues in Regular Graphs

Spectral Theory of Unsigned and Signed Graphs Applications to Graph Clustering: a Survey

ON THE QUALITY OF SPECTRAL SEPARATORS

Transcription:

Spectral Graph Theory and its Applications Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity

Outline Adjacency matrix and Laplacian Intuition, spectral graph drawing Physical intuition Isomorphism testing Random walks Graph Partitioning and clustering Distributions of eigenvalues and compression Computation

What I m Skipping Matrix-tree theorem. Most of algebraic graph theory. Special graphs (e.g. Cayley graphs). Connections to codes and designs. Lots of work by theorists. Expanders.

The Adjacency Matrix 1 2 3 4 λ is eigenvalue and v is eigenvector if Think of, or even better Symmetric -> n real eigenvalues and real eigenvectors form orthonormal basis

Example -1-0.618 0.618 1 1 2 3 4

Example -1-0.618 0.618 1 1 2 3 4-0.618 0.382 1

Example: invariant under re-labeling -1-0.618 0.618 1 1 2 3 4

Example: invariant under re-labeling -1-0.618 0.618 1 2 1 3 4

Operators and Quadratic Forms View of A as an operator: View of A as quadratic form: if and then

Laplacian: natural quadratic form on graphs where D is diagonal matrix of degrees 1 2 3 4

Laplacian: fast facts so, zero is an eigenvalue If k connected components, Fiedler ( 73) called algebraic connectivity of a graph The further from 0, the more connected.

Embedding graph in line (Hall 70) map minimize trivial solution: So, require Solution Atkins, Boman, Hendrickson 97: Gives correct embedding for graphs like

Courant-Fischer definition of eigvals/vecs (here )

Embedding graph in plane (Hall 70) map minimize trivial solution: So, require degenerate solution: Also require Solution up to rotation

A Graph

Drawing of the graph using v 2, v 3 Plot vertex i at

Spectral drawing of Streets in Rome

Spectral drawing of Erdos graph: edge between co-authors of papers

Dodecahedron Best embedded by first three eigenvectors

Spectral graph drawing: Tutte justification Condition for eigenvector Gives for all i λ small says x(i) near average of neighbors Tutte 63: If fix outside face, and let every other vertex be average of neighbors, get planar embedding of planar graph.

Tutte 63 embedding of a graph. Fix outside face. Edges -> springs. Vertex at center of mass of nbrs. 3-connected -> get planar embedding

Fundamental modes: string with fixed ends

Fundamental modes: string with free ends

Eigenvectors of path graph 1: 2: 3: 4: 17:

Drawing of the graph using v 3, v 4 0.06 0.04 0.02 0-0.02-0.04-0.06-0.06-0.04-0.02 0 0.02 0.04 0.06 Plot vertex i at

Spectral graph coloring from high eigenvectors Embedding of dodecahedron by 19 th and 20 th eigvecs. Coloring 3-colorable random graphs [Alon-Kahale 97]

Spectral graph drawing: FEM justification If apply finite element method to solve Laplace s equation in the plane with a Delaunay triangulation Would get graph Laplacian, but with some weights on edges Fundamental solutions are x and y coordinates (see Strang s Introduction to Applied Mathematics)

Isomorphism testing 1. different eigenvalues -> non-isomorphic 2. If each vertex distinct in spectral embedding, just need to line up embeddings. Each eigvec determined up to sign.

Isomorphism testing λ 2 = λ 3, eigvecs determined up to rotation

Isomorphism testing Distinguish by norm in embedding λ 2 = λ 3, eigvecs determined up to rotation

Isomorphism testing: difficulties 1. Many vertices can map to same place in spectral embedding, if only use few eigenvectors. 2. If λ i has a high dimensional space, eigvecs only determined up to basis rotations Ex.: Strongly regular graphs with only 3 eigenvalues, of multiplicities 1, (n-1)/2 and (n-1)/2 3. Some pairs have an exponential number of isomorphisms.

Isomorphism testing: success [Babai-Grigoryev-Mount 82] If each eigenvalue has multiplicity O(1), can test in polynomial time. Ideas: Partition vertices into classes by norms in embeddings. Refine partitions using other partitions. Use vertex classes to split eigenspaces. Use computational group theory to fuse information, and produce description of all isomorphisms.

Random Walks

Random walks and PageRank Adjacency matrix of directed graph: Walk transition matrix: Walk distribution at time t: PageRank vector p: Eigenvector of Eigenvalue 1

Random walks and PageRank PageRank vector p: Linear algebra issues: W is not symmetric, not similar to symmetric, does not necessarily have n eigenvalues If no nodes of out-degree 0, Perron-Frobenius Theorem: Guarantees a unique, positive eigevec p of eigenvalue 1. Is there a theoretically interesting spectral theory?

Kleinberg and the singular vectors Consider eigenvectors of largest eigenvalue of and Are left and right singular values of A. Always exist. Usually, a more useful theory than eigenvectors, when not symmetric. (see Strang s Intro. to Linear Algebra)

Random walks on Undirected Graphs Trivial PageRank Vector: Not symmetric, but similiar to symmetrized walk matrix W and S have same eigvals,

Random walk converges at rate 1/1-λ n-1 For lazy random walk (stay put with prob ½): Where π is the stable distribution For symmetric S

Normalized Laplacian [Chung] If consider 1-λ n-1 should look at Relationship to cuts:

Cheeger s Inequality (Jerrum-Sinclair 89) (Alon-Milman 85, Diaconis-Stroock 91)

Cheeger s Inequality (Jerrum-Sinclair 89) (Alon-Milman 85, Diaconis-Stroock 91) Can find the cut by looking at for some t

Only need approximate eigenvector (Mihail 89) Can find the cut by looking at for some t Guarantee Lanczos era.

Normalized Cut Alternative definition of conductance [Lovasz 96 (?)] This way, is a relaxation [see Hagen-Kahng 92]. Equivalent to Normalized Cut [Shi-Malik 00]

Spectral Image Segmentation (Shi-Malik 00) 100 200 300 400 500 600 700 800 900 200 400 600 800 1000 1200

Spectral Image Segmentation (Shi-Malik 00) 2 4 6 8 10 12 2 4 6 8 10 12 14 16

Spectral Image Segmentation (Shi-Malik 00) 2 4 6 8 10 12 2 4 6 8 10 12 14 16

Spectral Image Segmentation (Shi-Malik 00) 2 4 6 8 10 12 2 4 6 8 10 12 14 16

Spectral Image Segmentation (Shi-Malik 00) 2 4 6 8 10 12 2 4 6 8 10 12 14 16 edge weight

The second eigenvector 50 100 150 200 50 100 150 200 250 300

Second Eigenvector s sparsest cut 50 100 150 200 50 100 150 200 250 300

Third Eigenvector 50 50 100 100 150 150 200 200 50 100 150 200 250 300 50 100 150 200 250 300

Fourth Eigenvector 50 50 100 100 150 150 200 200 50 100 150 200 250 300 50 100 150 200 250 300

Perspective on Spectral Image Segmentation Ignoring a lot we know about images. On non-image data, gives good intuition. Can we fuse with what we know about images? Generally, can we fuse with other knowledge? What about better cut algorithms?

Improvement by Miller and Tolliver 06 50 50 100 100 150 150 200 200 50 100 150 200 250 300 50 100 150 200 250 300

Improvement by Miller and Tolliver 06 Idea: re-weight (i,j) by Actually, re-weight by Prove: as iterate λ 2 > 0, get 2 components

One approach to fusing: Dirichlet Eigenvalues Fixing boundary values to zero [Chung-Langlands 96] Fixing boundary values to non-zero. [Grady 06] Dominant mode by solving linear equation: computing electrical flow in resistor network

Analysis of Spectral Partitioning Finite Element Meshes (eigvals right) [S-Teng 07] Planted partitions (eigvecs right) [McSherry 01] p q } A Prob A-A edge = p B-B edge = p A-B edge = q q p } B q < p } } A B

Other planted problems Finding cn 1/2 clique in random graph [Alon-Krivelevich-Sudakov 98] Color random sparse k-colorable graph [Alon-Kahale 97] Asymmetric block structure (LSI and HITS) [Azar-Fiat-Karlin-McSherry-Saia 01] Partitioning with widely varying degrees [Dasgupta-Hopcroft-McSherry 04]

Planted problem analysis Sampled A as perturbation of p q q p Small perturbations don t change eigenvalues too much. Eigenvectors stable too, if well-separated from others. Understand eigenvalues of random matrices [Furedi-Komlos 81, Alon-Krivelevich-Vu 01, Vu 05]

Distribution of eigenvalues of Random Graphs Histogram of eigvals of random 40-by-40 adjacency matrices 12000 10000 8000 6000 4000 2000 0-10 -5 0 5 10 15 20 25

Distribution of eigenvalues of Random Graphs Histogram of eigvals of random 40-by-40 adjacency matrices 12000 10000 8000 6000 4000 2000 0-10 -5 0 5 10 15 20 25 Predicted curve: Wigner s Semi-Circle Law

Eigenvalue distributions Eigenvalues of walk matrix of 50-by-50 grid graph 1 0.8 0.6 0.4 0.2 0-0.2-0.4-0.6-0.8-1 0 500 1000 1500 2000 2500 Number greater than 1-ε proportional to ε

Eigenvalue distributions Eigenvalues of walk matrix of 50-by-50 grid graph 1 0.8 0.6 0.4 0.2 All these go to zero when take powers 0-0.2-0.4-0.6-0.8-1 0 500 1000 1500 2000 2500 Number greater than 1-ε proportional to ε

Compression of powers of graphs [Coifman, Lafon, Lee, Maggioni, Nadler, Warner, Zucker 05] If most eigenvalues of A and W bounded from 1. Most eigenvalues of A t very small. Can approximate A t by low-rank matrix. Build wavelets bases on graphs. Solve linear equations and compute eigenvectors. Make rigorous by taking graph from discretization of manifold

Discretizing Manifold edge weight

Eigenvalue distributions Eigenvalues of path graph on 10k nodes 1 0.8 0.6 0.4 0.2 0-0.2-0.4-0.6-0.8-1 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 Number greater than 1-ε proportional to

Theorem: Eigenvalue distributions Theorem: If bounded degree, number eigenvalues greater than 1-ε is Proof: can choose vertices to collapse so that conductance becomes at least (like adding an expander on those nodes). New graph has all eigvals at most 1-ε in abs value. Is rank change, so by Courant-Fischer

Eigenvalue distributions of planar graphs? For planar graphs, Colan de Verdiere s results imply How big must the gap be? Must other gaps exist?

Computation If λ exactly an eigenvalue, eigvecs = Null(A λi) Not rational, so only approximate If λ close to just one eigenvalue λ i If λ i close to λ i+1 is like λ i = λ i+1 v i and v i+1 can rotate with each other

General Symmetric Matrices Locate any eigval in time 1. Orthogonal similarity transform to tri-diagonal in time by elimination algorithm. 2. Given tri-diagonal matrix, count number eigenvalues in any interval in time 3. Do binary search to locate eigenvalue Locate eigenvector: steps on tri-diagonal, time to map back to A

Largest eigenvectors by power method Apply A to random vector r: In iters, expect x such that Using Lanczos, expect iters (better polynomial)

Smallest eigenvectors by inverse power method Apply L -1 to random vector r orthogonal to In iters, expect x such that Compute in time [STeng04] if planar, in time [Koutis-Miller 06]

Sparsification Key to fast computation. Replace A by sparse B for which Generalized eigenvalues provide notion of approximation in graphs.

Questions Cheeger s inequality for other physical problems? How to incorporate other data into spectral methods? Make multilevel coarsening rigorous. What can we do with boundary conditions? What about generalized eigenvalue problems?