ORIE 4741: Learning with Big Messy Data. Spectral Graph Theory

Similar documents
CS168: The Modern Algorithmic Toolbox Lectures #11 and #12: Spectral Graph Theory

Data Analysis and Manifold Learning Lecture 3: Graphs, Graph Matrices, and Graph Embeddings

An Algorithmist s Toolkit September 10, Lecture 1

Lecture 13: Spectral Graph Theory

1 Matrix notation and preliminaries from spectral graph theory

Spectral Graph Theory Lecture 2. The Laplacian. Daniel A. Spielman September 4, x T M x. ψ i = arg min

Spectra of Adjacency and Laplacian Matrices

This section is an introduction to the basic themes of the course.

Markov Chains, Random Walks on Graphs, and the Laplacian

Lecture 1: From Data to Graphs, Weighted Graphs and Graph Laplacian

ORIE 6334 Spectral Graph Theory September 22, Lecture 11

Graph fundamentals. Matrices associated with a graph

Rings, Paths, and Paley Graphs

Communities Via Laplacian Matrices. Degree, Adjacency, and Laplacian Matrices Eigenvectors of Laplacian Matrices

ORIE 6334 Spectral Graph Theory September 8, Lecture 6. In order to do the first proof, we need to use the following fact.

Lecture 1: Graphs, Adjacency Matrices, Graph Laplacian

1 Matrix notation and preliminaries from spectral graph theory

Lecture 14: Random Walks, Local Graph Clustering, Linear Programming

Spectral Graph Theory and You: Matrix Tree Theorem and Centrality Metrics

Markov Chains and Spectral Clustering

Lecture 13 Spectral Graph Algorithms

Data Analysis and Manifold Learning Lecture 7: Spectral Clustering

Spectral Clustering. Zitao Liu

18.312: Algebraic Combinatorics Lionel Levine. Lecture 19

Fiedler s Theorems on Nodal Domains

A New Space for Comparing Graphs

Algebraic Constructions of Graphs

Diffusion and random walks on graphs

Spectral Graph Theory and its Applications. Daniel A. Spielman Dept. of Computer Science Program in Applied Mathematics Yale Unviersity

Graphs, Vectors, and Matrices Daniel A. Spielman Yale University. AMS Josiah Willard Gibbs Lecture January 6, 2016

Spectral Clustering. Guokun Lai 2016/10

Rings, Paths, and Cayley Graphs

Permutations and Combinations

ORIE 6334 Spectral Graph Theory October 13, Lecture 15

Lecture 1. 1 if (i, j) E a i,j = 0 otherwise, l i,j = d i if i = j, and

Linear algebra and applications to graphs Part 1

Communities, Spectral Clustering, and Random Walks

U.C. Berkeley CS294: Spectral Methods and Expanders Handout 11 Luca Trevisan February 29, 2016

Lecture 7. 1 Normalized Adjacency and Laplacian Matrices

MATH 829: Introduction to Data Mining and Analysis Clustering II

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

An Introduction to Spectral Graph Theory

Strongly Regular Graphs, part 1

Math 304 Handout: Linear algebra, graphs, and networks.

Spectral Graph Theory

Lecture 12 : Graph Laplacians and Cheeger s Inequality

CSI 445/660 Part 6 (Centrality Measures for Networks) 6 1 / 68

Networks and Their Spectra

Class President: A Network Approach to Popularity. Due July 18, 2014

Convergence of Random Walks

1 T 1 = where 1 is the all-ones vector. For the upper bound, let v 1 be the eigenvector corresponding. u:(u,v) E v 1(u)

Mining of Massive Datasets Jure Leskovec, AnandRajaraman, Jeff Ullman Stanford University

Fiedler s Theorems on Nodal Domains

Lecture: Modeling graphs with electrical networks

Effective Resistance and Schur Complements

Lecture 10. Lecturer: Aleksander Mądry Scribes: Mani Bastani Parizi and Christos Kalaitzis

Laplacian Matrices of Graphs: Spectral and Electrical Theory

Spectral Theory of Unsigned and Signed Graphs Applications to Graph Clustering. Some Slides

Market Connectivity. Anup Rao Paul G. Allen School of Computer Science University of Washington

COMPSCI 514: Algorithms for Data Science

Web Structure Mining Nodes, Links and Influence

Spectral Analysis of Random Walks

Networks as vectors of their motif frequencies and 2-norm distance as a measure of similarity

Community Detection. fundamental limits & efficient algorithms. Laurent Massoulié, Inria

A New Spectral Technique Using Normalized Adjacency Matrices for Graph Matching 1

MLCC Clustering. Lorenzo Rosasco UNIGE-MIT-IIT

Beyond Scalar Affinities for Network Analysis or Vector Diffusion Maps and the Connection Laplacian

Learning from Sensor Data: Set II. Behnaam Aazhang J.S. Abercombie Professor Electrical and Computer Engineering Rice University

17.1 Directed Graphs, Undirected Graphs, Incidence Matrices, Adjacency Matrices, Weighted Graphs

Spectral Graph Theory Lecture 3. Fundamental Graphs. Daniel A. Spielman September 5, 2018

The Simplest Construction of Expanders

CMPSCI 791BB: Advanced ML: Laplacian Learning

Math 443/543 Graph Theory Notes 5: Graphs as matrices, spectral graph theory, and PageRank

Capacity Releasing Diffusion for Speed and Locality

EE 224 Lab: Graph Signal Processing

Recitation 8: Graphs and Adjacency Matrices

A Random Dot Product Model for Weighted Networks arxiv: v1 [stat.ap] 8 Nov 2016

Machine Learning for Data Science (CS4786) Lecture 11

The Strong Largeur d Arborescence

CMPSCI 250: Introduction to Computation. Lecture #11: Equivalence Relations David Mix Barrington 27 September 2013

Quadrature for the Finite Free Convolution

arxiv: v1 [cs.it] 26 Sep 2018

A Statistical Look at Spectral Graph Analysis. Deep Mukhopadhyay

STA141C: Big Data & High Performance Statistical Computing

Algebraic Representation of Networks

Data-dependent representations: Laplacian Eigenmaps

Graphs in Machine Learning

Computer Vision Group Prof. Daniel Cremers. 14. Clustering

Spectral Generative Models for Graphs

Predicting Graph Labels using Perceptron. Shuang Song

Eigenvectors Via Graph Theory

Lecture #14: NP-Completeness (Chapter 34 Old Edition Chapter 36) Discussion here is from the old edition.

Facebook Friends! and Matrix Functions

Lecture Introduction. 2 Brief Recap of Lecture 10. CS-621 Theory Gems October 24, 2012

8.1 Concentration inequality for Gaussian random matrix (cont d)

Finite Frames and Graph Theoretical Uncertainty Principles

Unsupervised Learning Techniques Class 07, 1 March 2006 Andrea Caponnetto

Lecture 24: Element-wise Sampling of Graphs and Linear Equation Solving. 22 Element-wise Sampling of Graphs and Linear Equation Solving

Lecture 18: More NP-Complete Problems

Random Sampling of Bandlimited Signals on Graphs

Transcription:

ORIE 4741: Learning with Big Messy Data Spectral Graph Theory Mika Sumida Operations Research and Information Engineering Cornell September 15, 2017 1 / 32

Outline Graph Theory Spectral Graph Theory Laplacian regularizer Spectral Embedding 2 / 32

What is a graph? A graph is a collection of nodes that are connected by a set of lines or arrows models systems where objects have some pairwise relationship with each other 3 / 32

What is a graph? A graph is a collection of nodes that are connected by a set of lines or arrows models systems where objects have some pairwise relationship with each other Q: What are some examples of graphs in real life? 3 / 32

Examples of graphs : Social network Nodes are users Edges could be Facebook friendships, LinkedIn connections (undirected) Instagram and Twitter follows (directed) 4 / 32

Examples of graphs : Transportation network Subway systems, freight networks Roads, bridges, and highway systems 5 / 32

Examples of graphs : Collaboration graphs Hollywood graph Academic collaborations 6 / 32

Formal definition of graphs A graph, G = (V, E), is made up of a Vertex set V = {v 1,..., v n } and an Edge set E = {e ij } We say an edge e ij connects vertices v i and v j. 7 / 32

Formal definition of graphs A graph, G = (V, E), is made up of a Vertex set V = {v 1,..., v n } and an Edge set E = {e ij } We say an edge e ij connects vertices v i and v j. Two basic types of graphs: Undirected graphs: edges are sets e ij = {v i, v j } Directed graphs: edges are ordered e ij = (v i, v j ) 7 / 32

Formal definition of graphs A graph, G = (V, E), is made up of a Vertex set V = {v 1,..., v n } and an Edge set E = {e ij } We say an edge e ij connects vertices v i and v j. Two basic types of graphs: Undirected graphs: edges are sets e ij = {v i, v j } Directed graphs: edges are ordered e ij = (v i, v j ) The degree of a vertex, d(v) = # of edges incident to v 8 / 32

Formal definition of graphs A graph, G = (V, E), is made up of a Vertex set V = {v 1,..., v n } and an Edge set E = {e ij } We say an edge e ij connects vertices v i and v j. Two basic types of graphs: Undirected graphs: edges are sets e ij = {v i, v j } Directed graphs: edges are ordered e ij = (v i, v j ) The degree of a vertex, d(v) = # of edges incident to v G is connected if there is a path between every two vertices in the graph. 8 / 32

Common graphs Path graph 9 / 32

Common graphs Path graph Complete graph 9 / 32

Common graphs Path graph Complete graph Star graph 9 / 32

Common graphs Path graph Complete graph Star graph Cycle 9 / 32

Graph Theory After modeling our system as a graph, we can ask about maximum degree : finding influential people and celebrities finding complete subgraphs (cliques) : detecting communities minimum cut : sever a communication network into pieces shortest paths : routing cars in transportation network 10 / 32

Outline Graph Theory Spectral Graph Theory Laplacian regularizer Spectral Embedding 11 / 32

Spectral Graph Theory Spectral graph theory studies properties of graphs through the eigenvalues (spectra) and eigenvectors of associated graph matrices Eigenvalues of the Laplacian matrix characterize the connectivity of a graph Approximation algorithms for Max Cut Spectral clustering 12 / 32

Adjacency Matrix Encode connections in a graph in a matrix like a spreadsheet Adjacency matrix is a V V matrix with entries: { 1 if there is an edge e ij A(i, j) = 0 otherwise 13 / 32

Adjacency Matrix Encode connections in a graph in a matrix like a spreadsheet Adjacency matrix is a V V matrix with entries: { 1 if there is an edge e ij A(i, j) = 0 otherwise Example: 2 4 1 3 5 0 1 1 1 0 1 0 1 0 0 1 1 0 1 0 1 0 1 0 1 0 0 0 1 0 13 / 32

Degree Matrix Degree matrix is a V V matrix with entries: { d(i) if i = j D(i, j) = 0 otherwise 14 / 32

Degree Matrix Degree matrix is a V V matrix with entries: { d(i) if i = j D(i, j) = 0 otherwise Example: 2 4 1 3 5 3 0 0 0 0 0 2 0 0 0 0 0 3 0 0 0 0 0 3 0 0 0 0 0 1 14 / 32

Laplacian Matrix Laplacian matrix, L = D - A d(i) if i = j L(i, j) = 1 if there is an edge e ij 0 otherwise Example: 2 4 1 3 5 3 1 1 1 0 1 2 1 0 0 1 1 3 1 0 1 0 1 3 1 0 0 0 1 1 15 / 32

Laplacians of common graphs Path graph 16 / 32

Laplacians of common graphs Path graph 1 1 0 0 L P4 = 1 2 1 0 0 1 2 1 0 0 1 1 Complete graph 16 / 32

Laplacians of common graphs Path graph 1 1 0 0 L P4 = 1 2 1 0 0 1 2 1 0 0 1 1 Complete graph 3 1 1 1 L K4 = 1 3 1 1 1 1 3 1 1 1 1 3 16 / 32

Laplacians of common graphs Star graph 17 / 32

Laplacians of common graphs Star graph 3 1 1 1 L S4 = 1 1 0 0 1 0 1 0 1 0 0 1 Cycle 17 / 32

Laplacians of common graphs Star graph 3 1 1 1 L S4 = 1 1 0 0 1 0 1 0 1 0 0 1 Cycle 2 1 0 1 L C4 = 1 2 1 0 0 1 2 1 1 0 1 2 17 / 32

Matrices as operators Can think of matrices as functions operating on vectors M 1 M1 T v M : R n R n where Mv = M 2 v = M2 T v M n Mn T v What happens if we apply the Laplacian to a vector v? 18 / 32

Laplacian operator Think of v as a distribution of weights or values on the vertices of G. (Lv)(i) = L T i v n = L(i, j)v(j) j=1 = d(i)v(i) + n j=1,j i = d(i)v(i) = j:{i,j} E j:{i,j} E L(i, j)v(j) v(j) ( ) v(i) v(j) 19 / 32

Quadratic Form of the Laplacian The quadratic form associated with a matrix M is the function f : R n R where f (v) = v T Mv 20 / 32

Quadratic Form of the Laplacian The quadratic form associated with a matrix M is the function For the Laplacian matrix, n v T Lv = v(i) (Lv)(i) = i=1 f : R n R where f (v) = v T Mv n v(i) i=1 = {i,j} E = {i,j} E = {i,j} E j:{i,j} E ( ) v(i) v(j) ( ) v(i) v(i) v(j) v(i) 2 2v(i)v(j) + v(j) 2 ( ) 2 v(i) v(j) ( ) + v(j) v(j) v(i) 20 / 32

Laplacian operator (Lv)(i) = v T Lv = {i,j} E {i,j} E and ( ) v(i) v(j) ( ) 2 v(i) v(j) Laplacian operators measure the smoothness of v across the edges of G If v = c 1, Lv = 0 = 0 is an eigenvalue of L with associated eigenvector 1 v T Lv 0 so L is positive semi-definite 21 / 32

Properties of the Laplacian matrix L, D, and A are all real symmetric matrices Spectral Theorem: An n n real symmetric matrix has n real eigenvalues with n real eigenvectors that form an orthonormal basis 22 / 32

Properties of the Laplacian matrix L, D, and A are all real symmetric matrices Spectral Theorem: An n n real symmetric matrix has n real eigenvalues with n real eigenvectors that form an orthonormal basis L is positive semi-definite = All eigenvalues of L are non-negative 22 / 32

Properties of the Laplacian matrix L, D, and A are all real symmetric matrices Spectral Theorem: An n n real symmetric matrix has n real eigenvalues with n real eigenvectors that form an orthonormal basis L is positive semi-definite = All eigenvalues of L are non-negative L has eigenvalues 0 = λ 1 λ 2 λ n 22 / 32

Outline Graph Theory Spectral Graph Theory Laplacian regularizer Spectral Embedding 23 / 32

Smooth regularizer d 1 r(w) = (w i+1 w i ) 2 = Sw 2 = w T S T Sw i=1 where S R (d 1) d is the first order difference operator 1 j = i S(i, j) = 1 j = i + 1 0 otherwise 24 / 32

Smoothed least squares problem Why smooth? minimize n (y i w T x i ) 2 + λ Sw 2 i=1 can couple coefficients of adjacent features allow model to change over space or time example: different years in tax data 25 / 32

Smoothed least squares problem Why smooth? minimize n (y i w T x i ) 2 + λ Sw 2 i=1 can couple coefficients of adjacent features allow model to change over space or time example: different years in tax data Can couple any pair of model coefficients, not just (i, i + 1)! 25 / 32

A closer look at the first order difference operator 1 1 0 0 0 0 1 2 1 0 0 0 S T 0 1 2 0 0 0 S = 0 0 0 2 1 0 0 0 0 1 2 1 0 0 0 0 1 1 26 / 32

A closer look at the first order difference operator 1 1 0 0 0 0 1 2 1 0 0 0 S T 0 1 2 0 0 0 S = 0 0 0 2 1 0 0 0 0 1 2 1 0 0 0 0 1 1 Laplacian of the path graph = L G(Path) Sw 2 = w T L G(Path) w 26 / 32

Laplacian regularizer Suppose we have a graph G that encodes relationships between features Product recommendation: Features are whether a customer bought a certain product. Graph has edges between similar products Geographic features: Features are states of residence. Graph has an edge if two states share a border Time series: Features are based on years. Graph has edges between consecutive years (y, y + 1). Add in weaker time dependencies by having edges (y, y + 2) with smaller weights. Laplacian regularizer smooths coefficients over edges of the graph G. 27 / 32

Laplacian Regularized Least Squares Laplacian regularizer smooths coefficients over edges of the graph G. minimize n (y i w T x i ) 2 + w T L G w i=1 = n (y i w T x i ) 2 + i=1 {i,j} E ( ) 2 w(i) w(j) 28 / 32

Outline Graph Theory Spectral Graph Theory Laplacian regularizer Spectral Embedding 29 / 32

Drawing graphs with eigenvectors Laplacian quadratic form measures difference between values of a vector across edges For eigenvector of L, v T Lv = λ Eigenvector of L with small eigenvalue has similar v-values on adjacent vertices Laplacian eigenvectors of low eigenvalue can be used to embed graph into 2-D or 3-D 30 / 32

Spectral Embedding Demo https://github.com/orie4741/demos/ spectralgraphtheory.ipynb 31 / 32

References Daniel Spielman s Spectral Graph Theory notes : http://www.cs.yale.edu/homes/spielman/561/ David Williamson ORIE 6334 notes : https://people.orie.cornell.edu/dpw/orie6334/ index.html 32 / 32