Ground states for exponential random graphs

Similar documents
Phase transitions in the edge-triangle exponential random graph model

Asymptotics and Extremal Properties of the Edge-Triangle Exponential Random Graph Model

arxiv: v2 [math.pr] 4 Jan 2018

The large deviation principle for the Erdős-Rényi random graph

EMERGENT STRUCTURES IN LARGE NETWORKS

arxiv: v3 [math.pr] 6 Apr 2011

ESTIMATING AND UNDERSTANDING EXPONENTIAL RANDOM GRAPH MODELS

Szemerédi s regularity lemma revisited. Lewis Memorial Lecture March 14, Terence Tao (UCLA)

Graph Limits: Some Open Problems

Multipodal Structure and Phase Transitions in Large Constrained Graphs

Parameter estimators of sparse random intersection graphs with thinned communities

Characterizing extremal limits

Graph limits Graph convergence Approximate asymptotic properties of large graphs Extremal combinatorics/computer science : flag algebra method, proper

Stein s Method for concentration inequalities

Consistency Under Sampling of Exponential Random Graph Models

Graph Detection and Estimation Theory

A LARGE DEVIATION PRINCIPLE FOR THE ERDŐS RÉNYI UNIFORM RANDOM GRAPH

Statistical Inference on Large Contingency Tables: Convergence, Testability, Stability. COMPSTAT 2010 Paris, August 23, 2010

Combinatorial Types of Tropical Eigenvector

arxiv: v3 [math.pr] 19 Apr 2018

arxiv: v3 [math.pr] 18 Aug 2017

The number of edge colorings with no monochromatic cliques

arxiv: v2 [math.pr] 22 Oct 2018

Statistical Physics on Sparse Random Graphs: Mathematical Perspective

Bayesian nonparametric models of sparse and exchangeable random graphs

Random Geometric Graphs

The regularity method and blow-up lemmas for sparse graphs

arxiv: v2 [math.st] 21 Nov 2017

arxiv: v1 [math.pr] 14 Jul 2016

Undecidability of Linear Inequalities Between Graph Homomorphism Densities

A Graphon-based Framework for Modeling Large Networks

BIRTHDAY PROBLEM, MONOCHROMATIC SUBGRAPHS & THE SECOND MOMENT PHENOMENON / 23

Network models: random graphs

Spectra of adjacency matrices of random geometric graphs

Elusive problems in extremal graph theory

Clustering means geometry in sparse graphs. Dmitri Krioukov Northeastern University Workshop on Big Graphs UCSD, San Diego, CA, January 2016

Limits, Continuity, and the Derivative

Applications of the Sparse Regularity Lemma

Limits at Infinity. Horizontal Asymptotes. Definition (Limits at Infinity) Horizontal Asymptotes

1 Mechanistic and generative models of network structure

Induction, sequences, limits and continuity

Hard-Core Model on Random Graphs

Large Deviations and Random Graphs

The typical structure of sparse K r+1 -free graphs

Applications of Eigenvalues in Extremal Graph Theory

Sparse halves in dense triangle-free graphs

Lecture 7: The Subgraph Isomorphism Problem

7.32/7.81J/8.591J: Systems Biology. Fall Exam #1

Extensions of a theorem of Erdős on nonhamiltonian graphs

The β-model for random hypergraphs

Hyperbolic metric spaces and their applications to large complex real world networks II

A Note On Large Deviation Theory and Beyond

5. Density evolution. Density evolution 5-1

RELATIVE N-TH NON-COMMUTING GRAPHS OF FINITE GROUPS. Communicated by Ali Reza Ashrafi. 1. Introduction

CH 2: Limits and Derivatives

On the Logarithmic Calculus and Sidorenko s Conjecture

An exact Turán result for tripartite 3-graphs

Applications of the Lopsided Lovász Local Lemma Regarding Hypergraphs

Probabilistic Foundations of Statistical Network Analysis Chapter 3: Network sampling

Random Networks. Complex Networks CSYS/MATH 303, Spring, Prof. Peter Dodds

SZEMERÉDI S REGULARITY LEMMA 165

Independent Transversals in r-partite Graphs

Lecture 4: Graph Limits and Graphons

Random Networks. Complex Networks, CSYS/MATH 303, Spring, Prof. Peter Dodds

Random Geometric Graphs

A Variational Principle for Permutations

McGill University Math 354: Honors Analysis 3

Undirected Graphical Models

Cutwidth and degeneracy of graphs

The Algorithmic Aspects of the Regularity Lemma

3.2 Configuration model

Improper Integrals. Goals:

Generalized Exponential Random Graph Models: Inference for Weighted Graphs

Networks: Lectures 9 & 10 Random graphs

The regularity method and blow-up lemmas for sparse graphs

Graphical Model Inference with Perfect Graphs

Almost giant clusters for percolation on large trees

Tree Decomposition of Graphs

5.3 METABOLIC NETWORKS 193. P (x i P a (x i )) (5.30) i=1

Theory and Methods for the Analysis of Social Networks

Tiling on multipartite graphs

Adventures in random graphs: Models, structures and algorithms

Algebra II Unit Breakdown (Curriculum Map Outline)

Imprimitive symmetric graphs with cyclic blocks

AP Calculus Testbank (Chapter 9) (Mr. Surowski)

Data Mining and Analysis: Fundamental Concepts and Algorithms

Topic 3 Outline. What is a Limit? Calculating Limits Infinite Limits Limits at Infinity Continuity. 1 Limits and Continuity

EQ: What are limits, and how do we find them? Finite limits as x ± Horizontal Asymptote. Example Horizontal Asymptote

Induced Turán numbers

The Zarankiewicz problem in 3-partite graphs

On the 3-local profiles of graphs

Math 116 Second Midterm March 20, 2013

arxiv: v2 [stat.ot] 10 Nov 2011

An Erdős-Stone Type Conjecture for Graphic Sequences

Lecture 06 01/31/ Proofs for emergence of giant component

Multipodal Structure and Phase Transitions in Large Constrained Graphs

THE ENTROPY FUNCTION OF AN INVARIANT MEASURE

Solving the MWT. Recall the ILP for the MWT. We can obtain a solution to the MWT problem by solving the following ILP:

Information Dimension

Proceedings of the 2014 Federated Conference on Computer Science and Information Systems pp

Transcription:

Ground states for exponential random graphs Mei Yin Department of Mathematics, University of Denver March 5, 2018

We propose a perturbative method to estimate the normalization constant in exponential random graph models as the weighting parameters approach infinity. As an application, we give evidence of discontinuity in natural parametrization along the critical directions of the edge-triangle model. Joint work with Rajinder Mavi.

Outline Exponential random graphs; Graph limit theory The edge-triangle model; Perturbation analysis

Erdős-Rényi graph G(n, ρ): n vertices; include edges independently with probability ρ. Empirical study of network structure shows that transitivity is the outstanding feature that differentiates observed data from a pattern of random ties. Modeling transitivity (or lack thereof) in a way that makes statistical inference feasible however has proved to be rather difficult. One direction is using exponential random graph models. They are particularly useful when one wants to construct models that resemble observed networks as closely as possible, but without going into detail of the specific process underlying network formation.

Probability space: The set G n of all simple graphs G n on n vertices. Probability mass function: ( ) P β n(g n ) = exp n 2 (β 1 t(h 1, G n ) +... + β k t(h k, G n ) ψn β ). β 1,..., β k are real parameters and H 1,..., H k are pre-chosen finite simple graphs. Each H i has vertex set V (H i ) = {1,..., k i } and edge set E(H i ). By convention, we take H 1 to be a single edge. Graph homomorphism hom(h i, G n ) is a random vertex map V (H i ) V (G n ) that is edge-preserving. Homomorphism density t(h i, G n ) = hom(h i,g n) V (G n) V (H i ). Normalization constant: ψ β n = 1 n 2 log G n G n exp ( n 2 (β 1 t(h 1, G n ) +... + β k t(h k, G n )) ).

β i = 0 for i 2: ( ) P β n(g n ) = exp n 2 (β 1 t(h 1, G n ) ψn β ) ( ) = exp 2β 1 E(G n ) n 2 ψn β. Erdős-Rényi graph G(n, ρ), P ρ n(g n ) = ρ E(Gn) (1 ρ)( n 2 ) E(G n). Include edges independently with probability ρ = e 2β 1 /(1 + e 2β 1 ). exp(n 2 ψn β ) = ( ) ( ) n 1 2 exp (2β 1 E(G n ) ) =. 1 ρ G n G n

What happens with general β i? Problem: Graphs with different numbers of vertices belong to different probability spaces! Solution: Theory of graph limits (graphons)! (Lovász and coauthors; earlier work of Aldous and Hoover) Graphon space W is the space of all symmetric measurable functions h(x, y) from [0, 1] 2 into [0, 1]. The interval [0, 1] represents a continuum of vertices, and h(x, y) denotes the probability of putting an edge between x and y.

What happens with general β i? Problem: Graphs with different numbers of vertices belong to different probability spaces! Solution: Theory of graph limits (graphons)! (Lovász and coauthors; earlier work of Aldous and Hoover) Graphon space W is the space of all symmetric measurable functions h(x, y) from [0, 1] 2 into [0, 1]. The interval [0, 1] represents a continuum of vertices, and h(x, y) denotes the probability of putting an edge between x and y.

Example: Erdős-Rényi graph G(n, ρ), h(x, y) = ρ. Example: Any G n G n, { 1, if ( nx, ny ) is an edge in Gn ; h(x, y) = 0, otherwise.

Large deviation and Concentration of measure: ( ) ψ β = lim n ψβ n = max h W β 1 t(h 1, h) +... + β k t(h k, h) I (h)dxdy [0,1] 2, where: t(h i, h) = [0,1] k i (i,j) E(H i ) h(x i, x j )dx 1...dx ki, and I : [0, 1] R is the function I (u) = 1 2 u log u + 1 (1 u) log(1 u). 2

Let F be the set of maximizers. G n lies close to F with high probability for large n. β 2,..., β k 0: G n behaves like the Erdős-Rényi graph G(n, u ), where u [0, 1] maximizes β 1 u +... + β k u E(Hk) 1 2 u log u 1 (1 u) log(1 u). 2 (Chatterjee and Varadhan; Chatterjee and Diaconis; Häggström and Jonasson; Bhamidi, Bresler, and Sly)

Take H 1 a single edge and H 2 a triangle. Fix the edge parameter β 1. Let the triangle parameter β 2 vary from 0 to. Then ψ β 1,β 2 loses its analyticity at at most one value of β 2. (Radin and Y) Gas/liquid transition: Critical point is ( 1 2 log 2 3 4, 9 16 ).

The line β 1 = β 2 is of particular importance. The edge-triangle model transitions from an Erdős-Rényi type almost complete graph (β 1 > β 2 ) to an Erdős-Rényi type almost empty graph (β 1 β 2 ). (Y) 3 Region I: nearly complete graphs 2 u * ~ 1-e -2(- 1 +3-2 ) - 2 u * ~ e 2-1 1 Region II: sparse graphs 0-3 -2-1 0 1 2 3-1

Feasible edge-triangle densities.

Upper bound: complete subgraph on e 1/2 n vertices. Lower bound for e 1/2: complete bipartite graph with 1 2e fraction of edges randomly deleted. Lower bound for e 1/2: complicated scallop curves where boundary points are complete multipartite graphs. (Razborov and others)

Take β 1 = aβ 2 + b. Fix a and b. Let n and then let β 2. G n exhibits quantized behavior. (Y, Rinaldo, and Fadnavis; related work in Handcock; Rinaldo, Fienberg, and Zhou) -1.0-0.5 0.0 0.5 1.0-1.0-0.5 0.0 0.5 1.0 The infinite polytope.

Problem: Along the critical directions o k, a typical graph sampled from the edge-triangle model may behave like either a Turán graphon with k + 1 classes or a Turán graphon with k + 2 classes, with no clear preference. Solution: Refined perturbation approximation! (Mavi and Y)

Take a finite simple graph H with vertex set V (H) and edge set E(H). For each (r, s) E(H) and each pair of points x r, x s [0, 1], define H,r,s f (x r, x s ) = f (x r, x s ) dx v. [0,1] V (H)\{r,s} (r,s ) E(H) (r,s ) (r,s) v V (H) v r,s This may be identified with the homomorphism density t(h, h), if we remove edge (r, s) from H and do not integrate out the associated vertices x r and x s.

For (x, y) [0, 1], define H f (x, y) = (r,s) E(H) H,r,s f (x, y). This corresponds to the total homomorphism density generated after all possible ways of removing one edge from H.

Assumption: H f = αχ A + βχ B, where A and B are complement sets. Example: Let K k be a complete graph on k vertices. Denote f K k+1 by T k (Turán graphon with k + 1 classes) and χ [0,1] 2 f K k+1 by D k. Then T k and D k are indicator functions associated with sets of measure k/(k + 1) and 1/(k + 1), respectively. K2 T k 1. 3(k 1) K3 T k = 3 T k (x, z)t k (y, z)dz = [0,1] k + 1 T k + 3k k + 1 D k.

1.0 t 0.8 0.6 E-R curve 0.4 tangent cones 0.2 0.2 0.4 0.6 0.8 1.0 e Region of attainable edge (e) and triangle (t) densities for graphons revisited. The tangent cones at complete bipartite graphon (Turán graphon with 2 classes) and complete tripartite graphon (Turán graphon with 3 classes) are displayed.

The (internal) tangent cone C R (v k ), consisting of graphons of the type X h = at k + bd k. Taking a = 1, b = 0 gives the Turán graphon with k + 1 classes, whose edge-triangle densities are v k = (e k, t k ). Here g k (e k 0) and g k (e k + 0) indicate the left and right hand side derivatives of the lower boundary curve g k at e k, respectively.

Let h be the true maximizing graphon for the edge-triangle model: H 1 = K 2 and H 2 = K 3. T k and h become arbitrarily close as β. Consider the averaged graphon perturbation X h = at k + bd k, viewed as a flattened out version of h. We may justify that for suitably chosen values of a and b, t(h i, X h ) gives a better asymptotic approximation for t(h i, h) than t(h i, T k ), hence removing the ambiguity.

A simulated realization of the exponential random graph model on 60 nodes with edges and triangles as sufficient statistics, where β = (10, 7.5) diverges along the critical direction o 1 = (1, 0.75). The simulated graph displays bipartite feature with edge density 0.557.

Thanks For Having Me Back!:)