A LARGE DEVIATION PRINCIPLE FOR THE ERDŐS RÉNYI UNIFORM RANDOM GRAPH

Similar documents
The large deviation principle for the Erdős-Rényi random graph

Graph Limits: Some Open Problems

EMERGENT STRUCTURES IN LARGE NETWORKS

arxiv: v2 [math.pr] 4 Jan 2018

Multipodal Structure and Phase Transitions in Large Constrained Graphs

Szemerédi s Lemma for the Analyst

Course 212: Academic Year Section 1: Metric Spaces

arxiv: v3 [math.pr] 19 Apr 2018

arxiv: v3 [math.pr] 18 Aug 2017

Gärtner-Ellis Theorem and applications.

Compact orbit spaces in Hilbert spaces and limits of edge-colouring models

arxiv: v2 [math.pr] 22 Oct 2018

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

On Semicontinuity of Convex-valued Multifunctions and Cesari s Property (Q)

Maths 212: Homework Solutions

arxiv: v2 [math.co] 11 Mar 2008

REVIEW OF ESSENTIAL MATH 346 TOPICS

INDISTINGUISHABILITY OF ABSOLUTELY CONTINUOUS AND SINGULAR DISTRIBUTIONS

Contents: 1. Minimization. 2. The theorem of Lions-Stampacchia for variational inequalities. 3. Γ -Convergence. 4. Duality mapping.

Metric Space Topology (Spring 2016) Selected Homework Solutions. HW1 Q1.2. Suppose that d is a metric on a set X. Prove that the inequality d(x, y)

g 2 (x) (1/3)M 1 = (1/3)(2/3)M.

Metric Spaces and Topology

The small ball property in Banach spaces (quantitative results)

Statistical Inference on Large Contingency Tables: Convergence, Testability, Stability. COMPSTAT 2010 Paris, August 23, 2010

Topological properties

Analysis Finite and Infinite Sets The Real Numbers The Cantor Set

Stanford Mathematics Department Math 205A Lecture Supplement #4 Borel Regular & Radon Measures

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi

MA651 Topology. Lecture 10. Metric Spaces.

Statistical Physics on Sparse Random Graphs: Mathematical Perspective

SPACES ENDOWED WITH A GRAPH AND APPLICATIONS. Mina Dinarvand. 1. Introduction

On Kusuoka Representation of Law Invariant Risk Measures

Math 4317 : Real Analysis I Mid-Term Exam 1 25 September 2012

Large Deviations and Random Graphs

A note on the convex infimum convolution inequality

A LOCALIZATION PROPERTY AT THE BOUNDARY FOR MONGE-AMPERE EQUATION

Ground states for exponential random graphs

Chapter 3 Continuous Functions

Notions such as convergent sequence and Cauchy sequence make sense for any metric space. Convergent Sequences are Cauchy

CHROMATIC ROOTS AND LIMITS OF DENSE GRAPHS

MATH 51H Section 4. October 16, Recall what it means for a function between metric spaces to be continuous:

Introduction and Preliminaries

First Order Convergence and Roots

Topology. Xiaolong Han. Department of Mathematics, California State University, Northridge, CA 91330, USA address:

Lecture Notes in Advanced Calculus 1 (80315) Raz Kupferman Institute of Mathematics The Hebrew University

AN EXPLORATION OF THE METRIZABILITY OF TOPOLOGICAL SPACES

1/12/05: sec 3.1 and my article: How good is the Lebesgue measure?, Math. Intelligencer 11(2) (1989),

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

Recall that if X is a compact metric space, C(X), the space of continuous (real-valued) functions on X, is a Banach space with the norm

Set, functions and Euclidean space. Seungjin Han

arxiv: v2 [math.ag] 24 Jun 2015

arxiv: v1 [math.pr] 24 Sep 2009

Mathematics for Economists

Partial cubes: structures, characterizations, and constructions

AN EFFECTIVE METRIC ON C(H, K) WITH NORMAL STRUCTURE. Mona Nabiei (Received 23 June, 2015)

n [ F (b j ) F (a j ) ], n j=1(a j, b j ] E (4.1)

Spectral Gap and Concentration for Some Spherically Symmetric Probability Measures

Continued fractions for complex numbers and values of binary quadratic forms

LARGE DEVIATIONS OF TYPICAL LINEAR FUNCTIONALS ON A CONVEX BODY WITH UNCONDITIONAL BASIS. S. G. Bobkov and F. L. Nazarov. September 25, 2011

Integral Jensen inequality

Notes 6 : First and second moment methods

Chapter 8. P-adic numbers. 8.1 Absolute values

BERNOULLI ACTIONS AND INFINITE ENTROPY

3 Measurable Functions

Chapter 2 Metric Spaces

Continuity. Matt Rosenzweig

NOTES ON DIOPHANTINE APPROXIMATION

Lecture 4 Lebesgue spaces and inequalities

Real Analysis Notes. Thomas Goller

Elusive problems in extremal graph theory

Szemerédi s regularity lemma revisited. Lewis Memorial Lecture March 14, Terence Tao (UCLA)

Continuity of convex functions in normed spaces

Convex Analysis and Economic Theory AY Elementary properties of convex functions

DYNAMICAL PROPERTIES OF MONOTONE DENDRITE MAPS

General Theory of Large Deviations

SHARP BOUNDARY TRACE INEQUALITIES. 1. Introduction

Real Analysis, 2nd Edition, G.B.Folland Elements of Functional Analysis

MATH 426, TOPOLOGY. p 1.

CHAPTER 9. Embedding theorems

ASYMPTOTICALLY NONEXPANSIVE MAPPINGS IN MODULAR FUNCTION SPACES ABSTRACT

Math 117: Topology of the Real Numbers

PERTURBATION THEORY FOR NONLINEAR DIRICHLET PROBLEMS

Subdifferential representation of convex functions: refinements and applications

Overview of normed linear spaces

BOUNDARY VALUE PROBLEMS ON A HALF SIERPINSKI GASKET

On the threshold for k-regular subgraphs of random graphs

Supplementary Notes for W. Rudin: Principles of Mathematical Analysis

MATHS 730 FC Lecture Notes March 5, Introduction

Measure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond

4 Countability axioms

Chapter 4. Measure Theory. 1. Measure Spaces

Invariant measures for iterated function systems

(convex combination!). Use convexity of f and multiply by the common denominator to get. Interchanging the role of x and y, we obtain that f is ( 2M ε

AMSI 2013: MEASURE THEORY Handout 4 Measurable and Borel Functions

MATH 4200 HW: PROBLEM SET FOUR: METRIC SPACES

Szemerédi s Lemma for the Analyst

1.1. MEASURES AND INTEGRALS

Empirical Processes: General Weak Convergence Theory

Transcription:

A LARGE DEVIATION PRINCIPLE FOR THE ERDŐS RÉNYI UNIFORM RANDOM GRAPH AMIR DEMBO AND EYAL LUBETZKY Abstract. Starting with the large deviation principle (ldp) for the Erdős Rényi binomial random graph G(n, p) (edge indicators are i.i.d.), due to Chatterjee and Varadhan (2011), we derive the ldp for the uniform random graph G(n, m) (the uniform distribution over graphs with n vertices and m edges), at suitable m = m n. Applying the latter ldp we find that tail decays for subgraph counts in G(n, m n) are controlled by variational problems, which up to a constant shift, coincide with those studied by Kenyon et al. and Radin et al. in the context of constrained random graphs, e.g., the edge/triangle model. 1. Introduction The Erdős Rényi binomial random graph model G(n, p) is the graph on n vertices where each edge is present independently with probability p; the uniform random graph G(n, m) is the uniform distribution over graphs with n vertices and exactly m edges. Let W be the space of all bounded measurable functions f : [0, 1] 2 R that are symmetric (f(x, y) = f(y, x) for all x, y [0, 1]). Let W 0 W denote all graphons, that is, symmetric measurable functions [0, 1] 2 [0, 1] (these generalize finite graphs; see (1.). The cut-norm of W W is given by W := sup W (x, y) dxdy = sup W (x, y)u(x)v(y) dxdy, S T [0,1] 2 S,T [0,1] u,v : [0,1] [0,1] (by linearity of the integral it suffices to consider 0, 1-valued u, v, hence the equality). For any measure-preserving map σ : [0, 1] [0, 1] and W W, let W σ W denote the graphon W σ (x, y) = W (σ(x), σ(y)). The cut-distance on W is then defined as δ (W 1, W 2 ) := inf σ W 1 W σ 2, with the infimum taken over all measure-preserving bijections σ on [0, 1]. It yields the pseudo-metric space (W 0, δ ), which is elevated into a genuine metric space ( W 0, δ ) upon taking the quotient w.r.t. the equivalence relation W 1 W 2 iff δ (W 1, W 2 ) = 0. In what may be viewed as a topological version of Szemerédi s regularity lemma, Lovász and Szegedy [11] showed that the metric space ( W 0, δ ) is compact. For a finite simple graph H = (V (H), E(H)) with V (H) = 1,..., k, its subgraph density in W W 0 is t H (W ) := W (x i, x j ) dx 1 dx k, [0,1] k (i,j) E(H) with the map W t H (W ) being Lipschitz-continuous in ( W 0, δ ) (see [2, Thm 3.7]). Define I p : [0, 1] R by I p (x) := x 2 log x p + 1 x 2 log 1 x 1 p for p (0, 1) and x [0, 1], (1.1) 2010 Mathematics Subject Classification. 05C80, 60F10. Key words and phrases. Large deviations, Erdős Rényi graphs, constrained random graphs. 1

2 AMIR DEMBO AND EYAL LUBETZKY and extend I p to W 0 via I p (W ) := [0,1] 2 I p (W (x, y)) dxdy for W W 0. As I p is convex on [0, 1], it is lower-semicontinuous on W 0 w.r.t. the cut-metric topology ([4, Lem. 2.1]). In the context of the space of graphons W 0, a simple graph G with vertices 1,..., n can be represented by W G (x, y) = 1 if ( nx, ny ) is an edge of G, 0 otherwise. (1. For two graphs G and H let hom(h, G) count the number of homomorphisms from H to G (i.e., maps V (H) V (G) that carry edges to edges). Let t H (G) := V (G) V (H) hom(h, G) = t H (W G ). A sequence of graphs G n n 1 is said to converge if the sequence of subgraph densities t H (G n ) converges for every fixed finite simple graph H. It was shown in [11] that for any such convergent graph sequence there is a limit object W W 0 such that t H (G n ) t H (W ) for every fixed H. Conversely, any W W 0 arises as a limit of a convergent graph sequence. It was shown in [2] that a sequence of graphs G n n 1 converges if and only if the sequence of graphons W Gn W 0 converges in W 0 w.r.t. δ A random graph G n G(n, p) corresponds to a random point W Gn W 0 inducing a probability distribution P(G n ) on W 0 supported on a finite set of points (n-vertex graphs) and G n W for the constant graphon W p a.s. for every fixed 0 < p < 1. Chatterjee and Varadhan [4] showed that, for 0 < p < 1 fixed, the random graph G(n, p) obeys a large deviation principle (ldp) in ( W 0, δ ) with the rate function I p ( ). Further denote W 1 = W (x, y) dxdy, and considering the restricted spaces W 0 := W W 0 : W 1 = p and W 0 = W W 0 : W 1 = p, here we deduce the analogous statement for the random graph G(n, m), the uniform distribution over all graphs with n vertices and exactly m edges, with a rate function J p ( ) restricted to W 0. As we later conclude, the variational formulas of this ldp for G(n, m), addressing such random graph structure conditioned on a large deviation, coincide with those studied earlier by Kenyon et al. and Radin et al. (cf., [7 9]). Theorem 1.1. Fix 0 < p < 1 and let m n N be such that m n / ( n p as n. Let G n G(n, m n ). Then the sequence P(G n ) obeys the ldp in the space ( W 0, δ ) with the good rate function J p, where J p (W ) = I p (W ) if W W 0 and is otherwise. That is, for any closed set F W 0, and for any open U W 0, lim sup n 2 log P(G n F ) inf J p(w ), W F lim inf n 2 log P(G n U) inf W U J p(w ).

LARGE DEVIATIONS IN UNIFORM RANDOM GRAPHS 3 Define and further let φ H (p, r) := inf ψ H (p, r) := inf I p (W ): W W 0, t H (W ) r I p (W ): W W 0, t H (W ) r (1.3) (1.4) (with I p having compact level sets in ( W 0, δ ) and t H ( ) continuous on ( W 0, δ ), the infimums in (1.3),(1.4) are attained whenever the relevant set of graphons is nonempty). For any r t H we relate the equivalent form of (1.4) (see Corollary 1., given by ψ H (p, r) = inf I p (W ): W W 0, t H (W ) = r, (1.5) to the following variational problem that has been extensively studied (e.g., [7 9, 14]) in constrained random graphs such as the edge/triangle model (where H is a triangle): F H (p, r) := sup h e (W ): W W 0, t H (W ) = r, (1.6) where h e (x) = 1 2 (x log x + (1 x) log(1 x)) is the (natural base) entropy function. As I p (x) = h e (x) x 1 x 2 log p 2 log(1 p) and W 1 = p throughout W 0, we see that both variational problems for F H and ψ H have the same set of optimizers, and F H (p, r) = ψ H (p, r) + h e. As a main application of their ldp, Chatterjee and Varadhan [4] showed that the large deviation rate function for subgraph counts in G(n, p) for any fixed 0 < p < 1 and graph H reduces to the variational problem (1.3). Namely, if G n G(n, p) then lim n 2 log P (t H (G n ) r) = φ H (p, r) for every fixed p, r (0, 1) and H, and, on the event t H (G n ) r, the graph G n is typically close to a minimizer of (1.3). Theorem 1.1 implies the analogous statement for the random graph G(n, m n ) w.r.t. the variational problem (1.4) (similar statements hold for lower tails of subgraph counts both in case of G(n, p) and that of G(n, m n )). Corollary 1.2. Fixing a subgraph H and 0 < p < 1, let r H (t H, 1] denote the largest r for which the collection of graphons in (1.4) is nonempty. (a) The lsc function r ψ H (p, r) is zero on [0, t H ] and finite, strictly increasing on [t H, r H ]. The nonempty set F of minimizers of (1.4) is a single point W p for r t H and F coincides for any r [t H, r H ] with the minimizers of (1.5). (b) For any m n N such that m n / ( n p as n and any right-continuity point r [0, r H ) of t ψ H (p, t), the random graph G n G(n, m n ) satisfies lim n 2 log P (t H (G n ) r) = ψ H (p, r). (1.7) (c) For any (p, r) as in part (b), and every ε > 0 there is C = C(H, ε, p, r) > 0 so that for all n large enough P ( δ (G n, F ) ε t H (G n ) r ) e Cn2. (1.8)

4 AMIR DEMBO AND EYAL LUBETZKY Remark 1.3. Since the function r ψ H (p, r) is monotone, it is continuous a.e.; however, the identity (1.7) may fail when ψ H (p, ) is discontinuous at r. For example, at r = r H the lhs of (1.7) equals whenever m n / ( n p slowly enough. Remark 1.4. The analog of (1.7) in the sparse regime (with edge density p n = o(1)) has been established in [3] in terms of a discrete variational problem in lieu of (1.3), valid when n c H p n 1 for some c H > 0 (see also [6], improving the range of p n, and [?, 1, 12, 13] for analyses of these variational problems in the sparse/dense regimes); In contrast with the delicate regime p n = n c, such results in the range p n (log n) c of G(n, p) are a straightforward consequence of the weak regularity lemma (cf. [13, 5]), and further extend to G(n, m n ), where the discrete variational problem features an extra constraint on the number of edges (see Proposition 3.. Consider (p, r) in the setting of Corollary 1.2. The studies of the variational problem for F H given in (1.6) were motivated by the question of estimating the number of graphs with prescribed edge and H-densities, via the following relation: F H (p, r) = lim δ 0 lim 1 δ log H n2 n,p,r where Hn,p,r δ = G n : E(G n ) / ( n p δ, t H (G n ) r δ (This follows by general principles from the ldp of [4] for G(n, p); see Proposition 2.1(a), or [14, Thm 3.1] for the derivation in the special case of the edge/triangle model). Corollary 1.2 allows us, roughly speaking, to interchange the order of these two limits; for instance, for any right-continuity point r t H of t ψ H (p, t) (which holds a.e.), the same variational problem in (1.6) also satisfies 1 F H (p, r) = lim n 2 log H n,m n,r where H n,m,r = G n : E(G n ) = m, t H (G n ) r.. (1.9) (Indeed, ψ H (p, r) = lim n 2 log P (t H (G(n, m n )) r), and this log-probability is then translated to log H n,mn,r by adding n 2 log ( ( n ) m n he = F H (p, r)+ψ H (p, r).) For the various results (as well as numerical simulations for the many problems related to (1.6) that remain open), the reader is referred to [7 9] and the references therein. Recall that the law of G(n, m n ) can be represented as that of a random graph G n from the model G(n, p), conditional on E(G n ) = m n. While our choice of m n in Theorem 1.1 is rather typical for G(n, p) when n 1, any ldp and in particular the ldp of [4], deals only with open and closed sets. The challenge in deriving Theorem 1.1 is thus in handling the point conditioning. To this end, we provide in Section 2 a general result (Proposition 2.1) for deriving a conditional ldp, which we then combine in 3.1 with a combinatorial coupling, and thereby prove Theorem 1.1. Building on the latter, 3.2 provides the proof of Corollary 1.2, whereas 3.3 is devoted to the analog of (1.7) for G(n, m n ) in the range m n n 2 (log n) c H (see Proposition 3..

LARGE DEVIATIONS IN UNIFORM RANDOM GRAPHS 5 2. Conditional LDP The ldp for G(n, m) is obtained by the next result, whose proof mimics that of [5, Theorem 4.2.16] about exponential approximations (see [5, Definition 4.2.14]). Proposition 2.1. Suppose Borel probability measures µ n on a metric space (X, d) satisfy the ldp with rate a n 0 and good rate function I( ). Fix a metric space (S, ρ), a continuous map f : (X, d) (S, ρ) and s S. For every η > 0, let Z η n denote radnom variables of the law ν η n := µ n ( B o f,s,η ), (2.1) where B f,s,η := x X : ρ(s, f(x)) η, Bf,s,η o := x X : ρ(s, f(x)) < η. (a) If lim n log µ n (B o f,s,η ) = 0 for every η > 0 fixed, (2. then for the good rate function I(x), f(x) = s J 0 (x) :=, otherwise and any open U X and closed F X, lim inf η 0 lim sup η 0 lim inf a n log ν η n(u) inf J 0(x), (2.3) x U lim sup a n log νn(f η ) inf J 0(x). (2.4) x F (b) Suppose (2. holds and that Z η n form an exponentially good approximation of variables Z n ν n ; i.e., for any δ > 0, there exist couplings P n,η of (Z n, Z η n) so that lim lim sup a n log P n,η (d(z n, Z η η 0 n) > δ) =. (2.5) Then ν n satisfy the ldp with rate a n 0 and the good rate function J 0 ( ). Proof. We first deduce from (2. that for every η > 0, open U X and closed F X, where J η (x) := Indeed, for any Borel set A and η > 0, lim inf n log ν η n(u) inf η o (x), x U (2.6) lim sup a n log νn(f η ) inf η(x), x F (2.7) I(x), x Bf,s,η I(x), x B, otherwise, J η o o (x) := f,s,η, otherwise. µ n (A B o f,s,η ) νη n(a) µ n(a B f,s,η ) µ n (B o f,s,η ).

6 AMIR DEMBO AND EYAL LUBETZKY Hence, for any open set U, we deduce from the ldp for µ n that lim inf a n log ν η n(u) lim inf a n log µ n (U B o f,s,η ) Similarly, for any closed set F it follows from (2. that lim sup a n log ν η n(f ) lim sup inf x U B o f,s,η I(x) = inf x U J o η (x). a n log µ n (F B f,s,η ) inf I(x) = inf J η(x). x F B f,s,η x F (a). In the lower bound (2.6) one obviously can use J 0 ( ) J o η ( ), yielding (2.3). Moreover, we get the bound (2.4) out of (2.7), upon showing that for any closed F X, inf J 0(y) lim inf inf J η(y) := α. (2.8) y F η 0 y F To this end, it suffices to consider only α <, in which case J ηl (y l ) α+l 1 for some η l 0 and y l F. As y l is contained in the compact level set x : I(x) α + 1, it has a limit point y F. Since J ηl (y l ) = I(y l ) α it follows from the lsc of x I(x) that I(y ) α. Passing to the convergent sub-sequence ρ(f(y l ), f(y )) 0. Further, recall that ρ(s, f(y l )) η l 0, hence by the triangle inequality ρ(s, f(y )) = 0. Consequently, J 0 (y ) = I(y ) α yielding (2.8) and completing the proof of part (a). (b). Clearly, J η is a good rate function (namely, of compact level sets x : J η (x) α = x : I(x) α B f,s,η ), and J η J o η J 0. If J o η J η then (2.7) (2.6) form the ldp for ν η n with the good rate function J η. While in general this may not be the case, assuming hereafter that (2.5) holds and proceeding as in [5, (4.2.20)], we get from (2.6) that ν n satisfies the ldp lower bound with the rate function J(y) := sup lim inf inf Jη o (z), δ>0 η 0 z B y,δ where B y,δ = z X : d(y, z) < δ (see [5, (4.2.17)], noting that no ldp upper bound for ν η n is needed here). Since y B y,δ for any δ > 0, we have that J 0 (y) = lim η 0 J o η (y) J(y) and consequently ν n trivially satisfies the ldp lower bound also with respect to the good rate function J 0. Now, precisely as in the proof of [5, Theorem 4.2.16(b)], we get from (2.5) and (2.7) that the corresponding ldp upper bound holds for ν n, thanks to (2.8) (see [5, (4.2.18)]), thereby completing the proof of part (b) of Prop. 2.1. 3. LDP for the uniform random graph 3.1. Proof of Theorem 1.1. Let µ n be the law of G(n, p), which obeys the ldp with good rate function I p ( ) on ( W 0, δ ) and speed n 2, and let ν n denote the law of G(n, m n ). We shall apply Proposition 2.1(b) for S = R and s = p, with f denoting the L 1 -norm on graphons (edge density): f(w ) := W 1 = W (x, y) dxdy.

LARGE DEVIATIONS IN UNIFORM RANDOM GRAPHS 7 With these choices, the role of Z n will be assumed by G n G(n, m n ), whereas those of the random variables Zn η will be assumed by the binomial random graph G(n, p) conditioned on having between 1 2 (p η)n2 and 1 2 (p + η)n2 edges: G η n ( G(n, p) ) B o p,η, where B o p,η = G : 2 E(G) (p η, p + η) (3.1) n 2 Note that p n := 2m n /n 2 (p η, p + η) for all n n 0 (η). We couple (G n, G η n) so that for such n, detereministically, E(G n ) E(G η n) < ηn 2 (3. (here S T denotes symmetric difference). This is achieved by the following procedure: (i) Draw G n G(n, m n ). (ii) Independently of G n draw E n Bin( ( n, p) and Mn ( E n 2E n /n 2 p < η ). Let D n = M n m n and obtain G η n from G n as follows: [shortage] if D n 0: add a uniformly chosen subset of D n edges missing from G n. [surplus] if D n 0: delete a uniformly chosen subset of D n edges from G n. Since D n < ηn 2 this guarantees (3. and has G n ν n ; the additional fact that G η n νn η is seen by noting that, if G G(n, p) then E(G) Bin( ( n, p), and on the event that G has M edges, these are uniformly distributed (i.e., the conditional distribution is G(n, M)). We proceed to show that such G η n form an exponentially good approximation of G n. Indeed, from the identity t i=1 a i t i=1 b i = t j=1 ( i<j a i)(a j b j )( i>j b i) and the definition of t H ( ), we find that for any H of t edges and graphs G, G on n vertices, th (G) t H (G ) t WG W G 1 = 2t n 2 E(G) E(G ) (3.3) (see also [10, Lemma 10.22]). Next, fixing δ > 0, we set k(δ) N large enough so that δ > 22/ log 2 k (for example, k = 2 (25/δ)2 ), and recall the following result: Theorem ([2, Thm. 2.7(b)]). If k 1 and the graphs G, G are such that for every simple graph H on k vertices t H (G) t H (G ) 3 k2, then δ (W G, W G ) 22/ log 2 k. To utilize this relation, set η 0 (δ) = k 2 3 k2 noting that if graphs G, G on n vertices satisfy E(G) E(G ) < ηn 2 for some η η 0, then t H (G) t H (G ) < 2 ( ) k 2 η0 < 3 k2 for every graph H on k vertices, and so by the preceding δ (G, G ) < δ. In particular, from (3. we deduce that for every η η 0 and all n n 0 (η), P (δ (G n, G η n) > δ) = 0 holds under the above coupling of (G n, G η n), thereby implying (2.5). Finally, Noting that Bp,η o of (3.1) is the event 2E n /n 2 p < η (with E n Bin( ( n, p) under µ n ), we deduce from the lln that µ n (Bp,η) o 1. In particular, for any η > 0 one has that n 2 log µ n (Bp,η) o 0, thereby verifying (2. for the case at hand.

8 AMIR DEMBO AND EYAL LUBETZKY W 0 and oth- 3.2. Proof of Corollary 1.2. (a). Recalling that J p (W ) = I p (W ) on erwise J p (W ) =, we express (1.4) as for the closed set of graphons Γ r := ψ H (p, r) = W W 0 inf J p (W ), W Γ r : t H (W ) r, (3.4) denoting by Γ =r the closed subset of graphons with t H (W ) = r. The unique global minimizer of J p ( ) over W 0 is W p. With W Γ =th, it follows that ψ H (p, r) = 0 on [0, t H ]. Next, for any r (t H, r H ], the good rate function J p ( ) is finite on the nonempty set Γ r W 0, hence ψ H(p, r) = α is finite and positive, with the infimum in (1.4) attained at the nonempty compact set F = Γ r W W 0 : J p (W ) α. (3.5) Fixing such r and W r F, consider the map W r (λ) := λw r + (1 λ)w from [0, 1] to W 0. Thanks to the continuity of λ t H(W r (λ)) on [0, 1], there exists for any r [t H, t H (W r )) some λ = λ (r ) [0, 1) such that t H (W r (λ )) = r. Hence, due to the convexity of J p ( ), ψ H (p, r ) J p (W r (λ )) λ J p (W r )) = λ α < α := ψ H (p, r). We have shown that ψ H (p, r ) < ψ H (p, r) for all r [t H, t H (W r )). Recalling that t H (W r ) r, it follows that ψ H (p, ) is strictly increasing on [t H, r H ] and further, that necessarily t H (W r ) = r for any W r F. That is, the collection F of minimizers of (1.4) then consists of only the minimizers of (1.5). Next, if ψ H (p, r ) α < for all r < r then there exist a pre-compact collection W r, r < r in (δ, W 0 ), with J p (W r ) α and t H (W r ) r. By the continuity of t H ( ) and the lsc of J p ( ), it follows that t H (W r ) r and J p (W r ) α for any limit point W r of W r as r r. Consequently ψ H (p, r) α as well, establishing the stated left-continuity of ψ H (p, ) on [0, r H ]. Finally, recall that an increasing function, finite on [0, r H ] and infinite otherwise, is lsc iff it is left continuous on [0, r H ]. (b). Considering the ldp bounds of Theorem 1.1 for the closed set Γ r and its open subset Γ >r := Γ r \ Γ =r we deduce that lim ψ H(p, r ) = inf J p (W ) lim inf r r W Γ >r n 2 log P (t H (G n ) > r) lim sup n 2 log P (t H (G n ) r) ψ H (p, r). By the assumed right-continuity of t ψ H (p, t) at r [0, r H ), the preceding inequalities must all hold with equality, resulting with (1.7). (c). Proceeding to prove (1.8), we fix (p, r) as in part (b). Further fixing ε > 0, let B W,ε := W W 0 : δ (W, W ) < ε

LARGE DEVIATIONS IN UNIFORM RANDOM GRAPHS 9 denote open cut-metric balls and consider the closed subset of Γ r, Γ r,ε := Γ r (B W,ε) c. (3.6) W F In view of (1.7) and the fact that it suffices for (1.8) to show that δ (G n, F ) ε, t H (G n ) r = W Gn Γ r,ε, lim sup n 2 log P (W Gn Γ r,ε ) < α. By the ldp upper-bound of Theorem 1.1, this in turn follows upon showing that inf J p (W ) α (3.7) W Γ r,ε contradicts the definition of F. Indeed, J p ( ) has compact level sets, so if (3.7) holds then J p (W r ) α for some W r Γ r,ε. Recall (3.5) that in particular W r F, hence (3.6) implies that δ (W r, W r ) ε > 0, yielding the desired contradiction. 3.3. Sparse uniform random graphs. In this section we show that, as was the case in G(n, p), the analog of (1.7), giving the asymptotic rate function for G(n, m) in the sparse regime m n = n 2 / log c n for a suitably small c > 0, can be derived in a straightforward manner from the weak regularity lemma. Indeed, the proof below follows essentially the same short argument used for G(n, p) in [13, Prop. 5.1]. Definition 3.1 (Discrete variational problem for upper tails). Let H be a graph with κ edges, and let b > 1. Denote the set of weighted undirected graphs on n vertices by G n = (a ij ) 1 i j n : 0 a ij 1, a ij = a ji, a ii = 0 for all i, j, and extend the definition of the graphon WĜ in (1. to a weighted graph Ĝ G n by replacing the weight 1 corresponding to an edge ( nx, ny ) by the weight a nx, ny. Taking m n ( n 2 ) and pn = m n / ( n, the variational problem for Gn G(n, m n ) is I pn (WĜ) : Ĝ G n, t H (WĜ) b p κ n, a ij = p n. ij ψ H (n, m n, b) := inf Remark. When p n p for some fixed 0 < p < 1, and r [p κ, r H ] is a right-continuity point of t ψ H (p, t) (whence (1.7) holds), one has ψ H (p, r) = lim ψh (n, m n, rp κ ) (e.g., rescale a sequence Ĝn of minimizers for ψ H (n, m n, rp κ + ε) by p/p n ; conversely, for a minimizer W for ψ H (p, r), one can take a sequence G n with W Gn W ). Proposition 3.2. Fix H be a graph with κ edges, fix b > 1 and for m n N let G n G(n, m) and p n = m n / ( n. For every ε > 0 there exists some K < such that, if p n (log n) 1/(2κ) K and n is sufficiently large then ψ H (n, m n, b) ε 1 n 2 log P(t H(G n ) b p κ n) ψ H (n, m n, b ε) + ε.

10 AMIR DEMBO AND EYAL LUBETZKY In particular, if m n N is such that p n (log n) 1/(2κ) and lim ψh (n, m n, t) exists and is continuous in some neighborhood of t = b, then lim 1 n 2 log P (t H(G n ) b p κ n) = lim ψ H (n, m n, b). The following simple lemma, whose analog for upper tails in G(n, p) (addressing only the event E 1 below) was phrased in [13, Lemma 5.2] for triangle counts in G(n, p), is an immediate consequence of the independence of distinct edges and Cramér s Theorem. Lemma 3.3. Fix ε > 0 and suppose n is sufficiently large. Let V 1,..., V s be a partition of 1,..., n, let Ĝ = (a ij) G s be such that a ij = p = m/ ( n, and define E 1 (G) = i,j a ij >p where d G (X, Y ) = # d G (V i, V j ) a ij, E 2 (G) = i,j a ij <p (x,y) X Y :xy E(G) X Y. Then G n G(n, m) has d G (V i, V j ) a ij, I p (WĜ) ε 1 n 2 log P (E 1(G) E 2 (G)) I p (WĜ) + ε. (3.8) Proof. Let G n G(n, p), and recall that d G n (V i, V j ) V i V j Bin( V i V j, p) and d G n (V i, V i ) ( V i ) ( 2 Bin( Vi ) 2, p), with these variables being mutually independent, thus 1 n 2 log P ( E 1 (G n) E 2 (G n) ) 1 n 2 V i V j I p (a ij ) 1 ( Vi ) n 2 2 Ip (a ii ) i<j = I p (WĜ) + O(n 2 ). Next, since P(G n ) = P(G n E(G n) = m), it follows that log P (E 1 (G n ) E 2 (G n )) log P ( E 1 (G n) E 2 (G n) ) log P(Bin( ( n, p) = m). For N = ( n, by definition p = m/n and so P(Bin(N, p) = m) 1/ 2πp(1 p)n provided that N is large enough, and the result follows. Combining the weak regularity lemma (see, e.g., [10, Lemma 9.3]) with the counting lemma for graphons (cf., e.g., [10, Lemma 10.23]) implies the following. Lemma 3.4. Let ε > 0 and set M = 4 1/ε2. For every graph G there is a partition V 1,..., V s of its vertices, for some s M, such that the weighted graph Ĝ G s in which a ij = d G (V i, V j ) satisfies that, for every graph H with κ edges, t H (G) t H (Ĝ) κε. Proof of Proposition 3.2. By Lemma 3.4, if G n has t H (G n ) bp κ n and E(G n ) = m then there exists a partition V 1,..., V s of its vertices, for some s M, such that the corresponding weighted graph Ĝ satisfies t H(WĜ) bp κ n κε and WĜ = p n (note that the edge density is invariant under the partition). We may round each of the densities a ij of Ĝ up to a multiple of ε (only increasing t H), with the effect of potentially i

LARGE DEVIATIONS IN UNIFORM RANDOM GRAPHS 11 increasing the edge density to at most p n + ε. By rescaling we then arrive at Ĝ such that W G 1 = p n and t H (WĜ ) bpκ n κε (1 + ε) κ bpκ n ε provided that ε/p κ n is small enough, which will indeed be the case by our assumption on p n. Applying Lemma 3.3, along with a union bound on the partition (at most M n possibilities) and the rounded a ij s (at most (1/ε) M 2 possibilities, the dominant factor), gives the required result, as the hypothesis that p n (log n) 1/2κ is large enough guarantees that this union bound amounts to a multiplicative factor of at most exp(ε n 2 ). Acknowledgment. A.D. was supported in part by NSF grant DMS-1613091 and E.L. was supported in part by NSF grant DMS-1513403. References [1] B. B. Bhattacharya, S. Ganguly, E. Lubetzky, and Y. Zhao. Upper tails and independence polynomials in random graphs. Adv. Math., 319:313 347, 2017. [2] C. Borgs, J. T. Chayes, L. Lovász, V. T. Sós, and K. Vesztergombi. Convergent sequences of dense graphs. I. Subgraph frequencies, metric properties and testing. Adv. Math., 219(6):1801 1851, 2008. [3] S. Chatterjee and A. Dembo. Nonlinear large deviations. Adv. Math., 299:396 450, 2016. [4] S. Chatterjee and S. R. S. Varadhan. The large deviation principle for the Erdős-Rényi random graph. European J. Combin., 32(7):1000 1017, 2011. [5] A. Dembo and O. Zeitouni. Large deviations techniques and applications, volume 38 of Stochastic Modelling and Applied Probability. Springer-Verlag, Berlin, 2010. Corrected reprint of the second (1998) edition. [6] R. Eldan. Gaussian-width gradient complexity, reverse log-sobolev inequalities and nonlinear large deviations. Preprint arxiv:1612.04346. [7] R. Kenyon, C. Radin, K. Ren, and L. Sadun. Multipodal structure and phase transitions in large constrained graphs. J. Stat. Phys., 168(:233 258, 2017. [8] R. Kenyon, C. Radin, K. Ren, and L. Sadun. The phases of large networks with edge and triangle constraints. J. Phys. A, 50(43):435001, 22, 2017. [9] R. Kenyon, C. Radin, K. Ren, and L. Sadun. Bipodal structure in oversaturated random graphs. Int. Math. Res. Not. (IMRN), 2018(4):1009 1044, 2018. [10] L. Lovász. Large networks and graph limits, volume 60 of American Mathematical Society Colloquium Publications. American Mathematical Society, Providence, RI, 2012. [11] L. Lovász and B. Szegedy. Limits of dense graph sequences. J. Combin. Theory Ser. B, 96(6):933 957, 2006. [12] E. Lubetzky and Y. Zhao. On replica symmetry of large deviations in random graphs. Random Structures Algorithms, 47(1):109 146, 2015. [13] E. Lubetzky and Y. Zhao. On the variational problem for upper tails in sparse random graphs. Random Structures Algorithms, 50(3):420 436, 2017. [14] C. Radin and L. Sadun. Phase transitions in a complex network. J. Phys. A, 46(30):305002, 12, 2013. [15] Y. Zhao. On the lower tail variational problem for random graphs. Combin. Probab. Comput., 26(:301 320, 2017.

12 AMIR DEMBO AND EYAL LUBETZKY Amir Dembo Department of Mathematics, Stanford University, Sloan Hall, Stanford, CA 94305, USA. E-mail address: amir@math.stanford.edu Eyal Lubetzky Courant Institute, New York University, 251 Mercer Street, New York, NY 10012, USA. E-mail address: eyal@courant.nyu.edu