arxiv: v2 [math.pr] 22 Oct 2018

Size: px
Start display at page:

Download "arxiv: v2 [math.pr] 22 Oct 2018"

Transcription

1 Breaking of ensemble equivalence for perturbed Erdős-Rényi random graphs arxiv: v [math.pr] Oct 08 F. den Hollander M. Mandjes A. Roccaverde 3 N.J. Starreveld 4 October 3, 08 Abstract In a previous paper we analysed a simple undirected random graph subject to constraints on the total number of edges and the total number of triangles. We considered the dense regime in which the number of edges per vertex is proportional to the number of vertices. We showed that, as soon as the constraints are frustrated, i.e., do not lie on the Erdős-Rényi line, there is breaking of ensemble equivalence, in the sense that the specific relative entropy per edge of the microcanonical ensemble with respect to the canonical ensemble is strictly positive in the limit as the number of vertices tends to infinity. In the present paper we analyse what happens near the Erdős-Rényi line. It turns out that the way in which the specific relative entropy tends to zero depends on whether the total number of triangles is slightly larger or slightly smaller than typical. We investigate what the constrained random graph looks like asymptotically in the microcanonical ensemble. MSC 00: 05C80, 60K35, 8B0. Key words: Erdős-Rényi random graph, Gibbs ensembles, relative entropy, graphon, breaking of ensemble equivalence. Acknowledgement: The research in this paper was supported through NWO Gravitation Grant NETWORKS Introduction In this paper we analyse random graphs that are subject to constraints. Statistical physics prescribes what probability distribution on the set of graphs we should choose when we want to model a given type of constraint []. Two important choices are: () The microcanonical ensemble, where the constraints are hard (i.e., are satisfied by each individual graph). () The canonical ensemble, where the constraints are soft (i.e., hold as ensemble averages, while individual graphs may violate the constraints). Mathematical Institute, Leiden University, P.O. Box 95, 300 RA Leiden, The Netherlands denholla@math.leidenuniv.nl Korteweg de-vries Institute, University of Amsterdam, P.O. Box 9448, 090 GE Amsterdam, The Netherlands m.r.h.mandjes@uva.nl 3 Mathematical Institute, Leiden University, P.O. Box 95, 300 RA Leiden, The Netherlands a.roccaverde@math.leidenuniv.nl 4 Korteweg de-vries Institute, University of Amsterdam, P.O. Box 9448, 090 GE Amsterdam, The Netherlands n.j.starreveld@uva.nl

2 For random graphs that are large but finite, the two ensembles are obviously different and, in fact, represent different empirical situations. Each ensemble represents the unique probability distribution with maximal entropy respecting the constraints. In the limit as the size of the graph diverges, the two ensembles are traditionally assumed to become equivalent as a result of the expected vanishing of the fluctuations of the soft constraints, i.e., the soft constraints are expected to behave asymptotically like hard constraints. This assumption of ensemble equivalence is one of the corner stones of statistical physics, but it does not hold in general (see [3] for more background). In a series of papers the question of possible breaking of ensemble equivalence was investigated for various choices of the constraints, including the degree sequence and the total number of edges, wedges and triangles. Both the sparse regime (where the number of edges per vertex remains bounded) and the dense regime (where the number of edges per vertex is of the order of the number of vertices) have been considered. The effect of community structure on ensemble equivalence has been investigated as well. Relevant references are [3], [4], [5], [30] and [3]. In [5] we considered a random graph subject to constraints on the total number of edges and the total number of triangles, in the dense regime. With the help of large deviation theory for graphons, we derived a variational formula for s = lim n n s n, where n is the number of vertices and s n is the relative entropy of the microcanonical ensemble with respect to the canonical ensemble. We found that s > 0 when the constraints are frustrated. In the present paper we analyse the behaviour of s when the constraints are close to but different from those of the Erdős-Rényi random graph, and we identify what the constrained random graph looks like asymptotically in the microcanonical ensemble. It turns out that the behaviour changes when the total number of triangles is larger, respectively, smaller than that of the Erdős-Rényi random graph with a given total number of edges. While breaking of ensemble equivalence is a relatively new concept in the theory of random graphs, there are many studies on the asymptotic structure of random graphs. In the pioneering work [9], followed by [], the large deviation principle for dense Erdős-Rényi random graphs is proven and the asymptotic structure of constrained Erdős-Rényi random graphs is described as the solution of a variational problem. In the past few years significant progress has been made regarding sparse random graphs as well. We refer the reader to [8], [0], [3] and [36]. Two other random graph models that have been extensively studied are the exponential random graph model and the constrained exponential random graph model. Exponential random graphs, which are related to the canonical ensemble we consider in this paper, have been analysed in [], [3], [7], [7], [9], [33] and [35]. In [3] Constrained exponential random graphs have been analysed in [], [7], [] and [34]. In [7], [8], [4] and [8]. The asymptotic structure of graphs drawn from the microcanonical ensemble is investigated for various values of the constraints on the edge density and the triangle density. In [8] the authors study the behavior of random graphs with edge and triangle densities close to the Erdős-Rényi curve. They manage to identify the scaling behavior close to the curve by proving a bound on the entropy function. We extend their results and find the exact behavior of constrained random graphs close to the Erdős-Rényi curve. The same question has been dealt with in [4] for a constraint on the edge and triangle density close to the lower boundary curve of the admissibility region. In [8] the authors manage to determine, through extensive simulations, regions where phase transitions in the structure of constrained random graphs occur. The remainder of this paper is organised as follows. In Section we define the two ensembles, give the definition of equivalence of ensembles in the dense regime, and recall some basic facts about graphons. In Section.4 we recall the variational representation of s derived in [5] when the constraints are on the total numbers of subgraphs drawn from a finite collection of subgraphs. We also recall the analysis of s in [5] for the special case where the subgraphs are the edges and the triangles. In Section 3 we state our main theorems. Proofs are given in Sections?? and 5. Definitions and preliminaries In Section. we give the formal definition of the two ensembles we are interested in and give our definition of equivalence of ensembles in the dense regime. In Section. we recall some basic facts

3 about graphons, in Section.3 we present some basic properties of the canonical ensemble and in Section.4 we give a variational characterisation of ensemble equivalence proven in [5].. Microcanonical ensemble, canonical ensemble, relative entropy For n N, let G n denote the set of all (n ) simple undirected graphs with n vertices. Any graph G Gn can be represented by a symmetric n n matrix with elements { h G if there is an edge between vertex i and vertex j, (i, j) := (.) 0 otherwise. Let C denote a vector-valued function on G n. We choose a specific vector C, which we assume to be graphical, i.e., realisable by at least one graph in G n. For this C the microcanonical ensemble is the probability distribution P mic on G n with hard constraint C defined as P mic (G) := { /Ω C, if C(G) = C, 0, otherwise, G G n, (.) where Ω C := {G G n : C(G) = C } (.3) is the number of graphs that realise C. The canonical ensemble P can is the unique probability distribution on G n that maximises the entropy S n (P) := G G n P(G) log P(G) (.4) subject to the soft constraint C = C, where This gives the formula [6] with C := G G n C(G) P(G). (.5) P can (G) := θ Z( θ ) eh(, C(G)), G G n, (.6) H( θ, C(G)) := θ C(G), Z( θ ) := G G n e θ C(G), (.7) denoting the Hamiltonian and the partition function, respectively. In (.6) (.7) the parameter θ, which is a real-valued vector whose size is equal to the number of constraints, must be set to the unique value that realises C = C. As a Lagrange multiplier, θ always exists, but uniqueness is non-trivial. In the sequel we will only consider examples where the gradients of the constraints in (.5) are linearly independent vectors. Consequently, the Hessian matrix of the entropy of the canonical ensemble in (.6) is a positive-definite matrix, which implies uniqueness. The relative entropy of P mic with respect to P can is defined as S n (P mic P can ) := P mic (G) log P mic(g) P can (G). (.8) G G n For any G, G G n, P can (G ) = P can (G ) whenever C(G ) = C(G ), i.e., the canonical probability is the same for all graphs with the same value of the constraint. We may therefore rewrite (.8) as S n (P mic P can ) = log P mic(g ) P can (G ), (.9) 3

4 where G is any graph in G n such that C(G ) = C (recall that we assumed that C is realisable by at least one graph in G n ). All the quantities above depend on n. In order not to burden the notation, we exhibit this n-dependence only in the symbols G n and S n (P mic P can ). When we pass to the limit n, we need to specify how C(G), C and θ are chosen to depend on n. We refer the reader to [5] where this issue has been discussed in detail. Definition. In the dense regime, if then P mic and P can are said to be equivalent. s := lim n S n(p mic P can ) = 0, (.0) Remark. In [3], which was concerned with the sparse regime, the relative entropy was divided by n (the number of vertices). In the dense regime, however, it is appropriate to divide by n (the order of the number of edges).. Graphons There is a natural way to embed a simple graph on n vertices in a space of functions called graphons. Let W be the space of functions h: [0, ] [0, ] such that h(x, y) = h(y, x) for all (x, y) [0, ]. A finite simple graph G on n vertices can be represented as a graphon h G W in a natural way as (see Fig. ) { h G if there is an edge between vertex nx and vertex ny, (x, y) := (.) 0 otherwise. 3 4 y h G (x,y) =, on h G (x,y) = 0, else x Figure : An example of a graph G and its graphon representation h G. The space of graphons W is endowed with the cut distance d (h, h ) := sup dx dy [h (x, y) h (x, y)], h, h W. (.) S,T [0,] S T On W there is a natural equivalence relation. Let Σ be the space of measure-preserving bijections σ : [0, ] [0, ]. Then h (x, y) h (x, y) if h (x, y) = h (σx, σy) for some σ Σ. This equivalence relation yields the quotient space ( W, δ ), where δ is the metric defined by δ ( h, h ) := inf σ,σ Σ d (h σ, hσ ), h, h W. (.3) As noted above, we suppress the n-dependence. Thus, by G we denote any simple graph on n vertices, by h G its image in the graphon space W, and by h G its image in the quotient space W. For a more detailed description of the structure of the space ( W, δ ) we refer the reader to [4], [5], []. In the 4

5 sequel we will deal with constraints on the edge and triangle density. In the space W the edge density and the triangle density of a graphon h are defined by T (h) = dx dx h(x, x ), [0,] T (h) = dx dx dx 3 h(x, x )h(x, x 3 )h(x 3, x ). [0,] 3 (.4) For an element h of the quotient space W we define the edge and triangle density by T ( h) = T (h), and T ( h) = T (h), where h is any representative element of the equivalence class h..3 Subgraph counts Label the simple graphs in any order, e.g., F is an edge, F is a wedge, F 3 is triangle, etc. Let C k (G) denote the number of subgraphs F k in G. In the dense regime, C k (G) grows like n V k, where V k = V (F k ) is the number of vertices in F k. For m N, consider the following scaled vector-valued function on G n : ( ) m ( ) m p(fk )C k (G) C(G) := n V = n p(fk )C k (G) k k= n V. (.5) k k= The term p(f k ) counts the edge-preserving permutations of the vertices of F k, i.e., p(f ) = for an edge, p(f ) = for a wedge, p(f 3 ) = 6 for a triangle, etc. The term C k (G)/n V k represents a subgraph density in the graph G. The additional n guarantees that the full vector scales like n, the scaling of the large deviation principle for graphons in the Erdős-Rényi random graph derived in [9]. For a simple graph F k, let hom(f k, G) be the number of homomorphisms from F k to G, and define the homomorphism density as t(f k, G) := hom(f k, G) = p(f k)c k (G), (.6) n V k which does not distinguish between permutations of the vertices. Hence the Hamiltonian becomes H( θ, T m (G)) = n θ k t(f k, G) = n ( θ T (G)), G G n, (.7) k= n V k where T (G) := (t(f k, G)) m k=. (.8) The canonical ensemble with parameter θ thus takes the form [ ] P can (G θ ) := e θ n T (G) ψn( θ ), G G n, (.9) where ψ n replaces the partition function Z( θ): ψ n ( θ) := n log G G n e n ( θ T (G)) = n log Z( θ). (.0) In the sequel we take θ equal to a specific value θ, so as to meet the soft constraint, i.e., T = G G n T (G) Pcan (G) = T. (.) The canonical probability then becomes P can (G) = P can (G θ ). (.) 5

6 Both the constraint T and the Lagrange multiplier θ in general depend on n, i.e., T = T n and θ = θ n. We consider constraints that converge when we pass to the limit n, i.e., Consequently, we expect that lim T n = T. (.3) n lim θ n = θ. (.4) n Throughout the sequel we assume that (.4) holds. If convergence fails, then we may still consider subsequential convergence. The subtleties concerning (.4) are discussed in [5, Appendix A]..4 Variational characterisation of ensemble equivalence The expression in (.7) can be written in terms of graphons as H( θ, T m (G)) = n θ k t(f k, h G ). (.5) With this scaling the hard constraint T has the interpretation of the density of an observable quantity in G, and defines a subspace of the quotient space W, which we denote by W, and which consists of all graphons that meet the hard constraint, i.e., k= W := { h W : T (h) = T }. (.6) The soft constraint in the canonical ensemble becomes T = T (recall (.5)). Recall that for n N we write θ for θ n. In order to characterise the asymptotic behavior of the two ensembles, the entropy function of a Bernoulli random variable is essential. For u [0, ] we define I(u) := u log u + ( u) log( u). (.7) We extend the domain of this function to the graphon space W by defining I(h) := dx dy I(h(x, y)) (.8) [0,] (with the convention that 0 log 0 = 0). On the quotient space ( W, δ ) we define I( h) = I(h), where h is any element of the equivalence class h. In order to keep the notation minimal we use I( ) for both (.7) and (.8). Depending on the argument of the function it will be clear which of the two is considered. The key result in [5] is the following variational formula for s. Theorem.3 [5] Subject to (.), (.3) and (.4), with lim n n S n (P mic P can ) =: s (.9) [ s = sup θ T [ ( h) I( h)] sup θ T ] ( h) I( h). (.30) h W h W Theorem.3 and the compactness of W give us a variational characterisation of ensemble equivalence: s = 0 if and only if at least one of the maximisers of θ T ( h) I( h) in W also lies in W W. Equivalently, s = 0 when at least one the maximisers of θ T ( h) I( h) satisfies the hard constraint. Theorem.3 allows us to identify examples where ensemble equivalence holds (s = 0) or is broken (s > 0). In [5] a detailed analysis was given for the special case where the constraint is on the total number of edges and the total number of triangles. The analysis in [5] relied on the large deviation principle for dense Erdős-Rényi random graphs established in [9]. The function defined in (.7) plays a crucial role and is related to the rate function of the large deviation principle. 6

7 Theorem.4 [5] For the edge-triangle model, s = 0 when T = T 3, 0 < T and T = 0, while s > 0 when T T 3 and T 8, T T 3, 0 < T and 0 < T < 8, (T, T ) lies on the scallopy curve in Fig.. Here, T, T are in fact the limits T,, T, in (.3), but in order to keep the notation light we now also suppress the index. (0,) (,) s = 0 s > 0 s > 0 triangle density T T = T 3 T = T 3 s =? (0, 8 ) (0,0) (,0) edge density T T = T (T ) (,0) Figure : The admissible edge-triangle density region is the region on and between the blue curves [8]. Theorem.4 is illustrated in Fig.. The region on and between the blue curves corresponds to the choices of (T, T ) that are graphical, i.e., there exists a graph with edge density T and triangle density T. The red curves represent ensemble equivalence, the blue curves and the grey region represent breaking of ensemble equivalence, while in the white region between the red curve and the lower blue curve we do not know what happens. Breaking of ensemble equivalence arises from frustration between the values of T and T. The lower blue curve, called the scallopy curve, consist of infinitely many pieces labelled by l N\{}. The l-th piece corresponds to T ( l l, l l+ ] and a T that is a computable but non-trivial function of T (see [5], [7], [8]). The structure of the graphs drawn from the microcanonical ensemble was determined in [5] and [8]: the vertex set can be partitioned into l subsets, the first l subsets have size c l n, the last subset has size between c l n and c l n, where [ ] c l := l+ + l+ l T [ l+, l ). (.3) The graph has the form of a complete l-partite graph, with some additional edges on the last subset 7

8 that create no triangles within that last subset. The optimal graphons have the form, k < l: x < kc l < y or y < kc l < x, gl (x, y) := p l, (l )c l < x < [ + (l )c l] < y or (l )c l < y < [ + (l )c] < x, 0, otherwise, (.3) where p l = 4c l( lc l ) (0, ]. (.33) ( (l )c l ) Figure 3 plots c l and p l as a function of T for l N. An illustration of the graphon limit of such a graph is given below in Figure Figure 3: For l N: c l (left) and p l (right) as a function of T. y (l ) c l g l (x,y) = p l on gl (x,y) = on 4 c l gl (x,y) = 0 else 3 c l c l c l c l c l 3 c l 4 c l (l ) c l x Figure 4: Graphon g l, for l N and T ( l l, l l+ ]. 8

9 3 Theorems In this section we present our results, which address the following two issues: In Theorems we identify the scaling behaviour of s for fixed T and T T 3, respectively, T T 3. It turns out that the way in which s tends to zero differs in the two cases. In Propositions we characterise some possible asymptotic structures of random graphs drawn from the microcanonical ensemble when the hard constraint is on the edge and triangle density. Our results indicate that the structure of the graphs differs for T T 3, respectively, T T 3. In the sequel we make the following two assumptions: Assumption Fix the edge density T (0, ) and consider the triangle density T 3 + ɛ, for some ɛ either positive or negative. For this pair of constraints we consider the Lagrange multipliers θ (ɛ) := (θ(ɛ), θ(ɛ)) as defined in Section. (see also the follow up discussion in Section.3). Then, for ɛ sufficiently small, we have the representation sup h W [ θ (ɛ)t ( h) + θ (ɛ)t ( h) I( h) where θ := θ (0), γ = θ (0) and γ = θ (0). ] = θ T I(T ) + (γ T + γ T 3 )ɛ + O(ɛ ), (3.) In Section 4. we show that Assumption is true when T [, ). For T (0, ) we can prove (3.4) and (3.5) below, but with replacing the equality. If Assumption is true, then we again obtain (3.4) and (3.5) with equality. If it fails, then we have strict inequality. Assumption Fix the edge density T (0, ) and consider the triangle density T 3 + ɛ, for some ɛ either positive or negative. For this pair of constraints we consider the microcanonical entropy J(ɛ) := sup{ I( h) : h W, T ( h) = T, T ( h) = T 3 + ɛ}. (3.) Then for ɛ sufficiently small the solution of (3.), denoted by h ɛ, has the following form h ɛ = T + g ɛ, where g ɛ = g I I + g (I J) (J I) + g J J, (3.3) with g, g, g [ T, T ] and I, J [0, ]. Assumption is based on the following intuitive argument. Suppose we want to maximise the microcanonical entropy among all piecewise constant graphons. Then we expect the entropy to decrease when we add more structure, i.e. more steps, in the graphon. A piecewise constant graphon with m steps corresponds to a random graph where the vertices are divided into m groups, and within each group we make an ER random graph with some probability. We expect that the microcanonical entropy will decrease as m increases. This statement is also supported by extensive numerical experiments performed in [9]. The methodology we rely on in order to analyse the variational problem in (3.) does not always identify the exact optimal graphon. It identifies a candidate optimal graphon, which is sufficient in some cases, for the scaling behaviour of the relative entropy s. We call these graphons balance optimal. Informally speaking, a balance optimal graphon is obtained when solving the optimisation problem in (3.) in a smaller class of graphons than the whole class of graphons that satisfy the hard constraint. This is the class of graphons satisfying the conditions in Assumption such that the values g, g, g all correspond to contributions of the same order. The precise definition of a balance optimal graphon is given in Section 5. We want to investigate in this chapter whether the global maximizer of (3.) lies in this smaller class of graphons. We show that balance optimisers have specific structural properties. 9

10 But, for the case of a perturbation upwards, the unique optimal graphon does not lie in this class, and this happens because λ(i) gets very small as ɛ 0 while g stays constant. We refer the reader to [0]. For the case of a perturbation downwards the exact structure of the unique optimal graphon is still not known: the only results we are aware of come from an extensive numerical study [7]. From this numerical study it seems that, at least for T (0, T ), with T 0.44, the unique global optimiser is indeed a balance optimal graphon. In this chapter we investigate this question further by identifying the balance optimal graphons and comparing them with the results established numerically in [7]. Balance optimal graphons are candidate optimisers of J(ɛ). In what follows, because all the graphons we derive are balance optimal graphons, we simply speak of optimal graphons. When at some point a clear distinction is needed we say so. Another important feature is that balance optimal graphons are in general not unique. In the following sections we construct various balance optimal graphons, showing the different structures that can emerge. The variational problem J(ɛ) in (3.) has been solved in [0] for the case T > T 3, while the case T < T 3 still remains unsolved. In this chapter we consider only a small perturbation around the typical values, but the advantage of our method is that it is simpler and yields more intuition about the way the constraint is attained. Moreover, it also applies for the case ɛ < 0, which has not been rigorously analysed before. In [7] the authors identify the maximizers of the microcanonical entropy numerically. The optimal graphons obtained numerically in [7] agree structurally with the balance optimal graphons that we find. Theorem 3. For T (0, ) and T, Theorem 3. For T (0, ], lim ɛ s (T, T 3 + 3T ɛ) = ɛ 0 6 T T log T (0, ). (3.4) lim sup ɛ 0 ɛ /3 s (T, T 3 T 3 ɛ) 4 T T (0, ). (3.5) Theorem 3.3 For T (, ), lim sup ɛ /3 s (T, T 3 T 3 ɛ) f(t, T ) (0, ), (3.6) ɛ 0 where T ( T, 0) is the unique point where the function x f(t, x), defined by attains its global minimum. f(t, x) := T I(T + x) I(T ) I (T )x x, x ( T, 0), (3.7) We illustrate these results in Figure 5. In the left panel we plot the limits in the right-hand side of (3.5) (3.6) as a function of T. In the right panel we plot s (T, T 3 + ɛ) as a function of ɛ, for ɛ sufficiently small, and for four different values of T. Remark 3.4 We believe, and there is numerical evidence in [7], that the results in (3.5) and (3.6) hold with equality and that the corresponding limits exist. Remark 3.5 From [8, Theorem.] we also have that for some constant c = c(ɛ) > 0. s (T, T 3 T 3 ɛ) cɛ /3, (3.8) In Propositions below we identify the structure of balance optimal graphons corresponding to the perturbed constraints in the microcanonical ensemble in the limit as n. 0

11 * * * * Figure 5: Limit of scaled s as a function of ɛ for ɛ sufficiently small. Proposition 3.6 When the ER-line is approached from above, a balance optimal graphon is given by h = T + ɛ g + O(ɛ) (global perturbation) (3.9) with g given by, (x, y) [0, ], g (x, y) = 0, (x, y) [0, ] (, ] (, ] [0, ],, (x, y) (, ]. (3.0) It is important to mention that the balance optimal graphon determined in Proposition 3.6 is not unique in the sense that there are multiple graphons that yield the same entropy value. From Proposition 3.6 we also see it is possible that the class of balance optimisers does not contain the actual unique optimiser of J(ɛ). For this pair of constraints and from [0] we have that the actual unique optimiser, denoted by h ɛ, is given by h, (x, y) [0, λɛ], h ɛ (x, y) = T + h ɛ, (x, y) [0, λɛ] (λɛ, ] (λɛ, ] [0, λɛ], (3.) T + h ɛ, (x, y) (λɛ, ], where λ := ( T, h ) := h, h := T. (3.) The term h solves the equation I (h ) = 3I ( T ) and is constant as ɛ 0. For details on this issue we refer to [0]. As mentioned above, balance optimal graphons have the structural property that g, g, g all contribute equally to the constraint. This is not the case for the graphon in (3.0), because only g and g contribute to the constraint, to leading order. The exact computations are provided in Section 5. From (3.0) and (3.) we observe that balance optimal graphons can have structures very different from the optimal graphons. Proposition 3.7 When the ER-line is approached from below and T (0, ], a balance optimal graphon is given by h = T + ɛ /3 g + O(ɛ /3 ) (global perturbation) (3.3)

12 with g given by T, (x, y) [0, ], g (x, y) = T, (x, y) [0, ] (, ] (, ] [0, ], T, (x, y) (, ]. This g is not unique, in the sense that there are multiple graphons that are balance optimal. (3.4) Proposition 3.8 When the ER-line is approached from below and T (, ), the unique balance optimal graphon is given by h = T + gɛ (local perturbation) (3.5) with g ɛ defined by gɛ (x, y) := with T ( T, 0) defined in Theorem 3.3. T ɛ T /3, (x, y) [0, T T ɛ/3 ] T ɛ /3, (x, y) [0, T T ɛ/3 ] [ T T ɛ/3, ] or (x, y) [ T T ɛ/3, ] [0, T T ɛ/3 ], T, (x, y) [ T T ɛ/3, ], (3.6) In conclusion, Theorems say that at a fixed density of the edges it is less costly in terms of relative entropy to increase the density of triangles than to decrease it. The ER-line represents a crossover in the cost (see Figure 5, right panel). Above the ER-line the cost is linear in the distance, below the ER-line the cost is proportional to the 3-power of the distance. Propositions show that the optimal perturbation of the ER-graphon is global above the ER-line and below the ER-line when the edge density is less than and local below the ER-line when the edge density is larger than. 4 Proofs of Theorems In this section we prove Theorems Along the way we use the results given in Propositions , which we prove in Section Proof of Theorem 3. For ease of notation we drop the superscript from the constraint on the edge density and write T instead of T. Let T (ɛ) = T, T (ɛ) = T 3 + 3T ɛ. (4.) The factor 3T appearing in front of the ɛ is put in for convenience. We know that for every pair of graphical constraints (T (ɛ), T (ɛ)) there exists a unique pair of Lagrange multipliers (θ (ɛ), θ (ɛ)) corresponding to these constraints. For an elaborate discussion on this issue we refer the reader to [?]. By considering the Taylor expansion of the Lagrange multipliers (θ (ɛ), θ (ɛ)) around ɛ = 0, we obtain where θ (ɛ) = θ + γ ɛ + Γ ɛ + O(ɛ 3 ), θ (ɛ) = γ ɛ + Γ ɛ + O(ɛ 3 ), (4.) θ (0) = θ = I (T ), γ = θ (0), Γ = θ (0), θ (0) = 0, γ = θ (0), Γ = θ (0). (4.3) We denote the two terms in the expression for s in (.30) by I, I, i.e., s = sup h W [ θ T ( h) I( h) ] sup h W [ θ T ( h) I( h) ] = I I, (4.4) and we let s (ɛ) denote the relative entropy corresponding to the perturbed constraints. We distinguish between the cases T [, ) and T (0, ).

13 Case I T [, ): From [?, Section 5], if T [, ) and T [ 8, ), then the corresponding Lagrange multipliers (θ, θ ) are both non-negative. Hence from [7, Theorem 4.] we have that [ ] [ I := sup θ (ɛ)t ( h) + θ (ɛ)t ( h) I( h) = sup θ (ɛ)u + θ (ɛ)u I(u) ], (4.5) h W 0 u and, consequently, I = sup 0<u< [ θ (ɛ)u + θ (ɛ)u 3 I(u) ] = θ (ɛ)u (ɛ) + θ (ɛ)u (ɛ) 3 I(u (ɛ)). (4.6) The optimiser u (ɛ) corresponding to the perturbed multipliers θ (ɛ) and θ (ɛ) is analytic in ɛ, as shown in [9]. Therefore, a Taylor expansion around ɛ = 0 gives where δ = u (0) and = u (0). Hence I can be written as Moreover, u (ɛ) = T + δɛ + ɛ + O(ɛ 3 ), (4.7) I = θ T I(T ) + (γ T + γ T 3 )ɛ + O(ɛ ). (4.8) I = [ θ + γ ɛ + Γ ɛ + O(ɛ 3 ) ] T + [ γ ɛ + Γ ɛ + O(ɛ 3 ) ] (T 3 + 3T ɛ) inf h W ɛ I( h) (4.9) where Consequently, =θ T + γ T ɛ + Γ T ɛ + T 3 γ ɛ + Γ T 3 ɛ + 3T γ ɛ J (ɛ) + O(ɛ 3 ), J (ɛ) := h () ɛ inf I( h), W ɛ := { h W : T ( h) = T, T ( h) = T 3 + 3T ɛ}. (4.0) h W ɛ s (T, T 3 + 3T ɛ) = J (ɛ) I(T ) + O(ɛ ). (4.) Denote by one of the, possibly multiple, balance optimisers of the variational problem J (ɛ). () From Proposition 3.6 we know that, for ɛ sufficiently small, any graphon in the equivalence class h ɛ, denoted by h () ɛ, has the form h () ɛ = T + ɛg + O(ɛ) where the graphon g was defined in (3.0). By considering the Taylor expansion of the function I around ɛ = 0, we get I(h () ɛ ) = I(T ) + I (T ) ɛ dx dy g (x, y) [0,] + I (T ) ɛ dx dy g (x, y) + o(ɛ) [0,] = I(T ) + I (T ) ɛ dx dy g (x, y) + o(ɛ) [0,] = I(T ) + I (T )ɛ + o(ɛ) = I(T ) + T ( T ) ɛ + o(ɛ). But, from (3.), a straightforward computation of the entropy of h ɛ shows that (4.) J 6 (ɛ) = I(T ) + T T log T ɛ + o(ɛ). (4.3) Hence we obtain that the global optimiser is not a balance optimiser and that s (T, T 3 + 3T ɛ) = 6 T T log T ɛ + o(ɛ). (4.4) 3

14 Case II T (0, ): Consider the term I := sup h W [ θ (ɛ)t ( h) + θ (ɛ)t ( h) I( h) ], as above. If Assumption applies, then this case is proved in the same way as Case I. Otherwise, consider the following lower bound [ ] [ θ (ɛ)t ( h) + θ (ɛ)t ( h) I( h) sup θ (ɛ)u + θ (ɛ)u 3 I(u) ]. (4.5) 0 u sup h W The arguments used in Case I after (4.6) apply, and the result in (4.) is obtained with an inequality instead of an equality. 4. Proof of Theorem 3. In this section we omit the computations that are similar to those in the proof of Theorem 3. in Section??. Let T (ɛ) = T, T (ɛ) = T 3 T 3 ɛ. (4.6) The factor T 3 appearing in front of the ɛ is put in for convenience in the computations. The perturbed Lagrange multipliers are where θ (ɛ) = θ + γ ɛ + Γ ɛ + O(ɛ 3 ), θ (ɛ) = γ ɛ + Γ ɛ + O(ɛ 3 ), (4.7) θ = I (T ), γ = θ (0), Γ = θ (0) γ = θ (0), Γ = θ (0). (4.8) We denote the two terms in the expression for s in (.30) by I, I, i.e., s = I I, and let s (ɛ) denote the perturbed relative entropy. The computations for I are similar as before, because the exact form of the constraint does not affect the expansions in (4.7) and (4.8). For I, on the other hand, we have I = θ T + γ T ɛ + Γ T ɛ + T 3 γ ɛ + Γ T 3 ɛ T 3 γ ɛ J (ɛ) = θ T + γ T ɛ + T 3 γ ɛ J (ɛ) + (4.9) O(ɛ ), where Consequently, J (ɛ) := inf I( h), W ɛ := { h W : T ( h) = T, T ( h) = T h W 3 T 3 ɛ}. (4.0) ɛ s (T, T T 3 ɛ) = J (ɛ) I(T ) + O(ɛ ). (4.) Denote by h ɛ one of the, possibly multiple, optimisers of the variational problem J (ɛ). From Proposition 3.7 we know that, for T (0, ], a balance optimal graphon in the equivalence class h ɛ, denoted by h ɛ for simplicity in the notation, has the form h ɛ = T + ɛ /3 g + O(ɛ /3 ) (4.) with g given by Hence which gives T, (x, y) [0, ], g (x, y) = T, (x, y) [0, ] (, ] (, ] [0, ], T, (x, y) (, ]. (4.3) J (ɛ) I(T ) + T I (T )ɛ /3 I(T ) + 4 T ɛ /3, (4.4) T s (T, T T 3 ɛ) 4 T ɛ /3 + o(ɛ /3 ). (4.5) T 4

15 4.3 Proof of Theorem 3.3 The computations leading to the expression for the relative entropy in the right-hand side of (3.6) are similar as those in Section 4., and we omit them. Hence we have where, for T (, ), s (T, T T 3 ɛ) = J (ɛ) I(T ) + O(ɛ ), (4.6) J (ɛ) := inf I( h), W ɛ := { h W : T ( h) = T, T ( h) = T h W 3 T 3 ɛ}. (4.7) ɛ Denote by h ɛ one of the, possibly multiple, optimisers of the variational problem J (ɛ). From Proposition 3.8 we know that, for T (, ), a balance optimal graphon in the equivalence class h ɛ, denoted by h ɛ for simplicity in the notation, has the form h ɛ = T + g ɛ (4.8) with g ɛ given by gɛ (x, y) := T ɛ T /3, (x, y) [0, T T ɛ/3 ] T ɛ /3, (x, y) [0, T T ɛ/3 ] [ T T ɛ/3, ] or (x, y) [ T T ɛ/3, ] [0, T T ɛ/3 ], T, (x, y) [ T T ɛ/3, ]. (4.9) The term T ( T, 0) is defined in Theorem 3.3. Hence we have s (T, T T 3 ɛ) f(t, T )ɛ /3 + o(ɛ /3 ), (4.30) where T ( T, 0) is the unique point where the global minimum of the function x f(t, x), defined by f(t, x) := T I(T + x) I(T ) I (T )x x, x ( T, 0). (4.3) We need to show that, for every T (0, ) and for every x ( T, 0), f(t, x) > 0 or equivalently that I(T + x) I(T ) I (T )x > 0. (4.3) From the mean-value theorem we have that there exists ξ (T +x, T ) such that I (T +x) I(T ) = I (ξ)x. Hence we have that f(t, x) = (I (ξ) I (T ))x > 0, (4.33) which follows because I is an increasing function, x ( T, 0) and ξ (T + x, T ). More detailed arguments are provided in the following section. 5 Proofs of Propositions In this section we prove Propositions In Section 5. we prove Proposition 3.6 and in Section 5. we prove Propositions 3.7 and 3.8. The proof of Proposition 3.8 is similar to the proof of Proposition 3.7, only computations are different. In Section 4 the following variational problems were encountered: () For T (0, ), () For T (0, ], J (ɛ) = inf { I( h): h W, T ( h) = T, T ( h) = T 3 + 3T ɛ }. (5.) J (ɛ) = inf { I( h): h W, T ( h) = T, T ( h) = T 3 T 3 ɛ }. (5.) 5

16 (3) For T (, ), J (ɛ) = inf { I( h): h W, T ( h) = T, T ( h) = T 3 T 3 ɛ }. (5.3) In order to prove Propositions , we need to analyse these three variational problems, for ɛ sufficiently small, which is the objective of this section. The variational formula in (5.) has been rigorously analysed in [0], and hence we study the variational formulas in (5.) and (5.3), under the assumption that the optimiser lies in the class of balance optimal graphons. We remind the reader that we suppose Assumption to be true. We analyse the variational formulas with the help of a perturbation argument. In particular, we show that the balance optimal perturbations are those given in (3.9), (3.3) and (3.5), respectively. The results in Propositions 3.7 and 3.8 follow directly from the following two lemmas. Lemma 5. Let T (0, ]. For ɛ > 0 consider the variational formula for J (ɛ) given in (5.). Then, for ɛ sufficiently small, T J (ɛ) I(T ) + ɛ /3 + o(ɛ /3 ). (5.4) 4 T Lemma 5. Let T (, ). For ɛ > 0 consider the variational formula for J (ɛ) given in (5.3). Then, for ɛ sufficiently small, J (ɛ) I(T ) + f(t, T )ɛ /3 + o(ɛ /3 ), (5.5) where f(t, x), x ( T, 0), and T were defined in Theorem 3.3. Remark 5.3 As argued in Remark 3.4, we believe, and there is numerical evidence in [7], that the results in (5.4) and (5.5) hold with equality. In what follows we use the notation f(ɛ) g(ɛ), for two functions f, g, when f(ɛ) g(ɛ) converges to a positive constant, as ɛ Proof of Proposition 3.6 In this section we prove Proposition 3.6 given that Assumption holds. In order to find the optimal perturbation when the ER-line is approached from above, we need to solve J (ɛ) in (5.). The following construction shows intuitively why balance optimal perturbations have the form given in (3.9). Consider an inhomogeneous ER-random graph on n vertices. We split the vertices of the graph into two parts of equal size, i.e. of size n/. In one part we connect two vertices with probability T + ɛ, in the other part we connect two vertices with probability T ɛ, and we connect vertices lying in different parts with probability T. This graph has expected edge density equal to ( n ) ( T ( n ) + (T + ɛ)( n ) + (T ɛ)( n Similarly, the expexted triangle density is equal to (( n ) ( n (T + 3) ɛ) 3 + n ( n ) ( n T T + )(T ) ɛ) = T 3 n 4 + 3T n ɛ T 3 + 3T ɛ, )) = T. (5.6) for n large. Below when we speak of optimal perturbation we mean balance optimal. In the proof below we will see that the optimal perturbation is indeed given by the graphon counterpart of the 6

17 inhomogeneous ER-random graph described above. We now proceed to the technical details of the proof. We consider the variational formula J (ɛ), with ɛ > 0, given in (5.). We denote by one of the, possibly multiple, optimisers of J (ɛ). For simplicity in the notation, in what follows we work with a representative element, denoted by h ɛ, of the equivalence class h ɛ. We write the optimiser h ɛ in the form h ɛ = T + H ɛ for some bounded symmetric function H ɛ defined on the unit square [0, ] and taking values in R. This term will be called the perturbation term. The optimiser h ɛ has to satisfy the conditions on the edge and triangle densities, i.e., T (h ɛ ) = T, T (h ɛ ) = T 3 + 3T ɛ. (5.7) Hence the perturbation term H ɛ needs to satisfy the constraints (G ): dx dy H ɛ (x, y) = 0 (5.8) [0,] and (G ): 3T [0,] 3 dx dy dz H ɛ (x, y) H ɛ (y, z) + dx dy dz H ɛ (x, y) H ɛ (y, z) H ɛ (z, x) = 3T ɛ. [0,] 3 In what follows we prove the result stated in Proposition 3.6, i.e., the optimal perturbation is a threestep function and is of the order ɛ. In Assumption it is stated that it suffices to restrict to graphons that can be written in the form T + H ɛ (), where H ɛ () is a bounded symmetric function defined on [0, ], taking three non-zero values. In what follows, for simplicity in the computations and without loss of generality, we suppose that the optimal graphon has the form h ɛ (5.9) Then (G ) above becomes H () ɛ = g I I + g (I J) (J I) + g J J. (5.0) and the two integrals in (G ) become λ(i) g + λ(i)( λ(i))g + ( λ(i)) g = 0, (5.) [0,] 3 dx dy dz H ɛ (x, y) H ɛ (y, z) = λ(i) 3 g + λ(i) ( λ(i))g g + λ(i)( λ(i)) g g + λ(i)( λ(i))g + ( λ(i)) g (5.) and [0,] 3 dx dy dz H ɛ (x, y) H ɛ (y, z) = λ(i) 3 g + λ(i) ( λ(i))g g + λ(i)( λ(i)) g g + λ(i)( λ(i))g + ( λ(i)) g, (5.3) and a similar expression can be computed for the second integral in (G ). We now give the formal definition of a balance optimal graphon: Definition 5.4 For T (0, ), a graphon T + h ɛ, ɛ > 0, is called balanced if it has the structure given in (5.0) and the terms λ(i) g, λ(i)( λ(i))g and ( λ(i)) g are all of the same order when ɛ is sufficiently small. 7

18 Definition 5.5 For ɛ > 0 a graphon h ɛ is called balance optimal if it solves the following optimisation problem: J bal (ɛ) := inf{i( h), h W, h is balanced, T ( h) = T, T ( h) = T 3 + 3T ɛ}. (5.4) It is straightforward to observe that, for ɛ > 0, J bal (ɛ) J(ɛ). (5.5) In what follows we essentially determine J bal (ɛ) for ɛ sufficiently small. We distinguish two cases, first g = 0 and then g 0. Case g = 0: The values of g + and g are such so that T + H ɛ () satisfies the conditions in (5.8) and (5.9). We proceed with the condition in (5.9). A standard computation yields dx dy dz H ɛ () (x, y) H ɛ () (y, z) = λ(i) 3 g+ + λ(j) 3 g (5.6) [0,] 3 and [0,] 3 dx dy dz H () ɛ (x, y) H () ɛ (y, z) H () ɛ (z, x) = λ(i) 3 g λ(j) 3 g 3. (5.7) From (5.8) we obtain the first order condition Using the condition in (5.8), we get that (5.9) equals λ(i) g + + λ(j) g = 0. (5.8) g λ(j) 3 3T λ(i) (λ(j) + λ(i)) λ(j) 3 g3 λ(i) 3 (λ(i)3 λ(j) 3 ) = 3T ɛ + o(ɛ). (5.9) There are multiple ways in which the condition in (5.9) can be met. We show that the lowest possible value of the function I is attained when g + ɛ, g ɛ and λ(i), λ(j) are constant. To that end we distinguish the following cases: (I) (II) g λ(j) 3 3T (λ(j) + λ(i)) ɛ, λ(i) g3 which splits into three sub-cases: (Ia) (Ib) (Ic) (d) g + ɛ /, g ɛ /, g + ɛ /+δ/3, g ɛ / δ, g + ɛ / 3δ, g ɛ /+δ, λ(j) 3 λ(i) 3 (λ(i)3 λ(j) 3 ) = o(ɛ), (5.0) λ(j) λ(i). (5.) λ(j) 3 λ(i) ɛδ, δ (0, ). (5.) λ(j) 3 λ(i) ɛ δ, δ (0, 6 ). (5.3) g + ɛ /3, g = ḡ ( T, 0), λ(j) ɛ /3. (5.4) g 3T λ(j) 3 λ(i) (λ(j) + λ(i)) ɛ+δ, g λ(i) ɛ δ, δ > 0. (5.5) A simple calculation shows that, in all five cases above, λ(i) + λ(j) and λ(i) 3 λ(j) 3, and hence we can omit these two factors from the analysis below. In what follows we exclude cases (Ib), (Ic) and (II) one by one by comparing them to graphons of the type given in case (Ia). 8

19 Case (Ib): We show that, for ɛ > 0 sufficiently small, graphons having the structure indicated in (Ia) yield smaller values of the function I than graphons with the structure in (Ib). We consider two graphons, denoted by T + g and T + ĝ, where g is as in Case (Ia) and ĝ is as in Case (Ib). Before giving the technical details of the proof, we present a heuristic argument why I(T + g ) < I(T + ĝ ). In what follows we will denote by B(p) a Bernoulli random variable with parameter p. The function I(x), x [0, ], defined in (??) represents the entropy of a B(x) random variable with parameter x. On the graphon space the function I(h), h W, defined in (??), can be seen as the expectation of the entropy of a Bernoulli random variable with a random parameter (the expectation is with respect to the random parameter), i.e., B(h(X, Y )) with (X, Y ) a uniformly distributed random variable on [0, ]. For h W we have I(h) = dx dy [ I(h(x, y))] = E[ I(h(X, Y ))]. (5.6) [0,] Hence we have the following equivalence I(T + g ) < I(T + ĝ ) E[ I(T + g (X, Y ))] > E[ I(T + ĝ (X, Y ))], (5.7) where (X, Y ) is a uniformly distributed random vector on [0, ]. Instead of working with entropy, it is intuitively simpler to work with the relative entropy with respect to the random variable B( ). The relative entropy is defined by Note that I (x) := x log x + ( x) log x, x [0, ]. (5.8) E[ I(T + g (X, Y ))] > E[ I(T + ĝ (X, Y ))] E[I (T + g (X, Y ))] < E[I (T + ĝ (X, Y ))]. (5.9) We first give an intuitive argument and afterwards prove that E[I (T + g (X, Y ))] < E[I (T + ĝ (X, Y ))]. (5.30) We distinguish between the cases T (0, ] and T (, ). The case T (0, ] follows by using similar arguments as in case T (, ). We treat in detail only the case T (, ). The relative entropy of a random variable with respect to B( ) is zero if and only if that random variable is equal to B( ). So, in order to compare the relative entropies in (5.30), we need to see how close the Bernoulli random variables with random parameters T + g (X, Y ) and T + ĝ (X, Y ) are to B( ). We are considering the case T >. Hence the random variables B(T + g (X, Y )) and B(T + ĝ (X, Y )) will be close to B( ) when the random parameters T + g (X, Y ) and T + ĝ (X, Y ) are close to. This is the case when g (X, Y ) and ĝ (X, Y ) are negative. These events occur with probabilities P(T + g (X, Y ) < T ) = P(g (X, Y ) < 0) = P(g (X, Y ) = g ) = λ(j), (5.3) because of the properties of the graphon in Case (Ia). Similarly, we have that P(T + ĝ (X, Y ) < T ) = P(ĝ (X, Y ) < 0) = P(ĝ (X, Y ) = g ) = λ(ĵ) ɛ 4δ/3, (5.3) for some δ (0, ], because of the properties of the graphon in Case (Ib). Hence we see that the random variable B(T + g (X, Y )) is closer to the random variable B( ) with much higher probability than the random variable B(T + ĝ (X, Y )). We can see this by computing the corresponding expectations, E(g (X, Y ) g (X, Y ) = g ) P(g (X, Y ) = g ) = g P(g (X, Y ) = g ) ɛ /, (5.33) 9

20 while E(ĝ (X, Y ) ĝ (X, Y ) = ĝ ) P(ĝ (X, Y ) = ĝ ) = ĝ P(ĝ (X, Y ) = ĝ ) ɛ / δ ɛ 4δ/3 = ɛ /+δ/3. In what follows we complete this argument by adding the technical details. We work out the expressions in the left-hand and right-hand sides of (5.30). The expression in the right-hand side of (5.30) can be written as E[I (T + g (X, Y ))] = LI (T + g + ) + KI (T + g ) + ( L K)I (T ), (5.34) for some constants L := P(g (X, Y ) = g + ) and K = P(g (X, Y ) = g ) independent of ɛ. Similarly, E[I (T + ĝ (X, Y ))]=λ(î) I (T + ĝ + ) + ɛ 4δ/3 I (T + ĝ ) + ( λ(î) ɛ 4δ/3 )I (T ), (5.35) where λ(î) = P(ĝ (X, Y ) = ĝ + ) and P(ĝ (X, Y ) = ĝ ) ɛ 4δ/3. Moreover, we recall that from the properties of the graphons in Case (Ia) and Case (Ib) we get g + ɛ, g ɛ, ĝ + ɛ /+δ/3, ĝ ɛ / δ, δ (0, ]. (5.36) Hence, for T (, ] and ɛ sufficiently small, because of (5.36), we obtain the following inequalities: I (T + g + ) > I (T + ĝ + ) > I (T + g ) > I (T + ĝ ). (5.37) Using a Taylor expansion of the function I around T and the first order conditions Lg + + Kg = 0 and λ(î) ĝ + + λ(ĵ) ĝ = 0, (5.38) we observe that (5.34) and (5.35) read E[I (T + g (X, Y ))] = I (T ) + I (T )(Lg+ + Kg ) + o ( g+ + g ) and (5.39) E[I (T + ĝ (X, Y ))] = I (T ) + I (T )(λ(î )ĝ+ + λ(ĵ) ĝ ) ( ) + o λ(î )ĝ+ + λ(ĵ) ĝ. (5.40) Using (5.36), we observe that Lg + + Kg ɛ and Hence, for ɛ sufficiently small, [ ] E I (T + g (X, Y )) which proves (5.30). λ(î )ĝ + + λ(ĵ) ĝ ɛ +δ/3 + ɛ 4/3δ ɛ δ ɛ δ/3. (5.4) [ ] < E I (T + ĝ (X, Y )), (5.4) Similar arguments can be used for the case T (0, ) to show that graphons having the structure as in Case (Ic), yield larger values of I for ɛ sufficiently small. We omit the details. Case (d): In this case we have that the optimal graphon is constant on a subset of the unit square with a size tending to zero as ɛ 0. Such a graphon yields I(T + g ) = λ(i) I(T + g + ) + ( λ(i))( λ(j))i(t ) + λ(j) I(T + g ) = λ(i) (I(T ) + I (T )g + + o(ɛ /3 )) + ( λ(i))( λ(j))i(t ) + λ(j) I(T + g ) = I(T ) λ(j) I(T ) λ(j) ḡi (T ) + λ(j) I(T + ḡ) = I(T ) + ɛ /3 (I(T + ḡ) ḡi (T ) I(T )) + o(ɛ /3 ). (5.43) 0

21 The second equality follows by considering a Taylor expansion around ɛ = 0 in the terms that go to zero as ɛ 0, i.e, g +. In the third equality we use (5.8). What remains is to show that I(T + ḡ) I(T ) ḡi (T ) > 0, (5.44) for ḡ ( T, 0). From the mean-value theorem we have that I(T + ḡ) I(T ) = I (ξ)ḡ for some ξ (T + ḡ, T ). Since ḡ < 0 and I is a convex function, i.e. I is an increasing function, we have that I (ξ) < I (T ). This proves the claim above. From (5.43) we observe that graphons having the form as in Case (d) yield larger values of I, for ɛ sufficiently small, than graphons as in Case (a). Case (II): This case is simpler to exclude than the ones above. Indeed, suppose that (5.5) holds. Then either λ(i) should become small or g should become large. But g ɛ δ is not possible because g should stay bounded in ( T, 0) as ɛ 0. Hence the only possibility is λ(i) ɛ η and g ɛ ζ for some η, ζ such that ζ η = δ, because of the second condition in (5.5). From the first condition in (5.5) we have that ζ η = + δ. Solving these two equations we obtain that η = 3 + δ and ζ = 3 + δ. From (5.8) we then get that g + ɛ δ, which is not possible because g + should stay bounded in (0, T ) as ɛ 0. At this point we summarise our findings. We considered the variational formula J (ɛ) given in (5.) and we assumed that we can restrict ourselves to piece-wise constant graphons (see Assumption ) subject to the constraints in (5.8) and (5.9). Afterwards, without loss of generality, we restricted ourselves to an even smaller class of graphons, those of the form g = g + I I + g J J (5.45) for some g + > 0, g < 0 and I, J [0, ] with λ(i) +λ(j). At the end of this section we elaborate on the case g 0. More specifically, we have shown that the optimal perturbation satisfies g + ɛ /, g ɛ / and λ(i), λ(j). Hence the solution to J (ɛ) has the form T + g ɛ + o(ɛ), where g = g + L L + g K K, for some g + > 0, g < 0, L, K (0, ) independent of ɛ, is a symmetric function defined on [0, ]. From the constraints (5.8) and (5.9) we have that g + L = g K and L 3 g + + K 3 g =. A simple calculation shows that I(T + g ɛ ) = I(T ) + I (T )(L g + + K g ) ɛ + I (T )(L g + + K g )ɛ + o(ɛ) = I(T ) + I (T )(L g + + K g )ɛ + o(ɛ). Hence, in order to find the optimal graphon we need to solve the following optimisation problem: min ( L g+ + K g ) (5.46) This is equivalent to such that L + K, g + L + g K = 0, L 3 g + + K 3 g =. ( min K + L ) K + L such that L + K. (5.47) From a standard computation we find that the optimal K, L should satisfy K + L =. Hence we need to minimize L+L L( L). This function is convex in L (0, ) and attains a unique minimum at the point L =. Having computed L, K we find g + = g =, and so the optimal solution to J (ɛ), for ɛ sufficiently small, is the graphon T + ɛ, if (x, y) [0, ], h ɛ (x, y) = T, if (x, y) [0, ] (, ] or (, ] [0, ], (5.48) T ɛ, if (x, y) (, ]. A standard computation shows that T (h ɛ ) = T and T (h ɛ ) = T 3 + 3T ɛ.

22 Case g 0: By following similar arguments as for the case g = 0, we can show that the optimal values of g, g, g, K and L can be retrieved by solving the following optimisation problem: min ( L g + K g + LKg ) such that L + K =, L g + K g + LKg = 0, L 3 g + K 3 g + L Kg g + LK g g + LKg =. (5.49) Suppose first that L = K =. Then we have the following optimisation problem ( min 4 g + g + g) such that g + g + g = 0, g + g + g g + g g + g = 8. Introducing Lagrange multipliers, we obtain the solution g = 0 and g = g =. For arbitrary L, K, substituting g = ( L L g + L ) L g into (5.49), and differentiating the Lagrangian with respect to g, we obtain g = 0. We observe at this point that this argument holds only for the case where g, g and g go to zero as ɛ 0. This is not the case for the actual optimal graphon in (3.). Case g 0 and g = 0: From (5.44) we observe that g = 0 yields an equality. Hence in this case the microcanonical entropy will be of the order ɛ instead of ɛ /3. From the first-order constraint in (5.8) we obtain g = λ ( λ) g, (5.50) where λ := λ(i). Then the second order constraint reads g λ( λ) = ɛ. (5.5) 4 ( λ) λ Following similar arguments as before, we can show that the case g ɛ δ, λ ɛ /3 δ/3, g ɛ /3+δ/ is not optimal. The case g or g are constant, independently of ɛ, is also not optimal, since if one of them is constant then the entropy cost will be ɛ /3 instead of ɛ. A standard computation yields I(T + g ) = I(T ) + ( I (T ) + 4 λ ) ɛ + o(ɛ), (5.5) λ while for the graphon defined in (5.48) we have I(h ɛ ) = I(T ) + I (T )ɛ + o(ɛ). (5.53) Hence we see that I(T + g ) > I(T + h ɛ ) if and only if λ is constant and independent of ɛ. If λ ɛ δ, then further analysis is needed in order to establish the optimal graphon. In any case, the graphon h ɛ is balance optimal, as desired.

The large deviation principle for the Erdős-Rényi random graph

The large deviation principle for the Erdős-Rényi random graph The large deviation principle for the Erdős-Rényi random graph (Courant Institute, NYU) joint work with S. R. S. Varadhan Main objective: how to count graphs with a given property Only consider finite

More information

EMERGENT STRUCTURES IN LARGE NETWORKS

EMERGENT STRUCTURES IN LARGE NETWORKS Applied Probability Trust (29 November 2012) EMERGENT STRUCTURES IN LARGE NETWORKS DAVID ARISTOFF, University of Minnesota CHARLES RADIN, The University of Texas at Austin Abstract We consider a large

More information

A LARGE DEVIATION PRINCIPLE FOR THE ERDŐS RÉNYI UNIFORM RANDOM GRAPH

A LARGE DEVIATION PRINCIPLE FOR THE ERDŐS RÉNYI UNIFORM RANDOM GRAPH A LARGE DEVIATION PRINCIPLE FOR THE ERDŐS RÉNYI UNIFORM RANDOM GRAPH AMIR DEMBO AND EYAL LUBETZKY Abstract. Starting with the large deviation principle (ldp) for the Erdős Rényi binomial random graph G(n,

More information

MATH 426, TOPOLOGY. p 1.

MATH 426, TOPOLOGY. p 1. MATH 426, TOPOLOGY THE p-norms In this document we assume an extended real line, where is an element greater than all real numbers; the interval notation [1, ] will be used to mean [1, ) { }. 1. THE p

More information

Mini-Course on Limits and Sequences. Peter Kwadwo Asante. B.S., Kwame Nkrumah University of Science and Technology, Ghana, 2014 A REPORT

Mini-Course on Limits and Sequences. Peter Kwadwo Asante. B.S., Kwame Nkrumah University of Science and Technology, Ghana, 2014 A REPORT Mini-Course on Limits and Sequences by Peter Kwadwo Asante B.S., Kwame Nkrumah University of Science and Technology, Ghana, 204 A REPORT submitted in partial fulfillment of the requirements for the degree

More information

Course 212: Academic Year Section 1: Metric Spaces

Course 212: Academic Year Section 1: Metric Spaces Course 212: Academic Year 1991-2 Section 1: Metric Spaces D. R. Wilkins Contents 1 Metric Spaces 3 1.1 Distance Functions and Metric Spaces............. 3 1.2 Convergence and Continuity in Metric Spaces.........

More information

Ground states for exponential random graphs

Ground states for exponential random graphs Ground states for exponential random graphs Mei Yin Department of Mathematics, University of Denver March 5, 2018 We propose a perturbative method to estimate the normalization constant in exponential

More information

Constrained Optimization and Lagrangian Duality

Constrained Optimization and Lagrangian Duality CIS 520: Machine Learning Oct 02, 2017 Constrained Optimization and Lagrangian Duality Lecturer: Shivani Agarwal Disclaimer: These notes are designed to be a supplement to the lecture. They may or may

More information

Partial cubes: structures, characterizations, and constructions

Partial cubes: structures, characterizations, and constructions Partial cubes: structures, characterizations, and constructions Sergei Ovchinnikov San Francisco State University, Mathematics Department, 1600 Holloway Ave., San Francisco, CA 94132 Abstract Partial cubes

More information

1 The Observability Canonical Form

1 The Observability Canonical Form NONLINEAR OBSERVERS AND SEPARATION PRINCIPLE 1 The Observability Canonical Form In this Chapter we discuss the design of observers for nonlinear systems modelled by equations of the form ẋ = f(x, u) (1)

More information

The Least Degraded and the Least Upgraded Channel with respect to a Channel Family

The Least Degraded and the Least Upgraded Channel with respect to a Channel Family The Least Degraded and the Least Upgraded Channel with respect to a Channel Family Wei Liu, S. Hamed Hassani, and Rüdiger Urbanke School of Computer and Communication Sciences EPFL, Switzerland Email:

More information

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi Real Analysis Math 3AH Rudin, Chapter # Dominique Abdi.. If r is rational (r 0) and x is irrational, prove that r + x and rx are irrational. Solution. Assume the contrary, that r+x and rx are rational.

More information

Hard-Core Model on Random Graphs

Hard-Core Model on Random Graphs Hard-Core Model on Random Graphs Antar Bandyopadhyay Theoretical Statistics and Mathematics Unit Seminar Theoretical Statistics and Mathematics Unit Indian Statistical Institute, New Delhi Centre New Delhi,

More information

arxiv: v2 [math.pr] 4 Jan 2018

arxiv: v2 [math.pr] 4 Jan 2018 Ground states for exponential random graphs Rajinder Mavi 1, a 2, b and Mei Yin 1 Department of Mathematics Michigan State University 619 Red Cedar Road arxiv:1706.09084v2 [math.pr] 4 Jan 2018 C212 Wells

More information

Continued fractions for complex numbers and values of binary quadratic forms

Continued fractions for complex numbers and values of binary quadratic forms arxiv:110.3754v1 [math.nt] 18 Feb 011 Continued fractions for complex numbers and values of binary quadratic forms S.G. Dani and Arnaldo Nogueira February 1, 011 Abstract We describe various properties

More information

A mathematical model for a copolymer in an emulsion

A mathematical model for a copolymer in an emulsion J Math Chem (2010) 48:83 94 DOI 10.1007/s10910-009-9564-y ORIGINAL PAPER A mathematical model for a copolymer in an emulsion F. den Hollander N. Pétrélis Received: 3 June 2007 / Accepted: 22 April 2009

More information

Math 118B Solutions. Charles Martin. March 6, d i (x i, y i ) + d i (y i, z i ) = d(x, y) + d(y, z). i=1

Math 118B Solutions. Charles Martin. March 6, d i (x i, y i ) + d i (y i, z i ) = d(x, y) + d(y, z). i=1 Math 8B Solutions Charles Martin March 6, Homework Problems. Let (X i, d i ), i n, be finitely many metric spaces. Construct a metric on the product space X = X X n. Proof. Denote points in X as x = (x,

More information

Analysis II - few selective results

Analysis II - few selective results Analysis II - few selective results Michael Ruzhansky December 15, 2008 1 Analysis on the real line 1.1 Chapter: Functions continuous on a closed interval 1.1.1 Intermediate Value Theorem (IVT) Theorem

More information

Math 4317 : Real Analysis I Mid-Term Exam 1 25 September 2012

Math 4317 : Real Analysis I Mid-Term Exam 1 25 September 2012 Instructions: Answer all of the problems. Math 4317 : Real Analysis I Mid-Term Exam 1 25 September 2012 Definitions (2 points each) 1. State the definition of a metric space. A metric space (X, d) is set

More information

Empirical Processes: General Weak Convergence Theory

Empirical Processes: General Weak Convergence Theory Empirical Processes: General Weak Convergence Theory Moulinath Banerjee May 18, 2010 1 Extended Weak Convergence The lack of measurability of the empirical process with respect to the sigma-field generated

More information

Topological properties

Topological properties CHAPTER 4 Topological properties 1. Connectedness Definitions and examples Basic properties Connected components Connected versus path connected, again 2. Compactness Definition and first examples Topological

More information

The L 3 (4) near octagon

The L 3 (4) near octagon The L 3 (4) near octagon A. Bishnoi and B. De Bruyn October 8, 206 Abstract In recent work we constructed two new near octagons, one related to the finite simple group G 2 (4) and another one as a sub-near-octagon

More information

Graph Limits: Some Open Problems

Graph Limits: Some Open Problems Graph Limits: Some Open Problems 1 Introduction Here are some questions from the open problems session that was held during the AIM Workshop Graph and Hypergraph Limits, Palo Alto, August 15-19, 2011.

More information

1 Lyapunov theory of stability

1 Lyapunov theory of stability M.Kawski, APM 581 Diff Equns Intro to Lyapunov theory. November 15, 29 1 1 Lyapunov theory of stability Introduction. Lyapunov s second (or direct) method provides tools for studying (asymptotic) stability

More information

Lecture Notes in Advanced Calculus 1 (80315) Raz Kupferman Institute of Mathematics The Hebrew University

Lecture Notes in Advanced Calculus 1 (80315) Raz Kupferman Institute of Mathematics The Hebrew University Lecture Notes in Advanced Calculus 1 (80315) Raz Kupferman Institute of Mathematics The Hebrew University February 7, 2007 2 Contents 1 Metric Spaces 1 1.1 Basic definitions...........................

More information

1 Directional Derivatives and Differentiability

1 Directional Derivatives and Differentiability Wednesday, January 18, 2012 1 Directional Derivatives and Differentiability Let E R N, let f : E R and let x 0 E. Given a direction v R N, let L be the line through x 0 in the direction v, that is, L :=

More information

Trace Class Operators and Lidskii s Theorem

Trace Class Operators and Lidskii s Theorem Trace Class Operators and Lidskii s Theorem Tom Phelan Semester 2 2009 1 Introduction The purpose of this paper is to provide the reader with a self-contained derivation of the celebrated Lidskii Trace

More information

Automorphism groups of wreath product digraphs

Automorphism groups of wreath product digraphs Automorphism groups of wreath product digraphs Edward Dobson Department of Mathematics and Statistics Mississippi State University PO Drawer MA Mississippi State, MS 39762 USA dobson@math.msstate.edu Joy

More information

Stein s Method for concentration inequalities

Stein s Method for concentration inequalities Number of Triangles in Erdős-Rényi random graph, UC Berkeley joint work with Sourav Chatterjee, UC Berkeley Cornell Probability Summer School July 7, 2009 Number of triangles in Erdős-Rényi random graph

More information

Second Order Optimality Conditions for Constrained Nonlinear Programming

Second Order Optimality Conditions for Constrained Nonlinear Programming Second Order Optimality Conditions for Constrained Nonlinear Programming Lecture 10, Continuous Optimisation Oxford University Computing Laboratory, HT 2006 Notes by Dr Raphael Hauser (hauser@comlab.ox.ac.uk)

More information

Graph Theory. Thomas Bloom. February 6, 2015

Graph Theory. Thomas Bloom. February 6, 2015 Graph Theory Thomas Bloom February 6, 2015 1 Lecture 1 Introduction A graph (for the purposes of these lectures) is a finite set of vertices, some of which are connected by a single edge. Most importantly,

More information

Metric spaces and metrizability

Metric spaces and metrizability 1 Motivation Metric spaces and metrizability By this point in the course, this section should not need much in the way of motivation. From the very beginning, we have talked about R n usual and how relatively

More information

September Math Course: First Order Derivative

September Math Course: First Order Derivative September Math Course: First Order Derivative Arina Nikandrova Functions Function y = f (x), where x is either be a scalar or a vector of several variables (x,..., x n ), can be thought of as a rule which

More information

Maths 212: Homework Solutions

Maths 212: Homework Solutions Maths 212: Homework Solutions 1. The definition of A ensures that x π for all x A, so π is an upper bound of A. To show it is the least upper bound, suppose x < π and consider two cases. If x < 1, then

More information

arxiv: v2 [math.ap] 28 Nov 2016

arxiv: v2 [math.ap] 28 Nov 2016 ONE-DIMENSIONAL SAIONARY MEAN-FIELD GAMES WIH LOCAL COUPLING DIOGO A. GOMES, LEVON NURBEKYAN, AND MARIANA PRAZERES arxiv:1611.8161v [math.ap] 8 Nov 16 Abstract. A standard assumption in mean-field game

More information

Topological properties of Z p and Q p and Euclidean models

Topological properties of Z p and Q p and Euclidean models Topological properties of Z p and Q p and Euclidean models Samuel Trautwein, Esther Röder, Giorgio Barozzi November 3, 20 Topology of Q p vs Topology of R Both R and Q p are normed fields and complete

More information

Exercise Solutions to Functional Analysis

Exercise Solutions to Functional Analysis Exercise Solutions to Functional Analysis Note: References refer to M. Schechter, Principles of Functional Analysis Exersize that. Let φ,..., φ n be an orthonormal set in a Hilbert space H. Show n f n

More information

Probability and Measure

Probability and Measure Part II Year 2018 2017 2016 2015 2014 2013 2012 2011 2010 2009 2008 2007 2006 2005 2018 84 Paper 4, Section II 26J Let (X, A) be a measurable space. Let T : X X be a measurable map, and µ a probability

More information

Szemerédi s regularity lemma revisited. Lewis Memorial Lecture March 14, Terence Tao (UCLA)

Szemerédi s regularity lemma revisited. Lewis Memorial Lecture March 14, Terence Tao (UCLA) Szemerédi s regularity lemma revisited Lewis Memorial Lecture March 14, 2008 Terence Tao (UCLA) 1 Finding models of large dense graphs Suppose we are given a large dense graph G = (V, E), where V is a

More information

Lecture 7: Semidefinite programming

Lecture 7: Semidefinite programming CS 766/QIC 820 Theory of Quantum Information (Fall 2011) Lecture 7: Semidefinite programming This lecture is on semidefinite programming, which is a powerful technique from both an analytic and computational

More information

The number of edge colorings with no monochromatic cliques

The number of edge colorings with no monochromatic cliques The number of edge colorings with no monochromatic cliques Noga Alon József Balogh Peter Keevash Benny Sudaov Abstract Let F n, r, ) denote the maximum possible number of distinct edge-colorings of a simple

More information

Convex Optimization Notes

Convex Optimization Notes Convex Optimization Notes Jonathan Siegel January 2017 1 Convex Analysis This section is devoted to the study of convex functions f : B R {+ } and convex sets U B, for B a Banach space. The case of B =

More information

Analysis Finite and Infinite Sets The Real Numbers The Cantor Set

Analysis Finite and Infinite Sets The Real Numbers The Cantor Set Analysis Finite and Infinite Sets Definition. An initial segment is {n N n n 0 }. Definition. A finite set can be put into one-to-one correspondence with an initial segment. The empty set is also considered

More information

THE STONE-WEIERSTRASS THEOREM AND ITS APPLICATIONS TO L 2 SPACES

THE STONE-WEIERSTRASS THEOREM AND ITS APPLICATIONS TO L 2 SPACES THE STONE-WEIERSTRASS THEOREM AND ITS APPLICATIONS TO L 2 SPACES PHILIP GADDY Abstract. Throughout the course of this paper, we will first prove the Stone- Weierstrass Theroem, after providing some initial

More information

Cauchy s Theorem (rigorous) In this lecture, we will study a rigorous proof of Cauchy s Theorem. We start by considering the case of a triangle.

Cauchy s Theorem (rigorous) In this lecture, we will study a rigorous proof of Cauchy s Theorem. We start by considering the case of a triangle. Cauchy s Theorem (rigorous) In this lecture, we will study a rigorous proof of Cauchy s Theorem. We start by considering the case of a triangle. Given a certain complex-valued analytic function f(z), for

More information

Asymptotically optimal induced universal graphs

Asymptotically optimal induced universal graphs Asymptotically optimal induced universal graphs Noga Alon Abstract We prove that the minimum number of vertices of a graph that contains every graph on vertices as an induced subgraph is (1+o(1))2 ( 1)/2.

More information

Chapter 4: Asymptotic Properties of the MLE

Chapter 4: Asymptotic Properties of the MLE Chapter 4: Asymptotic Properties of the MLE Daniel O. Scharfstein 09/19/13 1 / 1 Maximum Likelihood Maximum likelihood is the most powerful tool for estimation. In this part of the course, we will consider

More information

p-adic Analysis Compared to Real Lecture 1

p-adic Analysis Compared to Real Lecture 1 p-adic Analysis Compared to Real Lecture 1 Felix Hensel, Waltraud Lederle, Simone Montemezzani October 12, 2011 1 Normed Fields & non-archimedean Norms Definition 1.1. A metric on a non-empty set X is

More information

NOTES ON EXISTENCE AND UNIQUENESS THEOREMS FOR ODES

NOTES ON EXISTENCE AND UNIQUENESS THEOREMS FOR ODES NOTES ON EXISTENCE AND UNIQUENESS THEOREMS FOR ODES JONATHAN LUK These notes discuss theorems on the existence, uniqueness and extension of solutions for ODEs. None of these results are original. The proofs

More information

Classical regularity conditions

Classical regularity conditions Chapter 3 Classical regularity conditions Preliminary draft. Please do not distribute. The results from classical asymptotic theory typically require assumptions of pointwise differentiability of a criterion

More information

Contents Ordered Fields... 2 Ordered sets and fields... 2 Construction of the Reals 1: Dedekind Cuts... 2 Metric Spaces... 3

Contents Ordered Fields... 2 Ordered sets and fields... 2 Construction of the Reals 1: Dedekind Cuts... 2 Metric Spaces... 3 Analysis Math Notes Study Guide Real Analysis Contents Ordered Fields 2 Ordered sets and fields 2 Construction of the Reals 1: Dedekind Cuts 2 Metric Spaces 3 Metric Spaces 3 Definitions 4 Separability

More information

Eigenvalues and Eigenfunctions of the Laplacian

Eigenvalues and Eigenfunctions of the Laplacian The Waterloo Mathematics Review 23 Eigenvalues and Eigenfunctions of the Laplacian Mihai Nica University of Waterloo mcnica@uwaterloo.ca Abstract: The problem of determining the eigenvalues and eigenvectors

More information

McGill University Math 354: Honors Analysis 3

McGill University Math 354: Honors Analysis 3 Practice problems McGill University Math 354: Honors Analysis 3 not for credit Problem 1. Determine whether the family of F = {f n } functions f n (x) = x n is uniformly equicontinuous. 1st Solution: The

More information

Problem Set 5: Solutions Math 201A: Fall 2016

Problem Set 5: Solutions Math 201A: Fall 2016 Problem Set 5: s Math 21A: Fall 216 Problem 1. Define f : [1, ) [1, ) by f(x) = x + 1/x. Show that f(x) f(y) < x y for all x, y [1, ) with x y, but f has no fixed point. Why doesn t this example contradict

More information

Asymptotically optimal induced universal graphs

Asymptotically optimal induced universal graphs Asymptotically optimal induced universal graphs Noga Alon Abstract We prove that the minimum number of vertices of a graph that contains every graph on vertices as an induced subgraph is (1 + o(1))2 (

More information

We denote the space of distributions on Ω by D ( Ω) 2.

We denote the space of distributions on Ω by D ( Ω) 2. Sep. 1 0, 008 Distributions Distributions are generalized functions. Some familiarity with the theory of distributions helps understanding of various function spaces which play important roles in the study

More information

Introduction to Real Analysis Alternative Chapter 1

Introduction to Real Analysis Alternative Chapter 1 Christopher Heil Introduction to Real Analysis Alternative Chapter 1 A Primer on Norms and Banach Spaces Last Updated: March 10, 2018 c 2018 by Christopher Heil Chapter 1 A Primer on Norms and Banach Spaces

More information

Subhypergraph counts in extremal and random hypergraphs and the fractional q-independence

Subhypergraph counts in extremal and random hypergraphs and the fractional q-independence Subhypergraph counts in extremal and random hypergraphs and the fractional q-independence Andrzej Dudek adudek@emory.edu Andrzej Ruciński rucinski@amu.edu.pl June 21, 2008 Joanna Polcyn joaska@amu.edu.pl

More information

Measure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond

Measure Theory on Topological Spaces. Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond Measure Theory on Topological Spaces Course: Prof. Tony Dorlas 2010 Typset: Cathal Ormond May 22, 2011 Contents 1 Introduction 2 1.1 The Riemann Integral........................................ 2 1.2 Measurable..............................................

More information

1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3

1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3 Index Page 1 Topology 2 1.1 Definition of a topology 2 1.2 Basis (Base) of a topology 2 1.3 The subspace topology & the product topology on X Y 3 1.4 Basic topology concepts: limit points, closed sets,

More information

4 Countability axioms

4 Countability axioms 4 COUNTABILITY AXIOMS 4 Countability axioms Definition 4.1. Let X be a topological space X is said to be first countable if for any x X, there is a countable basis for the neighborhoods of x. X is said

More information

Estimates for probabilities of independent events and infinite series

Estimates for probabilities of independent events and infinite series Estimates for probabilities of independent events and infinite series Jürgen Grahl and Shahar evo September 9, 06 arxiv:609.0894v [math.pr] 8 Sep 06 Abstract This paper deals with finite or infinite sequences

More information

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9 MAT 570 REAL ANALYSIS LECTURE NOTES PROFESSOR: JOHN QUIGG SEMESTER: FALL 204 Contents. Sets 2 2. Functions 5 3. Countability 7 4. Axiom of choice 8 5. Equivalence relations 9 6. Real numbers 9 7. Extended

More information

Metric Spaces and Topology

Metric Spaces and Topology Chapter 2 Metric Spaces and Topology From an engineering perspective, the most important way to construct a topology on a set is to define the topology in terms of a metric on the set. This approach underlies

More information

On the mean connected induced subgraph order of cographs

On the mean connected induced subgraph order of cographs AUSTRALASIAN JOURNAL OF COMBINATORICS Volume 71(1) (018), Pages 161 183 On the mean connected induced subgraph order of cographs Matthew E Kroeker Lucas Mol Ortrud R Oellermann University of Winnipeg Winnipeg,

More information

Notions such as convergent sequence and Cauchy sequence make sense for any metric space. Convergent Sequences are Cauchy

Notions such as convergent sequence and Cauchy sequence make sense for any metric space. Convergent Sequences are Cauchy Banach Spaces These notes provide an introduction to Banach spaces, which are complete normed vector spaces. For the purposes of these notes, all vector spaces are assumed to be over the real numbers.

More information

The expansion of random regular graphs

The expansion of random regular graphs The expansion of random regular graphs David Ellis Introduction Our aim is now to show that for any d 3, almost all d-regular graphs on {1, 2,..., n} have edge-expansion ratio at least c d d (if nd is

More information

7: FOURIER SERIES STEVEN HEILMAN

7: FOURIER SERIES STEVEN HEILMAN 7: FOURIER SERIES STEVE HEILMA Contents 1. Review 1 2. Introduction 1 3. Periodic Functions 2 4. Inner Products on Periodic Functions 3 5. Trigonometric Polynomials 5 6. Periodic Convolutions 7 7. Fourier

More information

Quasi-randomness is determined by the distribution of copies of a fixed graph in equicardinal large sets

Quasi-randomness is determined by the distribution of copies of a fixed graph in equicardinal large sets Quasi-randomness is determined by the distribution of copies of a fixed graph in equicardinal large sets Raphael Yuster 1 Department of Mathematics, University of Haifa, Haifa, Israel raphy@math.haifa.ac.il

More information

Convexity in R n. The following lemma will be needed in a while. Lemma 1 Let x E, u R n. If τ I(x, u), τ 0, define. f(x + τu) f(x). τ.

Convexity in R n. The following lemma will be needed in a while. Lemma 1 Let x E, u R n. If τ I(x, u), τ 0, define. f(x + τu) f(x). τ. Convexity in R n Let E be a convex subset of R n. A function f : E (, ] is convex iff f(tx + (1 t)y) (1 t)f(x) + tf(y) x, y E, t [0, 1]. A similar definition holds in any vector space. A topology is needed

More information

MATH 205C: STATIONARY PHASE LEMMA

MATH 205C: STATIONARY PHASE LEMMA MATH 205C: STATIONARY PHASE LEMMA For ω, consider an integral of the form I(ω) = e iωf(x) u(x) dx, where u Cc (R n ) complex valued, with support in a compact set K, and f C (R n ) real valued. Thus, I(ω)

More information

i. v = 0 if and only if v 0. iii. v + w v + w. (This is the Triangle Inequality.)

i. v = 0 if and only if v 0. iii. v + w v + w. (This is the Triangle Inequality.) Definition 5.5.1. A (real) normed vector space is a real vector space V, equipped with a function called a norm, denoted by, provided that for all v and w in V and for all α R the real number v 0, and

More information

Solutions Final Exam May. 14, 2014

Solutions Final Exam May. 14, 2014 Solutions Final Exam May. 14, 2014 1. Determine whether the following statements are true or false. Justify your answer (i.e., prove the claim, derive a contradiction or give a counter-example). (a) (10

More information

LECTURE-13 : GENERALIZED CAUCHY S THEOREM

LECTURE-13 : GENERALIZED CAUCHY S THEOREM LECTURE-3 : GENERALIZED CAUCHY S THEOREM VED V. DATAR The aim of this lecture to prove a general form of Cauchy s theorem applicable to multiply connected domains. We end with computations of some real

More information

221A Lecture Notes Convergence of Perturbation Theory

221A Lecture Notes Convergence of Perturbation Theory A Lecture Notes Convergence of Perturbation Theory Asymptotic Series An asymptotic series in a parameter ɛ of a function is given in a power series f(ɛ) = f n ɛ n () n=0 where the series actually does

More information

Part III. 10 Topological Space Basics. Topological Spaces

Part III. 10 Topological Space Basics. Topological Spaces Part III 10 Topological Space Basics Topological Spaces Using the metric space results above as motivation we will axiomatize the notion of being an open set to more general settings. Definition 10.1.

More information

Asymptotics and Extremal Properties of the Edge-Triangle Exponential Random Graph Model

Asymptotics and Extremal Properties of the Edge-Triangle Exponential Random Graph Model Asymptotics and Extremal Properties of the Edge-Triangle Exponential Random Graph Model Alessandro Rinaldo Carnegie Mellon University joint work with Mei Yin, Sukhada Fadnavis, Stephen E. Fienberg and

More information

We are going to discuss what it means for a sequence to converge in three stages: First, we define what it means for a sequence to converge to zero

We are going to discuss what it means for a sequence to converge in three stages: First, we define what it means for a sequence to converge to zero Chapter Limits of Sequences Calculus Student: lim s n = 0 means the s n are getting closer and closer to zero but never gets there. Instructor: ARGHHHHH! Exercise. Think of a better response for the instructor.

More information

From now on, we will represent a metric space with (X, d). Here are some examples: i=1 (x i y i ) p ) 1 p, p 1.

From now on, we will represent a metric space with (X, d). Here are some examples: i=1 (x i y i ) p ) 1 p, p 1. Chapter 1 Metric spaces 1.1 Metric and convergence We will begin with some basic concepts. Definition 1.1. (Metric space) Metric space is a set X, with a metric satisfying: 1. d(x, y) 0, d(x, y) = 0 x

More information

Hilbert spaces. 1. Cauchy-Schwarz-Bunyakowsky inequality

Hilbert spaces. 1. Cauchy-Schwarz-Bunyakowsky inequality (October 29, 2016) Hilbert spaces Paul Garrett garrett@math.umn.edu http://www.math.umn.edu/ garrett/ [This document is http://www.math.umn.edu/ garrett/m/fun/notes 2016-17/03 hsp.pdf] Hilbert spaces are

More information

Packing and Covering Dense Graphs

Packing and Covering Dense Graphs Packing and Covering Dense Graphs Noga Alon Yair Caro Raphael Yuster Abstract Let d be a positive integer. A graph G is called d-divisible if d divides the degree of each vertex of G. G is called nowhere

More information

MACROSCOPIC VARIABLES, THERMAL EQUILIBRIUM. Contents AND BOLTZMANN ENTROPY. 1 Macroscopic Variables 3. 2 Local quantities and Hydrodynamics fields 4

MACROSCOPIC VARIABLES, THERMAL EQUILIBRIUM. Contents AND BOLTZMANN ENTROPY. 1 Macroscopic Variables 3. 2 Local quantities and Hydrodynamics fields 4 MACROSCOPIC VARIABLES, THERMAL EQUILIBRIUM AND BOLTZMANN ENTROPY Contents 1 Macroscopic Variables 3 2 Local quantities and Hydrodynamics fields 4 3 Coarse-graining 6 4 Thermal equilibrium 9 5 Two systems

More information

An introduction to Mathematical Theory of Control

An introduction to Mathematical Theory of Control An introduction to Mathematical Theory of Control Vasile Staicu University of Aveiro UNICA, May 2018 Vasile Staicu (University of Aveiro) An introduction to Mathematical Theory of Control UNICA, May 2018

More information

arxiv: v2 [math.co] 11 Mar 2008

arxiv: v2 [math.co] 11 Mar 2008 Testing properties of graphs and functions arxiv:0803.1248v2 [math.co] 11 Mar 2008 Contents László Lovász Institute of Mathematics, Eötvös Loránd University, Budapest and Balázs Szegedy Department of Mathematics,

More information

Edge-disjoint induced subgraphs with given minimum degree

Edge-disjoint induced subgraphs with given minimum degree Edge-disjoint induced subgraphs with given minimum degree Raphael Yuster Department of Mathematics University of Haifa Haifa 31905, Israel raphy@math.haifa.ac.il Submitted: Nov 9, 01; Accepted: Feb 5,

More information

Lecture 4: Completion of a Metric Space

Lecture 4: Completion of a Metric Space 15 Lecture 4: Completion of a Metric Space Closure vs. Completeness. Recall the statement of Lemma??(b): A subspace M of a metric space X is closed if and only if every convergent sequence {x n } X satisfying

More information

7 About Egorov s and Lusin s theorems

7 About Egorov s and Lusin s theorems Tel Aviv University, 2013 Measure and category 62 7 About Egorov s and Lusin s theorems 7a About Severini-Egorov theorem.......... 62 7b About Lusin s theorem............... 64 7c About measurable functions............

More information

14 : Theory of Variational Inference: Inner and Outer Approximation

14 : Theory of Variational Inference: Inner and Outer Approximation 10-708: Probabilistic Graphical Models 10-708, Spring 2014 14 : Theory of Variational Inference: Inner and Outer Approximation Lecturer: Eric P. Xing Scribes: Yu-Hsin Kuo, Amos Ng 1 Introduction Last lecture

More information

Notes on uniform convergence

Notes on uniform convergence Notes on uniform convergence Erik Wahlén erik.wahlen@math.lu.se January 17, 2012 1 Numerical sequences We begin by recalling some properties of numerical sequences. By a numerical sequence we simply mean

More information

Math 209B Homework 2

Math 209B Homework 2 Math 29B Homework 2 Edward Burkard Note: All vector spaces are over the field F = R or C 4.6. Two Compactness Theorems. 4. Point Set Topology Exercise 6 The product of countably many sequentally compact

More information

An Application of First-Order Logic to a Problem in Combinatorics 1

An Application of First-Order Logic to a Problem in Combinatorics 1 An Application of First-Order Logic to a Problem in Combinatorics 1 I. The Combinatorial Background. A. Families of objects. In combinatorics, one frequently encounters a set of objects in which a), each

More information

Set, functions and Euclidean space. Seungjin Han

Set, functions and Euclidean space. Seungjin Han Set, functions and Euclidean space Seungjin Han September, 2018 1 Some Basics LOGIC A is necessary for B : If B holds, then A holds. B A A B is the contraposition of B A. A is sufficient for B: If A holds,

More information

Tools from Lebesgue integration

Tools from Lebesgue integration Tools from Lebesgue integration E.P. van den Ban Fall 2005 Introduction In these notes we describe some of the basic tools from the theory of Lebesgue integration. Definitions and results will be given

More information

LINEAR EQUATIONS WITH UNKNOWNS FROM A MULTIPLICATIVE GROUP IN A FUNCTION FIELD. To Professor Wolfgang Schmidt on his 75th birthday

LINEAR EQUATIONS WITH UNKNOWNS FROM A MULTIPLICATIVE GROUP IN A FUNCTION FIELD. To Professor Wolfgang Schmidt on his 75th birthday LINEAR EQUATIONS WITH UNKNOWNS FROM A MULTIPLICATIVE GROUP IN A FUNCTION FIELD JAN-HENDRIK EVERTSE AND UMBERTO ZANNIER To Professor Wolfgang Schmidt on his 75th birthday 1. Introduction Let K be a field

More information

Least Sparsity of p-norm based Optimization Problems with p > 1

Least Sparsity of p-norm based Optimization Problems with p > 1 Least Sparsity of p-norm based Optimization Problems with p > Jinglai Shen and Seyedahmad Mousavi Original version: July, 07; Revision: February, 08 Abstract Motivated by l p -optimization arising from

More information

Discrete Mathematics

Discrete Mathematics Discrete Mathematics 309 (009) 3811 380 Contents lists available at ScienceDirect Discrete Mathematics journal homepage: www.elsevier.com/locate/disc Disjoint hamiltonian cycles in bipartite graphs Michael

More information

Math 61CM - Solutions to homework 6

Math 61CM - Solutions to homework 6 Math 61CM - Solutions to homework 6 Cédric De Groote November 5 th, 2018 Problem 1: (i) Give an example of a metric space X such that not all Cauchy sequences in X are convergent. (ii) Let X be a metric

More information

The Liapunov Method for Determining Stability (DRAFT)

The Liapunov Method for Determining Stability (DRAFT) 44 The Liapunov Method for Determining Stability (DRAFT) 44.1 The Liapunov Method, Naively Developed In the last chapter, we discussed describing trajectories of a 2 2 autonomous system x = F(x) as level

More information

Separation in General Normed Vector Spaces 1

Separation in General Normed Vector Spaces 1 John Nachbar Washington University March 12, 2016 Separation in General Normed Vector Spaces 1 1 Introduction Recall the Basic Separation Theorem for convex sets in R N. Theorem 1. Let A R N be non-empty,

More information

The Completion of a Metric Space

The Completion of a Metric Space The Completion of a Metric Space Let (X, d) be a metric space. The goal of these notes is to construct a complete metric space which contains X as a subspace and which is the smallest space with respect

More information