Hölder-type inequalities and their applications to concentration and correlation bounds

Similar documents
Optimization Problems in Multiple Subtree Graphs

arxiv: v1 [math.co] 25 Apr 2016

A spectral Turán theorem

Balanced Partitions of Vector Sequences

Computing Laboratory A GAME-BASED ABSTRACTION-REFINEMENT FRAMEWORK FOR MARKOV DECISION PROCESSES

On diamond-free subposets of the Boolean lattice

arxiv: v3 [math.co] 20 Sep 2016

arxiv: v1 [math.pr] 25 Jun 2015

DECOMPOSING 4-REGULAR GRAPHS INTO TRIANGLE-FREE 2-FACTORS

Asymptotic Normality of an Entropy Estimator with Exponentially Decaying Bias

Chromatic characterization of biclique cover. Denis Cornaz. Jean Fonlupt. Equipe Combinatoire, UFR 921, Universite Pierre et Marie Curie, 4 place

An Explicit Lower Bound of 5n o(n) for Boolean Circuits

different formulas, depending on whether or not the vector is in two dimensions or three dimensions.

Patterns of Non-Simple Continued Fractions

Course Notes. Part IV. Probabilistic Combinatorics. Algorithms

Scalar multiplication and algebraic direction of a vector

An extension of the Moser-Tardos algorithmic local lemma

arxiv: v1 [cs.dm] 21 Oct 2017

Isoperimetric problems

Bc. Dominik Lachman. Bruhat-Tits buildings

Network Flow Problems Luis Goddyn, Math 408

Properly colored Hamilton cycles in edge colored complete graphs

Towards Universal Cover Decoding

Constructive bounds for a Ramsey-type problem

arxiv: v2 [math.co] 12 Jul 2009

Lecture J. 10 Counting subgraphs Kirchhoff s Matrix-Tree Theorem.

arxiv: v3 [math.co] 10 Mar 2018

The expansion of random regular graphs

Section 1.7. Linear Independence

Polynomial-Solvability of N P-class Problems

Independent Transversals in r-partite Graphs

The concentration of the chromatic number of random graphs

On the Linear Threshold Model for Diffusion of Innovations in Multiplex Social Networks

arxiv: v1 [math.co] 10 Jun 2018

arxiv: v1 [math.co] 4 Jan 2018

Matchings in hypergraphs of large minimum degree

Vertex colorings of graphs without short odd cycles

On (a, d)-vertex-antimagic total labeling of Harary graphs

An asymptotically tight bound on the adaptable chromatic number

Lecture 1. 1 Overview. 2 Maximum Flow. COMPSCI 532: Design and Analysis of Algorithms August 26, 2015

Induced subgraphs with many repeated degrees

THE MINIMUM MATCHING ENERGY OF BICYCLIC GRAPHS WITH GIVEN GIRTH

Online Companion to Pricing Services Subject to Congestion: Charge Per-Use Fees or Sell Subscriptions?

G 2(X) X SOLVABILITY OF GRAPH INEQUALITIES. 1. A simple graph inequality. Consider the diagram in Figure 1.

Lectures on Spectral Graph Theory. Fan R. K. Chung

Assignment 2 : Probabilistic Methods

Lecture 4: Inequalities and Asymptotic Estimates

Subhypergraph counts in extremal and random hypergraphs and the fractional q-independence

THE NUMBER OF INDEPENDENT DOMINATING SETS OF LABELED TREES. Changwoo Lee. 1. Introduction

LECTURE 2: CROSS PRODUCTS, MULTILINEARITY, AND AREAS OF PARALLELOGRAMS

Sharp threshold functions for random intersection graphs via a coupling method.

Edge-disjoint induced subgraphs with given minimum degree

ACYCLIC WEAK CONVEX DOMINATION IN GRAPHS

arxiv: v1 [math.co] 2 Dec 2013

Improved Bounds on the L(2, 1)-Number of Direct and Strong Products of Graphs

arxiv: v1 [stat.ml] 15 Feb 2018

INTEGERS WITH A DIVISOR IN (y, 2y]

Sampling Binary Contingency Tables with a Greedy Start

A Regularization Framework for Learning from Graph Data

Integer Parameter Synthesis for Real-time Systems

Dynamic Vehicle Routing with Moving Demands Part II: High speed demands or low arrival rates

The adjacent vertex-distinguishing total chromatic number of 1-tree

Szemerédi s regularity lemma revisited. Lewis Memorial Lecture March 14, Terence Tao (UCLA)

< k 2n. 2 1 (n 2). + (1 p) s) N (n < 1

The Budgeted Minimum Cost Flow Problem with Unit Upgrading Cost

of L(G), the square of the line graph of G 1. The strong chromatic index of G, sχ 0 (G) is the least integer k such that G has a strong edge-colouring

Probabilistic Proofs of Existence of Rare Events. Noga Alon

List coloring hypergraphs

Measure and Conquer: A Simple O( n ) Independent Set Algorithm

4. A Physical Model for an Electron with Angular Momentum. An Electron in a Bohr Orbit. The Quantum Magnet Resulting from Orbital Motion.

Decomposition of random graphs into complete bipartite graphs

Accuracy Bounds for Belief Propagation

Dynamic Vehicle Routing with Moving Demands Part II: High speed demands or low arrival rates

ECE534, Spring 2018: Solutions for Problem Set #4 Due Friday April 6, 2018

On a correlation inequality of Farr

Notes 6 : First and second moment methods

Out-colourings of Digraphs

The number of edge colorings with no monochromatic cliques

Least singular value of random matrices. Lewis Memorial Lecture / DIMACS minicourse March 18, Terence Tao (UCLA)

Distributed Weighted Vertex Cover via Maximal Matchings

An estimate for the probability of dependent events

Size and degree anti-ramsey numbers

Fractional and circular 1-defective colorings of outerplanar graphs

The optimal pebbling number of the complete m-ary tree

ON THE STRUCTURE OF ORIENTED GRAPHS AND DIGRAPHS WITH FORBIDDEN TOURNAMENTS OR CYCLES

Large topological cliques in graphs without a 4-cycle

Induced subgraphs of Ramsey graphs with many distinct degrees

A Simple Algorithm for Clustering Mixtures of Discrete Distributions

The Geometry and Topology of Groups

P(c y) = W n (y c)p(c)

Two-sided bounds for L p -norms of combinations of products of independent random variables

Min-Wise independent linear permutations

PRODUCTS IN CONDITIONAL EXTREME VALUE MODEL

Math 144 Activity #9 Introduction to Vectors

Roberto s Notes on Linear Algebra Chapter 1: Geometric vectors Section 8. The dot product

Approximate Sunflowers

An Algorithmic Proof of the Lopsided Lovász Local Lemma (simplified and condensed into lecture notes)

Simple Stochastic Games with Few Random Vertices Are Easy to Solve

STAT 7032 Probability Spring Wlodek Bryc

International Journal of Scientific & Engineering Research, Volume 7, Issue 11, November-2016 ISSN Nullity of Expanded Smith graphs

Transcription:

Hölder-type inequalities and their applications to concentration and correlation bounds Christos Pelekis * Jan Ramon Yuyi Wang Noember 29, 2016 Abstract Let Y, V, be real-alued random ariables haing a dependency graph G = V, E) We show that [ ] E Y { ]} E [Y b b, where is the b-fold chromatic number of G This inequality may be seen as a dependency-graph analogue of a generalised Hölder inequality, due to Helmut Finner Additionally, we proide applications of the aformentioned Hölder-type inequalities to concentration and correlation bounds for sums of weakly dependent random ariables whose dependencies can be described in terms of graphs or hypergraphs Keywords: fractional chromatic number; Finner s inequality; Janson s inequality; dependency graph; hypergraphs 1 Introduction and related work The main purpose of this article is to illustrate that certain Hölder-type inequalities can be employed in order to obtain concentration and correlation bounds for sums of weakly dependent random ariables whose dependencies are described in terms of graphs, or hypergraphs Before being more precise, let us begin with some notation and definitions that will be fixed throughout the text * Department of Computer Science, KU Leuen, Celestijnenlaan 200A, 3001, Belgium, Email: pelekischr@gmailcom Department of Computer Science, KU Leuen, Celestijnenlaan 200A, 3001, Belgium, Email: JanRamon@cskuleuenbe Department of Computer Science, KU Leuen, Celestijnenlaan 200A, 3001, Belgium, Email: yuyiwang920@gmailcom 1

A hypergraph H is a pair V, E) where V is a finite set and E is a family of subsets of V The set V is called the ertex set of H and the set E is called the edge set of H; the elements of E are called hyperedges or just edges The cardinality of the ertex set will be denoted by V and the cardinality of the edge set by E A hypergraph is called k-uniform if eery edge from E has cardinality k A 2-uniform hypergraph is a graph The degree of a ertex V is defined as the number of edges that contain A hypergraph will be called d- regular if eery ertex has degree d A subset V V is called independent if it does not contain any edge from E A fractional matching of a hypergraph, H = V, E), is a function φ : E [0, 1] such that e: e φe) 1, for all ertices V The fractional matching number of H, denoted ν H), is defined as max φ φe) where the maximum runs oer all fractional matchings of H The fractional chromatic number of a graph G is defined in the following way A b-fold colouring of G is an assignment of sets of size b to the ertices of the graph in such a way that adjacent ertices hae disjoint sets A graph is a : b)- colourable if it has a b-fold colouring using a different colours The least a for which the graph is a : b)-colourable is the b-fold chromatic number of the graph, denoted G) The fractional chromatic number of a graph G is defined as χ χ G) = inf b G) b b Here and later, P[ ] and E[ ] will denote probability and expectation, respectiely Let us also recall Hölder s inequality Let Ω, A, P) be a probability space Let A be a finite set and let Y a, a A, be random ariables from Ω into R Suppose that w a, a A are non-negatie weights such that a A w a 1 and each Y a has finite 1 w a -moment, ie, [ E Y 1/wa a ] < +, for all a A Hölder s inequality asserts that [ ] E Y a a A [ E a A Y 1/wa a This is a classic result see [2]) In this article we shall be interested in applications of Hölder-type inequalities to concentration and correlation bounds for sums of weakly dependent random ariables We focus on two particular types of dependencies between random ariables The first one is described in terms of a hypergraph Definition 1 hypergraph-correlated random ariables) Let H = V, E) be a hypergraph Suppose that {Y e } is a collection of real-alued random ariables, indexed by the edge set of H, which satisfy the following: there exist independent random ariables {X } indexed by the ertex set V such that, for eery edge e E, we hae Y e = f e X ; e) is a function that depends only on the random ariables X with e We will refer to the aforementioned random ariables {Y e } as H-correlated, or simply as hypergraph-correlated, when there is no confusion about the underlying hypergraph As an example of hypergraph-correlated random ariables the reader may think of the indicators of the triangles in an Erdős-Rényi random graph model Gn, p) Recall that ] wa 2

such a model generates a random graph, G, on n labelled ertices by joining pairs of ertices, independently, with probability p 0, 1) To see that the indicators of the triangles in G are hypergraph-correlated, let H G be the hypergraph whose ertex set, V, has an element for eery potential) edge in G and whose edge set, E, has a hyperedge for eery triplet of edges in G that form potential) triangles Let X, V, be independent Bernoulli Berp) random ariables For eery triangle T in G, let E T be the corresponding hyperedge of H G Now notice that the indicator of each triangle, T, in G can be written as a product E T X More examples of hypergraph-correlated random ariables can be found in Section 4 Hypergraph-correlated random ariables are encountered in the theory of random graphs and machine learning see [9, 11, 17, 19] and references therein); a quintessential question that motiates their study asks for upper bounds on the probability that a random graph G Gn, p) contains more triangles than expected Notice that the indicators of the triangles in G Gn, p) are not mutually independent and therefore the standard Chernoff-Hoeffding bounds do not apply Another type of dependency structure that plays a key role in probabilistic combinatorics and related areas inoles the notion of a dependency graph see also [1, 12]) Definition 2 Graph-dependent random ariables) A dependency graph for the random ariables {Y }, indexed by a finite set V, is any loopless graph, G = V, E), whose ertex set V is the index set of the random ariables and whose edge set is such that if V V and i V is not incident to any ertex of V, then Y is mutually independent of the random ariables Y for which V We will refer to random ariables {Y } haing a dependency graph G as G-dependent or as graph-dependent A difference between the notions of graph-dependent and hypergraph-correlated random ariables is illustrated in Example 22 below An example of graph-dependent random ariables is the indicators of the triangles in a random graph G Gn, p) Let us remark that the preious notions of weakly dependent random ariables arise in the study of the, so-called, Loász Local Lemma In this setting one is dealing with a finite) number of eents in some probability space, and is interested in lower bounds on the probability that none of the eents will occur The eents need not be independent and their indicators are assumed to be graph-dependent, or hypergraph-correlated see [6, 16]) In this article we focus on upper bounds on the probability that none of the eents will occur Notice that if {Y e } are hypergraph-correlated random ariables then they induce a dependency graph whose ertex set is E and with edges joining any two sets e, e E such that e e Hence a set of hypergraph-correlated random ariables is graphdependent The reader may wonder whether the conerse holds true We will show, using a particular generalisation of Hölder s inequality, that this is not the case see Example 22 below) and so the aforementioned notions of dependencies between random ariables are not equialent 3

In the present paper we proide a Hölder-type inequality for graph-dependent random ariables Our result can be seen as a dependency-graph analogue of the following theorem, due to Helmut Finner Theorem 11 Finner [7]) Let H = V, E) be a hypergraph and let {Y e } be H-correlated random ariables If φ : E [0, 1] is a fractional matching of H then [ ] E Y e { [ ]} φe) E Ye 1/φe) Notice that, [ by applying the preious result to the random ariables Z e = Ye φe), one ] concludes E Y e φe) {E [Y e]} φe) See [7] for a proof of Theorem 11 which is based on Fubini s theorem and Hölder s inequality Alternatiely, see [17] for a proof that uses the concaity of the weighted geometric mean and Jensen s inequality In other words, Theorem 11 proides a Hölder-type inequality for hypergraph-correlated random ariables which is formalised in terms of a fractional matching of the underlying hypergraph In this article we illustrate how certain Hölder-type inequalities yield concentration and correlation bounds for sums of hypergraph-correlated random ariables as well as for sums of graph-dependent random ariables In that regard, our contribution is two-fold On the one hand, we show that a particular graph-dependent analogue of Theorem 11 yields a new proof of the following result, due to Sante Janson, which proides an estimate on the probability that the sum of graph-dependent random ariables is significantly larger than its mean Theorem 12 Janson [11]) Let {Y } be [0, 1]-alued random ariables haing a dependency graph G = V, E) Set q := 1 V E [ Y ] If t = nq + ε) for some ε > 0, then ) P Y t exp 2ε2 V χ, where χ = χ G) is the fractional chromatic number of G See [11, Theorem 21] for a proof of this result which is based on breaking up the sum into a particular linear combination of sums of independent random ariables On the other hand, we show that Finner s inequality applies to the following problem, that is interesting on its own Problem 13 Fix a hypergraph H = V, E) and let I be a random subset of V formed by including ertex V in I with probability p 0, 1), independently of other ertices What is an upper bound on the probability that I is independent? 4

Here and later, gien a set of parameters in 0, 1), say p = {p }, indexed by the ertex set of a hypergraph H, we will denote by πp, H) the probability that I is independent Let us remark that Problem 13 has attracted the attention of seeral authors and appears to be related to a ariety of topics see [4, 8, 13, 15, 19] and references therein) A particular line of research is motiated by question about independent sets and subgraph counting in random graphs In this context, Problem 13 has been considered by Janson et al [13], Krieleich et al [15] and Wolfoitz [19] It is obsered in [15] that when H is k-uniform, d-regular and p = p for all V, an exponential estimate on πp, H) can be obtained using the so-called Janson s correlation inequality Additionally, it is shown in [15, Section 5] that under certain mild additional assumptions the bound proided by Janson s inequality can be improed to )) p E πp, H) exp Ω 1 p)kd See [15] for a precise formulation of the mild additional assumptions and a proof of this result which is based on a martingale-type concentration inequality The remaining part of our paper is organised as follows In the Section 2 we collect our main results In particular, we proide a Hölder-type inequality for graph-dependent random ariables and we apply this result in order to deduce upper bounds on the probability that a sum of graph-dependent random ariables is significantly larger than its mean We also show that Finner s inequality yields an upper bound on πp, H) The proofs of our main results are deferred to Section 3 Finally, in Section 4, we illustrate that Finner s inequality can be seen as an alternatie to the, well-known, Janson s correlation inequality Our paper ends with Section 5 in which we briefly sketch how one can apply similar ideas to yet another class of weakly dependent random ariables 2 Main results In this section we collect our main results Our first result proides an analogue of Theorem 11 for random ariables haing a dependency graph G The corresponding Höldertype inequality is formalised in terms of the b-fold chromatic number of G and reads as follows Theorem 21 Let {Y } be real-alued random ariables haing a dependency graph G = V, E) Then, for eery b-fold colouring of G using := G) colours, we hae [ ] E Y { [ E Y b ]} b 5

We proe Theorem 21 in Section 3 The proof employs the concaity of the weighted geometric mean and the definition of b-fold chromatic number Recall that Finner s inequality applies to hypergraph-correlated random ariables and that such an ensemble of random ariables induces a dependency graph Howeer, as is shown in the following example, there exist graph-dependent random ariables which are not hypergraph-correlated and therefore Theorem 21 proides a Hölder-type inequality can be applied to more general ensembles of weakly dependent random ariables Example 22 R s that are graph-dependent but not hypergraph-correlated) Let G be a cycle-graph on 5 ertices { 1,, 5 } such that i is incident to i+1, for i {1, 2, 3, 4} and 5 is incident to 1 Let Y = Y 1,, Y 5 ) be a ector of Bernoulli 0/1 random ariables whose distribution is defined as follows The ector Y takes the alue 0, 0, 0, 0, 0) with probability 1 2 2 p)1 p) 2, the alue 1, 1, 1, 1, 1) with probability p2 +p 3 2, the alues 0, 0, 0, 1, 1), 0, 0, 1, 1, 0), 0, 1, 1, 0, 0), 1, 1, 0, 0, 0), 1, 0, 0, 0, 1) with probability p1 p)2 2, the alues 0, 0, 1, 1, 1), 0, 1, 1, 1, 0), 1, 1, 1, 0, 0), 1, 1, 0, 0, 1), 1, 0, 0, 1, 1) with probability p2 p 3 2 and the remaining alues with probability 0 Elementary, though quite tedious, calculations show that E[Y j ] = p, for j = 1,, 5 and that G is a dependency graph for {Y j } 5 j=1 Now assume that {Y j} 5 j=1 are H-correlated, for some hypergraph H = V, E) Notice that the cardinality of E must be equal to 5 If e i E is the edge corresponding to the random ariable Y i, i = 1,, 5, then the fact that {Y i } 5 i=1 hae G as a depencency graph implies that e i e i+2) mod 5 =, for i {1, 2, 3, 4, 5} This means that the fractional matching number of H is at least 25 and therefore Theorem 11 implies that P [Y = 1, 1, 1, 1, 1)] p 25 Howeer, the arithmetic geometric means inequality implies p2 +p 3 2 > p 25 and therefore the random ariables {Y j } 5 j=1 cannot be hypergraph-correlated We also show that Theorem 21, combined with standard techniques based on exponential moments, yields, one the one hand, a new proof of Theorem 12 and, on the other hand, the following Bennett-type inequality for graph-dependent random ariables Theorem 23 Let {Y } be random ariables haing a dependency graph G = V, E) For eery V let σ 2 := Var Y ) and assume further that Y 1 and E[Y ] = 0 Set S = σ2 and fix t > 0 Then P Y t where ψx) = 1 + x) ln1 + x) x exp S χ G) ψ )) t, S 6

Let us remark that Theorem 23 improes on [11, Theorem 23] which contains the same bound but with ψ ) 4t 5S instead of our ψ t ) S Note that we assume a one-sided bound on each Y The proof of Theorem 23 is based on Theorem 21 and can be found in Section 3 Notice also that Theorems 21 and 23 are concerned with graph-dependent random ariables Our final result concerns hypergraph-correlated indicators In Section 3 we show that Finner s inequality yields the following upper bound on the probability that a random subset of the ertex set of a hypergraph is independent Theorem 24 Let H = V, E) be a hypergraph and p = {p } be a set of parameters from 0, 1) Then πp, H) e 1 e p ) φe), where φ : E [0, 1] is a fractional matching of H In particular, if the hypergraph H is k-uniform, d-regular and p = p, for all V, then πp, H) 1 p k) ν H) exp p k E ), d where ν H) is the fractional matching number of H Notice that the first bound in the second statement of Theorem 24 has a monotonicity property, in the sense that if H 1 is a superhypergraph of H 2 then 1 p k) ν H 1 ) 1 p k ) ν H 2 ) In Section 4 we illustrate that Theorem 24 may be seen as an alternatie to Janson s correlation inequality Let us end this section by noting that Theorem 11, combined with standard exponentialmoment ideas, yields concentration inequalities for sums of hypergraph-correlated random ariables This has been reported in prior work and so we only proide the statement without proof In [17] one can find a proof of the following result Theorem 25 Ramon et al [17]) Let H = V, E) be a hypergraph and assume that {Y e } are H-correlated random ariables Assume further that Y e [0, 1], for all e E, and that E [Y e ] = p e, for some p e 0, 1) Let φ : E [0, 1] be a fractional matching of H and set Φ = e φe), p = 1 E e E [Y e] If t is a real number from the interal Φp, Φ) such that t = Φp + ε), then [ ] P φe)y e t exp 2Φε 2) In particular, if d is the maximum degree of H and φe) = 1 d, for all e E, then Theorem 7

25 yields the bound P Y e t e exp 2 E ) d ε2, for t = E p + ε) This inequality has also been obtained in Gainsky et al [9] using entropy ideas As is mentioned in the introduction, the indicators, say {I T } T, of the triangles in an Erdős- Rényi random graph are both graph-dependent and hypergraph-correlated; their dependency graph is the graph which is induced from the random ariables {I T } T when they are iewed as hypergraph-correlated This means that both Theorem 12 and Theorem 25 proide an upper bound on the probability that G Gn, p) contains more triangles than expected We inite the reader to erify that Theorem 25 proides a better bound An intuitie explanation as to why Theorem 25 gies a better bound is that, while considering the induced dependency-graph of a set of hypergraph-correlated random ariables, one loses information This behaiour has also been reported in the context of Loász Local Lemma see Kolipaka et al [14]) 3 Proofs In this section we proe our main results We begin with Theorem 21, whose proof is based on the concaity of the weighted geometric mean Lemma 31 Let β = β 1,, β k ) be a ector of non-negatie real numbers such that k i=1 β i = 1 Then the function g : R k R defined by gt) = k i=1 tβ i i is concae Proof This is easily erified by showing that the Hessian matrix is positie definite See [5], or [17] for details We are now ready to proe Theorem 21 Proof of Theorem 21 We show that [ ] E {Y } b {E [Y ]} b The theorem follows by applying this inequality to the random ariables Z = Y For eery colour i = 1,, let I i be the set consisting of the ertices that are coloured with b 8

colour i Note that each I i is an independent subset of V and eery ertex V appears in exactly b independent sets I i Therefore, [ ] E {Y } b = E {Y } i=1 I i = E 1 i=1 I i Y Lemma 31 and Jensen s inequality combined with the obseration that the random ariables {Y } Ii are mutually independent yield E i=1 I i Y 1 E Y Ii i=1 1 = i=1 1 I i {E [Y ]} Now, using again the fact that each ertex appears in exactly b sets I i, we conclude and the result follows i=1 I i {E [Y ]} 1 = {E [Y ]} b Recall the following, well-known, result whose proof is included for the sake of completeness Lemma 32 Let X be a random ariable that takes alues on the interal [0, 1] Suppose that E[X] = p, for some p 0, 1), and let B be a Bernoulli 0/1 random ariable such that E[B] = p If f : [0, 1] R is a conex function, then E [fx)] E [fb)] 1 Proof Gien an outcome from the random ariable X, define the random ariable B X that takes the alues 0 and 1 with probability 1 X and X, respectiely It is easy to see that E [B X ] = p and so B X has the same distribution as B Now Jensen s inequality implies E [fx)] = E [ f E [ B X X ])] E [ f B X X )] = E [fb X )], as required We now proceed with the proof of Theorem 12 9

Proof of Theorem 12 Fix h > 0 and let q = E [Y ], for V Using Marko s inequality and Theorem 21 we estimate P Y t e ht E [e h ] Y [ ] = e ht E e hy e ht { [ χb )]} b E exp b hy For V let B be a Bernoulli 0/1 random ariable of mean q Lemma 32 implies e ht { [ χb )]} b E exp b hy { [ e ht χb )]} b E exp b hb = e { } ht 1 q ) + q e b b h Using the weighted arithmetic-geometric means inequality we conclude e ht { } { 1 q ) + q e b b h χ b e ht 1 )} b V 1 q ) + q e χ b b h V = e ht { 1 q + qe b h } b V If we minimise the last expression with respect to h > 0 we get that h must satisfy e h b = t1 q) q V t) and therefore, since t = V q + ε), we conclude P Y t { ) q q+ε ) } b 1 q 1 q+ε) V = e b V Dq+ε q), q + ε 1 q + ε) where Dq + ε q) is the Kullback-Leibler distance between q + ε and q Finally, using the standard estimate Dq + ε q) 2ε 2, we deduce P Y t e b V 2ε χ 2 b and the result follows upon minimising the last expression with respect to b The proof of Theorem 23 is similar 10

Proof of Theorem 23 Fix h > 0 to be determined later As in the proof of Theorem 12, Marko s inequality and Theorem 21 yield P Y t e ht { [ χb )]} b E exp b hy Using an inequality proed in [11] see inequality 37), page 240), we hae [ χb )] E exp b hy exp σg 2 χb )) b h, where ga) := e a 1 a Summarising, we hae shown b P Y t exp ht + e hχb/b b ) ) h S Now choose h = b ln 1 + t ) S to deduce b P Y t exp t S b χ b = exp S b ψ t S 1 + t ) ln 1 + t )) S S )) The result follows upon minimising the last expression with respect to b We end this section with the proof of Theorem 24 Proof of Theorem 24 Let X, V, be indicators of the eent I For each e E set Y e = e B Clearly, the random ariables {Y e } are H-correlated Now look at the probability P [ e Y e = 0] Notice that if all Y e are equal to zero, then eery edge e E contains a ertex,, such that B = 0 and ice ersa This implies that if e Y e = 0 then the set of ertices,, for which B = 1 is an independent subset of V and ice ersa Therefore [ ] [ ] P Y e = 0 = E 1 Y e ) = πp, H) From Theorem 11 we deduce [ ] [ ] E 1 Y e ) E 1 Y e ) φe) E [1 Y e ]) φe) 11

and the first statement follows To proe the second statement notice that E [Y e ] = p k, for all e E, and therefore [ ] E 1 Y e ) 1 p k) e φe) The result follows by maximising the exponent on the right hand side oer all fractional matchings of H In the next section we proide seeral applications of Theorem 24 to the theory of random graphs 4 Applications In this section we discuss comparisons between Finner s and Janson s inequality Janson s inequality see Janson [10] and Janson et al [12, Chapter 2]) is a well known result that proides upper estimates on the probability that a sum of dependent indicators is equal to zero It is described in terms of the dependency graph corresponding to the indicators More precisely, let {B } be indicators haing a dependency graph G Set µ = E [ B ] and = e={u,} G E [B ub ] Janson s inequality asserts that [ ] { ) } P B = 0 min e µ+, exp 1 E[B ]) 1 max E[B ] This inequality has been proen to be ery useful in the study of the Erdős-Rényi random graph model Gn, p) For G Gn, p) let us denote by T G the number of triangles in G A typical application of Janson s inequality proides the estimate P [G Gn, p) is triangle-free] 1 p 3 ) n 3) exp 21 p 3 ) where = 6 n 4) p 5 In this section we juxtapose the preious bound with the bound proided by Finner s inequality Proposition 41 Let G Gn, p) be an Erdős-Rényi random graph and denote by T G the number of triangles in G Then P [T G = 0] 1 p 3) n 2 1 n 3) Proof We apply Theorem 24 Define a hypergraph H = V, E) as follows Let i, i = 1,, n 2), be an enumeration of all potential) edges in G and consider 1,, n 2) as 12 ),

the ertex set V of the hypergraph H Let B i, i = 1,, n 2) be independent Bernoulli Berp) random ariables, corresponding to the edges of G, and let E i, i = 1, n 3), be an enumeration of all triplets of edges in G that form potential) triangles in G Define E = {E 1,, E n 3) } to be the edge set of H and let Z i be the indicator of triangle E i ; thus T G = i Z i Now form a subset I of V by picking each ertex, independently, with probability p Then the probability that I is independent equals P [T G = 0] and, in order to apply Theorem 24, we hae to find a fractional matching of H Since eery ertex of H belongs to n 2 edges in E = {E 1,, E n }, we obtain a fractional matching, φ ) of H 3) by setting φe i ) = 1 n 2, for i = 1,, n 3) The result follows Notice that the bound obtained from Janson s inequality is smaller than the bound gien by Proposition 41 for alues of p that are close to 0, but the bound of Proposition 41 does better for large alues of p Similar estimates can be obtained for the probability that a graph G Gn, p) contains no k-clique, for k 3 The details are left to the reader We now proceed with one more application of Finner s inequality Let G Gn, p) be a random graph on n labelled ertices Fix two ertices, say u and What is an upper bound on the probability that there is no path of length k between u and? A path of length k is a sequence of edges { 0, 1 }, { 1, 2 },, { k 2, k 1 }, { k 1, k } such that i j, for i j We assume k 3, otherwise the problem is easy Let us remark that it may be cumbersome to apply Janson s inequality to the preious question By contrast, Theorem 24 reduces the problem to a counting one More explicitly, let {P i } i be an enumeration of all potential) paths of length k between u and Clearly, there are n 2 k 1) k 1)! such paths Define the hypergraph H = V, E) as follows The ertices of H correspond to the potential) edges of G and the edges of H correspond to the sets of edges in G which form a path of length k between u and Hence the probability that there is no path of length k between u and equals πp, H) In order to apply Theorem 24 we hae to find a fractional matching of H and, in order to do so, it is enough to find an upper bound on the maximum degree of H To this end, fix an edge, e = {x, y}, in G In case one of the ertices x or y is equal to either u or, then there are n 3 k 2) k 2)! paths of length k from u to that pass through edge e If none of the ertices x, y is equal to u or, then we count the paths as follows We first create a path, P k 2, of length k 2 from u to that does not pass through any of the points x, y and then we place the edge e = {x, y} in one of k 2 aailable edges in the path P k 2 Since there are two ways of placing the edge e in each slot of P k 2 it follows that the number of paths from u to that go through edge e is equal to 2k 2) n 4 k 3) k 3)! If k n 1)/2 then the later quantity is smaller than n 3 ) k 2 k 2)!, otherwise it is larger than n 3 k 2) k 2)! Therefore, if k n 1)/2, the fractional matching number of H is at least n 2 k 1) k 1)! = n 2 If k > n 1)/2 then the n 3 k 2) k 2)! fractional matching number of H is at least n 2)n 3) 2k 2) We hae thus proen the following 13

Proposition 42 Let G Gn, p) Fix two ertices u, in G and a positie integer k 3 Let E be the eent there is no path of length k between u and If k n 1)/2 then P [E] 1 p k) n 2 If k > n 1)/2, then P [E] 1 p k) n 2)n 3) 2k 2) We end this section with an estimate on the probability that a G Gn, p) contains no ertex of fixed degree Proposition 43 Let G Gn, p) and fix a positie integer d {0, 1,, n 1} Then the probability that there is no ertex in G whose degree equals d is less than or equal to 1 ) n n 1 )p d 1 p) n 1 d d Proof This is yet another application of Theorem 24, so we sketch it Let 1,, n be an enumeration of the ertices of G Let the hypergraph H = V, E) be defined as follows The ertex set V corresponds to the potential) edges of G The edge set E = {E 1,, E n } corresponds to the ertices of G That is, for i = 1,, n the edge E i contains those u V for which the corresponding edges of G are incident to ertex i The result follows from the fact that E = n and the maximum degree of H is equal to 2 2 5 Remarks As mentioned in Janson [11], there exist collections of weakly dependent random ariables that do not hae a dependency graph The dependencies between such collections of random ariables can occasionally be described using an independence system Recall that an independence system is a pair A = V, I) where V is a finite set and I is a collection of subsets of V called the independent sets) with the following properties see [3]): - The empty set is independent, ie, I Alternatiely, at least one subset of V is independent, ie, I ) - Eery subset of an independent set is independent, ie, for each A A A, if A I then A I This is sometimes called the hereditary property 14

Gien a set of random ariables {Y }, we say that their joint distribution is described with an independence system, say A = V, I), if for eery A I the random ariables {Y a } a A are mutually independent Let us remark that this definition includes the case of k-wise independent random ariables see [1, Chapter 16], or [18]) Notice that if {Y } are random ariables whose joint distribution is described with an independence system A = V, I) then {} I, for all V It is easy to see that if the random ariables {Y } hae a dependency graph then their joint distribution is described with an independence system Howeer, the conerse need not be true see [11]) In a similar way as in Section 1, one may define the fractional chromatic number of an independent system as follows A b-fold colouring of an independence system A = V, I) is a function λ : I Z + such that A: A λa) = b, for all V The b-fold chromatic number of A is defined as A) := inf λ A I λa), where the infimum is oer all b-fold colourings, λ ), of A = V, I) Finally, the fractional chromatic number of A is χ χ A) := inf b A) b b With these concepts by hand, one can proe a corresponding Hölder-type inequality using a similar argument as in Theorem 21 As a consequence one can obtain tail bounds similar to Theorem 12 and Theorem 23, the only difference being that the fractional chromatic number of the dependency graph, χ G), is replaced with the fractional chromatic number of the independence system, χ A) We leae the details to the reader Acknowledgements The authors are supported by ERC Starting Grant 240186 MiGraNT, Mining Graphs and Networks: a Theory-based approach We are grateful to an anonymous referee for seeral suggestions that improed the presentation of the paper References [1] N Alon and J Spencer The Probabilistic Method J Wiley and Sons, New York, 3rd edition, 2008 [2] V I Bogache Measure Theory, olume I Springer, 2007 [3] J A Bondy and U S R Murty Graph Theory Graduate Texts in Mathematics 244, Springer, 2008 [4] R Boppana and J Spencer A useful elementary correlation inequality Journal of Combinatorial Theory, Series A, 50:305 307, 1982 [5] S Boyd and L Vandenberghe Conex Optimization Cambridge Uniersity Press, Cambridge UK, 2004 15

[6] P Erdős and L Loász Problems and results on 3-chromatic hypergraphs and some related questions In In A Hajnal, R Rado, and V T Sós, eds Infinite and Finite Sets to Paul Erd?s on his 60th birthday), pages 609 627 North-Holland, 1975 [7] H Finner A generalization of Hölder s inequality and some probability inequalities The Annals of Probability, 204):1893 1901, 1992 [8] D Galin and Y Zhao The number of independent sets in a graph with small maximum degree Graphs and Combinatorics, 27:177 186, 2011 [9] D Gainsky, S Loett, M Saks, S Sriniasan, and A Tail Bound for Read-k Families of Functions Random Structures & Algorithms, 47:99 108, 2015 [10] S Janson New ersions of Suen s correlation inequality Random Structures & Algorithms, 13:467 483, 1998 [11] S Janson Large Deiations for Sums of Partly Dependent Random Variables Random Structures & Algorithms, 24:234 248, 2004 [12] S Janson, T Luczak, and A Rusiński Random Graphs Wiley, New York, 2000 [13] S Janson, L Warnke, and The Lower Tail: Poisson Approximation Reisited Random Structures & Algorithms 2015 [14] K Kolipaka and M Szegedy Moser and Tardos meet Loász In STOC 11 Proceedings of the forty-third annual ACM symposium on Theory of computing, pages 235 244, 2011 [15] M Krieleich, B Sudako, V H Vu, and N C Wormald On the probability of independent sets in random graphs Random Structures & Algorithms, 22:1 14, 2003 [16] R A Moser and G Tardos A constructie proof of the general Loász local lemma Journal of the ACM, 572), 2010 [17] J Ramon, Y Wang, and Z Guo Learning from networked examples Journal of Machine Learning Research to appear) arxi:140526002, 2015 [18] J P Schmidt, A Siegel, and A Sriniasan Chernoff-Hoeffding bounds for applications with limited independence SIAM Journal on Discrete Mathematics, 82):223 250, 1995 [19] G Wolfoitz A concentration result with application to subgraph count Random Structures & Algorithms, 40:254 267, 2012 16