Cliques in random graphs

Similar documents
Lecture 7: February 6

252 P. ERDÖS [December sequence of integers then for some m, g(m) >_ 1. Theorem 1 would follow from u,(n) = 0(n/(logn) 1/2 ). THEOREM 2. u 2 <<(n) < c

The concentration of the chromatic number of random graphs

Independent Transversals in r-partite Graphs

Random Graphs. EECS 126 (UC Berkeley) Spring 2019

ON THE ERDOS-STONE THEOREM

Hadwiger's Conjecture is True for Almost Every Graph

Note on Vertex-Disjoint Cycles

Disjoint Subgraphs in Sparse Graphs 1

A = A U. U [n] P(A U ). n 1. 2 k(n k). k. k=1

The Turán number of sparse spanning graphs

ANOTHER CLASS OF INVERTIBLE OPERATORS WITHOUT SQUARE ROOTS

220 B. BOLLOBÁS, P. ERDŐS AND M. SIMONOVITS then every G(n, m) contains a K,,, 1 (t), where a log n t - d log 2 (1 Jc) (2) (b) Given an integer d > 1

AN IMPROVED ERROR TERM FOR MINIMUM H-DECOMPOSITIONS OF GRAPHS. 1. Introduction We study edge decompositions of a graph G into disjoint copies of

contains a canonical graph.

ASYMPTOTIC DISTRIBUTION OF THE MAXIMUM CUMULATIVE SUM OF INDEPENDENT RANDOM VARIABLES

Reconstructing subgraph-counting graph polynomials of increasing families of graphs

176 5 t h Fl oo r. 337 P o ly me r Ma te ri al s

CLIQUES IN THE UNION OF GRAPHS

The expansion of random regular graphs

1981] 209 Let A have property P 2 then [7(n) is the number of primes not exceeding r.] (1) Tr(n) + c l n 2/3 (log n) -2 < max k < ir(n) + c 2 n 2/3 (l

< k 2n. 2 1 (n 2). + (1 p) s) N (n < 1

2. Linear recursions, different cases We discern amongst 5 different cases with respect to the behaviour of the series C(x) (see (. 3)). Let R be the

RELATIVE N-TH NON-COMMUTING GRAPHS OF FINITE GROUPS. Communicated by Ali Reza Ashrafi. 1. Introduction

PRZEMYSLAW M ATULA (LUBLIN]

The Degree of the Splitting Field of a Random Polynomial over a Finite Field

Minimal Paths and Cycles in Set Systems

Graphs with Large Variance

P i [B k ] = lim. n=1 p(n) ii <. n=1. V i :=

Czechoslovak Mathematical Journal

Nonnegative k-sums, fractional covers, and probability of small deviations

Matchings in hypergraphs of large minimum degree

Bounds for pairs in partitions of graphs

The Degree of the Splitting Field of a Random Polynomial over a Finite Field

1 5 4 PAUL ERDŐS of at most f (m) edges, then x(g(n)) < 3 (or, in a slightly weaker form, its chromatic number is bounded) It is surprising that we ha

Bounds on the generalised acyclic chromatic numbers of bounded degree graphs

Lecture 4: Phase Transition in Random Graphs

Generating all subsets of a finite set with disjoint unions

THE PRESERVATION OF CONVERGENCE OF MEASURABLE FUNCTIONS UNDER COMPOSITION

Decomposing oriented graphs into transitive tournaments

Small subgraphs of random regular graphs

Induced Saturation of Graphs

Graphs with large maximum degree containing no odd cycles of a given length

Chapter 1. Measure Spaces. 1.1 Algebras and σ algebras of sets Notation and preliminaries

arxiv: v1 [math.co] 24 Apr 2014

Testing Equality in Communication Graphs

NOTE ON A THEOREM OF KAKUTANI

Chapter 4. Measure Theory. 1. Measure Spaces

How many randomly colored edges make a randomly colored dense graph rainbow hamiltonian or rainbow connected?

On the number of cycles in a graph with restricted cycle lengths

Notes 6 : First and second moment methods

and 1 limn g k (n)/n 2 (1/2) - 2[ for k > 4, where [x] denotes the integer part of x. In other work [4], [7] the authors discuss the maximum number of

Acyclic subgraphs with high chromatic number

SOME REMARKS ON NILPOTENT GROUPS WITH ROOTS

Random Graphs. 7.1 Introduction

6o) by the requirement that radii map linearly onto radii. Then c1? = g,1 (D") is. [0, 1 ]), where h* is a C'-diffeomorphism except on (Sni, 1/2).

arxiv: v2 [math.nt] 4 Nov 2012

ON THE EVOLUTION OF ISLANDS

A simple branching process approach to the phase transition in G n,p

Coloring Uniform Hypergraphs with Few Colors*

Graphs with maximal Hosoya index and minimal Merrifield-Simmons index

Applications of the Lopsided Lovász Local Lemma Regarding Hypergraphs

The Singapore Copyright Act applies to the use of this document.

SUMS PROBLEM COMPETITION, 2000

Theorem (Special Case of Ramsey s Theorem) R(k, l) is finite. Furthermore, it satisfies,

Packing of Rigid Spanning Subgraphs and Spanning Trees

(1) N(bi, n) < cid ;-

Graphs and Combinatorics

Constructive bounds for a Ramsey-type problem

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Yunhi Cho and Young-One Kim

Jeong-Hyun Kang Department of Mathematics, University of West Georgia, Carrollton, GA

arxiv: v1 [math.ac] 7 Feb 2009

Decomposition of random graphs into complete bipartite graphs

F. R. K. CHUNG AND R. L. GRAHAM. Bell Laboratories, Mupray Hill, New Jersey Communicated by the Managing Editors

The Reduction of Graph Families Closed under Contraction

94 P. ERDÖS AND Á. RÉNYI 1. Generalization of the Borel-Cantelli lemma Let [X, 1,, P] be a probability space in the sense of KOLMOGOROV [6], i. e. X a

Numerical Sequences and Series

A Hilton-Milner-type theorem and an intersection conjecture for signed sets

Minimal Decompositions of Hypergraphs into Mutually Isomorphic Subhypergraphs

Course 311: Michaelmas Term 2005 Part III: Topics in Commutative Algebra

Subhypergraph counts in extremal and random hypergraphs and the fractional q-independence

ON JAMES' QUASI-REFLEXIVE BANACH SPACE

THE ASYMPTOTIC NUMBER OF TRIPLE SYSTEMS NOT CONTAINING A FIXED ONE

Integral closure of powers of the graded maximal ideal in a monomial ring

4 Sums of Independent Random Variables

Connectivity of addable graph classes

Packing and decomposition of graphs with trees

The Borel-Cantelli Group

Multi-colouring the Mycielskian of Graphs

On the Distribution of Multiplicative Translates of Sets of Residues (mod p)

SIZE-RAMSEY NUMBERS OF CYCLES VERSUS A PATH

T h e C S E T I P r o j e c t

ALMOST SURE CONVERGENCE OF THE BARTLETT ESTIMATOR

Decomposition of random graphs into complete bipartite graphs

MATH 117 LECTURE NOTES

IMA Preprint Series # 2385

The cycle polynomial of a permutation group

Decompositions of graphs into cycles with chords

2 ERDOS AND NATHANSON k > 2. Then there is a partition of the set of positive kth powers In k I n > } = A, u A Z such that Waring's problem holds inde

Transcription:

Math. Proe. Camb. Phil. Soc. (1976), 80, 419 MPCPS 80-41 Printed in Great Britain 419 Cliques in random graphs By B. BOLLOBAS AND P. ERDŐS University of Cambridge (Received 17 September 1975) 1. Introduction. Let 0 < p < 1 be fixed and denote by G a random graph with point set N, the set of natural numbers, such that each edge occurs with probability p, independently of all other edges. In other words the random variables ei5, 1 < i < j, defined by _ 1 if (i, j) is an edge of G, et ' - 0 if (i, j) is not an edge of G, are independent r.v.'s with P(e i, = 1) = p, P(eij = 0) = 1-p. Denote by G.n the subgraph of G spanned by the points 1, 2,..., n. These random graphs G, Gn will be investigated throughout the note. As in (1), denote by Kr a complete graph with r points and denote by k,(h) the number of K,'s in a graph H. A maximal complete subgraph is called a clique. In (1) one of us estimated the minimum of k,(h) provided H has as points and m edges. In this note we shall look at the random variables the number of Kr's in Gn,, and Yr = Y(n, r) = k,(g,,), X n = max {r : k,(g,,) > 0}, the maximal size of a clique in Gn. Random graphs of a slightly different kind were investigated in detail by Erdös and Rényi(2). In (4) Matula showed numerical evidence that X,,, has a strong peak around 2 log n/log (1/p). Grimmett and McDiarmid(3) proved that as n -~- oo X./log n -,- 2/log (1/p) with probability one. Independently and earlier Matula(5) proved a considerably finer result about the peak of X n. In particular he proved that, as n -* 00, Xn takes one of at most two values depending on n with probability tending to 1. The main aim of this note is to prove various results about the distribution of X n. We shall also investigate the existence of infinite complete graphs in G. Finally we prove how many colours are likely to be used by a certain colouring algorithm. 2. Cliques infinite graphs. To simplify the notations we shall put b = 1/p. Note first that the probability that a given set of r points spans a complete subgraph of G is p('). Consequently the expectation of Yr = Y(n, r) is Er = E(1 ;) = E(n, r) = (')p. (, (1) 27-2

420 B. BOLLOBÁS AND P. ERDŐS Let d = d(n) be the positive real number for which It is easily checked that d(n) = 2log, n-2logb logb n+2log b (1e) + 1+0(1) = 2 log, n + 0(logb logo n) = lo log b + 0009b logo n). Choose an e, 0 < e < 2. Given a natural number r > 2 let nr be the maximal natural number for which E(nr, r) < r-(1 +E) and let n;, be the minimal natural number for which n cd) (d)p, = i. It is easily checked that n,'.-nr < 3logrbir (2) r and nr = bir+o(bir). (3) Thus, with at most finitely many exceptions, one has and nr < n r < nr+h (nr-nr)í(nr+i-n;,) < 4(bi-1) - lr -1 logr lim (nr+2 - nr+1)/(nr+l - nr) = bi r- THEOREM 1. For a.e. graph G there is a constant c = c(g) such that if n 'r < n g nr+1 for some r > c then X.(G) = r. Proof. Let 0 < q < 1 be an arbitrary constant. We shall consider the random variables Yr = Y(n, r) for log log n (i+r1)logb n < r < 3 logb and large values of n. Note first that the second moment of I ;, is the sum of the probabilities of ordered pairs of K,'s. The probability of two Kr's with l points in common is p2(p-c;) since G must contain 2 `2) - 12) given edges. As one can choose n ('- ) (t) (r- 1 )

Cliques in random graphs 421 ordered pairs of sets of r points with l points in common (since n > 2r), the second moment of Y,, is (n-r~ E(Y,) a=q r J (4) ll r-l (ef. Matula(4». As the variance of YT is and so,2/e2 = = Z I (n ~lpz(g r) 111 lr - 4y, _ 0'z(Yr) = 2Y ( n) ) r) r ll l r _ l p2(' ) (b(< - 1) (r) n-r l r-l (b'')-1) - F (5} r=z lnj a=2 l J r Routine calculations show that, if n is sufficiently large and 3 5 l - r-1, then Consequently F < F3 + Fr- 1. U,/E2 < F2 +Fr +r(f3 +Fr-I) Since this gives < 2n2 (b-1)+er +r 1 6n(b 3-1)+ rér -i~. br > ni+n and r < l3b log n, g yr/e; < ( J(b-1) r4n-2 +Er 1 ) { 1 +r]) < (b-1)r4n -z+2er 1. ( 6) Inequality (6) gives, in fact, the right order of magnitude of o-t/et since (5) implies immediately G,' /E 2 > T2 + F, = ~(b- i) r4n-2 +Er 1 (1-p~' ) We shall use inequality (6) only to conclude P(Yr = 0) < /E ; < br 4 n -z+2er 1. ( 7 ) In particular, by the choice of n ;, P(Y(n', r) = 0) < br4(n;.)-2+ 2E(n,., r) -1 On the other hand, < 3r-~l+e) P(Y(nr+l,r+1) > 0) < E(n,.+l,r+1) < r-<l+e), Consequently, for a fixed r P(3n, n, < n - nr+1, X n + r) < 4r-(1 }e). (8)

422 B. BOLLOBAS AND P. ERDŐS W As E r -( 1 +e) < oo, the Borel-Cantelli lemma implies that for a.e. graph G with the i exception of finitely many is one has X.(G) = r This completes the proof of the theorem. for all n, nr' < n < nr+i Let e = 3. Then the choice of the numbers nr,nr and the definition of d(n) imply easily (cf. inequalities (2) and (3)) that and if r is sufficiently large then d(nr ) < r < d(nr ) max {r - d(nr), d(nr ) - r} < 7 log r < 2 log log n r+i 2 log b r log b log nr+i Thus inequality (8) implies the following extension of the result of Matula(s). then COROLLARY 1. (i) For a.e. graph G there is a constant c = c(g) such that if n > c(g) ~d(n)-2 loglogn \ Xn < [d(n) +2 loglogn L log n log b log n log b (ü) If r is sufficiently large and n r < n < nr+l then P { d(n) - 2lolog log g n log 71 b] < X n < [ d(n) + 2.1109log b]' L 09 n log > 1-1 Or-4. d n, n,. < n < n,,+,) Remark. Note that the upper and lower bounds on X in Corollary 1 differ by at most 1 if n is large and for most values of n they simply coincide. Let us estimate now how steep a peak X n has got near d(n). More precisely, we shall estimate P(X,n < r(n)) and P(X, > r'(n)) for certain functions r(n), r'(n) with r(n) < d(n) < r'(n). The expectation gives a trivial but fairly good bound for the second probability. As for r > d(n) one has E(n, r) < nd(n)-r P(Xn > r'(n)) < nd(n)-r(n) whenever r' (n) > d(n). Furthermore, if 0 < d(n) - r(n) is bounded, K > 0 is a constant and n is sufficiently large, then E(n, r) > Knd~n>-r(n>. Consequently it follows from (7) that if 0 < 8 < 2, 0 < c and n is sufficiently large (depending on p, 8 and c) then Our next result extends this inequality. P(X. < d(n) -S) < an-8. ( 9) THEOREM 2. (i) Let 0 < e, 0 < r(n) < d(n), r(n) oo and put t = t(n) _ [d(n)-a-r(n)] -1. Then P(X n < r(n)) < n-[b ig l if n is sufficiently large.

(ü)let0<e<8<1.then if n is sufficiently large. Cliques in random graphs 423 P(X, - (1-4)d(n)) < n-2 Proof. (i) Put s = [bit] and choose subsets V V of {i,..., n} such that I V n V i and I V I -> n/s (i < i, j -< n, i + j). Then d(lvl) > d(n) - le- 2oloogs > d(n)-2e-t -> r(n)+i-2e. Thus for large n the probability that the subgraph spanned by li does not contain a K, with l > r(n) is less than n_1. As these subgraphs are independent of each other, P(X. < r(n)) < n-8 if n is sufficiently large. (ü) Put r = [(1-8)d(n)+ 1] and let q be a prime between ne and 2ne. Put Q = q 2 +q+ 1, m = [nlq]. Divide {l, 2,..., n) into Q classes, Cl,..., CQ, each having m or m+ 1 elements. Consider the sets Cl,..., CQ as the points of a finite projective geometry. If e is a line of this projective geometry, let Ge be the subgraph of G with point set V = U Ci and with all C,ee those edges of G that join points belonging to different classes. It is clear that almost every r-tuple of V is such that no two points belong to the same class, since Furthermore, mr- \ g r 1 / (, VrI, ). r < d(ivj)-2. Consequently inequality (7) implies that the probability of G e not containing a K, is less than n-1. As e runs over the set of lines of the projective geometry the subgraphs Ge are independent since they have been chosen independently of the existence of edges. Therefore P(G,,, does not contain a K,) < n -Q < n-" as claimed. Remark. Up to now we have investigated the maximal order of a clique. Let us see now which natural numbers are likely to occur as orders of cliques (maximal complete subgraphs). We know that cliques of order essentially greater than d(n) are unlikely to occur. It turns out that cliques of order roughly less than 2d(n) are also unlikely to occur but every other value is likely to be the order of a clique. The probability that r given points span a clique of G n is clearly (1-pr )n-r Thus if z, = Z,(G,,,) denotes the number of cliques of order r in G,, then the expectation of Z, is p ( E (Zr) _ (1- p'r - p('). 0 %~.

424 B. BOLLOBÁS AND P. ERDŐS Denote by J(n) the minimal value of r > 2 for which the right hand side is 1. One can prove rather sharp results analogous to Theorem 1 stating that the orders of cliques occurring are almost exactly the numbers between J(n) - Jd(n) and d(n), but we shall formulate only the following very weak form of the possible results. Given e > 0 a.e. graph G is such that whenever n is sufficiently large and (1+e)logb g < r < (2-E) log gb ' Gn contains a clique of order r, but G n does not contain a clique of order less than or greater than log n (1-e} log b' (2+e) log n logb' 3. Infinite complete subgraphs. We denote by K(x,, x2,...) ( resp. K(x1,.... x,)) the infinite (resp. finite) complete graph with vertex set {x1, x2,...} ( resp. {x1,..., x,,}). We shall always suppose that 1 < x 1 < x 2 <.... We would like to determine the infimum c o of those positive constants c for which a.e. G contains a K(x l, x,_.) such that xn < ell for every n, n(g). Corollary 1 implies that c o -> bl. At the first sight c o = z does not seem to be impossible since a.e. G is such that for every sufficiently large n it contains a K(xl,..., xn ) satisfying xn < b(1+e)n. However, it turns out that a sequence cannot be continued with such a density and, in fact, c a = b. We have the following more precise result. THEOREM 3. (i) Given e > 0 a.e. graph G is such that for every K(x 1,...) C G x > ón(1-e) holds for infinitely many n. (ü) Given e > 0 a.e. G contains a K(x,,...) such that for every sufficiently large n. x n < bn(l+e) Proof. (i) Let 1 < xl < x 2 <... < xn < bn( 1-c). Then the probability that there is a point xn+, < b(n+1)u-e) joined to every xi, 1 < i < n, is less than b(n+1) (1-e)pn = bl-(n+1) e Thus the probability that G contains a K(xl,..., xn ) satisfying is less than xk < bk(1- e) (k = n N) bn N Pn,1V = rj b 1-ke (n n+1 n (There are at most `n 1 sequences 1 < x1 <... < xn < bn( 1-e). Clearly Pn,N -> 0 as N -+ oo, so the assertion follows.

Cliques in random graphs 425. (ü) Let Pn be the probability that G contains a K(xl,..., x n) with xn < bna+e). We know that Pn 1 Given 1 < x1 <... < xn < bn( 1 ) let us estimate the probability Qn+1 that there is a point xn+l, xn < xn+l < b(n}1)(1-(x), which is joined to every xi, 1 < i S n. There are [b(n+l) (1+e)-x.] > [b(n+1)(1+0-bn(l+e)] > b(n+1)(1+0 = B n+1 independent choices for xn+1, where 0 < y < e and n is sufficiently large. The probability that a point is not joined to each of {x1,..., xn} is 1- pn. Consequently 1 -Qn+1 < (1- pn ) B,, < e--b-). Therefore the probability that G contains a K(x xn ), xn < bn( l+e), which can be extended to a K(xl, x2,...) by choosing first xn+1 < b(n+l) (l+e), then xn+2 < b(n+2) (1+E) etc., is at least T7 Rn =Pn 11Qk n+1 As Rn -> 1 (n --> oo), the proof is complete. 4. Colouring by the greedy algorithm. Given a graph G with points 1, 2,..., the greedy algorithm (see (3)) colours G with colours c 1, c 2,... as follows. Suppose the points 1,..., n have already been coloured. Then the algorithm colours n+ 1 with colour c, where j is the maximal integer such that for each i < j the point n+ 1 is joined to a point xi S n with colour c i. In other words the algorithm colours the point n+ 1 with the colour having the minimal possible index. Denote by ~n =.1 (G) = x(g n) the number of colours used by this algorithm to colour G n. Out next result extends a theorem of Grimmer and McDiarmid(3) stating that n [(Iogn)Jn] -+ log i1q in mean, where q = 1-p. An immediate corollary of our result is that xj(log n)/n] log 11q in any mean (with a given rate of convergence) and almost surely. As usual, {x} denotes the least integer not less than x. TnEOaEnz 4. (i) Let 0 < y < z be fixed and let u(n) 3 y -i be an arbitrary function. If n is sufficiently large then P log n -11 <logs/q(1+u(n) (log ifq)l(log n) - 1) < n -Yu 2 ( n)+ l ~nn J (ü) Let 3 -< v(n) < log n(log log n)-1 and put t(n) = 1- v(n) log log n(log n) -1, c(n) _ Phen for every sufficiently large n we have p(~n > c(n)) < nlog 1Jq ~t(n) log n)' Proof. (i) Let Mj be the probability that G n has at least k = k(n) = lóáiq +u(n ) (log n)i (log ifq)- 1 = kl (n)+k 2(n) = k 1 +k2 Points of colour cf. It suffices to show that if n is sufficiently large then 117 < n -Y112 '

426 B. BOLLOBés AND P. ERDÖs since then the probability that there is a colour class with at least k(n) points is at most nnyu2 = n-yu'+1 Grimmett and McDiarmid showed ((3), p. 321), that [k(n)1-1 -A1I < rl (1-(1-qi)n). i--1 Since k2 oo as n oo we may suppose that k2-5k2+6 > 2yk2. Then taking into account that qk' = n-1, qk, = n uz [k-11.i 1j -< ri nqi _< nkx-2q' (ka-2x2k1+kz 3) i=[k,+11 = gi(ká-bk$+6) < qyk' = n-yul(n) as required. (ü) Let us estimate the probability that the greedy algorithm has to use more than c(n) colours to colour the first n points. The probability that this happens when ki points have colour i, i < c(n), is exactly c(n) i=1 {1-qki) (1-gnle(n))c(n) C(n) since ki -< n- 1. 1 Thus the probability that more than c(n) colours have to be used to colour the points 1,..., n is less than Sn = n(1-gnlc(n))c(n) Now (1 - gnlo(n))c(n) < e--c(n) (1(q)-n1 c n) and log (c(n)(1fq)-c(n)) > (v(n)-2) log logn if n is sufficiently large. Consequently S < e109n-{109n)v(10-1 < e-(10971)ví1)-2 n for n sufficiently large, completing the proof. 5. Final remarks. (i) Colouring random graphs. It is very likely that the greedy algorithm uses twice as many colours as necessary and, in fact, x(g,) log n z log 11q for a.e. graph G. (x(gn) denotes the chromatic number of Gn.) One would have a good chance of proving this if the bound n-n" in Theorem 2 (ü) could be replaced by e~, n for some positive constant ce, depending on e. (ü) Hypergraphs. Let k > 2 be a natural number and consider random k-hypergraphs (k-graphs) on N such that the probability of a given set of k points forming a k-tuple of the graph is p, independently of the existence of other k-tuples. The proofs of our edge graph results can easily be modified to give corresponding results about random k-graphs. Let us mention one or two of these results. As before, denote by X" the maximal order of a complete subgraph of Gn.

Cliques in random graphs 427 The expectation of the number of complete graphs of order r is clearly (n) p(, P r Let dk.(n) > 1 be the minimal value for which this is equal to 1. It is easily seen that d1jn) k! log n 11(k-1) (log lip Then, corresponding to Corollary 1, we have the following result. For every e > 0 lim P([dk (n) - e] -< Xn -< [d,(n)+e]) = 1. n- Denote by,,,,(g) the number of colours used by the greedy algorithm to colour G.n. Then 1k (logn)11(k-1) ((k-1) loo, W, n 1 in any mean. It is expected that in general the greedy algorithm uses kil(k-1 ) times as many colours as necessary. REFERENCES (1) BOLLOBAS, B. On complete subgraphs of different orders. 117ath. Proc. Cambridge Philos. Soc. 79 (1976), 19-24. (2) ERDŐS, P. and RANYi, A. On the evolution of random graphs. Publ. Math. Inst. Hung. Acad. Sei. 5 A (1960), 17-61. (3) GRIMMETT, G. R. and McDiARrnD, C. J. H. On colouring random graphs. Math. Proc. Cambridge Philos. Soc. 77 (1975), 313-324. (4) AIATuLA, D. W. On the complete subgraphs of a random graph. Combinatory Mathematics and its Applications (Chapel Hill, N.C., 1970), 356-369. (5) MATULA, D. W. The employee party problem. (To appear.)