Lecture 2 Long paths in random graphs

Similar documents
Lecture 2. The Lovász Local Lemma

An Introduction to Randomized Algorithms

Disjoint Systems. Abstract

Application to Random Graphs

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

Edge Disjoint Hamilton Cycles

7.1 Convergence of sequences of random variables

Problem Set 2 Solutions

The random version of Dvoretzky s theorem in l n

Lecture 2: Concentration Bounds

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.

7.1 Convergence of sequences of random variables

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Math 104: Homework 2 solutions

A note on log-concave random graphs

Fundamental Theorem of Algebra. Yvonne Lai March 2010

Large holes in quasi-random graphs

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 2 9/9/2013. Large Deviations for i.i.d. Random Variables

Math 216A Notes, Week 5

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Notes for Lecture 11

The natural exponential function

Lecture 3: August 31

On Algorithm for the Minimum Spanning Trees Problem with Diameter Bounded Below

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

LONG SNAKES IN POWERS OF THE COMPLETE GRAPH WITH AN ODD NUMBER OF VERTICES

On two Hamilton cycle problems in random graphs

Lecture 11: Pseudorandom functions

Statistics 511 Additional Materials

Few remarks on Ramsey-Turán-type problems Benny Sudakov Λ Abstract Let H be a fixed forbidden graph and let f be a function of n. Denote by RT n; H; f

HOMEWORK 2 SOLUTIONS

Independence number of graphs with a prescribed number of cliques

Math 155 (Lecture 3)

CS322: Network Analysis. Problem Set 2 - Fall 2009

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem

It is often useful to approximate complicated functions using simpler ones. We consider the task of approximating a function by a polynomial.

The multiplicative structure of finite field and a construction of LRC

On matchings in hypergraphs

w (1) ˆx w (1) x (1) /ρ and w (2) ˆx w (2) x (2) /ρ.

This section is optional.

Seunghee Ye Ma 8: Week 5 Oct 28

MA131 - Analysis 1. Workbook 3 Sequences II

Optimally Sparse SVMs

On Random Line Segments in the Unit Square

Self-normalized deviation inequalities with application to t-statistic

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013

Integrable Functions. { f n } is called a determining sequence for f. If f is integrable with respect to, then f d does exist as a finite real number

If a subset E of R contains no open interval, is it of zero measure? For instance, is the set of irrationals in [0, 1] is of measure zero?

Machine Learning Theory Tübingen University, WS 2016/2017 Lecture 12

An analog of the arithmetic triangle obtained by replacing the products by the least common multiples

Random Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A.

Advanced Stochastic Processes.

Math 2784 (or 2794W) University of Connecticut

Disjoint set (Union-Find)

Law of the sum of Bernoulli random variables

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Week 5-6: The Binomial Coefficients

Lecture 7: Properties of Random Samples

Frequentist Inference

Confidence Intervals for the Population Proportion p

# fixed points of g. Tree to string. Repeatedly select the leaf with the smallest label, write down the label of its neighbour and remove the leaf.

Solutions to selected exercise of Randomized Algorithms

The standard deviation of the mean

Distribution of Random Samples & Limit theorems

Math 140A Elementary Analysis Homework Questions 3-1

Math 220A Fall 2007 Homework #2. Will Garner A

6.3 Testing Series With Positive Terms

Lecture 10 October Minimaxity and least favorable prior sequences

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 6 9/23/2013. Brownian motion. Introduction

A Proof of Birkhoff s Ergodic Theorem

4 The Sperner property.

Chapter 5. Inequalities. 5.1 The Markov and Chebyshev inequalities

On a Smarandache problem concerning the prime gaps

TR/46 OCTOBER THE ZEROS OF PARTIAL SUMS OF A MACLAURIN EXPANSION A. TALBOT

Estimation for Complete Data

Sequences and Limits

Basics of Probability Theory (for Theory of Computation courses)

Shannon s noiseless coding theorem

3. Sequences. 3.1 Basic definitions

lim za n n = z lim a n n.

Series III. Chapter Alternating Series

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

Fall 2013 MTH431/531 Real analysis Section Notes

MA131 - Analysis 1. Workbook 2 Sequences I

Math 299 Supplement: Real Analysis Nov 2013

1. By using truth tables prove that, for all statements P and Q, the statement

Lecture 14: Graph Entropy

Lecture 16: Monotone Formula Lower Bounds via Graph Entropy. 2 Monotone Formula Lower Bounds via Graph Entropy

Lecture 9: Hierarchy Theorems

How to Maximize a Function without Really Trying

STAT Homework 1 - Solutions

Math 451: Euclidean and Non-Euclidean Geometry MWF 3pm, Gasson 204 Homework 3 Solutions

Spectral Partitioning in the Planted Partition Model

MDIV. Multiple divisor functions

Notes 27 : Brownian motion: path properties

Lecture 12: November 13, 2018

Approximations and more PMFs and PDFs

Randomized Algorithms I, Spring 2018, Department of Computer Science, University of Helsinki Homework 1: Solutions (Discussed January 25, 2018)

SOME GENERALIZATIONS OF OLIVIER S THEOREM

Assignment 5: Solutions

Transcription:

Lecture Log paths i radom graphs 1 Itroductio I this lecture we treat the appearace of log paths ad cycles i sparse radom graphs. will wor with the probability space G(, p) of biomial radom graphs, aalogous results for the sister model G(, m) ca be either prove usig very similar argumets, or derived usig available equivalece statemets betwee the two models. Our goal here is two-fold: we first prove that already i the super-critical regime p = 1+ɛ, the radom graph G(, p) cotais typically a path of legth liear i ; the we prove that i the regime p = C, the radom graph G(, p) has typically a path, coverig the proportio of vertices tedig to 1 as the costat C icreases. We will ivoe the approaches ad results developed i Lecture 1 (the Depth First Search Algorithm ad its cosequeces) to achieve both of these goals, ad i fact i a rather short ad elegat way. We Liearly log paths i the supercritical regime I their groudbreaig paper [] from 1960, Paul Erdős ad Alfréd Réyi made the followig fudametal discovery: the radom graph G(, p) udergoes a remarable phase trasitio aroud the edge probability p() = 1 1 ɛ. For ay costat ɛ > 0, if p =, the G(, p) has whp all coected compoets of size at most logarithmic i, while for p = 1+ɛ whp a coected compoet of liear size, usually called the giat compoet, emerges i G(, p) (they also showed that whp there is a uique liear sized compoet). The Erdős-Réyi paper, which lauched the moder theory of radom graphs, has had eormous ifluece o the developmet of the field. We will be able to derive both parts of this result very soo. Although for the super-critical case p = 1+ɛ the result of Erdős ad Réyi shows a typical existece of a liear sized coected compoet, it does ow imply that a logest path i such a radom graph is whp liearly log. This was established some 0 years later by Ajtai, Komlós ad Szemerédi [1]. I this sectio we preset a (relatively) simple proof of their result. We will ot attempt to achieve the best possible absolute costats, aimig rather for simplicity. Our treatmet follows closely that of []. 1

The most fudametal idea of the proof is to ru the DFS algorithm o a radom graph G G(, p), costructig the graph o the fly, as the algorithm progresses. We first fix the order σ o V (G) = [] to be the idetity permutatio. Whe the DFS algorithm is fed with a sequece of i.i.d. Beroulli(p) radom variables X = (X i ) N i=1, so that is gets its i-th query aswered positively if X i = 1 ad aswered egatively otherwise, the so obtaied graph is clearly distributed accordig to G(, p). Thus, studyig the compoet structure of G ca be reduced to studyig the properties of the radom sequece X. This is a very useful tric, as it allows to flatte the radom graph by replacig a iheretly two-dimesioal structure (a graph) by a oe-dimesioal oe (a sequece of bits). I particular, observe crucially that as log as T, every positive aswer to a query results i a vertex beig moved from T to U, ad thus after t queries ad assumig T still, we have S U t i=1 X i. (The last iequality is strict i fact as the first vertex of each coected compoet is moved from T to U for free, i.e., without eed to get a positive aswer to a query.) O the other had, sice the additio of every vertex, but the first oe i a coected compoet, to U is caused by a positive aswer to a query, we have at time t: U 1 + t i=1 X i. The probabilistic part of our argumet is provided by the followig quite simple lemma. Lemma.1 Let ɛ > 0 be a small eough costat. Cosider the sequece X = (X i ) N i=1 Beroulli radom variables with parameter p. of i.i.d. 1. Let p = 1 ɛ. Let = 7 l. The whp there is o iterval of legth i [N], i which at ɛ least of the radom variables X i tae value 1.. Let p = 1+ɛ. Let N 0 = ɛ. The whp N0 i=1 X i ɛ(1+ɛ) /. Proof 1) For a give iterval I of legth i [N], the sum i I X i is distributed biomially with parameters ad p. Applyig the Cheroff boud to the upper tail of B(, p), ad the the uio boud, we see that the probability of the existece of a iterval violatig the assertio of the lemma is at most for small eough ɛ > 0. (N + 1)P r[b(, p) ] < e ɛ (1 ɛ) < e ɛ (1 ɛ) 7 ɛ l = o(1), ) The sum N 0 i=1 X i is distributed biomially with parameters N 0 ad p. Hece, its expectatio is N 0 p = ɛ p = ɛ(1+ɛ), ad its stadard deviatio is of order. Applyig the Chebyshev iequality, we get the required estimate. Now we are ready to formulate ad to prove the result of this sectio. Theorem. Let ɛ > 0 be a small eough costat. Let G G(, p). 1. Let p = 1 ɛ. The whp all coected compoets of G are of size at most 7 ɛ l.. Let p = 1+ɛ. The whp G cotais a path of legth at least ɛ.

I both cases, we ru the DFS algorithm o G G(, p), ad assume that the sequece X = (X i ) N i=1 of radom variables, defiig the radom graph G G(, p) ad guidig the DFS algorithm, satisfies the correspodig part of Lemma.1. Proof 1) Assume to the cotrary that G cotais a coected compoet C with more tha = 7 ɛ l vertices. Let us loo at the epoch of the DFS whe C was created. Cosider the momet iside this epoch whe the algorithm has foud the ( + 1)-st vertex of C ad is about to move it to U. Deote S = S C at that momet. The S U =, ad thus the algorithm got exactly positive aswers to its queries to radom variables X i durig the epoch, with each positive aswer beig resposible for revealig a ew vertex of C, after the first vertex of C was put ito U i the begiig of the epoch. At that momet durig the epoch oly pairs of edges touchig S U have bee queried, ad the umber of such pairs is therefore at most ( ) + ( ) <. It thus follows that the sequece X cotais a iterval of legth at most with at least 1 s iside a cotradictio to Property 1 of Lemma.1. ) Assume that the sequece X satisfies Property of Lemma.1. We claim that after the first N 0 = ɛ queries of the DFS algorithm, the set U cotais at least ɛ vertices (with the cotets of U formig a path of desired legth at that momet). Observe first that S < at time N 0. Ideed, if S, the let us loo at a momet t where S = (such a momet surely exists as vertices flow to S oe by oe). At that momet U 1 + t i=1 X i < by Property of Lemma.1. The T = S U, ad the algorithm has examied all S T 9 > N 0 pairs betwee S ad T (ad foud them to be o-edges) a cotradictio. Let us retur to time N 0. If S < ad U < ɛ the, we have T. This meas i particular that the algorithm is still revealig the coected compoets of G, ad each positive aswer it got resulted i movig a vertex from T to U (some of these vertices may have already moved further from U to S). By Property of Lemma.1 the umber of positive aswers at that poit is at least ɛ(1+ɛ) /. ( Hece we have ) S U ɛ(1+ɛ) /. If U ɛ, the S ɛ + ɛ 10 /. All S T S S ɛ pairs betwee S ad T have bee probed by the algorithm (ad aswered i the egative). We thus get: ɛ ( ) ( ) ( = N 0 S S ɛ ɛ + ɛ 10 / ɛ ) ɛ + / = ɛ + ɛ 0 O(ɛ ) > ɛ (we used the assumptio S < ), ad this is obviously a cotradictio, completig the proof. Let us discuss the obtaied result ad its proof. First, give the probable existece of a log path i G(, p), that of a log cycle is just oe short step further. Ideed, we ca use sprilig as follows. Let p = 1+ɛ for small ɛ > 0. Write 1 p = (1 p 1)(1 p ) with p = ɛ ; thus, most of. Let ow G G(, p), G 1 G(, p 1 ), G G(, p ), we ca the probability p goes ito p 1 1+ɛ/ represet G = G 1 G. By Theorem., G 1 whp cotais a liearly log path P. Now, the edges

of G ca be used to close most of P ito a cycle there is whp a edge of G betwee the first ad the last / (say) vertices of P. The depedecies o ɛ i both parts of Theorem. are of the correct order of magitude for p = 1 ɛ a largest coected compoet of G(, p) is ow to be whp of size Θ(ɛ ) log while for p = 1+ɛ a logest cycle of G(, p) is whp of legth Θ(ɛ ). Observe that usig a Cheroff-type boud for the tales of the biomial radom variable istead of the Chebyshev iequality would allow to claim i the secod part of Lemma.1 that the sum N0 i=1 X i is close to ɛ(1+ɛ) with probability expoetially close to 1. This would show i tur, employig the argumet of Theorem., that G(, p) with p = 1+ɛ cotais a path of legth liear i with expoetially high probability, amely, with probability 1 exp{ c(ɛ)}. As we have metioed i Lecture 1, the DFS algorithm is applicable equally well to directed graphs. Hece essetially the same argumet as above, with obvious mior chages, ca be applied to the model D(, p) of radom digraphs. It the yields the followig theorem: Theorem. Let p = 1+ɛ, for ɛ > 0 costat. directed path ad a directed cycle of legth Θ(ɛ ). The the radom digraph D(, p) has whp a This recovers the result of Karp [4]. Nearly spaig paths Cosider ow the regime p = C, where C is a (large) costat. Our goal is to prove that whp i G(, p), the legth of a logest path approaches as C teds to ifiity. This too is a classical result due to Ajtai, Komlós ad Szemerédi [1], ad idepedetly due to Feradez de la Vega []. It is fairly amusig to see how easily it ca be derived usig the DFS-based tools we developed i Lecture 1. Theorem.1 For every ɛ > 0 be a small eough costat there exists C = C(ɛ) > 0 such that the followig is true. Let G G(, p), where p = C. The whp G cotais a path of legth at least (1 ɛ). Proof Let = ɛ. By Propositio. from Lecture 1 it suffices to show that G G(, p) cotais whp a edge betwee every pair of disjoit subsets of size of V (G). For a give pair of disjoit sets S, T of size S = T =, the probability that G cotais o edges betwee S ad T is exactly (1 p) (all pairs betwee S ad T come out o-edges i G). Usig the uio boud, we obtai that the probability of the existece of a pair violatig this requiremet is at most ( )( ) ( ) ( e ) [ (e ) ] (1 p) < (1 p) < e p( 1) < e C( 1). Recallig the value of ad taig C = l(/ɛ ) ɛ guaratees that the above estimate vaishes (i fact expoetially fast i ), thus establishig the claim. 4

A similar statemet holds for the probability space D(, p) of radom directed graphs, with a essetially idetical proof. Refereces [1] M. Ajtai, J. Komlós ad E. Szemerédi, The logest path i a radom graph, Combiatorica 1 (1981), 1 1. [] P. Erdős ad A. Réyi, O the evolutio of radom graphs, Publ. Math. Ist. Hugar. Acad. Sci. (1960), 17 61. [] W. Feradez de la Vega, Log paths i radom graphs, Studia Sci. Math. Hugar. 14 (1979), 40. [4] R. Karp, The trasitive closure of a radom digraph, Radom Structures ad Algorithms 1 (1990), 7 9. [] M. Krivelevich ad B. Sudaov, The phase trasitio i radom graphs a simple proof, Radom Structures ad Algorithms, to appear.