Distribution of subgraphs of random regular graphs

Similar documents
MATH 5707 HOMEWORK 4 SOLUTIONS 2. 2 i 2p i E(X i ) + E(Xi 2 ) ä i=1. i=1

Maximizing the number of nonnegative subsets

More metrics on cartesian products

arxiv: v1 [math.co] 1 Mar 2014

Finding Dense Subgraphs in G(n, 1/2)

Graph Reconstruction by Permutations

Complete subgraphs in multipartite graphs

Problem Set 9 Solutions

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

TAIL BOUNDS FOR SUMS OF GEOMETRIC AND EXPONENTIAL VARIABLES

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

Math 426: Probability MWF 1pm, Gasson 310 Homework 4 Selected Solutions

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 12 10/21/2013. Martingale Concentration Inequalities and Applications

The L(2, 1)-Labeling on -Product of Graphs

n ). This is tight for all admissible values of t, k and n. k t + + n t

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

Stanford University CS359G: Graph Partitioning and Expanders Handout 4 Luca Trevisan January 13, 2011

Lecture Space-Bounded Derandomization

Anti-van der Waerden numbers of 3-term arithmetic progressions.

Min Cut, Fast Cut, Polynomial Identities

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Credit Card Pricing and Impact of Adverse Selection

Goodness of fit and Wilks theorem

An Introduction to Morita Theory

DONALD M. DAVIS. 1. Main result

The lower and upper bounds on Perron root of nonnegative irreducible matrices

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Canonical transformations

Volume 18 Figure 1. Notation 1. Notation 2. Observation 1. Remark 1. Remark 2. Remark 3. Remark 4. Remark 5. Remark 6. Theorem A [2]. Theorem B [2].

Eigenvalues of Random Graphs

FINITELY-GENERATED MODULES OVER A PRINCIPAL IDEAL DOMAIN

Notes on Frequency Estimation in Data Streams

Edge Isoperimetric Inequalities

U.C. Berkeley CS294: Spectral Methods and Expanders Handout 8 Luca Trevisan February 17, 2016

The Order Relation and Trace Inequalities for. Hermitian Operators

Caps and Colouring Steiner Triple Systems

Remarks on the Properties of a Quasi-Fibonacci-like Polynomial Sequence

Every planar graph is 4-colourable a proof without computer

NP-Completeness : Proofs

Limited Dependent Variables

NUMERICAL DIFFERENTIATION

BOUNDEDNESS OF THE RIESZ TRANSFORM WITH MATRIX A 2 WEIGHTS

Linear, affine, and convex sets and hulls In the sequel, unless otherwise specified, X will denote a real vector space.

Lecture 4: Universal Hash Functions/Streaming Cont d

Lecture 4 Hypothesis Testing

CHAPTER 14 GENERAL PERTURBATION THEORY

Self-complementing permutations of k-uniform hypergraphs

Beyond Zudilin s Conjectured q-analog of Schmidt s problem

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE

Lecture Notes on Linear Regression

Matrix Approximation via Sampling, Subspace Embedding. 1 Solving Linear Systems Using SVD

FACTORIZATION IN KRULL MONOIDS WITH INFINITE CLASS GROUP

HMMT February 2016 February 20, 2016

U.C. Berkeley CS294: Beyond Worst-Case Analysis Handout 6 Luca Trevisan September 12, 2017

Exercises of Chapter 2

Stanford University CS254: Computational Complexity Notes 7 Luca Trevisan January 29, Notes for Lecture 7

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

k(k 1)(k 2)(p 2) 6(p d.

6. Stochastic processes (2)

arxiv: v3 [cs.dm] 7 Jul 2012

6. Stochastic processes (2)

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family

MATH 281A: Homework #6

Prof. Dr. I. Nasser Phys 630, T Aug-15 One_dimensional_Ising_Model

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

Joint Statistical Meetings - Biopharmaceutical Section

Expected Value and Variance

Character Degrees of Extensions of PSL 2 (q) and SL 2 (q)

THE CHINESE REMAINDER THEOREM. We should thank the Chinese for their wonderful remainder theorem. Glenn Stevens

Homework Notes Week 7

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

ISSN: ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 3, Issue 1, July 2013

Stanford University Graph Partitioning and Expanders Handout 3 Luca Trevisan May 8, 2013

Difference Equations

Randić Energy and Randić Estrada Index of a Graph

REGULAR POSITIVE TERNARY QUADRATIC FORMS. 1. Introduction

Economics 101. Lecture 4 - Equilibrium and Efficiency

Module 9. Lecture 6. Duality in Assignment Problems

Computing MLE Bias Empirically

Partitions and compositions over finite fields

Homework Assignment 3 Due in class, Thursday October 15

General viscosity iterative method for a sequence of quasi-nonexpansive mappings

Research Article Green s Theorem for Sign Data

Calculation of time complexity (3%)

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

The optimal delay of the second test is therefore approximately 210 hours earlier than =2.

First day August 1, Problems and Solutions

Discrete Mathematics. Laplacian spectral characterization of some graphs obtained by product operation

A Simple Research of Divisor Graphs

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law:

20. Mon, Oct. 13 What we have done so far corresponds roughly to Chapters 2 & 3 of Lee. Now we turn to Chapter 4. The first idea is connectedness.

Chat eld, C. and A.J.Collins, Introduction to multivariate analysis. Chapman & Hall, 1980

Case A. P k = Ni ( 2L i k 1 ) + (# big cells) 10d 2 P k.

Negative Binomial Regression

Supplement to Clustering with Statistical Error Control

Math 217 Fall 2013 Homework 2 Solutions

MATH 241B FUNCTIONAL ANALYSIS - NOTES EXAMPLES OF C ALGEBRAS

First Year Examination Department of Statistics, University of Florida

Transcription:

Dstrbuton of subgraphs of random regular graphs Zhcheng Gao Faculty of Busness Admnstraton Unversty of Macau Macau Chna zcgao@umac.mo N. C. Wormald Department of Combnatorcs and Optmzaton Unversty of Waterloo Waterloo ON Canada N2L 3G1 nwormald@uwaterloo.ca 1 Introducton The asymptotc dstrbuton of small subgraphs of a random graph has been bascally worked out (see Rucńsk [5] for example). But for random regular graphs, the man technques for provng, for nstance, asymptotc normalty, do not seem to be usable. One very recent result n ths drecton s to be found n [3], where swtchngs were appled to cycle counts. The am of the present note s to show that another very recent method of provng asymptotc normalty, gven by the authors n [1], can easly be appled to ths problem. In partcular, t requres consderably less work than usng swtchngs. The applcaton s, however, not drect, n the sense that the result obtaned s very weak f the random varable countng copes of a subgraph s examned drectly. We obtan a much stronger result by consderng solated copes of a subgraph. To be specfc, we nvestgate the probablty space G n,d of unformly dstrbuted random d-regular graphs on n vertces (whch we assume to be {1, 2,..., n}). Asymptotcs are for n, and here d s not fxed but may vary wth n (though for all our results there s an upper bound on the growth of d, at least mplctly). As usual, we mpose the restrcton that for the asymptotcs, the odd values of n are omtted n the case of odd d. Research supported by NSERC and Unversty of Macau Research supported by the Canada Research Chars program and NSERC. 1

We use µ(g) and ν(g) for the numbers of edges and vertces of a graph G respectvely. A graph G s strctly balanced f µ(g) ν(g) > µ(g 1) ν(g 1 ) (1.1) for all nontrval proper subgraphs G 1 of G. A standard example: every connected regular graph s strctly balanced. Throughout ths paper, [x] m denotng the fallng factoral: x(x 1) (x m + 1). We wll use the followng from [1] to deduce asymptotc normalty. Theorem 1 Let s n > µ 1 n and σ n = µ n + µ 2 ns n, (1.2) where 0 < µ n. Suppose that µ n = o(σ 3 n), (1.3) and a sequence {X n } of nonnegatve random varables satsfes ( k E[X n ] k µ k 2 ) s n n exp 2 (1.4) unformly for all ntegers k n the range cµ n /σ n k c µ n /σ n for some constants c > c > 0. Then (X n µ n )/σ n tends n dstrbuton to the standard normal as n. We also use McKay [4, Theorem 2.10], n the form of the followng smpler specal case stated n [3]. Here, G denotes a random element of G n,d, E denotes the edge set, and K n s the complete graph on n vertces (the same vertex set as G). Theorem 2 For any d and n such that G n,d 0, let J E(K n ). Then, wth j the number of edges n J ncdent wth vertex, (a) f J + 2d 2 nd/2 then P(J E(G)) (b) f 2 J + 4d(d + 1) nd/2 then nk=1 [d] jk 2 J [nd/2 2d 2 ] J ; P(J E(G)) nk=1 [d] jk 2 J [nd/2 1] J ( ) J n 2d 2. n + 2d Corollary 1 Provded d J = o(n), the hypotheses of Theorem 2 mply that P(J E(G)) = nk=1 [d] jk (nd) J ( 1 + O ( (d J /n) 2 )). 2

2 Dstrbuton of number of copes of a graph Throughout ths secton, G denotes a random graph n G n,d. Let H be a fxed strctly balanced graph wth maxmum vertex degree d. Let p and q be the number of vertces and edges of H. A copy of H n G s a subgraph of G whch s somorphc to H. The use of Theorem 1 calls for computng hgh moments of a random varable. It turns out that the random varable countng copes of H has badly behaved moments and consequently does not produce a very useful result. Instead we consder a related random varable whose behavour s more easly analysed. We say that a subgraph of a graph G somorphc to H s an solated copy of H f t shares no edges wth any other subgraph of G somorphc to H. Let X H be the random varable whch s the number of solated copes of H n a random d-regular graph. Let a denote the order of the automorphsm group of H. Set µ = P(H 1 G)[n] p /a (2.1) where H 1 s a fxed copy of H on the vertex set {1, 2,..., V (H)}. The probablty that any gven copy of H n K n occurs n G s equal to P(H 1 G), and there are [n] p /a such copes. Hence, µ s the expected number of copes of H n G, solated or not. By Corollary 1, for d = o(n) (notng that q s fxed), where f = Θ(g) f f = O(g) and g = O(f). Also, let µ = Θ(n p q d q ) (2.2) r = r(n, d, H) = P(H 1 s not solated H 1 G). (2.3) Fx a proper subgraph F of H 1 contanng at least one edge, and consder the probablty that G contans not only H 1 but also the edges of a subgraph H 2 = H wth H1 H 2 = F. Agan usng Corollary 1, ths (uncondtonal) probablty s Θ(n p p(f ) (d/n) 2q q(f ) ) for d = o(n). Snce there s a bounded number of such subgraphs F, we have for d = o(n), r = Θ(r H (n, d)) where r H (n, d) = n p p(f ) ( d n) q q(f ), (2.4) p(f ) = V (F ), q(f ) = E(F ), and F s a subgraph of H whch maxmses n q(f ) p(f ) d q(f ) subject to 1 q(f ) < q. (See [2, Secton 3.2] for a related dscusson n the settng of random graphs wthout the regularty condton.) Theorem 3 Defne µ and r as n (2.1) and (2.3). Suppose that µ, µ = o(n), µ = o(n 2 /d 2 ) and r = O(1/ µ). Then (X H µe r )/σ tends n dstrbuton to the standard normal as n, where σ 2 = µe r. Note 1 If r = o(1/ µ) then the mean and varance of the asymptotc dstrbuton can both be taken as µ. Moreover, the proof of the theorem then smplfes consderably. However, by ncludng the case r 1/ µ we hghlght why the method does not easly extend. 3

Note 2 The dstrbuton result n [3], whch s only for cycles, does not extend to the full range of d covered by Theorem 3. (It does however apply to non-fxed subgraphs, a modfcaton whch could also be done easly usng the technques of the present paper.) One could presumably extend the methods used n [3] to obtan dstrbuton results for all strctly balanced subgraphs, but ths s not as economcal as the method n the present paper, and we beleve that the range of d obtaned would not be any greater than that n Theorem 3. Note 3 Dstrbuton results for subgraph counts n the other common models of random graphs apply for wder ranges of densty of the parent graph than expressed n Theorem 3. Gven the much greater accessablty of those models due to edge ndependence, ths s not very surprsng. Proof of Theorem 3 We compute the k th factoral moment E[X H ] k, for k = O( µ), k. Note that ( ) E[X H ] k = P A J (2.5) J 1,...,J k E(K n) where A J denotes the event that J E(G) and forms an solated copy of H. (For k = 1, ths dffers from (2.1) because the copes here are solated.) To fnd the number of nonzero summands contrbutng n (2.5), for whch a prerequste s that the J are parwse dsjont, consder placng k ordered copes of H on the vertces of the complete graph. Snce k = O( µ) = o( n), the number of ways of dong ths, where each copy s placed ndependently (gnorng possble overlaps) s asymptotc to n kp a k, (2.6) and we get the same expresson f we nsst that the copes have dsjont vertex sets (so after j copes have been placed there are n pj vertces to choose from). Thus, by sandwchng, ths s also asymptotcally the number of ways of choosng edge-dsjont copes, as requred for solated copes, and almost all these placements are parwse vertex-dsjont. Clearly ( ) P A J P(J 1 J k E(G)) (P(J 1 G)) k (1 + o(1)) usng Corollary 1 and notng that the assumpton µ = o(n 2 /d 2 )) mples the requred bound on d J. So we have from (2.5) that ( ) n kp E[X H ] k = o (P(J a k 1 G)) k + = o ( ) n kp a k ( ) P A J ( ) A J J 1,...,J k E(Kn) J vertex-dsjont (P(J 1 G)) k + nkp a k P (2.7) for any partcular choce of J 1,..., J k E(K n ) whch nduce vertex-dsjont copes of H n K n. Lettng B denote the condtonal probablty that these sets nduce solated 4

copes n G, gven that they are subsets of the edge set of G, we have ( ) P A J = B P(J 1,..., J k E(G)) B(P(J 1 G)) k (2.8) usng Corollary 1 agan. For one of the copes to be nonsolated, t must share an edge wth some other copy of H, and we may use the same machnery to compute the factoral moments of the number Y of for whch J nduces a nonsolated copy n G, condtonal upon J 1,..., J k E(G). We have kr = O(1) (2.9) snce r µ = O(1), and hence also, snce µ, r = o(1). (2.10) Thus r = O(r 2 ) + H 2 P(H 1 H 2 G) P(H 1 G) where the sum s over copes H 2 of H n K n whch share at least one edge wth H 1, and the r 2 term bounds the overcountng n ncluson-excluson. Wth ths observaton t s easy to compute the jth factoral moment E[Y ] j of Y. There are [k] j ways to choose a j-subset of J 1,..., J k each of whch nduce a copy of H to be nonsolated, and the probablty that all the requred edges are present n G s, agan usng Corollary 1, asymptotc to r j P(J 1,..., J k E(G)) r j (P(J 1 G)) k. The probablty that all these edges are n G, condtonal upon the event that J 1,..., J k E(G), s thus r j, and so E[Y ] j (kr) j. Thus by (2.9) and the method of moments (the usual one, that s), P(Y = 0) e kr. Thus B e kr, and we have from (2.7) and (2.8) that E[X H ] k nkp a k e kr (P(J 1 G)) k. By (2.1) and the fact that k = o( n), ths mples E[X H ] k (µe r ) k and thus by Theorem 1 wth s n = 0 the dstrbuton of X H s asymptotcally normal wth mean and varance µe r by (2.10). Example: Cycles Consder the graph H = C t, the cycle of length t 3. Here by (2.2) µ = Θ(d t ), so we requre d. Also t s easy to check that r H (n, d) = Θ(µ/(nd)), the maxmum n (2.4) occurrng for F = K 2. Thus the range of µ s bounded at the maxmum end by µ 3/2 = O(nd), µ = o(n) and µ = o(n 2 /d 2 ). By consderng the mpled upper bounds on d, we see that the frst s strctest, and thus the number of solated copes of H s 5

asymptotcally normally dstrbuted provded d and d = O(n 2/(3t 2) ). Ths can be compared wth the result n [3], for whch the bound s d = o(n 1/(2t 1) ). Example: Complete graphs Consder the graph H = K t, where t 3. Here by (2.2) µ = Θ(d t(t 1)/2 n t(t 3)/2 ). (2.11) Also r H (n, d) = max 2 s<t d t(t 1)/2 s(s 1)/2 n t(t 3)/2+s(s 3)/2, and consderng d = n α, the maxmum occurs (as wth cycles) at s = 2, and so r = Θ(µ/(nd)). Thus (as wth cycles), the range of µ s bounded at the maxmum end by µ 3/2 = O(nd), µ = o(n) and µ = o(n 2 /d 2 ). By consderng the mpled upper bounds on d, t s straghtforward to verfy that the frst gves the strctest bound for t = 3, and the last does for t 4. These mply the upper bounds d = O(n 2/7 ) n the case t = 3, and d = o(n (t(t 3)/2+2)/(t(t 1)/2+2) ),.e. d = o(n 1 2t/(t2 t+4) ), for t 4. Therefore, the number of solated copes of H s asymptotcally normally dstrbuted provded ths upper bound on d holds and the expresson n (2.11) tends to. Fnally, we may conclude somethng about the dstrbuton of the total number of copes of H, solated or not. Denote ths number by ˆX H. Corollary 2 Suppose that µ, µ = o(n), µ = o(n 2 /d 2 ) and r = o(1/ µ). Then ( ˆX H µ)/ µ tends n dstrbuton to the standard normal as n. Proof: The expected number of nonsolated copes s O(µr). So we may conclude that the total number of copes of H s asymptotcally normal provded µr = o( µ),.e. r µ = o(1). Ths s an assumpton of the corollary whch s stronger than the correspondng one n the theorem. 3 Concludng remarks For the dstrbuton results obtaned n Theorem 3, the mean and varance are asymptotcally equal. Ths means that t could equvalently be stated as gvng asymptotcally Posson dstrbuton. It would be nterestng to know the range of the degree d for whch the subgraph count remans asymptotcally Posson. Theorem 1 can potentally be used to deduce asymptotc normalty outsde the Posson range (as for nstance the prevous applcatons n [1]); one challenge s to fnd a way to apply t for such d n the present context. Another challenge s to fnd a way to apply any of the other methods of deducng asymptotc normalty to sgnfcantly hgher values of d than we do here. One possblty s to use swtchngs rather than standard ncluson-excluson to extend the range of d for whch the nonsolated copes may be treated n the proof of Theorem 3. However, the extra effort may not pay very bg dvdends. 6

References [1] Z. Gao and N.C. Wormald, Asymptotc normalty determned by hgh moments, and submap counts of random maps, Probablty Theory and Related Felds (to appear). [2] S. Janson, T. Luczak and A. Rucńsk, Random graphs, Wley, New York, 2000. [3] B.D. McKay, N.C. Wormald and B. Wysocka, Short cycles n random regular graphs, Electronc Journal of Combnatorcs (submtted). [4] B. D. McKay, Subgraphs of random graphs wth specfed degrees, Congressus Numerantum 33 (1981) 213 223. [5] A. Rucńsk, When are small subgraphs of a random graph normally dstrbuted? Probablty Theory and Related Felds 78 (1988), 1 10. 7