r"o' HEWLETT A Large Deviation Principle for the Order ofa Random Permutation ~lj PACKARD lip Laboratories Bristol

Similar documents
The characteristic polynomial of a random permutation matrix

The Characteristic Polynomial of a Random Permutation Matrix

ON THE DENSITY OF SOME SEQUENCES OF INTEGERS P. ERDOS

252 P. ERDÖS [December sequence of integers then for some m, g(m) >_ 1. Theorem 1 would follow from u,(n) = 0(n/(logn) 1/2 ). THEOREM 2. u 2 <<(n) < c

The characteristic polynomial of a random permutation matrix

An inverse of Sanov s theorem

The Degree of the Splitting Field of a Random Polynomial over a Finite Field

Splitting Fields for Characteristic Polynomials of Matrices with Entries in a Finite Field.

1981] 209 Let A have property P 2 then [7(n) is the number of primes not exceeding r.] (1) Tr(n) + c l n 2/3 (log n) -2 < max k < ir(n) + c 2 n 2/3 (l

A log-scale limit theorem for one-dimensional random walks in random environments

THE METHOD OF CONDITIONAL PROBABILITIES: DERANDOMIZING THE PROBABILISTIC METHOD

Data Structures and Algorithms. Asymptotic notation

On Arithmetic Properties of Bell Numbers, Delannoy Numbers and Schröder Numbers

Large Deviations Techniques and Applications

O N P O S I T I V E N U M B E R S n F O R W H I C H Q(n) D I V I D E S F n

SOME REMARKS ON NUMBER THEORY BY P. ERDŐS 1. Let ABSTRACT This note contains some disconnected minor remarks on number theory. (1) Iz j I=1, 1<j<co be

RANDOM MATRICES: LAW OF THE DETERMINANT

A diametric theorem for edges. R. Ahlswede and L.H. Khachatrian

Arithmetic progressions in sumsets

A Note on the Approximation of Perpetuities

81 1. Nearly foamy years ago I made the following conjecture : Let 1 5 al <... < ak s n ; 1 5 b l <... < b1 S n be two sequences of integers. Assume t

A NEW PROOF OF ROTH S THEOREM ON ARITHMETIC PROGRESSIONS

Notes on the second moment method, Erdős multiplication tables

P(I -ni < an for all n > in) = 1 - Pm# 1

A prolific construction of strongly regular graphs with the n-e.c. property

= 1 2 x (x 1) + 1 {x} (1 {x}). [t] dt = 1 x (x 1) + O (1), [t] dt = 1 2 x2 + O (x), (where the error is not now zero when x is an integer.

CONNECTIONS BETWEEN BERNOULLI STRINGS AND RANDOM PERMUTATIONS

(n = 0, 1, 2,... ). (2)

103 Some problems and results in elementary number theory. By P. ERDÖS in Aberdeen (Scotland). Throughout this paper c, c,... denote absolute constant

1 Sequences of events and their limits

CONVERGENCE OF INFINITE EXPONENTIALS GENNADY BACHMAN. In this paper we give two tests of convergence for an

1 4 which satisfies (2) identically in u for at least one value of the complex variable z then s c g.l.b. I ~m-1 y~~z cn, lar m- = 0 ( co < r, s < oo)

Probabilistic Analysis of a Parallel Algorithm for Finding Maximal lndependen t Sets

Analysis of Algorithms I: Asymptotic Notation, Induction, and MergeSort

Random regular digraphs: singularity and spectrum

BALANCING GAUSSIAN VECTORS. 1. Introduction

) = nlog b ( m) ( m) log b ( ) ( ) = log a b ( ) Algebra 2 (1) Semester 2. Exponents and Logarithmic Functions

Products of ratios of consecutive integers

Krzysztof Burdzy University of Washington. = X(Y (t)), t 0}

PUTNAM PROBLEMS SEQUENCES, SERIES AND RECURRENCES. Notes

Conflict-Free Colorings of Rectangles Ranges

SOME TAUBERIAN THEOREMS RELATED TO COIN TOSSING

On Gauss sums and the evaluation of Stechkin s constant

ON THE STRANGE DOMAIN OF ATTRACTION TO GENERALIZED DICKMAN DISTRIBUTIONS FOR SUMS OF INDEPENDENT RANDOM VARIABLES

THE LINDEBERG-FELLER CENTRAL LIMIT THEOREM VIA ZERO BIAS TRANSFORMATION

Math 3012 Applied Combinatorics Lecture 5

Gärtner-Ellis Theorem and applications.

Constructions in Ramsey theory

Cycle lengths in sparse graphs

The large deviation principle for the Erdős-Rényi random graph

ON THE STRANGE DOMAIN OF ATTRACTION TO GENERALIZED DICKMAN DISTRIBUTIONS FOR SUMS OF INDEPENDENT RANDOM VARIABLES

Graphs with maximal Hosoya index and minimal Merrifield-Simmons index

Mod-φ convergence I: examples and probabilistic estimates

SOME RESULTS AND PROBLEMS IN PROBABILISTIC NUMBER THEORY

Notes 6 : First and second moment methods

Analysis of Algorithm Efficiency. Dr. Yingwu Zhu

MODERATE DEVIATIONS IN POISSON APPROXIMATION: A FIRST ATTEMPT

The Degree of the Splitting Field of a Random Polynomial over a Finite Field

Random matrices: A Survey. Van H. Vu. Department of Mathematics Rutgers University

A PROOF OF ROTH S THEOREM ON ARITHMETIC PROGRESSIONS

AN ESTIMATE FOR THE LENGTH OF AN ARITHMETIC PROGRESSION THE PRODUCT OF WHOSE TERMS IS ALMOST SQUARE 1. INTRODUCTION

On Subsets with Cardinalities of Intersections Divisible by a Fixed Integer

Estimates for probabilities of independent events and infinite series

On prime factors of integers which are sums or shifted products. by C.L. Stewart (Waterloo)

Bounds for pairs in partitions of graphs

The maximum of the characteristic polynomial for a random permutation matrix

The largest eigenvalues of the sample covariance matrix. in the heavy-tail case

The symmetry in the martingale inequality

Algorithm. Executing the Max algorithm. Algorithm and Growth of Functions Benchaporn Jantarakongkul. (algorithm) ก ก. : ก {a i }=a 1,,a n a i N,

Continued Fraction Digit Averages and Maclaurin s Inequalities

arxiv: v4 [math.nt] 8 May 2014

THE CONVEX HULL OF THE PRIME NUMBER GRAPH

THE GROUP OF UNITS OF SOME FINITE LOCAL RINGS I

NOTE ON A THEOREM OF KAKUTANI

THE ORDER OF ELEMENTS IN SYLOW p-subgroups OF THE SYMMETRIC GROUP

Self-normalized laws of the iterated logarithm

Numerical Sequences and Series

MATH 802: ENUMERATIVE COMBINATORICS ASSIGNMENT 2

ON EXPLICIT FORMULAE AND LINEAR RECURRENT SEQUENCES. 1. Introduction

On large deviations for combinatorial sums

Properly forking formulas in Urysohn spaces

arxiv: v2 [math.nt] 4 Nov 2012

RANDOM WALKS WHOSE CONCAVE MAJORANTS OFTEN HAVE FEW FACES

Decomposition of random graphs into complete bipartite graphs

A Criterion for the Compound Poisson Distribution to be Maximum Entropy

On Certain Sums of Stirling Numbers with Binomial Coefficients

Superconcentration inequalities for centered Gaussian stationnary processes

arxiv: v1 [math.co] 22 May 2014

F-Expansions of rationals

#A87 INTEGERS 18 (2018) A NOTE ON FIBONACCI NUMBERS OF EVEN INDEX

New lower bounds for hypergraph Ramsey numbers

A large-deviation principle for Dirichlet posteriors

1 i<j k (g ih j g j h i ) 0.

ON THE COMPLETE CONVERGENCE FOR WEIGHTED SUMS OF DEPENDENT RANDOM VARIABLES UNDER CONDITION OF WEIGHTED INTEGRABILITY

The Annals of Probability, Vol. 6, No. 3. (Jun., 1978), pp

ASYMPTOTIC DISTRIBUTION OF THE MAXIMUM CUMULATIVE SUM OF INDEPENDENT RANDOM VARIABLES

Math 259: Introduction to Analytic Number Theory More about the Gamma function

THE POISSON DIRICHLET DISTRIBUTION AND ITS RELATIVES REVISITED

NORMAL NUMBERS AND UNIFORM DISTRIBUTION (WEEKS 1-3) OPEN PROBLEMS IN NUMBER THEORY SPRING 2018, TEL AVIV UNIVERSITY

Annealed Brownian motion in a heavy tailed Poissonian potential

Transcription:

r"o' HEWLETT ~lj PACKARD A Large Deviation Principle for the Order ofa Random Permutation Neil 0'Connell Basic Research Institute in the Mathematical Sciences lip Laboratories Bristol HPL-BRIM:S-96-18 July, 1996 We obtain a large deviation principle for the scaled logarithm of the order of a random permutation of a large number of objects and give an explicit expression for the convex dual ofilie rate function. Internal Accession Date Only

1 Introduction Let Un be a random permutation of n objects, and denote by O(u n ) the order of Un' In 1967, Erdos and Turan [7] proved that, as n -+ 00, the sequence log O(un) - ~ log2 n 1/2 ( ~ log3 n ) converges in distribution to a standard Gaussian random variable. Their original proof was direct; since then, as one might expect, several probabilistic proofs have appeared. In particular, Arratia and Tavare [3] give a proof based on the Feller coupling, which we will describe later. Somewhat surprisingly, a corresponding large deviation principle (LDP) does not seem to have appeared in the literature. In this paper we prove such an LDP and give an explicit expression for the convex dual of the rate function. To be more specific, we will show that for any Borel set B, where and li lip (logo(u n ) B). f A*( ) m -- og E = - III X, n-+oo log n log2 n :reb A*(x) = sup[ox - A(O)] 8 e 28-1 A(O) = 20 - l. To prove this, we will also make use of the Feller coupling, which provides a means of comparing log O(un) with the more accessible n L Zj logj, j=l where the Zj'S are independent Poisson random variables with respective means I/j. It turns out that, when properly normalised, these sequences are exponentially equivalent. The result can thus be obtained by a routine application of the Gartner-Ellis theorem to the latter sequence. The outline of this paper is as follows. In section 2 we describe the Feller coupling, which forms the basis of our approach, and record the relevant estimates. We will state and prove the main result in section 3. In section 4 we discuss some more sophisticated LDP's, which incorporate cycle lengths and allow one to consider certain biased permutations. 2

2 The Feller coupling This coupling was introduced by Feller [8] in 1945, to establish a central limit theorem for the number of cycles in a random permutation. The idea is that one can construct a random permutation of n objects using n independent Bernoulli random variables, with the property that the times between successive 1's in the sequence correspond to the cycle-lengths in the permutation. Let XI,X 2, be a sequence of Bernoulli random variables with P(Xj = 1) =1- P(Xj = 0) = I/j. For each n, we construct a permutation Un recursively using the sequence Start with 1 in the first cycle. If X n = 1,' close the cycle and start a new one, with the next available integer (2, in the first instance). If X n = 0, choose one of the n - 1 remaining integers at random and place to the right of 1 in the same cycle. Continuing in this way produces a permutation Un E Sn. Remarkably, the distribution of Un is uniform on Sn. Moreover, if Cj denotes the number of cycles oflength j in the permutation Un, then where 00 Zjm = L Xi(1- X i+1) (1 - Xi+j-l)Xi+j i=m+l is the number of j-spacings in the sequence X after time m, and Zj = Zjo. Another remarkable fact is that the Z/s are independent Poisson random variables with respective means Iii. Now for any se,quence a, define On(a) = lcm{i: 1 ~ i ~ n, aj> O} ~ Pn(a) = II i ai i=l Then O(u n ) = On(cn), and we have the following estimates. For any pair of sequences a and b, n 3

where la - bl = I:i=llaj - bd. It therefore follows from (1) that logon(z) - (Yn + 1)logn ~ logo(a n ) ~ logon(z) +logn, (2) where Yn = 'L,'J=l Zjn. We also have [3): The first summation is over all prime powers less than, or equal to, n. 3 The large deviation principle We say a sequence of random variables X n satisfies the LDP with rate an and rate function I if, for any Borel set B, - inf.i(x) ~ liminfa~11ogp(xne B):S; limsupa~11ogp(xn E B) ~ - in{i(x). xeb n n xeb Theorem 1 The sequence log O(a n )/log2 n satisfies the LDP with rate log n and rate function given by the convex dual of e 28-1 A(O) = 20-1. Recalling the estimates (2) and (3), the statement of Theorem 1 follows from the next three lemmas. Lemma 1 For any c > 0, lim sup-11 logp(yn > clogn) = -'00. n ogn Proof. Set N-mn = I:i=-m+l X j and observe that where 00 An = L Nlen,(k+l)nI(Nlen,(Ie+l)n > 1), 1e=1 Bn = L I(Nlen,(k+l)n > O,N(Ie+l)n,(1e+2)n > 0), Ie>O even 4

and Cn = 2: I(Nkn,(k+l)n > 0, N(k+l)n.(k+2)n > 0). k>o odd Note that An' Bn and Cn are each sums of independent Bernoulli random variables, so we can use the decomposition c c c. P(Yn > clogn) ~ P(An > 3"logn) + P(Bn > 3" log n) + P(Cn > 3"logn), apply Chebyshev's inequality (with the convex function x t-+ ee:r) to each term, then consider the normalised logarithmic limit and let 8 -+ 00 for the r~~t. 0 Lemma 2 For any c > 0, Proof. First we write lim sup-1 log P(Dn > clog 2 n) = -00. n ogn Dn=Sn-f(n)+Rn where Sn = Lj~n a(j)zj, a(j) =I:"'/i logp, fen) = I:"'~n logp and Rn = L I ( "'~n L Zj =0) logp. j~n:p'lj (L". Ii denotes the sum over all prime powers that divide j.) Set g(n) = Lj<n logj. For any 8 > 0 we have It follows that, for). > 1/8, log Eee(Sn-f(n»/logn ~ -g(n). 1 c -1-log P(Sn - fen) + ).g(n)logn > -21og2 n) ~ -8c/2+ (8), - l}g(n}/logn ogn as n -+ 00. Also, -1 logp(r n - ).g(n)logn > ':log2 n) ogn 2 1 < 1-logP(Rn > ).g(n)logn) ogn 5 -+ -00,

1 < -AO + --logeebrn/g(n) logn < -AO +-1 logeexp (olo(gn) L I(Zp. = 0)) ogn 9 n,< p n = -AO + _1_ L log (exp[-l/ps +olog n/g(n)] +1- e- 1 / P ') logn,< p _n < -AO +-11 L (exp[-l/ps + Ologn/g(n)] - e- 1 / P ') ogn,< P _n -AO +-11 (exp[-l/ps +Ologn/g(n)] - 1) L e-1/p' ogn,< P _n < -AO + On/g(n) 1 + Ologn/g(n) - -AO as n - 00. Now let 0-00 for the result. o Lemma 3 The sequence log Pn(Z)flog2 n satisfies the LDP with rate log n and rate function given by the convex dual of e 2B - 1 A(O) = 20-1. Proof. This is a routine application of the Gartner-Ellis Theorem (see, for example, [6]), by which it suffices to check that lim-1 10gEexp(010gPn(Z)flogn) = A(O). n ogn This limit can be evaluated by noting that 10gPn(Z) = L Zjlogj. ;<n o 4 Spin-off's If K(CT n ) denotes the number of cycles in the random permutation, that is, n K(CTn ) = L Cj, ;=1 6

then it follows from (2.1) and Lemmas 3.2 and 3.3 that the pair (K(a n )/log n,log O(a n )/log2 n) is exponentially equivalent (at rate log n) to the pair ( L Zijlogn,L Zi logjjlog2 n). i~n i~n The Gartner-Ellis Theorem can thus be applied to the latter sequence to yield an LDP for the former, with rate log n, and rate function given by the convex dual of e 2t ~ 1 A 2 (s, t) = e S - l. 2t We can also deduce an LDP the logarithm of the order of a biased random permutation, the law of which is given by the 'tilted' measure P.(a) ex esk(u). An LDP for the sequence K(an)jlogn, under the law P s, can either be deduced from the above or proved directly using well-known generating functions for Stirling numbers of the first kind; this LDP also has rate log n and the rate function given by X f-+ S - X +xlog(x/s). Note that this is a Poisson rate function, and so this LDP is consistent with (but not implied by) the Poisson approximation results in the papers of Arratia, Barbour, Stark and Tavare [1, 2,4, 5]. For related work on the number of components in a random combinatorial structure see Hwang [9] and Maxwell [10], and references therein. References [1] R.A. Arratia, A.D. Barbour and S. Tavare. "Poisson process approximations for the Ewens sampling formula," Ann. Prob. 2(3), pp. 519-535, 1992. [2] R.A. Arratia, Dudley Stark and S. Tavare. "Total variation asymptoties for Poisson process approximations of logarithmic combinatorial assemblies," Ann. Prob. 23(3), pp. 1347-1388, 1995. 7

[3] R.A. Arratia and S. Tavan~. "Limit theorems for combinatorial structures via discrete process approximations," Rand. Struct. Alg. 3, pp. 321-345, 1992. [4] R.A. Arratia and S. Tavan~. "The cycle structure ofrandom permutations," Ann. Prob. 20(3), pp. 1567-1591,1992. '" [5] R.A. Arratia and S. Tavare. "Independent process approximations for random combinatorial structures," Adv.,Math. 104(1), pp. 90-154, 1994. [6] Amir Dembo and Ofer Zeitouni. Large Deviations Techniques and Applications. Jones and Bartlett, 1993. [7] P. Erdos and P. Turan. "On some problems of a statistical group theory, TIl," Acta Math. Acad. Sci. Hungar. 18, pp. 309-320, 1967. [8] W. Feller. "The fundamental limit theorems in probability," Bull. Arner. Math. Soc. 51, pp. 800-832, 1945. [9] Hsien-Kuei Hwang. "Theoremes limites pour les structures combinatoires at les fonctions arithmetiques," PhD Thesis, Ecole Polytechnique, 1994. [10] Mark M. Maxwell. "A note on rational structures and their asymptoties," preprint. 8