arxiv: v1 [math.co] 17 Dec 2007

Similar documents
Lectures 2 3 : Wigner s semicircle law

Asymptotically optimal induced universal graphs

Lectures 2 3 : Wigner s semicircle law

k-protected VERTICES IN BINARY SEARCH TREES

Asymptotically optimal induced universal graphs

Notes 6 : First and second moment methods

Mod-φ convergence II: dependency graphs

Asymptotics for minimal overlapping patterns for generalized Euler permutations, standard tableaux of rectangular shapes, and column strict arrays

Containment restrictions

WAITING FOR A BAT TO FLY BY (IN POLYNOMIAL TIME)

Spring 2012 Math 541B Exam 1

arxiv: v1 [math.co] 8 Feb 2014

arxiv: v1 [math.co] 29 Apr 2013

Small subgraphs of random regular graphs

Limits at Infinity. Horizontal Asymptotes. Definition (Limits at Infinity) Horizontal Asymptotes

On Descents in Standard Young Tableaux arxiv:math/ v1 [math.co] 3 Jul 2000

6.1 Polynomial Functions

Bipartite decomposition of random graphs

Induced subgraphs with many repeated degrees

Convergence Rates for Generalized Descents

Lecture 7: February 6

Bichain graphs: geometric model and universal graphs

Maximizing the descent statistic

arxiv:math/ v1 [math.co] 10 Nov 1998


The subword complexity of a class of infinite binary words

Welsh s problem on the number of bases of matroids

On the number of cycles in a graph with restricted cycle lengths

Enumeration Schemes for Words Avoiding Permutations

Discrete Mathematics. Kishore Kothapalli

On the Sensitivity of Cyclically-Invariant Boolean Functions

Poisson approximations

The concentration of the chromatic number of random graphs

Introduction to Probability

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

6 Permutations Very little of this section comes from PJE.

14.1 Finding frequent elements in stream

arxiv: v2 [math.co] 20 Jun 2018

6.207/14.15: Networks Lecture 3: Erdös-Renyi graphs and Branching processes

A Graph Polynomial Approach to Primitivity

Eigenvalues of Random Matrices over Finite Fields

A prolific construction of strongly regular graphs with the n-e.c. property

CPSC 536N: Randomized Algorithms Term 2. Lecture 9

DISJOINT PAIRS OF SETS AND INCIDENCE MATRICES

On a Balanced Property of Compositions

Math 115 Spring 11 Written Homework 10 Solutions

On a refinement of Wilf-equivalence for permutations

The number of Euler tours of random directed graphs

A dyadic endomorphism which is Bernoulli but not standard

Latin squares: Equivalents and equivalence

arxiv: v3 [math.ac] 29 Aug 2018

Transversal and cotransversal matroids via their representations.

arxiv: v2 [math.co] 17 May 2017

Department of Mathematics, University of Wisconsin-Madison Math 114 Worksheet Sections (4.1),

LINEAR ALGEBRA BOOT CAMP WEEK 1: THE BASICS

Some Results Concerning Uniqueness of Triangle Sequences

ABSTRACT. Department of Mathematics. interesting results. A graph on n vertices is represented by a polynomial in n

Generating Functions. STAT253/317 Winter 2013 Lecture 8. More Properties of Generating Functions Random Walk w/ Reflective Boundary at 0

A Primer on the Functional Equation

SO x is a cubed root of t

The Advantage Testing Foundation Solutions

arxiv: v1 [math.co] 19 Aug 2016

Nonnegative k-sums, fractional covers, and probability of small deviations

SUB-EXPONENTIALLY MANY 3-COLORINGS OF TRIANGLE-FREE PLANAR GRAPHS

Lecture Notes 1: Vector spaces

Wieland drift for triangular fully packed loop configurations

ENUMERATION BY KERNEL POSITIONS

On non-hamiltonian circulant digraphs of outdegree three

Maximum union-free subfamilies

arxiv: v1 [math.pr] 11 Dec 2017

Vertex colorings of graphs without short odd cycles

Decomposing oriented graphs into transitive tournaments

Mod-φ convergence I: examples and probabilistic estimates

PUTNAM 2016 SOLUTIONS. Problem A1. Find the smallest positive integer j such that for every polynomial p(x) with integer coefficients

Lines, parabolas, distances and inequalities an enrichment class

Linear Classification: Linear Programming

1 Basic Combinatorics

Short cycles in random regular graphs

GENERALIZED CANTOR SETS AND SETS OF SUMS OF CONVERGENT ALTERNATING SERIES

New lower bounds for hypergraph Ramsey numbers

Lecture 4: Probability and Discrete Random Variables

Graphs with Large Variance

Bounds on parameters of minimally non-linear patterns

CS5314 Randomized Algorithms. Lecture 18: Probabilistic Method (De-randomization, Sample-and-Modify)

MATH SOLUTIONS TO PRACTICE MIDTERM LECTURE 1, SUMMER Given vector spaces V and W, V W is the vector space given by

THE LECTURE HALL PARALLELEPIPED

ASYMPTOTIC ENUMERATION OF PERMUTATIONS AVOIDING GENERALIZED PATTERNS

Minimal Paths and Cycles in Set Systems

Lecture Introduction. 2 Brief Recap of Lecture 10. CS-621 Theory Gems October 24, 2012

arxiv: v1 [math.co] 6 Jun 2014

A DETERMINANT PROPERTY OF CATALAN NUMBERS. 1. Introduction

Definitions. Notations. Injective, Surjective and Bijective. Divides. Cartesian Product. Relations. Equivalence Relations

Cauchy-Schwartz inequality and geometric incidence problems. Misha Rudnev

Support weight enumerators and coset weight distributions of isodual codes

Exchangeable pairs, switchings, and random regular graphs

On the Average Complexity of Brzozowski s Algorithm for Deterministic Automata with a Small Number of Final States

Self-dual interval orders and row-fishburn matrices

Canonical lossless state-space systems: staircase forms and the Schur algorithm

Asymptotic Statistics-III. Changliang Zou

Transcription:

arxiv:07.79v [math.co] 7 Dec 007 The copies of any permutation pattern are asymptotically normal Milós Bóna Department of Mathematics University of Florida Gainesville FL 36-805 bona@math.ufl.edu Abstract We prove that the number of copies of any given permutation pattern q has an asymptotically normal distribution in random permutations. Introduction The classic definition of pattern avoidance for permutations is as follows. Let p p p p n be a permutation, let < n, and let q q q q be another permutation. We say that p contains q as a pattern if there exists a subsequence i < i < < i n so that for all indices j and r, the inequality q j < q r holds if and only if the inequality p ij < p ir holds. If p does not contain q, then we say that p avoids q. In other words, p contains q if p has a subsequence of entries, not necessarily in consecutive positions, which relate to each other the same way as the entries of q do. In a recent survey paper [] on the monotone permutation pattern, we have shown that if X n is the random variable counting copies of that pattern in a randomly selected permutation of length n, then as n goes to infinity, X n converges (in distribution to a normal distribution. When we say random permutation, we mean that each permutation of length n is selected with probability /n!. In this paper, we will generalize that result for any permutation pattern q, and the variable X n,q counting the copies of q in permutations of length Partially supported by an NSA Young Investigator Award.

n. The proof is very similar to the monotone case; just some details have to be modified. The result is a far-reaching generalization of the classic results (see [3] for more references that descents and inversions of random permutations are asymptotically normal. As a byproduct, we will see how close Var(X n,q and Var(X n, are to each other, for any pattern q of length. The Proof of Our Theorem. Bacground and Definitions We need to introduce some notation for transforms of the random variable Z. Let Z Z E(Z, let Z Z/ Var(Z, and let Z n N(0, mean that Z n converges in distribution to the standard normal variable. Definition Let {Y n,,,,n n } be an array of random variables. We say that a graph G is a dependency graph for {Y n,,,n n } if the following two conditions are satisfied:. There exists a bijection between the random variables Y n, and the vertices of G, and. If V and V are two disjoint sets of vertices of G so that no edge of G has one endpoint in V and another one in V, then the corresponding sets of random variables are independent. Note that the dependency graph of a family of variables is not unique. Indeed if G is a dependency graph for a family and G is not a complete graph, then we can get other dependency graphs for the family by simply adding new edges to G. Now we are in position to state Janson s theorem, the famous Janson dependency criterion. Theorem [4] Let Y n, be an array of random variables such that for all n, and for all,,,n n, the inequality Y n, A n holds for some real number A n, and that the maximum degree of a dependency graph of {Y n,,,,n n } is n. Set Y n N n Y n, and σn Var(Y n. If there is a natural number m so that ( m N n m An n 0, ( σ n

as n goes to infinity, then Ỹ n N(0,.. Verifying the Conditions of Janson s Criterion Let q be a fixed pattern of length. As q is fixed for the rest of this paper, we will mar our variables X n instead of X n,q, in order to avoid excessive indexing. Let us order the ( n subwords of length of the permutation p p p n linearly in some way. For i ( n, let Xn,i be the indicator random variable of the event that in a randomly selected permutation of length n, the ith subword of length in the permutation p p p p n is a q-pattern. We will now verify that the family of the X n,i satisfies all conditions of the Janson Dependency Criterion. First, X n,i for all i and all n, since the X n,i are indicator random variables. So we can set A n. Second, N n ( n, the total number of subwords of length in p. Third, if a b, then X a and X b are independent unless the corresponding subwords intersect. For that, the bth subword must intersect the ath subword in j entries, for some j. For a fixed ath subword, the number of ways that can happen is ( n j j( j ( n ( n, where we used the well-nown Vandermonde identity to compute the sum. Therefore, n ( n ( n. ( In particular, note that ( provides an upper bound for n in terms of a polynomial function of n that is of degree since terms of degree will cancel. There remains the tas of finding a lower bound for σ n that we can then use in applying Theorem. Let X n ( n i X n,i. We will show the following. Proposition There exists a positive constant c so that for all n, the inequality Var(X n cn holds. 3

Proof: By linearity of expectation, we have Var(X n E(Xn (E(X n (3 ( n ( n E E (4 ( n E X n,i X n,i i i X n,i i i ( n E(X n,i (5 i,i E(X n,i X n,i i,i E(X n,i E(X n,i. (6 Let I (resp. I denote the -element subword of p indexed by i, (resp. i. Clearly, it suffices to show that I I E(X n,i X n,i i,i E(X n,i E(X n,i cn, (7 since the left-hand side of (7 is obtained from the (6 by removing the sum of some positive terms, that is, the sum of all E(X n,i X n,i where I I >. As E(X n,i /! for each i, the sum with negative sign in (6 is i,i E(X n,i E(X n,i ( n!, which is a polynomial function in n, of degree and of leading coefficient!. As far as the summands in (6 with a positive sign go, most of them 4 are also equal to!. More precisely, E(X n,i X n,i! when I and I are disjoint, and that happens for ( ( n n ordered pairs (i,i of indices. The sum of these summands is d n ( n ( n!, (8 which is again a polynomial function in n, of degree and with leading coefficient. So summands of degree will cancel out in (6. (We will! 4 see in the next paragraph that the summands we have not yet considered add up to a polynomial of degree. 4

In fact, considering the two types of summands we studied in (6 and (8, we see that they add up to ( n ( n! ( n! n ( (! 4 + O(n (9 n! 4 + O(n. (0 Next we loo at ordered pairs of indices (i,i so that the corresponding subwords I and I intersect in exactly one entry, the entry x. Let us restrict our attention to the special case when I and I both form q-patterns, and x is the ath smallest entry in I and the bth smallest entry in I. Given q, the pair (a,b describes the location of x in I and in I as well. Let I (resp. I denote the set of a positions in I (resp. b positions in I which must contain entries smaller than x given that I (resp. I forms a q-pattern. Similarly, let I (resp. I denote the set of a positions in I (resp. b positions I which must contain entries larger than x given that I (resp. I forms a q-pattern. Example Let q 354, and let us say that I and I both form q- patterns, and they intersect in one entry x that is the third smallest entry in I and the fourth smallest entry in I (so a 3, and b 4. Then x is the leftmost entry of I and the next-to-last entry of I. Furthermore, the third and fifth positions of I form I and the second and fourth positions of I form I. Similarly, the first, third, and fifth positions of I form I and the second position of I forms I. Let q a (resp. q b be the pattern obtained from q by removing its ath smallest (resp. bth smallest entry. Note that X i X i if and only if all of the following independent events hold.. In the ( -element set of entries that belong to I I, the entry x is the (a+b th smallest. This happens with probability /(.. The a + b entries in positions belonging to I I must all be smaller than the a b entries in positions belonging to I I. This happens with probability ( a+b. 3. the subword I is a pattern that is isomorphic to the pattern formed by the a smallest entries of q, 5

the subword I is a pattern that is isomorphic to the pattern formed by the b smallest entries of q, the subword I is a pattern that is isomorphic to the pattern formed by the a largest entries of q, and the subword I is a pattern that is isomorphic to the pattern formed by the b largest entries of q. This happens with probability (a!(b!( a!( b!. Therefore, if I I, then P(X i X i ( ( ( a+b (a!(b!( a!( b! ( ( a + b a b (!. ( a a How many such ordered pairs (I,I are there? There are ( n choices for the underlying set I I. Once that choice is made, the a+b st smallest entry of I I will be x. Then the number of choices for the set of entries other than x that will be part of I is ( a+b ( a b a a. Therefore, summing over all a and b and recalling (, p n I I P(X i X i I I E(X i X i (3 ( n ( a + b ( a b. (4 (! a a a,b The expression we just obtained is a polynomial of degree, in the variable n. We claim that its leading coefficient is larger than /! 4. If we can show that, the proposition will be proved since (0 shows that the summands not included in (3 contribute about n to the left-hand! 4 side of (7. Recall that by the Cauchy-Schwarz inequality, if t,t,,t m are nonnegative real numbers, then ( m i t i m m t i, (5 i where equality holds if and only if all the t i are equal. 6

Let us apply this inequality with the numbers ( a+b ( a b a a playing the role of the t i, where a and b range from to. We get that a,b ( ( a + b ( a b a,b > a a ( a+b a ( a b a. (6 We will use Vandermonde s identity to compute the right-hand side. To that end, we first compute the sum of summands with a fixed h a+b. We obtain a,b ( ( a + b a b a a h a h ( h a ( ( ( ( h (7 a (8. (9 Substituting the last expression into the right-hand side of (6 yields a,b ( a + b ( a b a a Therefore, (3 and (0 imply that p n > ( n (! > ( ( (. (. (0 As we pointed out after (3, p n is a polynomial of degree in the variable n. The last displayed inequality shows that its leading coefficient is larger than (! (! (! 4! 4 as claimed. Comparing this with (0 completes the proof of our Proposition. We can now return to the application of Theorem to our variables X n,i. By Proposition, there is an absolute constant C so that σ n > Cn 0.5 for 7

all n. So ( will be satisfied if we show that there exists a positive integer m so that ( n (dn m (n +0.5 m < dn 0.5m 0. Clearly, any positive integer m is a good choice. So we have proved the following theorem. Theorem Let q be a fixed permutation pattern of length, and let X n be the random variable counting occurrences of q in permutations of length n. Then X n N(0,. In other words, X n is asymptotically normal. The following Corollary shows how close the variances of the numbers of copies of two given patterns are to each other. Corollary For any pattern q of length, we have where c V ar(x n,q c n + O(n, (! a,b ( a + b ( a b a a! 4. We point out that this does not mean that V ar(x n,q does not depend on q. It does, and it is easy to verify that Var(X 4,3 Var(X 4,3. However, it is only the terms of degree at most of Var(X n,q that depend on q. Proof: Note that in the proof of Theorem, we have not used anything about the pattern q apart from its length. Our claim then follows from comparing (0 and (4. References [] M. Bóna, Generalized Descents and Normality, submitted. [] M. Bóna, On Three Notions of Monotone Subsequences, submitted. [3] J. Fulman, Stein s Method and Non-reversible Marov Chains. Stein s method: expository lectures and applications, 69 77, IMS Lecture Notes Monogr. Ser., 46, Inst. Math. Statist., Beachwood, OH, 004. [4] S. Janson, Normal convergence by higher semi-invariants with applications to sums of dependent random variables and random graphs. Ann. Prob. 6 (988, no., 305-3. 8