On the number of subtrees on the fringe of random trees (partly joint with Huilan Chang)

Similar documents
Applications of Analytic Combinatorics in Mathematical Biology (joint with H. Chang, M. Drmota, E. Y. Jin, and Y.-W. Lee)

The Subtree Size Profile of Plane-oriented Recursive Trees

The Subtree Size Profile of Bucket Recursive Trees

The Subtree Size Profile of Plane-oriented Recursive Trees

MAXIMAL CLADES IN RANDOM BINARY SEARCH TREES

arxiv: v1 [q-bio.pe] 13 Aug 2015

Approximate Counting via the Poisson-Laplace-Mellin Method (joint with Chung-Kuei Lee and Helmut Prodinger)

Dependence between Path Lengths and Size in Random Trees (joint with H.-H. Chern, H.-K. Hwang and R. Neininger)

On joint subtree distributions under two evolutionary models

Asymptotic and Exact Poissonized Variance in the Analysis of Random Digital Trees (joint with Hsien-Kuei Hwang and Vytas Zacharovas)

Metric Diophantine Approximation for Formal Laurent Series over Finite Fields

LIMIT LAWS FOR SUMS OF FUNCTIONS OF SUBTREES OF RANDOM BINARY SEARCH TREES

Random trees and branching processes

Combinatorial/probabilistic analysis of a class of search-tree functionals

k-protected VERTICES IN BINARY SEARCH TREES

Asymptotic distribution of two-protected nodes in ternary search trees

Lecture 18 : Ewens sampling formula

The range of tree-indexed random walk

arxiv: v1 [math.pr] 21 Mar 2014

ANALYSIS OF SOME TREE STATISTICS

Dynamics of the evolving Bolthausen-Sznitman coalescent. by Jason Schweinsberg University of California at San Diego.

PROTECTED NODES AND FRINGE SUBTREES IN SOME RANDOM TREES

Analysis of Approximate Quickselect and Related Problems

14 Branching processes

Workshop III: Evolutionary Genomics

Hard-Core Model on Random Graphs

Combinatorial Enumeration. Jason Z. Gao Carleton University, Ottawa, Canada

Standard paths in another composition poset

STA 6857 Estimation ( 3.6)

Lecture 7: Dynamic Programming I: Optimal BSTs

Almost sure asymptotics for the random binary search tree

Cutting edges at random in large recursive trees

Discrete Distributions Chapter 6

Self-normalized Cramér-Type Large Deviations for Independent Random Variables

Example continued. Math 425 Intro to Probability Lecture 37. Example continued. Example

A REFINED ENUMERATION OF p-ary LABELED TREES

The nested Kingman coalescent: speed of coming down from infinity. by Jason Schweinsberg (University of California at San Diego)

Frequency Spectra and Inference in Population Genetics

FRINGE TREES, CRUMP MODE JAGERS BRANCHING PROCESSES AND m-ary SEARCH TREES

Chapter 4. Continuous Random Variables

The Moments of the Profile in Random Binary Digital Trees

Distribution of the Number of Encryptions in Revocation Schemes for Stateless Receivers

The number of accessible paths in the hypercube

Protected nodes and fringe subtrees in some random trees

V. Adamchik 1. Recurrences. Victor Adamchik Fall of 2005

Fundamental Algorithms

Partial match queries in random quadtrees

Cookie Monster Meets the Fibonacci Numbers. Mmmmmm Theorems!

1 Circuit Complexity. CS 6743 Lecture 15 1 Fall Definitions

5. Analytic Combinatorics

Maximum Agreement Subtrees

On a class of stochastic differential equations in a financial network model

CENTRAL LIMIT THEOREMS FOR ADDITIVE TREE PARAMETERS

Almost giant clusters for percolation on large trees

LIMITING DISTRIBUTIONS FOR THE NUMBER OF INVERSIONS IN LABELLED TREE FAMILIES

Forty years of tree enumeration

Applications of Discrete Mathematics to the Analysis of Algorithms

arxiv: v1 [math.co] 12 Apr 2018

The variance for partial match retrievals in k-dimensional bucket digital trees

Cookie Monster Meets the Fibonacci Numbers. Mmmmmm Theorems!

Speeding up the Dreyfus-Wagner Algorithm for minimum Steiner trees

Math 180A. Lecture 16 Friday May 7 th. Expectation. Recall the three main probability density functions so far (1) Uniform (2) Exponential.

Section 27. The Central Limit Theorem. Po-Ning Chen, Professor. Institute of Communications Engineering. National Chiao Tung University

GENERATING FUNCTIONS AND CENTRAL LIMIT THEOREMS

Chapters 3.2 Discrete distributions

THE SIMPLE URN PROCESS AND THE STOCHASTIC APPROXIMATION OF ITS BEHAVIOR

arxiv: v1 [math.co] 1 Aug 2018

Navigation on a Poisson point process. Nicolas Bonichon Jean-François Marckert (LaBRI - Bordeaux):

The Moran Process as a Markov Chain on Leaf-labeled Trees

Problem. Problem Given a dictionary and a word. Which page (if any) contains the given word? 3 / 26

Reckhow s Theorem. Yuval Filmus. November 2010

Review Sheet on Convergence of Series MATH 141H

arxiv: v1 [q-bio.pe] 1 Jun 2014

Susceptible-Infective-Removed Epidemics and Erdős-Rényi random

Discordance of Species Trees with Their Most Likely Gene Trees: The Case of Five Taxa

IEOR 6711: Stochastic Models I Fall 2012, Professor Whitt, Thursday, October 4 Renewal Theory: Renewal Reward Processes

Search Trees and Stirling Numbers

Erdős-Renyi random graphs basics

Lecture 5: Expectation

Extremal Statistics on Non-Crossing Configurations

Conditional distributions (discrete case)

Week 5: Quicksort, Lower bound, Greedy

Some Node Degree Properties of Series-Parallel Graphs Evolving Under a Stochastic Growth Model

STAT/MATH 395 A - PROBABILITY II UW Winter Quarter Moment functions. x r p X (x) (1) E[X r ] = x r f X (x) dx (2) (x E[X]) r p X (x) (3)

Binary Search Trees. Lecture 29 Section Robb T. Koether. Hampden-Sydney College. Fri, Apr 8, 2016

Hidden Markov Models Part 2: Algorithms

Homework 6: Solutions Sid Banerjee Problem 1: (The Flajolet-Martin Counter) ORIE 4520: Stochastics at Scale Fall 2015

Chapter 11. Min Cut Min Cut Problem Definition Some Definitions. By Sariel Har-Peled, December 10, Version: 1.

ECE302 Exam 2 Version A April 21, You must show ALL of your work for full credit. Please leave fractions as fractions, but simplify them, etc.

From trees to seeds: on the inference of the seed from large trees in the uniform attachment model

RANDOM CUTTING AND RECORDS IN DETERMINISTIC AND RANDOM TREES

Suppose that you have three coins. Coin A is fair, coin B shows heads with probability 0.6 and coin C shows heads with probability 0.8.

The space requirement of m-ary search trees: distributional asymptotics for m 27

CS 220: Discrete Structures and their Applications. Mathematical Induction in zybooks

Part IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

On finding a minimum spanning tree in a network with random weights

1 Inverse Transform Method and some alternative algorithms

Everything You Always Wanted to Know about Quicksort, but Were Afraid to Ask. Marianne Durand

The Convergence Rate for the Normal Approximation of Extreme Sums

THE DEGREE DISTRIBUTION OF RANDOM PLANAR GRAPHS

Transcription:

On the number of subtrees on the fringe of random trees (partly joint with Huilan Chang) Michael Fuchs Institute of Applied Mathematics National Chiao Tung University Hsinchu, Taiwan April 16th, 2008 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 1 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 7 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 7 6 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 1 7 6 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 1 7 6 8 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 1 7 6 8 5 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 1 7 3 6 8 5 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 1 7 3 6 8 2 5 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 X 8,1 = 3 1 7 3 6 8 2 5 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 X 8,1 = 3 X 8,2 = 2 1 7 3 6 8 2 5 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 1 7 X 8,1 = 3 X 8,2 = 2 X 8,3 = 1 3 6 8 2 5 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 1 7 X 8,1 = 3 X 8,2 = 2 X 8,3 = 1 X 8,4 = 1 3 6 8 2 5 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 1 7 3 6 8 X 8,1 = 3 X 8,2 = 2 X 8,3 = 1 X 8,4 = 1 X 8,5 = 0 2 5 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 1 7 3 6 8 X 8,1 = 3 X 8,2 = 2 X 8,3 = 1 X 8,4 = 1 X 8,5 = 0 X 8,6 = 0 2 5 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 1 7 3 6 8 X 8,1 = 3 X 8,2 = 2 X 8,3 = 1 X 8,4 = 1 X 8,5 = 0 X 8,6 = 0 X 8,7 = 0 2 5 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

The number of subtrees X n,k =number of subtrees of size k on the fringe of random binary search trees of size n. Example: Input: 4, 7, 6, 1, 8, 5, 3, 2 4 1 7 3 6 8 2 5 X 8,1 = 3 X 8,2 = 2 X 8,3 = 1 X 8,4 = 1 X 8,5 = 0 X 8,6 = 0 X 8,7 = 0 X 8,8 = 1 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 2 / 24

Mean value and variance X n,k satisfies d X n,k = XIn,k + Xn 1 I, n,k where X k,k = 1, X In,k and Xn 1 I n,k are conditionally independent given I n, and I n = Unif{0,..., n 1}. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 3 / 24

Mean value and variance X n,k satisfies X n,k d = XIn,k + X n 1 I n,k, where X k,k = 1, X In,k and Xn 1 I n,k are conditionally independent given I n, and I n = Unif{0,..., n 1}. This yields and for n > 2k + 1. µ n,k := E(X n,k ) = σ 2 n,k := Var(X n,k) = 2(n + 1), (n > k), (k + 1)(k + 2) 2k(4k 2 + 5k 3)(n + 1) (k + 1)(k + 2) 2 (2k + 1)(2k + 3) Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 3 / 24

Some previous results Aldous (1991): Weak law of large numbers X n,k µ n,k 1 in probability. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 4 / 24

Some previous results Aldous (1991): Weak law of large numbers X n,k µ n,k 1 in probability. Devroye (1991): Central limit theorem X n,k µ n,k σ n,k d N (0, 1). Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 4 / 24

Some previous results Aldous (1991): Weak law of large numbers X n,k µ n,k 1 in probability. Devroye (1991): Central limit theorem X n,k µ n,k σ n,k Flajolet, Gourdon, Martinez (1997): d N (0, 1). Central limit theorem with optimal Berry-Esseen bound and LLT Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 4 / 24

Some previous results Aldous (1991): Weak law of large numbers X n,k µ n,k 1 in probability. Devroye (1991): Central limit theorem X n,k µ n,k σ n,k Flajolet, Gourdon, Martinez (1997): d N (0, 1). Central limit theorem with optimal Berry-Esseen bound and LLT All the above results are for fixed k. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 4 / 24

Results for k = k n Theorem (Feng, Mahmoud, Panholzer (2008)) (i) (Normal range) Let k = o ( n) and k as n. Then, X n,k µ n,k 2n/k 2 d N (0, 1). (ii) (Poisson range) Let k c n as n. Then, X n,k d Poisson(2c 2 ). (iii) (Degenerate range) Let k < n and n = o(k) as n. Then, L X 1 n,k 0. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 5 / 24

Why are we interested in X n,k? X n,k is a new kind of profile of a tree. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 6 / 24

Why are we interested in X n,k? X n,k is a new kind of profile of a tree. The phase change from normal to Poisson is a universal phenomenon expected to hold for many classes of random trees. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 6 / 24

Why are we interested in X n,k? X n,k is a new kind of profile of a tree. The phase change from normal to Poisson is a universal phenomenon expected to hold for many classes of random trees. The methods for proving phase change results might be applicable to other parameters which are expected to exhibit the same phase change behavior as well. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 6 / 24

Why are we interested in X n,k? X n,k is a new kind of profile of a tree. The phase change from normal to Poisson is a universal phenomenon expected to hold for many classes of random trees. The methods for proving phase change results might be applicable to other parameters which are expected to exhibit the same phase change behavior as well. X n,k is related to parameters arising in genetics. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 6 / 24

Yule generated random genealogical trees Example: 5 4 3 2 time 1 1 2 3 4 5 6 genes Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 7 / 24

Yule generated random genealogical trees Example: Random model: At every time point, two yellow nodes uniformly coalescent. 1 2 3 4 5 6 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 7 / 24

Yule generated random genealogical trees Example: Random model: At every time point, two yellow nodes uniformly coalescent. 1 1 2 3 4 5 6 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 7 / 24

Yule generated random genealogical trees Example: Random model: 2 At every time point, two yellow nodes uniformly coalescent. 1 1 2 3 4 5 6 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 7 / 24

Yule generated random genealogical trees Example: Random model: 3 2 At every time point, two yellow nodes uniformly coalescent. 1 1 2 3 4 5 6 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 7 / 24

Yule generated random genealogical trees Example: Random model: 3 4 2 At every time point, two yellow nodes uniformly coalescent. 1 1 2 3 4 5 6 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 7 / 24

Yule generated random genealogical trees Example: 5 Random model: 3 4 2 At every time point, two yellow nodes uniformly coalescent. 1 1 2 3 4 5 6 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 7 / 24

Yule generated random genealogical trees Example: 5 Random model: 3 4 2 At every time point, two yellow nodes uniformly coalescent. 1 1 2 3 4 5 6 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 7 / 24

Yule generated random genealogical trees Example: 5 Random model: 3 4 2 At every time point, two yellow nodes uniformly coalescent. 1 1 2 3 4 5 6 Same model as random binary search tree model! Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 7 / 24

Shape parameters of genealogical trees k-pronged nodes (Rosenberg 2006): Nodes with an induced subtree with k 1 internal nodes. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 8 / 24

Shape parameters of genealogical trees k-pronged nodes (Rosenberg 2006): Nodes with an induced subtree with k 1 internal nodes. k-caterpillars (Rosenberg 2006): Correspond to nodes whose induced subtree has k 1 internal nodes all of them with out-degree either 0 or 1. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 8 / 24

Shape parameters of genealogical trees k-pronged nodes (Rosenberg 2006): Nodes with an induced subtree with k 1 internal nodes. k-caterpillars (Rosenberg 2006): Correspond to nodes whose induced subtree has k 1 internal nodes all of them with out-degree either 0 or 1. Nodes with minimal clade size k (Blum and François (2005)): If k 3, then they are internal nodes with induced subtree of size k 1 and either an empty right subtree or empty left subtree. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 8 / 24

Counting pattern in random binary search trees Consider X n,k with d X n,k = XIn,k + Xn 1 I, n,k where X k,k = Bernoulli(p k ), X In,k and Xn 1 I n,k are conditionally independent given I n, and I n = Unif{0,..., n 1}. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 9 / 24

Counting pattern in random binary search trees Consider X n,k with d X n,k = XIn,k + Xn 1 I, n,k where X k,k = Bernoulli(p k ), X In,k and Xn 1 I n,k are conditionally independent given I n, and I n = Unif{0,..., n 1}. Then, p k shape parameter 1 # of k + 1-pronged nodes 2/k # of nodes with minimal clade size k + 1 2 k 1 /k! # of k + 1 caterpillars Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 9 / 24

Underlying recurrence and solution All (centered or non-centered) moments satisfy a n,k = 2 n 1 a j,k + b n,k, n j=0 where a k,k is given and a n,k = 0 for n < k. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 10 / 24

Underlying recurrence and solution All (centered or non-centered) moments satisfy a n,k = 2 n 1 a j,k + b n,k, n j=0 where a k,k is given and a n,k = 0 for n < k. We have a n,k = where n > k. 2(n + 1) (k + 1)(k + 2) a k,k + 2(n + 1) k<j<n b j,k (j + 1)(j + 2) + b n,k, Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 10 / 24

Mean value and variance We have and E(X n,k ) = 2(n + 1) (k + 1)(k + 2) p k, (n > k), Var(X n,k ) = 2p k(4k 3 + 16k 2 + 19k + 6 (11k 2 + 22k + 6)p k )(n + 1) (k + 1)(k + 2) 2 (2k + 1)(2k + 3) for n > 2k + 1. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 11 / 24

Mean value and variance We have and E(X n,k ) = 2(n + 1) (k + 1)(k + 2) p k, (n > k), Var(X n,k ) = 2p k(4k 3 + 16k 2 + 19k + 6 (11k 2 + 22k + 6)p k )(n + 1) (k + 1)(k + 2) 2 (2k + 1)(2k + 3) for n > 2k + 1. Note that E(X n,k ) Var(X n,k ) 2p k k 2 n for n > 2k + 1 and k as n. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 11 / 24

Higher moments Denote by A (m) n,k := E(X n,k E(X n,k )) m. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 12 / 24

Higher moments Denote by Then, where A (m) n,k := E(X n,k E(X n,k )) m. A (m) n,k = 2 n 1 n j=0 A (m) j,k + B(m) n,k, B (m) n,k := i 1 +i 2 +i 3 =m 0 i 1,i 2 <m ( m ) n 1 1 i 1, i 2, i 3 n j=0 A (i 1) j,k A(i 2) n 1 j,k i 3 n,j,k and n,j,k = E(X j,k ) + E(X n 1 j,k ) E(X n,k ). Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 12 / 24

Normal range Proposition Uniformly for n, k, m 1 and n > k ( A (m) n,k = O max { 2p k n k 2, ( ) }) 2pk n m/2. k 2 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 13 / 24

Normal range Proposition Uniformly for n, k, m 1 and n > k ( A (m) n,k = O max { 2p k n k 2, ( ) }) 2pk n m/2. k 2 Proposition For E(X n,k ) as n, ( (2pk ) ) n m 1/2 = o A (2m 1) n,k k 2, A (2m) n,k ( ) 2pk n m g m k 2, where g m = (2m)!/(2 m m!). Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 13 / 24

Poisson range Consider Ā (m) n,k = E(X n,k(x n,k 1) (X n,k m + 1)). Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 14 / 24

Poisson range Consider Ā (m) n,k = E(X n,k(x n,k 1) (X n,k m + 1)). Then, similarly as before: Proposition (i) Uniformly for n, k, m 1 and n > k { ( ) Ā (m) n,k (max = O 2pk n k 2, 2pk n m }) k 2. (ii) For E(X n,k ) c and k < n as n, Ā (m) n,k cm. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 14 / 24

The phase change Theorem (i) (Normal range) Let E(X n,k ) and k as n. Then, X n,k E(X n,k ) 2pk n/k 2 d N (0, 1). (ii) (Poisson range) Let E(X n,k ) c > 0 and k < n as n. Then, X n,k d Poisson(c). (iii) (Degenerate range) Let E(X n,k ) 0 as n. Then, L X 1 n,k 0. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 15 / 24

A comparison of the phase change For k-caterpillars, we have Note that either So, there is no Poisson range. E(X n,k ) = 2k 1 n (k + 2)!. E(X n,k ) or E(X n,k ) 0. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 16 / 24

A comparison of the phase change For k-caterpillars, we have Note that either So, there is no Poisson range. E(X n,k ) = 2k 1 n (k + 2)!. E(X n,k ) or E(X n,k ) 0. shape parameter location phase change k-pronged nodes n normal - poisson - degenerate 3 minimal clade size k n normal - poisson - degenerate k-caterpillars ln n/(ln ln n) normal - degenerate Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 16 / 24

Refined results (for # of subtrees) Define and ( ) φ n,k (y) = e σ2 n,k y2 /2 E e (X n,k µ n,k )y. n,k = dm φ n,k (y) dy m. y=0 φ (m) Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 17 / 24

Refined results (for # of subtrees) Define and ( ) φ n,k (y) = e σ2 n,k y2 /2 E e (X n,k µ n,k )y. n,k = dm φ n,k (y) dy m. y=0 φ (m) Proposition Uniformly for n, k 1 and m 0 { φ (m) n ( n ) } m/3 n,k m!am max k 2, k 2 for a suitable constant A. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 17 / 24

Berry-Esseen bound and LLT for the normal range Theorem (Rate of convergency) For 1 k = o( n) as n, ( ) sup P X n,k µ n,k < x Φ(x) σ n,k = O x R ( k n ). Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 18 / 24

Berry-Esseen bound and LLT for the normal range Theorem (Rate of convergency) For 1 k = o( n) as n, ( ) sup P X n,k µ n,k < x Φ(x) σ n,k = O x R ( k n ). Theorem (LLT) For 1 k = o( n) as n, ( P (X n,k = µ n,k + xσ n,k ) = e x2 /2 (1 + O (1 + x 3 ) k )), 2πσn,k n uniformly in x = o(n 1/6 /k 1/3 ). Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 18 / 24

LLT for the Poisson range Define and φ n,k (y) = e µ n,k(y 1) E ( y X n,k ). n,k = dm φn,k (y) dy m. y=1 φ (m) Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 19 / 24

LLT for the Poisson range Define and φ n,k (y) = e µ n,k(y 1) E ( y X n,k ). n,k = dm φn,k (y) dy m. y=1 φ (m) Proposition Uniformly for n > k and m 0 for a suitable constant A. (m) φ ( n,k n ) m/2 m!am k 3 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 19 / 24

Poisson approximation Theorem (LLT) For k < n and n, P (X n,k = l) = e µ (µ n,k n,k) l l! uniformly in l. ( n ) + O k 3 Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 20 / 24

Poisson approximation Theorem (LLT) For k < n and n, P (X n,k = l) = e µ (µ n,k n,k) l l! uniformly in l. ( n ) + O k 3 Theorem (Poisson approximation) Let k < n and k as n. Then, d T V (X n,k, Poisson(µ n,k )) 0. Remark: A rate can be given as well. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 20 / 24

Other types of random trees Random recursive trees Non-plane, labelled trees with every label sequence from the root to a leave increasing; random model is the uniform model. Methods works as well (with minor modifications) and similar results can be proved. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 21 / 24

Other types of random trees Random recursive trees Non-plane, labelled trees with every label sequence from the root to a leave increasing; random model is the uniform model. Methods works as well (with minor modifications) and similar results can be proved. Plane-oriented recursive trees (PORTs) Plane, labelled trees with every label sequence from the root to a leave increasing; random model is the uniform model. Method works as well, but details more involved. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 21 / 24

Mean value and variance of PORTs We have, µ n,k := E(X n,k ) = 2n 1 4k 2 1, (n > k). Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 22 / 24

Mean value and variance of PORTs We have, µ n,k := E(X n,k ) = 2n 1 4k 2 1, (n > k). Moreover, for fixed k as n, where Var(X n,k ) c k n, c k = 8k2 4k 8 ((2k 3)!!) 2 (4k 2 1) 2 ((k 1)!) 2 4 k 1 k(2k + 1), and, for k < n and k as n, E(X n,k ) Var(X n,k ) n 2k 2. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 22 / 24

The phase change Theorem (i) (Normal range) Let k = o ( n) and k as n. Then, X n,k µ n,k n/(2k 2 ) d N (0, 1). (ii) (Poisson range) Let k c n as n. Then, X n,k d Poisson((2c 2 ) 1 ). (iii) (Degenerate range) Let k < n and n = o(k) as n. Then, L X 1 n,k 0. Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 23 / 24

More results and future research Parameters of genealogical trees under different random models Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 24 / 24

More results and future research Parameters of genealogical trees under different random models Universality of the phase change for the number of subtrees Very simple classes of increasing trees and more general classes of increasing trees (polynomial varieties, mobile trees, etc.) Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 24 / 24

More results and future research Parameters of genealogical trees under different random models Universality of the phase change for the number of subtrees Very simple classes of increasing trees and more general classes of increasing trees (polynomial varieties, mobile trees, etc.) Phase change results for the number of nodes with out-degree k Important in computer science. A phase change from normal to degenerate is expected (no Poisson range). Michael Fuchs (NCTU) Subtree Sizes of Random Trees April 16th, 2008 24 / 24