Dependence between Path Lengths and Size in Random Trees (joint with H.-H. Chern, H.-K. Hwang and R. Neininger)

Size: px
Start display at page:

Download "Dependence between Path Lengths and Size in Random Trees (joint with H.-H. Chern, H.-K. Hwang and R. Neininger)"

Transcription

1 Dependence between Path Lengths and Size in Random Trees (joint with H.-H. Chern, H.-K. Hwang and R. Neininger) Michael Fuchs Institute of Applied Mathematics National Chiao Tung University Hsinchu, Taiwan Kraków, July 4, 2016 Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

2 Random m-ary Search Trees Proposed by Muntz and Uzgalis in Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

3 Random m-ary Search Trees Proposed by Muntz and Uzgalis in Input: 6, 2, 4, 8, 7, 1, 5, 3, 10, 9 Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

4 Random m-ary Search Trees Proposed by Muntz and Uzgalis in Input: 6, 2, 4, 8, 7, 1, 5, 3, 10, 9 2, 6 1 4, 5 7, 8 3 9, 10 m = 3 Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

5 Random m-ary Search Trees Proposed by Muntz and Uzgalis in Input: 6, 2, 4, 8, 7, 1, 5, 3, 10, 9 2, 6 2, 4, 6 1 4, 5 7, , 8, 9 3 9, 10 m = 3 m = 4 10 Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

6 Random m-ary Search Trees Proposed by Muntz and Uzgalis in Input: 6, 2, 4, 8, 7, 1, 5, 3, 10, 9 2, 6 2, 4, 6 1 4, 5 7, , 8, 9 3 9, 10 m = 3 m = 4 10 If permutations are equally likely random m-ary search trees Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

7 Size, KPL, and NPL Size (or Storage Requirement) Number of nodes holding keys. Only random if m 3. S n = size of a random m-ary search tree built from n keys. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

8 Size, KPL, and NPL Size (or Storage Requirement) Number of nodes holding keys. Only random if m 3. S n = size of a random m-ary search tree built from n keys. Key Path Length (KPL) Sum of all key-distances to the root. K n = KPL of a random m-ary search tree built from n keys. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

9 Size, KPL, and NPL Size (or Storage Requirement) Number of nodes holding keys. Only random if m 3. S n = size of a random m-ary search tree built from n keys. Key Path Length (KPL) Sum of all key-distances to the root. K n = KPL of a random m-ary search tree built from n keys. Node Path Length (NPL) Sum of all node-distances to the root. N n = NPL of a random m-ary search tree built from n keys. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

10 Size: Mean Knuth (1973): where E(S n ) φn, φ := and H m are the Harmonic numbers. 1 2(H m 1) Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

11 Size: Mean Knuth (1973): where E(S n ) φn, φ := and H m are the Harmonic numbers. Mahmoud and Pittel (1989): 1 2(H m 1) E(S n ) = φ(n + 1) 1 m 1 + O(nα 1 ), where α is the real part of the second largest zero of Λ(z) = z(z + 1) (z + m 2) m!. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

12 Size: Phase Change for Variance Mahmoud and Pittel (1989): { C S n, if m 26; Var(S n ) F 1 (β log n)n 2α 2, if m 27, where λ = α + iβ is the second largest zero of Λ(z). Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

13 Size: Phase Change for Variance Mahmoud and Pittel (1989): { C S n, if m 26; Var(S n ) F 1 (β log n)n 2α 2, if m 27, where λ = α + iβ is the second largest zero of Λ(z). Here, F 1 (z) is the periodic function ( F 1 (z) = 2 A 2 Γ(λ) ( A 2 e 2iz + 2R Γ(λ) 2 with A = 1/(λ(λ 1) 0 j m 2 m!(m 1) Γ(λ) 2 ) Γ(2α + m 2) m!γ(2α 1) ( 1 + m!(m 1)Γ(λ) 2 Γ(2λ + m 2) m!γ(2λ 1) 1 j+λ ). )) Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

14 Size: Phase Change for Limit Law Theorem (Mahmoud & Pittel (1989); Lew & Mahmoud (1994)) For 3 m 26, S n E(S n ) Var(Sn ) d N(0, 1), where N(0, 1) is the standard normal distribution. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

15 Size: Phase Change for Limit Law Theorem (Mahmoud & Pittel (1989); Lew & Mahmoud (1994)) For 3 m 26, S n E(S n ) Var(Sn ) d N(0, 1), where N(0, 1) is the standard normal distribution. Theorem (Chern & Hwang (2001)) For m 27, S n E(S n ) Var(Sn ) does not converge to a fixed limit law. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

16 KPL: Moments Mahmoud (1986): E(K n ) = 2φn log n + c 1 n + o(n), where c 1 is an explicitly computable constant. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

17 KPL: Moments Mahmoud (1986): E(K n ) = 2φn log n + c 1 n + o(n), where c 1 is an explicitly computable constant. Mahmoud (1992): where with H (2) m = 1 j m 1/j2. Var(K n ) C K n 2, ( ) C K = 4φ 2 (m + 1)H m (2) 2 π2 m 1 6 Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

18 KPL: Moments Mahmoud (1986): E(K n ) = 2φn log n + c 1 n + o(n), where c 1 is an explicitly computable constant. Mahmoud (1992): where with H (2) m = 1 j m 1/j2. Var(K n ) C K n 2, ( ) C K = 4φ 2 (m + 1)H m (2) 2 π2 m 1 6 So, no phase change here for the variance! Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

19 KPL: Limit Law Theorem (Neininger & Rüschendorf (1999)) We have, K n E(K n ) n where K is the unique solution of X d = 1 r m V r X (r) + 2φ with X (r) an independent copy of X and d K, 1 r m V r = U (r) U (r 1), V r log V r where U (r) is the r-th order statistic of m i.i.d. uniform RVs. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

20 Node Path Length (NPL) N n = sum of all node-distances in an m-search tree built from n keys. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

21 Node Path Length (NPL) N n = sum of all node-distances in an m-search tree built from n keys. Broutin and Holmgren (2012): E(N n ) = 2φ 2 n log n + c 2 n + o(n), where c 2 is an explicitly computable constant. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

22 Node Path Length (NPL) N n = sum of all node-distances in an m-search tree built from n keys. Broutin and Holmgren (2012): E(N n ) = 2φ 2 n log n + c 2 n + o(n), where c 2 is an explicitly computable constant. We have, { d (1) S n = S I S (m) N n d = N (1) I 1 I m + 1, + + N (m) I m + S (1) I S (m) I m. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

23 Node Path Length (NPL) N n = sum of all node-distances in an m-search tree built from n keys. Broutin and Holmgren (2012): E(N n ) = 2φ 2 n log n + c 2 n + o(n), where c 2 is an explicitly computable constant. We have, { d (1) S n = S I S (m) N n d = N (1) I 1 I m + 1, + + N (m) I m + S (1) I S (m) I m. So, one expects a strong positive dependence between S n and N n! Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

24 Size and NPL: Correlation (i) Theorem (Chern, F., Hwang, Neininger) We have, Cov(S n, N n ) { C R n log n, if 3 m 13; φf 2 (β log n)n α, if m 14, where C R is a constant and F 2 (z) is a periodic function. Moreover, Var(N n ) φ 2 C K n 2., Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

25 Size and NPL: Correlation (i) Theorem (Chern, F., Hwang, Neininger) We have, Cov(S n, N n ) { C R n log n, if 3 m 13; φf 2 (β log n)n α, if m 14, where C R is a constant and F 2 (z) is a periodic function. Moreover, Var(N n ) φ 2 C K n 2., Thus (!), 0, if 3 m 26; ρ(s n, N n ) F 2(β log n), if m 27. CK F 1 (β log n) Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

26 Size and NPL: Correlation (ii) Periodic function of ρ(s n, N n ) for m = 27, 54,..., 270. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

27 Pearson s Correlation Coefficient Pearson: for RVs X and Y ρ(x, Y ) = Cov(X, Y ) Var(X)Var(Y ). Measures linear dependence between X and Y! Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

28 Pearson s Correlation Coefficient Pearson: for RVs X and Y ρ(x, Y ) = Cov(X, Y ) Var(X)Var(Y ). Measures linear dependence between X and Y! Refined correlation measures: Distance correlation, Brownian covariance, mutual information, total correlation, dual total correlation, etc. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

29 Pearson s Correlation Coefficient Pearson: for RVs X and Y ρ(x, Y ) = Cov(X, Y ) Var(X)Var(Y ). Measures linear dependence between X and Y! Refined correlation measures: Distance correlation, Brownian covariance, mutual information, total correlation, dual total correlation, etc. Question: Can our counterintuitive result for ρ(s n, N n ) be ascribed to the weakness of Pearson s correlation coefficient? Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

30 Pearson s Correlation Coefficient Pearson: for RVs X and Y ρ(x, Y ) = Cov(X, Y ) Var(X)Var(Y ). Measures linear dependence between X and Y! Refined correlation measures: Distance correlation, Brownian covariance, mutual information, total correlation, dual total correlation, etc. Question: Can our counterintuitive result for ρ(s n, N n ) be ascribed to the weakness of Pearson s correlation coefficient? NO! Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

31 Size and NPL: Limit Law for 3 m 26 Theorem (Chern, F., Hwang, Neininger) Consider Then, Q n = (S n, N n ). Cov(Q n ) 1/2( ) ( ) d Q n E(Q n ) N, C 1/2 K K, where N has a standard normal distribution. Moreover, N and K are independent! Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

32 Size and NPL: Limit Law for 3 m 26 Theorem (Chern, F., Hwang, Neininger) Consider Then, Q n = (S n, N n ). Cov(Q n ) 1/2( ) ( ) d Q n E(Q n ) N, C 1/2 K K, where N has a standard normal distribution. Moreover, N and K are independent! Thus, asymptotic independence for 3 m 26 is also observed in the bivariate limit law! Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

33 Size and NPL: Limit Law for m 27 Theorem (Chern, F., Hwang, Neininger) Consider Then, Y n = ( Sn φn n α 1, N ) n E(N n ). n l 2 (Y n, (R(n iβ Λ), φk)) 0, where l 2 is the minimal L 2 -metric and Λ is the unique solution of W d = 1 r m with W (r) independent copies of W. Vr λ 1 W (r) Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

34 Size and KPL Same results hold for size and KPL, e.g., { 0, if 3 m 26; ρ(s n, K n ) ρ(s n, N n ), if m 27. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

35 Size and KPL Same results hold for size and KPL, e.g., { 0, if 3 m 26; ρ(s n, K n ) ρ(s n, N n ), if m 27. Theorem (Chern, F., Hwang, Neininger) We have ρ(k n, N n ) 1 and N n φk n (E(N n φk n )) 2 = o(n). In particular, ( Kn E(K n ) n, N ) n E(N n ) d (K, φk). n Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

36 Trie Proposed by René de la Briandais in Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

37 Trie Proposed by René de la Briandais in Name from the word data retrieval (suggested by Fredkin). Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

38 Trie Proposed by René de la Briandais in Name from the word data retrieval (suggested by Fredkin). Example: Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

39 Trie Proposed by René de la Briandais in Name from the word data retrieval (suggested by Fredkin). Example: Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

40 Trie Proposed by René de la Briandais in Name from the word data retrieval (suggested by Fredkin). Example: Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

41 Trie Proposed by René de la Briandais in Name from the word data retrieval (suggested by Fredkin). Example: Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

42 Trie Proposed by René de la Briandais in Name from the word data retrieval (suggested by Fredkin). Example: Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

43 Trie Proposed by René de la Briandais in Name from the word data retrieval (suggested by Fredkin). Example: Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

44 Notation and Random Model Two types of nodes: Internal nodes: only used for branching; External nodes: nodes which hold data. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

45 Notation and Random Model Two types of nodes: Internal nodes: only used for branching; External nodes: nodes which hold data. Random Model: Bits are independent Bernoulli random variables with mean p. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

46 Notation and Random Model Two types of nodes: Internal nodes: only used for branching; External nodes: nodes which hold data. Random Model: Bits are independent Bernoulli random variables with mean p. p = 1/2: symmetric trie; p 1/2: asymmetric trie. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

47 Notation and Random Model Two types of nodes: Internal nodes: only used for branching; External nodes: nodes which hold data. Random Model: Bits are independent Bernoulli random variables with mean p. p = 1/2: symmetric trie; p 1/2: asymmetric trie. Question: correlation between size and path-length in random tries? Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

48 Size, EPL, and IPL Size Number of internal nodes. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

49 Size, EPL, and IPL Size Number of internal nodes. External Path Length (EPL) Sum of all distances between external nodes and root. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

50 Size, EPL, and IPL Size Number of internal nodes. External Path Length (EPL) Sum of all distances between external nodes and root. Internal Path Length (IPL) Sum of all distances between internal nodes and root. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

51 Size, EPL, and IPL Size Number of internal nodes. External Path Length (EPL) Sum of all distances between external nodes and root. Internal Path Length (IPL) Sum of all distances between internal nodes and root. We again use S n, K n, N n and EPL KPL; IPL NPL. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

52 Some Notation We use the following notation: Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

53 Some Notation We use the following notation: Entropy: h = p log p q log q; Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

54 Some Notation We use the following notation: Entropy: h = p log p q log q; If log p/ log q Q, then log p log q = r, gcd(r, l) = 1. l Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

55 Some Notation We use the following notation: Entropy: h = p log p q log q; If log p/ log q Q, then log p log q = r, gcd(r, l) = 1. l and χ k = 2rkπi log p, (k Z). Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

56 Some Notation We use the following notation: Entropy: h = p log p q log q; If log p/ log q Q, then log p log q = r, gcd(r, l) = 1. l and For a sequence g k : χ k = 2rkπi log p, (k Z). F [g](z) = { k Z g kz χ k, if log p/ log q Q; g 0, if log p/ log q Q. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

57 Means and Variances 1 Shape parameters n (mean) 1 n (variance) Size S n F [ ](n) F [g (1) ](n) NPL N n E(S n) KPL K n Depth D n log n h n log n h + F [ ](n) E(D n ) = E(Kn) n pq log 2 p q h 2 Var(S n) n log n h (log n) 2 h 2 Var(D n ) = Var(Kn) n + F [g (3) ](n) + O(1) Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

58 Means and Variances 1 Shape parameters n (mean) 1 n (variance) Size S n F [ ](n) F [g (1) ](n) NPL N n E(S n) KPL K n Depth D n log n h n log n h + F [ ](n) E(D n ) = E(Kn) n pq log 2 p q h 2 Var(S n) n log n h (log n) 2 h 2 Var(D n ) = Var(Kn) n + F [g (3) ](n) + O(1) Note that and for p q N n S n log n h Var(D n ) Var(K n). n Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

59 Main Approaches Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

60 Main Approaches Rice Method Exercise 54 in Section of Knuth s book. Developed into a systematic tool by Flajolet and Sedgewick. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

61 Main Approaches Rice Method Exercise 54 in Section of Knuth s book. Developed into a systematic tool by Flajolet and Sedgewick. Two-stage Approach Introduced by Jacquet and Régnier. Further developed by Jacquet and Szpankowski. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

62 Main Approaches Rice Method Exercise 54 in Section of Knuth s book. Developed into a systematic tool by Flajolet and Sedgewick. Two-stage Approach Introduced by Jacquet and Régnier. Further developed by Jacquet and Szpankowski. Poisson Variance and Covariance F., Hwang and Zacharovas. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

63 Main Approaches Rice Method Exercise 54 in Section of Knuth s book. Developed into a systematic tool by Flajolet and Sedgewick. Two-stage Approach Introduced by Jacquet and Régnier. Further developed by Jacquet and Szpankowski. Poisson Variance and Covariance F., Hwang and Zacharovas. Other Approaches Elementary (Schachinger), probabilistic (Devroye, Janson, etc.) Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

64 Size: Variance Theorem (Régnier & Jacquet (1989); Kirschenhofer & Prodinger (1991); F., Hwang, Zacharovas (2014)) We have, where Var(S n ) F [g (1) ](n)n, g (1) k = χ ( kγ( 1 + χ k ) 1 χ ) k + 3 h 2 1+χ k 1 h 2 Γ(χ j + 1)Γ(χ k j + 1) j Z 2 ( 1) j (j χ k )Γ(j + χ k ) ( p j+1 + q j+1) h (j 1)!(j + 1)(1 p j+1 q j+1. ) j 1 Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

65 Size and NPL: Variances and Covariance Theorem (F., Hwang, Zacharovas (2014)) We have, and and Var(S n ) F [g (1) ](n)n, Cov(S n, N n ) F [g (1) ](n) n log n h Var(N n ) F [g (1) ](n) n log2 n h 2. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

66 Size and NPL: Variances and Covariance Theorem (F., Hwang, Zacharovas (2014)) We have, Var(S n ) F [g (1) ](n)n, and and Corollary We have, Cov(S n, N n ) F [g (1) ](n) n log n h Var(N n ) F [g (1) ](n) n log2 n h 2. ρ(s n, N n ) 1. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

67 Size and NPL: Limit Law Theorem (F. & Lee (2015)) We have, ( ) S n E(S n ) Var(Sn ), N n E(N n ) d N (0, E 2 ), Var(Nn ) where E 2 is the 2 2 unit matrix E 2 = ( ) Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

68 Size and NPL: Limit Law Theorem (F. & Lee (2015)) We have, ( ) S n E(S n ) Var(Sn ), N n E(N n ) d N (0, E 2 ), Var(Nn ) where E 2 is the 2 2 unit matrix E 2 = ( ) NPL was also investigated by Nguyen-The (2004) in his PhD-thesis, but his result is incorrect. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

69 Size and NPL: Limit Law Theorem (F. & Lee (2015)) We have, ( ) S n E(S n ) Var(Sn ), N n E(N n ) d N (0, E 2 ), Var(Nn ) where E 2 is the 2 2 unit matrix E 2 = ( ) NPL was also investigated by Nguyen-The (2004) in his PhD-thesis, but his result is incorrect. Question: same result for size and KPL? Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

70 Size and KPL: Covariance Theorem (F. & Hwang) We have, Cov(S n, K n ) F [g (2) ](n)n, where g (2) k = Γ(χ ( k) 1 χ k + 2 ) h 2 χ k+1 1 h 2 Γ(χ k j + 1)(χ j 1)Γ(χ j ) j Z\{0} Γ(χ k + 1) ( h 2 γ ψ(χ k + 1) p log2 p + q log 2 q ) 2h + 1 ( 1) j (2j 2 2j (2j 1)χ k )Γ(j 1χ k )(p j + q j ) h j!(1 p j q j. ) j 2 Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

71 Correlation Coefficient p = q = 1/2: ρ(s n, K n ) Cov(S n,k n) Var(Sn)(Var(K n)+1.046) Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

72 Size and KPL: Correlation Coefficient Theorem (F. & Hwang) We have, ρ(s n, K n ) { 0, if p q; F (n), if p = q, where F (n) = F [g (2) ](n) F [g (1) ](n)f [g (3) ](n) is a periodic function with average value = and amplitude Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

73 Size and KPL: Correlation Coefficient Theorem (F. & Hwang) We have, ρ(s n, K n ) { 0, if p q; F (n), if p = q, where F (n) = F [g (2) ](n) F [g (1) ](n)f [g (3) ](n) is a periodic function with average value = and amplitude Question: is now this behavior due to the weakness of Pearson s correlation coefficient? Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

74 Size and KPL: Correlation Coefficient Theorem (F. & Hwang) We have, ρ(s n, K n ) { 0, if p q; F (n), if p = q, where F (n) = F [g (2) ](n) F [g (1) ](n)f [g (3) ](n) is a periodic function with average value = and amplitude Question: is now this behavior due to the weakness of Pearson s correlation coefficient? Again NO! Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

75 Size and KPL: Limit Laws Theorem (F. & Hwang) p q: ( ) S n E(S n ) Var(Sn ), K n E(K n ) d N (0, I 2 ), Var(Kn ) where I 2 is the 2 2 identity matrix. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

76 Size and KPL: Limit Laws Theorem (F. & Hwang) p q: ( ) S n E(S n ) Var(Sn ), K n E(K n ) d N (0, I 2 ), Var(Kn ) where I 2 is the 2 2 identity matrix. p = q: ( ) Σn 1/2 d S n E(S n ), K n E(K n ) N 2 (0, I 2 ), where Σ n is the (asymptotic) covariance matrix: ( F [g Σ n := n (1) ](n) F [g (2) ) ](n) F [g (2) ](n) F [g (3). ](n) Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

77 Joint Distribution of S n and K n p = 0.1 p = 0.2 p = 0.3 K n S n K n Sn K n Sn p = 0.4 p = 0.5 p = 0.6 K n S n K n Sn K n S n Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

78 Summary trees { ρ(s n, K n ) ρ(s n, N n ) p q : 0 tries 1 p = { q : periodic m-ary 3 m 26 : 0 search trees m 27 : periodic Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

79 Summary trees { ρ(s n, K n ) ρ(s n, N n ) p q : 0 tries 1 p = { q : periodic m-ary 3 m 26 : 0 search trees m 27 : periodic Similar results for fringe-balanced binary search trees and quadtrees: H.-H. Chern, M. Fuchs, H.-K. Hwang, R. Neininger. Dependence and phase changes in random m-ary search trees, arxiv: Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

80 Summary trees { ρ(s n, K n ) ρ(s n, N n ) p q : 0 tries 1 p = { q : periodic m-ary 3 m 26 : 0 search trees m 27 : periodic Similar results for fringe-balanced binary search trees and quadtrees: H.-H. Chern, M. Fuchs, H.-K. Hwang, R. Neininger. Dependence and phase changes in random m-ary search trees, arxiv: Similar results for m-ary tries, m-ary PATRICIA tries and bucket digital search trees. Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

81 Summary trees { ρ(s n, K n ) ρ(s n, N n ) p q : 0 tries 1 p = { q : periodic m-ary 3 m 26 : 0 search trees m 27 : periodic Similar results for fringe-balanced binary search trees and quadtrees: H.-H. Chern, M. Fuchs, H.-K. Hwang, R. Neininger. Dependence and phase changes in random m-ary search trees, arxiv: Similar results for m-ary tries, m-ary PATRICIA tries and bucket digital search trees. Better explanation of our results? Michael Fuchs (NCTU) Dependence in Random Trees Kraków, July 4, / 30

Asymptotic and Exact Poissonized Variance in the Analysis of Random Digital Trees (joint with Hsien-Kuei Hwang and Vytas Zacharovas)

Asymptotic and Exact Poissonized Variance in the Analysis of Random Digital Trees (joint with Hsien-Kuei Hwang and Vytas Zacharovas) Asymptotic and Exact Poissonized Variance in the Analysis of Random Digital Trees (joint with Hsien-Kuei Hwang and Vytas Zacharovas) Michael Fuchs Institute of Applied Mathematics National Chiao Tung University

More information

Approximate Counting via the Poisson-Laplace-Mellin Method (joint with Chung-Kuei Lee and Helmut Prodinger)

Approximate Counting via the Poisson-Laplace-Mellin Method (joint with Chung-Kuei Lee and Helmut Prodinger) Approximate Counting via the Poisson-Laplace-Mellin Method (joint with Chung-Kuei Lee and Helmut Prodinger) Michael Fuchs Department of Applied Mathematics National Chiao Tung University Hsinchu, Taiwan

More information

The Subtree Size Profile of Plane-oriented Recursive Trees

The Subtree Size Profile of Plane-oriented Recursive Trees The Subtree Size Profile of Plane-oriented Recursive Trees Michael Fuchs Department of Applied Mathematics National Chiao Tung University Hsinchu, Taiwan ANALCO11, January 22nd, 2011 Michael Fuchs (NCTU)

More information

EXPECTED WORST-CASE PARTIAL MATCH IN RANDOM QUADTRIES

EXPECTED WORST-CASE PARTIAL MATCH IN RANDOM QUADTRIES EXPECTED WORST-CASE PARTIAL MATCH IN RANDOM QUADTRIES Luc Devroye School of Computer Science McGill University Montreal, Canada H3A 2K6 luc@csmcgillca Carlos Zamora-Cura Instituto de Matemáticas Universidad

More information

On the number of subtrees on the fringe of random trees (partly joint with Huilan Chang)

On the number of subtrees on the fringe of random trees (partly joint with Huilan Chang) On the number of subtrees on the fringe of random trees (partly joint with Huilan Chang) Michael Fuchs Institute of Applied Mathematics National Chiao Tung University Hsinchu, Taiwan April 16th, 2008 Michael

More information

The Moments of the Profile in Random Binary Digital Trees

The Moments of the Profile in Random Binary Digital Trees Journal of mathematics and computer science 6(2013)176-190 The Moments of the Profile in Random Binary Digital Trees Ramin Kazemi and Saeid Delavar Department of Statistics, Imam Khomeini International

More information

PROBABILISTIC BEHAVIOR OF ASYMMETRIC LEVEL COMPRESSED TRIES

PROBABILISTIC BEHAVIOR OF ASYMMETRIC LEVEL COMPRESSED TRIES PROBABILISTIC BEAVIOR OF ASYMMETRIC LEVEL COMPRESSED TRIES Luc Devroye Wojcieh Szpankowski School of Computer Science Department of Computer Sciences McGill University Purdue University 3450 University

More information

LAWS OF LARGE NUMBERS AND TAIL INEQUALITIES FOR RANDOM TRIES AND PATRICIA TREES

LAWS OF LARGE NUMBERS AND TAIL INEQUALITIES FOR RANDOM TRIES AND PATRICIA TREES LAWS OF LARGE NUMBERS AND TAIL INEQUALITIES FOR RANDOM TRIES AND PATRICIA TREES Luc Devroye School of Computer Science McGill University Montreal, Canada H3A 2K6 luc@csmcgillca June 25, 2001 Abstract We

More information

Multiple Choice Tries and Distributed Hash Tables

Multiple Choice Tries and Distributed Hash Tables Multiple Choice Tries and Distributed Hash Tables Luc Devroye and Gabor Lugosi and Gahyun Park and W. Szpankowski January 3, 2007 McGill University, Montreal, Canada U. Pompeu Fabra, Barcelona, Spain U.

More information

Laws of large numbers and tail inequalities for random tries and PATRICIA trees

Laws of large numbers and tail inequalities for random tries and PATRICIA trees Journal of Computational and Applied Mathematics 142 2002 27 37 www.elsevier.com/locate/cam Laws of large numbers and tail inequalities for random tries and PATRICIA trees Luc Devroye 1 School of Computer

More information

Applications of Analytic Combinatorics in Mathematical Biology (joint with H. Chang, M. Drmota, E. Y. Jin, and Y.-W. Lee)

Applications of Analytic Combinatorics in Mathematical Biology (joint with H. Chang, M. Drmota, E. Y. Jin, and Y.-W. Lee) Applications of Analytic Combinatorics in Mathematical Biology (joint with H. Chang, M. Drmota, E. Y. Jin, and Y.-W. Lee) Michael Fuchs Department of Applied Mathematics National Chiao Tung University

More information

The space requirement of m-ary search trees: distributional asymptotics for m 27

The space requirement of m-ary search trees: distributional asymptotics for m 27 1 The space requirement of m-ary search trees: distributional asymptotics for m 7 James Allen Fill 1 and Nevin Kapur 1 Applied Mathematics and Statistics, The Johns Hopkins University, 3400 N. Charles

More information

ROSENA R.X. DU AND HELMUT PRODINGER. Dedicated to Philippe Flajolet ( )

ROSENA R.X. DU AND HELMUT PRODINGER. Dedicated to Philippe Flajolet ( ) ON PROTECTED NODES IN DIGITAL SEARCH TREES ROSENA R.X. DU AND HELMUT PRODINGER Dedicated to Philippe Flajolet (98 2) Abstract. Recently, 2-protected nodes were studied in the context of ordered trees and

More information

Combinatorial/probabilistic analysis of a class of search-tree functionals

Combinatorial/probabilistic analysis of a class of search-tree functionals Combinatorial/probabilistic analysis of a class of search-tree functionals Jim Fill jimfill@jhu.edu http://www.mts.jhu.edu/ fill/ Mathematical Sciences The Johns Hopkins University Joint Statistical Meetings;

More information

RENEWAL THEORY IN ANALYSIS OF TRIES AND STRINGS: EXTENDED ABSTRACT

RENEWAL THEORY IN ANALYSIS OF TRIES AND STRINGS: EXTENDED ABSTRACT RENEWAL THEORY IN ANALYSIS OF TRIES AND STRINGS: EXTENDED ABSTRACT SVANTE JANSON Abstract. We give a survey of a number of simple applications of renewal theory to problems on random strings, in particular

More information

The Subtree Size Profile of Bucket Recursive Trees

The Subtree Size Profile of Bucket Recursive Trees Iranian Journal of Mathematical Sciences and Informatics Vol., No. (206, pp - DOI: 0.7508/ijmsi.206.0.00 The Subtree Size Profile of Bucket Recursive Trees Ramin Kazemi Department of Statistics, Imam Khomeini

More information

The variance for partial match retrievals in k-dimensional bucket digital trees

The variance for partial match retrievals in k-dimensional bucket digital trees The variance for partial match retrievals in k-dimensional ucket digital trees Michael FUCHS Department of Applied Mathematics National Chiao Tung University January 12, 21 Astract The variance of partial

More information

Forty years of tree enumeration

Forty years of tree enumeration October 24, 206 Canada John Moon (Alberta) Counting labelled trees Canadian mathematical monographs What is our subject? Asymptotic enumeration or Probabilistic combinatorics? What is our subject? Asymptotic

More information

Asymptotic distribution of two-protected nodes in ternary search trees

Asymptotic distribution of two-protected nodes in ternary search trees Asymptotic distribution of two-protected nodes in ternary search trees Cecilia Holmgren Svante Janson March 2, 204; revised October 5, 204 Abstract We study protected nodes in m-ary search trees, by putting

More information

AVERAGE-CASE ANALYSIS OF COUSINS IN m-ary TRIES

AVERAGE-CASE ANALYSIS OF COUSINS IN m-ary TRIES J. Appl. Prob. 45, 888 9 (28) Printed in England Applied Probability Trust 28 AVERAGE-CASE ANALYSIS OF COUSINS IN m-ary TRIES HOSAM M. MAHMOUD, The George Washington University MARK DANIEL WARD, Purdue

More information

arxiv: v1 [math.pr] 21 Mar 2014

arxiv: v1 [math.pr] 21 Mar 2014 Asymptotic distribution of two-protected nodes in ternary search trees Cecilia Holmgren Svante Janson March 2, 24 arxiv:4.557v [math.pr] 2 Mar 24 Abstract We study protected nodes in m-ary search trees,

More information

Asymptotic variance of random symmetric digital search trees

Asymptotic variance of random symmetric digital search trees Discrete Mathematics and Theoretical Computer Science DMTCS vol. :,, 3 66 Asymptotic variance of random symmetric digital search trees Hsien-Kuei Hwang, Michael Fuchs and Vytas Zacharovas Institute of

More information

From the Discrete to the Continuous, and Back... Philippe Flajolet INRIA, France

From the Discrete to the Continuous, and Back... Philippe Flajolet INRIA, France From the Discrete to the Continuous, and Back... Philippe Flajolet INRIA, France 1 Discrete structures in... Combinatorial mathematics Computer science: data structures & algorithms Information and communication

More information

Partial match queries in random quadtrees

Partial match queries in random quadtrees Partial match queries in random quadtrees Nicolas Broutin nicolas.broutin@inria.fr Projet Algorithms INRIA Rocquencourt 78153 Le Chesnay France Ralph Neininger and Henning Sulzbach {neiningr, sulzbach}@math.uni-frankfurt.de

More information

An Analysis of the Height of Tries with Random Weights on the Edges

An Analysis of the Height of Tries with Random Weights on the Edges An Analysis of the Height of Tries with Random Weights on the Edges N. Broutin L. Devroye September 0, 2007 Abstract We analyze the weighted height of random tries built from independent strings of i.i.d.

More information

Lecture 3 - Expectation, inequalities and laws of large numbers

Lecture 3 - Expectation, inequalities and laws of large numbers Lecture 3 - Expectation, inequalities and laws of large numbers Jan Bouda FI MU April 19, 2009 Jan Bouda (FI MU) Lecture 3 - Expectation, inequalities and laws of large numbersapril 19, 2009 1 / 67 Part

More information

On Buffon Machines & Numbers

On Buffon Machines & Numbers On Buffon Machines & Numbers Philippe Flajolet, Maryse Pelletier, Michèle Soria AofA 09, Fréjus --- June 2009 [INRIA-Rocquencourt & LIP6, Paris] 1 1733: Countess Buffon drops her knitting kit on the floor.

More information

Compact Suffix Trees Resemble Patricia Tries: Limiting Distribution of Depth

Compact Suffix Trees Resemble Patricia Tries: Limiting Distribution of Depth Purdue University Purdue e-pubs Department of Computer Science Technical Reports Department of Computer Science 1992 Compact Suffix Trees Resemble Patricia Tries: Limiting Distribution of Depth Philippe

More information

Digital Trees and Memoryless Sources: from Arithmetics to Analysis

Digital Trees and Memoryless Sources: from Arithmetics to Analysis Digital Trees and Memoryless Sources: from Arithmetics to Analysis Philippe Flajolet, Mathieu Roux, Brigitte Vallée AofA 2010, Wien 1 What is a digital tree, aka TRIE? = a data structure for dynamic dictionaries

More information

Partial Fillup and Search Time in LC Tries

Partial Fillup and Search Time in LC Tries Partial Fillup and Search Time in LC Tries August 17, 2006 Svante Janson Wociech Szpankowski Department of Mathematics Department of Computer Science Uppsala University, P.O. Box 480 Purdue University

More information

Singularity Analysis: A Perspective

Singularity Analysis: A Perspective MSRI, Berkeley, June 2004 Singularity Analysis: A Perspective Philippe Flajolet (Inria, France) Analysis of Algorithms Average-Case, Probabilistic Properties of Random Structures? Counting and asymptotics

More information

Partial Match Queries in Relaxed Multidimensional Search Trees 1

Partial Match Queries in Relaxed Multidimensional Search Trees 1 Algorithmica 2001 29: 181 204 DOI: 10.1007/s004530010065 Algorithmica 2001 Springer-Verlag New York Inc. Partial Match Queries in Relaxed Multidimensional Search Trees 1 C. Martínez, 2 A. Panholzer, 3

More information

arxiv:math.pr/ v1 17 May 2004

arxiv:math.pr/ v1 17 May 2004 Probabilistic Analysis for Randomized Game Tree Evaluation Tämur Ali Khan and Ralph Neininger arxiv:math.pr/0405322 v1 17 May 2004 ABSTRACT: We give a probabilistic analysis for the randomized game tree

More information

LIMIT LAWS FOR SUMS OF FUNCTIONS OF SUBTREES OF RANDOM BINARY SEARCH TREES

LIMIT LAWS FOR SUMS OF FUNCTIONS OF SUBTREES OF RANDOM BINARY SEARCH TREES LIMIT LAWS FOR SUMS OF FUNCTIONS OF SUBTREES OF RANDOM BINARY SEARCH TREES Luc Devroye luc@cs.mcgill.ca Abstract. We consider sums of functions of subtrees of a random binary search tree, and obtain general

More information

Stationary independent increments. 1. Random changes of the form X t+h X t fixed h > 0 are called increments of the process.

Stationary independent increments. 1. Random changes of the form X t+h X t fixed h > 0 are called increments of the process. Stationary independent increments 1. Random changes of the form X t+h X t fixed h > 0 are called increments of the process. 2. If each set of increments, corresponding to non-overlapping collection of

More information

Analytic Information Theory: From Shannon to Knuth and Back. Knuth80: Piteaa, Sweden, 2018 Dedicated to Don E. Knuth

Analytic Information Theory: From Shannon to Knuth and Back. Knuth80: Piteaa, Sweden, 2018 Dedicated to Don E. Knuth Analytic Information Theory: From Shannon to Knuth and Back Wojciech Szpankowski Center for Science of Information Purdue University January 7, 2018 Knuth80: Piteaa, Sweden, 2018 Dedicated to Don E. Knuth

More information

Chapter 5 continued. Chapter 5 sections

Chapter 5 continued. Chapter 5 sections Chapter 5 sections Discrete univariate distributions: 5.2 Bernoulli and Binomial distributions Just skim 5.3 Hypergeometric distributions 5.4 Poisson distributions Just skim 5.5 Negative Binomial distributions

More information

Phase changes in random m-ary search trees and generalized quicksort

Phase changes in random m-ary search trees and generalized quicksort Phase changes in random m-ary search trees and generalized quicksort Hua-Huai Chern Department of Mathematics and Science Education Taipei Municipal Teacher College Taipei Taiwan Hsien-Kuei Hwang 2 Institute

More information

MAXIMAL CLADES IN RANDOM BINARY SEARCH TREES

MAXIMAL CLADES IN RANDOM BINARY SEARCH TREES MAXIMAL CLADES IN RANDOM BINARY SEARCH TREES SVANTE JANSON Abstract. We study maximal clades in random phylogenetic trees with the Yule Harding model or, equivalently, in binary search trees. We use probabilistic

More information

Analytic Pattern Matching: From DNA to Twitter. AxA Workshop, Venice, 2016 Dedicated to Alberto Apostolico

Analytic Pattern Matching: From DNA to Twitter. AxA Workshop, Venice, 2016 Dedicated to Alberto Apostolico Analytic Pattern Matching: From DNA to Twitter Wojciech Szpankowski Purdue University W. Lafayette, IN 47907 June 19, 2016 AxA Workshop, Venice, 2016 Dedicated to Alberto Apostolico Joint work with Philippe

More information

Everything You Always Wanted to Know about Quicksort, but Were Afraid to Ask. Marianne Durand

Everything You Always Wanted to Know about Quicksort, but Were Afraid to Ask. Marianne Durand Algorithms Seminar 200 2002, F. Chyzak (ed., INRIA, (2003, pp. 57 62. Available online at the URL http://algo.inria.fr/seminars/. Everything You Always Wanted to Know about Quicksort, but Were Afraid to

More information

50 years of Linear Probing Hashing

50 years of Linear Probing Hashing 50 years of Linear Probing Hashing Alfredo Viola Universidad de la República, Uruguay AofA 2013 Dedicated to Philippe Flajolet The problem. The table has m places to hash (park) from 0 to m 1, and n elements

More information

arxiv:cs/ v1 [cs.ds] 6 Oct 2005

arxiv:cs/ v1 [cs.ds] 6 Oct 2005 Partial Fillup and Search Time in LC Tries December 27, 2017 arxiv:cs/0510017v1 [cs.ds] 6 Oct 2005 Svante Janson Wociech Szpankowski Department of Mathematics Department of Computer Science Uppsala University,

More information

Lecture 2: Repetition of probability theory and statistics

Lecture 2: Repetition of probability theory and statistics Algorithms for Uncertainty Quantification SS8, IN2345 Tobias Neckel Scientific Computing in Computer Science TUM Lecture 2: Repetition of probability theory and statistics Concept of Building Block: Prerequisites:

More information

Topic 2: Probability & Distributions. Road Map Probability & Distributions. ECO220Y5Y: Quantitative Methods in Economics. Dr.

Topic 2: Probability & Distributions. Road Map Probability & Distributions. ECO220Y5Y: Quantitative Methods in Economics. Dr. Topic 2: Probability & Distributions ECO220Y5Y: Quantitative Methods in Economics Dr. Nick Zammit University of Toronto Department of Economics Room KN3272 n.zammit utoronto.ca November 21, 2017 Dr. Nick

More information

Searching with Dice A survey on randomized data structures

Searching with Dice A survey on randomized data structures Searching with Dice A survey on randomized data structures Conrado Martínez Univ. Politècnica de Catalunya, Spain May 11th, 2010 Journée inaugurale de l'équipe Combinatoire, algorithmique et interactions

More information

On joint subtree distributions under two evolutionary models

On joint subtree distributions under two evolutionary models Noname manuscript No. (will be inserted by the editor) On joint subtree distributions under two evolutionary models Taoyang Wu Kwok Pui Choi November 5, 2015 Abstract In population and evolutionary biology,

More information

Some Node Degree Properties of Series-Parallel Graphs Evolving Under a Stochastic Growth Model

Some Node Degree Properties of Series-Parallel Graphs Evolving Under a Stochastic Growth Model Some Node Degree Properties of Series-Parallel Graphs Evolving Under a Stochastic Growth Model Hosam M. Mahmoud The George Washington University Washington, DC 20052, USA 24th International Meeting on

More information

The Subtree Size Profile of Plane-oriented Recursive Trees

The Subtree Size Profile of Plane-oriented Recursive Trees The Subtree Size Profile of Plane-oriente Recursive Trees Michael FUCHS Department of Applie Mathematics National Chiao Tung University Hsinchu, 3, Taiwan Email: mfuchs@math.nctu.eu.tw Abstract In this

More information

Metric Diophantine Approximation for Formal Laurent Series over Finite Fields

Metric Diophantine Approximation for Formal Laurent Series over Finite Fields Metric Diophantine Approximation for Formal Laurent Series over Finite Fields Michael Fuchs Department of Applied Mathematics National Chiao Tung University Hsinchu, Taiwan Fq9, July 16th, 2009 Michael

More information

k-protected VERTICES IN BINARY SEARCH TREES

k-protected VERTICES IN BINARY SEARCH TREES k-protected VERTICES IN BINARY SEARCH TREES MIKLÓS BÓNA Abstract. We show that for every k, the probability that a randomly selected vertex of a random binary search tree on n nodes is at distance k from

More information

MAS113 Introduction to Probability and Statistics

MAS113 Introduction to Probability and Statistics MAS113 Introduction to Probability and Statistics School of Mathematics and Statistics, University of Sheffield 2018 19 Identically distributed Suppose we have n random variables X 1, X 2,..., X n. Identically

More information

Random trees and branching processes

Random trees and branching processes Random trees and branching processes Svante Janson IMS Medallion Lecture 12 th Vilnius Conference and 2018 IMS Annual Meeting Vilnius, 5 July, 2018 Part I. Galton Watson trees Let ξ be a random variable

More information

PROTECTED NODES AND FRINGE SUBTREES IN SOME RANDOM TREES

PROTECTED NODES AND FRINGE SUBTREES IN SOME RANDOM TREES PROTECTED NODES AND FRINGE SUBTREES IN SOME RANDOM TREES LUC DEVROYE AND SVANTE JANSON Abstract. We study protected nodes in various classes of random rooted trees by putting them in the general context

More information

Stat 134 Fall 2011: Notes on generating functions

Stat 134 Fall 2011: Notes on generating functions Stat 3 Fall 0: Notes on generating functions Michael Lugo October, 0 Definitions Given a random variable X which always takes on a positive integer value, we define the probability generating function

More information

arxiv: v1 [math.co] 1 Aug 2018

arxiv: v1 [math.co] 1 Aug 2018 REDUCING SIMPLY GENERATED TREES BY ITERATIVE LEAF CUTTING BENJAMIN HACKL, CLEMENS HEUBERGER, AND STEPHAN WAGNER arxiv:1808.00363v1 [math.co] 1 Aug 2018 ABSTRACT. We consider a procedure to reduce simply

More information

Tom Salisbury

Tom Salisbury MATH 2030 3.00MW Elementary Probability Course Notes Part V: Independence of Random Variables, Law of Large Numbers, Central Limit Theorem, Poisson distribution Geometric & Exponential distributions Tom

More information

Enumerating connected labelled graphs

Enumerating connected labelled graphs Enumerating connected labelled graphs Keith Briggs Keith.Briggs@bt.com more.btexact.com/people/briggsk2/ 2004 August 17 conn-lab-graphs.tex typeset 2004 August 17 10:40 in pdflatex on a linux system Enumerating

More information

6 The normal distribution, the central limit theorem and random samples

6 The normal distribution, the central limit theorem and random samples 6 The normal distribution, the central limit theorem and random samples 6.1 The normal distribution We mentioned the normal (or Gaussian) distribution in Chapter 4. It has density f X (x) = 1 σ 1 2π e

More information

Analytic Depoissonization and Its Applications

Analytic Depoissonization and Its Applications Analytic Depoissonization and Its Applications W. Szpankowski Department of Computer Science Purdue University W. Lafayette, IN 47907 April 29, 2008 AofA and IT logos This research is supported by NSF

More information

Problem Solving. Correlation and Covariance. Yi Lu. Problem Solving. Yi Lu ECE 313 2/51

Problem Solving. Correlation and Covariance. Yi Lu. Problem Solving. Yi Lu ECE 313 2/51 Yi Lu Correlation and Covariance Yi Lu ECE 313 2/51 Definition Let X and Y be random variables with finite second moments. the correlation: E[XY ] Yi Lu ECE 313 3/51 Definition Let X and Y be random variables

More information

ANALYSIS OF SOME TREE STATISTICS

ANALYSIS OF SOME TREE STATISTICS ANALYSIS OF SOME TREE STATISTICS ALOIS PANHOLZER Abstract. In this survey we present results and the used proof techniques concerning the analysis of several parameters in frequently used tree families.

More information

MA 575 Linear Models: Cedric E. Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2

MA 575 Linear Models: Cedric E. Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2 MA 575 Linear Models: Cedric E Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2 1 Revision: Probability Theory 11 Random Variables A real-valued random variable is

More information

Mathematical Statistics

Mathematical Statistics Mathematical Statistics Chapter Three. Point Estimation 3.4 Uniformly Minimum Variance Unbiased Estimator(UMVUE) Criteria for Best Estimators MSE Criterion Let F = {p(x; θ) : θ Θ} be a parametric distribution

More information

For a stochastic process {Y t : t = 0, ±1, ±2, ±3, }, the mean function is defined by (2.2.1) ± 2..., γ t,

For a stochastic process {Y t : t = 0, ±1, ±2, ±3, }, the mean function is defined by (2.2.1) ± 2..., γ t, CHAPTER 2 FUNDAMENTAL CONCEPTS This chapter describes the fundamental concepts in the theory of time series models. In particular, we introduce the concepts of stochastic processes, mean and covariance

More information

Probability and Statistics

Probability and Statistics Probability and Statistics 1 Contents some stochastic processes Stationary Stochastic Processes 2 4. Some Stochastic Processes 4.1 Bernoulli process 4.2 Binomial process 4.3 Sine wave process 4.4 Random-telegraph

More information

Lecture 2: Review of Probability

Lecture 2: Review of Probability Lecture 2: Review of Probability Zheng Tian Contents 1 Random Variables and Probability Distributions 2 1.1 Defining probabilities and random variables..................... 2 1.2 Probability distributions................................

More information

Chapter 4 : Expectation and Moments

Chapter 4 : Expectation and Moments ECE5: Analysis of Random Signals Fall 06 Chapter 4 : Expectation and Moments Dr. Salim El Rouayheb Scribe: Serge Kas Hanna, Lu Liu Expected Value of a Random Variable Definition. The expected or average

More information

1 Exercises for lecture 1

1 Exercises for lecture 1 1 Exercises for lecture 1 Exercise 1 a) Show that if F is symmetric with respect to µ, and E( X )

More information

Deterministic edge-weights in increasing tree families.

Deterministic edge-weights in increasing tree families. Deterministic edge-weights in increasing tree families. Markus Kuba and Stephan Wagner Institut für Diskrete Mathematik und Geometrie Technische Universität Wien Wiedner Hauptstr. 8-0/04, 040 Wien, Austria

More information

Compositions and samples of geometric random variables with constrained multiplicities

Compositions and samples of geometric random variables with constrained multiplicities FPSAC 2010, San Francisco, USA DMTCS proc. AN, 2010, 317 328 Compositions and samples of geometric random variables with constrained multiplicities Margaret Archibald 1 and Arnold Knopfmacher 2 and Toufik

More information

We introduce methods that are useful in:

We introduce methods that are useful in: Instructor: Shengyu Zhang Content Derived Distributions Covariance and Correlation Conditional Expectation and Variance Revisited Transforms Sum of a Random Number of Independent Random Variables more

More information

BT Research at Martlesham, Suffolk

BT Research at Martlesham, Suffolk Connectivity of random graphs Keith Briggs Keith.Briggs@bt.com research.btexact.com/teralab/keithbriggs.html Alan Turing Institute 2004 November 19 1330 typeset 2004 November 17 10:55 in pdflatex on a

More information

RANDOM RECURSIVE TREES AND PREFERENTIAL ATTACHMENT TREES ARE RANDOM SPLIT TREES

RANDOM RECURSIVE TREES AND PREFERENTIAL ATTACHMENT TREES ARE RANDOM SPLIT TREES RANDOM RECURSIVE TREES AND PREFERENTIAL ATTACHMENT TREES ARE RANDOM SPLIT TREES SVANTE JANSON Abstract. We consider linear preferential attachment trees, and show that they can be regarded as random split

More information

MULTIPLE CHOICE TRIES AND DISTRIBUTED HASH TABLES

MULTIPLE CHOICE TRIES AND DISTRIBUTED HASH TABLES MULTIPLE CHOICE TRIES AND DISTRIBUTED HASH TABLES Luc Devroye Gábor Lugosi Gahyun Park and Wojciech Szpankowski School of Computer Science ICREA and Department of Economics Department of Computer Sciences

More information

arxiv: v1 [q-bio.pe] 13 Aug 2015

arxiv: v1 [q-bio.pe] 13 Aug 2015 Noname manuscript No. (will be inserted by the editor) On joint subtree distributions under two evolutionary models Taoyang Wu Kwok Pui Choi arxiv:1508.03139v1 [q-bio.pe] 13 Aug 2015 August 14, 2015 Abstract

More information

Variations. ECE 6540, Lecture 02 Multivariate Random Variables & Linear Algebra

Variations. ECE 6540, Lecture 02 Multivariate Random Variables & Linear Algebra Variations ECE 6540, Lecture 02 Multivariate Random Variables & Linear Algebra Last Time Probability Density Functions Normal Distribution Expectation / Expectation of a function Independence Uncorrelated

More information

The Height of List-tries and TST

The Height of List-tries and TST Discrete Mathematics an Theoretical Computer Science (subm.), by the authors, 1 rev The Height of List-tries an TST N. Broutin 1 an L. Devroye 1 1 School of Computer Science, McGill University, 3480 University

More information

Asymmetric Rényi Problem

Asymmetric Rényi Problem Asymmetric Rényi Problem July 7, 2015 Abram Magner and Michael Drmota and Wojciech Szpankowski Abstract In 1960 Rényi in his Michigan State University lectures asked for the number of random queries necessary

More information

PROBABILITY AND INFORMATION THEORY. Dr. Gjergji Kasneci Introduction to Information Retrieval WS

PROBABILITY AND INFORMATION THEORY. Dr. Gjergji Kasneci Introduction to Information Retrieval WS PROBABILITY AND INFORMATION THEORY Dr. Gjergji Kasneci Introduction to Information Retrieval WS 2012-13 1 Outline Intro Basics of probability and information theory Probability space Rules of probability

More information

Recall that in order to prove Theorem 8.8, we argued that under certain regularity conditions, the following facts are true under H 0 : 1 n

Recall that in order to prove Theorem 8.8, we argued that under certain regularity conditions, the following facts are true under H 0 : 1 n Chapter 9 Hypothesis Testing 9.1 Wald, Rao, and Likelihood Ratio Tests Suppose we wish to test H 0 : θ = θ 0 against H 1 : θ θ 0. The likelihood-based results of Chapter 8 give rise to several possible

More information

Protected nodes and fringe subtrees in some random trees

Protected nodes and fringe subtrees in some random trees Electron. Commun. Probab. 19 (214), no. 6, 1 1. DOI: 1.1214/ECP.v19-348 ISSN: 183-589X ELECTRONIC COMMUNICATIONS in PROBABILITY Protected nodes and fringe subtrees in some random trees Luc Devroye Svante

More information

Additive tree functionals with small toll functions and subtrees of random trees

Additive tree functionals with small toll functions and subtrees of random trees Additive tree functionals with small toll functions and subtrees of random trees Stephan Wagner To cite this version: Stephan Wagner. Additive tree functionals with small toll functions and subtrees of

More information

Lecture 11. Multivariate Normal theory

Lecture 11. Multivariate Normal theory 10. Lecture 11. Multivariate Normal theory Lecture 11. Multivariate Normal theory 1 (1 1) 11. Multivariate Normal theory 11.1. Properties of means and covariances of vectors Properties of means and covariances

More information

COLLECTED WORKS OF PHILIPPE FLAJOLET

COLLECTED WORKS OF PHILIPPE FLAJOLET COLLECTED WORKS OF PHILIPPE FLAJOLET Editorial Committee: HSIEN-KUEI HWANG Institute of Statistical Science Academia Sinica Taipei 115 Taiwan ROBERT SEDGEWICK Department of Computer Science Princeton University

More information

Section 9.1. Expected Values of Sums

Section 9.1. Expected Values of Sums Section 9.1 Expected Values of Sums Theorem 9.1 For any set of random variables X 1,..., X n, the sum W n = X 1 + + X n has expected value E [W n ] = E [X 1 ] + E [X 2 ] + + E [X n ]. Proof: Theorem 9.1

More information

b. ( ) ( ) ( ) ( ) ( ) 5. Independence: Two events (A & B) are independent if one of the conditions listed below is satisfied; ( ) ( ) ( )

b. ( ) ( ) ( ) ( ) ( ) 5. Independence: Two events (A & B) are independent if one of the conditions listed below is satisfied; ( ) ( ) ( ) 1. Set a. b. 2. Definitions a. Random Experiment: An experiment that can result in different outcomes, even though it is performed under the same conditions and in the same manner. b. Sample Space: This

More information

Notes by Zvi Rosen. Thanks to Alyssa Palfreyman for supplements.

Notes by Zvi Rosen. Thanks to Alyssa Palfreyman for supplements. Lecture: Hélène Barcelo Analytic Combinatorics ECCO 202, Bogotá Notes by Zvi Rosen. Thanks to Alyssa Palfreyman for supplements.. Tuesday, June 2, 202 Combinatorics is the study of finite structures that

More information

CS37300 Class Notes. Jennifer Neville, Sebastian Moreno, Bruno Ribeiro

CS37300 Class Notes. Jennifer Neville, Sebastian Moreno, Bruno Ribeiro CS37300 Class Notes Jennifer Neville, Sebastian Moreno, Bruno Ribeiro 2 Background on Probability and Statistics These are basic definitions, concepts, and equations that should have been covered in your

More information

On the number of heaps and the cost of heap construction

On the number of heaps and the cost of heap construction On the number of heaps and the cost of heap construction Hsien-Kuei Hwang Institute of Statistical Science Academia Sinica Taipei 5 Taiwan Jean-Marc Steyaert LIX École Polytechnique 928 Palaiseau France

More information

Probability and Stochastic Processes

Probability and Stochastic Processes Probability and Stochastic Processes A Friendly Introduction Electrical and Computer Engineers Third Edition Roy D. Yates Rutgers, The State University of New Jersey David J. Goodman New York University

More information

Nodal Sets of High-Energy Arithmetic Random Waves

Nodal Sets of High-Energy Arithmetic Random Waves Nodal Sets of High-Energy Arithmetic Random Waves Maurizia Rossi UR Mathématiques, Université du Luxembourg Probabilistic Methods in Spectral Geometry and PDE Montréal August 22-26, 2016 M. Rossi (Université

More information

the electronic journal of combinatorics 4 (997), #R7 2 (unbiased) coin tossing process by Prodinger [9] (cf. also []), who provided the rst non-trivia

the electronic journal of combinatorics 4 (997), #R7 2 (unbiased) coin tossing process by Prodinger [9] (cf. also []), who provided the rst non-trivia ANALYSIS OF AN ASYMMETRIC LEADER ELECTION ALGORITHM Svante Janson Wojciech Szpankowski Department of Mathematics Department of Computer Science Uppsala University Purdue University 756 Uppsala W. Lafayette,

More information

The Contraction Method on C([0, 1]) and Donsker s Theorem

The Contraction Method on C([0, 1]) and Donsker s Theorem The Contraction Method on C([0, 1]) and Donsker s Theorem Henning Sulzbach J. W. Goethe-Universität Frankfurt a. M. YEP VII Probability, random trees and algorithms Eindhoven, March 12, 2010 joint work

More information

Today we will prove one result from probability that will be useful in several statistical tests. ... B1 B2 Br. Figure 23.1:

Today we will prove one result from probability that will be useful in several statistical tests. ... B1 B2 Br. Figure 23.1: Lecture 23 23. Pearson s theorem. Today we will prove one result from probability that will be useful in several statistical tests. Let us consider r boxes B,..., B r as in figure 23.... B B2 Br Figure

More information

A Note on the Approximation of Perpetuities

A Note on the Approximation of Perpetuities Discrete Mathematics and Theoretical Computer Science (subm.), by the authors, rev A Note on the Approximation of Perpetuities Margarete Knape and Ralph Neininger Department for Mathematics and Computer

More information

Mod-φ convergence I: examples and probabilistic estimates

Mod-φ convergence I: examples and probabilistic estimates Mod-φ convergence I: examples and probabilistic estimates Valentin Féray (joint work with Pierre-Loïc Méliot and Ashkan Nikeghbali) Institut für Mathematik, Universität Zürich Summer school in Villa Volpi,

More information

On finding a minimum spanning tree in a network with random weights

On finding a minimum spanning tree in a network with random weights On finding a minimum spanning tree in a network with random weights Colin McDiarmid Department of Statistics University of Oxford Harold S. Stone IBM T.J. Watson Center 30 August 1996 Theodore Johnson

More information

Monte-Carlo MMD-MA, Université Paris-Dauphine. Xiaolu Tan

Monte-Carlo MMD-MA, Université Paris-Dauphine. Xiaolu Tan Monte-Carlo MMD-MA, Université Paris-Dauphine Xiaolu Tan tan@ceremade.dauphine.fr Septembre 2015 Contents 1 Introduction 1 1.1 The principle.................................. 1 1.2 The error analysis

More information

7. The Multivariate Normal Distribution

7. The Multivariate Normal Distribution of 5 7/6/2009 5:56 AM Virtual Laboratories > 5. Special Distributions > 2 3 4 5 6 7 8 9 0 2 3 4 5 7. The Multivariate Normal Distribution The Bivariate Normal Distribution Definition Suppose that U and

More information