A Multivariate Normal Law for Turing s Formulae

Similar documents
A Multivariate Normal Law for Turing s Formulae

3.1 Random variables

ON INDEPENDENT SETS IN PURELY ATOMIC PROBABILITY SPACES WITH GEOMETRIC DISTRIBUTION. 1. Introduction. 1 r r. r k for every set E A, E \ {0},

On the Poisson Approximation to the Negative Hypergeometric Distribution

Multiple Criteria Secretary Problem: A New Approach

Pearson s Chi-Square Test Modifications for Comparison of Unweighted and Weighted Histograms and Two Weighted Histograms

Journal of Inequalities in Pure and Applied Mathematics

Hypothesis Test and Confidence Interval for the Negative Binomial Distribution via Coincidence: A Case for Rare Events

KOEBE DOMAINS FOR THE CLASSES OF FUNCTIONS WITH RANGES INCLUDED IN GIVEN SETS

Introduction to Mathematical Statistics Robert V. Hogg Joeseph McKean Allen T. Craig Seventh Edition

arxiv: v1 [math.co] 4 May 2017

arxiv: v1 [math.nt] 12 May 2017

Goodness-of-fit for composite hypotheses.

Results on the Commutative Neutrix Convolution Product Involving the Logarithmic Integral li(

Numerical approximation to ζ(2n+1)

Solution to HW 3, Ma 1a Fall 2016

PROBLEM SET #1 SOLUTIONS by Robert A. DiStasio Jr.

What Form of Gravitation Ensures Weakened Kepler s Third Law?

Measure Estimates of Nodal Sets of Polyharmonic Functions

Asymptotically Lacunary Statistical Equivalent Sequence Spaces Defined by Ideal Convergence and an Orlicz Function

GROWTH ESTIMATES THROUGH SCALING FOR QUASILINEAR PARTIAL DIFFERENTIAL EQUATIONS

6 PROBABILITY GENERATING FUNCTIONS

Unobserved Correlation in Ascending Auctions: Example And Extensions

THE NUMBER OF TWO CONSECUTIVE SUCCESSES IN A HOPPE-PÓLYA URN

Quasi-Randomness and the Distribution of Copies of a Fixed Graph

ON LACUNARY INVARIANT SEQUENCE SPACES DEFINED BY A SEQUENCE OF MODULUS FUNCTIONS

New problems in universal algebraic geometry illustrated by boolean equations

LET a random variable x follows the two - parameter

Semicanonical basis generators of the cluster algebra of type A (1)

SOME GENERAL NUMERICAL RADIUS INEQUALITIES FOR THE OFF-DIAGONAL PARTS OF 2 2 OPERATOR MATRICES

On the Quasi-inverse of a Non-square Matrix: An Infinite Solution

On the ratio of maximum and minimum degree in maximal intersecting families

JENSEN S INEQUALITY FOR DISTRIBUTIONS POSSESSING HIGHER MOMENTS, WITH APPLICATION TO SHARP BOUNDS FOR LAPLACE-STIELTJES TRANSFORMS

Do Managers Do Good With Other People s Money? Online Appendix

On the ratio of maximum and minimum degree in maximal intersecting families

Surveillance Points in High Dimensional Spaces

Lecture 8 - Gauss s Law

On decompositions of complete multipartite graphs into the union of two even cycles

6 Matrix Concentration Bounds

Lecture 28: Convergence of Random Variables and Related Theorems

Multiple Experts with Binary Features

Central Coverage Bayes Prediction Intervals for the Generalized Pareto Distribution

Asymptotic Normality in Monte Carlo Integration

Brief summary of functional analysis APPM 5440 Fall 2014 Applied Analysis

SUFFICIENT CONDITIONS FOR MAXIMALLY EDGE-CONNECTED AND SUPER-EDGE-CONNECTED GRAPHS DEPENDING ON THE CLIQUE NUMBER

Math 124B February 02, 2012

Method for Approximating Irrational Numbers

ANA BERRIZBEITIA, LUIS A. MEDINA, ALEXANDER C. MOLL, VICTOR H. MOLL, AND LAINE NOBLE

Math 301: The Erdős-Stone-Simonovitz Theorem and Extremal Numbers for Bipartite Graphs

NOTE. Some New Bounds for Cover-Free Families

A Bijective Approach to the Permutational Power of a Priority Queue

Fractional Zero Forcing via Three-color Forcing Games

Estimation of the Correlation Coefficient for a Bivariate Normal Distribution with Missing Data

ON THE INVERSE SIGNED TOTAL DOMINATION NUMBER IN GRAPHS. D.A. Mojdeh and B. Samadi

556: MATHEMATICAL STATISTICS I

Lacunary I-Convergent Sequences

Stanford University CS259Q: Quantum Computing Handout 8 Luca Trevisan October 18, 2012

STUDY OF SOLUTIONS OF LOGARITHMIC ORDER TO HIGHER ORDER LINEAR DIFFERENTIAL-DIFFERENCE EQUATIONS WITH COEFFICIENTS HAVING THE SAME LOGARITHMIC ORDER

1. Review of Probability.

On Computing Optimal (Q, r) Replenishment Policies under Quantity Discounts

Relating Branching Program Size and. Formula Size over the Full Binary Basis. FB Informatik, LS II, Univ. Dortmund, Dortmund, Germany

On the global uniform asymptotic stability of time-varying dynamical systems

1) (A B) = A B ( ) 2) A B = A. i) A A = φ i j. ii) Additional Important Properties of Sets. De Morgan s Theorems :

Chapter 3: Theory of Modular Arithmetic 38

YINGHUA DONG AND GENNADY SAMORODNITSKY

EM Boundary Value Problems

JANOWSKI STARLIKE LOG-HARMONIC UNIVALENT FUNCTIONS

Control Chart Analysis of E k /M/1 Queueing Model

Hydroelastic Analysis of a 1900 TEU Container Ship Using Finite Element and Boundary Element Methods

On a quantity that is analogous to potential and a theorem that relates to it

Probablistically Checkable Proofs

Temporal-Difference Learning

The Chromatic Villainy of Complete Multipartite Graphs

k. s k=1 Part of the significance of the Riemann zeta-function stems from Theorem 9.2. If s > 1 then 1 p s

4/18/2005. Statistical Learning Theory

Functions Defined on Fuzzy Real Numbers According to Zadeh s Extension

arxiv: v1 [math.nt] 28 Oct 2017

SOME SOLVABILITY THEOREMS FOR NONLINEAR EQUATIONS

Gradient-based Neural Network for Online Solution of Lyapunov Matrix Equation with Li Activation Function

1 Notes on Order Statistics

Mathematical Model of Magnetometric Resistivity. Sounding for a Conductive Host. with a Bulge Overburden

Absorption Rate into a Small Sphere for a Diffusing Particle Confined in a Large Sphere

arxiv: v1 [math.co] 6 Mar 2008

Liquid gas interface under hydrostatic pressure

Random Variables and Probability Distribution Random Variable

Research Article On Alzer and Qiu s Conjecture for Complete Elliptic Integral and Inverse Hyperbolic Tangent Function

2. Electrostatics. Dr. Rakhesh Singh Kshetrimayum 8/11/ Electromagnetic Field Theory by R. S. Kshetrimayum

Failure Probability of 2-within-Consecutive-(2, 2)-out-of-(n, m): F System for Special Values of m

A Backward Identification Problem for an Axis-Symmetric Fractional Diffusion Equation

Bayesian Analysis of Topp-Leone Distribution under Different Loss Functions and Different Priors

The Erdős-Hajnal conjecture for rainbow triangles

q i i=1 p i ln p i Another measure, which proves a useful benchmark in our analysis, is the chi squared divergence of p, q, which is defined by

Web-based Supplementary Materials for. Controlling False Discoveries in Multidimensional Directional Decisions, with

Chapter 3 Optical Systems with Annular Pupils

Weighted least-squares estimators of parametric functions of the regression coefficients under a general linear model

On the integration of the equations of hydrodynamics

Localization of Eigenvalues in Small Specified Regions of Complex Plane by State Feedback Matrix

Vanishing lines in generalized Adams spectral sequences are generic

Identification of the degradation of railway ballast under a concrete sleeper

arxiv: v1 [math.ca] 12 Mar 2015

Transcription:

A Multivaiate Nomal Law fo Tuing s Fomulae Zhiyi Zhang Depatment of Mathematics and Statistics Univesity of Noth Caolina at Chalotte Chalotte, NC 28223 Abstact This pape establishes a sufficient condition fo Tuing s fomulae of vaious odes to have asymptotic multivaiate nomality. 1 Intoduction. Conside a multinomial distibution with its countably infinite numbe of pescibed categoies indexed by K = {; = 1, } and its categoy pobabilities denoted by {p }, satisfying 0 < p < 1 fo all and p = 1, whee the sum without index is ove all as is obseved in the subsequent text unless othewise stated. Let the categoy counts in an iid sample of size n fom the undelying population be denoted by {X ; 1} and its obseved values by {x ; 1}. Fo a given sample, thee ae at most n non-zeo x s. Let, fo evey intege, 1 n, N = 1 X =], T = ) n n 1 ) 1 N = n + 1 N, and π 1 = p 1 X = 1]. N and π 1 may be thought of as, espectively, the numbe of categoies in the population that ae epesented exactly times in the sample and the total pobability associated with all the categoies that ae epesented exactly 1 times in the sample. T may be thought of as an estimato of π 1. T is also nown as Tuing s fomula of the th ode intoduced by Good 1953). AMS 2000 Subject Classifications. Pimay 62f10, 62F12, 62G05, 62G20; seconday 62F15. Keywods and phases. High ode Tuing s fomulae, asymptotic multivaiate nomality. Reseach patially suppoted by NSF Gants DMS 1004769 1

Nomality of Tuing s Fomulae 2 Pehaps the most inteesting case among all Tuing s fomulae of diffeent odes is T 1, nown as just Tuing s fomula, as an estimato of π 0. Given a sample, π 0 epesents the total pobability associated with categoies not obseved in the sample, which is also the pobability that the next obsevation will belong to a categoy peviously unseen. Since the multinomial model is essentially nonpaametic, the fact that something could be said about the total pobability associated with unobseved categoies is somewhat anti-intuitive. The statistical popeties of Tuing s fomula wee lagely unnown until Robbins 1968) gave an intepetation in tems of bias. Anothe fifteen yeas would pass befoe Esty 1983) gave a sufficient condition fo the asymptotic nomality of T 1 π 0. In ecent yeas, eseach on Tuing s fomula has been evitalized. Zhang and Huang 2007) gave anothe intepetation of Tuing s fomula and poposed an impoved vesion of the fomula which essentially eliminated all the bias of Tuing s oiginal fomula. Zhang and Huang 2008) gave a sufficient condition fo the nomality of Tuing s fomula which suppots a non-empty class of fixed distibutions. Zhang and Zhang 2009) gave a necessay and sufficient condition fo the nomality of Tuing s fomula. Howeve all the wos thus fa ae on Tuing s fomula of the fist ode. Pio to this pape, the distibutional popeties of high ode Tuing s fomulae ae unnown. Fo any fixed intege R 1, let T R = T 1 π 0,, T R π R 1 ). The objective of this pape is to establish the asymptotic multivaiate nomality of T R unde cetain conditions. Towad that end, the fist step is to establish the asymptotic nomality of T π 1, fo a fixed, 1 R, i.e., to show, unde cetain conditions, fo some gn), gn)t π 1 ) L N0, σ 2 ) whee σ 2 is a function of {p }. The esult is deived in Section 3. The esults on the multivaiate nomality of T R ae deived in Section 4.

Nomality of Tuing s Fomulae 3 2 Peliminay Results. Let K 1 = {1} and K 2 = {2, }. Fo any K = K 1 K 2, let p x = 1, f x) = /n + 1) x =, 0 0 x 2 o x + 1, 2.1) and Z = f X ). The objective is to deive the asymptotic behavio of Zgn), whee gn) is a function of n satisfying gn) = On 1 2δ ) 2.2) fo some δ 0, 1/4), in tems of the limit of its chaacteistic function, EexpisZgn))]. Let Z = Z 1 + Z 2, whee Z 1 = K 1 f X ) and Z 2 = K 2 f X ). Lemma 2.1 below is a well-nown fact and Lemma 2.2 is due to Batlett 1938). Lemma 2.1 Let {X } be the counts of obsevations in categoy, = 1, 2,, in an iid sample unde the multinomial model with pobability distibution {p }. Then P X = x ; = 1, ) = P Y = x ; = 1, Y = n) whee {Y } ae independent Poisson andom vaiables with mean np. Lemma 2.2 Let U, V ) be a two-dimensional andom vecto with U intege valued. Then EexpivV U = n)) = 2πP U = n)) 1 π π EexpiuU n) + ivv )]du. Thus EexpisZgn))) is 1 π 2πP Y = n)) E exp iu )] Y np ) + iszgn) du. π By Stiling s fomula, 2πn) 1/2 P Y = n) 1. Theefoe it suffices to evaluate the limit of H n s) = n 2π π π Eexpiu Y np ) + iszgn))]du,

Nomality of Tuing s Fomulae 4 o letting t = un 1/2, H n s) = 1 + 1 t <π n] Eexpin) 1/2 t Y np ) + iszgn))]dt. 2.3) 2π Let h n = 1 t < π n]eexpin) 1/2 t Y np ) + iszgn)] h n1 = 1 t < π n]eexpin) 1/2 ty 1 np 1 ) + isz 1 gn)] 2.4) h n2 = 1 t < π n]eexpin) 1/2 t K 2 Y np ) + isz 2 gn)], H n s) = 1 + h n dt = 1 + h n1 h n2 dt. 2.5) 2π 2π The fist tas is to allow the limit opeato to change place with the integal opeato, i.e., to show lim H n s) = 1 2π lim hn dt whee lim = lim n as is obseved elsewhee in the subsequent text. The ey element to suppot this exchange is lim h n1 dt = lim h n1 dt, 2.6) whee h n1 = 1 t π n] {exp itn 1/2 p 1 ) expnp 1 e itn 1/2 1)] +2np 1 ) 1 exp np 1 )/ 1)! + np 1 ) exp np 1 )/!] } is an uppe bound fo h n1 and hence, since h n2 1 implies h n h n1, an uppe bound fo h n. The poof of 2.6) is given by Zhang and Huang 2008) fo a special case of = 1, howeve the poof is also valid fo any 1. By 2.6) and the extended dominated convegence theoem of Patt 1960), the following lemma is established. Lemma 2.3 Let h n and H n be as defined in 2.3) and 2.5) espectively. Then lim H n = 1 2π lim h n dt.

Nomality of Tuing s Fomulae 5 Fo each, it can be veified that, letting B = exp itp n 1/2 )expnp expitn 1/2 ) 1))] C = exp itp n 1/2 )expisp gn)) 1) expit 1)n 1/2 ) exp np ) np ) 1 1)! D = exp itp n 1/2 )exp isn + 1) 1 gn)) 1] expitn 1/2 ) exp np ) np )!, and E = C + D, h n = B + E ) fo all t 0 ± π n. The objective is to evaluate lim B + E ). The following two lemmas ae given by Esty 1983) whee is equality in the limit as is obseved elsewhee in the subsequent text. Lemma 2.4 Let {β } and {ϵ } be two sequences of complex numbes, and M n be a sequence of subsets of K, indexed by n. If 1. M n β β, 2. M n ϵ ) ϵ, 3. β 1 unifomly, 4. ϵ 0 unifomly, 5. thee exists a constants, δ 1 such that, M n β 1 δ 1, and 6. thee exists a constants, δ 2 such that, M n ϵ δ 2 ; then whee β and ϵ may also depend on n. M n β + ϵ ) βe ϵ Lemma 2.5 Fo all K, B = exp t 2 /2)p + Ot 3 p n 1/2 )). The next lemma includes thee tivial but useful facts.

Nomality of Tuing s Fomulae 6 Lemma 2.6 1. Fo any complex numbe x satisfying x < 1, ln1 + x) x 1 x. ) 2. Fo any eal numbe x 0, 1), 1 x exp x 1 x. 3. Fo any eal numbe x 0, 1/2), 1 1 x < 1 + 2x. Poof. 1) By Taylo s fomula, ln1 + x) = j=1 1) j+1 x j /j j=1 x j = x /1 x ). 2) The function y = 1 1+t et is stictly inceasing ove 0, ), and has value 1 at t = 0. Theefoe 1 1+t et 1 fo t 0, ). The desied inequality follows the change of vaiable x = t/1 + t). 3) The poof is tivial. Conside a patition of the index set K = I II whee I = {; p /n 1 δ } and II = {; p > /n 1 δ } whee δ = δ/r + 1) and δ is as in 2.2). Lemma 2.7 a) II E 0; and b) II B + E )/ II B 1. Poof. a) II E 2 II e np np ) 1) / 1)! + e np np ) /!]. Since the deivative of e np np) 1) / 1)! + e np np) /!] with espect to p is negative fo all p /n, 1] and theefoe fo all p /n 1 δ, 1]), e np np ) 1) / 1)! + e np np ) /!] attains its maximum at p = /n 1 δ, fo evey II, with value e nδ On δ ). The total numbe of indices in II is less o equal to n 1 δ /. Theefoe II E 2n 1 δ /]e nδ On δ )] = 2/)e nδ O n 1+ 1)δ ) 0. b) By Lemma 2.5, B is bounded away fom zeo, and by the fact that lim E = 0 and hence lim E / B = 0), and by applying the fist pat of Lemma 2.6 with x = E /B, one has ln II B + E )/ II B ] = ) ) II 1 ln + E B II ln 1 + E B ) E II B E = O II E ) 0.

Nomality of Tuing s Fomulae 7 The following is a sufficient condition unde which many of the subsequent esults ae established. Condition 2.1 As n, 1. n 2 g 2 n)p e np c 0, 2. n 1 g 2 n)p +1 e np c +1 0, and 3. c + c +1 > 0. Lemma 2.8 Unde Condition 2.1, all the conditions of Lemma 2.4 ae satisfied with M n = I, β = B, β = B = lim B, ϵ = E, and ϵ = E = lim E. The poof of Lemma 2.8 is given in Appendix. Lemma 2.4 and Lemma 2.8 give immediately the following coollay. Coollay 2.1 Unde Condition 2.1, I B + E ) I B exp I E ). Lemma 2.9 Unde Condition 2.1, B + E ) Be E, whee B = lim B and E = lim E. Poof. B + E ) = I B + E ) II B + E ) I B + E ) II B by Lemma 2.7) I B exp I E ) II B by Lemma 2.8) B exp E ) by Lemma 2.7). 3 Univaiate Nomality. Theoem 3.1 Let gn) be as in 2.2). Unde Condition 2.1, gn)t π 1 ) L N 0, c +1 + c 1)! ).

Nomality of Tuing s Fomulae 8 Poof. Since lim B = e t2 2, by a) of Lemma 2.7 and 5.2), s2 lim E = 2 1)! ) 1 lim H n = 2π e t2 2 dt e lim n 1 g 2 n)p +1 e np + lim g 2 n)n p n + 1) 2 e np ] s2 2 1)! lim n 1 g 2 n)p +1 e np g 2 n)n p + lim n +1) 2 e np ], = e s2 2 c+1 1)! + c 1)! ]. Conside the following condition: Condition 3.1 As n, 1. g2 n) EN n 2 ) c! 0, 2. g2 n) EN n 2 +1 ) c +1 +1)! 0, and 3. c + c +1 > 0. Lemma 3.1 Condition 2.1 and Condition 3.1 ae equivalent. The poof of Lemma 3.1 is given in Appendix. Lemma 3.1 allows a e-statement of Theoem 3.1: Theoem 3.2 If thee exists a gn) satisfying 2.2) and Condition 3.1, then nt π 1 ) 2 EN ) + + 1)EN +1 ) L N0, 1). Theoem 3.3 If thee exists a gn) satisfying 2.2) and Condition 3.1, then nt π 1 ) 2 N + + 1)N +1 L N0, 1). The poof of Theoem 3.3 is given in Appendix. It may be of inteest to note that the esults of Theoems 3.2 and 3.3 equie no futhe nowledge of gn), i.e., the nowledge of δ, othe than its existence.

Nomality of Tuing s Fomulae 9 4 Multivaiate Nomality. Fo evey K, any two constants a and b, and any two positive integes 1 and 2, let f x) in 2.1) be edefined as f x) = ap x = 1 1, a 1 /n 1 + 1) x = 1 2 1, bp x = 2 1 1, b 2 /n 2 + 1) x = 2, bp a 1 /n 1 + 1) x = 1 = 2 1, 0 elsewhee, 4.1) and Z = f X ). The objective is to evaluate lim H n s) = 2π) 1/2 lim h n dt whee H n s) and h n have the same foms as in 2.4) and 2.5) but with f x) edefined in 4.1). Two sepaate cases ae to be consideed: 1 < 2 1 and 1 = 2 1. Let B = exp itp n 1/2 )expnp expitn 1/2 ) 1))] C = exp itp n 1/2 )expisap gn)) 1) expit 1 1)n 1/2 ) np ) 1 1 1 1)! e np D = exp itp n 1/2 )exp isa 1 n 1 + 1) 1 gn)) 1] expit 1 n 1/2 ) np ) 1 1! e np F = exp itp n 1/2 )expisbp gn)) 1) expit 2 1)n 1/2 ) np ) 2 1 2 1)! e np G = exp itp n 1/2 )exp isb 2 n 2 + 1) 1 gn)) 1] expit 1 n 1/2 ) np ) 2 2! e np A = exp itp n 1/2 ){expisgn)bp a n 1 +1 ) 1]} expit 1n 1/2 ) np ) 1 1! e np. If 1 < 2 1, let E = C + D + F + G. If 1 = 2 1, let E = C + A + G. It can be veified that, in eithe case, h n = B + E ) fo all t 0 ± π n. The objective is to evaluate lim B + E ). 1 Condition 4.1 As n, 1. g2 n) n 2 EN 1 ) c 1 1! 0, 2. g2 n) n 2 EN 1 +1) c 1 +1 1 +1)! 0,

Nomality of Tuing s Fomulae 10 3. c 1 + c 1 +1 > 0, 4. g2 n) n 2 EN 2 ) c 2 2! 0, 5. g2 n) EN n 2 2 +1) c 2 +1 2 +1)! 0, and 6. c 2 + c 2 +1 > 0. Lemma 4.1 Fo any two constants, a and b satisfying a 2 + b 2 > 0, assuming that 1 < 2 1 and that Condition 4.1 holds, then gn)at 1 π 1 1) + bt 2 π 2 1)] L N0, σ 2 ) whee σ 2 = a 2 c 1 +1+ 1 c 1 1 1)! + b 2 c 2 +1+ 2 c 2 2 1)!. The poof of Lemma 4.1 is staight fowad in light of the agument that led to Theoem 3.1. Lemma 4.2 Fo any two constants, a and b satisfying a 2 + b 2 > 0, assuming that 1 = 2 1 and that Condition 4.1 holds, then gn)at 1 π 1 1) + bt 2 π 2 1)] L N0, σ 2 ) whee σ 2 = a 2 c 1 +1+ 1 c 1 1 1)! 2ab c 2 1 1)! + b2 c 2 +1+ 2 c 2 2 1)!. The poof of Lemma 4.2 is also staight fowad in light of the agument that led to Theoem 3.1, but with an additional non-vanishing tem in the limit. Let σ 2 = 2 EN ) + + 1)EN +1 ), ρ n) = + 1)EN +1 )/σ σ +1 ), ρ = lim ρ n), ˆσ 2 = 2 N + + 1)N +1, and ˆρ = ˆρ n) = + 1)N +1 / ˆσ 2 ˆσ +1 2. Coollay 4.1 Assume that 1 < 2 1 and that Condition 4.1 holds, then n T 1 π 1 1)/σ 1, T 2 π 2 1)/σ 2 ] L MV N 0, I 2 2 ).

Nomality of Tuing s Fomulae 11 Coollay 4.2 Assume that 1 = 2 1 and that Condition 4.1 holds, then n T 1 π 1 1)/σ 1, T 2 π 2 1)/σ 2 ] L 1 ρ1 MV N 0, ρ 1 1 )). Rema 4.1 Coollaies 4.1 and 4.2 suggest that, in {nt π 1 )/σ ]; = 1,, R}, any two enties ae asymptotically independent unless they ae immediate neighbos in the seies. Theoem 4.1 Fo any positive intege R, if Condition 3.1 holds fo evey, 1 R, then n T 1 π 0 )/σ 1,, T R π R 1 )/σ R ] L MV N0, Σ) whee Σ = a i,j ) is a R R covaiance matix with all the diagonal elements being a, = 1 fo = 1,, R, the supe-diagonal and the sub-diagonal elements being a,+1 = a +1, = ρ fo = 1,, R 1, and all the othe off-diagonal elements being zeos. Let ˆΣ be the esulting matix of Σ with ρ eplaced by ˆρ n) fo all. Let ˆΣ 1 denote the invese of ˆΣ and ˆΣ 1/2 denote any R R matix satisfying ˆΣ 1 = ˆΣ 1/2 ˆΣ 1/2. Theoem 4.2 Fo any positive intege R, if Condition 3.1 holds fo evey, 1 R, then nˆσ 1/2 T 1 π 0 )/ˆσ 1,, T R π R 1 )/ˆσ R ] L MV N0, I R R ). An inteesting special case of discete distibution is that of {p } following a discete powe law, as nown as a Paeto law, in the tail, i.e., p = C λ 4.2) fo all > d whee C > 0 and λ > 1 ae unnown paametes descibing the tail of the pobability distibution beyond an unnown positive intege d. This patially paametic pobability model is subsequently efeed to as the tail model. Suppose it is of inteest to estimate C and λ. An estimation pocedue is poposed in this section.

Nomality of Tuing s Fomulae 12 Lemma 4.3 Unde the model in 4.2), Condition 4.1 holds. Poof. Letting δ = 4λ) 1 in 2.2), it can be veified that n 2 g 2 n) p e np c > 0 fo evey intege > 0. Coollay 4.3 Unde the model in 4.2), the esults of both Theoems 4.1 and 4.2 hold. 5 Appendix. 5.1 Poof of Lemma 2.8. All six conditions in Lemma 2.4 need to be checed. 3) is tue because and p and p / n ae unifomly bounded by Fo 1), since I p 0, B = exp t 2 /2)p ) expot 3 / n)p ))), n and 1 δ nn 1 δ espectively. B = exp t 2 /2) I I p ) expot 3 / n) I p ))) 1. Fo 4), it suffices to show that C and D espectively convege to zeo unifomly. Fist fo all I, exp itp n) 1 unifomly since p n n gn)n δ = On 1/2+δ ) 0 unifomly. Second, expit 1)n 1/2 ) 0 and expitn 1/2 ) 0 unifomly. Thid, exp np ) 1 unifomly. By Taylo s expansion and fo sufficiently lage n, expisp gn)) 1] np ) 1 1)! = isgn)p s2 g 2 n)p 2 2! O s 3 g 3 n)p 3 ) ) np ) 1 1)! = isn 1 gn)p 1)! s2 n 1 g 2 n)p +1 2! 1)! isn 1 gn)p 1)! + s 2 n 1 g 2 n)p +1 2! 1)! O s 3 n 1 g 3 n)p +2 ) O + s 3 n 1 g 3 n)p +2 ) s 1)! n 2δ+ R+1 δ + s2 +1 2! 1)! +1 n 4δ+ R+1 δ +2 6δ+ + O n δ) R+1 0 unifomly.

Nomality of Tuing s Fomulae 13 Similaly, it is easily checed that exp isn + 1) 1 gn)) 1 ] )) np )! = isgn) n +1 s2 2 g 2 n) + O s 3 3 g 3 n) np ) 2!n +1) 2 3!n +1) 3! = isgn)n p!n +1) s2 2 g 2 n)n p 2!!n +1) 2 ) s + O 3 g 3 n)n p n +1) 3 ) isgn)n p!n +1) + s2 2 g 2 n)n p + 2!!n +1) 2 O s 3 g 3 n)n p n +1) 3 s n 1)! n +1 n 2δ+ R+1 δ + s2 +2 2!! n 2 n 4δ+ n +1) 2 R+1 δ + O n 3 n 6δ+ n +1) 3 R+1 δ) 0 unifomly. Theefoe E 0 unifomly. Fo 2) and 6), E = e np exp itp n) expit 1)n 1/2 isn ) 1 gn)p )] O s 3 n 1 g 3 n)p +2 1)! s2 n 1 g 2 n)p +1 2! 1)! )] +e np exp itp n) expitn 1/2 ) isgn)n p!n +1) s2 2 g 2 n)n p s + O 3 g 3 n)n p 2!!n +1) 2 n +1) 3 = e np exp itp n) expit 1)n 1/2 isn ) 1 gn)p )] O s 3 n 1 g 3 n)p +2 1)! s2 n 1 g 2 n)p +1 2! 1)! +e np exp itp n) expitn 1/2 ) isgn)n 1 p 1)! is 1)gn)n 1 p 1)!n +1) s2 2 g 2 n)n p 2!!n +1) 2 )] s +O 3 g 3 n)n p n +1) 3 { = e np e itp n e it 1)n 1/2 isn 1 gn)p 1)! s2 n 1 g 2 n)p +1 2! 1)! O s 3 n 1 g 3 n)p +2 ) ) + 1 + it n t2 it3 2!n O ) isgn)n 1 p 3!n 3/2 1)! is 1)gn)n 1 p 1)!n +1) s2 2 g 2 n)n p 2!!n +1) 2 )]} s +O 3 g 3 n)n p n +1) 3

Nomality of Tuing s Fomulae 14 { = e np e itp n e it 1)n 1/2 is 1)gn)n 1 p 1)!n +1) s2 n 1 g 2 n)p +1 2! 1)! s2 2 g 2 n)n p 2!!n +1) 2 ) ) s +O 3 g 3 n)n p O s n +1) 3 n 1 g 3 n)p +2 3 ) + it n t2 it3 2!n O ) isgn)n 1 p 3!n 3/2 1)! is 1)gn)n 1 p 1)!n +1) s2 2 g 2 n)n p 2!!n +1) 2 )]} s +O 3 g 3 n)n p. n +1) 3 5.1) Noting the unifom convegence of e itp n e it 1)n 1/2 1 and Condition 2.1, it can be checed that all tems in 5.1) vanish unde lim I, except possibly the fist thee tems within the culy bacets, i.e., lim I E = lim I { e np is 1)gn)n 1 p 1)!n +1) s2 n 1 g 2 n)p +1 2! 1)! ]} s2 2 g 2 n)n p 2!!n +1) 2 = is 1) 1)! lim gn)n 1 p I n +1) e np s2 2! 1)! lim I n 1 g 2 n)p +1 e np s2 2 2!! lim g 2 n)n p I e n +1) np. 2 Condition 2.1 guaantees the existence of the second and the thid tems above, and the existence of the thid tem implies that the fist tem is zeo. Theefoe 2) is checed and lim E = s2 lim n 1 g 2 n)p +1 e np + lim g 2 n)n p 2 1)! n + 1) 2 e np. 5.2) I I I The convegence of I E and hence of I E guaantees 6). ) Fo 5), since B = exp t2 2 p + Ot 3 p n 1/2 ) and t2 2 p + Ot 3 p n 1/2 ) 0 unifomly, B 1 t2 2 p + Ot 3 p n 1/2 ) ) t 2 1 t2 2 p + Ot 3 p n 1/2 ) O 2 p + t 3 p n 1/2 and hence ) t 2 B 1 O p ) + t3 p < Ot 2 + t 3 ). 2 I I n I

Nomality of Tuing s Fomulae 15 5.2 Poof of Lemma 3.1. Conside the patition of K = I II. Since pe np has a negative deivative with espect to p on inteval 1/n, 1] and hence on /n 1 δ, 1] fo lage n, pe np attains its maximum at p = /n 1 δ. Theefoe noting that thee ae at most n δ / indices in II, 0 g2 n) n ) n 2 II p 1 p ) n g2 n) n n 2 g2 n) n 2 n ) e II n 1 δ e n n 1 δ ) g2 n) n 2 ) II p e n )p g2 n) n 2 n ) e n δ n ) n 1 δ e n 1 δ n ) e II p e np and Thus = g2 n) n n 2 ) e n δ n1 δ e nδ 0. ) lim g2 n) n 2 EN ) = lim g2 n) n n 2 p 1 p ) n 5.3) I lim n 2 g 2 n) p exp np ) = lim n 2 g 2 n) I p exp np ). 5.4) On the othe hand, g 2 n) n n 2 ) I p 1 p ) n g2 n) n n 2 ) I p e n )p g2 n) n ) n 2 exp supi p ) I p e np. Futhemoe, applying 2) and 3) of Lemma 2.6 in the fist and the thid steps below espectively leads to g 2 n) n 2 n ) I p 1 p ) n g2 n) n ) n 2 I p exp n )p ) 1 p g2 n) n ) n 2 I p exp ) np 1 sup I p g2 n) n ) n 2 I exp 2nsup I p ) 2 )p e np. Noting the fact that lim exp sup I p ) = 1 and lim exp 2nsup I p ) 2 ) = 1 unifomly by the definition of I, lim g2 n) n 2 ) n p 1 p ) n = lim g2 n) n 2 I ) n p e np, I and hence, by 5.3) and 5.4) and by the fact that n ) n /!, the equivalence of the fist pats of Condition 2.1 and Condition 3.1 is established: lim g2 n) n 2 EN 1 ) = 1/!) lim n 2 g 2 n) p exp np ). The equivalence of the second pats can be established similaly.

Nomality of Tuing s Fomulae 16 5.3 Poof of Theoem 3.3. Based on Theoem 3.2, it suffices to show that the vaiances of appoach zeo as n inceases to infinity. ĉ =!g2 n) n 2 N and ĉ +1 = + 1)!g2 n) n 2 N +1 V aĉ ) =!)2 g 4 n) n 4 V an ) =!)2 g 4 n) { n 4 EN 2 ) EN )] 2}. 5.5) EN 2 ) = EN ) + j EN ) 2 = n) p 1 p ) n ] 2 n!!!n 2)! p p j 1 p p j ) n 2 = n) 2 j p p j 1 p ) n 1 p j ) n + n) 2 p2 By the fist pat of Condition 3.1,!)2 g 4 n) n 4 EN ) 0 since g 2 /n 2 0. Theefoe lim!)2 g 4 n) n 4 EN 2 ) EN ) 2] lim g4 n) n 4 = lim g4 n) n 4 j j + lim g4 n) 1 p ) 2n 2. n! n 2)! p p j 1 p p j ) n 2 n!)2 n )!] 2 j p p j 1 p ) n 1 p j ) n ] n! n 2)! p p j 1 p p j ) n 2 n! n 2)! j p p j 1 p ) n 1 p j ) n ] n! n 4 n 2)! The second tem above is bounded by lim g4 n) n! n 4 n 2)! = lim = lim ] n!)2 n )!] 2 j p p j 1 p ) n 1 p j ) n ]. n! n 2)! n! n 2)! = c ) 2! lim ] n!)2 n )!] 2 j p p j 1 p ) n 1 p j ) n ] n!)2 n )!] 2 ] n ) 2 g 2 n) n ) 2 n 2 p 1 p ) n ] 2 ] n ) ] n!)2 2 g 2 n) 2 n )!] 2 EN n 2 ) n! n 2)! ] n ) n!)2 2 n )!] 2 = 0.

Nomality of Tuing s Fomulae 17 Noting 1 p j p ) n 2 1 p j p + p j p ) n 2 = 1 p j )1 p )] n 2, and theefoe lim!)2 g 4 n) n 4 EN 2 ) EN ) 2] lim g4 n) n! n 4 n 2)! j p p j 1 p j)1 p )] n 2 j p p j 1 p ) n 1 p j ) n ] { = lim g4 n) n! } n 4 n 2)! j p p j 1 p j)1 p )] n 2 1 1 p ) 1 p j ) ] { lim g4 n) n! } n 4 n 2)! j p p j 1 p j)1 p )] n 2 {1 1 p + p j )] }. Noting 1 1 x) 2 1)x fo all x 0, 1], On the othe hand, lim!)2 g 4 n) n 4 EN 2 ) EN ) 2] { lim g4 n) n!2 1) n 4 n 2)! j p p j 1 p j)1 p )] n 2 p + p j )} { = 2 lim g4 n) n!2 1) n 4 n 2)! j p+1 p j 1 p j)1 p )] n 2}. j p+1 p j 1 p ) n 2 1 p j ) n 2 = j,p p j + ) j,p >p j p +1 p j 1 p ) n 2 1 p j ) n 2 j,p p j p p+1 j 1 p ) n 1 p j ) n 3 + j,p >p j p +1 p j 1 p ) n 3 1 p j ) n 2 j p p+1 j 1 p ) n 1 p j ) n 3 2 p 1 p ) n j p+1 j 1 p j ) n 3 = 2 n) 1 EN ) j p+1 j 1 p j ) n 3. Noting that p 1 p) n 3 attains its maximum at p = /n 2) and hence p 1 p) n 3 /n 2), j p+1 p j 1 p ) n 2 1 p j ) n 2 2 n 1 ) n 2) EN ). Finally lim!)2 g 4 n) n 4 EN 2 ) EN ) 2] 4 lim g4 n) n!2 1) n 4 n 2)! n )n 2) EN ) = 4 2 1)c lim g 2 n) n 2 n )! n 2)!n 2) ] = 0. The consistency of ĉ follows. The consistency of ĉ +1 can also be similaly poved.

Nomality of Tuing s Fomulae 18 Refeences 1] Batlett, M.S. 1938), The chaacteistic function of a conditional statistic, Jounal of the London Mathematical Society, 13, pp. 62-67. 2] Esty, W.W. 1983), A nomal limit law fo a nonpaametic estimato of the coveage of a andom sample, The Annals of Statist., 11, pp. 905-912. 3] Good, I.J.1953), The population fequencies of species and the estimation of population paametes, Biometia, 40, pp. 237-264. 4] Patt, J.W. 1960),On intechanging limits and integals, Ann. Math. Stat. 31, pp.74-77. 5] Robbins, H.E. 1968), Estimating the total pobability of the unobseved outcomes of an expeiment, Annals of Mathematical Statistics, 39, pp. 256-257. 6] Zhang, Z. and Huang, H. 2007), Tuing s Fomula Revisited, Jounal of Quantitative Linguistics, Vol.4, No.2, pp. 222-241, 2007. 7] Zhang, Z. and Huang, H. 2008), A sufficient nomality condition fo Tuing s fomula, Jounal of Nonpaametic Statistics, Vol.20, No. 5. pp. 431-446. 8] Zhang, C.-H. and Zhang, Z. 2009), Asymptotic nomality of a nonpaametic estimato of sample coveage, The Annals of Statist., Vol. 37, No. 5A, pp. 2582-2595.