MAXIMUM VARIANCE OF ORDER STATISTICS

Similar documents
Bounds on expectation of order statistics from a nite population

If g is also continuous and strictly increasing on J, we may apply the strictly increasing inverse function g 1 to this inequality to get

arxiv: v1 [stat.me] 28 Mar 2011

Estimation of parametric functions in Downton s bivariate exponential distribution

Random Variables. Random variables. A numerically valued map X of an outcome ω from a sample space Ω to the real line R

A Note on the Strong Law of Large Numbers

MOMENT CONVERGENCE RATES OF LIL FOR NEGATIVELY ASSOCIATED SEQUENCES

Non Uniform Bounds on Geometric Approximation Via Stein s Method and w-functions

PM functions, their characteristic intervals and iterative roots

ON CHARACTERIZATION OF POWER SERIES DISTRIBUTIONS BY A MARGINAL DISTRIBUTION AND A REGRESSION FUNCTION

EXPLICIT MULTIVARIATE BOUNDS OF CHEBYSHEV TYPE

Lecture 1 Measure concentration

Density estimators for the convolution of discrete and continuous random variables

A NOTE ON A DISTRIBUTION OF WEIGHTED SUMS OF I.I.D. RAYLEIGH RANDOM VARIABLES

Yunhi Cho and Young-One Kim

EVALUATIONS OF EXPECTED GENERALIZED ORDER STATISTICS IN VARIOUS SCALE UNITS

Some New Results on the Beta Skew-Normal Distribution

n! (k 1)!(n k)! = F (X) U(0, 1). (x, y) = n(n 1) ( F (y) F (x) ) n 2

1.12 Multivariate Random Variables

Complete moment convergence of weighted sums for processes under asymptotically almost negatively associated assumptions

Stochastic Design Criteria in Linear Models

THE STRONG LAW OF LARGE NUMBERS

Math Circles: Number Theory III

Static Problem Set 2 Solutions

4. We accept without proofs that the following functions are differentiable: (e x ) = e x, sin x = cos x, cos x = sin x, log (x) = 1 sin x

Gaussian vectors and central limit theorem

Lines, parabolas, distances and inequalities an enrichment class

Asymptotic Tail Probabilities of Sums of Dependent Subexponential Random Variables

MAS223 Statistical Inference and Modelling Exercises

Probability Background

Exam P Review Sheet. for a > 0. ln(a) i=0 ari = a. (1 r) 2. (Note that the A i s form a partition)

MATH 220 solution to homework 4

CHARACTERIZATIONS OF UNIFORM AND EXPONENTIAL DISTRIBUTIONS VIA MOMENTS OF THE kth RECORD VALUES RANDOMLY INDEXED

P (A G) dp G P (A G)

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS

A CHARACTERIZATION OF STRICT LOCAL MINIMIZERS OF ORDER ONE FOR STATIC MINMAX PROBLEMS IN THE PARAMETRIC CONSTRAINT CASE

164 Final Solutions 1

ADMISSIBILITY OF UNBIASED TESTS FOR A COMPOSITE HYPOTHESIS WITH A RESTRICTED ALTERNATIVE

Probability and Measure

A NOTE ON THE COMPLETE MOMENT CONVERGENCE FOR ARRAYS OF B-VALUED RANDOM VARIABLES

STA2603/205/1/2014 /2014. ry II. Tutorial letter 205/1/

Formulas for probability theory and linear models SF2941

Joint Distributions. (a) Scalar multiplication: k = c d. (b) Product of two matrices: c d. (c) The transpose of a matrix:

HW1 solutions. 1. α Ef(x) β, where Ef(x) is the expected value of f(x), i.e., Ef(x) = n. i=1 p if(a i ). (The function f : R R is given.

1) The line has a slope of ) The line passes through (2, 11) and. 6) r(x) = x + 4. From memory match each equation with its graph.

EC9A0: Pre-sessional Advanced Mathematics Course. Lecture Notes: Unconstrained Optimisation By Pablo F. Beker 1

IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES

Paul Schrimpf. October 18, UBC Economics 526. Unconstrained optimization. Paul Schrimpf. Notation and definitions. First order conditions

CHARACTERIZATIONS OF POWER DISTRIBUTIONS VIA MOMENTS OF ORDER STATISTICS AND RECORD VALUES

Mathematical Preliminaries

NORMAL CHARACTERIZATION BY ZERO CORRELATIONS

ENEE 621 SPRING 2016 DETECTION AND ESTIMATION THEORY THE PARAMETER ESTIMATION PROBLEM

A Pointwise Approximation of Generalized Binomial by Poisson Distribution

Lecture 1: August 28

CONSISTENT ESTIMATION OF LOCATION REGION

AN OCCUPATION TIME THEOREM FOR A CLASS OF STOCHASTIC PROCESSES^)

Random convolution of O-exponential distributions

Convergence rates in weighted L 1 spaces of kernel density estimators for linear processes

Measure-theoretic probability

ON A FUNCTIONAL EQUATION ASSOCIATED WITH THE TRAPEZOIDAL RULE - II

PMC-optimal nonparametric quantile estimator. Ryszard Zieliński Institute of Mathematics Polish Academy of Sciences

Math 421, Homework #9 Solutions

Part II Probability and Measure

Selected Exercises on Expectations and Some Probability Inequalities

Weak convergence in Probability Theory A summer excursion! Day 3

Stochastic Comparisons of Order Statistics from Generalized Normal Distributions

Extreme Values in FGM Random Sequences

Convergence Concepts of Random Variables and Functions

arxiv: v1 [math.st] 26 Jun 2011

7 Convergence in R d and in Metric Spaces

Chapter 6. Characteristic functions

Review: mostly probability and some statistics

Complete Moment Convergence for Sung s Type Weighted Sums of ρ -Mixing Random Variables

3. Convex functions. basic properties and examples. operations that preserve convexity. the conjugate function. quasiconvex functions

Analytic Geometry and Calculus I Exam 1 Practice Problems Solutions 2/19/7

Additive functionals of infinite-variance moving averages. Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535

Han-Ying Liang, Dong-Xia Zhang, and Jong-Il Baek

GLOBAL ATTRACTIVITY IN A NONLINEAR DIFFERENCE EQUATION

ECE 650 Lecture 4. Intro to Estimation Theory Random Vectors. ECE 650 D. Van Alphen 1

Wald for non-stopping times: The rewards of impatient prophets

SUPPLEMENTARY MATERIAL TO IRONING WITHOUT CONTROL

Asymptotic behavior for sums of non-identically distributed random variables

d(x n, x) d(x n, x nk ) + d(x nk, x) where we chose any fixed k > N

A REMARK ON ROBUSTNESS AND WEAK CONTINUITY OF M-ESTEVtATORS

Part IA Probability. Theorems. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

8.7 Taylor s Inequality Math 2300 Section 005 Calculus II. f(x) = ln(1 + x) f(0) = 0

arxiv: v2 [math.st] 20 Feb 2013

("-1/' .. f/ L) I LOCAL BOUNDEDNESS OF NONLINEAR, MONOTONE OPERA TORS. R. T. Rockafellar. MICHIGAN MATHEMATICAL vol. 16 (1969) pp.

1 Probability Model. 1.1 Types of models to be discussed in the course

1 Acceptance-Rejection Method

n! (k 1)!(n k)! = F (X) U(0, 1). (x, y) = n(n 1) ( F (y) F (x) ) n 2

EXACT CONVERGENCE RATE AND LEADING TERM IN THE CENTRAL LIMIT THEOREM FOR U-STATISTICS

Linear and non-linear programming

Nonparametric hypothesis tests and permutation tests

Chapter 7. Extremal Problems. 7.1 Extrema and Local Extrema

Wittmann Type Strong Laws of Large Numbers for Blockwise m-negatively Associated Random Variables

On Characteristic Properties of the Uniform Distribution

Topological properties

S(x) Section 1.5 infinite Limits. lim f(x)=-m x --," -3 + Jkx) - x ~

Some Results on b-orthogonality in 2-Normed Linear Spaces

Transcription:

Ann. Inst. Statist. Math. Vol. 47, No. 1, 185-193 (1995) MAXIMUM VARIANCE OF ORDER STATISTICS NICKOS PAPADATOS Department of Mathematics, Section of Statistics and Operations Research, University of Athens, Panepistemiopolis, 15784 Athens, Greece (Received June 30, 1993; revised May 17, 1994) Abstract. Yang (1982, Bull. Inst. Math. Acad. Sinica, 10(2), 197-204) proved that the variance of the sample median cannot exceed the population variance. In this paper, the upper bound for the variance of order statistics is derived, and it is shown that this is attained by Bernoulli variates only. The proof is based on Hoeffding's identity for the covariance. Key words and phrases: Hoeffding's identity. Variance bounds, order statistics, Bernoulli variates, I. Introduction Let XI: n ~ X2: n ~_... <_ Xn: n be the order statistics corresponding to n iid rv's X1,..., Xn with df F(x) and finite variance 0.2. The purpose of this paper is to find the maximum variance of the k-th order statistic Xk:n given a fixed population variance 0.2. Several authors have studied non-parametric bounds for the moments of order statistics. The earliest results in that direction are by Plackett (1947) and Moriguti (1951), generalized by Hartley and David (1954) and Gumbel (1954). More general results are obtained by a method due to Moriguti (1953). Furthermore, moment inequalities for order statistics are given by Sugiura (1962), David and Groeneveld (1982), Terrell (1983), Sz@kely and Mdri (1985), Papathanasiou (1990), Balakrishnan (1990) and Gajek and Gather (1991). Yang (1982) proved that, (i) for n = 2k -- i, Var(Xk..n) _~ 0.2 (ii) for n -~ 2k, Var[(Xk:n + Xk+l:n)/2] ~_ 0.2, (iii) for k # (n+ 1)/2, there exists a continuous df F(x) such that Var(Xk:n) 0.2" Recently Lin and Huang (1989) observed that equality is attained in (i) by the symmetric Bernoulli variate. Here it will be shown that (1.1) Var(Xk:n) < 0.~(k). 0.2 1 < k < n, 185

186 NICKOS PAPADATOS where <r2(k) in a constant depending only on k and n. Equality in (1.1) is attained if and only if 1 < k < n and X is a two-valued (Bernoulli) distribution: P[X = x2] = p = l - P[X = xl], xl < x2, where the parameter p = p~(k) depends on k and n only. Cases k = 1 and k = n lead to strict inequality, but for every e > 0, there exists a df F(x) such that Var(Xl:n) > a2n(1 ) cr 2 - e for k = 1 and similarly for k = n. The key role to the derivation of these upper bounds is played by Hoeffding's identity for the covariance (see (3.2)) in combination with the unimodal property of the special function t(x) (see notations (2.1)), proved in Lemma 2.1(iv). Unfortunately, the a~ (k)-values, given by: (1.2) cry(k)= sup ~{Ix(k'n+l-k)'(1-Ix(k'n+l-k))~ o<x<l L x-(i--- J do not have a simple form; Ix (a, b) denotes, as usual, the incomplete beta function: 1 ua_l( 1 _ u)b_ldu, Ix(a, b) - B(a, b) 0<x<l. However, tables and figures for a2(k) are obtained for various n and k. 2. An auxiliary lemma Let Xk:n be the k-th order statistic based on a sample of size n > 2, from an arbitrary df F(x) with finite variance a2. Then (c.f., for example, David (1981), p. 34) Var(Xk:n) exists. We are interested in finding the maximum variance of Xk:n among all df's F having variance a 2. Let us use the notations: c(x) : Ix(k, n + 1 - k), g(x) = a'(x), (2.1) tl(x)- G(x) t2(y)- 1- a(y) x 1-y t(x,y)=tl(x)'t2(y) and t(x)=t(x,x), O<x<y<l. The following lemma is required for the main result. LEMMA 2.1. Let 1 < k < n. Then, there exist unique numbers Pl = pl(k, n), P2 -- p2( k, n) satisfying k-1 0<pl< <p2<1 n--1 such that, for 0 < x < y < 1: (i) tl(x) = G(x)/x strictly increases in (0, p2) and strictly decreases in (p2, 1) and similarly t2(y) = (1 - G(y))/(1 - y) strictly increases in (0, Pl) and strictly decreases in (pl, 1).

MAXIMUM VARIANCE OF ORDER STATISTICS 187 (ii) If x k Pl or y < P2, then G(x). (1 -G(y)) =t(x,y) <max{t(x),t(y)}. x(1 -y) (iii) If x < Pl and y > P2, then t(z,y) <t(pl,p2) <max{t(pl),t(p~)}. (iv) There exists a unique Xo = Xo(k,n) C (Pl, P2) such that the function a(~).o-a(~)) t(x) strictly increases in (0, Xo) and strictly decreases in (Xo, 1) x(1--x) -- PROOF. (i) Obviously ti(x ) = (xg(x)-g(x))/x 2. The function xg(x)-g(x) has derivative xg'(x) which is positive if x < (k - 1)/(n - 1) and negative if x > (k-1)/(n-1). Since limx~o+[xg(x)-g(x)] = O, limx~l_[xg(x)-g(x)] = -1 we conclude that the equation g(x) -a(x) = o, o < x < 1, has a unique root p2 = p2(k, n) C ((k - 1)/(n - 1), 1). Hence xg(x) - G(x) > 0 for x E (0, p2) and xg(x) - G(x) < 0 for x E (f12, 1) and the proof is complete for tl (x). Similarly for t2 (y), there exists a unique pl = p~ (k, n) satisfying 0<pl<-- k-1 n--1 such that t2(y) strictly increases in (0, Pl) and strictly decreases in (D1, 1). Note that Pl is the unique point in (0, 1) satisfying 1 - G(y) = (1 - y)g(y). (ii) Let x < y. If Pl < x, we have t(x, y) = tl(x)- t (y) < tl(x), t (x) = t(x). Similarly if y <_ P2, t(x, y) < t(y) and the proof is complete. (iii) If x < Pl and y > p~, we have t(x,y) = tl(x), t2(y) < tl(pl)" t2(p2) = t(pl,p2). The second inequality follows from (ii). (iv) Obviously limx~0+ t(x) = limz~l_ t(x) = O. Furthermore, it is clear from (i) that the function t(x) strictly increases in (0, Pl] and strictly decreases in [P2, 1). It suffices to study t(x) in (Pl, P2). It is easy to verify that for 0 < x < 1, x2g2(x)-xg(x)g(x) -x2g'(x)g(x) > O. Indeed, the function ng(x) - xg(x) strictly increases (because (ng(x) - xg(x) )' = (n - k)g(x)/(1 - x) > 0), so that rig(x) - xg(x) > limx_.o+[ng(x) - xg(x)] = O. Hence, the function x(1 - x)g(x ) - (k - nx)g(x) increases also (because (x(1 -

c(~)) x) 188 NICKOS PAPADATOS x)g(x) - (k - nx)g(x))' = ng(x) - xg(x) > 0 by the above argument). Therefore, x(1 -x)g(x)- (k- nx)g(x) > limx~o+[x(1- x)g(x) -(k- nx)g(x)] = 0, so that xb2(~) - xg(~)g(x) - ~%'(x)g(~) xg(x) r ~ -- 177 [x[ * - x)g(x) - (k - nx)g(x)] > 0, 0 < x < 1. But, for 0 < x < P2, G(x) < xg(x) (see (i)), so that (2.2) ~%~(~) - G~(x) - xu9'(x)g(~) > o, o < x < p2. We observe that -1 ~x) thus, from (2.2), (log c(a:--~--!)) '' < 0, 0 < x < p2, that is, h(x) is strictly log-concave in (0, P2). By the same arguments it is proved that t2 is strictly log-concave in (Pl, 1). Hence t(x) = h(x). t2(x) is a strictly log-concave function in (Pl, P2), and the proof is complete. (2.3) Remark. The unique point Xo = xo(k, n) satisfies the equation g(x)(1-2g(x)) 1-2x G(x)(1 - ~(1 - and obviously (2.4) C(xo)(1-O(xo)) = sup [ c(~)(1-g(x))] x0(1 - Xo) O<x<l Z ~) J" 3. Main result DEFINITION 3.1. relation We define the maximum variance function ~r~2(k) by the (see (1,2)). ~#(k) = sup L ~(1 o<x<l ~) J' 1 <k<n, n--2,3,... Clearly, a~(k) is a function of k for n _> 2 fixed. Moreover, from (2.4) ~(k) = e(xo)(1 - G(xo)) xo0-xo) =t(xo), l<k<~,

MAXIMUM VARIANCE OF ORDER STATISTICS 189 2 while an(l) = a2n(n) = n. We can now state the main result of THEOREM 3.1. (Maximum variance of order statistics) (3.1) Var(Xk:n) < ~r~(k)~r 2, and equality is attained if and only if 1 < k < n and F(x) is a Bernoulli distribution with probability of success 1 - xo(k, n) (xo(k, n) is defined in nemma 2.1(iv)). Cases k = 1 and k = n yield strict inequality, though (3.1) yields the best upper bound for Var(Xl:n) and Var(Xn:n). PROOF. Suppose 1 < k < n. Hoeffding's identity for the covariance of X, Y is given by the relation (for a proof see Lehmann (1966), Lemma 2) (3.2) Cov(X, Y)= f+ EH(x,y)- where H(x, y) is the bivariate df of (X, Y)'. Hence (3.3) Var(X) = 2 f/ F(x)(1 - F(y))dydx = ~2 x<_y and similarly Var(Xk:~) = 2 f/g(f(x))(1 x<_y - G(F(y)))dydx (the last relation follows from (3.3) and the fact that G(F(x)) is just the df of From Lemma 2.1 we conclude that for all x < y, (3.4) Xo(1 - xo)g(f(x))(1 - G(F(y))) <_ G(xo)(1 - G(Xo))F(x)(1 - F(y)) and equality is attained if and only if either F(x)(1 -F(y)) = 0 or F(x) = F(y) = X0. Integrating (3.4) over S = {-oo < x _< y < +oc} we have the desired result. In order to hold (3.1) as an equality, it is necessary and sufficient that xo(1 - xo)g(f(x) )(1 - G(F(y) ) ) = G(xo)(1 - G(xo) )F(x)(1 - F(y) ) or, equivalently a.e. in S F(x)(1 - F(y)) = 0 or F(x) = F(y) = Xo a.e. in S.

190 NICKOS PAPADATOS Thus, the only nondegenerate (with variance 0-2) df which attains equality is of the form: 0 ifx < Xl F(x;xl,x2)= xo if xl <x<x2 1 if x2 _< x where x2 = xl + 0./v/Xo(1 - Xo), xl e N, and the proof is complete for 1 < k < n. For k = n we have n Var(X~:~) <_ E Var(Xi:n) + 2 E E Cov(Xi:~,Xj:,~) : no- 2, i=l l<i<j<_n and obviously equality is attained iff 0-2 = 0. Therefore, for nondegenerate F, Var(Xm~) < no- 2 = 0-2(n)0-2 and this bound is the best possible (for example take F to be a two-valued distribution with probability of success close to zero). Case k = 1 is similar to k = n and is omitted. Note that 0-2(k) is symmetric about (n + 1)/2: = 0. (n + l-k), decreases for k < (n + 1)/2 and increases for k > (n + 1)/2 taking the values 0-2(1) = 0-2(n) = n, 0-2n((n + 1)/2) = 1 (see Fig. 1). Remark. Figure 1 identifies the upper bound for each order statistic separately. Note that these upper bounds can never be achieved simultaneously. For a given (fixed) distribution from a lot of distribution families, the variances of order statistics have a bell-shaped curve, in contrast to the possible misunderstanding that the U-shaped curve, presented by Fig. 1, may creates. Note that for k > (n + 1)/2, the values 0-2n(k ) and Xo(k, n) can be calculated from Table 1 using the relations: 0.~(k)=a2(n+ l-k), xo(k,n) = l- xo(n + l-k,n) and that p = 1 - xo(k, n) is the parameter of the Bernoulli rv which maximizes Var(X~:n)/Var(X). For example, if n = 10, k = 2 we find from Table 1: hence for any df we have 0-20(2) = 2.1608, 1 - xo(2, 10) = 0.8958 = p Var(X2:10) <_ 2.1608.0.2

MAXIMUM VARIANCE OF ORDER STATISTICS 191 + V 0 v b

192 NICKOS PAPADATOS o~(k) n=5 - - n = 7 - - n=lo i L I n Fig. 1. The function ~2 (k) sup{var(xk:n)/var(x)}. with equality if and only if X1, X2,..., Xlo are 10 independent copies of the rv X with (3.5) P[X = x2] = p = 1 - P[X = xl], Xl < x2. The same inequality is true for X9:10, but equality is attained if and only if X is as in (3.5) with Xl > x2. Throughout this paper, k was assumed to be an integer in {1, 2,..., n}. However, all the results continue to hold also for non-integer k. In this case, Xk:n denotes the k-th intermediate order statistic as defined by Papadatos (1994). Values of a~(k) and xo(k, n) for integer and non-integer k are given in Table 1 and Fig. 1. Acknowledgements The author would like to thank Professors T. Cacoullos, V. Papathanasiou and E. Kounias for their suggestions and helpful comments. Thanks are also due to an anonymous referee for his helpful informations about Fig. 1. REFERENCES Balakrishnan, N. (1990). Improving the Hartley-David-Gumbel bound for the mean of extreme order statistics, Statist. Probab. Lett., 9, 291-294. David, H. (1981). Order Statistics, 2nd ed., Wiley, New York.

MAXIMUM VARIANCE OF ORDER STATISTICS 193 David, H. and Groeneveld, R. (1982). Measures of local variation in a distribution: expected length of spacing and variances of order statistics, Biornetrika, 69, 227-232. Gajek, L. and Gather, U. (199!). Moment inequalities for order statistics with applications to characterizations of distributions~ Metrika, 38, 357-367. Gumbel, E. (1954). The maxima of the mean largest value and of the range, Ann. Math. Statist., 25, 76-84. Hartley, H. and David, H. (1954). Universal bounds for mean range and extreme observation, Ann. Math. Statist., 25, 85-99. Lehmann, E. L. (1966). Some concepts of dependence, Ann. Math. Statist., 37, 1137-1153. Lin, G. and Huang, J. (1989). Variances of sample medians, Statist. Probab. Left., 8, 143-146. Moriguti, S. (1951). Extremal property of extreme value distributions, Ann. Math. Statist., 22, 523-536. Moriguti, S. (1953). A modification of Schwarz's inequality with applications to distributions, Ann. Math. Statist., 24, 107-113. Papadatos, N. (1994). Intermediate order statistics with applications to nonparametric estimation, Statist. Probab. Lett. (to appear). Papathanasiou, V. (1990). Some characterizations of distributions based on order statistics, Statist. Probab. Lett., 9, 145-147. Plackett, R. (1947). Limits of the ratio of mean range to standard deviation, Biometrika, 34, 120-122. Sugiura, N. (1962). On the orthogonal inverse expansion with an application to the moments of order statistics, Osaka Math. J., 14, 253-263. Sz~kely, G. and Mdri, T. (1985). An extremal property of rectangular distributions, Statist. Probab. Lett., 3, 107-109. Terrell, G. (1983). A characterization of rectangular distributions, Ann. Probab., 11, 823-826. Yang, H. (1982). On the variances of median and some other order statistics, Bull. Inst. Math. Acad. Sinica, 10(2), 197-204.