-meeehheii. H~llEI. AD-Aho 476 WISCONSIN UNIV MADISON MATHEMATICS RESEARCH CENTER F/G 12/1

Similar documents
S 7ITERATIVELY REWEIGHTED LEAST SQUARES - ENCYCLOPEDIA ENTRY.(U) FEB 82 D B RUBIN DAAG29-80-C-O0N1 UNCLASSIFIED MRC-TSR-2328

UN MADISON MATHEMSTICS RESEARCH CENTER A P SUMS UNCLASSIFIE FER4 MONSRR2648 DAAG29-0C04 RDr C91 04

A STUDY OF SEQUENTIAL PROCEDURES FOR ESTIMATING. J. Carroll* Md Robert Smith. University of North Carolina at Chapel Hill.

(U) WISCONSIN UN! V-VIADISON MATHEMATICS RESEARCH CENTER J M VANDEN-BROECK JAN 84 MRC-TSR-2631 DAAG29-89-C-894i

A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR

A Generalized Class of Double Sampling Estimators Using Auxiliary Information on an Attribute and an Auxiliary Variable

ii19a la. Ill/lI II"' chart TEST RESOLUTION MICROCOPY NATIONAL BUREAU OF VANDARDS-1963-A

*jfflfllflfflfl AD-A 'ON THE EQUIDECOMPOSABILT OF A REGULAR TRIANGLE AND A -SQUARE OF

A0A TEXAS UNIV AT AUSTIN DEPT OF CHEMISTRY F/G 7/5 PHOTOASSISTED WATER-GAS SHIFT REACTION ON PLATINIZED T7TANIAI T--ETCIUI

'N7 7 DA4 5 AU) WISCONSIN UNIV MADISON MATHEMATICS RESEARCH CENTER NDA4 5 DIRECT APPROACH TO THE VILLARCEAU CIRCLES OF AJORUS /

Volume 6 Water Surface Profiles

A CHAIN RATIO EXPONENTIAL TYPE ESTIMATOR IN TWO- PHASE SAMPLING USING AUXILIARY INFORMATION

.IEEEEEEIIEII. fflllffo. I ll, 7 AO-AO CALIFORNIA UN IV BERKELEY OPERATIONS RESEARCH CENTER FIG 12/1

INSTRUMENTAL-VARIABLE CALIBRATION ESTIMATION IN SURVEY SAMPLING

Solution to the take home exam for ECON 3150/4150

THE COVARIANCE MATRIX OF NORMAL ORDER STATISTICS. BY C. S. DAVIS and M. A. STEPHENS TECHNICAL REPORT NO. 14 FEBRUARY 21, 1978

,AD-R A MEASURE OF VARIABILITY BASED ON THE HARMONIC MEAN AND i/i ITS USE IN APPROXIMATIONS(U) CITY COIL NEW YORK DEPT OF MATHEMATICS M BROWN

A MARTINGALE INEQUALITY FOR THE SQUARE AND MAXIMAL FUNCTIONS LOUIS H.Y. CHEN TECHNICAL REPORT NO. 31 MARCH 15, 1979

RO-AI55614? CONFIDENCE INTERYALS USIlNG THE REGENERTIYE METHOD FOR 1/1 SIMULATION OUTPUT.. (U) STANFORD UNJY CA DEPT OF 1"'7 1OPERTIONS RESEARCH P W

RD-Ai6i 457 LIMITING PERFORMANCE OF NONLINEAR SYSTEMS WITH APPLICATIONS TO HELICOPTER (U) VIRGINIA UNIV CHARLOTTESVILLE DEPT OF MECHANICAL AND

On Efficiency of Midzuno-Sen Strategy under Two-phase Sampling

AD-AISS "3 IMPACT OF DETECTOR LOCATION AND ORIENTATION ERRORS ON in THE PERPENDICULAR METHOD AN INTRODUCTION(U) JET PROPULSION LAB PASADENA CA 03 OCT

VAD-Ri POURING FLOU(U) WISCONSIN UNIV-MADISON MATHEMATICS /

;E (" DataEntered,) 1.GOVT ACCESSION NO.

Exponential Estimators for Population Mean Using the Transformed Auxiliary Variables

RD-R14i 390 A LIMIT THEOREM ON CHARACTERISTIC FUNCTIONS VIA RN ini EXTREMAL PRINCIPLE(U) TEXAS UNIV AT AUSTIN CENTER FOR CYBERNETIC STUDIES A BEN-TAL

A General Class of Estimators of Population Median Using Two Auxiliary Variables in Double Sampling

AD-A ROCHESTER UNIV NY GRADUATE SCHOOL OF MANAGEMENT F/B 12/1. j FEB 82.J KEILSON F C-0003 UNCLASSIFIED AFGL-TR NL

Technical Note

VARIANCE ESTIMATION FOR COMBINED RATIO ESTIMATOR

Eu... p UNCLASSIFIED ARO-i72672-EG DAAO2-8-C-i42 F/G 21/2 NL. D-A INVESTIGATION OF THE KINETICS OF CATALYZED COMBUSTION i/i

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Research Article An Improved Class of Chain Ratio-Product Type Estimators in Two-Phase Sampling Using Two Auxiliary Variables

ADl-AO PUDUE UMIV LAFAYETTE IN DEPT OF CHEMISTRY F/6 7/4

SaL1a. fa/ SLECTE '1~AIR FORCE GECOPNYSICS LABORATORY AIR FORCE SYSTEMS COMMAND AFGL-TH

A comparative study of ordinary cross-validation, r-fold cross-validation and the repeated learning-testing methods

Improved General Class of Ratio Type Estimators

MICROCOPY RESOLUTION TEST CHART NATIOAL BUREAU OF STANDARDS-1963-A. *9. ~ ~ ~ MCOCP REOUTO TEST CHA.<:~ RTrz ~

79 1,2 jj8 STATISTICAL PROPERTIES OF ALLOCATION AVERAGES DDC. Research Memorandum C-, Behavioral Science Research Laboratory ,*., U S.

Matrix multiplication: a group-theoretic approach

Economics 620, Lecture 2: Regression Mechanics (Simple Regression)

UNCLASSIFIED MRC-TSR-2253 NL

MAS223 Statistical Inference and Modelling Exercises

Research Article A Generalized Class of Exponential Type Estimators for Population Mean under Systematic Sampling Using Two Auxiliary Variables

* alllllllllli ADAI 3 OT4NMHOSTNI DALLAS TEX DEPT OF STATISTICS F/6 12/1

MULTIVARIATE BIRKHOFF-LAGRANGE INTERPOLATION SCHEMES AND CARTESIAN SETS OF NODES. 1. Introduction

Chapter 8: Estimation 1

(.immmmmmmmm. r~a -mhhhhhhhh. !i s,! i, o. Ehhhhhhhhhh. - l

AD OUTDOOR SCULP:URES(U) WISCONSIN UNIY-NADISON / NATHdEMRTICS RESEARCH CENTER I J SCHOENBERG JAN 86 UNCLSSIFIEV R-S-90OR2-DC94 F/C 13/1 NL

AIR FORCE RESEARCH LABORATORY Directed Energy Directorate 3550 Aberdeen Ave SE AIR FORCE MATERIEL COMMAND KIRTLAND AIR FORCE BASE, NM

A note on a difference type estimator for population mean under two phase sampling design

AVERAGE AND STANDARD DEVIATION

EXPONENTIAL CHAIN DUAL TO RATIO AND REGRESSION TYPE ESTIMATORS OF POPULATION MEAN IN TWO-PHASE SAMPLING

Linearized optimal power flow

Statistics GIDP Ph.D. Qualifying Exam Theory Jan 11, 2016, 9:00am-1:00pm

Improvement in Estimating the Finite Population Mean Under Maximum and Minimum Values in Double Sampling Scheme

V DD. M 1 M 2 V i2. V o2 R 1 R 2 C C

A FNITE DFERENCE SUD OTHESTRETCHINGAND BREAK-UP OF FIAMENTS OF. (U) WISCONSIN UNIV MAD SON MATHEMATICS RESEARCH CENTER P MARKOW CU ET AL JAN R4

U-.-._ AD-A TWO STOCHASTIC MODE.ING PROBLEMS IN COMMUJNICATIONS AND NAVIG6ATON I/J U) VIRGINIA P0 TECHNIC INS A ND STE BLACKSBURG UNIV

Estimation Tasks. Short Course on Image Quality. Matthew A. Kupinski. Introduction

Estimation of Finite Population Variance Under Systematic Sampling Using Auxiliary Information

8.6 Partial Fraction Decomposition

IEEE". UNIY-NILUAUKEE J SEDER 25 APR 85 AFOSR-TR i RFOSR UNCLASSIFIED F/G 12/1 NL

Altitude measurement for model rocketry

THE SIGNAL ESTIMATOR LIMIT SETTING METHOD

Fleet Maintenance Simulation With Insufficient Data

I North * ~ Mathematics Research Center University of Wisconsin- Madisonl 510 Walnut Street Madison. Wisconsin 53706

i of 6. PERFORMING ORG. REPORT NUMBER Approved for public Dtatibuft Unlimited

Improved ratio-type estimators using maximum and minimum values under simple random sampling scheme

1.0~ O. 1*1.8.2IIII II5 uuu11~ I 1 I6 MICROCOPY RESOLUTION TEST CHART NATIONAL BUREAU OF STANDARDS 1963-A

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

UNBIASED ESTIMATE FOR b-value OF MAGNITUDE FREQUENCY

Improvements in Modeling Radiant Emission from the Interaction Between Spacecraft Emanations and the Residual Atmosphere in LEO

A Chain Regression Estimator in Two Phase Sampling Using Multi-auxiliary Information

AD-At AN OUTPUT MATCHING APPROACH TO MULTIVARIABLE LINEAR Il 0001 UNCLASSIFIED AFOSR-TR AFOSR U/ 14/2 NL 1

NORMAL CHARACTERIZATION BY ZERO CORRELATIONS

REGRESSION WITH CORRELATED ERRORS C.A. GLASBEY

A70-Ai ARE MASS EXTINCTIONS REALLY PERIOOIC7(J) CALIFORNIA t/l UNIV BERKELEY OPERATIONS RESEARCH CENTER S M ROSS OCT 86 ORC-86-i9 RFOSR-86-8i53

IW IIIIL2 5-_6. -, ~IIIl IIII'nll MICROCOPY RESOLUTION TEST CHART NAI IONAL BUIREAU O( STANDARDS 1963 A

f (1 0.5)/n Z =

Estimation of Parameters

THE OBSERVED HAZARD AND MULTICONPONENT SYSTEMS. (U) JAN 80 M BROWN- S M R OSS N0001R77-C I~OREE80-1 he UNCLASSIFIED CA-

EEEJ AA CHICAGO UNIV ILL DEPT OF CHEMISTRY F/6 7/4 ION-ION NEUTRALIZATION PROCESSES AND THEIR CONSEQUENCES. (U)

N _IC.25 N/PS. Bill~ 11 2 WA' ,' mm_

I I ! MICROCOPY RESOLUTION TEST CHART NATIONAL BUREAU OF STANDARDS-1963-A

Inferring from data. Theory of estimators

Some Modified Unbiased Estimators of Population Mean

, i, ~. '~~~* a F- WI W U V U S S S S S S It

Parametric Models of NIR Transmission and Reflectivity Spectra for Dyed Fabrics

PRE-LEAVING CERTIFICATE EXAMINATION, 2010

Combining data from two independent surveys: model-assisted approach

Journal of Inequalities in Pure and Applied Mathematics

Sanjay Chaudhuri Department of Statistics and Applied Probability, National University of Singapore

QUADRATIC FORMS. A mathematical vignette. Ed Barbeau, University of Toronto

DTIC. glic FILE COPY. Lfl. Mathematics Research Center University of Wisconsin-Madison * 610 Walnut Street Madison, Wisconsin

Knü. ADOAOll 835 ON PACKING SQUARES WITH EQUAL SQUARES. P. Erdos, et al. Stanford University. Prepared for:

BEST LINEAR UNBIASED ESTIMATORS OF POPULATION MEAN ON CURRENT OCCASION IN TWO-OCCASION ROTATION PATTERNS

Estimators for the binomial distribution that dominate the MLE in terms of Kullback Leibler risk

ESTIMATION OF FINITE POPULATION MEAN USING KNOWN CORRELATION COEFFICIENT BETWEEN AUXILIARY CHARACTERS

Unbiased Estimation. Binomial problem shows general phenomenon. An estimator can be good for some values of θ and bad for others.

Transcription:

AD-Aho 476 WISCONSIN UNIV MADISON MATHEMATICS RESEARCH CENTER F/G 12/1 -meeehheii ESTIMATION OF VARIANCE OF THE RATIO ESTIMATOR.(U] AUG 81 C WU DAA629"80"C-0O0R UNCLASSIFIED MRC-TSR-2267 NL H~llEI II

111.0 L ~ 1111s 132 I.2 'l~ll11.1 IIIIL 1. 111111.6 "* MICROCOPY RESOLUTION TEST CHART NA1I(NA HURIAL M >IANDAIA IS lqt( A

MRC Technical Summary Report * 2267 ESTIMATION OF VARIANCE OF THE RATIO ESTIMATOR Q'ien-Fu Wu Mathematics Research Center University of Wisconsin-Madison 610 Walnut Street Madison, Wisconsin 53706 Auust 1981 **fceived July 20, 1981) 0. C) LUJ Approved for public release Distribution unlimited Sponsored by U. S. Army Research Office and National Science Foundation P. 0. Box 12211 Washinton, DC 20550 Research Trianle Park I ANorth Carolina 27709 "~~ ~~~ 820203 056........... -0_............ \ --

UNIVERSITY OF WISCONSIN-MADISON MATHEMATICS RESEARCH CENTER ESTIMATION OF VARIANCE OF THE RATIO ESTIMATOR Chien-Pu WU Technical Summary Report #2267 Auust 1 981 ABSTRACT A eneral class of estimators of the variance of the ratio estimator is considered, which includes two standard estimators V 0 and 2 and approximates another estimator V muqested by Royall and Eberhardt (1975).>Asymptotic expansions for the variances and biases of the proposed estimators are obtained. Based on this w-obbaok optimal ": 0 " - I variance estimator L7 the class* and comparejthe relative merits of three estimators V0, V I and V without any model assumption. Under a simple reression model a more definite comparison of V 0, vi and V 2 is made in terms of variance and bias. AMS (MOS) Subject Classification: 62D05 Key Words: Asymptotic expansion, Bias, Optimal variance estimator, Ratio estimator, Superpopulation, Variance Work Unit Number 4 - Statistics and Probability Department of Statistics, University of Wisconsin, Madison, Wisconsin 53706. Sponsored by the United States Army under Contract No. DAAG29-80-C-0041. This material is based upon work supported by the National Science Foundation under Grant No. CS-7901846. F - - - ---- - - - -~..----- -.- -,.:

SIGNIFICANCE AND EXPLANATION In estimatin the population mean of a character y, we often make use of an auxiliary covariate x whose information is more readily available and is positively correlated with y. One commonly used estimator in survey samplin is the ratio estimator (y-sample mean)(xpopulation mean)/(x-sample mean). To assess the variability of the estimator, we need an estimator of its variance. Several variance estimators have been compared under the assumption that the finite population itself is a random sample from an infinite superpopulation that is described by a linear model. Such an assumption may not be realistic in practice and usually is hard to verify. We propose a class of variance estimators, which includes or approximates several existin variance estimators in the literature. We then find the asymptotic variance and bias of these estimators and determine the optimal estimators for minimizin variance or bias. No superpopulation model is assumed. If we do assume a reression model over the finite population, stron optimality results are obtained and more definite comparisons of estimators are made. > / /. i, /!, I,7,, - I I.- I. The responsibility for the wordin and views expressed in this descriptive summary lies with MRC, and not with the author of this report. ka

ESTIMATION OF VARIANCE OF THE RATIO ESTIMATOR Chien-Fu WU* 1. Introduction Suppose a population consists of N distinct units with values (yi,x), x i > 0, i1,...,n. Denote the population means of Yi and xi by Y and X. To estimate f, it is customary to take a simple random A sample of size n and use the ratio estimator YR = y X/x, where y and x are respectively the sample means of Yi and xi. The mean A square error and variance of YR are each approximated by (Cochran, 1977, p. 155) n= N-1 1 N -- Y x i 2(), 111 where f = n/n is the samplin fraction. Two commonly used estimators of V are n V n n1 " (yi - rxi) 2 (2) and V 2 n 1-2 - rxq * (3) x i=1 where r y /Vx. The asymptotic consistency of V 0 and V 2 and the asymptotic normality of YR were riorously established in Scott and Wu (1981). Althouh the oriinal motivation for Ve V2 /X 2 as a variance Department of Statistics, University of Wisconsin, Madison, Wisconsin 53706. Sponsored by the United States Army under Contract No. DAAG29-80-0041. This material is based upon work supported by the National Science Foundation under Grant No. MCS-7901846.

estimator of the ratio R = f/x is the unavailability of X, it is not clear whether V 2 is worse than v 0 (Cochran, 1977, p. 155). Rao and Rao (1971) studied the small-sample properties of V 0 and V 2 by assumin that the sample is a random sample directly from an infinite superpopulation that can be described by a simple linear reression model of y on x and that x has a amma distribution. The nature of random samplin from a finite population is not taken into full account in their formulation as distinuished from the usual superpopulation approach. Royall and Eberhardt (1975) noted the bias of V 0 when the finite population is a random sample from a superpopulation in which Yi = Ox + Ci ' (4) where ei are independent with mean zero and variance a x i. They suested the simple modification c~ 2 o (1 -, x where C x is the x-sample coefficient of variation and x c is the mean of the N-n units not in the sample. They also noted that V H V 2 for lare n and N >> n, thus justifyin the use of V 2 from a superpopulation viewpoint. The estimator V 2 was previously recommended by Hajek (1958). The estimator V is &-unbiased under H model (4) with t = 1, and remains approximately C-unbiased when the variance of C i in model (4) is not proportional to x i. An empirical study of V was reported in Royall and Cumberland (1978). All these H authors assume that the actual population satisfies a hypothetical infinite population model. It is desirable to have a model-free -2- -

comparison of these estimators. As Royall and Cumberland (1978, p. 334) pointed out, the conventional theory in sample surveys does not provide any comparison on the relative merits of V 0 and V 2 a One of the purposes of this paper is to provide such a model-free comparison of V and V... 0 A We consider a more eneral class of variance estimators V - x 0 which includes V 0, V 2 and a new estimator 1-f X x)2 I n --n-1 (Ywhich ~~ iseqa=t1v (yi rx) which is equal to (V V2/2. Since the numerator of VH is a linear combination of v I and V 2, it can be adequately approximated by some V In 12 we obtain the leadin terms of the mean square error and variance of V which happens to be a quadratic function in. The optimal variance estimator is then obtained by minimizin this quadratic function. The optimal denoted opt is equal to the population reression coefficient of z/z over x/x, i - 1,...,N, where zi, 2 defined in (12), depends on the *residual" ei - yi - Rxi and ei. Therefore amon v 0, V I and V 2, V 0 is the best if opt 0.51 V I the best if 0.5 4 opt C 1.5; and V 2 the best if opt ) 1.5. By further assumin the superpopulation model (4) we show the optimality of V amon V under t - 0 and the optimality of V 1 amon V 0 A under t = 1. Note that the ratio estimator YR is the best linear unbiased estimator of f under model (4) with t - I (Brewer, 19631 Royall, 1970). If the ratio estimator is adopted with this optimality property in mind, then accordin to our result, one ouht to use v 1 as the estimate of variance. Under model (4) with t 1, it is also -3- ei.-, WN PV

shown in 12 that opt ) 1 with the implication that V 1 and V 2 are better than V 0. Therefore our study justifies the use of V and V 2 in practice when one believes that model (4) with t ) I adequately describes the population. Under further distributional assumptions on x, opt is determined as a function of t. In 13 we obtain the leadin terms of the bias of V. We then ompare V 0, V and V 2 in terms of their biases under model (4) with eneral variance pattern t. By further assumin that x has a amma distribution, we show that, amon V 0, V I and V 2, V 0 is the least biased for t ) 1.5 or t 4 0.6, V 1 the least biased for 0.6 4 t 4 6/7 and V 2 the least biased for 6/7 < t C 1.5. 2. Variance of V 2. 1. Asymptotic expansions Wle need the followin results for asymptotic expansions r- -. 0 - - 87x +, (6) (_) =-(ax) + o(.). 1-(6x) + q(q+l) (d)2 + o() (7) n 2 2 r+s (_x-)r(- ) s - O(n 2 ) if r+s is even r+s+l 2 - O(n ) if r+s is odd - rs+ (8) (-x - X)(y - Y)(z-) o(n - 2 ), (9) where x - (x-x)/x, z is the sample mean of character z from the same simple random sample as x and y, Z is the orrespondin jwvpjlation mean, o(n "1 ) is of stochastic order n- 1, ; - n-1(91 +...+ e n ) and el = yi " Rxi is the residual of yj to the -4-7,-"

line connectin (7,Y) and the oriin. Note e1 + N - 0. Formulas (6) and (7) follow easily from (8), and formulas (8), (9) can be riorously justified as in David and Sukhatme (1974). Usin (6), (8) and (9), we expand N = I 1f.nI 2 1xie n I N 0 fl l e 2 Ne 2 n 1 x e N 1 ii e i i x I n N 'n I n ii ( 8 )N-2 1N 2)+0 2 1 N +,l F" (10) n 2 where 1 2i SNxe Note that - N (e 2 e 02) and E V = V + O(n-2. From (7), (8) and (11), V =-f (x)z + 0(-) (13) n 2 n and the mean square error and variance of V are 9 3S -- ~ 2 2 0 1) var( 2). - ) 3 {S 2 2 n z S + 2 zx 2 S}+0(-j) T X n 4 (14) where S and Szx are the population variance of z and the population ovariance of z and x, respectively. The bias square of V is of lower order than the variance and will be studied in 43. 2.2 Optimal choice of V and comparison of V0'V1'V2 The optimal variance estimator is now obtained by minimizin expression (14) with respect to q, the optimal denoted opt bein -5-

=t x opt S2 2 ' x which is the population reression coefficient of z i/z over xi/x# i -,.o.,no Therefore, if n is lare and computational cost is not a problem, we propose the optimal estimator VA, where is a sample 9 analoue of opt For estimation of the population mean Srivastava (1967) suested a similar estimator y(x/x). Das and Tripathi (1978) considered estimators of a similar type for the finite population variance of y. In both papers the optimal for minimizin the asymptotic mean square errors were found. In practice we may not want to compute and will choose the variance estimate amon v 0, V I and V 2 Since the leadin terms in var(v ) are quadratic in, we conclude that, amon the three, V 0 0 is the best if opt > 0.5, V the best if o.5 < opt ( 1.5 and V 2 the best if-opt 1.5. To further relate our estimator V to the usual ratio estimator and the more eneral estimator proposed by Srivastava (1967), we approximate Yi - rxi by Yi - RXi - ei and the variance estimation problem is now reduced to estimatin the population men D=N (e 1.. e.~) by the sample mean V 0 e +.+e) or by the ratio estimator V, = V X/x, or by the unfamiliar V = V (/x)2. The usual comparison of the sample mean and the ratio estimator (Cochran, 1977, 16.6) and the more eneral comparison in Srivastava (1967) would suest that V is less efficient than V 0 1 and V 2 if the population reression coefficient of e i/ over x /A is reater than 1/2 - Because of the error introduced in the approximation yi - rx 1 e, the exact condition (15) involves the 2 less intuitive zi rather than e i, Some readers may prefer the precedin interpretation in terms of the reression of residual square -6 ~, -.j;

2I ei over x i. Under model (4) Sxz = &S w 2, hre is the 2 population covariance of x i and e, so that z i can indeed be 2 replaced by e2. Here & denotes expectation with respect to model. To ain further insiht, we now assume the superpopulation model (4) with variance proportional to x t, For most of the computations involvin model (4), it is important to note that, under (4), R = + 0(N-2 ), ei = Yi-xi=C +(N-1/2) -It t -2 if N (x +...+ x )/X is bounded. The that minimizes 9(var(v )} under (4) is, up to a term of order N - 1, (EN xi ENxi)(E N xi) 1 N 2 t" (16) I (x X) (EI xt) We have,= 0 for t = 0 and,= 1 for t =1, which is stated as a proposition. Proposition 1. Under model (4) with t = 0 (or I), V 0 (or V I ) is the optimal estimator of V amon V. A We shall point out that YR is the best linear unbiased estimator of Y under model (4) with t = I (Brewer, 19631 Royall, 1970). Therefore the ratio-type estimator vi for variance should be used in A situations where the ratio estimator YR is optimal for estimatin the 2 mean. On the other hand, t - 0 implies that e2 and xi are not correlated and the optimal variance estimator v 0 does not incorporate information on x. For t ),1 we have E x t+ E x > x x 2 which implies, ; 1 for t ) 1. Its implication as to the choice of estimators is states as follows. Proposition 2. Under model (4) with t ) 1, the optimal. I and V1, V 2 are b th better 'tan V 0 for estimatin V. -7- f. 4.

When t y( 0 or 1, there seems to be no clear-cut comparison of 0' V 1 V 2. We now assume a distribution on x to facilitate such a comparison. The optimal becomes : Cov(xt x)e(x) (17) E(x t ) var(x) where expectation is taken with respect to the distribution of x. When x has a amma distribution with two parameters, ** = t irrespective of the values of the parameters. When x has a beta distribution on 10,M], M > 0, with parameters p and q, pq > 0, ** - t(p+q+1)/(p+q+t). Note that ** 1 t for t > 1 and ** > t for t C 1. In particular, when x has a uniform distribution, ** = 3t/(t+2). When x has a lonormal distribution with parameters 8 and Y, i.e., Y + 8 lo x is standard normal, then ** - (w2t-l)/(w2_1), where w = exp(1/26 2 ). Note that * ) t for t > I and., C t for t C I in this case. Some selected ** values as function of t are iven in Table 1. Table 1. Optimal ** as function of t in model (4) values of t distribution of x 0 0.5 1.0 1.5 2.0 2.5 3.0 amma 0 0.5 1.0 1.5 2.0 2.5 3.0 uniform 0 0.6 1.0 1.29 1.5 1.67 1.8 p+q-2 beta 0 0.55 1.0 1.38 1.71 2.0 2.25 p+q-5 lonormal 0 0.47 1.0 1.60 2.28 3.06 3.93 8-2 lonormal 0 0.38 1.0 2.03 3.72 6.51 11.11 8-1 -8-

Rao and Rao (1971) compared the stability of V 0 and V 2 under an infinite population reression model and amma distribution on x. They reported that V 0 is more stable than V 2 for t - 0 or 1 and V 2 is more stable than V 0 for t - 2. Their results are consistent with ours. 3. Bias of V 3.1. Asymptotic expansions Multiplyin (7) to (10) and usin (8) and (9) to collect terms of order n -3, we obtain V --- n1 Y - n2 1 xe +(2+2) xe + 2N Nii -2 N) ex2~ + (+ ) (r 2N + -2N i n 1 2 N 1 3 1 I n From Theorem 2.3 of Cochran (1977), the bias of V is (f)2 1 2+2 N N N 2 bias(v )-- U 1)(1-L)2 1 e 2 x 2 + 2(+1)(I Xe n(n-1) N-i 4-2 NX '2 1 i 1ii1 1 i 1 (+1)(-2) W-2 N2 N N 2) 1 2 e, - (+2) I X1 ie + ( 11 1 n Since the bias square of V is of O(n- 4 ), smaller than var(v ), 9 9 one would not choose V based on its bias for lare samples. But in 9 practice the variance estimators can be seriously biased in small samples (Rao, 1968). The bias can then be reduced by subtractin the s&nple analoue of expression (18) from the estimate. -9-

3.2. Least biased V and co!mparison of V 0 V 1 V 2 Without further assumptions on the population, there is no clearcut comparison on the biases of v 0, V 1 and V 2 * By assumin model (4), we have 9{bias(V (i) I 2 { ++2 X t n(n-1) N-I -2 2 1 1 t - - 2 - (+2) T x i x i 1 + O( ) * (19) 2 NX 1 1 1 i i 0 n3) Based on (19), the least biased V for t = 0 or 1 are easily found. Proposition 3. Under model (4) with t = 0, V 1 / is the least biased estimator of V amon V model (4) with t = 1, v 2 and the bias is of order O(n2) Under and VI are the least biased estimators of V amon V and are the only ones amon V with bias of order O(n -3 ). For t / 0 or 1 expression (19) does not have a very nice form. To ain further insiht we assume that x has a amma distribution with shape parameter a > 0. Then, from.(19), the bias of V is Cf( 2++2)/2 - (+2)t}, where C is a positive constant independent of. From this we conclude that, amon V 0, Vl, V 2, (i) V is the least biased for t > 1.5 or t ( 0.6, (ii) V the least 0 1 biased for 0.6 4 t 4 6/7 and (iii) V 2 the least biased for 6/7 C t C 1.5. Rao and Rao (1971) compared the biases of V 0 and V 2 under an infinite population reression model and amma distribution on x. They reported that v 2 is less biased than V 0 for 0 ( t 4 1.5 and V 0-10- ; -NO1

is less biased than V 2 for t - 2. Their results are aain in ood accord with ours. Of course the definition of "bias" here is in terms of estimatin the approximate variance V of the ratio estimator, not its true mean square error. From a Monte Carlo study by Rao (1968) on some natural populations, the percent underestimate in V of the true nean square error is between 10 and 15 percent for n - 4,6,8,12. For a summary of results see Cochran (1977, p. 164). Therefore, if V underestimates the true mean square error, we shall prefer a variance estimator with a small amount of positive Nbias". The only impact of this observation on the previous comparison is for 0 4 t 4 1.0, where the biases of v 0, V 1 can be either positive or neative. Since bias(v ) bias(v I ) or bias (V 0 ) for 0 4 t 4 1.0, V 2 may be preferred on this round. The research was supported by a rant from the National Science Foundation. The comments of the referee are appreciated. -4 A.-- -- - h.

REFERENCES Brewer, K. W. R. (1963). Ratio estimation in finite populations: some results deducible from the assumption of an underlyin stochastic process. AustralL..i.T)arn. Statist. 5, 93-105. Cochran, W. G. (1977). Samplin Techniques, 3rd edition, New York: Wiley. Das, A. K. and Tripathi, T. P. (1978). Use of auxiliary information in estimatin the finite population variance. Sankya, Ser. C 139-148. David, I. P. and Sukhatme, B. V. (1974). On the bias and mean square 466. Hajek, J. (1958). On the theory of ratio estimates. Aplikace Mathematiky 1, 384-398. Rao, J. N. K. (1968). Some small sample results in ratio and reression estimation. J. Indian Statist. Assoc. 6, 160-168. Rao, P. S. R. S. and Rao, J. N. K. (1971). Small sample results for ratio estimators. Biometrika R, 625-630. Royall, R. M. (1970). On finite population samplin theory under certain linear reression models. Biometrika 57, 377-387. Royall, R. M. and Cumberland, W. G. (1978). An empirical study of prediction theory in finite population samplin: simple random samplin and the ratio estimator. In Survey Samplin and Measurements, ed. N. K. Namboodiri, pp. 293-309. New Yorks Academic Press. Royall, R. M. and Eberhardt, K. R. (1975). Variance estimates for the ratio estimator, Sankya, Ser. C 37, 43-52. error of the ratio estimator. J. Am. Statist. Assoc. t9, 464- -12- - ",''+. a... +

Scott, A. J. and Wup C. F. (1981). on the asymptotic distribution of ratio and reression estimators.* J.* Am.* Statist.* Assoc. 76 8 102. Srivastava, S. K. (1967). An estimator usin auxiliary information in sample surveys. Calcutta Statist. Assoc. Bulletin ~,121-132. CFW/jVS -13

SECURITY CLASSIFICATION OF THIS PACE t4 l*.n Des *E.'nred) REPORT DOCUMENTATION PAGE EFOXF READ INSTRUCTIONS COII'lEITINC FORM 1. REPORT NUMBLR GOVT ACCESSION NO. 3. RECIPIENT'S CATALOG NUMULR #2267N9A l0l- 4. TITLE (and Subtile) is. TYPE Of REPORT & PERIOD COVERED ESTIMATION OF VARIANCE OF THE RATIO ESTIMATOR Summary Report - no specific reportin period 6. PERFORMING ORG. REPORT NUMBER 7. AUTHOR(.) S. CONTRACT OR GRANT NUMBER(*) Chien-Fu Wu DAAGZ9-80-C-0041 MCS-7901846 S. PERFORMING ORGANIZATION NAME AND ADDRESS 10. PROGRAM ELEMENT. PROJECT. TASK AREA & WORK UNIT NUMBERS Mathematics Research Center, University of Work Unit Number 4-610 Walnut Street Wisconsin Statistics & Probability Madison, Wisconsin 53706 II. CONTROLLING OFFICE NAME AND ADDRESS 12. REPORT DATE Auust 1981 (see Item 18 below) 13 NUMBER OF PAGES 13 4. MONITORING %GENCY NAME & ADDRESS(I difterent from Controllin Office) IS. SECURITY CLASS. (of tise repott) 1Sa. UNCLASSIFIED DECL ASSI FICATION/ DOWN GRADING SCHEOULE 16. DISTRIBUTION STATEMENT (of thl Report) Approved for public release; distribution unlimited. I?. DISTRIBUTION STATEMENT (of the abatract entered In Stock 20. It different from Report) 1S. SUPPLEMENTARY NOTES U. S. Army Research Office and National Science Foundation P. 0. Box 12211 Washinton, DC 20550 Research Trianle Park North Carolina 27709 IS. KEY WORDS (Continue on rovewri de it neceeeary and identify by block number) AsyiIptotic expansion, Bias, Optimal variance estimator, Ratio estimator, Superpopilat ion, Variance 20. ABSTRACT (Continue an rversve l4e It neoooey and identify by block nuiber) A eneral class of estimators of the variance of the ratio estimator is considered, which includes two standard estimators v 0 and V2 and approximates another estimator vh suested by Royall and Eberhardt (1975). Asymptotic expansions for the variances and biases of the proposed estimators are obtained. Based on this we obtain optimal variance estimator in the class and compare the relative merits of three estimators v 0, v 1 and u 2 without any model assumption Under z, rimple reression model a more definite comparison of v 0, v, and '2 is made in terms of variance and bias. DD I JAN a 1473 EDITION OF INOV*S IS OBSOLETE UNCLASSIFIED SiCURITY CLASSIFICATION OF THIS PAGE (011en Date Enter** -- -- ~-------.--------.---. Ai -:Z.