BOOTSTRAPPING SAMPLE QUANTILES BASED ON COMPLEX SURVEY DATA UNDER HOT DECK IMPUTATION

Similar documents
Bahadur representations for bootstrap quantiles 1

TWO-WAY CONTINGENCY TABLES UNDER CONDITIONAL HOT DECK IMPUTATION

Asymptotic Normality under Two-Phase Sampling Designs

ASYMPTOTIC NORMALITY UNDER TWO-PHASE SAMPLING DESIGNS

Monte Carlo Study on the Successive Difference Replication Method for Non-Linear Statistics

REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES

EMPIRICAL EDGEWORTH EXPANSION FOR FINITE POPULATION STATISTICS. I. M. Bloznelis. April Introduction

Bootstrap. Director of Center for Astrostatistics. G. Jogesh Babu. Penn State University babu.

A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR

Bootstrap inference for the finite population total under complex sampling designs

IEOR E4703: Monte-Carlo Simulation

Introduction to Survey Data Analysis

Workpackage 5 Resampling Methods for Variance Estimation. Deliverable 5.1

ORTHOGONAL DECOMPOSITION OF FINITE POPULATION STATISTICS AND ITS APPLICATIONS TO DISTRIBUTIONAL ASYMPTOTICS. and Informatics, 2 Bielefeld University

Empirical Likelihood Methods for Sample Survey Data: An Overview

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES

VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70

University of California San Diego and Stanford University and

Weight calibration and the survey bootstrap

J.N.K. Rao, Carleton University Department of Mathematics & Statistics, Carleton University, Ottawa, Canada

Stat 710: Mathematical Statistics Lecture 31

A note on multiple imputation for general purpose estimation

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

The Nonparametric Bootstrap

ESTIMATION OF CONFIDENCE INTERVALS FOR QUANTILES IN A FINITE POPULATION

Model Assisted Survey Sampling

Data Integration for Big Data Analysis for finite population inference

ESTIMATION OF DISTRIBUTION FUNCTION AND QUANTILES USING THE MODEL-CALIBRATED PSEUDO EMPIRICAL LIKELIHOOD METHOD

On the Uniform Asymptotic Validity of Subsampling and the Bootstrap

Jong-Min Kim* and Jon E. Anderson. Statistics Discipline Division of Science and Mathematics University of Minnesota at Morris

Jong-Min Kim* and Jon E. Anderson. Statistics Discipline Division of Science and Mathematics University of Minnesota at Morris

Imputation for Missing Data under PPSWR Sampling

large number of i.i.d. observations from P. For concreteness, suppose

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data

36. Multisample U-statistics and jointly distributed U-statistics Lehmann 6.1

Two-phase sampling approach to fractional hot deck imputation

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Chapter 10. U-statistics Statistical Functionals and V-Statistics

Miscellanea A note on multiple imputation under complex sampling

Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions

arxiv:math/ v1 [math.st] 23 Jun 2004

Bootstrap, Jackknife and other resampling methods

11. Bootstrap Methods

Lecture 13: Subsampling vs Bootstrap. Dimitris N. Politis, Joseph P. Romano, Michael Wolf

ON THE UNIFORM ASYMPTOTIC VALIDITY OF SUBSAMPLING AND THE BOOTSTRAP. Joseph P. Romano Azeem M. Shaikh

Confidence Intervals in Ridge Regression using Jackknife and Bootstrap Methods

6. Fractional Imputation in Survey Sampling

Nonresponse weighting adjustment using estimated response probability

EXTENDED TAPERED BLOCK BOOTSTRAP

EFFICIENCY OF MODEL-ASSISTED REGRESSION ESTIMATORS IN SAMPLE SURVEYS

A decision theoretic approach to Imputation in finite population sampling

An Overview of the Pros and Cons of Linearization versus Replication in Establishment Surveys

Reliable Inference in Conditions of Extreme Events. Adriana Cornea

Asymptotic inference for a nonstationary double ar(1) model

Inference for Identifiable Parameters in Partially Identified Econometric Models

Inference For High Dimensional M-estimates. Fixed Design Results

Shu Yang and Jae Kwang Kim. Harvard University and Iowa State University

Investigating the Use of Stratified Percentile Ranked Set Sampling Method for Estimating the Population Mean

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses

Fast and robust bootstrap for LTS

Economics 583: Econometric Theory I A Primer on Asymptotics

A Resampling Method on Pivotal Estimating Functions

A CENTRAL LIMIT THEOREM FOR NESTED OR SLICED LATIN HYPERCUBE DESIGNS

Eric Shou Stat 598B / CSE 598D METHODS FOR MICRODATA PROTECTION

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels

ECE531 Lecture 10b: Maximum Likelihood Estimation

Chapter 4. Replication Variance Estimation. J. Kim, W. Fuller (ISU) Chapter 4 7/31/11 1 / 28

Lecture 8 Inequality Testing and Moment Inequality Models

A Comparison of Robust Estimators Based on Two Types of Trimming

A JACKKNIFE VARIANCE ESTIMATOR FOR SELF-WEIGHTED TWO-STAGE SAMPLES

The Bootstrap: Theory and Applications. Biing-Shen Kuo National Chengchi University

Preliminaries The bootstrap Bias reduction Hypothesis tests Regression Confidence intervals Time series Final remark. Bootstrap inference

Obtaining Critical Values for Test of Markov Regime Switching

STAT 512 sp 2018 Summary Sheet

ECON 3150/4150, Spring term Lecture 6

Lecture 16: Sample quantiles and their asymptotic properties

Fractional Imputation in Survey Sampling: A Comparative Review

Mean estimation with calibration techniques in presence of missing data

Non-parametric bootstrap mean squared error estimation for M-quantile estimates of small area means, quantiles and poverty indicators

Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½

Combining data from two independent surveys: model-assisted approach

REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY

Fractional hot deck imputation

Better Bootstrap Confidence Intervals

The bootstrap. Patrick Breheny. December 6. The empirical distribution function The bootstrap

Can we do statistical inference in a non-asymptotic way? 1

Test of the Correlation Coefficient in Bivariate Normal Populations Using Ranked Set Sampling

SUPPLEMENT TO PARAMETRIC OR NONPARAMETRIC? A PARAMETRICNESS INDEX FOR MODEL SELECTION. University of Minnesota

COMPLEX SAMPLING DESIGNS FOR THE CUSTOMER SATISFACTION INDEX ESTIMATION

Bayesian Nonparametric Point Estimation Under a Conjugate Prior

Optimal Jackknife for Unit Root Models

On the bias of the multiple-imputation variance estimator in survey sampling

Journal of the American Statistical Association, Vol. 89, No (Dec., 1994), pp

Model Selection, Estimation, and Bootstrap Smoothing. Bradley Efron Stanford University

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design

Robustness to Parametric Assumptions in Missing Data Models

Preliminaries The bootstrap Bias reduction Hypothesis tests Regression Confidence intervals Time series Final remark. Bootstrap inference

Small Sample Corrections for LTS and MCD

Transcription:

Statistica Sinica 8(998), 07-085 BOOTSTRAPPING SAMPLE QUANTILES BASED ON COMPLEX SURVEY DATA UNDER HOT DECK IMPUTATION Jun Shao and Yinzhong Chen University of Wisconsin-Madison Abstract: The bootstrap method works for both smooth and nonsmooth statistics, and replaces theoretical derivations by routine computations. With survey data sampled using a stratified multistage sampling design, the consistency of the bootstrap variance estimators and bootstrap confidence intervals was established for smooth statistics such as functions of sample means (Rao and Wu (988)). However, similar results are not available for nonsmooth statistics such as the sample quantiles and the sample low income proportion. We consider a more complicated situation where the data set contains nonrespondents imputed using a random hot deck method. We establish the consistency of the bootstrap procedures for the sample quantiles and the sample low income proportion. Some empirical results are also presented. Key words and phrases: Imputation classes, low income proportion, stratified multistage sampling.. Introduction Variance estimation and confidence intervals are the main research focuses in survey problems. Popular methods include the linearization-substitution, the jackknife, the balanced repeated replication, and the bootstrap, which can be applied to problems in which the parameter of interest is a smooth function of population totals. When the parameter of interest is a population quantile or other nonsmooth parameters such as the low income proportion (to be described later), the jackknife is not applicable; a counterpart of the linearizationsubstitution method is Woodruff s method (Woodruff (952)), but Kovar, Rao and Wu (988) and Sitter (992) found that Woodruff s method had poor empirical performance when the population was stratified using a concomitant variable highly correlated with the variable of interest. The balanced repeated replication method works well for quantiles (e.g., Shao and Wu (992), Shao and Rao (994), Shao (996)), but it requires the construction of balanced subsamples, which can be very difficult if we have a stratified sample with many strata and very different stratum sizes. The bootstrap method works for both smooth and nonsmooth statistics so that one can use a single method for estimating variances and setting

072 JUN SHAO AND YINZHONG CHEN confidence intervals based on various statistics. The bootstrap requires a large amount of routine computation, which is usually not a serious problem with the fast computers we have nowadays. Asymptotic validity of the bootstrap was shown by Rao and Wu (988) for the case of smooth functions of population totals. Similar results for quantile problems are expected, but have not been rigorously established. While studying properties of the bootstrap for quantile problems, we go a step further; that is, we consider the situation where the data set contains missing values imputed by using a random hot deck method. Most surveys involve nonrespondents and random hot deck imputation is commonly used to compensate for missing data. If we apply the bootstrap by treating the imputed values as if they are true values, then the bootstrap variance estimator has a serious negative bias when the proportion of nonrespondents is appreciable. A correct bootstrap is obtained by imputing the bootstrap samples in the same way as imputing the original data set (Efron (994), Shao and Sitter (996)). After describing (in Section 2) the sampling design, the imputation procedure, the survey estimators, and the bootstrap procedure, we establish the asymptotic validity of the bootstrap estimators in Section 3. Some empirical results are presented in Section 4. 2. Sampling Design, Imputation, and the Bootstrap The following commonly used stratified multistage sampling design is considered. The population P under consideration has been stratified into L strata with N h clusters in the hth stratum. From the hth stratum, n h 2clusters are selected with replacement (or so treated) using some probability sampling plan, independently across the strata. Within the (h, i)th selected cluster, n hi ultimate units are sampled according to some sampling methods, i =,...,n h, h =,...,L. We do not need to specify the number of stages and the sampling methods used after the first-stage sampling. Associated with the kth ultimate unit in the ith sampled cluster of stratum h is a characteristic y hik and a survey weight w hik, k =,...,n hi,i =,...,n h,h =,...,L. The survey weights are constructed so that G(x) = w hik I M {yhik }(x) () is unbiased for the population distribution F (x) = M S y hik P I {yhik }(x), (2) where M is the total number of ultimate units in population P, S is the summation over S, the set of indices (h, i, k) in the sample, and I C (x) is the indicator

BOOTSTRAPPING QUANTILES FOR SURVEY DATA 073 function of the set C. Note that M may be unknown in many survey problems and G in () is not necessarily a distribution function. Hence, in the case of no missing data, a customary estimate of F in (2) is the empirical distribution = G/G( ) = S w hij I {yhik }/ S w hik. Suppose that some values y hik in the sample are missing and that there are V disjoint subsets S v of S such that S = S S V and units in S v respond independently with the same probability, v =,...,V. That is, within S v,we have a uniform response mechanism. The S v are called imputation classes and are constructed using auxiliary variables observed for every sampled unit; e.g., avariablex hik taking the same value when (h, i, k) S v (Schenker and Welsh (988), Section 4). Imputation is carried out within the imputation classes; that is, a missing value with (h, i, k) S v is imputed based on the respondents with indices in S v. For a concise presentation we assume V = throughout the paper. The extensions of the results to the case of any fixed V 2 are straightforward. There are two types of commonly used imputation methods: deterministic imputation (e.g., the mean imputation, the ratio imputation, and the regression imputation) and random (hot deck) imputation (e.g., Schenker and Welsh (988), Rao and Shao (992)). Deterministic imputation, however, does not directly produce valid estimates of population quantiles, since it does not preserve the distribution of item values. We shall, therefore, concentrate on the following random hot deck imputation. Let S R and S M be subsets of S containing the indices of respondents and nonrespondents, respectively. The random hot deck method imputes missing values by a random sample selecting (with replacement) y hik with probability w hik / S R w hik for each (h, i, k) S R. Based on the imputed data set, the empirical distribution is ( I ) = w hik I {yhik } + w hik I {y I hik } / w hik, (3) S R S M S where yhik I denotes the imputed value for y hik, (h, i, k) S M. In studies of income shares or wealth distributions, an important class of population characteristics is θ = F (p) =inf{x : F (x) p}, thepth quantile of F, p (0, ). Another important parameter is the proportion of low income economic families. Let µ = F ( 2 ) be the population median family income. Then, the population low income proportion can be defined as ρ = F ( 2 µ), ( 2 µ is called the poverty line; see Wolfson and Evans (990)). Customary survey estimates of θ (with a fixed p) andρ are the sample pth quantile and the sample low income proportion defined by ˆθ I =( I ) (p) and ˆρ I = I ( 2 ˆµI ), (4)

074 JUN SHAO AND YINZHONG CHEN respectively, where ˆµ I = ˆθ I with p = 2, the sample median. Under some conditions, ˆθ I and ˆρ I are shown to be asymptotically normal (Chen and Shao (996)). We now consider the bootstrap with no missing data. Since some of the n h may not be large, the original bootstrap (Efron (979)) has to be modified. We focus on McCarthy and Snowden s (985) with replacement bootstrap method, which is a special case of Rao and Wu s (988) rescaling bootstrap. Let {y hi : i =,...,n h } be a simple random sample with replacement from {y hi : i =,...,n h }, h =,...,L, independently across the strata, where y hi = {y hik : k =,...,n hi }. Define = S w hik I {yhik } / S w hik,wherey hik is the kth component of y hi, w hik is n h/(n h ) times the survey weight associated with yhik,ands is the index set for the bootstrap sample. The bootstrap variance estimator for ˆθ = (p) isvar (ˆθ ), where ˆθ =( ) (p) andvar is the variance taken under the bootstrap distribution conditional on y hik,(h, i, k) S. A level 2α bootstrap confidence interval for θ is C =[ˆθ +d α, ˆθ +d α ], where d a is the ( a)th quantile of the bootstrap distribution of ˆθ ˆθ, conditional on y hik,(h, i, k) S. Monte Carlo approximations are used if Var (ˆθ )orc has no closed form. When there are imputed data, however, treating yhik I as if they are true values and applying the previously described bootstrap do not lead to consistent bootstrap variance estimators and correct bootstrap confidence intervals. The bootstrap procedure has to be modified to take into account the effect of missing data and imputation. Shao and Sitter (996) proposed a bootstrap method which can be described as follows. Assume that the data set carries identification flags a hik indicating whether a unit is a respondent; that is, a hik =if(h, i, k) S R and a hik =0if(h, i, k) S M.. Draw a simple random sample {y hi : i =,...,n h } with replacement from {y hi : i =,...,n h }, h =,...,L, independently across the strata, where missing values in y hi are imputed by yhik I. 2. Let a hik be the response indicator associated with y hik, S M = {(h, i, k) :a hik = 0}, andsr = {(h, i, k) :a hik =}. Then impute the missing value y hik, (h, i, k) SM,byy hik selected with replacement from {y hik :(h, i, k) S R } with probability whik / SR w hik, independently for (h, i, k) S M. 3. Define ( = S R w hik I {y hik } + S M w hik I {y hik } )/ S w hik, (5) ˆθ =( ) (p), and ˆρ = ( 2 ˆµ ), which are bootstrap analogues of I, ˆθ I,andˆρ I, respectively. The bootstrap variance estimators for ˆθ I and ˆρ I are Var (ˆθ )andvar (ˆρ ), respectively,

BOOTSTRAPPING QUANTILES FOR SURVEY DATA 075 and bootstrap percentile confidence intervals for θ and ρ are C =[ˆθ I + d α, ˆθ I + d α ] and C =[ˆρ I + d α, ˆρI + d α ], d a are ( a)th quantiles of the bootstrap distri- respectively, where d a and butions of ˆθ ˆθ I and ˆρ ˆρ I, respectively. The key feature in this bootstrap method is step 2; that is, the bootstrap data yhik are imputed in the same way as the original data set. If there is no missing data, then step 2 can be omitted and this bootstrap reduces to that in McCarthy and Snowden (985). 3. Consistency of the Bootstrap We now show the consistency of the bootstrap estimators. For this purpose, assume that the finite population P is a member of a sequence of finite populations {P : =, 2,...}. Therefore, the values L, M, N h, n h, n hi, w hik, y hik,and yhik I depend on, but, for simplicity of notation, the population index for these values will be suppressed. Also, F, θ, µ, ρ, G,, I, ˆθ I,ˆµ I,andˆρ I depend on and will be denoted by F, θ, µ, ρ, G,, I, ˆθ,ˆµ I I,andˆρ I, respectively. All limiting processes will be understood to be as. We always assume that the sequence {θ : =, 2,...} is bounded and that the total number of selected first-stage clusters n = L h= n h is large; that is, n as. We first state a lemma whose proof is given in the Appendix. Lemma. Assume that (A) max h,i,k n hi w hik /M = O(n ); (A2) There is a sequence of functions {f : =, 2,...} such that 0 < inf f (θ ) sup f (θ ) < and for any δ = O(n /2 ), [ F (θ + δ ) F (θ ) ] lim f (θ ) =0. δ Then sup H (x) = o p (n /2 ) x θ cn /2 for any constant c>0, where H (x) = (x) (θ ) I (x)+ I (θ ). (6) The following result provides a Bahadur representation for the bootstrap sample quantiles. Theorem. Assume (A) and (A2). Then ˆθ = ˆθ I + I(θ ) (θ ) + o p (n /2 ) (7) f (θ )

076 JUN SHAO AND YINZHONG CHEN and sup K (x) K (x) = o p (), (8) x where K is the distribution of n(ˆθ I θ ) and K is the bootstrap distribution of n(ˆθ ˆθ ), I conditional on S R and y hik, (h, i, k) S R. Proof. Define ζ (t) = n[f (θ +tn /2 ) (θ +tn /2 )]/f (θ )andη (t) = n[f (θ + tn /2 ) (ˆθ )]/f (θ ). ByLemmaandLemmainChenand Shao (996), ζ (0) ζ (t) = n[h (θ + tn /2 )+H (θ + tn /2 )]/f (θ )=o p (), where H (x) = I (x) I (θ ) F (x)+f (θ ). From (A2), Also, n F (θ ) (ˆθ ) f (θ ) Hence η(t) t = o p () and n [ˆθ n F (θ + tn /2 ) F (θ ) f (θ ) θ F (θ ) (θ ) ] f (θ ) t. n [ f (θ ) M + G ( ) max h,i,k whik ] = O p (n /2 ). M = n(ˆθ θ ) ζ (0) = o p(). (9) Then result (7) follows from (9) and the Bahadur representation for ˆθ I (Theorem 2 in Chen and Shao (996)). Result (8) can be shown using result (7) and the bootstrap central limit theorem (Bickel and Freedman (984)). The next result is for bootstrapping the sample low income proportion. Theorem 2. Assume that (A) holds and that (A2) holds when θ = µ and θ = 2 µ.then ˆρ ˆρ I = ( 2 µ ) I ( 2 µ )+ f( 2 µ) 2f (µ ) [ I (µ ) (µ )] + o p (n /2 ) (0) and sup K (x) K (x) = o p (), () x where K is the distribution of n(ˆρ I ρ ) and K is the bootstrap distribution of n(ˆρ ˆρ I ). Proof. Result () can be obtained from result (0). Hence we only need to show (0). From Lemma, U = ( 2 ˆµ ) ( 2 µ ) I ( 2 ˆµ )+ I ( 2 µ )=o p (n /2 ).

BOOTSTRAPPING QUANTILES FOR SURVEY DATA 077 Using (A2), Lemma, Theorem, and Theorem 3 in Chen and Shao (996), we have ˆρ ˆρ I = = = = ( 2 ˆµ ) I ( 2 ˆµI ) ( 2 µ ) I( 2 µ )+ I( 2 ˆµ ) I( 2 ˆµI )+U ( 2 µ ) I ( 2 µ )+F ( 2 ˆµ ) F ( 2 ˆµI )+o p (n /2 ) ( 2 µ ) I( 2 µ )+ f( 2 µ) 2f [ I (µ ) (µ ) (µ )] + o p (n /2 ). From Theorems and 2, the bootstrap confidence intervals C and C are asymptotically correct, i.e., P {θ C } 2α and P {ρ C } 2α. The consistency of the bootstrap variance estimators is established in the next two theorems. Their proofs are given in the Appendix. Theorem 3. Assume (A) and (A2 )(A2)holds with any δ = O(n /2 log n). Assume further that n +ɛ L n h h= i= ( E max y hik ɛ) 0 (2) k=,...,n hi for some ɛ>0and that lim inf nσ 2(θ ) > 0, where σ 2 (x) is the asymptotic variance of I (x). Then n[var (ˆθ ) σ 2 (θ )/f 2 (θ )] = o p (). (3) Remark. () Condition (A2 ) is slightly stronger than (A2). Using (A2 )and the same arguments in the proof of Lemma as in Chen and Shao (996), we can show that sup x θ cn /2 R (x) R (θ ) F (x)+f (θ ) = o p (n /2 ) (4) log n for any c>0. (2) Condition (2) is a very weak moment condition since ɛ can be any positive number. Without this condition, the bootstrap variance estimator for ˆθ I may be inconsistent (Ghosh, Parr, Singh and Babu (984)). Theorem 4. Assume that (A) holds and that (A2 ) holds with θ = 2 µ and θ = µ.then n[var (ˆρ ) γ2 ]=o p(), (5) where γ 2 is the asymptotic variance of ˆρ I given by γ 2 = σ2 ( 2 µ )+σ 2 (µ ) [ f( 2 µ) 2f (µ ) ] 2 σ ( 2 µ,µ ) f( 2 µ) f (µ ), (6)

078 JUN SHAO AND YINZHONG CHEN and σ (x, y) is the asymptotic covariance between I (x) and I (y). 4. A Simulation Study We present the results from a simulation study examining the performance of the bootstrap percentile confidence interval and the bootstrap variance estimator in a problem where ˆθ I =( I ) ( 2 ), the sample median. For comparison, we also included Woodruff s method in the simulation study. We considered a population with L = 32 strata. In the hth stratum, the i.i.d. y-values of the population were generated according to y hi N(Ȳh,σh 2), i =,...,N h, where the population parameters N h, Ȳh, andσ h are listed in Table. Table. Population parameters and n h h N h Ȳ h σ h n h h N h Ȳ h σ h n h 38 3.7 6.7 3 7 34 9.2 4.5 2 2 38 3.0 6.5 3 8 34 9.0 4.6 2 3 38 2.5 6.4 3 9 34 9.8 4.4 2 4 38 2.0 6.6 3 20 34 8.6 4. 2 5 38 2.3 6. 3 2 34 8.3 4.9 2 6 38.7 6.8 3 22 22 8.2 4.6 2 7 38.4 6.3 3 23 22 8.0 4.3 2 8 38.2 6.4 3 24 22 7.9 4.7 2 9 38.0 5.5 3 25 22 7.8 3. 2 0 38 0.8 5.6 3 26 22 7.5 3.9 2 38 0.6 5.9 3 27 22 7.2 3.7 2 2 34 0.3 5.3 2 28 22 7.0 3.6 2 3 34 0. 5.4 2 29 22 6.7 3.4 2 4 34 9.7 5.8 2 30 22 6.4 3.2 2 5 34 9.5 4.8 2 3 22 6. 3.5 2 6 34 9.4 4.7 2 32 22 6.0 3.7 2 After the population was generated, a simple random sample of size n h was drawn from stratum h, independently across the 32 strata. Thus, the sampling design is a stratified one stage simple random sampling. The sample sizes n h are also listed in Table. The respondents {y hi, (h, i) S R } were obtained by assuming that the sampled units responded with equal probability p. The missing values {y hi, (h, i) S M } were imputed by taking an i.i.d. sample from {y hi, (h, i) S R },withselection probability w hi / A r w hi for y hi,(h, i) S R, where the survey weight w hi = w h = N h /n h in this special case. The bootstrap percentile confidence interval (for the population median) and the bootstrap variance estimator (for the sample median ˆθ I ) were computed

BOOTSTRAPPING QUANTILES FOR SURVEY DATA 079 according to the procedure described in Section 2. Monte Carlo approximations of size,000 were used in computing quantities such as d a and Var (ˆθ ). The Woodruff s confidence interval is [( I ) ( 2.96ˆσ(ˆθ I )), ( I ) ( 2 +.96ˆσ(ˆθ I ))], where, for any fixed x, ˆσ 2 (x) is the adjusted jackknife variance estimator (Rao and Shao (992)) for the statistic I (x). Woodruff s variance estimator is given by [ ( I ) ( 2 +.96ˆσ (ˆθ I )) ( I ) ( 2.96ˆσ (ˆθ I )) ] 2. 2.96 The simulation size was 0,000. All the computations were done in UNIX at the Department of Statistics, University of Wisconsin-Madison, using IMSL subroutines GENNOR, IGNUIN and GENUNF for random number generations. Table 2. Results for the sample median RB (%) MSE Prob (%) Length p (%) True Var BM WM BM WM BM WM BM WM 00 0.69.4 6.3 0.32 0.37 92.6 93.6 3.9 3.0 90 0.86 0.4 2. 0.4 0.48 92.9 96.0 3.54 3.57 80.03 2.3 9.2 0.53 0.63 93. 96.8 3.9 4.03 70.20 5.9 25.9 0.67 0.80 93.3 97.4 4.28 4.47 60.47 4.7 25.9 0.84 0.98 93. 97.9 4.70 4.94 50.77 8.0 28.8.0.27 93.8 97.7 5.2 5.45 40 2.9.4 32.0.44.67 94.6 97.7 5.88 6.0 BM: the bootstrap method WM: Woodruff s method Table 2 lists, for some values of the response probability p, the true variance of ˆθ I (approximated by the sample variance of the 0,000 simulated values of ˆθ I ), the empirical relative bias (RB) of the bootstrap or Woodruff s variance estimator, which is defined as (the average of simulated variance estimates the true variance)/ the true variance, the mean square error (MSE) of the bootstrap or Woodruff s variance estimator, and the coverage probability and length of 95% bootstrap or Woodruff s confidence interval. The following is a summary of the simulation results.. Variance estimation. The bootstrap variance estimator performs well, but Woodruff s variance estimator has a large relative bias (except in the case of no missing data) which results in a large MSE. In the case of no missing data, the poor performance of Woodruff s variance estimator was noticed and explained by Kovar, Rao and Wu (988) and Sitter (992). In our simulation

080 JUN SHAO AND YINZHONG CHEN study, Woodruff s variance estimator behaves well when there is no missing data, but performs poorly when there are imputed data; the relative bias of Woodruff s variance estimator increases as the response rate decreases. 2. Confidence interval. In the case of no missing data, both confidence intervals have coverage probability about 93%; Woodruff s interval is slightly better and has slightly shorter length. When there are imputed data, Woodruff s interval is slightly longer than the bootstrap percentile confidence interval (which is related to the large positive bias of Woodruff s variance estimator); the coverage probability of the bootstrap percentile confidence interval is around 93-94%, whereas the coverage probability of Woodruff s interval is around 96-98%. The coverage error of Woodruff s interval is not serious in this simulation study, but the problem could be much worse if the coverage probability of Woodruff s interval were 95% in the case of no missing data or if the bias of Woodruff s variance estimator were negative instead of positive. Acknowledgement The research was partially supported by National Sciences Foundation Grant DMS-9504425 and National Security Agency Grant MDA904-96--0066. Appendix Proof of Lemma. The proof is very similar to that of Lemma in Chen and Shao (996). Let E be the expectation under the bootstrap imputation. Then E ( )=G R /q R = R, where G R = M whik I {yhik }, S R q R = G R ( ), and R = G R /q R. Define H (x) = (x) (θ ) R (x)+ R (θ ) and H R (x) =G R (x) G R (θ ) q R [ R (x) R (θ )]. Then E [H R (x)] = 0. Following the proof of Lemma in Chen and Shao (996), we can show that sup H x θ cn /2 (x)+h R (x)/q R = o p (n /2 )

BOOTSTRAPPING QUANTILES FOR SURVEY DATA 08 and sup H I (x) = o p(n /2 ). x θ cn /2 Hence, the result follows from H (x) =H (x)+h R (x)/q R H(x). I ProofofTheorem3. The proof parallels that of Theorem in Ghosh et al. (984), but some modifications are needed to take into account the stratified multistage sampling and the imputation. From (8) and ˆθ I θ = O p (n /2 ), it suffices to show that E n(ˆθ θ ) 2+δ =(2+δ) 0 t +δ P { n ˆθ θ >t}dt = O p () (7) for a δ>0, where E and P are the expectation and probability with respect to bootstrap sampling. Note that ˆθ ɛ n +ɛ n +ɛ max h,i,k y hik ɛ L n h n +ɛ h= i= max y hik ɛ = o p () k=,...,n hi under condition (2). Hence P {P { n ˆθ θ b n } =}, where b n = n /2+(+ɛ)/ɛ. Therefore, (7) follows from bn From (A), there is a constant c > 0 such that { P n sup x t +δ P { n ˆθ θ >t}dt = O p (). (8) } Var [G R (x) q R R (x)] c. Also, (A) implies the existence of a constant c 2 >0 such that sup max h,i,k w hik /M c 2 2.Letc ɛ = 2 [ 2 +(+ɛ)/ɛ](2 + δ)max{c,c 2 } and a n = c 0 cɛ log n, wherec 0 is a constant satisfying inf f (θ ) > 4p. By (4), R c 0 a n n /2 [ R (θ + a n n /2 ) R (θ ) F (θ + a n n /2 )+F (θ )] = o p (); by (A2 ), lim inf a n n/2 [F (θ + a n n /2 ) F (θ )] > 4p R c 0 and a n n/2 [ R (θ ) p] =o p (). Hence P { R (θ +a n n /2 ) p 4p R c 0 a nn /2 }. Thus, in the rest of the proof we may assume n sup x Var [G R (x) q R R (x)] c (9) and R (θ + a n n /2 ) p 4p R c 0 a nn /2. (20)

082 JUN SHAO AND YINZHONG CHEN Define B = {q R R ( R )(θ + a n n /2 ) c 0 a nn /2 } {q p R /2}. Using Bernstein s and Hoeffding s inequalities, together with (9), (A), (A2 ) and the choice of c ɛ, P (B )=O p(b (2+δ) n ). By (20), on the complement of the set B, q R Hence, by Hoeffding s inequality again, P { n(ˆθ [ (θ +a n n /2 ) p] c 0 a nn /2. θ ) a n } = P {p (θ + a n n /2 )} exp{ 2n[q R R ( (θ + a n n /2 ) p)] 2 /c 2 } + I B exp{ 2c ɛ log n/c 2 } + I B, where P is the probability with respect to the bootstrap imputation. Therefore, bn a n t +δ P { n(ˆθ Similarly, it can be shown that θ ) >t}dt = E bn a n t +δ P { n(ˆθ θ ) >t}dt E [b 2+δ n P { n(ˆθ θ ) >a n }] b 2+δ n [exp{ 2c ɛ log n/c 2 } + P (B)] = O p (). (2) bn a n Let t = θ + tn /2.Then an t +δ P { n(ˆθ θ ) >t}dt = t +δ P { n(ˆθ θ ) < t}dt = O p (). (22) an an an O p (n 2 ) t +δ P {p t +δ P { F (t ) t +δ (t )}dt [F (t ) p] 4 E F (t ) an t +δ t 4 dt, n 2 (t ) F (t ) p}dt (t ) 4 dt where the last inequality follows from (A2 )andthefactthat(a)and(a2 ) imply sup E F (t ) (t ) 4 = O p (n 2 ). (23) t a n

BOOTSTRAPPING QUANTILES FOR SURVEY DATA 083 By choosing δ (0, ), we obtain an t +δ P { n(ˆθ θ ) >t}dt = O p (). (24) Similarly, an t +δ P { n(ˆθ θ ) < t}dt = O p (). (25) Hence (8) follows from (2)-(25). This completes the proof. Proof of Theorem 4. From () and ˆρ I ρ = O p (n /2 ), we only need to show E n(ˆρ ρ ) 2+δ = O p () (26) for a δ>0. From (A2 ), there is a constant c>0such that n F ( 2 ˆµ ) ρ t implies n ˆµ µ ct for all and 0 <t log n. Notingthat0 F, we obtain E n[f ( 2 ˆµ ) ρ ] 2+δ 2n =(2+δ) t +δ P { n F ( 2 ˆµ ) ρ ) t}dt 0 (2n) +δ/2 P { n F ( 2 ˆµ ) ρ ) log n} by the proof of Theorem 3. Similarly, E n[ˆρ log n +(2+δ) 0 P { n F ( 2 ˆµ ) ρ ) t}dt (2n) +δ/2 P { n ˆµ µ c log n} log n +(2 + δ) P { n ˆµ µ ct}dt 0 = O p () (27) F ( 2 ˆµ )] 2+δ n +δ/2 P { n ˆµ µ log n} + E n[ˆρ F ( 2 ˆµ )] 2+δ I { n ˆµ µ log n} = O p () + E sup n[ (x) F (x)] 2+δ, (28) x D where D = {x : x 2 µ n /2 log n}. Hence (26) follows from (27), (28) and E sup n[ (x) F (x)] 2+δ = O p (). (29) x D Since E n[ ( 2 µ ) ρ ] 2+δ = O p (), (29) follows from (4) and E sup x D n[h (x)] 2+δ = O p (), (30)

084 JUN SHAO AND YINZHONG CHEN where H (x) is defined by (6) with θ = 2 µ. From the proof of Lemma in the Appendix, (30) follows from E max nh R (η l ) 2+δ = O p () (3) n l n and E max nh (η l ) 2+δ = O p (), (32) n l n where η l = 2 µ + n 3/2 log nl, l = n,...,n, H R (x) andh (x) are defined in the proof of Lemma. Using Bernstein s inequality, we obtain that E max nh R (η l ) 2+δ n l n { =(2+δ) t +δ P max nh R n l n 4(2 + δ)n = o p (). 0 0 t +δ exp } (η l ) t dt { t 2 } O p (n /2 )( dt log n + t) Therefore, (3) holds. (32) can be similarly shown. The proof is completed. References Bickel, P. J. and Freedman, D. A. (984). Asymptotic normality and the bootstrap in stratified sampling. Ann. Statist. 2, 470-482. Chen, Y. and Shao, J. (996). Inference with complex survey data imputed by hot deck when nonrespondents are nonidentifiable. Technical Report, Department of Statistics, University of Wisconsin-Madison. Efron, B. (979). Bootstrap methods: Another look at the jackknife. Ann. Statist. 7, -26. Efron, B. (994). Missing data, imputation, and the bootstrap. J. Amer. Statist. Assoc. 89, 463-479. Ghosh, J. K. (97). A new proof of the Bahadur representation of quantiles and an application. Ann. Math. Statist. 42, 957-96. Ghosh, M., Parr, W. C., Singh, K. and Babu, G. J. (984). A note on bootstrapping the sample median. Ann. Statist. 2, 30-35. Kovar, J. G., Rao, J. N. K. and Wu, C. F. J. (988). Bootstrap and other methods to measure errors in survey estimates. Canad. J. Statist. 6, Supplement, 25-45. McCarthy, P. J. and Snowden, C. B. (985). The bootstrap and finite population sampling. Vital and Health Statistics, 2-95, Public Health Service Publication 85-369, U.S. Government Printing Office, Washington, DC. Rao, J. N. K. and Shao, J. (992). Jackknife variance estimation with survey data under hot deck imputation. Biometrika 79, 8-822. Rao, J. N. K. and Wu, C. F. J. (988). Resampling inference with complex survey data. J. Amer. Statist. Assoc. 83, 23-24. Schenker, N. and Welsh, A. H. (988). Asymptotic results for multiple imputation. Ann. Statist. 6, 550-566.

BOOTSTRAPPING QUANTILES FOR SURVEY DATA 085 Shao, J. (996). Resampling methods in sample surveys (with discussions). Statist. 27, 203-254. Shao, J. and Rao, J. N. K. (994). Standard errors for low income proportions estimated from stratified multistage samples. Sankhya Ser. B, Special Volume 55, 393-44. Shao, J. and Sitter, R. R. (996). Bootstrap for imputed survey data. J. Amer. Statist. Assoc. 9, 278-288. Shao J. and Wu, C. F. J. (992). Asymptotic properties of the balanced repeated replication method for sample quantiles. Ann. Statist. 20, 57-593. Sitter, R. R. (992). A resampling procedure for complex survey data. J. Amer. Statist. Assoc. 87, 755-765. Wolfson, M. C. and Evans, J. M. (990). Statistics Canada low income cut-offs, methodological concerns and possibilities. Discussion Paper, Statistics Canada. Woodruff, R. S. (952). Confidence intervals for medians and other position measures. J. Amer. Statist. Assoc. 47, 635-646. Department of Statistics, University of Wisconsin-Madison, 20 West Dayton Street, Madison WI 53706-685, U.S.A. E-mail: shao@stat.wisc.edu (Received June 996; accepted June 997)