Jong-Min Kim* and Jon E. Anderson. Statistics Discipline Division of Science and Mathematics University of Minnesota at Morris

Size: px
Start display at page:

Download "Jong-Min Kim* and Jon E. Anderson. Statistics Discipline Division of Science and Mathematics University of Minnesota at Morris"

Transcription

1 Jackknife Variance Estimation for Two Samples after Imputation under Two-Phase Sampling Jong-Min Kim* and Jon E. Anderson Statistics Discipline Division of Science and Mathematics University of Minnesota at Morris 2004 JSM, August 11 1

2 Outline Jackknife Background and Definition Ratio Estimator in Simple Random Sampling Jackknife variance estimators for two samples Simulation study Ratio Estimator in Stratified Random Sampling Jackknife variance estimators for two samples Simulation study Conclusions 2004 JSM, August 11 2

3 Jackknife Background Introduced by Quenouille (1949, 1956) as a method to reduce bias Popularized by Tukey (1958) who used it for variances and CIs Arvesen (1969) was the first to propose two-sample jackknife estimator 2004 JSM, August 11 3

4 Jackknife Definition Let ˆθ be an estimate Let ˆθ( j) be an estimator of the same form with observation j deleted The jackknife estimate of the variance of ˆθ is n 1 n n 2 [ˆθ( j) ˆθ] j= JSM, August 11 4

5 Naive Jackknife Variance Estimator for Two Samples, Two-Phase, Simple Random Sampling Two, first-phase simple random samples s 1 of size n 1 and s 2 of size n 2 are taken without replacement from a population of N elements. Simple random subsamples s 1 of size n 1 and s 2 of size n 2 are taken without replacement from s 1 and s 2. Ratio estimators of Y, are y r1 = (y 1 /x 1 )x 1 = ˆR 1 x 1, y r2 = (y 2 /x 2 )x 2 = ˆR 2 x 2, where x 1 and x 2 are the means for the first-phase samples s 1 and s 2 and (x 1, y 1 ) and (x 2, y 2 ) are the means for the second-phase samples s 1 and s JSM, August 11 5

6 Naive Jackknife Variance Estimator for Two Samples, SRS Define y rk ( j) = [y k ( j)/x k ( j)]x k( j), for all j s k, k = 1, 2, where (n k x k x j )/(n k 1), if (j s k ), x k ( j) = x k, if (j s k s k), y k ( j) = (n k y k y j )/(n k 1), if (j s k ), y k, if (j s k s k). and x k ( j) = (n k x k x j)/(n k 1) for all j s k JSM, August 11 6

7 Naive Jackknife Variance Estimator for Two Samples, SRS Apply the usual jackknife method for y rk ( j) to get v Jrk = n k 1 n k j s k [y rk ( j) y rk ] 2. The jackknife variance estimator is a weighted average of two estimators, given by v Jr = 1v n Jr1 + n 2v Jr2 n 1 + n 2 = n 1 1 n 1 + [y r1 ( j) y r1 ] 2 + n 2 1 n 2 n 1 + n 2 j s 1 l s 2 [y r2 ( l) y r2 ] JSM, August 11 7

8 Adjusted Jackknife Variance Estimator for Two Samples, Two-Phase, Simple Random Sampling Let s consider an adjusted jackknife variance estimator using a data imputation framework. In one of our samples, k = 1, 2 we define our estimator of Y as y ki = (1/n k ) i s yi. k If the observation is part of the second phase sample, s k, y i = y i, because y i is observed. If the value of y is not directly observed because it is part of s k s k, the value is obtained through ratio imputation as y i = (y k/x k )x i JSM, August 11 8

9 Adjusted Jackknife Variance Estimator for Two Samples, Two-Phase, Simple Random Sampling Rao and Sitter (1995) proposed the following device { ẑ ki ( j) = yki yk ( j) + x k ( j) x ki y } k x ki, x k for sample k = 1, 2. Under this formulation, ẑ ki ( j) = y ki = (y k/x k )x ki for j s k s k in sample k = 1, 2, and ẑ ki ( j) = (y k ( j)/x k ( j))x ki for j s k in sample k = 1, JSM, August 11 9

10 Define the adjusted estimator, y a ki( j) = 1 n k 1 n k i=1 ẑ ki ( j). Define the jackknife variance estimator for sample k, v Jrk = n k 1 n k j s k [y a ki( j) y ki ] 2, where y ki = y kr under ratio imputation. The jackknife variance estimator based on adjusted imputed estimators y ki, k = 1, 2 is a weighted average of two estimators, given by vjr a = 1v n Jr1 + n 2v Jr2 n 1 + n 2 = n 1 1 n 1 + [y a 1I( j) y 1I ] 2 + n 2 1 n 2 n 1 + n 2 j s 1 l s 2 [y a 2I( l) y 2I ] JSM, August 11 10

11 Simulation Study Design: Simple Random Sampling Population size is Pop Y is related to Pop X, Y = 0.8 X + ɛ Y X 2004 JSM, August 11 11

12 Simulation: Simple Random Sampling One Sample Jacknife Variance for Ratio Estimator J. Variance *SecondPhase 3*SecondPhase Second Phase Sample Size 2004 JSM, August 11 12

13 Simulation: Simple Random Sampling Two Sample Jackknife Variance for Ratio Estimator J. Variance *SecondPhase 3*SecondPhase Second Phase Sample Size 2004 JSM, August 11 13

14 Simulation: Simple Random Sampling Comparison of One vs Two Sample Estimators St. Dev of Ests One Sample St. Dev Two Sample St. Dev Second Phase Sample Size 2004 JSM, August 11 14

15 Adjusted Jackknife Variance Estimator for Two Samples, Two-Phase, Stratified Random Sampling Suppose the population of N units consists of L strata such that the h-th stratum consists of N h units and L h=1 N h = N. Suppose that an auxiliary variable, x, closely related to an item y is observed on all sample units, s hk in sample k = 1, 2 for stratum h. Ratio imputation uses y hki = (y hk/x hk )x hki for i s hk s hk where y hk and x hk are the means of y and x for the respondents s hk in stratum h JSM, August 11 15

16 Under ratio imputation, y ki = L W h y hki = h L W h (y hk /x hk )x hk, h where x hk is the x mean for the full sample s hk h. from stratum Also, y hki ( hkj) = [y hk( hkj)/x hk ( hkj)]x hki, under ratio imputation when hkjth respondent is deleted, where y hk ( hkj) = [n hk y hk y hj ] /(n hk 1) and x hk ( hkj) = [n hk x hk x hj ] /(n hk 1), for k = 1, JSM, August 11 16

17 Define ẑ hki ( hkj) = y hki + { yhk ( hkj) x hk ( hkj) x hki y } hk x hki, x hk for sample k = 1, 2, stratum h. Under this formulation, ẑ hki ( hkj) = y hki = (y hk/x hk )x hki for hkj s hk s hk in sample k = 1, 2, and ẑ hki ( hkj) = (y hk ( hkj)/x hk ( hkj))x hki for hkj s hk in sample k = 1, 2, stratum h. Define the adjusted estimator, y a hki( hkj) = 1 n hk 1 n hk i=1 ẑ hki ( hkj). Using these values, the jackknife variance estimator is given by v Jr (y ki ) = L h=1 n hk 1 n hk n hk j=1 [y a ki( hkj) y ki ] 2, 2004 JSM, August 11 17

18 Adjusted Jackknife Variance Estimator for Two Samples,Two-Phase, Stratified Random Sampling Noting that y a ki ( hkj) y ki = W h [y a hki ( hkj) y hki], where y a hki ( hkj) is the adjusted imputed estimator of the hth stratum mean Y h when hkjth sample unit is deleted, we get v Jr (y ki ) = = L Whv 2 Jr (y hki ) h=1 L h=1 W 2 h n hk 1 n hk n hk j=1 (y a hki( hkj) y hki ) 2, for k = 1, JSM, August 11 18

19 The jackknife variance estimator is a weighted average of two estimators, given by v Jrs = 1v n Jr (y 1I ) + n 2v Jr (y 2I ) n 1 + n 2 n L 1 = n 1 + W 2 n h1 1 h n 2 n h=1 h1 n L 2 + n 1 + W 2 n h2 1 h n 2 n h2 h=1 n h1 j=1 n h2 j=1 where n 1 = 2 h=1 n h1 and n 2 = 2 h=1 n h2. (y a h1i( h1j) y h1i ) 2 (y a h2i( h2j) y h2i ) JSM, August 11 19

20 Simulation Study Design: Stratified Random Sampling Population size is N = Pop Y is related to Pop X, Y = 0.8 X + ɛ Three strata, X < 90, 90 X 110, 110 < X. Stratum 1 size = N 1 = 1633 s.t. W 1 = N 1 N = Stratum 2 size = N 2 = 6805 s.t. W 2 = N 2 N = Stratum 3 size = N 3 = 1562 s.t. W 3 = N 3 N = JSM, August 11 20

21 Simulation Study Design: Stratified Random Sampling Stratum=3 Population Y Values Stratum=1 Stratum= Population X Values 2004 JSM, August 11 21

22 Simulation: Stratified Random Sampling One Sample Jackknife Variance, Stratified Sampling J. Variance *SecondPhase 3*SecondPhase Second Phase Sample Size 2004 JSM, August 11 22

23 Simulation: Stratified Random Sampling Two Sample Jackknife Variance, Stratified Sampling J. Variance *SecondPhase 3*SecondPhase Second Phase Sample Size 2004 JSM, August 11 23

24 Simulation: Stratified Random Sampling Comparison of One and Two Samples, Stratified RS St. Dev of Ests One Sample St. Dev Two Sample St. Dev Second Phase Sample Size 2004 JSM, August 11 24

25 Simulation: Stratified Random Sampling Comparison of Complete and Missing Data St. Dev of Ests Complete Data Missing Data Case Second Phase Sample Size Stratified RS, First SS= 2*SecondPhase 2004 JSM, August 11 25

26 Calibration Approach to Jackknife Variance Estimation Three major advantages of calibration approach in Survey Sampling Leads to consistent estimates Provides an important class of techniques for the efficient combination of data sources. Has computational advantage for estimates. Apply Tracy et al. (2003) calibration in Stratified and Double Sampling to Jackknife Variance Estimator 2004 JSM, August 11 26

27 Conclusions and Future Study Jackknife variance estimator for two samples has less SD (variation) than Jackknife variance estimator for one sample. Adjusted Jackknife variance estimator for two samples, Stratified RS, is more efficient than the Adjusted Jackknife variance estimator for two samples, SRS. As the sample sizes increase, the adjusted Jackknife variance estimator for two-sample is shown to be consistent. Future: Apply Jackknife variance estimator for two samples to Stratified Multistage Sampling JSM, August 11 27

Jong-Min Kim* and Jon E. Anderson. Statistics Discipline Division of Science and Mathematics University of Minnesota at Morris

Jong-Min Kim* and Jon E. Anderson. Statistics Discipline Division of Science and Mathematics University of Minnesota at Morris Jackknife Variance Estimation of the Regression and Calibration Estimator for Two 2-Phase Samples Jong-Min Kim* and Jon E. Anderson jongmink@morris.umn.edu Statistics Discipline Division of Science and

More information

Imputation for Missing Data under PPSWR Sampling

Imputation for Missing Data under PPSWR Sampling July 5, 2010 Beijing Imputation for Missing Data under PPSWR Sampling Guohua Zou Academy of Mathematics and Systems Science Chinese Academy of Sciences 1 23 () Outline () Imputation method under PPSWR

More information

An Overview of the Pros and Cons of Linearization versus Replication in Establishment Surveys

An Overview of the Pros and Cons of Linearization versus Replication in Establishment Surveys An Overview of the Pros and Cons of Linearization versus Replication in Establishment Surveys Richard Valliant University of Michigan and Joint Program in Survey Methodology University of Maryland 1 Introduction

More information

Data Integration for Big Data Analysis for finite population inference

Data Integration for Big Data Analysis for finite population inference for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation

More information

REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES

REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES Statistica Sinica 8(1998), 1153-1164 REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES Wayne A. Fuller Iowa State University Abstract: The estimation of the variance of the regression estimator for

More information

Fractional Imputation in Survey Sampling: A Comparative Review

Fractional Imputation in Survey Sampling: A Comparative Review Fractional Imputation in Survey Sampling: A Comparative Review Shu Yang Jae-Kwang Kim Iowa State University Joint Statistical Meetings, August 2015 Outline Introduction Fractional imputation Features Numerical

More information

VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA

VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA Submitted to the Annals of Applied Statistics VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA By Jae Kwang Kim, Wayne A. Fuller and William R. Bell Iowa State University

More information

Nonresponse weighting adjustment using estimated response probability

Nonresponse weighting adjustment using estimated response probability Nonresponse weighting adjustment using estimated response probability Jae-kwang Kim Yonsei University, Seoul, Korea December 26, 2006 Introduction Nonresponse Unit nonresponse Item nonresponse Basic strategy

More information

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design 1 / 32 Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design Changbao Wu Department of Statistics and Actuarial Science University of Waterloo (Joint work with Min Chen and Mary

More information

Introduction to Survey Data Analysis

Introduction to Survey Data Analysis Introduction to Survey Data Analysis JULY 2011 Afsaneh Yazdani Preface Learning from Data Four-step process by which we can learn from data: 1. Defining the Problem 2. Collecting the Data 3. Summarizing

More information

6. Fractional Imputation in Survey Sampling

6. Fractional Imputation in Survey Sampling 6. Fractional Imputation in Survey Sampling 1 Introduction Consider a finite population of N units identified by a set of indices U = {1, 2,, N} with N known. Associated with each unit i in the population

More information

REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY

REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY J.D. Opsomer, W.A. Fuller and X. Li Iowa State University, Ames, IA 50011, USA 1. Introduction Replication methods are often used in

More information

TWO-WAY CONTINGENCY TABLES UNDER CONDITIONAL HOT DECK IMPUTATION

TWO-WAY CONTINGENCY TABLES UNDER CONDITIONAL HOT DECK IMPUTATION Statistica Sinica 13(2003), 613-623 TWO-WAY CONTINGENCY TABLES UNDER CONDITIONAL HOT DECK IMPUTATION Hansheng Wang and Jun Shao Peking University and University of Wisconsin Abstract: We consider the estimation

More information

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Chapter 5: Models used in conjunction with sampling J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Nonresponse Unit Nonresponse: weight adjustment Item Nonresponse:

More information

BOOTSTRAPPING SAMPLE QUANTILES BASED ON COMPLEX SURVEY DATA UNDER HOT DECK IMPUTATION

BOOTSTRAPPING SAMPLE QUANTILES BASED ON COMPLEX SURVEY DATA UNDER HOT DECK IMPUTATION Statistica Sinica 8(998), 07-085 BOOTSTRAPPING SAMPLE QUANTILES BASED ON COMPLEX SURVEY DATA UNDER HOT DECK IMPUTATION Jun Shao and Yinzhong Chen University of Wisconsin-Madison Abstract: The bootstrap

More information

Workpackage 5 Resampling Methods for Variance Estimation. Deliverable 5.1

Workpackage 5 Resampling Methods for Variance Estimation. Deliverable 5.1 Workpackage 5 Resampling Methods for Variance Estimation Deliverable 5.1 2004 II List of contributors: Anthony C. Davison and Sylvain Sardy, EPFL. Main responsibility: Sylvain Sardy, EPFL. IST 2000 26057

More information

A decision theoretic approach to Imputation in finite population sampling

A decision theoretic approach to Imputation in finite population sampling A decision theoretic approach to Imputation in finite population sampling Glen Meeden School of Statistics University of Minnesota Minneapolis, MN 55455 August 1997 Revised May and November 1999 To appear

More information

Chapter 4. Replication Variance Estimation. J. Kim, W. Fuller (ISU) Chapter 4 7/31/11 1 / 28

Chapter 4. Replication Variance Estimation. J. Kim, W. Fuller (ISU) Chapter 4 7/31/11 1 / 28 Chapter 4 Replication Variance Estimation J. Kim, W. Fuller (ISU) Chapter 4 7/31/11 1 / 28 Jackknife Variance Estimation Create a new sample by deleting one observation n 1 n n ( x (k) x) 2 = x (k) = n

More information

Combining data from two independent surveys: model-assisted approach

Combining data from two independent surveys: model-assisted approach Combining data from two independent surveys: model-assisted approach Jae Kwang Kim 1 Iowa State University January 20, 2012 1 Joint work with J.N.K. Rao, Carleton University Reference Kim, J.K. and Rao,

More information

Two-phase sampling approach to fractional hot deck imputation

Two-phase sampling approach to fractional hot deck imputation Two-phase sampling approach to fractional hot deck imputation Jongho Im 1, Jae-Kwang Kim 1 and Wayne A. Fuller 1 Abstract Hot deck imputation is popular for handling item nonresponse in survey sampling.

More information

Model Assisted Survey Sampling

Model Assisted Survey Sampling Carl-Erik Sarndal Jan Wretman Bengt Swensson Model Assisted Survey Sampling Springer Preface v PARTI Principles of Estimation for Finite Populations and Important Sampling Designs CHAPTER 1 Survey Sampling

More information

Successive Difference Replication Variance Estimation in Two-Phase Sampling

Successive Difference Replication Variance Estimation in Two-Phase Sampling Successive Difference Replication Variance Estimation in Two-Phase Sampling Jean D. Opsomer Colorado State University Michael White US Census Bureau F. Jay Breidt Colorado State University Yao Li Colorado

More information

EFFICIENT REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLING

EFFICIENT REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLING Statistica Sinica 13(2003), 641-653 EFFICIENT REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLING J. K. Kim and R. R. Sitter Hankuk University of Foreign Studies and Simon Fraser University Abstract:

More information

Singh, S. (2013). A dual problem of calibration of design weights. Statistics: A Journal of Theoretical and Applied Statistics 47 (3),

Singh, S. (2013). A dual problem of calibration of design weights. Statistics: A Journal of Theoretical and Applied Statistics 47 (3), Selected Publications: Sarjinder Singh Singh, S. (2013). A dual problem of calibration of design weights. Statistics: A Journal of Theoretical and Applied Statistics 47 (3), 566-574. Singh, S. (2012).

More information

A JACKKNIFE VARIANCE ESTIMATOR FOR SELF-WEIGHTED TWO-STAGE SAMPLES

A JACKKNIFE VARIANCE ESTIMATOR FOR SELF-WEIGHTED TWO-STAGE SAMPLES Statistica Sinica 23 (2013), 595-613 doi:http://dx.doi.org/10.5705/ss.2011.263 A JACKKNFE VARANCE ESTMATOR FOR SELF-WEGHTED TWO-STAGE SAMPLES Emilio L. Escobar and Yves G. Berger TAM and University of

More information

Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse

Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse Nanhua Zhang Division of Biostatistics & Epidemiology Cincinnati Children s Hospital Medical Center (Joint work

More information

Songklanakarin Journal of Science and Technology SJST R3 LAWSON

Songklanakarin Journal of Science and Technology SJST R3 LAWSON Songklanakarin Journal of Science and Technology SJST-0-00.R LAWSON Ratio Estimators of Population Means Using Quartile Function of Auxiliary Variable in Double Sampling Journal: Songklanakarin Journal

More information

Asymptotic Normality under Two-Phase Sampling Designs

Asymptotic Normality under Two-Phase Sampling Designs Asymptotic Normality under Two-Phase Sampling Designs Jiahua Chen and J. N. K. Rao University of Waterloo and University of Carleton Abstract Large sample properties of statistical inferences in the context

More information

Chapter 8: Estimation 1

Chapter 8: Estimation 1 Chapter 8: Estimation 1 Jae-Kwang Kim Iowa State University Fall, 2014 Kim (ISU) Ch. 8: Estimation 1 Fall, 2014 1 / 33 Introduction 1 Introduction 2 Ratio estimation 3 Regression estimator Kim (ISU) Ch.

More information

Taking into account sampling design in DAD. Population SAMPLING DESIGN AND DAD

Taking into account sampling design in DAD. Population SAMPLING DESIGN AND DAD Taking into account sampling design in DAD SAMPLING DESIGN AND DAD With version 4.2 and higher of DAD, the Sampling Design (SD) of the database can be specified in order to calculate the correct asymptotic

More information

Sampling from Finite Populations Jill M. Montaquila and Graham Kalton Westat 1600 Research Blvd., Rockville, MD 20850, U.S.A.

Sampling from Finite Populations Jill M. Montaquila and Graham Kalton Westat 1600 Research Blvd., Rockville, MD 20850, U.S.A. Sampling from Finite Populations Jill M. Montaquila and Graham Kalton Westat 1600 Research Blvd., Rockville, MD 20850, U.S.A. Keywords: Survey sampling, finite populations, simple random sampling, systematic

More information

Sampling Techniques. Esra Akdeniz. February 9th, 2016

Sampling Techniques. Esra Akdeniz. February 9th, 2016 Sampling Techniques Esra Akdeniz February 9th, 2016 HOW TO DO RESEARCH? Question. Literature research. Hypothesis. Collect data. Analyze data. Interpret and present results. HOW TO DO RESEARCH? Collect

More information

You are allowed 3? sheets of notes and a calculator.

You are allowed 3? sheets of notes and a calculator. Exam 1 is Wed Sept You are allowed 3? sheets of notes and a calculator The exam covers survey sampling umbers refer to types of problems on exam A population is the entire set of (potential) measurements

More information

J.N.K. Rao, Carleton University Department of Mathematics & Statistics, Carleton University, Ottawa, Canada

J.N.K. Rao, Carleton University Department of Mathematics & Statistics, Carleton University, Ottawa, Canada JACKKNIFE VARIANCE ESTIMATION WITH IMPUTED SURVEY DATA J.N.K. Rao, Carleton University Department of Mathematics & Statistics, Carleton University, Ottawa, Canada KEY WORDS" Adusted imputed values, item

More information

It should be unbiased, or approximately unbiased. Variance of the variance estimator should be small. That is, the variance estimator is stable.

It should be unbiased, or approximately unbiased. Variance of the variance estimator should be small. That is, the variance estimator is stable. Chapter 10 Variace Estimatio 10.1 Itroductio Variace estimatio is a importat practical problem i survey samplig. Variace estimates are used i two purposes. Oe is the aalytic purpose such as costructig

More information

A note on multiple imputation for general purpose estimation

A note on multiple imputation for general purpose estimation A note on multiple imputation for general purpose estimation Shu Yang Jae Kwang Kim SSC meeting June 16, 2015 Shu Yang, Jae Kwang Kim Multiple Imputation June 16, 2015 1 / 32 Introduction Basic Setup Assume

More information

Resampling Variance Estimation in Surveys with Missing Data

Resampling Variance Estimation in Surveys with Missing Data Journal of Official Statistics, Vol. 23, No. 3, 2007, pp. 371 386 Resampling Variance Estimation in Surveys with Missing Data A.C. Davison 1 and S. Sardy 2 We discuss variance estimation by resampling

More information

Weighting Missing Data Coding and Data Preparation Wrap-up Preview of Next Time. Data Management

Weighting Missing Data Coding and Data Preparation Wrap-up Preview of Next Time. Data Management Data Management Department of Political Science and Government Aarhus University November 24, 2014 Data Management Weighting Handling missing data Categorizing missing data types Imputation Summary measures

More information

Estimation of change in a rotation panel design

Estimation of change in a rotation panel design Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS028) p.4520 Estimation of change in a rotation panel design Andersson, Claes Statistics Sweden S-701 89 Örebro, Sweden

More information

COMPLEX SAMPLING DESIGNS FOR THE CUSTOMER SATISFACTION INDEX ESTIMATION

COMPLEX SAMPLING DESIGNS FOR THE CUSTOMER SATISFACTION INDEX ESTIMATION STATISTICA, anno LXVII, n. 3, 2007 COMPLEX SAMPLING DESIGNS FOR THE CUSTOMER SATISFACTION INDEX ESTIMATION. INTRODUCTION Customer satisfaction (CS) is a central concept in marketing and is adopted as an

More information

Deriving indicators from representative samples for the ESF

Deriving indicators from representative samples for the ESF Deriving indicators from representative samples for the ESF Brussels, June 17, 2014 Ralf Münnich and Stefan Zins Lisa Borsi and Jan-Philipp Kolb GESIS Mannheim and University of Trier Outline 1 Choosing

More information

Large n normal approximations (Central Limit Theorem). xbar ~ N[mu, sigma 2 / n] (sketch a normal with mean mu and sd = sigma / root(n)).

Large n normal approximations (Central Limit Theorem). xbar ~ N[mu, sigma 2 / n] (sketch a normal with mean mu and sd = sigma / root(n)). 1. 5-1, 5-2, 5-3 Large n normal approximations (Central Limit Theorem). xbar ~ N[mu, sigma 2 / n] (sketch a normal with mean mu and sd = sigma / root(n)). phat ~ N[p, pq / n] (sketch a normal with mean

More information

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction

More information

A Design-Sensitive Approach to Fitting Regression Models With Complex Survey Data

A Design-Sensitive Approach to Fitting Regression Models With Complex Survey Data A Design-Sensitive Approach to Fitting Regression Models With Complex Survey Data Phillip S. Kott RI International Rockville, MD Introduction: What Does Fitting a Regression Model with Survey Data Mean?

More information

Robustness to Parametric Assumptions in Missing Data Models

Robustness to Parametric Assumptions in Missing Data Models Robustness to Parametric Assumptions in Missing Data Models Bryan Graham NYU Keisuke Hirano University of Arizona April 2011 Motivation Motivation We consider the classic missing data problem. In practice

More information

Mean estimation with calibration techniques in presence of missing data

Mean estimation with calibration techniques in presence of missing data Computational Statistics & Data Analysis 50 2006 3263 3277 www.elsevier.com/locate/csda Mean estimation with calibration techniues in presence of missing data M. Rueda a,, S. Martínez b, H. Martínez c,

More information

Monte Carlo Study on the Successive Difference Replication Method for Non-Linear Statistics

Monte Carlo Study on the Successive Difference Replication Method for Non-Linear Statistics Monte Carlo Study on the Successive Difference Replication Method for Non-Linear Statistics Amang S. Sukasih, Mathematica Policy Research, Inc. Donsig Jang, Mathematica Policy Research, Inc. Amang S. Sukasih,

More information

A comparison of weighted estimators for the population mean. Ye Yang Weighting in surveys group

A comparison of weighted estimators for the population mean. Ye Yang Weighting in surveys group A comparison of weighted estimators for the population mean Ye Yang Weighting in surveys group Motivation Survey sample in which auxiliary variables are known for the population and an outcome variable

More information

Biostat 2065 Analysis of Incomplete Data

Biostat 2065 Analysis of Incomplete Data Biostat 2065 Analysis of Incomplete Data Gong Tang Dept of Biostatistics University of Pittsburgh September 13 & 15, 2005 1. Complete-case analysis (I) Complete-case analysis refers to analysis based on

More information

Combining Non-probability and Probability Survey Samples Through Mass Imputation

Combining Non-probability and Probability Survey Samples Through Mass Imputation Combining Non-probability and Probability Survey Samples Through Mass Imputation Jae-Kwang Kim 1 Iowa State University & KAIST October 27, 2018 1 Joint work with Seho Park, Yilin Chen, and Changbao Wu

More information

ESTIMATION OF FINITE POPULATION MEAN USING KNOWN CORRELATION COEFFICIENT BETWEEN AUXILIARY CHARACTERS

ESTIMATION OF FINITE POPULATION MEAN USING KNOWN CORRELATION COEFFICIENT BETWEEN AUXILIARY CHARACTERS STATISTICA, anno LXV, n. 4, 005 ESTIMATION OF FINITE POPULATION MEAN USING KNOWN CORRELATION COEFFICIENT BETWEEN AUXILIAR CHARACTERS. INTRODUCTION Let U { U, U,..., U N } be a finite population of N units.

More information

ASYMPTOTIC NORMALITY UNDER TWO-PHASE SAMPLING DESIGNS

ASYMPTOTIC NORMALITY UNDER TWO-PHASE SAMPLING DESIGNS Statistica Sinica 17(2007), 1047-1064 ASYMPTOTIC NORMALITY UNDER TWO-PHASE SAMPLING DESIGNS Jiahua Chen and J. N. K. Rao University of British Columbia and Carleton University Abstract: Large sample properties

More information

Handling Missing Data on Asymmetric Distribution

Handling Missing Data on Asymmetric Distribution International Matematical Forum, Vol. 8, 03, no. 4, 53-65 Handling Missing Data on Asymmetric Distribution Amad M. H. Al-Kazale Department of Matematics, Faculty of Science Al-albayt University, Al-Mafraq-Jordan

More information

Optimal Calibration Estimators Under Two-Phase Sampling

Optimal Calibration Estimators Under Two-Phase Sampling Journal of Of cial Statistics, Vol. 19, No. 2, 2003, pp. 119±131 Optimal Calibration Estimators Under Two-Phase Sampling Changbao Wu 1 and Ying Luan 2 Optimal calibration estimators require in general

More information

of being selected and varying such probability across strata under optimal allocation leads to increased accuracy.

of being selected and varying such probability across strata under optimal allocation leads to increased accuracy. 5 Sampling with Unequal Probabilities Simple random sampling and systematic sampling are schemes where every unit in the population has the same chance of being selected We will now consider unequal probability

More information

Estimation of Parameters and Variance

Estimation of Parameters and Variance Estimation of Parameters and Variance Dr. A.C. Kulshreshtha U.N. Statistical Institute for Asia and the Pacific (SIAP) Second RAP Regional Workshop on Building Training Resources for Improving Agricultural

More information

The Effect of Multiple Weighting Steps on Variance Estimation

The Effect of Multiple Weighting Steps on Variance Estimation Journal of Official Statistics, Vol. 20, No. 1, 2004, pp. 1 18 The Effect of Multiple Weighting Steps on Variance Estimation Richard Valliant 1 Multiple weight adjustments are common in surveys to account

More information

New estimation methodology for the Norwegian Labour Force Survey

New estimation methodology for the Norwegian Labour Force Survey Notater Documents 2018/16 Melike Oguz-Alper New estimation methodology for the Norwegian Labour Force Survey Documents 2018/16 Melike Oguz Alper New estimation methodology for the Norwegian Labour Force

More information

Variance Estimation for Calibration to Estimated Control Totals

Variance Estimation for Calibration to Estimated Control Totals Variance Estimation for Calibration to Estimated Control Totals Siyu Qing Coauthor with Michael D. Larsen Associate Professor of Statistics Tuesday, 11/05/2013 2 Outline A. Background B. Calibration Technique

More information

In Praise of the Listwise-Deletion Method (Perhaps with Reweighting)

In Praise of the Listwise-Deletion Method (Perhaps with Reweighting) In Praise of the Listwise-Deletion Method (Perhaps with Reweighting) Phillip S. Kott RTI International NISS Worshop on the Analysis of Complex Survey Data With Missing Item Values October 17, 2014 1 RTI

More information

IE 361 Module 4. Modeling Measurement. Reading: Section 2.1 Statistical Methods for Quality Assurance. ISU and Analytics Iowa LLC

IE 361 Module 4. Modeling Measurement. Reading: Section 2.1 Statistical Methods for Quality Assurance. ISU and Analytics Iowa LLC IE 361 Module 4 Modeling Measurement Reading: Section 2.1 Statistical Methods for Quality Assurance ISU and Analytics Iowa LLC (ISU and Analytics Iowa LLC) IE 361 Module 4 1 / 12 Using Probability to Describe

More information

Sampling Weights. Pierre Foy

Sampling Weights. Pierre Foy Sampling Weights Pierre Foy 11 Sampling Weights Pierre Foy 189 11.1 Overview The basic sample design used in TIMSS 1999 was a two-stage stratified cluster design, with schools as the first stage and classrooms

More information

CPT Section D Quantitative Aptitude Chapter 15. Prof. Bharat Koshti

CPT Section D Quantitative Aptitude Chapter 15. Prof. Bharat Koshti CPT Section D Quantitative Aptitude Chapter 15 Prof. Bharat Koshti Different Methods of Sampling 1. Probability Sampling Methods 2. Non-Probabilistic Sampling Methods 3. Mixed Sampling Methods (1) Probability

More information

agilis D1. Define Estimation Procedures European Commission Eurostat/B1, Eurostat/F1 Contract No

agilis D1. Define Estimation Procedures European Commission Eurostat/B1, Eurostat/F1 Contract No Informatics European Commission Eurostat/B1, Eurostat/F1 Contract No. 611.211.5-212.426 Development of methods and scenarios for an integrated system of D1. Define Estimation Procedures October 213 (Contract

More information

BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition, International Publication,

BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition, International Publication, STATISTICS IN TRANSITION-new series, August 2011 223 STATISTICS IN TRANSITION-new series, August 2011 Vol. 12, No. 1, pp. 223 230 BOOK REVIEW Sampling: Design and Analysis. Sharon L. Lohr. 2nd Edition,

More information

Statistics 135: Fall 2004 Final Exam

Statistics 135: Fall 2004 Final Exam Name: SID#: Statistics 135: Fall 2004 Final Exam There are 10 problems and the number of points for each is shown in parentheses. There is a normal table at the end. Show your work. 1. The designer of

More information

SAS/STAT 14.2 User s Guide. Introduction to Survey Sampling and Analysis Procedures

SAS/STAT 14.2 User s Guide. Introduction to Survey Sampling and Analysis Procedures SAS/STAT 14.2 User s Guide Introduction to Survey Sampling and Analysis Procedures This document is an individual chapter from SAS/STAT 14.2 User s Guide. The correct bibliographic citation for this manual

More information

Survey Sample Methods

Survey Sample Methods Survey Sample Methods p. 1/54 Survey Sample Methods Evaluators Toolbox Refreshment Abhik Roy & Kristin Hobson abhik.r.roy@wmich.edu & kristin.a.hobson@wmich.edu Western Michigan University AEA Evaluation

More information

Multidimensional Control Totals for Poststratified Weights

Multidimensional Control Totals for Poststratified Weights Multidimensional Control Totals for Poststratified Weights Darryl V. Creel and Mansour Fahimi Joint Statistical Meetings Minneapolis, MN August 7-11, 2005 RTI International is a trade name of Research

More information

On the bias of the multiple-imputation variance estimator in survey sampling

On the bias of the multiple-imputation variance estimator in survey sampling J. R. Statist. Soc. B (2006) 68, Part 3, pp. 509 521 On the bias of the multiple-imputation variance estimator in survey sampling Jae Kwang Kim, Yonsei University, Seoul, Korea J. Michael Brick, Westat,

More information

5.3 LINEARIZATION METHOD. Linearization Method for a Nonlinear Estimator

5.3 LINEARIZATION METHOD. Linearization Method for a Nonlinear Estimator Linearization Method 141 properties that cover the most common types of complex sampling designs nonlinear estimators Approximative variance estimators can be used for variance estimation of a nonlinear

More information

Weight calibration and the survey bootstrap

Weight calibration and the survey bootstrap Weight and the survey Department of Statistics University of Missouri-Columbia March 7, 2011 Motivating questions 1 Why are the large scale samples always so complex? 2 Why do I need to use weights? 3

More information

New Developments in Nonresponse Adjustment Methods

New Developments in Nonresponse Adjustment Methods New Developments in Nonresponse Adjustment Methods Fannie Cobben January 23, 2009 1 Introduction In this paper, we describe two relatively new techniques to adjust for (unit) nonresponse bias: The sample

More information

arxiv:math/ v1 [math.st] 23 Jun 2004

arxiv:math/ v1 [math.st] 23 Jun 2004 The Annals of Statistics 2004, Vol. 32, No. 2, 766 783 DOI: 10.1214/009053604000000175 c Institute of Mathematical Statistics, 2004 arxiv:math/0406453v1 [math.st] 23 Jun 2004 FINITE SAMPLE PROPERTIES OF

More information

Research Methods in Environmental Science

Research Methods in Environmental Science Research Methods in Environmental Science Module 5: Research Methods in the Physical Sciences Research Methods in the Physical Sciences Obviously, the exact techniques and methods you will use will vary

More information

Model-assisted Estimation of Forest Resources with Generalized Additive Models

Model-assisted Estimation of Forest Resources with Generalized Additive Models Model-assisted Estimation of Forest Resources with Generalized Additive Models Jean Opsomer, Jay Breidt, Gretchen Moisen, Göran Kauermann August 9, 2006 1 Outline 1. Forest surveys 2. Sampling from spatial

More information

Jakarta, Indonesia,29 Sep-10 October 2014.

Jakarta, Indonesia,29 Sep-10 October 2014. Regional Training Course on Sampling Methods for Producing Core Data Items for Agricultural and Rural Statistics Jakarta, Indonesia,29 Sep-0 October 204. LEARNING OBJECTIVES At the end of this session

More information

New Method to Estimate Missing Data by Using the Asymmetrical Winsorized Mean in a Time Series

New Method to Estimate Missing Data by Using the Asymmetrical Winsorized Mean in a Time Series Applied Mathematical Sciences, Vol. 3, 2009, no. 35, 1715-1726 New Method to Estimate Missing Data by Using the Asymmetrical Winsorized Mean in a Time Series Ahmad Mahir R. and A. M. H. Al-Khazaleh 1 School

More information

in Survey Sampling Petr Novák, Václav Kosina Czech Statistical Office Using the Superpopulation Model for Imputations and Variance

in Survey Sampling Petr Novák, Václav Kosina Czech Statistical Office Using the Superpopulation Model for Imputations and Variance Using the Superpopulation Model for Imputations and Variance Computation in Survey Sampling Czech Statistical Office Introduction Situation Let us have a population of N units: n sampled (sam) and N-n

More information

On the Use of Compromised Imputation for Missing data using Factor-Type Estimators

On the Use of Compromised Imputation for Missing data using Factor-Type Estimators J. Stat. Appl. Pro. Lett. 2, No. 2, 105-113 (2015) 105 Journal of Statistics Applications & Probability Letters An International Journal http://dx.doi.org/10.12785/jsapl/020202 On the Use of Compromised

More information

A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR

A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR Statistica Sinica 8(1998), 1165-1173 A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR Phillip S. Kott National Agricultural Statistics Service Abstract:

More information

EFFICIENCY OF MODEL-ASSISTED REGRESSION ESTIMATORS IN SAMPLE SURVEYS

EFFICIENCY OF MODEL-ASSISTED REGRESSION ESTIMATORS IN SAMPLE SURVEYS Statistica Sinica 24 2014, 395-414 doi:ttp://dx.doi.org/10.5705/ss.2012.064 EFFICIENCY OF MODEL-ASSISTED REGRESSION ESTIMATORS IN SAMPLE SURVEYS Jun Sao 1,2 and Seng Wang 3 1 East Cina Normal University,

More information

Sampling Concepts. IUFRO-SPDC Snowbird, UT September 29 Oct 3, 2014 Drs. Rolfe Leary and John A. Kershaw, Jr.

Sampling Concepts. IUFRO-SPDC Snowbird, UT September 29 Oct 3, 2014 Drs. Rolfe Leary and John A. Kershaw, Jr. Sampling Concepts IUFRO-SPDC Snowbird, UT September 9 Oct 3, 014 Drs. Rolfe Leary and John A. Kershaw, Jr. Sampling Concepts Simple Sampling Strategies: Random Sampling Systematic Sampling Stratified Sampling

More information

Sampling: What you don t know can hurt you. Juan Muñoz

Sampling: What you don t know can hurt you. Juan Muñoz Sampling: What you don t know can hurt you Juan Muñoz Outline of presentation Basic concepts Scientific Sampling Simple Random Sampling Sampling Errors and Confidence Intervals Sampling error and sample

More information

Estimation for two-phase designs: semiparametric models and Z theorems

Estimation for two-phase designs: semiparametric models and Z theorems Estimation for two-phase designs:semiparametric models and Z theorems p. 1/27 Estimation for two-phase designs: semiparametric models and Z theorems Jon A. Wellner University of Washington Estimation for

More information

A Family of Estimators for Estimating The Population Mean in Stratified Sampling

A Family of Estimators for Estimating The Population Mean in Stratified Sampling Rajesh Singh, Mukesh Kumar, Sachin Malik, Manoj K. Chaudhary Department of Statistics, B.H.U., Varanasi (U.P.)-India Florentin Smarandache Department of Mathematics, University of New Mexico, Gallup, USA

More information

What is Survey Weighting? Chris Skinner University of Southampton

What is Survey Weighting? Chris Skinner University of Southampton What is Survey Weighting? Chris Skinner University of Southampton 1 Outline 1. Introduction 2. (Unresolved) Issues 3. Further reading etc. 2 Sampling 3 Representation 4 out of 8 1 out of 10 4 Weights 8/4

More information

Estimating Unemployment for Small Areas in Navarra, Spain

Estimating Unemployment for Small Areas in Navarra, Spain Estimating Unemployment for Small Areas in Navarra, Spain Ugarte, M.D., Militino, A.F., an Goicoa, T. Departamento e Estaística e Investigación Operativa, Universia Pública e Navarra Campus e Arrosaía,

More information

Propensity score adjusted method for missing data

Propensity score adjusted method for missing data Graduate Theses and Dissertations Graduate College 2013 Propensity score adjusted method for missing data Minsun Kim Riddles Iowa State University Follow this and additional works at: http://lib.dr.iastate.edu/etd

More information

VARIANCE ESTIMATION FOR COMBINED RATIO ESTIMATOR

VARIANCE ESTIMATION FOR COMBINED RATIO ESTIMATOR Sankyā : Te Indian Journal of Statistics 1995, Volume 57, Series B, Pt. 1, pp. 85-92 VARIANCE ESTIMATION FOR COMBINED RATIO ESTIMATOR By SANJAY KUMAR SAXENA Central Soil and Water Conservation Researc

More information

Efficient estimators for adaptive two-stage sequential sampling

Efficient estimators for adaptive two-stage sequential sampling 0.8Copyedited by: AA 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 Journal of Statistical Computation and

More information

Missing Covariate Data in Matched Case-Control Studies

Missing Covariate Data in Matched Case-Control Studies Missing Covariate Data in Matched Case-Control Studies Department of Statistics North Carolina State University Paul Rathouz Dept. of Health Studies U. of Chicago prathouz@health.bsd.uchicago.edu with

More information

Part 7: Glossary Overview

Part 7: Glossary Overview Part 7: Glossary Overview In this Part This Part covers the following topic Topic See Page 7-1-1 Introduction This section provides an alphabetical list of all the terms used in a STEPS surveillance with

More information

Investigating the Use of Stratified Percentile Ranked Set Sampling Method for Estimating the Population Mean

Investigating the Use of Stratified Percentile Ranked Set Sampling Method for Estimating the Population Mean royecciones Journal of Mathematics Vol. 30, N o 3, pp. 351-368, December 011. Universidad Católica del Norte Antofagasta - Chile Investigating the Use of Stratified ercentile Ranked Set Sampling Method

More information

Sociology 6Z03 Review I

Sociology 6Z03 Review I Sociology 6Z03 Review I John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review I Fall 2016 1 / 19 Outline: Review I Introduction Displaying Distributions Describing

More information

SAMPLING- Method of Psychology. By- Mrs Neelam Rathee, Dept of Psychology. PGGCG-11, Chandigarh.

SAMPLING- Method of Psychology. By- Mrs Neelam Rathee, Dept of Psychology. PGGCG-11, Chandigarh. By- Mrs Neelam Rathee, Dept of 2 Sampling is that part of statistical practice concerned with the selection of a subset of individual observations within a population of individuals intended to yield some

More information

Sampling techniques for big data analysis in finite population inference

Sampling techniques for big data analysis in finite population inference Statistics Preprints Statistics 1-29-2018 Sampling techniques for big data analysis in finite population inference Jae Kwang Kim Iowa State University, jkim@iastate.edu Zhonglei Wang Iowa State University,

More information

'LVNXVVLRQVSDSLHUH 'LVFXVVLRQ3DSHUV

'LVNXVVLRQVSDSLHUH 'LVFXVVLRQ3DSHUV 'LVNXVVLRQVSDSLHUH 'LVFXVVLRQ3DSHUV 'LVFXVVLRQ3DSHU1R 'HULYDWLRQRIGHVLJQZHLJKWV7KHFDVHRIWKH *HUPDQ6RFLR(FRQRPLF3DQHO*62(3 E\ 0DUWLQ6SLHVV %HUOLQ)HEUXDU\ *HUPDQ6RFLR(FRQRPLF3DQHO6WXG\*HUPDQ,QVWLWXWHIRU(FRQRPLF5HVHDUFK.RHQLJLQ/XLVH6WU

More information

A measurement error model approach to small area estimation

A measurement error model approach to small area estimation A measurement error model approach to small area estimation Jae-kwang Kim 1 Spring, 2015 1 Joint work with Seunghwan Park and Seoyoung Kim Ouline Introduction Basic Theory Application to Korean LFS Discussion

More information

Advising on Research Methods: A consultant's companion. Herman J. Ader Gideon J. Mellenbergh with contributions by David J. Hand

Advising on Research Methods: A consultant's companion. Herman J. Ader Gideon J. Mellenbergh with contributions by David J. Hand Advising on Research Methods: A consultant's companion Herman J. Ader Gideon J. Mellenbergh with contributions by David J. Hand Contents Preface 13 I Preliminaries 19 1 Giving advice on research methods

More information