Jong-Min Kim* and Jon E. Anderson. Statistics Discipline Division of Science and Mathematics University of Minnesota at Morris
|
|
- William Preston Gray
- 5 years ago
- Views:
Transcription
1 Jackknife Variance Estimation of the Regression and Calibration Estimator for Two 2-Phase Samples Jong-Min Kim* and Jon E. Anderson Statistics Discipline Division of Science and Mathematics University of Minnesota at Morris 2005 ENAR, March 21 1
2 Outline Jackknife Background and Definitions Regression Estimator in Simple Random Sampling Regression Estimator in Stratified Random Sampling Calibration Estimation in Stratified Random Sampling Conclusions 2005 ENAR, March 21 2
3 Jackknife Background Introduced by Quenouille (1949, 1956) as a method to reduce bias Popularized by Tukey (1958) who used it for variances and CIs Arvesen (1969) was the first to propose two-sample jackknife estimator 2005 ENAR, March 21 3
4 Jackknife Definition Let ˆθ be an estimate Let ˆθ( j) be an estimator of the same form with observation j deleted The jackknife estimate of the variance of ˆθ is n 1 n n 2 [ˆθ( j) ˆθ] j= ENAR, March 21 4
5 Main Idea 2005 ENAR, March 21 5
6 Jackknife Variance Estimation of the Regression Estimator for Two Samples, Two-Phase, Simple Random Sampling Two, first-phase simple random samples s 1 of size n 1 and s 2 of size n 2 are taken without replacement from a population of N elements. Simple random subsamples s 1 of size n 1 and s 2 of size n 2 are taken without replacement from s 1 and s ENAR, March 21 6
7 The simple linear regression estimators for two samples, two-phase sampling are and y lr1 = y 1 + ˆβ 1 (x 1 x 1 ), y lr2 = y 2 + ˆβ 2 (x 2 x 2 ), where x 1 and x 2 are the means for first stage samples s 1 and s 2, x 1 and x 2 are the means for second-stage samples s 1 and s 2, and y lr1 and y lr2 are the means for second-stage samples s 1 and s 2, ˆβ 1 = ˆσ xy /ˆσ x 2 is the least squares regression coefficient of y on x based on second-stage sample s 1, and ˆβ 2 = ˆσ xy /ˆσ x 2 is the least squares regression coefficient of y 2 on x 2 based on second-stage sample s ENAR, March 21 7
8 We obtain a jackknife variance estimator for y lrk by recalculating y lrk with the jth element removed for each j s k then using the variance of these n k jackknife values, y lrk ( j). Clearly, deleting unit j will affect x k, y k and ˆβ k only if j s k and not if j s k s k, while it will affect x k for all j s k. Define y lrk ( j) = y k ( j) + ˆβ k ( j) [ ] x k( j) x k ( j), for all j s k, where x k ( j) = (n k x k x j)/(n k 1) for all j s k, (n k x k x j )/(n k 1), if (j s k ), x k ( j) = x k, if (j s k s k), 2005 ENAR, March 21 8
9 y k ( j) = (n k y k y j )/(n k 1), if (j s k ), y k, if (j s k s k), ˆβ k ( j) = ˆβ k ˆβ k, (x j x k )d j (n k 1)ˆσ 2 xk (1 k j ), if (j s k), if (j s k s k). where d j = y j y k ˆβ k (x j x k ) and k j = 1/n k + (x j x k ) 2 / [ (n k 1)ˆσ 2 xk]. Now apply the usual jackknife method to y rlk ( j) to get v Jlrk = n k 1 n k j s k [y lrk ( j) y lrk ] ENAR, March 21 9
10 Rao and Sitter approach to Jackknife Variance Estimator for Two Samples, SRS To use convenient one-phase sample variance formulae, Rao and Sitter (1995) proposed a similar device to facilitate computations for ratio imputation. For regression imputation we define, ẑ ki ( j) = y ki + {y ki( j) y ki}, for sample k = 1, 2, and yki ( j) is defined as, y ki( j) = y k ( j) + ˆβ k ( j)(x ki x k ( j)), so that ẑ ki ( j) = y ki for j s k s k in sample k = 1, 2, and ẑ ki ( j) = y ki ( j) for j s k in sample k = 1, ENAR, March 21 10
11 We also define the adjusted estimator, y a ki( j) = 1 n k 1 n k i=1 ẑ ki ( j), and this helps define the jackknife variance estimator for sample k, v Jlrk = n k 1 n k j s k [y a ki( j) y ki ] 2, where y ki = y klr under regression imputation. The jackknife variance estimator based on adjusted imputed estimators y ki, k = 1, 2 is a weighted average of two estimators, given by vjlr a = 1v n Jlr1 + n 2v Jlr2 n 1 + n 2 = n 1 1 n 1 + [y a 1I( j) y 1I ] 2 + n 2 1 n 2 n 1 + n 2 j s 1 l s 2 [y a 2I( l) y 2I ] ENAR, March 21 11
12 Simulation Study Design: Simple Random Sampling Population size is Pop Y is related to Pop X, Y = 0.8 X + ɛ Y X 2005 ENAR, March 21 12
13 Simulation: Simple Random Sampling One Sample Jackknife Variance, Simple Random Sampling J. Variance First Phase=2000 First Phase= Second Phase Sample Size 2005 ENAR, March 21 13
14 Simulation: Simple Random Sampling Two Sample Jackknife Variance, Simple Random Sampling J. Variance First Phase=1000 First Phase= Second Phase Sample Size 2005 ENAR, March 21 14
15 Simulation: One vs. Two SRS Comparison Mean Jackknife Variance, SRS J. Variance Two Samples One Sample Second Phase Sample Size 2005 ENAR, March 21 15
16 Simulation: One vs. Two SRS Comparison SD Jackknife Variance, SRS J. Variance Two Samples One Sample Second Phase Sample Size 2005 ENAR, March 21 16
17 Simulation: Missing and Complete Comparison SRS Two Sample Jackknife Variance, Simple Random Sampling J. Variance Complete Data Missing Data Second Phase Sample Size 2005 ENAR, March 21 17
18 Jackknife Variance Estimator for Two Samples, Two-Phase, Stratified Random Sampling Assume that x is observed on all sample units, s hk, for sample k = 1, 2 in stratum h. Simple linear regression imputation uses y hki = y hk + ˆβ hk (x hki x hk ) for i s hk s hk where y hk and x hk are the means of y and x for the respondents in group s hk in stratum h. ˆβ hk is the ordinary least squares regression based on the respondents, s hk in stratum h ENAR, March 21 18
19 The imputed values y hki are best predictors of unobserved y hki under the following superpopulation model ξ: E ξ (y hki ) = α hk + β hk x hki, V ξ (y hki ) = σ 2 h, cov ξ (y hki, y hkj ) = 0, for i j provided that the model also holds for the respondents s hk. Under regression imputation, y ki = W h y hki = h h W h [y hk + ˆβ ] hk (x hk x hk ), for k = 1, 2. It is readily seen that y hki ( hkj) = y hk( hkj) + ˆβ hk ( hkj) [x hki x hk ( hkj)] under regression imputation when the hkjth respondent is deleted, where ˆβ hk ( hkj) is the least squares regression coefficient when the hkjth respondent is deleted ENAR, March 21 19
20 Rao and Sitter Approach to Jackknife Variance Estimator for two samples, Stratified Random Sampling To use convenient one-phase sample variance formulae, Rao and Sitter (1995) proposed a similar device to facilitate computations for ratio imputation. For regression imputation we define, ẑ hki ( hkj) = y hki + {y hki( hkj) y hki}, so that ẑ hki ( hkj) = y hki for hkj s hk s hk in sample k = 1, 2, and ẑ hki ( hkj) = y hki ( hkj) for hkj s hk. in sample k = 1, 2, stratum h. We also define the adjusted estimator, y a hki( hkj) = 1 n hk 1 n hk i=1 ẑ hki ( hkj), 2005 ENAR, March 21 20
21 Using these values, the jackknife variance estimator of y ki is given by v Jlr (y ki ) = n hk 1 n hk n hk j=1 [y a ki( hkj) y ki ] 2. Noting that y a ki ( hkj) y ki = W h [y a hki ( hkj) y hki], where y a hki ( hkj) is the adjusted imputed estimator of the hth stratum mean Y h when hkjth sample unit is deleted, we get v Jlr (y ki ) = = Whv 2 Jlr (y hki ) W 2 h n hk 1 n hk n hk j=1 (y a hki( hkj) y hki ) ENAR, March 21 21
22 Simulation Study Design: Stratified Random Sampling Population size is N = Pop Y is related to Pop X, Y = 0.8 X + ɛ Three strata, X < 90, 90 X 110, 110 < X. Stratum 1 size = N 1 = 1633 s.t. W 1 = N 1 N = Stratum 2 size = N 2 = 6805 s.t. W 2 = N 2 N = Stratum 3 size = N 3 = 1562 s.t. W 3 = N 3 N = ENAR, March 21 22
23 Simulation Study Design: Stratified Random Sampling Stratum=3 Population Y Values Stratum=1 Stratum= Population X Values 2005 ENAR, March 21 23
24 Simulation: Stratified Random Sampling One Sample Jackknife Variance, Stratified Random Samplin J. Variance First Phase=2000 First Phase= Second Phase Sample Size 2005 ENAR, March 21 24
25 Simulation: Stratified Random Sampling Two Sample Jackknife Variance, Stratified Random Samplin J. Variance First Phase=1000 First Phase= Second Phase Sample Size 2005 ENAR, March 21 25
26 Calibration Approach to Jackknife Variance Estimation Three major advantages of calibration approach in Survey Sampling Leads to consistent estimates Provides an important class of techniques for the efficient combination of data sources. Has computational advantage for estimates. Apply Tracy et al. (2003) calibration in Stratified and Double Sampling to Jackknife Variance Estimator 2005 ENAR, March 21 26
27 Calibration Approach to Jackknife Variance Estimation We apply calibration estimation with ratio imputation in stratified random sampling. Suppose the population of N units consists of L strata such that the h-th stratum consists of N h units and L N h = N. Suppose that an auxiliary variable, x, closely related to an item y is observed on all sample units, s hk in sample k = 1, 2 for stratum h. Ratio imputation uses y hki = (y hk/x hk )x hki for i s hk s hk, and equals y hki when it is observed in sample s hk. Note that y hk and x hk are the means of y and x for the respondents s hk in stratum h ENAR, March 21 27
28 Under ratio imputation presented by Särndal (1992), y ki = W h y hki = W h (y hk /x hk )x hk, where x hk is the x mean for the full sample s hk h. from stratum Let ˆσ 2 hk (x) = (n xh 1) 1 n kh i=1 (x hki x hk ) be the variance of x in the first sample s hk from stratum h. The variance of x in the subsample, s hk, stratum h is ˆσ hk 2 (x) = (n hk 1) 1 i s hk (x hki x hk ) 2. Also, let ˆσ 2 hk (y) = (n hk 1) 1 i s hk (y hki y hk ) 2 be the variance of the target characteristic respondents in the subsample, s hk, from stratum h ENAR, March 21 28
29 We are considering the jackknife variance estimator of y I using calibration estimation. Let s define where Wh distance y ki = Wh y hki = Wh (y hk /x hk )x hk, are the calibrated weights such that the chi-square (W h W h) 2 W h Q h, is minimized subject to constraints given below, and the Q h are predefined weights used to obtain to different estimators. The above distance is minimized subject to these constraints Wh x hk = W h x hk, 2005 ENAR, March 21 29
30 and Wh ˆσ hk 2 = W hˆσ 2 hk (x), where W h = N h /N are known stratum weights. Then we get calibrated weights as W h = W h + {W h Q h x hk A} /C + { W h Q hˆσ 2 hk(x)b } /C, where A = W h x hk x hk W h Q h ˆσ hk 4 (x) W h ˆσ 2 hk (x) ˆσ 2 hk (x) L W h Q h x hk ˆσ hk 2 (x), B = W h ˆσ 2 hk (x) ˆσ 2 hk (x) L W h Q h x 2 hk W h x hk x hk 2 C = W h Q h x 2 hk W h Q h ˆσ hk 4 (x) W h Q h x hk ˆσ hk 2 (x) W h Q h x hk ˆσ hk 2 (x), 2005 ENAR, March 21 30
31 A calibrated estimator of y ki is given by y ki = W h (y hk /x hk )x hk [ + ˆβ L ) x1 W h (x ] hk x hk + ˆβ x2 [ L W h ( ˆσ 2 hk(x) ˆσ 2 hk (x)) ], where ( L ) ˆβ x1 = ( L {W h Q h x hk y hk A} /C, ) ˆβ x2 = W h Q hˆσ 2 hk(x)(y hk /x hk )x hkb/c ENAR, March 21 31
32 It is readily seen that y hki ( hkj) = [y hk( hkj)/x hk ( hkj)]x hki, under ratio imputation when hkjth respondent is deleted, y ki ( hkj) = W h (y hk ( hkj)/x hk ( hkj))x hk( hkj) + ˆβ x1 ( hkj) + ˆβ x2 ( hkj) [ L [ L ) W h (x ] hk ( hkj) x hk( hkj) W h ( ˆσ 2 hk( hkj)(x) ˆσ 2 hx ( hkj)) ], where x hk ( hkj) = (n hk x hk x hkj)/(n hk 1) for all j s hk y hk ( hkj) = [n hk y hk y hkj ] /(n hk 1), x hk ( hkj) = [n hk x hk x hkj ] /(n hk 1), 2005 ENAR, March 21 32
33 ˆβ x1 ( hkj) = W h Q h x hk ( hkj)y hk ( hkj)a( hkj) /C( hkj), ˆβ x2 ( hkj) = W h Q h ˆσ hk 2 ( hkj)(x)(y hk ( hkj)/x hk ( hkj))x hk ( hkj)b( hkj)/c( hkj), and ˆσ 2 hk ( hkj)(x) = (n hk 1) 1 n hk i=1 (x hki x hk ( hkj))2, and ˆσ 2 hk = (n hk 1) 1 i s (x hki x hk ( hkj)), where hk A( hkj) = W h x L hk ( hkj) x hk ( hkj) W h Q h ˆσ hk 4 ( hkj)(x) W h ˆσ 2 hk ( hkj)(x) ˆσ hk 2 L ( hkj)(x) W h Q h x hk ( hkj)ˆσ hk 2 ( hkj), B( hkj) = W h ˆσ 2 hk ( hkj)(x) ˆσ hk 2 L ( hkj) W h Q h x 2 hk ( hkj)(x) W h x L hk ( hkj) x hk ( hkj) W h Q h x hk ( hkj)ˆσ hk 2 ( hkj), C( hkj) = W h Q h x 2 hk ( hkj) W h Q h ˆσ hk 4 ( hkj)(x) 2 W h Q h x hk ( hkj)ˆσ hk 2 ( hkj)(x), 2005 ENAR, March 21 33
34 Using these values, the jackknife variance estimator of y ki is given by v Jcr (y ki ) = n hk 1 n hk n hk j=1 [y a ki( hkj) y ki ] 2. Noting that y a ki ( hkj) y ki = W h [y a hki ( hkj) y hki], where y a hki ( hkj) is the adjusted imputed estimator of the hth stratum mean Y h when hkjth sample unit is deleted, we get v Jcr (y ki ) = = Whv 2 Jcr (y hki ) W 2 h n hk 1 n hk n hk j=1 (y a hki( hkj) y hki ) ENAR, March 21 34
35 The jackknife variance estimator is a weighted average of two estimators, given by v Jcr = n 1v Jcr (y 1I ) + n 2v Jcr (y 2I ) n 1 + n 2 n 1 = n 1 + W 2 n h1 1 h n 2 n h1 n 2 + n 1 + W 2 n h2 1 h n 2 n h2 n h1 j=1 n h2 j=1 where n 1 = 2 n h1 and n 2 = 2 n h2. (y a h1i( h1j) y h1i ) 2 (y a h2i( h2j) y h2i ) ENAR, March 21 35
36 Conclusions Simulation shows greater benefit from increasing the first phase sample size compared to increasing the second phase sample size. Jackknife variance estimator for two samples has only a slightly smaller mean than Jackknife variance estimator for one sample. Jackknife variance estimator for two samples has less SD (variation) than Jackknife variance estimator for one sample. Future: Apply Jackknife variance estimator for two samples to Stratified Multistage Sampling ENAR, March 21 36
Jong-Min Kim* and Jon E. Anderson. Statistics Discipline Division of Science and Mathematics University of Minnesota at Morris
Jackknife Variance Estimation for Two Samples after Imputation under Two-Phase Sampling Jong-Min Kim* and Jon E. Anderson jongmink@mrs.umn.edu Statistics Discipline Division of Science and Mathematics
More informationImputation for Missing Data under PPSWR Sampling
July 5, 2010 Beijing Imputation for Missing Data under PPSWR Sampling Guohua Zou Academy of Mathematics and Systems Science Chinese Academy of Sciences 1 23 () Outline () Imputation method under PPSWR
More informationAn Overview of the Pros and Cons of Linearization versus Replication in Establishment Surveys
An Overview of the Pros and Cons of Linearization versus Replication in Establishment Surveys Richard Valliant University of Michigan and Joint Program in Survey Methodology University of Maryland 1 Introduction
More informationChapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70
Chapter 5: Models used in conjunction with sampling J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Nonresponse Unit Nonresponse: weight adjustment Item Nonresponse:
More informationREPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES
Statistica Sinica 8(1998), 1153-1164 REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES Wayne A. Fuller Iowa State University Abstract: The estimation of the variance of the regression estimator for
More informationTWO-WAY CONTINGENCY TABLES UNDER CONDITIONAL HOT DECK IMPUTATION
Statistica Sinica 13(2003), 613-623 TWO-WAY CONTINGENCY TABLES UNDER CONDITIONAL HOT DECK IMPUTATION Hansheng Wang and Jun Shao Peking University and University of Wisconsin Abstract: We consider the estimation
More informationNonresponse weighting adjustment using estimated response probability
Nonresponse weighting adjustment using estimated response probability Jae-kwang Kim Yonsei University, Seoul, Korea December 26, 2006 Introduction Nonresponse Unit nonresponse Item nonresponse Basic strategy
More informationWorkpackage 5 Resampling Methods for Variance Estimation. Deliverable 5.1
Workpackage 5 Resampling Methods for Variance Estimation Deliverable 5.1 2004 II List of contributors: Anthony C. Davison and Sylvain Sardy, EPFL. Main responsibility: Sylvain Sardy, EPFL. IST 2000 26057
More informationBOOTSTRAPPING SAMPLE QUANTILES BASED ON COMPLEX SURVEY DATA UNDER HOT DECK IMPUTATION
Statistica Sinica 8(998), 07-085 BOOTSTRAPPING SAMPLE QUANTILES BASED ON COMPLEX SURVEY DATA UNDER HOT DECK IMPUTATION Jun Shao and Yinzhong Chen University of Wisconsin-Madison Abstract: The bootstrap
More informationData Integration for Big Data Analysis for finite population inference
for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation
More informationEmpirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design
1 / 32 Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design Changbao Wu Department of Statistics and Actuarial Science University of Waterloo (Joint work with Min Chen and Mary
More informationREPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY
REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY J.D. Opsomer, W.A. Fuller and X. Li Iowa State University, Ames, IA 50011, USA 1. Introduction Replication methods are often used in
More informationIntroduction to Survey Data Analysis
Introduction to Survey Data Analysis JULY 2011 Afsaneh Yazdani Preface Learning from Data Four-step process by which we can learn from data: 1. Defining the Problem 2. Collecting the Data 3. Summarizing
More informationChapter 2. Section Section 2.9. J. Kim (ISU) Chapter 2 1 / 26. Design-optimal estimator under stratified random sampling
Chapter 2 Section 2.4 - Section 2.9 J. Kim (ISU) Chapter 2 1 / 26 2.4 Regression and stratification Design-optimal estimator under stratified random sampling where (Ŝxxh, Ŝxyh) ˆβ opt = ( x st, ȳ st )
More informationModel Assisted Survey Sampling
Carl-Erik Sarndal Jan Wretman Bengt Swensson Model Assisted Survey Sampling Springer Preface v PARTI Principles of Estimation for Finite Populations and Important Sampling Designs CHAPTER 1 Survey Sampling
More informationSongklanakarin Journal of Science and Technology SJST R3 LAWSON
Songklanakarin Journal of Science and Technology SJST-0-00.R LAWSON Ratio Estimators of Population Means Using Quartile Function of Auxiliary Variable in Double Sampling Journal: Songklanakarin Journal
More informationA decision theoretic approach to Imputation in finite population sampling
A decision theoretic approach to Imputation in finite population sampling Glen Meeden School of Statistics University of Minnesota Minneapolis, MN 55455 August 1997 Revised May and November 1999 To appear
More informationFractional Imputation in Survey Sampling: A Comparative Review
Fractional Imputation in Survey Sampling: A Comparative Review Shu Yang Jae-Kwang Kim Iowa State University Joint Statistical Meetings, August 2015 Outline Introduction Fractional imputation Features Numerical
More information6. Fractional Imputation in Survey Sampling
6. Fractional Imputation in Survey Sampling 1 Introduction Consider a finite population of N units identified by a set of indices U = {1, 2,, N} with N known. Associated with each unit i in the population
More informationEFFICIENT REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLING
Statistica Sinica 13(2003), 641-653 EFFICIENT REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLING J. K. Kim and R. R. Sitter Hankuk University of Foreign Studies and Simon Fraser University Abstract:
More informationSimple and Multiple Linear Regression
Sta. 113 Chapter 12 and 13 of Devore March 12, 2010 Table of contents 1 Simple Linear Regression 2 Model Simple Linear Regression A simple linear regression model is given by Y = β 0 + β 1 x + ɛ where
More informationIntroduction to Survey Data Integration
Introduction to Survey Data Integration Jae-Kwang Kim Iowa State University May 20, 2014 Outline 1 Introduction 2 Survey Integration Examples 3 Basic Theory for Survey Integration 4 NASS application 5
More informationCombining data from two independent surveys: model-assisted approach
Combining data from two independent surveys: model-assisted approach Jae Kwang Kim 1 Iowa State University January 20, 2012 1 Joint work with J.N.K. Rao, Carleton University Reference Kim, J.K. and Rao,
More informationMean estimation with calibration techniques in presence of missing data
Computational Statistics & Data Analysis 50 2006 3263 3277 www.elsevier.com/locate/csda Mean estimation with calibration techniues in presence of missing data M. Rueda a,, S. Martínez b, H. Martínez c,
More informationAsymptotic Normality under Two-Phase Sampling Designs
Asymptotic Normality under Two-Phase Sampling Designs Jiahua Chen and J. N. K. Rao University of Waterloo and University of Carleton Abstract Large sample properties of statistical inferences in the context
More informationA JACKKNIFE VARIANCE ESTIMATOR FOR SELF-WEIGHTED TWO-STAGE SAMPLES
Statistica Sinica 23 (2013), 595-613 doi:http://dx.doi.org/10.5705/ss.2011.263 A JACKKNFE VARANCE ESTMATOR FOR SELF-WEGHTED TWO-STAGE SAMPLES Emilio L. Escobar and Yves G. Berger TAM and University of
More informationin Survey Sampling Petr Novák, Václav Kosina Czech Statistical Office Using the Superpopulation Model for Imputations and Variance
Using the Superpopulation Model for Imputations and Variance Computation in Survey Sampling Czech Statistical Office Introduction Situation Let us have a population of N units: n sampled (sam) and N-n
More informationCombining Non-probability and Probability Survey Samples Through Mass Imputation
Combining Non-probability and Probability Survey Samples Through Mass Imputation Jae-Kwang Kim 1 Iowa State University & KAIST October 27, 2018 1 Joint work with Seho Park, Yilin Chen, and Changbao Wu
More informationSuccessive Difference Replication Variance Estimation in Two-Phase Sampling
Successive Difference Replication Variance Estimation in Two-Phase Sampling Jean D. Opsomer Colorado State University Michael White US Census Bureau F. Jay Breidt Colorado State University Yao Li Colorado
More informationOn the bias of the multiple-imputation variance estimator in survey sampling
J. R. Statist. Soc. B (2006) 68, Part 3, pp. 509 521 On the bias of the multiple-imputation variance estimator in survey sampling Jae Kwang Kim, Yonsei University, Seoul, Korea J. Michael Brick, Westat,
More informationINSTRUMENTAL-VARIABLE CALIBRATION ESTIMATION IN SURVEY SAMPLING
Statistica Sinica 24 (2014), 1001-1015 doi:http://dx.doi.org/10.5705/ss.2013.038 INSTRUMENTAL-VARIABLE CALIBRATION ESTIMATION IN SURVEY SAMPLING Seunghwan Park and Jae Kwang Kim Seoul National Univeristy
More informationComments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/21/03) Ed Stanek
Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/2/03) Ed Stanek Here are comments on the Draft Manuscript. They are all suggestions that
More informationResampling Variance Estimation in Surveys with Missing Data
Journal of Official Statistics, Vol. 23, No. 3, 2007, pp. 371 386 Resampling Variance Estimation in Surveys with Missing Data A.C. Davison 1 and S. Sardy 2 We discuss variance estimation by resampling
More informationLinear Models and Estimation by Least Squares
Linear Models and Estimation by Least Squares Jin-Lung Lin 1 Introduction Causal relation investigation lies in the heart of economics. Effect (Dependent variable) cause (Independent variable) Example:
More informationVARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA
Submitted to the Annals of Applied Statistics VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA By Jae Kwang Kim, Wayne A. Fuller and William R. Bell Iowa State University
More informationUNIVERSITY OF MASSACHUSETTS. Department of Mathematics and Statistics. Basic Exam - Applied Statistics. Tuesday, January 17, 2017
UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Tuesday, January 17, 2017 Work all problems 60 points are needed to pass at the Masters Level and 75
More informationBootstrap, Jackknife and other resampling methods
Bootstrap, Jackknife and other resampling methods Part VI: Cross-validation Rozenn Dahyot Room 128, Department of Statistics Trinity College Dublin, Ireland dahyot@mee.tcd.ie 2005 R. Dahyot (TCD) 453 Modern
More informationA measurement error model approach to small area estimation
A measurement error model approach to small area estimation Jae-kwang Kim 1 Spring, 2015 1 Joint work with Seunghwan Park and Seoyoung Kim Ouline Introduction Basic Theory Application to Korean LFS Discussion
More informationSampling from Finite Populations Jill M. Montaquila and Graham Kalton Westat 1600 Research Blvd., Rockville, MD 20850, U.S.A.
Sampling from Finite Populations Jill M. Montaquila and Graham Kalton Westat 1600 Research Blvd., Rockville, MD 20850, U.S.A. Keywords: Survey sampling, finite populations, simple random sampling, systematic
More informationLecture 4: Heteroskedasticity
Lecture 4: Heteroskedasticity Econometric Methods Warsaw School of Economics (4) Heteroskedasticity 1 / 24 Outline 1 What is heteroskedasticity? 2 Testing for heteroskedasticity White Goldfeld-Quandt Breusch-Pagan
More informationPropensity score adjusted method for missing data
Graduate Theses and Dissertations Graduate College 2013 Propensity score adjusted method for missing data Minsun Kim Riddles Iowa State University Follow this and additional works at: http://lib.dr.iastate.edu/etd
More informationChapter 4. Replication Variance Estimation. J. Kim, W. Fuller (ISU) Chapter 4 7/31/11 1 / 28
Chapter 4 Replication Variance Estimation J. Kim, W. Fuller (ISU) Chapter 4 7/31/11 1 / 28 Jackknife Variance Estimation Create a new sample by deleting one observation n 1 n n ( x (k) x) 2 = x (k) = n
More informationNew estimation methodology for the Norwegian Labour Force Survey
Notater Documents 2018/16 Melike Oguz-Alper New estimation methodology for the Norwegian Labour Force Survey Documents 2018/16 Melike Oguz Alper New estimation methodology for the Norwegian Labour Force
More information5.3 LINEARIZATION METHOD. Linearization Method for a Nonlinear Estimator
Linearization Method 141 properties that cover the most common types of complex sampling designs nonlinear estimators Approximative variance estimators can be used for variance estimation of a nonlinear
More informationNonrespondent subsample multiple imputation in two-phase random sampling for nonresponse
Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse Nanhua Zhang Division of Biostatistics & Epidemiology Cincinnati Children s Hospital Medical Center (Joint work
More informationGraybill Conference Poster Session Introductions
Graybill Conference Poster Session Introductions 2013 Graybill Conference in Modern Survey Statistics Colorado State University Fort Collins, CO June 10, 2013 Small Area Estimation with Incomplete Auxiliary
More informationMonte Carlo Study on the Successive Difference Replication Method for Non-Linear Statistics
Monte Carlo Study on the Successive Difference Replication Method for Non-Linear Statistics Amang S. Sukasih, Mathematica Policy Research, Inc. Donsig Jang, Mathematica Policy Research, Inc. Amang S. Sukasih,
More informationModel-assisted Estimation of Forest Resources with Generalized Additive Models
Model-assisted Estimation of Forest Resources with Generalized Additive Models Jean Opsomer, Jay Breidt, Gretchen Moisen, Göran Kauermann August 9, 2006 1 Outline 1. Forest surveys 2. Sampling from spatial
More informationSampling Techniques. Esra Akdeniz. February 9th, 2016
Sampling Techniques Esra Akdeniz February 9th, 2016 HOW TO DO RESEARCH? Question. Literature research. Hypothesis. Collect data. Analyze data. Interpret and present results. HOW TO DO RESEARCH? Collect
More informationTwo-phase sampling approach to fractional hot deck imputation
Two-phase sampling approach to fractional hot deck imputation Jongho Im 1, Jae-Kwang Kim 1 and Wayne A. Fuller 1 Abstract Hot deck imputation is popular for handling item nonresponse in survey sampling.
More informationAdvanced Econometrics
Based on the textbook by Verbeek: A Guide to Modern Econometrics Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna May 16, 2013 Outline Univariate
More informationEFFICIENCY OF MODEL-ASSISTED REGRESSION ESTIMATORS IN SAMPLE SURVEYS
Statistica Sinica 24 2014, 395-414 doi:ttp://dx.doi.org/10.5705/ss.2012.064 EFFICIENCY OF MODEL-ASSISTED REGRESSION ESTIMATORS IN SAMPLE SURVEYS Jun Sao 1,2 and Seng Wang 3 1 East Cina Normal University,
More informationStatistics 135: Fall 2004 Final Exam
Name: SID#: Statistics 135: Fall 2004 Final Exam There are 10 problems and the number of points for each is shown in parentheses. There is a normal table at the end. Show your work. 1. The designer of
More informationDeriving indicators from representative samples for the ESF
Deriving indicators from representative samples for the ESF Brussels, June 17, 2014 Ralf Münnich and Stefan Zins Lisa Borsi and Jan-Philipp Kolb GESIS Mannheim and University of Trier Outline 1 Choosing
More informationMODEL-ASSISTED AND MODEL-CALIBRATED ESTIMATION FOR CLASS FREQUENCIES WITH ORDINAL OUTCOMES
MODEL-ASSISTED AND MODEL-CALIBRATED ESTIMATION FOR CLASS FREQUENCIES WITH ORDINAL OUTCOMES Authors: Maria del Mar Rueda Department of Statistics and Operational Research, University of Granada, Spain (mrueda@ugr.es)
More informationLinear models and their mathematical foundations: Simple linear regression
Linear models and their mathematical foundations: Simple linear regression Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/21 Introduction
More informationASYMPTOTIC NORMALITY UNDER TWO-PHASE SAMPLING DESIGNS
Statistica Sinica 17(2007), 1047-1064 ASYMPTOTIC NORMALITY UNDER TWO-PHASE SAMPLING DESIGNS Jiahua Chen and J. N. K. Rao University of British Columbia and Carleton University Abstract: Large sample properties
More informationApplied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013
Applied Regression Chapter 2 Simple Linear Regression Hongcheng Li April, 6, 2013 Outline 1 Introduction of simple linear regression 2 Scatter plot 3 Simple linear regression model 4 Test of Hypothesis
More informationAdvanced Topics in Survey Sampling
Advanced Topics in Survey Sampling Jae-Kwang Kim Wayne A Fuller Pushpal Mukhopadhyay Department of Statistics Iowa State University World Statistics Congress Short Course July 23-24, 2015 Kim & Fuller
More informationEconomics Division University of Southampton Southampton SO17 1BJ, UK. Title Overlapping Sub-sampling and invariance to initial conditions
Economics Division University of Southampton Southampton SO17 1BJ, UK Discussion Papers in Economics and Econometrics Title Overlapping Sub-sampling and invariance to initial conditions By Maria Kyriacou
More informationEstimation for two-phase designs: semiparametric models and Z theorems
Estimation for two-phase designs:semiparametric models and Z theorems p. 1/27 Estimation for two-phase designs: semiparametric models and Z theorems Jon A. Wellner University of Washington Estimation for
More informationOne-stage dose-response meta-analysis
One-stage dose-response meta-analysis Nicola Orsini, Alessio Crippa Biostatistics Team Department of Public Health Sciences Karolinska Institutet http://ki.se/en/phs/biostatistics-team 2017 Nordic and
More informationRegression: Lecture 2
Regression: Lecture 2 Niels Richard Hansen April 26, 2012 Contents 1 Linear regression and least squares estimation 1 1.1 Distributional results................................ 3 2 Non-linear effects and
More informationBiostat 2065 Analysis of Incomplete Data
Biostat 2065 Analysis of Incomplete Data Gong Tang Dept of Biostatistics University of Pittsburgh September 13 & 15, 2005 1. Complete-case analysis (I) Complete-case analysis refers to analysis based on
More informationA MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR
Statistica Sinica 8(1998), 1165-1173 A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR Phillip S. Kott National Agricultural Statistics Service Abstract:
More informationThe Effect of Multiple Weighting Steps on Variance Estimation
Journal of Official Statistics, Vol. 20, No. 1, 2004, pp. 1 18 The Effect of Multiple Weighting Steps on Variance Estimation Richard Valliant 1 Multiple weight adjustments are common in surveys to account
More informationarxiv:math/ v1 [math.st] 23 Jun 2004
The Annals of Statistics 2004, Vol. 32, No. 2, 766 783 DOI: 10.1214/009053604000000175 c Institute of Mathematical Statistics, 2004 arxiv:math/0406453v1 [math.st] 23 Jun 2004 FINITE SAMPLE PROPERTIES OF
More informationSingh, S. (2013). A dual problem of calibration of design weights. Statistics: A Journal of Theoretical and Applied Statistics 47 (3),
Selected Publications: Sarjinder Singh Singh, S. (2013). A dual problem of calibration of design weights. Statistics: A Journal of Theoretical and Applied Statistics 47 (3), 566-574. Singh, S. (2012).
More informationJ.N.K. Rao, Carleton University Department of Mathematics & Statistics, Carleton University, Ottawa, Canada
JACKKNIFE VARIANCE ESTIMATION WITH IMPUTED SURVEY DATA J.N.K. Rao, Carleton University Department of Mathematics & Statistics, Carleton University, Ottawa, Canada KEY WORDS" Adusted imputed values, item
More informationSAS/STAT 14.2 User s Guide. Introduction to Survey Sampling and Analysis Procedures
SAS/STAT 14.2 User s Guide Introduction to Survey Sampling and Analysis Procedures This document is an individual chapter from SAS/STAT 14.2 User s Guide. The correct bibliographic citation for this manual
More informationEcon 2120: Section 2
Econ 2120: Section 2 Part I - Linear Predictor Loose Ends Ashesh Rambachan Fall 2018 Outline Big Picture Matrix Version of the Linear Predictor and Least Squares Fit Linear Predictor Least Squares Omitted
More informationFractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling
Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction
More informationCalibration estimation using exponential tilting in sample surveys
Calibration estimation using exponential tilting in sample surveys Jae Kwang Kim February 23, 2010 Abstract We consider the problem of parameter estimation with auxiliary information, where the auxiliary
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationChapter 8: Estimation 1
Chapter 8: Estimation 1 Jae-Kwang Kim Iowa State University Fall, 2014 Kim (ISU) Ch. 8: Estimation 1 Fall, 2014 1 / 33 Introduction 1 Introduction 2 Ratio estimation 3 Regression estimator Kim (ISU) Ch.
More informationHigh-dimensional regression modeling
High-dimensional regression modeling David Causeur Department of Statistics and Computer Science Agrocampus Ouest IRMAR CNRS UMR 6625 http://www.agrocampus-ouest.fr/math/causeur/ Course objectives Making
More informationYou are allowed 3? sheets of notes and a calculator.
Exam 1 is Wed Sept You are allowed 3? sheets of notes and a calculator The exam covers survey sampling umbers refer to types of problems on exam A population is the entire set of (potential) measurements
More informationA comparison of weighted estimators for the population mean. Ye Yang Weighting in surveys group
A comparison of weighted estimators for the population mean Ye Yang Weighting in surveys group Motivation Survey sample in which auxiliary variables are known for the population and an outcome variable
More informationOptimal Calibration Estimators Under Two-Phase Sampling
Journal of Of cial Statistics, Vol. 19, No. 2, 2003, pp. 119±131 Optimal Calibration Estimators Under Two-Phase Sampling Changbao Wu 1 and Ying Luan 2 Optimal calibration estimators require in general
More informationAn Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data
An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data Jae-Kwang Kim 1 Iowa State University June 28, 2012 1 Joint work with Dr. Ming Zhou (when he was a PhD student at ISU)
More informationWeighting Missing Data Coding and Data Preparation Wrap-up Preview of Next Time. Data Management
Data Management Department of Political Science and Government Aarhus University November 24, 2014 Data Management Weighting Handling missing data Categorizing missing data types Imputation Summary measures
More informationBayesian Linear Models
Bayesian Linear Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics, School of Public
More informationSTAT 512 sp 2018 Summary Sheet
STAT 5 sp 08 Summary Sheet Karl B. Gregory Spring 08. Transformations of a random variable Let X be a rv with support X and let g be a function mapping X to Y with inverse mapping g (A = {x X : g(x A}
More informationRobustness to Parametric Assumptions in Missing Data Models
Robustness to Parametric Assumptions in Missing Data Models Bryan Graham NYU Keisuke Hirano University of Arizona April 2011 Motivation Motivation We consider the classic missing data problem. In practice
More informationSimple design-efficient calibration estimators for rejective and high-entropy sampling
Biometrika (202), 99,, pp. 6 C 202 Biometrika Trust Printed in Great Britain Advance Access publication on 3 July 202 Simple design-efficient calibration estimators for rejective and high-entropy sampling
More informationagilis D1. Define Estimation Procedures European Commission Eurostat/B1, Eurostat/F1 Contract No
Informatics European Commission Eurostat/B1, Eurostat/F1 Contract No. 611.211.5-212.426 Development of methods and scenarios for an integrated system of D1. Define Estimation Procedures October 213 (Contract
More informationCausal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions
Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census
More informationIn Praise of the Listwise-Deletion Method (Perhaps with Reweighting)
In Praise of the Listwise-Deletion Method (Perhaps with Reweighting) Phillip S. Kott RTI International NISS Worshop on the Analysis of Complex Survey Data With Missing Item Values October 17, 2014 1 RTI
More informationRegression Analysis. Regression: Methodology for studying the relationship among two or more variables
Regression Analysis Regression: Methodology for studying the relationship among two or more variables Two major aims: Determine an appropriate model for the relationship between the variables Predict the
More informationNonparametric Regression Estimation of Finite Population Totals under Two-Stage Sampling
Nonparametric Regression Estimation of Finite Population Totals under Two-Stage Sampling Ji-Yeon Kim Iowa State University F. Jay Breidt Colorado State University Jean D. Opsomer Colorado State University
More informationChapter 4: Constrained estimators and tests in the multiple linear regression model (Part III)
Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III) Florian Pelgrin HEC September-December 2010 Florian Pelgrin (HEC) Constrained estimators September-December
More informationProperties of the least squares estimates
Properties of the least squares estimates 2019-01-18 Warmup Let a and b be scalar constants, and X be a scalar random variable. Fill in the blanks E ax + b) = Var ax + b) = Goal Recall that the least squares
More informationTaking into account sampling design in DAD. Population SAMPLING DESIGN AND DAD
Taking into account sampling design in DAD SAMPLING DESIGN AND DAD With version 4.2 and higher of DAD, the Sampling Design (SD) of the database can be specified in order to calculate the correct asymptotic
More informationProbability and Statistics Notes
Probability and Statistics Notes Chapter Seven Jesse Crawford Department of Mathematics Tarleton State University Spring 2011 (Tarleton State University) Chapter Seven Notes Spring 2011 1 / 42 Outline
More informationSensitivity of GLS estimators in random effects models
of GLS estimators in random effects models Andrey L. Vasnev (University of Sydney) Tokyo, August 4, 2009 1 / 19 Plan Plan Simulation studies and estimators 2 / 19 Simulation studies Plan Simulation studies
More informationMultidimensional Control Totals for Poststratified Weights
Multidimensional Control Totals for Poststratified Weights Darryl V. Creel and Mansour Fahimi Joint Statistical Meetings Minneapolis, MN August 7-11, 2005 RTI International is a trade name of Research
More informationA Design-Sensitive Approach to Fitting Regression Models With Complex Survey Data
A Design-Sensitive Approach to Fitting Regression Models With Complex Survey Data Phillip S. Kott RI International Rockville, MD Introduction: What Does Fitting a Regression Model with Survey Data Mean?
More informationWeight calibration and the survey bootstrap
Weight and the survey Department of Statistics University of Missouri-Columbia March 7, 2011 Motivating questions 1 Why are the large scale samples always so complex? 2 Why do I need to use weights? 3
More informationAssociation studies and regression
Association studies and regression CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar Association studies and regression 1 / 104 Administration
More informationSelection on Observables: Propensity Score Matching.
Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017
More information