Implications of Ignoring the Uncertainty in Control Totals for Generalized Regression Estimators. Calibration Estimators

Size: px
Start display at page:

Download "Implications of Ignoring the Uncertainty in Control Totals for Generalized Regression Estimators. Calibration Estimators"

Transcription

1 Implications of Ignoring the Uncertainty in Control Totals for Generalized Regression Estimators Jill A. Dever, RTI Richard Valliant, JPSM & ISR is a trade name of Research Triangle Institute. Calibration Estimators Calibrate auxiliary estimates (X) to corresponding population control totals (Deville & Särndal 1992; Särndal, Swensson, & Wretman 1992) Can produce efficient estimates (Särndal & Lundström 2005; Kott 2006) Reduce errors linked to sampling frame and nonresponse Reduce variation in weights Regression approach (Sarndal 2007) = working prediction model (Kott 2006) to closely represent the relationship between y and X Specification of the model is important for precision 1

2 The GREG, tˆ Generalized Regression (Calibration) Estimators ( ˆ ) = tˆ + t t Bˆ y R A y x A x A 1 = 1 ( ˆ ) d d + t t x x x y k A s l sa x A x l l l k k k Working prediction model: 2 ( ) = x B, ( ) = E y V a r y ε k k ε k σ k Note: When x k is substituted for y k, then by definition t ˆ A = x t x GREG vs. Estimated Control (EC) GREG t ˆ ( ˆ ) ˆ = t ˆ + t t B GREG: y R A y x A x A t ˆ ( ˆ ˆ ) ˆ = t ˆ + t t B EC-GREG: y R A y B x A x A w h t e= r x e ; ˆ ; = t d x x jj AU xk ks k and tˆ = B x j s j j B w x A 2

3 Scope of the Research Analytic survey (requires calibration) stratified multi-stage design, PSUs selected WR negligible nonresponse Bernoulli distribution used for random undercoverage error in frame Benchmark survey (estimated controls) independent from analytic survey probability-based sampling design Estimated-Control (EC) Calibration with Generalized Regression Estimation calculated from complete "population" with varying levels of precision Theory Theoretical and Empirical Evaluation Asymptotic bias and variance developed Simulation Study (Poststratified Estimator) Population = Subset of 2003 NHIS (N=21,664) Poststrata = Age group (8) by gender 4,000 frames = Coverage rates vary by poststrata Sensitivity analysis: Two outcome variables (y k ) Size of analytic survey sample Relative size of the benchmark survey 3

4 Biast ( ˆ yecr ) Theoretical Bias 4 components Unbiased if all conditions are met: 1) Model residuals are not correlated to coverage propensities 2) Same populations for analytic and benchmark surveys 3) Average model residual is zero 4) Unbiased estimates from the reference survey Empirical Bias for Estimate Total is Lower with EC Poststratification Adjustment RelBias* EC Poststratification 0.2 % None % * Hispanic persons who delayed medical care because of cost 4

5 EC-GREG Linearization Variance Estimator var ( ˆ yecr ) t = var ( t ˆ yr ) + f + f ( benchmark covariance matrix ) ( coverage propensity, model residuals ) EC-GREG Jackknife Variance Estimator tˆ = t + B x B x adjustment where adjustment is defined by: Fuller method [ECF2] = eigenvalue decomposition of benchmark covariance matrix (Fuller 1998) Nadimpalli et al. method [ECNJC] = controls chosen from N(0,1) with diagonal of covariance matrix (NJC 2004) Multi.Normal method [ECMV] = controls chosen from MV distribution with covariance matrix (Dever & Valliant 2010) 5

6 Average Percent Relative Bias Naive ECNJCm Other (ECTS, ECF2m, ECMV) Empirical Percent Bias of Variance Relative to MSE* * Hispanic persons only Relative Size of Benchmark Survey to Analytic Survey Summary of Research to Date Theoretical Empirical Variance* Estimators ~ Unbiased? Ratios Totals Naïve Œ Œ Œ Linearization Fuller NJC - N(0,1) Ø or Œ - Œ MV Normal Ævariation * Variation in subdomain results warrants additional research. 6

7 Future Research Thresholds Traditional variance estimators are acceptable Benchmark estimates too imprecise for use Variance adjustment for negative bias Improve precision for subdomains Constrained EC calibration Effects of nonresponse and measurement errors Two-phase designs with mix of controls More Information? Jill A. Dever 7

Model Assisted Survey Sampling

Model Assisted Survey Sampling Carl-Erik Sarndal Jan Wretman Bengt Swensson Model Assisted Survey Sampling Springer Preface v PARTI Principles of Estimation for Finite Populations and Important Sampling Designs CHAPTER 1 Survey Sampling

More information

Variance Estimation for Calibration to Estimated Control Totals

Variance Estimation for Calibration to Estimated Control Totals Variance Estimation for Calibration to Estimated Control Totals Siyu Qing Coauthor with Michael D. Larsen Associate Professor of Statistics Tuesday, 11/05/2013 2 Outline A. Background B. Calibration Technique

More information

The Effect of Multiple Weighting Steps on Variance Estimation

The Effect of Multiple Weighting Steps on Variance Estimation Journal of Official Statistics, Vol. 20, No. 1, 2004, pp. 1 18 The Effect of Multiple Weighting Steps on Variance Estimation Richard Valliant 1 Multiple weight adjustments are common in surveys to account

More information

Multidimensional Control Totals for Poststratified Weights

Multidimensional Control Totals for Poststratified Weights Multidimensional Control Totals for Poststratified Weights Darryl V. Creel and Mansour Fahimi Joint Statistical Meetings Minneapolis, MN August 7-11, 2005 RTI International is a trade name of Research

More information

In Praise of the Listwise-Deletion Method (Perhaps with Reweighting)

In Praise of the Listwise-Deletion Method (Perhaps with Reweighting) In Praise of the Listwise-Deletion Method (Perhaps with Reweighting) Phillip S. Kott RTI International NISS Worshop on the Analysis of Complex Survey Data With Missing Item Values October 17, 2014 1 RTI

More information

An Overview of the Pros and Cons of Linearization versus Replication in Establishment Surveys

An Overview of the Pros and Cons of Linearization versus Replication in Establishment Surveys An Overview of the Pros and Cons of Linearization versus Replication in Establishment Surveys Richard Valliant University of Michigan and Joint Program in Survey Methodology University of Maryland 1 Introduction

More information

Finite Population Sampling and Inference

Finite Population Sampling and Inference Finite Population Sampling and Inference A Prediction Approach RICHARD VALLIANT ALAN H. DORFMAN RICHARD M. ROYALL A Wiley-Interscience Publication JOHN WILEY & SONS, INC. New York Chichester Weinheim Brisbane

More information

Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/21/03) Ed Stanek

Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/21/03) Ed Stanek Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/2/03) Ed Stanek Here are comments on the Draft Manuscript. They are all suggestions that

More information

REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY

REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY REPLICATION VARIANCE ESTIMATION FOR THE NATIONAL RESOURCES INVENTORY J.D. Opsomer, W.A. Fuller and X. Li Iowa State University, Ames, IA 50011, USA 1. Introduction Replication methods are often used in

More information

Estimation from Purposive Samples with the Aid of Probability Supplements but without Data on the Study Variable

Estimation from Purposive Samples with the Aid of Probability Supplements but without Data on the Study Variable Estimation from Purposive Samples with the Aid of Probability Supplements but without Data on the Study Variable A.C. Singh,, V. Beresovsky, and C. Ye Survey and Data Sciences, American nstitutes for Research,

More information

NONLINEAR CALIBRATION. 1 Introduction. 2 Calibrated estimator of total. Abstract

NONLINEAR CALIBRATION. 1 Introduction. 2 Calibrated estimator of total.   Abstract NONLINEAR CALIBRATION 1 Alesandras Pliusas 1 Statistics Lithuania, Institute of Mathematics and Informatics, Lithuania e-mail: Pliusas@tl.mii.lt Abstract The definition of a calibrated estimator of the

More information

A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR

A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR Statistica Sinica 8(1998), 1165-1173 A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR Phillip S. Kott National Agricultural Statistics Service Abstract:

More information

REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES

REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES Statistica Sinica 8(1998), 1153-1164 REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLES Wayne A. Fuller Iowa State University Abstract: The estimation of the variance of the regression estimator for

More information

Regression Diagnostics for Survey Data

Regression Diagnostics for Survey Data Regression Diagnostics for Survey Data Richard Valliant Joint Program in Survey Methodology, University of Maryland and University of Michigan USA Jianzhu Li (Westat), Dan Liao (JPSM) 1 Introduction Topics

More information

Random permutation models with auxiliary variables. Design-based random permutation models with auxiliary information. Wenjun Li

Random permutation models with auxiliary variables. Design-based random permutation models with auxiliary information. Wenjun Li Running heads: Random permutation models with auxiliar variables Design-based random permutation models with auxiliar information Wenjun Li Division of Preventive and Behavioral Medicine Universit of Massachusetts

More information

Developing Calibration Weights and Variance Estimates for a Survey of Drug-Related Emergency-Room Visits

Developing Calibration Weights and Variance Estimates for a Survey of Drug-Related Emergency-Room Visits Developing Calibration Weights and Variance Estimates for a Survey of Drug-Related Emergency-Room Visits Phillip S. Kott RTI International Charles D. Day SAMHSA RTI International is a trade name of Research

More information

BIAS-ROBUSTNESS AND EFFICIENCY OF MODEL-BASED INFERENCE IN SURVEY SAMPLING

BIAS-ROBUSTNESS AND EFFICIENCY OF MODEL-BASED INFERENCE IN SURVEY SAMPLING Statistica Sinica 22 (2012), 777-794 doi:http://dx.doi.org/10.5705/ss.2010.238 BIAS-ROBUSTNESS AND EFFICIENCY OF MODEL-BASED INFERENCE IN SURVEY SAMPLING Desislava Nedyalova and Yves Tillé University of

More information

SAS/STAT 13.2 User s Guide. Introduction to Survey Sampling and Analysis Procedures

SAS/STAT 13.2 User s Guide. Introduction to Survey Sampling and Analysis Procedures SAS/STAT 13.2 User s Guide Introduction to Survey Sampling and Analysis Procedures This document is an individual chapter from SAS/STAT 13.2 User s Guide. The correct bibliographic citation for the complete

More information

Weight calibration and the survey bootstrap

Weight calibration and the survey bootstrap Weight and the survey Department of Statistics University of Missouri-Columbia March 7, 2011 Motivating questions 1 Why are the large scale samples always so complex? 2 Why do I need to use weights? 3

More information

SAS/STAT 14.2 User s Guide. Introduction to Survey Sampling and Analysis Procedures

SAS/STAT 14.2 User s Guide. Introduction to Survey Sampling and Analysis Procedures SAS/STAT 14.2 User s Guide Introduction to Survey Sampling and Analysis Procedures This document is an individual chapter from SAS/STAT 14.2 User s Guide. The correct bibliographic citation for this manual

More information

Sampling from Finite Populations Jill M. Montaquila and Graham Kalton Westat 1600 Research Blvd., Rockville, MD 20850, U.S.A.

Sampling from Finite Populations Jill M. Montaquila and Graham Kalton Westat 1600 Research Blvd., Rockville, MD 20850, U.S.A. Sampling from Finite Populations Jill M. Montaquila and Graham Kalton Westat 1600 Research Blvd., Rockville, MD 20850, U.S.A. Keywords: Survey sampling, finite populations, simple random sampling, systematic

More information

arxiv: v2 [math.st] 20 Jun 2014

arxiv: v2 [math.st] 20 Jun 2014 A solution in small area estimation problems Andrius Čiginas and Tomas Rudys Vilnius University Institute of Mathematics and Informatics, LT-08663 Vilnius, Lithuania arxiv:1306.2814v2 [math.st] 20 Jun

More information

Research Article Ratio Type Exponential Estimator for the Estimation of Finite Population Variance under Two-stage Sampling

Research Article Ratio Type Exponential Estimator for the Estimation of Finite Population Variance under Two-stage Sampling Research Journal of Applied Sciences, Engineering and Technology 7(19): 4095-4099, 2014 DOI:10.19026/rjaset.7.772 ISSN: 2040-7459; e-issn: 2040-7467 2014 Maxwell Scientific Publication Corp. Submitted:

More information

Formalizing the Concepts: Simple Random Sampling. Juan Muñoz Kristen Himelein March 2012

Formalizing the Concepts: Simple Random Sampling. Juan Muñoz Kristen Himelein March 2012 Formalizing the Concepts: Simple Random Sampling Juan Muñoz Kristen Himelein March 2012 Purpose of sampling To study a portion of the population through observations at the level of the units selected,

More information

Supplement-Sample Integration for Prediction of Remainder for Enhanced GREG

Supplement-Sample Integration for Prediction of Remainder for Enhanced GREG Supplement-Sample Integration for Prediction of Remainder for Enhanced GREG Abstract Avinash C. Singh Division of Survey and Data Sciences American Institutes for Research, Rockville, MD 20852 asingh@air.org

More information

INSTRUMENTAL-VARIABLE CALIBRATION ESTIMATION IN SURVEY SAMPLING

INSTRUMENTAL-VARIABLE CALIBRATION ESTIMATION IN SURVEY SAMPLING Statistica Sinica 24 (2014), 1001-1015 doi:http://dx.doi.org/10.5705/ss.2013.038 INSTRUMENTAL-VARIABLE CALIBRATION ESTIMATION IN SURVEY SAMPLING Seunghwan Park and Jae Kwang Kim Seoul National Univeristy

More information

Cross-sectional variance estimation for the French Labour Force Survey

Cross-sectional variance estimation for the French Labour Force Survey Survey Research Methods (007 Vol., o., pp. 75-83 ISS 864-336 http://www.surveymethods.org c European Survey Research Association Cross-sectional variance estimation for the French Labour Force Survey Pascal

More information

A global optimisation approach to range-restricted survey calibration

A global optimisation approach to range-restricted survey calibration DOI 10.1007/s11222-017-9739-5 A global optimisation approach to range-restricted survey calibration Ferran Espuny-Pujol 1 Karyn Morrissey 2 Paul Williamson 3 Received: 7 September 2016 / Accepted: 2 March

More information

Estimation Techniques in the German Labor Force Survey (LFS)

Estimation Techniques in the German Labor Force Survey (LFS) Estimation Techniques in the German Labor Force Survey (LFS) Dr. Kai Lorentz Federal Statistical Office of Germany Group C1 - Mathematical and Statistical Methods Email: kai.lorentz@destatis.de Federal

More information

Formalizing the Concepts: Simple Random Sampling. Juan Muñoz Kristen Himelein March 2013

Formalizing the Concepts: Simple Random Sampling. Juan Muñoz Kristen Himelein March 2013 Formalizing the Concepts: Simple Random Sampling Juan Muñoz Kristen Himelein March 2013 Purpose of sampling To study a portion of the population through observations at the level of the units selected,

More information

Estimation of change in a rotation panel design

Estimation of change in a rotation panel design Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS028) p.4520 Estimation of change in a rotation panel design Andersson, Claes Statistics Sweden S-701 89 Örebro, Sweden

More information

Successive Difference Replication Variance Estimation in Two-Phase Sampling

Successive Difference Replication Variance Estimation in Two-Phase Sampling Successive Difference Replication Variance Estimation in Two-Phase Sampling Jean D. Opsomer Colorado State University Michael White US Census Bureau F. Jay Breidt Colorado State University Yao Li Colorado

More information

Combining data from two independent surveys: model-assisted approach

Combining data from two independent surveys: model-assisted approach Combining data from two independent surveys: model-assisted approach Jae Kwang Kim 1 Iowa State University January 20, 2012 1 Joint work with J.N.K. Rao, Carleton University Reference Kim, J.K. and Rao,

More information

Propensity score adjusted method for missing data

Propensity score adjusted method for missing data Graduate Theses and Dissertations Graduate College 2013 Propensity score adjusted method for missing data Minsun Kim Riddles Iowa State University Follow this and additional works at: http://lib.dr.iastate.edu/etd

More information

SAS/STAT 13.1 User s Guide. Introduction to Survey Sampling and Analysis Procedures

SAS/STAT 13.1 User s Guide. Introduction to Survey Sampling and Analysis Procedures SAS/STAT 13.1 User s Guide Introduction to Survey Sampling and Analysis Procedures This document is an individual chapter from SAS/STAT 13.1 User s Guide. The correct bibliographic citation for the complete

More information

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census

More information

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall 1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work

More information

EFFICIENT REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLING

EFFICIENT REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLING Statistica Sinica 13(2003), 641-653 EFFICIENT REPLICATION VARIANCE ESTIMATION FOR TWO-PHASE SAMPLING J. K. Kim and R. R. Sitter Hankuk University of Foreign Studies and Simon Fraser University Abstract:

More information

F. Jay Breidt Colorado State University

F. Jay Breidt Colorado State University Model-assisted survey regression estimation with the lasso 1 F. Jay Breidt Colorado State University Opening Workshop on Computational Methods in Social Sciences SAMSI August 2013 This research was supported

More information

No is the Easiest Answer: Using Calibration to Assess Nonignorable Nonresponse in the 2002 Census of Agriculture

No is the Easiest Answer: Using Calibration to Assess Nonignorable Nonresponse in the 2002 Census of Agriculture No is the Easiest Answer: Using Calibration to Assess Nonignorable Nonresponse in the 2002 Census of Agriculture Phillip S. Kott National Agricultural Statistics Service Key words: Weighting class, Calibration,

More information

Small domain estimation using probability and non-probability survey data

Small domain estimation using probability and non-probability survey data Small domain estimation using probability and non-probability survey data Adrijo Chakraborty and N Ganesh March 2018, FCSM Outline Introduction to the Problem Approaches for Combining Probability and Non-Probability

More information

Graybill Conference Poster Session Introductions

Graybill Conference Poster Session Introductions Graybill Conference Poster Session Introductions 2013 Graybill Conference in Modern Survey Statistics Colorado State University Fort Collins, CO June 10, 2013 Small Area Estimation with Incomplete Auxiliary

More information

Estimation of Parameters and Variance

Estimation of Parameters and Variance Estimation of Parameters and Variance Dr. A.C. Kulshreshtha U.N. Statistical Institute for Asia and the Pacific (SIAP) Second RAP Regional Workshop on Building Training Resources for Improving Agricultural

More information

Simple Linear Regression for the Climate Data

Simple Linear Regression for the Climate Data Prediction Prediction Interval Temperature 0.2 0.0 0.2 0.4 0.6 0.8 320 340 360 380 CO 2 Simple Linear Regression for the Climate Data What do we do with the data? y i = Temperature of i th Year x i =CO

More information

Comparing Group Means When Nonresponse Rates Differ

Comparing Group Means When Nonresponse Rates Differ UNF Digital Commons UNF Theses and Dissertations Student Scholarship 2015 Comparing Group Means When Nonresponse Rates Differ Gabriela M. Stegmann University of North Florida Suggested Citation Stegmann,

More information

A Model-Over-Design Integration for Estimation from Purposive Supplements to Probability Samples

A Model-Over-Design Integration for Estimation from Purposive Supplements to Probability Samples A Model-Over-Design Integration for Estimation from Purposive Supplements to Probability Samples Avinash C. Singh, NORC at the University of Chicago, Chicago, IL 60603 singh-avi@norc.org Abstract For purposive

More information

Plausible Values for Latent Variables Using Mplus

Plausible Values for Latent Variables Using Mplus Plausible Values for Latent Variables Using Mplus Tihomir Asparouhov and Bengt Muthén August 21, 2010 1 1 Introduction Plausible values are imputed values for latent variables. All latent variables can

More information

A Design-Sensitive Approach to Fitting Regression Models With Complex Survey Data

A Design-Sensitive Approach to Fitting Regression Models With Complex Survey Data A Design-Sensitive Approach to Fitting Regression Models With Complex Survey Data Phillip S. Kott RI International Rockville, MD Introduction: What Does Fitting a Regression Model with Survey Data Mean?

More information

Effects of Cluster Sizes on Variance Components in Two-Stage Sampling

Effects of Cluster Sizes on Variance Components in Two-Stage Sampling Journal of Official Statistics, Vol. 31, No. 4, 2015, pp. 763 782, http://dx.doi.org/10.1515/jos-2015-0044 Effects of Cluster Sizes on Variance Components in Two-Stage Sampling Richard Valliant 1, Jill

More information

A comparison of stratified simple random sampling and sampling with probability proportional to size

A comparison of stratified simple random sampling and sampling with probability proportional to size A comparison of stratified simple random sampling and sampling with probability proportional to size Edgar Bueno Dan Hedlin Per Gösta Andersson 1 Introduction When planning the sampling strategy (i.e.

More information

On Auxiliary Variables and Models in Estimation in Surveys with Nonresponse

On Auxiliary Variables and Models in Estimation in Surveys with Nonresponse On Auxiliary Variables and Models in Estimation in Surveys with Nonresponse June 10, 2016 Bernardo João Rota 1,3) and Thomas Laitila 1,2) 1) Department of Statistics, Örebro University, 701 82 Örebro,

More information

Random-effects Models for Smoothing Poststrati cation Weights

Random-effects Models for Smoothing Poststrati cation Weights Journal of Of cial Statistics, Vol. 14, No. 1, 1998, pp. 61±78 Random-effects Models for Smoothing Poststrati cation Weights Laura C. Lazzeroni 1 and Roderick J.A. Little 2 Poststrati cation is a common

More information

Incorporating Level of Effort Paradata in Nonresponse Adjustments. Paul Biemer RTI International University of North Carolina Chapel Hill

Incorporating Level of Effort Paradata in Nonresponse Adjustments. Paul Biemer RTI International University of North Carolina Chapel Hill Incorporating Level of Effort Paradata in Nonresponse Adjustments Paul Biemer RTI International University of North Carolina Chapel Hill Acknowledgements Patrick Chen, RTI International Kevin Wang, RTI

More information

A comparison of stratified simple random sampling and sampling with probability proportional to size

A comparison of stratified simple random sampling and sampling with probability proportional to size A comparison of stratified simple random sampling and sampling with probability proportional to size Edgar Bueno Dan Hedlin Per Gösta Andersson Department of Statistics Stockholm University Introduction

More information

Data Integration for Big Data Analysis for finite population inference

Data Integration for Big Data Analysis for finite population inference for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation

More information

Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse

Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse Nanhua Zhang Division of Biostatistics & Epidemiology Cincinnati Children s Hospital Medical Center (Joint work

More information

Incorporating published univariable associations in diagnostic and prognostic modeling

Incorporating published univariable associations in diagnostic and prognostic modeling Incorporating published univariable associations in diagnostic and prognostic modeling Thomas Debray Julius Center for Health Sciences and Primary Care University Medical Center Utrecht The Netherlands

More information

Deriving indicators from representative samples for the ESF

Deriving indicators from representative samples for the ESF Deriving indicators from representative samples for the ESF Brussels, June 17, 2014 Ralf Münnich and Stefan Zins Lisa Borsi and Jan-Philipp Kolb GESIS Mannheim and University of Trier Outline 1 Choosing

More information

THE GENERALIZED EXPONENTIAL MODEL FOR SAMPLING WEIGHT CALIBRATION FOR EXTREME VALUES, NONRESPONSE, AND POSTSTRATIFICATION

THE GENERALIZED EXPONENTIAL MODEL FOR SAMPLING WEIGHT CALIBRATION FOR EXTREME VALUES, NONRESPONSE, AND POSTSTRATIFICATION THE GENERALIZED EXPONENTIAL MODEL FOR SAMPLING WEIGHT CALIBRATION FOR EXTREME VALUES, NONRESPONSE, AND POSTSTRATIFICATION R. E. Folsom, Jr. and A.C. Singh, Research Triangle Institute R.E. Folsom, RTI,

More information

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction

More information

Development of methodology for the estimate of variance of annual net changes for LFS-based indicators

Development of methodology for the estimate of variance of annual net changes for LFS-based indicators Development of methodology for the estimate of variance of annual net changes for LFS-based indicators Deliverable 1 - Short document with derivation of the methodology (FINAL) Contract number: Subject:

More information

The Effective Use of Complete Auxiliary Information From Survey Data

The Effective Use of Complete Auxiliary Information From Survey Data The Effective Use of Complete Auxiliary Information From Survey Data by Changbao Wu B.S., Anhui Laodong University, China, 1982 M.S. Diploma, East China Normal University, 1986 a thesis submitted in partial

More information

A comparison of weighted estimators for the population mean. Ye Yang Weighting in surveys group

A comparison of weighted estimators for the population mean. Ye Yang Weighting in surveys group A comparison of weighted estimators for the population mean Ye Yang Weighting in surveys group Motivation Survey sample in which auxiliary variables are known for the population and an outcome variable

More information

Temi di discussione. Accounting for sampling design in the SHIW. (Working papers) April by Ivan Faiella. Number

Temi di discussione. Accounting for sampling design in the SHIW. (Working papers) April by Ivan Faiella. Number Temi di discussione (Working papers) Accounting for sampling design in the SHIW by Ivan Faiella April 2008 Number 662 The purpose of the Temi di discussione series is to promote the circulation of working

More information

Sampling techniques for big data analysis in finite population inference

Sampling techniques for big data analysis in finite population inference Statistics Preprints Statistics 1-29-2018 Sampling techniques for big data analysis in finite population inference Jae Kwang Kim Iowa State University, jkim@iastate.edu Zhonglei Wang Iowa State University,

More information

Calibration estimation in survey sampling

Calibration estimation in survey sampling Calibration estimation in survey sampling Jae Kwang Kim Mingue Park September 8, 2009 Abstract Calibration estimation, where the sampling weights are adjusted to make certain estimators match known population

More information

Statistica Sinica Preprint No: SS R2

Statistica Sinica Preprint No: SS R2 Statistica Sinica Preprint No: SS-13-244R2 Title Examining some aspects of balanced sampling in surveys Manuscript ID SS-13-244R2 URL http://www.stat.sinica.edu.tw/statistica/ DOI 10.5705/ss.2013.244 Complete

More information

STK4900/ Lecture 3. Program

STK4900/ Lecture 3. Program STK4900/9900 - Lecture 3 Program 1. Multiple regression: Data structure and basic questions 2. The multiple linear regression model 3. Categorical predictors 4. Planned experiments and observational studies

More information

VARIANCE ESTIMATION FOR TWO-PHASE STRATIFIED SAMPLING

VARIANCE ESTIMATION FOR TWO-PHASE STRATIFIED SAMPLING VARIACE ESTIMATIO FOR TWO-PHASE STRATIFIED SAMPLIG David A. Binder, Colin Babyak, Marie Brodeur, Michel Hidiroglou Wisner Jocelyn (Statistics Canada) Business Survey Methods Division, Statistics Canada,

More information

Division of Preventative and Behavioral Medicine. University of Massachusetts Medical School, Worcester, MA 01655

Division of Preventative and Behavioral Medicine. University of Massachusetts Medical School, Worcester, MA 01655 USE OF AUXLARY FORMATO A DESG-BASED RADOM PERMUTATO MODEL Wenjun Li Division of Preventative Behavioral Medicine Universit of Massachusetts Medical School, Worcester, MA 0655 Edward J. Stanek Department

More information

Telescope Matching: A Flexible Approach to Estimating Direct Effects

Telescope Matching: A Flexible Approach to Estimating Direct Effects Telescope Matching: A Flexible Approach to Estimating Direct Effects Matthew Blackwell and Anton Strezhnev International Methods Colloquium October 12, 2018 direct effect direct effect effect of treatment

More information

THE THEORY AND PRACTICE OF MAXIMAL BREWER SELECTION WITH POISSON PRN SAMPLING

THE THEORY AND PRACTICE OF MAXIMAL BREWER SELECTION WITH POISSON PRN SAMPLING THE THEORY AND PRACTICE OF MAXIMAL BREWER SELECTION WITH POISSON PRN SAMPLING Phillip S. Kott And Jeffrey T. Bailey, National Agricultural Statistics Service Phillip S. Kott, NASS, Room 305, 351 Old Lee

More information

Chapter 2. Section Section 2.9. J. Kim (ISU) Chapter 2 1 / 26. Design-optimal estimator under stratified random sampling

Chapter 2. Section Section 2.9. J. Kim (ISU) Chapter 2 1 / 26. Design-optimal estimator under stratified random sampling Chapter 2 Section 2.4 - Section 2.9 J. Kim (ISU) Chapter 2 1 / 26 2.4 Regression and stratification Design-optimal estimator under stratified random sampling where (Ŝxxh, Ŝxyh) ˆβ opt = ( x st, ȳ st )

More information

An Introduction to Causal Analysis on Observational Data using Propensity Scores

An Introduction to Causal Analysis on Observational Data using Propensity Scores An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut

More information

Calibration of Weights in Surveys with Nonresponse and Frame Imperfections

Calibration of Weights in Surveys with Nonresponse and Frame Imperfections Calibration of Weights in Surveys with Nonresponse and Frame Imperfections A course presented at Eustat Bilbao, Basque Country January 26-27, 2009 by Sixten Lundström and Carl-Eri Särndal Statistics Sweden

More information

Linear Regression Models. Based on Chapter 3 of Hastie, Tibshirani and Friedman

Linear Regression Models. Based on Chapter 3 of Hastie, Tibshirani and Friedman Linear Regression Models Based on Chapter 3 of Hastie, ibshirani and Friedman Linear Regression Models Here the X s might be: p f ( X = " + " 0 j= 1 X j Raw predictor variables (continuous or coded-categorical

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

The Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates

The Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates The Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates Eun Sook Kim, Patricia Rodríguez de Gil, Jeffrey D. Kromrey, Rheta E. Lanehart, Aarti Bellara,

More information

On Accuracy of Calibration Estimators Supported by Auxiliary Variables from Past Periods Based on Simulation Analyses

On Accuracy of Calibration Estimators Supported by Auxiliary Variables from Past Periods Based on Simulation Analyses Folia Oeconomica Acta Universitatis Lodziensis ISSN 0208-6018 e-issn 2353-7663 4(330) 2017 DOI: http://dx.doi.org/10.18778/0208-6018.330.03 omasz Stachurski University of Economics in Katowice, Faculty

More information

Rerandomization to Balance Covariates

Rerandomization to Balance Covariates Rerandomization to Balance Covariates Kari Lock Morgan Department of Statistics Penn State University Joint work with Don Rubin University of Minnesota Biostatistics 4/27/16 The Gold Standard Randomized

More information

Applied Econometrics (QEM)

Applied Econometrics (QEM) Applied Econometrics (QEM) The Simple Linear Regression Model based on Prinicples of Econometrics Jakub Mućk Department of Quantitative Economics Jakub Mućk Applied Econometrics (QEM) Meeting #2 The Simple

More information

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator by Emmanuel Flachaire Eurequa, University Paris I Panthéon-Sorbonne December 2001 Abstract Recent results of Cribari-Neto and Zarkos

More information

A Unified Theory of Empirical Likelihood Confidence Intervals for Survey Data with Unequal Probabilities and Non Negligible Sampling Fractions

A Unified Theory of Empirical Likelihood Confidence Intervals for Survey Data with Unequal Probabilities and Non Negligible Sampling Fractions A Unified Theory of Empirical Likelihood Confidence Intervals for Survey Data with Unequal Probabilities and Non Negligible Sampling Fractions Y.G. Berger O. De La Riva Torres Abstract We propose a new

More information

Admissible Estimation of a Finite Population Total under PPS Sampling

Admissible Estimation of a Finite Population Total under PPS Sampling Research Journal of Mathematical and Statistical Sciences E-ISSN 2320-6047 Admissible Estimation of a Finite Population Total under PPS Sampling Abstract P.A. Patel 1* and Shradha Bhatt 2 1 Department

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression Simple linear regression tries to fit a simple line between two variables Y and X. If X is linearly related to Y this explains some of the variability in Y. In most cases, there

More information

Linear regression methods

Linear regression methods Linear regression methods Most of our intuition about statistical methods stem from linear regression. For observations i = 1,..., n, the model is Y i = p X ij β j + ε i, j=1 where Y i is the response

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 6: Bias and variance (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 49 Our plan today We saw in last lecture that model scoring methods seem to be trading off two different

More information

Task. Variance Estimation in EU-SILC Survey. Gini coefficient. Estimates of Gini Index

Task. Variance Estimation in EU-SILC Survey. Gini coefficient. Estimates of Gini Index Variance Estimation in EU-SILC Survey MārtiĦš Liberts Central Statistical Bureau of Latvia Task To estimate sampling error for Gini coefficient estimated from social sample surveys (EU-SILC) Estimation

More information

VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA

VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA Submitted to the Annals of Applied Statistics VARIANCE ESTIMATION FOR NEAREST NEIGHBOR IMPUTATION FOR U.S. CENSUS LONG FORM DATA By Jae Kwang Kim, Wayne A. Fuller and William R. Bell Iowa State University

More information

Re-estimating Weights for IPUMS-Greece Samples. Dr. Stefanos G. Giakoumatos Technological Educational Institute of Kalamatas

Re-estimating Weights for IPUMS-Greece Samples. Dr. Stefanos G. Giakoumatos Technological Educational Institute of Kalamatas Re-estimating Weights for IPUMS-Greece Samples Dr. Stefanos G. Giaoumatos Technological Educational Institute of Kalamatas Introduction Weights are commonly used to survey data file in order to mae the

More information

Generalized Pseudo Empirical Likelihood Inferences for Complex Surveys

Generalized Pseudo Empirical Likelihood Inferences for Complex Surveys The Canadian Journal of Statistics Vol.??, No.?,????, Pages???-??? La revue canadienne de statistique Generalized Pseudo Empirical Likelihood Inferences for Complex Surveys Zhiqiang TAN 1 and Changbao

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

The 2010 Morris Hansen Lecture Dealing with Survey Nonresponse in Data Collection, in Estimation

The 2010 Morris Hansen Lecture Dealing with Survey Nonresponse in Data Collection, in Estimation Journal of Official Statistics, Vol. 27, No. 1, 2011, pp. 1 21 The 2010 Morris Hansen Lecture Dealing with Survey Nonresponse in Data Collection, in Estimation Carl-Erik Särndal 1 In dealing with survey

More information

Nonparametric Regression Estimation of Finite Population Totals under Two-Stage Sampling

Nonparametric Regression Estimation of Finite Population Totals under Two-Stage Sampling Nonparametric Regression Estimation of Finite Population Totals under Two-Stage Sampling Ji-Yeon Kim Iowa State University F. Jay Breidt Colorado State University Jean D. Opsomer Colorado State University

More information

Methods Used for Estimating Statistics in EdSurvey Developed by Paul Bailey & Michael Cohen May 04, 2017

Methods Used for Estimating Statistics in EdSurvey Developed by Paul Bailey & Michael Cohen May 04, 2017 Methods Used for Estimating Statistics in EdSurvey 1.0.6 Developed by Paul Bailey & Michael Cohen May 04, 2017 This document describes estimation procedures for the EdSurvey package. It includes estimation

More information

Combining multiple observational data sources to estimate causal eects

Combining multiple observational data sources to estimate causal eects Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,

More information

MS&E 226: Small Data. Lecture 6: Bias and variance (v2) Ramesh Johari

MS&E 226: Small Data. Lecture 6: Bias and variance (v2) Ramesh Johari MS&E 226: Small Data Lecture 6: Bias and variance (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 47 Our plan today We saw in last lecture that model scoring methods seem to be trading o two di erent

More information

University of Michigan School of Public Health

University of Michigan School of Public Health University of Michigan School of Public Health The University of Michigan Department of Biostatistics Working Paper Series Year 003 Paper Weighting Adustments for Unit Nonresponse with Multiple Outcome

More information

A measurement error model approach to small area estimation

A measurement error model approach to small area estimation A measurement error model approach to small area estimation Jae-kwang Kim 1 Spring, 2015 1 Joint work with Seunghwan Park and Seoyoung Kim Ouline Introduction Basic Theory Application to Korean LFS Discussion

More information

2011 NATIONAL SURVEY ON DRUG USE AND HEALTH

2011 NATIONAL SURVEY ON DRUG USE AND HEALTH 2011 NATIONAL SURVEY ON DRUG USE AND HEALTH PERSON-LEVEL SAMPLING WEIGHT CALIBRATION Prepared for the 2011 Methodological Resource Book Contract No. HHSS283200800004C RTI Project No. 0211838.207.004 Phase

More information