of being selected and varying such probability across strata under optimal allocation leads to increased accuracy.

Size: px
Start display at page:

Download "of being selected and varying such probability across strata under optimal allocation leads to increased accuracy."

Transcription

1 5 Sampling with Unequal Probabilities Simple random sampling and systematic sampling are schemes where every unit in the population has the same chance of being selected We will now consider unequal probability sampling We have encountered an example under stratified sampling in which the units in stratum l have chance n l N l of being selected and varng such probability across strata under optimal allocation leads to increased accuracy 51 Sampling with Replacement Using with replacement sampling simplifies the calculations and if the sampling fraction is small this model should give a reasonable approximation to the exact behaviour of the estimators in without replacement sampling Let p j denote the probability of selecting unit y j on the ith draw, so P Y i = y j ) = p j, j = 1,,, N where Y i represents a rv, not a total value The first two moments are EY i ) = p j y j and VarY i ) = p j yj p j y j j=1 The Hansen and Hurwitz estimator for the population total Y is Ŷ HH = 1 /p i n This estimator is unbiased for Y since EŶHH) = 1 E Y i ) = n n p i n EY i ) = Y 1 p 1 + Y p + + Y N p N = Y p i p 1 p p N SydU STAT ) Second semester Dr J Chan 8 j=1 j=1

2 Under sampling with replacement the random variables Y i /p i ) are independent so VarŶHH) = 1 Var Y i ) = n ) Yi p n p i n i Y = 1 N ) Yi Y p i n p i and is estimated by [ varŷhh) = 1 1 n n 1 ) ] p ŶHH = i 1 nn 1) How do we choose the p i to minimise the variance? [ p i ) nŷ HH The minimum is achieved if we set p i = Y i /Y, i = 1,, N since VarŶHH) = 1 ) Yi p i Y = 1 p i Y Y ) = 0 n p i n Of course we cannot use these values in practice as not all Y i and hence Y ) are known Instead we look for another variable, X i, that is known and highly correlated with Y i to construct probability estimates Set p j = X j /X where X = N j=1 X j Then Ŷ HH = X n the mean-of-ratio estimator in the probability proportional to size PPS) sampling Note ) VarŶHH) = X n Var x i ) where Var is estimated by a sample variance of x i x i x i ] SydU STAT ) Second semester Dr J Chan 84

3 When p i = 1 N, ŶHH = 1 n in SRS STAT014/914 Applied Stat-Sampling C5-Uneq Prob sample p i = N n = Nȳ, the total estimator A natural alternative to the pps sampling here would be to use the ratio estimator whereas Ŷ R = X Ŷ HH = X n x i One is based on the ratio of the averages whilst the other is the average of the ratios Example: Half Ackroyd case) Consider the population of firms A, B and C only with N =, Y = = 7667 and n = The following table shows selection probabilities of a particular sampling procedures and the sample estimates x i Sample Outcomes and HH estimators Firm Sales x i Selection probability p i = x i X Employee A 1 1/4 = B 1 1/4 = C 9 9/4 = SydU STAT ) Second semester Dr J Chan 85

4 Sample Outcomes and HH Estimates Outcome Probability Y HH = 1 n nn p i 1 1 A,A 4 4 = / /4 ) = A,B 4 4 = / /4 ) = A,C 4 4 = / /4 ) = B,A 4 4 = / /4 ) = B,B 4 4 = / /4 ) = B,C 4 4 = / /4 ) = C,A 4 4 = / /4 ) = C,B 4 4 = / /4 ) = C,C 4 4 = / /4 ) = Then the expected values and variances of these estimates are EŶ HH) = 78461)0146) )00701) = VarŶ HH) = ) 0146) ) 00701) = Note that the estimators are indeed unbiased and that they agree with the direct calculation of their variances: VarŶ HH) = 1 N ) Y nn p i ) 9 = 1 ) 1/ / /4 = SydU STAT ) Second semester Dr J Chan 86

5 If we use SRS, VarŶ srs) = 1 n ) S y N n = 1 ) 9 7 6) ) ) which is larger = SydU STAT ) Second semester Dr J Chan 87

6 5 Inclusion PPS IPPS) Sampling If we are sampling without replacement the nature of the population changes after each selection and so if at each step we select using probabilities proportional to size the overall scheme will not necessarily be PPS One way to achieve a PPS scheme is to use systematic sampling where only one random selection is necessary Let denote the probability Y i is selected in the sample, not just the j- th draw Note that is a first order inclusion not selection) probability The Horvitz-Thompson estimator for the population total Y is Ŷ HT = Let the first order inclusion indicator is Write Using this form EŶHT) = so ŶHT is unbiased for Y I i = 1 if is selected = 0 otherwise Ŷ HT = EI i ) = The second order inclusion indicator is I ij I i = Y = I i I j = 1 if both and y j are selected = 0 otherwise SydU STAT ) Second semester Dr J Chan 88

7 Note I ii = I i Let the second order inclusion probability j = EI ij ) Then N ) VarŶHT) = Var I i π i = VarI i ) πi + CovI i, I j ) y j π j = 1 ) π i + j=1,i j j=1,i j j π j ) y j π j since VarI i ) = 1 ) and CovI i, I j ) = EI i I j ) EI i )EI j ) = j π j We estimate this via varŷht,1) = 1 ) πi + j=1,i j j π j j y j π j since 1 = N n say adjusts the sum of N terms to the sum of n terms and j π j = πi = 1 ) if i = j Hence Note: varŷht,1) = 1 π i y i + j=1,i<j 1 The first order inclusion probabilities satisfy j π j j π j y j = N ) EI i ) = E I i = n whereas p i = 1 SydU STAT ) Second semester Dr J Chan 89

8 The estimator in SRS is a special case of the HT estimator Y HT = 1 y1 + y + + y ) n N π 1 π π n = 1 y1 N n/n + y n/n + + y ) n = ȳ n/n The variance estimate can be negative if π j > j ) If we restrict the variance to 0 in these cases, we induce bias Example: Strip transect sampling) Consider a study area of 100 km partitioned into strips 1 km wide but varng in length A sample of n = 4 strips are selected by draw-by-draw with replacement The number of animals is counted in each of these 4 strips The data are shown below: Sample strip Length of strip in km) p i Estimate the total number of animals in this area using the HT estimator and report its standard error Solution: The sample is drawn with replacement such that n = 4 and s = {7,, 56} π = 1 1 p ) 4 = ) 4 = π 7 = 1 1 p 7 ) 4 = ) 4 = π 56 = 1 1 p 56 ) 4 = ) 4 = 0094 π,7 = π + π 7 [1 1 p 7 p ) 4 ] = } {{ } [ ) 4 ] }{{} 7, 7, 7, 7 7, 7, 7 = 0011 SydU STAT ) Second semester Dr J Chan 90

9 π 7,56 = π 7 + π 56 [1 1 p 7 p 56 ) 4 ] = [ ) 4 ] = π,56 = π + π 56 [1 1 p p 56 ) 4 ] = [ ) 4 ] = 000 Ŷ HT = i S = = varŷht,1) = 1 π + πij π j y j i S i π i<j i π j j = ) ) = 74, seŷht,1) = 74, = 79 5 IPPS sampling without replacement For sampling without replacement WOR), the inclusion probabilities are WOR: j = EI i I j ) = E I i N I j j=1,j i j=1,j i j=1,j i = E[I i n I i )] = n = n 1) j π j ) = n 1) n ) = 1 ) j i SydU STAT ) Second semester Dr J Chan 91

10 Then the variance for the HT estimator becomes = = = = = VarŶHT,) 1 ) j=1,i<j j=1,i j ) + j=1,i j ) j π j ) [ ) π j j ) + [ ) π j j ) + j=i+1 π j j ) j=i+1 We estimate this via Note: varŷht,) = j π j ) yj yj y ) j π j π j j=i+1 π j ) ] ) j=1,i<j π j j j ) ) yj π j π j j ) j=1,i<j ) ) ] yj π j π j j ) y ) j π j ) ) yj π j ) ) yj 1 The variance varŷht,) is guaranteed to be positive for any sampling WOR method satisfng π j > j Also varŷht,) takes negative values less often than varŷht,1) The calculating j depends on the way the sample is selected Note n ) values of πij need to be calculated If Y i are approximately proportional to an auxiliary variable X i, ie rx i and we set = nx i N x np i cx i, then for all i, j) i y j π j rx i cx i rx j cx j 0 SydU STAT ) Second semester Dr J Chan 9 π j

11 or VarŶHT,) 0 The IPPS without replacement esitmator is Ŷ HT = 1 n p i Moreover if and x i are perfectly correlated such that = rx i, i = 1,,, N, then = n /Y and Ŷ HT = = n /Y = Y n = Y We expect good performance if rx i for all i under PPS sampling 4 Systematic IPPS is popular because of its simplicity but an unbiased estimator for VarŶsys) is not available Hurtley and Rao 196) showed that when n x i X < 1 for all i, VarŶsys,pps) 1 n 1 n This expression can be estimated by varŷsys,pps) ) Y ) n 1 n 1 ) ) n π i ŶHT n SydU STAT ) Second semester Dr J Chan 9

12 Example: Half Ackroyd case) Consider the population of firms A, B and C only with N =, Y = = 7667 The following table shows the inclusion probabilities of a particular sampling procedures and the sample estimates Sample outcomes and HT estimator Sample nd order Inclusion prob j Ŷ HT = 1 n N A,B π 1 = 05 Y HT = ) = A,C π 1 = 0 Y HT = ) = B,C π = 0 Y HT = ) = The first-order inclusion probabilities are Note Firm A 9 π 1 = = 08 B 8 π = = 07 C 6 π = = 05 Total 0 P A or B) = P A)+P B) P A and B) = π 1 +π π 1 = = 1 since in a sample of size n = from N =, at least one element of the pair will be included The variance of the HT estimator is: SydU STAT ) Second semester Dr J Chan 94

13 VarŶ HT,) = 1 y1 [π N 1 π π 1 ) y ) y1 + π 1 π π 1) y π 1 π π 1 π ) + π π π ) y y ) ] π π = 1 9 [8 7 5) 8 8 ) ) ) ) ) ] 5 = The following calculation verifies that the HT estimator is unbiased and that its variance is indeed as calculated using: EŶ HT) = 75595)05) )0) )0) = VarŶ HT) = ) 05) ) 0) = The simple average estimator is biased when the inclusion probabilities are not the same for all elements Note that Outcome Inclusion probability 1 A,B 05 A,C B,C 0 10 Y = ȳ 9 + 8) = ) = ) = 70 Eȳ) = 0585) + 075) + 070) = Therefore, the sample mean ȳ is biased SydU STAT ) Second semester Dr J Chan 95

14 54 How to draw IPPS sample? It is very difficult Madow 1949) method for a systematic IPPS sample Index x Partial sum Assigned interval 1 x 1 S 1 = x 1 0, S 1 ) x S = x 1 + x S 1, S ) N x N S N = x x N = X S N 1, S N ) Define d = X n Step 1 Select a uniform random number u from a uniform dist U0, 1) and set u = ud 0, d) Step Select the units whose assigned intervals contain u, u + d, u + d,, u + n 1)d For systematic IPPS method, it can be shown that = x i d = nx i X = nz i Hence Ŷsys = is unbiased for the population total Y = P [u S i 1, S i ) or u + d S i 1, S i ) or or u + n 1)d S i 1, S i )] = P [u S i 1, S i ) mod d] = S i S i 1 d = x i d = nx i X = nz i Note that x i > d x i > X n n x i X > 1 nz i > 1 If nz i < 1 for all i, any unit has a probability = nz i of being selected and no unit is selected more than once If nz i > 1 for one or more i, such units may be selected more than once in the sample but the average frequency of selection is nz i SydU STAT ) Second semester Dr J Chan 96

15 Example: IPPS sample) For a population of 7 households, the number of visits to a local supermarket last week x i and this week are given below: Household i No of visit last week x i No of visit this week Use random numbers u = 650 Select a systematic IPPS sample of size n = using the Madow 1949) method Estimate the average number of visits to the supermarket this week and its standard error Solution: d = X n = 0 = 10 Note that nz = 11 0 = 0 unit can be selected more than once Index i > 1 Hence Assigned interval Assigned interval Width Prob x i S i z i = x i X S i 1, S i ) mod d = 10) 1 0 0,) 0,) ,4),4) ,15) 4,10) or 0,5) 6+5= ,1) 5,10) or 0,1) 1+5= ,5) 1,5) ,7) 5,7) ,0) 7,10) 10 Total Now u 1 = = 65 from the interval 0,10) The IPPS sample contains households {, 4, 6} A complete list of different samples based SydU STAT ) Second semester Dr J Chan 97

16 in u 0, 10) is given below STAT014/914 Applied Stat-Sampling C5-Uneq Prob sample u IP P S sample 0,1) {1,,4} 1,) {1,,5},4) {,,5} 4,5) {,,5} 5,7) {,4,6} 7,10) {,4,7} It is difficult to estimate se using the HT estimator because the second order inclusion probabilities j are unknown Hence the estimate of the average number of visits to the supermarket this week and its standard error using Hartley & Rao 196) estimator for systematic sampling are Y HT = 1 N varŷ sys,ipps) = 1 N = n 1 n ) ) Ŷsys n ) = 1 [ 1 ) ) ) 1 ) ) = = 4191 seŷsys,ipps) = 4191 = = ] ) SydU STAT ) Second semester Dr J Chan 98

Unequal Probability Designs

Unequal Probability Designs Unequal Probability Designs Department of Statistics University of British Columbia This is prepares for Stat 344, 2014 Section 7.11 and 7.12 Probability Sampling Designs: A quick review A probability

More information

Population element: 1 2 N. 1.1 Sampling with Replacement: Hansen-Hurwitz Estimator(HH)

Population element: 1 2 N. 1.1 Sampling with Replacement: Hansen-Hurwitz Estimator(HH) Chapter 1 Samplng wth Unequal Probabltes Notaton: Populaton element: 1 2 N varable of nterest Y : y1 y2 y N Let s be a sample of elements drawn by a gven samplng method. In other words, s s a subset of

More information

Cluster Sampling 2. Chapter Introduction

Cluster Sampling 2. Chapter Introduction Chapter 7 Cluster Sampling 7.1 Introduction In this chapter, we consider two-stage cluster sampling where the sample clusters are selected in the first stage and the sample elements are selected in the

More information

RESEARCH REPORT. Vanishing auxiliary variables in PPS sampling with applications in microscopy.

RESEARCH REPORT. Vanishing auxiliary variables in PPS sampling with applications in microscopy. CENTRE FOR STOCHASTIC GEOMETRY AND ADVANCED BIOIMAGING 2014 www.csgb.dk RESEARCH REPORT Ina Trolle Andersen, Ute Hahn and Eva B. Vedel Jensen Vanishing auxiliary variables in PPS sampling with applications

More information

The New sampling Procedure for Unequal Probability Sampling of Sample Size 2.

The New sampling Procedure for Unequal Probability Sampling of Sample Size 2. . The New sampling Procedure for Unequal Probability Sampling of Sample Size. Introduction :- It is a well known fact that in simple random sampling, the probability selecting the unit at any given draw

More information

ICES training Course on Design and Analysis of Statistically Sound Catch Sampling Programmes

ICES training Course on Design and Analysis of Statistically Sound Catch Sampling Programmes ICES training Course on Design and Analysis of Statistically Sound Catch Sampling Programmes Sara-Jane Moore www.marine.ie General Statistics - backed up by case studies General Introduction to sampling

More information

Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities

Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities Conservative variance estimation for sampling designs with zero pairwise inclusion probabilities Peter M. Aronow and Cyrus Samii Forthcoming at Survey Methodology Abstract We consider conservative variance

More information

Chapter 3: Element sampling design: Part 1

Chapter 3: Element sampling design: Part 1 Chapter 3: Element sampling design: Part 1 Jae-Kwang Kim Fall, 2014 Simple random sampling 1 Simple random sampling 2 SRS with replacement 3 Systematic sampling Kim Ch. 3: Element sampling design: Part

More information

REMAINDER LINEAR SYSTEMATIC SAMPLING

REMAINDER LINEAR SYSTEMATIC SAMPLING Sankhyā : The Indian Journal of Statistics 2000, Volume 62, Series B, Pt. 2, pp. 249 256 REMAINDER LINEAR SYSTEMATIC SAMPLING By HORNG-JINH CHANG and KUO-CHUNG HUANG Tamkang University, Taipei SUMMARY.

More information

Main sampling techniques

Main sampling techniques Main sampling techniques ELSTAT Training Course January 23-24 2017 Martin Chevalier Department of Statistical Methods Insee 1 / 187 Main sampling techniques Outline Sampling theory Simple random sampling

More information

Nonresponse weighting adjustment using estimated response probability

Nonresponse weighting adjustment using estimated response probability Nonresponse weighting adjustment using estimated response probability Jae-kwang Kim Yonsei University, Seoul, Korea December 26, 2006 Introduction Nonresponse Unit nonresponse Item nonresponse Basic strategy

More information

Efficient estimators for adaptive two-stage sequential sampling

Efficient estimators for adaptive two-stage sequential sampling 0.8Copyedited by: AA 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 Journal of Statistical Computation and

More information

SAMPLING II BIOS 662. Michael G. Hudgens, Ph.D. mhudgens :37. BIOS Sampling II

SAMPLING II BIOS 662. Michael G. Hudgens, Ph.D.  mhudgens :37. BIOS Sampling II SAMPLING II BIOS 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-11-17 14:37 BIOS 662 1 Sampling II Outline Stratified sampling Introduction Notation and Estimands

More information

OPTIMAL CONTROLLED SAMPLING DESIGNS

OPTIMAL CONTROLLED SAMPLING DESIGNS OPTIMAL CONTROLLED SAMPLING DESIGNS Rajender Parsad and V.K. Gupta I.A.S.R.I., Library Avenue, New Delhi 002 rajender@iasri.res.in. Introduction Consider a situation, where it is desired to conduct a sample

More information

Advanced Survey Sampling

Advanced Survey Sampling Lecture materials Advanced Survey Sampling Statistical methods for sample surveys Imbi Traat niversity of Tartu 2007 Statistical methods for sample surveys Lecture 1, Imbi Traat 2 1 Introduction Sample

More information

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70

Chapter 5: Models used in conjunction with sampling. J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Chapter 5: Models used in conjunction with sampling J. Kim, W. Fuller (ISU) Chapter 5: Models used in conjunction with sampling 1 / 70 Nonresponse Unit Nonresponse: weight adjustment Item Nonresponse:

More information

C. J. Skinner Cross-classified sampling: some estimation theory

C. J. Skinner Cross-classified sampling: some estimation theory C. J. Skinner Cross-classified sampling: some estimation theory Article (Accepted version) (Refereed) Original citation: Skinner, C. J. (205) Cross-classified sampling: some estimation theory. Statistics

More information

Chapter 8: Estimation 1

Chapter 8: Estimation 1 Chapter 8: Estimation 1 Jae-Kwang Kim Iowa State University Fall, 2014 Kim (ISU) Ch. 8: Estimation 1 Fall, 2014 1 / 33 Introduction 1 Introduction 2 Ratio estimation 3 Regression estimator Kim (ISU) Ch.

More information

Sampling and Estimation in Network Graphs

Sampling and Estimation in Network Graphs Sampling and Estimation in Network Graphs Gonzalo Mateos Dept. of ECE and Goergen Institute for Data Science University of Rochester gmateosb@ece.rochester.edu http://www.ece.rochester.edu/~gmateosb/ March

More information

Figure Figure

Figure Figure Figure 4-12. Equal probability of selection with simple random sampling of equal-sized clusters at first stage and simple random sampling of equal number at second stage. The next sampling approach, shown

More information

A comparison of weighted estimators for the population mean. Ye Yang Weighting in surveys group

A comparison of weighted estimators for the population mean. Ye Yang Weighting in surveys group A comparison of weighted estimators for the population mean Ye Yang Weighting in surveys group Motivation Survey sample in which auxiliary variables are known for the population and an outcome variable

More information

Sampling from Finite Populations Jill M. Montaquila and Graham Kalton Westat 1600 Research Blvd., Rockville, MD 20850, U.S.A.

Sampling from Finite Populations Jill M. Montaquila and Graham Kalton Westat 1600 Research Blvd., Rockville, MD 20850, U.S.A. Sampling from Finite Populations Jill M. Montaquila and Graham Kalton Westat 1600 Research Blvd., Rockville, MD 20850, U.S.A. Keywords: Survey sampling, finite populations, simple random sampling, systematic

More information

Bootstrap inference for the finite population total under complex sampling designs

Bootstrap inference for the finite population total under complex sampling designs Bootstrap inference for the finite population total under complex sampling designs Zhonglei Wang (Joint work with Dr. Jae Kwang Kim) Center for Survey Statistics and Methodology Iowa State University Jan.

More information

Asymptotic Normality under Two-Phase Sampling Designs

Asymptotic Normality under Two-Phase Sampling Designs Asymptotic Normality under Two-Phase Sampling Designs Jiahua Chen and J. N. K. Rao University of Waterloo and University of Carleton Abstract Large sample properties of statistical inferences in the context

More information

Sampling. Jian Pei School of Computing Science Simon Fraser University

Sampling. Jian Pei School of Computing Science Simon Fraser University Sampling Jian Pei School of Computing Science Simon Fraser University jpei@cs.sfu.ca INTRODUCTION J. Pei: Sampling 2 What Is Sampling? Select some part of a population to observe estimate something about

More information

STA304H1F/1003HF Summer 2015: Lecture 11

STA304H1F/1003HF Summer 2015: Lecture 11 STA304H1F/1003HF Summer 2015: Lecture 11 You should know... What is one-stage vs two-stage cluster sampling? What are primary and secondary sampling units? What are the two types of estimation in cluster

More information

Empirical and Constrained Empirical Bayes Variance Estimation Under A One Unit Per Stratum Sample Design

Empirical and Constrained Empirical Bayes Variance Estimation Under A One Unit Per Stratum Sample Design Empirical and Constrained Empirical Bayes Variance Estimation Under A One Unit Per Stratum Sample Design Sepideh Mosaferi Abstract A single primary sampling unit (PSU) per stratum design is a popular design

More information

STAT 3014/3914. Semester 2 Applied Statistics Solution to Tutorial 13

STAT 3014/3914. Semester 2 Applied Statistics Solution to Tutorial 13 STAT 304/394 Semester Appled Statstcs 05 Soluton to Tutoral 3. Note that s the total mleage for branch. a) -stage cluster sample Cluster branches N ; n 4) Element cars M 80; m 40) Populaton mean no. of

More information

Biostat 2065 Analysis of Incomplete Data

Biostat 2065 Analysis of Incomplete Data Biostat 2065 Analysis of Incomplete Data Gong Tang Dept of Biostatistics University of Pittsburgh September 13 & 15, 2005 1. Complete-case analysis (I) Complete-case analysis refers to analysis based on

More information

SydU STAT3014 (2015) Second semester Dr. J. Chan 18

SydU STAT3014 (2015) Second semester Dr. J. Chan 18 STAT3014/3914 Appied Stat.-Samping C-Stratified rand. sampe Stratified Random Samping.1 Introduction Description The popuation of size N is divided into mutuay excusive and exhaustive subpopuations caed

More information

SAMPLING BIOS 662. Michael G. Hudgens, Ph.D. mhudgens :55. BIOS Sampling

SAMPLING BIOS 662. Michael G. Hudgens, Ph.D.   mhudgens :55. BIOS Sampling SAMPLIG BIOS 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-11-14 15:55 BIOS 662 1 Sampling Outline Preliminaries Simple random sampling Population mean Population

More information

FINAL EXAM STAT 5201 Fall 2016

FINAL EXAM STAT 5201 Fall 2016 FINAL EXAM STAT 5201 Fall 2016 Due on the class Moodle site or in Room 313 Ford Hall on Tuesday, December 20 at 11:00 AM In the second case please deliver to the office staff of the School of Statistics

More information

INSTRUMENTAL-VARIABLE CALIBRATION ESTIMATION IN SURVEY SAMPLING

INSTRUMENTAL-VARIABLE CALIBRATION ESTIMATION IN SURVEY SAMPLING Statistica Sinica 24 (2014), 1001-1015 doi:http://dx.doi.org/10.5705/ss.2013.038 INSTRUMENTAL-VARIABLE CALIBRATION ESTIMATION IN SURVEY SAMPLING Seunghwan Park and Jae Kwang Kim Seoul National Univeristy

More information

7 Estimation. 7.1 Population and Sample (P.91-92)

7 Estimation. 7.1 Population and Sample (P.91-92) 7 Estimation MATH1015 Biostatistics Week 7 7.1 Population and Sample (P.91-92) Suppose that we wish to study a particular health problem in Australia, for example, the average serum cholesterol level for

More information

Combining data from two independent surveys: model-assisted approach

Combining data from two independent surveys: model-assisted approach Combining data from two independent surveys: model-assisted approach Jae Kwang Kim 1 Iowa State University January 20, 2012 1 Joint work with J.N.K. Rao, Carleton University Reference Kim, J.K. and Rao,

More information

MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7

MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7 MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7 1 Random Vectors Let a 0 and y be n 1 vectors, and let A be an n n matrix. Here, a 0 and A are non-random, whereas y is

More information

Data Integration for Big Data Analysis for finite population inference

Data Integration for Big Data Analysis for finite population inference for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation

More information

SAMPLING III BIOS 662

SAMPLING III BIOS 662 SAMPLIG III BIOS 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2009-08-11 09:52 BIOS 662 1 Sampling III Outline One-stage cluster sampling Systematic sampling Multi-stage

More information

Stat472/572 Sampling: Theory and Practice Instructor: Yan Lu

Stat472/572 Sampling: Theory and Practice Instructor: Yan Lu Stat472/572 Sampling: Theory and Practice Instructor: Yan Lu 1 Chapter 5 Cluster Sampling with Equal Probability Example: Sampling students in high school. Take a random sample of n classes (The classes

More information

A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR

A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR Statistica Sinica 8(1998), 1165-1173 A MODEL-BASED EVALUATION OF SEVERAL WELL-KNOWN VARIANCE ESTIMATORS FOR THE COMBINED RATIO ESTIMATOR Phillip S. Kott National Agricultural Statistics Service Abstract:

More information

Imputation for Missing Data under PPSWR Sampling

Imputation for Missing Data under PPSWR Sampling July 5, 2010 Beijing Imputation for Missing Data under PPSWR Sampling Guohua Zou Academy of Mathematics and Systems Science Chinese Academy of Sciences 1 23 () Outline () Imputation method under PPSWR

More information

Fractional Imputation in Survey Sampling: A Comparative Review

Fractional Imputation in Survey Sampling: A Comparative Review Fractional Imputation in Survey Sampling: A Comparative Review Shu Yang Jae-Kwang Kim Iowa State University Joint Statistical Meetings, August 2015 Outline Introduction Fractional imputation Features Numerical

More information

BIAS-ROBUSTNESS AND EFFICIENCY OF MODEL-BASED INFERENCE IN SURVEY SAMPLING

BIAS-ROBUSTNESS AND EFFICIENCY OF MODEL-BASED INFERENCE IN SURVEY SAMPLING Statistica Sinica 22 (2012), 777-794 doi:http://dx.doi.org/10.5705/ss.2010.238 BIAS-ROBUSTNESS AND EFFICIENCY OF MODEL-BASED INFERENCE IN SURVEY SAMPLING Desislava Nedyalova and Yves Tillé University of

More information

Improvement in Estimating the Finite Population Mean Under Maximum and Minimum Values in Double Sampling Scheme

Improvement in Estimating the Finite Population Mean Under Maximum and Minimum Values in Double Sampling Scheme J. Stat. Appl. Pro. Lett. 2, No. 2, 115-121 (2015) 115 Journal of Statistics Applications & Probability Letters An International Journal http://dx.doi.org/10.12785/jsapl/020203 Improvement in Estimating

More information

Sanjay Chaudhuri Department of Statistics and Applied Probability, National University of Singapore

Sanjay Chaudhuri Department of Statistics and Applied Probability, National University of Singapore AN EMPIRICAL LIKELIHOOD BASED ESTIMATOR FOR RESPONDENT DRIVEN SAMPLED DATA Sanjay Chaudhuri Department of Statistics and Applied Probability, National University of Singapore Mark Handcock, Department

More information

ASYMPTOTIC NORMALITY UNDER TWO-PHASE SAMPLING DESIGNS

ASYMPTOTIC NORMALITY UNDER TWO-PHASE SAMPLING DESIGNS Statistica Sinica 17(2007), 1047-1064 ASYMPTOTIC NORMALITY UNDER TWO-PHASE SAMPLING DESIGNS Jiahua Chen and J. N. K. Rao University of British Columbia and Carleton University Abstract: Large sample properties

More information

Estimating Bounded Population Total Using Linear Regression in the Presence of Supporting Information

Estimating Bounded Population Total Using Linear Regression in the Presence of Supporting Information International Journal of Mathematics and Computational Science Vol. 4, No. 3, 2018, pp. 112-117 http://www.aiscience.org/journal/ijmcs ISSN: 2381-7011 (Print); ISSN: 2381-702X (Online) Estimating Bounded

More information

Empirical Likelihood Methods for Sample Survey Data: An Overview

Empirical Likelihood Methods for Sample Survey Data: An Overview AUSTRIAN JOURNAL OF STATISTICS Volume 35 (2006), Number 2&3, 191 196 Empirical Likelihood Methods for Sample Survey Data: An Overview J. N. K. Rao Carleton University, Ottawa, Canada Abstract: The use

More information

Gauss Markov & Predictive Distributions

Gauss Markov & Predictive Distributions Gauss Markov & Predictive Distributions Merlise Clyde STA721 Linear Models Duke University September 14, 2017 Outline Topics Gauss-Markov Theorem Estimability and Prediction Readings: Christensen Chapter

More information

Sampling and Estimation in Agricultural Surveys

Sampling and Estimation in Agricultural Surveys GS Training and Outreach Workshop on Agricultural Surveys Training Seminar: Sampling and Estimation in Cristiano Ferraz 24 October 2016 Download a free copy of the Handbook at: http://gsars.org/wp-content/uploads/2016/02/msf-010216-web.pdf

More information

Estimation under cross classified sampling with application to a childhood survey

Estimation under cross classified sampling with application to a childhood survey TSE 659 April 2016 Estimation under cross classified sampling with application to a childhood survey Hélène Juillard, Guillaume Chauvet and Anne Ruiz Gazen Estimation under cross-classified sampling with

More information

Estimation under cross-classified sampling with application to a childhood survey

Estimation under cross-classified sampling with application to a childhood survey Estimation under cross-classified sampling with application to a childhood survey arxiv:1511.00507v1 [math.st] 2 Nov 2015 Hélène Juillard Guillaume Chauvet Anne Ruiz-Gazen January 11, 2018 Abstract The

More information

arxiv: v1 [stat.ap] 7 Aug 2007

arxiv: v1 [stat.ap] 7 Aug 2007 IMS Lecture Notes Monograph Series Complex Datasets and Inverse Problems: Tomography, Networks and Beyond Vol. 54 (007) 11 131 c Institute of Mathematical Statistics, 007 DOI: 10.114/07491707000000094

More information

Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/21/03) Ed Stanek

Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/21/03) Ed Stanek Comments on Design-Based Prediction Using Auxilliary Information under Random Permutation Models (by Wenjun Li (5/2/03) Ed Stanek Here are comments on the Draft Manuscript. They are all suggestions that

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Markus Haas LMU München Summer term 2011 15. Mai 2011 The Simple Linear Regression Model Considering variables x and y in a specific population (e.g., years of education and wage

More information

EC969: Introduction to Survey Methodology

EC969: Introduction to Survey Methodology EC969: Introduction to Survey Methodology Peter Lynn Tues 1 st : Sample Design Wed nd : Non-response & attrition Tues 8 th : Weighting Focus on implications for analysis What is Sampling? Identify the

More information

Optimal combination forecasts for hierarchical time series

Optimal combination forecasts for hierarchical time series Optimal combination forecasts for hierarchical time series Rob J. Hyndman Roman A. Ahmed Department of Econometrics and Business Statistics Outline 1 Review of hierarchical forecasting 2 3 Simulation study

More information

Design and Analysis of Experiments Embedded in Sample Surveys

Design and Analysis of Experiments Embedded in Sample Surveys Journal of Of cial Statistics, Vol. 14, No. 3, 1998, pp. 277±295 Design and Analysis of Experiments Embedded in Sample Surveys Jan van den Brakel and Robbert H. Renssen 1 This article focuses on the design

More information

Empirical likelihood inference for regression parameters when modelling hierarchical complex survey data

Empirical likelihood inference for regression parameters when modelling hierarchical complex survey data Empirical likelihood inference for regression parameters when modelling hierarchical complex survey data Melike Oguz-Alper Yves G. Berger Abstract The data used in social, behavioural, health or biological

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

Tribhuvan University Institute of Science and Technology 2065

Tribhuvan University Institute of Science and Technology 2065 1CSc. Stat. 108-2065 Tribhuvan University Institute of Science and Technology 2065 Bachelor Level/First Year/ First Semester/ Science Full Marks: 60 Computer Science and Information Technology (Stat. 108)

More information

Non-uniform coverage estimators for distance sampling

Non-uniform coverage estimators for distance sampling Abstract Non-uniform coverage estimators for distance sampling CREEM Technical report 2007-01 Eric Rexstad Centre for Research into Ecological and Environmental Modelling Research Unit for Wildlife Population

More information

O.O. DAWODU & A.A. Adewara Department of Statistics, University of Ilorin, Ilorin, Nigeria

O.O. DAWODU & A.A. Adewara Department of Statistics, University of Ilorin, Ilorin, Nigeria Efficiency of Alodat Sample Selection Procedure over Sen - Midzuno and Yates - Grundy Draw by Draw under Unequal Probability Sampling without Replacement Sample Size 2 O.O. DAWODU & A.A. Adewara Department

More information

Taking into account sampling design in DAD. Population SAMPLING DESIGN AND DAD

Taking into account sampling design in DAD. Population SAMPLING DESIGN AND DAD Taking into account sampling design in DAD SAMPLING DESIGN AND DAD With version 4.2 and higher of DAD, the Sampling Design (SD) of the database can be specified in order to calculate the correct asymptotic

More information

6. Fractional Imputation in Survey Sampling

6. Fractional Imputation in Survey Sampling 6. Fractional Imputation in Survey Sampling 1 Introduction Consider a finite population of N units identified by a set of indices U = {1, 2,, N} with N known. Associated with each unit i in the population

More information

Urvalsmetoder och Estimation 5. Sampling and Estimation

Urvalsmetoder och Estimation 5. Sampling and Estimation Urvalsmetoder och Estimation 5 Sampling and Estimation 5 2012-02-07 1 5. Estimation and sampling with general inclusion probabilities 5.1 Introduction Inclusion probabilities are central in (design-based)

More information

Regression - Modeling a response

Regression - Modeling a response Regression - Modeling a response We often wish to construct a model to Explain the association between two or more variables Predict the outcome of a variable given values of other variables. Regression

More information

Chapter 6: Sampling with Unequal Probability

Chapter 6: Sampling with Unequal Probability Chapter 6: Sampling with Unequal Probability 1 Introduction This chapter is the concluding chapter to an introduction to Survey Sampling and Design. Topics beyond this one are, for the most part standalone.

More information

Sampling Distribution: Week 6

Sampling Distribution: Week 6 Sampling Distribution: Week 6 Kwonsang Lee University of Pennsylvania kwonlee@wharton.upenn.edu February 27, 2015 Kwonsang Lee STAT111 February 27, 2015 1 / 16 Sampling Distribution: Sample Mean If X 1,

More information

The ESS Sample Design Data File (SDDF)

The ESS Sample Design Data File (SDDF) The ESS Sample Design Data File (SDDF) Documentation Version 1.0 Matthias Ganninger Tel: +49 (0)621 1246 282 E-Mail: matthias.ganninger@gesis.org April 8, 2008 Summary: This document reports on the creation

More information

You are allowed 3? sheets of notes and a calculator.

You are allowed 3? sheets of notes and a calculator. Exam 1 is Wed Sept You are allowed 3? sheets of notes and a calculator The exam covers survey sampling umbers refer to types of problems on exam A population is the entire set of (potential) measurements

More information

Chapter 2. Review of basic Statistical methods 1 Distribution, conditional distribution and moments

Chapter 2. Review of basic Statistical methods 1 Distribution, conditional distribution and moments Chapter 2. Review of basic Statistical methods 1 Distribution, conditional distribution and moments We consider two kinds of random variables: discrete and continuous random variables. For discrete random

More information

Successive Difference Replication Variance Estimation in Two-Phase Sampling

Successive Difference Replication Variance Estimation in Two-Phase Sampling Successive Difference Replication Variance Estimation in Two-Phase Sampling Jean D. Opsomer Colorado State University Michael White US Census Bureau F. Jay Breidt Colorado State University Yao Li Colorado

More information

On Efficiency of Midzuno-Sen Strategy under Two-phase Sampling

On Efficiency of Midzuno-Sen Strategy under Two-phase Sampling International Journal of Statistics and Analysis. ISSN 2248-9959 Volume 7, Number 1 (2017), pp. 19-26 Research India Publications http://www.ripublication.com On Efficiency of Midzuno-Sen Strategy under

More information

MN 400: Research Methods. CHAPTER 7 Sample Design

MN 400: Research Methods. CHAPTER 7 Sample Design MN 400: Research Methods CHAPTER 7 Sample Design 1 Some fundamental terminology Population the entire group of objects about which information is wanted Unit, object any individual member of the population

More information

Log-linear Modelling with Complex Survey Data

Log-linear Modelling with Complex Survey Data Int. Statistical Inst.: Proc. 58th World Statistical Congress, 20, Dublin (Session IPS056) p.02 Log-linear Modelling with Complex Survey Data Chris Sinner University of Southampton United Kingdom Abstract:

More information

Estimation of Parameters and Variance

Estimation of Parameters and Variance Estimation of Parameters and Variance Dr. A.C. Kulshreshtha U.N. Statistical Institute for Asia and the Pacific (SIAP) Second RAP Regional Workshop on Building Training Resources for Improving Agricultural

More information

Deriving indicators from representative samples for the ESF

Deriving indicators from representative samples for the ESF Deriving indicators from representative samples for the ESF Brussels, June 17, 2014 Ralf Münnich and Stefan Zins Lisa Borsi and Jan-Philipp Kolb GESIS Mannheim and University of Trier Outline 1 Choosing

More information

STAT5044: Regression and Anova. Inyoung Kim

STAT5044: Regression and Anova. Inyoung Kim STAT5044: Regression and Anova Inyoung Kim 2 / 51 Outline 1 Matrix Expression 2 Linear and quadratic forms 3 Properties of quadratic form 4 Properties of estimates 5 Distributional properties 3 / 51 Matrix

More information

A comparison of pivotal sampling and unequal. probability sampling with replacement

A comparison of pivotal sampling and unequal. probability sampling with replacement arxiv:1609.02688v2 [math.st] 13 Sep 2016 A comparison of pivotal sampling and unequal probability sampling with replacement Guillaume Chauvet 1 and Anne Ruiz-Gazen 2 1 ENSAI/IRMAR, Campus de Ker Lann,

More information

Estimation and Detection

Estimation and Detection stimation and Detection Lecture 2: Cramér-Rao Lower Bound Dr. ir. Richard C. Hendriks & Dr. Sundeep P. Chepuri 7//207 Remember: Introductory xample Given a process (DC in noise): x[n]=a + w[n], n=0,,n,

More information

Chapter Learning Objectives. Random Experiments Dfiii Definition: Dfiii Definition:

Chapter Learning Objectives. Random Experiments Dfiii Definition: Dfiii Definition: Chapter 2: Probability 2-1 Sample Spaces & Events 2-1.1 Random Experiments 2-1.2 Sample Spaces 2-1.3 Events 2-1 1.4 Counting Techniques 2-2 Interpretations & Axioms of Probability 2-3 Addition Rules 2-4

More information

Formalizing the Concepts: Simple Random Sampling. Juan Muñoz Kristen Himelein March 2012

Formalizing the Concepts: Simple Random Sampling. Juan Muñoz Kristen Himelein March 2012 Formalizing the Concepts: Simple Random Sampling Juan Muñoz Kristen Himelein March 2012 Purpose of sampling To study a portion of the population through observations at the level of the units selected,

More information

Part 4: Multi-parameter and normal models

Part 4: Multi-parameter and normal models Part 4: Multi-parameter and normal models 1 The normal model Perhaps the most useful (or utilized) probability model for data analysis is the normal distribution There are several reasons for this, e.g.,

More information

Sampling in Space and Time. Natural experiment? Analytical Surveys

Sampling in Space and Time. Natural experiment? Analytical Surveys Sampling in Space and Time Overview of Sampling Approaches Sampling versus Experimental Design Experiments deliberately perturb a portion of population to determine effect objective is to compare the mean

More information

arxiv: v1 [stat.me] 16 Jun 2016

arxiv: v1 [stat.me] 16 Jun 2016 Causal Inference in Rebuilding and Extending the Recondite Bridge between Finite Population Sampling and Experimental arxiv:1606.05279v1 [stat.me] 16 Jun 2016 Design Rahul Mukerjee *, Tirthankar Dasgupta

More information

Review of probability and statistics 1 / 31

Review of probability and statistics 1 / 31 Review of probability and statistics 1 / 31 2 / 31 Why? This chapter follows Stock and Watson (all graphs are from Stock and Watson). You may as well refer to the appendix in Wooldridge or any other introduction

More information

All variances and covariances appearing in this formula are understood to be defined in the usual manner for finite populations; example

All variances and covariances appearing in this formula are understood to be defined in the usual manner for finite populations; example 155 By: UBIASED COMPOET RATIO ESTIMATIO1 D. S. Robson and Chitra Vitbayasai, Cornell University ITRODUCTIO The precision of a ratio -type estimator such as can sometimes be substantially increased the

More information

Empirical Likelihood Methods

Empirical Likelihood Methods Handbook of Statistics, Volume 29 Sample Surveys: Theory, Methods and Inference Empirical Likelihood Methods J.N.K. Rao and Changbao Wu (February 14, 2008, Final Version) 1 Likelihood-based Approaches

More information

Chapter 5 Matrix Approach to Simple Linear Regression

Chapter 5 Matrix Approach to Simple Linear Regression STAT 525 SPRING 2018 Chapter 5 Matrix Approach to Simple Linear Regression Professor Min Zhang Matrix Collection of elements arranged in rows and columns Elements will be numbers or symbols For example:

More information

Formalizing the Concepts: Simple Random Sampling. Juan Muñoz Kristen Himelein March 2013

Formalizing the Concepts: Simple Random Sampling. Juan Muñoz Kristen Himelein March 2013 Formalizing the Concepts: Simple Random Sampling Juan Muñoz Kristen Himelein March 2013 Purpose of sampling To study a portion of the population through observations at the level of the units selected,

More information

Regression Review. Statistics 149. Spring Copyright c 2006 by Mark E. Irwin

Regression Review. Statistics 149. Spring Copyright c 2006 by Mark E. Irwin Regression Review Statistics 149 Spring 2006 Copyright c 2006 by Mark E. Irwin Matrix Approach to Regression Linear Model: Y i = β 0 + β 1 X i1 +... + β p X ip + ɛ i ; ɛ i iid N(0, σ 2 ), i = 1,..., n

More information

Specification Errors, Measurement Errors, Confounding

Specification Errors, Measurement Errors, Confounding Specification Errors, Measurement Errors, Confounding Kerby Shedden Department of Statistics, University of Michigan October 10, 2018 1 / 32 An unobserved covariate Suppose we have a data generating model

More information

ance estimators are commonly presented with THT. One (Horvitz and Thompson 1952) is

ance estimators are commonly presented with THT. One (Horvitz and Thompson 1952) is PREDICTION THEORY IN FINITE POPULATION SAMPLING: THE HORVITZ-THOMPSON ESTIMATOR AND ESTIMATES OF ITS VARIANCE William G. Cumberland, U.C.L.A. Richard M. Royall, Johns Hopkins University i. INTRODUCTION

More information

A Unified Theory of Empirical Likelihood Confidence Intervals for Survey Data with Unequal Probabilities and Non Negligible Sampling Fractions

A Unified Theory of Empirical Likelihood Confidence Intervals for Survey Data with Unequal Probabilities and Non Negligible Sampling Fractions A Unified Theory of Empirical Likelihood Confidence Intervals for Survey Data with Unequal Probabilities and Non Negligible Sampling Fractions Y.G. Berger O. De La Riva Torres Abstract We propose a new

More information

Bayesian Linear Regression

Bayesian Linear Regression Bayesian Linear Regression Sudipto Banerjee 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. September 15, 2010 1 Linear regression models: a Bayesian perspective

More information

Domain estimation and poststratification in national forest inventory

Domain estimation and poststratification in national forest inventory omain estimation and poststratification in national forest inventory Adrian Lanz 1 Radim Adolt 2 Jiří Fejfar 2 Berthold Traub 1 1 Swiss Federal Institute for Forest, Snow and Landscape Research WSL 2 Forest

More information

Mixed-Models. version 30 October 2011

Mixed-Models. version 30 October 2011 Mixed-Models version 30 October 2011 Mixed models Mixed models estimate a vector! of fixed effects and one (or more) vectors u of random effects Both fixed and random effects models always include a vector

More information

Incorporating Level of Effort Paradata in Nonresponse Adjustments. Paul Biemer RTI International University of North Carolina Chapel Hill

Incorporating Level of Effort Paradata in Nonresponse Adjustments. Paul Biemer RTI International University of North Carolina Chapel Hill Incorporating Level of Effort Paradata in Nonresponse Adjustments Paul Biemer RTI International University of North Carolina Chapel Hill Acknowledgements Patrick Chen, RTI International Kevin Wang, RTI

More information

Algorithms to Calculate Exact Inclusion Probabilities for a Non-Rejective Approximate πps Sampling Design

Algorithms to Calculate Exact Inclusion Probabilities for a Non-Rejective Approximate πps Sampling Design Revista Colombiana de Estadística Junio 2014, volumen 37, no. 1, pp. 127 a 140 Algorithms to Calculate Exact Inclusion Probabilities for a Non-Rejective Approximate πps Sampling Design Algoritmos para

More information