LECTURE - 4 SIMPLE RANDOM SAMPLING DR. SHALABH DEPARTMENT OF MATHEMATICS AND STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANPUR

Similar documents
Chapter -2 Simple Random Sampling

Chapter -2 Simple Random Sampling

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

Module 7. Lecture 7: Statistical parameter estimation

Chapter 3 Sampling For Proportions and Percentages

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

Sampling Theory MODULE X LECTURE - 35 TWO STAGE SAMPLING (SUB SAMPLING)

Chapter 11 Systematic Sampling

Lecture 3. Sampling, sampling distributions, and parameter estimation

CHAPTER VI Statistical Analysis of Experimental Data

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

STK4011 and STK9011 Autumn 2016

Estimation of Stress- Strength Reliability model using finite mixture of exponential distributions

Parameter, Statistic and Random Samples

Comparison of Dual to Ratio-Cum-Product Estimators of Population Mean

Chapter 8: Statistical Analysis of Simulated Data

Sampling Theory MODULE V LECTURE - 14 RATIO AND PRODUCT METHODS OF ESTIMATION

Chapter 10 Two Stage Sampling (Subsampling)

Econometric Methods. Review of Estimation

LINEAR REGRESSION ANALYSIS

Arithmetic Mean Suppose there is only a finite number N of items in the system of interest. Then the population arithmetic mean is

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

Continuous Distributions

Chapter 5 Properties of a Random Sample

PROPERTIES OF GOOD ESTIMATORS

MEASURES OF DISPERSION

Lecture Notes Types of economic variables

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

Summary of the lecture in Biostatistics

Chapter 14 Logistic Regression Models

Application of Calibration Approach for Regression Coefficient Estimation under Two-stage Sampling Design

Chapter 8. Inferences about More Than Two Population Central Values

Lecture 02: Bounding tail distributions of a random variable

22 Nonparametric Methods.

1. The weight of six Golden Retrievers is 66, 61, 70, 67, 92 and 66 pounds. The weight of six Labrador Retrievers is 54, 60, 72, 78, 84 and 67.

X ε ) = 0, or equivalently, lim

TESTS BASED ON MAXIMUM LIKELIHOOD

STATISTICAL INFERENCE

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Simulation Output Analysis

Median as a Weighted Arithmetic Mean of All Sample Observations

Simple Linear Regression

A Note on Ratio Estimators in two Stage Sampling

Class 13,14 June 17, 19, 2015

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best

Point Estimation: definition of estimators

Multivariate Transformation of Variables and Maximum Likelihood Estimation

Lecture 2: The Simple Regression Model

Investigation of Partially Conditional RP Model with Response Error. Ed Stanek

Analysis of Variance with Weibull Data

Linear Regression with One Regressor

GOALS The Samples Why Sample the Population? What is a Probability Sample? Four Most Commonly Used Probability Sampling Methods

2.28 The Wall Street Journal is probably referring to the average number of cubes used per glass measured for some population that they have chosen.

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations

Outline. Point Pattern Analysis Part I. Revisit IRP/CSR

A New Family of Transformations for Lifetime Data

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

Wu-Hausman Test: But if X and ε are independent, βˆ. ECON 324 Page 1

ENGI 4421 Propagation of Error Page 8-01

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Statistics Descriptive and Inferential Statistics. Instructor: Daisuke Nagakura

9 U-STATISTICS. Eh =(m!) 1 Eh(X (1),..., X (m ) ) i.i.d

Bias Correction in Estimation of the Population Correlation Coefficient

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

Comparing Different Estimators of three Parameters for Transmuted Weibull Distribution

Recall MLR 5 Homskedasticity error u has the same variance given any values of the explanatory variables Var(u x1,...,xk) = 2 or E(UU ) = 2 I

Bounds on the expected entropy and KL-divergence of sampled multinomial distributions. Brandon C. Roy

BAYESIAN INFERENCES FOR TWO PARAMETER WEIBULL DISTRIBUTION

STRATIFIED SAMPLING IN AGRICULTURAL SURVEYS

= 1. UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Parameters and Statistics. Measures of Centrality

Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests. Soccer Goals in European Premier Leagues

Econometrics. 3) Statistical properties of the OLS estimator

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5

Chapter 4 Multiple Random Variables

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE EFFICIENCY OF EMPIRICAL LIKELIHOOD WITH NUISANCE PARAMETERS

Lecture 3 Probability review (cont d)

To use adaptive cluster sampling we must first make some definitions of the sampling universe:

Bayes (Naïve or not) Classifiers: Generative Approach

The Mathematical Appendix

A Combination of Adaptive and Line Intercept Sampling Applicable in Agricultural and Environmental Studies

THE ROYAL STATISTICAL SOCIETY 2010 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULE 2 STATISTICAL INFERENCE

f f... f 1 n n (ii) Median : It is the value of the middle-most observation(s).

Faculty Research Interest Seminar Department of Biostatistics, GSPH University of Pittsburgh. Gong Tang Feb. 18, 2005

Idea is to sample from a different distribution that picks points in important regions of the sample space. Want ( ) ( ) ( ) E f X = f x g x dx

SPECIAL CONSIDERATIONS FOR VOLUMETRIC Z-TEST FOR PROPORTIONS

Third handout: On the Gini Index

Maximum Likelihood Estimation

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model

ECON 5360 Class Notes GMM

å 1 13 Practice Final Examination Solutions - = CS109 Dec 5, 2018

Lecture 1 Review of Fundamental Statistical Concepts

C. Statistics. X = n geometric the n th root of the product of numerical data ln X GM = or ln GM = X 2. X n X 1

Modified Moment Estimation for a Two Parameter Gamma Distribution

Bayesian Inferences for Two Parameter Weibull Distribution Kipkoech W. Cheruiyot 1, Abel Ouko 2, Emily Kirimi 3

A Generalized Class of Ratio-Cum-Dual to Ratio Estimators of Finite Population Mean Using Auxiliary Information in Sample Surveys

Multiple Choice Test. Chapter Adequacy of Models for Regression

Transcription:

amplg Theory MODULE II LECTURE - 4 IMPLE RADOM AMPLIG DR. HALABH DEPARTMET OF MATHEMATIC AD TATITIC IDIA ITITUTE OF TECHOLOGY KAPUR

Estmato of populato mea ad populato varace Oe of the ma objectves after the selecto of a sample s to kow about the tedecy of the data to cluster aroud the cetral value ad the scatterdess of the data aroud the cetral value. Amog varous dcators of cetral tedecy ad dsperso, the popular choces are arthmetc mea ad varace. o the populato mea ad populato varablty are geerally measured by arthmetc mea (or weghted arthmetc mea) ad varace. There are varous popular estmators for estmatg the populato mea ad populato varace. Amog them, sample arthmetc mea ad sample varace are more popular tha other estmators. Oe of the reaso to use these estmators s that they possess ce statstcal propertes. Moreover, they are also obtaed through well establshed statstcal estmato procedures lke maxmum lkelhood estmato, least squares estmato, method of momets etc. uder several stadard statstcal dstrbutos. Oe may also cosder other dcators lke meda, mode, geometrc mea, harmoc mea for measurg the cetral tedecy ad mea devato, absolute devato, Ptma earess etc. for measurg the dsperso. The propertes of such estmators ca be studed by umercal procedures lke bootstrapg.

. Estmato of populato mea y = Let us cosder the sample arthmetc mea y = y ad verfy f s a ubased estmator of Y uder the two cases. as a estmator of populato mea Y = = Y R W O R E( y) = E y E( t) t ( where t y) = = = = = = =. y. = = Whe uts are sampled from uts by wthout replacemet, the each ut of the populato ca occur wth ( - ) other uts selected out of the remag ( - ) uts the populato ad each ut occurs so ow of the possble samples. o y = = = = y ( )!!( )! E( y) =. y = y = Y. ( )!( )!! = = Thus y s a ubased estmator of Y. 3

Alteratvely, the followg approach ca also be adopted to show that the sample mea s a ubased estmator of populato mea E( y) = E( yj ) j= = YP j() j= = = Y. j= = = Y = Y j= where Pj () deotes the probablty of selecto of th ut at j th stage. R W R E( y) = E y = = E( y) = ( YP +... + YP) = Y = Y = = = where P s the probablty of selecto of a ut. Thus s a ubased estmator of = for all =,,..., y populato mea uder RWR also. 4

Varace of the estmate Assume that each observato has same varace σ. The V( y) = E( y Y) = E ( y Y) = = E ( ) ( )( ) y Y + y Y yj Y = j = E( y Y) + E( y Y)( y Y) j = j = + σ = where = + K = E( y Y)( y Y). j K K j ow we fd the value of K uder the setups of RWR ad RWOR. 5

R W O R K = E( y Y)( y Y) Cosder ce j j E( y Y)( yj Y) = ( yk Y)( yl Y). ( ) k k k l k= k= k l k ( y Y) = ( y Y) + ( y Y)( y Y) l 0 = ( ) + ( yk Y)( yl Y) k l ( yk Y)( yl Y) = ( ) ( ) =. Thus K= ( ) k l 6

ad substtutg the value of K, the varace of y uder RWOR s V( ywor ) = ( ) =. R W R ow we obta the value of K uder RWR. K = E( y Y)( y Y) = E( y Y) E( y Y) = 0 j j j j because th ad j th draws ( j) are depedet. Thus the varace of y uder RWR s V y ( WR ) =. 7

It s to be oted that f s fte (large eough), the V( y) = both the cases of RWOR ad RWR. o the factor s resposble for chagg the varace of y whe the sample s draw from a fte populato comparso to a fte populato. Ths s why fte populato correcto (fpc). s called as It may be oted that =, so s close to f the rato of sample sze to populato sze s very small or eglgble. I such a case, the sze of the populato has o drect effect o the varace of y. The term s called as samplg fracto. I practce, fpc ca be gored wheever 5% < ad for may purposes eve f t s as hgh as 0%. Igorg fpc wll result the over estmato of varace of y. 8

Effcecy of y uder RWOR over RWR: ow we compare the varaces of sample meas uder RWOR ad RWR. V( y) V( y) WOR WR = = = + = V( y) + a postve quatty WOR V( y) > V( y) WR WOR ad so, RWOR s more effcet tha RWR. 9

Estmato of varace from a sample ce the expressos of varaces of sample mea volve whch s based o populato values, so these expressos ca ot be used real lfe applcatos. I order to estmate the varace of y o the bass of a sample, a estmator of (or equvaletly ) s eeded. Cosder s as a estmator of (or σ ) ad we vestgate ts basedess for the cases of RWOR ad RWR. σ Cosder s y y = ( ) = = ( y Y) ( y Y) = = ( ) ( ) y Y y Y = E( s ) = E( y Y) E( y Y) = = Var( y ) Var( y) = σ = Var( y) 0

I case of RWOR ad so Var( y) WOR = Es ( ) = σ = = I case of RWR ad so Var( y) WR = Es ( ) = σ = = = σ Hece Es ( ) = σ RWOR RWR

A ubased estmate of Var( y) Var ( y) WOR = s s case of RWOR. Var( y) WR =. s s = case of RWR.