Confidence Level We want to estimate the true mean of a random variable X economically and with confidence.

Similar documents
Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

MATH/STAT 352: Lecture 15

Topic 9: Sampling Distributions of Estimators

f(x i ; ) L(x; p) = i=1 To estimate the value of that maximizes L or equivalently ln L we will set =0, for i =1, 2,...,m p x i (1 p) 1 x i i=1

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Topic 9: Sampling Distributions of Estimators

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Common Large/Small Sample Tests 1/55

Computing Confidence Intervals for Sample Data

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Topic 9: Sampling Distributions of Estimators

Parameter, Statistic and Random Samples

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.

Exam II Review. CEE 3710 November 15, /16/2017. EXAM II Friday, November 17, in class. Open book and open notes.

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Stat 319 Theory of Statistics (2) Exercises

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

Sampling Distributions, Z-Tests, Power

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Simulation. Two Rule For Inverting A Distribution Function

AMS570 Lecture Notes #2

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

(7 One- and Two-Sample Estimation Problem )

Statistics 20: Final Exam Solutions Summer Session 2007

z is the upper tail critical value from the normal distribution

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Estimation for Complete Data

Lecture 3. Properties of Summary Statistics: Sampling Distribution

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

Module 1 Fundamentals in statistics

Distributions of Functions of. Normal Random Variables Version 27 Jan 2004

Lecture 7: Properties of Random Samples

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Estimation of the Mean and the ACVF

Stat 421-SP2012 Interval Estimation Section

January 25, 2017 INTRODUCTION TO MATHEMATICAL STATISTICS

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS

Economics Spring 2015

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

ECE 330:541, Stochastic Signals and Systems Lecture Notes on Limit Theorems from Probability Fall 2002

Homework for 2/3. 1. Determine the values of the following quantities: a. t 0.1,15 b. t 0.05,15 c. t 0.1,25 d. t 0.05,40 e. t 0.

1 Introduction to reducing variance in Monte Carlo simulations

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

STATISTICAL INFERENCE

Binomial Distribution

Solutions: Homework 3

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

UNIT 8: INTRODUCTION TO INTERVAL ESTIMATION

Stat 200 -Testing Summary Page 1

Power and Type II Error

Properties and Hypothesis Testing

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

Chapter 8: Estimating with Confidence

Chapter 6. Sampling and Estimation

Linear Regression Models

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence

Random Variables, Sampling and Estimation

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

Notes 5 : More on the a.s. convergence of sums

This section is optional.

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234

Parameter, Statistic and Random Samples

A Question. Output Analysis. Example. What Are We Doing Wrong? Result from throwing a die. Let X be the random variable

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Statistical inference: example 1. Inferential Statistics

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

7.1 Convergence of sequences of random variables

The standard deviation of the mean

Expectation and Variance of a random variable

STAT Homework 1 - Solutions

Agenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740

Problem Set 4 Due Oct, 12

7.1 Convergence of sequences of random variables

STAT431 Review. X = n. n )

AMS 216 Stochastic Differential Equations Lecture 02 Copyright by Hongyun Wang, UCSC ( ( )) 2 = E X 2 ( ( )) 2

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Basis for simulation techniques

Topic 10: Introduction to Estimation

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Lecture 11 and 12: Basic estimation theory

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

IE 230 Seat # Name < KEY > Please read these directions. Closed book and notes. 60 minutes.

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Lecture 2: Monte Carlo Simulation

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Transcription:

Cofidece Iterval 700 Samples Sample Mea 03 Cofidece Level 095 Margi of Error 0037 We wat to estimate the true mea of a radom variable X ecoomically ad with cofidece True Mea μ from the Etire Populatio P e μ e < < + Cof Saple Mea from Samples

Approach 0: Estimate the mea of a Gaussia rv with kow variace Assumptio X ~ N, ( μ ) is kow Goodess of our assumptio Not practical at all But our Approach 0 is a ice steppig stoe toward a practical approach Procedure The solutio i Approach 0 cosists of the followig four steps: Step ) Fid the Sample Mea Get iidradom samples of X to form the sample mea ˆ X X j j, Sice the sum of idepedet Gaussia is Gaussia, EX [ ˆ ] μ VAR( ) is Gaussia with Step ) Normalize the Gaussia Sample Mea ~ N(0,) / Step 3) Fid the area of the ormalized Gaussia tail For a arbitrary > 0, fid the value of z such that P z / Alteratively u Qz e du π z

3 Step 4) Formulate the cofidece iterval z / Fially we have z z / ˆ X z μ + z ˆ ˆ PX z μ X + z is referred to as the cofidece level z is referred to as the margi of error ˆ, ˆ X z X + z is referred to as the cofidece iterval of μ For a large, we will have a small margi of error, but at a higher cost

4 Example Assume X ~ N( μ, 4) We wat to estimate μ We will get the sample mea from 6 samples of X Fid the 80% cofidece iterval Solutio 08 z z 8 0 The margi of error is z 8 064 6 The cofidece iterval is expressed as P 064 μ 064 + 08 For usig Excel, NORMSINV returs z Refer to Excel file ormal-dist kow variace Cofidece Iterval Example Note We caot expect to kow the variace whe we do t kow the mea What ca we do to alleviate the impracticality?

5 Sample Variace We still assume X ~ N( μ, ) We wat to estimate μ ad we do ot kow Defie the sample variace as ˆ ( j ) s X X wheere is the sample mea defied as before The sample variace s is a estimate of the true variace s is a fuctio of radom variables Theorem Es [ ] The sample variace is a ubiased estimate of Proof for For, ( ˆ j ) s X X substitutig X + X ad takig expectatio X+ X X+ X + s X X X X + X X X X otig X ad X are iid to X, X X

6 Proof for ay ( j ) s X {( X j μ) ( μ X) } + ˆ ˆ + ˆ ( X j μ) ( X j μ)( X μ) ( X μ) recallig { X j} s X X + ( μ) ( j μ)( μ) ( μ) ( j μ) are iid samples of X ( ) ˆ Recallig VAR X ad VAR( X ) ( μ) ( X j ) + Notig ( ˆ X ) + + X μ Review Note ˆ ( j ) j s X X NOT ˆ ( j ) j s X X

7 Approach : Estimatig μ with ukow variace Use the sample variace s i place of the variace Derive a cofidece iterval from P z s/ However is ot Gaussia s/ We must kow its distributio to fid the value of Gosset showed has the studet-t distributio with degrees of freedom s/ z Properties of the Studet-t distributio with k degrees of freedom Notatio: Y ~ t k k k + Γ + y fy ( y) + k k Γ Γ k < y < Oe ca show 04 Y 0 f ( y) f ( y) Y k VAR( Y ) k lim VAR( Y ) k t N(0,) for a large k k Y 03 0 0-4 - 4 pdf of t k for k,5,30, approachig N(0,) W S Gosset published uder the pseudoym, "A Studet" Ref Distributios various pdf ad cdfxls

8 Studet-t Distributio To show has studet-t distributio with degrees of freedom, s/ we eed the followig lemmas Write / s/ s ( ) Lemma : ~ N(0,) / s ( ) Lemma : ~ χ, chi - square with degrees of freedom Lemma 3 : s ( ) μ ad are idepedet / Lemma 4 : V W ~ t, the studet-t distributio with degrees of freedom provided that V ~ N(0,), W ~ χ, ad V ad W are idepedet

9 The st step for lemma We ca show the followig relatio: ( ) j ˆ s ( X X) j i j i j> i X XX e ( ) Proof of lemma, Chi-Square, for 3 For 3, from e, ( X X X3 ) ( XX XX 3 XX3) s ( ) + + + + 3 3 Let + 3 3 X X X X X + 3 3 U X+ X X 3 3 3 U X X 3 The we ca show ( ) U ~ N 0,, U ( ) ~ N 0,, ad U ad U are idepedet Therefore ad thus s s ( ), ( ) ~ χ ( U ) + ( U )

0 Proof of lemma, Chi-Square, for 3,4,5, For 3, s ( ) ( U) + ( U) X+ X3 U X 3 3 U X X3 For 4, s ( ) ( U) + ( U) + ( U3) 3 X + X3 + X4 U X 4 4 3 X3 + X4 U X 3 3 X3 X4 U3 For 5, s ( ) ( U) + ( U) + ( U3) + ( U4) 4 X+ X3 + X4 + X5 U X 5 5 4 3 X3 + X4 + X5 U X 4 4 3 X4 + X5 U3 X3 3 3 X4 X5 U4

Approach : Evaluatig the cofidece iterval with Sample Variace Step Assume the target radom variable X is Gaussia, N( μ, ) We wat to estimate μ, ad we do ot kow Step ˆ Get samples, ad calculate the sample mea X ad the sample variace s ˆ j ( j ) ˆ X X s X X has the studet-t distributio with degrees of freedom s/ Step 3 For a give cofidece level, determie z such that z t ( u) du Excel TINV (, deg of freedom) returs z Step 4 We kow P z s/ that is, ˆ s ˆ s P X z μ X + z ˆ s, ˆ s X z X + z is referred to as the cofidece iterval Example See t-dist example excel file

No-gaussia Radom Variables What ca we do if the target radom variable is ot gaussia? Solutio Batch Samples Suppose we wat to estimate the true mea of Y, but Y is ot Gaussia The take batches of samples Each batch cosists of m samples Y Y Y X m Y Y Y X m Y Y Y X m Yj+ Yj + + Yjm Compute the average i each batch: X j, j,,, m For a large value of m, we ca assume X is a sample of a Gaussia radom variable X Note that Yj+ Yj + + Yjm E[ X] E[ Y] m So we ca use a estimate of the mea of X as that of the mea of Y j The batch sample method cosists of the followig steps: Step ) Arrage the m samples i batches Each batch cosists of m samples Step ) Compute the average i each of the batches Call the averages as X, X,, X { X j} Treat as Gaussia samples Step 3) Formulate the sample mea ad the sample variace out of the Gaussia samples The rest is same as before The rest proceeds the same as before See t-dist Beroulli CI Example excel file

3 Guidelie to use t-dist cofidece iterval agaist Beroulli samples Margi of Error z s Icreasig the umber of samples makes the cofidece iterval tighter Icreasig also icreases the degree of freedom However, icreasig the degree of freedom beyod 0 reduces z by a isigificat amout A larger size of batches (a larger value of m) reduces the sample variace I our example of biomial samples; 006 for 4-sample batches, 003 for 0-sample batches s Also a larger batch size provides a safer groud for assumig the batch sample meas are Gaussia Also oe must ote that icreases sharply for the values of beyod 095 z