ENGI 4421 Central Limit Theorem Page Central Limit Theorem [Navidi, section 4.11; Devore sections ]

Similar documents
ENGI 4421 Central Limit Theorem Page Central Limit Theorem [Navidi, section 4.11; Devore sections ]

5.1 Two-Step Conditional Density Estimator

D.S.G. POLLOCK: TOPICS IN TIME-SERIES ANALYSIS STATISTICAL FOURIER ANALYSIS

, the random variable. and a sample size over the y-values 0:1:10.

Chapter 3.1: Polynomial Functions

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

AP Statistics Notes Unit Eight: Introduction to Inference

Ch. 1 Introduction to Estimation 1/15

BIO752: Advanced Methods in Biostatistics, II TERM 2, 2010 T. A. Louis. BIO 752: MIDTERM EXAMINATION: ANSWERS 30 November 2010

Quantum Mechanics for Scientists and Engineers. David Miller

Solutions. Definitions pertaining to solutions

Intermediate Division Solutions

A Study on Estimation of Lifetime Distribution with Covariates Under Misspecification

MATH/STAT 352: Lecture 15

Bootstrap Method > # Purpose: understand how bootstrap method works > obs=c(11.96, 5.03, 67.40, 16.07, 31.50, 7.73, 11.10, 22.38) > n=length(obs) >

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

MATH Midterm Examination Victor Matveev October 26, 2016

Topic 9: Sampling Distributions of Estimators

Comparative analysis of bayesian control chart estimation and conventional multivariate control chart

Chapter 6 Sampling Distributions

Topic 9: Sampling Distributions of Estimators

If σis unknown. Properties of t distribution. 6.3 One and Two Sample Inferences for Means. What is the correct multiplier? t

1036: Probability & Statistics

5.80 Small-Molecule Spectroscopy and Dynamics

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

MATHEMATICS 9740/01 Paper 1 14 Sep hours

(6) Fundamental Sampling Distribution and Data Discription

Topic 9: Sampling Distributions of Estimators

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Parameter, Statistic and Random Samples

The Simple Linear Regression Model: Theory

Author. Introduction. Author. o Asmir Tobudic. ISE 599 Computational Modeling of Expressive Performance

ALE 26. Equilibria for Cell Reactions. What happens to the cell potential as the reaction proceeds over time?

Probability and statistics: basic terms

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Review for cumulative test

Markov processes and the Kolmogorov equations

Statistics 511 Additional Materials

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

The Excel FFT Function v1.1 P. T. Debevec February 12, The discrete Fourier transform may be used to identify periodic structures in time ht.

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Parameter, Statistic and Random Samples

Binomial Distribution

CHAPTER 2. Mean This is the usual arithmetic mean or average and is equal to the sum of the measurements divided by number of measurements.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Active redundancy allocation in systems. R. Romera; J. Valdés; R. Zequeira*

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

, which yields. where z1. and z2

Grade 3 Mathematics Course Syllabus Prince George s County Public Schools

Expectation and Variance of a random variable

Outline Map Activity. Regional Atlas Activity A. Name Date Class

Multi-objective Programming Approach for. Fuzzy Linear Programming Problems

Final Review for MATH 3510

A Hartree-Fock Calculation of the Water Molecule

[1 & α(t & T 1. ' ρ 1

AP Statistics Notes Unit Two: The Normal Distributions

Module 1 Fundamentals in statistics

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

Identical Particles. We would like to move from the quantum theory of hydrogen to that for the rest of the periodic table

Eco411 Lab: Central Limit Theorem, Normal Distribution, and Journey to Girl State

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Statistical Intervals for a Single Sample

Chapter 6. Sampling and Estimation

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Confidence Intervals

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Physical Chemistry Laboratory I CHEM 445 Experiment 2 Partial Molar Volume (Revised, 01/13/03)

(7 One- and Two-Sample Estimation Problem )

Central Limit Theorem the Meaning and the Usage

STAT 515 fa 2016 Lec Sampling distribution of the mean, part 2 (central limit theorem)

An Investigation of Stratified Jackknife Estimators Using Simulated Establishment Data Under an Unequal Probability Sample Design

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Every gas consists of a large number of small particles called molecules moving with very high velocities in all possible directions.

UNIVERSITY OF TECHNOLOGY. Department of Mathematics PROBABILITY THEORY, STATISTICS AND OPERATIONS RESEARCH GROUP. Memorandum COSOR 76-10

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Chapter 4. Problem Solutions

STAT431 Review. X = n. n )

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

STAT 203 Chapter 18 Sampling Distribution Models

Example: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}.

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

Statistical inference: example 1. Inferential Statistics

PH 425 Quantum Measurement and Spin Winter SPINS Lab 1

Statistics 20: Final Exam Solutions Summer Session 2007

Lecture 21: Signal Subspaces and Sparsity

Lesson 10: Limits and Continuity

Basics of Probability Theory (for Theory of Computation courses)

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

7.1 Convergence of sequences of random variables

Internal vs. external validity. External validity. Internal validity

Working with Two Populations. Comparing Two Means

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Solutions Descriptive Statistics. None at all!

SUPPLEMENTARY MATERIAL GaGa: a simple and flexible hierarchical model for microarray data analysis

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.

Transcription:

ENGI 441 Cetral Limit Therem Page 11-01 Cetral Limit Therem [Navidi, secti 4.11; Devre sectis 5.3-5.4] If X i is t rmally distributed, but E X i, V X i ad is large (apprximately 30 r mre), the, t a gd apprximati, X ~ N, At "http://www.egr.mu.ca/~ggerge/441/dems/clt.html" is a demstrati web prgram t illustrate hw the sample mea appraches a rmal distributi eve fr highly -rmal discrete distributis f X. Csider the expetial distributi, whse p.d.f. (prbability desity fucti) is x 1 1 f x; e, x 0, 0 E X, VX It ca be shw that the exact p.d.f. f the sample mea fr sample size is 1 x x e f x;,, x 0, 0, X 1! 1 1 with mea ad variace E X, V X. [A -examiable derivati f this p.d.f. is available at " http://www.egr.mu.ca/~ggerge/441/dems/cltexp.pdf ".] Fr illustrati, settig = 1, the p.d.f. fr the sample mea fr sample sizes = 1,, 4 ad 8 are: 1: f x e x x : f x 4xe 3 4x 44x e 4: 8: f x f x X 3! X X 7 8 x e 88 7! x The ppulati mea = E[X] = 1 fr all sample sizes. The variace ad the psitive skew bth dimiish with icreasig sample size. The mde ad the media apprach the mea frm the left.

ENGI 441 Cetral Limit Therem Page 11-0 Fr a sample size f = 16, the sample mea X has the p.d.f. f X x 15 16 x e x 16 16 1 ad parameters E X 1 ad V X 15!. 16 A plt f the exact p.d.f is draw here, tgether with the rmal distributi that has the same mea ad variace. The apprach t rmality is clear. Beyd = 40 r s, the differece betwee the exact p.d.f. ad the Nrmal apprximati is egligible. It is geerally the case that, whatever the prbability distributi f a radm quatity may be, the prbability distributi f the sample mea X appraches rmality as the sample size icreases. Fr mst prbability distributis f practical iterest, the rmal apprximati becmes very gd beyd a sample size f 30. Example 11.01 A radm sample f 100 items is draw frm a expetial distributi with parameter = 0.04. Fid the prbabilities that (a) a sigle item has a value f mre tha 30; (b) the sample mea has a value f mre tha 30. (a)

ENGI 441 Cetral Limit Therem Page 11-03 Example 11.01 (ctiued) (b)

ENGI 441 Cetral Limit Therem Page 11-04 Sample Prprtis A Berulli radm quatity has tw pssible utcmes: x = 0 (= failure ) with prbability q1 p ad x = 1 (= success ) with prbability p. Suppse that all elemets f the set X 1, X, X 3,, X are idepedet Berulli radm quatities, (s that the set frms a radm sample). Let T X 1 X X 3 X umber f successes i the radm sample ad Pˆ T = prprti f successes i the radm sample, the T is

ENGI 441 Cetral Limit Therem Page 11-05 Example 11.0 55% f all custmers prefer brad A. Fid the prbability that a majrity i a radm sample f 100 custmers des t prefer brad A.

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-06 Cfidece Itervals (Oe Sample) [Navidi, sectis 5.1-5.3; Devre chapter 7] S far we have cstructed prbability statemets hw likely certai sample values are, give kwledge f the ppulati frm which the radm sample came. Nw we shall reverse that situati: we have a kw sample i frt f us, frm which we ca ifer the values f the parameters f the ppulati frm which the sample was draw. This is the realm f iferetial statistics. If the radm quatity X is such that ~ N, X, the it is highly ulikely that X will be mre tha three stadard deviatis away frm its mea: P X 3 PX 3 PZ 3 3.00.003 Mre tha 99.7% f the time, X will be clser tha three stadard deviatis t its mea. Fr a sufficietly large radm sample, the cetral limit therem assures us that the sample mea X ~ N, (either exactly r t a excellet apprximati). Therefre we have 99.7% cfidece that a bserved sample mea x is withi three stadard errrs f the ppulati mea. This lie f reasig allws us t replace a pit estimate by a rage f plausible values f a ukw parameter a cfidece iterval. Mre geerally, whe X ~ N,, P z/ X z/ 1

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-07 The cfidece iterval estimatr fr (at a level f cfidece f 1 ) is X z X z The 1 cfidece iterval estimatr fr is a radm iterval X z, X z The prbability is (1 ) that the abve radm iterval icludes the true value f.. 1 1 cfidece f all radm samples will prduce a iequality, (the iterval estimate fr ) x z x z that is true. Nte that the cfidece iterval estimate ctais radm quatities at all! The statemet is either abslutely certai t be true r abslutely certai t be false, (depedig the values f,, x, ad ). Iterpretati f a cfidece iterval [ = cfidece iterval estimate ] Oly 5% f all 95% cfidece iterval estimates fr fail t iclude.

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-08 A ccise expressi fr the C.I. (cfidece iterval estimate fr ) is x z A Bayesia view f iterval estimati: If the ly quatity amg {,, x, ad } that we d t kw is, the represet the ukw by the radm quatity A. The A te abut the stadard rmal distributi ad the t distributi Let Z ~ N(0, 1) (stadard rmal distributi), s that P Z z z (cumulative distributi fucti fr the stadard rmal distributi). th The the 1 100 percetile f the stadard rmal distributi is z, which satisfies PZ z. It als fllws that 1 z z. 1 z The t distributi with ν degrees f freedm is als a bell shaped curve, with a mea, media ad mde at t 0, but with a greater variace tha the stadard rmal distributi. As the umber f degrees f freedm icreases, the t distributi appraches the z (stadard rmal) distributi. The graphs f t 1 ad t 5 are shw here, tgether with z, which is idistiguishable t the eye frm t fr abve 30 r s. Therefre lim t, t, z. Use the t distributi ly if the true ppulati variace is ukw.

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-09 1 100 percetile z, use the fial rw i the table f critical values T fid the th f the t distributi ( page 17-0 r the iside back cver f the textbk): z t., The fial rw f the t tables is 0.1 0.05 0.05 0.01 0.005 1.8155 1.64485 1.95996.3635.57583 Therefre PZ 1.645.05 r equivaletly z.05 1.645 ; P 1.960.05 Z r equivaletly z.05 1.960 ; etc. Example 11.03 The rate f eergy lss X (watt) i a mtr is kw t be a rmally distributed radm quatity with stadard deviati 3.0 W. A radm sample f 100 such mtrs prduces a sample mea rate f eergy lss f 58.3 W. Fid a 99% cfidece iterval estimate fr the true mea rate f eergy lss.

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-10 Example 11.03 (ctiued) Hw large must be fr the width f the 99% cfidece iterval estimate fr t be less tha 1.0? Chice f sample size The width f the cfidece iterval x z, x z is w z z w The sample size is iversely related t the square f the desired width. Edpits f a (1 ) CI fr : (a) kw: (b) ukw, large: (c) ukw, small: Whe is small, X must be early (r exactly) rmal.

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-11 Example 11.04 The lifetime X f a particular brad f filamets is kw t be rmally distributed. A radm sample f six filamets is tested t destructi ad they are fud t last fr a average f 1,008 hurs with a sample stadard deviati f 6. hurs. (a) Fid a 95% cfidece iterval estimate fr the ppulati mea lifetime. (b) Is the evidece csistet with 1000? (c) Is the evidece csistet with > 1000?

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-1 Prperties f a cfidece iterval If we thik f the width f the cfidece iterval as specifyig its precisi, the the cfidece level (r reliability) f the iterval is iversely related t its precisi. Estimati f Ppulati Prprti Whe a radm sample f size is draw frm a ppulati i which a prprti p f the items are successes, the, as we saw page 11.04, pq P ˆ ~ N p, fr sufficietly large p ad q, (amely, p > 10 ad q > 10 ). Cmpare this with X ~ N,, fr which the crrespdig cfidece iterval has edpits x z / s. Hwever, the variace pq is ukw because p ad q are ukw. A bvius remedy is t replace the ukw parameters p ad q by their pit estimates ˆp ad ˆq.

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-13 Therefre, a simple 100(1 )% cfidece iterval estimatr fr p is P z / PQ ad the 100(1 )% cfidece iterval estimate fr p is pˆ z / pq ˆˆ Hwever, these cfidece itervals ca exhibit sigificat errrs whe either p r q is much less tha 100. Durig the 1990 s, mre reliable cfidece itervals fr p were develped. Oe f them is (Devre, sixth editi, secti 7., page 66) : z pq ˆˆ z pˆ 4 z / 1 / / z / Ather iterval, frm the Navidi textbk (page 339), is the Agresti-Cull iterval. If x is the bserved umber f successes i a radm sample f idepedet Berulli trials, the defie x* x ad * 4 s that p* x* x * 4 ad q* 1 p* The the 100(1 )% cfidece iterval estimate fr p is p* z / p* q* * It turs ut that the Bayesia pit estimate fr p is p*, t ˆp.

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-14 Example 11.05 Frm a radm sample f e thusad silic wafers, 750 pass a quality ctrl test. Fid a 99% cfidece iterval estimate fr p (the true prprti f wafers i the ppulati that are gd). = 1000 ad x = 750 x 750 3 p ˆ 1000 4 1 q ˆ 1 pˆ 4.005 z.005 t.005,.576 Edpits f the C.I.: pq ˆˆ.75.5 pˆ z.75.576 =.75.035 7... 1000 Therefre the 99% cfidece iterval estimate fr p is crrect t three sigificat figures. 71.5% p 78.5% Usig the mre precise Agresti-Cull versi f the cfidece iterval yields x* = 750 + = 75, * = 1000 + 4 = 1004 p* x* 75 * 1004.749003 p* q*.749.50 * 1004.013683 The 99% CI is therefre p* q* p* z.005.749 *.576.0136.749003.03547 71.4% p 78.4%

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-15 (1)100% Bayesia Cfidece Iterval fr [t i the Navidi r Devre textbks] Suppse that previus evidece leads us t believe that. The stregth f this belief is represeted by the variace (lwer variace crrespds t strger belief). We wish t update that estimate after a radm sample f size has bee examied. Assume that 30 (s that the Cetral Limit Therem will apply). Prir distributi: New evidece: X ~ N, Sample size = Sample mea = x Sample stadard deviati = s Calculate where wd, w x d w d w w 1, w w w are the weights f the data ad rigial ifrmati respectively, give by wd 1 1, w s d Psterir distributi: X ~ N, 1 100% Bayesia iterval fr : z / Cmpare with the classical 1 100% cfidece iterval fr : s x z / ( 30) r x z / I may applicatis, the Bayesia iterval is fte arrwer tha the classical cfidece iterval, because the Bayesia iterval icrprates mre ifrmati (previus evidece r belief abut the true value f ). [Nte: it is easy t shw that as (r if x the), * = x ad that as, * s, which are the classical expressis.]

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-16 Examples f Bayesia Cfidece Itervals These examples are mdificatis f the previus examples f classical cfidece itervals fr. Example 11.06 (mdificati f Example 11.03) The rate f eergy lss X (watt) i a mtr is kw t be a rmally distributed radm quatity ad prir experiece suggests that the mea is 60 W with stadard deviati 3.0 W. A radm sample f 100 such mtrs prduces a sample mea rate f eergy lss f 58.3 W with sample stadard deviati.8 W. Fid a 99% cfidece iterval estimate fr the true mea rate f eergy lss.

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-17 Example 11.07 (mdificati f Example 11.04) The lifetime X f a particular brad f filamets is kw t be rmally distributed. Prir experiece suggests that 1000 ad 6.0. A radm sample f six filamets is tested t destructi ad they are fud t last fr a average f 1,008 hurs with a sample stadard deviati f 6. hurs. (a) Fid a 95% cfidece iterval estimate fr the ppulati mea lifetime. (b) Is the evidece csistet with 1000? [Ed f Chapter 11]

ENGI 441 Cfidece Itervals (Oe Sample) Page 11-18 [Space fr Additial Ntes]