Central Limit Theorem the Meaning and the Usage

Similar documents
Homework 5 Solutions

Expectation and Variance of a random variable

Introduction to Probability and Statistics Twelfth Edition

STAT 515 fa 2016 Lec Sampling distribution of the mean, part 2 (central limit theorem)

PRACTICE PROBLEMS FOR THE FINAL

1036: Probability & Statistics

Confidence Intervals for the Population Proportion p

( ) = p and P( i = b) = q.

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

Parameter, Statistic and Random Samples

Binomial Distribution

Chapter 6 Sampling Distributions

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

(7 One- and Two-Sample Estimation Problem )

PRACTICE PROBLEMS FOR THE FINAL

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Frequentist Inference

Exam 2 Instructions not multiple versions

(6) Fundamental Sampling Distribution and Data Discription

Chapter 23: Inferences About Means

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

IE 230 Seat # Name < KEY > Please read these directions. Closed book and notes. 60 minutes.

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Stat 400: Georgios Fellouris Homework 5 Due: Friday 24 th, 2017

Parameter, Statistic and Random Samples

Statistics 511 Additional Materials

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Stat 225 Lecture Notes Week 7, Chapter 8 and 11

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Topic 9: Sampling Distributions of Estimators

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Understanding Samples

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

MATH/STAT 352: Lecture 15

Sampling Distributions, Z-Tests, Power

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

AMS570 Lecture Notes #2

Sample questions. 8. Let X denote a continuous random variable with probability density function f(x) = 4x 3 /15 for

Estimation of a population proportion March 23,

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

Data Description. Measure of Central Tendency. Data Description. Chapter x i

PH 425 Quantum Measurement and Spin Winter SPINS Lab 1

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Eco411 Lab: Central Limit Theorem, Normal Distribution, and Journey to Girl State

STP 226 EXAMPLE EXAM #1

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Chapter 1 (Definitions)

n outcome is (+1,+1, 1,..., 1). Let the r.v. X denote our position (relative to our starting point 0) after n moves. Thus X = X 1 + X 2 + +X n,

Statisticians use the word population to refer the total number of (potential) observations under consideration

Topic 6 Sampling, hypothesis testing, and the central limit theorem

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

STAT 203 Chapter 18 Sampling Distribution Models

independence of the random sample measurements, we have U = Z i ~ χ 2 (n) with σ / n 1. Now let W = σ 2. We then have σ 2 (x i µ + µ x ) 2 i =1 ( )

Chapter 8: Estimating with Confidence

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

IE 230 Probability & Statistics in Engineering I. Closed book and notes. No calculators. 120 minutes.

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

AAEC/ECON 5126 FINAL EXAM: SOLUTIONS

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

a.) If random samples of size n=16 are selected, can we say anything about the x~ distribution of sample means?

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Topic 9: Sampling Distributions of Estimators

Topic 10: Introduction to Estimation

Discrete probability distributions

Working with Two Populations. Comparing Two Means

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Estimating the Population Mean - when a sample average is calculated we can create an interval centered on this average

Module 1 Fundamentals in statistics

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Economics 250 Assignment 1 Suggested Answers. 1. We have the following data set on the lengths (in minutes) of a sample of long-distance phone calls

Understanding Dissimilarity Among Samples

Elements of Statistical Methods Lots of Data or Large Samples (Ch 8)

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence

6 Sample Size Calculations

Statistics 300: Elementary Statistics

Read through these prior to coming to the test and follow them when you take your test.

Stat 400, section 5.4 supplement: The Central Limit Theorem

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions

Distribution of Random Samples & Limit theorems

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

BIOSTATS 640 Intermediate Biostatistics Frequently Asked Questions Topic 1 FAQ 1 Review of BIOSTATS 540 Introductory Biostatistics

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

Math 140 Introductory Statistics

Topic 9: Sampling Distributions of Estimators

4. Partial Sums and the Central Limit Theorem

Example: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}.

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

CH5. Discrete Probability Distributions

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Transcription:

Cetral Limit Theorem the Meaig ad the Usage Covetio about otatio. N, We are usig otatio X is variable with mea ad stadard deviatio. i lieu of sayig that X is a ormal radom Assume a sample of measuremets (or couts) that are distributed with the same probability distributio that has mea ad stadard deviatio. *** Example: Take a sample of 30 GPAs of high school seiors. Now look at the average x of your sample. It has a particular value. 3.3 3.11.35 1.39 3.80 3.1.35.76 3.43 1.91.7.3.10 3.4 3.55.34 3.78 1.66 3.56 3.45.87 3.10 3.07 3.8 1.64 3.01 3.45 3.10.4 1.98 Average (sample mea) x.79. Take aother sample of equal size ad average of that oe has aother value. 3.9.1.78.19 3.50 3.30.35.75 3.3.8..3.10 3.4 3.45.34.35.45.56 3.45.87 3.40 3.43.8 1.89 3.11.55 3.3.55 1.67 Average x.73. Ad so o. The sample average x itself is a radom variable with a probability distributio. We shall hoor that fact by writig X. What kid of probability distributio? Fact A. If the origial measuremet X is (approximately) ormally distributed the the sample mea X is (approximately) ormally distributed with the mea ad the stadard deviatio ( i.e., variace ), if the sample size is.

But eve if the origial measuremet is ot ormally distributed we still have: Fact B. Cetral Limit Theorem: As the sample size () icreases the closer to the ormal radom variable the sample mea X is, regardless of the ature of the iitial radom variable. This ormal is with mea ad the stadard deviatio ( i.e., variace ). The rule of thumb we use for the safe applicatio of CLT is whe the sample size is at least 30 or more. This allows us to use ormal distributio to assess probabilities of sample mea beig i some particular rage. Note: The umber Example 1. is ofte called stadard error of the mea. The height (X) of 18-year-old me is approximately ormally distributed with the mea X N 70,3. 70 iches ad stadard deviatio 3 iches, i.e., (a) What is the chace that a 18-year old ma selected at radom is betwee 67 ad 73 iches tall? Aswer: P X 67 73 0.683 by usig empirical rule. (b) If we select the sample of ie 18-year-olds what is the chace that the mea height X of the sample is betwee 67 ad 73 iches? 3 9 Aswer: Usig the Fact A above we have that X is N 70, N70,1 P 67 X 73 0.997 by empirical rule agai., therefore Example. The white blood cell cout per cubic millimeter of blood is approximately ormal with mea 7500 ad stadard deviatio 1750. Doctor uses two-test average X. What is the chace that X 4500?

Aswer: Usig Fact A above we have that X is approximately ormal 1750 N 7500, thus 4500 7500 P X 4500 PZ PZ.4 0.0078. 1750 This is very ulikely, i less tha 1% of the tests i geeral populatio. Example 3. Average Math SAT i the US for college boud seiors i 004 was 518 with stadard deviatio of 118. We assume approximately ormal distributio of the scores. Professor Witt advises 4 freshme. Let X be their average Math SAT. What is the chace that this is (a) better tha 600? (b) less tha 400? (c) better tha 500 but less tha 600? 118 4 (a) 600 518 600 518 P X 600 PZ P Z PZ 1.39 0.083 118. 59 4 Aswers: First ote that X is approximately ormal, N 518, N518,59 (b) 400 518 P X 400 P Z P Z 0.08. 59 (c) 500 518 600 518 P 500 X 600 P Z P 0.31 Z 1.39 0.5394. 59 59.

Example 4. Suppose the populatio of adult males had a mea weight of 18 pouds with a stadard deviatio of 3 pouds. The umber of males treated for heart related problems i local healthcare facility was 53 i 011. Their average weight was 191. Would this lead to coclusio that males with heart related problems have more weight? Aswer: 3 By the Fact B from CLT we have that X is approximately ormal N 18, 53 If we assume that the mea weight of 18 applies to this group of idividuals the the chace that the weight would be above 191 is 19118 P X 191 PZ PZ.8487 0.00. 3 53 This idicates that it is very ulikely that mea weight of 18 ad stadard deviatio of 53 is applicable o this group. It is far more likely that the me with heart related problems are heavier tha 18 pouds o average. Example 5. The mea lifetime of a certai type of icadescet light bulb is 900 hours with a stadard deviatio of 900 hours. We have a batch of 100 of these light bulbs. What is the chace these light bulbs are goig last more tha 100,000 hours? Aswer: Let s deote the lifetime of a sigle light bulb with X. The questio is posed as where P X1 X X100 100000 X i meas the lifetime of i-th light bulb. We ca write this questio as X1 X X100 P 1000 P X 1000 100. Now we ca use CLT because X is approximately with mea 900 ad stadard deviatio 90. 900 N 900,. That meas ormal 100

1000 900 P X 1000 P Z PZ 1.11 1 0.8665 0.1335 90 The batch of bulbs has about 13% chace to last more tha 100,000 hours. Example 6. A shippig compay charges extra if the average shippig load durig oe week period exceeds the compay limit of 0 tos per cotaier. Meat packig compay ships o average 18.5 tos per cotaier with a stadard deviatio of 7.4 tos. If they shipped 68 cotaiers this week, what is the chace the average load was above 0 tos? Aswer: The shippig load per cotaier is a radom variable X we do ot kow the ature of, apart from mea ad stadard deviatio. Therefore to aswer the questio, what is the probability that X, the average of 68 shippig loads exceeds 0 tos we use CLT. 7.4 Therefore X is approximately ormal, N 18.5,. 68 0 18.5 P X 0 PZ PZ 1.67 0.0475 7.4 68 The aswer is approximately 5%. Homework: Check olie.

Biomial Distributio by Normal Approximatio (Optioal) Remider: X = B(, p) the cout of successes for biomial distributio with the umber of idepedet idetical trials ad probability of success o oe trial p. Every trial ca ed with oly two possible outcome success ad failure. Probability of failure is q = 1 p. The probability mass formula is P X r p q r r r Accordig to Cetral Limit Theorem, biomial distributio i case of a large ca be approximated with a ormal distributio with the same mea (p) ad the same stadard deviatio ( pq ). The implicatio is that respective probabilities for a biomial radom variable (cout of successes) ca be approximated by usig z values of the variable i the charts for stadard ormal distributio. There are two requiremets the approximatio eeds to meet: p 5, q 5. Example 1. 1 out of 4 wome that smoke will die from smokig related diseases. What is the probability that out of 1,000 wome smokers more tha 50 will die from smokig related diseases? What is the probability that the death from smokig related disease will be betwee 40 ad 60 cases ( 40 X 60 )? Aswer: X, the cout of deaths is 1000,0.5 sice we have p = 50, q = 750. B. This ca be approximated by N 50,13.7 40 50 60 50 P X P Z P Z 13.7 13.7 40 60 0.73 0.73 0.5346. Example. O average % of batteries produced by a car battery maufacturer are defective. What is the chace that i a stock of 10,000 batteries there are o more tha 180 defective?

Aswer: X, the cout of defective batteries i a stock of 10,000 is B 10000,0.0 approximated by N 00,196. Obviously p = 00 > 5, q = 9800 > 5. 180 00 P X 180 PZ PZ 1.43 0.0764. 14. This ca be Example 3 (Small rages or specific value of the variable). O average oe out of 6 smokers is goig to die from the lug cacer. If there are 10 smokers i my eighborhood, what is the chace there will be 0 of them that will die of lug cacer? Aswer: X, the cout of rouds wo is B 10,1/ 6. This ca be approximated by 0, 16.67 19.5 0 0.5 0 P X P X P Z P Z 4.08 4.08 N. 0 19.5 0.5 0.1 0.1 0.0956 Oe ca calculate this probability directly from formula for the mass of B 100,1/ 38, 0 100 10 1 5 P X 3 0.0973 0. 6 6 Obviously the error of ormal approximatio to biomial was less tha 0.%. Homework: Check olie.