Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

Similar documents
Mathacle PSet Stats, Confidence Interval Level Number Name: Date: Confidence Interval Guesswork with Confidence

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.

Chapter 8: Estimating with Confidence

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

MATH/STAT 352: Lecture 15

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

(7 One- and Two-Sample Estimation Problem )

AP Statistics Review Ch. 8

Confidence Intervals QMET103

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Chapter 23: Inferences About Means

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Common Large/Small Sample Tests 1/55

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Sampling Error. Chapter 6 Student Lecture Notes 6-1. Business Statistics: A Decision-Making Approach, 6e. Chapter Goals

Topic 9: Sampling Distributions of Estimators

University of California, Los Angeles Department of Statistics. Hypothesis testing

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS

Final Examination Solutions 17/6/2010

CONFIDENCE INTERVALS STUDY GUIDE

Topic 9: Sampling Distributions of Estimators

Statistical Intervals for a Single Sample

Statistics 511 Additional Materials

Topic 9: Sampling Distributions of Estimators

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Sample Size Determination (Two or More Samples)

Confidence Level We want to estimate the true mean of a random variable X economically and with confidence.

Confidence Interval for one population mean or one population proportion, continued. 1. Sample size estimation based on the large sample C.I.

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Chapter 6 Sampling Distributions

Module 1 Fundamentals in statistics

Working with Two Populations. Comparing Two Means

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?

Chapter 1 (Definitions)

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Stat 225 Lecture Notes Week 7, Chapter 8 and 11

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Economics Spring 2015

Confidence Intervals รศ.ดร. อน นต ผลเพ ม Assoc.Prof. Anan Phonphoem, Ph.D. Intelligent Wireless Network Group (IWING Lab)

STAT 155 Introductory Statistics Chapter 6: Introduction to Inference. Lecture 18: Estimation with Confidence

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

f(x i ; ) L(x; p) = i=1 To estimate the value of that maximizes L or equivalently ln L we will set =0, for i =1, 2,...,m p x i (1 p) 1 x i i=1

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Read through these prior to coming to the test and follow them when you take your test.

Random Variables, Sampling and Estimation

Stat 200 -Testing Summary Page 1

Confidence Intervals for the Population Proportion p

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

Exam 2 Instructions not multiple versions

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

Probability and Statistics Estimation Chapter 7 Section 3 Estimating p in the Binomial Distribution

Topic 5 [434 marks] (i) Find the range of values of n for which. (ii) Write down the value of x dx in terms of n, when it does exist.

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Chapter 8 Interval Estimation

MA238 Assignment 4 Solutions (part a)

Error & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i :

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

UNIT 8: INTRODUCTION TO INTERVAL ESTIMATION

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Stat 319 Theory of Statistics (2) Exercises

Estimating the Population Mean - when a sample average is calculated we can create an interval centered on this average

Data Analysis and Statistical Methods Statistics 651

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Statistical inference: example 1. Inferential Statistics

STAT 203 Chapter 18 Sampling Distribution Models

z is the upper tail critical value from the normal distribution

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Statistics 20: Final Exam Solutions Summer Session 2007

1 Inferential Methods for Correlation and Regression Analysis

Expectation and Variance of a random variable

This is an introductory course in Analysis of Variance and Design of Experiments.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Confidence Intervals

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Simulation. Two Rule For Inverting A Distribution Function

A Confidence Interval for μ

CH19 Confidence Intervals for Proportions. Confidence intervals Construct confidence intervals for population proportions

Properties and Hypothesis Testing

Mathacle PSet Stats, Confidence Intervals and Estimation Level Number Name: Date: Unbiased Estimators So we don t have favorite.

Chapter 13, Part A Analysis of Variance and Experimental Design

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

Computing Confidence Intervals for Sample Data

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234

Topic 10: Introduction to Estimation

1 Constructing and Interpreting a Confidence Interval

STAC51: Categorical data Analysis

Transcription:

PSet ----- Stats, Cocepts I Statistics 7.3. Cofidece Iterval for a Mea i Oe Sample [MATH] The Cetral Limit Theorem. Let...,,, be idepedet, idetically distributed (i.i.d.) radom variables havig mea µ ad fiite oero variace. Let + +... + =, the µ lim Pr = Φ( ) / The Cetral Limit Theorem says that whe the sample sie is large, this ubiased µ estimator ~ N( µ, ) ~ N(0,). Note that the radom variables eed ot be ormally distributed. [MATH] The correct hypergeometric model should add the correctio factor to the variace: N = N Whe the 0% rule is satisfied, the stadard deviatio is 34

PSet ----- Stats, Cocepts I Statistics N N = = N N The populatio mea ca be estimated i the followig two cases: CASE #: Large Sample Sie ad Kow Suppose...,,, be idepedet, idetically distributed (i.i.d.) radom variables havig mea µ ad fiite oero variace. The is the ubiased mea estimator: = + +... + The variace is + + + = = E[... E[ ] E i ] = µ = µ + [ ] +... + Var i = Var[ ] = Var = = = Whe the sigificat level is give ad sample sie is relatively large, the populatio mea µ ad the sample statistic are related through the ormal distributio: µ Pr > = i i= where the sample statistic =. The cofidece iterval CI is, CI =, + 35

PSet ----- Stats, Cocepts I Statistics Or, i the form of iterval otatio: < µ < + I the form of margi of error (MoE) : ± [PROCEDURE] Cofidece Iterval for a Mea with Large Sample Sie ad Kow Variace The coditios to obtai CI for the sample mea (statistic) with sample sie are.) The sample is a idepedet radom sample..) The sample sie is less tha 0% of populatio to reduce the w/o replacemet effect 3.) is kow ad > 30 for the coditio to use ormal distributio. 4.) The CI is costructed from the sample statistic for a give sigificace level : CI =, + Where is the terms of margi of error (MoE). [Ti-84] Cofidece Iterval for a Mea with Large Sample Sie ad Kow Variace.) STAT -> TESTS ->7. ZIterval, select Stats if the statistic is give..) Iput,,,. 36

PSet ----- Stats, Cocepts I Statistics Eample 7.3.. The Presidet of a large uiversity wishes to estimate the average age of the studets presetly erolled. For the past studies, the stadard deviatio is kow to be years. A sample of 50 studets is selected radomly, ad sample mea is foud to be 3. years. Fid the 95% cofidece iterval of the school s populatio mea. Use the calculator to verify your aswers. Solutio: The sample sie = 50 > 30, ad it is assumed that the school has more tha 500 studets to make the sample sie less tha 0% of the populatio so that the o replacemet effect is reduced. =, = 3., = 0. 05, = =. 96, CI =? or 3..96 < µ < 3. +.96 50 CI = (.6, 3.8) 50 That is, the Presidet ca say with 95% cofidece that the average age of studets is betwee.6 ad 3.8 years old. Eample 7.3.. From the last eample, the Presidet would like to be 99% cofidet that the estimate of average age should be accurate withi year whe the stadard deviatio of the ages is 3 years. How large a sample is ecessary? 37

PSet ----- Stats, Cocepts I Statistics Solutio: = 3, = 0. 0, = =. 58, =? < The sample sie eeds to be at least 60..58 3 < > 60 Eample 7.3.3. [CB] A large compay is cosiderig opeig a frachise i St. Louis ad wats to estimate the mea household icome for the area usig a simple radom sample of the households. Based o iformatio from a pilot study, the compay assumes that the stadard deviatio of household icomes is = $7,00. What is the least umber of households that should be surveyed to obtai a estimate that is withi $00 of the true mea houshold icome with 95 percet cofidece? Solutio: The variace is kow. = 7, 00, = 0.05, =.96, 00 ( ).96 700 = 4979 00 00 The least umber of household that should be surveyed is about 5000. 38

PSet ----- Stats, Cocepts I Statistics CASE #: Small Sample Sie or/ad Ukow [MATH] For a radom variable ~ T ( ), where T ( ) is the t-distributio with the sample sie of ad the degrees of freedom k =. The mea E [ ] = 0, the variace k 6 Var[ ] = for k >, the skewess is 0 for k > 3, ad the kurtosis is for k k 4 k > 4. Whe variace for,,, is ukow, the ubiased variace estimator is: That is, the epected value of s is s = i i= ( ) E[ s ] = S N S = i where ( ) N i= hypergeometric distributio,, which is the populatio stadard error. For the s is s s N = N Whe the 0% rule is satisfied, the stadard error of is s s Note that the sample stadard deviatio estimator s = i i= ( ) is a biased estimator of the populatio stadard error S. 39

PSet ----- Stats, Cocepts I Statistics Whe the sample sie is less tha 30, ad/or is ukow, similar to use ormal distributio, the mea estimator ca be studetied by the t-distributio µ ~ T ( ) s ad the cofidece level is the domai for µ that is determied by µ Pr > t s = The CI is s s CI = t, + t Or, i the iterval otatio: s t < µ < + t s I the form of margi of error t s : ± t s [PROCEDURE] Cofidece Iterval for a Mea with Small Sample Sie ad/or Ukow Variace The coditios to obtai CI for the sample mea (statistic) with sample sie are.) The sample is a idepedet radom sample..) The sample sie is less tha 0% of populatio. 3.) < 30 ad/or is ukow. 4.) The CI is costructed from the sample statistic for a give sigificace level : 40

PSet ----- Stats, Cocepts I Statistics s CI = t, + t s Where t s s = i i= ( ) is the term of margi of error (MoE). [Ti-84] Cofidece Iterval for a Mea with Small Sample Sie ad/or Ukow Variace.) STAT -> TESTS ->8. TIterval, select Stats if the statistic is give..) Iput, s,,. Eample 7.3.4. The average (mea) travel time from home to school for a sample of 8 THS teachers was 4.3 miles. The stadard deviatio of their travel time was miles. Fid the 95% cofidece iterval of true mea or populatio mea. Use the calculator to verify the results. Solutio: = 8 < 30 ad the is ukow. s =, = 4. 3, = 0. 05, the umber of degrees of 7 freedom is k = 8 = 7, t = =. 05, CI =? t 4

PSet ----- Stats, Cocepts I Statistics 4.3.05 < µ < 4.3 +.05 8 8 or 3.5 < µ < 5. CI = ( 3.5, 5.) Eample 7.3.5. [MC008] A test egieer wats to estimate the mea gas mileage µ (i miles per gallo) for a particular model of automobile. Eleve of these cars are subjected to a road test, ad the gas mileage is computed for each car. A dotplot of the gas-mileage values is roughly symmetrical ad has o outliers. The mea ad stadard deviatio of these values are 5.5 ad 3.0, respectively. Assumig that these automobiles ca be cosidered a simple radom sample of cars of this model, which of the followig is a correct statemet? 3.0 (A) A 95% cofidece iterval for µ is 5.5 ±.8. 3.0 (B) A 95% cofidece iterval for µ is 5.5 ±.0. 3.0 (C) A 95% cofidece iterval for µ is 5.5 ±.8. 0 3.0 (D) A 95% cofidece iterval for µ is 5.5 ±.0. 0 (E) The results caot be trusted; the sample is too small. Solutio: The aswer is A. 0 3.0 = = k = df = t ± 5.5,, 0,.8, 5.5.8 0.05 4

PSet ----- Stats, Cocepts I Statistics Eample 7.3.6. [FRQ30] A evirometal group coducted a study to determie whether crows i a certai regio were igestig food cotaiig uhealthy levels of lead. A biologist classified lead levels greater tha 6.0 parts per millio (ppm) as uhealthy. The lead levels of a radom sample of 3 crows i the regio were measured ad recorded. The data are show i the stemplot below. a.) What proportio of crows i the sample had lead levels that are classified by the biologist as uhealthy? b.) The mea lead level of the 3 crows i the sample was 4.90 ppm ad the stadard deviatio was. ppm. Costruct ad iterpret a 95 percet cofidece iterval for the mea lead level of crows i the regio. Solutio: 4 a.) p ˆ = = 0.74 3 df b.) = 4.90, s =., = 3 df = 3 =, t = t =.069 s. ± t = 4.90 ±.069 = 4.90 ± 0.483 3 CI = (4.47, 5.383) We ca be 95% cofidet that the populatio mea lead level amog all crows i this regio is betwee 4.46 ad 5.384 parts per millio. 0.05 43

PSet ----- Stats, Cocepts I Statistics Eample 7.3.7. For each of the followig problems of fidig cofidece iterval of populatio mea from oe-sample mea, fid the idicated variable. Assume that the sample is idepedet ad the sample sie is less tha 0% of the populatio. Case # s df Cof. Iterval Margi of error 0. 05 5 50 - - 0.0-7000 - - - 00 3 0. 05 5 0-4 - 5-00 - 30 Solutio: Case s df Cof. Iterval Margi of # error 0. 05 5 50 - - (4.446, 5.554) 0.554 0.0-89 7000 - - - 00 3 0. 05 5 0-9 (4.065, 5.935) 0.935 4 0.65-5 - 00-30 Case #: 30 = 0.05, * =.95, Case #: = 0.0, * =.576,, so use oe-sample Z-distributio for the CI. ± * = 5 ±.96 5 ± 0.554 or CI = (4.446, 5.554) 50.576(7000).576 * = 00 = 88.8 00 Case #3: < 30, so, use t-distributio for the CI. = 0.0, df = = 9, t = ivt (0.975,9).09, s ± t = 5 ±.09 5 ± 0.935, or CI = (4.065, 5.935) 0 Case #4: < 30, so, use t-distributio for the CI. t 00 = 30 t.6, = tcdf (.6,00,4) 0.3 0.65 5, so 89 44

PSet ----- Stats, Cocepts I Statistics Quick-Check 7.3.. Cofidece Iterval for a Mea QC 7.3... [MC06] A quality cotrol ispector must verify whether a machie that packages sack foods is workig correctly. The ispector will radomly select a sample of packages ad weigh the amout of sack food i each. Assume that the weights of food i packages filled by the machie have a stadard deviatio of 0.30 ouce. A estimate of the mea amout of sack food i each package must be reported with 99.6 percet cofidece ad a margi of error of o more tha 0. ouce. What would be the miimum sample sie for the umber of packages the ispector must select? (A) 8 (B) 5 (C) 5 (D) 5 (E) 60 QC 7.3... [MC974] A radom sample of costs of repair jobs at a large muffler repair shop produces a mea of $7.95 ad a stadard deviatio of $4.03. If the sie of this sample is 40, which of the followig is a approimate 90 percet cofidece iterval for the average cost of a repair at this repair shop? (A) $7.95 ± $4.87 (B) $7.95 ± $6.5 (C) $7.95 ± $7.45 (D) $7.95 ± $30.8 (E) $7.95 ± $39.53 QC 7.3..3. [MC037] A simple radom sample procedure produces a sample mea,, of 5. A 95 percet cofidece iterval for the correspodig populatio mea is 5 ± 3. Which of the followig statemets must be true? (A) Niety-five percet of the populatio measuremets fall betwee ad 8. (B) Niety-five percet of the sample measuremets fall betwee ad 8. (C) If 00 samples were take, 95 of the sample meas would fall betwee ad 8. (D) P( 8 ) = 0.95 (E) If µ = 9, this of 5 would be ulikely to occur. 45

PSet ----- Stats, Cocepts I Statistics QC 7.3..4. [MC033] A egieer for the Allied Steel Compay has the resposibility of estimatig the mea carbo cotet of a particular day's steel output, usig a radom sample of 5 rods from that day's output. The actual populatio distributio of carbo cotet is ot kow to be ormal, but graphic displays of the egieer's sample results idicate that the assumptio of ormality is ot ureasoable. The process is ewly developed, ad there are o historical data o the variability of the process. I estimatig this day's mea carbo cotet, the primary reaso the egieer should use a t-cofidece iterval rather tha a -cofidece iterval is because the egieer (A) is estimatig the populatio mea usig the sample mea. (B) is usig the sample variace as a estimate of the populatio variace. (C) is usig data, rather tha theory, to judge that the carbo cotet is ormal. (D) is usig data from a specific day oly. (E) has a small sample, ad a -cofidece iterval should ever be used with a small sample. 46

PSet ----- Stats, Cocepts I Statistics Aswers QC 7.3... D. Assume that the coditios to use ormal distributio are satisfied. = 0.3, = 0.004, =.878 4.03 QC 7.3... B..64 6.5 40 0.00 0.30.878 0. 5.7 QC 7.3..3. E. The populatio mea is out of CI. QC 7.3..4. B. The variace is ukow. 47