ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

Similar documents
STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

MATH/STAT 352: Lecture 15

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Common Large/Small Sample Tests 1/55

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

Chapter 8: Estimating with Confidence

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Topic 9: Sampling Distributions of Estimators

(7 One- and Two-Sample Estimation Problem )

Comparing your lab results with the others by one-way ANOVA

Statistical Intervals for a Single Sample

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.

Economics Spring 2015

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.

Topic 9: Sampling Distributions of Estimators

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Final Examination Solutions 17/6/2010

Stat 421-SP2012 Interval Estimation Section

Power and Type II Error

Math 140 Introductory Statistics

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov

Sample Size Determination (Two or More Samples)

Computing Confidence Intervals for Sample Data

Introducing Sample Proportions

1 Inferential Methods for Correlation and Regression Analysis

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Confidence Intervals

Sampling Error. Chapter 6 Student Lecture Notes 6-1. Business Statistics: A Decision-Making Approach, 6e. Chapter Goals

Topic 9: Sampling Distributions of Estimators

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

ENGI 4421 Probability and Statistics Faculty of Engineering and Applied Science Problem Set 1 Solutions Descriptive Statistics. None at all!

Statistics 511 Additional Materials

AP Statistics Review Ch. 8

Data Analysis and Statistical Methods Statistics 651

Module 1 Fundamentals in statistics

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Working with Two Populations. Comparing Two Means

This is an introductory course in Analysis of Variance and Design of Experiments.

Chapter 6 Sampling Distributions

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?

Unit 6 Estimation Week #10 - Practice Problems SOLUTIONS

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Properties and Hypothesis Testing

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

6 Sample Size Calculations

Frequentist Inference

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

Lesson 2. Projects and Hand-ins. Hypothesis testing Chaptre 3. { } x=172.0 = 3.67

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Statistics 300: Elementary Statistics

Confidence Intervals for the Population Proportion p

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008

Linear Regression Models

Expectation and Variance of a random variable

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

STAT431 Review. X = n. n )

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Stat 200 -Testing Summary Page 1

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

6.3 Testing Series With Positive Terms

Output Analysis (2, Chapters 10 &11 Law)

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

Introducing Sample Proportions

STAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)

Sampling Distributions, Z-Tests, Power

Probability and Statistics Estimation Chapter 7 Section 3 Estimating p in the Binomial Distribution

Chapter 13, Part A Analysis of Variance and Experimental Design

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Parameter, Statistic and Random Samples

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Statistical Analysis on Uncertainty for Autocorrelated Measurements and its Applications to Key Comparisons

Random Variables, Sampling and Estimation

Statistical inference: example 1. Inferential Statistics

Transcription:

ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly large sample sizes from two idepedet populatios of meas, ad variaces,, the sample meas are distributed as ~ N,, ~ N,, with ~ lso, for sufficietly large sample sizes, we may estimate ukow values of, by s, s. Example.0 large corporatio wishes to determie the effectiveess of a ew traiig techique. radom sample of 64 employees is tested after udergoig the ew traiig techique ad obtais a mea test score of 6. with a stadard deviatio of 5.. other radom sample of 00 employees, servig as a cotrol group, is tested after udergoig the old traiig methods. The cotrol group has a sample mea test score of 58.3 with a stadard deviatio of 6.30. (a) (b) Use a two-sided cofidece iterval to determie whether the ew traiig techique has led to a sigificat chage i test scores. Use a oe-sided cofidece iterval to determie whether the ew traiig techique has led to a sigificat icrease i test scores.

ENGI 44 Cofidece Itervals (Two Samples) Page -0 Example.0 (cotiued) If a two-sided 00% cofidece iterval does ot iclude a value c, the the appropriate oe-sided 00% cofidece iterval will ot iclude c. If a oe-sided 00% CI icludes c, the the two-sided 00% CI icludes c.

ENGI 44 Cofidece Itervals (Two Samples) Page -03 Two Sample Cofidece Iterval for a Differece i Populatio Proportios [for bous questios oly] We kow that whe a radom sample cosists of idepedet eroulli radom quatities, with a probability p of success i each trial, the umber of successes i the radom sample follows a biomial distributio with p ad pq. Provided the expected umbers of successes (p) ad failures (q) are both more tha 0, the sample proportio P of successes i the radom sample follows a ormal distributio to a good approximatio: P ~ N p, pq If we draw radom samples of sizes ad (both large) from two idepedet populatios whose true probabilities of success are p ad p, the the differece i sample proportios is p q p q P P ~ N p p, This allows us to costruct cofidece itervals for the ukow p p. The traditioal approach util the 990 s was to replace the ukow values of p ad p i the expressio for the variace by the sample proportios, leadig to the 00% cofidece iterval pˆ ˆ ˆ ˆ q p q pˆ ˆ p z / However, simulatios show that this cofidece iterval ca be very iaccurate for sample sizes of about 30 or less. more accurate iterval (the gresti-caffo iterval) ca be achieved by addig oe to the observed umbers of successes, x* x ad y* y ad addig two to the sample sizes, * ad *. x* y* It the follows that p*, q* p*, p*, q* p* * * The two-sided 00% cofidece iterval for p p is therefore p* q* p* q* p* p* z / * * If the lower boud is below, the replace it by. If the upper boud is above +, the replace it by +. This iterval works well eve for small sample sizes.

ENGI 44 Cofidece Itervals (Two Samples) Page -04 Example.0 [ot examiable except for bous] radom sample of 5 compoets (produced by oe machie) yields 5 compoets that are loger tha 0.0 cm. other radom sample of 30 compoets (produced by aother machie) yields compoets that are loger tha 0.0 cm. Costruct a 95% cofidece iterval for the differece i populatio proportios of compoets that are loger tha 0.0 cm. Ca oe coclude that the two populatio proportios are differet? Usig the gresti-caffo 95% cofidece iterval for p p, x 6 x 5, 5 p * q * 7 7 3 9 x, 30 p* q* 3 3 6 3 6 p* p*.8634 7 3 864 leadig to the stadard error s p * q * p * q * 6 39 *.837 3 3 * * 7 3 z.05.959 Therefore the 95% CI is p * p * z s.86.959.8.86.5.05 Correct to 3 s.f., the 95% CI is.065 p p.438 This cofidece iterval icludes zero. Therefore p p is plausible from these data. NO, we caot coclude that the two populatio proportios are differet, (eve though the sample proportios differ by 0%!) We eed much larger sample sizes i order for a 0% differece i sample proportios to be sigificat. [Note: the traditioal 95% CI is.00 ±.60 = [.060, +.460] ]

ENGI 44 Cofidece Itervals (Two Samples) Page -05 Example.03 [ot examiable except for bous] Suppose that radom samples of equal size are draw from two idepedet populatios. Fid the smallest value of for which a observed differece i sample proportios of 0% would allow us to coclude, at a level of cofidece of 95%, that the populatio proportios are differet. The expressio for the stadard error i ˆ ˆ pq pq p q p q P P is where q p ad q p. The maximum possible value of occurs whe both p p ad p p are as large as they ca be. y x x x x is a quadratic fuctio whose uique maximum occurs whe ut dy x 0 dx, that is, at x. the maximum value of occurs whe p q p q : With pˆ ˆ p.0, the lower boud of the traditioal 95% cofidece iterval for p p is.0.95.0.95 For the CI to be guarateed to exclude zero,.95.0.95 0.0.95 9.5.0 9.5 9.07 Therefore a miimum sample size of 93 items from each populatio will guaratee that a observed differece of sample proportios of 0% will geerate a 95% cofidece iterval for p p that excludes zero. With such large sample sizes, the gresti-caffo CI will be very close to the traditioal CI.

ENGI 44 Cofidece Itervals (Two Samples) Page -06 Small Sample Cofidece Iterval for the Differece betwee Two Meas Suppose,,, is a radom sample of size draw from a populatio ~ N, ad that,,, is a radom sample of size draw from a populatio ~ N,, with idepedet of, the Z ~ N 0, If at least oe of the sample sizes is small (less tha 30 or so), the the cetral limit theorem caot be ivoked. Z will be ormal (exactly or to a acceptable approximatio) oly if both populatios are exactly or early ormal. If, i additio, both populatio variaces are kow, the Z follows the stadard ormal distributio (exactly or approximately) o matter how small the sample sizes may be. However, if the populatio variaces are ot kow, the they must be replaced by their poit estimators from the data (the sample variaces): T S S The additioal ucertaity, (itroduced by the fact that S ad S are themselves radom quatities), results i T followig Studet s t-distributio istead of the stadard ormal distributio. For ay particular pair of radom samples, the umber of degrees of freedom is s s INT s s

ENGI 44 Cofidece Itervals (Two Samples) Page -07 Example.04 ivestigator wats to kow which of two electric toasters has the greater ability to resist the abormally high electrical currets that occur durig a uprotected power surge. Radom samples of six toasters from factory ad five toasters from factory were subjected to a destructive test, i which each toaster was subjected to icreasig currets util it failed. The distributio of currets at failure (measured i amperes) is kow to be approximately ormal for both products. The results are as follows: Factory : 0 8 4 6 3 6 Factory : 8 9 7 (a) State the assumptios that you are makig. (b) Costruct a 95% cofidece iterval for the mea differece i failure currets. (c) Is there ay sigificat differece betwee the failure currets of the two types of toaster? (a) Give i the questio: ssumptio: (b) The summary statistics are 6 x 4.5 5 x 9.4 s 7.9 s 4.3 s.36 s 0.86 s s s. e..76 s. e..475353 The most tedious part of this set of calculatios is for the umber of degrees of freedom:

ENGI 44 Cofidece Itervals (Two Samples) Page -08 Example.04 (cotiued) s s INT s s.76 INT INT 8.9 8.36 0.86 5 4 For a 95% two-sided cofidece iterval with 8 degrees of freedom, we eed t.05, 8.30600 x x t s e.05, 8.. 4.5 9.4.30600.475353 5. 3.40 Therefore the 95% CI for is, correct to d.p.,.7, 8.5 Excel spreadsheet for this example is available at " www.egr.mu.ca/~ggeorge/44/demos/cismall_.xls " The textbooks (Navidi sectio 5.6; Devore sectio 9.) provide a alterative more precise cofidece iterval that may be used oly whe the populatio variaces ca safely be assumed to be equal ( pooled t procedures ). We shall ot employ that iterval i this course.

ENGI 44 Cofidece Itervals (Two Samples) Page -09 Cofidece Itervals for Paired Data Example.05 Nie voluteers are tested before ad after a traiig programme. ased o the data below, ca you coclude that the programme has improved test scores? Voluteer: 3 4 5 6 7 8 9 fter traiig: 75 66 69 45 54 85 58 9 6 efore traiig: 7 65 64 39 5 85 5 9 58 Let = score after traiig ad = score before traiig. INCORRECT METHOD: The summary statistics are 9, x 67., x 64., s s 5.94 3.993 943.5 943.5 9 8 546.5 s 546.5 9 8 s 8.94 3.438 s s 4490 s. e. 55.43 s. e. 7.44573 8 s s 55.43 INT INT s s 3.993 3.438 8 8 = INT(5.7...) = 5.050,5.75305 t We eed a oe-sided CI because we are lookig for evidece of a icrease, ot a chage.

ENGI 44 Cofidece Itervals (Two Samples) Page -0 Example.05 (cotiued) The lower boudary of the 95% oe-sided CI for is x x t s e.050,5.. 67. 64..75305 7.44573 3.0 3.05 We are 95% sure that 0.. Clearly zero is a very plausible value. We would coclude that the data are cosistet with ad therefore there is owhere ear eough evidece to coclude that the traiig has improved test scores. ut look agai at the test scores: Voluteer: 3 4 5 6 7 8 9 fter traiig: 75 66 69 45 54 85 58 9 6 efore traiig: 7 65 64 39 5 85 5 9 58 There is a clear patter of icreases from the before score to the after score for most idividuals. There is cosiderable variatio betwee idividuals, which is swampig the idividual icreases. The error i the aalysis above is that CORRECT METHOD: The remedy is to explore oly the differeces, ot the two sets of scores. Voluteer: 3 4 5 6 7 8 9 Differece d : Now we have a sigle sample, from which to costruct a oe-sided cofidece iterval for the true mea differece. D 933 7 3 D 9, d 3.0, s 6.5 sd.549 98 s D se.. 0.849836 t.050,8.85955 D d t.050, 8 s. e. 3.0.85 0.84 3.0.58.4

ENGI 44 Cofidece Itervals (Two Samples) Page - Paired vs. Upaired Cofidece Itervals Whe the sample sizes are equal, the questio arises as to which type of cofidece iterval to use: two-sample (upaired) or the iterval based o differeces (paired). If the samples are pairs of observatios of two differet effects o the same set of idividuals, the idepedece betwee the populatios is ulikely ad oe should use the paired cofidece iterval. Otherwise use the upaired cofidece iterval. The paired cofidece iterval is valid eve if the two populatios are strogly correlated, whereas the upaired cofidece iterval is based o the assumptio that the two populatios are idepedet (or at least ucorrelated). Example.06 (Navidi textbook, exercises 5.7, page 373, questio 6) sample of 0 diesel trucks were ru both hot ad cold to estimate the differece i fuel ecoomy. The results, i miles per gallo, are preseted i the followig table. (from I-use Emissios from Heavy-Duty Diesel Vehicles, J. aowitz, Ph.D. thesis, Colorado School of Mies, 00.) Truck Hot Cold 4.56 4.6 4.46 4.08 3 6.49 5.83 4 5.37 4.96 5 6.5 5.87 6 5.90 5.3 7 4. 3.9 8 3.85 3.69 9 4.5 3.74 0 4.69 4.9 Fid a 98% cofidece iterval for the differece i mea fuel mileage betwee hot ad cold egies. Choice of type of cofidece iterval:

ENGI 44 Cofidece Itervals (Two Samples) Page - Example.06 (cotiued) Summary statistics: Truck Hot Cold Differece 4.56 4.6 0.30 4.46 4.08 0.38 3 6.49 5.83 0.66 4 5.37 4.96 0.4 5 6.5 5.87 0.38 6 5.90 5.3 0.58 7 4. 3.9 0.0 8 3.85 3.69 0.6 9 4.5 3.74 0.4 0 4.69 4.9 0.50 d d 0, 3.98,.806 d 3.98 d 0.398, 0 d d 0.806 3.98.856 sd 09 90 0.04 84 sd 0.5 sd 0.55834 s. e. 0.049 79 0 [Ed of Chapter ]