Chapter 13, Part A Analysis of Variance and Experimental Design

Similar documents
Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Common Large/Small Sample Tests 1/55

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

Final Examination Solutions 17/6/2010

Chapter 6 Sampling Distributions

Properties and Hypothesis Testing

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Linear Regression Models

Topic 9: Sampling Distributions of Estimators

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Definitions of terms and examples. Experimental Design. Sampling versus experiments. For each experimental unit, measures of the variables of

Sample Size Determination (Two or More Samples)

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators

Power and Type II Error

LESSON 20: HYPOTHESIS TESTING

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

University of California, Los Angeles Department of Statistics. Hypothesis testing

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Statistical Inference About Means and Proportions With Two Populations

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Statistics. Chapter 10 Two-Sample Tests. Copyright 2013 Pearson Education, Inc. publishing as Prentice Hall. Chap 10-1

Comparing your lab results with the others by one-way ANOVA

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

1 Inferential Methods for Correlation and Regression Analysis

This is an introductory course in Analysis of Variance and Design of Experiments.

Lesson 2. Projects and Hand-ins. Hypothesis testing Chaptre 3. { } x=172.0 = 3.67

Multiple Comparisons Examples STAT 314

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Regression, Inference, and Model Building

Stat 200 -Testing Summary Page 1

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

GG313 GEOLOGICAL DATA ANALYSIS

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

A statistical method to determine sample size to estimate characteristic value of soil parameters

Frequentist Inference

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Math 140 Introductory Statistics

Chapter two: Hypothesis testing

A Statistical hypothesis is a conjecture about a population parameter. This conjecture may or may not be true. The null hypothesis, symbolized by H

Day 8-3. Prakash Balachandran Department of Mathematics & Statistics Boston University. Friday, October 28, 2011

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

UCLA STAT 110B Applied Statistics for Engineering and the Sciences

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

1036: Probability & Statistics

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Additional Notes and Computational Formulas CHAPTER 3

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

Statistical inference: example 1. Inferential Statistics

Chapter 4 Tests of Hypothesis

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random

6 Sample Size Calculations

y ij = µ + α i + ɛ ij,

Notes on Hypothesis Testing, Type I and Type II Errors

UCLA STAT 110B Applied Statistics for Engineering and the Sciences

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Chapter 22: What is a Test of Significance?

MA238 Assignment 4 Solutions (part a)

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Describing the Relation between Two Variables

Stat 139 Homework 7 Solutions, Fall 2015

Sampling Error. Chapter 6 Student Lecture Notes 6-1. Business Statistics: A Decision-Making Approach, 6e. Chapter Goals

Lecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS

Correlation Regression

Stat 319 Theory of Statistics (2) Exercises

Lecture 7: Non-parametric Comparison of Location. GENOME 560 Doug Fowler, GS

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

5. A formulae page and two tables are provided at the end of Part A of the examination PART A

Lecture 9: Independent Groups & Repeated Measures t-test

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines)

Chapter 5: Hypothesis testing

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Biostatistics for Med Students. Lecture 2

Data Analysis and Statistical Methods Statistics 651

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

Agenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740

SIMPLE LINEAR REGRESSION AND CORRELATION ANALYSIS


Testing Statistical Hypotheses for Compare. Means with Vague Data

11 Correlation and Regression

Transcription:

Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of k Populatio Meas Multiple Compariso Procedures Slide Itroductio to Aalysis of Variace Aalysis of Variace (ANOVA) ca be used to test for the equality of three or more populatio meas. Data obtaied from observatioal or eperimetal studies ca be used for the aalysis. We wat to use the sample results to test the followig hypotheses: H 0 : 1 = = 3 =... = k H a : Not all populatio meas are equal Slide 3 1

Itroductio to Aalysis of Variace H 0 : 1 = = 3 =... = k H a : Not all populatio meas are equal If H 0 is reected, we caot coclude that all populatio meas are differet. Reectig H 0 meas that at least two populatio meas have differet values. Slide 4 Itroductio to Aalysis of Variace Samplig Distributio of Give H 0 is True Sample meas are close together because there is oly oe samplig distributio whe H 0 is true. σ σ = 1 3 Slide 5 Itroductio to Aalysis of Variace Samplig Distributio of Give H 0 is False Sample meas come from differet samplig distributios ad are ot as close together whe H 0 is false. 3 1 1 3 Slide 6

Assumptios for Aalysis of Variace For each populatio, the respose variable is ormally distributed. The variace of the respose variable, deoted σ, is the same for all of the populatios. The observatios must be idepedet. Slide 7 Aalysis of Variace: Testig for the Equality of k Populatio Meas Betwee-Treatmets Estimate of Populatio Variace Withi-Treatmets Estimate of Populatio Variace Comparig the Variace Estimates: The F Test ANOVA Table Slide 8 Betwee-Treatmets Estimate of Populatio Variace A betwee-treatmet estimate of σ is called the mea square treatmet ad is deoted MSTR. MSTR = k ( ) = 1 k 1 Deomiator represets the degrees of freedom associated with SSTR Numerator is the sum of squares due to treatmets ad is deoted SSTR Slide 9 3

Withi-Samples Estimate of Populatio Variace The estimate of σ based o the variatio of the sample observatios withi each sample is called the mea square error ad is deoted by MSE. MSE = k =1 ( 1) s k T Deomiator represets the degrees of freedom associated with SSE Numerator is the sum of squares due to error ad is deoted SSE Slide 10 Comparig the Variace Estimates: The F Test If the ull hypothesis is true ad the ANOVA assumptios are valid, the samplig distributio of MSTR/MSE is a F distributio with MSTR d.f. equal to k - 1 ad MSE d.f. equal to T - k. If the meas of the k populatios are ot equal, the value of MSTR/MSE will be iflated because MSTR overestimates σ. Hece, we will reect H 0 if the resultig value of MSTR/MSE appears to be too large to have bee selected at radom from the appropriate F distributio. Slide 11 Test for the Equality of k Populatio Meas Hypotheses Test Statistic H 0 : 1 = = 3 =... = k H a : Not all populatio meas are equal F = MSTR/MSE Slide 1 4

Test for the Equality of k Populatio Meas Reectio Rule p-value Approach: Reect H 0 if p-value < α Critical Value Approach: Reect H 0 if F > F α where the value of F α is based o a F distributio with k - 1 umerator d.f. ad T - k deomiator d.f. Slide 13 Samplig Distributio of MSTR/MSE Reectio Regio Samplig Distributio of MSTR/MSE Reect H 0 Do Not Reect H 0 α F α Critical Value MSTR/MSE Slide 14 ANOVA Table Source of Variatio Treatmet Error Total Sum of Squares SSTR SSE SST Degrees of Freedom k 1 T k T -1 Mea Squares MSTR MSE F MSTR/MSE SST is partitioed ito SSTR ad SSE. SST s degrees of freedom (d.f.) are partitioed ito SSTR s d.f. ad SSE s d.f. Slide 15 5

ANOVA Table SST divided by its degrees of freedom T 1 is the overall sample variace that would be obtaied if we treated the etire set of observatios as oe data set. With the etire data set as oe sample, the formula for computig the total sum of squares, SST, is: k SST = ( i ) = SSTR + SSE = 1 i = 1 Slide 16 ANOVA Table ANOVA ca be viewed as the process of partitioig the total sum of squares ad the degrees of freedom ito their correspodig sources: treatmets ad error. Dividig the sum of squares by the appropriate degrees of freedom provides the variace estimates ad the F value used to test the hypothesis of equal populatio meas. Slide 17 Test for the Equality of k Populatio Meas Eample: Reed Maufacturig Jaet Reed would like to kow if there is ay sigificat differece i the mea umber of hours worked per week for the departmet maagers at her three maufacturig plats (i Buffalo, Pittsburgh, ad Detroit). Slide 18 6

Test for the Equality of k Populatio Meas Eample: Reed Maufacturig A simple radom sample of five maagers from each of the three plats was take ad the umber of hours worked by each maager for the previous week is show o the et slide. Coduct a F test usig α =.05. Slide 19 Test for the Equality of k Populatio Meas Observatio 1 3 4 5 Sample Mea Sample Variace Plat 1 Buffalo 48 54 57 54 6 Plat Pittsburgh 73 63 66 64 74 Plat 3 Detroit 51 63 61 54 56 55 68 57 6.0 6.5 4.5 Slide 0 Test for the Equality of k Populatio Meas p -Value ad Critical Value Approaches 1. Develop the hypotheses. H 0 : 1 = = 3 H a : Not all the meas are equal where: 1 = mea umber of hours worked per week by the maagers at Plat 1 = mea umber of hours worked per week by the maagers at Plat 3 = mea umber of hours worked per week by the maagers at Plat 3 Slide 1 7

Test for the Equality of k Populatio Meas p -Value ad Critical Value Approaches. Specify the level of sigificace. α =.05 3. Compute the value of the test statistic. Mea Square Due to Treatmets (Sample sizes are all equal.) = (55 + 68 + 57)/3 = 60 SSTR = 5(55-60) + 5(68-60) + 5(57-60) = 490 MSTR = 490/(3-1) = 45 Slide Test for the Equality of k Populatio Meas p -Value ad Critical Value Approaches 3. Compute the value of the test statistic. (cotiued) Mea Square Due to Error SSE = 4(6.0) + 4(6.5) + 4(4.5) = 308 MSE = 308/(15-3) = 5.667 F = MSTR/MSE = 45/5.667 = 9.55 Slide 3 Test for the Equality of k Populatio Meas ANOVA Table Source of Variatio Sum of Squares Degrees of Freedom Mea Squares F Treatmet Error Total 490 308 798 1 14 45 5.667 9.55 Slide 4 8

Test for the Equality of k Populatio Meas p Value Approach 4. Compute the p value. With umerator d.f. ad 1 deomiator d.f., the p-value is.01 for F = 6.93. Therefore, the p-value is less tha.01 for F = 9.55. 5. Determie whether to reect H 0. The p-value <.05, so we reect H 0. We have sufficiet evidece to coclude that the mea umber of hours worked per week by departmet maagers is ot the same at all 3 plat. Slide 5 Test for the Equality of k Populatio Meas Critical Value Approach 4. Determie the critical value ad reectio rule. Based o a F distributio with umerator d.f. ad 1 deomiator d.f., F.05 = 3.89. Reect H 0 if F > 3.89 5. Determie whether to reect H 0. Because F = 9.55 > 3.89, we reect H 0. We have sufficiet evidece to coclude that the mea umber of hours worked per week by departmet maagers is ot the same at all 3 plat. Slide 6 Multiple Compariso Procedures Suppose that aalysis of variace has provided statistical evidece to reect the ull hypothesis of equal populatio meas. Fisher s least sigificat differece (LSD) procedure ca be used to determie where the differeces occur. Slide 7 9

Fisher s LSD Procedure Hypotheses H : 0 i H : H a i Test Statistic t = i MSE( 1 1 + ) i Slide 8 Reectio Rule p-value Approach: Fisher s LSD Procedure Critical Value Approach: Reect H 0 if p-value < α Reect H 0 if t < -t a/ or t > t a/ where the value of t a/ is based o a t distributio with T - k degrees of freedom. Slide 9 Hypotheses Fisher s LSD Procedure Based o the Test Statistic i - H 0 : i H : H a i Test Statistic i Reectio Rule Reect H 0 if i > LSD where LSD = t / MSE( 1 1 α + ) i Slide 30 10

Fisher s LSD Procedure Based o the Test Statistic i - Eample: Reed Maufacturig Recall that Jaet Reed wats to kow if there is ay sigificat differece i the mea umber of hours worked per week for the departmet maagers at her three maufacturig plats. Aalysis of variace has provided statistical evidece to reect the ull hypothesis of equal populatio meas. Fisher s least sigificat differece (LSD) procedure ca be used to determie where the differeces occur. Slide 31 Fisher s LSD Procedure Based o the Test Statistic i - For α =.05 ad T - k = 15 3 = 1 degrees of freedom, t.05 =.179 LSD = t / MSE( 1 1 α + ) LSD =. 179 5. 667 ( 1 5 + 1 5 ) = 698. i MSE value was computed earlier Slide 3 Fisher s LSD Procedure Based o the Test Statistic i - LSD for Plats 1 ad Hypotheses (A) Reectio Rule Reect H 0 if > 6.98 Test Statistic = 55 68 = 13 1 H 0 : 1 H : H a 1 1 Coclusio The mea umber of hours worked at Plat 1 is ot equal to the mea umber worked at Plat. Slide 33 11

Fisher s LSD Procedure Based o the Test Statistic i - LSD for Plats 1 ad 3 Hypotheses (B) Reectio Rule Test Statistic = 55 57 = Coclusio Reect H 0 if > 6.98 1 3 H 0 : 1 3 H : H a 1 3 1 3 There is o sigificat differece betwee the mea umber of hours worked at Plat 1 ad the mea umber of hours worked at Plat 3. Slide 34 Fisher s LSD Procedure Based o the Test Statistic i - LSD for Plats ad 3 Hypotheses (C) Reectio Rule Test Statistic = 68 57 = 11 Coclusio Reect H 0 if > 6.98 3 H 0 : 3 H : H a 3 3 The mea umber of hours worked at Plat is ot equal to the mea umber worked at Plat 3. Slide 35 Type I Error Rates The comparisowise Type I error rate α idicates the level of sigificace associated with a sigle pairwise compariso. The eperimetwise Type I error rate α EW is the probability of makig a Type I error o at least oe of the (k 1)! pairwise comparisos. α EW = 1 (1 α) (k 1)! The eperimetwise Type I error rate gets larger for problems with more populatios (larger k). Slide 36 1

Ed of Chapter 13, Part A Slide 37 13