Battery Life. Factory

Similar documents
Sociology 6Z03 Review II

Unit 27 One-Way Analysis of Variance

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

MATH Chapter 21 Notes Two Sample Problems

Example: Four levels of herbicide strength in an experiment on dry weight of treated plants.

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

Average weight of Eisenhower dollar: 23 grams

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College

Chap The McGraw-Hill Companies, Inc. All rights reserved.

ANOVA: Analysis of Variation

Elementary Statistics for the Biological and Life Sciences

STAT Chapter 10: Analysis of Variance

Inferences for Regression

Stat 217 Final Exam. Name: May 1, 2002

What If There Are More Than. Two Factor Levels?

One-way ANOVA. Experimental Design. One-way ANOVA

Multiple comparisons - subsequent inferences for two-way ANOVA

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

Lec 5: Factorial Experiment

Stat 427/527: Advanced Data Analysis I

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

Inference for the Regression Coefficient

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning

IEOR165 Discussion Week 12

Analysis of variance (ANOVA) Comparing the means of more than two groups

Factorial Analysis of Variance

Statistiek II. John Nerbonne using reworkings by Hartmut Fitz and Wilbert Heeringa. February 13, Dept of Information Science

Research Methods II MICHAEL BERNSTEIN CS 376

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

Data Skills 08: General Linear Model

Table 1: Fish Biomass data set on 26 streams

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication

Factorial BG ANOVA. Psy 420 Ainsworth

Disadvantages of using many pooled t procedures. The sampling distribution of the sample means. The variability between the sample means

22s:152 Applied Linear Regression. Take random samples from each of m populations.

One-way Analysis of Variance. Major Points. T-test. Ψ320 Ainsworth

1 Introduction to One-way ANOVA

Lecture 14: ANOVA and the F-test

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA

16.3 One-Way ANOVA: The Procedure

Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

Variance Estimates and the F Ratio. ERSH 8310 Lecture 3 September 2, 2009

Week 7.1--IES 612-STA STA doc

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Two-Sample Inferential Statistics

Chemometrics. Matti Hotokka Physical chemistry Åbo Akademi University

Power & Sample Size Calculation

Difference in two or more average scores in different groups

3. Factorial Experiments (Ch.5. Factorial Experiments)

Factorial Treatment Structure: Part I. Lukas Meier, Seminar für Statistik

The Multiple Regression Model

Correlation Analysis

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

Ch 11- One Way Analysis of Variance

STAT Chapter 11: Regression

Analysis of Variance (ANOVA)

Mathematics for Economics MA course

Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G

Repeated Measures Analysis of Variance

Ch18 links / ch18 pdf links Ch18 image t-dist table

CHAPTER 13: F PROBABILITY DISTRIBUTION

Chapter 11 - Lecture 1 Single Factor ANOVA

Note: k = the # of conditions n = # of data points in a condition N = total # of data points

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

1 Use of indicator random variables. (Chapter 8)

A discussion on multiple regression models

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:


Lecture 3: Inference in SLR

Multiple t Tests. Introduction to Analysis of Variance. Experiments with More than 2 Conditions

20.0 Experimental Design

One-Way Analysis of Variance (ANOVA)

Statistik för bioteknik sf2911 Föreläsning 15: Variansanalys

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)

Group comparison test for independent samples

One-way ANOVA (Single-Factor CRD)

Statistics For Economics & Business

1 One-way Analysis of Variance

Recall that a measure of fit is the sum of squared residuals: where. The F-test statistic may be written as:

One-Way Analysis of Variance (ANOVA) Paul K. Strode, Ph.D.

Chapter 10: Analysis of variance (ANOVA)

One-way ANOVA Inference for one-way ANOVA

Chapter 23: Inferences About Means

Basic Business Statistics, 10/e

Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data

Analysis of Variance: Repeated measures

Regression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.

Stat 500 Midterm 2 8 November 2007 page 0 of 4

Statistical Modelling in Stata 5: Linear Models

Lec 1: An Introduction to ANOVA

STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis

4.1. Introduction: Comparing Means

STA 4210 Practise set 2a

Statistics for EES Factorial analysis of variance

Cuckoo Birds. Analysis of Variance. Display of Cuckoo Bird Egg Lengths

Analysis of Variance (ANOVA)

Chapter 3 Multiple Regression Complete Example

Transcription:

Statistics 354 (Fall 2018) Analysis of Variance: Comparing Several Means Remark. These notes are from an elementary statistics class and introduce the Analysis of Variance technique for comparing several population means. They are meant as both a review and a reminder of how ANOVA works. In STAT 354, we will use an Analysis of Variance technique to assess the strength of the linear regression relationship which has the advantage that it can be extended quite easily from simple linear regression to multiple linear regression. Recall that the two sample t-test allows us to compare the means of two independent populations based on data from two independent samples. Suppose that we are interested in comparing the means from I 2 independent populations based on I independent simple random samples from those populations, i.e., we want to compare the means of more than two populations. The technique is called analysis of variance, or more compactly, ANOVA. Example. A particular brand of battery is manufactured in one of four factories. Ideally, the mean life of the batteries should be the same from each factory. Battery life (in weeks) of random samples from each factory are as follows. Factory 1 Factory 2 Factory 3 Factory 4 102 107 98 99 96 102 98 97 92 103 102 91 95 95 105 96 94 97 94 95 101 100 97 90 97 99 99 95 96 101 96 95 92 98 96 mean 96.60 100.43 99.25 94.70 SD 3.06 3.99 3.37 2.83 sample size 10 7 8 10 Do the data indicate that the mean battery life is different between the factories? Boxplots show that there is variability within each factory, but there is also variability between factory means. Battery Life 90 95 100 105 1 2 3 4 Factory

Our goal is to test the global hypothesis H 0 : µ 1 = µ 2 = µ 3 = µ 4. We will accomplish this using analysis of variance. It is important to note that the probability of a Type I error (rejecting H 0 when H 0 is true) for performing all pairs of comparisons is much greater than for a single comparison. For instance, in this example where I = 4, there are 6 possibilities, namely Therefore, we have 6 opportunities to reject (i) H 0 : µ 1 = µ 2, (ii) H 0 : µ 1 = µ 3, (iii) H 0 : µ 1 = µ 4, (iv) H 0 : µ 2 = µ 3, (v) H 0 : µ 2 = µ 4, (vi) H 0 : µ 3 = µ 4. H 0 : µ 1 = µ 2 = µ 3 = µ 4. Suppose that the significance level for each of these tests is 0.05. The overall significance level α (i.e., the probability of rejecting at least once) is α = P (rejecting at least once) = 1 P (rejecting none) = 1 (0.95) 6 = 0.2649. Thus, we need to develop a more useful way of performing multiple comparisons. In general, statistical methods for dealing with multiple comparisons usually have two steps. (1) An overall test to see if there is good evidence of any differences among the parameters we want to compare. (2) A detailed follow-up analysis to decide which of the parameters differ and to estimate how large the differences are. ANOVA is designed for the first step. The t-test can then be used as part of the second step. The ANOVA F -test Suppose that we have an independent simple random sample from each of I populations and that the jth population has a normal N(µ j, σ 2 ) distribution, where σ is the common standard deviation in all the populations. (It is irrelevant whether or not σ is known. We do, however, require that σ be the same for each population.) Assume that the jth sample has size n j, sample mean x j, and sample standard deviation s j. In general, we can list our notation as follows. Population 1 2 I Sample x 11 x 21 x I1 x 12 x 22 x I2... x 1n1 x 2n2 x InI Mean x 1 x 2 x I Standard Deviation s 1 s 2 s I

Our goal is to make comparisons between µ 1, µ 2,..., µ I based on the observed data. Remark. If all the sample sizes are the same, we sometimes say that the design is balanced. Let N = n 1 + n 2 + + n I denote the total number of observations, and let x = 1 N n I j x jk = 1 N j=1 k=1 I n j x j j=1 denote the average of all observations taken together. (We sometimes call x the grand mean.) In order to test the hypothesis that all I populations have the same mean, H 0 : µ 1 = µ 2 = = µ I against H a : not all of the µ j are equal, we use the ANOVA F statistic given by where the mean square for groups is F = MSG MSE and the mean square for error is MSG = n 1(x 1 x) 2 + n 2 (x 2 x) 2 + + n I (x I x) 2 I 1 MSE = (n 1 1)s 2 1 + (n 2 1)s 2 2 + + (n I 1)s 2 I. N I When H 0 is true, the F statistic has the F distribution with I 1 numerator degrees of freedom and N I denominator degrees of freedom. That is, the degrees of freedom of the F statistic are a pair (I 1, N I) which are the same as the denominators of MSG and MSE, respectively. We usually call the numerators of MSG and MSE the sums of squares so that the sum of squares among groups (or sum of squares between groups) is SS(among) = n 1 (x 1 x) 2 + n 2 (x 2 x) 2 + + n I (x I x) 2 and the sum of squares within groups is SS(within) = (n 1 1)s 2 1 + (n 2 1)s 2 2 + + (n I 1)s 2 I. An ANOVA table lists all of this information. Software packages will always give you an ANOVA table containing the following information. Source df SS M S variation among groups I 1 SS(among) M SG = SS/df variation within groups N I SS(within) M SE = SS/df

Example (continued). The ANOVA table for the battery factories is as follows. Source df SS M S variation among groups 4 1 = 3 170.9 MSG = 57.0 variation within groups 35 4 = 31 331.4 MSE = 10.7 Thus, the F test statistic is F = MSG MSE = 57.0 10.7 = 5.33 and the degress of freedom are df = (3, 31). From a table, we find that the critical values corresponding to df = (3, 25) are so that the P -value satisfies 4.68 < 5.33 < 7.45 0.001 < P -value < 0.01. Thus, we can reject H 0 at the 1% level and conclude that this is strong evidence that the mean battery life differs between the four factories. Conditions for validity of ANOVA The ANOVA procedure is valid if the following conditions are satisfied. (i) The groups of observations can be regarded as random samples from their respective populations. That is, we have I independent SRSs, one from each of the I populations. In particular, observations within each group are independent, and observations across groups are independent. (ii) The jth population has a normal N(µ j, σ 2 ) distribution with unknown mean µ j. (iii) All I populations have the same standard deviation σ. Like the t-test, the ANOVA procedure is robust meaning that departures from normality are permitted as long as the sample sizes are sufficiently large. In fact, even for roughly symmetric distributions with no outliers, the ANOVA procedure can be used for a sample of size 4. Furthermore, the results of the ANOVA F test are approximately correct when the largest sample standard deviation is no more than twice as large as the smallest sample size. Intuition for ANOVA Variability in the observed data results from variation in the population means (if they are truly different) and from variation within each population itself.

Differences between sample means estimate differences in the populations means, but the observed differences may simply reflect the error in estimating µ 1,..., µ I due to sampling variation within a population. Conclude the population means are not all equal if the differences in x 1, x 2,..., x I are sufficiently large that it is unlikely that the difference is due to estimation error due to within population variation. Additional remarks Rejection of H 0 : µ 1 = µ 2 = = µ I does not allow one to determine which means are different. For that, a more detailed analysis is needed (such as the Newman-Keuls procedure). ANOVA can be used for I = 2 populations in which case it gives exactly the same results as the two-sample t test. Failure to reject H 0 does not mean that H 0 is true. It simply means that the differences in the µ j may be too small to detect given the available data.