STAT Chapter 10: Analysis of Variance

Similar documents
1. The (dependent variable) is the variable of interest to be measured in the experiment.

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication

ANOVA: Comparing More Than Two Means

We need to define some concepts that are used in experiments.

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

One-Way Analysis of Variance (ANOVA)

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

1 The Randomized Block Design

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

Chapter 10. Design of Experiments and Analysis of Variance

Statistics For Economics & Business

STAT 115:Experimental Designs

1 Introduction to One-way ANOVA

Unit 27 One-Way Analysis of Variance

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

ANOVA CIVL 7012/8012

Lecture 28 Chi-Square Analysis

16.3 One-Way ANOVA: The Procedure

Inference for Regression Simple Linear Regression

Basic Business Statistics, 10/e

Battery Life. Factory

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.

Chapter 15: Analysis of Variance

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G

Analysis of Variance. Read Chapter 14 and Sections to review one-way ANOVA.

Chapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance

Chapter 11 - Lecture 1 Single Factor ANOVA

EX1. One way ANOVA: miles versus Plug. a) What are the hypotheses to be tested? b) What are df 1 and df 2? Verify by hand. , y 3

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Design & Analysis of Experiments 7E 2009 Montgomery

The Multiple Regression Model

ANOVA (Analysis of Variance) output RLS 11/20/2016

Analysis of Variance

Chapter 3 Multiple Regression Complete Example

Chapter 10: Analysis of variance (ANOVA)

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Statistics and Quantitative Analysis U4320

In ANOVA the response variable is numerical and the explanatory variables are categorical.

Chapter 14 Student Lecture Notes 14-1

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

Data Analysis 1 LINEAR REGRESSION. Chapter 03

Analysis of Variance

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company

Chapter 7 Student Lecture Notes 7-1

Chapter 12. ANalysis Of VAriance. Lecture 1 Sections:

Multiple Regression: Chapter 13. July 24, 2015

Inferences for Regression

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

21.0 Two-Factor Designs

i=1 X i/n i=1 (X i X) 2 /(n 1). Find the constant c so that the statistic c(x X n+1 )/S has a t-distribution. If n = 8, determine k such that

Confidence Interval for the mean response

W&M CSCI 688: Design of Experiments Homework 2. Megan Rose Bryant

Ch14. Multiple Regression Analysis

Sociology 6Z03 Review II

Correlation Analysis

Analysis of Variance (ANOVA)

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal

STAT 705 Chapter 16: One-way ANOVA

ANOVA: Analysis of Variation

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

Inference for the Regression Coefficient

SIMPLE REGRESSION ANALYSIS. Business Statistics

Statistical Techniques II EXST7015 Simple Linear Regression

Ch 2: Simple Linear Regression

Analysis of Variance. Contents. 1 Analysis of Variance. 1.1 Review. Anthony Tanbakuchi Department of Mathematics Pima Community College

WELCOME! Lecture 13 Thommy Perlinger

CHAPTER 10 ONE-WAY ANALYSIS OF VARIANCE. It would be very unusual for all the research one might conduct to be restricted to

Model Building Chap 5 p251

Table 1: Fish Biomass data set on 26 streams

Example: Four levels of herbicide strength in an experiment on dry weight of treated plants.

Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8]

PLSC PRACTICE TEST ONE

Lecture 13 Extra Sums of Squares

Introduction. Chapter 8

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.

Analysis of Variance. ภาว น ศ ร ประภาน ก ล คณะเศรษฐศาสตร มหาว ทยาล ยธรรมศาสตร

Topic 13. Analysis of Covariance (ANCOVA) - Part II [ST&D Ch. 17]

Factorial designs. Experiments

Analysis of Variance

Chapter 14. Multiple Regression Models. Multiple Regression Models. Multiple Regression Models

One-Way Analysis of Variance (ANOVA) Paul K. Strode, Ph.D.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Simple Linear Regression: One Qualitative IV

Week 12 Hypothesis Testing, Part II Comparing Two Populations

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

Chapter 4. Regression Models. Learning Objectives

Two-Way ANOVA. Chapter 15

Ch 11- One Way Analysis of Variance

STATS Analysis of variance: ANOVA

One-way ANOVA. Experimental Design. One-way ANOVA

Lecture 15 - ANOVA cont.

Variance Decomposition and Goodness of Fit

Statistics for Managers Using Microsoft Excel

What If There Are More Than. Two Factor Levels?

BALANCED INCOMPLETE BLOCK DESIGNS

Transcription:

STAT 515 -- Chapter 10: Analysis of Variance Designed Experiment A study in which the researcher controls the levels of one or more variables to determine their effect on the variable of interest (called the response variable or dependent variable). Response variable: Main variable of interest (continuous) Factors: Other variables (typically discrete) which may have an effect on the response. Quantitative factors are numerical. Qualitative factors are categorical. The levels are the different values (for each factor) used in the experiment. Example 1: Response variable: College GPA Factors: Gender (levels: Male, Female) # of AP courses (levels: 0, 1,, 3, 4+) The treatments of an experiment are the different factor level combinations. Treatments for Example 1:

Experimental Units: the objects on which the factors and response are observed or measured. Example 1? Designed experiment: The analyst controls which treatments to use and assigns experimental units to each treatment. Observational study: The analyst simply observes treatments and responses for a sample of units. Example : Plant growth study: Experimental Units: A sample of plants Response: Growth over one month Factors: Fertilizer Brand (levels: A, B, C) Environment (levels: Natural Sunlight, Artificial Lamp) There are how many treatments? (Could also have a quantitative factor ) If 5 plants are assigned to each treatment (5 replicates per treatment), there are how many observations in all?

Completely Randomized Design (CRD) A Completely Randomized Design is a design in which independent samples of experimental units are selected for each treatment. Suppose there are k treatments (usually k 3). We want to test for any differences in mean response among the treatments. Hypothesis Test: H 0 : 1 = = = k Ha: At least two of the treatment population means differ. Visually, we could compare all the sample means for the different treatments. (Dot plots, p. 51) If there are more than two treatments, we cannot just subtract sample mean values. Instead, we analyze the variance in the data: Q: Is the variance within each group small compared to the variance between groups (specifically, between group means)? Top figure? Bottom figure?

How do we measure the variance within each group and the variance between groups? The Sum of Squares for Treatments (SST) measures variation between group means. SST = k i 1 n ( X X ) i i n i = number of observations in group i X i X = sample mean response for group i = overall sample mean response SST measures how much each group sample mean varies from the overall sample mean. The Sum of Squares for Error (SSE) measures variation within groups. SSE = s i k ( i 1) si i 1 n = sample variance for group i SSE is a sum of the variances of each group, weighted by the sample sizes by each group. To make these measures comparable, we divide by their degrees of freedom and obtain:

Mean Square for Treatments (MST) = SST k 1 Mean Square for Error (MSE) = SSE n k MST The ratio MSE is called the ANOVA F-statistic. If F = MST MSE is much bigger than 1, then the variation between groups is much bigger than the variation within groups, and we would reject H 0 : 1 = = = k in favor of H a. Example (Table 10.3) Response: Distance a golf ball travels 4 treatments: Four different brands of ball X _ 1 = 50.8, X _ = 61.1, X _ 3 = 70.0, X _ 4 = 49.3. => X _ = 57.8. n 1 = 10, n = 10, n 3 = 10, n 4 = 10. => n = 40. Sample variances for each group: s 1 =.4, s = 14.95, s 3 = 0.6, s 4 = 7.07.

SST = SSE = MST = MSE = F = This information is summarized in an ANOVA table: Source df SS MS F Treatments k 1 SST MST MST/MSE Error n k SSE MSE Total n 1 SS(Total) Note that df(total) = df(trt) + df(error) and that SS(Total) = SST + SSE.

For our example, the ANOVA table is: Note: SS( Total) df ( Total) ( X X ) entire data set,. n 1 is simply the sample variance for the In example, we can see F = 43.99 is clearly bigger than 1 but how much bigger than 1 must it be for us to reject H 0? ANOVA F-test: If H 0 is true and all the population means are indeed equal, then this F-statistic has an F-distribution with numerator d.f. k 1 and denominator d.f. n k. We would reject H 0 if our F is unusually large. Picture:

H 0 : 1 = = = k Ha: At least two of the treatment population means differ. Rejection Region: F > F, where F based on (k 1, n k) d.f. Assumptions: We have random samples from the k populations. All k populations are normal. All k population variances are equal. Example: Perform ANOVA F-test using =.10. Which treatment means differ? Section 10.3 (Multiple Comparisons of Means) covers this issue.