Design & Analysis of Experiments 7E 2009 Montgomery

Similar documents
DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

What If There Are More Than. Two Factor Levels?

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

Design & Analysis of Experiments 7E 2009 Montgomery

Chapter 4: Randomized Blocks and Latin Squares

Unit 12: Analysis of Single Factor Experiments

Formal Statement of Simple Linear Regression Model

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

OHSU OGI Class ECE-580-DOE :Statistical Process Control and Design of Experiments Steve Brainerd Basic Statistics Sample size?

Factorial designs. Experiments

1 Introduction to One-way ANOVA

Research Methods II MICHAEL BERNSTEIN CS 376

Analysis of Variance

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.

Lec 1: An Introduction to ANOVA

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

Basic Business Statistics 6 th Edition

Mathematics for Economics MA course

Solutions to Final STAT 421, Fall 2008

Lecture 9. ANOVA: Random-effects model, sample size

W&M CSCI 688: Design of Experiments Homework 2. Megan Rose Bryant

STAT Chapter 10: Analysis of Variance

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Simple Linear Regression

Statistics for Managers using Microsoft Excel 6 th Edition

Ch 2: Simple Linear Regression

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

1 One-way Analysis of Variance

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

Business Statistics. Lecture 10: Course Review

16.3 One-Way ANOVA: The Procedure

Inferences for Regression

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)

The entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.

Analysis of Variance. Read Chapter 14 and Sections to review one-way ANOVA.

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Design of Engineering Experiments Part 5 The 2 k Factorial Design

Psychology 282 Lecture #4 Outline Inferences in SLR

20.0 Experimental Design

Correlation Analysis

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication

Inference for the Regression Coefficient

Sociology 6Z03 Review II

Randomized Blocks, Latin Squares, and Related Designs. Dr. Mohammad Abuhaiba 1

Much of the material we will be covering for a while has to do with designing an experimental study that concerns some phenomenon of interest.

Two-Way Factorial Designs

df=degrees of freedom = n - 1

Review of Statistics 101

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

ANOVA (Analysis of Variance) output RLS 11/20/2016

9 One-Way Analysis of Variance

Chapter 4. Regression Models. Learning Objectives

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

Diagnostics and Remedial Measures

Lecture 3: Inference in SLR

Multiple comparisons - subsequent inferences for two-way ANOVA

F-tests and Nested Models

Remedial Measures, Brown-Forsythe test, F test

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company

STAT22200 Spring 2014 Chapter 8A

appstats27.notebook April 06, 2017

STA121: Applied Regression Analysis

One-way ANOVA (Single-Factor CRD)

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

28. SIMPLE LINEAR REGRESSION III

Business Statistics. Lecture 9: Simple Regression

The simple linear regression model discussed in Chapter 13 was written as

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs

Group comparison test for independent samples

3. Diagnostics and Remedial Measures

Statistical Techniques II EXST7015 Simple Linear Regression

The Model Building Process Part I: Checking Model Assumptions Best Practice

Chapter 3. Diagnostics and Remedial Measures

Using regression to study economic relationships is called econometrics. econo = of or pertaining to the economy. metrics = measurement

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Analysis of Variance

The Model Building Process Part I: Checking Model Assumptions Best Practice (Version 1.1)

7. Response Surface Methodology (Ch.10. Regression Modeling Ch. 11. Response Surface Methodology)

Practice Problems Section Problems

Assessing Model Adequacy

Hypothesis Testing. We normally talk about two types of hypothesis: the null hypothesis and the research or alternative hypothesis.

Comparing Several Means: ANOVA

COMPARING SEVERAL MEANS: ANOVA

2.830 Homework #6. April 2, 2009

3. Factorial Experiments (Ch.5. Factorial Experiments)

Topic 20: Single Factor Analysis of Variance

Analysis of Variance

Chapter 13. Multiple Regression and Model Building

QUEEN MARY, UNIVERSITY OF LONDON

Chapter 27 Summary Inferences for Regression

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

STA2601. Tutorial letter 203/2/2017. Applied Statistics II. Semester 2. Department of Statistics STA2601/203/2/2017. Solutions to Assignment 03

Six Sigma Black Belt Study Guides

Design and Analysis of Experiments Prof. Jhareswar Maiti Department of Industrial and Systems Engineering Indian Institute of Technology, Kharagpur

Power & Sample Size Calculation


DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

Transcription:

1

What If There Are More Than Two Factor Levels? The t-test does not directly apply ppy There are lots of practical situations where there are either more than two levels of interest, or there are several lfactors of simultaneous interest t The analysis of variance (ANOVA) is the appropriate analysis engine for these types of experiments The ANOVA was developed by Fisher in the early 1920s, and initially applied to agricultural experiments Used extensively today for industrial experiments 2

An Example (See pg. 61) An engineer is interested in investigating the relationship between een the RF power setting and the etch rate for this tool. The objective of an experiment like this is to model the relationship between etch rate and RF power, and to specify the power setting that will give a desired target etch rate. The response variable is etch rate. She is interested in a particular gas (C2F6) and gap (0.80 cm), and wants to test four levels of RF power: 160W, 180W, 200W, and 220W. She decided to test five wafers at each level of RF power. The experimenter chooses 4 levels of RF power 160W, 180W, 200W, and 220W The experiment is replicated 5 times runs made in random order 3

4

An Example (See pg. 62) 5

Does changing the power change the mean etch rate? Is there an optimum level for power? We would like to have an objective way to answer these questions The t-test test really doesn t apply here more than two factor levels 6

The Analysis of Variance (Sec. 3.2, pg. 62) In general, there will be a levels of the factor, or a treatments, and n replicates of the experiment, run in random order a completely randomized design (CRD) N = an total runs We consider the fixed effects case the random effects case will be discussed later Objective is to test hypotheses about the equality of the a treatment means 7

The Analysis of Variance The name analysis of variance stems from a partitioning of the total variability in the response variable into components that are consistent with a model for the experiment The basic single-factor ANOVA model is i 1,2,...,,, a, j 1,2,..., n y ij i ij ij an overall mean, ith treatment effect, 2 experimental error, NID(0, ) i 8

Models for the Data There are several ways to write a model for the data: yij i ij is called the effects model Let, then i i y is called the means model ij i ij Regression models can also be employed 9

The Analysis of Variance Total variability is measured by the total sum of squares: a n SS ( y y ) T i 1 j 1 The basic ANOVA partitioning is: a n a n 2 2 ( yij y.. ) [( yi. y.. ) ( yij yi. )] i 1 j 1 i 1 j 1 ij.. 2 a a n 2 2 (...) i ( ij i. ) i 1 i 1 j 1 n y y y y SS SS SS T Treatments t E 10

The Analysis of Variance SS SS SS T Treatments E A large value of SS Treatments reflects large differences in treatment means A small value of SS Treatments likely indicates no differences in treatment t t means Formal statistical hypotheses are: H H 0 1 2 1 : a : At least one mean is different 11

The Analysis of Variance While sums of squares cannot be directly compared to test the hypothesis of equal means, mean squares can be compared. A mean square is a sum of squares divided by its degrees of freedom: MS dftotal dftreatments dferror an 1 a 1 a( n 1) SSTreatments SSE, MSE a 1 a( n 1) Treatments If the treatment means are equal, the treatment and error mean squares will be (theoretically) equal. If treatment means differ, the treatment mean square will be larger than the error mean square. 12

The Analysis of Variance is Summarized in a Table Computing see text, pp 69 The reference distribution ib ti for F 0 is the F a-1, a(n-1) distribution ib ti Reject the null hypothesis (equal treatment means) if F 0 F, a 1, a( n 1) 13

14

Example: Etch Rate Data (in Å/min) from the Plasma Etching Experiment RF Power Observed Etch Rate Totals Average (W) 1 2 3 4 5 yi. yi.bar 160 575 542 530 539 570 2756 551.2 180 565 593 590 579 610 2937 587.4 220 600 651 610 637 629 3127 625.4 220 725 700 715 685 710 3535 707 15

ANOVA Table Example 3-1 16

The Reference Distribution: P-value 17

ANOVA calculations may be made simpler by coding the observation. Observation may be coded by subtracting each data by 600. We get: Based on the data above, the sum of squares may be computed in the usual manner. The sum of squares are similar to that calculated previously. Therefore subtracting a constant from the original data does not change the sum of squares. DOX 6E Montgomery 18

Model Adequacy Checking in the ANOVA Text reference, Section 3.4, pg. 75 Checking assumptions is important t Normality Constant variance Independence Have we fit the right model? Later we will talk about what to do if some of these assumptions are violated 19

The normality assumption could be verified by constructing a normal probability plot of the residuals. If the underlying error distribution is normal, this plot will resemble a straight line. DOX 6E Montgomery 20

Model Adequacy Checking in the ANOVA Examination of residuals (see text, Sec. 3-4, pg. 75) e y yˆ ij ij ij y y ij i. Computer software generates the residuals Residual plots are very useful Normal probability plot of residuals 21

Plot of residuals in time sequence Plotting the residuals in time order of data collection is helpful in detecting correlation between the residuals. A tendency to have runs of positive and negative residuals indicates positive correlation. This would imply that the independence assumption on the errors has been violated. Important to prevent the problem and proper randomization of the experiment is an important step in obtaining independence. DOX 6E Montgomery 22

Skill of the experimenter may improve as the experiment progresses or the process may drift. This will often result in a change in the error variance over time depicted by more spread at one end than at the other. DOX 6E Montgomery 23

Plot of residuals versus fitted values If the model is correct and if the assumptions are satisfied, the residuals should be structureless. A simple check is to plot the residuals versus the fitted values. This plot should not reveal any obvious pattern as shown in Fig. 3.6. Nonconstant variance may show up on this plot in a form of an outward opening funnel or megaphone. One way in dealing with nonconstant variance is to apply a variance stabilizing transformation DOX 6E Montgomery 24

The Regression Model 25

Post-ANOVA Comparison of Means The analysis of variance tests the hypothesis of equal treatment means Assume that residual analysis is satisfactory If that hypothesis is rejected, we don t know which specific means are different Determining which specific means differ following an ANOVA is called the multiple comparisons problem There are lots of ways to do this see text, Section 3.5, pg. 84 We will use pairwise t-tests tests on means sometimes sometimes called Fisher s Least Significant Difference (or Fisher s LSD) Method 26

Design-Expert Output 27

Graphical Comparison of Means Text, pg. 88 28

ANOVA calculations are usually done via computer Text exhibits sample calculations from three very popular software packages, Design-Expert Expert, JMP and Minitab See pages 98-100 Text discusses some of the summary statistics provided by these packages 29

Figure 3.12 (p. 99) Design-Expert computer output for Example 3-1. R 2 SS Model SS Total 66870.55 72209.7575 0.9261 R 2 is loosely interpreted as the proportion of the variability in the data explained by the ANOVA model. Values closer to 1 is desirable. Adjusted R 2 reflects the number of factors in the model and useful for evaluating the impact of increasing or decreasing the number of model terms. Others see text pg. 98 DOX 6E Montgomery 30

Figure 3.13 (p. 100) Minitab computer output for Example 3-1. DOX 6E Montgomery 31

Why Does the ANOVA Work? We are sampling from normal populations, so SS SS if H is true, and Cochran's theorem gives the independence of Treatments 2 E 2 2 a 1 0 2 a ( n 1) these two chi-square random variables 2 SSTreatments /( a 1) a 1 /( 1) a So F0 F 2 a 1, a( n 1) SSE /[ a( n 1)] a( n 1) /[ a( n 1)] n 2 n i 2 i 1 2 EMSE Finally, EMS ( Treatments ) and ( ) a 1 Therefore an upper-tail F test is appropriate. 32

Sample Size Determination Text, Section 3.7, pg. 101 FAQ in designed experiments Answer depends on lots of things; including what type of experiment is being contemplated, how it will be conducted, resources, and desired sensitivity Sensitivity refers to the difference in means that the experimenter wishes to detect Generally, increasing the number of replications increases the sensitivity or it makes it easier to detect small differences in means 33

Sample Size Determination Fixed Effects Case Can choose the sample size to detect a specific difference in means and achieve desired values of type I and type II errors Type I error reject H 0 when it is true ( ) Type II error fail to reject H 0 when it is false ( ) Power =1 - Operating characteristic curves plot against a parameter where n a 2 i 1 2 a 2 i 34

Sample Size Determination Fixed Effects Case---use of OC Curves The OC curves for the fixed effects model are in the Appendix, Table V A very common way to use these charts is to define a difference in two means D of interest, then the minimum value of 2is 2 2 nd 2 2a Typically work in term of the ratio of D / and try values of n until the desired d power is achieved Most statistics software packages will perform power and sample size calculations see page 103 There are some other methods discussed d in the textt 35

Power and sample size calculations from Minitab (Page 103) 36

37

38

39

40

41

Question to be attempted No. 3.1, 3.3, 3.6, 3.9 (No. 37 3.7, 38 3.8, 311 3.11, 35) 3.5) DOX 6E Montgomery 42