Fractional Factorial Designs

Similar documents
2 k, 2 k r and 2 k-p Factorial Designs

Two-Factor Full Factorial Design with Replications

CS 5014: Research Methods in Computer Science

CS 147: Computer Systems Performance Analysis

CS 5014: Research Methods in Computer Science. Experimental Design. Potential Pitfalls. One-Factor (Again) Clifford A. Shaffer.

Two Factor Full Factorial Design with Replications

One Factor Experiments

Factorial ANOVA. Psychology 3256

Chapter 11: Factorial Designs

If we have many sets of populations, we may compare the means of populations in each set with one experiment.

Suppose we needed four batches of formaldehyde, and coulddoonly4runsperbatch. Thisisthena2 4 factorial in 2 2 blocks.

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Statistics For Economics & Business

Analysis of Variance and Design of Experiments-I

Two-Way Analysis of Variance - no interaction

Allow the investigation of the effects of a number of variables on some response

Theorem A: Expectations of Sums of Squares Under the two-way ANOVA model, E(X i X) 2 = (µ i µ) 2 + n 1 n σ2

EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture 19

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Outline. Simulation of a Single-Server Queueing System. EEC 686/785 Modeling & Performance Evaluation of Computer Systems.

Multiple Comparisons. The Interaction Effects of more than two factors in an analysis of variance experiment. Submitted by: Anna Pashley

Two or more categorical predictors. 2.1 Two fixed effects

19. Blocking & confounding

Unit 8: 2 k Factorial Designs, Single or Unequal Replications in Factorial Designs, and Incomplete Block Designs

ST3232: Design and Analysis of Experiments

23. Fractional factorials - introduction

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

The 2 k Factorial Design. Dr. Mohammad Abuhaiba 1

Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data

Fractional Factorial Designs

Topic 9: Factorial treatment structures. Introduction. Terminology. Example of a 2x2 factorial

Outline Topic 21 - Two Factor ANOVA

Answer Keys to Homework#10

TWO OR MORE RANDOM EFFECTS. The two-way complete model for two random effects:

Assignment 9 Answer Keys

Statistical Hypothesis Testing

Unit 9: Confounding and Fractional Factorial Designs

Chapter 13 Experiments with Random Factors Solutions

Chapter 8: Hypothesis Testing Lecture 9: Likelihood ratio tests

Key Features: More than one type of experimental unit and more than one randomization.

Reference: Chapter 6 of Montgomery(8e) Maghsoodloo

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

MAT3378 ANOVA Summary

Multiple comparisons - subsequent inferences for two-way ANOVA

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

Factorial designs. Experiments

Written Exam (2 hours)

1 The Randomized Block Design

ANOVA Randomized Block Design

Definitions of terms and examples. Experimental Design. Sampling versus experiments. For each experimental unit, measures of the variables of

Two Factor Completely Between Subjects Analysis of Variance. 2/12/01 Two-Factor ANOVA, Between Subjects 1

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Chapter 15: Analysis of Variance

Stat 217 Final Exam. Name: May 1, 2002

Two-Way Factorial Designs

Two-factor studies. STAT 525 Chapter 19 and 20. Professor Olga Vitek

Chapter 10: Analysis of variance (ANOVA)

Unit 27 One-Way Analysis of Variance

Unreplicated 2 k Factorial Designs

Confounding and Fractional Replication in Factorial Design

Blocks are formed by grouping EUs in what way? How are experimental units randomized to treatments?

Ch. 1: Data and Distributions

CS 5014: Research Methods in Computer Science

Chapter 10. Design of Experiments and Analysis of Variance

Statistical Analysis of Unreplicated Factorial Designs Using Contrasts

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

BALANCED INCOMPLETE BLOCK DESIGNS

Factorial Treatment Structure: Part I. Lukas Meier, Seminar für Statistik

STAT 705 Chapter 19: Two-way ANOVA

Chapter 6 Randomized Block Design Two Factor ANOVA Interaction in ANOVA

3. Factorial Experiments (Ch.5. Factorial Experiments)

STAT22200 Spring 2014 Chapter 8A

Grant MacEwan University STAT 252 Dr. Karen Buro Formula Sheet

Lecture 15 - ANOVA cont.

iron retention (log) high Fe2+ medium Fe2+ high Fe3+ medium Fe3+ low Fe2+ low Fe3+ 2 Two-way ANOVA

R version ( ) Copyright (C) 2009 The R Foundation for Statistical Computing ISBN

Factorial Designs. Prof. Daniel A. Menasce Dept. of fcomputer Science George Mason University. studied simultaneously.

. Example: For 3 factors, sse = (y ijkt. " y ijk

Contents. TAMS38 - Lecture 8 2 k p fractional factorial design. Lecturer: Zhenxia Liu. Example 0 - continued 4. Example 0 - Glazing ceramic 3

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

A Study on Factorial Designs with Blocks Influence and Inspection Plan for Radiated Emission Testing of Information Technology Equipment

Soo King Lim Figure 1: Figure 2: Figure 3: Figure 4: Figure 5: Figure 6: Figure 7: Figure 8: Figure 9: Figure 10: Figure 11: Figure 12: Figure 13:

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication

Unit 12: Analysis of Single Factor Experiments

STAT 705 Chapter 19: Two-way ANOVA

Example - Alfalfa (11.6.1) Lecture 14 - ANOVA cont. Alfalfa Hypotheses. Treatment Effect

Reference: CHAPTER 7 of Montgomery(8e)

STAT Final Practice Problems

IX. Complete Block Designs (CBD s)

20g g g Analyze the residuals from this experiment and comment on the model adequacy.

STAT Chapter 10: Analysis of Variance

ANOVA CIVL 7012/8012

Unit 6: Fractional Factorial Experiments at Three Levels

Fractional Replications

Example - Alfalfa (11.6.1) Lecture 16 - ANOVA cont. Alfalfa Hypotheses. Treatment Effect

STAT 506: Randomized complete block designs

We need to define some concepts that are used in experiments.

2 k Factorial Designs Raj Jain

Statistics 512: Applied Linear Models. Topic 7

Transcription:

k-p Fractional Factorial Designs Fractional Factorial Designs If we have 7 factors, a 7 factorial design will require 8 experiments How much information can we obtain from fewer experiments, e.g. 7-4 = 8 experiments? A k-p design allows the analysis of k twolevel factors with fewer experiments

A 7-4 Experimental Design Consider the 3 design below: Experiment # I A B C AB AC BC ABC - - - - - - - - 3 - - - - 4 - - - - 5 - - - - 6 - - - - 7 - - - - 8 If the factors, AB, AC, BC, ABC are replaced by D, E, F, and G we get a 7-4 design 3 A 7-4 design If the interactions AB, AC, AD,, ABCD are negligible we can use the table below Experiment # I A B C D E F G y - - - - 0 - - - - 35 3 - - - - 7 4 - - - - 4 5 - - - - 36 6 - - - - 50 7 - - - - 45 8 8 Total 37 0 35 09 43 47 3 Total/8 39.6.6 4.37 3.6 5.37 0. 5.9 0.37 Percent variation 37.6 4.74 43.4 6.75 0 8. 0.03 4

Preparing the sign table for a k-p design. Choose k-p factors and prepare a complete sign table for a full factorial design with k-p factors. There are k-p rows and columns in the table. The first column is marked I and consists of all s. The next k-p columns correspond to the k-p selected factors. The remaining columns correspond to the products of these factors.. Of the k-p -k+p- remaining columns, select p columns corresponding to the p factors that were not chosen in step. Note: there are several possibilities; the columns corresponding to negligible interactions should be chosen. 5 A 4- design Experiment # I A B C AB AC BC ABC D - - - - - - - - 3 - - - - 4 - - - - 5 - - - - 6 - - - - 7 - - - - 8 If the ABC interaction is negligible, we should replace ABC with D. If AB is negligible, we can replace AB with D. 6 3

Confounding The drawback of k-p designs is that the experiments only yield the combined effects of two or more factors. This is called confounding On the previous slide, the effects of ABC and D are confounded (denoted as ABC = D) In a k- design, every column represents a sum of two effects. For our example, A = BCD, B = ACD, C = ABD, AB = CD, AC = BD, BC = AD, ABC = D, I = ABCD This means that columns for A, B, C, D actually correspond to A + BCD, B + ACD, C + ABD, D+ ABC, etc. If we replace AB with D, I = ABD, A = BD, B = AD, C = A BCD, D = AB, AC = BCD, BC = ACD, ABC = CD In a k-p design, p effects are confounded 7 Algebra of Confounding Consider the first design in which ABC is replaced with D Here, I = ABCD All the confoundings can be generated using the following rules. I is treated as unity.e.g. I multiplied by A is A. Any term with a power of is erased, e.g. AB C is the same as AC. The polynomial I = ABCD is used to generate all the confoundings for this design, and is called the generator polynomial The second design in which AB was replaced by D in the sign table has generator polynomial I = ABD 8 4

Design Resolution The resolution of a design is measured by the order of effects that are confounded The effect ABCD is of order 4, while I is of order 0 If an i-th order effect is confounded with a j-th order term, the confounding is of order i+j The minimum of orders of all confoundings of a design is called its resolution We can easily determine the resolution of a design by looking at the generator polynomial, e.g. if I = ABCD, then the design has resolution 4, if I = ABD, the design has resolution 3 In general, higher resolution designs are considered better under the assumption that higher order interactions are smaller than lower-order effects 9 One Factor Experiment Design 0 5

One Factor Experiment Design So far: unlimited factors, two levels Now: unlimited levels, one factor Model: y ij = µ + α j + e ij, where y ij is the ith response with the factor at level j, µ is the mean response, α j is the effect of alternative j, and e ij is the error term The effects are computed such that α j = 0, e ij = 0, e ij = 0 i j i Model cont d Notation: j (factor level), i (replication), a (number of levels), r (number of replicas) Example: a = 3, r = 5 R 44 0 76 88 44 V 0 44 88 7 Z 30 80 4 374 30 6

Model cont d Notation: y.. = µ, grand mean (avg. of all responses i,j) y.j = µ + α j, column mean (avg of all responses for a particular factor level) R V Z 44 0 30 0 44 80 76 4 88 88 374 44 7 30 Column Sum 87 86 7 85 Column mean 74.4 y. 63. 5.4 87.7 y.. Column effect -3.3 α -4.5 37.7 3 Estimating experimental errors Estimated response for the jth alternative is For the example on previous slide, SSE = (44-87.7 + 3.3) + (0-87.7 + 4.5 ) + (30-87.7-37.7) + + (30-87.7-37.7) = 94,365.0 ˆ y j = µ + α j e ij = y ij ˆ y j SSE = e ij r a i= j= 4 7

Allocation of variation We can show that SSY = SS0 + SSA + SSE, where SSY is sum of squares of y, SS0 is the sum of squares of the grand mean, SSA is the sum of squares of effects, and SSE is the sum of squares of errors SSE can be easily calculated from SSY, SS0, and SSA Further, SST = SSY - SS0 = SSA + SSE In our example, SST = 05,357; SSA = 0,99 (0.4%) ; SSE = 94,365 (89.6%) SS0 = arµ, SSA = r α j Is SSA statistically significant? a j= 5 Analysis of variance Allocation of variation shows that almost 90% of the variation is due to SSE In general, relatively high SSE could mean Factor under consideration is not important Number of replicas is much larger than number of factor levels Maybe we have two few samples and a bad sample with high errors Statistical procedure for analysis of significance of various factors -- Analysis of Variance (ANOVA) 6 8

ANOVA Consider that Define SSY = SS0 + SSA + SSE ar = + (a -) + a (r-) Degrees of freedom MSA = SSA/(a-) MSE = SSE/a(r-) Mean Square of A Mean Square of E Then, the ratio MSA/MSE has a F-distribution with (a-) numerator degrees of freedom and a(r-) denominator degrees of freedom F(n,m) denotes F distribution where n and m and numerator and denominator degrees of freedom respectively 7 F-test Tests following null hypothesis: Response variable does not depend upon any factor α Acceptance criteria: MSA/MSE ratio does not exceed the -α quantile of F distribution So the question: is the factor statistically significant is equivalent to rejecting the null hypotheis above In other words, the F-statistic from our data should exceed the theoretical F value 8 9

F-test example For our example, MSA = SSA/(a-) = 0,99/ = 5496. MSE = SSE/a(r-) = 94,65/ = 7863.8 Computed F statistic = 5496./7963.8 = 0.7 Theoretical F(0.9;,) =.8 Thus, we can conclude that the factor under consideration is not statistically significant 9 Additional Reading Visual Diagnostic Tests for verifying assumptions - Section 0.6 Confidence intervals for Effects - Section 0.7 Unequal sample sizes - Section 0.8 0 0

Two factor full factorial designs without replications Experiment design with two factors, each of which can have an arbitrary number of levels Initially, we will not consider replications Factors A and B, with number of levels a and b respectively, number of experiments is ab Model y ij = µ + α j + β i + e ij, where y ij is the observed response with the factor A at level j and the factor B at level i, µ is the mean response, α j is the effect of factor A at level j, and β i is the effect of factor B at level i, and e ij is the error term The effects are computed such that α j = 0, β i = 0, e ij = 0 j i We obtain y.. = µ, y i. = µ + β i, y j. = µ + α j

Computation of Effects Workload Two Caches One Cache No Cache Row Sum Row Mean Row Effect ASM 54.0 55.0 06.0 5.0 7.7-0.5 β TECO 60.0 60.0 3.0 43.0 8.0 8.8 SIEVE 43.0 43.0 0.0 06.0 68.7-3.5 DHRYSTONE 49.0 5.0.0.0 70.7 -.5 SORT 49.0 50.0 08.0 07.0 69.0-3. Column Sum 55.0 60.0 568.0 083.0 Column Mean Column effect 5.0 -. 5.0-0. 3.6 4.4 7. y.. α 3 Estimating experimental errors ˆ y j = µ + α j + β i e ij = y ij ˆ y j = y ij µ α j β i b a SSE = e ij i= j= For our example, ˆ y = 7.. 0.5 = 50.5 e = 54 50.5 = 3.5 SSE = 3.5 + 0. +...+ (.4) = 36.80 4

Allocation of variation We can show SSY = SS0 + SSA + SSB + SSE and SST = SSY - SS0 = SSA+SSB + SSE For our example, SSY = 9,595; SS0 = 78,9.6; SSA =,857., SSB = 308.4, SST = 3,40.4 SSE = SST - SSA - SSB = 36.8 The percentage of variation explained by the cache is 95.9%, due to workloads is.3% and the unexplained variation is.8% 5 Analysis of variance (ANOVA) Similar to one-factor analysis MSA = SSA/(a-); MSB = SSB/(b-) MSE = SSE/(a-)(b-) F-ratio for factor A is F A = MSA/MSE and for factor B is F B = MSB/MSE If greater than theoretical F-value then factor is statistically significant For our example, F A = 7., F B =.6, theoretical F value =.8, so the first factor (cache) is statistically significant 6 3

Additional Reading Section.6 - confidence intervals for effects Section.7 - multiplicative models for two factor experiments Section.8 - handling missing observations (optional) 7 Two-factor full factorial design with replications y ijk = µ + α j + β i + γ ij + e ij, where y ijk is the observed response in the kth replication of the experiment with the factor A at level j and the factor B at level i, µ is the mean response, α j is the effect of factor A at level j, β i is the effect of factor B at level i, γ ij is the effect of interaction between factor A at level j and factor B at level i, and and e ij is the experimental error The effects are computed such that α j = 0, β i = 0, The interactions are computed so that their row as well as column sums are 0 The errors in each experiment add to 0 r k= e ijk = 0, i, j 8 4

Computation of effects The observations are arranged in b rows and a columns with each cell containing r observations Compute the average of the r observations for each cell Then, proceed as in the analysis of twofactor design without replication 9 Computation of errors ˆ y ij = µ + α j + β i + γ ij = y ij. The error in the kth replication of the experiment is e ijk = y ijk y ij. SSE = b a r i= j= k= e ijk (the average of the r observations in a cell) 30 5

Allocation of variation and ANOVA We can show SST = SSY - SS0 = SSA + SSB + SSAB + SSE ANOVA F-test: compute MSA/MSE, MSB/MSE, MSAB/MSE Degrees of freedom: SSA has a-, SSB has b-, SSAB has (a-)(b-), SSE has ab(r-) 3 Further Reading Section.6 - confidence intervals for effects Chapter 3 - General Full factorial designs with k factors Generalization of analysis techniques discussed Informal (non-statistical methods) for determining the important factors 3 6