Design and Analysis of Experiments

Similar documents
Lec 5: Factorial Experiment

Lecture 11: Nested and Split-Plot Designs

Lecture 10. Factorial experiments (2-way ANOVA etc)

Design and Analysis of Experiments

Chapter 5 Introduction to Factorial Designs

Design of Engineering Experiments Chapter 5 Introduction to Factorials

These are multifactor experiments that have

Factorial designs. Experiments

Contents. TAMS38 - Lecture 8 2 k p fractional factorial design. Lecturer: Zhenxia Liu. Example 0 - continued 4. Example 0 - Glazing ceramic 3

Lecture 9: Factorial Design Montgomery: chapter 5

Chapter 5 Introduction to Factorial Designs Solutions

Design & Analysis of Experiments 7E 2009 Montgomery

Reference: Chapter 14 of Montgomery (8e)

Factorial Treatment Structure: Part I. Lukas Meier, Seminar für Statistik

STAT Final Practice Problems

3. Factorial Experiments (Ch.5. Factorial Experiments)

Solution to Final Exam

Contents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4

1 Use of indicator random variables. (Chapter 8)

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning

Reference: Chapter 13 of Montgomery (8e)

STATISTICS 368 AN EXPERIMENT IN AIRCRAFT PRODUCTION Christopher Wiens & Douglas Wiens

Design and Analysis of Experiments

Section 4.6 Simple Linear Regression

FACTORIAL DESIGNS and NESTED DESIGNS

Addition of Center Points to a 2 k Designs Section 6-6 page 271

Stat 579: Generalized Linear Models and Extensions

Written Exam (2 hours)

2-way analysis of variance

Design and Analysis of Experiments. David Yanez Department of Biostatistics University of Washington

MODELS WITHOUT AN INTERCEPT

Part 5 Introduction to Factorials

Comparing Nested Models

Stat 217 Final Exam. Name: May 1, 2002

Inference for Regression

3. Design Experiments and Variance Analysis

Key Features: More than one type of experimental unit and more than one randomization.

Multiple Regression: Example

Contents. 2 2 factorial design 4

Stat 412/512 TWO WAY ANOVA. Charlotte Wickham. stat512.cwick.co.nz. Feb

Process/product optimization using design of experiments and response surface methodology

Exercise I.1 I.2 I.3 I.4 II.1 II.2 III.1 III.2 III.3 IV.1 Question (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) Answer

STAT 135 Lab 10 Two-Way ANOVA, Randomized Block Design and Friedman s Test

3.4. A computer ANOVA output is shown below. Fill in the blanks. You may give bounds on the P-value.

MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS FALL 1999

Unit 9: Confounding and Fractional Factorial Designs

Topic 9: Factorial treatment structures. Introduction. Terminology. Example of a 2x2 factorial

Chapter 9 Other Topics on Factorial and Fractional Factorial Designs

Analysis of Variance and Design of Experiments-I

Stat 5031 Quadratic Response Surface Methods (QRSM) Sanford Weisberg November 30, 2015

Design and Analysis of Experiments Prof. Jhareshwar Maiti Department of Industrial and Systems Engineering Indian Institute of Technology, Kharagpur

Open book and notes. 120 minutes. Covers Chapters 8 through 14 of Montgomery and Runger (fourth edition).

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

Chap The McGraw-Hill Companies, Inc. All rights reserved.

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests

Two-Way Factorial Designs

What If There Are More Than. Two Factor Levels?

Cuckoo Birds. Analysis of Variance. Display of Cuckoo Bird Egg Lengths

Stat 401B Final Exam Fall 2015

Multiple Linear Regression

Statistics GIDP Ph.D. Qualifying Exam Methodology

Chemical Engineering: 4C3/6C3 Statistics for Engineering McMaster University: Final examination

TWO-LEVEL FACTORIAL EXPERIMENTS: IRREGULAR FRACTIONS

Biostatistics 380 Multiple Regression 1. Multiple Regression

Chapter 12. Analysis of variance

Two (or more) factors, say A and B, with a and b levels, respectively.

Multiple Predictor Variables: ANOVA

Exam Applied Statistical Regression. Good Luck!

Regression. Marc H. Mehlman University of New Haven

Linear Model Specification in R

ANOVA: Comparing More Than Two Means

Analysis of Variance (ANOVA)

UNIVERSITY OF TORONTO Faculty of Arts and Science

TWO OR MORE RANDOM EFFECTS. The two-way complete model for two random effects:

ST430 Exam 1 with Answers

STAT 213 Interactions in Two-Way ANOVA

RESPONSE SURFACE MODELLING, RSM

Increasing precision by partitioning the error sum of squares: Blocking: SSE (CRD) à SSB + SSE (RCBD) Contrasts: SST à (t 1) orthogonal contrasts

Unit 8: 2 k Factorial Designs, Single or Unequal Replications in Factorial Designs, and Incomplete Block Designs

Workshop 7.4a: Single factor ANOVA

Allow the investigation of the effects of a number of variables on some response

. Example: For 3 factors, sse = (y ijkt. " y ijk

Institutionen för matematik och matematisk statistik Umeå universitet November 7, Inlämningsuppgift 3. Mariam Shirdel

Factorial and Unbalanced Analysis of Variance

7. Response Surface Methodology (Ch.10. Regression Modeling Ch. 11. Response Surface Methodology)

Stat 6640 Solution to Midterm #2

IE 316 Exam 1 Fall 2011

STAT22200 Spring 2014 Chapter 8A

Biostatistics for physicists fall Correlation Linear regression Analysis of variance

Suppose we needed four batches of formaldehyde, and coulddoonly4runsperbatch. Thisisthena2 4 factorial in 2 2 blocks.

Contents. TAMS38 - Lecture 10 Response surface. Lecturer: Jolanta Pielaszkiewicz. Response surface 3. Response surface, cont. 4

MSc / PhD Course Advanced Biostatistics. dr. P. Nazarov

Analysis of Variance

Response Surface Methodology

Part II { Oneway Anova, Simple Linear Regression and ANCOVA with R

EXAM IN TMA4255 EXPERIMENTAL DESIGN AND APPLIED STATISTICAL METHODS

Stat 401B Exam 2 Fall 2015

ST430 Exam 2 Solutions

Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals. Regression Output. Conditions for inference.

Transcription:

Design and Analysis of Experiments Part VIII: Plackett-Burman, 3 k, Mixed Level, Nested, and Split-Plot Designs Prof. Dr. Anselmo E de Oliveira anselmo.quimica.ufg.br anselmo.disciplinas@gmail.com

Plackett-Burman Designs Two-level fractional factorial designs developed for studying k = N 1 variables in N runs, where N is a multiple of 4 Nongeometric designs (cannot be represented as cubes) Nonregular designs Regular design is one in which all effects can be estimated independently of the other effects (e.g. 2 k design) and in the case of a fractional factorial, the effects that cannot be estimated are completely aliased with the other effects (e.g. 2 k-p design) Screening Main effects are, in general, heavily confounded with two-factor interactions Plackett-Burman designs are very efficient screening designs when only main effects are of interest. Web of Science: 2017/04: 47 2016: 202

Plus and minus signs for the Plackett-Burman Designs The design is obtained by writing down the appropriate row in the table below as a column (or row) A second column (or row) is then generated from this first one by moving the elements of the column (or row) down (or to the right) one position and placing the last element in the first position. A third column... N k Signs 12 11 + + - + + + - - - + - 16 15 + + + + - + - + + - - + - - - 20 19 + + - - + + + + - + - + - - - - + + - 24 23 + + + + + - + - + + - - + + - - + - + - - - - After the column k a row of minus sings is then added, completing the design

Plackett-Burman Design for N = 12, k = 11 X 1 X 2 X 3 X 4 X 5 X 6 X 7 X 8 X 9 X 10 X 11 1 +1-1 +1-1 -1-1 +1 +1 +1-1 +1 2 +1 +1-1 +1-1 -1 +1 +1 +1 +1-1 3-1 +1 +1-1 +1-1 -1-1 +1 +1 +1 4 +1-1 +1 +1-1 +1-1 -1-1 +1 +1 5 +1 +1-1 +1 +1-1 +1-1 -1-1 +1 6 +1 +1 +1-1 +1 +1-1 +1-1 -1-1 7-1 +1 +1 +1-1 +1 +1-1 +1-1 -1 8-1 -1 +1 +1 +1-1 +1 +1-1 +1-1 9-1 -1-1 +1 +1 +1-1 +1 +1-1 +1 10 +1-1 -1-1 +1 +1 +1-1 +1 +1-1 11-1 +1-1 -1-1 +1 +1 +1-1 +1 +1 12-1 -1-1 -1-1 -1-1 -1-1 -1-1

k = 8 N = 12 row of minus signs

? x 2

> library(frf2) > design<-pb(nruns=12,nfactos=11,randomize=false) > y<-c(0.371,0.336,0.576,0.216,0.228,0.328, + 0.746,0.562,0.389,0.438,0.430,0.292) > model<-aov(y~(.), data=design) > coef(model) (Intercept) A1 B1 C1 0.409333333 0.103000000-0.024000000-0.009833333 D1 E1 F1 G1-0.016166667-0.021500000-0.009166667 0.022500000 H1 J1 K1 L1 0.051666667-0.050500000 0.007333333 0.064000000

3 k Factorial Design Three levels: low(-1), intermediate(0), and high(+1) Regression Model y = β 0 + β 1 x 1 + β 2 x 2 + β 12 x 1 x 2 + β 11 x 2 2 11 + β 22 x 22 Possible choice to model a curvature in the response function It is not the most efficient way to model a quadratic relationship Response surface designs 2 k design augmented with central points (curvature)

The effects model y ijk = μ + τ i + β j + τβ ij + ε ijk The means model y ijk = μ ijk + ε ijk i = 1,2,, a j = 1,2,, b k = 1,2,, n where the mean of the ijth cell is μ ij = μ + τ i + β j + τβ ij i = 1,2,, a j = 1,2,, b k = 1,2,, n

In the two-factor factorial design we are interested in testing hypothesis abou the equality of row treatment effects, say H 0 : τ 1 = τ 2 = = τ a H 1 : at least one τ i 0 and the equality of column treatment effects, say H 0 : β 1 = β 2 = = β a H 1 : at least one β i 0 We are also interested in determining whether row and column treatment interact H 0 : τβ ij = 0 for all i, j H 1 : at least one τβ ij 0

Source of Variation Sum of Squares Degrees of Freedom Mean Square F 0 A treatments SS A a 1 SS A a 1 B treatments SS B b 1 SS B b 1 Interaction SS AB a 1 b 1 SS AB a 1 b 1 MS A MS E MS B MS E MS AB MS E Error SS E ab n 1 SS E ab n 1 Total SS T abn 1 SS A = 1 bn SS B = 1 an a i=1 b j=1 y 2 i.. y 2... abn y 2.j. y 2... abn SS AB = SS Subtotals SS A SS B SS Subtotals = 1 n a b i=1 j=1 y 2 ij. y 2... abn SS E = SS T SS A SS B SS AB SS T = a b n i=1 j=1 k=1 2 y ijk y 2... abn

A battery design experiment An engineer is designing a battery for use in a device that will be subjected to some extreme variations in temperature Three plate materials for the battery 3 2 factorial design Material Type Temperature ( F) 15 70 125 1 130 155 34 40 20 70 74 180 80 75 82 58 2 150 188 136 122 25 70 159 126 106 115 58 45 3 138 110 174 120 96 104 168 160 150 139 82 60

Material Type 1 2 3 y.j. Temperature ( F) 15 70 125 130 155 34 40 20 70 539 74 180 80 75 82 58 150 188 136 122 25 70 159 126 106 115 58 45 138 110 174 120 96 104 342 168 160 150 139 82 60 1738 1291 770 y i.. 998 1300 1501 y... = 3799 SS Material = 1 bn a i=1 y 2 i.. y 2... abn = 1 3 4 9982 +.. +1501 2 37992 36 = 10,683.72 SS Temperature = 1 3 4 17382 +.. +770 2 37992 36 = 39,118.72 SS Interaction = 1 4 5392 +.. +342 2 37992 36 10,683.72 39,118.72 SS T = 130 2 + 155 2 + + 60 2 37992 36 = 77,646.97 SS E = 77,646.97 10,683.72 39,118.72 9,613.78 = 18,230.75

Source of Variation Sum of Squares Degrees of Freedom Mean Square F 0 P-Value Material types 10,683.72 2 6,341.86 7.91 0.0020 Temperature 39,118.72 2 19,559.36 28.97 <0.0001 Interaction 9,613.78 4 2,403.44 3.56 0.0186 Error 18,230.75 27 675.21 Total 77,646.97 35 The main effects of material type and temperature are significant; The interaction effect is also significant

> library(doe.base) > design<-fac.design(nlevels=3, nfactors=2, + factor.names = list(temp=c(-1,0,1), + Mat=c(-1,0,1)), + replications=4, randomize=false) > design run.no run.no.std.rp Temp Mat Blocks 1 1 1.1-1 -1.1 2 2 2.1 0-1.1 3 3 3.1 1-1.1 4 4 4.1-1 0.1 5 5 5.1 0 0.1 6 6 6.1 1 0.1 7 7 7.1-1 1.1 8 8 8.1 0 1.1 9 9 9.1 1 1.1 10 10 1.2-1 -1.2 11 11 2.2 0-1.2 12 12 3.2 1-1.2 13 13 4.2-1 0.2 14 14 5.2 0 0.2 15 15 6.2 1 0.2 16 16 7.2-1 1.2 17 17 8.2 0 1.2 18 18 9.2 1 1.2 19 19 1.3-1 -1.3 20 20 2.3 0-1.3 21 21 3.3 1-1.3 22 22 4.3-1 0.3 23 23 5.3 0 0.3 24 24 6.3 1 0.3 25 25 7.3-1 1.3 26 26 8.3 0 1.3 27 27 9.3 1 1.3 28 28 1.4-1 -1.4 29 29 2.4 0-1.4 30 30 3.4 1-1.4 31 31 4.4-1 0.4 32 32 5.4 0 0.4 33 33 6.4 1 0.4 34 34 7.4-1 1.4 35 35 8.4 0 1.4 36 36 9.4 1 1.4 class=design, type= full factorial NOTE: columns run.no and run.no.std.rp are annotation, not part of the data frame

> y<-c(130,34,20,150,136,25,138,174,96, + 155,40,70,188,122,70,110,120,104, + 74,80,82,159,106,58,168,150,82, + 180,75,58,126,115,45,160,139,60) > design<-add.response(design=design,response=y) > summary(aov(y~temp*mat, data=design)) Df Sum Sq Mean Sq F value Pr(>F) Temp 2 39119 19559 28.968 1.91e-07 *** Mat 2 10684 5342 7.911 0.00198 ** Temp:Mat 4 9614 2403 3.560 0.01861 * Residuals 27 18231 675 --- Signif. codes: 0 *** 0.001 ** 0.01 * 0.05. 0.1 1

Tukey s Test Interaction is significant and comparisons between the means of one factor (e.g., A) may be obscured by the AB interaction One approach is to fix factor B at a specific level and apply Tukey s test to the means of factor A at that level Ex: To detect the differences among the means of the three material types: temperature level 2 (70 F) y 12. = 57.25; y 22. = 119.75; y 32. = 145.75 [XLS] Statistical Tables & Calculators T 0.05 = q 0.05,3,27 MS E n = 3.51 675.21 4 3 vs 1: 145.75 57.25 = 88.50 3 vs 2: 145.75 119.75 = 26.00 2 vs 1: 119.75 57.25 = 62.50 = 45.47

> x1<-as.numeric(levels(design$temp)[design$temp]) > x2<-as.numeric(levels(design$mat)[design$mat]) > x1x1<-x1*x1 > x2x2<-x2*x2 > model<-lm(y~x1*x2+x1x1+x2x2) > summary(model) Call: lm.default(formula = y ~ x1 * x2 + x1x1 + x2x2) Residuals: Min 1Q Median 3Q Max -53.160-18.248 2.601 16.913 52.840 Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) 110.389 11.282 9.785 7.58e-11 *** x1-40.333 6.179-6.527 3.23e-07 *** x2 20.958 6.179 3.392 0.00197 ** x1x1-3.083 10.703-0.288 0.77527 x2x2-4.208 10.703-0.393 0.69696 x1:x2 4.687 7.568 0.619 0.54035 --- Signif. codes: 0 *** 0.001 ** 0.01 * 0.05. 0.1 1 Residual standard error: 30.27 on 30 degrees of freedom Multiple R-squared: 0.6459, Adjusted R-squared: 0.5869 F-statistic: 10.95 on 5 and 30 DF, p-value: 4.638e-06

Model Adequacy Checking model1<-lm(y~x1+x2, data=design) summary(model1) par(mfrow=c(2,2)) plot(model1)

> library(plot3d) > x<-seq(-1,1,by=.1) > y<-seq(-1,1,by=.1) > M<-mesh(x,y) > par(mar = c(2, 2, 2, 2)) > surf3d(x=m$x,y=m$y, + z=110.389-40.333*m$x+20.958*m$y, + bty="b2",xlab="x1",ylab="x2", + zlab="y", + main=expression("y=110.389-40.333x"[1]*"+20.958x"[2]))

Montgomery, pg. 398

> design<-fac.design(nlevels=3, nfactors=3, + factor.names = list(noozle=c(-1,0,1), + Speed=c(-1,0,1), + Pressure=c(-1,0,1)), + replications=2, randomize=false) > y<-c(-35,17,-39,-45,-65,-55,-40,20,15, + 110,55,90,-10,-55,-28,80,110,110, + 4,-23,-30,-40,-64,-61,31,-20,54, + -25,24,-35,-60,-58,-67,15,4,-30, + 75,120,113,30,-44,-26,54,44,135, + 5,-5,-55,-30,-62,-52,36,-31,4) > design<-add.response(design=design,response=y) > summary(aov(y~noozle*speed*pressure, data=design)) Df Sum Sq Mean Sq F value Pr(>F) Noozle 2 994 497 1.165 0.327102 Speed 2 61190 30595 71.735 1.57e-11 *** Pressure 2 69105 34553 81.014 3.89e-12 *** Noozle:Speed 4 6301 1575 3.693 0.015950 * Noozle:Pressure 4 7514 1878 4.404 0.007187 ** Speed:Pressure 4 12854 3214 7.535 0.000327 *** Noozle:Speed:Pressure 8 4629 579 1.357 0.259496 Residuals 27 11516 427 --- Signif. codes: 0 *** 0.001 ** 0.01 * 0.05. 0.1 1

> x1<-as.numeric(levels(design$noozle)[design$noozle]) > x2<-as.numeric(levels(design$speed)[design$speed]) > x3<-as.numeric(levels(design$pressure)[design$pressure]) > x1x1<-x1*x1 > x2x2<-x2*x2 > x3x3<-x3*x3 > model<-lm(y~x1*x2*x3+x1x1+x2x2+x3x3) > summary(model) Call: lm.default(formula = y ~ x1 * x2 * x3 + x1x1 + x2x2 + x3x3) Residuals: Min 1Q Median 3Q Max -56.556-26.297 6.281 19.684 54.111 Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) 1.556 10.954 0.142 0.888 x1-3.111 5.071-0.614 0.543 x2 6.250 5.071 1.233 0.224 x3 3.333 5.071 0.657 0.514 x1x1 7.333 8.783 0.835 0.408 x2x2 70.583 8.783 8.036 4.21e-10 *** x3x3-75.667 8.783-8.615 6.49e-11 *** x1:x2 8.417 6.211 1.355 0.182 x1:x3-5.208 6.211-0.839 0.406 x2:x3 4.208 6.211 0.678 0.502 x1:x2:x3 3.813 7.606 0.501 0.619 --- Signif. codes: 0 *** 0.001 ** 0.01 * 0.05. 0.1 1 Residual standard error: 30.43 on 43 degrees of freedom Multiple R-squared: 0.7714, Adjusted R-squared: 0.7182 F-statistic: 14.51 on 10 and 43 DF, p-value: 8.554e-11

Regression Model (x 1 = nozzle, x 2 = speed, x 3 = pressure in coded units) y = 1.56 3.11x 1 + 6.25x 2 + 3.33x 3 + 8.42x 1 x 2 5.21x 1 x 3 + 4.21x 2 x 3 + 7.33x 1 2 + 70.58x 2 2 75.67x 3 2 + 3.81x 1 x 2 x 3 Nozzle Type (qualitative factor) 1 (x 1 = 1): y = 12 2.17x 2 + 8.54x 3 + 0.4x 2 x 3 + 70.58x 2 2 75.67x 3 2 2 (x 1 = 0): y = 1.56 + 6.25x 2 + 3.33x 3 + 4.21x 2 x 3 + 70.58x 2 2 75.67x 3 2 3 (x 1 = +1): y = 5.78 + 14.67x 2 1.88x 3 + 8.02x 2 x 3 + 70.58x 2 2 75.67x 3 2

> library(plot3d) > x<-seq(-1,1,by=.1) > y<-seq(-1,1,by=.1) > M<-mesh(x,y) > par(mar = c(2, 2, 2, 2)) > surf3d(x=m$x,y=m$y, + z=70.583*m$x*m$x+-75.667*m$y*m$y, + bty="b2",xlab="x2",ylab="x3", + zlab="y", + main=expression("y=70.583x"[2]^2*"-75.667x"[3]^2))

> contour2d(x=x,y=y, + z=70.583*m$x*m$x-75.667*m$y*m$y, + xlab="x2",ylab="x3",lwd=3, + drape = FALSE, + main=expression("y=70.583x"[2]^2*"- 75.667X"[3]^2)) The smallest observed contour is -60 (the objetive is to minimize syrup loss)

Factorial with Mixed Levels Two-level factorial and fractional designs are of great practical importance The three-level system is much less useful because the designs are relatively large even for a modest number of factors In some situations it is necessary to include a factor (or a few factors) that has more than two levels

Example Data As an example, assume that you conducted an experiment in which you were interested in the extent to which visual distraction affects younger and older people's learning and remembering. To do this, you obtained a group of younger adults and a separate group of older adults and had them learn under three conditions (eyes closed, eyes open looking at a blank field, eyes open looking at a distracting field of pictures). This is a 2 (age) x 3 (distraction condition) mixed factorial design. The scores on the data sheet below represent the number of words recalled out of ten under each distraction condition. http://eweb.furman.edu/~lpace/spss_tutorials/lesson9.html

> design2x3 <- regular.design(factors=list(age=1:2,distraction=1:3), + model=~age*distraction, + nunits=24) > print(design2x3@design) Age Distraction 1 1 1 2 1 2 3 1 3 4 1 1 5 1 2 6 1 3 7 1 1 8 1 2 9 1 3 10 1 1 11 1 2 12 1 3 13 2 1 14 2 2 15 2 3 16 2 1 17 2 2 18 2 3 19 2 1 20 2 2 21 2 3 22 2 1 23 2 2 24 2 3

> design2x3@design$y<-c(8,5,3, + 7,6,6, + 8,7,6, + 7,5,4, + 6,5,2, + 5,5,4, + 5,4,3, + 6,3,2) > design2x3.aov <- aov(y~age*distraction, data=design2x3) > summary(design2x3.aov) Df Sum Sq Mean Sq F value Pr(>F) Age 1 20.167 20.167 21.353 0.000212 *** Distraction 2 30.333 15.167 16.059 9.94e-05 *** Age:Distraction 2 0.333 0.167 0.176 0.839656 Residuals 18 17.000 0.944 --- Signif. codes: 0 *** 0.001 ** 0.01 * 0.05. 0.1 1 The test indicates: a) there is a significant effect of the age condition on word memory. b) that there is a significant effect of the distraction condition on word memorization; c) the lack of an interaction between distraction and age indicates that this effect is consistent for both younger and older participants.

Additional Designs Nested (or Hierarchical) Designs The levels of one factor (e.g. B) are similar but not identical for different levels of another factor (e.g. A) The levels of factor B are nested under the levels of factor A Example Two-stage nested design: suppliers batches observation» A company purchases raw material from three different suppliers» Is the purity of the raw material the same from each supplier?» There are four batches of raw material available from each supplier, and three determinations of purity from each batch

Why it is not a factorial example?» The batches from each supplier are unique from that particular supplier Batch 1 from supplier 1 has no connection with batch 1 from supplier 2, batch 2 from supplier 1 has no connection with batch 2 from supplier 2, and so forth.» To emphasize the fact that the batches from each supplier are different batches we may rename the batches:

Consider a company that buys raw material in batches from three different suppliers. The purity of this raw material varies considerably, which causes problems in manufacturing the finished product. We wish to determine whether the variability in purity is attributable to differences between suppliers. Four batches of raw material are selected at random from each supplier, and three determinations of purity are made on each batch.

P-Values: There is no signficant effect on purity due to suppliers The purity of batches of raw material from the same supplier does differ significantly

Split-Plot Designs In some experiments, we may be unable to completely randomize the order of the run Whole plots or main treatments Subplots or split-plots

Ex: a paper manufacturer is interested in three different pulp preparation methods and four differen cooking temperatures to study these effects on the tensile strength of the paper Each replicate requires 12 observations, and the experimenter has decided to run 3 replicates 36 runs A batch of pulp is produced by one of the three methods This batch is divided into four samples, and each sample is cooked at one of the four temperatures Then a second batch of pulp is made up using another of the three methods...

Why it is not a factorial example (three levels of preparation method, factor A, four levels of temperature, factor B)? If this is the case, then the order of the experimentation within each replicate should be completely randomized Treatment combinations (preparation method and temperature) should be randomly selected But, the experimenter did no collect the data this way He made up a batch of pulp and obtained observations for all temperatures from that batch The only feasible way to run this experiment: preparing the bathes (economics) and the size of the batches A completely randomized factorial experiment: 36 batches of pulp Split-plot design: 9 batches total

9 whole plots (preparation methods) 4 subplot treatments (temperatures)

The linear model y ijk = μ + τ i + β j + τβ ij + γ k + τγ ik + βγ jk + τβγ ijk + ε ijk i = 1,2,, r j = 1,2,, a k = 1,2,, b where τ i, β j, and τβ ij represent the whole plot and correspond, respectively, to replicates, main treatments (factor A), and whole-plot error (replicates A); and γ k, τγ ik, βγ jk, and τβγ ijk represent the subplot and correspond, respectively, to the subplot treatment (factor B), the replicates B and AB interactions, and the subplot error (replicates AB).

The split-plot design has an agricultural heritage Whole plots: large areas of land Subplots: smaller areas of land within large areas Ex: several varieties of a crop could be planted in different fields (whole plots), one variety to a field Each field could be divided into, say, four subplots, and each subplot could be treated with different type os fertilizer. Here the crop varietes are the main treatments and the different fertilizers are the subtreatments.

Lab