Design of Experiments: III. Hasse diagrams in designed experiments

Similar documents
Design of Experiments: I, II

Automatic analysis of variance for orthogonal plot structures

Abstract. The design key in single- and multi-phase experiments. Introduction of the design key. H. Desmond Patterson

Analysis of Variance and Co-variance. By Manza Ramesh

John Ashworth Nelder Contributions to Statistics

Small Units inside Large Units

Business Statistics. Lecture 10: Correlation and Linear Regression

STAT 350: Geometry of Least Squares

The definition of a vector space (V, +, )

Sleep data, two drugs Ch13.xls

An approach to the selection of multistratum fractional factorial designs

MATH 1120 (LINEAR ALGEBRA 1), FINAL EXAM FALL 2011 SOLUTIONS TO PRACTICE VERSION

Stat 579: Generalized Linear Models and Extensions

CHAPTER 2 POLYNOMIALS KEY POINTS

Design of Engineering Experiments Chapter 5 Introduction to Factorials

Written Exam (2 hours)

Figure 9.1: A Latin square of order 4, used to construct four types of design

SKILL BUILDER TEN. Graphs of Linear Equations with Two Variables. If x = 2 then y = = = 7 and (2, 7) is a solution.

Proof of Radon-Nikodym theorem

Simple Linear Regression

Statistics Univariate Linear Models Gary W. Oehlert School of Statistics 313B Ford Hall

This module focuses on the logic of ANOVA with special attention given to variance components and the relationship between ANOVA and regression.

PLSC PRACTICE TEST ONE

A discussion on multiple regression models

Factorial designs. Experiments

Ð"Ñ + Ð"Ñ, Ð"Ñ +, +, + +, +,,

Topic 7: Incomplete, double-blocked designs: Latin Squares [ST&D sections ]

MATH 31 - ADDITIONAL PRACTICE PROBLEMS FOR FINAL

20g g g Analyze the residuals from this experiment and comment on the model adequacy.

SCIE 4101 Spring Math Review Packet #2 Notes Algebra I

Combinatorics for algebraic geometers

Announcements Wednesday, November 01

with the usual assumptions about the error term. The two values of X 1 X 2 0 1

Mean Comparisons PLANNED F TESTS

Six Sigma Black Belt Study Guides

5.) For each of the given sets of vectors, determine whether or not the set spans R 3. Give reasons for your answers.

Design & Analysis of Experiments 7E 2009 Montgomery

Chapter 13: Analysis of variance for two-way classifications

VCE. VCE Maths Methods 1 and 2 Pocket Study Guide

Analysis of variance using orthogonal projections

Assignment 1 Math 5341 Linear Algebra Review. Give complete answers to each of the following questions. Show all of your work.

3.4. A computer ANOVA output is shown below. Fill in the blanks. You may give bounds on the P-value.

CORRELATION AND REGRESSION

Solving Equations Quick Reference

SCIE 4101 Fall Math Review Packet #2 Notes Patterns and Algebra I Topics

The linear model. Our models so far are linear. Change in Y due to change in X? See plots for: o age vs. ahe o carats vs.

MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS FALL 1999

Chapter 3. Vector spaces

Extra Problems for Math 2050 Linear Algebra I

CHAPTER 1: Functions

OHSU OGI Class ECE-580-DOE :Design of Experiments Steve Brainerd

Correlation Analysis

, a 1. , a 2. ,..., a n

21.0 Two-Factor Designs

Linear Models Review

Experimental design (DOE) - Design

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Conceptual Questions for Review

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Chapter 5 Introduction to Factorial Designs Solutions

Math 520 Exam 2 Topic Outline Sections 1 3 (Xiao/Dumas/Liaw) Spring 2008

23. Fractional factorials - introduction

Vector Spaces 4.5 Basis and Dimension

Order of Operations. Real numbers

Maths Class 11 Chapter 5 Part -1 Quadratic equations

5.3. Exercises on the curve analysis of polynomial functions

Chapter 4. Regression Models. Learning Objectives

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Outline for today. Two-way analysis of variance with random effects

What If There Are More Than. Two Factor Levels?

Maths 212: Homework Solutions

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

2-7 Solving Quadratic Inequalities. ax 2 + bx + c > 0 (a 0)

Taguchi Method and Robust Design: Tutorial and Guideline

Solution to Final Exam

Topic 9: Factorial treatment structures. Introduction. Terminology. Example of a 2x2 factorial

MODULE 8 Topics: Null space, range, column space, row space and rank of a matrix

Chapter 4: Regression Models

Preliminary/Qualifying Exam in Numerical Analysis (Math 502a) Spring 2012

Do not copy, post, or distribute

Factorial designs (Chapter 5 in the book)

1 Topology Definition of a topology Basis (Base) of a topology The subspace topology & the product topology on X Y 3

Willmar Public Schools Curriculum Map

Fractional Factorial Designs

Linear algebra I Homework #1 due Thursday, Oct Show that the diagonals of a square are orthogonal to one another.

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

Orthogonal, Planned and Unplanned Comparisons

The eigenvalues are the roots of the characteristic polynomial, det(a λi). We can compute

Mathematics 1 Lecture Notes Chapter 1 Algebra Review

Introduction to bivariate analysis

STUDY GUIDE Math 20. To accompany Intermediate Algebra for College Students By Robert Blitzer, Third Edition

Algebraic Methods in Combinatorics

Chapter 6 - Orthogonality

STAT 525 Fall Final exam. Tuesday December 14, 2010

22 Approximations - the method of least squares (1)

The absolute value (modulus) of a number

5.3. Polynomials and Polynomial Functions

COVARIANCE ANALYSIS. Rajender Parsad and V.K. Gupta I.A.S.R.I., Library Avenue, New Delhi

Graphs of polynomials. Sue Gordon and Jackie Nicholas

MATHEMATICAL METHODS UNIT 1 Chapter 1 Reviewing Linear Equations Chapter 2 Coordinate geometry & linear relations

Transcription:

Design of Experiments: III. Hasse diagrams in designed experiments R. A. Bailey r.a.bailey@qmul.ac.uk 57a Reunião Anual da RBras, May 202 Outline I: Factors II: Models I. Factors Part I Factors pen. Each pen was allocated to a certain of feed. Batches of this of feed were put into the pen; calves were free to eat as much of this as they liked. Calves were weighed individually. Wine Four wines are tasted and evaluated by each of eight judges. A plot is one tasting by one judge. Experiment on cultivars and fertilizer Partial order Infimum and Supremum Orthogonality Structures defined by factors Skeleton anova. Examples Three experiments Ladybirds Three pesticides were compared for their side-effects on ladybirds. A field was divided into three areas and one pesticide applied to each area. Ladybirds were counted on three samples from each area. Calves The treatments were 4 different feeds for calves. The calves were not fed individually. They were housed in 8 pens, with 0 calves per 0 60 240 60 80 0 60 80 80 0 60 80 80 0 60 240 0 240 240 240 0 80 240 60 Cropper Melba Melle Melba Cropper Melle.2 Factors Factors, levels and classes A factor F is a function for which we are more interested in knowing whether F(α) = F(β) than in knowing the value F(α).

So we can regard a factor as a partition into levels or classes. Let Ω = the set of observational units, and let F be a factor on Ω. We define the F-class containing α to be F[[α]] = {ω Ω : F(ω) = F(α)}. For example, if the observational units are plots in several fields, and F is the factor for fields and α is a plot, then F[[α]] consists of all the plots in the same field as the plot α. A factor is uniform if all of its classes have the same size. Cultivar example 0 60 240 60 80 0 60 80 80 0 60 80 80 0 60 240 0 240 240 240 0 80 240 60 Cropper Melba Melle Melba Cropper Melle E = plot strip field U strip cultivar Aliasing We say that F is aliased with G (written F G) if every F-class is also a G-class. Some aliasing is just renaming of levels: for example, F is days with names Monday, Tuesday,..., and G is dates with names, 2,.... Some aliasing is deliberate: for example, assign each fruit-picker to pick the apples from a single row in the orchard. Two special factors The universal factor U has just one class; it has the same level everywhere. The Hasse diagram There is one point for each factor. Write F G Draw G F Convention Write n F, the number of levels of F, beside the dot for F. The equality factor E has one level per observational unit; so each class consists of a single observational unit.. Partial order The partial order We say that F is finer than G (or G is coarser than F) (written F G) if every F-class is contained in a G-class but F G. We write F G if F G or F G. So E F U for every factor F. 2

Cultivar example: Hasse diagram on plot factors 0 60 240 60 80 0 60 80 80 0 60 80 Infimum on the Hasse diagram F G is the factor whose classes are the nonempty intersections of F-classes with G-classes. It is finer than both F and G: F G F and F G G. It has the least number of levels subject to this. 80 0 60 240 0 240 240 240 0 80 240 60 Cropper Melba Melle Melba Cropper Melle F G F G Infimum example on treatment factors U 2 field 6 strip cultivar fertilizer 4 treatment 2 cultivar fertilizer = treatment.4 Inf and sup 24 E Infimum of two factors Given two factors F and G, the factor F G is defined by (F G)[[ω]] = F[[ω]] G[[ω]]. Cultivar example cultivar fertilizer = treatment field cultivar = strip Supremum of two factors Given two factors F and G, the factor F G is the finest factor whose classes are unions of F-classes and unions of G-classes. If you try to fit F and G in a linear model, you will get into trouble unless you fit F G first. Cultivar example field fertilizer = U strip treatment = cultivar

Supremum on the Hasse diagram F G is the factor which is coarser than both F and G: F F G and G F G, and which has the largest number of levels subject to this. Its classes are the connected components of the graph whose edges are pairs with the same level of either F or G. F F G G Supremum example with mixed factors 0 60 240 60 80 0 60 80 80 0 60 80 Hasse diagram for factors on the observational units 2 6 24 U field strip E How many of each are there? Hasse diagram for factors on the treatments cultivar U = fertilizer cultivar fertilizer 4 2 T = fertlizer cultivar 80 0 60 240 0 240 Factorial treatments plus control 240 240 0 80 240 60 Cropper Melba Melle Melba Cropper Melle none single double Z S K M N cultivar 6 2 strip treatment strip treatment = cultivar = fumigant 2 9 U fumigant 5 treatment 4

Drugs at different stages of development A pharmaceutical company wants to compare 6 treatments for a certain disease. There are are different s of drug A, that has been under development for some time, and different s (not comparable with the previous ) of a new drug B, that has not been so extensively studied. A A2 A B B2 B U drug 2 2 2 A 2 4 4 4 B 4 4 4 2 E 2 4 5 6 U Orthogonality: general case F is orthogonal to G if, within each class of F G separately, F-classes meet G-classes proportionately. then In general, this condition means that, whenever (F G)(α) = (F G)(β), F[[α]] G[[β]] = and the following proportionality equation holds: (F G)[[α]] F[[α]] G[[α]] = (F G)[[β]] F[[β]] G[[β]]. A 4 2 6 drug B 4 E Three different possibilities, all orthogonal Z S K M N none 6 single 4 4 4 4 double 4 4 4 4.5 Orthogonality Orthogonality: special cases Let F and G be factors on the same set. F is orthogonal to G if, within each class of F G separately, F-classes meet G-classes proportionately. Some special cases We deem F to be orthogonal to itself. If F G, then F is orthogonal to G. If every F-class intersects every G-class and F G is uniform, then F is orthogonal to G. Z S K M N none 0 single 4 5 4 double 4 5 4 Z S K M N none 25 single 6 9 6 double 2 2 5

.6 Structures Orthogonal treatment structure An orthogonal treatment structure is a set G of factors on the set of treatments such that. U G; 2. if F G and G G then F G G;. if F G and G G then F is orthogonal to G. Factorial treatments plus control Z S K M N none single double = fumigant For each factor F in G, let W F be the set of vectors which, U are constant on every level of F and sum to zero on every level of G whenever F G and G G. Put d F = dim W F = degrees of freedom for F. Theorem. For an orthogonal treatment structure, the spaces W F are orthogonal to each other and d F = n F d G. G F Hasse diagram for factors on the treatments cultivar, 2, 2, 6 U = fertilizer cultivar fertilizer 4, T = fertlizer cultivar, 2, 9, fumigant 5, treatment Drugs at different stages of development A A2 A B B2 B U drug 2 2 2 A 2 4 4 4 B 4 4 4 2 E 2 4 5 6, U Degrees of freedom calculated by subtraction A 4, 2 2, drug B 4, 2 6, 0 E 6

Orthogonal plot structure An orthogonal plot structure is a set F of factors on the observational units such that. every factor in F is uniform; 2. U F;. E F; 4. if F F and G F then F G F; 5. if F F and G F then F G F; 6. if F F and G F then F is orthogonal to G. Conditions 2, 4 and 6 are the same as those for an orthogonal treatment structure, so if we define W F and d F as before then we have Theorem 2. For an orthogonal plot structure, the spaces W F are orthogonal to each other and d F = n F d G. G F Hasse diagram for factors on the observational units, 2, 6, 4 24, 8 U field strip E Covariance matrix How many of each are there? Degrees of freedom calculated by subtraction Theorem. If the observational units have an orthogonal plot structure, and the covariance of Y i and Y j depends only on which factors have the same level on i and j, then the eigenspaces of the covariance matrix (that is, the strata) are the spaces W F. This implies that, if we project the data onto any one stratum W F, then estimation and testing within that stratum proceed just as they do when the covariance matrix is Iσ 2. Null analysis of variance The null analysis of variance is a table showing the strata and their degrees of freedom., 2, 6, 4 24, 8 stratum U field strip E df U field strip 4 plot 8 Total 24 Orthogonal design An orthogonal design consists of a set of treatments (which affect expectation) having an orthogonal treatment structure a set of plots (which affect covariance) having an orthogonal plot structure an allocation of treatments to plots which satisfies treatment factors remain orthogonal to each other, even if treatments are not equally replicated treatment factors are orthogonal to plot factors if F is a plot factor and G is a treatment factor then F G is a treatment factor. 7

.7 Skeleton anova Combined Hasse diagram Draw the combined Hasse diagram using 2 different colours of dot. Each treatment dot G is immediately above a unique plot dot, say F. Estimate the effect of W G in stratum W F. Expand the null analysis of variance to the skeleton analysis of variance by allocating each treatment subspace to the stratum in which the corresponding treatment effect is estimated. Combined Hassed diagram for cultivar example U field cultivar fertilizer strip T E Skeleton analysis of variance for cultivar example stratum df source df U mean field field strip 4 cultivar 2 residual 2 plot 8 fertilizer Total 24 cultivar fertilizer 6 residual 9 Wine example: combined Hasse diagram Four wines are tasted and evaluated by each of eight judges. A plot is one tasting by one judge. U judge 8 4 wine 2 E The main effect of cultivar is estimated in the strip stratum. The main effect of fertilizer and the cultivar-byfertilizer interaction are both estimated in the plot stratum. 8

Calves example: combined Hasse diagram The treatments were 4 different feeds for calves. The calves were not fed individually. They were housed in 8 pens, with 0 calves per pen. Each pen was allocated to a certain of feed. Batches of this of feed were put into the pen; calves were free to eat as much of this as they liked. Calves were weighed individually. 4 8 U feed pen Linear models A linear model for variables Y,..., Y n has the form E(Y) M and Cov(Y) = σj 2 Q j, j where M is a known subspace of R n and the matrices Q j are known. For brevity, let us call M the model. Example model with one treatment factor Plot i has treatment f (i) and E(Y i ) = τ f (i). 80 E Then M consists of all vectors v with the property that Ladybirds example: combined Hasse diagram Three pesticides were compared for their sideeffects on ladybirds. A field was divided into three areas and one pesticide applied to each area. Ladybirds were counted on three samples from each area. and v i = v j if f (i) = f (j); dim M = number of treatments. 2 II. Models area 9 U pesticide E Example of a polynomial model There is a covariate x taking value x i on plot i, and E(Y i ) = a + bx i + cx 2 i. Then M consists of all vectors which are linear combinations of the three vectors (,,,..., ), (x, x 2, x,..., x n ) and (x 2, x2 2, x2,..., x2 n); and dim M =. Part II Expectation Models Partial order Orthogonality Examples Scaled Hasse digram 2. Partial order The partial order Suppose that M and M 2 are subspaces of R n. Write M < M 2 if M is a subspace of M 2 but M = M 2. Write M M 2 if M < M 2 or M = M 2. 9

The Hasse diagram on models There is one point for each model. Write M < M 2 Draw M2 M Convention Write dim M beside the dot for M. Family of models: intersection Let M be the family of expectation models that we are considering fitting to the data. Principle. If M and M 2 are both in M then M M 2 must also be in M. Why? If not, we might find that our data vector y is in M M 2 and then we cannot choose between M and M 2. (This principle is sometimes violated in fractional factorial designs.) M M 2 Family of models: sum M M 2 M + M 2 = {v + w : v M and w M 2 } Principle 2. If M and M 2 are both in M then M + M 2 must also be in M. Why? If not, we might find that v and w are allowed vectors of fitted values but v + w is not. (People who fit sparse models do not believe this.) 2.2 Orthogonality Family of models: orthogonality If M and M 2 are in M then in general they cannot be orthogonal to each other because they both contain M M 2. Principle. If M and M 2 are both in M and M 0 = M M 2 then M M0 should be orthogonal to M 2 M0. Why? If not, various paradoxes are possible in fitting the data. (Again, in practice, this is often ignored.) 2. Examples Hasse diagram of some polynomial models 4 2 cubic polynomial quadratic polynomial straight line constant Warning: the best-fitting quadratic polynomial is not usually obtained by taking the best-fitting cubic polynomial and removing the term in x. M + M 2 M M 2 Theorem 4. Two warnings If the models are defined by factors then the Hasse diagram for the models is the opposite way up from the Hasse diagram for the factors dim(m + M 2 ) = dim M + dim M 2 dim(m M 2 ). there may be more models than factors. 0

Drugs at different stages of development A A2 A B B2 B U drug 2 2 2 A 2 4 4 4 B 4 4 4 2 tmt 2 4 5 6 A 4 6 treatment B 4 2 drug constant Factorial: cultivars and fertilizer 2 treatments are all combinations of: factor levels Cultivar (C) Cropper, Melle, Melba Fertilizer (F) 0, 80, 60, 240 kg/ha 2 treatment Factorial treatments plus control none single double Z S K M N = fumigant 9 treatment 6 + 5 2 fumigant constant 2.4 Scaled Hasse diagram ANOVA table Each line of the ANOVA table corresponds to a difference between models, showing the difference between the sums of squares of their fitted values, and the difference between their dimensions. C 6 C + F F 4 constant We can summarize the ANOVA table graphically by scaling the lines in the Hasse diagram.

An experiment on biodiversity A, B, C, D, E, F six s of freshwater shrimp. Put 2 shrimps in a jar containing stream water and alder leaf litter. Measure how much leaf litter is eaten after 28 days. Experimental unit = jar. Richness Treatment Level 6 A,..., F monoculture 2 of A 5 AB,..., EF duoculture 6 of A, 6 of B 2 20 ABC,..., DEF triculture 4 of A, 4 of B, 4 of C 4 The Hasse diagram of our family of expectation The experiment was carried out in 4 blocks of 4 models jars. (4) Treatment What models did we fit? The biologist fitted the model Richness with parameters, one for each level of richness, and found no evidence of any differences between the levels. I suggested the model Type with 6 parameters α A,..., α F : monoculture A duoculture AB triculture ABC α A α A + α B 2 α A + α B + α C In other words, if there are x i shrimps of i then E(Y) = 6 a i x i where 2a i = α i. i= ( x i = 2 always, so no need for intercept.) (8) (8) Richness + Type Richness () Type (6) Constant () (blocks included in all models) Richness Type (a i can change with each level of richness but does not depend on what else is present) (add a different constant for each level of richness) 2

What the data showed: lengths are mean squares Treatment Richness Type Richness + Type. Type Conclusions: The model Richness does not explain the data. The model Type explains the data well. There is no evidence that any larger model does any better. Scale: residual mean square Richness. Constant Further Reading R. A. Bailey: Design of Comparative Experiments, Cambridge Series in Statistical and Probabilistic Mathematics, Cambridge University Press, Cambridge, 2008. J. Reiss, R. A. Bailey, D. M. Perkins, A. Pluchinotta & G. Woodward: Testing effects of consumer richness, evenness and body size on ecosystem functioning. Journal of Animal Ecology, published online May 20.