Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs

Similar documents
STAT 506: Randomized complete block designs

STAT 705 Chapter 19: Two-way ANOVA

STAT 705 Chapter 19: Two-way ANOVA

STAT 705 Chapter 16: One-way ANOVA

STAT 705 Chapters 23 and 24: Two factors, unequal sample sizes; multi-factor ANOVA

Analysis of Covariance

Stat 705: Completely randomized and complete block designs

Unit 6: Orthogonal Designs Theory, Randomized Complete Block Designs, and Latin Squares

Analysis of Covariance

Two-factor studies. STAT 525 Chapter 19 and 20. Professor Olga Vitek

IX. Complete Block Designs (CBD s)

Outline Topic 21 - Two Factor ANOVA

SAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model

1 Tomato yield example.

Statistics For Economics & Business

Nested Designs & Random Effects

Formal Statement of Simple Linear Regression Model

STAT 525 Fall Final exam. Tuesday December 14, 2010

Lec 5: Factorial Experiment

Lecture 9: Factorial Design Montgomery: chapter 5

Power & Sample Size Calculation

Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA

If we have many sets of populations, we may compare the means of populations in each set with one experiment.

STAT 705 Chapters 22: Analysis of Covariance

Stat 601 The Design of Experiments

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication

Ch 2: Simple Linear Regression

Unit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs

Stat 217 Final Exam. Name: May 1, 2002

Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8]

Topic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction Review of regression concepts

Lecture 1 Linear Regression with One Predictor Variable.p2

Topic 13. Analysis of Covariance (ANCOVA) - Part II [ST&D Ch. 17]

VIII. ANCOVA. A. Introduction

Chapter 11 - Lecture 1 Single Factor ANOVA

Factorial and Unbalanced Analysis of Variance

Chap The McGraw-Hill Companies, Inc. All rights reserved.

PLS205!! Lab 9!! March 6, Topic 13: Covariance Analysis

PROBLEM TWO (ALKALOID CONCENTRATIONS IN TEA) 1. Statistical Design

Much of the material we will be covering for a while has to do with designing an experimental study that concerns some phenomenon of interest.

The Random Effects Model Introduction

Analysis of Variance and Design of Experiments-I

Outline. Topic 19 - Inference. The Cell Means Model. Estimates. Inference for Means Differences in cell means Contrasts. STAT Fall 2013

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

Unit 8: 2 k Factorial Designs, Single or Unequal Replications in Factorial Designs, and Incomplete Block Designs

STAT 115:Experimental Designs

STATS Analysis of variance: ANOVA

PLS205 Lab 6 February 13, Laboratory Topic 9

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Factorial designs. Experiments

The factors in higher-way ANOVAs can again be considered fixed or random, depending on the context of the study. For each factor:

Lecture 10: Experiments with Random Effects

Week 14 Comparing k(> 2) Populations

Least Squares Analyses of Variance and Covariance

Table 1: Fish Biomass data set on 26 streams

Lec 1: An Introduction to ANOVA

Outline. Topic 22 - Interaction in Two Factor ANOVA. Interaction Not Significant. General Plan

Factorial ANOVA. Psychology 3256

Confidence Intervals, Testing and ANOVA Summary

MAT3378 ANOVA Summary

Homework 2: Simple Linear Regression

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data

Topic 29: Three-Way ANOVA

Allow the investigation of the effects of a number of variables on some response

df=degrees of freedom = n - 1

Lecture 5: Comparing Treatment Means Montgomery: Section 3-5

Theorem A: Expectations of Sums of Squares Under the two-way ANOVA model, E(X i X) 2 = (µ i µ) 2 + n 1 n σ2

Lecture 10 Multiple Linear Regression

Correlation Analysis

Correlation and the Analysis of Variance Approach to Simple Linear Regression

3. Design Experiments and Variance Analysis

V. Experiments With Two Crossed Treatment Factors

Analysis of Variance. Read Chapter 14 and Sections to review one-way ANOVA.

One-way ANOVA (Single-Factor CRD)

STAT Final Practice Problems

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.

Two-Way Analysis of Variance - no interaction

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Two-Way ANOVA (Two-Factor CRD)

Multiple comparisons The problem with the one-pair-at-a-time approach is its error rate.

Lectures on Simple Linear Regression Stat 431, Summer 2012

Lecture 7 Randomized Complete Block Design (RCBD) [ST&D sections (except 9.6) and section 15.8]

Outline. Remedial Measures) Extra Sums of Squares Standardized Version of the Multiple Regression Model

ANOVA (Analysis of Variance) output RLS 11/20/2016

EX1. One way ANOVA: miles versus Plug. a) What are the hypotheses to be tested? b) What are df 1 and df 2? Verify by hand. , y 3

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

Topic 25 - One-Way Random Effects Models. Outline. Random Effects vs Fixed Effects. Data for One-way Random Effects Model. One-way Random effects

Introduction. Chapter 8

Two-Way Factorial Designs

SAS Program Part 1: proc import datafile="y:\iowa_classes\stat_5201_design\examples\2-23_drillspeed_feed\mont_5-7.csv" out=ds dbms=csv replace; run;

Homework 3 - Solution

Overview Scatter Plot Example

Contents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4

TWO-LEVEL FACTORIAL EXPERIMENTS: BLOCKING. Upper-case letters are associated with factors, or regressors of factorial effects, e.g.

Single Factor Experiments

STAT22200 Spring 2014 Chapter 8A

Definitions of terms and examples. Experimental Design. Sampling versus experiments. For each experimental unit, measures of the variables of

Transcription:

Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs Adapted from Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 14

Tukey s test for additivity Tukey s test for additivity provides a 1 df test for interaction, rather than a (a 1) (b 1) df test, which would be impractical for n ij 1. 0.5 The reduced model is additive: Y ij = µ + α i + β j + ɛ ij. The full model is Y ij = µ + α i + β j + Dα i β j + ɛ ij. This is more restrictive than using a general interaction (αβ) ij, leaving (a 1) (b 1) 1 = ab a b df to estimate error. 0.5 The full LS estimator would be nonlinear in our parameters. Instead, we solve for D separately, and then plug in LS estimates of α i and β j from the additive model. 2 / 14

Tukey s test for additivity a b i=1 j=1 ˆD = (Ȳ i Ȳ )(Ȳ j Ȳ )Y ij a i=1 (Ȳi Ȳ ) 2 b j=1 (Ȳ j Ȳ ). 2 SSAB = a b (αβ) ˆ 2 ij = i=1 j=1 a i=1 j=1 b ˆD 2 (Ȳ i Ȳ ) 2 (Ȳ j Ȳ ) 2, and SSTO=SSA+SSB+SSAB*+SSE*. F = if H 0 : D = 0 is true. SSAB SSE F (1, ab a b), /(ab a b) 3 / 14

Trick for additivity test Let Ŷ ij be fitted values from the additive model. Make a second pass through PROC GLM to fit the ANCOVA model Y ij = µ + α i + β j + γŷ 2 ij + ɛ ij. The test of H 0 : γ = 0 is the same as the test of H 0 : D = 0, the F-statistics are the same and the p-values are the same! 4 / 14

Confidence ratings Example (p. 895): Executives are exposed to one of three methods (treatment, i = 1 utility method, i = 2 worry method, i = 3 comparison method) of quantifying the maximum risk premium they would be willing to pay to avoid uncertainty in a business decision. The response variable is degree of confidence in the method on a scale from 0 (no confidence) to 20 (complete confidence). It is thought that confidence is related to age, so the subjects are blocked according to age (j = 1, 2, 3, 4, 5 from oldest to youngest). n T = 15 subjects are recruited, with three subjects in each of the 5 age categories. Within each age category, the three subjects are randomly given one of the three treatments. 5 / 14

Confidence ratings, pp. 895-896 data conf; input rating age$ method$ @@; datalines; 1 1 1 5 1 2 8 1 3 2 2 1 8 2 2 14 2 3 7 3 1 9 3 2 16 3 3 6 4 1 13 4 2 18 4 3 12 5 1 14 5 2 17 5 3 ; proc format; value $ac 1 = oldest 2 = age grp II 3 = age grp III 4= age grp IV 5= youngest ; value $mc 1 = utility 2 = worry 3= compare ; * first obtain interaction plot by fitting model V; * trajectories look reasonably parallel; proc glm data=conf plots=all; class age method; model rating=age method; run; 6 / 14

Confidence ratings * fit additive model; proc glm data=conf plots=all; class age method; format age $ac. method $mc.; model rating=age method / solution; output out=tukeytest p=p; * p=yhat values for Tukey s test; run; * Tukey test for additivity; * p-value=0.79 so model IV is okay; proc glm data=tukeytest; title Test for additivity is Type III p*p p-value ; class age method; model rating=age method p*p; run; 7 / 14

21.1 Randomized complete block designs Subjects are placed into homogeneous groups, called blocks. All treatment combinations assigned randomly to subjects within blocks. Example: Stream habitat is being tested for biodiversity. In an observational study, three different types of habitat along streams are tested: farmland, high-density rural, forested. Four streams are selected for testing: Myers Creek, Cedar Creek, Dry Branch and Toms Creek. For each stream and habitat type, a randomly selected 100-meter stretch is seined and electrofished to measure fish and macroinvertebrates. A single measure of biodiversity (IBI-Index of Biotic Integrity) is computed for each stream reach. The observational treatment here is Habitat, while the block is Stream. 8 / 14

RCBD designs, comments With thoughtful blocking, a RCBD can provide more precise results than completely randomized design. There is only one replication for each pairing of treatment and block; we need to assume no interaction between treatments and blocks to obtain an estimate of σ 2. Note, as in the example above, that some experiments lend themselves to a single replication by the nature of their response variable. If interaction were present, it has troubling implications for inference on the treatment. The blocking variable is observational, not experimental. We cannot infer a causal relationship. This is usually not a problem, since the experimenter care more about the treatment. It s a source of endless debates among statisticians though! 9 / 14

Model and inference One observation per block/treatment combination gives n T = n b r. We need to fit model IV to get SSE > 0 Y ij = µ + ρ }{{} i + τ j +ɛ ij, i = 1,..., n b, j = 1,..., r. }{{} Bl Trt Estimates are obtained via LS as usual, Q(ρ, τ ) = n b i=1 j=1 minimized subject to ρ nb = τ r = 0. r (Y ij [µ + ρ i + τ j ]) 2 10 / 14

ANOVA table Source SS df MS F Block SSBL = r r i=1 (Ȳ i Ȳ ) 2 n b 1 SSBL MSBL n b 1 MSBL.TR Trt SSTR = n rj=1 b (Ȳ j Ȳ ) 2 r 1 SSTR MSB r 1 MSE Error SSBL.TR = n b rj=1 i=1 (Y ij Ȳ i Ȳ j + Ȳ ) 2 (n b 1)(r 1) SSBL.TR (n b 1)(r 1) Total SSTO = n b rj=1 i=1 (Y ij Ȳ ) 2 n b r 1 Here, F = MSBL.TR MSBL tests H 0 : ρ 1 = = ρ nb = 0 (no blocking effect) and F = MSBL.TR MSTR tests H 0 : τ 1 = = τ r = 0 (no treatment effect). These appear in SAS as Type III tests. If we reject H 0 : τ j = 0, then we obtain inferences in treatment effects as usual, e.g. lsmeans B / pdiff adjust=tukey cl; 11 / 14

Diagnostics 1 Profile (spaghetti) plots of the Y ij vs. treatment j, connected by block i are useful. They should be somewhat parallel if the additive model is okay, but there is a lot of sampling variability here as ˆµ ij = Y ij. For this reason, the book actually recommends using ˆµ ij = Ȳi + Ȳ j Ȳ for inference. 2 Standard SAS diagnostic panel: e ij vs. Ŷ ij, normal probability plot of the {e ij }, etc. Can also look at e ij vs. either i or j, should show constant variance within blocks and treatments. 3 Friedman s test is a nonparametric test based on within-block ranks. Yes, it s named after Milton Friedman. This can be readily analyzed in PROC FREQ. 4 Tukey s test for additivity. 12 / 14

Diagnostics 1 Blocking can be assessed post hoc to see whether it was efficient. You can compare the error term in a blocked design (σ 2 b, estimated by MSBL.TR ) to σ2 r, the error term in a completely randomized design. The estimator for σ 2 r can be written a couple ways: ˆσ 2 r = (n b 1)MSBL + n b (r 1)MSBL.TR (n b r 1) = df blocksms blocks + (df Trt + df error )MSE df Blocks + df Trt + df error 2 The relative efficiency of blocking E = ˆσ 2 r /MSBL.TR measures the increase in sample size needed for a CRD to match the RCBD in efficiency. 13 / 14

RCBD model extensions n ij > 1 Factorial treatment structure (e.g., τ jk = β j + γ k + (βγ) jk ) is a straightforward extension of RCBD. Multiple blocking variables require a little more consideration A couple of these topics are explored further in STAT 706 14 / 14