Stat 705: Completely randomized and complete block designs

Similar documents
STAT 506: Randomized complete block designs

Analysis of Covariance

Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs

Chapter 1 Statistical Inference

Two-Sample Inferential Statistics

Experimental Design and Data Analysis for Biologists

Sleep data, two drugs Ch13.xls

Stat 579: Generalized Linear Models and Extensions

Nested Designs & Random Effects

8/04/2011. last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression)

STAT 705 Chapter 19: Two-way ANOVA

STAT 705 Chapter 19: Two-way ANOVA

STAT 525 Fall Final exam. Tuesday December 14, 2010

STAT 705 Chapters 22: Analysis of Covariance

Multiple comparisons The problem with the one-pair-at-a-time approach is its error rate.

Unit 12: Analysis of Single Factor Experiments

Mixed- Model Analysis of Variance. Sohad Murrar & Markus Brauer. University of Wisconsin- Madison. Target Word Count: Actual Word Count: 2755

Upon completion of this chapter, you should be able to:

Confidence Intervals, Testing and ANOVA Summary

PROBLEM TWO (ALKALOID CONCENTRATIONS IN TEA) 1. Statistical Design

RCB - Example. STA305 week 10 1

Written Exam (2 hours)

STAT 705 Chapter 16: One-way ANOVA

Stats fest Analysis of variance. Single factor ANOVA. Aims. Single factor ANOVA. Data

Harvard University. Rigorous Research in Engineering Education

VARIANCE COMPONENT ANALYSIS

, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1

Residual Analysis for two-way ANOVA The twoway model with K replicates, including interaction,

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Stat 6640 Solution to Midterm #2

Analysis of Variance (ANOVA)

Statistics For Economics & Business

What If There Are More Than. Two Factor Levels?

STAT 8200 Design of Experiments for Research Workers Lab 11 Due: Friday, Nov. 22, 2013

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Sections 3.4, 3.5. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

Stat 217 Final Exam. Name: May 1, 2002

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective

Chapter 12 - Lecture 2 Inferences about regression coefficient

Formal Statement of Simple Linear Regression Model

Chapter 12: Linear regression II

Time-Invariant Predictors in Longitudinal Models

VIII. ANCOVA. A. Introduction

Factorial designs. Experiments

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression

Cuckoo Birds. Analysis of Variance. Display of Cuckoo Bird Egg Lengths

Validation of Visual Statistical Inference, with Application to Linear Models

This gives us an upper and lower bound that capture our population mean.

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

The entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.

Blocks are formed by grouping EUs in what way? How are experimental units randomized to treatments?

Interactions and Factorial ANOVA

MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS FALL 1999

Interactions and Factorial ANOVA

STAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons:

Review of Statistics 101

Sampling and Sample Size. Shawn Cole Harvard Business School

STAT 705 Chapters 23 and 24: Two factors, unequal sample sizes; multi-factor ANOVA

Test Yourself! Methodological and Statistical Requirements for M.Sc. Early Childhood Research

Analysis of Variance and Co-variance. By Manza Ramesh

MATH Notebook 3 Spring 2018

Analysis of Variance

SAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model

DIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS

STAT 512 MidTerm I (2/21/2013) Spring 2013 INSTRUCTIONS

(1) The explanatory or predictor variables may be qualitative. (We ll focus on examples where this is the case.)

Business Statistics. Lecture 9: Simple Regression

AP Final Review II Exploring Data (20% 30%)

Chapter 22. Comparing Two Proportions 1 /29

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing

CHAPTER 4 & 5 Linear Regression with One Regressor. Kazu Matsuda IBEC PHBU 430 Econometrics

ANCOVA. Lecture 9 Andrew Ainsworth

22s:152 Applied Linear Regression. Take random samples from each of m populations.

Analysis of Variance

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

28. SIMPLE LINEAR REGRESSION III

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Chapter 22. Comparing Two Proportions 1 /30

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling

Lecture 10: F -Tests, ANOVA and R 2

Chapter 13: Analysis of variance for two-way classifications

L21: Chapter 12: Linear regression

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Module 6: Model Diagnostics

Open book and notes. 120 minutes. Covers Chapters 8 through 14 of Montgomery and Runger (fourth edition).

Contents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4

Chapter 4 Multi-factor Treatment Designs with Multiple Error Terms 93

Keppel, G. & Wickens, T.D. Design and Analysis Chapter 2: Sources of Variability and Sums of Squares

Chapter 3: Examining Relationships

Time-Invariant Predictors in Longitudinal Models

STAT 135 Lab 10 Two-Way ANOVA, Randomized Block Design and Friedman s Test

Swarthmore Honors Exam 2012: Statistics

Stat 601 The Design of Experiments

INTRODUCTION TO DESIGN AND ANALYSIS OF EXPERIMENTS

44.2. Two-Way Analysis of Variance. Introduction. Prerequisites. Learning Outcomes

Transcription:

Stat 705: Completely randomized and complete block designs Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 16

Experimental design Our department offers an entire course, STAT 706, on experimental design. In Stat 705 we will focus mainly on the analysis of common models: completely randomized designs, randomized complete block designs, ANCOVA, multifactor studies, hierarchical models (mixed-effects models), split-plots (e.g. longitudinal data analysis), Latin squares, and nested models. Some of the material in these notes is lifted from Ron Christensen s book Analysis of Variance, Design and Regression (Chapman and Hall, 1996). The rest of it is paraphrased from your textbook. Basic object of experimental design Obtain a valid estimate of variability σ 2 ; make the treatment inferences as sharp as feasibly possible by making the error variability σ 2 small. 2 / 16

Comments Smaller variance leads to sharper inference (e.g. tighter confidence intervals, more precise point estimates of mean treatment differences, etc.) The standard assumption of independent identically distributed error terms must be scrutinized and suitably approximated. Randomization in an experimental design helps us approximate this ideal situation. When treatments are randomly assigned to homogeneous experimental units, the only systematic differences between the units are the treatments themselves, which are included in the statistical model as simple treatment effects. Unlike observational studies, designed experiments allow us to (carefully) infer causation. We deliberately impose treatments on a homogeneous population and record some variable of interest. Since the population is homogeneous, we may infer that differences in the response are due solely to the treatments. 3 / 16

Components of an experimental design An experimental design includes: 1 Treatments 2 Subjects 3 A subject-specific response to be recorded 4 A rule to assign treatments to subjects or vise-versa 4 / 16

Completely randomized designs In a completely randomized design, the experimenter randomly assigns treatments to experimental units in pre-specified numbers (often the same number of units receives each treatment yielding a balanced design). Every experimental unit initially has an equal chance of receiving a particular treatment. The data collected is typically analyzed via a one-way (or multi-way) ANOVA model. 5 / 16

Quantifying noise pollution on mood Example: Denise has a pool of n T = 8 subjects to participate in an experiment. The 8 people are randomly drawn from a psychology class she is teaching and asked to participate, so inferences may drawn for the population of psychology students in her 3 PM class. Denise subjects 4 of the students, chosen uniformly from the initial 8 via a random number generator, to a tape of dogs loudly barking for a period of 30 minutes. The other 4 students listen to silence for 30 minutes. At the end of the experiment, Denise records the mood of participants on a scale from 1 (really bad, angry) to 10 (happy, content). 6 / 16

Denise, continued Denise uses the one-way ANOVA model Y ij = µ i + ɛ ij where i = 1, 2 for the treatment effects (dogs barking and silence, respectively) and j = 1,..., 5 for the replications, to model subject mood after being exposed to the treaments. She, not surprisingly, finds a 95% confidence interval for µ 2 µ 1 to be (2.2, 4.5). On average, those psychology students subjected to silence were 2.2 to 4.5 mood points higher than those subjected to dogs barking. 7 / 16

Randomized complete block designs Say there are b treatments to be considered. In a randomized complete block design, the experimenter constructs a blocks of b homogeneous subjects and (uniformly) randomly allocates the b treatments to everyone in each block. The treatments are assumed to act independent of the blocks, and the overall error variability σ 2 is reduced as some variability will be explained by the block differences. Initially we consider fixed block effects, but will explore random block effects shortly. A simple randomized complete block design is analyzed as a two-way ANOVA without replication. A valid estimate of σ 2 is obtained through blocking and assuming an additive model. 8 / 16

Aspects of a R.C.B. design The key to designing a good R.C.B. design is to pick blocks so that there is little within block variability. If all treatments cannot be administered in a block, we get an incomplete block design. Blocking variables are categorized into two types by your book, those variables that are characteristics of the experimental units (gender, age, income, etc.), and those variables that are associated with the experiment (observer, machine used, etc.) 9 / 16

Noise pollution, continued Denise refines her experiment by considering blocks defined by the number of dogs owned by students: (i = 1, 2, 3, 4 for no dogs, 1 dog, 2 dogs, 3 or more dogs); among her n T = 8 participants she now requires two from each of the a = 4 blocking categories. For each of the a = 4 blocks of b = 2 subjects she makes one endure the barking-tape and the other one gets silence. She proposes the following simple model for the mood scores: Y ij = µ + α i + β j + ɛ ij, where j = 1, 2 denotes treatment. 10 / 16

Denise, continued Denise finds the confidence interval for µ 2 µ 1 = β 2 β 1 now to be (2.8, 3.4). Again, she concludes there is a significant mean mood difference in subjects exposed to dogs barking versus not, and this difference is more precise than before, as the difference is examined within blocks. The p-value associated with the blocking variable number of dogs is 0.03 indicating that blocking made a significant difference in the analysis (i.e. we would reject that the blocking effect is null). Notice that we are assuming that there is no interaction between being exposed to dogs barking or not and number of dogs owned. Is this assumption reasonable? Whether you think so or not, it can be tested using Tukey s test for additivity. 11 / 16

Textbook s notation Note that in Section 21.2 your book rather uses the parameterization Y ij = µ + ρ i }{{} block effects + τ j }{{} treatment effects +ɛ ij. Using this notation instead, we are interested in testing H 0 : τ j = 0 and, if we reject, examining linear combinations of treatments L(c) = b j=1 c jτ j. The test for whether blocking effectively reduces variability H 0 : ρ i = 0 is also of (lesser) interest. Your textbook redoes everything in Section 21.3 in terms of the new parameterization, but nothing has really changed from our standard two-way model and approach except for naming one factor treatment and the other one blocks. 12 / 16

Another model... What does this analysis have in common with an ANCOVA (coming up in Chapter 22) model Y ij = µ + γx ij + τ j + ɛ ij, where x ij is the actual number of dogs (owned by the ith subject receiving treatment j) as a concomitant variable? Which model is simpler (requires fewer parameters?) Which is to be preferred? 13 / 16

Diagnostics for R.C.B. designs 1 Residuals e ij versus the block index i may be used to check for unequal error variance within blocks. Residuals e ij vs. the treatment index j may be used to check for unequal error variance by treatment. Also look at e ij vs. ˆµ ij (automatic in SAS), normal probability plot of {e ij }, and the deleted residuals t ij to check for outliers. 2 An interaction plot Y ij versus j (or i) may be used to check for a possible interaction between the blocking variate and the treatments. NOTE: No replication of treatments within blocks means there is only one observation to estimate ˆµ ij, ˆµ ij = Y ij. Therefore, when there truly is no interaction present, we expect to see the deviation from parallel curves to be much greater than when we have replication, unless σ 2 is very small relative to treatment/blocking effects. Tukey s test for additivity is a formal way to check the appropriateness of model IV with a p-value. 14 / 16

Generalization This simple model can be extended to multi-factor treatment structures (21.6 and 21.8), although by definition, in a R.C.B. design, there is no interaction between blocks and treatments, and the replication is achieved only through blocking. A generalized randomized block design (Sec. 21.7) assigns n subjects within each block instead of only one, yielding replication. In this model, an interaction between treatments and blocks can be tested as usual, and in fact is given automatically as a Type III test in SAS. Standard methods apply! 15 / 16

Noise pollution, finished Denise decides that she would like to test the effect of leaf-blowers as well. Now she selects n T = 16 students to participate and subjects each block of a = 4 students sorted according to how many dogs they own (i = 1, 2, 3, 4 as before) to one of: j = 1, k = 1 nothing, j = 2, k = 1 dogs barking, j = 1, k = 2 a running leaf-blower nearby, or j = 2, k = 2 both a tape of dogs barking and the leaf-blower. She used the following model: Y ijk = µ + ρ i + α j + β k + (αβ) jk + ɛ ijk. The model is fit as a three-way ANOVA and interpreted as usual. If there is no interaction between treatments and blocks (again, testable via Tukey), the model is valid and contrasts in the effects of interest, for example µ 2k µ 1k for k = 1, 2, are examined. If (αβ) jk = 0 is accepted, simply µ 2 µ 1 = α 2 α 1, may be examined. This is a R.C.B. design with factorial treatments. 16 / 16