STAT22200 Chapter 14

Size: px
Start display at page:

Download "STAT22200 Chapter 14"

Transcription

1 STAT00 Chapter 4 Yibi Huang Chapter 4 Incomplete Block Designs 4. Balanced Incomplete Block Designs (BIBD) Chapter 4 -

2 Incomplete Block Designs A Brief Introduction to a Class of Most Useful Designs in Practice Recall that for a randomized complete block design (RCBD) of g treatments, the size k of each block has to be g (or multiples of g). Under complete block design, each treatment occurs once and only once in each block, so that we can randomize and run all treatment combinations in each block. However, in practice, it often happens that the size of available blocks is smaller than the numbers of treatments (k < g). We cannot apply every treatment to every block. We then have Incomplete Block Design (IBD). IBD makes analysis harder, but sometimes cannot be avoided. The next best thing to a randomized complete block design (RCBD) is a Balanced Incomplete Block Design (BIBD). Chapter 4 -

3 An Example of Unavoidable Incomplete Blocks Eye irritation can be reduced with eyedrops. Three brands of eyedrops are to be compared for their ability to reduce eye irritation. As there is a strong individual effect, subjects should be used as blocks. If each subject can only be used during one treatment period, then the researchers must use one brand of drop in the left eye and another brand in the right eye. The design is restricted into blocks of size two. The study is force into incomplete blocks, with k = < 3 = g (block size) < (number of treatments) Chapter 4-3

4 Balanced Incomplete Block Designs (BIBD) BIBD not balanced in the general sense that all treatment-block combinations occur equally often. Rather they are balanced in the looser sense by the criteria described below. A balanced incomplete block design with g treatments, b blocks, k as the size of each block, r replications of each treatment, is a design satisfying the following: Incomplete: k < g. Balanced: Each treatment appears at most once per block. Each pair of treatments appear in a block the same number of times λ E.g., if treatments A and C appear in the same block 5 times, then treatments B and C must show up together in exactly 5 blocks. Chapter 4-4

5 The following design table is a BIBD. Block size k = 3, there are g = 5 treatments and b = 0 blocks. A B E A C D B E D C B D A C A E C B E D C A B D E A D C B E First Balancing Condition of BIBD If there are b blocks of size k each, then the total number of experimental units is N = bk. If there are g treatments, and each appears r times, then the total number of experimental units is N = rg. Therefore in a BIBD, we must have N = bk = rg. Chapter 4-5

6 Second Balancing Condition of BIBD Suppose each pair of treatments appears in the same block λ times. The second balancing condition r(k ) = λ(g ) comes from two different ways to count the total number of pairs including treatment i 0 in the experiment: g possible pairs, each appears in λ blocks, so λ(g ); treatment i 0 appears in r blocks. Within each block, there are k pairs including i 0, so r(k ) Remarks. In a BIBD, λ = r(k )/(g ) must be an integer. In the eyedrop example: g = 3, k =, r =. λ = r(k ) g = ( ) 3 = is a whole number, and we need b = rg/k = 3 blocks. Chapter 4-6

7 Example A Marketing Psychology Experiment Goal: to compare the effects of 5 advertisements, g = 5 Response: recall after being shown the ad. A subject is a block Block size: Subjects lose patience after being shown 3 ads, k = 3 What is the smallest feasible BIBD? b =? r =? r(k ) By N = rg = kb, and that is a whole number, we need g to make r = 3b 5 and r 4 both being whole numbers So r must be a multiple of and 3, and b be a multiple of and 5.Thus the smallest BIBD is with b = 0, and r = 6. The number of subjects (blocks) must be multiple of 0 (0, 0, 30,... ). Chapter 4-7

8 The following design table is a BIBD with g = 5, k = 3, b = 0, r = 6, λ = 3 (Oehlert Appendix C., p ) A B E A C D B E D C B D A C A E C B E D C A B D E A D C B E In the marketing psychology experiment, if we have b = 0 subjects (blocks), then we can do repetitions of the above design. One obvious randomization is to randomize subjects to columns, then randomize the order of treatments in each block based on the above design. Chapter 4-8

9 Still a Balanced design The following table shows that, in the marketing psychology example, in the 0 blocks, each treatment appears 6 times. Block Treatment A B C D E If we have b = 0 subjects (blocks), we can do repetitions of the above design, and each treatment appears r = 6 = times. Chapter 4-9

10 The following table shows that, in the 0 blocks, each treatment pair appear λ = 3 times. Block Treatment-pair AB AC AD AE BC BD BE CD CE DE If we have b = 0 subjects (blocks), we can do repetitions of the above design, and each treatment pair appears λ = 3 = 6 times. Chapter 4-0

11 Models for BIBD The model for BIBD looks familiar: y ij = µ + α i + β j + ε ij (treatment) (block) (i.i.d. N(0, σ )) for i =,..., g, and j =,..., b with g i= α i = b j= β j = 0. additive model (no trt-block interaction) Not all y ij exist because of incompleteness Nonorthogonality between treatments and blocks, just like unbalanced factorial designs. Thus, Type I Sum of squares will change with the order of terms in the model. Chapter 4 -

12 Parameter Estimates for BIBD Let I ij = {, if treatment i appears in block j, 0, otherwise. and define Q i = y i I ij y j, k j Q j = y j r I ij y i i the least square estimates for µ, α i, β j are µ = y N, α i = kq i λg, βj = kq j λb Remark: Can verify that i Q i = 0 i α i = 0 Chapter 4 -

13 ANOVA for BIBD (Type I Sum of Squares!) Source d.f. SS MS F -value Block b SS block MS block (MS block /MSE) Treatment g SS Trt MS Trt MS trt /MSE Error N g b+ SSE MSE Total N SS total {, if treatment i appears in block j, Let I ij = 0, otherwise. Then SS total = g i= i b j= I ij(y ij y ) SS block = k b j y ) j= (unadjusted) SS Trt = k Qi = λg α i λg k (adjusted for block) SSE = SS total SS block SS trt i Chapter 4-3

14 Pairwise Comparisons Estimate of α i α i is SE( α i α i ) = α i α i = k λg (Q i Q i ) MSE ( ) k λg t-statistic = α i α i SE with df = df of MSE Chapter 4-4

15 Example of BIBD Problem 4.3 on. p The State Board of Education has adopted basic skills tests for high school graduation. One of these is a writing test. The student writing samples are graded by professional graders, and the board is taking some care to be sure that the graders are grading to the same standard. We examine grader differences with the following experiment. There are 5 graders available. We select 30 writing samples at random, each writing sample will be graded by 5 graders. Thus each grader will grade 30 5/5 = 6 samples. Data file: Questions of Interest Do the 5 graders grade consistently? If graders don t grade consistently, how should we adjust the grades? If graders don t grade consistently, can we identify which graders fail to follow the grading standard? Chapter 4-5

16 38 Incomplete Block Designs Exam Grader Score Exam Grader Score Problem 4.4 Analyze these data to determine if graders differ, and if so, how. Be sure to describe the design. Here a exam is a writing sample. Which factor is the treatment factor? Graders or writing samples? Thirty consumers are asked to rate the softness of clothes washed by ten different detergents, but each consumer rates only four different detergents. The design and responses are given below: Which factor is the block factor? Graders or writing samples? Is this a BIBD? Trts Softness Trts Softness A B C D A B C D A B E F A B E F A C G H A C G H A D I J A D I J Chapter 4-6

17 Y ij = µ + α i + β j + ε ij (grader) (exam) As writing samples differ in levels, we expect β j not all equal. If the graders grade consistently, they should give the same score to a writing sample, i.e., α = α = = α 5 > pr4.3 = read.table( " h=t) > EXAM = as.factor(pr4.3$exam) > GRADER = as.factor(pr4.3$grader) > anova(lm(score ~ GRADER + EXAM, data=pr4.3)) Df Sum Sq Mean Sq F value Pr(>F) GRADER <.e-6 *** EXAM <.e-6 *** Residuals > anova(lm(score ~ EXAM + GRADER, data=pr4.3)) Df Sum Sq Mean Sq F value Pr(>F) EXAM <.e-6 *** GRADER e-08 *** Residuals Why are the two ANOVA tables different? Which one should we look at? Chapter 4-7

18 Model diagnostic for y ij = µ + α i + β j + ε ij λ log Likelihood 95% Fitted value Residual Exam Factor Residual Grader Factor Residual Chapter 4-8

19 As the 5 graders grade inconsistently, how to adjust the scores? Based on the model y ij = µ + α i + β j + ε ij, the score of the ith writing sample should be µ + β j, and is estimated by µ + β j. How to get β j in R? Recall that by default, R reports estimates of parameters using the baseline constraints α = β = 0, not the sum-to-zero constraints g i= α i = b j= β j = 0. One can use constrasts() and contr.sum() to force R using the the sum-to-zero constraints. > contrasts(exam) = contr.sum(30) > contrasts(grader) = contr.sum(5) > lm = lm(score ~ EXAM + GRADER, data=pr4.3); lm$coef (Intercept) EXAM EXAM... (omitted) EXAM8 EXAM9 GRADER... (omotted) GRADER Why there are neither estimates for exam30, nor for grader 5? Chapter 4-9

20 > muhat = lm$coef[] > betahat = vector("numeric",length=30) > betahat[:9] = lm$coef[:30] > betahat[30] = -sum(betahat[:9]) > adjustedscore = muhat + betahat; adjustedscore [] [9] [7] [5] > adjustedscore = array(adjustedscore,dim=c(,30)) > adjustedscore = data.frame(adjustedscore) > colnames(adjustedscore) = :30 > adjustedscore Chapter 4-0

21 How to identify inconsistent graders? We can do pairwise comparisons for the grader effects α i α i using the t-statistic = α i α i where SE ( ) ( ) k 5 SE = MSE = λg 5 with df = df of MSE = 96. The t-statistic needs to be at least t 0.05, to be significant at 5% level. That is, α i α i needs to be at least t 0.05,96 SE to be significant at 5% level. Chapter 4 -

22 How to identify inconsistent graders? We don t need to use the formula for estimates of α i s because we can obtain them in R α i α i = k λg (Q i Q i ) > lm$coef[3:54] GRADER GRADER GRADER3 GRADER4 GRADER5 GRADER6 GRADER7 GRADER GRADER9 GRADER0 GRADER GRADER GRADER3 GRADER4 GRADER5 GRADER GRADER7 GRADER8 GRADER9 GRADER0 GRADER GRADER GRADER3 GRADER > sort(lm$coef[3:54]) GRADER3 GRADER5 GRADER6 GRADER6 GRADER5 GRADER4 GRADER8 GRADER GRADER9 GRADER GRADER9 GRADER3 GRADER4 GRADER8 GRADER0 GRADER GRADER7 GRADER GRADER GRADER7 GRADER0 GRADER GRADER GRADER We can see that Grader, 3, 4, and 5 have their α i s differ from most of other α i s by more than They appeared to be inconsistent with other graders. Chapter 4 -

23 exam score Grader 3 always gave the lowest score among the 5 graders grading the same exam Grader 4 always gave the highest score, substantially higher than the scores given by the other graders grading the same exam. Grader tend to give higher scores, Grader 5 tended to give lower score, but not as much as Grader 3 and 4. Chapter 4-3

24 Problem 4. Incomplete Block Design but Not BIBD Milk can be strained through filter disks to remove dirt and debris. Filters are made by surface-bonding fiber webs to both sides of a disk. Goal of the experiment: to know how the construction of the filter affects the speed of milk flow through the filter Treatments: We have a 4 factorial structure for the filters. The 4 factors are fiber weight (normal or heavy), loft (thickness of the filter, normal or low), bonding solution on bottom surface (A or B), and bonding solution on top surface (A or B). Treatments through 6 are the factor-level combinations in standard order. Chapter 4-4

25 Response: filtration time when pouring a measured amount of milk experiment are given below (data from Connor 958): through the disk considerable variation from farm to farm, so block on farm. 6 farms were selected considerable variation from milking to milking, so only one milking per farm. Only 3 filters can be used at a single milking. So blocks size 3 Is this a BIBD? Actually, there is a second blocking factor order Sixteen farms were selected. At each farm there will be three strain at one milking, with the milk strained first with one filter, then a second, a third. Each treatment will be used three times in the design: once as a filter, once as second, and once as third. The treatments and responses for Treatments and Responses Filtration time Farm First Second Third What type of design is this? Analyze the data and report your findings on influence of the treatment factors on straining time. The Chapter State Board 4 - of 5Education has adopted basic skills tests for h

26 pr4. = read.table( " pr4.$wt = factor(pr4.$weight, labels=c("normal", "heavy")) pr4.$lt = factor(pr4.$loft, labels=c("normal","low")) pr4.$bn = factor(pr4.$bottom, labels=c("a","b")) pr4.$tp = factor(pr4.$top, labels=c("a","b")) pr4.$ord = as.factor(pr4.$order, labels=c("st","nd", "3rd")) pr4.$fm = as.factor(pr4.$farm) Chapter 4-6

27 Model diagnostic for log Likelihood % Y ijk = µ + α i + β j + γ k +ε ij (trt) (farm) (order) 0 λ Normal Q Q Plot Residuals Fitted Values Sample Quantiles Theoretical Quantiles Chapter 4-7

28 After removing observation # 39, log Likelihood After Removing Observation #39 95% 0 λ Sample Quantiles 0 Normal Q Q Plot 0 Theoretical Quantiles Residuals Fitted Values Chapter 4-8

29 > anova(lm) Analysis of Variance Table Df Sum Sq Mean Sq F value Pr(>F) fm e-05 *** ord wt e-07 *** lt bn tp *** wt:lt wt:bn *** lt:bn wt:tp lt:tp ** bn:tp wt:lt:bn wt:lt:tp wt:bn:tp lt:bn:tp wt:lt:bn:tp Residuals Chapter 4-9

30 > library(car) > Anova(lm, type=) Anova Table (Type II tests) Sum Sq Df F value Pr(>F) fm *** ord wt e-07 *** lt bn tp *** wt:lt wt:bn *** lt:bn wt:tp lt:tp ** bn:tp wt:lt:bn wt:lt:tp wt:bn:tp lt:bn:tp wt:lt:bn:tp Residuals Chapter 4-30

31 > lma = lm(time ~ fm + ord + wt*lt*bn*tp, data=pr4.[-39,]) > anova(lma) Df Sum Sq Mean Sq F value Pr(>F) fm e-07 *** ord * wt e-09 *** lt * bn ** tp e-06 *** wt:lt wt:bn e-07 *** lt:bn wt:tp lt:tp ** bn:tp wt:lt:bn ** wt:lt:tp ** wt:bn:tp lt:bn:tp wt:lt:bn:tp Residuals Chapter 4-3

32 > library(car) > Anova(lma, type=) Anova Table (Type II tests) Response: time Sum Sq Df F value Pr(>F) fm e-07 *** ord wt e-09 *** lt * bn ** tp e-06 *** wt:lt wt:bn e-06 *** lt:bn wt:tp lt:tp ** bn:tp wt:lt:bn ** wt:lt:tp ** wt:bn:tp lt:bn:tp wt:lt:bn:tp Residuals Chapter 4-3

33 mean of time wt heavy normal farm mean of time lt low normal farm Chapter 4-33

34 mean of time tp B A farm mean of time bn A B farm Chapter 4-34

35 > lm3a = lm(time ~ fm + ord + wt*lt*(bn+tp), data=pr4.[-39,]) > Anova(lm3a, type=) Anova Table (Type II tests) Sum Sq Df F value Pr(>F) fm e-08 *** ord wt e-0 *** lt * bn ** tp e-06 *** wt:lt wt:bn e-07 *** wt:tp lt:bn lt:tp * wt:lt:bn ** wt:lt:tp ** Residuals > anova(lm3a,lma) Analysis of Variance Table Model : time ~ fm + ord + wt * lt * (bn + tp) Model : time ~ fm + ord + wt * lt * bn * tp Res.Df RSS Df Sum of Sq F Pr(>F) Chapter 4-35

STAT22200 Spring 2014 Chapter 14

STAT22200 Spring 2014 Chapter 14 STAT22200 Spring 2014 Chapter 14 Yibi Huang May 27, 2014 Chapter 14 Incomplete Block Designs 14.1 Balanced Incomplete Block Designs (BIBD) Chapter 14-1 Incomplete Block Designs A Brief Introduction to

More information

Two-Way Factorial Designs

Two-Way Factorial Designs 81-86 Two-Way Factorial Designs Yibi Huang 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley, so brewers like

More information

STAT22200 Spring 2014 Chapter 8A

STAT22200 Spring 2014 Chapter 8A STAT22200 Spring 2014 Chapter 8A Yibi Huang May 13, 2014 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley,

More information

Power & Sample Size Calculation

Power & Sample Size Calculation Chapter 7 Power & Sample Size Calculation Yibi Huang Chapter 7 Section 10.3 Power & Sample Size Calculation for CRDs Power & Sample Size for Factorial Designs Chapter 7-1 Power & Sample Size Calculation

More information

Explanatory Variables Must be Linear Independent...

Explanatory Variables Must be Linear Independent... Explanatory Variables Must be Linear Independent... Recall the multiple linear regression model Y j = β 0 + β 1 X 1j + β 2 X 2j + + β p X pj + ε j, i = 1,, n. is a shorthand for n linear relationships

More information

Incomplete Block Designs

Incomplete Block Designs Incomplete Block Designs Recall: in randomized complete block design, each of a treatments was used once within each of b blocks. In some situations, it will not be possible to use each of a treatments

More information

Suppose we needed four batches of formaldehyde, and coulddoonly4runsperbatch. Thisisthena2 4 factorial in 2 2 blocks.

Suppose we needed four batches of formaldehyde, and coulddoonly4runsperbatch. Thisisthena2 4 factorial in 2 2 blocks. 58 2. 2 factorials in 2 blocks Suppose we needed four batches of formaldehyde, and coulddoonly4runsperbatch. Thisisthena2 4 factorial in 2 2 blocks. Some more algebra: If two effects are confounded with

More information

STAT 430 (Fall 2017): Tutorial 8

STAT 430 (Fall 2017): Tutorial 8 STAT 430 (Fall 2017): Tutorial 8 Balanced Incomplete Block Design Luyao Lin November 7th/9th, 2017 Department Statistics and Actuarial Science, Simon Fraser University Block Design Complete Random Complete

More information

STAT22200 Spring 2014 Chapter 13B

STAT22200 Spring 2014 Chapter 13B STAT22200 Spring 2014 Chapter 13B Yibi Huang May 27, 2014 13.3.1 Crossover Designs 13.3.4 Replicated Latin Square Designs 13.4 Graeco-Latin Squares Chapter 13B - 1 13.3.1 Crossover Design (A Special Latin-Square

More information

Example: Poisondata. 22s:152 Applied Linear Regression. Chapter 8: ANOVA

Example: Poisondata. 22s:152 Applied Linear Regression. Chapter 8: ANOVA s:5 Applied Linear Regression Chapter 8: ANOVA Two-way ANOVA Used to compare populations means when the populations are classified by two factors (or categorical variables) For example sex and occupation

More information

Multiple Predictor Variables: ANOVA

Multiple Predictor Variables: ANOVA Multiple Predictor Variables: ANOVA 1/32 Linear Models with Many Predictors Multiple regression has many predictors BUT - so did 1-way ANOVA if treatments had 2 levels What if there are multiple treatment

More information

23. Fractional factorials - introduction

23. Fractional factorials - introduction 173 3. Fractional factorials - introduction Consider a 5 factorial. Even without replicates, there are 5 = 3 obs ns required to estimate the effects - 5 main effects, 10 two factor interactions, 10 three

More information

Multiple Predictor Variables: ANOVA

Multiple Predictor Variables: ANOVA What if you manipulate two factors? Multiple Predictor Variables: ANOVA Block 1 Block 2 Block 3 Block 4 A B C D B C D A C D A B D A B C Randomized Controlled Blocked Design: Design where each treatment

More information

19. Blocking & confounding

19. Blocking & confounding 146 19. Blocking & confounding Importance of blocking to control nuisance factors - day of week, batch of raw material, etc. Complete Blocks. This is the easy case. Suppose we run a 2 2 factorial experiment,

More information

Lecture 15 Topic 11: Unbalanced Designs (missing data)

Lecture 15 Topic 11: Unbalanced Designs (missing data) Lecture 15 Topic 11: Unbalanced Designs (missing data) In the real world, things fall apart: plants are destroyed/trampled/eaten animals get sick volunteers quit assistants are sloppy accidents happen

More information

Chapter 11: Factorial Designs

Chapter 11: Factorial Designs Chapter : Factorial Designs. Two factor factorial designs ( levels factors ) This situation is similar to the randomized block design from the previous chapter. However, in addition to the effects within

More information

Written Exam (2 hours)

Written Exam (2 hours) M. Müller Applied Analysis of Variance and Experimental Design Summer 2015 Written Exam (2 hours) General remarks: Open book exam. Switch off your mobile phone! Do not stay too long on a part where you

More information

1 Use of indicator random variables. (Chapter 8)

1 Use of indicator random variables. (Chapter 8) 1 Use of indicator random variables. (Chapter 8) let I(A) = 1 if the event A occurs, and I(A) = 0 otherwise. I(A) is referred to as the indicator of the event A. The notation I A is often used. 1 2 Fitting

More information

Lec 1: An Introduction to ANOVA

Lec 1: An Introduction to ANOVA Ying Li Stockholm University October 31, 2011 Three end-aisle displays Which is the best? Design of the Experiment Identify the stores of the similar size and type. The displays are randomly assigned to

More information

Stat 412/512 TWO WAY ANOVA. Charlotte Wickham. stat512.cwick.co.nz. Feb

Stat 412/512 TWO WAY ANOVA. Charlotte Wickham. stat512.cwick.co.nz. Feb Stat 42/52 TWO WAY ANOVA Feb 6 25 Charlotte Wickham stat52.cwick.co.nz Roadmap DONE: Understand what a multiple regression model is. Know how to do inference on single and multiple parameters. Some extra

More information

Incomplete Block Designs. Lukas Meier, Seminar für Statistik

Incomplete Block Designs. Lukas Meier, Seminar für Statistik Incomplete Block Designs Lukas Meier, Seminar für Statistik Incomplete Block Designs Up to now we only considered complete block designs. This means we would see all treatments in each block. In some situations

More information

Unit 8: 2 k Factorial Designs, Single or Unequal Replications in Factorial Designs, and Incomplete Block Designs

Unit 8: 2 k Factorial Designs, Single or Unequal Replications in Factorial Designs, and Incomplete Block Designs Unit 8: 2 k Factorial Designs, Single or Unequal Replications in Factorial Designs, and Incomplete Block Designs STA 643: Advanced Experimental Design Derek S. Young 1 Learning Objectives Revisit your

More information

Stat 217 Final Exam. Name: May 1, 2002

Stat 217 Final Exam. Name: May 1, 2002 Stat 217 Final Exam Name: May 1, 2002 Problem 1. Three brands of batteries are under study. It is suspected that the lives (in weeks) of the three brands are different. Five batteries of each brand are

More information

SCHOOL OF MATHEMATICS AND STATISTICS

SCHOOL OF MATHEMATICS AND STATISTICS RESTRICTED OPEN BOOK EXAMINATION (Not to be removed from the examination hall) Data provided: Statistics Tables by H.R. Neave MAS5052 SCHOOL OF MATHEMATICS AND STATISTICS Basic Statistics Spring Semester

More information

Randomized Block Designs with Replicates

Randomized Block Designs with Replicates LMM 021 Randomized Block ANOVA with Replicates 1 ORIGIN := 0 Randomized Block Designs with Replicates prepared by Wm Stein Randomized Block Designs with Replicates extends the use of one or more random

More information

Statistics GIDP Ph.D. Qualifying Exam Methodology

Statistics GIDP Ph.D. Qualifying Exam Methodology Statistics GIDP Ph.D. Qualifying Exam Methodology January 9, 2018, 9:00am 1:00pm Instructions: Put your ID (not your name) on each sheet. Complete exactly 5 of 6 problems; turn in only those sheets you

More information

Section 4.6 Simple Linear Regression

Section 4.6 Simple Linear Regression Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval

More information

Unbalanced Data in Factorials Types I, II, III SS Part 1

Unbalanced Data in Factorials Types I, II, III SS Part 1 Unbalanced Data in Factorials Types I, II, III SS Part 1 Chapter 10 in Oehlert STAT:5201 Week 9 - Lecture 2 1 / 14 When we perform an ANOVA, we try to quantify the amount of variability in the data accounted

More information

Recall that a measure of fit is the sum of squared residuals: where. The F-test statistic may be written as:

Recall that a measure of fit is the sum of squared residuals: where. The F-test statistic may be written as: 1 Joint hypotheses The null and alternative hypotheses can usually be interpreted as a restricted model ( ) and an model ( ). In our example: Note that if the model fits significantly better than the restricted

More information

Stat 5303 (Oehlert): Balanced Incomplete Block Designs 1

Stat 5303 (Oehlert): Balanced Incomplete Block Designs 1 Stat 5303 (Oehlert): Balanced Incomplete Block Designs 1 > library(stat5303libs);library(cfcdae);library(lme4) > weardata

More information

Factorial and Unbalanced Analysis of Variance

Factorial and Unbalanced Analysis of Variance Factorial and Unbalanced Analysis of Variance Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 04-Jan-2017 Nathaniel E. Helwig (U of Minnesota)

More information

22s:152 Applied Linear Regression. Take random samples from each of m populations.

22s:152 Applied Linear Regression. Take random samples from each of m populations. 22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each

More information

Blocks are formed by grouping EUs in what way? How are experimental units randomized to treatments?

Blocks are formed by grouping EUs in what way? How are experimental units randomized to treatments? VI. Incomplete Block Designs A. Introduction What is the purpose of block designs? Blocks are formed by grouping EUs in what way? How are experimental units randomized to treatments? 550 What if we have

More information

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA 22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each

More information

Homework 2: Simple Linear Regression

Homework 2: Simple Linear Regression STAT 4385 Applied Regression Analysis Homework : Simple Linear Regression (Simple Linear Regression) Thirty (n = 30) College graduates who have recently entered the job market. For each student, the CGPA

More information

Chapter 4: Randomized Blocks and Latin Squares

Chapter 4: Randomized Blocks and Latin Squares Chapter 4: Randomized Blocks and Latin Squares 1 Design of Engineering Experiments The Blocking Principle Blocking and nuisance factors The randomized complete block design or the RCBD Extension of the

More information

MODELS WITHOUT AN INTERCEPT

MODELS WITHOUT AN INTERCEPT Consider the balanced two factor design MODELS WITHOUT AN INTERCEPT Factor A 3 levels, indexed j 0, 1, 2; Factor B 5 levels, indexed l 0, 1, 2, 3, 4; n jl 4 replicate observations for each factor level

More information

CAS MA575 Linear Models

CAS MA575 Linear Models CAS MA575 Linear Models Boston University, Fall 2013 Midterm Exam (Correction) Instructor: Cedric Ginestet Date: 22 Oct 2013. Maximal Score: 200pts. Please Note: You will only be graded on work and answers

More information

Factorial designs. Experiments

Factorial designs. Experiments Chapter 5: Factorial designs Petter Mostad mostad@chalmers.se Experiments Actively making changes and observing the result, to find causal relationships. Many types of experimental plans Measuring response

More information

Stat 401B Final Exam Fall 2015

Stat 401B Final Exam Fall 2015 Stat 401B Final Exam Fall 015 I have neither given nor received unauthorized assistance on this exam. Name Signed Date Name Printed ATTENTION! Incorrect numerical answers unaccompanied by supporting reasoning

More information

Analysis of variance. Gilles Guillot. September 30, Gilles Guillot September 30, / 29

Analysis of variance. Gilles Guillot. September 30, Gilles Guillot September 30, / 29 Analysis of variance Gilles Guillot gigu@dtu.dk September 30, 2013 Gilles Guillot (gigu@dtu.dk) September 30, 2013 1 / 29 1 Introductory example 2 One-way ANOVA 3 Two-way ANOVA 4 Two-way ANOVA with interactions

More information

STAT 525 Fall Final exam. Tuesday December 14, 2010

STAT 525 Fall Final exam. Tuesday December 14, 2010 STAT 525 Fall 2010 Final exam Tuesday December 14, 2010 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will

More information

Stat 6640 Solution to Midterm #2

Stat 6640 Solution to Midterm #2 Stat 6640 Solution to Midterm #2 1. A study was conducted to examine how three statistical software packages used in a statistical course affect the statistical competence a student achieves. At the end

More information

Analysis of Variance

Analysis of Variance Analysis of Variance Math 36b May 7, 2009 Contents 2 ANOVA: Analysis of Variance 16 2.1 Basic ANOVA........................... 16 2.1.1 the model......................... 17 2.1.2 treatment sum of squares.................

More information

STAT 506: Randomized complete block designs

STAT 506: Randomized complete block designs STAT 506: Randomized complete block designs Timothy Hanson Department of Statistics, University of South Carolina STAT 506: Introduction to Experimental Design 1 / 10 Randomized complete block designs

More information

unadjusted model for baseline cholesterol 22:31 Monday, April 19,

unadjusted model for baseline cholesterol 22:31 Monday, April 19, unadjusted model for baseline cholesterol 22:31 Monday, April 19, 2004 1 Class Level Information Class Levels Values TRETGRP 3 3 4 5 SEX 2 0 1 Number of observations 916 unadjusted model for baseline cholesterol

More information

MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS FALL 1999

MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS FALL 1999 MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS 350 - FALL 1999 Instructor: A. Oyet Date: December 16, 1999 Name(Surname First): Student Number INSTRUCTIONS

More information

Randomized Complete Block Designs Incomplete Block Designs. Block Designs. 1 Randomized Complete Block Designs. 2 Incomplete Block Designs

Randomized Complete Block Designs Incomplete Block Designs. Block Designs. 1 Randomized Complete Block Designs. 2 Incomplete Block Designs Block Designs Randomized Complete Block Designs 1 Randomized Complete Block Designs 2 0 / 18 1 Randomized Complete Block Designs 2 1 / 18 Randomized Complete Block Design RCBD is the most widely used experimental

More information

Nested Designs & Random Effects

Nested Designs & Random Effects Nested Designs & Random Effects Timothy Hanson Department of Statistics, University of South Carolina Stat 506: Introduction to Design of Experiments 1 / 17 Bottling plant production A production engineer

More information

Stat 5102 Final Exam May 14, 2015

Stat 5102 Final Exam May 14, 2015 Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions

More information

ST430 Exam 2 Solutions

ST430 Exam 2 Solutions ST430 Exam 2 Solutions Date: November 9, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textbook are permitted but you may use a calculator. Giving

More information

More about Single Factor Experiments

More about Single Factor Experiments More about Single Factor Experiments 1 2 3 0 / 23 1 2 3 1 / 23 Parameter estimation Effect Model (1): Y ij = µ + A i + ɛ ij, Ji A i = 0 Estimation: µ + A i = y i. ˆµ = y..  i = y i. y.. Effect Modell

More information

Allow the investigation of the effects of a number of variables on some response

Allow the investigation of the effects of a number of variables on some response Lecture 12 Topic 9: Factorial treatment structures (Part I) Factorial experiments Allow the investigation of the effects of a number of variables on some response in a highly efficient manner, and in a

More information

FACTORIAL DESIGNS and NESTED DESIGNS

FACTORIAL DESIGNS and NESTED DESIGNS Experimental Design and Statistical Methods Workshop FACTORIAL DESIGNS and NESTED DESIGNS Jesús Piedrafita Arilla jesus.piedrafita@uab.cat Departament de Ciència Animal i dels Aliments Items Factorial

More information

Lec 5: Factorial Experiment

Lec 5: Factorial Experiment November 21, 2011 Example Study of the battery life vs the factors temperatures and types of material. A: Types of material, 3 levels. B: Temperatures, 3 levels. Example Study of the battery life vs the

More information

8/04/2011. last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression)

8/04/2011. last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression) psyc3010 lecture 7 analysis of covariance (ANCOVA) last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression) 1 announcements quiz 2 correlation and

More information

Regression and the 2-Sample t

Regression and the 2-Sample t Regression and the 2-Sample t James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Regression and the 2-Sample t 1 / 44 Regression

More information

Chapter 4 Experiments with Blocking Factors

Chapter 4 Experiments with Blocking Factors Chapter 4 Experiments with Blocking Factors 許湘伶 Design and Analysis of Experiments (Douglas C. Montgomery) hsuhl (NUK) DAE Chap. 4 1 / 54 The Randomized Complete Block Design (RCBD; 隨機化完全集區設計 ) 1 Variability

More information

Example: 1982 State SAT Scores (First year state by state data available)

Example: 1982 State SAT Scores (First year state by state data available) Lecture 11 Review Section 3.5 from last Monday (on board) Overview of today s example (on board) Section 3.6, Continued: Nested F tests, review on board first Section 3.4: Interaction for quantitative

More information

1 Multiple Regression

1 Multiple Regression 1 Multiple Regression In this section, we extend the linear model to the case of several quantitative explanatory variables. There are many issues involved in this problem and this section serves only

More information

Multiple Regression. Dr. Frank Wood. Frank Wood, Linear Regression Models Lecture 12, Slide 1

Multiple Regression. Dr. Frank Wood. Frank Wood, Linear Regression Models Lecture 12, Slide 1 Multiple Regression Dr. Frank Wood Frank Wood, fwood@stat.columbia.edu Linear Regression Models Lecture 12, Slide 1 Review: Matrix Regression Estimation We can solve this equation (if the inverse of X

More information

Multiple Linear Regression. Chapter 12

Multiple Linear Regression. Chapter 12 13 Multiple Linear Regression Chapter 12 Multiple Regression Analysis Definition The multiple regression model equation is Y = b 0 + b 1 x 1 + b 2 x 2 +... + b p x p + ε where E(ε) = 0 and Var(ε) = s 2.

More information

NC Births, ANOVA & F-tests

NC Births, ANOVA & F-tests Math 158, Spring 2018 Jo Hardin Multiple Regression II R code Decomposition of Sums of Squares (and F-tests) NC Births, ANOVA & F-tests A description of the data is given at http://pages.pomona.edu/~jsh04747/courses/math58/

More information

Answer Keys to Homework#10

Answer Keys to Homework#10 Answer Keys to Homework#10 Problem 1 Use either restricted or unrestricted mixed models. Problem 2 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean

More information

Assignment 9 Answer Keys

Assignment 9 Answer Keys Assignment 9 Answer Keys Problem 1 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean 26.00 + 34.67 + 39.67 + + 49.33 + 42.33 + + 37.67 + + 54.67

More information

What If There Are More Than. Two Factor Levels?

What If There Are More Than. Two Factor Levels? What If There Are More Than Chapter 3 Two Factor Levels? Comparing more that two factor levels the analysis of variance ANOVA decomposition of total variability Statistical testing & analysis Checking

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the

More information

Assignment 6 Answer Keys

Assignment 6 Answer Keys ssignment 6 nswer Keys Problem 1 (a) The treatment sum of squares can be calculated by SS Treatment = b a ȳi 2 Nȳ 2 i=1 = 5 (5.40 2 + 5.80 2 + 10 2 + 9.80 2 ) 20 7.75 2 = 92.95 Then the F statistic for

More information

Two-factor studies. STAT 525 Chapter 19 and 20. Professor Olga Vitek

Two-factor studies. STAT 525 Chapter 19 and 20. Professor Olga Vitek Two-factor studies STAT 525 Chapter 19 and 20 Professor Olga Vitek December 2, 2010 19 Overview Now have two factors (A and B) Suppose each factor has two levels Could analyze as one factor with 4 levels

More information

Lecture 10. Factorial experiments (2-way ANOVA etc)

Lecture 10. Factorial experiments (2-way ANOVA etc) Lecture 10. Factorial experiments (2-way ANOVA etc) Jesper Rydén Matematiska institutionen, Uppsala universitet jesper@math.uu.se Regression and Analysis of Variance autumn 2014 A factorial experiment

More information

Nonlinear Regression

Nonlinear Regression Nonlinear Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Nonlinear Regression 1 / 36 Nonlinear Regression 1 Introduction

More information

Models for Clustered Data

Models for Clustered Data Models for Clustered Data Edps/Psych/Soc 589 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline Notation NELS88 data Fixed Effects ANOVA

More information

Models for Clustered Data

Models for Clustered Data Models for Clustered Data Edps/Psych/Stat 587 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline Notation NELS88 data Fixed Effects ANOVA

More information

Stat 5303 (Oehlert): Analysis of CR Designs; January

Stat 5303 (Oehlert): Analysis of CR Designs; January Stat 5303 (Oehlert): Analysis of CR Designs; January 2016 1 > resin

More information

Lecture 10 Multiple Linear Regression

Lecture 10 Multiple Linear Regression Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable

More information

ANOVA: Analysis of Variation

ANOVA: Analysis of Variation ANOVA: Analysis of Variation The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative variables depend on which group (given by categorical

More information

BALANCED INCOMPLETE BLOCK DESIGNS

BALANCED INCOMPLETE BLOCK DESIGNS BALANCED INCOMPLETE BLOCK DESIGNS V.K. Sharma I.A.S.R.I., Library Avenue, New Delhi -110012. 1. Introduction In Incomplete block designs, as their name implies, the block size is less than the number of

More information

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test, October 2013

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test, October 2013 UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test, October 2013 STAC67H3 Regression Analysis Duration: One hour and fifty minutes Last Name: First Name: Student

More information

21.0 Two-Factor Designs

21.0 Two-Factor Designs 21.0 Two-Factor Designs Answer Questions 1 RCBD Concrete Example Two-Way ANOVA Popcorn Example 21.4 RCBD 2 The Randomized Complete Block Design is also known as the two-way ANOVA without interaction. A

More information

VIII. ANCOVA. A. Introduction

VIII. ANCOVA. A. Introduction VIII. ANCOVA A. Introduction In most experiments and observational studies, additional information on each experimental unit is available, information besides the factors under direct control or of interest.

More information

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College 1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College Spring 2010 The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative

More information

Swarthmore Honors Exam 2015: Statistics

Swarthmore Honors Exam 2015: Statistics Swarthmore Honors Exam 2015: Statistics 1 Swarthmore Honors Exam 2015: Statistics John W. Emerson, Yale University NAME: Instructions: This is a closed-book three-hour exam having 7 questions. You may

More information

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is

More information

R Output for Linear Models using functions lm(), gls() & glm()

R Output for Linear Models using functions lm(), gls() & glm() LM 04 lm(), gls() &glm() 1 R Output for Linear Models using functions lm(), gls() & glm() Different kinds of output related to linear models can be obtained in R using function lm() {stats} in the base

More information

Notes on Maxwell & Delaney

Notes on Maxwell & Delaney Notes on Maxwell & Delaney PSY710 12 higher-order within-subject designs Chapter 11 discussed the analysis of data collected in experiments that had a single, within-subject factor. Here we extend those

More information

Orthogonal contrasts for a 2x2 factorial design Example p130

Orthogonal contrasts for a 2x2 factorial design Example p130 Week 9: Orthogonal comparisons for a 2x2 factorial design. The general two-factor factorial arrangement. Interaction and additivity. ANOVA summary table, tests, CIs. Planned/post-hoc comparisons for the

More information

STATISTICS 174: APPLIED STATISTICS TAKE-HOME FINAL EXAM POSTED ON WEBPAGE: 6:00 pm, DECEMBER 6, 2004 HAND IN BY: 6:00 pm, DECEMBER 7, 2004 This is a

STATISTICS 174: APPLIED STATISTICS TAKE-HOME FINAL EXAM POSTED ON WEBPAGE: 6:00 pm, DECEMBER 6, 2004 HAND IN BY: 6:00 pm, DECEMBER 7, 2004 This is a STATISTICS 174: APPLIED STATISTICS TAKE-HOME FINAL EXAM POSTED ON WEBPAGE: 6:00 pm, DECEMBER 6, 2004 HAND IN BY: 6:00 pm, DECEMBER 7, 2004 This is a take-home exam. You are expected to work on it by yourself

More information

Chapter 8 Conclusion

Chapter 8 Conclusion 1 Chapter 8 Conclusion Three questions about test scores (score) and student-teacher ratio (str): a) After controlling for differences in economic characteristics of different districts, does the effect

More information

Topic 13. Analysis of Covariance (ANCOVA) - Part II [ST&D Ch. 17]

Topic 13. Analysis of Covariance (ANCOVA) - Part II [ST&D Ch. 17] Topic 13. Analysis of Covariance (ANCOVA) - Part II [ST&D Ch. 17] 13.5 Assumptions of ANCOVA The assumptions of analysis of covariance are: 1. The X s are fixed, measured without error, and independent

More information

MTH6134 / MTH6134P: Statistical Modelling II

MTH6134 / MTH6134P: Statistical Modelling II Main Examination period 2017 MTH6134 / MTH6134P: Statistical Modelling II Duration: 2 hours Student number Desk number Name and type of calculator used Apart from this page, you are not permitted to read

More information

Swarthmore Honors Exam 2012: Statistics

Swarthmore Honors Exam 2012: Statistics Swarthmore Honors Exam 2012: Statistics 1 Swarthmore Honors Exam 2012: Statistics John W. Emerson, Yale University NAME: Instructions: This is a closed-book three-hour exam having six questions. You may

More information

Outline Topic 21 - Two Factor ANOVA

Outline Topic 21 - Two Factor ANOVA Outline Topic 21 - Two Factor ANOVA Data Model Parameter Estimates - Fall 2013 Equal Sample Size One replicate per cell Unequal Sample size Topic 21 2 Overview Now have two factors (A and B) Suppose each

More information

Topic 9: Factorial treatment structures. Introduction. Terminology. Example of a 2x2 factorial

Topic 9: Factorial treatment structures. Introduction. Terminology. Example of a 2x2 factorial Topic 9: Factorial treatment structures Introduction A common objective in research is to investigate the effect of each of a number of variables, or factors, on some response variable. In earlier times,

More information

STAT 571A Advanced Statistical Regression Analysis. Chapter 8 NOTES Quantitative and Qualitative Predictors for MLR

STAT 571A Advanced Statistical Regression Analysis. Chapter 8 NOTES Quantitative and Qualitative Predictors for MLR STAT 571A Advanced Statistical Regression Analysis Chapter 8 NOTES Quantitative and Qualitative Predictors for MLR 2015 University of Arizona Statistics GIDP. All rights reserved, except where previous

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

Lecture 22 Mixed Effects Models III Nested designs

Lecture 22 Mixed Effects Models III Nested designs Lecture 22 Mixed Effects Models III Nested designs 94 Introduction: Crossed Designs The two-factor designs considered so far involve every level of the first factor occurring with every level of the second

More information

STAT 420: Methods of Applied Statistics

STAT 420: Methods of Applied Statistics STAT 420: Methods of Applied Statistics Model Diagnostics Transformation Shiwei Lan, Ph.D. Course website: http://shiwei.stat.illinois.edu/lectures/stat420.html August 15, 2018 Department

More information

Lecture 18: Simple Linear Regression

Lecture 18: Simple Linear Regression Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength

More information

Increasing precision by partitioning the error sum of squares: Blocking: SSE (CRD) à SSB + SSE (RCBD) Contrasts: SST à (t 1) orthogonal contrasts

Increasing precision by partitioning the error sum of squares: Blocking: SSE (CRD) à SSB + SSE (RCBD) Contrasts: SST à (t 1) orthogonal contrasts Lecture 13 Topic 9: Factorial treatment structures (Part II) Increasing precision by partitioning the error sum of squares: s MST F = = MSE 2 among = s 2 within SST df trt SSE df e Blocking: SSE (CRD)

More information

Unit 9: Confounding and Fractional Factorial Designs

Unit 9: Confounding and Fractional Factorial Designs Unit 9: Confounding and Fractional Factorial Designs STA 643: Advanced Experimental Design Derek S. Young 1 Learning Objectives Understand what it means for a treatment to be confounded with blocks Know

More information