Topic 29: Three-Way ANOVA

Similar documents
Topic 28: Unequal Replication in Two-Way ANOVA

SAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model

Topic 32: Two-Way Mixed Effects Model

Topic 20: Single Factor Analysis of Variance

Unbalanced Designs Mechanics. Estimate of σ 2 becomes weighted average of treatment combination sample variances.

Outline. Topic 22 - Interaction in Two Factor ANOVA. Interaction Not Significant. General Plan

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Outline Topic 21 - Two Factor ANOVA

Two-factor studies. STAT 525 Chapter 19 and 20. Professor Olga Vitek

Topic 25 - One-Way Random Effects Models. Outline. Random Effects vs Fixed Effects. Data for One-way Random Effects Model. One-way Random effects

STAT 705 Chapters 23 and 24: Two factors, unequal sample sizes; multi-factor ANOVA

Statistics 512: Applied Linear Models. Topic 9

Outline. Topic 19 - Inference. The Cell Means Model. Estimates. Inference for Means Differences in cell means Contrasts. STAT Fall 2013

SAS Program Part 1: proc import datafile="y:\iowa_classes\stat_5201_design\examples\2-23_drillspeed_feed\mont_5-7.csv" out=ds dbms=csv replace; run;

Lecture 9: Factorial Design Montgomery: chapter 5

Topic 23: Diagnostics and Remedies

Statistics 512: Applied Linear Models. Topic 7

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3

Topic 14: Inference in Multiple Regression

Chapter 8 Quantitative and Qualitative Predictors

Chapter 2 Inferences in Simple Linear Regression

One-Way Analysis of Variance (ANOVA) There are two key differences regarding the explanatory variable X.

Factorial Treatment Structure: Part I. Lukas Meier, Seminar für Statistik

6 Designs with Split Plots

Lec 5: Factorial Experiment

Topic 16: Multicollinearity and Polynomial Regression

Factorial and Unbalanced Analysis of Variance

Outline. Review regression diagnostics Remedial measures Weighted regression Ridge regression Robust regression Bootstrapping

Outline. Topic 20 - Diagnostics and Remedies. Residuals. Overview. Diagnostics Plots Residual checks Formal Tests. STAT Fall 2013

Chapter 19. More Complex ANOVA Designs Three-way ANOVA

Topic 18: Model Selection and Diagnostics

Lecture 27 Two-Way ANOVA: Interaction

Comparison of a Population Means

Analysis of Covariance

Chapter 1 Linear Regression with One Predictor

Single Factor Experiments

STAT 705 Chapter 16: One-way ANOVA

The Random Effects Model Introduction

SIZE = Vehicle size: 1 small, 2 medium, 3 large. SIDE : 1 right side of car, 2 left side of car

Lecture 10: 2 k Factorial Design Montgomery: Chapter 6

More than one treatment: factorials

6. Multiple regression - PROC GLM

using the beginning of all regression models

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

STAT 401A - Statistical Methods for Research Workers

Lecture 4. Checking Model Adequacy

y = µj n + β 1 b β b b b + α 1 t α a t a + e

Stat 217 Final Exam. Name: May 1, 2002

Lecture 10: Experiments with Random Effects

Lecture 12: 2 k Factorial Design Montgomery: Chapter 6

Multiple Sample Numerical Data

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Lecture 7: Latin Squares and Related Designs

Lecture 1 Linear Regression with One Predictor Variable.p2

Analysis of variance and regression. November 22, 2007

Analysis of Variance Bios 662

. Example: For 3 factors, sse = (y ijkt. " y ijk

1 Tomato yield example.

Overview Scatter Plot Example

STAT 8200 Design of Experiments for Research Workers Lab 11 Due: Friday, Nov. 22, 2013

Statistics 512: Applied Linear Models. Topic 1

Lecture 3: Inference in SLR

Outline. Analysis of Variance. Comparison of 2 or more groups. Acknowledgements. Comparison of serveral groups

Differences of Least Squares Means

PLS205 Lab 2 January 15, Laboratory Topic 3

Nesting and Mixed Effects: Part I. Lukas Meier, Seminar für Statistik

Outline. Analysis of Variance. Acknowledgements. Comparison of 2 or more groups. Comparison of serveral groups

Course Information Text:

Lecture 7: Latin Square and Related Design

Unbalanced Designs & Quasi F-Ratios

Lecture 5: Comparing Treatment Means Montgomery: Section 3-5

General Linear Model (Chapter 4)

Lecture 11: Blocking and Confounding in 2 k design

Linear models Analysis of Covariance

Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs

Linear models Analysis of Covariance

Linear Combinations. Comparison of treatment means. Bruce A Craig. Department of Statistics Purdue University. STAT 514 Topic 6 1

Lecture 12 Inference in MLR

Lecture 10 Multiple Linear Regression

Analysis of variance and regression. April 17, Contents Comparison of several groups One-way ANOVA. Two-way ANOVA Interaction Model checking

Multiple Regression Examples

T-test: means of Spock's judge versus all other judges 1 12:10 Wednesday, January 5, judge1 N Mean Std Dev Std Err Minimum Maximum

Power & Sample Size Calculation

Reference: Chapter 14 of Montgomery (8e)

Unbalanced Data in Factorials Types I, II, III SS Part 2

Analysis of variance. April 16, Contents Comparison of several groups

ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003

2-way analysis of variance

Analysis of variance. April 16, 2009

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS

data proc sort proc corr run proc reg run proc glm run proc glm run proc glm run proc reg CONMAIN CONINT run proc reg DUMMAIN DUMINT run proc reg

Sections 7.1, 7.2, 7.4, & 7.6

Incomplete Block Designs

Lecture 2 Simple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: Chapter 1

Orthogonal contrasts for a 2x2 factorial design Example p130

Assessing Model Adequacy

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.

N J SS W /df W N - 1

Stat 315c: Transposable Data Rasch model and friends

Transcription:

Topic 29: Three-Way ANOVA

Outline Three-way ANOVA Data Model Inference

Data for three-way ANOVA Y, the response variable Factor A with levels i = 1 to a Factor B with levels j = 1 to b Factor C with levels k = 1 to c Y ijkl is the l th observation in cell (i,j,k), l = 1 to n ijk A balanced design has n ijk =n

KNNL Example KNNL p 1005 Y is exercise tolerance, minutes until fatigue on a bicycle test A is sex, a=2 levels: male, female B is percent body fat, b=2 levels: high, low C is smoking history, c=2 levels: light, heavy n=3 persons aged 25-35 per (i,j,k) cell

Read and check the data data a1; infile 'c:\...\ch24ta04.txt'; input extol sex fat smoke; proc print data=a1; run;

Obs extol sex fat smoke 1 24.1 1 1 1 2 29.2 1 1 1 3 24.6 1 1 1 4 20.0 2 1 1 5 21.9 2 1 1 6 17.6 2 1 1 7 14.6 1 2 1 8 15.3 1 2 1 9 12.3 1 2 1 10 16.1 2 2 1 11 9.3 2 2 1 12 10.8 2 2 1 13 17.6 1 1 2... 24 6.1 2 2 2

Define variable for a plot data a1; set a1; if (sex eq 1)*(fat eq 1)*(smoke eq 1) then gfs='1_mfs'; if (sex eq 1)*(fat eq 2)*(smoke eq 1) then gfs='2_mfs'; if (sex eq 1)*(fat eq 1)*(smoke eq 2) then gfs='3_mfs'; if (sex eq 1)*(fat eq 2)*(smoke eq 2) then gfs='4_mfs'; if (sex eq 2)*(fat eq 1)*(smoke eq 1) then gfs='5_ffs'; if (sex eq 2)*(fat eq 2)*(smoke eq 1) then gfs='6_ffs'; if (sex eq 2)*(fat eq 1)*(smoke eq 2) then gfs='7_ffs'; if (sex eq 2)*(fat eq 2)*(smoke eq 2) then gfs='8_ffs'; run;

Obs extol sex fat smoke gfs 1 24.1 1 1 1 1_Mfs 2 29.2 1 1 1 1_Mfs 3 24.6 1 1 1 1_Mfs 4 17.6 1 1 2 3_MfS 5 18.8 1 1 2 3_MfS 6 23.2 1 1 2 3_MfS 7 14.6 1 2 1 2_MFs 8 15.3 1 2 1 2_MFs 9 12.3 1 2 1 2_MFs 10 14.9 1 2 2 4_MFS 11 20.4 1 2 2 4_MFS 12 12.8 1 2 2 4_MFS

Plot the data title1 'Plot of the data'; symbol1 v=circle i=none c=black; proc gplot data=a1; plot extol*gfs/frame; run;

Find the means proc sort data=a1; by sex fat smoke; proc means data=a1; output out=a2 mean=avextol; by sex fat smoke;

Define fat*smoke data a2; set a2; if (fat eq 1)*(smoke eq 1) then fs='1_fs'; if (fat eq 1)*(smoke eq 2) then fs='2_fs'; if (fat eq 2)*(smoke eq 1) then fs='3_fs'; if (fat eq 2)*(smoke eq 2) then fs='4_fs';

Obs sex fat smoke FR avextol fs 1 1 1 1 3 25.97 1_fs 2 2 1 1 3 19.83 1_fs 3 1 1 2 3 19.87 2_fS 4 2 1 2 3 12.13 2_fS 5 1 2 1 3 14.07 3_Fs 6 2 2 1 3 12.07 3_Fs 7 1 2 2 3 16.03 4_FS 8 2 2 2 3 10.20 4_FS

Plot the means proc sort data=a2; by fs; title1 'Plot of the means'; symbol1 v='m' i=join c=black; symbol2 v='f' i=join c=black; proc gplot data=a2; plot avextol*fs=sex/frame; run;

Cell means model Y ijkl = μ ijk + ε ijkl where μ ijk is the theoretical mean or expected value of all observations in cell (i,j,k) the ε ijkl are iid N(0, σ 2 ) Y ijkl ~ N(μ ijk, σ 2 ), independent

Estimates Estimate μ ijk by the mean of the observations in cell (i,j,k), Y ijk ˆ Y Y n ijk ijk l ijkl ijk For each (i,j,k) combination, we can get an estimate of the variance 2 2 s ijk Yijkl Y l ijk nijk 1 We need to combine these to get an estimate of σ 2

Pooled estimate of σ 2 We pool the s ijk2, giving weights proportional to the df, n ijk -1 The pooled estimate is s n ijk 1 s ijk nijk 1 2 2 ijk ijk

Factor effects model Model cell mean as μ ijk = μ + α i + β j + γ k + (αβ) ij + (αγ) ik + (βγ) jk + (αβγ) ijk μ is the overall mean α i, β j, γ k are the main effects of A, B, and C (αβ) ij, (αγ) ik, and (βγ) jk are the two-way interactions (first-order interactions) (αβγ) ijk is the three-way interaction (second-order interaction) Extension of the usual constraints apply

ANOVA table Sources of model variation are the three main effects, the three two-way interactions, and the one three-way interaction With balanced data the SS and DF add to the model SS and DF Still have Model + Error = Total Each effect is tested by an F statistic with MSE in the denominator

Run proc glm proc glm data=a1; class sex fat smoke; model extol=sex fat smoke sex*fat sex*smoke fat*smoke sex*fat*smoke; means sex*fat*smoke; run;

Run proc glm proc glm data=a1; class sex fat smoke; model extol=sex fat smoke; means sex*fat*smoke; run; Shorthand way to express model

SAS Parameter Estimates Solution option on the model statement gives parameter estimates for the glm parameterization These are as we have seen before; any main effect or interaction with a subscript of a, b, or c is zero These reproduce the cell means in the usual way

ANOVA Table Source DF Sum of Squares Mean Square F Value Pr > F Model 7 588.582917 84.0832738 9.01 0.0002 Error 16 149.366667 9.3354167 Corrected Total 23 737.949583 Type I and III SS the same here

Factor effects output Source DF Type I SS Mean Square F Value Pr > F sex 1 176.58375 176.5837500 18.92 0.0005 fat 1 242.57042 242.5704167 25.98 0.0001 sex*fat 1 13.650417 13.6504167 1.46 0.2441 smoke 1 70.383750 70.3837500 7.54 0.0144 sex*smoke 1 11.0704167 11.0704167 1.19 0.2923 fat*smoke 1 72.453750 72.4537500 7.76 0.0132 sex*fat*smoke 1 1.8704167 1.8704167 0.20 0.6604

Analytical Strategy First examine interactions highest order to lowest order Some options when one or more interactions are significant Interpret the plot of means Run analyses for each level of one factor, eg run A*B by C (lsmeans with slice option) Run as a one-way with abc levels Define a composite factor by combining two factors, eg AB with ab levels Use contrasts

Analytical Strategy Some options when no interactions are significant Use a multiple comparison procedure for the main effects Use contrasts When needed, rerun without the interactions

Example Interpretation Since there appears to be a fat by smoke interaction, let s run a two-way ANOVA (no interaction-note:pooling not necessary here) using the fat*smoke variable Note that we could also use the interaction plot to describe the interaction

Run glm proc glm data=a1; class sex fs; model extol=sex fs; means sex fs/tukey; run;

ANOVA Table Source DF Sum of Squares Mean Square F Value Pr > F Model 4 561.99167 140.4979167 15.17 <.0001 Error 19 175.95792 9.2609430 Corrected Total 23 737.94958

Factor effects output Source DF Type I SS Mean Square F Value Pr > F sex 1 176.58375 176.5837500 19.07 0.0003 fs 3 385.40792 128.4693056 13.87 <.0001 Both are significant as expected compare means

Means for sex Mean N sex A 18.983 12 1 B 13.558 12 2

Tukey comparisons for fs Mean N fs A 22.900 6 1_fs B 16.000 6 2_fS B B 13.117 6 4_FS B B 13.067 6 3_Fs

Conclusions sex difference with males having a roughly 5.5 minute higher exercise tolerance beneficial to add CI here There was a smoking history by body fat level interaction where those who were low body fat and had a light smoking history had a significantly higher exercise tolerance than the other three groups

Last slide Read NKNW Chapter 24 We used program topic29.sas to generate the output for today