5.3 Three-Stage Nested Design Example

Size: px
Start display at page:

Download "5.3 Three-Stage Nested Design Example"

Transcription

1 5.3 Three-Stage Nested Design Example A researcher designs an experiment to study the of a metal alloy. A three-stage nested design was conducted that included Two alloy chemistry compositions. Three ovens for each alloy chemistry composition (6 ovens were used). Four ingot molds were used to produce alloy ingots for each of the six combinations of alloy chemistry composition and oven (24 molds were used). Three ingots were produced from each of the 24 molds. Molds can only be used once. The experimental data in the table below contains alloy measurements. Alloy Chemistry 1 Oven Mold Alloy Chemistry 2 Oven Mold Analyze this data assuming alloy chemistry and ovens are fixed effects and ingot mold is a random effect. Assume data collection was randomized. Based on the expected mean squares, the F -statistics are F alloy = MS alloy MS mold(alloy oven) F oven(alloy) = MS oven(alloy) MS mold(alloy oven) F mold(oven alloy) = MS mold(oven alloy) MS E The tests for oven(alloy) and mold(oven*alloy) are significant with p-values of <.1 and.4, respectively. The variance component estimates are σ 2 = and σ mold 2 =. Thus, the random replication variability is estimated to be about 27% larger than the variability due to molds. THREE-STAGE NESTED DESIGN: ALLOY AND oven FIXED, MOLD RANDOM Class Level Information Class Levels Values alloy oven mold Number of Observations Read Number of Observations Used 72

2 Dependent Variable: Source DF Sum of Squares Mean Square F Value Pr > F Model <.1 Error Corrected Total R-Square Coeff Var Root MSE Mean alloy THREE-STAGE oven(alloy) NESTED DESIGN: ALLOY AND oven FIXED , MOLD <.1 RANDOM mold(alloy*oven) Source Type III Expected Mean Square alloy Var(Error) + 3 Var(mold(alloy*oven)) + Q(alloy,oven(alloy)) oven(alloy) Var(Error) + 3 Var(mold(alloy*oven)) + Q(oven(alloy)) THREE-STAGE NESTED DESIGN: ALLOY AND oven FIXED, MOLD RANDOM mold(alloy*oven) Var(Error) + 3 Var(mold(alloy*oven)) Tests of Hypotheses for Mixed Model Analysis of Variance Dependent Variable: * alloy oven(alloy) <.1 Error Error: MS(mold(alloy*oven)) * This test assumes one or more other fixed effects are zero. mold(alloy*oven) Error: MS(Error)

3 THREE-STAGE NESTED DESIGN: ALLOY AND oven FIXED, MOLD RANDOM 5 6 Distribution of THREE-STAGE NESTED DESIGN: ALLOY AND oven FIXED, MOLD RANDOM alloy Distribution of alloy N Mean Std Dev alloy 2 N Mean Std Dev THREE-STAGE NESTED DESIGN: ALLOY AND oven FIXED, MOLD RANDOM 5 6 Distribution of oven oven oven(alloy) alloy N Mean Std Dev alloy N Mean Std Dev

4 Var(mold(alloy*oven)) Number of Observations Used THREE-STAGE NESTED DESIGN: ALLOY AND oven FIXED, MOLD RANDOM THREE-STAGE NESTED DESIGN: ALLOY AND oven FIXED, MOLD RANDOM Distribution of Distribution of 6 6 VARIANCE COMPONENTS ANALYSIS 5 5 Variance Components Estimation Procedure mold(alloy*oven) Class Level Information Class Levels Values alloy oven mold(alloy*oven) mold mold alloy oven N Mean Std Dev mold alloy oven N Mean Std Dev Number 2. of Observations Read Number 1 THREE-STAGE of 1Observations 3NESTED Used DESIGN: ALLOY AND oven FIXED, MOLD RANDOM mold 59.8alloy oven N Mean Std Dev Dependent Variable: REML 3 Iterations Iteration Objective 2.Var(mold(alloy*oven)) Var(Error) VARIANCE COMPONENTS ANALYSIS Convergence criteria met. Variance Components Estimation Procedure REML Class Estimates Level Information Variance Component Estimate Class Levels Values Var(mold(alloy*oven)) alloy Var(Error) oven mold Asymptotic Covariance Matrix of Estimates Number of Observations Var(mold(alloy*oven)) 28 Read 72 Var(Error)

5 Analyze the data again but assume chemistry alloy is a fixed effect and oven and mold are random effects. That is, the 6 ovens were randomly selected from a larger population of ovens. Based on the expected mean squares, the F -statistics are F alloy = MS alloy MS oven(alloy) F oven(alloy) = MS oven(alloy) MS mold(alloy oven) F mold(oven alloy) = MS mold(oven alloy) MS E The tests for variance components for oven(alloy) and mold(oven*alloy) are significant with p-values of <.1 and.4, respectively. The variance component estimates are σ 2 =, σ 2 mold =, and σ2 oven = This implies there is very large variability due to changes in oven. It is estimated to be about 1.5 s larger than the random replication variability, and it is about 13.3 s larger than the variability due to molds. The engineers need to first focus on reducing the oven-to-oven variability. THREE-STAGE NESTED DESIGN: ALLOY FIXED, oven AND MOLD RANDOM Dependent Variable: Source DF Sum of Squares Mean Square F Value Pr > F Model <.1 Error Corrected Total R-Square Coeff Var Root MSE Mean alloy THREE-STAGE NESTED DESIGN: ALLOY FIXED, oven AND MOLD RANDOM oven(alloy) <.1 mold(alloy*oven) 18 The GLM Procedure Source alloy oven(alloy) Type III Expected Mean Square Var(Error) + 3 Var(mold(alloy*oven)) + 12 Var(oven(alloy)) + Q(alloy) Var(Error) + 3 Var(mold(alloy*oven)) + 12 Var(oven(alloy)) mold(alloy*oven) Var(Error) + 3 Var(mold(alloy*oven)) 29

6 THREE-STAGE NESTED DESIGN: ALLOY FIXED, oven AND MOLD RANDOM Tests of Hypotheses for Mixed Model Analysis of Variance Dependent Variable: Source VARIANCE DF Type COMPONENTS III SS Mean Square ANALYSIS F Value Pr > F alloy Variance Components Estimation Procedure.886 Error Class Level Error: MS(oven(alloy)) Information Class Levels Values Source DF alloy Type III SS2 Mean 1 2 Square F Value Pr > F oven(alloy) 4 oven <.1 Error 18 mold Error: MS(mold(alloy*oven)) Number of Observations Read 72 Source Number DFof Type Observations III SS Mean Used Square 72 F Value Pr > F mold(alloy*oven) Error: MS(Error) Dependent Variable: REML Iterations Iteration Objective Var(oven(alloy)) Var(mold(alloy*oven)) Var(Error) Convergence criteria met. REML Estimates Variance Component Estimate Var(oven(alloy)) Var(mold(alloy*oven)) Var(Error) Asymptotic Covariance Matrix of Estimates Var(oven(alloy)) Var(mold(alloy*oven)) Var(Error) Var(oven(alloy)) E-13 Var(mold(alloy*oven)) Var(Error) -6.2E

7 SAS Code for Three-Stage Nested Design Example DM LOG; CLEAR; OUT; CLEAR; ; ODS GRAPHICS ON; ODS PRINTER PDF file= C:\COURSES\ST541\NESTED3.PDF ; OPTIONS NODATE NONUMBER; **********************************; *** THREE-STAGE NESTED DESIGNS ***; **********************************; DATA in; DO alloy=1 TO 2; DO oven=1 TO 3; DO mold=1 TO 4; DO ingot = 1 to 3; OUTPUT; END; END; END; END; LINES; ; PROC GLM DATA=in; CLASS alloy oven mold; MODEL = alloy oven(alloy) mold(alloy oven) / SS3; RANDOM mold(alloy oven) / TEST; MEANS alloy oven(alloy) mold(alloy oven); TITLE THREE-STAGE NESTED DESIGN: ALLOY and OVEN FIXED, MOLD RANDOM ; PROC VARCOMP DATA=in METHOD=REML; CLASS alloy oven mold; MODEL = alloy oven(alloy) mold(alloy oven) / FIXED=2; TITLE VARIANCE COMPONENTS ANALYSIS ; PROC GLM DATA=in; CLASS alloy oven mold; MODEL = alloy oven(alloy) mold(alloy oven) / SS3; RANDOM oven(alloy) mold(alloy oven) / TEST; * MEANS alloy oven(alloy) mold(alloy oven); TITLE THREE-STAGE NESTED DESIGN: ALLOY FIXED, OVEN and MOLD RANDOM ; PROC VARCOMP DATA=in METHOD=REML; CLASS alloy oven mold; MODEL = alloy oven(alloy) mold(alloy oven) / FIXED=1; TITLE VARIANCE COMPONENTS ANALYSIS ; RUN; 211

8 5.4 Examples with Crossed and Nested Effects Three Factor Example An industrial engineer is studying the manual assembly of electronic components on circuit boards. The goal is to improve the speed of the assembly operation. The engineer has designed three assembly fixtures and two workplace layouts that seem promising. Operators are required to manually perform the assembly, and it is was decided to randomly select four operators for each fixture and layout combination. Because the workplaces are in different locations within the plant, it is difficult to use the same four operators for each layout. Therefore, the four operators chosen for layout 1 are different individuals than the four operators chosen for layout 2. Because the eight operators were also chosen at random, we have a mixed model containing both crossed (factorial) and nested effects. The data collection was completely randomized and two replicates were obtained. The assembly s measured in seconds are given in the table below. Layout 1 Layout 2 Operator y i Fixture Fixture Fixture y jk y j y = 1252 What is the model equatioin associated with this design? Class Level Information Class Levels Values fixture layout operator Number of Observations Read 48 Number of Observations Used

9 Dependent Variable: Source DF Sum of Squares Mean Square F Value Pr > F Model Error Corrected Total R-Square Coeff Var Root MSE Mean fixture <.1 layout fixture*layout operator(layout) fixtu*operat(layout) Fit Diagnostics for Residual RStudent RStudent Predicted Value Predicted Value Leverage Percent Residual Quantile Residual Predicted Value Fit Mean Residual Proportion Less Cook's D Observation Observations 48 Parameters 24 Error DF 24 MSE R-Square.818 Adj R-Square

10 Source fixture layout fixture*layout operator(layout) fixtu*operat(layout) ependent Variable: Type III Expected Mean Square Var(Error) + 2 Var(fixtu*operat(layout)) + Q(fixture,fixture*layout) Var(Error) + 2 Var(fixtu*operat(layout)) + 6 Var(operator(layout)) + Q(layout,fixture*layout) Var(Error) + 2 Var(fixtu*operat(layout)) + Q(fixture*layout) Var(Error) + 2 Var(fixtu*operat(layout)) + 6 Var(operator(layout)) Var(Error) + 2 Var(fixtu*operat(layout)) Tests of Hypotheses for Mixed Model Analysis of Variance * fixture fixture*layout operator(layout) Error Error: MS(fixtu*operat(layout)) * This test assumes one or more other fixed effects are zero. * layout Error Error: MS(operator(layout)) * This test assumes one or more other fixed effects are zero. fixtu*operat(layout) Error: MS(Error)

11 Distribution of 32.5 Distribution of fixture fixture 32.5 fixture N Mean Std Dev Distribution of fixture N Mean Std Dev Distribution of layout 2 Distribution of layout layout N Mean Std Dev layout N Mean Std Dev Distribution of fixture*layout fixture layout N Mean Std Dev fixture fixture*layout layout N Mean Std Dev operator operator(layout) operator layout N Mean Std Dev layout N Mean Std Dev

12 Distribution DESIGN of WITH NESTED AND CROSSED FACTORS 32.5 Distribution of The UNIVARIATE Procedure Variable: resid Moments N 48 Sum Weights fixture operator Mean Sum Observations fixtu*operat(layout) fixture operator fixtu*operat(layout) Std Deviation Variance Skewness Kurtosis Uncorrected SS 56 Corrected SS 56 Coeff Variation. Std Error Mean layout N Mean Std Dev layout N 1 Mean1Basic Std Statistical Dev 1 2 Measures Location Mean. Std Deviation Median Variance Mode Range Level 31. of Level of fixture operator 1Interquartile layout N Mean Std Dev Range Tests for Location: Mu= Test 1 1 Statistic p Value Student's 24.5t t 3 Pr > 1 t Variability Note: The mode displayed is the smallest of 2 modes with a count of 1. Sign M Pr >= M 1. Signed Rank S -67 Pr >= S Tests for Normality Test Statistic p Value Shapiro-Wilk W.9695 Pr < W.11 Kolmogorov-Smirnov D Pr > D <.1 Cramer-von Mises W-Sq Pr > W-Sq <.5 Anderson-Darling A-Sq Pr > A-Sq <.5 216

13 SAS Code for Three Factor Example DM LOG; CLEAR; OUT; CLEAR; ; ODS GRAPHICS ON; ODS PRINTER PDF file= C:\COURSES\ST541\NESTWF.PDF ; OPTIONS NODATE NONUMBER; **********************************; *** NESTED AND CROSSED FACTORS ***; **********************************; DATA in; DO fixture=1 TO 3; DO layout=1 TO 2; DO operator=1 TO 4; DO rep=1 TO 2; OUTPUT; END; END; END; END; LINES; ; PROC GLM DATA=in PLOTS=(ALL); CLASS fixture layout operator; MODEL = fixture layout fixture operator(layout) / SS3; RANDOM operator(layout) fixture*operator(layout) / TEST; MEANS fixture layout fixture operator(layout) ; OUTPUT OUT=diag R=resid; TITLE ; PROC UNIVARIATE DATA= diag NORMAL; VAR resid; RUN; Partitioning the SS total 217

14 5.4.2 Four Factor Example For the following quality control example, Suppliers and Operators are fixed effects. The two production runs for operator D are different than the two production runs for operator N. From each supplier, two random batches of raw materials were used in this study. Hence, Production Run and Batches are random effects. Supplier A B C Batch Batch Batch Run Operator D Run Run Operator N Run What is the model equation associated with this design? There are exact F -tests for Run(Operator), Batch(Supplier), Supplier*Run(Operator), Operator*Batch(Supplier), and Run(Operator)*Batch(Supplier). There are approximate F -tests for Operator, Supplier, and Operator*Supplier. The REML (restricted maximum likelihood) estimates of the variance components were generated. REML estimates do not allow for negative variance components estimates. That is why the one of the estimates is set to. The Supplier*Operator interaction plots indicate parallel lines which is consistent with the p-value of QUALITY CONTROL EXAMPLE-- 2 CROSSED AND 2 NESTED EFFECTS Class Level Information Class Levels Values operator 2 D N supplier 3 A B C run batch Number of Observations Read 72 Number of Observations 218 Used 72

15 Dependent Variable: Source DF Sum of Squares Mean Square F Value Pr > F Model <.1 Error Corrected Total R-Square Coeff Var Root MSE Mean operator <.1 supplier <.1 operator*supplier run(operator) <.1 batch(supplier) <.1 supplie*run(operato) operat*batch(suppli) QUALITY CONTROL EXAMPLE CROSSED AND NESTED EFFECTS <.1 run*batc(oper*suppl) 6The GLM Procedure <.1 Fit Diagnostics for Residual RStudent 2 RStudent Predicted Value Predicted Value Leverage Residual Cook's D Quantile Predicted Value Observation 3 Fit Mean Residual 1 Percent Observations 72 Parameters 24 Error DF 48 MSE.2587 R-Square.9946 Adj R-Square Residual Proportion Less 219

16 QUALITY CONTROL EXAMPLE-- 2 CROSSED AND 2 NESTED EFFECTS Distribution of Distribution of A B C supplier A B C supplier 35 QUALITY CONTROL EXAMPLE-- 2 CROSSED AND 2 NESTED EFFECTS supplier N Mean Std Dev A 24 Distribution of B C QUALITY CONTROL supplier EXAMPLE-- N 2 CROSSED Mean AND Std Dev 2 NESTED EFFEC A The 24GLM Procedure B Distribution of C D N operator operator N Mean Std Dev D D operator QUALITY CONTROL EXAMPLE-- 2 CROSSED AND 2 NESTED EFFECTS operator N Mean Std Dev 35 D Distribution of N N N D A D B D C N A N B N C operator*supplier operator supplier N Mean Std Dev D A D B D C N A D A D B D C N A N B N C operator*supplier N B N C operator supplier N Mean Std Dev D A D B D C N A N B N C

17 QUALITY CONTROL EXAMPLE-- 2 CROSSED AND 2 NESTED EFFECTS Tests of Hypotheses for Mixed Model Analysis of Variance Dependent Variable: * operator Error Error: MS(run(operator)) + MS(operat*batch(suppli)) - MS(run*batc(oper*suppl)) + 12E-16*MS(Error) * This test assumes one or more other fixed effects are zero. * supplier Error Error: MS(batch(supplier)) + MS(supplie*run(operato)) - MS(run*batc(oper*suppl)) + 89E-17*MS(Error) * This test assumes one or more other fixed effects are zero. operator*supplier Error Error: MS(supplie*run(operato)) + MS(operat*batch(suppli)) - MS(run*batc(oper*suppl)) - 89E-17*MS(Error) run(operator) Error Error: MS(supplie*run(operato)) QUALITY Source CONTROL EXAMPLE-- DF Type III 2 SS CROSSED Mean Square AND F 2 Value NESTED Pr > EFFECTS F batch(supplier) Error Tests of Hypotheses 3 for Mixed Model Analysis of Variance Error: MS(operat*batch(suppli)) Dependent Variable: supplie*run(operato) operat*batch(suppli) Error Error: MS(run*batc(oper*suppl)) run*batc(oper*suppl) <.1 Error: MS(Error)

18 VARIANCE COMPONENTS ANALYSIS Variance Components The GLM Estimation Procedure Procedure Source Type III Expected Mean Class Square Level Information operator Var(Error) + 3 Var(run*batc(oper*suppl)) Class Levels+ 6 Values Var(operat*batch(suppli)) + 6 Var(supplie*run(operato)) + 18 Var(run(operator)) + Q(operator,operator*supplier) operator 2 D N supplier Var(Error) + 3 Var(run*batc(oper*suppl)) supplier 3+ 6 A Var(operat*batch(suppli)) B C + 6 Var(supplie*run(operato)) + 12 Var(batch(supplier)) + Q(supplier,operator*supplier) operator*supplier run Var(Error) + 3 Var(run*batc(oper*suppl)) + 6 Var(operat*batch(suppli)) + 6 Var(supplie*run(operato)) batch + Q(operator*supplier) run(operator) Var(Error) + 3 Var(run*batc(oper*suppl)) + 6 Var(supplie*run(operato)) + 18 Var(run(operator)) batch(supplier) Var(Error) + 3 Var(run*batc(oper*suppl)) Number of Observations + 6 Var(operat*batch(suppli)) Read Var(batch(supplier)) supplie*run(operato) Number of Observations Used 72 Var(Error) + 3 Var(run*batc(oper*suppl)) + 6 Var(supplie*run(operato)) operat*batch(suppli) Var(Error) + 3 Var(run*batc(oper*suppl)) + 6 Var(operat*batch(suppli)) run*batc(oper*suppl) Dependent Variable: Var(Error) + 3 Var(run*batc(oper*suppl)) REML Iterations Iteration Objective Var(run(operator)) Var(batch(supplier)) Var(supplie*run(operato)) REML Iterations Iteration Var(operat*batch(suppli)) Var(run*batc(oper*suppl)) Var(Error) Convergence criteria met. VARIANCE COMPONENTS ANALYSIS Variance Components Estimation Procedure REML Estimates Variance Component Estimate Var(run(operator)) Var(batch(supplier)) Var(supplie*run(operato)) Var(operat*batch(suppli)) Var(run*batc(oper*suppl)) Var(Error) Asymptotic Covariance Matrix of Estimates Var(run(operator)) Var(batch(supplier)) Var(supplie*run(operato)) 222 Var(run(operator)) E-16

19 SAS Code for Four Factor Example DM LOG; CLEAR; OUT; CLEAR; ; ODS GRAPHICS ON; ODS PRINTER PDF file= C:\COURSES\ST541\NEST4.PDF ; OPTIONS NODATE NONUMBER; DATA comp; INPUT operator $ run supplier $ LINES; D 1 A D 1 A D 1 A D 1 A D 1 A D 1 A D 1 B D 1 B D 1 B D 1 B D 1 B D 1 B D 1 C D 1 C D 1 C D 1 C D 1 C D 1 C D 2 A D 2 A D 2 A D 2 A D 2 A D 2 A 6 2 D 2 B D 2 B D 2 B D 2 B D 2 B D 2 B D 2 C D 2 C D 2 C 2 8 D 2 C D 2 C D 2 C N 3 A N 3 A N 3 A N 3 A N 3 A N 3 A N 3 B N 3 B 1 1 N 3 B N 3 B N 3 B N 3 B N 3 C N 3 C N 3 C N 3 C N 3 C N 3 C N 4 A N 4 A N 4 A N 4 A N 4 A N 4 A N 4 B N 4 B N 4 B N 4 B N 4 B N 4 B N 4 C N 4 C N 4 C N 4 C N 4 C N 4 C ; PROC GLM DATA=comp PLOTS=(ALL); CLASS operator supplier run batch; MODEL =operator supplier run(operator) batch(supplier) supplier*run(operator) operator*batch(supplier) run*batch(operator supplier) / SS3; MEANS supplier operator; RANDOM run(operator) batch(supplier) supplier*run(operator) operator*batch(supplier) run*batch(operator supplier) / TEST; TITLE QUALITY CONTROL EXAMPLE-- 2 CROSSED AND 2 NESTED EFFECTS ; PROC VARCOMP DATA=comp METHOD=REML; CLASS operator supplier run batch; MODEL =operator supplier operator*supplier run(operator) batch(supplier) supplier*run(operator) operator*batch(supplier) run*batch(operator supplier) / FIXED=3; TITLE VARIANCE COMPONENTS ANALYSIS ; RUN; 223

unadjusted model for baseline cholesterol 22:31 Monday, April 19,

unadjusted model for baseline cholesterol 22:31 Monday, April 19, unadjusted model for baseline cholesterol 22:31 Monday, April 19, 2004 1 Class Level Information Class Levels Values TRETGRP 3 3 4 5 SEX 2 0 1 Number of observations 916 unadjusted model for baseline cholesterol

More information

This is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.

This is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed. EXST3201 Chapter 13c Geaghan Fall 2005: Page 1 Linear Models Y ij = µ + βi + τ j + βτij + εijk This is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.

More information

Chapter 8 (More on Assumptions for the Simple Linear Regression)

Chapter 8 (More on Assumptions for the Simple Linear Regression) EXST3201 Chapter 8b Geaghan Fall 2005: Page 1 Chapter 8 (More on Assumptions for the Simple Linear Regression) Your textbook considers the following assumptions: Linearity This is not something I usually

More information

EXST7015: Estimating tree weights from other morphometric variables Raw data print

EXST7015: Estimating tree weights from other morphometric variables Raw data print Simple Linear Regression SAS example Page 1 1 ********************************************; 2 *** Data from Freund & Wilson (1993) ***; 3 *** TABLE 8.24 : ESTIMATING TREE WEIGHTS ***; 4 ********************************************;

More information

Handout 1: Predicting GPA from SAT

Handout 1: Predicting GPA from SAT Handout 1: Predicting GPA from SAT appsrv01.srv.cquest.utoronto.ca> appsrv01.srv.cquest.utoronto.ca> ls Desktop grades.data grades.sas oldstuff sasuser.800 appsrv01.srv.cquest.utoronto.ca> cat grades.data

More information

1) Answer the following questions as true (T) or false (F) by circling the appropriate letter.

1) Answer the following questions as true (T) or false (F) by circling the appropriate letter. 1) Answer the following questions as true (T) or false (F) by circling the appropriate letter. T F T F T F a) Variance estimates should always be positive, but covariance estimates can be either positive

More information

Answer Keys to Homework#10

Answer Keys to Homework#10 Answer Keys to Homework#10 Problem 1 Use either restricted or unrestricted mixed models. Problem 2 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean

More information

Assignment 9 Answer Keys

Assignment 9 Answer Keys Assignment 9 Answer Keys Problem 1 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean 26.00 + 34.67 + 39.67 + + 49.33 + 42.33 + + 37.67 + + 54.67

More information

6 Designs with Split Plots

6 Designs with Split Plots 6 Designs with Split Plots Many factorial experimental designs are incorrectly analyzed because the assumption of complete randomization is not true. Many factorial experiments have one or more restrictions

More information

Odor attraction CRD Page 1

Odor attraction CRD Page 1 Odor attraction CRD Page 1 dm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" ---- + ---+= -/\*"; ODS LISTING; *** Table 23.2 ********************************************;

More information

COMPREHENSIVE WRITTEN EXAMINATION, PAPER III FRIDAY AUGUST 26, 2005, 9:00 A.M. 1:00 P.M. STATISTICS 174 QUESTION

COMPREHENSIVE WRITTEN EXAMINATION, PAPER III FRIDAY AUGUST 26, 2005, 9:00 A.M. 1:00 P.M. STATISTICS 174 QUESTION COMPREHENSIVE WRITTEN EXAMINATION, PAPER III FRIDAY AUGUST 26, 2005, 9:00 A.M. 1:00 P.M. STATISTICS 174 QUESTION Answer all parts. Closed book, calculators allowed. It is important to show all working,

More information

dm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" = -/\<>*"; ODS LISTING;

dm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR= = -/\<>*; ODS LISTING; dm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" ---- + ---+= -/\*"; ODS LISTING; *** Table 23.2 ********************************************; *** Moore, David

More information

Comparison of a Population Means

Comparison of a Population Means Analysis of Variance Interested in comparing Several treatments Several levels of one treatment Comparison of a Population Means Could do numerous two-sample t-tests but... ANOVA provides method of joint

More information

SAS Procedures Inference about the Line ffl model statement in proc reg has many options ffl To construct confidence intervals use alpha=, clm, cli, c

SAS Procedures Inference about the Line ffl model statement in proc reg has many options ffl To construct confidence intervals use alpha=, clm, cli, c Inference About the Slope ffl As with all estimates, ^fi1 subject to sampling var ffl Because Y jx _ Normal, the estimate ^fi1 _ Normal A linear combination of indep Normals is Normal Simple Linear Regression

More information

Single Factor Experiments

Single Factor Experiments Single Factor Experiments Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 4 1 Analysis of Variance Suppose you are interested in comparing either a different treatments a levels

More information

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3 Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3 Fall, 2013 Page 1 Tensile Strength Experiment Investigate the tensile strength of a new synthetic fiber. The factor is the

More information

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3 Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3 Page 1 Tensile Strength Experiment Investigate the tensile strength of a new synthetic fiber. The factor is the weight percent

More information

4.8 Alternate Analysis as a Oneway ANOVA

4.8 Alternate Analysis as a Oneway ANOVA 4.8 Alternate Analysis as a Oneway ANOVA Suppose we have data from a two-factor factorial design. The following method can be used to perform a multiple comparison test to compare treatment means as well

More information

7 The Analysis of Response Surfaces

7 The Analysis of Response Surfaces 7 The Analysis of Response Surfaces Goal: the researcher is seeking the experimental conditions which are most desirable, that is, determine optimum design variable levels. Once the researcher has determined

More information

Chapter 11 : State SAT scores for 1982 Data Listing

Chapter 11 : State SAT scores for 1982 Data Listing EXST3201 Chapter 12a Geaghan Fall 2005: Page 1 Chapter 12 : Variable selection An example: State SAT scores In 1982 there was concern for scores of the Scholastic Aptitude Test (SAT) scores that varied

More information

BE640 Intermediate Biostatistics 2. Regression and Correlation. Simple Linear Regression Software: SAS. Emergency Calls to the New York Auto Club

BE640 Intermediate Biostatistics 2. Regression and Correlation. Simple Linear Regression Software: SAS. Emergency Calls to the New York Auto Club BE640 Intermediate Biostatistics 2. Regression and Correlation Simple Linear Regression Software: SAS Emergency Calls to the New York Auto Club Source: Chatterjee, S; Handcock MS and Simonoff JS A Casebook

More information

Booklet of Code and Output for STAC32 Final Exam

Booklet of Code and Output for STAC32 Final Exam Booklet of Code and Output for STAC32 Final Exam December 8, 2014 List of Figures in this document by page: List of Figures 1 Popcorn data............................. 2 2 MDs by city, with normal quantile

More information

Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes)

Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes) Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes) Asheber Abebe Discrete and Statistical Sciences Auburn University Contents 1 Completely Randomized Design

More information

Outline. Review regression diagnostics Remedial measures Weighted regression Ridge regression Robust regression Bootstrapping

Outline. Review regression diagnostics Remedial measures Weighted regression Ridge regression Robust regression Bootstrapping Topic 19: Remedies Outline Review regression diagnostics Remedial measures Weighted regression Ridge regression Robust regression Bootstrapping Regression Diagnostics Summary Check normality of the residuals

More information

Biological Applications of ANOVA - Examples and Readings

Biological Applications of ANOVA - Examples and Readings BIO 575 Biological Applications of ANOVA - Winter Quarter 2010 Page 1 ANOVA Pac Biological Applications of ANOVA - Examples and Readings One-factor Model I (Fixed Effects) This is the same example for

More information

Lecture 4. Random Effects in Completely Randomized Design

Lecture 4. Random Effects in Completely Randomized Design Lecture 4. Random Effects in Completely Randomized Design Montgomery: 3.9, 13.1 and 13.7 1 Lecture 4 Page 1 Random Effects vs Fixed Effects Consider factor with numerous possible levels Want to draw inference

More information

Descriptions of post-hoc tests

Descriptions of post-hoc tests Experimental Statistics II Page 81 Descriptions of post-hoc tests Post-hoc or Post-ANOVA tests! Once you have found out some treatment(s) are different, how do you determine which one(s) are different?

More information

STATISTICS 479 Exam II (100 points)

STATISTICS 479 Exam II (100 points) Name STATISTICS 79 Exam II (1 points) 1. A SAS data set was created using the following input statement: Answer parts(a) to (e) below. input State $ City $ Pop199 Income Housing Electric; (a) () Give the

More information

PLS205 Lab 2 January 15, Laboratory Topic 3

PLS205 Lab 2 January 15, Laboratory Topic 3 PLS205 Lab 2 January 15, 2015 Laboratory Topic 3 General format of ANOVA in SAS Testing the assumption of homogeneity of variances by "/hovtest" by ANOVA of squared residuals Proc Power for ANOVA One-way

More information

Failure Time of System due to the Hot Electron Effect

Failure Time of System due to the Hot Electron Effect of System due to the Hot Electron Effect 1 * exresist; 2 option ls=120 ps=75 nocenter nodate; 3 title of System due to the Hot Electron Effect ; 4 * TIME = failure time (hours) of a system due to drift

More information

EXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"

EXST Regression Techniques Page 1. We can also test the hypothesis H : œ 0 versus H : EXST704 - Regression Techniques Page 1 Using F tests instead of t-tests We can also test the hypothesis H :" œ 0 versus H :" Á 0 with an F test.! " " " F œ MSRegression MSError This test is mathematically

More information

7.3 Ridge Analysis of the Response Surface

7.3 Ridge Analysis of the Response Surface 7.3 Ridge Analysis of the Response Surface When analyzing a fitted response surface, the researcher may find that the stationary point is outside of the experimental design region, but the researcher wants

More information

These are multifactor experiments that have

These are multifactor experiments that have Design of Engineering Experiments Nested Designs Text reference, Chapter 14, Pg. 525 These are multifactor experiments that have some important industrial applications Nested and split-plot designs frequently

More information

Lecture notes on Regression & SAS example demonstration

Lecture notes on Regression & SAS example demonstration Regression & Correlation (p. 215) When two variables are measured on a single experimental unit, the resulting data are called bivariate data. You can describe each variable individually, and you can also

More information

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing

More information

ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003

ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003 ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003 The MEANS Procedure DRINKING STATUS=1 Analysis Variable : TRIGL N Mean Std Dev Minimum Maximum 164 151.6219512 95.3801744

More information

Week 7.1--IES 612-STA STA doc

Week 7.1--IES 612-STA STA doc Week 7.1--IES 612-STA 4-573-STA 4-576.doc IES 612/STA 4-576 Winter 2009 ANOVA MODELS model adequacy aka RESIDUAL ANALYSIS Numeric data samples from t populations obtained Assume Y ij ~ independent N(μ

More information

Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA

Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA Faculty of Health Sciences Outline Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA Lene Theil Skovgaard Sept. 14, 2015 Paired comparisons: tests and confidence intervals

More information

Version 1: Equality of Distributions. 3. F (x) and G(x) represent the distribution functions corresponding to the Xs and Y s, respectively.

Version 1: Equality of Distributions. 3. F (x) and G(x) represent the distribution functions corresponding to the Xs and Y s, respectively. 4 Two-Sample Methods 4.1 The (Mann-Whitney) Wilcoxon Rank Sum Test Version 1: Equality of Distributions Assumptions: Given two independent random samples X 1, X 2,..., X n and Y 1, Y 2,..., Y m : 1. The

More information

Reference: Chapter 14 of Montgomery (8e)

Reference: Chapter 14 of Montgomery (8e) Reference: Chapter 14 of Montgomery (8e) 99 Maghsoodloo The Stage Nested Designs So far emphasis has been placed on factorial experiments where all factors are crossed (i.e., it is possible to study the

More information

SAS code for Levene s Test and Weighted Least Squares DM LOG; CLEAR; OUT; CLEAR; ; ODS GRAPHICS ON; OPTIONS NODATE NONUMBER;

SAS code for Levene s Test and Weighted Least Squares DM LOG; CLEAR; OUT; CLEAR; ; ODS GRAPHICS ON; OPTIONS NODATE NONUMBER; 4.5 4.9 Weighted WeightedLeast Least Squares Squares and and Transformation Transformation Example Example 97 8 SAS code for Levene s Test and Weighted Least Squares M LOG; LEAR; OUT; LEAR; ; OS GRAPHIS

More information

Statistics for exp. medical researchers Regression and Correlation

Statistics for exp. medical researchers Regression and Correlation Faculty of Health Sciences Regression analysis Statistics for exp. medical researchers Regression and Correlation Lene Theil Skovgaard Sept. 28, 2015 Linear regression, Estimation and Testing Confidence

More information

USE OF THE SAS VARCOMP PROCEDURE TO ESTIMATE ANALYTICAL REPEATABILITY. Anna Caroli Istituto di Zootecnica Veterinaria - Milano - Italy

USE OF THE SAS VARCOMP PROCEDURE TO ESTIMATE ANALYTICAL REPEATABILITY. Anna Caroli Istituto di Zootecnica Veterinaria - Milano - Italy INTRODUCTION USE OF THE SAS VARCOMP PROCEDURE TO ESTIMATE ANALYTICAL REPEATABILITY Anna Caroli Istituto di Zootecnica Veterinaria - Milano - Italy Researchers often have to assess if an analytical method

More information

Lecture 11: Simple Linear Regression

Lecture 11: Simple Linear Regression Lecture 11: Simple Linear Regression Readings: Sections 3.1-3.3, 11.1-11.3 Apr 17, 2009 In linear regression, we examine the association between two quantitative variables. Number of beers that you drink

More information

Mixed Model: Split plot with two whole-plot factors, one split-plot factor, and CRD at the whole-plot level (e.g. fancier split-plot p.

Mixed Model: Split plot with two whole-plot factors, one split-plot factor, and CRD at the whole-plot level (e.g. fancier split-plot p. 22s:173 Combining multiple factors into a single superfactor Mixed Model: Split plot with two whole-plot factors, one split-plot factor, and CRD at the whole-plot level (e.g. fancier split-plot p.422 Oehlert)

More information

Lecture 10: Experiments with Random Effects

Lecture 10: Experiments with Random Effects Lecture 10: Experiments with Random Effects Montgomery, Chapter 13 1 Lecture 10 Page 1 Example 1 A textile company weaves a fabric on a large number of looms. It would like the looms to be homogeneous

More information

EXST 7015 Fall 2014 Lab 08: Polynomial Regression

EXST 7015 Fall 2014 Lab 08: Polynomial Regression EXST 7015 Fall 2014 Lab 08: Polynomial Regression OBJECTIVES Polynomial regression is a statistical modeling technique to fit the curvilinear data that either shows a maximum or a minimum in the curve,

More information

Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8]

Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8] Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8] The completely randomized design Treatments are randomly assigned to e.u. such

More information

Topic 25 - One-Way Random Effects Models. Outline. Random Effects vs Fixed Effects. Data for One-way Random Effects Model. One-way Random effects

Topic 25 - One-Way Random Effects Models. Outline. Random Effects vs Fixed Effects. Data for One-way Random Effects Model. One-way Random effects Topic 5 - One-Way Random Effects Models One-way Random effects Outline Model Variance component estimation - Fall 013 Confidence intervals Topic 5 Random Effects vs Fixed Effects Consider factor with numerous

More information

Topic 2. Chapter 3: Diagnostics and Remedial Measures

Topic 2. Chapter 3: Diagnostics and Remedial Measures Topic Overview This topic will cover Regression Diagnostics Remedial Measures Statistics 512: Applied Linear Models Some other Miscellaneous Topics Topic 2 Chapter 3: Diagnostics and Remedial Measures

More information

Introduction to Linear regression analysis. Part 2. Model comparisons

Introduction to Linear regression analysis. Part 2. Model comparisons Introduction to Linear regression analysis Part Model comparisons 1 ANOVA for regression Total variation in Y SS Total = Variation explained by regression with X SS Regression + Residual variation SS Residual

More information

a = 4 levels of treatment A = Poison b = 3 levels of treatment B = Pretreatment n = 4 replicates for each treatment combination

a = 4 levels of treatment A = Poison b = 3 levels of treatment B = Pretreatment n = 4 replicates for each treatment combination In Box, Hunter, and Hunter Statistics for Experimenters is a two factor example of dying times for animals, let's say cockroaches, using 4 poisons and pretreatments with n=4 values for each combination

More information

IES 612/STA 4-573/STA Winter 2008 Week 1--IES 612-STA STA doc

IES 612/STA 4-573/STA Winter 2008 Week 1--IES 612-STA STA doc IES 612/STA 4-573/STA 4-576 Winter 2008 Week 1--IES 612-STA 4-573-STA 4-576.doc Review Notes: [OL] = Ott & Longnecker Statistical Methods and Data Analysis, 5 th edition. [Handouts based on notes prepared

More information

6 Single Sample Methods for a Location Parameter

6 Single Sample Methods for a Location Parameter 6 Single Sample Methods for a Location Parameter If there are serious departures from parametric test assumptions (e.g., normality or symmetry), nonparametric tests on a measure of central tendency (usually

More information

ANOVA Longitudinal Models for the Practice Effects Data: via GLM

ANOVA Longitudinal Models for the Practice Effects Data: via GLM Psyc 943 Lecture 25 page 1 ANOVA Longitudinal Models for the Practice Effects Data: via GLM Model 1. Saturated Means Model for Session, E-only Variances Model (BP) Variances Model: NO correlation, EQUAL

More information

Chapter 11. Analysis of Variance (One-Way)

Chapter 11. Analysis of Variance (One-Way) Chapter 11 Analysis of Variance (One-Way) We now develop a statistical procedure for comparing the means of two or more groups, known as analysis of variance or ANOVA. These groups might be the result

More information

T-test: means of Spock's judge versus all other judges 1 12:10 Wednesday, January 5, judge1 N Mean Std Dev Std Err Minimum Maximum

T-test: means of Spock's judge versus all other judges 1 12:10 Wednesday, January 5, judge1 N Mean Std Dev Std Err Minimum Maximum T-test: means of Spock's judge versus all other judges 1 The TTEST Procedure Variable: pcwomen judge1 N Mean Std Dev Std Err Minimum Maximum OTHER 37 29.4919 7.4308 1.2216 16.5000 48.9000 SPOCKS 9 14.6222

More information

Repeated Measures Part 2: Cartoon data

Repeated Measures Part 2: Cartoon data Repeated Measures Part 2: Cartoon data /*********************** cartoonglm.sas ******************/ options linesize=79 noovp formdlim='_'; title 'Cartoon Data: STA442/1008 F 2005'; proc format; /* value

More information

Analysis of variance and regression. April 17, Contents Comparison of several groups One-way ANOVA. Two-way ANOVA Interaction Model checking

Analysis of variance and regression. April 17, Contents Comparison of several groups One-way ANOVA. Two-way ANOVA Interaction Model checking Analysis of variance and regression Contents Comparison of several groups One-way ANOVA April 7, 008 Two-way ANOVA Interaction Model checking ANOVA, April 008 Comparison of or more groups Julie Lyng Forman,

More information

Possibly useful formulas for this exam: b1 = Corr(X,Y) SDY / SDX. confidence interval: Estimate ± (Critical Value) (Standard Error of Estimate)

Possibly useful formulas for this exam: b1 = Corr(X,Y) SDY / SDX. confidence interval: Estimate ± (Critical Value) (Standard Error of Estimate) Statistics 5100 Exam 2 (Practice) Directions: Be sure to answer every question, and do not spend too much time on any part of any question. Be concise with all your responses. Partial SAS output and statistical

More information

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED Maribeth Johnson Medical College of Georgia Augusta, GA Overview Introduction to longitudinal data Describe the data for examples

More information

PLS205 Lab 6 February 13, Laboratory Topic 9

PLS205 Lab 6 February 13, Laboratory Topic 9 PLS205 Lab 6 February 13, 2014 Laboratory Topic 9 A word about factorials Specifying interactions among factorial effects in SAS The relationship between factors and treatment Interpreting results of an

More information

Mixed Model: Split plot with two whole-plot factors, one split-plot factor, and CRD at the whole-plot level (e.g. fancier split-plot p.

Mixed Model: Split plot with two whole-plot factors, one split-plot factor, and CRD at the whole-plot level (e.g. fancier split-plot p. STAT:5201 Applied Statistic II Mixed Model: Split plot with two whole-plot factors, one split-plot factor, and CRD at the whole-plot level (e.g. fancier split-plot p.422 OLRT) Hamster example with three

More information

A Little Stats Won t Hurt You

A Little Stats Won t Hurt You A Little Stats Won t Hurt You Nate Derby Statis Pro Data Analytics Seattle, WA, USA Edmonton SAS Users Group, 11/13/09 Nate Derby A Little Stats Won t Hurt You 1 / 71 Outline Introduction 1 Introduction

More information

PLS205 KEY Winter Homework Topic 3. The following represents one way to program SAS for this question:

PLS205 KEY Winter Homework Topic 3. The following represents one way to program SAS for this question: PL05 KEY Winter 05 Homework Topic 3 Answers to Question [30 points] The following represents one way to program A for this question: Data Weeds; Input Cover $ Biomass; Cards; 66 634 63 63 633 645 69 63

More information

PLS205!! Lab 9!! March 6, Topic 13: Covariance Analysis

PLS205!! Lab 9!! March 6, Topic 13: Covariance Analysis PLS205!! Lab 9!! March 6, 2014 Topic 13: Covariance Analysis Covariable as a tool for increasing precision Carrying out a full ANCOVA Testing ANOVA assumptions Happiness! Covariable as a Tool for Increasing

More information

Split-plot Designs. Bruce A Craig. Department of Statistics Purdue University. STAT 514 Topic 21 1

Split-plot Designs. Bruce A Craig. Department of Statistics Purdue University. STAT 514 Topic 21 1 Split-plot Designs Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 21 1 Randomization Defines the Design Want to study the effect of oven temp (3 levels) and amount of baking soda

More information

Effect of Centering and Standardization in Moderation Analysis

Effect of Centering and Standardization in Moderation Analysis Effect of Centering and Standardization in Moderation Analysis Raw Data The CORR Procedure 3 Variables: govact negemot Simple Statistics Variable N Mean Std Dev Sum Minimum Maximum Label govact 4.58699

More information

Joseph O. Marker Marker Actuarial a Services, LLC and University of Michigan CLRS 2010 Meeting. J. Marker, LSMWP, CLRS 1

Joseph O. Marker Marker Actuarial a Services, LLC and University of Michigan CLRS 2010 Meeting. J. Marker, LSMWP, CLRS 1 Joseph O. Marker Marker Actuarial a Services, LLC and University of Michigan CLRS 2010 Meeting J. Marker, LSMWP, CLRS 1 Expected vs Actual Distribution Test distributions of: Number of claims (frequency)

More information

Analysis of variance. April 16, Contents Comparison of several groups

Analysis of variance. April 16, Contents Comparison of several groups Contents Comparison of several groups Analysis of variance April 16, 2009 One-way ANOVA Two-way ANOVA Interaction Model checking Acknowledgement for use of presentation Julie Lyng Forman, Dept. of Biostatistics

More information

The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.1/27

The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.1/27 The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification Brett Presnell Dennis Boos Department of Statistics University of Florida and Department of Statistics North Carolina State

More information

Analysis of variance. April 16, 2009

Analysis of variance. April 16, 2009 Analysis of variance April 16, 2009 Contents Comparison of several groups One-way ANOVA Two-way ANOVA Interaction Model checking Acknowledgement for use of presentation Julie Lyng Forman, Dept. of Biostatistics

More information

Covariance Structure Approach to Within-Cases

Covariance Structure Approach to Within-Cases Covariance Structure Approach to Within-Cases Remember how the data file grapefruit1.data looks: Store sales1 sales2 sales3 1 62.1 61.3 60.8 2 58.2 57.9 55.1 3 51.6 49.2 46.2 4 53.7 51.5 48.3 5 61.4 58.7

More information

Stat 500 Midterm 2 12 November 2009 page 0 of 11

Stat 500 Midterm 2 12 November 2009 page 0 of 11 Stat 500 Midterm 2 12 November 2009 page 0 of 11 Please put your name on the back of your answer book. Do NOT put it on the front. Thanks. Do not start until I tell you to. The exam is closed book, closed

More information

Topic 18: Model Selection and Diagnostics

Topic 18: Model Selection and Diagnostics Topic 18: Model Selection and Diagnostics Variable Selection We want to choose a best model that is a subset of the available explanatory variables Two separate problems 1. How many explanatory variables

More information

K. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =

K. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij = K. Model Diagnostics We ve already seen how to check model assumptions prior to fitting a one-way ANOVA. Diagnostics carried out after model fitting by using residuals are more informative for assessing

More information

Suppose we needed four batches of formaldehyde, and coulddoonly4runsperbatch. Thisisthena2 4 factorial in 2 2 blocks.

Suppose we needed four batches of formaldehyde, and coulddoonly4runsperbatch. Thisisthena2 4 factorial in 2 2 blocks. 58 2. 2 factorials in 2 blocks Suppose we needed four batches of formaldehyde, and coulddoonly4runsperbatch. Thisisthena2 4 factorial in 2 2 blocks. Some more algebra: If two effects are confounded with

More information

WITHIN-PARTICIPANT EXPERIMENTAL DESIGNS

WITHIN-PARTICIPANT EXPERIMENTAL DESIGNS 1 WITHIN-PARTICIPANT EXPERIMENTAL DESIGNS I. Single-factor designs: the model is: yij i j ij ij where: yij score for person j under treatment level i (i = 1,..., I; j = 1,..., n) overall mean βi treatment

More information

data proc sort proc corr run proc reg run proc glm run proc glm run proc glm run proc reg CONMAIN CONINT run proc reg DUMMAIN DUMINT run proc reg

data proc sort proc corr run proc reg run proc glm run proc glm run proc glm run proc reg CONMAIN CONINT run proc reg DUMMAIN DUMINT run proc reg data one; input id Y group X; I1=0;I2=0;I3=0;if group=1 then I1=1;if group=2 then I2=1;if group=3 then I3=1; IINT1=I1*X;IINT2=I2*X;IINT3=I3*X; *************************************************************************;

More information

Lecture 7 Randomized Complete Block Design (RCBD) [ST&D sections (except 9.6) and section 15.8]

Lecture 7 Randomized Complete Block Design (RCBD) [ST&D sections (except 9.6) and section 15.8] Lecture 7 Randomized Complete Block Design () [ST&D sections 9.1 9.7 (except 9.6) and section 15.8] The Completely Randomized Design () 1. It is assumed that all experimental units (EU's) are uniform..

More information

Scenarios Where Utilizing a Spline Model in Developing a Regression Model Is Appropriate

Scenarios Where Utilizing a Spline Model in Developing a Regression Model Is Appropriate Paper 1760-2014 Scenarios Where Utilizing a Spline Model in Developing a Regression Model Is Appropriate Ning Huang, University of Southern California ABSTRACT Linear regression has been a widely used

More information

Lecture 11 Multiple Linear Regression

Lecture 11 Multiple Linear Regression Lecture 11 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 11-1 Topic Overview Review: Multiple Linear Regression (MLR) Computer Science Case Study 11-2 Multiple Regression

More information

Residuals from regression on original data 1

Residuals from regression on original data 1 Residuals from regression on original data 1 Obs a b n i y 1 1 1 3 1 1 2 1 1 3 2 2 3 1 1 3 3 3 4 1 2 3 1 4 5 1 2 3 2 5 6 1 2 3 3 6 7 1 3 3 1 7 8 1 3 3 2 8 9 1 3 3 3 9 10 2 1 3 1 10 11 2 1 3 2 11 12 2 1

More information

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is

More information

Overview. Prerequisites

Overview. Prerequisites Overview Introduction Practicalities Review of basic ideas Peter Dalgaard Department of Biostatistics University of Copenhagen Structure of the course The normal distribution t tests Determining the size

More information

One-way ANOVA Model Assumptions

One-way ANOVA Model Assumptions One-way ANOVA Model Assumptions STAT:5201 Week 4: Lecture 1 1 / 31 One-way ANOVA: Model Assumptions Consider the single factor model: Y ij = µ + α }{{} i ij iid with ɛ ij N(0, σ 2 ) mean structure random

More information

SPECIAL TOPICS IN REGRESSION ANALYSIS

SPECIAL TOPICS IN REGRESSION ANALYSIS 1 SPECIAL TOPICS IN REGRESSION ANALYSIS Representing Nominal Scales in Regression Analysis There are several ways in which a set of G qualitative distinctions on some variable of interest can be represented

More information

Outline. Topic 20 - Diagnostics and Remedies. Residuals. Overview. Diagnostics Plots Residual checks Formal Tests. STAT Fall 2013

Outline. Topic 20 - Diagnostics and Remedies. Residuals. Overview. Diagnostics Plots Residual checks Formal Tests. STAT Fall 2013 Topic 20 - Diagnostics and Remedies - Fall 2013 Diagnostics Plots Residual checks Formal Tests Remedial Measures Outline Topic 20 2 General assumptions Overview Normally distributed error terms Independent

More information

Detecting and Assessing Data Outliers and Leverage Points

Detecting and Assessing Data Outliers and Leverage Points Chapter 9 Detecting and Assessing Data Outliers and Leverage Points Section 9.1 Background Background Because OLS estimators arise due to the minimization of the sum of squared errors, large residuals

More information

Math 475. Jimin Ding. August 29, Department of Mathematics Washington University in St. Louis jmding/math475/index.

Math 475. Jimin Ding. August 29, Department of Mathematics Washington University in St. Louis   jmding/math475/index. istical A istic istics : istical Department of Mathematics Washington University in St. Louis www.math.wustl.edu/ jmding/math475/index.html August 29, 2013 istical August 29, 2013 1 / 18 istical A istic

More information

Multi-Equation Structural Models: Seemingly Unrelated Regression Models

Multi-Equation Structural Models: Seemingly Unrelated Regression Models Chapter 15 Multi-Equation Structural Models: Seemingly Unrelated Regression Models Section 15.1 Seemingly Unrelated Regression Models Modeling Approaches Econometric (Structural) Models Time-Series Models

More information

Assignment 6 Answer Keys

Assignment 6 Answer Keys ssignment 6 nswer Keys Problem 1 (a) The treatment sum of squares can be calculated by SS Treatment = b a ȳi 2 Nȳ 2 i=1 = 5 (5.40 2 + 5.80 2 + 10 2 + 9.80 2 ) 20 7.75 2 = 92.95 Then the F statistic for

More information

SAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model

SAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model Topic 23 - Unequal Replication Data Model Outline - Fall 2013 Parameter Estimates Inference Topic 23 2 Example Page 954 Data for Two Factor ANOVA Y is the response variable Factor A has levels i = 1, 2,...,

More information

STAT 350: Summer Semester Midterm 1: Solutions

STAT 350: Summer Semester Midterm 1: Solutions Name: Student Number: STAT 350: Summer Semester 2008 Midterm 1: Solutions 9 June 2008 Instructor: Richard Lockhart Instructions: This is an open book test. You may use notes, text, other books and a calculator.

More information

Z score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound

Z score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound 1 EDUR 8131 Chat 3 Notes 2 Normal Distribution and Standard Scores Questions Standard Scores: Z score Z = (X M) / SD Z = deviation score divided by standard deviation Z score indicates how far a raw score

More information

Random and mixed effects models

Random and mixed effects models Random and mixed effects models Fixed effect: Three fields were available for an agricultural yield experiment. The experiment is conducted on those fields. The mean yield of this particular strain of

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Multicollinearity Exercise

Multicollinearity Exercise Multicollinearity Exercise Use the attached SAS output to answer the questions. [OPTIONAL: Copy the SAS program below into the SAS editor window and run it.] You do not need to submit any output, so there

More information

STA 303H1F: Two-way Analysis of Variance Practice Problems

STA 303H1F: Two-way Analysis of Variance Practice Problems STA 303H1F: Two-way Analysis of Variance Practice Problems 1. In the Pygmalion example from lecture, why are the average scores of the platoon used as the response variable, rather than the scores of the

More information

Chapter 6 Multiple Regression

Chapter 6 Multiple Regression STAT 525 FALL 2018 Chapter 6 Multiple Regression Professor Min Zhang The Data and Model Still have single response variable Y Now have multiple explanatory variables Examples: Blood Pressure vs Age, Weight,

More information