Reference: Chapter 13 of Montgomery (8e)

Similar documents
Reference: Chapter 14 of Montgomery (8e)

Reference: CHAPTER 7 of Montgomery(8e)

Reference: Chapter 6 of Montgomery(8e) Maghsoodloo

TWO OR MORE RANDOM EFFECTS. The two-way complete model for two random effects:

Stat 217 Final Exam. Name: May 1, 2002

The Random Effects Model Introduction

Chapter 13 Experiments with Random Factors Solutions

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

As an example, consider the Bond Strength data in Table 2.1, atop page 26 of y1 y 1j/ n , S 1 (y1j y 1) 0.

Lecture 10: Experiments with Random Effects

Chapter 8 of Devore , H 1 :

1; (f) H 0 : = 55 db, H 1 : < 55.

. Example: For 3 factors, sse = (y ijkt. " y ijk

Sleep data, two drugs Ch13.xls

Chapter 5 Introduction to Factorial Designs Solutions

Chapter Seven: Multi-Sample Methods 1/52

44.2. Two-Way Analysis of Variance. Introduction. Prerequisites. Learning Outcomes

Confidence Intervals, Testing and ANOVA Summary

Multiple comparisons - subsequent inferences for two-way ANOVA

Solution to Final Exam

Analysis of Variance

Analysis of Variance and Design of Experiments-I

Chapter 12 Comparing Two or More Means

Lecture 14 Topic 10: ANOVA models for random and mixed effects. Fixed and Random Models in One-way Classification Experiments

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

Lec 5: Factorial Experiment

MATH Notebook 3 Spring 2018

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

CHAPTER 10 ONE-WAY ANALYSIS OF VARIANCE. It would be very unusual for all the research one might conduct to be restricted to

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Design and Analysis of Experiments

Chapter 13: Analysis of variance for two-way classifications

Fractional Factorial Designs

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

Unit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs

Open book and notes. 120 minutes. Covers Chapters 8 through 14 of Montgomery and Runger (fourth edition).

Factorial designs. Experiments

Chapter 7 Comparison of two independent samples

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

Design & Analysis of Experiments 7E 2009 Montgomery

Calculating Fobt for all possible combinations of variances for each sample Calculating the probability of (F) for each different value of Fobt

Multiple Regression Analysis

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

ST4241 Design and Analysis of Clinical Trials Lecture 4: 2 2 factorial experiments, a special cases of parallel groups study

Lectures 5 & 6: Hypothesis Testing

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)

Design of Experiments. Factorial experiments require a lot of resources

Lecture 11: Nested and Split-Plot Designs

Experimental Design. IUFRO-SPDC Snowbird, UT September 29 Oct 3, 2014 Drs. Rolfe Leary and John A. Kershaw, Jr.

MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS FALL 1999

Inference for the Regression Coefficient

3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is

Unit 8: A Mixed Two-Factor Design

VARIANCE COMPONENT ANALYSIS

Contents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

Topic 22 Analysis of Variance

Chapter 12. Analysis of variance

LECTURE 5 HYPOTHESIS TESTING

Multiple Comparisons. The Interaction Effects of more than two factors in an analysis of variance experiment. Submitted by: Anna Pashley

Simple Linear Regression. (Chs 12.1, 12.2, 12.4, 12.5)

1 Use of indicator random variables. (Chapter 8)

Psychology 282 Lecture #4 Outline Inferences in SLR

Sociology 6Z03 Review II

STAT Final Practice Problems

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

PSYC 331 STATISTICS FOR PSYCHOLOGISTS

Design of Engineering Experiments Part 2 Basic Statistical Concepts Simple comparative experiments

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Statistics 251: Statistical Methods

20.0 Experimental Design

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs

Analysis of Variance (ANOVA)

Solutions to Final STAT 421, Fall 2008

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

2.830 Homework #6. April 2, 2009

NESTED FACTORS (Chapter 18)

Simple Linear Regression. Material from Devore s book (Ed 8), and Cengagebrain.com

Lecture 7: Hypothesis Testing and ANOVA

Random and Mixed Effects Models - Part III

Factorial Treatment Structure: Part I. Lukas Meier, Seminar für Statistik

ECO375 Tutorial 4 Introduction to Statistical Inference

Nesting and Mixed Effects: Part I. Lukas Meier, Seminar für Statistik

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

Unit 8: A Mixed Two-Factor Design

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)

Statistical Hypothesis Testing

Simple and Multiple Linear Regression

Reference: Chapter 8 of Montgomery (8e)

TAMS38 Experimental Design and Biostatistics, 4 p / 6 hp Examination on 19 April 2017, 8 12

DATA IN SERIES AND TIME I. Several different techniques depending on data and what one wants to do

Review of Statistics 101

CHAPTER 6 MACHINABILITY MODELS WITH THREE INDEPENDENT VARIABLES

Statistics Introductory Correlation

BASIC NOTIONS. x + y = 1 3, 3x 5y + z = A + 3B,C + 2D, DC are not defined. A + C =

RCB - Example. STA305 week 10 1

Week 14 Comparing k(> 2) Populations

Transcription:

Reference: Chapter 1 of Montgomery (8e) Maghsoodloo 89 Factorial Experiments with Random Factors So far emphasis has been placed on factorial experiments where all factors are at a, b, c,... fixed levels (i.e., their levels were not selected at random). In such experiments, conclusions must be restricted to the region of factor space spanned by the ranges of the factors. Then, regression may be used to predict the response, y, within the region of factor spaces. In general, we may claim that almost all quantitative factors (such as temperature, pressure, feed rate, etc.) considered in the design of an experiment are fixed while majority of qualitative factors are random. By a random factor we mean that its levels in the design phase of the experiment are actually selected at random from a (finite or infinite) population of levels. I would venture to say that 90% of all random factors in all designs of experiments in the universe are qualitative factors! Examples of qualitative factors are operators, parts, days, material type, batches of raw material, nozzle type, tool shape or geometry, etc., et cetera. When all factors in an experiment are fixed, the corresponding model is called fixed effects; when all factors in an experiment are random, the corresponding model is called randomeffects or the components of variance model; when some factors are fixed and the rest are at random levels, then the corresponding statistical model is called mixed effects. The SS s computations for the ANVA remain exactly the same as before for all three models, but obtaining the F statistic, the conclusions, and ST ANVA procedures are different for the three models. BAD NEWS So far everything has been gravy because we have dealt only with fixed level factors so that the denominators of all F statistics, i.e., the denominators of F 0 values, have always been the MS(Error) or MS(Residual Error). Unfortunately, if at least one factor in experiments has random levels, giving rise to a random or mixed effects model, no longer the denominator of all F 0 values will be the MS(Error), but rather, each denominator has to be determined separately based on its expected mean square (EMS). For example, if we have a 4 89

factorial design with A and B at and 4 fixed levels, respectively, factor C at random levels and n = random observations per cell, I would not know (with my 40 years of experience in the DE) what the denominator of each F statistic will be until I develop the EMS column for this 4 experiment. In order for us to learn how to do this systematically, we only need to learn an extra concept in designing experiments, namely nesting (which will be covered in Chapter 14). Factor B is said to be nested under factor A iff the levels of B cannot be crossed with those of A. As examples, see Figures 14.1 & 14. on page 605 of Montgomery s 8 th edition, where Batches are nested within (or under) Suppliers, and in Figure 14.5 on page 614 of Montgomery(8e), where Ingots are nested under Heats and Heats are nested within Alloy formulation (at levels). Although we have not yet covered nested factorials in this course, we need this concept because repeat observations (or replications within the same FLC) are indeed nested within each cell (or FLC), i.e., for the above 4 design with n = observations per cell, the ijkr term should actually be written as r(ijk) or (ijk)r because the index r is nested within the (ijk) cell. In obtaining the EMS (Expected Mean Squares) column, every index inside parentheses ( ) will be called dead and those outside ( ) will be called live. We now state the rules for the Bennett & Franklin (I am not certain of this reference) algorithm to obtain the EMS column for all the effects in a mixed effects factorial experiment. Rule 1. In each row, write 1 if the dead subscripts in the row matches column subscripts. Rule. For the case of restricted model, in each row, write 0 for fixed columns and write 1 for random columns wherever live subscripts match. For the unrestricted model, write 1 wherever live subscripts match. Rule. In the remaining empty row positions, write the number of levels shown in the column headings. We now use the above rules to obtain the following Table 0 needed to derive the EMS (expected mean squares) column for the CR (completely randomized) 4 factorial with repeat observations per cell and then will describe the last rule after constructing the Table. 90 In Table 0, A = i1 i A /, B = 4 B j /, and AB = 4 (A B) ij /6, where (A B) ij= ij j1 i1j1 90

Table 0. (Rand = R = Random; F = Fixed; the case of restricted model) 91 Model Fixed F Rand R Terms 4 n = i j k r EMS df A i 0 4 B j 0 (AB) ij 0 0 16 A + 8 AC + 1 B + 6 BC + 4 AB + ABC + 6 C k 4 1 (AC) ik 0 4 1 (BC) jk 0 1 (ABC) ijk 0 0 1 4 C + 8 AC + 6 BC + ABC + (ijk)r 1 1 1 1 1 6 4 i..j + ; further, the E(MS A ) was obtained by 1 st covering the entire column i, second using only the rows whose effects in the 1 st column has the index i and then multiplying the visible numbers by the corresponding variance component. The relevant rows for E(MS A ) are A i, (AB) ij, (AC) ik, (ABC) ijk and (ijk)r because these rows all contain the index i. Table 0 clearly shows that only the effects C, AC, BC, and ABC must be tested against MS(Error), i.e., the divisor of their F statistics must be MS(Error). However, Factor A must be tested against MS(AC); Factor B must be tested against MS(BC), and the statistic for testing the null hypothesis H 0 : (AB) ij = 0 is F 0 = MS(AB)/MS(ABC). Unfortunately, the above design is inadequate because the denominator of the F statistics for the fixed factors A and B have only and df, respectively. Generally, the denominator of the F test should have at least 5 df (preferably 6 df ) for the F test to be fairly stable and to possess sufficient power to reject H 0 if H 0 is false. Therefore, we have to redesign the above experiment by using at least 4 levels of the random factor C. 91

Exercise 6. Work problem 5.19 on pages 7 8 of Montgomery(8e) but omit the last (i.e., the rd ) observation in every cell so that you have a factorial with n = observations per cell. Further, assume that only A (Cycle Time indexed by i) and B (Temperature indexed by j) are fixed but the three operators were randomly selected from all the operators in the plant. Code the data by subtracting and comment on the adequacy of the design. 9 The Factor Factorial Design with Both Factors at Random Levels Consider the experiment in Example 1.1 of Montgomery on pp. 575 578, where factor (art Number) is at 0 random levels and factor (perators) is at random levels with n = observations per FLC. Therefore, the statistical model y ijk = + i + j + () ij + ijk, i = 1,,..., 0; j = 1,,, and k = 1, is called the components of variance (or random effects) model because both and were selected at random. The assumptions of this model are as follows: (1) i, j, () ij and ijk are NID distributed with variances V( i ) =, V( j ) =, V[() ij ] =, and V( ijk ) =. () Because of the independence assumption of the rv s i, j, () ij and ijk, the total variance is V(y ijk ) = + + model y ijk = + i + j + () ij + ijk, where +. Thus, there are 4 components of variance in the measures gauge repeatability, while ( + ) measures gauge reproducibility, i.e., V(Gauge) = ( + ) +. () There are three null hypotheses to be tested from the ANVA, which are H 0 : = 0, and H 0 : = = 0 VS H 1 : Eff > 0, where Eff =,, or. = 0, H 0 : Since there are only random factors involved in the experiment, the denominator of all F statistics will no longer be MS(Error) and have to be determined through the algorithm (Bennett and Franklin 1954) described earlier under Bad News above. Applying that algorithm to the experiment of Montgomery s Example 1.1 leads to the EMS column given in Table 1. Table 1 clearly shows that both random factors and must be tested against MS(), but the denominator of the F statistic for the effect must be MS(Error). The ANVA is provided on 9

Table 1. (The EMS for the Example 1.1 on pp. 575 8 of Montgomery(8e); R Random) 9 Source R R R 0 i j k df EMS = Expected Mean Square i 1 19 6 + + j 0 1 40 + + () ij 1 1 8 + (ij)k 1 1 1 60 = page 577 of Montgomery(8e) in Table 1., which shows that only art to art variation is highly significant; there is a slight variation due to operators ( value = 0.17), but the null hypothesis H 0 : = 0 VS the alternative H 1 : > 0 cannot at all be rejected ( value = 0.8614). As post ANVA, we first obtain point moment estimates of variance components using the corresponding EMS from the above Table 1. The EMS column clearly shows that E(MS ) = 6 + +. Note that every term in this equation is a parameter and has to be estimated from the ANVA Table. In order to estimate, we replace every term in this last parametric equation with their corresponding moment estimator, i.e., MS 6 MS() 40 ˆ ˆ = + MS() MS MS( ) = 6 6.908 0.71184 6 ˆ + = 10.80; similarly, MS() ˆ = (1.08 0.71184)/40 = 0.01491. However, an attempt to estimate in the same manner will lead to a negative value of ˆ, which is contradictory to the fact that variances can never be non positive. Hence, we have to assume ˆ that = 0. As Montgomery(8e) points out on his page 575, that this is a gauge R&R study (i.e., gauge repeatability and reproducibility study) where repeatability is estimated by MS(Error) = 0.99166 and reproducibility is measured by +, which in this case is 9

estimated by ˆ + ˆ = 0.01491. Therefore, ˆ gauge = 0.99166 + 0.01491 = 1.006570. Since V(y) = V(arts) + V(Gauge), then 91.08% (= 10.80/11.8657) of the total variability is due to arts, and hence, the gauge is fairly capable of measuring different parts dimensions! The reader should study the ANVA of the reduced model on page 578 of Montgomery(8e). Exercise 7. Work roblem 1.1 on page 601 of Montgomery(8e). You must code the data by subtracting 45 from all N = 10 = 60 observations, which were CR. Compute the % variation due to the measuring instrument. 94 The Two Factor Mixed Effects Model Suppose in the Example 1.1 of Montgomery there are only three operators in the plant that make use of the single gauge to make measurements of the parts, i.e., the entire population levels of factor were used in the experiment so that now (= perator) is a factor that is fixed at levels, i.e., our conclusions will pertain only to the three operators used in the experiment. Montgomery presents two mixed models but we will discuss and require the knowledge of the most commonly encountered one in these notes. If operators are assumed fixed, then the restrictions that will be placed in the model are ( )ij = 0; further, i s are rvs distributed as N(0, j 1 j = 0, j 1 ) and the rvs () ij are assumed N(0, ) and stochastically independent from i s. If the restriction ( ) ij= 0 is j1 removed, then the model is called the unrestricted mixed model, with different divisors for the F statistics. To obtain the EMS column that paves the way to the proper F tests, we modify Table 1 above as in Table. Table atop the next page, where Φ = j /, clearly j=1 shows that now only the fixed factor () must be tested against MS() but the two random effects, and, must be tested against the MS(Error) in the ANVA table. Further, V(y) = 94

Table. ( = ; F Fixed levels factor; R = Random levels factor) 95 Source R F R df 0 EMS i j k i 1 19 6 + j 0 0 40Φ + + () ij 1 0 8 + (ij)k 1 1 1 60 + +, and therefore, reproducibility is measured only by. Since perators are fixed, then operator effects can be estimated as Ô j = y.j. y... with se( y.j. ) = MS / ( 0) = MS / 40 because F()= 0 MS /MS. See the Example 1. on pp. 58 58 of Montgomery(8e). Mixed Model Factorials With at Least Random Factors Consider a mixed factorial experiment with or more factors where at least of the factors are at random levels. Factorial designs where there is only one random factor pose no problem from the standpoint of forming a direct F statistic to test H 0 : A i = 0 and H 0 : B j = 0, et cetera. However, in such a case the designer has to be careful to randomly select sufficient levels of the random factor so that the denominator of all F statistics have at least, say, 5 df (I would prefer 6 df if at all possible). Now, consider a factor CR experiment where factor A is at fixed levels, factor B has random levels, factor C is at c random levels, the value of c to be determined after the EMS column has been developed, and n = observations at each FLC. Table shows the EMS (Expected Mean Squares) column for this balanced c factorial with n = observations per cell. Table, atop the next page, clearly shows that there are direct F tests for every effect except for factor A, whose levels are fixed. Table clearly shows that the random factors B and C must be tested against MS(BC); the AB and AC effects must be tested against MS(ABC), and the null hypotheses H 0 : BC BC = 0 VS H 1 : > 0 and H 0 : ABC = 0 versus 95

Table. (The EMS for a factor CR experiment with random factors B & C) 96 Model Terms F i R j R c k R n= r EMS A i 0 c 6c A + c AB + 6 AC + + (ABC) + B j 1 c 4c B + 4 BC + (AB) ij 0 1 c c AB + ABC + df 1 C k 1 1 C + 4 BC + c 1 (AC) ik 0 1 6 AC + ABC + c 1 (BC) jk 1 1 4 BC + (ABC) ijk 0 1 1 ABC + (ijk)r 1 1 1 1 (c 1) (c 1) 6c H 1 : ABC > 0 must be tested against MS(Error). Since MS(BC) appears in the denominator of two of the F tests, then c > so that (c 1) 6, or c 4. Table also shows that, unfortunately, there is no direct F statistic for testing H 0 : A i = 0, where A is a fixed factor. However, the Table shows that E(MS AB + MS AC MS ABC ) = c AB + 6 AC + ABC +. This implies that if we let MS= MS A and MS = MS AB + MS AC MS ABC, then the statistic F 0 = MS/MS has an approximate (or pseudo) F distribution with 1 df for the numerator and q df for the denominator, where q is given in Eq. (1.1) on page 59 of Montgomery s 8 th edition. The expression for q, assuming MS = MS A and MS = MS AB + MS AC MS ABC and c = 4, is given by q = (MS ). MS / MS / ( MS ) / 6 AB AC ABC Note that Montgomery defines MS differently near the middle of his page 59, which also provides an appropriate F 0 = MS/ MS for testing H 0 : A i = 0. I would use Montgomery s expressions for MS and MS on p. 59 only in if MS ABC MS AB + MS AC. therwise, my definition provides a more direct pseudo F test for H 0 : A i = 0. Exercise 8. Study the Example 1.6 on pages 59 597 of Montgomery(8e) but 96

conduct the pseudo F test for the fixed factor A as I have outlined above. 97 Approximate Confidence Intervals for Variance Components Reference to my Chapter Notes on pages 5 & 6 indicates that for a normal parent universe, the statistic (n 1)S / has a distribution with = n 1 df. Using this and the fact that E(S ) = for any infinite universe, it was illustrated on pp. 5 & 6 of my notes how to construct a 95% CI for the parameter. For the Example 1.1 of Montgomery, this leads to 0.7141 1.46980, where 0.975,60 = 40.481748 and = 8.97675. 0.05,60 Unfortunately, as stated by Montgomery(8e) p. 598, in experiments involving several fixed and random factors, E(MS) will generally not equal to one of the variance components, and hence, it will not be possible to use the distribution directly to obtain a CI for, say, of the random factors in the experiment with general, not equal to B, where B is one B = b 1. What we mean is that E(MS B ) will, in B in a mixed model factorial experiment. As an example, for the R&R study of Example 1.1 on pages 575 578 of Montgomery(8e), if we let B =, then E(MS ) = 6 + +, which is quite different from. From the above discussion, it is clear that we have to develop a linear combination of MS s that is a point unbiased estimate of B, where B is a random factor. For the factor (= arts) of Montgomery s Example 1.1 because E(MS ) = 6 + E(MS ), then E(MS )E(MS ) = 6 E[(MS MS )/6] =. Thus, let this point estimator be denoted by MS MS( ) 6 ˆ = = (1/6)MS (1/6)MS(), which is a linear combination. Note that E( ˆ ) = ; further, 19MS / and 8MS()/ σ do not seem to have distributions with df equal to 19 and 8, respectively, because E(MS ) and E[MS()] σ. This last statement is not consistent with that of Montgomery s near the bottom of page 598 right after his Eq. (1.4). Consequently, Satterthwaite s (1946) pseudo F procedure can be used to 97

deduce that (r ˆ )/ has an approximate distribution with r df, whose value is given in Eq. (1.5) at the bottom of page 598 and r is generally close to df of the random factor, i.e., in the Example 1.1 the value of r should be close to 19 df (because there were 0 randomly selected parts). component We now use the above developments to obtain an approximate 95% CI for the variance of Example 1.1 on pages 575 578 of Montgomery(8e). To this end, we must first compute the df of the desired CI (recall that, from the Eq. (1.5) on p. 598, to be used in obtaining the r ˆ = 10.798, MS = 6.908, and MS = 0.71184.) 98 r = m i1 ( ˆ ) i MS / (k f ) (10.80) r = [(6.908) / (619)] [(0.71184) / (68)] i (Eq. 1.5 p. 598 of Montgomery 8e) (MS = MS ) = 18.56771 MS /19 MS / 8 where for the Example 1.1 of Montgomery and its variance component, m =, k = 6, f 1 = 19, and f = 8. To obtain a 95% CI for, we use the fact that (r ˆ )/ has an approximate distribution with r = 18.56771 degrees of freedom. Therefore, the r(8.61 r ˆ /.80) 0.95. This last confidence probability statement leads to the desired approximate 95% CI : 5.910.160. Exercise 9. Work problems 1.7, and 1.8 on pp. 60 60 of Montgomery(8e). 98