IX. Complete Block Designs (CBD s)

Size: px
Start display at page:

Download "IX. Complete Block Designs (CBD s)"

Transcription

1 IX. Complete Block Designs (CBD s) A.Background Noise Factors nuisance factors whose values can be controlled within the context of the experiment but not outside the context of the experiment Covariates - nuisance factors whose values can not be controlled but can be measured within the context of (prior to or during) the experiment) Block - nuisance factors whose values can not be measured or controlled within the context of the experiment (and can not be conveniently measured) Blocks: - are usually groups of experimental subjects who are similar in a manner that may effect the response - are b groups of experimental units that are (relatively) homogenous within and heterogeneous between wrt the blocking factor - are often used to represent a poorly or inexactly measured covariate - are particularly popular in agricultural, scientific, and manufacturing problems (where factors such as plot of land, location, and production lot can not be measured but must be considered) - do not have to be of equal size (but are easier to work with when they have common size k) and will be represented with the subscript h

2 B. Types of Block Designs - Complete Block Design random allocation of the same number of experimental units per block to each treatment (this requires k = c, c R) so that n hi = k/ = s - Randomized Complete Block Design random allocation of one experimental unit per block to each treatment (this requires k = ) - General Complete Block Design random allocation of the same number of experimental units per block to each treatment (this requires k = c, c R, c > 1) - Confounding result of a design for which all experimental units from each block are assigned to a separate treatment - Incomplete Block Design random allocation of different numbers of experimental units per block to each treatment (this requires k c, c R) - Partial Block Design random allocation where at least one block does not contribute any experimental units to at least one treatment (this requires k < ) the term Incomplete Block design is often reserved for this situation

3 C. The Randomized Complete Block Design We have treatments (which may be factorial treatment combinations) and b blocks of k = (relatively) homogenous within and heterogeneous between groups (so s = 1 and r i = b) Now let A be a factor with levels i = 1,,a B be a blocking factor with levels h = 1,,b Y hi = the response obtained from the h th block randomly assigned of the i th treatment hi = the error variable (or variation attributable to randomness) The basic model (often called the block-treatment model) is now Y = µ + θ + τ + hi h i hi We again need to make some assumptions about the error terms hi in order to statistically analyze the results of a RCBD experiment: - hi ~ N(0, σ ) - hi s are mutually independent, h = 1,,b, i = 1,, We can test these assumptions using techniques discussed earlier.

4 To understand the analysis of variance of an RCBD, note that the model looks similar to a two-way main effects model. However, the blocks are not random, but are intentional and deliberate. This generates controversy about testing hypotheses about statistically testing equality of block effects. This leaves one standard hypothesis for an RCBD: H : τ = τ = L = τ 0 1 i.e., there is no treatment effect. Note that this can be partitioned into hypothesis tests about various factors in more complex designs. To test this hypothesis, we need to develop formulas for the various sums of squares. The similarity to the two-way main effects models extends to sums of squares formulae: - the sum of squares for treatments is ( ) y i y sst = b y i - y = - b b i=1 i=1 - the sums of squares for blocks is b b ( ) yh y ssθ = yh - y = - b h=1 h=1

5 - the total sum of squares is y sstot = y - y = y - b b ( hi ) hi h=1 i=1 i=1 - these sums of squares can be used to find the sum of squares for error: sse = sstot - ssθ - sst Now if H 0 is true, then sst ~ χ σ -1 sse and since ~ χ b--b+1 σ and sst and sse are independent, the ratio of these two Chi-Square statistics (divided by their respective degrees of freedom) yields ssa sst ( -1) σ ( -1) mst = = ~ F sse sse mse ( b -b- +1) σ ( b -b- +1) which provides us with the means for testing H : τ = τ = L = τ 0 1-1,b-b-+1

6 We will reject when H : τ = τ = L = τ 0 1 mst >F-1,b--b+1,α mse The results of a RCBD Analysis of Variance can be summarized and displayed in an ANOVA Table: Source of Variation Degrees of Freedom (df) Sum of Squares (SS) Mean Square (MS) F-Ratio Block b-1 Treatment -1 Error b b +1 b ( h ) h=1 ( ) i i=1 sst = b y - y ssθ = y - y sse = sstot - ssθ -sst ssb msb = b-1 sst mst = -1 mst F= mse sse mse = b -b- +1 Total n - 1 ( b hi ) sstot = y - y h=1 i=1

7 Example Again consider the original One-Factor CRD Automobile Battery Life problem with the addition of information regarding the brand of battery (TryHard, NeverStart, WorryLast, or Fail-Safe): Automobile Battery Life (in Months) Brand New England Mid West South West TryHard NeverStart WorryLast Fail-Safe Region is the treatment (i = 1 for New England, i = for Mid West, and i = 3 for South West) and Brand is the block (h = 1 for TryHard, h = for NeverStart, h = 3 for WorryLast, and h = 4 for Fail-Safe). Use these data to perform a RCBD ANOVA. Here s the interaction plot for this data: Life (in Months) Interaction Plot Sub-Compact TryHard Compact NeverStart Midsized WorryLast Full Fail-Safe Sized 0 New TryHard England NeverStart Mid West WorryLast South West Brand Region There is very little evidence of interaction between regions and brand of battery (the relationships between regions and life of battery are similar for the different brands).

8 Or this interaction plot: Life (in Months) Interaction Plot New TryHard England Mid NeverStart West South WorryLast West 0 TryHard Sub- Compact NeverStart Compact Midsized WorryLast Fail-Safe Full Sized Brand Again, there is only moderate evidence of interaction between region and brand of battery. This could be used to make us comfortable in not interacting the treatment and block. So we proceed with a test of the main effect associated with region: H0 : τ NewEngland = τ MidWest = τsouthwest The appropriate sum of squares is ( ) y i y sst = b y i - y = - = = 88 b b i=1 i=1 and the corresponding mean square is sst 88 mst = = =

9 To calculate the F-statistic necessary to test the region effect hypothesis, we still need the sse: sse = sstot - ssθ - sst for which we need the sstot and ssθ to calculate. From our previous work we already know that sstot = 40.0, so if we find ssθ we can easily calculate sse. The ssθ for the RCBD main effects model is y y ssθ = - b = b h h=1 ( ) + ( ) + ( ) ( ) - 1 = = 1.67 so we have that sse = sstot - ssθ - sst = = and the corresponding mean squares is sse mse = = = b -b

10 which results in an F-statistic for the hypothesis test of a region main effect of mst 144 F = = = mse At 1 = and b b = 6 degrees of freedom, this test statistic has a p-value of , so we have fair evidence that the hypothesis of no treatment (region) effect is incorrect. The results of our RCBD Analysis of Variance can be summarized and displayed in an ANOVA Table: Source of Variation Degrees of Freedom (df) Sum of Squares (SS) Mean Square (MS) F-Ratio Block 3 ssθ = 1.67 msθ = 4.3 Treatment sst = 88.0 mst = F = 8.5 Error 6 sse = mse = Total 11 sstot = 40.0

11 SAS code for an RCBD Model: DATA battery; INPUT region $ autotype $ trtmnt ion life brand $; num=_n_; LABEL region= Region of Country' autotype='type of Auto (Mid- vs. Full-sized)' life = 'Observed Life of Tire in Months' ion = 'Received Ion Treament (yes=1, no=0)' numdays = 'Number of Days with High Temperature Below Freezing' brand= Brand of Battery' num = 'Order of Data'; CARDS; NewEngla Mid TryHard SouthWes Full Fail-Safe ; PROC GLM DATA=battery; TITLE4 'Using PROC GLM for an RCBD Analysis'; CLASS brand region; MODEL life=brand region; RUN; SAS output for an RCBD Model: Analysis of Variance AUTOMOBILE BATTERY LIFE DATA ANALYSIS FOR QA 605 WINTER QUARTER 006 Using PROC GLM for an RCBD Analysis Dependent Variable: life The GLM Procedure Observed Life of Battery in Months Sum of Source DF Squares Mean Square F Value Pr > F Model Error Corrected Total R-Square Coeff Var Root MSE life Mean Source DF Type I SS Mean Square F Value Pr > F brand region Source DF Type III SS Mean Square F Value Pr > F brand region

12 - Multiple comparisons and the RCBD the blocktreatment model is similar to a single replicate two-way main effects model, so the least squares estimator for each µ + θ h + τ i (h = 1,,b; i = 1,,) is similar to the least squares estimator for each µ + α i + β j (i = 1,,a; j = 1,,b) so analogously any contrast in the treatment effects c τ, c = 0 i i i=1 i=1 is estimable in the randomized complete block design and has least-squares estimator i c ˆτ = c Y i i i i i=1 i=1 This contrast also has variance ˆ ci Var ciτ i = Var ciy i = σ i=1 i=1 i=1 b which leads to a 100(1 α)% confidence interval for a treatment contrast of c ˆ ± ˆ iτ i ciτ i t α Var b-b-+1, cτ i i i=1 i=1 i=1 and multiple comparisons of the form ciτ i cy i i± w mse c i b i=1 i=1 i=1

13 For various multiple comparison procedures, we have the following critical coefficients: w B = t b-b- +1, α m ( ) w S = -1 F q w T =,b-b-+1,α -1,b-b- +1,α (where q, b -b + 1, α is taken from Table A.8 in D&V) w = t D (where ( 0.5) -1,b-b- +1,α ( 0.5) t -1,b-b-+1,α is taken from Table A.10 in D&V) ( 0.5) H D1-1,b-b- +1,α ( 0.5) t -1,b-b- +1,α w = w = t (where is taken from Table A.9 in D&V) D.The General Complete Block Design the Block-Treatment Model) We have treatments (which may be factorial treatment combinations) and b blocks of k > (relatively) homogenous within and heterogeneous between groups (so s > 1 and r > b) Now let A be a factor with levels i = 1,,a B be a blocking factor with levels h = 1,,b s be the number of replicates Y hit = the response obtained observation from the h th block randomly assigned of the i th treatment hit = the error variable (or variation attributable to randomness) The basic model is now Y = µ + θ + τ + hit h i hit

14 We again need to make some assumptions about the error terms hit in order to statistically analyze the results of a RCBD experiment: - hit ~ N(0, σ ) - hit s are mutually independent, t=1,,s, h = 1,,b, i = 1,, We can test these assumptions using techniques discussed earlier. This model is similar to the previous treatment-block model except that we have more than one replicate. So we have a single standard hypothesis: H : τ = τ = L = τ 0 1 i.e., there is no treatment effect. Note that this can be partitioned into hypothesis tests about various factors in more complex designs. To test this hypothesis, we modify our previous formulas for the various sums of squares to account for multiple replicates. - the sum of squares for treatments is y sst = bs y - y = - bs ( ) i i i=1 i=1 - the sums of squares for blocks is y ssθ = s y - y = - s y bs b b ( ) h h h=1 h=1 y bs

15 - the total sum of squares is y sstot = y - y = y - bs b s b s ( hit ) hit h=1 i=1 t=1 h=1 i=1 t=1 - these sums of squares can be used to find the sum of squares for error: sse = sstot - ssθ - sst Now if H 0 is true, then sst ~ χ σ -1 sse and since ~ χ bs-b-+1 σ and sst and sse are independent, the ratio of these two Chi-Square statistics (divided by their respective degrees of freedom) yields sst sst ( -1) σ ( -1) mst = = ~ F sse sse mse ( bs-b- +1) σ ( bs-b- +1) which provides us with the means for testing H : τ = τ = L = τ 0 1-1,bs-b-+1

16 We will reject when H : τ = τ = L = τ 0 1 mst >F-1,b--b+1,α mse The results of an Analysis of Variance for a RCBD with replicates can be summarized and displayed in an ANOVA Table: Source of Variation Degrees of Freedom (df) Sum of Squares (SS) Mean Square (MS) F-Ratio Block b-1 Treatment -1 Error bs b +1 b ( h ) h=1 ( ) i i=1 sst = bs y - y ssθ = s y - y sse = sstot - ssθ -sst ssb msb = b-1 sst mst = -1 mst F= mse sse mse = bs-b- +1 Total n - 1 b s ( hit ) sstot = y - y h=1 i=1 t=1

17 Example Now consider the One-Factor CRD Automobile Battery Life problem where we wish to consider type of battery (sealed or non-sealed) as a blocking factor: Automobile Battery Life (in Months) Sealed? New England Mid West South West No No Yes Yes Region is the treatment (i = 1 for New England, i = for Mid West, and i = 3 for South West) and Type of Battery is the block (j = 1 for Sealed and j = for Nonsealed). Use these data to perform a General RCBD ANOVA. Here s the interaction plot for this data: Life of Battery (Months) 60 Interaction Plot Sealed Nonsealed 10 0 New TryHard England NeverStart Mid West WorryLast South West Region Brand There is very little evidence of interaction between region and type of battery (the relationships between regions and life of battery are similar for the different types of battery).

18 Or this interaction plot: Life of Battery (Months) 60 Interaction Plot TryHard New England NeverStart Mid West WorryLast South West 10 0 Sealed Nonsealed Type of Battery Again, there is little evidence of interaction between region and type of battery. This could be used to make us comfortable in not interacting the treatment and block. So we proceed with a test the main effect associated with region: H0 : τ NewEngland = τ MidWest = τsouthwest The appropriate sum of squares is ( ) y i y sst = bs y i - y = - = = 88 bs bs i=1 i=1 and the corresponding mean square is sst 88 mst = = =

19 To calculate the F-statistic necessary to test the region effect hypothesis, we still need the sse: sse = sstot - ssθ - sst for which we need the sstot and ssθ to calculate. From our previous work we already know that sstot = 40.0, so if we find ssθ we can easily calculate sse. The ssθ for the RCBD main effects model is y y ssθ = - = = 8.33 bs b h h=1 s so we have that sse = sstot - ssθ - sst = = and the corresponding mean squares is sse mse = = = bs-b which results in an F-statistic for the hypothesis test of a region main effect of mst 144 F = = = mse 13.1 At 1 = and bs b = 8 degrees of freedom, this test statistic has a p-value of , so we have strong evidence that the hypothesis of no treatment (region) effect is incorrect.

20 The results of our General RCBD Analysis of Variance can be summarized and displayed in an ANOVA Table: Source of Variation Degrees of Freedom (df) Sum of Squares (SS) Mean Square (MS) F-Ratio Block 1 ssθ = 8.33 msθ = 8.33 Treatment sst = 88.0 mst = F = Error 8 sse = mse = 13.1 Total 11 sstot = 40.0 SAS code for an RCBD Model: DATA battery; INPUT region $ autotype $ trtmnt ion life numdays brand $; num=n_; LABEL region= Region of Country' autotype='type of Auto (Mid- vs. Full-sized)' life = 'Observed Life of Battery in Months' ion = 'Received Ion Treament (yes=1, no=0)' numdays = 'Number of Days with High Temperature Below Freezing' brand= Brand of Battery' num = 'Order of Data'; CARDS; NewEngla Mid TryHard SouthWes Full Fail-Safe ; PROC GLM DATA=battery; TITLE4 'Using PROC GLM for a General RCBD Analysis with Replicates'; CLASS type region; MODEL life=type region; RUN;

21 SAS output for an RCBD Model: Analysis of Variance AUTOMOBILE BATTERY LIFE DATA ANALYSIS FOR QA 605 WINTER QUARTER 006 Using PROC GLM for a General RCBD Analysis with Replicates Dependent Variable: life The GLM Procedure Observed Life of Battery in Months Sum of Source DF Squares Mean Square F Value Pr > F Model Error Corrected Total R-Square Coeff Var Root MSE life Mean Source DF Type I SS Mean Square F Value Pr > F type region Source DF Type III SS Mean Square F Value Pr > F type region E. The General Complete Block Design the Block-Treatment Interaction Model) We have treatments (which may be factorial treatment combinations) and b blocks of k > (relatively) homogenous within and heterogeneous between groups (so s > 1 and r > b) Now let A be a factor with levels i = 1,,a B be a blocking factor with levels h = 1,,b s be the number of replicates Y hit = the response obtained observation from the h th block randomly assigned of the i th treatment hit = the error variable (or variation attributable to randomness) The basic model is now ( ) Y = µ + θ + τ + θτ + hit h i hi hit

22 We again need to make some assumptions about the error terms hit in order to statistically analyze the results of a RCBD experiment: - hit ~ N(0, σ ) - hit s are mutually independent, t=1,,s, h = 1,,b, i = 1,, We can test these assumptions using techniques discussed earlier. This model is similar to the previous treatment-block with replicates model except that we now allow for a block-treatment interaction. Our first standard hypothesis for this model is: ( ) ( ) ( ) ( ) H : θτ - θτ - θτ + θτ = 0 i,j θt 0 hi h i i.e., there is no block-treatment interaction. To test this hypothesis, we modify our previous formulas for the various sums of squares to account for the blocktreatment interaction. - the sum of squares for treatments is unchanged: y sst = bs y - y = - bs ( ) i i i=1 i=1 y bs - the sums of squares for blocks is also unchanged: y ssθ = s y - y = - s b b ( ) h h h=1 h=1 y bs

23 - Of course, the total sum of squares is unchanged: y sstot = y - y = y - bs b s b s ( hit ) hit h=1 i=1 t=1 h=1 i=1 t=1 - these sums of squares can be used to find the sum of squares for error: sse = sstot - ssθ - sst - ssθt which can not be calculated in this manner until we find the sum of squares for the block-treatment interaction. - the sum of squares for the block-treatment interaction is: b b b yhi y i yh y ssθt =s ( yhi -y ) = s bs s bs h=1 i=1 h=1 i=1 i=1 h=1 Now we can find the sum of squares for error: sse = sstot - ssθ - sst - ssθt

24 ssθt Now if the null hypothesis is true, then ~ χ ( b-1)( -1) σ sse and since ~ χ b( s-1) σ and ssθt and sse are independent, the ratio of these two Chi-Square statistics (divided by their respective degrees of freedom) yields ssθt ssθt ( b-1) ( -1) σ ( b-1) ( -1) msθt = = ~ F sse sse, mse b ( s-1) σ b ( s-1) which provides us with the means for testing ( ) ( ) ( ) ( ) θt 0 hi h i ( b-1 )( -1) b ( s-1 ) H : θτ - θτ - θτ + θτ = 0 i,j We will reject ( ) ( ) ( ) ( ) H : θτ - θτ - θτ + θτ = 0 i,j θt 0 hi h i when mst >Fb-1 ( )( -1 ), b ( s-1 ) mse,α At this point we can proceed with the second standard hypothesis for this model: T H : τ = τ = L = τ 0 1 i.e., there is no treatment effect. Again note that this can be partitioned into hypothesis tests about various factors in more complex designs.

25 sst Now if the null hypothesis is true, then ~ χ -1 σ sse and since ~ χ b( s-1) σ and sst and sse are independent, the ratio of these two Chi-Square statistics (divided by their respective degrees of freedom) yields sst sst ( -1) σ ( -1) mst = = ~ F sse sse -1,b ( s-1 ) mse b ( s-1) σ b ( s-1) which provides us with the means for testing H : τ = τ = L = τ 0 1 We will reject when H : τ = τ = L = τ 0 1 mst >F-1,b ( s-1 ) mse,α

26 The results of an Analysis of Variance for a RCBD with replicates can be summarized and displayed in an ANOVA Table: Source of Variation Degrees of Freedom (df) Sum of Squares (SS) Mean Square (MS) F-Ratio Block b-1 Treatment -1 Interaction(b-1)( 1) Error b(s-1) Total n - 1 sst = bs b ( h ) ssθ = s y - y h=1 ( y - y ) i i=1 b ( hi ) ssθt =s y -y h=1 i=1 sse = sstot - ssθ- sst - ssθt b s ( hit ) sstot = y - y h=1 i=1 t=1 ssb msb = b-1 sst mst = -1 ssθt msθt = ( b-1 ) ( -1 ) sse mse = b s-1 ( ) mst F= mse msθt F= mse Example Again consider the One-Factor CRD Automobile Battery Life problem where we wish to consider whether the battery is sealed as a blocking factor: Automobile Battery Life (in Months) Sealed? New England Mid West South West No No Yes Yes Region is the treatment (i = 1 for New England, i = for Mid West, and i = 3 for South West) and Type of Battery is the block (j = 1 for Sealed and j = for Nonsealed). Use these data to perform a General Complete RCBD ANOVA with a block-treatment interaction.

27 First we test for a block-treatment interaction: ( ) ( ) ( ) ( ) ( ) ( ) ( ) 1 1 ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) 1 1 ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) H : θτ - θτ - θτ + θτ = 0, θt θτ - θτ - θτ + θτ = 0, θτ - θτ - θτ + θτ = 0, θτ - θτ - θτ + θτ = 0, θτ - θτ - θτ + θτ = 0, θτ - θτ - θτ + θτ = 0, 3 3 The appropriate sum of squares is b b yhi y i yh y ssθt = h=1 i=1 s i=1 bs h=1 s bs = = and the corresponding mean square is ssθt msθt = = = 9.33 ( b-1) ( -1) ( -1) ( 3-1) The revised sum of squares for error is sse = sstot - ssθ - sst - ssθt = = and the corresponding mean squares is sse mse = = = b s-1-1 ( ) ( )

28 The resulting F-statistic for testing the hypothesis no block-treatment interaction is msθt 9.33 F = = = mse At (b 1)( 1) = and b(s 1) = 6 degrees of freedom, this test statistic has a p-value of , so we have very weak evidence of the existence of a brand-treatment interaction. Now we execute the test for a treatment effect: H0 : τ TryHard = τ NeverStart = τworrylast We have already calculated the sum of squares for treatment to be ( ) y i y sst = bs y i - y = - = = 88 bs bs i=1 i=1 and the corresponding mean square is sst 88 mst = = =

29 The resulting F-statistic for testing the hypothesis no treatment effect is mst F = = = mse At (b 1)( 1) = and b(s 1) = 6 degrees of freedom, this test statistic has a p-value of , so we have fairly strong evidence of the existence of a treatment effect. The results of our Analysis of Variance for a RCBD with replicates can be summarized and displayed in an ANOVA Table: Source of Variation Degrees of Freedom (df) Sum of Squares (SS) Mean Square (MS) F-Ratio Block 1 ssθ = 8.33 msb = 8.33 Treatment mst = F = 9.93 sst=88 Interaction ssθt = msθt = 9.33 F = 0.64 Error 6 sse = 87.0 mse = Total 11 sstot = 40.0

30 SAS code for a RCBD Model with a Block-Treatment Interaction: DATA battery; INPUT region $ autotype $ trtmnt ion life numdays brand $; num=_n_; LABEL region= Region of Country' autotype='type of Auto (Mid- vs. Full-sized)' life = 'Observed Life of Battery in Months' ion = 'Received Ion Treament (yes=1, no=0)' numdays = 'Number of Days with High Temperature Below Freezing' brand= Brand of Battery' num = 'Order of Data'; CARDS; NewEngla Mid TryHard SouthWes Full Fail-Safe ; PROC GLM DATA=battery; TITLE4 'Using PROC GLM for a General Complete RCBD Analysis'; CLASS type region; MODEL life=type region; RUN; SAS output for a RCBD Model with a Block-Treatment Interaction: Analysis of Variance AUTOMOBILE BATTERY LIFE DATA ANALYSIS FOR QA 605 WINTER QUARTER 006 Using PROC GLM for a General Complete RCBD Analysis Dependent Variable: life The GLM Procedure Observed Life of Battery in Months Sum of Source DF Squares Mean Square F Value Pr > F Model Error Corrected Total R-Square Coeff Var Root MSE life Mean Source DF Type I SS Mean Square F Value Pr > F type region type*region Source DF Type III SS Mean Square F Value Pr > F type region type*region

31 - Multiple comparisons and the General CBD again, the block-treatment model is similar to a single replicate two-way main effects model, so the least squares estimator for each µ + θ h + τ i (h = 1,,b; i = 1,,) is similar to the least squares estimator for each µ + α i + β j (i = 1,,a; j = 1,,b) so analogously any contrast in the treatment effects c τ, c = 0 i i i=1 i=1 is estimable in the randomized complete block design and has least-squares estimator i c ˆτ = c Y i i i i i=1 i=1 This contrast also has variance ˆ ci Var ciτ i = Var ciy i = σ i=1 i=1 i=1 b which leads to a 100(1 α)% confidence interval for a treatment contrast of c ˆ ± ˆ iτ i ciτ i t α Var n-b--1, cτ i i i=1 i=1 i=1 and multiple comparisons of the form ciτ i cy i i± w mse c i b i=1 i=1 i=1

32 For various multiple comparison procedures, we have the following critical coefficients: w B = t n-b- +1, α m ( ) w S = -1 F q,n- b- +1,α w T = -1,n-b- +1,α (where q, n b - + 1, α is taken from Table A.8 in D&V) w = t D (where ( 0.5) -1,n-b- +1,α ( 0.5) t -1,n-b- +1,α is taken from Table A.10 in D&V) ( 0.5) H D1-1,n-b- +1,α ( 0.5) t -1,n-b- +1,α w = w = t (where is taken from Table A.9 in D&V) The block-treatment interaction model is analogous to the two-way complete model, and contrasts and multiple comparison tests (for both the block-treatment interaction and treatment effect) are handled in a similar manner.

33 F. Sample Size Calculations Recall that a complete block design has n = bs experimental units divided into b blocks of size k = s. Thus block size k must be chosen to accommodate the experimental conditions budget constraints desired power/width of confidence intervals Thus, computing the number of observations s per block that will achieve a given level of power of the test of no treatment differences is analogous sample size calculations for testing main effects in a two-way design: s σφ b G.Checking Model Assumptions and Responding to their Violations Assumptions on block models (block-treatment and block-treatment interaction models) must be checked in the usual manner this can be accomplished by plotting residuals against order of observation (for independence) predicted values y hit, levels of treatment factor, levels of block factor (homogeneity of variance and fit/outliers) normal scores also against each treatment (or block) if r (or k) is large (normality of residuals) Of course, other approaches to assessment of assumptions (HOV tests, Shapiro-Wilk test, etc.) should also be used where appropriate.

34 If the assumptions are not met, we have similar alternatives transform the response to normalize the distribution and/or stabilize the variance of the residuals Brown-Forsythe Modified F-Test (under nonconstant variance) Satterthwaite s approximation or Welch s test (for multiple comparisons) a nonparametric approach, i.e., Friedman s test for randomized complete block designs (under nonconstant variance and/or nonnormal residuals) Friedman s Rank Test for Differences in c Medians - a common distribution-free alternative to ANOVA for RCBDs. Friedman s Test considers the following hypothesis of medians (M i s): H 0 : M 1 = =M To use Friedman s test 1. Rank all observations in descending order within each block. Calculate Friedman s test statistic F R : 1 s F R = T i - 3k ( s + 1) ks s + 1 ( ) i=1 where T i is the sum of ranks associated with the i th treatment. Note that if H 0 is true (no differences in treatment medians exist) then H ~ χ k-1.

35 Example Again consider the original One-Factor CRD Automobile Battery Life problem with the addition of information regarding the brand of battery (TryHard, NeverStart, WorryLast, or Fail-Safe): Automobile Battery Life (in Months) Brand New England Mid West South West TryHard NeverStart WorryLast Fail-Safe Region is the treatment (i = 1 for New England, i = for Mid West, and i = 3 for South West) and Brand is the block (h = 1 for TryHard, h = for NeverStart, h = 3 for WorryLast, and h = 4 for Fail-Safe). Suppose we believe Brand and Type of Automobile do not interact. Use these data to perform Friedman s Test. First we rank all observations in descending order within each block: Automobile Battery Life (in Months) Brand New England Mid West South West TryHard 1 3 NeverStart 3 1 WorryLast 1 3 Fail-Safe 1 3 Note that the sums of the ranks for the treatments are T NewEngland = 9, T MidWest = 4, and T SouthWest =11.

36 Now calculate Friedman s test statistic F R : 1 F R = T - 3k s + 1 ks s + 1 ( ) s i i=1 ( ) 1 = = ( ) ( ) ( ) ( ) ( ) At k 1 = 3 degrees of freedom, the computed value of the test statistic has a p-value of , so we have some evidence against the null hypothesis New England = MidWest = SouthWest M M M SAS code for Friedman s test: DATA battery; INPUT region $ autotype $ trtmnt ion life ion numdays brand $; LABEL region= Region of Country' autotype='type of Auto (Mid- vs. Full-sized)' life = 'Observed Life of Battery in Months' ion = 'Received Ion Treament (yes=1, no=0)' numdays = 'Number of Days with High Temperature Below Freezing' brand= Brand of Battery'; CARDS; NewEngla Mid TryHard SouthWes Full Fail-Safe ; PROC SORT battery; BY size; RUN; PROC RANK DATA=battery OUT=rbattery; VAR life; BY brand; ranks rlife; run; proc freq data=rbattery; tables brand*region*rlife/noprint cmh; run;

37 SAS output for Friedman s test: Using PROC FREQ for Friedman s Test The FREQ Procedure Summary Statistics for region by rlife Controlling for brand Cochran-Mantel-Haenszel Statistics (Based on Table Scores) Statistic Alternative Hypothesis DF Value Prob ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ 1 Nonzero Correlation Row Mean Scores Differ General Association Total Sample Size = 1 H.Factorial Experiments and Blocking When treatments are factorial in nature, treatment parameter τ i in the complete block model can be replaced by the associated main effect and interaction parameters For an experiment with two interacting treatment factors in a randomized complete block design, the treatment-block model is Y = µ + θ + τ + hijt h ij hijt substitution of the main effect and interaction parameters for the treatment parameters τ ij results in the equivalent model ( ) Y = µ + θ + γ + δ + γδ + hijt h i j ij hijt This can be generalized to any number of treatment and/or blocking factors.

V. Experiments With Two Crossed Treatment Factors

V. Experiments With Two Crossed Treatment Factors V. Experiments With Two Crossed Treatment Factors A.The Experimental Design Completely Randomized Design (CRD) Let A be a factor with levels i = 1,,a B be a factor with levels j = 1,,b Y ijt = the response

More information

Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8]

Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8] Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8] The completely randomized design Treatments are randomly assigned to e.u. such

More information

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data

More information

Week 7.1--IES 612-STA STA doc

Week 7.1--IES 612-STA STA doc Week 7.1--IES 612-STA 4-573-STA 4-576.doc IES 612/STA 4-576 Winter 2009 ANOVA MODELS model adequacy aka RESIDUAL ANALYSIS Numeric data samples from t populations obtained Assume Y ij ~ independent N(μ

More information

STAT 115:Experimental Designs

STAT 115:Experimental Designs STAT 115:Experimental Designs Josefina V. Almeda 2013 Multisample inference: Analysis of Variance 1 Learning Objectives 1. Describe Analysis of Variance (ANOVA) 2. Explain the Rationale of ANOVA 3. Compare

More information

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1) Summary of Chapter 7 (Sections 7.2-7.5) and Chapter 8 (Section 8.1) Chapter 7. Tests of Statistical Hypotheses 7.2. Tests about One Mean (1) Test about One Mean Case 1: σ is known. Assume that X N(µ, σ

More information

One-way ANOVA Model Assumptions

One-way ANOVA Model Assumptions One-way ANOVA Model Assumptions STAT:5201 Week 4: Lecture 1 1 / 31 One-way ANOVA: Model Assumptions Consider the single factor model: Y ij = µ + α }{{} i ij iid with ɛ ij N(0, σ 2 ) mean structure random

More information

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Jurusan Teknik Industri Universitas Brawijaya Outline Introduction The Analysis of Variance Models for the Data Post-ANOVA Comparison of Means Sample

More information

The Random Effects Model Introduction

The Random Effects Model Introduction The Random Effects Model Introduction Sometimes, treatments included in experiment are randomly chosen from set of all possible treatments. Conclusions from such experiment can then be generalized to other

More information

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 1 Chapter 1: Research Design Principles The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 2 Chapter 2: Completely Randomized Design

More information

Statistics For Economics & Business

Statistics For Economics & Business Statistics For Economics & Business Analysis of Variance In this chapter, you learn: Learning Objectives The basic concepts of experimental design How to use one-way analysis of variance to test for differences

More information

Key Features: More than one type of experimental unit and more than one randomization.

Key Features: More than one type of experimental unit and more than one randomization. 1 SPLIT PLOT DESIGNS Key Features: More than one type of experimental unit and more than one randomization. Typical Use: When one factor is difficult to change. Example (and terminology): An agricultural

More information

RCB - Example. STA305 week 10 1

RCB - Example. STA305 week 10 1 RCB - Example An accounting firm wants to select training program for its auditors who conduct statistical sampling as part of their job. Three training methods are under consideration: home study, presentations

More information

Analysis of Variance and Design of Experiments-I

Analysis of Variance and Design of Experiments-I Analysis of Variance and Design of Experiments-I MODULE VIII LECTURE - 35 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS MODEL Dr. Shalabh Department of Mathematics and Statistics Indian

More information

Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs

Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs Adapted from Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis

More information

Lecture 7 Randomized Complete Block Design (RCBD) [ST&D sections (except 9.6) and section 15.8]

Lecture 7 Randomized Complete Block Design (RCBD) [ST&D sections (except 9.6) and section 15.8] Lecture 7 Randomized Complete Block Design () [ST&D sections 9.1 9.7 (except 9.6) and section 15.8] The Completely Randomized Design () 1. It is assumed that all experimental units (EU's) are uniform..

More information

PLS205 Lab 6 February 13, Laboratory Topic 9

PLS205 Lab 6 February 13, Laboratory Topic 9 PLS205 Lab 6 February 13, 2014 Laboratory Topic 9 A word about factorials Specifying interactions among factorial effects in SAS The relationship between factors and treatment Interpreting results of an

More information

Exam details. Final Review Session. Things to Review

Exam details. Final Review Session. Things to Review Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit

More information

Factorial designs. Experiments

Factorial designs. Experiments Chapter 5: Factorial designs Petter Mostad mostad@chalmers.se Experiments Actively making changes and observing the result, to find causal relationships. Many types of experimental plans Measuring response

More information

Topic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction Review of regression concepts

Topic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction Review of regression concepts Topic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction The analysis of covariance (ANCOVA) is a technique that is occasionally useful for improving the precision of an experiment.

More information

W&M CSCI 688: Design of Experiments Homework 2. Megan Rose Bryant

W&M CSCI 688: Design of Experiments Homework 2. Megan Rose Bryant W&M CSCI 688: Design of Experiments Homework 2 Megan Rose Bryant September 25, 201 3.5 The tensile strength of Portland cement is being studied. Four different mixing techniques can be used economically.

More information

WITHIN-PARTICIPANT EXPERIMENTAL DESIGNS

WITHIN-PARTICIPANT EXPERIMENTAL DESIGNS 1 WITHIN-PARTICIPANT EXPERIMENTAL DESIGNS I. Single-factor designs: the model is: yij i j ij ij where: yij score for person j under treatment level i (i = 1,..., I; j = 1,..., n) overall mean βi treatment

More information

Fractional Factorial Designs

Fractional Factorial Designs k-p Fractional Factorial Designs Fractional Factorial Designs If we have 7 factors, a 7 factorial design will require 8 experiments How much information can we obtain from fewer experiments, e.g. 7-4 =

More information

Lecture 7: Hypothesis Testing and ANOVA

Lecture 7: Hypothesis Testing and ANOVA Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis

More information

PLS205 Lab 2 January 15, Laboratory Topic 3

PLS205 Lab 2 January 15, Laboratory Topic 3 PLS205 Lab 2 January 15, 2015 Laboratory Topic 3 General format of ANOVA in SAS Testing the assumption of homogeneity of variances by "/hovtest" by ANOVA of squared residuals Proc Power for ANOVA One-way

More information

Unit 27 One-Way Analysis of Variance

Unit 27 One-Way Analysis of Variance Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied

More information

PLS205!! Lab 9!! March 6, Topic 13: Covariance Analysis

PLS205!! Lab 9!! March 6, Topic 13: Covariance Analysis PLS205!! Lab 9!! March 6, 2014 Topic 13: Covariance Analysis Covariable as a tool for increasing precision Carrying out a full ANCOVA Testing ANOVA assumptions Happiness! Covariable as a Tool for Increasing

More information

Unit 6: Orthogonal Designs Theory, Randomized Complete Block Designs, and Latin Squares

Unit 6: Orthogonal Designs Theory, Randomized Complete Block Designs, and Latin Squares Unit 6: Orthogonal Designs Theory, Randomized Complete Block Designs, and Latin Squares STA 643: Advanced Experimental Design Derek S. Young 1 Learning Objectives Understand the basics of orthogonal designs

More information

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

Design & Analysis of Experiments 7E 2009 Montgomery

Design & Analysis of Experiments 7E 2009 Montgomery 1 What If There Are More Than Two Factor Levels? The t-test does not directly apply ppy There are lots of practical situations where there are either more than two levels of interest, or there are several

More information

df=degrees of freedom = n - 1

df=degrees of freedom = n - 1 One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:

More information

Lecture 4. Random Effects in Completely Randomized Design

Lecture 4. Random Effects in Completely Randomized Design Lecture 4. Random Effects in Completely Randomized Design Montgomery: 3.9, 13.1 and 13.7 1 Lecture 4 Page 1 Random Effects vs Fixed Effects Consider factor with numerous possible levels Want to draw inference

More information

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is

More information

Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes)

Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes) Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes) Asheber Abebe Discrete and Statistical Sciences Auburn University Contents 1 Completely Randomized Design

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

Topic 13. Analysis of Covariance (ANCOVA) - Part II [ST&D Ch. 17]

Topic 13. Analysis of Covariance (ANCOVA) - Part II [ST&D Ch. 17] Topic 13. Analysis of Covariance (ANCOVA) - Part II [ST&D Ch. 17] 13.5 Assumptions of ANCOVA The assumptions of analysis of covariance are: 1. The X s are fixed, measured without error, and independent

More information

STAT 430 (Fall 2017): Tutorial 8

STAT 430 (Fall 2017): Tutorial 8 STAT 430 (Fall 2017): Tutorial 8 Balanced Incomplete Block Design Luyao Lin November 7th/9th, 2017 Department Statistics and Actuarial Science, Simon Fraser University Block Design Complete Random Complete

More information

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya DESAIN EKSPERIMEN BLOCKING FACTORS Semester Genap Jurusan Teknik Industri Universitas Brawijaya Outline The Randomized Complete Block Design The Latin Square Design The Graeco-Latin Square Design Balanced

More information

STAT Chapter 10: Analysis of Variance

STAT Chapter 10: Analysis of Variance STAT 515 -- Chapter 10: Analysis of Variance Designed Experiment A study in which the researcher controls the levels of one or more variables to determine their effect on the variable of interest (called

More information

3. Design Experiments and Variance Analysis

3. Design Experiments and Variance Analysis 3. Design Experiments and Variance Analysis Isabel M. Rodrigues 1 / 46 3.1. Completely randomized experiment. Experimentation allows an investigator to find out what happens to the output variables when

More information

Chapter 4: Randomized Blocks and Latin Squares

Chapter 4: Randomized Blocks and Latin Squares Chapter 4: Randomized Blocks and Latin Squares 1 Design of Engineering Experiments The Blocking Principle Blocking and nuisance factors The randomized complete block design or the RCBD Extension of the

More information

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Chap The McGraw-Hill Companies, Inc. All rights reserved. 11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview

More information

Analysis of variance, multivariate (MANOVA)

Analysis of variance, multivariate (MANOVA) Analysis of variance, multivariate (MANOVA) Abstract: A designed experiment is set up in which the system studied is under the control of an investigator. The individuals, the treatments, the variables

More information

BALANCED INCOMPLETE BLOCK DESIGNS

BALANCED INCOMPLETE BLOCK DESIGNS BALANCED INCOMPLETE BLOCK DESIGNS V.K. Sharma I.A.S.R.I., Library Avenue, New Delhi -110012. 1. Introduction In Incomplete block designs, as their name implies, the block size is less than the number of

More information

Contrasts and Multiple Comparisons Supplement for Pages

Contrasts and Multiple Comparisons Supplement for Pages Contrasts and Multiple Comparisons Supplement for Pages 302-323 Brian Habing University of South Carolina Last Updated: July 20, 2001 The F-test from the ANOVA table allows us to test the null hypothesis

More information

One-Way Analysis of Variance (ANOVA)

One-Way Analysis of Variance (ANOVA) 1 One-Way Analysis of Variance (ANOVA) One-Way Analysis of Variance (ANOVA) is a method for comparing the means of a populations. This kind of problem arises in two different settings 1. When a independent

More information

1 The Randomized Block Design

1 The Randomized Block Design 1 The Randomized Block Design When introducing ANOVA, we mentioned that this model will allow us to include more than one categorical factor(explanatory) or confounding variables in the model. In a first

More information

Sleep data, two drugs Ch13.xls

Sleep data, two drugs Ch13.xls Model Based Statistics in Biology. Part IV. The General Linear Mixed Model.. Chapter 13.3 Fixed*Random Effects (Paired t-test) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch

More information

Allow the investigation of the effects of a number of variables on some response

Allow the investigation of the effects of a number of variables on some response Lecture 12 Topic 9: Factorial treatment structures (Part I) Factorial experiments Allow the investigation of the effects of a number of variables on some response in a highly efficient manner, and in a

More information

Lecture 13 Extra Sums of Squares

Lecture 13 Extra Sums of Squares Lecture 13 Extra Sums of Squares STAT 512 Spring 2011 Background Reading KNNL: 7.1-7.4 13-1 Topic Overview Extra Sums of Squares (Defined) Using and Interpreting R 2 and Partial-R 2 Getting ESS and Partial-R

More information

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous. COMPLETELY RANDOM DESIGN (CRD) Description of the Design -Simplest design to use. -Design can be used when experimental units are essentially homogeneous. -Because of the homogeneity requirement, it may

More information

Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti

Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti Putra Malaysia Serdang Use in experiment, quasi-experiment

More information

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr. Department of Economics Business Statistics Chapter 1 Chi-square test of independence & Analysis of Variance ECON 509 Dr. Mohammad Zainal Chapter Goals After completing this chapter, you should be able

More information

Chapter 10. Design of Experiments and Analysis of Variance

Chapter 10. Design of Experiments and Analysis of Variance Chapter 10 Design of Experiments and Analysis of Variance Elements of a Designed Experiment Response variable Also called the dependent variable Factors (quantitative and qualitative) Also called the independent

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

Chapter 15: Analysis of Variance

Chapter 15: Analysis of Variance Chapter 5: Analysis of Variance 5. Introduction In this chapter, we introduced the analysis of variance technique, which deals with problems whose objective is to compare two or more populations of quantitative

More information

Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data

Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data 1999 Prentice-Hall, Inc. Chap. 10-1 Chapter Topics The Completely Randomized Model: One-Factor

More information

Inferences About the Difference Between Two Means

Inferences About the Difference Between Two Means 7 Inferences About the Difference Between Two Means Chapter Outline 7.1 New Concepts 7.1.1 Independent Versus Dependent Samples 7.1. Hypotheses 7. Inferences About Two Independent Means 7..1 Independent

More information

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication CHAPTER 4 Analysis of Variance One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication 1 Introduction In this chapter, expand the idea of hypothesis tests. We

More information

STAT 506: Randomized complete block designs

STAT 506: Randomized complete block designs STAT 506: Randomized complete block designs Timothy Hanson Department of Statistics, University of South Carolina STAT 506: Introduction to Experimental Design 1 / 10 Randomized complete block designs

More information

TA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM

TA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM STAT 301, Fall 2011 Name Lec 4: Ismor Fischer Discussion Section: Please circle one! TA: Sheng Zhgang... 341 (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan... 345 (W 1:20) / 346 (Th

More information

Inference for the Regression Coefficient

Inference for the Regression Coefficient Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates

More information

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables. One-Way Analysis of Variance With regression, we related two quantitative, typically continuous variables. Often we wish to relate a quantitative response variable with a qualitative (or simply discrete)

More information

Lecture 3: Inference in SLR

Lecture 3: Inference in SLR Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals

More information

Formal Statement of Simple Linear Regression Model

Formal Statement of Simple Linear Regression Model Formal Statement of Simple Linear Regression Model Y i = β 0 + β 1 X i + ɛ i Y i value of the response variable in the i th trial β 0 and β 1 are parameters X i is a known constant, the value of the predictor

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

Stat 217 Final Exam. Name: May 1, 2002

Stat 217 Final Exam. Name: May 1, 2002 Stat 217 Final Exam Name: May 1, 2002 Problem 1. Three brands of batteries are under study. It is suspected that the lives (in weeks) of the three brands are different. Five batteries of each brand are

More information

One-Way ANOVA Cohen Chapter 12 EDUC/PSY 6600

One-Way ANOVA Cohen Chapter 12 EDUC/PSY 6600 One-Way ANOVA Cohen Chapter 1 EDUC/PSY 6600 1 It is easy to lie with statistics. It is hard to tell the truth without statistics. -Andrejs Dunkels Motivating examples Dr. Vito randomly assigns 30 individuals

More information

Chapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance

Chapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance Chapter 8 Student Lecture Notes 8-1 Department of Economics Business Statistics Chapter 1 Chi-square test of independence & Analysis of Variance ECON 509 Dr. Mohammad Zainal Chapter Goals After completing

More information

Sample Size / Power Calculations

Sample Size / Power Calculations Sample Size / Power Calculations A Simple Example Goal: To study the effect of cold on blood pressure (mmhg) in rats Use a Completely Randomized Design (CRD): 12 rats are randomly assigned to one of two

More information

4.8 Alternate Analysis as a Oneway ANOVA

4.8 Alternate Analysis as a Oneway ANOVA 4.8 Alternate Analysis as a Oneway ANOVA Suppose we have data from a two-factor factorial design. The following method can be used to perform a multiple comparison test to compare treatment means as well

More information

Analysis of Variance (ANOVA)

Analysis of Variance (ANOVA) Analysis of Variance ANOVA) Compare several means Radu Trîmbiţaş 1 Analysis of Variance for a One-Way Layout 1.1 One-way ANOVA Analysis of Variance for a One-Way Layout procedure for one-way layout Suppose

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

In many situations, there is a non-parametric test that corresponds to the standard test, as described below:

In many situations, there is a non-parametric test that corresponds to the standard test, as described below: There are many standard tests like the t-tests and analyses of variance that are commonly used. They rest on assumptions like normality, which can be hard to assess: for example, if you have small samples,

More information

Lec 5: Factorial Experiment

Lec 5: Factorial Experiment November 21, 2011 Example Study of the battery life vs the factors temperatures and types of material. A: Types of material, 3 levels. B: Temperatures, 3 levels. Example Study of the battery life vs the

More information

Incomplete Block Designs

Incomplete Block Designs Incomplete Block Designs Recall: in randomized complete block design, each of a treatments was used once within each of b blocks. In some situations, it will not be possible to use each of a treatments

More information

Tentative solutions TMA4255 Applied Statistics 16 May, 2015

Tentative solutions TMA4255 Applied Statistics 16 May, 2015 Norwegian University of Science and Technology Department of Mathematical Sciences Page of 9 Tentative solutions TMA455 Applied Statistics 6 May, 05 Problem Manufacturer of fertilizers a) Are these independent

More information

Correlation and the Analysis of Variance Approach to Simple Linear Regression

Correlation and the Analysis of Variance Approach to Simple Linear Regression Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation

More information

Simple Linear Regression: One Quantitative IV

Simple Linear Regression: One Quantitative IV Simple Linear Regression: One Quantitative IV Linear regression is frequently used to explain variation observed in a dependent variable (DV) with theoretically linked independent variables (IV). For example,

More information

Topic 20: Single Factor Analysis of Variance

Topic 20: Single Factor Analysis of Variance Topic 20: Single Factor Analysis of Variance Outline Single factor Analysis of Variance One set of treatments Cell means model Factor effects model Link to linear regression using indicator explanatory

More information

Much of the material we will be covering for a while has to do with designing an experimental study that concerns some phenomenon of interest.

Much of the material we will be covering for a while has to do with designing an experimental study that concerns some phenomenon of interest. Experimental Design: Much of the material we will be covering for a while has to do with designing an experimental study that concerns some phenomenon of interest We wish to use our subjects in the best

More information

Example: Four levels of herbicide strength in an experiment on dry weight of treated plants.

Example: Four levels of herbicide strength in an experiment on dry weight of treated plants. The idea of ANOVA Reminders: A factor is a variable that can take one of several levels used to differentiate one group from another. An experiment has a one-way, or completely randomized, design if several

More information

2 k, 2 k r and 2 k-p Factorial Designs

2 k, 2 k r and 2 k-p Factorial Designs 2 k, 2 k r and 2 k-p Factorial Designs 1 Types of Experimental Designs! Full Factorial Design: " Uses all possible combinations of all levels of all factors. n=3*2*2=12 Too costly! 2 Types of Experimental

More information

1 DV is normally distributed in the population for each level of the within-subjects factor 2 The population variances of the difference scores

1 DV is normally distributed in the population for each level of the within-subjects factor 2 The population variances of the difference scores One-way Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti Putra Malaysia Serdang The purpose is to test the

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6 STA 8 Applied Linear Models: Regression Analysis Spring 011 Solution for Homework #6 6. a) = 11 1 31 41 51 1 3 4 5 11 1 31 41 51 β = β1 β β 3 b) = 1 1 1 1 1 11 1 31 41 51 1 3 4 5 β = β 0 β1 β 6.15 a) Stem-and-leaf

More information

Analysis of Variance. ภาว น ศ ร ประภาน ก ล คณะเศรษฐศาสตร มหาว ทยาล ยธรรมศาสตร

Analysis of Variance. ภาว น ศ ร ประภาน ก ล คณะเศรษฐศาสตร มหาว ทยาล ยธรรมศาสตร Analysis of Variance ภาว น ศ ร ประภาน ก ล คณะเศรษฐศาสตร มหาว ทยาล ยธรรมศาสตร pawin@econ.tu.ac.th Outline Introduction One Factor Analysis of Variance Two Factor Analysis of Variance ANCOVA MANOVA Introduction

More information

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown Nonparametric Statistics Leah Wright, Tyler Ross, Taylor Brown Before we get to nonparametric statistics, what are parametric statistics? These statistics estimate and test population means, while holding

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 13, Part A: Analysis of Variance and Experimental Design Introduction to Analysis of Variance Analysis

More information

Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G

Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G 1 ANOVA Analysis of variance compares two or more population means of interval data. Specifically, we are interested in determining whether differences

More information

ANOVA: Comparing More Than Two Means

ANOVA: Comparing More Than Two Means 1 ANOVA: Comparing More Than Two Means 10.1 ANOVA: The Completely Randomized Design Elements of a Designed Experiment Before we begin any calculations, we need to discuss some terminology. To make this

More information

Unit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs

Unit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs Unit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs STA 643: Advanced Experimental Design Derek S. Young 1 Learning Objectives Understand how to interpret a random effect Know the different

More information

Two-factor studies. STAT 525 Chapter 19 and 20. Professor Olga Vitek

Two-factor studies. STAT 525 Chapter 19 and 20. Professor Olga Vitek Two-factor studies STAT 525 Chapter 19 and 20 Professor Olga Vitek December 2, 2010 19 Overview Now have two factors (A and B) Suppose each factor has two levels Could analyze as one factor with 4 levels

More information

1. The (dependent variable) is the variable of interest to be measured in the experiment.

1. The (dependent variable) is the variable of interest to be measured in the experiment. Chapter 10 Analysis of variance (ANOVA) 10.1 Elements of a designed experiment 1. The (dependent variable) is the variable of interest to be measured in the experiment. 2. are those variables whose effect

More information

Statistical Hypothesis Testing

Statistical Hypothesis Testing Statistical Hypothesis Testing Dr. Phillip YAM 2012/2013 Spring Semester Reference: Chapter 7 of Tests of Statistical Hypotheses by Hogg and Tanis. Section 7.1 Tests about Proportions A statistical hypothesis

More information

Three-Way Contingency Tables

Three-Way Contingency Tables Newsom PSY 50/60 Categorical Data Analysis, Fall 06 Three-Way Contingency Tables Three-way contingency tables involve three binary or categorical variables. I will stick mostly to the binary case to keep

More information

Lecture 6: Single-classification multivariate ANOVA (k-group( MANOVA)

Lecture 6: Single-classification multivariate ANOVA (k-group( MANOVA) Lecture 6: Single-classification multivariate ANOVA (k-group( MANOVA) Rationale and MANOVA test statistics underlying principles MANOVA assumptions Univariate ANOVA Planned and unplanned Multivariate ANOVA

More information