Unit 8: A Mixed Two-Factor Design
|
|
- Cathleen Nelson
- 6 years ago
- Views:
Transcription
1 Minitab Notes for STAT 6305 Dept. of Statistics CSU East Bay Unit 8: A Mixed Two-Factor Design 8.1. The Data We use data quoted in Brownlee: Statistical Theory and Methodology in Science and Engineering, 2nd ed., page 502, on the rupture strength of concrete beams. Six beams were cast from each of five batches of concrete. Of the six beams from each batch, two were treated with each of three different "capping" formulas (a process intended to add strength). The relevant measurement was the breaking strength of each beam in psi (lbs. per square inch). The results are shown below. Breaking Strength (psi) of Concrete Beams (Results for 3 capping formulas with each of 5 batches of concrete) Batch (B) Formula (A) Problems: Source: George Werner, "The effect of capping material on the compressive strength of concrete cylinders," Proceedings of the American Society for Testing and Materials, 58 (1958), (Somewhat abridged here.) For your convenience the breaking strength measurements in the table above have been repeated below, reading across each row of the table. Put them into c1 of a Minitab worksheet; label it PSI. Then use the patterned data feature to make subscript columns c2 and c3 for A and B, respectively. What "pattern codes" can be used to generate these columns? Finally, look in the worksheet or print it out to verify that its rows contain the information shown at the top of the next page Use Minitab to make a table similar to the one shown above. Also make a table that shows the means for each cell, row, and column Make box plots or dot plots of these data broken out by Formula. Even though these plots do not account for the fact that the data come from five different batches of concrete, do they suggest that one capping formula may result in beams that are significantly stronger (or weaker) than the others? Is there any evidence of outliers or skewness among the observations for any of the Formulas? Based on graphical displays similar to those in the previous problem, does it seem that there is significant variability among the batches?
2 Minitab Notes for STAT 6305 Unit 8-2 c1 c2 c3 c1 c2 c3 c1 c2 c3 ROW PSI A B ROW PSI A B ROW PSI A B The Model In this experiment there are two factors that may influence the strength of a particular concrete beam: the capping Formula used to enhance its strength and the Batch of concrete from which it was made. We consider Formula to be a fixed effect. Only these three capping formulas are currently under study. We would like to know if they differ significantly as to the strength they impart to the beams to which they are applied. If, so we would like to establish the pattern of significant differences. In particular, we would like to know if one capping formula is better than the others. We consider Batch to be a random effect. The five batches are of interest only because variability among them may contribute to variability in strength of the beams. We would like to know if the variability among batches is noticeable over and above the inevitable variability in strength from beam to beam, even ones made in the same way from the same batch. In addition, there might be interaction between these two factors. (As a purely speculative scenario, some batches may be mixed in such a way that voids from air bubbles show at the surface of beams cast from them. Unfilled, these voids may weaken a beam. One capping formula may not be as good as the others overall, but it may be particularly effective in filling such voids.) We are able to test for interaction here because there is more than one observation per cell, so that interaction and error effects are not confounded. Formally, this is a two-way mixed-model analysis of variance (mixed because one effect is fixed and the other is random). There are a = 3 levels of the fixed factor, b = 5 levels of the random factor, and n = 2 replications within each cell. The model is: Y ijk = µ + α i + B j + (αb) ij + e ijk, for i = 1, 2,3, j = 1,..., 5, and k = 1, 2. The parameters α i correspond to the fixed effects of the 3 levels of Formula, the random variables B j correspond to the random effects of the 5 levels of Batch, and the random variables (αb) ij correspond to 15 random "non-additive" adjustments, one for each cell. (Non-additive means that for a given k, the value of Y ijk cannot be expressed as the sum of mean and effects µ + α i + B j.) Moreover, the distributions of the random variables in this model are as follows: B j are identically distributed according to N(0, σ B 2 ), (αb) ij are distributed identically according to N(0, σ αb 2 ), and
3 e ijk are identically distributed according to N(0, σ 2 ). Minitab Notes for STAT 6305 Unit 8-3 Moreover, in the unrestricted version of the model all of these random variables are mutually independent. In our ANOVA we shall perform tests of three null hypotheses: No Formula effect [H 0 : Σ i α i 2 = 0], no Batch effect [H 0 : σ B 2 = 0], and no Interaction [H 0 : σ αb 2 = 0]. For each test the alternative is that the relevant sum of squared parameters or variance component is positive. About restricted and unrestricted models. Some authors place restrictions on the parameters α i and (αb) ij and some authors do not. The restriction Σ i α i = 0 is a matter of notational and computational convenience, but it does not affect the way the data are analyzed. Another possible restriction is that Σ i (αb) ij = 0, for each value of j, where each sum is taken over the full range of the subscript i. If this restriction is made, then Cov[(αB) ij, (αb) i'j ] = σ αb 2 /(a 1), for i i'. If this restriction is made, then, for a given j and different levels i, the random variables (αb) ij cannot be independent because of the restriction that they sum to 0 over the index i. Some authors do not put restrictions on the random variables (αb) ij that represent the interaction effect. One speaks of the distinction between restricted ANOVA models and unrestricted ones. The distinction can make a difference in how data is analyzed in some cases. This is because the decision to restrict or not makes a difference in how EMSs are computed, and thus how F-ratios are formed. Although there are specific experimental situations in which it seems clear that these restrictions on interaction terms may be appropriate, the tendency seems to be to use unrestricted models as a general practice. In O/L 5e all computations of mixed two-factor models seem to have used the restricted model, even though this was not explicitly stated in specifying the models. In Minitab, a subcommand is required if restrictions are to be made. It is difficult to use restricted models in SAS. It is simpler to derive EMSs for restricted models than for unrestricted ones. In O/L 6e it is explicitly stated that unrestricted models are used. (This required large-scale revisions in the text, a few of which do not seem to me to have been made correctly. I will try to point out the errors that matter for purposes of this course.) In Minitab, unrestricted models are the default (requiring no subcommand). Problems: Look at the interaction plot that has Batches on the horizontal axis, so that there are three broken-line traces representing the three capping formulas. (See the next page.) If you had to guess just from looking at this plot, would you say that there is significant interaction? If interaction were significant, would it be disorderly with respect to the Formula effect?
4 Minitab Notes for STAT 6305 Unit 8-4 Interaction Plot (data means) for PSI A A B B (a) Because we are interested in the randomly chosen batches only as a possible source of variability, we will not be interested in determining specific patterns of differences among the batches, even if the Batch effect turns out to be significant. But use the profile plot with Formulas on the horizontal axis to speculate whether some of our randomly chosen Batches happened to produce stronger (weaker) beams than others. (b) If the Formula effect turns out to be significant, is there a formula that seems better (or worse) than the others? Suppose that you were given the 15 cell means rather than the 30 individual observations. Write the model you would use to analyze the results Analysis and Interpretation Here we use Minitab to make the ANOVA table and to interpret the results of this mixed two-factor model. The model for this ANOVA is more complex than for those that have gone before. It is very important to include every key feature of the formal model (Section 2) in specifying the model in Minitab. Notice the following requirements for using Minitab correctly: You must include the interaction term in specifying the model on the command line. There are two ways to do this: either include the term A * B in the model or put a vertical bar between the designations of the two main effects: A B. Minitab assumes that effects are fixed unless declared as random. You must declare Batch as a random effect, either using the subcommand random Batch or, if using menus, by putting Batch in the "text box" for random factors. A proper interpretation of the resulting ANOVA table is based on an understanding of the expected mean squares (EMS table). Include the subcommand ems or by marking the check box for EMS under Results.
5 Minitab Notes for STAT 6305 Unit 8-5 As always, we will want to look at a normal probability plot of the residuals. This requires the subcommand residuals cx, where cx denotes an empty worksheet column. Alternatively, in the Balanced ANOVA menu, check the box for residuals under Storage, in which case Minitab selects an empty column automatically. (In menus, you can also get a normal probability plot of the residuals by marking the appropriate check box under Graphs, but this plot will not have confidence bands.) STAT > ANOVA > Balanced, Declare B random, Results to show EMS, Storage of residuals [or Graph normal probability plot of residuals]. MTB > anova PSI = A B; SUBC> random B; SUBC> ems; SUBC> resid c4. ANOVA: PSI versus A, B Factor Type Levels Values A fixed 3 1, 2, 3 B random 5 1, 2, 3, 4, 5 Analysis of Variance for PSI Source DF SS MS F P A B A*B Error Total S = R-Sq = 90.01% R-Sq(adj) = 80.69% Expected Mean Square Variance Error for Each Term (using Source component term unrestricted model) 1 A 3 (4) + 2 (3) + Q[1] 2 B (4) + 2 (3) + 6 (2) 3 A*B (4) + 2 (3) 4 Error (4) In all of the models we have seen to date, MS(Error) has been used in the denominator of the F-statistic for each test of significance. The situation is now somewhat more complex. The following table interprets Minitab's EMS table in terms of our symbols above, where we define θ α = nb[σ i α i 2 ]/(a 1) and where the column "Error Term" indicates which MS is used in the denominator of the F-statistic for the corresponding row. Two-Way Mixed Model (Unrestricted) Source Error Term Expected Mean Square Formula (A Fixed) Interaction σ 2 + 2σ 2 αb + θ α Batch (B Random) Interaction σ 2 + 2σ 2 2 αb + 6σ B Interaction (A*B Mixed) Error σ σ αb Error σ 2 We begin our interpretation of the ANOVA table by noticing that the Interaction effect is not significant, P = Thus we can make straightforward interpretations of the main effects.
6 Minitab Notes for STAT 6305 Unit 8-6 Both of the main effects are very highly significant (with very small P-values). A final interpretation as to which capping formula is best or worst awaits an analysis of multiple comparisons (see Problem 8.3.2). As for the Batch, we now know that batch-to-batch variation is a significant source of variability in the strengths of the beams. Of course there is also variability within each cell of the data table because no two beams will ever be exactly alike and because of measurement errors. But the component of variance due to Batches significantly stands out above this within-cell variability. There is no point in determining the exact pattern of differences among the Batches because we will never see these particular batches again. It may, however, be worth pondering whether something can be done to decrease this component of variance in order to get a more uniform product. The important thing to notice about the tests of the main effects is that the F-ratio for testing the Formula effect is MS(A)/MS(A*B) and the F-ratio for testing the Batch effect is MS(B)/MS(A*B). That is, the interaction mean square is used as the denominator for both F-ratios. The rationale for using MS(A*B) in the denominator of F(A) is as follows. If there is no Formula effect, then all of the α i are 0 so that θ α = 0. In that case, we have EMS(A) = EMS(A*B) = σ 2 + 2σ αb 2, so that MS(A)/MS(A*B) has an F-distribution with 4 and 8 degrees of freedom. Similar arguments based on the EMS table lead us to conclude that the Batch effect is also "tested against Interaction" and that Interaction should be "tested against Error." Problems: Look at the normal probability plot of the residuals. What is your interpretation? What is the P-value of the Anderson-Darling test for normality? Because Interaction is not significant one could remove the interaction term from the ANOVA model before testing the main effects. (a) What happens to the df for Interaction when you "pool" in this fashion? What is the correct "error term" for testing the main effects in this model? (b) Use a calculator to perform Tukey's HSD procedure to determine the pattern of differences among Formulas. Use the "pooled" MS(Error) and df(error) in your computations. Recall that each Formula mean is based on 10 observations. Verify your results using Tukey comparisons in Minitab's general linear models (GLM) procedure Perform the ANOVA including the interaction term, but specifying the restricted model. What important difference in the formation of F-ratios is implied by Minitab's EMS table? Note: For several decades, the difference in error terms (between the restricted and unrestricted model) that you see here has been the cause of controversy and debate among statisticians in the interpretation of mixed-model ANOVAs. The question has been: Which model is correct: restricted or unrestricted? Here are comments on recent discussions: Some texts suggest that the restricted model is appropriate in some applications and the unrestricted model in others. (An example is the text by Oehlert.) The texts by Brownlee and Snedecor and Cochran discuss the restricted model exclusively. O/L 5e also used the restricted model, but O/L 6e uses the unrestricted model. Thus, sensible or not, using the restricted model in Minitab for every ANOVA with any fixed effect will give the same results as in the books by Brownlee, Snedecor / Cochran, and earlier editions of Ott's book. More advanced books on ANOVA generally prefer the unrestricted model. Also, some software packages also prefer the unrestricted model: SAS seems to do the restricted model only with special programming, and Minitab uses the unrestricted model by default, doing restricted analyses only if requested by a subcommand.
7 Minitab Notes for STAT 6305 Unit 8-7 In practice, situations in which the distinction between restricted and unrestricted models will lead to different practical interpretations seem to be rare. For example, in the current two-way mixed model ANOVA, the importance of the distinction evaporates if interaction is far from significant and the interaction term is removed from the model (as in Problem 8.3.2). Even when the interaction term remains, for the practical interpretation of main effects it may not make much difference whether MS(Error) or MS(Interaction) is put in the denominator of F Suppose that the 30 observations given at the beginning of this unit were collected exactly in order reading across the rows of the original data table, with Formula 1 tests being run on Monday, Formula 2 on Tuesday, and Formula 3 on Wednesday. Furthermore, suppose that on each of these days the Day shift ran Batch 1, the Swing shift ran Batches 2, 3, and 4, and the Night shift ran Batch 5. What effect would this knowledge have on the report you would write about your conclusions from the experiment? Suppose that the beams that produced the values 613 and 656 in the first two cells with Formula 1 had been damaged in handling so that these two values are missing. Further suppose that for each of the cells in Batch 3 a third beam was tested with results: 649, 612, 614 for Formulas 1, 2, 3, respectively. How would you analyze the resulting unbalanced design? Do the missing and extra observations change the overall interpretation of the results? 8.4. Comparison with Fixed and Random Models In this section we compare the mixed model just analyzed with a model in which both factors are fixed and one in which both factors are random. We do this by "changing the story" behind the data on strengths of concrete beams. These fantasy models are incorrect for the data given, but they are instructive from a computational point of view. If Both Factors Were Fixed. First, suppose that we have access to five major suppliers of concrete, and that Factor B represents "brands" of concrete rather than batches from the same supplier. Suppose that Factor A remains capping formulas as in the true story. In this case both factors would be fixed effects. HYPOTHETICAL FIXED-EFFECTS MODEL MTB > anova PSI = A B; SUBC> ems. ANOVA: PSI versus A, B Factor Type Levels Values A fixed 3 1, 2, 3 B fixed 5 1, 2, 3, 4, 5 Analysis of Variance for PSI Source DF SS MS F P A B A*B Error Total S = R-Sq = 90.01% R-Sq(adj) = 80.69% Expected Mean Square Variance Error for Each Term (using Source component term unrestricted model) 1 A 4 (4) + Q[1,3] 2 B 4 (4) + Q[2,3] 3 A*B 4 (4) + Q[3] 4 Error (4)
8 Minitab Notes for STAT 6305 Unit 8-8 Notice that MS(Error) is used in the denominator for F(A*B), F(A) and F(B) when both effects are fixed. There is not much point in giving interpretations of the F-ratios here because our phony model was not used to collect the data. A legitimate example of a fixed two-factor ANOVA with interaction was discussed in a prior unit. If Both Factors Were Random. Second, pretend that Factor B refers to randomly chosen batches as in the true story. However, imagine that Factor A is also a random factor perhaps corresponding to randomly chosen batches of a single capping formula. The issue would be how much of the variability in measured strength of the beams is due to random concrete batch variability, how much to random variability in batches of the capping formula, how much to any interaction between these two random effects, and how much to unexplained random error. To use Minitab for this fanciful scenario, it is necessary to declare both factors as random. We declare the restricted model here just for consistency. Whether to "restrict" is not an issue unless there is at least one fixed effect; the only result of the restrict subcommand is in the heading of the EMS table. (The residuals are the same as for the previous two models.) HYPOTHETICAL RANDOM-EFFECTS MODEL MTB > anova PSI = A B; SUBC> random A B; SUBC> ems. ANOVA: PSI versus A, B Factor Type Levels Values A random 3 1, 2, 3 B random 5 1, 2, 3, 4, 5 Analysis of Variance for PSI Source DF SS MS F P A B A*B Error Total S = R-Sq = 90.01% R-Sq(adj) = 80.69% Expected Mean Square Variance Error for Each Term (using Source component term unrestricted model) 1 A (4) + 2 (3) + 10 (1) 2 B (4) + 2 (3) + 6 (2) 3 A*B (4) + 2 (3) 4 Error (4) In this case, the EMS information tells us to use the same F-ratios as in the model where A is fixed and B is random. EMS Tables in O/L 6e for Fixed, Mixed, and Random Models. Table in O/L 6e gives general formulas for EMSs in the two-factor fixed, mixed, and random models. The coefficients of the terms in each EMS depend on the number of levels of a and b of the two factors and the number n of replications in each cell. These are similar to the EMS tables from Minitab, except that the components for fixed effects (θ A, θ B, θ AB in O/L and Q[ ] in Minitab, as appropriate) are defined differently.
9 Minitab Notes for STAT 6305 Unit 8-9 O/L 6e Sect presents an algorithm claimed to produce EMS tables for a wide variety of experimental designs. This section is not required for our course because we do not feel any important insights are gained by memorizing this rather complicated procedure, and because the algorithm programmed into Minitab (and most other serious software packages) can provide the EMS tables we need. If interaction is not significant and the reduced model (leaving out interaction) is used, then all twoway models (fixed/mixed/random, restricted/unrestricted) have the same F-ratios: both main effects are tested against error. Problems: Compare two-way ANOVA tables for fixed, mixed, and random models. Which of the following columns are unchanged, regardless of which factors are declared as random: DF, SS, MS, F? In Problem we saw that, for the mixed model, Minitab's F-ratios are different depending on whether the restricted model is declared or whether the default unrestricted model is used. Is there a similar difference in F-ratios for the two-way model with both effects random? Same as problem 8.4.2, but consider both effects to be fixed? First, use the unrestricted model and then the restricted model, showing EMS tables in both cases. Comment on similarities and differences. Which EMS table agrees with what is shown in O/L 6e, Table 17.17, p1057? For the data in this unit, the most fundamental interpretation has been the same whether with the true story (mixed model), or with the two imaginary scenarios (both fixed, both random): Interaction is not significant, A-effect highly significant, and B-effect highly significant. Thus, with these data an untrained "statistician" who does not know the difference between fixed and random effects would get the same results for the three F-tests no matter which model he or she happened to choose. Under what circumstances might the results of these F-tests depend on getting the correct model? (Of course, the interpretation of profile plots, use of multiple comparison procedures for significant main effects, and explanation of the findings would depend on knowing the difference between fixed and random effects in any case.) Carefully write the models with both factors fixed (restricted) and both factors random (where the issue of restriction does not arise). Show the equation (using Greek letters for parameters and Latin letters for random variables), ranges of subscripts, restrictions (as appropriate), and distributional assumptions. Minitab Notes for Statistics 6305 by Bruce E. Trumbo, Department of Statistics, CSU East Bay, Hayward CA, 94542, bruce.trumbo@csueastbau.edu. Comments and corrections welcome. Copyright 1991, 2010 by Bruce E. Trumbo. All rights reserved. These notes are intended primarily for use at California State University, East Bay. For other uses, please contact the author. Preparation of earlier versions of this document was partially supported by NSF grant USE Modified: 2/10
Unit 8: A Mixed Two-Factor Design
Minitab Notes for STAT 3503 Dept. of Statistics CSU Hayward Unit 8: A Mixed Two-Factor Design 8.1. The Data We use data quoted in Brownlee: Statistical Theory and Methodology in Science and Engineering,
More informationResidual Analysis for two-way ANOVA The twoway model with K replicates, including interaction,
Residual Analysis for two-way ANOVA The twoway model with K replicates, including interaction, is Y ijk = µ ij + ɛ ijk = µ + α i + β j + γ ij + ɛ ijk with i = 1,..., I, j = 1,..., J, k = 1,..., K. In carrying
More informationTWO OR MORE RANDOM EFFECTS. The two-way complete model for two random effects:
TWO OR MORE RANDOM EFFECTS Example: The factors that influence the breaking strength of a synthetic fiber are being studied. Four production machines and three operators are randomly selected. A two-way
More informationSleep data, two drugs Ch13.xls
Model Based Statistics in Biology. Part IV. The General Linear Mixed Model.. Chapter 13.3 Fixed*Random Effects (Paired t-test) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch
More informationINTRODUCTION TO ANALYSIS OF VARIANCE
CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two
More information9 One-Way Analysis of Variance
9 One-Way Analysis of Variance SW Chapter 11 - all sections except 6. The one-way analysis of variance (ANOVA) is a generalization of the two sample t test to k 2 groups. Assume that the populations of
More informationTwo-Way Analysis of Variance - no interaction
1 Two-Way Analysis of Variance - no interaction Example: Tests were conducted to assess the effects of two factors, engine type, and propellant type, on propellant burn rate in fired missiles. Three engine
More informationDISTRIBUTIONS USED IN STATISTICAL WORK
DISTRIBUTIONS USED IN STATISTICAL WORK In one of the classic introductory statistics books used in Education and Psychology (Glass and Stanley, 1970, Prentice-Hall) there was an excellent chapter on different
More information1 Introduction to One-way ANOVA
Review Source: Chapter 10 - Analysis of Variance (ANOVA). Example Data Source: Example problem 10.1 (dataset: exp10-1.mtw) Link to Data: http://www.auburn.edu/~carpedm/courses/stat3610/textbookdata/minitab/
More information1. Least squares with more than one predictor
Statistics 1 Lecture ( November ) c David Pollard Page 1 Read M&M Chapter (skip part on logistic regression, pages 730 731). Read M&M pages 1, for ANOVA tables. Multiple regression. 1. Least squares with
More informationUnit 27 One-Way Analysis of Variance
Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied
More informationTwo or more categorical predictors. 2.1 Two fixed effects
Two or more categorical predictors Here we extend the ANOVA methods to handle multiple categorical predictors. The statistician has to watch carefully to see whether the effects being considered are properly
More informationChap The McGraw-Hill Companies, Inc. All rights reserved.
11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview
More informationMORE ON MULTIPLE REGRESSION
DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 MORE ON MULTIPLE REGRESSION I. AGENDA: A. Multiple regression 1. Categorical variables with more than two categories 2. Interaction
More informationRegression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.
Regression, part II I. What does it all mean? A) Notice that so far all we ve done is math. 1) One can calculate the Least Squares Regression Line for anything, regardless of any assumptions. 2) But, if
More informationContents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4
Contents Factorial design TAMS38 - Lecture 6 Factorial design, Latin Square Design Lecturer: Zhenxia Liu Department of Mathematics - Mathematical Statistics 28 November, 2017 Complete three factor design
More informationOne-Way Analysis of Variance (ANOVA)
1 One-Way Analysis of Variance (ANOVA) One-Way Analysis of Variance (ANOVA) is a method for comparing the means of a populations. This kind of problem arises in two different settings 1. When a independent
More information1 Introduction to Minitab
1 Introduction to Minitab Minitab is a statistical analysis software package. The software is freely available to all students and is downloadable through the Technology Tab at my.calpoly.edu. When you
More informationTopic 32: Two-Way Mixed Effects Model
Topic 3: Two-Way Mixed Effects Model Outline Two-way mixed models Three-way mixed models Data for two-way design Y is the response variable Factor A with levels i = 1 to a Factor B with levels j = 1 to
More informationStat 6640 Solution to Midterm #2
Stat 6640 Solution to Midterm #2 1. A study was conducted to examine how three statistical software packages used in a statistical course affect the statistical competence a student achieves. At the end
More informationUnit 12: Analysis of Single Factor Experiments
Unit 12: Analysis of Single Factor Experiments Statistics 571: Statistical Methods Ramón V. León 7/16/2004 Unit 12 - Stat 571 - Ramón V. León 1 Introduction Chapter 8: How to compare two treatments. Chapter
More informationReference: Chapter 13 of Montgomery (8e)
Reference: Chapter 1 of Montgomery (8e) Maghsoodloo 89 Factorial Experiments with Random Factors So far emphasis has been placed on factorial experiments where all factors are at a, b, c,... fixed levels
More informationThe entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.
One-Way ANOVA Summary The One-Way ANOVA procedure is designed to construct a statistical model describing the impact of a single categorical factor X on a dependent variable Y. Tests are run to determine
More informationSMAM 314 Practice Final Examination Winter 2003
SMAM 314 Practice Final Examination Winter 2003 You may use your textbook, one page of notes and a calculator. Please hand in the notes with your exam. 1. Mark the following statements True T or False
More informationStatistics 512: Applied Linear Models. Topic 9
Topic Overview Statistics 51: Applied Linear Models Topic 9 This topic will cover Random vs. Fixed Effects Using E(MS) to obtain appropriate tests in a Random or Mixed Effects Model. Chapter 5: One-way
More informationDESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya
DESAIN EKSPERIMEN BLOCKING FACTORS Semester Genap Jurusan Teknik Industri Universitas Brawijaya Outline The Randomized Complete Block Design The Latin Square Design The Graeco-Latin Square Design Balanced
More informationLAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION
LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION In this lab you will learn how to use Excel to display the relationship between two quantitative variables, measure the strength and direction of the
More informationMultiple Pairwise Comparison Procedures in One-Way ANOVA with Fixed Effects Model
Biostatistics 250 ANOVA Multiple Comparisons 1 ORIGIN 1 Multiple Pairwise Comparison Procedures in One-Way ANOVA with Fixed Effects Model When the omnibus F-Test for ANOVA rejects the null hypothesis that
More informationIndependent Samples ANOVA
Independent Samples ANOVA In this example students were randomly assigned to one of three mnemonics (techniques for improving memory) rehearsal (the control group; simply repeat the words), visual imagery
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More information5:1LEC - BETWEEN-S FACTORIAL ANOVA
5:1LEC - BETWEEN-S FACTORIAL ANOVA The single-factor Between-S design described in previous classes is only appropriate when there is just one independent variable or factor in the study. Often, however,
More informationChapter 1 Statistical Inference
Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations
More information9 Correlation and Regression
9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor
More informationOne-way ANOVA (Single-Factor CRD)
One-way ANOVA (Single-Factor CRD) STAT:5201 Week 3: Lecture 3 1 / 23 One-way ANOVA We have already described a completed randomized design (CRD) where treatments are randomly assigned to EUs. There is
More informationRCB - Example. STA305 week 10 1
RCB - Example An accounting firm wants to select training program for its auditors who conduct statistical sampling as part of their job. Three training methods are under consideration: home study, presentations
More informationDesign & Analysis of Experiments 7E 2009 Montgomery
1 What If There Are More Than Two Factor Levels? The t-test does not directly apply ppy There are lots of practical situations where there are either more than two levels of interest, or there are several
More informationANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS
ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing
More informationProbability and Statistics
Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT
More informationCorrelation & Simple Regression
Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.
More informationMultiple Regression Examples
Multiple Regression Examples Example: Tree data. we have seen that a simple linear regression of usable volume on diameter at chest height is not suitable, but that a quadratic model y = β 0 + β 1 x +
More informationCHAPTER 13: F PROBABILITY DISTRIBUTION
CHAPTER 13: F PROBABILITY DISTRIBUTION continuous probability distribution skewed to the right variable values on horizontal axis are 0 area under the curve represents probability horizontal asymptote
More informationy response variable x 1, x 2,, x k -- a set of explanatory variables
11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate
More informationFactorial designs. Experiments
Chapter 5: Factorial designs Petter Mostad mostad@chalmers.se Experiments Actively making changes and observing the result, to find causal relationships. Many types of experimental plans Measuring response
More informationMULTIPLE LINEAR REGRESSION IN MINITAB
MULTIPLE LINEAR REGRESSION IN MINITAB This document shows a complicated Minitab multiple regression. It includes descriptions of the Minitab commands, and the Minitab output is heavily annotated. Comments
More informationAnalysis of 2x2 Cross-Over Designs using T-Tests
Chapter 234 Analysis of 2x2 Cross-Over Designs using T-Tests Introduction This procedure analyzes data from a two-treatment, two-period (2x2) cross-over design. The response is assumed to be a continuous
More informationSimple Linear Regression: One Quantitative IV
Simple Linear Regression: One Quantitative IV Linear regression is frequently used to explain variation observed in a dependent variable (DV) with theoretically linked independent variables (IV). For example,
More informationDepartment of Mathematics & Statistics Stat 2593 Final Examination 19 April 2001
Department of Mathematics & Statistics Stat 2593 Final Examination 19 April 2001 TIME: 3 hours. Total Marks: 60. Indicate your answers clearly. Show all work. Remember to answer as a statistician should.
More informationCHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups 10. Comparing Two Means The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Means Learning
More informationCHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups 10.2 Comparing Two Means The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Means Learning
More informationMULTIPLE REGRESSION METHODS
DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 MULTIPLE REGRESSION METHODS I. AGENDA: A. Residuals B. Transformations 1. A useful procedure for making transformations C. Reading:
More information1 Use of indicator random variables. (Chapter 8)
1 Use of indicator random variables. (Chapter 8) let I(A) = 1 if the event A occurs, and I(A) = 0 otherwise. I(A) is referred to as the indicator of the event A. The notation I A is often used. 1 2 Fitting
More informationPower Analysis for One-Way ANOVA
Chapter 12 Power Analysis for One-Way ANOVA Recall that the power of a statistical test is the probability of rejecting H 0 when H 0 is false, and some alternative hypothesis H 1 is true. We saw earlier
More informationKeppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means
Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means 4.1 The Need for Analytical Comparisons...the between-groups sum of squares averages the differences
More informationUnit 5: How a Randomized Block Design Differs From a One-Way ANOVA
Minitab Notes for STT 6305: nalysis of Variance Models Department of Statistics and iostatistics SU East ay Unit 5: How a Randomized lock Design Differs From a One-Way NOV 5.1. First Dataset Real Estate
More informationCHAPTER 13: F PROBABILITY DISTRIBUTION
CHAPTER 13: F PROBABILITY DISTRIBUTION continuous probability distribution skewed to the right variable values on horizontal axis are 0 area under the curve represents probability horizontal asymptote
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More information8.1-4 Test of Hypotheses Based on a Single Sample
8.1-4 Test of Hypotheses Based on a Single Sample Example 1 (Example 8.6, p. 312) A manufacturer of sprinkler systems used for fire protection in office buildings claims that the true average system-activation
More informationChapter 14. Simple Linear Regression Preliminary Remarks The Simple Linear Regression Model
Chapter 14 Simple Linear Regression 14.1 Preliminary Remarks We have only three lectures to introduce the ideas of regression. Worse, they are the last three lectures when you have many other things academic
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationIntroduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs
Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs The Analysis of Variance (ANOVA) The analysis of variance (ANOVA) is a statistical technique
More informationPLS205 Lab 2 January 15, Laboratory Topic 3
PLS205 Lab 2 January 15, 2015 Laboratory Topic 3 General format of ANOVA in SAS Testing the assumption of homogeneity of variances by "/hovtest" by ANOVA of squared residuals Proc Power for ANOVA One-way
More informationAnalysis of Covariance (ANCOVA) with Two Groups
Chapter 226 Analysis of Covariance (ANCOVA) with Two Groups Introduction This procedure performs analysis of covariance (ANCOVA) for a grouping variable with 2 groups and one covariate variable. This procedure
More informationThe simple linear regression model discussed in Chapter 13 was written as
1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple
More informationSoil Phosphorus Discussion
Solution: Soil Phosphorus Discussion Summary This analysis is ambiguous: there are two reasonable approaches which yield different results. Both lead to the conclusion that there is not an independent
More informationEX1. One way ANOVA: miles versus Plug. a) What are the hypotheses to be tested? b) What are df 1 and df 2? Verify by hand. , y 3
EX. Chapter 8 Examples In an experiment to investigate the performance of four different brands of spark plugs intended for the use on a motorcycle, plugs of each brand were tested and the number of miles
More informationSTAT 350: Summer Semester Midterm 1: Solutions
Name: Student Number: STAT 350: Summer Semester 2008 Midterm 1: Solutions 9 June 2008 Instructor: Richard Lockhart Instructions: This is an open book test. You may use notes, text, other books and a calculator.
More information, (1) e i = ˆσ 1 h ii. c 2016, Jeffrey S. Simonoff 1
Regression diagnostics As is true of all statistical methodologies, linear regression analysis can be a very effective way to model data, as along as the assumptions being made are true. For the regression
More informationSwarthmore Honors Exam 2012: Statistics
Swarthmore Honors Exam 2012: Statistics 1 Swarthmore Honors Exam 2012: Statistics John W. Emerson, Yale University NAME: Instructions: This is a closed-book three-hour exam having six questions. You may
More informationUsing SPSS for One Way Analysis of Variance
Using SPSS for One Way Analysis of Variance This tutorial will show you how to use SPSS version 12 to perform a one-way, between- subjects analysis of variance and related post-hoc tests. This tutorial
More informationOpen book and notes. 120 minutes. Covers Chapters 8 through 14 of Montgomery and Runger (fourth edition).
IE 330 Seat # Open book and notes 10 minutes Covers Chapters 8 through 14 of Montgomery and Runger (fourth edition) Cover page and eight pages of exam No calculator ( points) I have, or will, complete
More informationOrthogonal contrasts for a 2x2 factorial design Example p130
Week 9: Orthogonal comparisons for a 2x2 factorial design. The general two-factor factorial arrangement. Interaction and additivity. ANOVA summary table, tests, CIs. Planned/post-hoc comparisons for the
More informationSTAT 705 Chapter 19: Two-way ANOVA
STAT 705 Chapter 19: Two-way ANOVA Adapted from Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 41 Two-way ANOVA This material is covered in Sections
More informationMATH Notebook 3 Spring 2018
MATH448001 Notebook 3 Spring 2018 prepared by Professor Jenny Baglivo c Copyright 2010 2018 by Jenny A. Baglivo. All Rights Reserved. 3 MATH448001 Notebook 3 3 3.1 One Way Layout........................................
More informationSix Sigma Black Belt Study Guides
Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships
More informationThe One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)
The One-Way Independent-Samples ANOVA (For Between-Subjects Designs) Computations for the ANOVA In computing the terms required for the F-statistic, we won t explicitly compute any sample variances or
More informationSTAT 705 Chapter 19: Two-way ANOVA
STAT 705 Chapter 19: Two-way ANOVA Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 38 Two-way ANOVA Material covered in Sections 19.2 19.4, but a bit
More informationUnbalanced Designs Mechanics. Estimate of σ 2 becomes weighted average of treatment combination sample variances.
Unbalanced Designs Mechanics Estimate of σ 2 becomes weighted average of treatment combination sample variances. Types of SS Difference depends on what hypotheses are tested and how differing sample sizes
More informationContents. 2 2 factorial design 4
Contents TAMS38 - Lecture 10 Response surface methodology Lecturer: Zhenxia Liu Department of Mathematics - Mathematical Statistics 12 December, 2017 2 2 factorial design Polynomial Regression model First
More informationHow To: Analyze a Split-Plot Design Using STATGRAPHICS Centurion
How To: Analyze a SplitPlot Design Using STATGRAPHICS Centurion by Dr. Neil W. Polhemus August 13, 2005 Introduction When performing an experiment involving several factors, it is best to randomize the
More information10 Model Checking and Regression Diagnostics
10 Model Checking and Regression Diagnostics The simple linear regression model is usually written as i = β 0 + β 1 i + ɛ i where the ɛ i s are independent normal random variables with mean 0 and variance
More informationMultiple samples: Modeling and ANOVA
Multiple samples: Modeling and Patrick Breheny April 29 Patrick Breheny Introduction to Biostatistics (171:161) 1/23 Multiple group studies In the latter half of this course, we have discussed the analysis
More informationDESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya
DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Jurusan Teknik Industri Universitas Brawijaya Outline Introduction The Analysis of Variance Models for the Data Post-ANOVA Comparison of Means Sample
More informationANOVA: Analysis of Variation
ANOVA: Analysis of Variation The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative variables depend on which group (given by categorical
More informationContrasts and Multiple Comparisons Supplement for Pages
Contrasts and Multiple Comparisons Supplement for Pages 302-323 Brian Habing University of South Carolina Last Updated: July 20, 2001 The F-test from the ANOVA table allows us to test the null hypothesis
More informationChapter Seven: Multi-Sample Methods 1/52
Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze
More informationUnbalanced Data in Factorials Types I, II, III SS Part 1
Unbalanced Data in Factorials Types I, II, III SS Part 1 Chapter 10 in Oehlert STAT:5201 Week 9 - Lecture 2 1 / 14 When we perform an ANOVA, we try to quantify the amount of variability in the data accounted
More informationTwo-factor studies. STAT 525 Chapter 19 and 20. Professor Olga Vitek
Two-factor studies STAT 525 Chapter 19 and 20 Professor Olga Vitek December 2, 2010 19 Overview Now have two factors (A and B) Suppose each factor has two levels Could analyze as one factor with 4 levels
More information2 Naïve Methods. 2.1 Complete or available case analysis
2 Naïve Methods Before discussing methods for taking account of missingness when the missingness pattern can be assumed to be MAR in the next three chapters, we review some simple methods for handling
More information(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.
FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December
More information1 Using standard errors when comparing estimated values
MLPR Assignment Part : General comments Below are comments on some recurring issues I came across when marking the second part of the assignment, which I thought it would help to explain in more detail
More informationWarm-up Using the given data Create a scatterplot Find the regression line
Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444
More informationVariance Estimates and the F Ratio. ERSH 8310 Lecture 3 September 2, 2009
Variance Estimates and the F Ratio ERSH 8310 Lecture 3 September 2, 2009 Today s Class Completing the analysis (the ANOVA table) Evaluating the F ratio Errors in hypothesis testing A complete numerical
More informationChapter 3. Introduction to Linear Correlation and Regression Part 3
Tuesday, December 12, 2000 Ch3 Intro Correlation Pt 3 Page: 1 Richard Lowry, 1999-2000 All rights reserved. Chapter 3. Introduction to Linear Correlation and Regression Part 3 Regression The appearance
More informationKey Features: More than one type of experimental unit and more than one randomization.
1 SPLIT PLOT DESIGNS Key Features: More than one type of experimental unit and more than one randomization. Typical Use: When one factor is difficult to change. Example (and terminology): An agricultural
More informationTAMS38 Experimental Design and Biostatistics, 4 p / 6 hp Examination on 19 April 2017, 8 12
Kurskod: TAMS38 - Provkod: TEN1 TAMS38 Experimental Design and Biostatistics, 4 p / 6 hp Examination on 19 April 2017, 8 12 The collection of the formulas in mathematical statistics prepared by Department
More informationappstats27.notebook April 06, 2017
Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves
More informationCOMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.
COMPLETELY RANDOM DESIGN (CRD) Description of the Design -Simplest design to use. -Design can be used when experimental units are essentially homogeneous. -Because of the homogeneity requirement, it may
More informationAn Old Research Question
ANOVA An Old Research Question The impact of TV on high-school grade Watch or not watch Two groups The impact of TV hours on high-school grade Exactly how much TV watching would make difference Multiple
More informationOpen book, but no loose leaf notes and no electronic devices. Points (out of 200) are in parentheses. Put all answers on the paper provided to you.
ISQS 5347 Final Exam Spring 2017 Open book, but no loose leaf notes and no electronic devices. Points (out of 200) are in parentheses. Put all answers on the paper provided to you. 1. Recall the commute
More information