1 Tomato yield example.
|
|
- Hollie McCoy
- 5 years ago
- Views:
Transcription
1 ST706 - Linear Models II. Spring 2013 Two-way Analysis of Variance examples. Here we illustrate what happens analyzing two way data in proc glm in SAS. Similar issues come up with other software where we need to worry about balance, possible weighting in defining main effects, etc.. In SAS the type I analysis use hierarchical sums of squares (test for that effect accounting only for effects above it in the table) while the type III sums of squares test for an effect accounting for all other things in the model. Note that the default in SAS is to assume that the weights used to define marginal means are equal and so effects add to 0; e.g., i α i = 0. If we want to test for equal marginal means with other weighting it needs to be customized, as illustrated below. Assume constant variance throughout; but, in practice would be evaluated and remedied as needed. 1 Tomato yield example. From Devore s book Probability and Statistics for Engineering and the Sciences and took text describing the experiment from a Duke web site. Three different varieties of tomato (Harvester, Pusa Early Dwarf, and Ife No. 1) and four different plant densities (10, 20, 30 and 40 thousand plants per hectare) are being considered for planting in a particular region. The goal of the experiment is to determine whether variety and density affect yield ( Effects of Plant Density on Tomato Yields in Western Nigeria, Experimental Agriculture, 1976: 43:47). Both factors are randomized, where the unit here is a plot of land. There are 36 such plots and the assumption is that the treatment combinations are randomized to these 36 subject to the constraint that there are 3 plots for each of the 12 treatment combinations. This is actually a setting where the marginal means are probably not of interest if there is interaction, but we use it for illustrating the computational issues nonetheless. Analysis of the tomato yield data, balanced. Illustrates balanced data with equal weights used in defining the marginal and overall means. This is a pretty straightforward analysis. Analysis Variable : Yield Variety Density N Mean Std Dev Minimum Maximum H Ife P *** SOME OF THE SAS CODE *** /* Two-way analysis with inferences for main effects */ proc glm data=a; class density variety; model yield = density variety density*variety; lsmeans density variety/cl pdiff tdiff; lsmeans density variety/cl pdiff tdiff adjust=bon; lsmeans density variety/cl pdiff tdiff adjust=tukey; 1
2 lsmeans density variety/cl pdiff tdiff adjust=scheffe; *** TWO WAY ANALYSIS. **** Sum of Source DF Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total Source DF Type I SS Mean Square F Value Pr > F Density <.0001 Variety <.0001 Density*Variety Source DF Type III SS Mean Square F Value Density Variety Density*Variety ***BECAUSE OF BALANCED DATA THE TYPE I AND TYPE III SUM OF SQUARES ARE THE SAME.*** ****ONE-AT-A-TIME INFERENCES Density Yield LSMEAN 95% Confidence Limits Difference Between 95% Confidence Limits for LSMEAN Variety Yield LSMEAN Number H Ife P Variety Yield LSMEAN 95% Confidence Limits H Ife P Difference Between 95% Confidence Limits for Adjustment for Multiple Comparisons: ****BONFERONNI ****** Least Squares Means for Effect Density 2
3 Between Confidence Limits for Between Confidence Limits for Adjustment for Multiple Comparisons: *******TUKEY******* Least Squares Means for Effect Density Between Confidence Limits for Least Squares Means for Effect Variety Between Confidence Limits for Adjustment for Multiple Comparisons: ******SCHEFFE ***** Between Confidence Limits for Least Squares Means for Effect Variety Between Confidence Limits for With balanced data means and lsmeans will give the same answer. MORE ANALYSIS OF TOMATO YIELD DATA 3
4 Using the original balanced data. Consider estimating specific linear combinations directly. Since the parameterization is in terms of effects in general we need to write c ij µ ij = µc.. + c i. α i + c.j τ j + c ij γ ij. ij i j ij Examples: 1. µ i. = µ + α i has a coefficient of 1 for µ (intercept) and 1 for α i 2. µ.j = i v iµ ij = µ + i v iα i + τ j + i v iγ ij Illustration: µ.j =.3µ 1j +.2µ 2j +.25µ 3j +.25µ 4j (v 1 =.3, v 2 =.2, etc.) = marginal mean of variety j = expected yield if planted in 30% at density 10000, 20% at density 20000, etc. 3. µ ij has coefficient 1 for µ,α i, τ j and γ ij. How to test a general hypothesis about a set of contrast? Consider testing H 0 : µ.1 = µ.2 = µ.3. This is equivalent to H 0 : µ.1 µ.2 = 0 and µ.1 µ.3 = 0. This is a hypothesis that two linear combinations equal zero. (There are many other sets of two linearly independent contrasts that can be used, but the test is independent of which two we choose.). This is used to show how to easily construct a test for equal marginal means in the presence of interaction when unequal weights are used to define the marginal means and associated main effects. proc glm data=a; class density variety; model yield = density variety density*variety/e clparm; /* ESTIMATE LINEAR COMBINATIONS WITH EQUAL WEIGHTING */ estimate variety1 versus variety3 variety 1 0-1; estimate mean variety 3 intercept 1 variety 0 0 1; estimate mean density 2 variety 3 intercept 1 density variety density*variety ; /* Two-way model with variety marginal means weighted over densities using weights.3,.2,.25 and.25 */ estimate weighted mean for variety1 intercept 1 density variety density*variety ; estimate weighted mean for variety2 intercept 1 density variety density*variety ; estimate weighted mean for variety3 intercept 1 density variety density*variety ; run; /* Showing how to test for equal marginal means with unequal weighting; tests that the specified combinationd equal 0*/ contrast test for variety with unequal weighting variety density*variety , 4
5 variety density*variety ; run; THIS COMES FROM THE e OPTION. IT SHOWS THE FORM OF ESTIMABLE FUNCTIONS ASSUMING THAT EFFECTS ADD TO 0 (SO EQUAL WEIGHTS EVERYWHERE) Effect Intercept General Form of Estimable Functions Coefficients L1 Density L2 Density L3 Density L4 Density L1-L2-L3-L4 Variety H L6 Variety Ife L7 Variety P L1-L6-L7 Density*Variety H Density*Variety Ife Density*Variety P Density*Variety H Density*Variety Ife Density*Variety P Density*Variety H Density*Variety Ife Density*Variety P Density*Variety H Density*Variety Ife Density*Variety P L9 L10 L2-L9-L10 L12 L13 L3-L12-L13 L15 L16 L4-L15-L16 L6-L9-L12-L15 L7-L10-L13-L16 L1-L2-L3-L4-L6-L7+L9+ L10+L12+L13+L15+L16 FROM ESTIMATE STATEMENTS Standard Parameter Estimate Error t Value variety1 versus variety mean variety mean density 2 variety Parameter Pr > t 95% Confidence Limits variety1 versus variety3 < mean variety 3 < mean density 2 variety 3 < Parameter Estimate Error t Value weighted mean for variety weighted mean for variety weighted mean for variety Parameter Pr > t 95% Confidence Limits weighted mean for variety1 < weighted mean for variety2 <
6 weighted mean for variety3 < TEST FROM USE OF THE CONTRAST STATEMENT Contrast DF Contrast SS test for variety with unequal weighting Contrast Mean Square F Value test for variety with unequal weighting Contrast Pr > F test for variety with unequal weighting <.0001 Fitting an additive model using the balanced data. This part fits an additive model. Note that the sums of squares associated with variety and density are the same as in the model with no interaction. This is because of the balance in the data. The test for the main effects is correct no matter what weights are used in defining the main effects (unlike the situation with interaction where the test in the anova table for the main effects in the presence of interaction is only okay if equal weights are involved.) The reason is that since µ ij = µ + α i + τ j, µ.j µ.j = τ j τ j for any weighting (similarly, µ i. µ i. = α i α i ). The consequence here is that the original SS due to interaction (and its associated degrees of freedom) are collapsed into the error term. The Type I and Type III analyses are the same. /* Fitting a model with no interaction */ proc glm data=a; class density variety; model yield = density variety; means density variety; run; Sum of Source DF Squares Mean Square F Value Pr > F Model <.0001 Error Corrected Total Source DF Type III SS Mean Square F Value Pr > F Density <.0001 Variety < Fitting using unbalanced data. Some observations were eliminated to create an unbalanced data set. group N Mean Std Dev 10000H Ife P H Ife P H Ife P H
7 40000Ife P Analysis of model with interaction. The type III analysis is the correct one to use to test for main effects in the presence of interaction, not the type I. (The correctness of the Type III analysis assumes that effects are defined using equal weights; otherwise we need to customize the test as done with the balanced data.) As seen below the Type I sums of squares are the ones that are applicable for a model with no interaction. Notice that the test for interaction is the same in both analyses. Assuming effects are defined with equal weights the lsmeans provides the right analysis of main effects. For example the estimate of the mean at density 1000 is the average of 9.85, and 16.3 which is the given by lsmeans. The means statement just averages all of the values at the given level of a factor. This is a weighted average of the corresponding cell means and would only be appropriate if the marginal means were defined using weights that were proportional to the sample size. proc glm data=au; /* using unbalanced data*/ class density variety; model yield = density variety density*variety; lsmeans density variety/cl pdiff tdiff; means density variety/clm cldiff t; run; Sum of Source DF Squares Mean Square F Value Model Error Corrected Total Source DF Type I SS Mean Square F Value Pr > F Density <.0001 Variety <.0001 Density*Variety Source DF Type III SS Mean Square F Value Density Variety Density*Variety FROM LSMEAN Density Yield LSMEAN 95% Confidence Limits FROM MEANS t Confidence Intervals for Yield Density N Mean 95% Confidence Limits
8 Analysis with no interaction. Type I and Type III SS s agree. These values shows that the Type I sums of squares given above in the ANOVA table with interaction are the ones that apply to an additive model. The analyses of main effects should still be carried out using the lsmeans option (assuming equal weighting). Dependent Variable: Yield Source DF Type I SS Mean Square F Value Density Variety Source DF Type III SS Mean Square F Value Density Variety Senic data example. This data (described in Appendix C.1, see also problem 19.1 of Kutner et al.) is from a sample of hospitals in the U.S. One problem of interest is examining how the infection rates vary with geographical region (A) and whether the hospital has a medical school affiliation or not (B). This is an example where neither factor is experimental. The definition of marginal means and the overall mean are more general than layout in the notes. Suppose in the population the number of hospitals at geographic region i and affiliation j is N ij and the total number of hospitals is N. Define a ij = N ij /N. Then the overall population mean is µ = ij a ijµ ij while the marginal means are given by µ i. = j N ijµ ij /N i. = j w ijµ ij, where w ij = N ij /N i. and µ.j = i v ijµ ij, where v ij = N ij /N.j. So, the weights are proportional to the population sizes. If unknown and we have an overall random sample then we can estimated weights using weights that are proportional to the sample sizes; e.g., w ij = n ij /n i.. Here the use of the means option in SAS produces a correct analysis for the marginal means (estimates and confidence intervals) since if we take a weighted average of cell means using weights proportional to sample size we simply get a mean of all observations over cells that are being combined. In the glm analysis, the test for interaction is okay. It also turns out in this case that the Type I analysis gives the correct test for a main effect in the presence of interaction as long as that main effect is the first one entered into the model. (This is not always true but happens here because the weights are proportional to the sample sizes.) proc glm data=a; class medsch region; model risk = medsch region medsch*region; *lsmeans medsch region/cl; /* Incorrect with weighting*/ means medsch region/clm t; FROM PROC MEANS region medsch N Mean Std Dev region N Mean Std Dev
9 FROM THE GLM ANALYSIS Sum of Source DF Squares Mean Square F Value Model Error Source DF Type I SS Mean Square F Value Pr > F medsch region medsch*region Source DF Type III SS Mean Square F Value Pr > F medsch region medsch*region t Confidence Intervals for risk Error Degrees of Freedom 105 Error Mean Square Critical Value of t % Confidence medsch N Mean Limits % Confidence region N Mean Limits ***ADDITIVE MODEL **** model risk = medsch region; Sum of Source DF Squares Mean Square F Value Model Error Source DF Type I SS Mean Square F Value medsch region Source Pr > F medsch region Source DF Type III SS Mean Square F Value medsch region Source Pr > F medsch region The Type I sums of squares depend on the order of the variables in the model statement. With A = medsch and B = region. SSA = SSE(µ) SSE(µ, α), where SSE(µ) is the error sum of squares for model with just µ in in 9
10 and SSE(µ, α) from the model with µ + α i. SSB = SSE(µ, α) SSE(µ, α, τ). Additional Comments on the senic example. Below is a confidence interval for difference in the two med school levels and simultaneous confidence intervals for the pairwise differences in regions. I ve shown the Bonferroni intervals since they are shorter than Scheffe s. The Tukey intervals are even shorter but with unequal sample sizes Tukey s method (called Tukey-Kramer) is only approximate and performance can be questionable with very unequal sample sizes. The simultaneous pairwise comparisons can be used to carry out a test of equal marginal means across regions. We would reject since we reject that the means for regions 1 and 3 are the same. This also shows how to explicitly carry out a test for equal means across the two medical school categories while allowing interactions using an estimate statement with weights based on the sample sizes. We can use estimate here since there is only one difference. However, constructing the tests this way is not necessary. With weights proportional to sample sizes the estimates of the marginal means are the overall means and the sums of squares for the main effects are the same sums of squares as we would compute for them if viewed as a one-way problem; see page 113 of the notes. This is exactly what gets calculated for the first main effect in the Type I SS. So, the test for medschool from the Type I analysis above is correct with F = 6.61 and a p-value of.0115 agrees with the t-test below, any difference being due to rounding in the weighting. proc glm data=a; class medsch region; model risk = medsch region medsch*region; means medsch/clm cldiff t; means region/clm cldiff bon scheffe tukey; estimate test for medsch with unequal weighting medsch 1-1 region medsch*region ; t Tests (LSD) for risk Alpha 0.05 Error Degrees of Freedom 105 Error Mean Square Critical Value of t Difference medsch Between 95% Confidence Comparison Means Limits *** *** Bonferroni (Dunn) t Tests for risk Difference Simultaneous region Between 95% Confidence Comparison Means Limits ***
11 *** Parameter test for medsch with unequal weighting Standard Estimate Error t Value Pr > t Similarly one could construct an overall F-test for equal means across regions using contrasts. In this case three contrasts are needed since there are four regions; for example with region as factor B you can use µ.1 µ.2, µ.1 µ.3 and µ.1 µ.4. But, as with testing for medschool, you don t need to do this in this case. The analysis above for regions is not correct but we can get the correct test by running the analysis with region first in the model statement and using the Type I analysis. The F-test with a p-value of.0404 below is the correct test for differences among regions. proc glm data=a; class medsch region; model risk = region medsch medsch*region; run; Source DF Type I SS Mean Square F Value P value region medsch medsch*region BIBD example Analysis of BIBD example; martindale wear testing. BIBD EXAMPLE (Martindale wear data). title bibd example ; data values; infile wear.dat ; input run type loss; proc glm; class run type; model loss = run type; lsmeans type/stderr cl tdiff; estimate type 1 mean-i intercept 1 type 1; estimate type 1 mean-ii intercept 7 type 7 run /divisor=7; estimate type1-type2 type 1-1; lsmeans type/cl pdiff=all adjust=tukey; run; Dependent Variable: LOSS Sum of Mean Source DF Squares Square F Value Pr > F Model Error Corrected Total R-Square C.V. Root MSE LOSS Mean
12 Source DF Type I SS Mean Square F Value Pr > F RUN TYPE Source DF Type III SS Mean Square F Value Pr > F RUN TYPE **The means option does not produce anything useful here as it uses mean for a level of a factor over the nonempty cells for that level. Instead we use lsmeans** Standard LSMEAN type loss LSMEAN Error Pr > t Number < < < < < < < Least Squares Means for Effect type t for H0: LSMean(i)=LSMean(j) / Pr > t Dependent Variable: loss i/j < <.0001 < <.0001 <.0001 <.0001 < < < < < < <.0001 < < < <.0001 <.0001 <.0001 < < <.0001 type loss LSMEAN 95% Confidence Limits Least Squares Means Adjustment for Multiple Comparisons: Tukey-Kramer Least Squares Means for effect type Pr > t for H0: LSMean(i)=LSMean(j) Dependent Variable: loss i/j
13 <.0001 <.0001 < < < < < < < < <.0001 <.0001 <.0001 < < <.0001 Least Squares Means for Effect type Between Confidence Limits for Standard Parameter Estimate Error t Value Pr > t type 1 mean-i <.0001 type 1 mean-ii <.0001 type1-type <
Outline. Topic 19 - Inference. The Cell Means Model. Estimates. Inference for Means Differences in cell means Contrasts. STAT Fall 2013
Topic 19 - Inference - Fall 2013 Outline Inference for Means Differences in cell means Contrasts Multiplicity Topic 19 2 The Cell Means Model Expressed numerically Y ij = µ i + ε ij where µ i is the theoretical
More informationSAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model
Topic 23 - Unequal Replication Data Model Outline - Fall 2013 Parameter Estimates Inference Topic 23 2 Example Page 954 Data for Two Factor ANOVA Y is the response variable Factor A has levels i = 1, 2,...,
More informationT-test: means of Spock's judge versus all other judges 1 12:10 Wednesday, January 5, judge1 N Mean Std Dev Std Err Minimum Maximum
T-test: means of Spock's judge versus all other judges 1 The TTEST Procedure Variable: pcwomen judge1 N Mean Std Dev Std Err Minimum Maximum OTHER 37 29.4919 7.4308 1.2216 16.5000 48.9000 SPOCKS 9 14.6222
More informationSTAT 705 Chapter 19: Two-way ANOVA
STAT 705 Chapter 19: Two-way ANOVA Adapted from Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 41 Two-way ANOVA This material is covered in Sections
More informationSTAT 705 Chapter 19: Two-way ANOVA
STAT 705 Chapter 19: Two-way ANOVA Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 38 Two-way ANOVA Material covered in Sections 19.2 19.4, but a bit
More information4.8 Alternate Analysis as a Oneway ANOVA
4.8 Alternate Analysis as a Oneway ANOVA Suppose we have data from a two-factor factorial design. The following method can be used to perform a multiple comparison test to compare treatment means as well
More informationLinear Combinations of Group Means
Linear Combinations of Group Means Look at the handicap example on p. 150 of the text. proc means data=mth567.disability; class handicap; var score; proc sort data=mth567.disability; by handicap; proc
More informationLecture 5: Comparing Treatment Means Montgomery: Section 3-5
Lecture 5: Comparing Treatment Means Montgomery: Section 3-5 Page 1 Linear Combination of Means ANOVA: y ij = µ + τ i + ɛ ij = µ i + ɛ ij Linear combination: L = c 1 µ 1 + c 1 µ 2 +...+ c a µ a = a i=1
More informationSTA 303H1F: Two-way Analysis of Variance Practice Problems
STA 303H1F: Two-way Analysis of Variance Practice Problems 1. In the Pygmalion example from lecture, why are the average scores of the platoon used as the response variable, rather than the scores of the
More informationTopic 28: Unequal Replication in Two-Way ANOVA
Topic 28: Unequal Replication in Two-Way ANOVA Outline Two-way ANOVA with unequal numbers of observations in the cells Data and model Regression approach Parameter estimates Previous analyses with constant
More informationLinear Combinations. Comparison of treatment means. Bruce A Craig. Department of Statistics Purdue University. STAT 514 Topic 6 1
Linear Combinations Comparison of treatment means Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 6 1 Linear Combinations of Means y ij = µ + τ i + ǫ ij = µ i + ǫ ij Often study
More informationLeast Squares Analyses of Variance and Covariance
Least Squares Analyses of Variance and Covariance One-Way ANOVA Read Sections 1 and 2 in Chapter 16 of Howell. Run the program ANOVA1- LS.sas, which can be found on my SAS programs page. The data here
More informationLec 1: An Introduction to ANOVA
Ying Li Stockholm University October 31, 2011 Three end-aisle displays Which is the best? Design of the Experiment Identify the stores of the similar size and type. The displays are randomly assigned to
More informationSTAT 705 Chapters 23 and 24: Two factors, unequal sample sizes; multi-factor ANOVA
STAT 705 Chapters 23 and 24: Two factors, unequal sample sizes; multi-factor ANOVA Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 22 Balanced vs. unbalanced
More informationST505/S697R: Fall Homework 2 Solution.
ST505/S69R: Fall 2012. Homework 2 Solution. 1. 1a; problem 1.22 Below is the summary information (edited) from the regression (using R output); code at end of solution as is code and output for SAS. a)
More informationdata proc sort proc corr run proc reg run proc glm run proc glm run proc glm run proc reg CONMAIN CONINT run proc reg DUMMAIN DUMINT run proc reg
data one; input id Y group X; I1=0;I2=0;I3=0;if group=1 then I1=1;if group=2 then I2=1;if group=3 then I3=1; IINT1=I1*X;IINT2=I2*X;IINT3=I3*X; *************************************************************************;
More informationDescriptions of post-hoc tests
Experimental Statistics II Page 81 Descriptions of post-hoc tests Post-hoc or Post-ANOVA tests! Once you have found out some treatment(s) are different, how do you determine which one(s) are different?
More informationANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS
ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing
More informationChapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs
Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs Adapted from Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis
More informationTwo-factor studies. STAT 525 Chapter 19 and 20. Professor Olga Vitek
Two-factor studies STAT 525 Chapter 19 and 20 Professor Olga Vitek December 2, 2010 19 Overview Now have two factors (A and B) Suppose each factor has two levels Could analyze as one factor with 4 levels
More informationN J SS W /df W N - 1
One-Way ANOVA Source Table ANOVA MODEL: ij = µ* + α j + ε ij H 0 : µ = µ =... = µ j or H 0 : Σα j = 0 Source Sum of Squares df Mean Squares F J Between Groups nj( j * ) J - SS B /(J ) MS B /MS W = ( N
More informationOutline Topic 21 - Two Factor ANOVA
Outline Topic 21 - Two Factor ANOVA Data Model Parameter Estimates - Fall 2013 Equal Sample Size One replicate per cell Unequal Sample size Topic 21 2 Overview Now have two factors (A and B) Suppose each
More informationWITHIN-PARTICIPANT EXPERIMENTAL DESIGNS
1 WITHIN-PARTICIPANT EXPERIMENTAL DESIGNS I. Single-factor designs: the model is: yij i j ij ij where: yij score for person j under treatment level i (i = 1,..., I; j = 1,..., n) overall mean βi treatment
More informationTopic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model
Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is
More informationPLS205!! Lab 9!! March 6, Topic 13: Covariance Analysis
PLS205!! Lab 9!! March 6, 2014 Topic 13: Covariance Analysis Covariable as a tool for increasing precision Carrying out a full ANCOVA Testing ANOVA assumptions Happiness! Covariable as a Tool for Increasing
More informationPLS205 Lab 2 January 15, Laboratory Topic 3
PLS205 Lab 2 January 15, 2015 Laboratory Topic 3 General format of ANOVA in SAS Testing the assumption of homogeneity of variances by "/hovtest" by ANOVA of squared residuals Proc Power for ANOVA One-way
More informationLaboratory Topics 4 & 5
PLS205 Lab 3 January 23, 2014 Orthogonal contrasts Class comparisons in SAS Trend analysis in SAS Multiple mean comparisons Laboratory Topics 4 & 5 Orthogonal contrasts Planned, single degree-of-freedom
More informationV. Experiments With Two Crossed Treatment Factors
V. Experiments With Two Crossed Treatment Factors A.The Experimental Design Completely Randomized Design (CRD) Let A be a factor with levels i = 1,,a B be a factor with levels j = 1,,b Y ijt = the response
More information171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th
Name 171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th Use the selected SAS output to help you answer the questions. The SAS output is all at the back of the exam on pages
More informationSTAT 115:Experimental Designs
STAT 115:Experimental Designs Josefina V. Almeda 2013 Multisample inference: Analysis of Variance 1 Learning Objectives 1. Describe Analysis of Variance (ANOVA) 2. Explain the Rationale of ANOVA 3. Compare
More informationPairwise multiple comparisons are easy to compute using SAS Proc GLM. The basic statement is:
Pairwise Multiple Comparisons in SAS Pairwise multiple comparisons are easy to compute using SAS Proc GLM. The basic statement is: means effects / options Here, means is the statement initiat, effects
More informationTopic 20: Single Factor Analysis of Variance
Topic 20: Single Factor Analysis of Variance Outline Single factor Analysis of Variance One set of treatments Cell means model Factor effects model Link to linear regression using indicator explanatory
More informationTopic 32: Two-Way Mixed Effects Model
Topic 3: Two-Way Mixed Effects Model Outline Two-way mixed models Three-way mixed models Data for two-way design Y is the response variable Factor A with levels i = 1 to a Factor B with levels j = 1 to
More informationT.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS
ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only
More informationIntroduction. Chapter 8
Chapter 8 Introduction In general, a researcher wants to compare one treatment against another. The analysis of variance (ANOVA) is a general test for comparing treatment means. When the null hypothesis
More informationOutline. Topic 22 - Interaction in Two Factor ANOVA. Interaction Not Significant. General Plan
Topic 22 - Interaction in Two Factor ANOVA - Fall 2013 Outline Strategies for Analysis when interaction not present when interaction present when n ij = 1 when factor(s) quantitative Topic 22 2 General
More informationChapter 10. Design of Experiments and Analysis of Variance
Chapter 10 Design of Experiments and Analysis of Variance Elements of a Designed Experiment Response variable Also called the dependent variable Factors (quantitative and qualitative) Also called the independent
More informationCOMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.
COMPLETELY RANDOM DESIGN (CRD) Description of the Design -Simplest design to use. -Design can be used when experimental units are essentially homogeneous. -Because of the homogeneity requirement, it may
More informationThis is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.
EXST3201 Chapter 13c Geaghan Fall 2005: Page 1 Linear Models Y ij = µ + βi + τ j + βτij + εijk This is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.
More informationIntroduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes)
Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes) Asheber Abebe Discrete and Statistical Sciences Auburn University Contents 1 Completely Randomized Design
More informationTopic 23: Diagnostics and Remedies
Topic 23: Diagnostics and Remedies Outline Diagnostics residual checks ANOVA remedial measures Diagnostics Overview We will take the diagnostics and remedial measures that we learned for regression and
More informationSimple, Marginal, and Interaction Effects in General Linear Models: Part 1
Simple, Marginal, and Interaction Effects in General Linear Models: Part 1 PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 2: August 24, 2012 PSYC 943: Lecture 2 Today s Class Centering and
More informationAnalysis of Covariance
Analysis of Covariance (ANCOVA) Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 10 1 When to Use ANCOVA In experiment, there is a nuisance factor x that is 1 Correlated with y 2
More informationUnbalanced Data in Factorials Types I, II, III SS Part 2
Unbalanced Data in Factorials Types I, II, III SS Part 2 Chapter 10 in Oehlert STAT:5201 Week 9 - Lecture 2b 1 / 29 Types of sums of squares Type II SS The Type II SS relates to the extra variability explained
More information2 Hand-out 2. Dr. M. P. M. M. M c Loughlin Revised 2018
Math 403 - P. & S. III - Dr. McLoughlin - 1 2018 2 Hand-out 2 Dr. M. P. M. M. M c Loughlin Revised 2018 3. Fundamentals 3.1. Preliminaries. Suppose we can produce a random sample of weights of 10 year-olds
More informationANOVA Analysis of Variance
ANOVA Analysis of Variance ANOVA Analysis of Variance Extends independent samples t test ANOVA Analysis of Variance Extends independent samples t test Compares the means of groups of independent observations
More informationStatistics for exp. medical researchers Comparison of groups, T-tests and ANOVA
Faculty of Health Sciences Outline Statistics for exp. medical researchers Comparison of groups, T-tests and ANOVA Lene Theil Skovgaard Sept. 14, 2015 Paired comparisons: tests and confidence intervals
More informationLecture 3: Inference in SLR
Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals
More informationThe legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.
1 Chapter 1: Research Design Principles The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 2 Chapter 2: Completely Randomized Design
More informationTopic 14: Inference in Multiple Regression
Topic 14: Inference in Multiple Regression Outline Review multiple linear regression Inference of regression coefficients Application to book example Inference of mean Application to book example Inference
More informationCorrelation and the Analysis of Variance Approach to Simple Linear Regression
Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation
More informationAnalysis of variance and regression. April 17, Contents Comparison of several groups One-way ANOVA. Two-way ANOVA Interaction Model checking
Analysis of variance and regression Contents Comparison of several groups One-way ANOVA April 7, 008 Two-way ANOVA Interaction Model checking ANOVA, April 008 Comparison of or more groups Julie Lyng Forman,
More informationChap The McGraw-Hill Companies, Inc. All rights reserved.
11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview
More information4:3 LEC - PLANNED COMPARISONS AND REGRESSION ANALYSES
4:3 LEC - PLANNED COMPARISONS AND REGRESSION ANALYSES FOR SINGLE FACTOR BETWEEN-S DESIGNS Planned or A Priori Comparisons We previously showed various ways to test all possible pairwise comparisons for
More informationSTAT 506: Randomized complete block designs
STAT 506: Randomized complete block designs Timothy Hanson Department of Statistics, University of South Carolina STAT 506: Introduction to Experimental Design 1 / 10 Randomized complete block designs
More informationLecture 27 Two-Way ANOVA: Interaction
Lecture 27 Two-Way ANOVA: Interaction STAT 512 Spring 2011 Background Reading KNNL: Chapter 19 27-1 Topic Overview Review: Two-way ANOVA Models Basic Strategy for Analysis Studying Interactions 27-2 Two-way
More informationBiological Applications of ANOVA - Examples and Readings
BIO 575 Biological Applications of ANOVA - Winter Quarter 2010 Page 1 ANOVA Pac Biological Applications of ANOVA - Examples and Readings One-factor Model I (Fixed Effects) This is the same example for
More informationSTAT 401A - Statistical Methods for Research Workers
STAT 401A - Statistical Methods for Research Workers One-way ANOVA Jarad Niemi (Dr. J) Iowa State University last updated: October 10, 2014 Jarad Niemi (Iowa State) One-way ANOVA October 10, 2014 1 / 39
More informationAnalysis of variance. April 16, Contents Comparison of several groups
Contents Comparison of several groups Analysis of variance April 16, 2009 One-way ANOVA Two-way ANOVA Interaction Model checking Acknowledgement for use of presentation Julie Lyng Forman, Dept. of Biostatistics
More informationIncomplete Block Designs
Incomplete Block Designs Recall: in randomized complete block design, each of a treatments was used once within each of b blocks. In some situations, it will not be possible to use each of a treatments
More informationDESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective
DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective Second Edition Scott E. Maxwell Uniuersity of Notre Dame Harold D. Delaney Uniuersity of New Mexico J,t{,.?; LAWRENCE ERLBAUM ASSOCIATES,
More informationRepeated Measures Part 2: Cartoon data
Repeated Measures Part 2: Cartoon data /*********************** cartoonglm.sas ******************/ options linesize=79 noovp formdlim='_'; title 'Cartoon Data: STA442/1008 F 2005'; proc format; /* value
More informationAnalysis of variance. April 16, 2009
Analysis of variance April 16, 2009 Contents Comparison of several groups One-way ANOVA Two-way ANOVA Interaction Model checking Acknowledgement for use of presentation Julie Lyng Forman, Dept. of Biostatistics
More informationThree Factor Completely Randomized Design with One Continuous Factor: Using SPSS GLM UNIVARIATE R. C. Gardner Department of Psychology
Data_Analysis.calm Three Factor Completely Randomized Design with One Continuous Factor: Using SPSS GLM UNIVARIATE R. C. Gardner Department of Psychology This article considers a three factor completely
More informationOne-Way Analysis of Variance (ANOVA) There are two key differences regarding the explanatory variable X.
One-Way Analysis of Variance (ANOVA) Also called single factor ANOVA. The response variable Y is continuous (same as in regression). There are two key differences regarding the explanatory variable X.
More informationTopic 29: Three-Way ANOVA
Topic 29: Three-Way ANOVA Outline Three-way ANOVA Data Model Inference Data for three-way ANOVA Y, the response variable Factor A with levels i = 1 to a Factor B with levels j = 1 to b Factor C with levels
More informationTopic 12. The Split-plot Design and its Relatives (continued) Repeated Measures
12.1 Topic 12. The Split-plot Design and its Relatives (continued) Repeated Measures 12.9 Repeated measures analysis Sometimes researchers make multiple measurements on the same experimental unit. We have
More informationOrthogonal and Non-orthogonal Polynomial Constrasts
Orthogonal and Non-orthogonal Polynomial Constrasts We had carefully reviewed orthogonal polynomial contrasts in class and noted that Brian Yandell makes a compelling case for nonorthogonal polynomial
More informationOutline. Analysis of Variance. Comparison of 2 or more groups. Acknowledgements. Comparison of serveral groups
Outline Analysis of Variance Analysis of variance and regression course http://staff.pubhealth.ku.dk/~jufo/varianceregressionf2011.html Comparison of serveral groups Model checking Marc Andersen, mja@statgroup.dk
More information1. (Problem 3.4 in OLRT)
STAT:5201 Homework 5 Solutions 1. (Problem 3.4 in OLRT) The relationship of the untransformed data is shown below. There does appear to be a decrease in adenine with increased caffeine intake. This is
More informationLecture 13 Extra Sums of Squares
Lecture 13 Extra Sums of Squares STAT 512 Spring 2011 Background Reading KNNL: 7.1-7.4 13-1 Topic Overview Extra Sums of Squares (Defined) Using and Interpreting R 2 and Partial-R 2 Getting ESS and Partial-R
More informationAnalysis of Variance II Bios 662
Analysis of Variance II Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-10-24 17:21 BIOS 662 1 ANOVA II Outline Multiple Comparisons Scheffe Tukey Bonferroni
More informationdf=degrees of freedom = n - 1
One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:
More informationStatistics For Economics & Business
Statistics For Economics & Business Analysis of Variance In this chapter, you learn: Learning Objectives The basic concepts of experimental design How to use one-way analysis of variance to test for differences
More informationAnalysis of Variance
1 / 70 Analysis of Variance Analysis of variance and regression course http://staff.pubhealth.ku.dk/~lts/regression11_2 Marc Andersen, mja@statgroup.dk Analysis of variance and regression for health researchers,
More informationLecture 15 Topic 11: Unbalanced Designs (missing data)
Lecture 15 Topic 11: Unbalanced Designs (missing data) In the real world, things fall apart: plants are destroyed/trampled/eaten animals get sick volunteers quit assistants are sloppy accidents happen
More informationSTAT22200 Spring 2014 Chapter 5
STAT22200 Spring 2014 Chapter 5 Yibi Huang April 29, 2014 Chapter 5 Multiple Comparisons Chapter 5-1 Chapter 5 Multiple Comparisons Note the t-tests and C.I. s are constructed assuming we only do one test,
More informationMultiple comparisons - subsequent inferences for two-way ANOVA
1 Multiple comparisons - subsequent inferences for two-way ANOVA the kinds of inferences to be made after the F tests of a two-way ANOVA depend on the results if none of the F tests lead to rejection of
More informationExample: Four levels of herbicide strength in an experiment on dry weight of treated plants.
The idea of ANOVA Reminders: A factor is a variable that can take one of several levels used to differentiate one group from another. An experiment has a one-way, or completely randomized, design if several
More informationHypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =
Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,
More informationTopic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction Review of regression concepts
Topic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction The analysis of covariance (ANCOVA) is a technique that is occasionally useful for improving the precision of an experiment.
More informationFactorial and Unbalanced Analysis of Variance
Factorial and Unbalanced Analysis of Variance Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 04-Jan-2017 Nathaniel E. Helwig (U of Minnesota)
More information1 The Randomized Block Design
1 The Randomized Block Design When introducing ANOVA, we mentioned that this model will allow us to include more than one categorical factor(explanatory) or confounding variables in the model. In a first
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More informationModels for Clustered Data
Models for Clustered Data Edps/Psych/Soc 589 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline Notation NELS88 data Fixed Effects ANOVA
More informationDifferences of Least Squares Means
STAT:5201 Homework 9 Solutions 1. We have a model with two crossed random factors operator and machine. There are 4 operators, 8 machines, and 3 observations from each operator/machine combination. (a)
More informationSTAT 350: Summer Semester Midterm 1: Solutions
Name: Student Number: STAT 350: Summer Semester 2008 Midterm 1: Solutions 9 June 2008 Instructor: Richard Lockhart Instructions: This is an open book test. You may use notes, text, other books and a calculator.
More informationOutline. Analysis of Variance. Acknowledgements. Comparison of 2 or more groups. Comparison of serveral groups
Outline Analysis of Variance Analysis of variance and regression course http://staff.pubhealth.ku.dk/~lts/regression10_2/index.html Comparison of serveral groups Model checking Marc Andersen, mja@statgroup.dk
More information22s:152 Applied Linear Regression. Take random samples from each of m populations.
22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each
More informationANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003
ANALYSES OF NCGS DATA FOR ALCOHOL STATUS CATEGORIES 1 22:46 Sunday, March 2, 2003 The MEANS Procedure DRINKING STATUS=1 Analysis Variable : TRIGL N Mean Std Dev Minimum Maximum 164 151.6219512 95.3801744
More informationModels for Clustered Data
Models for Clustered Data Edps/Psych/Stat 587 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline Notation NELS88 data Fixed Effects ANOVA
More informationStatistics for exp. medical researchers Regression and Correlation
Faculty of Health Sciences Regression analysis Statistics for exp. medical researchers Regression and Correlation Lene Theil Skovgaard Sept. 28, 2015 Linear regression, Estimation and Testing Confidence
More informationSAS Program Part 1: proc import datafile="y:\iowa_classes\stat_5201_design\examples\2-23_drillspeed_feed\mont_5-7.csv" out=ds dbms=csv replace; run;
STAT:5201 Applied Statistic II (two-way ANOVA with contrasts Two-Factor experiment Drill Speed: 125 and 200 Feed Rate: 0.02, 0.03, 0.05, 0.06 Response: Force All 16 runs were done in random order. This
More informationUnit 12: Analysis of Single Factor Experiments
Unit 12: Analysis of Single Factor Experiments Statistics 571: Statistical Methods Ramón V. León 7/16/2004 Unit 12 - Stat 571 - Ramón V. León 1 Introduction Chapter 8: How to compare two treatments. Chapter
More informationTopic 25 - One-Way Random Effects Models. Outline. Random Effects vs Fixed Effects. Data for One-way Random Effects Model. One-way Random effects
Topic 5 - One-Way Random Effects Models One-way Random effects Outline Model Variance component estimation - Fall 013 Confidence intervals Topic 5 Random Effects vs Fixed Effects Consider factor with numerous
More information22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA
22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each
More informationPROBLEM TWO (ALKALOID CONCENTRATIONS IN TEA) 1. Statistical Design
PROBLEM TWO (ALKALOID CONCENTRATIONS IN TEA) 1. Statistical Design The purpose of this experiment was to determine differences in alkaloid concentration of tea leaves, based on herb variety (Factor A)
More informationAnalysis of Variance
Analysis of Variance Math 36b May 7, 2009 Contents 2 ANOVA: Analysis of Variance 16 2.1 Basic ANOVA........................... 16 2.1.1 the model......................... 17 2.1.2 treatment sum of squares.................
More informationOutline. PubH 5450 Biostatistics I Prof. Carlin. Confidence Interval for the Mean. Part I. Reviews
Outline Outline PubH 5450 Biostatistics I Prof. Carlin Lecture 11 Confidence Interval for the Mean Known σ (population standard deviation): Part I Reviews σ x ± z 1 α/2 n Small n, normal population. Large
More informationSTA 303H1F: Two-way Analysis of Variance Practice Problems
STA 303H1F: Two-way Analysis of Variance Practice Problems 1. In the Pygmalion example from lecture, why are the average scores of the platoon used as the response variable, rather than the scores of the
More information