RCB - Example. STA305 week 10 1

Size: px
Start display at page:

Download "RCB - Example. STA305 week 10 1"

Transcription

1 RCB - Example An accounting firm wants to select training program for its auditors who conduct statistical sampling as part of their job. Three training methods are under consideration: home study, presentations by local staff and training sessions at head office. 30 auditors were placed into one of 10 groups of 3 based on time since graduation. Within each group, one auditor was randomly allocated to each of the 3 training methods a proficiency exam was administered at the end of the training period. The results of the proficiency test are given in the following table: STA305 week 10 1

2 STA305 week 10 2

3 As we can see on average, training at the head office tends to produce the highest proficiency scores. The blocks appear to be quite heterogeneous with respect to proficiency. The sums of squares and the ANOV are Test that the 3 training methods result in the same mean proficiency scores STA305 week 10 3

4 Linear Contrasts As we have seen previously, the ANOVA will tell us only if there is an overall difference between the treatment means. In order to test hypotheses about the equality of specific treatment means, we use linear contrasts. The contrasts, the corresponding hypothesis and the test statistic are the same as before. Hypothesis: H 0 : c 1 μ 1 + c 2 μ c a μ a = 0 H a : c 1 μ 1 + c 2 μ c a μ a 0 Test statistic is: F obs ( ) a c Y i= 1 i i = ~ F a 2 b MSE c / i= 1 i b So P-value = P(F (1,(a-1)(b-1)) > F obs ) 2 ( 1, ( a 1)( 1) ) STA305 week 10 4

5 Tests Concerning Block Means From an examination of the expected mean squares, it would seem reasonable to test : H 0 : β 1 = β 2 =... = β b = 0 by comparing MS Bl / MS E to the critical F value. However, when we applied randomization, we did so only within blocks. Some authors argue that because of the restriction on randomization that we imposed, this F test would be a test of the equality of block plus restriction effect. STA305 week 10 5

6 Increased Precision Due to Blocking In designing the study, we decided to use blocks and impose a restriction on randomization. We did this because we believed that by controlling the nuisance factor we would decrease haphazard error. We might be interested in knowing how much we gained by blocking. The test for equality of block means is open to interpretation, and even it wasn t, it would not help quantify any gains due to blocking. Several authors have proposed a measure of the relative gain in efficiency due to inclusion of the blocking factor. Their measure takes into account the fact that in the completely randomized (CR) design there are more degrees for freedom for estimating error variability. STA305 week 10 6

7 It is given by R = 2 ( df BL + 1)( dfcr + 3) σ 2 ( df + 3)( df + 1) σ BL CR CR BL 2 σ CR In this expression, is the experimental error variance of the 2 completely randomized design, and σ BL is the corresponding quantity from the randomized complete block design. 2 σ CR 2 σ BL In order to estimate R, we must first estimate and. We can use MS E from the randomized block design to estimate. 2 σ CR Further, it can be shown that an unbiased estimator of is 2 ( b 1) MSBL + b( a 1) MSE σ CR = ab 1 where MS BL is the mean square for blocks from the randomized complete block design. 2 σ BL STA305 week 10 7

8 Example Consider the example of the accounting firm that was interested in selecting the best of 3training methods. The study used time since graduation as a blocking variable. To find the efficiency of this design relative to the completely randomized design, we need the following STA305 week 10 8

9 RCB ANOVA Using SAS The SAS procedure that we will use to conduct the analysis of variance in the randomized complete block design is GLM. In fact, we will use the same commands as in the two factor fixed effect model. As usual, start by inputting the data The ANOVA for the previous example is produced as follows: proc glm data = example ; class years program ; model score=years program ; run ; STA305 week 10 9

10 Assessing Model Adequacy As in the case of the one factor model, we made a number of assumptions about the data that we must verify. Once the model has been fit, the residuals should be examined to assess the validity of the normality assumption. Similarly, the variance should be examined within each treatment and within each block to ensure homogeneity. The same tools can be used here as in the one factor case: scatterplots, box plots, and normal probability plots. STA305 week 10 10

11 Replication and Interaction The model we used for the randomized complete block design assumes that the size and direction of the treatment effect is the same within each block. We would need more than 1 observation per treatment within each block to look for interaction of treatments with blocks. We will handle this case in the same manner as in the two-factor design. STA305 week 10 11

12 Estimating Model Parameters One method of estimating the model parameters involves minimizing the squared distances from the fitted model to the observed response values. That is, we need to find the values of μ, τ i, and β j which minimize Q = a b ( Y ij μ τ i β j ) i= 1 j= 1 Differentiate Q with respect to each of the parameters, set resulting equations to 0, and solve for the parameter values. 2 STA305 week 10 12

13 Exercise What is the distribution of the least squares estimates? Y Caution: and Y i are correlated! STA305 week 10 13

14 Latin Square Design Recall: the purpose of blocking was to minimize variability due to known & controllable nuisance factor. There may be more than one nuisance factor that is known & controllable. We will now examine the case where there are 2 such nuisance factors. STA305 week 10 14

15 Example Output of 4 industrial processes to be compared. The processes usually only run on Mon, Wed, Fri, and Sat. We can run several processes on each day. Processes are affected by external conditions such as weather which vary day to day. The external conditions also vary by time of day. In studying the 4 processes, we need to ensure each process runs in each time slot on each day. The 4 processes form the treatments in this study, while day and time of day are 2 nuisance factors which are known & controllable. STA305 week 10 15

16 One possible way to run this study is described in the following table This allocation ensures that there is a balance with respect to day of the week and time of day. That is, each process is run on each day and each process is run in each time slot. This is an example of a Latin Square design. Each of the nuisance factors must have the same number of levels as the factor being studied. STA305 week 10 16

17 Randomization For a given number of treatments, a, there are several possible Latin squares. In order to randomize the experiment, a Latin square would ideally be chosen from among all possible squares. As the number of treatments increases, so does the number of possible Latin squares. In general, STA305 week 10 17

18 For example, in the case where there are 3 treatments the possible Latin squares are: It would be tedious and time consuming to write out all possible Latin squares, and for a greater than 3 this is not feasible. So if we can t do this then how do we randomly select a Latin square? STA305 week 10 18

19 Standard Latin Squares A standard Latin square is one which has both the first row and the first column in numeric (or alphabetic) order. For example, To randomize, start with the standard Latin square and randomly permute the rows and columns. This is still difficult when the number of treatments is large. Cyclic squares, in which treatments always follow each other in the same order are often used in large designs. Disadvantages: order effect, know which treatment comes next. STA305 week 10 19

20 The Model for Latin Square We will assume that the effect of the treatment is the same regardless of the level of the 2 nuisance factors, i.e., we assume that the factors do not interact with each other. The model is then. STA305 week 10 20

21 Notation We will use the dot notation with 3 subscripts STA305 week 10 21

22 Sources of Variation Possible sources of variability in this model, include: Differences in treatment means Differences in means for each row level Differences in means for each column level Random variability The total sum of squares can be partitioned in the usual manner: STA305 week 10 22

23 Hypothesis Testing Although this design involves 3 factors, only one is of primary interest. The other 2 are nuisance factors and are included in the design to minimize experimental error. The hypothesis about the equality of treatment means can be tested in a manner similar to that used in the CR and randomized complete block designs. The hypothesis of interest is: H 0 : μ 1 = μ 2 =...= μ a versus Ha: not all μ i equal or H 0 :τ 1 = τ 2 =... = τ a =0 versus H a : not all τ i = 0 To begin, construct the analysis of variance table. It is given in the next slide STA305 week 10 23

24 STA305 week 10 24

25 Example A medical researcher was interested in comparing four formulas that were fed to newborn infants. The outcome of interest was the weight gain (in ounces per day) after one week. The study was designed in such a way that each formula was to be fed to each infant for 1 week. Four infants participated in the study, and 4 weeks were needed to complete the study Although primary interest was in the difference in mean weight gain between the 4 formulas, two nuisance factors which could affect the outcome were identified. STA305 week 10 25

26 These were: (a) infants, i.e. some infants may gain weight more quickly than others. (b) week i.e. the week determines the age of the infant, which could affect outcome and week could also be an indicator for external conditions i.e. in week x there was a flu virus circulating which caused weight loss. The following table indicates Latin square that was randomly selected to determine when infants would receive formulas 1-4 STA305 week 10 26

27 The study was conducted and the results are: The mean weight gain for each formula is: F 1 : F 2 : F 3 : F 4 : Do these data provide any evidence that there is a difference in mean weight gain between the 4 formulas? STA305 week 10 27

28 Solution Before proceeding to the analysis of variance, examine the sample means so we know what to expect. We blocked on 2 factors in the belief that there was considerable variability within blocks, and less variability between blocks. Does blocking on these 2 variables appear to have been worthwhile? One of the blocking factors was infant; there seems to be quite a bit a difference in average weight gain for the 4 infants, so perhaps this was worthwhile including in the design. The second blocking factor was week; the mean weight gain for weeks 2-4 do not appear to differ very much; however, the weight gain in week 1 was quite a bit less, so perhaps it was a good idea to have included this in the study design. STA305 week 10 28

29 The factor of primary interest was the type of formula that was fed to the infants. Although there are some differences between the formulas, these differences aren t as pronounced as the differences between infants. Need to conduct the ANOVA to determine whether the differences are significant. Start by calculating the sums of squares STA305 week 10 29

30 Latin Squares Using SAS To obtain the ANOVA for the previous example using SAS, input the data data example; input infant week formula wtgain ; cards ; ; run ; Conduct the ANOVA using PROC GLM proc glm data = example ; class formula infant week ; Model wtgain=formula infant week ; run ; STA305 week 10 30

31 Advantages / Disadvantages of Latin Square Design One of the advantages of the Latin square design is that the use of 2 blocking variables can greatly reduce experimental error. The total number of experimental units required is relatively small, which makes this design very practical for pilot or preliminary studies. One of the drawbacks of using a Latin square design is that the number of levels of each nuisance factor must equal the number of treatments. In some cases, the complexity of the randomization can be a disadvantage. The degrees of freedom for estimating σ 2 are relatively small, meaning that we will not have a very precise estimate. STA305 week 10 31

32 Relative Efficiency The Latin square is a design one level more complicated that the randomized complete block design. It is worth asking whether the added complexity of the Latin square paid off in terms of increased precision and power. Consider the simpler randomized complete block design where control of the row factor is part of the design, and column effects will (hopefully) be averaged out by randomization. STA305 week 10 32

33 The relative efficiency of the Latin square design compared to this blocked design is 2 RCB 2 LS The denominator can be estimated by MS E from the Latin square design. The numerator can be estimated by R = σ σ ( a ) MScol + 1 ME ˆ σ 2 RCB = a where MS Col is the mean square for the column facture in the Latin square design, and MS E is also from the Latin square design. E STA305 week 10 33

34 Example In the infant formula example, we might wonder whether there was any gain in efficiency by blocking on weeks and infants, as compared to just blocking on infants. An estimate of the relative efficiency is obtained by first calculating the numerator ˆ σ 2 RCB = MS col + ( a 1) a ME = = So the relative efficiency is R = / = E Had we decided to use a design which blocked on infant and randomized order of formula within each infant, we would have required approximately twice the number of observations to obtain the same precision. STA305 week 10 34

35 Replicated Latin Squares In general, even when the number of treatments is large, residual degrees of freedom in Latin Square remains relatively small. When there are only two treatments (a = 2) there are no degrees of freedom for error. It may be desirable to replicate the experiment in order to increase precision. That is, we want to increase the number of experimental units and still maintain the balance. To do this we must add one or more Latin squares to the experiment. Depending on the study, there may be more than one way to add replicates. STA305 week 10 35

36 Example The output from manufacturing process depends on settings of the machine used. Three settings will be studied to determine if there is a difference in mean output. It is believed that output may also be affected by the particular machine and by the person operating the machine. In order to control for these 2 nuisance factors, and Latin square design will be used Machine 1 Machine 2 Machine 3 Operator 1 S3 S1 S2 Operator 2 S2 S3 S1 Operator 3 S1 S2 S3 STA305 week 10 36

37 If we wish to increase size of experiment and to add more experimental units, we can do it in several ways by adding more squares. We can either use the same operators and machines, or if possible by including new operators and machines. Several options are presented STA305 week 10 37

38 Example Retraining program to teach automobile repair skills to individuals. Three incentive methods are being tested for use in the retraining program. It is believed that age and level of formal education also have an impact on outcome of training. There are 18 study participants available to participate in this experiment. Two Latin squares with the same rows and columns were used to design the STA305 week 10 38

39 The study was conducted, and achievement scores were : STA305 week 10 39

40 Graeco-Latin Squares The Latin square design was used to control for 2 nuisance factors. When there is a 3rd nuisance factor to be included in the design of an experiment, Graeco- Latin squares are used. A Graeco-Latin square consists of superimposing 2 orthogonal Latin squares. Consider 2 Latin squares, one with Latin letters and one with Greek letters. The 2 squares are said to be orthogonal if each Latin letter appears exactly once with each Greek letter. STA305 week 10 40

41 Example STA305 week 10 41

Incomplete Block Designs

Incomplete Block Designs Incomplete Block Designs Recall: in randomized complete block design, each of a treatments was used once within each of b blocks. In some situations, it will not be possible to use each of a treatments

More information

The Random Effects Model Introduction

The Random Effects Model Introduction The Random Effects Model Introduction Sometimes, treatments included in experiment are randomly chosen from set of all possible treatments. Conclusions from such experiment can then be generalized to other

More information

Chapter 4: Randomized Blocks and Latin Squares

Chapter 4: Randomized Blocks and Latin Squares Chapter 4: Randomized Blocks and Latin Squares 1 Design of Engineering Experiments The Blocking Principle Blocking and nuisance factors The randomized complete block design or the RCBD Extension of the

More information

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya DESAIN EKSPERIMEN BLOCKING FACTORS Semester Genap Jurusan Teknik Industri Universitas Brawijaya Outline The Randomized Complete Block Design The Latin Square Design The Graeco-Latin Square Design Balanced

More information

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 1 Chapter 1: Research Design Principles The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 2 Chapter 2: Completely Randomized Design

More information

Lecture 7: Latin Square and Related Design

Lecture 7: Latin Square and Related Design Lecture 7: Latin Square and Related Design Montgomery: Section 4.2-4.3 Page 1 Automobile Emission Experiment Four cars and four drivers are employed in a study for possible differences between four gasoline

More information

Figure 9.1: A Latin square of order 4, used to construct four types of design

Figure 9.1: A Latin square of order 4, used to construct four types of design 152 Chapter 9 More about Latin Squares 9.1 Uses of Latin squares Let S be an n n Latin square. Figure 9.1 shows a possible square S when n = 4, using the symbols 1, 2, 3, 4 for the letters. Such a Latin

More information

IX. Complete Block Designs (CBD s)

IX. Complete Block Designs (CBD s) IX. Complete Block Designs (CBD s) A.Background Noise Factors nuisance factors whose values can be controlled within the context of the experiment but not outside the context of the experiment Covariates

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Unit 6: Orthogonal Designs Theory, Randomized Complete Block Designs, and Latin Squares

Unit 6: Orthogonal Designs Theory, Randomized Complete Block Designs, and Latin Squares Unit 6: Orthogonal Designs Theory, Randomized Complete Block Designs, and Latin Squares STA 643: Advanced Experimental Design Derek S. Young 1 Learning Objectives Understand the basics of orthogonal designs

More information

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent: Activity #10: AxS ANOVA (Repeated subjects design) Resources: optimism.sav So far in MATH 300 and 301, we have studied the following hypothesis testing procedures: 1) Binomial test, sign-test, Fisher s

More information

Sleep data, two drugs Ch13.xls

Sleep data, two drugs Ch13.xls Model Based Statistics in Biology. Part IV. The General Linear Mixed Model.. Chapter 13.3 Fixed*Random Effects (Paired t-test) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

Inference for the Regression Coefficient

Inference for the Regression Coefficient Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates

More information

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is

More information

Stat 705: Completely randomized and complete block designs

Stat 705: Completely randomized and complete block designs Stat 705: Completely randomized and complete block designs Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis II 1 / 16 Experimental design Our department offers

More information

Assignment 6 Answer Keys

Assignment 6 Answer Keys ssignment 6 nswer Keys Problem 1 (a) The treatment sum of squares can be calculated by SS Treatment = b a ȳi 2 Nȳ 2 i=1 = 5 (5.40 2 + 5.80 2 + 10 2 + 9.80 2 ) 20 7.75 2 = 92.95 Then the F statistic for

More information

PLS205 Lab 2 January 15, Laboratory Topic 3

PLS205 Lab 2 January 15, Laboratory Topic 3 PLS205 Lab 2 January 15, 2015 Laboratory Topic 3 General format of ANOVA in SAS Testing the assumption of homogeneity of variances by "/hovtest" by ANOVA of squared residuals Proc Power for ANOVA One-way

More information

MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS FALL 1999

MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS FALL 1999 MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS 350 - FALL 1999 Instructor: A. Oyet Date: December 16, 1999 Name(Surname First): Student Number INSTRUCTIONS

More information

Lecture 7: Latin Squares and Related Designs

Lecture 7: Latin Squares and Related Designs Lecture 7: Latin Squares and Related Designs Montgomery: Section 4.2 and 4.3 1 Lecture 7 Page 1 Automobile Emission Experiment Four cars and four drivers are employed in a study of four gasoline additives(a,b,c,

More information

VIII. ANCOVA. A. Introduction

VIII. ANCOVA. A. Introduction VIII. ANCOVA A. Introduction In most experiments and observational studies, additional information on each experimental unit is available, information besides the factors under direct control or of interest.

More information

Deciphering Math Notation. Billy Skorupski Associate Professor, School of Education

Deciphering Math Notation. Billy Skorupski Associate Professor, School of Education Deciphering Math Notation Billy Skorupski Associate Professor, School of Education Agenda General overview of data, variables Greek and Roman characters in math and statistics Parameters vs. Statistics

More information

Topic 20: Single Factor Analysis of Variance

Topic 20: Single Factor Analysis of Variance Topic 20: Single Factor Analysis of Variance Outline Single factor Analysis of Variance One set of treatments Cell means model Factor effects model Link to linear regression using indicator explanatory

More information

Orthogonal and Non-orthogonal Polynomial Constrasts

Orthogonal and Non-orthogonal Polynomial Constrasts Orthogonal and Non-orthogonal Polynomial Constrasts We had carefully reviewed orthogonal polynomial contrasts in class and noted that Brian Yandell makes a compelling case for nonorthogonal polynomial

More information

Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8]

Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8] Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8] The completely randomized design Treatments are randomly assigned to e.u. such

More information

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models EPSY 905: Multivariate Analysis Spring 2016 Lecture #12 April 20, 2016 EPSY 905: RM ANOVA, MANOVA, and Mixed Models

More information

Assessing Model Adequacy

Assessing Model Adequacy Assessing Model Adequacy A number of assumptions were made about the model, and these need to be verified in order to use the model for inferences. In cases where some assumptions are violated, there are

More information

psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests

psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests last lecture: introduction to factorial designs next lecture: factorial between-ps ANOVA II: (effect sizes and follow-up tests) 1 general

More information

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous. COMPLETELY RANDOM DESIGN (CRD) Description of the Design -Simplest design to use. -Design can be used when experimental units are essentially homogeneous. -Because of the homogeneity requirement, it may

More information

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit

More information

First we look at some terms to be used in this section.

First we look at some terms to be used in this section. 8 Hypothesis Testing 8.1 Introduction MATH1015 Biostatistics Week 8 In Chapter 7, we ve studied the estimation of parameters, point or interval estimates. The construction of CI relies on the sampling

More information

Contrasts and Multiple Comparisons Supplement for Pages

Contrasts and Multiple Comparisons Supplement for Pages Contrasts and Multiple Comparisons Supplement for Pages 302-323 Brian Habing University of South Carolina Last Updated: July 20, 2001 The F-test from the ANOVA table allows us to test the null hypothesis

More information

44.2. Two-Way Analysis of Variance. Introduction. Prerequisites. Learning Outcomes

44.2. Two-Way Analysis of Variance. Introduction. Prerequisites. Learning Outcomes Two-Way Analysis of Variance 44 Introduction In the one-way analysis of variance (Section 441) we consider the effect of one factor on the values taken by a variable Very often, in engineering investigations,

More information

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 13, Part A: Analysis of Variance and Experimental Design Introduction to Analysis of Variance Analysis

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

Topic 7: Incomplete, double-blocked designs: Latin Squares [ST&D sections ]

Topic 7: Incomplete, double-blocked designs: Latin Squares [ST&D sections ] Topic 7: Incomplete, double-blocked designs: Latin Squares [ST&D sections 9.10 9.15] 7.1. Introduction The Randomized Complete Block Design is commonly used to improve the ability of an experiment to detect

More information

Analysis of Variance

Analysis of Variance Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also

More information

INTRODUCTION TO ANALYSIS OF VARIANCE

INTRODUCTION TO ANALYSIS OF VARIANCE CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two

More information

Example - Alfalfa (11.6.1) Lecture 16 - ANOVA cont. Alfalfa Hypotheses. Treatment Effect

Example - Alfalfa (11.6.1) Lecture 16 - ANOVA cont. Alfalfa Hypotheses. Treatment Effect (11.6.1) Lecture 16 - ANOVA cont. Sta102 / BME102 Colin Rundel October 28, 2015 Researchers were interested in the effect that acid has on the growth rate of alfalfa plants. They created three treatment

More information

Simple Linear Regression: One Quantitative IV

Simple Linear Regression: One Quantitative IV Simple Linear Regression: One Quantitative IV Linear regression is frequently used to explain variation observed in a dependent variable (DV) with theoretically linked independent variables (IV). For example,

More information

INTRODUCTION TO DESIGN AND ANALYSIS OF EXPERIMENTS

INTRODUCTION TO DESIGN AND ANALYSIS OF EXPERIMENTS GEORGE W. COBB Mount Holyoke College INTRODUCTION TO DESIGN AND ANALYSIS OF EXPERIMENTS Springer CONTENTS To the Instructor Sample Exam Questions To the Student Acknowledgments xv xxi xxvii xxix 1. INTRODUCTION

More information

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000 Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Applied Statistics Preliminary Examination Theory of Linear Models August 2017

Applied Statistics Preliminary Examination Theory of Linear Models August 2017 Applied Statistics Preliminary Examination Theory of Linear Models August 2017 Instructions: Do all 3 Problems. Neither calculators nor electronic devices of any kind are allowed. Show all your work, clearly

More information

Wolf River. Lecture 19 - ANOVA. Exploratory analysis. Wolf River - Data. Sta 111. June 11, 2014

Wolf River. Lecture 19 - ANOVA. Exploratory analysis. Wolf River - Data. Sta 111. June 11, 2014 Aldrin in the Wolf River Wolf River Lecture 19 - Sta 111 Colin Rundel June 11, 2014 The Wolf River in Tennessee flows past an abandoned site once used by the pesticide industry for dumping wastes, including

More information

Orthogonal, Planned and Unplanned Comparisons

Orthogonal, Planned and Unplanned Comparisons This is a chapter excerpt from Guilford Publications. Data Analysis for Experimental Design, by Richard Gonzalez Copyright 2008. 8 Orthogonal, Planned and Unplanned Comparisons 8.1 Introduction In this

More information

EX1. One way ANOVA: miles versus Plug. a) What are the hypotheses to be tested? b) What are df 1 and df 2? Verify by hand. , y 3

EX1. One way ANOVA: miles versus Plug. a) What are the hypotheses to be tested? b) What are df 1 and df 2? Verify by hand. , y 3 EX. Chapter 8 Examples In an experiment to investigate the performance of four different brands of spark plugs intended for the use on a motorcycle, plugs of each brand were tested and the number of miles

More information

Implementing contrasts using SAS Proc GLM is a relatively straightforward process. A SAS Proc GLM contrast statement has the following form:

Implementing contrasts using SAS Proc GLM is a relatively straightforward process. A SAS Proc GLM contrast statement has the following form: Contrasts for Comparison of Means If the analysis of variance produces a significant F-statistic, this signals the need for further analyses. The most common post ANOVA analysis is the comparison of means.

More information

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010

MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 MIXED MODELS FOR REPEATED (LONGITUDINAL) DATA PART 2 DAVID C. HOWELL 4/1/2010 Part 1 of this document can be found at http://www.uvm.edu/~dhowell/methods/supplements/mixed Models for Repeated Measures1.pdf

More information

Y it = µ + τ i + ε it ; ε it ~ N(0, σ 2 ) (1)

Y it = µ + τ i + ε it ; ε it ~ N(0, σ 2 ) (1) DOE (Linear Model) Strategy for checking experimental model assumptions Part 1 When we discuss experiments whose data are described or analyzed by the one-way analysis of variance ANOVA model, we note

More information

Experimental Design. IUFRO-SPDC Snowbird, UT September 29 Oct 3, 2014 Drs. Rolfe Leary and John A. Kershaw, Jr.

Experimental Design. IUFRO-SPDC Snowbird, UT September 29 Oct 3, 2014 Drs. Rolfe Leary and John A. Kershaw, Jr. Experimental Design IUFRO-SPDC Snowbird, UT September 29 Oct 3, 2014 Drs. Rolfe Leary and John A. Kershaw, Jr. Three scenarios The Good You designed the experiment You have the data Now what? The Bad

More information

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments. Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a

More information

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1 Notes for Wee 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1 Exam 3 is on Friday May 1. A part of one of the exam problems is on Predictiontervals : When randomly sampling from a normal population

More information

Population 1 Population 2

Population 1 Population 2 Two Population Case Testing the Difference Between Two Population Means Sample of Size n _ Sample mean = x Sample s.d.=s x Sample of Size m _ Sample mean = y Sample s.d.=s y Pop n mean=μ x Pop n s.d.=

More information

Hypothesis T e T sting w ith with O ne O One-Way - ANOV ANO A V Statistics Arlo Clark Foos -

Hypothesis T e T sting w ith with O ne O One-Way - ANOV ANO A V Statistics Arlo Clark Foos - Hypothesis Testing with One-Way ANOVA Statistics Arlo Clark-Foos Conceptual Refresher 1. Standardized z distribution of scores and of means can be represented as percentile rankings. 2. t distribution

More information

PROBLEM TWO (ALKALOID CONCENTRATIONS IN TEA) 1. Statistical Design

PROBLEM TWO (ALKALOID CONCENTRATIONS IN TEA) 1. Statistical Design PROBLEM TWO (ALKALOID CONCENTRATIONS IN TEA) 1. Statistical Design The purpose of this experiment was to determine differences in alkaloid concentration of tea leaves, based on herb variety (Factor A)

More information

Topic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction Review of regression concepts

Topic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction Review of regression concepts Topic 13. Analysis of Covariance (ANCOVA) [ST&D chapter 17] 13.1 Introduction The analysis of covariance (ANCOVA) is a technique that is occasionally useful for improving the precision of an experiment.

More information

Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means

Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means Keppel, G. & Wickens, T. D. Design and Analysis Chapter 4: Analytical Comparisons Among Treatment Means 4.1 The Need for Analytical Comparisons...the between-groups sum of squares averages the differences

More information

21.0 Two-Factor Designs

21.0 Two-Factor Designs 21.0 Two-Factor Designs Answer Questions 1 RCBD Concrete Example Two-Way ANOVA Popcorn Example 21.4 RCBD 2 The Randomized Complete Block Design is also known as the two-way ANOVA without interaction. A

More information

In social and biological sciences, a useful and frequently used statistical technique is the

In social and biological sciences, a useful and frequently used statistical technique is the Testing the additivity assumption in repeated measures ANOV A Robert I. Zambarano University of Texas Computation Center In social and biological sciences, a useful and frequently used statistical technique

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information. STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory

More information

Stat 579: Generalized Linear Models and Extensions

Stat 579: Generalized Linear Models and Extensions Stat 579: Generalized Linear Models and Extensions Mixed models Yan Lu Feb, 2018, week 7 1 / 17 Some commonly used experimental designs related to mixed models Two way or three way random/mixed effects

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

Linear Combinations of Group Means

Linear Combinations of Group Means Linear Combinations of Group Means Look at the handicap example on p. 150 of the text. proc means data=mth567.disability; class handicap; var score; proc sort data=mth567.disability; by handicap; proc

More information

Analysis of Variance and Co-variance. By Manza Ramesh

Analysis of Variance and Co-variance. By Manza Ramesh Analysis of Variance and Co-variance By Manza Ramesh Contents Analysis of Variance (ANOVA) What is ANOVA? The Basic Principle of ANOVA ANOVA Technique Setting up Analysis of Variance Table Short-cut Method

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan COSC 341 Human Computer Interaction Dr. Bowen Hui University of British Columbia Okanagan 1 Last Topic Distribution of means When it is needed How to build one (from scratch) Determining the characteristics

More information

Topic 12. The Split-plot Design and its Relatives (continued) Repeated Measures

Topic 12. The Split-plot Design and its Relatives (continued) Repeated Measures 12.1 Topic 12. The Split-plot Design and its Relatives (continued) Repeated Measures 12.9 Repeated measures analysis Sometimes researchers make multiple measurements on the same experimental unit. We have

More information

OHSU OGI Class ECE-580-DOE :Design of Experiments Steve Brainerd

OHSU OGI Class ECE-580-DOE :Design of Experiments Steve Brainerd Why We Use Analysis of Variance to Compare Group Means and How it Works The question of how to compare the population means of more than two groups is an important one to researchers. Let us suppose that

More information

SAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model

SAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model Topic 23 - Unequal Replication Data Model Outline - Fall 2013 Parameter Estimates Inference Topic 23 2 Example Page 954 Data for Two Factor ANOVA Y is the response variable Factor A has levels i = 1, 2,...,

More information

MATH Notebook 3 Spring 2018

MATH Notebook 3 Spring 2018 MATH448001 Notebook 3 Spring 2018 prepared by Professor Jenny Baglivo c Copyright 2010 2018 by Jenny A. Baglivo. All Rights Reserved. 3 MATH448001 Notebook 3 3 3.1 One Way Layout........................................

More information

STAT 350. Assignment 4

STAT 350. Assignment 4 STAT 350 Assignment 4 1. For the Mileage data in assignment 3 conduct a residual analysis and report your findings. I used the full model for this since my answers to assignment 3 suggested we needed the

More information

Chapter 12: Inference about One Population

Chapter 12: Inference about One Population Chapter 1: Inference about One Population 1.1 Introduction In this chapter, we presented the statistical inference methods used when the problem objective is to describe a single population. Sections 1.

More information

PLS205 Lab 6 February 13, Laboratory Topic 9

PLS205 Lab 6 February 13, Laboratory Topic 9 PLS205 Lab 6 February 13, 2014 Laboratory Topic 9 A word about factorials Specifying interactions among factorial effects in SAS The relationship between factors and treatment Interpreting results of an

More information

PLS205!! Lab 9!! March 6, Topic 13: Covariance Analysis

PLS205!! Lab 9!! March 6, Topic 13: Covariance Analysis PLS205!! Lab 9!! March 6, 2014 Topic 13: Covariance Analysis Covariable as a tool for increasing precision Carrying out a full ANCOVA Testing ANOVA assumptions Happiness! Covariable as a Tool for Increasing

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

A Introduction to Matrix Algebra and the Multivariate Normal Distribution

A Introduction to Matrix Algebra and the Multivariate Normal Distribution A Introduction to Matrix Algebra and the Multivariate Normal Distribution PRE 905: Multivariate Analysis Spring 2014 Lecture 6 PRE 905: Lecture 7 Matrix Algebra and the MVN Distribution Today s Class An

More information

Analysis of Variance (ANOVA)

Analysis of Variance (ANOVA) Analysis of Variance ANOVA) Compare several means Radu Trîmbiţaş 1 Analysis of Variance for a One-Way Layout 1.1 One-way ANOVA Analysis of Variance for a One-Way Layout procedure for one-way layout Suppose

More information

1. Least squares with more than one predictor

1. Least squares with more than one predictor Statistics 1 Lecture ( November ) c David Pollard Page 1 Read M&M Chapter (skip part on logistic regression, pages 730 731). Read M&M pages 1, for ANOVA tables. Multiple regression. 1. Least squares with

More information

[ z = 1.48 ; accept H 0 ]

[ z = 1.48 ; accept H 0 ] CH 13 TESTING OF HYPOTHESIS EXAMPLES Example 13.1 Indicate the type of errors committed in the following cases: (i) H 0 : µ = 500; H 1 : µ 500. H 0 is rejected while H 0 is true (ii) H 0 : µ = 500; H 1

More information

Allow the investigation of the effects of a number of variables on some response

Allow the investigation of the effects of a number of variables on some response Lecture 12 Topic 9: Factorial treatment structures (Part I) Factorial experiments Allow the investigation of the effects of a number of variables on some response in a highly efficient manner, and in a

More information

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3 Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3.1 through 3.3 Fall, 2013 Page 1 Tensile Strength Experiment Investigate the tensile strength of a new synthetic fiber. The factor is the

More information

Factorial designs. Experiments

Factorial designs. Experiments Chapter 5: Factorial designs Petter Mostad mostad@chalmers.se Experiments Actively making changes and observing the result, to find causal relationships. Many types of experimental plans Measuring response

More information

Group Comparisons: Differences in Composition Versus Differences in Models and Effects

Group Comparisons: Differences in Composition Versus Differences in Models and Effects Group Comparisons: Differences in Composition Versus Differences in Models and Effects Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 15, 2015 Overview.

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analysis of Variance and Design of Experiment-I MODULE IX LECTURE - 38 EXERCISES Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Example (Completely randomized

More information

STA121: Applied Regression Analysis

STA121: Applied Regression Analysis STA121: Applied Regression Analysis Linear Regression Analysis - Chapters 3 and 4 in Dielman Artin Department of Statistical Science September 15, 2009 Outline 1 Simple Linear Regression Analysis 2 Using

More information

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3

Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3 Lecture 3. Experiments with a Single Factor: ANOVA Montgomery 3-1 through 3-3 Page 1 Tensile Strength Experiment Investigate the tensile strength of a new synthetic fiber. The factor is the weight percent

More information

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS

ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing

More information

EXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"

EXST Regression Techniques Page 1. We can also test the hypothesis H : œ 0 versus H : EXST704 - Regression Techniques Page 1 Using F tests instead of t-tests We can also test the hypothesis H :" œ 0 versus H :" Á 0 with an F test.! " " " F œ MSRegression MSError This test is mathematically

More information

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Chap The McGraw-Hill Companies, Inc. All rights reserved. 11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview

More information

Advanced Experimental Design

Advanced Experimental Design Advanced Experimental Design Topic Four Hypothesis testing (z and t tests) & Power Agenda Hypothesis testing Sampling distributions/central limit theorem z test (σ known) One sample z & Confidence intervals

More information

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company Multiple Regression Inference for Multiple Regression and A Case Study IPS Chapters 11.1 and 11.2 2009 W.H. Freeman and Company Objectives (IPS Chapters 11.1 and 11.2) Multiple regression Data for multiple

More information

Inference for Regression Simple Linear Regression

Inference for Regression Simple Linear Regression Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating

More information

PSYC 331 STATISTICS FOR PSYCHOLOGISTS

PSYC 331 STATISTICS FOR PSYCHOLOGISTS PSYC 331 STATISTICS FOR PSYCHOLOGISTS Session 4 A PARAMETRIC STATISTICAL TEST FOR MORE THAN TWO POPULATIONS Lecturer: Dr. Paul Narh Doku, Dept of Psychology, UG Contact Information: pndoku@ug.edu.gh College

More information

Group comparison test for independent samples

Group comparison test for independent samples Group comparison test for independent samples The purpose of the Analysis of Variance (ANOVA) is to test for significant differences between means. Supposing that: samples come from normal populations

More information

1 Use of indicator random variables. (Chapter 8)

1 Use of indicator random variables. (Chapter 8) 1 Use of indicator random variables. (Chapter 8) let I(A) = 1 if the event A occurs, and I(A) = 0 otherwise. I(A) is referred to as the indicator of the event A. The notation I A is often used. 1 2 Fitting

More information

20.0 Experimental Design

20.0 Experimental Design 20.0 Experimental Design Answer Questions 1 Philosophy One-Way ANOVA Egg Sample Multiple Comparisons 20.1 Philosophy Experiments are often expensive and/or dangerous. One wants to use good techniques that

More information

Longitudinal Data Analysis of Health Outcomes

Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis Workshop Running Example: Days 2 and 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development

More information

Topic 9: Factorial treatment structures. Introduction. Terminology. Example of a 2x2 factorial

Topic 9: Factorial treatment structures. Introduction. Terminology. Example of a 2x2 factorial Topic 9: Factorial treatment structures Introduction A common objective in research is to investigate the effect of each of a number of variables, or factors, on some response variable. In earlier times,

More information