Power & Sample Size Calculation

Size: px
Start display at page:

Download "Power & Sample Size Calculation"

Transcription

1 Chapter 7 Power & Sample Size Calculation Yibi Huang Chapter 7 Section 10.3 Power & Sample Size Calculation for CRDs Power & Sample Size for Factorial Designs Chapter 7-1

2 Power & Sample Size Calculation When proposing an experiment (applying for funding etc), nowadays one needs to show that the proposed sample size (i.e. the number of experiment units) is neither so small that scientifically interesting effects will be swamped by random variation (i.e., has enough power to reject a false H 0 ) nor larger than necessary, wasting resources (time & money) Chapter 7-2

3 Errors and Power in Hypothesis Testing A Type I error occurs when H 0 is true but is rejected. A Type II error occurs when H 0 is false but is accepted. The (significance) level of a test is the chance of making a Type I error, i.e., the chance to reject a H 0 when it is true. The power of the test is the the chance of rejecting H 0 when H a is true: power = 1 P(making type II error H 0 is false) = 1 β = P(correctly reject H 0 H 0 is false) A good test should have small test-level and large power. H 0 is true H 0 is false H a is rejected H 0 is rejected (H 0 is accepted) (H a is accepted) α = P(Type I error) β = P(Type II error) Chapter 7-3

4 Power Calculation for ANOVA For a CRD design with g treatments, recall the means model: y ij = µ i + ε ij and the effects model: y ij = µ + α i + ε ij, for i = 1,..., g, and j = 1,..., n i. The two models are related via the relationship µ i = µ + α i. For CRDs, we use the means model more often. However, for power and sample size calculation, we need to use the effects model and µ must be defined as µ = 1 g n i µ i = 1 g n i µ i. N N i=1 j=1 which implies that g i=1 n iα i = 0. Here N = g i=1 n i is the total number of experimental units. Chapter 7-4 i=1

5 Recall the H 0 and H a for the ANOVA F test are H 0 : µ 1 = = µ g v.s. H a : µ i s not all equal, in terms of the means model, or equivalently in terms of the effects model, they are H 0 : α 1 = = α g = 0 v.s. H a : Not all α i s are 0. Recall we reject H 0 at level α if the test statistic F = MS Trt MSE exceeds the critical value F α,g 1,N g. So Power = P(Reject H 0 H a is true) = P(F > F α, g 1, N g ). To find P(F > F α, g 1, N g ), we must know the distribution of F. What is the distribution of F under H 0? F g 1,N g. And under H a? Chapter 7-5

6 For F = MS Trt MSE = SS Trt/(g 1), recall in Ch3 we ve shown that SSE/(N g) no matter under H 0 or H a, it is always true that SSE σ 2 χ2 N g, and E(MSE) = σ2 For the numerator, SS Trt /σ 2 has a χ 2 g 1 distribution under H 0. But under H a, it has a non-central Chi-square distribution χ 2 g 1,δ with non-central parameter Moreover, δ = g i=1 n iα 2 i σ 2. E(SS Trt ) = (g 1)σ 2 + g n iα 2 i=1 i }{{} δσ { 2 (g 1)σ 2 under H 0 = (g 1 + δ)σ 2 under H a Chapter 7-6

7 Non-Central F -Distribution Under H a, it can be shown that F = MS Trt MSE has a non-central F -distribution on degrees of freedom g 1 and N g, with non-centrality parameter δ, denoted as g i=1 F F g 1,N g,δ where δ = n iαi 2 σ 2. Recall that α i = µ i µ, and µ = 1 N g n i µ i = 1 N i=1 j=1 g n i µ i. Non-central F -distribution is also right skewed, and the greater the non-centrality parameter δ, the further away the peak of the distribution from 0. Chapter 7-7 i=1

8 Example 1: Power Calculation (1) g = 5 treatment groups group sizes: n 1 = n 2 = n 3 = 5, n 4 = 6, n 5 = 4 assume σ = 0.8 desired significance level α = 0.05 find the power of the test when H a is true with µ 1 = 1.6, µ 2 = 0.6, µ 3 = 2, µ 4 = 0, µ 5 = 1. Solution. The grand mean µ is g i=1 µ = n iµ i = N = = 1 The α i s are thus α i = µ i µ = µ i 1: α 1 = 0.6, α 2 = 0.4, α 3 = 1, α 4 = 1, α 5 = 0. Chapter 7-8

9 Example 1: Power Calculation (2) The non-centrality parameter is g i=1 δ = n iαi 2 σ 2 = ( 0.4) ( 1) = = So F = MS Trt MSE { F g 1, N g = F 4, 25 5 under H 0 F g 1, N g,δ = F 4, 25 5, under H a H 0 : F F 4,20 H a : F F 4,20, Chapter 7-9

10 Example 1: Power Calculation (3) The critical value to reject H 0 keeping the significance level at α = 0.05 is F 0.05,4, > qf(0.05,4,20, lower.tail=f) [1] When H 0 is true, the chance that H 0 is rejected is only 0.05 (the red shaded area.) Accept H 0 Reject H 0 F 0.05,4,20 Area = Reminder: Don t confuse the following two: in most STAT books, F α,df 1,df 2 means the area of the upper tail is α. in R, qf(alpha, df1, df2) means the the area of the lower Chapter 7-10

11 Example 1: Power Calculation (4) If H a is true and µ 1 = 1.6, µ 2 = 0.6, µ 3 = 2, µ 4 = 0, µ 5 = 1, we know then F F 4,20,δ= The power to reject H 0 is the area under the density of F 4,20,δ=21.25 beyond the critical value (the blue shaded area,) which is > pf(qf(.95,4,20),4,20, ncp=21.25, lower.tail=f) [1] In the R codes, ncp means the non-centrality parameter Accept H 0 Reject H 0 Power F 0.05,4, Chapter 7-11

12 Example 1: Power Calculation (5) Remark The power of a test is not a single value, but a function of the parameters in H a. If parameter µ i s change their values, the power of the test also changes. It makes no sense to talk about the power of a test without specifying the parameters in H a Chapter 7-12

13 Power Of A Test Is Affected By... The larger the non-centrality parameter the greater the power. δ = The power of a test will increase if g i=1 n iα 2 i σ 2, the number of replicate n i per treatment increases the magnitude of treatment effects α i s increase (under the constraint i n iα i = 0) the size of noise σ 2 decreases. Chapter 7-13

14 Example 2: Sample Size Calculation (1) g = 5 treatment groups of equal sample size n i = n for all i assume σ = 0.8 desired significance level α = 0.05 Assuming the design is balanced that all groups have identical sample size n, what is the minimal sample size n per treatment to have power 0.95 when α 1 = 0.5, α 2 = 0.5, α 3 = 1, α 4 = 1, α 5 = 0? Solution. The non-centrality parameter is g i=1 δ = n iαi 2 σ 2 = n [ ( 0.5) ( 1) ] = n 2.5 = 3.9n 0.82 So F = MS Trt MSE { F g 1, N g = F 4, 5n 5 under H 0 F g 1, N g,δ = F 4, 5n 5, 3.9n under H a Chapter 7-14

15 Recall the critical value F for rejecting H 0 at level α = 0.05 is F = F α, g 1, N g = F 0.05, 4, 5n 5 which can be find in R via the command > F.crit = qf(alpha, g-1, N-g, lower.tail=f) # syntax > F.crit = qf(0.05, 4, 5*n-5, lower.tail=f) # sub-in the values By definition, Power = P(reject H 0 H a is true) = P(the non central F statistic F ) = P(F (g 1, N g, δ) F ) = P(F (4, 5n 5, 3.9n) F ) which can be found in R via the command > pf(f.crit, g-1, N-g, ncp=delta, lower.tail=f) # syntax > pf(f.crit, 4, 5*n-5, ncp=3.9*n, lower.tail=f) # sub-in values In the R codes, ncp means the non-centrality parameter. Chapter 7-15

16 Now we find the R code to find the power of the ANOVA F -test when n is known. Let s plug in different values of n and see what is the smallest n to make power > F.crit = qf(alpha, g-1, N-g, lower.tail=f) > pf(f.crit, g-1, N-g, ncp=delta, lower.tail=f) # syntax > n = 5 > F.crit = qf(0.05, 4, 5*n-5, lower.tail=f) > pf(f.crit, 4, 5*n-5, ncp=3.9*n, lower.tail=f) [1] # less than 0.95, not high enough > n = 6 > F.crit = qf(0.05, 4, 5*n-5, lower.tail=f) > pf(f.crit, 4, 5*n-5, ncp=3.9*n, lower.tail=f) [1] # greater than 0.95, bingo! So we need 6 replicates in each of the 5 treatment groups to ensure a power of 0.95 when α 1 = 0.5, α 2 = 0.5, α 3 = 1, α 4 = 1, α 5 = 0. Remark: Again, we have to specify the H a fully to to find the appropriate sample size. Chapter 7-16

17 But σ 2 is Unknown... As σ 2 is usually unknown, here are a few ways to make a guess. Make a small-sample pilot study to get an estimate of σ. Based on prior studies or knowledge about the experimental units, can you think of a range of plausible values for σ? If so, choose the biggest one. You could repeat the sample size calculations for various levels of σ to see how it affects the needed sample size. Chapter 7-17

18 How to Specify the H a? As the power of a test depends on the alternative hypothesis H a, that is, the α i s, one might has to try several sets of α i s to find the appropriate sample size. But how many H a s we have to try? Here is a useful trick. 1. Suppose we would be interested if any two means differed by D or more. 2. The smallest value of δ in this case is when two means differ by exactly, D, and the other g 2 means are halfway between. 3. So try α 1 = D/2, α 2 = D/2, and α i = 0 for all other τ. Assuming equal sample sizes, the non-centrality parameter is δ = i nαi 2 σ 2 = n(d2 /4 + D 2 /4) σ 2 = nd2 2σ 2. Chapter 7-18

19 Power and Sample Size Calculation for Complete Block Designs For complete block designs y ij = µ + α i + β j + ε ij y ijk = µ + α i + β j + γ k + ε ijk y ijkl = µ + α i + β j + γ k + δ l + ε ijkl RCBD Latin Square Graeco-Latin Square where α i s are the treatment effect, and β j, γ k, δ l are effects of blocking factors, when H a is true, treatments have different effects, the ANOVA F -statistic also has a non-central F -distribution F = MS trt MSE { F g 1, df of MSE under H 0 under H a F g 1, df of MSE,δ where the non-centrality parameter δ is δ = r i α2 i σ 2, where r = # of replicates per treatment Note that the df changes from N g to the df of MSE. Chapter 7-19

20 Power and Sample Size Calculation for Balanced Factorial Designs Section 10.3 in Oehlert s book We may ignore factorial structure and compute power and sample size for the overall null hypothesis of no model effects. We will address methods for balanced data only because most factorial experiments are designed to be balanced, and simple formulae Power for balanced data for noncentrality parameters exist only for balanced data. For factorial data, we usually test H 0 about main effects or interactions in addition to the overall H 0 of no model effects. Chapter 7-20

21 Non-Centrality Parameter for Balanced Factorial Designs Ex, consider a 3-way factorial design y ijkl = µ + α i + β j + γ k + αβ ij + βγ jk + αγ ij + αβγ ijk + ε ijkl to calculate the power of the F -test of a main effect or an interaction effect under H a, the non-centrality parameter can be calculated as follows: Square the factorial effects (using the zero-sum constraint) and sum them, Multiply this sum by the total number of data in the design divided by the number of levels in the effect, and Divide that product by the error variance. Chapter 7-21

22 E.g., consider a a b c factorial design with r replicates per treatment y ijkl = µ + α i + β j + γ k + αβ ij + βγ jk + αγ ij + αβγ ijk + ε ijkl To test about the B main effects H 0 : β 1 = = β b = 0 v.s. H a : not all β j are 0 the ANOVA F -statistic for B main-effect is F = MS B MSE { F b 1, df of MSE under H 0 under H a F b 1, df of MSE,δ where the non-centrality parameter δ is ( N ) / ( δ = b j β2 j σ 2 = rac ) / j β2 j σ 2. Here N = abcr is the total number of observations. Chapter 7-22

23 E.g., consider a a b c factorial design with r replicates per treatment y ijkl = µ + α i + β j + γ k + αβ ij + βγ jk + αγ ij + αβγ ijk + ε ijkl To test about the AB interactions H 0 : all αβ ij = 0 v.s. H a : not all αβ ij = 0 the ANOVA F -statistic for AB interactions F = MS AB MSE { F (a 1)(b 1), df of MSE under H 0 under H a F (a 1)(b 1), df of MSE,δ where the non-centrality parameter δ is ( N ) / ( δ = ab (αβ ij) 2 σ 2 = rc (αβ ij) 2) / σ 2. ij ij Here N = abcr is the total number of observations. Chapter 7-23

24 Sample Size Should Be The Last Thing To Consider In practice, sample size calculation is the last thing to do in designing an experiment. Consider the followings before calculating the required sample size. asking the right question, choosing an appropriate response, thinking about what factors need to be blocked on, choosing a right design, setting up the right procedure to recruit subjects, and so on, All the above are more important than doing the right sample size calculation! Sometimes, choosing a better design can substantially reduces the required sample size. Chapter 7-24

Factorial Treatment Structure: Part I. Lukas Meier, Seminar für Statistik

Factorial Treatment Structure: Part I. Lukas Meier, Seminar für Statistik Factorial Treatment Structure: Part I Lukas Meier, Seminar für Statistik Factorial Treatment Structure So far (in CRD), the treatments had no structure. So called factorial treatment structure exists if

More information

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Jurusan Teknik Industri Universitas Brawijaya Outline Introduction The Analysis of Variance Models for the Data Post-ANOVA Comparison of Means Sample

More information

STAT22200 Spring 2014 Chapter 8A

STAT22200 Spring 2014 Chapter 8A STAT22200 Spring 2014 Chapter 8A Yibi Huang May 13, 2014 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley,

More information

Two-Way Factorial Designs

Two-Way Factorial Designs 81-86 Two-Way Factorial Designs Yibi Huang 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley, so brewers like

More information

Lec 5: Factorial Experiment

Lec 5: Factorial Experiment November 21, 2011 Example Study of the battery life vs the factors temperatures and types of material. A: Types of material, 3 levels. B: Temperatures, 3 levels. Example Study of the battery life vs the

More information

STAT22200 Spring 2014 Chapter 13B

STAT22200 Spring 2014 Chapter 13B STAT22200 Spring 2014 Chapter 13B Yibi Huang May 27, 2014 13.3.1 Crossover Designs 13.3.4 Replicated Latin Square Designs 13.4 Graeco-Latin Squares Chapter 13B - 1 13.3.1 Crossover Design (A Special Latin-Square

More information

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1) Summary of Chapter 7 (Sections 7.2-7.5) and Chapter 8 (Section 8.1) Chapter 7. Tests of Statistical Hypotheses 7.2. Tests about One Mean (1) Test about One Mean Case 1: σ is known. Assume that X N(µ, σ

More information

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 1 Chapter 1: Research Design Principles The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 2 Chapter 2: Completely Randomized Design

More information

16.3 One-Way ANOVA: The Procedure

16.3 One-Way ANOVA: The Procedure 16.3 One-Way ANOVA: The Procedure Tom Lewis Fall Term 2009 Tom Lewis () 16.3 One-Way ANOVA: The Procedure Fall Term 2009 1 / 10 Outline 1 The background 2 Computing formulas 3 The ANOVA Identity 4 Tom

More information

Theorem A: Expectations of Sums of Squares Under the two-way ANOVA model, E(X i X) 2 = (µ i µ) 2 + n 1 n σ2

Theorem A: Expectations of Sums of Squares Under the two-way ANOVA model, E(X i X) 2 = (µ i µ) 2 + n 1 n σ2 identity Y ijk Ȳ = (Y ijk Ȳij ) + (Ȳi Ȳ ) + (Ȳ j Ȳ ) + (Ȳij Ȳi Ȳ j + Ȳ ) Theorem A: Expectations of Sums of Squares Under the two-way ANOVA model, (1) E(MSE) = E(SSE/[IJ(K 1)]) = (2) E(MSA) = E(SSA/(I

More information

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent: Activity #10: AxS ANOVA (Repeated subjects design) Resources: optimism.sav So far in MATH 300 and 301, we have studied the following hypothesis testing procedures: 1) Binomial test, sign-test, Fisher s

More information

Analysis of Variance

Analysis of Variance Analysis of Variance Math 36b May 7, 2009 Contents 2 ANOVA: Analysis of Variance 16 2.1 Basic ANOVA........................... 16 2.1.1 the model......................... 17 2.1.2 treatment sum of squares.................

More information

Chapter 11 - Lecture 1 Single Factor ANOVA

Chapter 11 - Lecture 1 Single Factor ANOVA Chapter 11 - Lecture 1 Single Factor ANOVA April 7th, 2010 Means Variance Sum of Squares Review In Chapter 9 we have seen how to make hypothesis testing for one population mean. In Chapter 10 we have seen

More information

STAT22200 Spring 2014 Chapter 14

STAT22200 Spring 2014 Chapter 14 STAT22200 Spring 2014 Chapter 14 Yibi Huang May 27, 2014 Chapter 14 Incomplete Block Designs 14.1 Balanced Incomplete Block Designs (BIBD) Chapter 14-1 Incomplete Block Designs A Brief Introduction to

More information

Analysis of Variance

Analysis of Variance Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also

More information

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.

COMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous. COMPLETELY RANDOM DESIGN (CRD) Description of the Design -Simplest design to use. -Design can be used when experimental units are essentially homogeneous. -Because of the homogeneity requirement, it may

More information

Power. Week 8: Lecture 1 STAT: / 48

Power. Week 8: Lecture 1 STAT: / 48 Power STAT:5201 Week 8: Lecture 1 1 / 48 Power We have already described Type I and II errors. Decision Reality/True state Accept H o Reject H o H o is true good Type I error H o is false Type II error

More information

Contents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4

Contents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4 Contents Factorial design TAMS38 - Lecture 6 Factorial design, Latin Square Design Lecturer: Zhenxia Liu Department of Mathematics - Mathematical Statistics 28 November, 2017 Complete three factor design

More information

Battery Life. Factory

Battery Life. Factory Statistics 354 (Fall 2018) Analysis of Variance: Comparing Several Means Remark. These notes are from an elementary statistics class and introduce the Analysis of Variance technique for comparing several

More information

Factorial designs. Experiments

Factorial designs. Experiments Chapter 5: Factorial designs Petter Mostad mostad@chalmers.se Experiments Actively making changes and observing the result, to find causal relationships. Many types of experimental plans Measuring response

More information

Lecture 9. ANOVA: Random-effects model, sample size

Lecture 9. ANOVA: Random-effects model, sample size Lecture 9. ANOVA: Random-effects model, sample size Jesper Rydén Matematiska institutionen, Uppsala universitet jesper@math.uu.se Regressions and Analysis of Variance fall 2015 Fixed or random? Is it reasonable

More information

Chapter 4: Randomized Blocks and Latin Squares

Chapter 4: Randomized Blocks and Latin Squares Chapter 4: Randomized Blocks and Latin Squares 1 Design of Engineering Experiments The Blocking Principle Blocking and nuisance factors The randomized complete block design or the RCBD Extension of the

More information

. Example: For 3 factors, sse = (y ijkt. " y ijk

. Example: For 3 factors, sse = (y ijkt.  y ijk ANALYSIS OF BALANCED FACTORIAL DESIGNS Estimates of model parameters and contrasts can be obtained by the method of Least Squares. Additional constraints must be added to estimate non-estimable parameters.

More information

Chapter 10: Analysis of variance (ANOVA)

Chapter 10: Analysis of variance (ANOVA) Chapter 10: Analysis of variance (ANOVA) ANOVA (Analysis of variance) is a collection of techniques for dealing with more general experiments than the previous one-sample or two-sample tests. We first

More information

STAT22200 Spring 2014 Chapter 5

STAT22200 Spring 2014 Chapter 5 STAT22200 Spring 2014 Chapter 5 Yibi Huang April 29, 2014 Chapter 5 Multiple Comparisons Chapter 5-1 Chapter 5 Multiple Comparisons Note the t-tests and C.I. s are constructed assuming we only do one test,

More information

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication

CHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication CHAPTER 4 Analysis of Variance One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication 1 Introduction In this chapter, expand the idea of hypothesis tests. We

More information

y = µj n + β 1 b β b b b + α 1 t α a t a + e

y = µj n + β 1 b β b b b + α 1 t α a t a + e The contributions of distinct sets of explanatory variables to the model are typically captured by breaking up the overall regression (or model) sum of squares into distinct components This is useful quite

More information

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Chap The McGraw-Hill Companies, Inc. All rights reserved. 11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview

More information

Design & Analysis of Experiments 7E 2009 Montgomery

Design & Analysis of Experiments 7E 2009 Montgomery 1 What If There Are More Than Two Factor Levels? The t-test does not directly apply ppy There are lots of practical situations where there are either more than two levels of interest, or there are several

More information

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya DESAIN EKSPERIMEN BLOCKING FACTORS Semester Genap Jurusan Teknik Industri Universitas Brawijaya Outline The Randomized Complete Block Design The Latin Square Design The Graeco-Latin Square Design Balanced

More information

Solutions to Final STAT 421, Fall 2008

Solutions to Final STAT 421, Fall 2008 Solutions to Final STAT 421, Fall 2008 Fritz Scholz 1. (8) Two treatments A and B were randomly assigned to 8 subjects (4 subjects to each treatment) with the following responses: 0, 1, 3, 6 and 5, 7,

More information

STATS Analysis of variance: ANOVA

STATS Analysis of variance: ANOVA STATS 1060 Analysis of variance: ANOVA READINGS: Chapters 28 of your text book (DeVeaux, Vellman and Bock); on-line notes for ANOVA; on-line practice problems for ANOVA NOTICE: You should print a copy

More information

Analysis of Variance and Design of Experiments-I

Analysis of Variance and Design of Experiments-I Analysis of Variance and Design of Experiments-I MODULE VIII LECTURE - 35 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS MODEL Dr. Shalabh Department of Mathematics and Statistics Indian

More information

STAT22200 Chapter 14

STAT22200 Chapter 14 STAT00 Chapter 4 Yibi Huang Chapter 4 Incomplete Block Designs 4. Balanced Incomplete Block Designs (BIBD) Chapter 4 - Incomplete Block Designs A Brief Introduction to a Class of Most Useful Designs in

More information

Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs

Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs Adapted from Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis

More information

SIZE = Vehicle size: 1 small, 2 medium, 3 large. SIDE : 1 right side of car, 2 left side of car

SIZE = Vehicle size: 1 small, 2 medium, 3 large. SIDE : 1 right side of car, 2 left side of car THREE-WAY ANOVA MODELS (CHAPTER 7) Consider a completely randomized design for an experiment with three treatment factors A, B and C. We will assume that every combination of levels of A, B and C is observed

More information

Week 14 Comparing k(> 2) Populations

Week 14 Comparing k(> 2) Populations Week 14 Comparing k(> 2) Populations Week 14 Objectives Methods associated with testing for the equality of k(> 2) means or proportions are presented. Post-testing concepts and analysis are introduced.

More information

Factorial ANOVA. Psychology 3256

Factorial ANOVA. Psychology 3256 Factorial ANOVA Psychology 3256 Made up data.. Say you have collected data on the effects of Retention Interval 5 min 1 hr 24 hr on memory So, you do the ANOVA and conclude that RI affects memory % corr

More information

PLSC PRACTICE TEST ONE

PLSC PRACTICE TEST ONE PLSC 724 - PRACTICE TEST ONE 1. Discuss briefly the relationship between the shape of the normal curve and the variance. 2. What is the relationship between a statistic and a parameter? 3. How is the α

More information

Stat 217 Final Exam. Name: May 1, 2002

Stat 217 Final Exam. Name: May 1, 2002 Stat 217 Final Exam Name: May 1, 2002 Problem 1. Three brands of batteries are under study. It is suspected that the lives (in weeks) of the three brands are different. Five batteries of each brand are

More information

Part 1.) We know that the probability of any specific x only given p ij = p i p j is just multinomial(n, p) where p k1 k 2

Part 1.) We know that the probability of any specific x only given p ij = p i p j is just multinomial(n, p) where p k1 k 2 Problem.) I will break this into two parts: () Proving w (m) = p( x (m) X i = x i, X j = x j, p ij = p i p j ). In other words, the probability of a specific table in T x given the row and column counts

More information

Sample Size / Power Calculations

Sample Size / Power Calculations Sample Size / Power Calculations A Simple Example Goal: To study the effect of cold on blood pressure (mmhg) in rats Use a Completely Randomized Design (CRD): 12 rats are randomly assigned to one of two

More information

STAT Final Practice Problems

STAT Final Practice Problems STAT 48 -- Final Practice Problems.Out of 5 women who had uterine cancer, 0 claimed to have used estrogens. Out of 30 women without uterine cancer 5 claimed to have used estrogens. Exposure Outcome (Cancer)

More information

3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is

3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is Stat 501 Solutions and Comments on Exam 1 Spring 005-4 0-4 1. (a) (5 points) Y ~ N, -1-4 34 (b) (5 points) X (X,X ) = (5,8) ~ N ( 11.5, 0.9375 ) 3 1 (c) (10 points, for each part) (i), (ii), and (v) are

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness

More information

Unit 12: Analysis of Single Factor Experiments

Unit 12: Analysis of Single Factor Experiments Unit 12: Analysis of Single Factor Experiments Statistics 571: Statistical Methods Ramón V. León 7/16/2004 Unit 12 - Stat 571 - Ramón V. León 1 Introduction Chapter 8: How to compare two treatments. Chapter

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 03 The Chi-Square Distributions Dr. Neal, Spring 009 The chi-square distributions can be used in statistics to analyze the standard deviation of a normally distributed measurement and to test the

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

Chapter 11 - Lecture 1 Single Factor ANOVA

Chapter 11 - Lecture 1 Single Factor ANOVA April 5, 2013 Chapter 9 : hypothesis testing for one population mean. Chapter 10: hypothesis testing for two population means. What comes next? Chapter 9 : hypothesis testing for one population mean. Chapter

More information

Much of the material we will be covering for a while has to do with designing an experimental study that concerns some phenomenon of interest.

Much of the material we will be covering for a while has to do with designing an experimental study that concerns some phenomenon of interest. Experimental Design: Much of the material we will be covering for a while has to do with designing an experimental study that concerns some phenomenon of interest We wish to use our subjects in the best

More information

21.0 Two-Factor Designs

21.0 Two-Factor Designs 21.0 Two-Factor Designs Answer Questions 1 RCBD Concrete Example Two-Way ANOVA Popcorn Example 21.4 RCBD 2 The Randomized Complete Block Design is also known as the two-way ANOVA without interaction. A

More information

Random and Mixed Effects Models - Part III

Random and Mixed Effects Models - Part III Random and Mixed Effects Models - Part III Statistics 149 Spring 2006 Copyright 2006 by Mark E. Irwin Quasi-F Tests When we get to more than two categorical factors, some times there are not nice F tests

More information

STAT 525 Fall Final exam. Tuesday December 14, 2010

STAT 525 Fall Final exam. Tuesday December 14, 2010 STAT 525 Fall 2010 Final exam Tuesday December 14, 2010 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

20.0 Experimental Design

20.0 Experimental Design 20.0 Experimental Design Answer Questions 1 Philosophy One-Way ANOVA Egg Sample Multiple Comparisons 20.1 Philosophy Experiments are often expensive and/or dangerous. One wants to use good techniques that

More information

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Difference in two or more average scores in different groups

Difference in two or more average scores in different groups ANOVAs Analysis of Variance (ANOVA) Difference in two or more average scores in different groups Each participant tested once Same outcome tested in each group Simplest is one-way ANOVA (one variable as

More information

Nesting and Mixed Effects: Part I. Lukas Meier, Seminar für Statistik

Nesting and Mixed Effects: Part I. Lukas Meier, Seminar für Statistik Nesting and Mixed Effects: Part I Lukas Meier, Seminar für Statistik Where do we stand? So far: Fixed effects Random effects Both in the factorial context Now: Nested factor structure Mixed models: a combination

More information

Topic 9: Factorial treatment structures. Introduction. Terminology. Example of a 2x2 factorial

Topic 9: Factorial treatment structures. Introduction. Terminology. Example of a 2x2 factorial Topic 9: Factorial treatment structures Introduction A common objective in research is to investigate the effect of each of a number of variables, or factors, on some response variable. In earlier times,

More information

Keppel, G. & Wickens, T.D. Design and Analysis Chapter 2: Sources of Variability and Sums of Squares

Keppel, G. & Wickens, T.D. Design and Analysis Chapter 2: Sources of Variability and Sums of Squares Keppel, G. & Wickens, T.D. Design and Analysis Chapter 2: Sources of Variability and Sums of Squares K&W introduce the notion of a simple experiment with two conditions. Note that the raw data (p. 16)

More information

Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G

Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G 1 ANOVA Analysis of variance compares two or more population means of interval data. Specifically, we are interested in determining whether differences

More information

Unit 27 One-Way Analysis of Variance

Unit 27 One-Way Analysis of Variance Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied

More information

Statistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts

Statistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts Statistical methods for comparing multiple groups Lecture 7: ANOVA Sandy Eckel seckel@jhsph.edu 30 April 2008 Continuous data: comparing multiple means Analysis of variance Binary data: comparing multiple

More information

STAT Chapter 10: Analysis of Variance

STAT Chapter 10: Analysis of Variance STAT 515 -- Chapter 10: Analysis of Variance Designed Experiment A study in which the researcher controls the levels of one or more variables to determine their effect on the variable of interest (called

More information

ANOVA Analysis of Variance

ANOVA Analysis of Variance ANOVA Analysis of Variance ANOVA Analysis of Variance Extends independent samples t test ANOVA Analysis of Variance Extends independent samples t test Compares the means of groups of independent observations

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Motivations for the ANOVA We defined the F-distribution, this is mainly used in

More information

ANOVA (Analysis of Variance) output RLS 11/20/2016

ANOVA (Analysis of Variance) output RLS 11/20/2016 ANOVA (Analysis of Variance) output RLS 11/20/2016 1. Analysis of Variance (ANOVA) The goal of ANOVA is to see if the variation in the data can explain enough to see if there are differences in the means.

More information

TWO OR MORE RANDOM EFFECTS. The two-way complete model for two random effects:

TWO OR MORE RANDOM EFFECTS. The two-way complete model for two random effects: TWO OR MORE RANDOM EFFECTS Example: The factors that influence the breaking strength of a synthetic fiber are being studied. Four production machines and three operators are randomly selected. A two-way

More information

Statistics For Economics & Business

Statistics For Economics & Business Statistics For Economics & Business Analysis of Variance In this chapter, you learn: Learning Objectives The basic concepts of experimental design How to use one-way analysis of variance to test for differences

More information

MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS FALL 1999

MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS FALL 1999 MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS 350 - FALL 1999 Instructor: A. Oyet Date: December 16, 1999 Name(Surname First): Student Number INSTRUCTIONS

More information

One-Way ANOVA Calculations: In-Class Exercise Psychology 311 Spring, 2013

One-Way ANOVA Calculations: In-Class Exercise Psychology 311 Spring, 2013 One-Way ANOVA Calculations: In-Class Exercise Psychology 311 Spring, 2013 1. You are planning an experiment that will involve 4 equally sized groups, including 3 experimental groups and a control. Each

More information

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables. One-Way Analysis of Variance With regression, we related two quantitative, typically continuous variables. Often we wish to relate a quantitative response variable with a qualitative (or simply discrete)

More information

One-way ANOVA (Single-Factor CRD)

One-way ANOVA (Single-Factor CRD) One-way ANOVA (Single-Factor CRD) STAT:5201 Week 3: Lecture 3 1 / 23 One-way ANOVA We have already described a completed randomized design (CRD) where treatments are randomly assigned to EUs. There is

More information

STAT 506: Randomized complete block designs

STAT 506: Randomized complete block designs STAT 506: Randomized complete block designs Timothy Hanson Department of Statistics, University of South Carolina STAT 506: Introduction to Experimental Design 1 / 10 Randomized complete block designs

More information

Ma 3/103: Lecture 25 Linear Regression II: Hypothesis Testing and ANOVA

Ma 3/103: Lecture 25 Linear Regression II: Hypothesis Testing and ANOVA Ma 3/103: Lecture 25 Linear Regression II: Hypothesis Testing and ANOVA March 6, 2017 KC Border Linear Regression II March 6, 2017 1 / 44 1 OLS estimator 2 Restricted regression 3 Errors in variables 4

More information

Multiple Testing. Gary W. Oehlert. January 28, School of Statistics University of Minnesota

Multiple Testing. Gary W. Oehlert. January 28, School of Statistics University of Minnesota Multiple Testing Gary W. Oehlert School of Statistics University of Minnesota January 28, 2016 Background Suppose that you had a 20-sided die. Nineteen of the sides are labeled 0 and one of the sides is

More information

Lecture 3: Inference in SLR

Lecture 3: Inference in SLR Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals

More information

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1 Notes for Wee 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1 Exam 3 is on Friday May 1. A part of one of the exam problems is on Predictiontervals : When randomly sampling from a normal population

More information

Multiple comparisons The problem with the one-pair-at-a-time approach is its error rate.

Multiple comparisons The problem with the one-pair-at-a-time approach is its error rate. Multiple comparisons The problem with the one-pair-at-a-time approach is its error rate. Each confidence interval has a 95% probability of making a correct statement, and hence a 5% probability of making

More information

Allow the investigation of the effects of a number of variables on some response

Allow the investigation of the effects of a number of variables on some response Lecture 12 Topic 9: Factorial treatment structures (Part I) Factorial experiments Allow the investigation of the effects of a number of variables on some response in a highly efficient manner, and in a

More information

One-Way ANOVA Source Table J - 1 SS B / J - 1 MS B /MS W. Pairwise Post-Hoc Comparisons of Means

One-Way ANOVA Source Table J - 1 SS B / J - 1 MS B /MS W. Pairwise Post-Hoc Comparisons of Means One-Way ANOVA Source Table ANOVA MODEL: ij = µ* + α j + ε ij H 0 : µ 1 = µ =... = µ j or H 0 : Σα j = 0 Source Sum of Squares df Mean Squares F Between Groups n j ( j - * ) J - 1 SS B / J - 1 MS B /MS

More information

Lec 1: An Introduction to ANOVA

Lec 1: An Introduction to ANOVA Ying Li Stockholm University October 31, 2011 Three end-aisle displays Which is the best? Design of the Experiment Identify the stores of the similar size and type. The displays are randomly assigned to

More information

Topic 7: Incomplete, double-blocked designs: Latin Squares [ST&D sections ]

Topic 7: Incomplete, double-blocked designs: Latin Squares [ST&D sections ] Topic 7: Incomplete, double-blocked designs: Latin Squares [ST&D sections 9.10 9.15] 7.1. Introduction The Randomized Complete Block Design is commonly used to improve the ability of an experiment to detect

More information

Analysis of Variance

Analysis of Variance Analysis of Variance Blood coagulation time T avg A 62 60 63 59 61 B 63 67 71 64 65 66 66 C 68 66 71 67 68 68 68 D 56 62 60 61 63 64 63 59 61 64 Blood coagulation time A B C D Combined 56 57 58 59 60 61

More information

One-way ANOVA. Experimental Design. One-way ANOVA

One-way ANOVA. Experimental Design. One-way ANOVA Method to compare more than two samples simultaneously without inflating Type I Error rate (α) Simplicity Few assumptions Adequate for highly complex hypothesis testing 09/30/12 1 Outline of this class

More information

Week 12 Hypothesis Testing, Part II Comparing Two Populations

Week 12 Hypothesis Testing, Part II Comparing Two Populations Week 12 Hypothesis Testing, Part II Week 12 Hypothesis Testing, Part II Week 12 Objectives 1 The principle of Analysis of Variance is introduced and used to derive the F-test for testing the model utility

More information

Your schedule of coming weeks. One-way ANOVA, II. Review from last time. Review from last time /22/2004. Create ANOVA table

Your schedule of coming weeks. One-way ANOVA, II. Review from last time. Review from last time /22/2004. Create ANOVA table Your schedule of coming weeks One-way ANOVA, II 9.07 //00 Today: One-way ANOVA, part II Next week: Two-way ANOVA, parts I and II. One-way ANOVA HW due Thursday Week of May Teacher out of town all week

More information

EXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"

EXST Regression Techniques Page 1. We can also test the hypothesis H : œ 0 versus H : EXST704 - Regression Techniques Page 1 Using F tests instead of t-tests We can also test the hypothesis H :" œ 0 versus H :" Á 0 with an F test.! " " " F œ MSRegression MSError This test is mathematically

More information

If we have many sets of populations, we may compare the means of populations in each set with one experiment.

If we have many sets of populations, we may compare the means of populations in each set with one experiment. Statistical Methods in Business Lecture 3. Factorial Design: If we have many sets of populations we may compare the means of populations in each set with one experiment. Assume we have two factors with

More information

Lecture 7: Hypothesis Testing and ANOVA

Lecture 7: Hypothesis Testing and ANOVA Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis

More information

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions

More information

2 Hand-out 2. Dr. M. P. M. M. M c Loughlin Revised 2018

2 Hand-out 2. Dr. M. P. M. M. M c Loughlin Revised 2018 Math 403 - P. & S. III - Dr. McLoughlin - 1 2018 2 Hand-out 2 Dr. M. P. M. M. M c Loughlin Revised 2018 3. Fundamentals 3.1. Preliminaries. Suppose we can produce a random sample of weights of 10 year-olds

More information

Note: The problem numbering below may not reflect actual numbering in DGE.

Note: The problem numbering below may not reflect actual numbering in DGE. Stat664 Year 1999 DGE Note: The problem numbering below may not reflect actual numbering in DGE. 1. For a balanced one-way random effect model, (a) write down the model and assumptions; (b) write down

More information

Stat 6640 Solution to Midterm #2

Stat 6640 Solution to Midterm #2 Stat 6640 Solution to Midterm #2 1. A study was conducted to examine how three statistical software packages used in a statistical course affect the statistical competence a student achieves. At the end

More information

IEOR165 Discussion Week 12

IEOR165 Discussion Week 12 IEOR165 Discussion Week 12 Sheng Liu University of California, Berkeley Apr 15, 2016 Outline 1 Type I errors & Type II errors 2 Multiple Testing 3 ANOVA IEOR165 Discussion Sheng Liu 2 Type I errors & Type

More information

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test 1 Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test Learning Objectives After this section, you should be able to DESCRIBE the relationship between the significance level of a test, P(Type

More information

STAT 536: Genetic Statistics

STAT 536: Genetic Statistics STAT 536: Genetic Statistics Tests for Hardy Weinberg Equilibrium Karin S. Dorman Department of Statistics Iowa State University September 7, 2006 Statistical Hypothesis Testing Identify a hypothesis,

More information

Relating Graph to Matlab

Relating Graph to Matlab There are two related course documents on the web Probability and Statistics Review -should be read by people without statistics background and it is helpful as a review for those with prior statistics

More information

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 13, Part A: Analysis of Variance and Experimental Design Introduction to Analysis of Variance Analysis

More information