Power & Sample Size Calculation
|
|
- Frederick Burke
- 5 years ago
- Views:
Transcription
1 Chapter 7 Power & Sample Size Calculation Yibi Huang Chapter 7 Section 10.3 Power & Sample Size Calculation for CRDs Power & Sample Size for Factorial Designs Chapter 7-1
2 Power & Sample Size Calculation When proposing an experiment (applying for funding etc), nowadays one needs to show that the proposed sample size (i.e. the number of experiment units) is neither so small that scientifically interesting effects will be swamped by random variation (i.e., has enough power to reject a false H 0 ) nor larger than necessary, wasting resources (time & money) Chapter 7-2
3 Errors and Power in Hypothesis Testing A Type I error occurs when H 0 is true but is rejected. A Type II error occurs when H 0 is false but is accepted. The (significance) level of a test is the chance of making a Type I error, i.e., the chance to reject a H 0 when it is true. The power of the test is the the chance of rejecting H 0 when H a is true: power = 1 P(making type II error H 0 is false) = 1 β = P(correctly reject H 0 H 0 is false) A good test should have small test-level and large power. H 0 is true H 0 is false H a is rejected H 0 is rejected (H 0 is accepted) (H a is accepted) α = P(Type I error) β = P(Type II error) Chapter 7-3
4 Power Calculation for ANOVA For a CRD design with g treatments, recall the means model: y ij = µ i + ε ij and the effects model: y ij = µ + α i + ε ij, for i = 1,..., g, and j = 1,..., n i. The two models are related via the relationship µ i = µ + α i. For CRDs, we use the means model more often. However, for power and sample size calculation, we need to use the effects model and µ must be defined as µ = 1 g n i µ i = 1 g n i µ i. N N i=1 j=1 which implies that g i=1 n iα i = 0. Here N = g i=1 n i is the total number of experimental units. Chapter 7-4 i=1
5 Recall the H 0 and H a for the ANOVA F test are H 0 : µ 1 = = µ g v.s. H a : µ i s not all equal, in terms of the means model, or equivalently in terms of the effects model, they are H 0 : α 1 = = α g = 0 v.s. H a : Not all α i s are 0. Recall we reject H 0 at level α if the test statistic F = MS Trt MSE exceeds the critical value F α,g 1,N g. So Power = P(Reject H 0 H a is true) = P(F > F α, g 1, N g ). To find P(F > F α, g 1, N g ), we must know the distribution of F. What is the distribution of F under H 0? F g 1,N g. And under H a? Chapter 7-5
6 For F = MS Trt MSE = SS Trt/(g 1), recall in Ch3 we ve shown that SSE/(N g) no matter under H 0 or H a, it is always true that SSE σ 2 χ2 N g, and E(MSE) = σ2 For the numerator, SS Trt /σ 2 has a χ 2 g 1 distribution under H 0. But under H a, it has a non-central Chi-square distribution χ 2 g 1,δ with non-central parameter Moreover, δ = g i=1 n iα 2 i σ 2. E(SS Trt ) = (g 1)σ 2 + g n iα 2 i=1 i }{{} δσ { 2 (g 1)σ 2 under H 0 = (g 1 + δ)σ 2 under H a Chapter 7-6
7 Non-Central F -Distribution Under H a, it can be shown that F = MS Trt MSE has a non-central F -distribution on degrees of freedom g 1 and N g, with non-centrality parameter δ, denoted as g i=1 F F g 1,N g,δ where δ = n iαi 2 σ 2. Recall that α i = µ i µ, and µ = 1 N g n i µ i = 1 N i=1 j=1 g n i µ i. Non-central F -distribution is also right skewed, and the greater the non-centrality parameter δ, the further away the peak of the distribution from 0. Chapter 7-7 i=1
8 Example 1: Power Calculation (1) g = 5 treatment groups group sizes: n 1 = n 2 = n 3 = 5, n 4 = 6, n 5 = 4 assume σ = 0.8 desired significance level α = 0.05 find the power of the test when H a is true with µ 1 = 1.6, µ 2 = 0.6, µ 3 = 2, µ 4 = 0, µ 5 = 1. Solution. The grand mean µ is g i=1 µ = n iµ i = N = = 1 The α i s are thus α i = µ i µ = µ i 1: α 1 = 0.6, α 2 = 0.4, α 3 = 1, α 4 = 1, α 5 = 0. Chapter 7-8
9 Example 1: Power Calculation (2) The non-centrality parameter is g i=1 δ = n iαi 2 σ 2 = ( 0.4) ( 1) = = So F = MS Trt MSE { F g 1, N g = F 4, 25 5 under H 0 F g 1, N g,δ = F 4, 25 5, under H a H 0 : F F 4,20 H a : F F 4,20, Chapter 7-9
10 Example 1: Power Calculation (3) The critical value to reject H 0 keeping the significance level at α = 0.05 is F 0.05,4, > qf(0.05,4,20, lower.tail=f) [1] When H 0 is true, the chance that H 0 is rejected is only 0.05 (the red shaded area.) Accept H 0 Reject H 0 F 0.05,4,20 Area = Reminder: Don t confuse the following two: in most STAT books, F α,df 1,df 2 means the area of the upper tail is α. in R, qf(alpha, df1, df2) means the the area of the lower Chapter 7-10
11 Example 1: Power Calculation (4) If H a is true and µ 1 = 1.6, µ 2 = 0.6, µ 3 = 2, µ 4 = 0, µ 5 = 1, we know then F F 4,20,δ= The power to reject H 0 is the area under the density of F 4,20,δ=21.25 beyond the critical value (the blue shaded area,) which is > pf(qf(.95,4,20),4,20, ncp=21.25, lower.tail=f) [1] In the R codes, ncp means the non-centrality parameter Accept H 0 Reject H 0 Power F 0.05,4, Chapter 7-11
12 Example 1: Power Calculation (5) Remark The power of a test is not a single value, but a function of the parameters in H a. If parameter µ i s change their values, the power of the test also changes. It makes no sense to talk about the power of a test without specifying the parameters in H a Chapter 7-12
13 Power Of A Test Is Affected By... The larger the non-centrality parameter the greater the power. δ = The power of a test will increase if g i=1 n iα 2 i σ 2, the number of replicate n i per treatment increases the magnitude of treatment effects α i s increase (under the constraint i n iα i = 0) the size of noise σ 2 decreases. Chapter 7-13
14 Example 2: Sample Size Calculation (1) g = 5 treatment groups of equal sample size n i = n for all i assume σ = 0.8 desired significance level α = 0.05 Assuming the design is balanced that all groups have identical sample size n, what is the minimal sample size n per treatment to have power 0.95 when α 1 = 0.5, α 2 = 0.5, α 3 = 1, α 4 = 1, α 5 = 0? Solution. The non-centrality parameter is g i=1 δ = n iαi 2 σ 2 = n [ ( 0.5) ( 1) ] = n 2.5 = 3.9n 0.82 So F = MS Trt MSE { F g 1, N g = F 4, 5n 5 under H 0 F g 1, N g,δ = F 4, 5n 5, 3.9n under H a Chapter 7-14
15 Recall the critical value F for rejecting H 0 at level α = 0.05 is F = F α, g 1, N g = F 0.05, 4, 5n 5 which can be find in R via the command > F.crit = qf(alpha, g-1, N-g, lower.tail=f) # syntax > F.crit = qf(0.05, 4, 5*n-5, lower.tail=f) # sub-in the values By definition, Power = P(reject H 0 H a is true) = P(the non central F statistic F ) = P(F (g 1, N g, δ) F ) = P(F (4, 5n 5, 3.9n) F ) which can be found in R via the command > pf(f.crit, g-1, N-g, ncp=delta, lower.tail=f) # syntax > pf(f.crit, 4, 5*n-5, ncp=3.9*n, lower.tail=f) # sub-in values In the R codes, ncp means the non-centrality parameter. Chapter 7-15
16 Now we find the R code to find the power of the ANOVA F -test when n is known. Let s plug in different values of n and see what is the smallest n to make power > F.crit = qf(alpha, g-1, N-g, lower.tail=f) > pf(f.crit, g-1, N-g, ncp=delta, lower.tail=f) # syntax > n = 5 > F.crit = qf(0.05, 4, 5*n-5, lower.tail=f) > pf(f.crit, 4, 5*n-5, ncp=3.9*n, lower.tail=f) [1] # less than 0.95, not high enough > n = 6 > F.crit = qf(0.05, 4, 5*n-5, lower.tail=f) > pf(f.crit, 4, 5*n-5, ncp=3.9*n, lower.tail=f) [1] # greater than 0.95, bingo! So we need 6 replicates in each of the 5 treatment groups to ensure a power of 0.95 when α 1 = 0.5, α 2 = 0.5, α 3 = 1, α 4 = 1, α 5 = 0. Remark: Again, we have to specify the H a fully to to find the appropriate sample size. Chapter 7-16
17 But σ 2 is Unknown... As σ 2 is usually unknown, here are a few ways to make a guess. Make a small-sample pilot study to get an estimate of σ. Based on prior studies or knowledge about the experimental units, can you think of a range of plausible values for σ? If so, choose the biggest one. You could repeat the sample size calculations for various levels of σ to see how it affects the needed sample size. Chapter 7-17
18 How to Specify the H a? As the power of a test depends on the alternative hypothesis H a, that is, the α i s, one might has to try several sets of α i s to find the appropriate sample size. But how many H a s we have to try? Here is a useful trick. 1. Suppose we would be interested if any two means differed by D or more. 2. The smallest value of δ in this case is when two means differ by exactly, D, and the other g 2 means are halfway between. 3. So try α 1 = D/2, α 2 = D/2, and α i = 0 for all other τ. Assuming equal sample sizes, the non-centrality parameter is δ = i nαi 2 σ 2 = n(d2 /4 + D 2 /4) σ 2 = nd2 2σ 2. Chapter 7-18
19 Power and Sample Size Calculation for Complete Block Designs For complete block designs y ij = µ + α i + β j + ε ij y ijk = µ + α i + β j + γ k + ε ijk y ijkl = µ + α i + β j + γ k + δ l + ε ijkl RCBD Latin Square Graeco-Latin Square where α i s are the treatment effect, and β j, γ k, δ l are effects of blocking factors, when H a is true, treatments have different effects, the ANOVA F -statistic also has a non-central F -distribution F = MS trt MSE { F g 1, df of MSE under H 0 under H a F g 1, df of MSE,δ where the non-centrality parameter δ is δ = r i α2 i σ 2, where r = # of replicates per treatment Note that the df changes from N g to the df of MSE. Chapter 7-19
20 Power and Sample Size Calculation for Balanced Factorial Designs Section 10.3 in Oehlert s book We may ignore factorial structure and compute power and sample size for the overall null hypothesis of no model effects. We will address methods for balanced data only because most factorial experiments are designed to be balanced, and simple formulae Power for balanced data for noncentrality parameters exist only for balanced data. For factorial data, we usually test H 0 about main effects or interactions in addition to the overall H 0 of no model effects. Chapter 7-20
21 Non-Centrality Parameter for Balanced Factorial Designs Ex, consider a 3-way factorial design y ijkl = µ + α i + β j + γ k + αβ ij + βγ jk + αγ ij + αβγ ijk + ε ijkl to calculate the power of the F -test of a main effect or an interaction effect under H a, the non-centrality parameter can be calculated as follows: Square the factorial effects (using the zero-sum constraint) and sum them, Multiply this sum by the total number of data in the design divided by the number of levels in the effect, and Divide that product by the error variance. Chapter 7-21
22 E.g., consider a a b c factorial design with r replicates per treatment y ijkl = µ + α i + β j + γ k + αβ ij + βγ jk + αγ ij + αβγ ijk + ε ijkl To test about the B main effects H 0 : β 1 = = β b = 0 v.s. H a : not all β j are 0 the ANOVA F -statistic for B main-effect is F = MS B MSE { F b 1, df of MSE under H 0 under H a F b 1, df of MSE,δ where the non-centrality parameter δ is ( N ) / ( δ = b j β2 j σ 2 = rac ) / j β2 j σ 2. Here N = abcr is the total number of observations. Chapter 7-22
23 E.g., consider a a b c factorial design with r replicates per treatment y ijkl = µ + α i + β j + γ k + αβ ij + βγ jk + αγ ij + αβγ ijk + ε ijkl To test about the AB interactions H 0 : all αβ ij = 0 v.s. H a : not all αβ ij = 0 the ANOVA F -statistic for AB interactions F = MS AB MSE { F (a 1)(b 1), df of MSE under H 0 under H a F (a 1)(b 1), df of MSE,δ where the non-centrality parameter δ is ( N ) / ( δ = ab (αβ ij) 2 σ 2 = rc (αβ ij) 2) / σ 2. ij ij Here N = abcr is the total number of observations. Chapter 7-23
24 Sample Size Should Be The Last Thing To Consider In practice, sample size calculation is the last thing to do in designing an experiment. Consider the followings before calculating the required sample size. asking the right question, choosing an appropriate response, thinking about what factors need to be blocked on, choosing a right design, setting up the right procedure to recruit subjects, and so on, All the above are more important than doing the right sample size calculation! Sometimes, choosing a better design can substantially reduces the required sample size. Chapter 7-24
Factorial Treatment Structure: Part I. Lukas Meier, Seminar für Statistik
Factorial Treatment Structure: Part I Lukas Meier, Seminar für Statistik Factorial Treatment Structure So far (in CRD), the treatments had no structure. So called factorial treatment structure exists if
More informationDESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya
DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Jurusan Teknik Industri Universitas Brawijaya Outline Introduction The Analysis of Variance Models for the Data Post-ANOVA Comparison of Means Sample
More informationSTAT22200 Spring 2014 Chapter 8A
STAT22200 Spring 2014 Chapter 8A Yibi Huang May 13, 2014 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley,
More informationTwo-Way Factorial Designs
81-86 Two-Way Factorial Designs Yibi Huang 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley, so brewers like
More informationLec 5: Factorial Experiment
November 21, 2011 Example Study of the battery life vs the factors temperatures and types of material. A: Types of material, 3 levels. B: Temperatures, 3 levels. Example Study of the battery life vs the
More informationSTAT22200 Spring 2014 Chapter 13B
STAT22200 Spring 2014 Chapter 13B Yibi Huang May 27, 2014 13.3.1 Crossover Designs 13.3.4 Replicated Latin Square Designs 13.4 Graeco-Latin Squares Chapter 13B - 1 13.3.1 Crossover Design (A Special Latin-Square
More informationSummary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)
Summary of Chapter 7 (Sections 7.2-7.5) and Chapter 8 (Section 8.1) Chapter 7. Tests of Statistical Hypotheses 7.2. Tests about One Mean (1) Test about One Mean Case 1: σ is known. Assume that X N(µ, σ
More informationThe legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.
1 Chapter 1: Research Design Principles The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 2 Chapter 2: Completely Randomized Design
More information16.3 One-Way ANOVA: The Procedure
16.3 One-Way ANOVA: The Procedure Tom Lewis Fall Term 2009 Tom Lewis () 16.3 One-Way ANOVA: The Procedure Fall Term 2009 1 / 10 Outline 1 The background 2 Computing formulas 3 The ANOVA Identity 4 Tom
More informationTheorem A: Expectations of Sums of Squares Under the two-way ANOVA model, E(X i X) 2 = (µ i µ) 2 + n 1 n σ2
identity Y ijk Ȳ = (Y ijk Ȳij ) + (Ȳi Ȳ ) + (Ȳ j Ȳ ) + (Ȳij Ȳi Ȳ j + Ȳ ) Theorem A: Expectations of Sums of Squares Under the two-way ANOVA model, (1) E(MSE) = E(SSE/[IJ(K 1)]) = (2) E(MSA) = E(SSA/(I
More informationIn a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:
Activity #10: AxS ANOVA (Repeated subjects design) Resources: optimism.sav So far in MATH 300 and 301, we have studied the following hypothesis testing procedures: 1) Binomial test, sign-test, Fisher s
More informationAnalysis of Variance
Analysis of Variance Math 36b May 7, 2009 Contents 2 ANOVA: Analysis of Variance 16 2.1 Basic ANOVA........................... 16 2.1.1 the model......................... 17 2.1.2 treatment sum of squares.................
More informationChapter 11 - Lecture 1 Single Factor ANOVA
Chapter 11 - Lecture 1 Single Factor ANOVA April 7th, 2010 Means Variance Sum of Squares Review In Chapter 9 we have seen how to make hypothesis testing for one population mean. In Chapter 10 we have seen
More informationSTAT22200 Spring 2014 Chapter 14
STAT22200 Spring 2014 Chapter 14 Yibi Huang May 27, 2014 Chapter 14 Incomplete Block Designs 14.1 Balanced Incomplete Block Designs (BIBD) Chapter 14-1 Incomplete Block Designs A Brief Introduction to
More informationAnalysis of Variance
Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also
More informationCOMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.
COMPLETELY RANDOM DESIGN (CRD) Description of the Design -Simplest design to use. -Design can be used when experimental units are essentially homogeneous. -Because of the homogeneity requirement, it may
More informationPower. Week 8: Lecture 1 STAT: / 48
Power STAT:5201 Week 8: Lecture 1 1 / 48 Power We have already described Type I and II errors. Decision Reality/True state Accept H o Reject H o H o is true good Type I error H o is false Type II error
More informationContents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4
Contents Factorial design TAMS38 - Lecture 6 Factorial design, Latin Square Design Lecturer: Zhenxia Liu Department of Mathematics - Mathematical Statistics 28 November, 2017 Complete three factor design
More informationBattery Life. Factory
Statistics 354 (Fall 2018) Analysis of Variance: Comparing Several Means Remark. These notes are from an elementary statistics class and introduce the Analysis of Variance technique for comparing several
More informationFactorial designs. Experiments
Chapter 5: Factorial designs Petter Mostad mostad@chalmers.se Experiments Actively making changes and observing the result, to find causal relationships. Many types of experimental plans Measuring response
More informationLecture 9. ANOVA: Random-effects model, sample size
Lecture 9. ANOVA: Random-effects model, sample size Jesper Rydén Matematiska institutionen, Uppsala universitet jesper@math.uu.se Regressions and Analysis of Variance fall 2015 Fixed or random? Is it reasonable
More informationChapter 4: Randomized Blocks and Latin Squares
Chapter 4: Randomized Blocks and Latin Squares 1 Design of Engineering Experiments The Blocking Principle Blocking and nuisance factors The randomized complete block design or the RCBD Extension of the
More information. Example: For 3 factors, sse = (y ijkt. " y ijk
ANALYSIS OF BALANCED FACTORIAL DESIGNS Estimates of model parameters and contrasts can be obtained by the method of Least Squares. Additional constraints must be added to estimate non-estimable parameters.
More informationChapter 10: Analysis of variance (ANOVA)
Chapter 10: Analysis of variance (ANOVA) ANOVA (Analysis of variance) is a collection of techniques for dealing with more general experiments than the previous one-sample or two-sample tests. We first
More informationSTAT22200 Spring 2014 Chapter 5
STAT22200 Spring 2014 Chapter 5 Yibi Huang April 29, 2014 Chapter 5 Multiple Comparisons Chapter 5-1 Chapter 5 Multiple Comparisons Note the t-tests and C.I. s are constructed assuming we only do one test,
More informationCHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication
CHAPTER 4 Analysis of Variance One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication 1 Introduction In this chapter, expand the idea of hypothesis tests. We
More informationy = µj n + β 1 b β b b b + α 1 t α a t a + e
The contributions of distinct sets of explanatory variables to the model are typically captured by breaking up the overall regression (or model) sum of squares into distinct components This is useful quite
More informationChap The McGraw-Hill Companies, Inc. All rights reserved.
11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview
More informationDesign & Analysis of Experiments 7E 2009 Montgomery
1 What If There Are More Than Two Factor Levels? The t-test does not directly apply ppy There are lots of practical situations where there are either more than two levels of interest, or there are several
More informationDESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya
DESAIN EKSPERIMEN BLOCKING FACTORS Semester Genap Jurusan Teknik Industri Universitas Brawijaya Outline The Randomized Complete Block Design The Latin Square Design The Graeco-Latin Square Design Balanced
More informationSolutions to Final STAT 421, Fall 2008
Solutions to Final STAT 421, Fall 2008 Fritz Scholz 1. (8) Two treatments A and B were randomly assigned to 8 subjects (4 subjects to each treatment) with the following responses: 0, 1, 3, 6 and 5, 7,
More informationSTATS Analysis of variance: ANOVA
STATS 1060 Analysis of variance: ANOVA READINGS: Chapters 28 of your text book (DeVeaux, Vellman and Bock); on-line notes for ANOVA; on-line practice problems for ANOVA NOTICE: You should print a copy
More informationAnalysis of Variance and Design of Experiments-I
Analysis of Variance and Design of Experiments-I MODULE VIII LECTURE - 35 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS MODEL Dr. Shalabh Department of Mathematics and Statistics Indian
More informationSTAT22200 Chapter 14
STAT00 Chapter 4 Yibi Huang Chapter 4 Incomplete Block Designs 4. Balanced Incomplete Block Designs (BIBD) Chapter 4 - Incomplete Block Designs A Brief Introduction to a Class of Most Useful Designs in
More informationChapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs
Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs Adapted from Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Data Analysis
More informationSIZE = Vehicle size: 1 small, 2 medium, 3 large. SIDE : 1 right side of car, 2 left side of car
THREE-WAY ANOVA MODELS (CHAPTER 7) Consider a completely randomized design for an experiment with three treatment factors A, B and C. We will assume that every combination of levels of A, B and C is observed
More informationWeek 14 Comparing k(> 2) Populations
Week 14 Comparing k(> 2) Populations Week 14 Objectives Methods associated with testing for the equality of k(> 2) means or proportions are presented. Post-testing concepts and analysis are introduced.
More informationFactorial ANOVA. Psychology 3256
Factorial ANOVA Psychology 3256 Made up data.. Say you have collected data on the effects of Retention Interval 5 min 1 hr 24 hr on memory So, you do the ANOVA and conclude that RI affects memory % corr
More informationPLSC PRACTICE TEST ONE
PLSC 724 - PRACTICE TEST ONE 1. Discuss briefly the relationship between the shape of the normal curve and the variance. 2. What is the relationship between a statistic and a parameter? 3. How is the α
More informationStat 217 Final Exam. Name: May 1, 2002
Stat 217 Final Exam Name: May 1, 2002 Problem 1. Three brands of batteries are under study. It is suspected that the lives (in weeks) of the three brands are different. Five batteries of each brand are
More informationPart 1.) We know that the probability of any specific x only given p ij = p i p j is just multinomial(n, p) where p k1 k 2
Problem.) I will break this into two parts: () Proving w (m) = p( x (m) X i = x i, X j = x j, p ij = p i p j ). In other words, the probability of a specific table in T x given the row and column counts
More informationSample Size / Power Calculations
Sample Size / Power Calculations A Simple Example Goal: To study the effect of cold on blood pressure (mmhg) in rats Use a Completely Randomized Design (CRD): 12 rats are randomly assigned to one of two
More informationSTAT Final Practice Problems
STAT 48 -- Final Practice Problems.Out of 5 women who had uterine cancer, 0 claimed to have used estrogens. Out of 30 women without uterine cancer 5 claimed to have used estrogens. Exposure Outcome (Cancer)
More information3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is
Stat 501 Solutions and Comments on Exam 1 Spring 005-4 0-4 1. (a) (5 points) Y ~ N, -1-4 34 (b) (5 points) X (X,X ) = (5,8) ~ N ( 11.5, 0.9375 ) 3 1 (c) (10 points, for each part) (i), (ii), and (v) are
More informationThe Chi-Square Distributions
MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness
More informationUnit 12: Analysis of Single Factor Experiments
Unit 12: Analysis of Single Factor Experiments Statistics 571: Statistical Methods Ramón V. León 7/16/2004 Unit 12 - Stat 571 - Ramón V. León 1 Introduction Chapter 8: How to compare two treatments. Chapter
More informationThe Chi-Square Distributions
MATH 03 The Chi-Square Distributions Dr. Neal, Spring 009 The chi-square distributions can be used in statistics to analyze the standard deviation of a normally distributed measurement and to test the
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More informationChapter 11 - Lecture 1 Single Factor ANOVA
April 5, 2013 Chapter 9 : hypothesis testing for one population mean. Chapter 10: hypothesis testing for two population means. What comes next? Chapter 9 : hypothesis testing for one population mean. Chapter
More informationMuch of the material we will be covering for a while has to do with designing an experimental study that concerns some phenomenon of interest.
Experimental Design: Much of the material we will be covering for a while has to do with designing an experimental study that concerns some phenomenon of interest We wish to use our subjects in the best
More information21.0 Two-Factor Designs
21.0 Two-Factor Designs Answer Questions 1 RCBD Concrete Example Two-Way ANOVA Popcorn Example 21.4 RCBD 2 The Randomized Complete Block Design is also known as the two-way ANOVA without interaction. A
More informationRandom and Mixed Effects Models - Part III
Random and Mixed Effects Models - Part III Statistics 149 Spring 2006 Copyright 2006 by Mark E. Irwin Quasi-F Tests When we get to more than two categorical factors, some times there are not nice F tests
More informationSTAT 525 Fall Final exam. Tuesday December 14, 2010
STAT 525 Fall 2010 Final exam Tuesday December 14, 2010 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More information20.0 Experimental Design
20.0 Experimental Design Answer Questions 1 Philosophy One-Way ANOVA Egg Sample Multiple Comparisons 20.1 Philosophy Experiments are often expensive and/or dangerous. One wants to use good techniques that
More informationTopic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model
Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationDifference in two or more average scores in different groups
ANOVAs Analysis of Variance (ANOVA) Difference in two or more average scores in different groups Each participant tested once Same outcome tested in each group Simplest is one-way ANOVA (one variable as
More informationNesting and Mixed Effects: Part I. Lukas Meier, Seminar für Statistik
Nesting and Mixed Effects: Part I Lukas Meier, Seminar für Statistik Where do we stand? So far: Fixed effects Random effects Both in the factorial context Now: Nested factor structure Mixed models: a combination
More informationTopic 9: Factorial treatment structures. Introduction. Terminology. Example of a 2x2 factorial
Topic 9: Factorial treatment structures Introduction A common objective in research is to investigate the effect of each of a number of variables, or factors, on some response variable. In earlier times,
More informationKeppel, G. & Wickens, T.D. Design and Analysis Chapter 2: Sources of Variability and Sums of Squares
Keppel, G. & Wickens, T.D. Design and Analysis Chapter 2: Sources of Variability and Sums of Squares K&W introduce the notion of a simple experiment with two conditions. Note that the raw data (p. 16)
More informationAnalysis Of Variance Compiled by T.O. Antwi-Asare, U.G
Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G 1 ANOVA Analysis of variance compares two or more population means of interval data. Specifically, we are interested in determining whether differences
More informationUnit 27 One-Way Analysis of Variance
Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied
More informationStatistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts
Statistical methods for comparing multiple groups Lecture 7: ANOVA Sandy Eckel seckel@jhsph.edu 30 April 2008 Continuous data: comparing multiple means Analysis of variance Binary data: comparing multiple
More informationSTAT Chapter 10: Analysis of Variance
STAT 515 -- Chapter 10: Analysis of Variance Designed Experiment A study in which the researcher controls the levels of one or more variables to determine their effect on the variable of interest (called
More informationANOVA Analysis of Variance
ANOVA Analysis of Variance ANOVA Analysis of Variance Extends independent samples t test ANOVA Analysis of Variance Extends independent samples t test Compares the means of groups of independent observations
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Motivations for the ANOVA We defined the F-distribution, this is mainly used in
More informationANOVA (Analysis of Variance) output RLS 11/20/2016
ANOVA (Analysis of Variance) output RLS 11/20/2016 1. Analysis of Variance (ANOVA) The goal of ANOVA is to see if the variation in the data can explain enough to see if there are differences in the means.
More informationTWO OR MORE RANDOM EFFECTS. The two-way complete model for two random effects:
TWO OR MORE RANDOM EFFECTS Example: The factors that influence the breaking strength of a synthetic fiber are being studied. Four production machines and three operators are randomly selected. A two-way
More informationStatistics For Economics & Business
Statistics For Economics & Business Analysis of Variance In this chapter, you learn: Learning Objectives The basic concepts of experimental design How to use one-way analysis of variance to test for differences
More informationMEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS FALL 1999
MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS 350 - FALL 1999 Instructor: A. Oyet Date: December 16, 1999 Name(Surname First): Student Number INSTRUCTIONS
More informationOne-Way ANOVA Calculations: In-Class Exercise Psychology 311 Spring, 2013
One-Way ANOVA Calculations: In-Class Exercise Psychology 311 Spring, 2013 1. You are planning an experiment that will involve 4 equally sized groups, including 3 experimental groups and a control. Each
More informationOne-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.
One-Way Analysis of Variance With regression, we related two quantitative, typically continuous variables. Often we wish to relate a quantitative response variable with a qualitative (or simply discrete)
More informationOne-way ANOVA (Single-Factor CRD)
One-way ANOVA (Single-Factor CRD) STAT:5201 Week 3: Lecture 3 1 / 23 One-way ANOVA We have already described a completed randomized design (CRD) where treatments are randomly assigned to EUs. There is
More informationSTAT 506: Randomized complete block designs
STAT 506: Randomized complete block designs Timothy Hanson Department of Statistics, University of South Carolina STAT 506: Introduction to Experimental Design 1 / 10 Randomized complete block designs
More informationMa 3/103: Lecture 25 Linear Regression II: Hypothesis Testing and ANOVA
Ma 3/103: Lecture 25 Linear Regression II: Hypothesis Testing and ANOVA March 6, 2017 KC Border Linear Regression II March 6, 2017 1 / 44 1 OLS estimator 2 Restricted regression 3 Errors in variables 4
More informationMultiple Testing. Gary W. Oehlert. January 28, School of Statistics University of Minnesota
Multiple Testing Gary W. Oehlert School of Statistics University of Minnesota January 28, 2016 Background Suppose that you had a 20-sided die. Nineteen of the sides are labeled 0 and one of the sides is
More informationLecture 3: Inference in SLR
Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals
More informationNotes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1
Notes for Wee 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1 Exam 3 is on Friday May 1. A part of one of the exam problems is on Predictiontervals : When randomly sampling from a normal population
More informationMultiple comparisons The problem with the one-pair-at-a-time approach is its error rate.
Multiple comparisons The problem with the one-pair-at-a-time approach is its error rate. Each confidence interval has a 95% probability of making a correct statement, and hence a 5% probability of making
More informationAllow the investigation of the effects of a number of variables on some response
Lecture 12 Topic 9: Factorial treatment structures (Part I) Factorial experiments Allow the investigation of the effects of a number of variables on some response in a highly efficient manner, and in a
More informationOne-Way ANOVA Source Table J - 1 SS B / J - 1 MS B /MS W. Pairwise Post-Hoc Comparisons of Means
One-Way ANOVA Source Table ANOVA MODEL: ij = µ* + α j + ε ij H 0 : µ 1 = µ =... = µ j or H 0 : Σα j = 0 Source Sum of Squares df Mean Squares F Between Groups n j ( j - * ) J - 1 SS B / J - 1 MS B /MS
More informationLec 1: An Introduction to ANOVA
Ying Li Stockholm University October 31, 2011 Three end-aisle displays Which is the best? Design of the Experiment Identify the stores of the similar size and type. The displays are randomly assigned to
More informationTopic 7: Incomplete, double-blocked designs: Latin Squares [ST&D sections ]
Topic 7: Incomplete, double-blocked designs: Latin Squares [ST&D sections 9.10 9.15] 7.1. Introduction The Randomized Complete Block Design is commonly used to improve the ability of an experiment to detect
More informationAnalysis of Variance
Analysis of Variance Blood coagulation time T avg A 62 60 63 59 61 B 63 67 71 64 65 66 66 C 68 66 71 67 68 68 68 D 56 62 60 61 63 64 63 59 61 64 Blood coagulation time A B C D Combined 56 57 58 59 60 61
More informationOne-way ANOVA. Experimental Design. One-way ANOVA
Method to compare more than two samples simultaneously without inflating Type I Error rate (α) Simplicity Few assumptions Adequate for highly complex hypothesis testing 09/30/12 1 Outline of this class
More informationWeek 12 Hypothesis Testing, Part II Comparing Two Populations
Week 12 Hypothesis Testing, Part II Week 12 Hypothesis Testing, Part II Week 12 Objectives 1 The principle of Analysis of Variance is introduced and used to derive the F-test for testing the model utility
More informationYour schedule of coming weeks. One-way ANOVA, II. Review from last time. Review from last time /22/2004. Create ANOVA table
Your schedule of coming weeks One-way ANOVA, II 9.07 //00 Today: One-way ANOVA, part II Next week: Two-way ANOVA, parts I and II. One-way ANOVA HW due Thursday Week of May Teacher out of town all week
More informationEXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"
EXST704 - Regression Techniques Page 1 Using F tests instead of t-tests We can also test the hypothesis H :" œ 0 versus H :" Á 0 with an F test.! " " " F œ MSRegression MSError This test is mathematically
More informationIf we have many sets of populations, we may compare the means of populations in each set with one experiment.
Statistical Methods in Business Lecture 3. Factorial Design: If we have many sets of populations we may compare the means of populations in each set with one experiment. Assume we have two factors with
More informationLecture 7: Hypothesis Testing and ANOVA
Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis
More informationME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV
Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions
More information2 Hand-out 2. Dr. M. P. M. M. M c Loughlin Revised 2018
Math 403 - P. & S. III - Dr. McLoughlin - 1 2018 2 Hand-out 2 Dr. M. P. M. M. M c Loughlin Revised 2018 3. Fundamentals 3.1. Preliminaries. Suppose we can produce a random sample of weights of 10 year-olds
More informationNote: The problem numbering below may not reflect actual numbering in DGE.
Stat664 Year 1999 DGE Note: The problem numbering below may not reflect actual numbering in DGE. 1. For a balanced one-way random effect model, (a) write down the model and assumptions; (b) write down
More informationStat 6640 Solution to Midterm #2
Stat 6640 Solution to Midterm #2 1. A study was conducted to examine how three statistical software packages used in a statistical course affect the statistical competence a student achieves. At the end
More informationIEOR165 Discussion Week 12
IEOR165 Discussion Week 12 Sheng Liu University of California, Berkeley Apr 15, 2016 Outline 1 Type I errors & Type II errors 2 Multiple Testing 3 ANOVA IEOR165 Discussion Sheng Liu 2 Type I errors & Type
More informationSection 10.1 (Part 2 of 2) Significance Tests: Power of a Test
1 Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test Learning Objectives After this section, you should be able to DESCRIBE the relationship between the significance level of a test, P(Type
More informationSTAT 536: Genetic Statistics
STAT 536: Genetic Statistics Tests for Hardy Weinberg Equilibrium Karin S. Dorman Department of Statistics Iowa State University September 7, 2006 Statistical Hypothesis Testing Identify a hypothesis,
More informationRelating Graph to Matlab
There are two related course documents on the web Probability and Statistics Review -should be read by people without statistics background and it is helpful as a review for those with prior statistics
More informationEcon 3790: Business and Economic Statistics. Instructor: Yogesh Uppal
Econ 3790: Business and Economic Statistics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 13, Part A: Analysis of Variance and Experimental Design Introduction to Analysis of Variance Analysis
More information