Sample Size / Power Calculations
|
|
- Patrick Warner
- 6 years ago
- Views:
Transcription
1 Sample Size / Power Calculations
2 A Simple Example Goal: To study the effect of cold on blood pressure (mmhg) in rats Use a Completely Randomized Design (CRD): 12 rats are randomly assigned to one of two different temperature environments Trt 1: Normal environment (20 C) n=6 Trt 2: Cold environment (5 C) n=6 Expect higher mean BP when under Trt 2 Is n=6 enough to detect important difference? 2
3 Statistical Approach Two competing hypotheses: H o : µ 1 =µ 2 i.e. consider one-tailed test (for now) H 1 : µ 1 <µ 2 Will quantify evidence against null hypothesis in order to make conclusion Compare the P-value of some statistical test relative to a predetermined significance level (α) There are two possible incorrect conclusions based on the analysis of the data Commonly present conclusions/truths in terms of a 2x2 table 3
4 Type I and Type II errors True state H o : is true H 1 : is true What the data indicate: Fail to reject H o : (P>α) No error Type II error (Prob = β) Reject H o : (P α) Type I error (Prob is α) No error 4
5 Is n = 6 rats large enough? Rephrase: Do we have enough statistical power? To do calculations, need to know several unknowns Thus, power analysis involves educated guessing How large is the true mean difference (δ = µ 2 -µ 1 )? 1) What do you anticipate or want to make sure you can detect? 2) What would be economically/practically important? Suppose researchers believe that δ =20 mmhg is important How much variability (σ) exists between rats within a grp? Some prior information potentially available from previously published studies or small pilot study May also have to guess Suppose researchers believe that σ = 15 mmhg 5
6 One way to elicit/guess values for σ Use an empirical rule: Consider range of responses to be equal to 4σ Question to client: What would be the likely range (max-min) of responses for rats within the same trt? Suppose the answer was 60 mmhg R = 60 σ = 15 mmhg. Can often find similar published studies with estimates of σ. Always round up to be a little conservative. R 4σ 6
7 Quantifying evidence against H o Consider using a two-sample z test (σ known) Under H o : Under H 1 : 2 2σ y2 y1 ~ N 0, n 2 2σ y2 y1 ~ N δ, n Difference is in the means of the distributions. Assuming equal variances here but that is not necessary. Conduct one-tailed z-test for a certain α Reject H o : if z y 2 1 = > 2 2σ n y z α y y > z 2 1 α 2 2σ n 7
8 Distributions of y y 2 1 H o : H 1 : Power = 1-β ( ) 2 zα δ σ n = 1 Φ 2 / z α 2 2σ n α 1 β 0 δ 8
9 More reasonable statistical test t-test Rarely can you assume variance(s) to be known One-sided: Reject H o if 2 1 = > 2 2 s1 s2 Two-sided (H 1 : µ 1 µ 2 ): Reject H o : if t y n y n t α, df Must use non-central t distribution in power power calculations + t > t α /2,df Choice of df depends on assumptions regarding variances 9
10 More reasonable statistical test? Nonparametric test Often proposed simply because data not Normal or sample sizes are small In practice, t test approximately correct provided that y is approximately Normal (CLT) 2 y 1 When sample sizes small, t test more powerful if data are truly Normal Similar power calculations are available, usually based on large sample assumptions 10
11 Using SAS for power analysis proc power; twosamplemeans alpha=.05 nulldiff=0 sides=1 meandiff=20 npergroup=6 stddev=15 power=.; run; One-sided t-test or proc power; onewayanova alpha=.05 test=overall groupmeans=(0 20) npergroup=6 stddev=15 power=.; run; Similar to two-sided t-test 11
12 SAS Output Two-sample t Test for Mean Difference Fixed Scenario Elements Distribution Normal Method Exact Number of Sides 1 Null Difference 0 Alpha 0.05 Mean Difference 20 Standard Deviation 15 Sample Size Per Group 6 Computed Power Power
13 SAS Output Overall F Test for One-Way ANOVA Fixed Scenario Elements Method Exact Alpha 0.05 Group Means 0 20 Standard Deviation 15 Sample Size Per Group 6 Computed Power Power Typically want power to be larger than 80%... 13
14 Using SAS for sample size estimation proc power; twosamplemeans alpha=.05 nulldiff=0 sides=1 meandiff=20 npergroup=. stddev=15 power=.80; run; or proc power; onewayanova alpha=.05 test=overall groupmeans=(0 20) npergroup=. stddev=15 power=.80; run; 14
15 SAS Output Two-sample t Test for Mean Difference Fixed Scenario Elements Distribution Normal Method Exact Number of Sides 1 Null Difference 0 Alpha 0.05 Mean Difference 20 Standard Deviation 15 Nominal Power 0.8 Computed N Per Group Actual N Per Power Group
16 SAS Output Overall F Test for One-Way ANOVA Fixed Scenario Elements Method Exact Alpha 0.05 Group Means 0 20 Standard Deviation 15 Nominal Power 0.8 Computed N Per Group Actual N Per Power Group
17 Generating a Power Curve proc power; twosamplemeans alpha=.05 nulldiff=0 sides=1 meandiff=20 stddev=15 power=. npergroup=3 to 20 by 1; plot interpol=join yopts=(ref=0.80); run; 17
18 Power Curve for one-sided t test Power Sample Size Per Group 18
19 What if more than two treatments? In a study of vitamin D supplementation, subjects are assigned to each of 5 treatment groups and bone density changes over a twelve week period are to be recorded. Anticipate mean responses of 3.9, 4.1, 4.2, 4.3, and 4.5 mg/cm 2 for each of the five treatments Based on a previous study, they anticipate a within-treatment variance of about 0.30 They want to know if n=4 subjects per treatment would provide sufficient power for the ANOVA F- test. 19
20 Linear model written two ways 1) Y = µ + e ij i ij 2) Y = µ + α + e ij i ij Cell means model Factor level effects model i= 1,...,t=5; j = 1,2,,n=4 ( 2 e ~ NIID 0, σ ) ij e t µ i t i= 1 µ = α i = µ i µ αi t i= 1 = 0 i.e. Sum-to-zero constraint 20
21 One-way ANOVA table Source Df SS MS EMS Treatment t-1 SSA MSA σ 2 function + t i= 1 α 2 i Error t(n-1) SSE MSE σ 2 ANOVA F -test: 1) H o : µ 1 =µ 2 =µ 3 =µ 4 =µ 5 versus H 1 : at least one µ i µ i 2) H o : ALL α i = 0 versus. H 1 : at least one α i 0 Equivalent specs. Note: if H o : is true then both EMS = σ 2 such that F = MSA/MSE ~ F t-1, t(n-1) Central F-distribution 21
22 Power determination for F-test Under H 1 : µ µ µ 3 = 4.2 µ µ 5 4.5, or α1 0.3 α α 3 = 0.0 α α with µ = 4.2 such that F = MSA/MSE ~ F t-1,t(n-1),φ t n φ = α 2 σ i = 1 2 i is the non-centrality parameter Non central F-distribution (if φ 0) Corrected sum of squared means (CSSM) =(-0.3) 2 +(-0.1) 2 + +(0.0) 2 + (0.1) 2 +(+0.3) 2 =0.20 for example 22
23 SAS Code proc power; onewayanova alpha=.05 test=overall groupmeans=( ) npergroup=4 stddev= power=.; run; This is the square root of
24 SAS Output Overall F Test for One-Way ANOVA Fixed Scenario Elements Method Exact Alpha 0.05 Group Means Standard Deviation Sample Size Per Group 4 Computed Power Power Very poorly underpowered.as designed, this would be a waste of time and money to run!! 24
25 SAS Code proc power; onewayanova alpha=.05 test=overall groupmeans=( ) npergroup=4 to 30 stddev= power=.; plot interpol=join yopts=(ref=.80); run; Let s look at a power curve to get an idea of the necessary sample size 25
26 Power Looks like we need about 19 animals per group (almost 5 times the number before) Sample Size Per Group 26
27 What if treatment means unknown? Use the worst case scenario Conservative assessment of power Just have to know the difference between the largest and smallest means or the difference you think is meaningful (say = 0.6). Assume all the other means clump in the middle. Minimizes 2 αi 27
28 SAS Code proc power; onewayanova alpha=.05 test=overall groupmeans=( ) npergroup=4 to 30 stddev= power=.; plot interpol=join yopts=(ref=.80); run; 28
29 Power Power Looks like we need about 21 animals per group in the worst case Sample Size 15 Per Group Sample Size Per Group 29
30 There is actually a trick to computing φ using ANOVA software like PROC GLM/MIXED (O Brien and Lohr, 1984) 1) Substitute true means for data in ANOVA. 2) Use the ANOVA table to compute the noncentrality parameter 3) Then use that computed value in power calculations! 30
31 Using true means for data data oneway; input treatment mean; datalines; ; Suppose you are interested in 3 treatments. Anticipate true mean responses of 4.0, 4.3 and 4.6 Anticipate residual variance of 0.30 Wish to compute power based on sample size of n= 4 for each treatment. proc mixed data=oneway noprofile; class treatment; model mean = treatment; parms (0.30) /noiter; ods output tests3 = tests3; run; Output the ANOVA table to a file called tests3 31
32 Trick to compute φ Compute the ANOVA treatment F ratio " F Treatment " Obs Effect NumDF DenDF FValue ProbF 1 treatment Multiple F Treatment by numerator degrees of freedom (NumDF) to get φ: φ = " F " df = 1.2* 2 = 2. 4 Treatment * Treatment F Treatment is a function of corrected sum of squared means 32
33 Use φ to computer power data power; set tests3; noncent = Fvalue*numdf; alpha = 0.05; criticalvalue = Finv(1-alpha,numdf,dendf,0); Power = 1-Probf(criticalvalue,numdf,dendf,noncent); run; proc print data=power; run; Effect Num DF Den DF The critical value separating the acceptance region from the rejection region Probability of falling in rejection region if H1 is true. FValue ProbF noncent alpha Critical value Power treatment
34 PROC GLMPOWER does this data example1; input FactorA $ mean; datalines; run; proc glmpower data=example1 ; class FactorA ; model mean = FactorA ; power stddev =.548 ntotal = 12 power =. alpha=0.05; run; Much simpler data step Total number of experimental units The GLMPOWER Procedure Fixed Scenario Elements Dependent Variable mean Source FactorA Alpha 0.05 Error Standard Deviation Total Sample Size 12 Test Degrees of Freedom 2 Error Degrees of Freedom 9 Computed Power Power
35 Mixed Model Power Calculations GLMPOWER is technically for fixed-effects models only Can use it for mixed models when comparisons only involves the error variance Can extend the PROC MIXED trick to handle all mixed models 35
36 RCBD mixed model Y = µ + ρ + α + ij i j ij ρ i = 1...,b, ( 2 0, σ ) ( ) ~ NIID ρ j = 1,...,a. i across all i e 2 e where ij ~ NIID 0, σ across all i and j. µ is the overall mean α j is the fixed effect of the jth diet ρ i is the random effect of the ith litter e ij is the residual error term specific to the ijth experimental unit. 36
37 Example using MIXED trick Suppose we d like to address power for a new RCBD study with 5 treatments and anticipate µ. j = µ + α j µ.1 µ.2 µ.3 µ.4 µ.5 = 70 = 72 = 74 = 76 = 78 Block (e.g. litter) variance: Residual variance: σ ρ σ 2 = 50 2 = 20 How much power would 10 blocks provide for the ANOVA F-test on treatments? 37
38 Entering the data data rcbd; input diet mean; datalines; ; data rcbd; set rcbd; do litter=1 to 10; output; end; run; 38
39 SAS code proc mixed data=rcbd noprofile; class litter diet; model mean = diet ; random litter ; parms (20) (50) /noiter; ods output tests3 = ANOVAtest; run; σ 2 ρ = 20 σ 2 = 50 39
40 Remaining code data power; set ANOVAtest; alpha = 0.05; noncent = numdf*fvalue; crit = Finv(1-alpha,numdf,dendf,0); Power = 1-Probf(crit,numdf,dendf,noncent); run; proc print data=power; var alpha noncent crit numdf dendf power; run; 40
41 Num Den Obs alpha noncent crit DF DF Power Density Central F (under H o ) Noncentral F (under H 1 ) Type I error rate α = Prob(F>F α H 0 ) = F-ratio 41
42 What if you consider GLMPOWER proc glmpower data=rcbd; class Diet Litter; model mean = Diet Litter; power stddev = ntotal = 50 power =. alpha=0.05; run; Litter no longer considered random 42
43 PROC GLMPOWER OUTPUT Fixed Scenario Elements Dependent Variable mean Alpha 0.05 Error Standard Deviation Total Sample Size 50 Error Degrees of Freedom 36 Computed Power Test Index Source DF Power 1 diet litter
44 Summary In the case of an RCBD, results can be obtained via PROC GLMPOWER because litter variance does not enter standard error Cannot use GLMPOWER for incomplete block designs or repeated measures designs 44
45 Latin Square Design Consider a 3 period crossover design Measurements taken at 4 time points Period 1 Period 2 Period 3 Y 1 Y 2 Y 3 Y 4 Plan to compare Y 2 -Y 1, Y 3 -Y 2, and Y 4 -Y 3 Differences remove subject variability but... Var(Y 2 -Y 1 )=2σ 2 Cov(Y k -Y k-1, Y k+1 -Y k )=-σ 2 See Kononoff, P.J. and K.J. Hanford, 2006 for other latin square examples 45
46 SAS Code data means; input order period trt cards; 1 1 A 1 2 B 1 3 C 2 1 B 2 2 C 2 3 A 3 1 C 3 2 A 3 3 B 4 1 B 4 2 A 4 3 C 5 1 A 5 2 C 5 3 B 6 1 C 6 2 B 6 3 A ; data meanset; set means; do subject=1 to 6; if(trt= A ) then mnresp=5.1; if(trt= B ) then mnresp=5.2; if(trt= C ) then mnresp=0.0; output; end; Can alter number of replications and the important treatment difference 46
47 SAS Code Proc Mixed noprofile data=meanset; class subject order trt period; model mnresp = trt period; repeated / subject=subject(order) type=toep(2) ; parms ( ) / NoIter; estimate 'T1 VS T3 trt 1 0-1; contrast 'T1 VS T3' trt 1 0-1; contrast 'T2 VS T3' trt 0 1-1; ODS Output Tests3=anova; ODS Output Contrasts=contr; run; 47
48 SAS Output Data Power; Set anova contr; if Effect ne ' ' then Source=Effect; if Label ne ' ' then Source=Label; ALPHA = 0.05; * The desired Type I error rate ; FCRIT = FINV(1- ALPHA,NumDF, DenDF, 0); * The 0 represents non-centrality parameter ; NONCENT = NumDF*Fvalue; * The non-centrality parameter ; POWER = 1 - PROBF(FCRIT, NumDF, DenDF, NONCENT); if NONCENT > 3000 then POWER = 1.0; * When NC large PROBF does not return value; PROC PRINT; Var Source numdf dendf NonCent Power; run; 48
49 Output Num Den Obs Source DF DF NONCENT POWER 1 trt period T1 VS T T2 VS T
50 Layout of Split Plot Design Two factor treatment structure Consider a completely randomized design in whole plot Whole plot EUs serve as blocks for subplot Treatment A1 Treatment A2 B1 B2 B4 B3 WP EU 1 B2 B4 WP EU 2 B3 B1 WP EU 3 B1 B3 WP EU 4 B4 B2 B3 B4 B2 B1 50
51 Linear Model Y = µ + A + Rep ) + B + ( AB) + ijk i j ( i k ik e ijk Two-factor treatment structure Several comparisons of interest 1. Compare main effects of A 2. Compare main effects of B 3. Compare levels of A for fixed level of B 4. Compare levels of B for fixed level of A 51
52 Approach #1 For each type of comparison, compute SE 1. Comparing main effects of A Var( y i.. y ) = Var(Rep Rep + e e i.. = 2 ( 2 2 σ / n +σ / bn) 2. Comparing levels of A at fixed level of B R.( i).( i ) i.. i.. ) Var( y i. k y ) = Var(Rep Rep + e e i. k = 2 ( 2 2 σ / n +σ / n) R.( i).( i ) i. k i. k ) 52
53 Approach #2 Using Proc Mixed Specify means (A has 4 levels, B has 3 levels) Specify the Rep and Error variances data ideal; input A B cards; ; data idealrep; set ideal; do rep=1 to 6; output; end; run; Can alter the number of EUs per level of A 53
54 Interaction Plot 54
55 SAS code proc mixed data=idealrep noprofile; class A Rep B; model mu = A B A*B; random rep(a); parms (2) (1) / noiter; 2 σ R = 2, σ = 1 contrast 'A1 vs A2' A A*B / df=20; contrast 'A11 vs A21' A A*B ; estimate 'A1 vs A2' A A*B / df=20; estimate 'A11 vs A21' A A*B ; ods output tests3 = anova contrasts = contr; run; 2 Documentation states the df for a contrast are associated with the last term specified. Since this is comparison of the WP factor, I use the DF associated with rep(a) 55
56 Selected Output Covariance Parameter Estimates Cov Parm Subject Estimate Intercept rep(a) Residual SEs agree with previous formulas Estimates Standard Label Estimate Error DF t Value Pr > t A1 vs A A11 vs A Num Den Obs Source DF DF NONCENT POWER 1 A B A*B A1 vs A A11 vs A
57 Comments Note that ddfm=kr has been recommended for use but not included here. That is because you cannot use ddfm= in this sample size estimation approach. ddfm= will have an effect on df and SE for comparison #3 (page 51). Should not affect others My suggestion would be to run this analysis to get a sample size and then use simulation (say 1000 data sets using the kr option) to confirm the sample size specified if you plan to power up for this comparison. 57
58 SAS code for simulation data simulate; do sim=1 to 10; do A=1 to 4; do rep=1 to 6; rep_eff = normal(612)*sqrt(2); do B=1 to 3; res_eff = normal(612)*sqrt(1); resp = rep_eff + res_eff; if A=1 & B=1 then resp = 5 + rep_eff + res_eff; if A=1 & B=2 then resp = 6 + rep_eff + res_eff; if A=1 & B=3 then resp = 7 + rep_eff + res_eff; if A=2 & B=1 then resp = 7 + rep_eff + res_eff; if A=4 & B=1 then resp = rep_eff + res_eff; if A=4 & B=2 then resp = rep_eff + res_eff; output; end; end; end; end; 58
Lecture 5: Determining Sample Size
Lecture 5: Determining Sample Size Montgomery: Section 3.7 and 13.4 1 Lecture 5 Page 1 Choice of Sample Size: Fixed Effects Can determine the sample size for OverallF test Contrasts of interest For simplicity,
More information4.8 Alternate Analysis as a Oneway ANOVA
4.8 Alternate Analysis as a Oneway ANOVA Suppose we have data from a two-factor factorial design. The following method can be used to perform a multiple comparison test to compare treatment means as well
More informationPLS205 Lab 2 January 15, Laboratory Topic 3
PLS205 Lab 2 January 15, 2015 Laboratory Topic 3 General format of ANOVA in SAS Testing the assumption of homogeneity of variances by "/hovtest" by ANOVA of squared residuals Proc Power for ANOVA One-way
More informationPaper Equivalence Tests. Fei Wang and John Amrhein, McDougall Scientific Ltd.
Paper 11683-2016 Equivalence Tests Fei Wang and John Amrhein, McDougall Scientific Ltd. ABSTRACT Motivated by the frequent need for equivalence tests in clinical trials, this paper provides insights into
More informationPower Analysis for One-Way ANOVA
Chapter 12 Power Analysis for One-Way ANOVA Recall that the power of a statistical test is the probability of rejecting H 0 when H 0 is false, and some alternative hypothesis H 1 is true. We saw earlier
More informationTopic 12. The Split-plot Design and its Relatives (continued) Repeated Measures
12.1 Topic 12. The Split-plot Design and its Relatives (continued) Repeated Measures 12.9 Repeated measures analysis Sometimes researchers make multiple measurements on the same experimental unit. We have
More informationPower & Sample Size Calculation
Chapter 7 Power & Sample Size Calculation Yibi Huang Chapter 7 Section 10.3 Power & Sample Size Calculation for CRDs Power & Sample Size for Factorial Designs Chapter 7-1 Power & Sample Size Calculation
More informationSplit-plot Designs. Bruce A Craig. Department of Statistics Purdue University. STAT 514 Topic 21 1
Split-plot Designs Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 21 1 Randomization Defines the Design Want to study the effect of oven temp (3 levels) and amount of baking soda
More informationPLS205 KEY Winter Homework Topic 3. The following represents one way to program SAS for this question:
PL05 KEY Winter 05 Homework Topic 3 Answers to Question [30 points] The following represents one way to program A for this question: Data Weeds; Input Cover $ Biomass; Cards; 66 634 63 63 633 645 69 63
More informationCOMPLETELY RANDOM DESIGN (CRD) -Design can be used when experimental units are essentially homogeneous.
COMPLETELY RANDOM DESIGN (CRD) Description of the Design -Simplest design to use. -Design can be used when experimental units are essentially homogeneous. -Because of the homogeneity requirement, it may
More informationEXST Regression Techniques Page 1. We can also test the hypothesis H :" œ 0 versus H :"
EXST704 - Regression Techniques Page 1 Using F tests instead of t-tests We can also test the hypothesis H :" œ 0 versus H :" Á 0 with an F test.! " " " F œ MSRegression MSError This test is mathematically
More informationTopic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model
Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is
More informationCMU MSP 36726: Power
CMU MSP 36726: Power H. Seltman 2/21/2018 I. Consider three experiments: 1) Recruitment, randomization, priming with your group does well/poorly in math, math testing 2) Recruitment, each subject gets
More informationSTAT 115:Experimental Designs
STAT 115:Experimental Designs Josefina V. Almeda 2013 Multisample inference: Analysis of Variance 1 Learning Objectives 1. Describe Analysis of Variance (ANOVA) 2. Explain the Rationale of ANOVA 3. Compare
More informationOne-Way Analysis of Variance (ANOVA)
1 One-Way Analysis of Variance (ANOVA) One-Way Analysis of Variance (ANOVA) is a method for comparing the means of a populations. This kind of problem arises in two different settings 1. When a independent
More informationLecture 3: Inference in SLR
Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals
More informationE509A: Principle of Biostatistics. GY Zou
E509A: Principle of Biostatistics (Week 4: Inference for a single mean ) GY Zou gzou@srobarts.ca Example 5.4. (p. 183). A random sample of n =16, Mean I.Q is 106 with standard deviation S =12.4. What
More informationIX. Complete Block Designs (CBD s)
IX. Complete Block Designs (CBD s) A.Background Noise Factors nuisance factors whose values can be controlled within the context of the experiment but not outside the context of the experiment Covariates
More informationOne-way ANOVA (Single-Factor CRD)
One-way ANOVA (Single-Factor CRD) STAT:5201 Week 3: Lecture 3 1 / 23 One-way ANOVA We have already described a completed randomized design (CRD) where treatments are randomly assigned to EUs. There is
More informationLecture 11: Nested and Split-Plot Designs
Lecture 11: Nested and Split-Plot Designs Montgomery, Chapter 14 1 Lecture 11 Page 1 Crossed vs Nested Factors Factors A (a levels)and B (b levels) are considered crossed if Every combinations of A and
More informationSTAT 5200 Handout #23. Repeated Measures Example (Ch. 16)
Motivating Example: Glucose STAT 500 Handout #3 Repeated Measures Example (Ch. 16) An experiment is conducted to evaluate the effects of three diets on the serum glucose levels of human subjects. Twelve
More informationdf=degrees of freedom = n - 1
One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:
More information1 Use of indicator random variables. (Chapter 8)
1 Use of indicator random variables. (Chapter 8) let I(A) = 1 if the event A occurs, and I(A) = 0 otherwise. I(A) is referred to as the indicator of the event A. The notation I A is often used. 1 2 Fitting
More informationCorrelation and the Analysis of Variance Approach to Simple Linear Regression
Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation
More informationLecture 7: Latin Square and Related Design
Lecture 7: Latin Square and Related Design Montgomery: Section 4.2-4.3 Page 1 Automobile Emission Experiment Four cars and four drivers are employed in a study for possible differences between four gasoline
More informationUnbalanced Data in Factorials Types I, II, III SS Part 1
Unbalanced Data in Factorials Types I, II, III SS Part 1 Chapter 10 in Oehlert STAT:5201 Week 9 - Lecture 2 1 / 14 When we perform an ANOVA, we try to quantify the amount of variability in the data accounted
More informationThe legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.
1 Chapter 1: Research Design Principles The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 2 Chapter 2: Completely Randomized Design
More informationMixed Model: Split plot with two whole-plot factors, one split-plot factor, and CRD at the whole-plot level (e.g. fancier split-plot p.
STAT:5201 Applied Statistic II Mixed Model: Split plot with two whole-plot factors, one split-plot factor, and CRD at the whole-plot level (e.g. fancier split-plot p.422 OLRT) Hamster example with three
More informationOutline Topic 21 - Two Factor ANOVA
Outline Topic 21 - Two Factor ANOVA Data Model Parameter Estimates - Fall 2013 Equal Sample Size One replicate per cell Unequal Sample size Topic 21 2 Overview Now have two factors (A and B) Suppose each
More informationLecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2
Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y
More informationAnswer to exercise: Blood pressure lowering drugs
Answer to exercise: Blood pressure lowering drugs The data set bloodpressure.txt contains data from a cross-over trial, involving three different formulations of a drug for lowering of blood pressure:
More informationChapter 11. Analysis of Variance (One-Way)
Chapter 11 Analysis of Variance (One-Way) We now develop a statistical procedure for comparing the means of two or more groups, known as analysis of variance or ANOVA. These groups might be the result
More informationChap The McGraw-Hill Companies, Inc. All rights reserved.
11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview
More informationOutline. Topic 19 - Inference. The Cell Means Model. Estimates. Inference for Means Differences in cell means Contrasts. STAT Fall 2013
Topic 19 - Inference - Fall 2013 Outline Inference for Means Differences in cell means Contrasts Multiplicity Topic 19 2 The Cell Means Model Expressed numerically Y ij = µ i + ε ij where µ i is the theoretical
More informationTable 1: Fish Biomass data set on 26 streams
Math 221: Multiple Regression S. K. Hyde Chapter 27 (Moore, 5th Ed.) The following data set contains observations on the fish biomass of 26 streams. The potential regressors from which we wish to explain
More informationOdor attraction CRD Page 1
Odor attraction CRD Page 1 dm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" ---- + ---+= -/\*"; ODS LISTING; *** Table 23.2 ********************************************;
More informationLecture 4. Random Effects in Completely Randomized Design
Lecture 4. Random Effects in Completely Randomized Design Montgomery: 3.9, 13.1 and 13.7 1 Lecture 4 Page 1 Random Effects vs Fixed Effects Consider factor with numerous possible levels Want to draw inference
More information13. The Cochran-Satterthwaite Approximation for Linear Combinations of Mean Squares
13. The Cochran-Satterthwaite Approximation for Linear Combinations of Mean Squares opyright c 2018 Dan Nettleton (Iowa State University) 13. Statistics 510 1 / 18 Suppose M 1,..., M k are independent
More informationLecture 10: Experiments with Random Effects
Lecture 10: Experiments with Random Effects Montgomery, Chapter 13 1 Lecture 10 Page 1 Example 1 A textile company weaves a fabric on a large number of looms. It would like the looms to be homogeneous
More informationTopic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8]
Topic 6. Two-way designs: Randomized Complete Block Design [ST&D Chapter 9 sections 9.1 to 9.7 (except 9.6) and section 15.8] The completely randomized design Treatments are randomly assigned to e.u. such
More informationSAS Commands. General Plan. Output. Construct scatterplot / interaction plot. Run full model
Topic 23 - Unequal Replication Data Model Outline - Fall 2013 Parameter Estimates Inference Topic 23 2 Example Page 954 Data for Two Factor ANOVA Y is the response variable Factor A has levels i = 1, 2,...,
More informationTopic 25 - One-Way Random Effects Models. Outline. Random Effects vs Fixed Effects. Data for One-way Random Effects Model. One-way Random effects
Topic 5 - One-Way Random Effects Models One-way Random effects Outline Model Variance component estimation - Fall 013 Confidence intervals Topic 5 Random Effects vs Fixed Effects Consider factor with numerous
More informationTwo-factor studies. STAT 525 Chapter 19 and 20. Professor Olga Vitek
Two-factor studies STAT 525 Chapter 19 and 20 Professor Olga Vitek December 2, 2010 19 Overview Now have two factors (A and B) Suppose each factor has two levels Could analyze as one factor with 4 levels
More informationExample: Four levels of herbicide strength in an experiment on dry weight of treated plants.
The idea of ANOVA Reminders: A factor is a variable that can take one of several levels used to differentiate one group from another. An experiment has a one-way, or completely randomized, design if several
More informationUNIVERSITY EXAMINATIONS NJORO CAMPUS SECOND SEMESTER 2011/2012
UNIVERSITY EXAMINATIONS NJORO CAMPUS SECOND SEMESTER 2011/2012 THIRD YEAR EXAMINATION FOR THE AWARD BACHELOR OF SCIENCE IN AGRICULTURE AND BACHELOR OF SCIENCE IN FOOD TECHNOLOGY AGRO 391 AGRICULTURAL EXPERIMENTATION
More information" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2
Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the
More informationAnalysis of Variance. Read Chapter 14 and Sections to review one-way ANOVA.
Analysis of Variance Read Chapter 14 and Sections 15.1-15.2 to review one-way ANOVA. Design of an experiment the process of planning an experiment to insure that an appropriate analysis is possible. Some
More informationOne-Way ANOVA. Some examples of when ANOVA would be appropriate include:
One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement
More informationIntroduction to Power and Sample Size Analysis
Four chapters from SAS/STAT User's Guide 13.2: (18) Introduction to Power and Sample Size Analysis (47) The GLMPOWER Procedure (77) The POWER Procedure (78) The Power and Sample Size Application Chapter
More informationSample Size/Power Calculation by Software/Online Calculators
Sample Size/Power Calculation by Software/Online Calculators May 24, 2018 Li Zhang, Ph.D. li.zhang@ucsf.edu Associate Professor Department of Epidemiology and Biostatistics Division of Hematology and Oncology
More informationIn a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:
Activity #10: AxS ANOVA (Repeated subjects design) Resources: optimism.sav So far in MATH 300 and 301, we have studied the following hypothesis testing procedures: 1) Binomial test, sign-test, Fisher s
More informationLecture 1 Linear Regression with One Predictor Variable.p2
Lecture Linear Regression with One Predictor Variablep - Basics - Meaning of regression parameters p - β - the slope of the regression line -it indicates the change in mean of the probability distn of
More informationInferential Statistical Analysis of Microarray Experiments 2007 Arizona Microarray Workshop
Inferential Statistical Analysis of Microarray Experiments 007 Arizona Microarray Workshop μ!! Robert J Tempelman Department of Animal Science tempelma@msuedu HYPOTHESIS TESTING (as if there was only one
More informationChapter 4: Randomized Blocks and Latin Squares
Chapter 4: Randomized Blocks and Latin Squares 1 Design of Engineering Experiments The Blocking Principle Blocking and nuisance factors The randomized complete block design or the RCBD Extension of the
More informationStatistics 512: Applied Linear Models. Topic 9
Topic Overview Statistics 51: Applied Linear Models Topic 9 This topic will cover Random vs. Fixed Effects Using E(MS) to obtain appropriate tests in a Random or Mixed Effects Model. Chapter 5: One-way
More informationKey Features: More than one type of experimental unit and more than one randomization.
1 SPLIT PLOT DESIGNS Key Features: More than one type of experimental unit and more than one randomization. Typical Use: When one factor is difficult to change. Example (and terminology): An agricultural
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More informationCHAPTER 4 Analysis of Variance. One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication
CHAPTER 4 Analysis of Variance One-way ANOVA Two-way ANOVA i) Two way ANOVA without replication ii) Two way ANOVA with replication 1 Introduction In this chapter, expand the idea of hypothesis tests. We
More informationAllow the investigation of the effects of a number of variables on some response
Lecture 12 Topic 9: Factorial treatment structures (Part I) Factorial experiments Allow the investigation of the effects of a number of variables on some response in a highly efficient manner, and in a
More informationFactorial designs. Experiments
Chapter 5: Factorial designs Petter Mostad mostad@chalmers.se Experiments Actively making changes and observing the result, to find causal relationships. Many types of experimental plans Measuring response
More informationdm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" = -/\<>*"; ODS LISTING;
dm'log;clear;output;clear'; options ps=512 ls=99 nocenter nodate nonumber nolabel FORMCHAR=" ---- + ---+= -/\*"; ODS LISTING; *** Table 23.2 ********************************************; *** Moore, David
More informationSingle Factor Experiments
Single Factor Experiments Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 4 1 Analysis of Variance Suppose you are interested in comparing either a different treatments a levels
More informationDESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya
DESAIN EKSPERIMEN BLOCKING FACTORS Semester Genap Jurusan Teknik Industri Universitas Brawijaya Outline The Randomized Complete Block Design The Latin Square Design The Graeco-Latin Square Design Balanced
More informationThis is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.
EXST3201 Chapter 13c Geaghan Fall 2005: Page 1 Linear Models Y ij = µ + βi + τ j + βτij + εijk This is a Randomized Block Design (RBD) with a single factor treatment arrangement (2 levels) which are fixed.
More informationChapter 12. Analysis of variance
Serik Sagitov, Chalmers and GU, January 9, 016 Chapter 1. Analysis of variance Chapter 11: I = samples independent samples paired samples Chapter 1: I 3 samples of equal size J one-way layout two-way layout
More informationAnalysis of Variance and Design of Experiments-I
Analysis of Variance and Design of Experiments-I MODULE VIII LECTURE - 35 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS MODEL Dr. Shalabh Department of Mathematics and Statistics Indian
More informationTopic 9: Factorial treatment structures. Introduction. Terminology. Example of a 2x2 factorial
Topic 9: Factorial treatment structures Introduction A common objective in research is to investigate the effect of each of a number of variables, or factors, on some response variable. In earlier times,
More informationTwo-Way Analysis of Variance - no interaction
1 Two-Way Analysis of Variance - no interaction Example: Tests were conducted to assess the effects of two factors, engine type, and propellant type, on propellant burn rate in fired missiles. Three engine
More informationAnalysis of Variance
Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also
More informationDESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya
DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Jurusan Teknik Industri Universitas Brawijaya Outline Introduction The Analysis of Variance Models for the Data Post-ANOVA Comparison of Means Sample
More informationStatistics For Economics & Business
Statistics For Economics & Business Analysis of Variance In this chapter, you learn: Learning Objectives The basic concepts of experimental design How to use one-way analysis of variance to test for differences
More informationTopic 32: Two-Way Mixed Effects Model
Topic 3: Two-Way Mixed Effects Model Outline Two-way mixed models Three-way mixed models Data for two-way design Y is the response variable Factor A with levels i = 1 to a Factor B with levels j = 1 to
More informationSubject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study
Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study 1.4 0.0-6 7 8 9 10 11 12 13 14 15 16 17 18 19 age Model 1: A simple broken stick model with knot at 14 fit with
More informationANOVA CIVL 7012/8012
ANOVA CIVL 7012/8012 ANOVA ANOVA = Analysis of Variance A statistical method used to compare means among various datasets (2 or more samples) Can provide summary of any regression analysis in a table called
More informationGregory Carey, 1998 Regression & Path Analysis - 1 MULTIPLE REGRESSION AND PATH ANALYSIS
Gregory Carey, 1998 Regression & Path Analysis - 1 MULTIPLE REGRESSION AND PATH ANALYSIS Introduction Path analysis and multiple regression go hand in hand (almost). Also, it is easier to learn about multivariate
More informationApplication of Ghosh, Grizzle and Sen s Nonparametric Methods in. Longitudinal Studies Using SAS PROC GLM
Application of Ghosh, Grizzle and Sen s Nonparametric Methods in Longitudinal Studies Using SAS PROC GLM Chan Zeng and Gary O. Zerbe Department of Preventive Medicine and Biometrics University of Colorado
More informationANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS
ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing
More informationMixed Model: Split plot with two whole-plot factors, one split-plot factor, and CRD at the whole-plot level (e.g. fancier split-plot p.
22s:173 Combining multiple factors into a single superfactor Mixed Model: Split plot with two whole-plot factors, one split-plot factor, and CRD at the whole-plot level (e.g. fancier split-plot p.422 Oehlert)
More informationN J SS W /df W N - 1
One-Way ANOVA Source Table ANOVA MODEL: ij = µ* + α j + ε ij H 0 : µ = µ =... = µ j or H 0 : Σα j = 0 Source Sum of Squares df Mean Squares F J Between Groups nj( j * ) J - SS B /(J ) MS B /MS W = ( N
More informationRepeated Measures Data
Repeated Measures Data Mixed Models Lecture Notes By Dr. Hanford page 1 Data where subjects are measured repeatedly over time - predetermined intervals (weekly) - uncontrolled variable intervals between
More informationAN IMPROVEMENT TO THE ALIGNED RANK STATISTIC
Journal of Applied Statistical Science ISSN 1067-5817 Volume 14, Number 3/4, pp. 225-235 2005 Nova Science Publishers, Inc. AN IMPROVEMENT TO THE ALIGNED RANK STATISTIC FOR TWO-FACTOR ANALYSIS OF VARIANCE
More informationRCB - Example. STA305 week 10 1
RCB - Example An accounting firm wants to select training program for its auditors who conduct statistical sampling as part of their job. Three training methods are under consideration: home study, presentations
More informationUnit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs
Unit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs STA 643: Advanced Experimental Design Derek S. Young 1 Learning Objectives Understand how to interpret a random effect Know the different
More information6 Single Sample Methods for a Location Parameter
6 Single Sample Methods for a Location Parameter If there are serious departures from parametric test assumptions (e.g., normality or symmetry), nonparametric tests on a measure of central tendency (usually
More informationIncomplete Block Designs
Incomplete Block Designs Recall: in randomized complete block design, each of a treatments was used once within each of b blocks. In some situations, it will not be possible to use each of a treatments
More informationTopic 20: Single Factor Analysis of Variance
Topic 20: Single Factor Analysis of Variance Outline Single factor Analysis of Variance One set of treatments Cell means model Factor effects model Link to linear regression using indicator explanatory
More informationMEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS FALL 1999
MEMORIAL UNIVERSITY OF NEWFOUNDLAND DEPARTMENT OF MATHEMATICS AND STATISTICS FINAL EXAM - STATISTICS 350 - FALL 1999 Instructor: A. Oyet Date: December 16, 1999 Name(Surname First): Student Number INSTRUCTIONS
More informationSTAT 401A - Statistical Methods for Research Workers
STAT 401A - Statistical Methods for Research Workers One-way ANOVA Jarad Niemi (Dr. J) Iowa State University last updated: October 10, 2014 Jarad Niemi (Iowa State) One-way ANOVA October 10, 2014 1 / 39
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More information16.3 One-Way ANOVA: The Procedure
16.3 One-Way ANOVA: The Procedure Tom Lewis Fall Term 2009 Tom Lewis () 16.3 One-Way ANOVA: The Procedure Fall Term 2009 1 / 10 Outline 1 The background 2 Computing formulas 3 The ANOVA Identity 4 Tom
More informationAnalysis Of Variance Compiled by T.O. Antwi-Asare, U.G
Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G 1 ANOVA Analysis of variance compares two or more population means of interval data. Specifically, we are interested in determining whether differences
More informationBIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES
BIOL 458 - Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES PART 1: INTRODUCTION TO ANOVA Purpose of ANOVA Analysis of Variance (ANOVA) is an extremely useful statistical method
More informationIntroduction to Crossover Trials
Introduction to Crossover Trials Stat 6500 Tutorial Project Isaac Blackhurst A crossover trial is a type of randomized control trial. It has advantages over other designed experiments because, under certain
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationPower and Sample Size Bios 662
Power and Sample Size Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-10-31 14:06 BIOS 662 1 Power and Sample Size Outline Introduction One sample: continuous
More information21.0 Two-Factor Designs
21.0 Two-Factor Designs Answer Questions 1 RCBD Concrete Example Two-Way ANOVA Popcorn Example 21.4 RCBD 2 The Randomized Complete Block Design is also known as the two-way ANOVA without interaction. A
More informationDifferences of Least Squares Means
STAT:5201 Homework 9 Solutions 1. We have a model with two crossed random factors operator and machine. There are 4 operators, 8 machines, and 3 observations from each operator/machine combination. (a)
More informationSTAT 350 Final (new Material) Review Problems Key Spring 2016
1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,
More informationUnbalanced Designs Mechanics. Estimate of σ 2 becomes weighted average of treatment combination sample variances.
Unbalanced Designs Mechanics Estimate of σ 2 becomes weighted average of treatment combination sample variances. Types of SS Difference depends on what hypotheses are tested and how differing sample sizes
More informationTwo-Way Factorial Designs
81-86 Two-Way Factorial Designs Yibi Huang 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley, so brewers like
More information