Week 3 & 4 Randomized Blocks, Latin Squares, and Related Designs

Size: px
Start display at page:

Download "Week 3 & 4 Randomized Blocks, Latin Squares, and Related Designs"

Transcription

1 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 1/81 Week 3 & 4 Randomized Blocks, Latin Squares, and Related Designs Joslin Goh Simon Fraser University

2 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 2/81 Outline 31 Randomized Complete Block Design 32 Balanced Incomplete Block Design 33 Latin Square Design 34 Graeco-Latin Square Design 35 Analysis of Covariance

3 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 3/81 31 Randomized Complete Block Design Blocking is to systematically eliminate its effect on treatment effects Within each block, subjects are assumed to be homogeneous The variations within blocks should be less than the variations between blocks Nuisance factor: a design factor that may have an effect on the response but is not of primary interest Unknown and uncontrollable: randomization Known and uncontrollable: analysis of covariance Known and controllable: blocking

4 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 4/81 Examples a Four training processes for IQ Score, 16 Students, each of 4 students are from the same department b Test 3 crop varieties on 5 fields c An industrial engineer is conducting an experiment on eye focus time He is interested in the effect of the distance of the object from the eye on the focus time Four different distances are of interest He has five subjects available for the experiment

5 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 5/81 Model y ij = µ+τ i +β j +ǫ ij,i = 1,,a,j = 1,,b µ: grand/overall mean τ i : ith treatment effect, a i=1 τ i = 0 β j : jth block effect, b j=1 β j = 0 ǫ ij : experimental error iid N(0,σ 2 )

6 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 6/81 Estimates y ij = ˆµ+ ˆτ i + ˆβ j +r ij

7 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 7/81 ANOVA decomposition a i=1 b a b (y ij ȳ ) 2 = b (ȳ i ȳ ) 2 +a (ȳ j ȳ ) 2 j=1 i=1 i=1 j=1 j=1 a b + (y ij ȳ i ȳ j +ȳ ) 2 SS Total = SS Treatments +SS Blocks +SS Error

8 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 8/81 ANOVA Table Source Sum of Square DOF Mean Square F Treatments b a i=1 (ȳ i ȳ ) 2 a-1 MS MS Treatments Treatments MS Error Blocks a b j=1 (ȳ j ȳ ) 2 b-1 MS Blocks Error SS Total SS Treatments SS Blocks (a-1)(b-1) MS Error Total b j=1 (y ij ȳ ) 2 N-1 a i=1

9 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 9/81 E(MS Error ) = σ 2 E(MS Treatments ) = σ 2 + b a i=1 τ2 i a 1 E(MS Blocks ) = σ 2 + a b j=1 β2 j b 1 Effective blocking

10 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 10/81 Hypothesis testing H 0 : τ 1 = τ 2 = = τ a H 1 : i j such that τ i τ j If F > F α,a 1,(a 1)(b 1), reject H 0 at level α

11 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 11/81 Assumption checking Normality: normal probability plot Independence: Constant variance: Additivity between blocks and treatments (Tukey s non-additivity test)

12 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 12/81 Multiple comparisons a If H 0 is rejected, multiple comparisons of τ i s should be performed b Tukey multiple comparison method declares treatments i and j are statistical different if t ij = c Simultaneous CI for τ i τ j are ȳ i ȳ j ˆσ 1/b+1/b > 1 2 q α,a,(a 1)(b 1) ȳ i ȳ j ±q α,a,(a 1)(b 1) ˆσ b

13 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 13/81 Example of Penicillin Comparison of four variants of a penicillin product process It was known that an important raw material, corn steep liquor, was quite variable Each blend of corn steep liquor is sufficient for four runs The experiment was protected from extraneous unknown sources of bias by running the treatments in random order within each block Block Treatment Block (blend of corn steep liquor) A B C D average blend 1 89 (1) 88 (3) 97 (2) 94 (4) 92 blend 2 84 (4) 77 (2) 92 (3) 79 (1) 83 blend 3 81 (2) 87 (1) 87 (4) 85 (3) 85 blend 4 87 (1) 92 (3) 89 (2) 84 (4) 88 blend 5 79 (3) 81 (4) 80 (1) 88 (2) 82 Treatment Average =Grand Average

14 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 14/81 Graphical check yield yield A B C D treat Blend1 Blend2 Blend3 Blend4 Blend5 blend plot(yield blend+treat,data=penicillin)

15 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 15/81 Interaction plots penicillin$blend penicillin$treat mean of penicillin$yield Blend1 Blend5 Blend3 Blend4 Blend2 mean of penicillin$yield D B C A A B C D penicillin$treat Blend1 Blend2 Blend3 Blend4 Blend5 penicillin$blend interactionplot(penicillin$treat,penicillin$blend,penicillin$yield) interactionplot(penicillin$blend,penicillin$treat,penicillin$yield)

16 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 16/81 Fit the model > g <- lm(yield treat+blend,penicillin) > summary(g) Call: lm(formula = yield treat + blend, data = penicillin) Coefficients: Estimate Std Error t value Pr(> t ) (Intercept) e-13 *** treatb treatc treatd blendblend * blendblend * blendblend blendblend ** --- Signif codes: 0 *** 0001 ** 001 * Residual standard error: 434 on 12 degrees of freedom Multiple R-squared: 05964, Adjusted R-squared: 0361 F-statistic: 2534 on 7 and 12 DF, p-value:

17 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 17/81 Normality assumption Normal Q Q Plot Sample Quantiles Theoretical Quantiles

18 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 18/81 Independence assumption Residuals Run order

19 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 19/81 Constant variance assumption Residuals Fitted

20 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 20/81 Constant variance assumption > bartletttest(yield treat,penicillin) Bartlett test of homogeneity of variances data: yield by treat Bartlett s K-squared = 06901, df = 3, p-value = > bartletttest(yield blend,penicillin) Bartlett test of homogeneity of variances data: yield by blend Bartlett s K-squared = 23859, df = 4, p-value = 06652

21 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 21/81 Tukey s Non-additivity Test Use package alr3 > tukeynonaddtest(g) t value Pr(> t )

22 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 22/81 > anova(g) Analysis of Variance Table Response: yield Df Sum Sq Mean Sq F value Pr(>F) treat blend * Residuals Signif codes: 0 *** 0001 ** 001 *

23 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 23/81 Fitting the model with the interaction term > alpha <- c(0,g$coef[2:4]) > alpha treatb treatc treatd > beta <- c(0,g$coef[5:8]) > beta blendblend2 blendblend3 blendblend4 blendblend > ab <- rep(alpha,5)*rep(beta,rep(4,5)) > h <- update(g, +ab) > anova(h) Analysis of Variance Table Response: yield Df Sum Sq Mean Sq F value Pr(>F) treat blend ab Residuals Signif codes: 0 *** 0001 ** 001 *

24 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 24/81 Multiple comparisons Fisher s LSD ( Least Significant Difference ) Bonferroni method Tukey method

25 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 25/81 Example 2: This experiment is to assess the effects of four different cholesterol-reducing diets on persons who have hypercholesterolemia To run a CRBD, the blocks are chosen to represent combinations of the various gender-age-body size categories of interest For each block of subjects, the four diets are randomly assigned to the sample of four persons in the block Each subject is then followed for one year, after which the change in cholesterol level is recorded Gender: Male/Female Age: Above/Below 50 Body Size: Quatelet index above/below 35

26 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 26/81 Factor: Diet Treatments: Diet 1, 2, 3, 4

27 > g <- lm(change diet+block,cholesterol) > summary(g) Call: lm(formula = change diet + block, data = cholesterol) Coefficients:Estimate Std Error t value Pr(> t ) (Intercept) e-15 *** dietb *** dietc *** dietd e-05 *** blockb e-07 *** blockb e-06 *** blockb *** blockb *** blockb blockb e-10 *** blockb Signif codes: 0 *** 0001 ** 001 * Residual standard error: on 21 degrees of freedom Multiple R-squared: 09618, Adjusted R-squared: F-statistic: 5289 on 10 and 21 DF, p-value: 1312e-12 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 27/81

28 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 28/81 > anova(g) Analysis of Variance Table Response: change Df Sum Sq Mean Sq F value Pr(>F) diet e-05 *** block e-13 *** Residuals

29 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 29/81 32 Balanced Incomplete Block Design Gathering faculty teaching evaluations by in-class and online surveys: their effects on response rates and evaluations by Curt J Dommeyer, Paul Baum, Robert W Hanna and Kenneth S Chapman Experimental Design: The study was conducted using undergraduate business majors at California State University, Northridge A total of 16 instructors participated in the study Although the sample of instructors represents a convenience sample, the courses represent a cross-section of lower and upper division core courses that are required for business majors Courses by area online treatment Accounting Business Law Economics Finance Management Marketing Management Sci Total Grade incentive Grade feedback Incentive Demo Control group Total

30 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 30/81 Each of the instructors in this study was assigned to have one of his/her sections evaluated in-class and the other evaluated online In the online evaluation, each instructor was assigned either to a control group or to one of the following online treatments: 1 a very modest grade incentive (one-quarter of a percent) for completing the online evaluations; 2 an in-class demonstration of how to log on to the web site and complete the form; 3 an early grade feedback incentive in which students were told they would receive early feedback of their course grades (by postcard and/or posting the grades online) if at least two-thirds of the class completed online evaluations

31 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 31/81 Seven different hardwood concentrations are being studied to determine their effect on the strength of the paper produced However, the pilot plant can only produce three runs each day As days may differ, the analyst uses the following design Hardwood Days Concentration(%)

32 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 32/81 Definition and notation A Balanced Incomplete Block Design BIBD(N,a,k,b,r,λ) is an incomplete block design in which

33 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 33/81 a: the number of treatments k: the block size b: the number of blocks r: the number of times each treatment occurs λ: the number of times each pair of treatments appears in the same block N: the number of observations

34 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 34/81 Some contraints Note that for some values of a,r,b,k, there do not exist a BIBD For example, a = 8,r = 8,k = 4,b = 16

35 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 35/81 Examples block 1 A C D block 2 A B C block 3 B C D block 4 A B D block 1 A B block 2 A C block 3 B C

36 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 36/81 Model y ij = µ+τ i +β j +ǫ ij,i = 1,,a,j = 1,b µ: overall mean τ i : ith treatment effect, i τ i = 0 β j : jth block effect, j β j = 0 ǫ ij : random error ǫ ij iid N(0,σ 2 ) Note: 1 Not all y ij exists 2 Treatments and blocks are not orthogonal (independent)

37 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 37/81 Estimation where ˆµ = ȳ ; ˆτ i = kq i λa ; ˆβj = rq j λb Q i = y i 1 k b j=1 n ijy j ; Q j = y j 1 r a i=1 n ijy i ; n ij is 1 if treatment i appears in block j and 0 otherwise; i ˆτ i = 0; j ˆβ j = 0; i Q i = 0; Var(Q i ) = (k 1)r k σ 2 and Var(ˆτ i ) = k(a 1) λa 2 σ 2 ; Q i is the adjusted total for ith treatment and they are not independent (Var(ˆτ i ˆτ j ) = 2kσ 2 /(λa))

38 Note the blocks are incomplete and thus ȳ i ȳ is not an unbiased estimate of τ i For example E(ȳ 1 ) = µ+τ 1 +(β 1 +β 2 +β 3 )/3 E(ȳ ) = µ E(ȳ 1 ȳ ) = τ 1 +(β 1 +β 2 +β 3 )/3 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 38/81

39 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 39/81 ANOVA Table Define: SS Total = i,j (y ij ȳ ) 2 = i,j y 2 ij y2 N SS Blocks = k j (ȳ j ȳ ) 2 = 1 k j y 2 j y2 N SS Treatments(adjusted) = k a i=1 Q2 i λa SS Error = SS Total SS Blocks SS Treatments(adjusted)

40 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 40/81 Source Sum of Squares DOF Mean Squares F Blocks SS Blocks b 1 SS Blocks /(b 1) Trts(adjusted) SS Treatments(adjusted) a 1 SS Treatments(adjusted) /(a 1) Error SS Error N a b+1 SS Error /(N a b+1) Total SS Total N 1 MS Treamtments(adjusted) SS Error Test statistic for testing the equality of the treatment effect is F = MS Treamtments(adjusted) SS Error

41 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 41/81 Assessing Block Effects To assess the block effects, we use and SS Blocks(adjusted) = r b j=1 (Q j )2 λb SS Total = SS Treatments + SS Blocks(adjusted) + SS Error But SS Total SS Treatments(adjusted) + SS Blocks(adjusted) + SS Error

42 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 42/81 Confidence interval For τ i τ j Fisher LSD CI Tukey s CI ˆτ i ˆτ j ±t α/2,n a b+1 2k λa MS Error ˆτ i ˆτ j ± q α,a,n a b+1 2 2k λa MS Error

43 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 43/81 Working Example Suppose that a chemical engineer thinks that the time of reaction for a chemical process is a function of the type of catalyst employed Four catalysts are currently being investigated The experimental procedure consists of selecting a batch of raw material, loading the pilot plant, applying each catalyst in a separate run of the pilot plant, and observing the reaction time Because variations in the batches of raw material may affect the performance of the catalysts, the engineer decides to use batches of raw material as blocks However, each batch is only large enough to permit three catalysts to be run Therefore a randomized incomplete block design must be used Treatment Blocks(Batch of Raw Material) (Catalyst) y i y j =y

44 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 44/81 In the example, we have Q 1 = ( ) = 9/3 Q 2 = ( ) = 7/3 Q 3 = ( ) = 4/3 Q 4 = ( ) = 20/3

45 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 45/81 For example: SS Total = i,j y2 ij y2 N SS Blocks = 1 k = = 81 j y2 j y2 N = 1 3 ( ) = 55 SS Treatments(adjusted) = [( 9/3)2 +( 7/3) 2 +( 4/3) 2 +(20/3) 2 ]) 2 4 = 2275 SS Error = = 325

46 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 46/81 Example Incorrect analysis for testing equality of treatment effects > g<-lm(y Treatment + Block, catalyst) > anova(g) Analysis of Variance Table Response: y Df Sum Sq Mean Sq F value Pr(>F) Treatment * Block *** Residuals Signif codes: 0 *** 0001 ** 001 *

47 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 47/81 Correct analysis for testing equality of treatment effects > g<-lm(y Block + Treatment, catalyst) > anova(g) Analysis of Variance Table Response: y Df Sum Sq Mean Sq F value Pr(>F) Block ** Treatment * Residuals Signif codes: 0 *** 0001 ** 001 *

48 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 48/81 #set the parameters a<-4 b<-4 k<-3 r<-3 n<-12 lambda<-r*(k-1)/(a-1) # create n_ij Nij<-matrix(0,a,a) for(i in 1:n) Nij[asinteger(Treatment[i]),asinteger(Block[i])]<-1 # compute y_i ysumi<-vector( numeric,a) for(i in 1:a) ysumi[i]<-sum(y[((i-1)*r+1):(i*r)]) #compute y_j ysumj<-vector( numeric,b) for(i in 1:n) {ysumj[asinteger(block[i])]<- ysumj[asinteger(block[i])] + y[i]} #compute Q_i Qi<-vector("numeric",a) for(i in 1:a) Qi[i]<-ysumi[i]-sum(Nij[i,]*ysumj)/k #compute hat of tau_i tauihat<-k*qi/(lambda*a)

49 > #(1-alpha)100% Fisher LSD CI for tau_i- tau_j > alpha<-005 > mse<-065 > for(i in 1:(a-1)) + for(j in (i+1):a) + { + print( + c(tauihat[i]-tauihat[j]-qt(1-alpha/2,n-a-b+1)*sqrt(mse*2*k/(lambda*a)), + tauihat[i]-tauihat[j]+qt(1-alpha/2,n-a-b+1)*sqrt(mse*2*k/(lambda*a)))) + } [1] [1] [1] [1] [1] [1] Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 49/81

50 > #(1-alpha)100% Tukey CI for tau_i- tau_j > alpha<-005 > mse<-065 > for(i in 1:(a-1)) + for(j in (i+1):a) + { + print( + c(tauihat[i]-tauihat[j]-(qtukey(1-alpha,a,n-a-b+1)/sqrt(2)) + *sqrt(mse*2*k/(lambda*a)), + tauihat[i]-tauihat[j]+(qtukey(1-alpha,a,n-a-b+1)/sqrt(2)) + *sqrt(mse*2*k/(lambda*a)))) + } [1] [1] [1] [1] [1] [1] Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 50/81

51 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 51/81 Other incomplete block designs Youden square Partially balanced incomplete block design Cubic and rectangular Lattices Cyclic designs

52 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 52/81 33 Latin Square Designs Agronomy experiments: Suppose we want to test the relative effectiveness of 5 different fertilizer mixtures on oats The five experiments cannot be carried out on the same plot of land Even contiguous plots may vary in fertility because of a moisture gradient, different previous use of the land, or some other reasons Dividing a single plot into a 5 5 grid of subplots, and administering the fertilizers according to a Latin square arrangement as in the figure below:

53 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 53/81 Typewriters experiments: A company is interested in purchasing one brand of typewriter from among the 5 kinds available in the market To make decision, a study is performed to evaluate the typewriter It is suspected that the time of the day as well the day of the week in which the test is carried out affects the output on the machine Days Shifts Total Treatments Mon D B A C E Tue E C B D A Wed B E C A D Thu A D E B C Fri C A D E B Total

54 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 54/81 Example: A hardness testing machine presses a pointed rod (the tip ) into a metal specimen (a coupon ), with a known force The depth of the depression is a measure of the hardness of the specimen It is feared that, depending on the kind of tip used, the machine might give different readings The experimenter wants 4 observations on each of the 4 types of tips Suppose that the coupon and the operator of the testing machine are thought to be factors Suppose there are p = 4 operators, p = 4 coupons, and p = 4 tips The first two are nuisance factors, the last is the treatment factor Operator Coupon k = 1 k = 2 k = 3 k = 4 i = 1 A = 93 B = 93 C = 95 D = 102 i = 2 B = 94 A = 94 D = 10 C = 97 i = 3 C = 92 D = 96 A = 96 B = 99 i = 4 D = 97 C = 94 B = 98 A = 10

55 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 55/81 Definition Consider one treatment factor and two nuisance/blocking factors A design is called Latin square design if each treatment appears exactly once in each row and in each column The number of levels of each blocking factor must equal the number of levels of the treatment factor Small run sizes Randomization Randomly select a Latin square Randomize the experiment units and trial orders

56 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 56/81 Model y ijk = µ+α i +τ j +β k +ǫ ijk,i,j,k = 1,,p y ijk : the response in row i, column k using treatment j α i : row effect, i α i = 0 τ j : treatment effect, j τ j = 0 β k : column effect, k β k = 0 ǫ ijk iid N(0,σ 2 )

57 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 57/81 Estimation ˆµ = ȳ ˆα i = ȳ i ȳ ˆτ j = ȳ j ȳ ˆβ k = ȳ k ȳ r ijk = y ijk (ȳ i +ȳ j +ȳ k )+2ȳ

58 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 58/81 ANOVA table Define: SS Total = i (y ijk ȳ ) 2 j k SS Rows = p i (ȳ i ȳ ) 2 SS Treatments = p j (ȳ j ȳ ) 2 SS Columns = p k (ȳ k ȳ ) 2 SS Error = i [y ijk (ȳ i +ȳ j +ȳ k )+2ȳ ] 2 j k

59 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 59/81 Source Sum of Squares DO F Mean Squares F Treatments SS Treatments p 1 SS Treatments /(p 1) Rows SS Rows p 1 SS Rows /(p 1) Columns SS Columns p 1 SS Columns /(p 1) Error SS Error (p 2)(p 1) SS Error /[(p 2)(p 1)] Total SS Total p 2 1 MS Treatments MS Error Compare F to F α,p 1,(p 2)(p 1) to reject or accept H 0 : τ 1 = = τ p

60 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 60/81 > anova(g) Analysis of Variance Table Response: y Df Sum Sq Mean Sq F value Pr(>F) tip *** operator e-05 *** coupon * Residuals Signif codes: 0 *** 0001 ** 001 *

61 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 61/81 Multiple comparisons If H 0 is rejected, multiple comparisons of τ i and τ j should be performed The test statistic is t ij = ȳi ȳ j ˆσ 1p + 1 p where ˆσ 2 is the mean square error in the ANOVA table Tukey multiple comparison method claims treatments i and j are statistically significantly different if t ij > 1 2 q α,p,(p 2)(p 1)

62 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 62/81 Application Crossover design is a type of randomized clinical trial Each participants in the design is randomized to either group 1 or group 2 All participants in group 1 receive drug A in the first treatment period and drug B in the second period All participants in group 2 receive drug B in the first treatment period and drug A in the second treatment period Often there is a washout period between the two active drug periods During the washout period, they receive no study medication The purpose of the washout period is to reduce the likelihood that study medication taken in the first period will have an effect that carries over the next period Latin Squares I II III IV V Subject Period 1 A B B A B A A B A B Period 2 B A A B A B B A B A

63 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 63/81 Consider a clinical trial comparing Motrin versus placebo for the treatment of tennis elbow Each participant was randomized to receive either Motrin (group 1) or placebo (group 2) for a 3-week period All participants then had 2-week washout period during which they receive no study medication All participants were then crossed-over for a second 3-week period to receive the opposite study medication from that initially received ANOVA table for crossover design Source DOF Subjects (Columns) 9 Periods (Rows) 1 Treatments (Letters) 1 Error 18 Total 29

64 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 64/81 33 Graeco-Latin Squares Definition: Orthogonal Latin squares Two Latin squares are said to be orthogonal if each pair of letters appears exactly once in the superimposed squares The superimposed square is called Graeco-Latin square Aα Bβ Cγ Bγ Cα Aβ Cβ Aγ Bα

65 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 65/81 Model y ijkl = µ+θ i +τ j +ω k +φ l +ǫ ijkl,i,j,k,l = 1,,p y ijkl : the observation in row i and column l for Latin letter j and Greek letter k θ i : ith row effect τ j : jth Latin letter treatment effect ω k : kth Greek letter treatment effect φ l : lth column effect ǫ ijkl iid N(0,σ 2 )

66 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 66/81 ANOVA Table Source Sum of Squares DOF Rows p i (ȳ i ȳ ) 2 p 1 Latin letter treatments p j (ȳ j ȳ ) 2 p 1 Greek letter treatments p k (ȳ k ȳ ) 2 p 1 Columns p l (ȳ l ȳ ) 2 p 1 Error SS Error (by substraction) (p 3)(p 1) Total i j k l (y ijkl ȳ ) 2 p 2 1 F-test and Tukey s multiple comparison are similar as those in Latin square design

67 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 67/81 35 Analysis of Covariance (ANCOVA) Consider an experiment (Flurry, 1939) to study the breaking strength (y) in grams of three types of starch film The breaking strength is also known to depend on the thickness of the film (x) as measured in 10 4 inches Because film thickness varies from run to run and its values cannot be controlled or chosen prior to the experiment, it should be treated as a covariate whose effect on strength needs to be accounted for before comparing the three types of starch Objective: perform the treatment comparisons by incorporating the information of auxiliary covariate x

68 Data for starch experiment Canna Starch corn Potato y x y x y x Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 68/81

69 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 69/81 Model y ij = µ+τ i +γx ij +ǫ ij,i = 1,,k;j = 1,,n i where µ: overall mean τ i : ith treatment effect, k i=1 τ i = 0 or τ 1 = 0 x ij : the covariate value for y ij γ:: the regression coefficient for the covariate iid ǫ ij N(0,σ 2 ) Special cases a γx ij = 0, reduces to complete randomized design b τ i = 0, reduces to simple linear regression

70 Estimation Assume τ 1 = 0 We rewrite the model such that y = Xβ +ǫ where y = y 11 y 1n1 y k1 y knk, x = x x 1n x x 2n x k x knk, β = µ τ 2 τ k γ, ǫ = ǫ 11 ǫ 1n1 ǫ k1 ǫ knk Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 70/81

71 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 71/81 We have ˆβ = (X T X) 1 X T y = ˆµ ˆτ 2 ˆτ kˆγ and thus we also obtain ˆτ i ˆτ j = ˆµ i ˆµ j

72 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 72/81 Hypothesis test H 0 : τ 1 = = τ k Model I: y ij = µ+γx ij +ǫ ij,i = 1,,k;j = 1,,n i Model II: y ij = µ+τ i +γx ij +ǫ ij,i = 1,,k;j = 1,,n i We have SS(Reg, Model I) = SS Total SS Error (Model I) SS Treatments = SS Error (Model I) SS Error (Model II)

73 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 73/81 ANOVA table Source Sum of Square DOF Mean Square F Covariate SS(Reg, Model I) 1 SS(Reg, Model I)/1 MS x /MS Error Treatments SS Treatments k 1 SS Treatments /(k 1) MS Treatments /MS Error Error SS Error (Model II) i n i 1 k MS Error Total SS Total i n i 1

74 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 74/81 Multiple Comparisons To compute Var(ˆτ i ˆτ j ), note that Var(ˆτ i ˆτ j ) = Var(ˆτ i )+Var(ˆτ j ) 2Cov(ˆτ i,ˆτ j ) Use the fact Var(ˆβ) = σ 2 (X T X) 1 in multiple regression Test statistic t ij = ˆτ i ˆτ j ˆ Var(ˆτ i ˆτ j )

75 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 75/81 > # set the model matrix > x<-matrix(0,n,4) > x[,1]<-rep(1,n) > x[14:32,2]<-1 > x[33:49,3]<-1 > x[,4]<-dataset[,2] > # fit the model > g<-lm(y x[,2]+x[,3]+x[,4])

76 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 76/81 > summary(g) Call: lm(formula = y x[, 2] + x[, 3] + x[, 4]) Coefficients: Estimate Std Error t value Pr(> t ) (Intercept) x[, 2] x[, 3] x[, 4] *** --- Signif codes: 0 *** 0001 ** 001 * Residual standard error: 1647 on 45 degrees of freedom Multiple R-squared: 06815, Adjusted R-squared: F-statistic: 3209 on 3 and 45 DF, p-value: 3001e-11

77 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 77/81 Effect Estimate Standard Error t value p value intercept (µ) γ τ 2 τ τ 3 τ

78 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 78/81 > starch<-vector("character",n) > starch[1:13]<-"a" > starch[14:32]<-"b" > starch[33:49]<-"c" > anova(lm(y x[,4]+starch)) #ANOVA table Analysis of Variance Table Response: y Df Sum Sq Mean Sq F value Pr(>F) x[, 4] e-12 *** starch Residuals Signif codes: 0 *** 0001 ** 001 *

79 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 79/81 Incorrect analysis > anova(lm(y starch+x[,4])) #ANOVA table Analysis of Variance Table Response: y Df Sum Sq Mean Sq F value Pr(>F) starch e-11 *** x[, 4] *** Residuals Signif codes: 0 *** 0001 ** 001 *

80 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 80/81 > anova(lm(y starch)) #ANOVA table ignoring thickness Analysis of Variance Table Response: y Df Sum Sq Mean Sq F value Pr(>F) starch e-09 *** Residuals Signif codes: 0 *** 0001 ** 001 *

81 Week 3 & 4Randomized Blocks, Latin Squares, and Related Designs p 81/81 > solve(t(x)%*%x) [,1] [,2] [,3] [,4] [1,] [2,] [3,] [4,] > txxinv<-solve(t(x)%*%x) > sqrt(27110*(txxinv[2,2]+txxinv[3,3]-2*txxinv[2,3])) [1]

Lecture 7: Latin Square and Related Design

Lecture 7: Latin Square and Related Design Lecture 7: Latin Square and Related Design Montgomery: Section 4.2-4.3 Page 1 Automobile Emission Experiment Four cars and four drivers are employed in a study for possible differences between four gasoline

More information

Lecture 7: Latin Squares and Related Designs

Lecture 7: Latin Squares and Related Designs Lecture 7: Latin Squares and Related Designs Montgomery: Section 4.2 and 4.3 1 Lecture 7 Page 1 Automobile Emission Experiment Four cars and four drivers are employed in a study of four gasoline additives(a,b,c,

More information

Chapter 4: Randomized Blocks and Latin Squares

Chapter 4: Randomized Blocks and Latin Squares Chapter 4: Randomized Blocks and Latin Squares 1 Design of Engineering Experiments The Blocking Principle Blocking and nuisance factors The randomized complete block design or the RCBD Extension of the

More information

Chapter 4 Experiments with Blocking Factors

Chapter 4 Experiments with Blocking Factors Chapter 4 Experiments with Blocking Factors 許湘伶 Design and Analysis of Experiments (Douglas C. Montgomery) hsuhl (NUK) DAE Chap. 4 1 / 54 The Randomized Complete Block Design (RCBD; 隨機化完全集區設計 ) 1 Variability

More information

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

DESAIN EKSPERIMEN BLOCKING FACTORS. Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya DESAIN EKSPERIMEN BLOCKING FACTORS Semester Genap Jurusan Teknik Industri Universitas Brawijaya Outline The Randomized Complete Block Design The Latin Square Design The Graeco-Latin Square Design Balanced

More information

STAT22200 Spring 2014 Chapter 13B

STAT22200 Spring 2014 Chapter 13B STAT22200 Spring 2014 Chapter 13B Yibi Huang May 27, 2014 13.3.1 Crossover Designs 13.3.4 Replicated Latin Square Designs 13.4 Graeco-Latin Squares Chapter 13B - 1 13.3.1 Crossover Design (A Special Latin-Square

More information

STAT 430 (Fall 2017): Tutorial 8

STAT 430 (Fall 2017): Tutorial 8 STAT 430 (Fall 2017): Tutorial 8 Balanced Incomplete Block Design Luyao Lin November 7th/9th, 2017 Department Statistics and Actuarial Science, Simon Fraser University Block Design Complete Random Complete

More information

Chapter 12. Analysis of variance

Chapter 12. Analysis of variance Serik Sagitov, Chalmers and GU, January 9, 016 Chapter 1. Analysis of variance Chapter 11: I = samples independent samples paired samples Chapter 1: I 3 samples of equal size J one-way layout two-way layout

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

3. Design Experiments and Variance Analysis

3. Design Experiments and Variance Analysis 3. Design Experiments and Variance Analysis Isabel M. Rodrigues 1 / 46 3.1. Completely randomized experiment. Experimentation allows an investigator to find out what happens to the output variables when

More information

Lec 1: An Introduction to ANOVA

Lec 1: An Introduction to ANOVA Ying Li Stockholm University October 31, 2011 Three end-aisle displays Which is the best? Design of the Experiment Identify the stores of the similar size and type. The displays are randomly assigned to

More information

STAT 525 Fall Final exam. Tuesday December 14, 2010

STAT 525 Fall Final exam. Tuesday December 14, 2010 STAT 525 Fall 2010 Final exam Tuesday December 14, 2010 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will

More information

RCB - Example. STA305 week 10 1

RCB - Example. STA305 week 10 1 RCB - Example An accounting firm wants to select training program for its auditors who conduct statistical sampling as part of their job. Three training methods are under consideration: home study, presentations

More information

22s:152 Applied Linear Regression. Take random samples from each of m populations.

22s:152 Applied Linear Regression. Take random samples from each of m populations. 22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each

More information

Ch 3: Multiple Linear Regression

Ch 3: Multiple Linear Regression Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery

More information

2-way analysis of variance

2-way analysis of variance 2-way analysis of variance We may be considering the effect of two factors (A and B) on our response variable, for instance fertilizer and variety on maize yield; or therapy and sex on cholesterol level.

More information

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA 22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each

More information

Stat 217 Final Exam. Name: May 1, 2002

Stat 217 Final Exam. Name: May 1, 2002 Stat 217 Final Exam Name: May 1, 2002 Problem 1. Three brands of batteries are under study. It is suspected that the lives (in weeks) of the three brands are different. Five batteries of each brand are

More information

Lecture 11: Nested and Split-Plot Designs

Lecture 11: Nested and Split-Plot Designs Lecture 11: Nested and Split-Plot Designs Montgomery, Chapter 14 1 Lecture 11 Page 1 Crossed vs Nested Factors Factors A (a levels)and B (b levels) are considered crossed if Every combinations of A and

More information

Unit 6: Orthogonal Designs Theory, Randomized Complete Block Designs, and Latin Squares

Unit 6: Orthogonal Designs Theory, Randomized Complete Block Designs, and Latin Squares Unit 6: Orthogonal Designs Theory, Randomized Complete Block Designs, and Latin Squares STA 643: Advanced Experimental Design Derek S. Young 1 Learning Objectives Understand the basics of orthogonal designs

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression MATH 282A Introduction to Computational Statistics University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/ eariasca/math282a.html MATH 282A University

More information

Unit 8: 2 k Factorial Designs, Single or Unequal Replications in Factorial Designs, and Incomplete Block Designs

Unit 8: 2 k Factorial Designs, Single or Unequal Replications in Factorial Designs, and Incomplete Block Designs Unit 8: 2 k Factorial Designs, Single or Unequal Replications in Factorial Designs, and Incomplete Block Designs STA 643: Advanced Experimental Design Derek S. Young 1 Learning Objectives Revisit your

More information

Stat 502 Design and Analysis of Experiments General Linear Model

Stat 502 Design and Analysis of Experiments General Linear Model 1 Stat 502 Design and Analysis of Experiments General Linear Model Fritz Scholz Department of Statistics, University of Washington December 6, 2013 2 General Linear Hypothesis We assume the data vector

More information

Section 4.6 Simple Linear Regression

Section 4.6 Simple Linear Regression Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval

More information

Assignment 6 Answer Keys

Assignment 6 Answer Keys ssignment 6 nswer Keys Problem 1 (a) The treatment sum of squares can be calculated by SS Treatment = b a ȳi 2 Nȳ 2 i=1 = 5 (5.40 2 + 5.80 2 + 10 2 + 9.80 2 ) 20 7.75 2 = 92.95 Then the F statistic for

More information

Lecture 4. Random Effects in Completely Randomized Design

Lecture 4. Random Effects in Completely Randomized Design Lecture 4. Random Effects in Completely Randomized Design Montgomery: 3.9, 13.1 and 13.7 1 Lecture 4 Page 1 Random Effects vs Fixed Effects Consider factor with numerous possible levels Want to draw inference

More information

STAT 510 Final Exam Spring 2015

STAT 510 Final Exam Spring 2015 STAT 510 Final Exam Spring 2015 Instructions: The is a closed-notes, closed-book exam No calculator or electronic device of any kind may be used Use nothing but a pen or pencil Please write your name and

More information

Lecture 6 Multiple Linear Regression, cont.

Lecture 6 Multiple Linear Regression, cont. Lecture 6 Multiple Linear Regression, cont. BIOST 515 January 22, 2004 BIOST 515, Lecture 6 Testing general linear hypotheses Suppose we are interested in testing linear combinations of the regression

More information

Contents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4

Contents. TAMS38 - Lecture 6 Factorial design, Latin Square Design. Lecturer: Zhenxia Liu. Factorial design 3. Complete three factor design 4 Contents Factorial design TAMS38 - Lecture 6 Factorial design, Latin Square Design Lecturer: Zhenxia Liu Department of Mathematics - Mathematical Statistics 28 November, 2017 Complete three factor design

More information

The Distribution of F

The Distribution of F The Distribution of F It can be shown that F = SS Treat/(t 1) SS E /(N t) F t 1,N t,λ a noncentral F-distribution with t 1 and N t degrees of freedom and noncentrality parameter λ = t i=1 n i(µ i µ) 2

More information

Lecture 10. Factorial experiments (2-way ANOVA etc)

Lecture 10. Factorial experiments (2-way ANOVA etc) Lecture 10. Factorial experiments (2-way ANOVA etc) Jesper Rydén Matematiska institutionen, Uppsala universitet jesper@math.uu.se Regression and Analysis of Variance autumn 2014 A factorial experiment

More information

Vladimir Spokoiny Design of Experiments in Science and Industry

Vladimir Spokoiny Design of Experiments in Science and Industry W eierstraß-institut für Angew andte Analysis und Stochastik Vladimir Spokoiny Design of Experiments in Science and Industry www.ndt.net/index.php?id=8310 Mohrenstr. 39, 10117 Berlin spokoiny@wias-berlin.de

More information

Lec 5: Factorial Experiment

Lec 5: Factorial Experiment November 21, 2011 Example Study of the battery life vs the factors temperatures and types of material. A: Types of material, 3 levels. B: Temperatures, 3 levels. Example Study of the battery life vs the

More information

More about Single Factor Experiments

More about Single Factor Experiments More about Single Factor Experiments 1 2 3 0 / 23 1 2 3 1 / 23 Parameter estimation Effect Model (1): Y ij = µ + A i + ɛ ij, Ji A i = 0 Estimation: µ + A i = y i. ˆµ = y..  i = y i. y.. Effect Modell

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

Analysis of Covariance

Analysis of Covariance Analysis of Covariance (ANCOVA) Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 10 1 When to Use ANCOVA In experiment, there is a nuisance factor x that is 1 Correlated with y 2

More information

MATH 644: Regression Analysis Methods

MATH 644: Regression Analysis Methods MATH 644: Regression Analysis Methods FINAL EXAM Fall, 2012 INSTRUCTIONS TO STUDENTS: 1. This test contains SIX questions. It comprises ELEVEN printed pages. 2. Answer ALL questions for a total of 100

More information

Lecture 1: Linear Models and Applications

Lecture 1: Linear Models and Applications Lecture 1: Linear Models and Applications Claudia Czado TU München c (Claudia Czado, TU Munich) ZFS/IMS Göttingen 2004 0 Overview Introduction to linear models Exploratory data analysis (EDA) Estimation

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

STK4900/ Lecture 3. Program

STK4900/ Lecture 3. Program STK4900/9900 - Lecture 3 Program 1. Multiple regression: Data structure and basic questions 2. The multiple linear regression model 3. Categorical predictors 4. Planned experiments and observational studies

More information

Factorial and Unbalanced Analysis of Variance

Factorial and Unbalanced Analysis of Variance Factorial and Unbalanced Analysis of Variance Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 04-Jan-2017 Nathaniel E. Helwig (U of Minnesota)

More information

QUEEN MARY, UNIVERSITY OF LONDON

QUEEN MARY, UNIVERSITY OF LONDON QUEEN MARY, UNIVERSITY OF LONDON MTH634 Statistical Modelling II Solutions to Exercise Sheet 4 Octobe07. We can write (y i. y.. ) (yi. y i.y.. +y.. ) yi. y.. S T. ( Ti T i G n Ti G n y i. +y.. ) G n T

More information

Stat 6640 Solution to Midterm #2

Stat 6640 Solution to Midterm #2 Stat 6640 Solution to Midterm #2 1. A study was conducted to examine how three statistical software packages used in a statistical course affect the statistical competence a student achieves. At the end

More information

Lecture 2. The Simple Linear Regression Model: Matrix Approach

Lecture 2. The Simple Linear Regression Model: Matrix Approach Lecture 2 The Simple Linear Regression Model: Matrix Approach Matrix algebra Matrix representation of simple linear regression model 1 Vectors and Matrices Where it is necessary to consider a distribution

More information

Two-Way Analysis of Variance - no interaction

Two-Way Analysis of Variance - no interaction 1 Two-Way Analysis of Variance - no interaction Example: Tests were conducted to assess the effects of two factors, engine type, and propellant type, on propellant burn rate in fired missiles. Three engine

More information

Lecture 5: Comparing Treatment Means Montgomery: Section 3-5

Lecture 5: Comparing Treatment Means Montgomery: Section 3-5 Lecture 5: Comparing Treatment Means Montgomery: Section 3-5 Page 1 Linear Combination of Means ANOVA: y ij = µ + τ i + ɛ ij = µ i + ɛ ij Linear combination: L = c 1 µ 1 + c 1 µ 2 +...+ c a µ a = a i=1

More information

Models for Clustered Data

Models for Clustered Data Models for Clustered Data Edps/Psych/Soc 589 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline Notation NELS88 data Fixed Effects ANOVA

More information

Sleep data, two drugs Ch13.xls

Sleep data, two drugs Ch13.xls Model Based Statistics in Biology. Part IV. The General Linear Mixed Model.. Chapter 13.3 Fixed*Random Effects (Paired t-test) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch

More information

Unit 12: Analysis of Single Factor Experiments

Unit 12: Analysis of Single Factor Experiments Unit 12: Analysis of Single Factor Experiments Statistics 571: Statistical Methods Ramón V. León 7/16/2004 Unit 12 - Stat 571 - Ramón V. León 1 Introduction Chapter 8: How to compare two treatments. Chapter

More information

21.0 Two-Factor Designs

21.0 Two-Factor Designs 21.0 Two-Factor Designs Answer Questions 1 RCBD Concrete Example Two-Way ANOVA Popcorn Example 21.4 RCBD 2 The Randomized Complete Block Design is also known as the two-way ANOVA without interaction. A

More information

Models for Clustered Data

Models for Clustered Data Models for Clustered Data Edps/Psych/Stat 587 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline Notation NELS88 data Fixed Effects ANOVA

More information

STATISTICS 368 AN EXPERIMENT IN AIRCRAFT PRODUCTION Christopher Wiens & Douglas Wiens

STATISTICS 368 AN EXPERIMENT IN AIRCRAFT PRODUCTION Christopher Wiens & Douglas Wiens STATISTICS 368 AN EXPERIMENT IN AIRCRAFT PRODUCTION Christopher Wiens & Douglas Wiens April 21, 2005 The progress of science. 1. Preliminary description of experiment We set out to determine the factors

More information

Lecture 9: Factorial Design Montgomery: chapter 5

Lecture 9: Factorial Design Montgomery: chapter 5 Lecture 9: Factorial Design Montgomery: chapter 5 Page 1 Examples Example I. Two factors (A, B) each with two levels (, +) Page 2 Three Data for Example I Ex.I-Data 1 A B + + 27,33 51,51 18,22 39,41 EX.I-Data

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.

More information

SCHOOL OF MATHEMATICS AND STATISTICS

SCHOOL OF MATHEMATICS AND STATISTICS SHOOL OF MATHEMATIS AND STATISTIS Linear Models Autumn Semester 2015 16 2 hours Marks will be awarded for your best three answers. RESTRITED OPEN BOOK EXAMINATION andidates may bring to the examination

More information

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments. Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a

More information

Unit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs

Unit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs Unit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs STA 643: Advanced Experimental Design Derek S. Young 1 Learning Objectives Understand how to interpret a random effect Know the different

More information

Extensions of One-Way ANOVA.

Extensions of One-Way ANOVA. Extensions of One-Way ANOVA http://www.pelagicos.net/classes_biometry_fa18.htm What do I want You to Know What are two main limitations of ANOVA? What two approaches can follow a significant ANOVA? How

More information

19. Blocking & confounding

19. Blocking & confounding 146 19. Blocking & confounding Importance of blocking to control nuisance factors - day of week, batch of raw material, etc. Complete Blocks. This is the easy case. Suppose we run a 2 2 factorial experiment,

More information

Chapter 4 Randomized Blocks, Latin Squares, and Related Designs Solutions

Chapter 4 Randomized Blocks, Latin Squares, and Related Designs Solutions Solutions from Montgomery, D. C. (008) Design and Analysis of Experiments, Wiley, NY Chapter 4 Randomized Blocks, Latin Squares, and Related Designs Solutions 4.. The ANOVA from a randomized complete block

More information

Extensions of One-Way ANOVA.

Extensions of One-Way ANOVA. Extensions of One-Way ANOVA http://www.pelagicos.net/classes_biometry_fa17.htm What do I want You to Know What are two main limitations of ANOVA? What two approaches can follow a significant ANOVA? How

More information

Stat 5102 Final Exam May 14, 2015

Stat 5102 Final Exam May 14, 2015 Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions

More information

Linear Regression. In this lecture we will study a particular type of regression model: the linear regression model

Linear Regression. In this lecture we will study a particular type of regression model: the linear regression model 1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor

More information

1. The (dependent variable) is the variable of interest to be measured in the experiment.

1. The (dependent variable) is the variable of interest to be measured in the experiment. Chapter 10 Analysis of variance (ANOVA) 10.1 Elements of a designed experiment 1. The (dependent variable) is the variable of interest to be measured in the experiment. 2. are those variables whose effect

More information

Stat 579: Generalized Linear Models and Extensions

Stat 579: Generalized Linear Models and Extensions Stat 579: Generalized Linear Models and Extensions Mixed models Yan Lu Feb, 2018, week 7 1 / 17 Some commonly used experimental designs related to mixed models Two way or three way random/mixed effects

More information

Multiple comparisons - subsequent inferences for two-way ANOVA

Multiple comparisons - subsequent inferences for two-way ANOVA 1 Multiple comparisons - subsequent inferences for two-way ANOVA the kinds of inferences to be made after the F tests of a two-way ANOVA depend on the results if none of the F tests lead to rejection of

More information

STAT Final Practice Problems

STAT Final Practice Problems STAT 48 -- Final Practice Problems.Out of 5 women who had uterine cancer, 0 claimed to have used estrogens. Out of 30 women without uterine cancer 5 claimed to have used estrogens. Exposure Outcome (Cancer)

More information

Written Exam (2 hours)

Written Exam (2 hours) M. Müller Applied Analysis of Variance and Experimental Design Summer 2015 Written Exam (2 hours) General remarks: Open book exam. Switch off your mobile phone! Do not stay too long on a part where you

More information

http://www.statsoft.it/out.php?loc=http://www.statsoft.com/textbook/ Group comparison test for independent samples The purpose of the Analysis of Variance (ANOVA) is to test for significant differences

More information

Matrices and vectors A matrix is a rectangular array of numbers. Here s an example: A =

Matrices and vectors A matrix is a rectangular array of numbers. Here s an example: A = Matrices and vectors A matrix is a rectangular array of numbers Here s an example: 23 14 17 A = 225 0 2 This matrix has dimensions 2 3 The number of rows is first, then the number of columns We can write

More information

Homework 3 - Solution

Homework 3 - Solution STAT 526 - Spring 2011 Homework 3 - Solution Olga Vitek Each part of the problems 5 points 1. KNNL 25.17 (Note: you can choose either the restricted or the unrestricted version of the model. Please state

More information

THE UNIVERSITY OF CHICAGO Booth School of Business Business 41912, Spring Quarter 2012, Mr. Ruey S. Tsay

THE UNIVERSITY OF CHICAGO Booth School of Business Business 41912, Spring Quarter 2012, Mr. Ruey S. Tsay THE UNIVERSITY OF CHICAGO Booth School of Business Business 41912, Spring Quarter 2012, Mr. Ruey S. Tsay Lecture 3: Comparisons between several multivariate means Key concepts: 1. Paired comparison & repeated

More information

ST505/S697R: Fall Homework 2 Solution.

ST505/S697R: Fall Homework 2 Solution. ST505/S69R: Fall 2012. Homework 2 Solution. 1. 1a; problem 1.22 Below is the summary information (edited) from the regression (using R output); code at end of solution as is code and output for SAS. a)

More information

STAT 135 Lab 10 Two-Way ANOVA, Randomized Block Design and Friedman s Test

STAT 135 Lab 10 Two-Way ANOVA, Randomized Block Design and Friedman s Test STAT 135 Lab 10 Two-Way ANOVA, Randomized Block Design and Friedman s Test Rebecca Barter April 13, 2015 Let s now imagine a dataset for which our response variable, Y, may be influenced by two factors,

More information

Multi-Factor Experiments

Multi-Factor Experiments ETH p. 1/31 Multi-Factor Experiments Twoway anova Interactions More than two factors ETH p. 2/31 Hypertension: Effect of biofeedback Biofeedback Biofeedback Medication Control + Medication 158 188 186

More information

ANOVA (Analysis of Variance) output RLS 11/20/2016

ANOVA (Analysis of Variance) output RLS 11/20/2016 ANOVA (Analysis of Variance) output RLS 11/20/2016 1. Analysis of Variance (ANOVA) The goal of ANOVA is to see if the variation in the data can explain enough to see if there are differences in the means.

More information

What is Experimental Design?

What is Experimental Design? One Factor ANOVA What is Experimental Design? A designed experiment is a test in which purposeful changes are made to the input variables (x) so that we may observe and identify the reasons for change

More information

Example: Poisondata. 22s:152 Applied Linear Regression. Chapter 8: ANOVA

Example: Poisondata. 22s:152 Applied Linear Regression. Chapter 8: ANOVA s:5 Applied Linear Regression Chapter 8: ANOVA Two-way ANOVA Used to compare populations means when the populations are classified by two factors (or categorical variables) For example sex and occupation

More information

Two-Way Factorial Designs

Two-Way Factorial Designs 81-86 Two-Way Factorial Designs Yibi Huang 81-86 Two-Way Factorial Designs Chapter 8A - 1 Problem 81 Sprouting Barley (p166 in Oehlert) Brewer s malt is produced from germinating barley, so brewers like

More information

Lecture 4 Multiple linear regression

Lecture 4 Multiple linear regression Lecture 4 Multiple linear regression BIOST 515 January 15, 2004 Outline 1 Motivation for the multiple regression model Multiple regression in matrix notation Least squares estimation of model parameters

More information

Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes)

Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes) Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes) Asheber Abebe Discrete and Statistical Sciences Auburn University Contents 1 Completely Randomized Design

More information

Incomplete Block Designs

Incomplete Block Designs Incomplete Block Designs Recall: in randomized complete block design, each of a treatments was used once within each of b blocks. In some situations, it will not be possible to use each of a treatments

More information

CAS MA575 Linear Models

CAS MA575 Linear Models CAS MA575 Linear Models Boston University, Fall 2013 Midterm Exam (Correction) Instructor: Cedric Ginestet Date: 22 Oct 2013. Maximal Score: 200pts. Please Note: You will only be graded on work and answers

More information

VIII. ANCOVA. A. Introduction

VIII. ANCOVA. A. Introduction VIII. ANCOVA A. Introduction In most experiments and observational studies, additional information on each experimental unit is available, information besides the factors under direct control or of interest.

More information

Analysis of Variance. Read Chapter 14 and Sections to review one-way ANOVA.

Analysis of Variance. Read Chapter 14 and Sections to review one-way ANOVA. Analysis of Variance Read Chapter 14 and Sections 15.1-15.2 to review one-way ANOVA. Design of an experiment the process of planning an experiment to insure that an appropriate analysis is possible. Some

More information

Lecture 6: Linear models and Gauss-Markov theorem

Lecture 6: Linear models and Gauss-Markov theorem Lecture 6: Linear models and Gauss-Markov theorem Linear model setting Results in simple linear regression can be extended to the following general linear model with independently observed response variables

More information

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as ST 51, Summer, Dr. Jason A. Osborne Homework assignment # - Solutions 1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available

More information

Page: 3, Line: 24 ; Replace 13 Building... with 13 More on Multiple Regression. Page: 83, Line: 9 ; Replace The ith ordered... with The jth ordered...

Page: 3, Line: 24 ; Replace 13 Building... with 13 More on Multiple Regression. Page: 83, Line: 9 ; Replace The ith ordered... with The jth ordered... ERRATA FOR THE OTT/LONGNECKER BOOK AN INTRODUCTION TO STATISTICAL METHODS AND DATA ANALYSIS (Some of these typos have been corrected in later printings) Page: xiv, Line: 15 ; Replace these with the Page:

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html 1 / 42 Passenger car mileage Consider the carmpg dataset taken from

More information

The Pennsylvania State University The Graduate School Eberly College of Science INTRABLOCK, INTERBLOCK AND COMBINED ESTIMATES

The Pennsylvania State University The Graduate School Eberly College of Science INTRABLOCK, INTERBLOCK AND COMBINED ESTIMATES The Pennsylvania State University The Graduate School Eberly College of Science INTRABLOCK, INTERBLOCK AND COMBINED ESTIMATES IN INCOMPLETE BLOCK DESIGNS: A NUMERICAL STUDY A Thesis in Statistics by Yasin

More information

Swarthmore Honors Exam 2012: Statistics

Swarthmore Honors Exam 2012: Statistics Swarthmore Honors Exam 2012: Statistics 1 Swarthmore Honors Exam 2012: Statistics John W. Emerson, Yale University NAME: Instructions: This is a closed-book three-hour exam having six questions. You may

More information

PROBLEM TWO (ALKALOID CONCENTRATIONS IN TEA) 1. Statistical Design

PROBLEM TWO (ALKALOID CONCENTRATIONS IN TEA) 1. Statistical Design PROBLEM TWO (ALKALOID CONCENTRATIONS IN TEA) 1. Statistical Design The purpose of this experiment was to determine differences in alkaloid concentration of tea leaves, based on herb variety (Factor A)

More information

What If There Are More Than. Two Factor Levels?

What If There Are More Than. Two Factor Levels? What If There Are More Than Chapter 3 Two Factor Levels? Comparing more that two factor levels the analysis of variance ANOVA decomposition of total variability Statistical testing & analysis Checking

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

[y i α βx i ] 2 (2) Q = i=1

[y i α βx i ] 2 (2) Q = i=1 Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation

More information

Single Factor Experiments

Single Factor Experiments Single Factor Experiments Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 4 1 Analysis of Variance Suppose you are interested in comparing either a different treatments a levels

More information

Introduction and Background to Multilevel Analysis

Introduction and Background to Multilevel Analysis Introduction and Background to Multilevel Analysis Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Background and

More information

with the usual assumptions about the error term. The two values of X 1 X 2 0 1

with the usual assumptions about the error term. The two values of X 1 X 2 0 1 Sample questions 1. A researcher is investigating the effects of two factors, X 1 and X 2, each at 2 levels, on a response variable Y. A balanced two-factor factorial design is used with 1 replicate. The

More information

Multiple Regression: Example

Multiple Regression: Example Multiple Regression: Example Cobb-Douglas Production Function The Cobb-Douglas production function for observed economic data i = 1,..., n may be expressed as where O i is output l i is labour input c

More information

Analysis of Variance (ANOVA) Cancer Research UK 10 th of May 2018 D.-L. Couturier / R. Nicholls / M. Fernandes

Analysis of Variance (ANOVA) Cancer Research UK 10 th of May 2018 D.-L. Couturier / R. Nicholls / M. Fernandes Analysis of Variance (ANOVA) Cancer Research UK 10 th of May 2018 D.-L. Couturier / R. Nicholls / M. Fernandes 2 Quick review: Normal distribution Y N(µ, σ 2 ), f Y (y) = 1 2πσ 2 (y µ)2 e 2σ 2 E[Y ] =

More information