Stat 601 The Design of Experiments

Similar documents
Stat 217 Final Exam. Name: May 1, 2002

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

Lectures on Simple Linear Regression Stat 431, Summer 2012

Week 14 Comparing k(> 2) Populations

STAT 5200 Handout #7a Contrasts & Post hoc Means Comparisons (Ch. 4-5)

W&M CSCI 688: Design of Experiments Homework 2. Megan Rose Bryant

Confidence Intervals, Testing and ANOVA Summary

Unit 7: Random Effects, Subsampling, Nested and Crossed Factor Designs

Chapter 20 : Two factor studies one case per treatment Chapter 21: Randomized complete block designs

Two-Way Analysis of Variance - no interaction

Formal Statement of Simple Linear Regression Model

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

Correlation Analysis

Unit 6: Orthogonal Designs Theory, Randomized Complete Block Designs, and Latin Squares

Lecture 5: Comparing Treatment Means Montgomery: Section 3-5

One-way ANOVA (Single-Factor CRD)

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression

STAT Final Practice Problems

Chapter 11 - Lecture 1 Single Factor ANOVA

Ch 2: Simple Linear Regression

We like to capture and represent the relationship between a set of possible causes and their response, by using a statistical predictive model.

STAT 525 Fall Final exam. Tuesday December 14, 2010

More about Single Factor Experiments

K. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =

Lec 1: An Introduction to ANOVA

Inferences for Regression

STAT 540: Data Analysis and Regression

Lec 5: Factorial Experiment

Factorial designs. Experiments

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College

Stat 710: Mathematical Statistics Lecture 41

PART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable,

Lecture 15 Topic 11: Unbalanced Designs (missing data)

4.1. Introduction: Comparing Means

Inference for Regression Simple Linear Regression

Multiple comparisons - subsequent inferences for two-way ANOVA

Stat 705: Completely randomized and complete block designs

Multiple comparisons The problem with the one-pair-at-a-time approach is its error rate.

Figure 1: The fitted line using the shipment route-number of ampules data. STAT5044: Regression and ANOVA The Solution of Homework #2 Inyoung Kim

This document contains 3 sets of practice problems.

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

AMS 315/576 Lecture Notes. Chapter 11. Simple Linear Regression

Ch 3: Multiple Linear Regression

PROBLEM TWO (ALKALOID CONCENTRATIONS IN TEA) 1. Statistical Design

STAT 506: Randomized complete block designs

Statistics For Economics & Business

Ch. 1: Data and Distributions

I i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic.

Basic Business Statistics 6 th Edition

ANOVA: Comparing More Than Two Means

Homework 2: Simple Linear Regression

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

[4+3+3] Q 1. (a) Describe the normal regression model through origin. Show that the least square estimator of the regression parameter is given by

. Example: For 3 factors, sse = (y ijkt. " y ijk

Tentamentsskrivning: Statistisk slutledning 1. Tentamentsskrivning i Statistisk slutledning MVE155/MSG200, 7.5 hp.

STAT 705 Chapter 19: Two-way ANOVA

STAT22200 Spring 2014 Chapter 8A

Introduction to Design and Analysis of Experiments with the SAS System (Stat 7010 Lecture Notes)

STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis

STAT 705 Chapter 19: Two-way ANOVA

Section 4.6 Simple Linear Regression

Outline. Remedial Measures) Extra Sums of Squares Standardized Version of the Multiple Regression Model

ST430 Exam 2 Solutions

ANOVA (Analysis of Variance) output RLS 11/20/2016

Stat 579: Generalized Linear Models and Extensions

Linear models and their mathematical foundations: Simple linear regression

Chapter 12. Analysis of variance

STAT 511. Lecture : Simple linear regression Devore: Section Prof. Michael Levine. December 3, Levine STAT 511

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B

ANOVA Multiple Comparisons

Outline Topic 21 - Two Factor ANOVA

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

Correlation and Regression

Chapter 5 Introduction to Factorial Designs Solutions

STATS Analysis of variance: ANOVA

Stat 601 The Design of Experiments

Content by Week Week of October 14 27

The Random Effects Model Introduction

In ANOVA the response variable is numerical and the explanatory variables are categorical.

Homework 3 - Solution

Design and Analysis of Experiments Prof. Jhareswar Maiti Department of Industrial and Systems Engineering Indian Institute of Technology, Kharagpur

EXAM IN TMA4255 EXPERIMENTAL DESIGN AND APPLIED STATISTICAL METHODS

1 One-way Analysis of Variance

Sampling Distributions: Central Limit Theorem

Statistics GIDP Ph.D. Qualifying Exam Methodology

ST 305: Final Exam ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) ( ) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ Y. σ X. σ n.

Introduction. Chapter 8

Chapter 12 - Lecture 2 Inferences about regression coefficient

STAT 360-Linear Models

Residual Analysis for two-way ANOVA The twoway model with K replicates, including interaction,

Inference for the Regression Coefficient

Chapter 7, continued: MANOVA

Ph.D. Preliminary Examination Statistics June 2, 2014

MATH Notebook 3 Spring 2018

3. Design Experiments and Variance Analysis

STAT Chapter 11: Regression

EX1. One way ANOVA: miles versus Plug. a) What are the hypotheses to be tested? b) What are df 1 and df 2? Verify by hand. , y 3

Econometrics. 4) Statistical inference

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

This gives us an upper and lower bound that capture our population mean.

Transcription:

Stat 601 The Design of Experiments Yuqing Xu Department of Statistics University of Wisconsin Madison, WI 53706, USA October 28, 2016 Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 1 / 16

Logistics Exam on next Wed. Bring calculator and tables (t, F and Studentized Range). Office Hours adjustment: (One time adjustment) Mon: 12:15-1:15pm, Wed 9:00-10:00am, @ MSC 6740 MSC 6740: Sixth floor, on the East side of MSC, take lift No.13 Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 2 / 16

RCBD with Random Block Effect Statistic Models for RCBD with Random Effect Y ij = µ + b i + t j + ɛ ij where i = 1,..., b, j = 1,..., t, ɛ ij N(0, σ 2 ) and b i N(0, σ 2 β ). Why Random When we are trying to learn about variation in the treatment effects: we want to design an experiment and look at the variability that arises in sample. Random effects models are also used in subsampling situations. Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 3 / 16

ANOVA RCBD with Fixed Effect Source DF SS MS E[MS] b Block b-1 i=1 t(y i. y.. ) 2 SSB (b 1) σ 2 + t t Treatment t-1 j=1 b(y.j y.. ) 2 SSTr t 1 σ 2 + b Error (b-1)(t-1) i j (y ij y i. y.j + y.. ) 2 SSE df σ 2 Total n-1 S total = i j (y ij y.. ) 2 i b2 i b 1 j t2 j t 1 RCBD with Random Effect Source DF SS MS E[MS] b Block b-1 i=1 t(y i. y.. ) 2 SSB (b 1) σ 2 + tσb 2 t Treatment t-1 j=1 b(y.j y.. ) 2 SSTr t 1 σ 2 + b Error (b-1)(t-1) i j (y ij y i. y.j + y.. ) 2 SSE df σ 2 Total n-1 S total = i j (y ij y.. ) 2 j t2 j t 1 Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 4 / 16

RCBD with Random Block Effect Final Question Y ij = µ + b i + t j + ɛ ij where i = 1,..., b, j = 1,..., t, ɛ ij N(0, σ 2 ) and b i N(0, σβ 2). How to estimate σ 2? How about σβ 2? Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 5 / 16

Model Diagnostics This is far too rough but important 1 What if independence is not satisfied? 2 What if homoscedasticity or normality is not satisfied? 3 What if you have CRD but fitted RCBD? or you have RCBD but fitted CRD? 4 What kind of transformation should we use? 5 After transformation, what value is the most important one? 3 You will lose power of the test. 4 Monotone function. 5 Median. Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 6 / 16

Model Diagnostics This is far too rough but important Median/quantiles will not be changed by monotone transformation Suppose we did transformation: ỹ jk = log y jk Then median( µ j ) = log[median(µ j )] CI for median( µ j ) = log[ci for median(µ j )] This is not always true for mean. Think about Jensen s inequality: E(logy) log(ey) Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 7 / 16

Review of Selected Topics in CRD Example 1 Four different brands of pillows were tested by consumers. The rates on a scale of 1(inferior) to 7(superior) is summarized below: Pillow brand Rating A 1,3,5,7,2,3,4 B 7,6,7,7,6 C 1,2,3,2,3,2,1 D 4,3,4,1,5 Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 8 / 16

Example 1 Solution: Source DF SS MSS F Between Tr t-1 S T = t j=1 n j(y j. y.. ) 2 S T t 1 Within Tr n-t S E = j k (y jk y j. ) 2 S E n t Total n-1 S total = j k (y jk y.. ) 2 S T /(t 1) S E /(n t) Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 9 / 16

Example 1 Questions (b) Based on this study, manufacturers of pillow B want to claim their pillow is significantly better than brand C. Test this contrast. (c)how large must the difference δ = ȳ B ȳ C be in order to declare significant at the 0.05 level by Scheffe s method? Is the observed difference significant? (d)how large must the difference δ = ȳ B ȳ C be in order to declare significant at the 0.05 level by Tukey s method? Is the observed difference significant? (e)how large must the difference δ = ȳ B ȳ C be in order to declare significant at the 0.05 level by Bonferroni s method? Is the observed difference significant? (consider m = ( 4 2), FWER=0.05) Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 10 / 16

Example 1 (b) Solution: Hypothesis: H 0 : µ 2 = µ 3, H 1 : µ 2 > µ 3 Contrast: C = (0, 1, 1, 0) t-statistic: (c)(d)(e) Solution: Sheffe: ( µ ˆ B µ ˆ C ) > j c j ˆµ j j c jµ j ˆσ2 j c2 j /n j Tukey: ( µ ˆ B µ ˆ C ) > q t,n t (α)ˆσ Bonferroni: t n t (t 1)F t 1,n t (α) ˆσ 2 (n 1 B (n 1 B + n 1 C )/2 ( µ ˆ B µ ˆ C ) > ˆσ2 (n 1 B + n 1 C ) + n 1 C )t n t( α 2m ) Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 11 / 16

Example 2 Example 2 We are planning an experiment on the quality of video tape and have purchased 24 tapes, four tapes from each of six types. The six types of tape were: 1) brand A high cost, 2) brand A low cost, 3) brand B high cost, 4) brand B low cost, 5) brand C high cost, 6) brand C low cost. Each tape will be recorded with a series of standard test patterns, replayed 10 times, and then replayed an eleventh time into a device that measures the distortion on the tape. The distortion measure is the response, and the tapes will be recorded and replayed in random order. Previous similar tests had an error variance of about.25. Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 12 / 16

Example 2 Question How large should the sample size have been to have a 95% brand A versus brand B confidence interval of no wider than 2? Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 13 / 16

Example 3 HW6 Q1 A chemist wishes to test the effect of four chemical agents on the strength of a particular type of cloth. Because there might be variability from one bolt to another, the chemist decides to use a randomized block design, with the bolts of cloth considered as blocks. She selects five bolts and applies all four chemicals in random order to each bolt. The resulting tensile strengths are given below. Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 14 / 16

Example 3 Questions (a) Analyze the data and state your conclusions. (b) Suppose that the observations for chemical type 2 and bolt 3 and chemical type 4 and bolt 4 are missing. i. Estimate the missing values by differentiating the residual sum of squares with respect to them, equating the results to zero, and solving the equations. Analyze the data using these two estimates of the missing values. ii. Another way to estimate the missing values is to set their residuals to zero. Does this method give the same estimates as the previous one? Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 15 / 16

Example 3 HW3 Q1 (b): SSE = RSS = i Let y 32 = x 1, y 44 = x 2. Then (y ij y i. y.j + y.. ) 2 j y 3. = (74 + x 1 + 78 + 75)/4 y.2 = (73 + 67 + x 1 + 72 + 70)/5 y 4. = (71 + 72 + 73 + x 2 )/4 y.4 = (73 + 71 + 75 + x 2 + 69)/5 y.. = (... + x 1 +... + x 2 +...)/20 i. Find solutions to SSE x 1 = 0, SSE x 2 = 0. ii. Find x 1, x 2 such that y 32 y 3. y.2 + y.. = 0 and y 44 y 4. y.4 + y.. = 0 Yuqing Xu (UW-Madison) Stat 601 Week 8 October 28, 2016 16 / 16