STAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons:

Similar documents
Chapter 12 - Lecture 2 Inferences about regression coefficient

Mathematical statistics

Multiple comparisons - subsequent inferences for two-way ANOVA

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)

Lec 1: An Introduction to ANOVA

STAT 5200 Handout #7a Contrasts & Post hoc Means Comparisons (Ch. 4-5)

Statistiek II. John Nerbonne using reworkings by Hartmut Fitz and Wilbert Heeringa. February 13, Dept of Information Science

Review of Statistics 101

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

Ch 2: Simple Linear Regression

Inferences for Regression

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Solutions to Final STAT 421, Fall 2008

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

Correlation Analysis

STAT 525 Fall Final exam. Tuesday December 14, 2010

Lecture 4 January 23

Design of Experiments. Factorial experiments require a lot of resources

Multiple t Tests. Introduction to Analysis of Variance. Experiments with More than 2 Conditions

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /8/2016 1/38

CHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity

Introduction to Business Statistics QM 220 Chapter 12

Analysis of Variance II Bios 662

Stat 579: Generalized Linear Models and Extensions

ANOVA Analysis of Variance

Chapter 12. Analysis of variance

Econometrics. 4) Statistical inference

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

Specific Differences. Lukas Meier, Seminar für Statistik

UNIVERSITY OF MASSACHUSETTS. Department of Mathematics and Statistics. Basic Exam - Applied Statistics. Tuesday, January 17, 2017

Lecture 6 Multiple Linear Regression, cont.

PSYC 331 STATISTICS FOR PSYCHOLOGISTS

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.


Stat 135 Fall 2013 FINAL EXAM December 18, 2013

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs

2 Hand-out 2. Dr. M. P. M. M. M c Loughlin Revised 2018

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Master s Written Examination

STAT 461/561- Assignments, Year 2015

Garvan Ins)tute Biosta)s)cal Workshop 16/7/2015. Tuan V. Nguyen. Garvan Ins)tute of Medical Research Sydney, Australia

Two-Way ANOVA. Chapter 15

Association studies and regression

Analysis of variance

Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test

Analysis of Variance

Linear Combinations. Comparison of treatment means. Bruce A Craig. Department of Statistics Purdue University. STAT 514 Topic 6 1

22s:152 Applied Linear Regression. Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA)

Week 14 Comparing k(> 2) Populations

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective

I i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic.

Central Limit Theorem ( 5.3)

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

Analysis of variance (ANOVA) Comparing the means of more than two groups

STAT763: Applied Regression Analysis. Multiple linear regression. 4.4 Hypothesis testing

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College

20.1. Balanced One-Way Classification Cell means parametrization: ε 1. ε I. + ˆɛ 2 ij =

Basic Business Statistics 6 th Edition

Chapter 11 - Lecture 1 Single Factor ANOVA

Lecture 5: ANOVA and Correlation

Analysis of Variance (ANOVA)

Chapter Seven: Multi-Sample Methods 1/52

More about Single Factor Experiments

Notes on Maxwell & Delaney

Outline. Topic 19 - Inference. The Cell Means Model. Estimates. Inference for Means Differences in cell means Contrasts. STAT Fall 2013

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Independent Samples ANOVA

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.

Multiple Comparison Procedures Cohen Chapter 13. For EDUC/PSY 6600

Applying the Benjamini Hochberg procedure to a set of generalized p-values

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

Comparisons among means (or, the analysis of factor effects)

Multiple Testing. Tim Hanson. January, Modified from originals by Gary W. Oehlert. Department of Statistics University of South Carolina

Statistics for Managers using Microsoft Excel 6 th Edition

Group comparison test for independent samples

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES

Sociology 6Z03 Review II

assumes a linear relationship between mean of Y and the X s with additive normal errors the errors are assumed to be a sample from N(0, σ 2 )

Lecture 14: ANOVA and the F-test

Step-down FDR Procedures for Large Numbers of Hypotheses

The scatterplot is the basic tool for graphically displaying bivariate quantitative data.

Analysis of Variance (ANOVA)

STAT 540: Data Analysis and Regression

Stat 710: Mathematical Statistics Lecture 41

STATISTICS 141 Final Review

Analysis of Variance Bios 662

Chapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

Looking at the Other Side of Bonferroni

STA2601. Tutorial letter 203/2/2017. Applied Statistics II. Semester 2. Department of Statistics STA2601/203/2/2017. Solutions to Assignment 03

Lecture 15. Hypothesis testing in the linear model

One-way ANOVA (Single-Factor CRD)

Analysis of Variance: Repeated measures

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

Statistics 210 Statistical Methods. Hal S. Stern Department of Statistics University of California, Irvine

High-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018

Factorial designs. Experiments

Transcription:

STAT 263/363: Experimental Design Winter 206/7 Lecture January 9 Lecturer: Minyong Lee Scribe: Zachary del Rosario. Design of Experiments Why perform Design of Experiments (DOE)? There are at least two reasons:. To do more efficient data gathering 2. To infer causality By efficiency, we mean reducing the variance of some estimate V ( ˆβ) (say from regression) for some fixed number of samples n. For inferring causality, note that a regression by itself is passive it only allows us to learn some correlation. The best way to get at causality in a reliable way is to design an experiment. Definition. An experiment is a purposeful setting of input variables with observations of corresponding output variable(s). This may include (purposeful) randomization of the input variables. Definition 2. A controlled experiment is an experiment which changes only one variable, in order to isolate the result. Note that this also includes (purposeful) randomization. Example: (Ch. of Modern Experimental Design) A math teacher has many students in their class (40), and is considering a new approach for teaching math concepts. In order to test this new technique, they are considering splitting the class to compare the old and new approaches. Suppose there are an equal number of male and female students, and the teacher has settled on performing an even split of 20 students under the old approach, and 20 under the new method. There are at least 3 ways to perform this split:. Split by gender (20 boys and 20 girls) 2. Split randomly 3. Block the students by gender, then randomly assign Approach () is a terrible idea, as it introduces a confounding factor (aka lurking variable); the performance difference between the groups could be attributed to either the teaching method, or the gender of the students. Approach (2) is better, but may still suffer from the confound if the gender ratios are not equal. Approach (3) guarantees an equal ratio in both groups; it is the best of these options. There are some cases where regression without a designed experiment can still infer causality. This is not the case for arbitrary experiments, though. -

.2 Paired Data Example: (Shoe sole robustness) A group of experimenters want to quantify the robustness of two different types of shoe soles; A and B versions. To this end, they study 0 kids, and assign them specially made shoes with a different sole for each foot. They randomly assign subjects to have an A left sole and B right sole or vise-versa, and measure the wear on each sole after a set period of time. The results are shown in Figure. Figure.. Example paired data; on average A is more robust than B. This is most clear from the paired data, but less clear from the estimated densities, which hide the pairwise nature of the data. Note that from considering the pairwise differences, it is clear that A is more robust than B. This is less clear from the distributions, where we ignore the pairwise nature of the data. This suggests that a pairwise test will have more power, an idea which can be made rigorous by comparing the two statistical tests..2. Unpaired t-test Given two random variables X N(µ X, σ 2 ) and Y N(µ Y, σ 2 ), suppose we have measurements x,..., x m and y,..., y n. We define a t-statistic t = where x = m x i/m, ȳ = n y i/n, and s 2 = have t t n+m 2 with a null hypothesis of x = ȳ. x ȳ s /m + /n, (.) n (x i x) 2 + m (y i ȳ) 2 m+n 2. In this case, we -2

.2.2 Paired t-test Given two random variables X, Y whose difference D = X Y is normally distributed D N(, σ 2 ), suppose we have data pairs (x i, y i ) for i =,..., n, and define d i = x i y i. We define a t-statistic t = d s D, (.2) where d = n d i/n and s 2 D = n (d i d) 2 /(n ). In this case, we have t t n with a null hypothesis of d = 0. Comparing Equations. and.2, we see that their numerators are identical, while their denominators (variance estimates) are not. Since we reject the null hypothesis when t is large, the smaller the variance estimate the easier it is to reject (a large variance suppresses the numerator). Note that in the unpaired case, we add together the contributions from X and Y, while in the paired case we consider only the differences; thus s D s. 2.3 Normal Theory via One-way ANOVA Suppose we have Y ij (µ i, σ 2 ) for i =,..., K; j =,..., n i, with N = K n i. We use the model Y ij = µ + α i + ɛ ij, where the effects α i sum to zero; i.e. k α i = 0. We may test the null hypothesis µ i = = µ k by using the 2 log likelihood 3 σ 2 k n i (Ȳi Ȳ ) 2, (.3) j= which is asymptotically (n ) distributed as a χ 2 variable. However, we may apply a different test for finite n this leads to the ANOVA F-test..3. One-way ANOVA Identity and Test With the model above, we have (Y ij Ȳ ) 2 = j= (Y ij Ȳi ) 2 + (Ȳi Ȳ ) 2, (.4) j= j= SS total = SS error + SS treatment. 4 To test the null hypothesis defined above, we define the following F-statistic F = K SS treatment N K SS error, (.5) 2 Note that this argument breaks down if X, Y are independent. 3 What does this terminology mean? 4 Note that SS error is sometimes called SS within, and SS treatment is aka SS between. -3

which is distributed according to F F K,N K. Under an alternative hypothesis H A with fixed α i, one can show that E(SS treatment ) = = K n i E((Ȳi Ȳ ) 2 ), K n i αi 2 + (K )σ 2, E(SS error ) = (N K)σ 2, (.6) and our F statistic defined above is distributed according to a non-central F distribution F F K,N K ( σ 2 K n iα 2 i )..3.2 ANOVA Table When computing an ANOVA via standard statistical software (e.g. R), a table similar to the following will be produced. Table.3.2 labels the entries of such a table. Source df SS M S F Treatment K SS treatment SS treatment /(K ) MS treatment /MS error Error N K SS error SS error /(N K) Total N SS total.4 Post-hoc Analysis of ANOVA Results An ANOVA can only detect if a difference between treatments exists; further/different analysis is required to determine which treatments differ. There are various methods to test for these differences. 5.4. Pairwise t-tests If there are k treatments, then there are ( K 2 ) pairwise t-tests to perform; with null hypotheses α i α j = 0. However, a blind testing of each possible pairwise test will yield a large number of false discoveries; supposing a type error rate of α, we expect to see α ( K 2 ) false positives..4.2 Bonferroni To solve the false discovery issue of pairwise testing, the Bonferroni correction enforces a significance level of α = α/ ( ) K 2, where α is the overall desired significance level. This is a (very) conservative approach. 5 Note that here we are performing sequential tests; i.e. we choose to perform further statistical tests based on the results of an initial test. It can be difficult to analyze the statistical effects of this sequential procedure. -4

.4.3 Scheffé The Scheffe test (apparently?) builds simultaneous confidence intervals for contrasts. It is intended to solve the pairwise issue noted above..4.4 Benjamini-Hochberg Procedure The Benjamini-Hochberg procedure is intended to control the false discovery rate (FDR) at some specified level. This is in contrast with the Bonferroni approach, which controls the probability of making at least one false discovery. Thus, this is a less conservative approach than Bonferroni. -5