IEOR165 Discussion Week 12

Similar documents
ANOVA: Comparing More Than Two Means

Chapter Seven: Multi-Sample Methods 1/52

Sociology 6Z03 Review II

1 DV is normally distributed in the population for each level of the within-subjects factor 2 The population variances of the difference scores

Battery Life. Factory

Average weight of Eisenhower dollar: 23 grams

Statistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts

Lec 1: An Introduction to ANOVA

Analysis of Variance

Inferences for Regression

Example: Four levels of herbicide strength in an experiment on dry weight of treated plants.

Specific Differences. Lukas Meier, Seminar für Statistik

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

One-way ANOVA. Experimental Design. One-way ANOVA

Lecture 6 April

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

Week 5 Video 1 Relationship Mining Correlation Mining

Contrasts and Multiple Comparisons Supplement for Pages

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

16.3 One-Way ANOVA: The Procedure

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis

Analysis of Variance

STAT22200 Spring 2014 Chapter 5

Multiple t Tests. Introduction to Analysis of Variance. Experiments with More than 2 Conditions

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

STAT 5200 Handout #7a Contrasts & Post hoc Means Comparisons (Ch. 4-5)

Statistical Applications in Genetics and Molecular Biology

Hypothesis testing: Steps

Analysis of Variance: Part 1

ANOVA Analysis of Variance

An inferential procedure to use sample data to understand a population Procedures

High-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018

Hypothesis testing: Steps

Multiple comparisons - subsequent inferences for two-way ANOVA

Performance Evaluation and Comparison

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

COMPARING SEVERAL MEANS: ANOVA

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Multiple testing: Intro & FWER 1


Chapter 7 Comparison of two independent samples

Chapter 10: Analysis of variance (ANOVA)

One-way Analysis of Variance. Major Points. T-test. Ψ320 Ainsworth

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs

Linear Combinations. Comparison of treatment means. Bruce A Craig. Department of Statistics Purdue University. STAT 514 Topic 6 1

(Foundation of Medical Statistics)

IMPROVING TWO RESULTS IN MULTIPLE TESTING

Visual interpretation with normal approximation

Testing a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials

Announcements. Unit 4: Inference for numerical variables Lecture 4: ANOVA. Data. Statistics 104

Table 1: Fish Biomass data set on 26 streams

Econometrics. 4) Statistical inference

1. What does the alternate hypothesis ask for a one-way between-subjects analysis of variance?

Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G

Introduction to Statistical Hypothesis Testing

Analysis of variance (ANOVA) ANOVA. Null hypothesis for simple ANOVA. H 0 : Variance among groups = 0

IEOR165 Discussion Week 5

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

Introduction to the Analysis of Variance (ANOVA)

Ch 2: Simple Linear Regression

Tests about a population mean

Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. T=number of type 2 errors

Comparing Several Means

Week 14 Comparing k(> 2) Populations

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

Outline. Topic 19 - Inference. The Cell Means Model. Estimates. Inference for Means Differences in cell means Contrasts. STAT Fall 2013

HYPOTHESIS TESTING. Hypothesis Testing

9 One-Way Analysis of Variance

Sample Size and Power Calculation in Microarray Studies Using the sizepower package.

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains:

Lecture 21: October 19

High-throughput Testing

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

Introduction to Business Statistics QM 220 Chapter 12

Note: k = the # of conditions n = # of data points in a condition N = total # of data points

Multiple Testing. Gary W. Oehlert. January 28, School of Statistics University of Minnesota

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Multiple Testing. Hoang Tran. Department of Statistics, Florida State University

Week 12 Hypothesis Testing, Part II Comparing Two Populations

Power Analysis. Introduction to Power

Summary and discussion of: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

Linear Regression for Air Pollution Data

Statistiek II. John Nerbonne using reworkings by Hartmut Fitz and Wilbert Heeringa. February 13, Dept of Information Science

Tukey Complete Pairwise Post-Hoc Comparison

Difference in two or more average scores in different groups

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

LECTURE 5. Introduction to Econometrics. Hypothesis testing

Power & Sample Size Calculation

Chapter 10. Design of Experiments and Analysis of Variance

ANOVA Multiple Comparisons

20.0 Experimental Design

ON STEPWISE CONTROL OF THE GENERALIZED FAMILYWISE ERROR RATE. By Wenge Guo and M. Bhaskara Rao

The t-statistic. Student s t Test

One-Way Analysis of Variance (ANOVA) Paul K. Strode, Ph.D.

Unit 27 One-Way Analysis of Variance

Transcription:

IEOR165 Discussion Week 12 Sheng Liu University of California, Berkeley Apr 15, 2016

Outline 1 Type I errors & Type II errors 2 Multiple Testing 3 ANOVA IEOR165 Discussion Sheng Liu 2

Type I errors & Type II errors Table: Error types in hypothesis testing Null Hypothesis Decision TRUE FALSE Reject Type I Correct Fail to reject Correct Type II Type I error Reject H 0 when it is true, also called false positive P (Type I error) = P (reject H 0 H 0 is true) = α Type II error Fail to reject H 0 when it is false, also called false negative P (Type II error) = P (fail to reject H 0 H 0 is false) = β IEOR165 Discussion Sheng Liu 3

Tradeoff We want to perform a test that yields low errors: both type I and type II errors However, choosing a lower level of type I level will increase the chance of type II error. Intuitively, if it is hard to reject a true null hypothesis (low type I error), it is also very likely that a false null hypothesis is not rejected (high type II error) When fixing the significance level, are we restricting the probability of type I error or Type II error? Only way to reduce both type I and type II errors: increase the sample size IEOR165 Discussion Sheng Liu 4

Probability of Type I and Type II errors IEOR165 Discussion Sheng Liu 5

Tradeoff Reference: James B. Ramsey, 1998 http: //www.econ.nyu.edu/user/ramseyj/textbook/chapter11.pdf IEOR165 Discussion Sheng Liu 6

Familywise Error Rate (FWER) Let fix the significance level as α = 0.05 When doing a single hypothesis testing, the probability of not making a Type I error is 1 α = 1 0.05 = 0.95 When doing k hypothesis testings at the same time, the probability of not making a type I error on all k tests is (1 α) k = 0.95 k 0.95 4 = 0.814, 0.95 6 = 0.735 You need to improve on this The probability of making one or more Type I errors on the family of tests (familywise error rate), with the assumption of independence F W ER = 1 (1 α) k IEOR165 Discussion Sheng Liu 7

Bonferroni Correction We want to bound our familywise error rate by α F (differentiate it with error rate per test α T ), the key question is, if we set α F = 0.05, what should be our α T? Directly from the equation (also called Šidàk equation) α T = 1 (1 α F ) 1/k Bonferroni approximation (Taylor expansion) α F = 0.05 and k = 4: α T α F k α T = 1 (1 0.05) 1/4 = 0.0127 α T = α F k = 0.05 = 0.0125 4 IEOR165 Discussion Sheng Liu 8

From Bonferroni to Holm-Bonferroni Compare p i with α F /k, which is equal to comparing k p i with α F Bonferroni correction actually drops the independence assumption and thus is very general but conservative. Requiring each test at the significance level of α/k is too strict and may lead to higher type II errors A more complicated but more powerful test is Holm-Bonferroni test: 1 Sort p-values from the smallest to the largest. 2 Compare p (1) with α/k, if p (1) > α/k, accept all null hypothesis. 3 Otherwise we continue to compare p (i) and α/(k i + 1): if p (i) > α/(k i + 1), we stop; otherwise we reject H (i) and move on. Intuitively, you can see why this method is less conservative IEOR165 Discussion Sheng Liu 9

Example We want to test three hypothesis at the same time, their p-values are 0.0004, 0.0201 and 0.0612. Use both the Bonferroni correction and Holm-Bonferroni method. IEOR165 Discussion Sheng Liu 10

Basics We have k groups of data and each group j has n j observations. Let X j be the sample average in group j and X be the sample average of all observations. The total sum of squares SS T = n k j (X j i X)2 j=1 i=1 The between group sum of squares (between groups variation) SS B = k n j (X j X) 2 j=1 The within groups sum of squares (within groups variation) SS W = n k j (X j i Xj ) 2 j=1 i=1 IEOR165 Discussion Sheng Liu 11

Basics We can show that SS T = SS B + SS W And similar equation applies to degrees of freedom df(ss T ) = df(ss B ) + df(ss W ) N 1 = k 1 + N k Now assume these groups have the same variance σ 2 (unknown), we have E(SS W /(N k)) = σ 2 If group means are the same, we have E(SS B /(k 1)) = σ 2 IEOR165 Discussion Sheng Liu 12

ANOVA H 0 : µ 1 = µ 2 = = µ k (σ 2 1 = σ 2 2 = = σ 2 k) Under the null hypothesis, we have SS B /(k 1) SS W /(N k) = MSG F (k 1, N k) MSE When the null hypothesis is true, this ratio should be close to one. In the contrary, if their means are different (there is a treatment effect), this ratio should be greater than 1: p value = P (F > SS B/(k 1) SS W /(N k) ) We only do one-tailed test here, why? IEOR165 Discussion Sheng Liu 13

Example Low Calorie Low Fat Low Carbohydrate 8 2 5 9 4 7 4 3 3 Calculate SS T, SS B, SS W, MSG, MSE and test whether the means of these three groups are the same or not (assume normal population and the same variance across groups) IEOR165 Discussion Sheng Liu 14

References ANOVA: http://www.lboro.ac.uk/media/wwwlboroacuk/content/ mlsc/downloads/1.5_onewayanova.pdf http://sphweb.bumc.bu.edu/otlt/mph-modules/bs/bs704_ HypothesisTesting-ANOVA/BS704_HypothesisTesting-Anova_ print.html Holm and Bonferroni: https: //www.utdallas.edu/~herve/abdi-holm2010-pretty.pdf IEOR165 Discussion Sheng Liu 15