The problem of base rates

Similar documents
Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Applied Statistics for the Behavioral Sciences

For use only in [the name of your school] 2014 S4 Note. S4 Notes (Edexcel)

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test

LECTURE 5. Introduction to Econometrics. Hypothesis testing

t-test for b Copyright 2000 Tom Malloy. All rights reserved. Regression

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

Introductory Econometrics. Review of statistics (Part II: Inference)

HYPOTHESIS TESTING. Hypothesis Testing

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

Tests about a population mean

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Hypothesis testing. Data to decisions

Statistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

Announcements. Unit 3: Foundations for inference Lecture 3: Decision errors, significance levels, sample size, and power.

Lab #12: Exam 3 Review Key

Rigorous Science - Based on a probability value? The linkage between Popperian science and statistical analysis

Rigorous Science - Based on a probability value? The linkage between Popperian science and statistical analysis

Two-Sample Inferential Statistics

Mathematical statistics

Chapter Three. Hypothesis Testing

Data Analysis and Statistical Methods Statistics 651

Parameter Estimation, Sampling Distributions & Hypothesis Testing

Mathematical statistics

Unit 19 Formulating Hypotheses and Making Decisions

Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals

The Purpose of Hypothesis Testing

Single Sample Means. SOCY601 Alan Neustadtl

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

Section 9.1 (Part 2) (pp ) Type I and Type II Errors

Rigorous Science - Based on a probability value? The linkage between Popperian science and statistical analysis

Class 19. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing

Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text)

Hypothesis Tests Solutions COR1-GB.1305 Statistics and Data Analysis

Probability and Statistics

MTMS Mathematical Statistics

Lecture 21: October 19

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

Soc 3811 Basic Social Statistics Second Midterm Exam Spring Your Name [50 points]: ID #: ANSWERS

Scientific Method. Section 1. Observation includes making measurements and collecting data. Main Idea

hypothesis a claim about the value of some parameter (like p)

Hypotheses Test Procedures. Is the claim wrong?

Chapter 5: HYPOTHESIS TESTING

Probability and Statistics. Joyeeta Dutta-Moscato June 29, 2015

Introduction to Statistics

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations

Lecturer: Dr. Adote Anum, Dept. of Psychology Contact Information:

Topic 17 Simple Hypotheses

Statistics of Small Signals

Chapter 9 Inferences from Two Samples

Introduction to Artificial Intelligence. Prof. Inkyu Moon Dept. of Robotics Engineering, DGIST

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Power Analysis. Ben Kite KU CRMDA 2015 Summer Methodology Institute

t test for independent means

Bios 6649: Clinical Trials - Statistical Design and Monitoring

Sampling Distributions

Background to Statistics

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

Psychology 282 Lecture #4 Outline Inferences in SLR

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

10/31/2012. One-Way ANOVA F-test

Power Analysis. Introduction to Power

Introduction 1. STA442/2101 Fall See last slide for copyright information. 1 / 33

Inferential Statistics Hypothesis tests Confidence intervals

CENTRAL LIMIT THEOREM (CLT)

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Rejection regions for the bivariate case

Bayesian Information Criterion as a Practical Alternative to Null-Hypothesis Testing Michael E. J. Masson University of Victoria

ECO220Y Hypothesis Testing: Type I and Type II Errors and Power Readings: Chapter 12,

Bayesian Statistics: An Introduction

STAT Chapter 8: Hypothesis Tests

Spearman Rho Correlation

Examine characteristics of a sample and make inferences about the population

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

Introduction to Statistical Hypothesis Testing

Uni- and Bivariate Power

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

20 Hypothesis Testing, Part I

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

hypothesis testing 1

Finansiell Statistik, GN, 15 hp, VT2008 Lecture 10-11: Statistical Inference: Hypothesis Testing

Exact Inference by Complete Enumeration

Confidence Intervals and Hypothesis Tests

Sampling Distributions: Central Limit Theorem

AP Statistics Ch 12 Inference for Proportions

Mitosis Data Analysis: Testing Statistical Hypotheses By Dana Krempels, Ph.D. and Steven Green, Ph.D.

Homework Exercises. 1. You want to conduct a test of significance for p the population proportion.

Statistics Introductory Correlation

Survey on Population Mean

This gives us an upper and lower bound that capture our population mean.

Hypothesis Testing. Testing Hypotheses MIT Dr. Kempthorne. Spring MIT Testing Hypotheses

Hypothesis Testing. ) the hypothesis that suggests no change from previous experience

Bayesian Concept Learning

Probability and Statistics. Terms and concepts

Transcription:

Psychology 205: Research Methods in Psychology William Revelle Department of Psychology Northwestern University Evanston, Illinois USA October, 2015 1 / 14

Outline Inferential statistics 2 / 14

Hypothesis testing using inferential statistics How likely are the observed data given the hypothesis that an Independent Variable has no effect. Bayesian statistics compare the likelihood of the data given the hypothesis of no differences as contrasted to the likelihood of the data given competing hypotheses. This takes into account our prior willingness to believe that the IV could have an effect. Also takes into account our strength of belief in the hypothesis of no effect Conventional tests report the probability of the data given the Null hypothesis of no difference. The less likely the data are to be observed given the Null, the more we tend to discount the Null. Three kinds of inferential errors: Type I, Type II and Type III Type I is rejecting the Null when in fact it is true Type II is failing to reject the Null when it is in fact not true Type III is asking the wrong question 3 / 14

Hypothesis Testing Table : The ways we can be mistaken True False Scientists says True Valid Positive Type I error False Type II error Valid rejection Type III error is asking the wrong question! 4 / 14

Hypothesis Testing α and β Table : The ways we can be mistaken True False Scientists says True Valid Positive (Type I error) = α False p(type II error) = β Valid rejection Power as 1-β We need to think about power. Type III error is asking the wrong question! Probability that a significant effect is a type I error = α False α False+(1 β) True 5 / 14

Consider a number of scenarios 1. 1000 studies all together Consider α =.05,.01 Consider 1 β =.5,.8,.95, 1.0 2. But, also consider the state of the world ( What is the a priori likelihood of the outcome) a 50-50 chance (boring result) a 20-80 chance (interesting finding) a 10-90 chance (very interesting finding) a 1-99 chance (Wow, you found that!) 3. Probability that a significant effect is a type I error = α False α False+(1 β) True 6 / 14

Hypothesis Testing α and β Table : The ways we can be mistaken True False Scientists says True VP = (1 β) True p(type I)= α * False False p(type II) = β True VR = (1 α) False Power as 1-β We need to think about power. probability that a significant effect is a type I error = α False α False+(1 β) True 7 / 14

Hypothesis Testing α and β likely event, high power Table : A 50-50 chance high power True False Total Scientists says True 475 25 500 False 25 475 500 Total 500 500 1000 p (False Positive finding was significant) = 25 25+475 =.05 8 / 14

Hypothesis Testing α and β likely event, 50% power Table : A 50-50 chance low power True False Total Scientists says True 250 25 275 False 250 475 725 Total 500 500 1000 p (False Positive finding was significant) = 25 25+250 =.09 9 / 14

Hypothesis Testing α and β unlikely event, 50% power Table : A 10-90 chance low power True False Total Scientists says True 50 45 95 False 50 855 905 Total 100 900 1000 p (False Positive finding was significant) = 45 45+50 =.47 10 / 14

Hypothesis Testing α and β very unlikely event, 80% power Table : A 1-99 chance good power True False Total Scientists says True 8 49.5 57.5 False 2 940.5 942.5 Total 10 990 1000 p (False Positive finding was significant) = 49.5 49.5+8 =.86 11 / 14

Hypothesis Testing α and β very unlikely event, perfect power Table : A 1-99 chance perfect power True False Total Scientists says True 10 49.5 59.5 False 0 940.5 94.5 Total 10 990 1000 p (False Positive finding was significant) = 49.5 49.5+10 =.83 12 / 14

Power, α level, and the excitement of the finding 1. There is a natural tendency to want to show the unlikely. Showing your grandmother is right is not as interesting as showing she is wrong Showing that what most people expect is wrong is very exciting 2. But, the less likely the effect is to be there, the more likely that a significant effect is actually a type I error. Need to increase our Power and be sensitive to the replicability of our results. 3. The power of a good graphic to show the problem. Five lines: alpha =.05,.01,.001 Power =.8 or 1 13 / 14

P(Type I) given alpha, power, sexiness P(Type I) given alpha, power, sexiness P(Type I) 0.0 0.2 0.4 0.6 0.8 1.0 P(Type I) 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.5 0.6 0.7 0.8 0.9 1.0 Sexiness of finding = (1-p) Sexiness of finding = (1-p) P(Type I) given alpha, power, sexiness P(Type I) given alpha, power, sexiness P(Type I) 0.0 0.2 0.4 0.6 0.8 1.0 P(Type I) 0.0 0.2 0.4 0.6 0.8 1.0 0.80 0.85 0.90 0.95 1.00 0.90 0.92 0.94 0.96 0.98 1.00 14 / 14