Power Analysis. Introduction to Power

Similar documents
One-Way Analysis of Variance: ANOVA

Independent Samples t tests. Background for Independent Samples t test

Power and the computation of sample size

Difference in two or more average scores in different groups

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Introduction and Background to Multilevel Analysis

Introduction and Single Predictor Regression. Correlation

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Hypothesis testing. Data to decisions

Introduction to Statistical Hypothesis Testing

Sample Size Calculations

Statistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts

Power Analysis. Ben Kite KU CRMDA 2015 Summer Methodology Institute

Multiple t Tests. Introduction to Analysis of Variance. Experiments with More than 2 Conditions

Introduction to Business Statistics QM 220 Chapter 12

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

LECTURE 5. Introduction to Econometrics. Hypothesis testing

Power and sample size calculations

Harvard University. Rigorous Research in Engineering Education

Visual interpretation with normal approximation

Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals

The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

PSYC 331 STATISTICS FOR PSYCHOLOGISTS

In a one-way ANOVA, the total sums of squares among observations is partitioned into two components: Sums of squares represent:

Section 6.2 Hypothesis Testing

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!

The problem of base rates

OHSU OGI Class ECE-580-DOE :Design of Experiments Steve Brainerd

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

Sampling Distributions: Central Limit Theorem

Section 9.1 (Part 2) (pp ) Type I and Type II Errors

Independent Samples ANOVA

16.3 One-Way ANOVA: The Procedure

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

Power of a hypothesis test

HYPOTHESIS TESTING. Hypothesis Testing

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

Hypothesis testing: Steps

Hypothesis Testing hypothesis testing approach formulation of the test statistic

Wolf River. Lecture 19 - ANOVA. Exploratory analysis. Wolf River - Data. Sta 111. June 11, 2014

Performance Evaluation and Comparison

Variance Decomposition and Goodness of Fit

Power and Sample Size Bios 662

Rejection regions for the bivariate case

Sociology 6Z03 Review II

Power and nonparametric methods Basic statistics for experimental researchersrs 2017

Value Added Modeling

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs

IEOR165 Discussion Week 12

Power and sample size calculations

Simple Linear Regression: One Quantitative IV

One-sample categorical data: approximate inference

MATH 240. Chapter 8 Outlines of Hypothesis Tests

Lecture 10: Alternatives to OLS with limited dependent variables. PEA vs APE Logit/Probit Poisson

Review. One-way ANOVA, I. What s coming up. Multiple comparisons

Lecture 7: Hypothesis Testing and ANOVA

One-Way Analysis of Variance. With regression, we related two quantitative, typically continuous variables.

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests

Hypothesis testing: Steps

Power Analysis using GPower 3.1

Chapter Seven: Multi-Sample Methods 1/52

Lecture Slides for INTRODUCTION TO. Machine Learning. ETHEM ALPAYDIN The MIT Press,

Comparing Several Means: ANOVA

ANOVA Analysis of Variance

Machine Learning. Lecture Slides for. ETHEM ALPAYDIN The MIT Press, h1p://

1. What does the alternate hypothesis ask for a one-way between-subjects analysis of variance?

Regression With a Categorical Independent Variable

Statistical Inference

Chapter 7: Hypothesis Testing

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

Uni- and Bivariate Power

Binary Logistic Regression

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

What p values really mean (and why I should care) Francis C. Dane, PhD

This gives us an upper and lower bound that capture our population mean.

Analysis of Variance: Repeated measures

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Two-Sample Inferential Statistics

Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti

Lecture 21: October 19

ANALYTICAL COMPARISONS AMONG TREATMENT MEANS (CHAPTER 4)

Wolf River. Lecture 15 - ANOVA. Exploratory analysis. Wolf River - Data. Sta102 / BME102. October 22, 2014

Statistical Distribution Assumptions of General Linear Models

Probability and Statistics Notes

Two sample Hypothesis tests in R.

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test

Study Guide #3: OneWay ANALYSIS OF VARIANCE (ANOVA)

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

Lecture 30. DATA 8 Summer Regression Inference

1 Introduction to One-way ANOVA

Power. January 12, 2019

Wolf River. Lecture 15 - ANOVA. Exploratory analysis. Wolf River - Data. Sta102 / BME102. October 26, 2015

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

Lecture (chapter 10): Hypothesis testing III: The analysis of variance

Review. December 4 th, Review

Using SPSS for One Way Analysis of Variance

Transcription:

Power Analysis Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning When testing a specific null hypothesis (H 0 ), we have four possible outcomes. True State Decision Made H 0 = True H 0 = False Reject Type I error Correct Decision Fail to Reject Correct Decision Type II error In the above table, α is referred to as our Type I error rate. Therefore, if we set α = 0.05, then we have a 5% chance of making a Type I error.

Power Definition cont. From the table on the previous slide, we call the probability of making a Type II error, β. Based on these definitions for α and β, we then define power as the probability of rejecting the H 0 when it is false, or (1 β). Recall that this is specifically what we often want to do. Power is making a correct decision. Therefore, we maximize power when we minimize β. Power will have bounds 0 (1 β) 1 with larger values meaning more power. These can also be thought of as a percent. Factors Affecting Power 1. α Level As the α level increases (say from 0.05 to 0.10), β will decrease, meaning that power will go up (1 β). Conversely, as the α level decreases (say from 0.05 to 0.01), β will increase, meaning that power will go down (1 β). 2. Sample Size As sample size increases, the standard error of measurement will likewise decrease, all other things being equal since σ X = σ X / n. 3. Effect Size As the effect size increases, so does the power of the analysis. All other things remaining constant, as the size of the effect increases, the relative power associated with rejecting the null also increases.

Power for the Single Sample t test Suppose that we have a hypothetical dataset where µ = 100, X = 105, SD X = 15, and n = 30. In this case we can compute t as: t = X µ σ/ n 105 100 = 15/ 30 = 1.826 We can then compute the probability of t given our data and likewise β as: > pt(qt(0.975, 29), 29, 1.826) [1] 0.5770932 Graphical Representation of Heuristic Data > curve(pt(x, 29, ncp = 1.826), from = 0, to = 6) > abline(v = qt(0.975, 29)) pt(x, 29, ncp = 1.826) 0.0 0.2 0.4 0.6 0.8 1.0 0 1 2 3 4 5 6 x

In order to compute power, we need to know a few things about our data. 1. n 2. SD 3. Effect Size (or difference between means) 4. α level Knowing these four things will then let you compute the missing parameter (1 β), or power. > power.t.test(delta = 5, sd = 15, n = 30,, + type = "one.sample") One-sample t test power calculation n = 30 delta = 5 sd = 15 power = 0.4228091 alternative = two.sided Computing n based on Power Suppose that you are conducting a single sample experiment and you want to know how many people you should sample in order to attain a power of 0.85. We can compute this by the work backwards principle since n is the only missing piece of data. > power.t.test(delta = 5, sd = 15, power = 0.85,, + type = "one.sample") One-sample t test power calculation n = 82.74835 delta = 5 sd = 15 power = 0.85 alternative = two.sided This means that we need to sample 83 people to obtain a power of 0.85 given our study parameters.

Ways of Improving Power Reduce α from 0.05 to say 0.10. Consider methods for reducing the within-group variability (thus increasing the effect size). Increase the number of individuals in each group. Power for the One-Way ANOVA For the ANOVA, we can calculate relative group sample size given our estimated power, α, effect-size, and K. > library(pwr) > pwr.anova.test(f = 0.28, k = 4,, + power = 0.8) Balanced one-way analysis of variance power calculation k = 4 n = 35.75789 f = 0.28 power = 0.8 NOTE: n is number in each group

Post-Hoc Power for the One-Way ANOVA Likewise for the ANOVA, we can calculate post-hoc power based on α, effect-size, the average n, and K. > pwr.anova.test(f = 0.28, k = 4, n = 20, ) Balanced one-way analysis of variance power calculation k = 4 n = 20 f = 0.28 power = 0.5149793 NOTE: n is number in each group Other Types of Tests For all types of statistical tests, we can compute power or the needed sample size based on desired power. Almost all of the power calculations for statistical tests you will run are covered in Cohen (1988) Statistical Power Analysis for the Behavioral Sciences.