James H. Steiger. Department of Psychology and Human Development Vanderbilt University. Introduction Factors Influencing Power

Similar documents
Confidence Interval Estimation

Understanding p Values

POLI 443 Applied Political Research

Inference for Single Proportions and Means T.Scofield

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

The Chi-Square and F Distributions

Hypothesis Testing and Interval Estimation

Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT):

Design of the Fuzzy Rank Tests Package

Data Science for Engineers Department of Computer Science and Engineering Indian Institute of Technology, Madras

Introduction to the Analysis of Variance (ANOVA)

280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses

Introductory Econometrics. Review of statistics (Part II: Inference)

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing

Summary: the confidence interval for the mean (σ 2 known) with gaussian assumption

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)

MATH 240. Chapter 8 Outlines of Hypothesis Tests

16.3 One-Way ANOVA: The Procedure

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test

Statistical inference

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

Adv. App. Stat. Presentation On a paradoxical property of the Kolmogorov Smirnov two-sample test

Lecture 06. DSUR CH 05 Exploring Assumptions of parametric statistics Hypothesis Testing Power

A3. Statistical Inference

Industrial Engineering Prof. Inderdeep Singh Department of Mechanical & Industrial Engineering Indian Institute of Technology, Roorkee

Hypothesis Testing One Sample Tests

The Finite Sample Properties of the Least Squares Estimator / Basic Hypothesis Testing

Probability Methods in Civil Engineering Prof. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras. Lecture 11 t- Tests

ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing

Null Hypothesis Significance Testing p-values, significance level, power, t-tests

Power and sample size calculations

Psych 230. Psychological Measurement and Statistics

CBA4 is live in practice mode this week exam mode from Saturday!

Independent Samples ANOVA

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

Sampling Distributions

Ch. 1: Data and Distributions

Stat 5421 Lecture Notes Fuzzy P-Values and Confidence Intervals Charles J. Geyer March 12, Discreteness versus Hypothesis Tests

Hypothesis Testing. We normally talk about two types of hypothesis: the null hypothesis and the research or alternative hypothesis.

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

WISE Power Tutorial Answer Sheet

Tutorial on Mathematical Induction

Robustness. James H. Steiger. Department of Psychology and Human Development Vanderbilt University. James H. Steiger (Vanderbilt University) 1 / 37

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Bayesian Inference for Normal Mean

Descriptive Statistics-I. Dr Mahmoud Alhussami

We saw in the previous lectures that continuity and differentiability help to understand some aspects of a

BIOLOGY 111. CHAPTER 1: An Introduction to the Science of Life

Econometrics. 4) Statistical inference

Coulomb s Law and Electric Fields

Power Analysis. Introduction to Power

What p values really mean (and why I should care) Francis C. Dane, PhD

10/31/2012. One-Way ANOVA F-test

Module 5 Practice problem and Homework answers

Lecture 6 April

Review of Basic Probability Theory

Applied Statistics for the Behavioral Sciences

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018

Review. One-way ANOVA, I. What s coming up. Multiple comparisons

PSYC 331 STATISTICS FOR PSYCHOLOGIST

Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS.

Lab #12: Exam 3 Review Key

Key Algebraic Results in Linear Regression

How is the Statistical Power of Hypothesis Tests affected by Dose Uncertainty?

COMPARING SEVERAL MEANS: ANOVA

POLI 443 Applied Political Research

8/23/2018. One-Way ANOVA F-test. 1. Situation/hypotheses. 2. Test statistic. 3.Distribution. 4. Assumptions

Week 12 Hypothesis Testing, Part II Comparing Two Populations

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

One-Way ANOVA Calculations: In-Class Exercise Psychology 311 Spring, 2013

Rank-Based Methods. Lukas Meier

The problem of base rates

Lecture 18: Analysis of variance: ANOVA

Sampling Distributions: Central Limit Theorem

Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

The Central Limit Theorem

Frequentist Statistics and Hypothesis Testing Spring

PSYC 331 STATISTICS FOR PSYCHOLOGISTS

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Regression and the 2-Sample t

Hypotheses Test Procedures. Is the claim wrong?

STAT Chapter 8: Hypothesis Tests

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

Fin285a:Computer Simulations and Risk Assessment Section 2.3.2:Hypothesis testing, and Confidence Intervals

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

Advanced Operations Research Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras

Lecture 5. 1 Review (Pairwise Independence and Derandomization)

Transcription:

Department of Psychology and Human Development Vanderbilt University

1 2 3 4 5

In the preceding lecture, we examined hypothesis testing as a general strategy. Then, we examined how to calculate critical values, power, and required sample size in the specific case of a hypothesis test on a single mean, when σ is somehow known.

At the end of the lecture, we introduced the notion of standardized effect size E s, the amount by which the null hypothesis is wrong in standard deviation units. E s = µ µ 0 σ We found that power can be calculated from the simplified formula Power = Φ( ne s Z crit ) (2) where Φ is the normal curve cumulative probability (pnorm in R) and Z crit is the critical value for the Z-statistic. (1)

We then worked with the formula showing that the sample size n required to achieve a desired level of power can be calculated directly as ( (Zcrit ) ) + Z 2 power n required = ceiling (3) where Z power is the normal curve quantile corresponding to the desired level of power. E s

In this module, we want to re-examine these formulas to see what they can tell us about 1 The factors influencing power of a hypothesis test 2 The factors influencing the n required to achieve a given level of power Let s begin by examining the factors influencing power.

As we saw in Equation 2, power is evaluated by computing Φ(K), where K = ne s Z crit. The normal curve cumulative probability function Φ is a strictly increasing S-shaped function, as shown in the plot on the next slide. So anything that increases K must also increase power, all other things being equal. Likewise, anything that decreases K must also decrease power.

Φ(K) 0.0 0.2 0.4 0.6 0.8 1.0 4 2 0 2 4 K = ne s Z crit

Since n and E s and Z crit are all assumed to be positive when this formula is used, it is clear that K will increase as a function of n and E s, while it will decrease as a function of Z crit. So, all other things being equal, 1 As n increases, power increases. This is true because the left side of the formula for K increases as a function of the square root of n. 2 As E s increases, power increases. Two factors can cause E s to increase. (1) The null hypothesis can become more false because of an increased experimental effect (due to, for example, a more powerful manipulation on the part of the experimenter), or (2) The natural variation σ in the measurement of the dependent variable decreases because of refinements in the measurement method or a decrease in variability in the population. 3 As α increases, power increases. This is because increasing α allows the critical value Z crit to decrease, thereby reducing K in the plot. For example, relaxing α from 0.01 to the less stringent 0.05 will increase power. 4 Shifting from a 2-tailed test to a correctly specified 1-tailed test with the same α will increase power. This is because shifting from a 2-tailed test to a 1-tailed test will move half the rejection area to the other rejection region. This will decrease Z crit and increase power. On the other hand, of course, shifting from a 1-tailed test to a 2-tailed test will cost you power.

Since n and power are directly related, we can conclude that other factors we discussed that increase power will, all other things being equal, decrease the required sample size n. Another way of examining the issue is to re-examine the formula for required n, i.e., ( (Zcrit ) ) + Z 2 power n required = ceiling (4) Anything increasing either term in the numerator will increase required n, and anything decreasing E s will increase the required n. Decreasing ( tightening ) α or switching from a 1-tailed to a two-tailed test will increase Z crit. Increasing required power will increase Z power. Any decrease in the experimental effect or increase in σ will decrease E s. Taking these facts into consideration, we emerge with the rules summarized on the next slide. E s

All other things being equal, Increasing ( relaxing ) α will decrease the required n. Switching from a 2-tailed test to a correctly specified 1-tailed test will decrease the required n. Increasing the experimental effect (with a stronger manipulation) will increase E s and decrease the required n. Decreasing the natural variation in measurements will increase E s and decrease the required n. Decreasing the required power will decrease the required n.

In the previous sections, we developed rules of thumb for how various factors affect power and/or required sample size (n). We developed the rules by a careful consideration of the formulas for calculating power and required n in the context of the 1-sample Z test for a single mean. However, we will find that these results generalized across a wide variety of statistical tests. In the next module, we extend the 1-sample Z test to the situation in which the population standard deviation σ is not known.