Sample Size Calculations

Similar documents
Power Analysis. Introduction to Power

Sample Size. Vorasith Sornsrivichai, MD., FETP Epidemiology Unit, Faculty of Medicine Prince of Songkla University

Sample Size Determination

Power of a hypothesis test

a Sample By:Dr.Hoseyn Falahzadeh 1

Sampling Distributions: Central Limit Theorem

Difference in two or more average scores in different groups

Philosophy and History of Statistics

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

Comparing Several Means: ANOVA

HYPOTHESIS TESTING. Hypothesis Testing

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

Estimating the accuracy of a hypothesis Setting. Assume a binary classification setting

Contrasts (in general)

Two sample Hypothesis tests in R.

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Review. One-way ANOVA, I. What s coming up. Multiple comparisons

Using SPSS for One Way Analysis of Variance

How do we compare the relative performance among competing models?

Two-sample inference: Continuous Data

Question. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?

The Purpose of Hypothesis Testing

Parameter Estimation, Sampling Distributions & Hypothesis Testing

Cherry Blossom run (1) The credit union Cherry Blossom Run is a 10 mile race that takes place every year in D.C. In 2009 there were participants

CH.9 Tests of Hypotheses for a Single Sample

Business Statistics. Lecture 10: Course Review

Comparing the effects of two treatments on two ordinal outcome variables

Study Ch. 9.3, #47 53 (45 51), 55 61, (55 59)

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

Confidence Intervals and Hypothesis Tests

Power and sample size calculations

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

LECTURE 5. Introduction to Econometrics. Hypothesis testing

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Hypothesis testing. Data to decisions

SMAM 314 Exam 3 Name. 1. Mark the following statements true or false. (6 points 2 each)

Statistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers

Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.

The Components of a Statistical Hypothesis Testing Problem

Power and the computation of sample size

Exam Applied Statistical Regression. Good Luck!

Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment

Power and Sample Sizes in RCTs Cape Town 2007

Multiple Comparisons

Hypothesis Testing with Z and T

Sociology 6Z03 Review II

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

Relating Graph to Matlab

On the Triangle Test with Replications

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Review of Statistics 101

Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals

Lecture 5: ANOVA and Correlation

COMPARING SEVERAL MEANS: ANOVA

Hypothesis testing, part 2. With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal

Lecture Outline. Biost 518 Applied Biostatistics II. Choice of Model for Analysis. Choice of Model. Choice of Model. Lecture 10: Multiple Regression:

Appendix C: Accuracy, Precision, and Uncertainty

Data Analysis and Statistical Methods Statistics 651

Hypothesis testing: Steps

Note: k = the # of conditions n = # of data points in a condition N = total # of data points

Intro. To Exptl. Design

PSY 216. Assignment 9 Answers. Under what circumstances is a t statistic used instead of a z-score for a hypothesis test

Analysis of Variance (ANOVA)

Sampling Distributions

Tests for the Odds Ratio of Two Proportions in a 2x2 Cross-Over Design

Power Analysis. Ben Kite KU CRMDA 2015 Summer Methodology Institute

A Clinical Trial Simulation System, Its Applications, and Future Challenges. Peter Westfall, Texas Tech University Kuenhi Tsai, Merck Research Lab

One-way between-subjects ANOVA. Comparing three or more independent means

Power and nonparametric methods Basic statistics for experimental researchersrs 2017

We need to define some concepts that are used in experiments.

Chapter Six: Two Independent Samples Methods 1/51

Decision Trees. CSC411/2515: Machine Learning and Data Mining, Winter 2018 Luke Zettlemoyer, Carlos Guestrin, and Andrew Moore

Categorical Data Analysis. The data are often just counts of how many things each category has.

Introduction to Signal Detection and Classification. Phani Chavali

Lecture Testing Hypotheses: The Neyman-Pearson Paradigm

Sample Size Calculations for Group Randomized Trials with Unequal Sample Sizes through Monte Carlo Simulations

Chi square test of independence

Dose-response modeling with bivariate binary data under model uncertainty

The Difference in Proportions Test

(1) Introduction to Bayesian statistics

CUSUM(t) D data. Supplementary Figure 1: Examples of changes in linear trend and CUSUM. (a) The existence of

STT 315 Problem Set #3

Math 3361-Modern Algebra Lecture 08 9/26/ Cardinality

Chapter 6. Chapter A random sample of size n = 452 yields 113 successes. Calculate the 95% confidence interval

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

Political Science 236 Hypothesis Testing: Review and Bootstrapping

Chapter 26: Comparing Counts (Chi Square)

Welcome! Webinar Biostatistics: sample size & power. Thursday, April 26, 12:30 1:30 pm (NDT)

STA Module 10 Comparing Two Proportions

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

Logistic Regression Analysis

STAT Chapter 8: Hypothesis Tests

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests

1 Statistical inference for a population mean

Exam 1. Problem 1: True or false

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0.

Clinical Trials. Olli Saarela. September 18, Dalla Lana School of Public Health University of Toronto.

16.400/453J Human Factors Engineering. Design of Experiments II

Transcription:

Sample Size Calculations Analyses rely on means rather than individual values Means are more precise Precision measured by σ n So precision depends on n This can be used, directly or indirectly, as a basis for setting a sample size 1

Why set a sample size? Should aim for a degree of precision appropriate to purposes of the study Too much precision is wasteful and possibly unethical Too little precision will yield a study that is uninformative (and so possibly unethical) Methods usually based on either: Specifying width of confidence interval or Power of a hypothesis test

Principal Difficulties Methods usually use formulae, tables or a computer program All require values of unknown parameters Study is a method of estimating these parameters So do sample size calculations put the cart before the horse? To an extent, yes Must use imprecise estimates, so results must not be taken too numerically can rule out silly studies 3

Method based on confidence intervals Confidence interval is: x ± ts / n Use z in place of t, OK for n>30 Width of confidence interval is zs/ n 4s/ n Limit on confidence interval is equivalent to limit on SE SE is less than L if s and/or σ. n > s / L - use literature to get estimate of 1 1 For two-groups, SE=s +, leading to n n n 1 s L 4

Method based on hypothesis tests Principles but not details apply whatever the type of outcome Start with Normal outcome, two groups Slightly different emphasis to hypothesis test: Type I error if a true null hypothesis is rejected Type II error if a false null hypothesis is not rejected. 5

Constraining error rates Test null hypothesis by referring z = x x 1 + 1 s 1 n n 1 to SNV If you reject null hypothesis if z>1.96 or z<-1.96 then Type I error rate is 5% (independent of sample size) Type II error rate depends on sample size, so Type II error rate is the thing we seek to control through sample size. Note POWER = 1 -Type II error rate = Probability reject a false null hypothesis Heuristic explanation of process can be given 6

0.08 0.00-3 0 5 0.4 0.3 0. 0.1 0.0-1.5 0.0 5.0 Basic Ideas Two cases: different means, same se and same means different ses Dashed has twice SE of solid 0 Mean 1 Mean Difference between groups 0 Mean Difference between groups Will be able to tell μ1 μ 0 more easily is μ1 μ larger OR if se is smaller, i.e. sample size larger 7

Power as a function Null hypothesis specifies be false in an infinite number of ways μ μ ( = ) whereas the null hypothesis can 1 0 It follows that power is not a single number, but depends on μ μ 1 1.0 Probability of rejecting H 0 0.8 0.6 0.4 0. 0.0-1.5 0.0 1.5 (μ 1 μ )/σ μ μ All curves must go through (0,0.05) [or (0,α)] and tend to 1 as 1 increases. 8

Sample size formula for Normal outcomes Two groups with common SD, testing they have a common mean. Number in each group is n. Power to detect difference μ μ is β at two-sided level α if: 1 n = β α 1 σ ( z + z1 ) ( μ μ ) z β cuts off top proportion β from a SNV etc. 9

Common values Formula can be written 1 1 ) ( ) ( = = μ μ σ μ μ σ A A n Power 80% 90% 95% A 15.7 1.0 6.0 (Type I error 0.05) σ needs to be estimated from literature/pilot study (latter can be very variable) 10

Specifying μ Power to detect some values of μ 1 μ 1 Huge studies needed to detect small μ μ μ will always be small 1 But small values of μ μ will not be of interest 1 Need to specify μ clinical interest. μ 1 as smallest difference that is of Essentially set μ1 μ as limit so that investigators are prepared to accept that their study may miss differences less than μ μ 1 11

Example Placebo controlled trial of zanamivir, a new treatment for influenza (MIST study group, Lancet 35, 1877-81) Outcome is number of days to alleviation of symptoms. Common SD is taken as.75 days Decided that a mean improvement of 1 day is minimum that is clinically valuable For 90% power at 5% level A=1.0, so number required in each group is.75 1.0 = 159: for σ=, 3.5 we get 84 or 58 1 1

Binary outcome α, β and π π 1 specified as before π ( 1 π) No analogue of σ: remember SE(r/n) = n information about σ comes through π itself., i.e. (sample size for change 0.1 to 0.3 different from 0.4 to 0.6) Actually need to specify one of π π 1, as well as π 1 π Table available, in terms of π smaller and π larger π smaller Note change from 0.6 to 0.8 is same problem as change from 0. to 0.4 13

Concluding remarks Sample size calculation is a bit of a black art: depends on unknown parameters that need to be estimated by whatever means you can - still leaves you with very variable estimates Is a sample size estimate that is plausibly between 400 and 800 any use? Yes, if all you can manage is 100. Can use formula to get power from a given n - need to be cautious Anticipate dropouts, but remember that loss of power may not be your main concern about dropouts Post hoc power calculations inappropriate - use confidence intervals. 14