Sample Size/Power Calculation by Software/Online Calculators

Size: px
Start display at page:

Download "Sample Size/Power Calculation by Software/Online Calculators"

Transcription

1 Sample Size/Power Calculation by Software/Online Calculators May 24, 2018 Li Zhang, Ph.D. Associate Professor Department of Epidemiology and Biostatistics Division of Hematology and Oncology Department of Medicine University of California, San Francisco

2 Topics R packages SAS Proc Online calculator: CTSI sample size calculator Online calculator for clinical trial: SWOG Software: G*Power 2

3 Power Analysis with R 3

4 Power/Sample size calculation for one or two proportions Power calculations for proportion tests (one sample) H 0 : p=p 1 vs. H a : p p 1 pwr.p.test(h, n, sig.level, power, alternative = c("two.sided","less","greater")) Power calculation for two proportions (same sample size) H 0 : p 1 =p 2 vs. H a : p 1 p 2 pwr.2p.test(h, n, sig.level, power, alternative=c("two.sided","less","greater")) Power calculation for two proportions (different sample sizes) H 0 : p 1 =p 2 vs. H a : p 1 p 2 pwr.2p2n.test(h, n1, n2, sig.level, power, alternative = c("two.sided", "less","greater")) Effect size calculation R Demo 4

5 Power calculations for chi-squared tests pwr.chisq.test(w = NULL, N = NULL, df = NULL, sig.level = 0.05, power = NULL) ES.w1(P0, P1): Effect size calculation in the chi-squared test for goodness of fit, which is the sum of differences between observed and expected outcome frequencies Compute effect size w for two sets of k probabilities P0 (null hypothesis) and P1 (alternative hypothesis) ES.w2(P0, P1): Effect size calculation in the chi-squared test for association Compute effect size w for a two-way probability table corresponding to the alternative hypothesis in the chisquared test of association in two-way contingency tables 5

6 Power/Sample size calculation for one or two means Power calculations for t-tests of means (one sample, two samples and paired samples) One sample: H 0 : μ = μ 1 vs. H a : μ μ 1 Two sample or paired samples: H 0 : μ 1 = μ 2 vs. H a : μ 1 μ 2 pwr.t.test(n, d, sig.level, power, type = c("two.sample", "one.sample", "paired"), alternative = c("two.sided", "less", "greater")) Power calculations for two samples (different sizes) t-tests of means H 0 : μ 1 = μ 2 vs. H a : μ 1 μ 2 pwr.t2n.test(n1, n2, d, sig.level = 0.05, power, alternative = c("two.sided", "less","greater")) R Demo 6

7 Power calculations for balanced one-way analysis of variance tests pwr.anova.test(k = NULL, n = NULL, f = NULL, sig.level = 0.05, power = NULL) k n f Number of groups Number of observations (per group) Effect size 7

8 Power calculations for the general linear model pwr.f2.test(u = NULL, v = NULL, f2 = NULL, sig.level = 0.05, power = NULL) u and v are the numerator and denominator degrees of freedom. We use f2 as the effect size measure. when evaluating the impact of a set of predictors on an outcome when evaluating the impact of one set of predictors above and beyond a second set of predictors (or covariates) 8

9 Other R Packages for Sample Size Calculation powersurvepi: Power and Sample Size Calculation for Survival Analysis of Epidemiological Studies epir: Sample size cohort study, case-control study, cross-sectional study, under one or two-stage cluster sampling kappasize: Sample Size Estimation Functions for Studies of Interobserver Agreement powermediation: Power/Sample size calculation for mediation analysis, simple linear regression, logistic regression, or longitudinal study power.roc.test {proc}: Computes sample size, power, significance level or minimum AUC for ROC curves. RNASeqPower: Sample Size for RNA-Seq and similar Studies 9

10 Case-control study by library(epir) A matched case control study is to be carried out to quantify the association between exposure A and an outcome B. Assume the prevalence of exposure in controls is 0.60 and the correlation between case and control exposures for matched pairs (rho) is 0.20 (moderate). Assuming an equal number of cases and controls, how many subjects need to be enrolled into the study to detect an odds ratio of 3.0 with 0.80 power using a two-sided 0.05 test? epi.ccsize(or = 3.0, p0 = 0.60, n = NA, power = 0.80, r = 1, rho = 0.2, design = 1, sided.test = 2, conf.level = 0.95, method = "matched", fleiss = FALSE) A total of 162 subjects need to be enrolled in the study: 81 cases and 81 controls. 10

11 Case-control study by library(epir) How many cases and controls are required if we select three controls per case? epi.ccsize(or = 3.0, p0 = 0.60, n = NA, power = 0.80, r = 3, rho = 0.2, design = 1, sided.test = 2, conf.level = 0.95, method = "matched", fleiss = FALSE) A total of 204 subjects need to be enrolled in the study: 51 cases and 153 controls. 11

12 kappasize: Sample Size Estimation Functions for Studies of Interobserver Agreement Library(kappaSize) Can handle binary to 5 categories Confidence Interval Approach E.g. CI3Cats Calculation of the Lowest Expected Value E.g., FixedN4Cats Power-Based Approach E.g., PowerBinary 12

13 Computes sample size/power/minimum AUC for ROC curves power.roc.test(...) One or Two ROC curves test with roc objects: power.roc.test(roc1, roc2, sig.level = 0.05, power = NULL, alternative = c("two.sided", "one.sided"), reuse.auc=true, method = c("delong", "bootstrap", "obuchowski"),...) One ROC curve with a given AUC: power.roc.test(auc = NULL, ncontrols = NULL, ncases = NULL, sig.level = 0.05, power = NULL, kappa = 1, alternative = c("two.sided", "one.sided"),...) Two ROC curves with the given parameters: power.roc.test(parslist, ncontrols = NULL, ncases = NULL, sig.level = 0.05, power = NULL, kappa = 1, alternative = c("two.sided", "one.sided"),...) 13

14 RNASeqPower: Sample Size for RNA-Seq and Similar Studies rnapower(depth, n, n2 = n, cv, cv2 = cv, effect, alpha, power) depth average depth of coverage for the transcript or gene of interest. Common values are 5-20, any numeric value >0 is valid. n sample size in group 1 (or both) n2 sample size in group 2 cv biological coefficient of variation in group 1 (or both). cv2 biological coefficient of variation in group 2 effect size target effect size 14

15 Comments about R packages Pros: Free A lot of resources for different tests/study designs Generate a figure/table easily for different options of parameters, for example, sample size calculation for sequencing data Cons: Need to write codes Hard to implement sometimes Not sure about reliability 15

16 Power Analysis with SAS SAS PROC POWER t-tests, equivalence tests, and confidence intervals for means tests, equivalence tests, and confidence intervals for binomial proportions multiple regression tests of correlation and partial correlation one-way analysis of variance rank tests for comparing two survival curves logistic regression with binary response Wilcoxon-Mann-Whitney (rank-sum) test PROC GLMPOWER: Compute Power and Sample Size for Repeated Measures 16

17 SAS Example: Calculate power for Pearson chi-squared tests Same sample size, two-sided test of proportions proc power; twosamplefreq test=pchi groupproportions=( ) npergroup=30 power=.; run; The SAS System The POWER Procedure Pearson Chi-square Test for Proportion Difference Fixed Scenario Elements Distribution Asymptotic normal Method Normal approximation Group 1 Proportion 0.1 Group 2 Proportion 0.5 Computed Power Power Sample Size per Group 30 Number of Sides 2 Null Proportion Difference 0 Alpha

18 SAS Example: Calculate power for Pearson chi-squared tests Different sample size, two-sided test of proportions proc power; twosamplefreq test=pchi groupproportions=( ) groupns=25 50 power=.; run; The SAS System The POWER Procedure Pearson Chi-square Test for Proportion Difference Fixed Scenario Elements Distribution Asymptotic normal Method Normal approximation Group 1 Proportion 0.1 Group 2 Proportion 0.5 Group 1 Sample Size 25 Group 2 Sample Size 50 Computed Power Power Number of Sides 2 Null Proportion Difference 0 Alpha

19 SAS Example: Calculate power for t- tests Two independent samples, same size proc power; twosamplemeans test=diff meandiff=2 stddev=2.8 npergroup=30 power=.; run; The SAS System The POWER Procedure Two-Sample t Test for Mean Difference Fixed Scenario Elements Distribution Normal Method Exact Mean Difference 2 Standard Deviation 2.8 Computed Power Power Sample Size per Group 30 Number of Sides 2 Null Difference 0 Alpha

20 SAS Example: Calculate power for t- tests One sample proc power; onesamplemeans test=t mean=2 stddev=2.8 ntotal=30 power=.; run; The SAS System The POWER Procedure One-Sample t Test for Mean Fixed Scenario Elements Distribution Normal Method Exact Mean 2 Standard Deviation 2.8 Computed Power Power Total Sample Size 30 Number of Sides 2 Null Mean 0 Alpha

21 SAS Example: Calculate power for t- tests Paired samples proc power; pairedmeans test=diff meandiff=2 corr=0.5 stddev=2.8 npairs=30 power=.; run; The SAS System The POWER Procedure Paired t Test for Mean Difference Fixed Scenario Elements Distribution Normal Method Exact Mean Difference 2 Standard Deviation 2.8 Correlation 0.5 Computed Power Power Number of Pairs 30 Number of Sides 2 Null Difference 0 Alpha

22 SAS Example: Calculate power for t- tests Two independent samples, different sizes proc power; twosamplemeans test=diff meandiff=2 stddev=2.8 groupns=(20 40) power=.; run; The SAS System The POWER Procedure Two-Sample t Test for Mean Difference Fixed Scenario Elements Distribution Normal Method Exact Mean Difference 2 Standard Deviation 2.8 Computed Power Power Group 1 Sample Size 20 Group 2 Sample Size 40 Number of Sides 2 Null Difference 0 Alpha

23 UCSF CTSI Sample Size Calculators Can do most of the popular tests Compare the mean of a continuous measurement in two samples which allow for clustered sampling. A cluster randomized controlled trial is a type of randomized controlled trial in which groups of subjects (as opposed to individual subjects) are randomised. 23

24 Online Calculators: SWOG Primary objective is not a hypothesis, just estimation, then provide the precision of the estimation Example: The expected adherence rate is 80%, n=50 95% CI is (66.3%, 90.0%) One arm binomial: H 0 : P=0.1 vs. H a : P 0.1 One arm survival: Length of the accrual period Length of the follow-up period, i.e. the time from end of accrual to analysis H 0 : median OS=6 months vs. H a : median OS>6 months H 0 : s(t)=0.5 at 6 months vs. H a : s(t) = 0.5 at 9 months 24

25 Online Calculators: SWOG (cont.) Two-arm binomial: H 0 : P 1 =P 2 vs. H a : P 1 P 2 P 1 = 0.1 vs. P 2 = 0.25 Two-arm survival: Length of the accrual period Length of the follow-up period, i.e. the time from end of accrual to analysis H 0 : HR = 2 vs. H a : HR 2 (Median OS = 6months for null, 12-month accrual and 12-month followup) 25

26 Online Calculators: SWOG (cont.) Two stage a l r l a 2 r 2 If the number of successes after completing the first stage is < al, we reject the alternative hypothesis that p > Pa. If the number of successes after completing the first stage is > r l, we reject the null hypothesis that p < P 0. If the number of successes after completing the trial is < a 2 then we reject the alternative hypothesis. If the number of successes after completing the trial is > r 2 then we reject the null hypothesis. 26

27 Online Calculators: SWOG (cont.) Other options: Survival noninferiority Competing Risk: the hazard of the competing risk random variable Hazard ratios between experimental and standard defining equivalence Hazard ratio must be less than hazard ratio defining equivalence Expected Deaths Make a table of expected death information Provide expected deaths for a given time Provide expected deaths for a time at which the expected proportion of deaths have occurred. 27

28 Online Calculators: Simon s two stage design nstwostagedesign.aspx One arm Phase II clinical trial Endpoint: Response rate or binary outcome Incorporate interim analysis for futility One-sided test Example: H 0 : P=0.1 vs. H a : P>0.1 Simon's two-stage design (Simon, 1989) will be used. The null hypothesis that the true response rate is 0.1 will be tested against a onesided alternative. In the first stage, 22 patients will be accrued. If there are 2 or fewer responses in these 22 patients, the study will be stopped. Otherwise, 18 additional patients will be accrued for a total of 40.The null hypothesis will be rejected if 8 or more responses are observed in 40 patients. This design yields a type I error rate of 0.04 and power of 80% when the true response rate is

29 G* Power G*Power is a tool to compute statistical power analyses for many different t tests, F tests, χ2 tests, z tests and some exact tests. G*Power can also be used to compute effect sizes and to display graphically the results of power analyses. It is free, both Windows and Mac version. 29

30 Exact: Proportion - inequality, two dependent groups (McNemar) Standard Treatment Yes No Yes p 11 p 12 p t No p 21 p 22 1 p t p s 1 p s 1 H 0 : p 12 /p 21 = 1 H 1 : p 12 /p 21 6= 1. Standard Treatment Yes No Yes No e the proportion of discordant pa Select Type of power analysis: Post hoc Options Computation: Exact Input Tail(s): Two Odds ratio: 0.25 α err prob: 0.05 Total sample size: 50 Prop discordant pairs: 0.4 Output Power (1-β err prob): 0.80 Actual α: 0.04 Proportion p12: 0.08 Proportion p21:

31 F test: Fixed effects One-Way ANOVA means to more than two groups. The null hypothesis is that all k means are identical H 0 : µ 1 = µ 2 =... = µ k. The alternative hypothesis states that at least two of the k means differ. H 1 : µ i 6= µ j, for at least one pair i, j with 1 apple i, j apple k. Example: We compare 10 groups, and we have reason to expect a "medium" effect size (f =.25). How many subjects do we need in a test with α = 0.05 to achieve a power of 0.95? Select Type of power analysis: A priori Input Effect size f : 0.25 α err prob: 0.05 Power (1-β err prob): 0.95 Number of groups: 10 Output Noncentrality parameter λ: Critical F: Numerator df: 9 Denominator df: 380 Total sample size: 390 Actual Power:

32 Questions? 32

POWER FOR COMPARING TWO PROPORTIONS WITH INDEPENDENT SAMPLES

POWER FOR COMPARING TWO PROPORTIONS WITH INDEPENDENT SAMPLES This handout covers material found in Section 0.5 of the text. POWER FOR COMPARING TWO PROPORTIONS WITH INDEPENDENT SAMPLES EXAMPLE: Otolaryngology (Example 0.3 of your text, page 405). Suppose a study

More information

APPENDIX B Sample-Size Calculation Methods: Classical Design

APPENDIX B Sample-Size Calculation Methods: Classical Design APPENDIX B Sample-Size Calculation Methods: Classical Design One/Paired - Sample Hypothesis Test for the Mean Sign test for median difference for a paired sample Wilcoxon signed - rank test for one or

More information

Sample size and power calculation using R and SAS proc power. Ho Kim GSPH, SNU

Sample size and power calculation using R and SAS proc power. Ho Kim GSPH, SNU Sample size and power calculation using R and SAS proc power Ho Kim GSPH, SNU Pvalue (1) We want to show that the means of two populations are different! Y 1 a sample mean from the 1st pop Y 2 a sample

More information

Turning a research question into a statistical question.

Turning a research question into a statistical question. Turning a research question into a statistical question. IGINAL QUESTION: Concept Concept Concept ABOUT ONE CONCEPT ABOUT RELATIONSHIPS BETWEEN CONCEPTS TYPE OF QUESTION: DESCRIBE what s going on? DECIDE

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

The Design of a Survival Study

The Design of a Survival Study The Design of a Survival Study The design of survival studies are usually based on the logrank test, and sometimes assumes the exponential distribution. As in standard designs, the power depends on The

More information

Using SAS Proc Power to Perform Model-based Power Analysis for Clinical Pharmacology Studies Peng Sun, Merck & Co., Inc.

Using SAS Proc Power to Perform Model-based Power Analysis for Clinical Pharmacology Studies Peng Sun, Merck & Co., Inc. PharmaSUG00 - Paper SP05 Using SAS Proc Power to Perform Model-based Power Analysis for Clinical Pharmacology Studies Peng Sun, Merck & Co., Inc., North Wales, PA ABSRAC In this paper, we demonstrate that

More information

Tests for Two Correlated Proportions in a Matched Case- Control Design

Tests for Two Correlated Proportions in a Matched Case- Control Design Chapter 155 Tests for Two Correlated Proportions in a Matched Case- Control Design Introduction A 2-by-M case-control study investigates a risk factor relevant to the development of a disease. A population

More information

Question. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?

Question. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not? Hypothesis testing Question Very frequently: what is the possible value of μ? Sample: we know only the average! μ average. Random deviation or not? Standard error: the measure of the random deviation.

More information

Textbook Examples of. SPSS Procedure

Textbook Examples of. SPSS Procedure Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 3: Bivariate association : Categorical variables Proportion in one group One group is measured one time: z test Use the z distribution as an approximation to the binomial

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline

More information

Paper Equivalence Tests. Fei Wang and John Amrhein, McDougall Scientific Ltd.

Paper Equivalence Tests. Fei Wang and John Amrhein, McDougall Scientific Ltd. Paper 11683-2016 Equivalence Tests Fei Wang and John Amrhein, McDougall Scientific Ltd. ABSTRACT Motivated by the frequent need for equivalence tests in clinical trials, this paper provides insights into

More information

Statistics Handbook. All statistical tables were computed by the author.

Statistics Handbook. All statistical tables were computed by the author. Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance

More information

HOW TO DETERMINE THE NUMBER OF SUBJECTS NEEDED FOR MY STUDY?

HOW TO DETERMINE THE NUMBER OF SUBJECTS NEEDED FOR MY STUDY? HOW TO DETERMINE THE NUMBER OF SUBJECTS NEEDED FOR MY STUDY? TUTORIAL ON SAMPLE SIZE AND POWER CALCULATIONS FOR INEQUALITY TESTS. John Zavrakidis j.zavrakidis@nki.nl May 28, 2018 J.Zavrakidis Sample and

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Power and Sample Size Bios 662

Power and Sample Size Bios 662 Power and Sample Size Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-10-31 14:06 BIOS 662 1 Power and Sample Size Outline Introduction One sample: continuous

More information

Statistical Aspects of Futility Analyses. Kevin J Carroll. nd 2013

Statistical Aspects of Futility Analyses. Kevin J Carroll. nd 2013 Statistical Aspects of Futility Analyses Kevin J Carroll March Spring 222013 nd 2013 1 Contents Introduction The Problem in Statistical Terms Defining Futility Three Common Futility Rules The Maths An

More information

Lecture 25. Ingo Ruczinski. November 24, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University

Lecture 25. Ingo Ruczinski. November 24, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University Lecture 25 Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University November 24, 2015 1 2 3 4 5 6 7 8 9 10 11 1 Hypothesis s of homgeneity 2 Estimating risk

More information

Small n, σ known or unknown, underlying nongaussian

Small n, σ known or unknown, underlying nongaussian READY GUIDE Summary Tables SUMMARY-1: Methods to compute some confidence intervals Parameter of Interest Conditions 95% CI Proportion (π) Large n, p 0 and p 1 Equation 12.11 Small n, any p Figure 12-4

More information

Introduction to Power and Sample Size Analysis

Introduction to Power and Sample Size Analysis Four chapters from SAS/STAT User's Guide 13.2: (18) Introduction to Power and Sample Size Analysis (47) The GLMPOWER Procedure (77) The POWER Procedure (78) The Power and Sample Size Application Chapter

More information

Pubh 8482: Sequential Analysis

Pubh 8482: Sequential Analysis Pubh 8482: Sequential Analysis Joseph S. Koopmeiners Division of Biostatistics University of Minnesota Week 5 Course Summary So far, we have discussed Group sequential procedures for two-sided tests Group

More information

Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm

Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam June 8 th, 2016: 9am to 1pm Instructions: 1. This is exam is to be completed independently. Do not discuss your work with

More information

G * Power 3.1 manual March 1, 2017

G * Power 3.1 manual March 1, 2017 G * Power 3.1 manual March 1, 2017 This manual is not yet complete. We will be adding help on more tests in the future. If you cannot find help for your test in this version of the manual, then please

More information

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of

More information

Study Design: Sample Size Calculation & Power Analysis

Study Design: Sample Size Calculation & Power Analysis Study Design: Sample Size Calculation & Power Analysis RCMAR/CHIME/EXPORT April 21, 2008 Honghu Liu, Ph.D. Contents Background Common Designs Examples Computer Software Summary & Discussion Background

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

10: Crosstabs & Independent Proportions

10: Crosstabs & Independent Proportions 10: Crosstabs & Independent Proportions p. 10.1 P Background < Two independent groups < Binary outcome < Compare binomial proportions P Illustrative example ( oswege.sav ) < Food poisoning following church

More information

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3 University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.

More information

E509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou.

E509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou. E509A: Principle of Biostatistics (Week 11(2): Introduction to non-parametric methods ) GY Zou gzou@robarts.ca Sign test for two dependent samples Ex 12.1 subj 1 2 3 4 5 6 7 8 9 10 baseline 166 135 189

More information

Lecture 7: Hypothesis Testing and ANOVA

Lecture 7: Hypothesis Testing and ANOVA Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis

More information

Sample Size Determination

Sample Size Determination Sample Size Determination 018 The number of subjects in a clinical study should always be large enough to provide a reliable answer to the question(s addressed. The sample size is usually determined by

More information

Hypothesis testing, part 2. With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal

Hypothesis testing, part 2. With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal Hypothesis testing, part 2 With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal 1 CATEGORICAL IV, NUMERIC DV 2 Independent samples, one IV # Conditions Normal/Parametric Non-parametric

More information

7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis

7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis Lecture 6: Logistic Regression Analysis Christopher S. Hollenbeak, PhD Jane R. Schubart, PhD The Outcomes Research Toolbox Review Homework 2 Overview Logistic regression model conceptually Logistic regression

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

BIOS 312: Precision of Statistical Inference

BIOS 312: Precision of Statistical Inference and Power/Sample Size and Standard Errors BIOS 312: of Statistical Inference Chris Slaughter Department of Biostatistics, Vanderbilt University School of Medicine January 3, 2013 Outline Overview and Power/Sample

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA Sample Size and Power I: Binary Outcomes James Ware, PhD Harvard School of Public Health Boston, MA Sample Size and Power Principles: Sample size calculations are an essential part of study design Consider

More information

Formulas and Tables by Mario F. Triola

Formulas and Tables by Mario F. Triola Copyright 010 Pearson Education, Inc. Ch. 3: Descriptive Statistics x f # x x f Mean 1x - x s - 1 n 1 x - 1 x s 1n - 1 s B variance s Ch. 4: Probability Mean (frequency table) Standard deviation P1A or

More information

Welcome! Webinar Biostatistics: sample size & power. Thursday, April 26, 12:30 1:30 pm (NDT)

Welcome! Webinar Biostatistics: sample size & power. Thursday, April 26, 12:30 1:30 pm (NDT) . Welcome! Webinar Biostatistics: sample size & power Thursday, April 26, 12:30 1:30 pm (NDT) Get started now: Please check if your speakers are working and mute your audio. Please use the chat box to

More information

Chapter 7 Comparison of two independent samples

Chapter 7 Comparison of two independent samples Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N

More information

Statistical. Psychology

Statistical. Psychology SEVENTH у *i km m it* & П SB Й EDITION Statistical M e t h o d s for Psychology D a v i d C. Howell University of Vermont ; \ WADSWORTH f% CENGAGE Learning* Australia Biaall apan Korea Меяко Singapore

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Exam details. Final Review Session. Things to Review

Exam details. Final Review Session. Things to Review Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit

More information

BIOS 6222: Biostatistics II. Outline. Course Presentation. Course Presentation. Review of Basic Concepts. Why Nonparametrics.

BIOS 6222: Biostatistics II. Outline. Course Presentation. Course Presentation. Review of Basic Concepts. Why Nonparametrics. BIOS 6222: Biostatistics II Instructors: Qingzhao Yu Don Mercante Cruz Velasco 1 Outline Course Presentation Review of Basic Concepts Why Nonparametrics The sign test 2 Course Presentation Contents Justification

More information

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution

More information

Power and the computation of sample size

Power and the computation of sample size 9 Power and the computation of sample size A statistical test will not be able to detect a true difference if the sample size is too small compared with the magnitude of the difference. When designing

More information

Checking model assumptions with regression diagnostics

Checking model assumptions with regression diagnostics @graemeleehickey www.glhickey.com graeme.hickey@liverpool.ac.uk Checking model assumptions with regression diagnostics Graeme L. Hickey University of Liverpool Conflicts of interest None Assistant Editor

More information

E509A: Principle of Biostatistics. GY Zou

E509A: Principle of Biostatistics. GY Zou E509A: Principle of Biostatistics (Effect measures ) GY Zou gzou@robarts.ca We have discussed inference procedures for 2 2 tables in the context of comparing two groups. Yes No Group 1 a b n 1 Group 2

More information

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Patrick J. Heagerty PhD Department of Biostatistics University of Washington 166 ISCB 2010 Session Four Outline Examples

More information

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014 Nemours Biomedical Research Statistics Course Li Xie Nemours Biostatistics Core October 14, 2014 Outline Recap Introduction to Logistic Regression Recap Descriptive statistics Variable type Example of

More information

Bios 6649: Clinical Trials - Statistical Design and Monitoring

Bios 6649: Clinical Trials - Statistical Design and Monitoring Bios 6649: Clinical Trials - Statistical Design and Monitoring Spring Semester 2015 John M. Kittelson Department of Biostatistics & Informatics Colorado School of Public Health University of Colorado Denver

More information

= 1 i. normal approximation to χ 2 df > df

= 1 i. normal approximation to χ 2 df > df χ tests 1) 1 categorical variable χ test for goodness-of-fit ) categorical variables χ test for independence (association, contingency) 3) categorical variables McNemar's test for change χ df k (O i 1

More information

How do we compare the relative performance among competing models?

How do we compare the relative performance among competing models? How do we compare the relative performance among competing models? 1 Comparing Data Mining Methods Frequent problem: we want to know which of the two learning techniques is better How to reliably say Model

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

Epidemiology Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures. John Koval

Epidemiology Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures. John Koval Epidemiology 9509 Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Non-parametric methods

Non-parametric methods Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish

More information

Sample Size / Power Calculations

Sample Size / Power Calculations Sample Size / Power Calculations A Simple Example Goal: To study the effect of cold on blood pressure (mmhg) in rats Use a Completely Randomized Design (CRD): 12 rats are randomly assigned to one of two

More information

Power assessment in group sequential design with multiple biomarker subgroups for multiplicity problem

Power assessment in group sequential design with multiple biomarker subgroups for multiplicity problem Power assessment in group sequential design with multiple biomarker subgroups for multiplicity problem Lei Yang, Ph.D. Statistical Scientist, Roche (China) Holding Ltd. Aug 30 th 2018, Shanghai Jiao Tong

More information

Adaptive Trial Designs

Adaptive Trial Designs Adaptive Trial Designs Wenjing Zheng, Ph.D. Methods Core Seminar Center for AIDS Prevention Studies University of California, San Francisco Nov. 17 th, 2015 Trial Design! Ethical:!eg.! Safety!! Efficacy!

More information

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Guosheng Yin Department of Statistics and Actuarial Science The University of Hong Kong Joint work with J. Xu PSI and RSS Journal

More information

Inference for Binomial Parameters

Inference for Binomial Parameters Inference for Binomial Parameters Dipankar Bandyopadhyay, Ph.D. Department of Biostatistics, Virginia Commonwealth University D. Bandyopadhyay (VCU) BIOS 625: Categorical Data & GLM 1 / 58 Inference for

More information

BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference

BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference BINF702 SPRING 2014 Chapter 7 Hypothesis Testing 1 Section 7.9 One-Sample c 2 Test for the Variance of a Normal Distribution Eq. 7.40

More information

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.1 Logistic Regression (Dose - Response)

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.1 Logistic Regression (Dose - Response) Model Based Statistics in Biology. Part V. The Generalized Linear Model. Logistic Regression ( - Response) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10, 11), Part IV

More information

Transition Passage to Descriptive Statistics 28

Transition Passage to Descriptive Statistics 28 viii Preface xiv chapter 1 Introduction 1 Disciplines That Use Quantitative Data 5 What Do You Mean, Statistics? 6 Statistics: A Dynamic Discipline 8 Some Terminology 9 Problems and Answers 12 Scales of

More information

Longitudinal Modeling with Logistic Regression

Longitudinal Modeling with Logistic Regression Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to

More information

Session 3 The proportional odds model and the Mann-Whitney test

Session 3 The proportional odds model and the Mann-Whitney test Session 3 The proportional odds model and the Mann-Whitney test 3.1 A unified approach to inference 3.2 Analysis via dichotomisation 3.3 Proportional odds 3.4 Relationship with the Mann-Whitney test Session

More information

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation Biost 58 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 5: Review Purpose of Statistics Statistics is about science (Science in the broadest

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests

More information

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data

More information

Unit 9: Inferences for Proportions and Count Data

Unit 9: Inferences for Proportions and Count Data Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 12/15/2008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)

More information

A3. Statistical Inference Hypothesis Testing for General Population Parameters

A3. Statistical Inference Hypothesis Testing for General Population Parameters Appendix / A3. Statistical Inference / General Parameters- A3. Statistical Inference Hypothesis Testing for General Population Parameters POPULATION H 0 : θ = θ 0 θ is a generic parameter of interest (e.g.,

More information

High-Throughput Sequencing Course

High-Throughput Sequencing Course High-Throughput Sequencing Course DESeq Model for RNA-Seq Biostatistics and Bioinformatics Summer 2017 Outline Review: Standard linear regression model (e.g., to model gene expression as function of an

More information

Overrunning in Clinical Trials: a Methodological Review

Overrunning in Clinical Trials: a Methodological Review Overrunning in Clinical Trials: a Methodological Review Dario Gregori Unit of Biostatistics, Epidemiology and Public Health Department of Cardiac, Thoracic and Vascular Sciences dario.gregori@unipd.it

More information

df=degrees of freedom = n - 1

df=degrees of freedom = n - 1 One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:

More information

Introduction to Inferential Statistics. Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine Mahidol University

Introduction to Inferential Statistics. Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine Mahidol University Introduction to Inferential Statistics Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine Mahidol University 1 Data & Variables 2 Types of Data QUALITATIVE Data expressed by type Data that has been

More information

Formulas and Tables. for Elementary Statistics, Tenth Edition, by Mario F. Triola Copyright 2006 Pearson Education, Inc. ˆp E p ˆp E Proportion

Formulas and Tables. for Elementary Statistics, Tenth Edition, by Mario F. Triola Copyright 2006 Pearson Education, Inc. ˆp E p ˆp E Proportion Formulas and Tables for Elementary Statistics, Tenth Edition, by Mario F. Triola Copyright 2006 Pearson Education, Inc. Ch. 3: Descriptive Statistics x Sf. x x Sf Mean S(x 2 x) 2 s Å n 2 1 n(sx 2 ) 2 (Sx)

More information

Correlation and the Analysis of Variance Approach to Simple Linear Regression

Correlation and the Analysis of Variance Approach to Simple Linear Regression Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation

More information

Analysis of Categorical Data. Nick Jackson University of Southern California Department of Psychology 10/11/2013

Analysis of Categorical Data. Nick Jackson University of Southern California Department of Psychology 10/11/2013 Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/2013 1 Overview Data Types Contingency Tables Logit Models Binomial Ordinal Nominal 2 Things not

More information

PubH 7405: REGRESSION ANALYSIS INTRODUCTION TO LOGISTIC REGRESSION

PubH 7405: REGRESSION ANALYSIS INTRODUCTION TO LOGISTIC REGRESSION PubH 745: REGRESSION ANALYSIS INTRODUCTION TO LOGISTIC REGRESSION Let Y be the Dependent Variable Y taking on values and, and: π Pr(Y) Y is said to have the Bernouilli distribution (Binomial with n ).

More information

Unit 9: Inferences for Proportions and Count Data

Unit 9: Inferences for Proportions and Count Data Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 1/15/008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)

More information

Lecture 12: Effect modification, and confounding in logistic regression

Lecture 12: Effect modification, and confounding in logistic regression Lecture 12: Effect modification, and confounding in logistic regression Ani Manichaikul amanicha@jhsph.edu 4 May 2007 Today Categorical predictor create dummy variables just like for linear regression

More information

Bayes Factor Single Arm Time-to-event User s Guide (Version 1.0.0)

Bayes Factor Single Arm Time-to-event User s Guide (Version 1.0.0) Bayes Factor Single Arm Time-to-event User s Guide (Version 1.0.0) Department of Biostatistics P. O. Box 301402, Unit 1409 The University of Texas, M. D. Anderson Cancer Center Houston, Texas 77230-1402,

More information

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010 1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of

More information

CHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity

CHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity CHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity Prof. Kevin E. Thorpe Dept. of Public Health Sciences University of Toronto Objectives 1. Be able to distinguish among the various

More information

BIOMETRICS INFORMATION

BIOMETRICS INFORMATION BIOMETRICS INFORMATION Index of Pamphlet Topics (for pamphlets #1 to #60) as of December, 2000 Adjusted R-square ANCOVA: Analysis of Covariance 13: ANCOVA: Analysis of Covariance ANOVA: Analysis of Variance

More information

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES Cox s regression analysis Time dependent explanatory variables Henrik Ravn Bandim Health Project, Statens Serum Institut 4 November 2011 1 / 53

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

E509A: Principle of Biostatistics. GY Zou

E509A: Principle of Biostatistics. GY Zou E509A: Principle of Biostatistics (Week 4: Inference for a single mean ) GY Zou gzou@srobarts.ca Example 5.4. (p. 183). A random sample of n =16, Mean I.Q is 106 with standard deviation S =12.4. What

More information

SPSS Guide For MMI 409

SPSS Guide For MMI 409 SPSS Guide For MMI 409 by John Wong March 2012 Preface Hopefully, this document can provide some guidance to MMI 409 students on how to use SPSS to solve many of the problems covered in the D Agostino

More information

Analysis of 2x2 Cross-Over Designs using T-Tests

Analysis of 2x2 Cross-Over Designs using T-Tests Chapter 234 Analysis of 2x2 Cross-Over Designs using T-Tests Introduction This procedure analyzes data from a two-treatment, two-period (2x2) cross-over design. The response is assumed to be a continuous

More information

Lecture 25: Models for Matched Pairs

Lecture 25: Models for Matched Pairs Lecture 25: Models for Matched Pairs Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture

More information

Outline The Rank-Sum Test Procedure Paired Data Comparing Two Variances Lab 8: Hypothesis Testing with R. Week 13 Comparing Two Populations, Part II

Outline The Rank-Sum Test Procedure Paired Data Comparing Two Variances Lab 8: Hypothesis Testing with R. Week 13 Comparing Two Populations, Part II Week 13 Comparing Two Populations, Part II Week 13 Objectives Coverage of the topic of comparing two population continues with new procedures and a new sampling design. The week concludes with a lab session.

More information

Experimental Design and Data Analysis for Biologists

Experimental Design and Data Analysis for Biologists Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1

More information

Measuring relationships among multiple responses

Measuring relationships among multiple responses Measuring relationships among multiple responses Linear association (correlation, relatedness, shared information) between pair-wise responses is an important property used in almost all multivariate analyses.

More information

Descriptive Statistics-I. Dr Mahmoud Alhussami

Descriptive Statistics-I. Dr Mahmoud Alhussami Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.

More information