Sample Size/Power Calculation by Software/Online Calculators
|
|
- Trevor Harris
- 5 years ago
- Views:
Transcription
1 Sample Size/Power Calculation by Software/Online Calculators May 24, 2018 Li Zhang, Ph.D. Associate Professor Department of Epidemiology and Biostatistics Division of Hematology and Oncology Department of Medicine University of California, San Francisco
2 Topics R packages SAS Proc Online calculator: CTSI sample size calculator Online calculator for clinical trial: SWOG Software: G*Power 2
3 Power Analysis with R 3
4 Power/Sample size calculation for one or two proportions Power calculations for proportion tests (one sample) H 0 : p=p 1 vs. H a : p p 1 pwr.p.test(h, n, sig.level, power, alternative = c("two.sided","less","greater")) Power calculation for two proportions (same sample size) H 0 : p 1 =p 2 vs. H a : p 1 p 2 pwr.2p.test(h, n, sig.level, power, alternative=c("two.sided","less","greater")) Power calculation for two proportions (different sample sizes) H 0 : p 1 =p 2 vs. H a : p 1 p 2 pwr.2p2n.test(h, n1, n2, sig.level, power, alternative = c("two.sided", "less","greater")) Effect size calculation R Demo 4
5 Power calculations for chi-squared tests pwr.chisq.test(w = NULL, N = NULL, df = NULL, sig.level = 0.05, power = NULL) ES.w1(P0, P1): Effect size calculation in the chi-squared test for goodness of fit, which is the sum of differences between observed and expected outcome frequencies Compute effect size w for two sets of k probabilities P0 (null hypothesis) and P1 (alternative hypothesis) ES.w2(P0, P1): Effect size calculation in the chi-squared test for association Compute effect size w for a two-way probability table corresponding to the alternative hypothesis in the chisquared test of association in two-way contingency tables 5
6 Power/Sample size calculation for one or two means Power calculations for t-tests of means (one sample, two samples and paired samples) One sample: H 0 : μ = μ 1 vs. H a : μ μ 1 Two sample or paired samples: H 0 : μ 1 = μ 2 vs. H a : μ 1 μ 2 pwr.t.test(n, d, sig.level, power, type = c("two.sample", "one.sample", "paired"), alternative = c("two.sided", "less", "greater")) Power calculations for two samples (different sizes) t-tests of means H 0 : μ 1 = μ 2 vs. H a : μ 1 μ 2 pwr.t2n.test(n1, n2, d, sig.level = 0.05, power, alternative = c("two.sided", "less","greater")) R Demo 6
7 Power calculations for balanced one-way analysis of variance tests pwr.anova.test(k = NULL, n = NULL, f = NULL, sig.level = 0.05, power = NULL) k n f Number of groups Number of observations (per group) Effect size 7
8 Power calculations for the general linear model pwr.f2.test(u = NULL, v = NULL, f2 = NULL, sig.level = 0.05, power = NULL) u and v are the numerator and denominator degrees of freedom. We use f2 as the effect size measure. when evaluating the impact of a set of predictors on an outcome when evaluating the impact of one set of predictors above and beyond a second set of predictors (or covariates) 8
9 Other R Packages for Sample Size Calculation powersurvepi: Power and Sample Size Calculation for Survival Analysis of Epidemiological Studies epir: Sample size cohort study, case-control study, cross-sectional study, under one or two-stage cluster sampling kappasize: Sample Size Estimation Functions for Studies of Interobserver Agreement powermediation: Power/Sample size calculation for mediation analysis, simple linear regression, logistic regression, or longitudinal study power.roc.test {proc}: Computes sample size, power, significance level or minimum AUC for ROC curves. RNASeqPower: Sample Size for RNA-Seq and similar Studies 9
10 Case-control study by library(epir) A matched case control study is to be carried out to quantify the association between exposure A and an outcome B. Assume the prevalence of exposure in controls is 0.60 and the correlation between case and control exposures for matched pairs (rho) is 0.20 (moderate). Assuming an equal number of cases and controls, how many subjects need to be enrolled into the study to detect an odds ratio of 3.0 with 0.80 power using a two-sided 0.05 test? epi.ccsize(or = 3.0, p0 = 0.60, n = NA, power = 0.80, r = 1, rho = 0.2, design = 1, sided.test = 2, conf.level = 0.95, method = "matched", fleiss = FALSE) A total of 162 subjects need to be enrolled in the study: 81 cases and 81 controls. 10
11 Case-control study by library(epir) How many cases and controls are required if we select three controls per case? epi.ccsize(or = 3.0, p0 = 0.60, n = NA, power = 0.80, r = 3, rho = 0.2, design = 1, sided.test = 2, conf.level = 0.95, method = "matched", fleiss = FALSE) A total of 204 subjects need to be enrolled in the study: 51 cases and 153 controls. 11
12 kappasize: Sample Size Estimation Functions for Studies of Interobserver Agreement Library(kappaSize) Can handle binary to 5 categories Confidence Interval Approach E.g. CI3Cats Calculation of the Lowest Expected Value E.g., FixedN4Cats Power-Based Approach E.g., PowerBinary 12
13 Computes sample size/power/minimum AUC for ROC curves power.roc.test(...) One or Two ROC curves test with roc objects: power.roc.test(roc1, roc2, sig.level = 0.05, power = NULL, alternative = c("two.sided", "one.sided"), reuse.auc=true, method = c("delong", "bootstrap", "obuchowski"),...) One ROC curve with a given AUC: power.roc.test(auc = NULL, ncontrols = NULL, ncases = NULL, sig.level = 0.05, power = NULL, kappa = 1, alternative = c("two.sided", "one.sided"),...) Two ROC curves with the given parameters: power.roc.test(parslist, ncontrols = NULL, ncases = NULL, sig.level = 0.05, power = NULL, kappa = 1, alternative = c("two.sided", "one.sided"),...) 13
14 RNASeqPower: Sample Size for RNA-Seq and Similar Studies rnapower(depth, n, n2 = n, cv, cv2 = cv, effect, alpha, power) depth average depth of coverage for the transcript or gene of interest. Common values are 5-20, any numeric value >0 is valid. n sample size in group 1 (or both) n2 sample size in group 2 cv biological coefficient of variation in group 1 (or both). cv2 biological coefficient of variation in group 2 effect size target effect size 14
15 Comments about R packages Pros: Free A lot of resources for different tests/study designs Generate a figure/table easily for different options of parameters, for example, sample size calculation for sequencing data Cons: Need to write codes Hard to implement sometimes Not sure about reliability 15
16 Power Analysis with SAS SAS PROC POWER t-tests, equivalence tests, and confidence intervals for means tests, equivalence tests, and confidence intervals for binomial proportions multiple regression tests of correlation and partial correlation one-way analysis of variance rank tests for comparing two survival curves logistic regression with binary response Wilcoxon-Mann-Whitney (rank-sum) test PROC GLMPOWER: Compute Power and Sample Size for Repeated Measures 16
17 SAS Example: Calculate power for Pearson chi-squared tests Same sample size, two-sided test of proportions proc power; twosamplefreq test=pchi groupproportions=( ) npergroup=30 power=.; run; The SAS System The POWER Procedure Pearson Chi-square Test for Proportion Difference Fixed Scenario Elements Distribution Asymptotic normal Method Normal approximation Group 1 Proportion 0.1 Group 2 Proportion 0.5 Computed Power Power Sample Size per Group 30 Number of Sides 2 Null Proportion Difference 0 Alpha
18 SAS Example: Calculate power for Pearson chi-squared tests Different sample size, two-sided test of proportions proc power; twosamplefreq test=pchi groupproportions=( ) groupns=25 50 power=.; run; The SAS System The POWER Procedure Pearson Chi-square Test for Proportion Difference Fixed Scenario Elements Distribution Asymptotic normal Method Normal approximation Group 1 Proportion 0.1 Group 2 Proportion 0.5 Group 1 Sample Size 25 Group 2 Sample Size 50 Computed Power Power Number of Sides 2 Null Proportion Difference 0 Alpha
19 SAS Example: Calculate power for t- tests Two independent samples, same size proc power; twosamplemeans test=diff meandiff=2 stddev=2.8 npergroup=30 power=.; run; The SAS System The POWER Procedure Two-Sample t Test for Mean Difference Fixed Scenario Elements Distribution Normal Method Exact Mean Difference 2 Standard Deviation 2.8 Computed Power Power Sample Size per Group 30 Number of Sides 2 Null Difference 0 Alpha
20 SAS Example: Calculate power for t- tests One sample proc power; onesamplemeans test=t mean=2 stddev=2.8 ntotal=30 power=.; run; The SAS System The POWER Procedure One-Sample t Test for Mean Fixed Scenario Elements Distribution Normal Method Exact Mean 2 Standard Deviation 2.8 Computed Power Power Total Sample Size 30 Number of Sides 2 Null Mean 0 Alpha
21 SAS Example: Calculate power for t- tests Paired samples proc power; pairedmeans test=diff meandiff=2 corr=0.5 stddev=2.8 npairs=30 power=.; run; The SAS System The POWER Procedure Paired t Test for Mean Difference Fixed Scenario Elements Distribution Normal Method Exact Mean Difference 2 Standard Deviation 2.8 Correlation 0.5 Computed Power Power Number of Pairs 30 Number of Sides 2 Null Difference 0 Alpha
22 SAS Example: Calculate power for t- tests Two independent samples, different sizes proc power; twosamplemeans test=diff meandiff=2 stddev=2.8 groupns=(20 40) power=.; run; The SAS System The POWER Procedure Two-Sample t Test for Mean Difference Fixed Scenario Elements Distribution Normal Method Exact Mean Difference 2 Standard Deviation 2.8 Computed Power Power Group 1 Sample Size 20 Group 2 Sample Size 40 Number of Sides 2 Null Difference 0 Alpha
23 UCSF CTSI Sample Size Calculators Can do most of the popular tests Compare the mean of a continuous measurement in two samples which allow for clustered sampling. A cluster randomized controlled trial is a type of randomized controlled trial in which groups of subjects (as opposed to individual subjects) are randomised. 23
24 Online Calculators: SWOG Primary objective is not a hypothesis, just estimation, then provide the precision of the estimation Example: The expected adherence rate is 80%, n=50 95% CI is (66.3%, 90.0%) One arm binomial: H 0 : P=0.1 vs. H a : P 0.1 One arm survival: Length of the accrual period Length of the follow-up period, i.e. the time from end of accrual to analysis H 0 : median OS=6 months vs. H a : median OS>6 months H 0 : s(t)=0.5 at 6 months vs. H a : s(t) = 0.5 at 9 months 24
25 Online Calculators: SWOG (cont.) Two-arm binomial: H 0 : P 1 =P 2 vs. H a : P 1 P 2 P 1 = 0.1 vs. P 2 = 0.25 Two-arm survival: Length of the accrual period Length of the follow-up period, i.e. the time from end of accrual to analysis H 0 : HR = 2 vs. H a : HR 2 (Median OS = 6months for null, 12-month accrual and 12-month followup) 25
26 Online Calculators: SWOG (cont.) Two stage a l r l a 2 r 2 If the number of successes after completing the first stage is < al, we reject the alternative hypothesis that p > Pa. If the number of successes after completing the first stage is > r l, we reject the null hypothesis that p < P 0. If the number of successes after completing the trial is < a 2 then we reject the alternative hypothesis. If the number of successes after completing the trial is > r 2 then we reject the null hypothesis. 26
27 Online Calculators: SWOG (cont.) Other options: Survival noninferiority Competing Risk: the hazard of the competing risk random variable Hazard ratios between experimental and standard defining equivalence Hazard ratio must be less than hazard ratio defining equivalence Expected Deaths Make a table of expected death information Provide expected deaths for a given time Provide expected deaths for a time at which the expected proportion of deaths have occurred. 27
28 Online Calculators: Simon s two stage design nstwostagedesign.aspx One arm Phase II clinical trial Endpoint: Response rate or binary outcome Incorporate interim analysis for futility One-sided test Example: H 0 : P=0.1 vs. H a : P>0.1 Simon's two-stage design (Simon, 1989) will be used. The null hypothesis that the true response rate is 0.1 will be tested against a onesided alternative. In the first stage, 22 patients will be accrued. If there are 2 or fewer responses in these 22 patients, the study will be stopped. Otherwise, 18 additional patients will be accrued for a total of 40.The null hypothesis will be rejected if 8 or more responses are observed in 40 patients. This design yields a type I error rate of 0.04 and power of 80% when the true response rate is
29 G* Power G*Power is a tool to compute statistical power analyses for many different t tests, F tests, χ2 tests, z tests and some exact tests. G*Power can also be used to compute effect sizes and to display graphically the results of power analyses. It is free, both Windows and Mac version. 29
30 Exact: Proportion - inequality, two dependent groups (McNemar) Standard Treatment Yes No Yes p 11 p 12 p t No p 21 p 22 1 p t p s 1 p s 1 H 0 : p 12 /p 21 = 1 H 1 : p 12 /p 21 6= 1. Standard Treatment Yes No Yes No e the proportion of discordant pa Select Type of power analysis: Post hoc Options Computation: Exact Input Tail(s): Two Odds ratio: 0.25 α err prob: 0.05 Total sample size: 50 Prop discordant pairs: 0.4 Output Power (1-β err prob): 0.80 Actual α: 0.04 Proportion p12: 0.08 Proportion p21:
31 F test: Fixed effects One-Way ANOVA means to more than two groups. The null hypothesis is that all k means are identical H 0 : µ 1 = µ 2 =... = µ k. The alternative hypothesis states that at least two of the k means differ. H 1 : µ i 6= µ j, for at least one pair i, j with 1 apple i, j apple k. Example: We compare 10 groups, and we have reason to expect a "medium" effect size (f =.25). How many subjects do we need in a test with α = 0.05 to achieve a power of 0.95? Select Type of power analysis: A priori Input Effect size f : 0.25 α err prob: 0.05 Power (1-β err prob): 0.95 Number of groups: 10 Output Noncentrality parameter λ: Critical F: Numerator df: 9 Denominator df: 380 Total sample size: 390 Actual Power:
32 Questions? 32
POWER FOR COMPARING TWO PROPORTIONS WITH INDEPENDENT SAMPLES
This handout covers material found in Section 0.5 of the text. POWER FOR COMPARING TWO PROPORTIONS WITH INDEPENDENT SAMPLES EXAMPLE: Otolaryngology (Example 0.3 of your text, page 405). Suppose a study
More informationAPPENDIX B Sample-Size Calculation Methods: Classical Design
APPENDIX B Sample-Size Calculation Methods: Classical Design One/Paired - Sample Hypothesis Test for the Mean Sign test for median difference for a paired sample Wilcoxon signed - rank test for one or
More informationSample size and power calculation using R and SAS proc power. Ho Kim GSPH, SNU
Sample size and power calculation using R and SAS proc power Ho Kim GSPH, SNU Pvalue (1) We want to show that the means of two populations are different! Y 1 a sample mean from the 1st pop Y 2 a sample
More informationTurning a research question into a statistical question.
Turning a research question into a statistical question. IGINAL QUESTION: Concept Concept Concept ABOUT ONE CONCEPT ABOUT RELATIONSHIPS BETWEEN CONCEPTS TYPE OF QUESTION: DESCRIBE what s going on? DECIDE
More informationIntroduction to Statistical Analysis
Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive
More informationThe Design of a Survival Study
The Design of a Survival Study The design of survival studies are usually based on the logrank test, and sometimes assumes the exponential distribution. As in standard designs, the power depends on The
More informationUsing SAS Proc Power to Perform Model-based Power Analysis for Clinical Pharmacology Studies Peng Sun, Merck & Co., Inc.
PharmaSUG00 - Paper SP05 Using SAS Proc Power to Perform Model-based Power Analysis for Clinical Pharmacology Studies Peng Sun, Merck & Co., Inc., North Wales, PA ABSRAC In this paper, we demonstrate that
More informationTests for Two Correlated Proportions in a Matched Case- Control Design
Chapter 155 Tests for Two Correlated Proportions in a Matched Case- Control Design Introduction A 2-by-M case-control study investigates a risk factor relevant to the development of a disease. A population
More informationQuestion. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?
Hypothesis testing Question Very frequently: what is the possible value of μ? Sample: we know only the average! μ average. Random deviation or not? Standard error: the measure of the random deviation.
More informationTextbook Examples of. SPSS Procedure
Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of
More informationStatistics in medicine
Statistics in medicine Lecture 3: Bivariate association : Categorical variables Proportion in one group One group is measured one time: z test Use the z distribution as an approximation to the binomial
More informationCorrelation and Simple Linear Regression
Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline
More informationPaper Equivalence Tests. Fei Wang and John Amrhein, McDougall Scientific Ltd.
Paper 11683-2016 Equivalence Tests Fei Wang and John Amrhein, McDougall Scientific Ltd. ABSTRACT Motivated by the frequent need for equivalence tests in clinical trials, this paper provides insights into
More informationStatistics Handbook. All statistical tables were computed by the author.
Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance
More informationHOW TO DETERMINE THE NUMBER OF SUBJECTS NEEDED FOR MY STUDY?
HOW TO DETERMINE THE NUMBER OF SUBJECTS NEEDED FOR MY STUDY? TUTORIAL ON SAMPLE SIZE AND POWER CALCULATIONS FOR INEQUALITY TESTS. John Zavrakidis j.zavrakidis@nki.nl May 28, 2018 J.Zavrakidis Sample and
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationPower and Sample Size Bios 662
Power and Sample Size Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-10-31 14:06 BIOS 662 1 Power and Sample Size Outline Introduction One sample: continuous
More informationStatistical Aspects of Futility Analyses. Kevin J Carroll. nd 2013
Statistical Aspects of Futility Analyses Kevin J Carroll March Spring 222013 nd 2013 1 Contents Introduction The Problem in Statistical Terms Defining Futility Three Common Futility Rules The Maths An
More informationLecture 25. Ingo Ruczinski. November 24, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University
Lecture 25 Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University November 24, 2015 1 2 3 4 5 6 7 8 9 10 11 1 Hypothesis s of homgeneity 2 Estimating risk
More informationSmall n, σ known or unknown, underlying nongaussian
READY GUIDE Summary Tables SUMMARY-1: Methods to compute some confidence intervals Parameter of Interest Conditions 95% CI Proportion (π) Large n, p 0 and p 1 Equation 12.11 Small n, any p Figure 12-4
More informationIntroduction to Power and Sample Size Analysis
Four chapters from SAS/STAT User's Guide 13.2: (18) Introduction to Power and Sample Size Analysis (47) The GLMPOWER Procedure (77) The POWER Procedure (78) The Power and Sample Size Application Chapter
More informationPubh 8482: Sequential Analysis
Pubh 8482: Sequential Analysis Joseph S. Koopmeiners Division of Biostatistics University of Minnesota Week 5 Course Summary So far, we have discussed Group sequential procedures for two-sided tests Group
More informationName: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm
Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam June 8 th, 2016: 9am to 1pm Instructions: 1. This is exam is to be completed independently. Do not discuss your work with
More informationG * Power 3.1 manual March 1, 2017
G * Power 3.1 manual March 1, 2017 This manual is not yet complete. We will be adding help on more tests in the future. If you cannot find help for your test in this version of the manual, then please
More informationIntroduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.
Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of
More informationStudy Design: Sample Size Calculation & Power Analysis
Study Design: Sample Size Calculation & Power Analysis RCMAR/CHIME/EXPORT April 21, 2008 Honghu Liu, Ph.D. Contents Background Common Designs Examples Computer Software Summary & Discussion Background
More informationStatistics in medicine
Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu
More informationGlossary for the Triola Statistics Series
Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling
More information10: Crosstabs & Independent Proportions
10: Crosstabs & Independent Proportions p. 10.1 P Background < Two independent groups < Binary outcome < Compare binomial proportions P Illustrative example ( oswege.sav ) < Food poisoning following church
More informationPrerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3
University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.
More informationE509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou.
E509A: Principle of Biostatistics (Week 11(2): Introduction to non-parametric methods ) GY Zou gzou@robarts.ca Sign test for two dependent samples Ex 12.1 subj 1 2 3 4 5 6 7 8 9 10 baseline 166 135 189
More informationLecture 7: Hypothesis Testing and ANOVA
Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis
More informationSample Size Determination
Sample Size Determination 018 The number of subjects in a clinical study should always be large enough to provide a reliable answer to the question(s addressed. The sample size is usually determined by
More informationHypothesis testing, part 2. With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal
Hypothesis testing, part 2 With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal 1 CATEGORICAL IV, NUMERIC DV 2 Independent samples, one IV # Conditions Normal/Parametric Non-parametric
More information7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis
Lecture 6: Logistic Regression Analysis Christopher S. Hollenbeak, PhD Jane R. Schubart, PhD The Outcomes Research Toolbox Review Homework 2 Overview Logistic regression model conceptually Logistic regression
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More informationBIOS 312: Precision of Statistical Inference
and Power/Sample Size and Standard Errors BIOS 312: of Statistical Inference Chris Slaughter Department of Biostatistics, Vanderbilt University School of Medicine January 3, 2013 Outline Overview and Power/Sample
More informationGlossary. The ISI glossary of statistical terms provides definitions in a number of different languages:
Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the
More informationSample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA
Sample Size and Power I: Binary Outcomes James Ware, PhD Harvard School of Public Health Boston, MA Sample Size and Power Principles: Sample size calculations are an essential part of study design Consider
More informationFormulas and Tables by Mario F. Triola
Copyright 010 Pearson Education, Inc. Ch. 3: Descriptive Statistics x f # x x f Mean 1x - x s - 1 n 1 x - 1 x s 1n - 1 s B variance s Ch. 4: Probability Mean (frequency table) Standard deviation P1A or
More informationWelcome! Webinar Biostatistics: sample size & power. Thursday, April 26, 12:30 1:30 pm (NDT)
. Welcome! Webinar Biostatistics: sample size & power Thursday, April 26, 12:30 1:30 pm (NDT) Get started now: Please check if your speakers are working and mute your audio. Please use the chat box to
More informationChapter 7 Comparison of two independent samples
Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N
More informationStatistical. Psychology
SEVENTH у *i km m it* & П SB Й EDITION Statistical M e t h o d s for Psychology D a v i d C. Howell University of Vermont ; \ WADSWORTH f% CENGAGE Learning* Australia Biaall apan Korea Меяко Singapore
More informationDETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics
DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and
More informationExam details. Final Review Session. Things to Review
Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit
More informationBIOS 6222: Biostatistics II. Outline. Course Presentation. Course Presentation. Review of Basic Concepts. Why Nonparametrics.
BIOS 6222: Biostatistics II Instructors: Qingzhao Yu Don Mercante Cruz Velasco 1 Outline Course Presentation Review of Basic Concepts Why Nonparametrics The sign test 2 Course Presentation Contents Justification
More informationPSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests
PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution
More informationPower and the computation of sample size
9 Power and the computation of sample size A statistical test will not be able to detect a true difference if the sample size is too small compared with the magnitude of the difference. When designing
More informationChecking model assumptions with regression diagnostics
@graemeleehickey www.glhickey.com graeme.hickey@liverpool.ac.uk Checking model assumptions with regression diagnostics Graeme L. Hickey University of Liverpool Conflicts of interest None Assistant Editor
More informationE509A: Principle of Biostatistics. GY Zou
E509A: Principle of Biostatistics (Effect measures ) GY Zou gzou@robarts.ca We have discussed inference procedures for 2 2 tables in the context of comparing two groups. Yes No Group 1 a b n 1 Group 2
More informationPart IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation
Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Patrick J. Heagerty PhD Department of Biostatistics University of Washington 166 ISCB 2010 Session Four Outline Examples
More informationNemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014
Nemours Biomedical Research Statistics Course Li Xie Nemours Biostatistics Core October 14, 2014 Outline Recap Introduction to Logistic Regression Recap Descriptive statistics Variable type Example of
More informationBios 6649: Clinical Trials - Statistical Design and Monitoring
Bios 6649: Clinical Trials - Statistical Design and Monitoring Spring Semester 2015 John M. Kittelson Department of Biostatistics & Informatics Colorado School of Public Health University of Colorado Denver
More information= 1 i. normal approximation to χ 2 df > df
χ tests 1) 1 categorical variable χ test for goodness-of-fit ) categorical variables χ test for independence (association, contingency) 3) categorical variables McNemar's test for change χ df k (O i 1
More informationHow do we compare the relative performance among competing models?
How do we compare the relative performance among competing models? 1 Comparing Data Mining Methods Frequent problem: we want to know which of the two learning techniques is better How to reliably say Model
More informationTesting Independence
Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1
More informationEpidemiology Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures. John Koval
Epidemiology 9509 Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered
More information" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2
Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the
More informationNon-parametric methods
Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish
More informationSample Size / Power Calculations
Sample Size / Power Calculations A Simple Example Goal: To study the effect of cold on blood pressure (mmhg) in rats Use a Completely Randomized Design (CRD): 12 rats are randomly assigned to one of two
More informationPower assessment in group sequential design with multiple biomarker subgroups for multiplicity problem
Power assessment in group sequential design with multiple biomarker subgroups for multiplicity problem Lei Yang, Ph.D. Statistical Scientist, Roche (China) Holding Ltd. Aug 30 th 2018, Shanghai Jiao Tong
More informationAdaptive Trial Designs
Adaptive Trial Designs Wenjing Zheng, Ph.D. Methods Core Seminar Center for AIDS Prevention Studies University of California, San Francisco Nov. 17 th, 2015 Trial Design! Ethical:!eg.! Safety!! Efficacy!
More informationTwo-stage Adaptive Randomization for Delayed Response in Clinical Trials
Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Guosheng Yin Department of Statistics and Actuarial Science The University of Hong Kong Joint work with J. Xu PSI and RSS Journal
More informationInference for Binomial Parameters
Inference for Binomial Parameters Dipankar Bandyopadhyay, Ph.D. Department of Biostatistics, Virginia Commonwealth University D. Bandyopadhyay (VCU) BIOS 625: Categorical Data & GLM 1 / 58 Inference for
More informationBINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference
BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference BINF702 SPRING 2014 Chapter 7 Hypothesis Testing 1 Section 7.9 One-Sample c 2 Test for the Variance of a Normal Distribution Eq. 7.40
More informationModel Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.1 Logistic Regression (Dose - Response)
Model Based Statistics in Biology. Part V. The Generalized Linear Model. Logistic Regression ( - Response) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch 9, 10, 11), Part IV
More informationTransition Passage to Descriptive Statistics 28
viii Preface xiv chapter 1 Introduction 1 Disciplines That Use Quantitative Data 5 What Do You Mean, Statistics? 6 Statistics: A Dynamic Discipline 8 Some Terminology 9 Problems and Answers 12 Scales of
More informationLongitudinal Modeling with Logistic Regression
Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to
More informationSession 3 The proportional odds model and the Mann-Whitney test
Session 3 The proportional odds model and the Mann-Whitney test 3.1 A unified approach to inference 3.2 Analysis via dichotomisation 3.3 Proportional odds 3.4 Relationship with the Mann-Whitney test Session
More informationBiost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation
Biost 58 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 5: Review Purpose of Statistics Statistics is about science (Science in the broadest
More information16.400/453J Human Factors Engineering. Design of Experiments II
J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential
More informationCHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC
CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests
More informationData are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)
BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data
More informationUnit 9: Inferences for Proportions and Count Data
Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 12/15/2008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)
More informationA3. Statistical Inference Hypothesis Testing for General Population Parameters
Appendix / A3. Statistical Inference / General Parameters- A3. Statistical Inference Hypothesis Testing for General Population Parameters POPULATION H 0 : θ = θ 0 θ is a generic parameter of interest (e.g.,
More informationHigh-Throughput Sequencing Course
High-Throughput Sequencing Course DESeq Model for RNA-Seq Biostatistics and Bioinformatics Summer 2017 Outline Review: Standard linear regression model (e.g., to model gene expression as function of an
More informationOverrunning in Clinical Trials: a Methodological Review
Overrunning in Clinical Trials: a Methodological Review Dario Gregori Unit of Biostatistics, Epidemiology and Public Health Department of Cardiac, Thoracic and Vascular Sciences dario.gregori@unipd.it
More informationdf=degrees of freedom = n - 1
One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:
More informationIntroduction to Inferential Statistics. Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine Mahidol University
Introduction to Inferential Statistics Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine Mahidol University 1 Data & Variables 2 Types of Data QUALITATIVE Data expressed by type Data that has been
More informationFormulas and Tables. for Elementary Statistics, Tenth Edition, by Mario F. Triola Copyright 2006 Pearson Education, Inc. ˆp E p ˆp E Proportion
Formulas and Tables for Elementary Statistics, Tenth Edition, by Mario F. Triola Copyright 2006 Pearson Education, Inc. Ch. 3: Descriptive Statistics x Sf. x x Sf Mean S(x 2 x) 2 s Å n 2 1 n(sx 2 ) 2 (Sx)
More informationCorrelation and the Analysis of Variance Approach to Simple Linear Regression
Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation
More informationAnalysis of Categorical Data. Nick Jackson University of Southern California Department of Psychology 10/11/2013
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/2013 1 Overview Data Types Contingency Tables Logit Models Binomial Ordinal Nominal 2 Things not
More informationPubH 7405: REGRESSION ANALYSIS INTRODUCTION TO LOGISTIC REGRESSION
PubH 745: REGRESSION ANALYSIS INTRODUCTION TO LOGISTIC REGRESSION Let Y be the Dependent Variable Y taking on values and, and: π Pr(Y) Y is said to have the Bernouilli distribution (Binomial with n ).
More informationUnit 9: Inferences for Proportions and Count Data
Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 1/15/008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)
More informationLecture 12: Effect modification, and confounding in logistic regression
Lecture 12: Effect modification, and confounding in logistic regression Ani Manichaikul amanicha@jhsph.edu 4 May 2007 Today Categorical predictor create dummy variables just like for linear regression
More informationBayes Factor Single Arm Time-to-event User s Guide (Version 1.0.0)
Bayes Factor Single Arm Time-to-event User s Guide (Version 1.0.0) Department of Biostatistics P. O. Box 301402, Unit 1409 The University of Texas, M. D. Anderson Cancer Center Houston, Texas 77230-1402,
More informationStat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010
1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of
More informationCHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity
CHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity Prof. Kevin E. Thorpe Dept. of Public Health Sciences University of Toronto Objectives 1. Be able to distinguish among the various
More informationBIOMETRICS INFORMATION
BIOMETRICS INFORMATION Index of Pamphlet Topics (for pamphlets #1 to #60) as of December, 2000 Adjusted R-square ANCOVA: Analysis of Covariance 13: ANCOVA: Analysis of Covariance ANOVA: Analysis of Variance
More informationADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables
ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES Cox s regression analysis Time dependent explanatory variables Henrik Ravn Bandim Health Project, Statens Serum Institut 4 November 2011 1 / 53
More informationParametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami
Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous
More informationE509A: Principle of Biostatistics. GY Zou
E509A: Principle of Biostatistics (Week 4: Inference for a single mean ) GY Zou gzou@srobarts.ca Example 5.4. (p. 183). A random sample of n =16, Mean I.Q is 106 with standard deviation S =12.4. What
More informationSPSS Guide For MMI 409
SPSS Guide For MMI 409 by John Wong March 2012 Preface Hopefully, this document can provide some guidance to MMI 409 students on how to use SPSS to solve many of the problems covered in the D Agostino
More informationAnalysis of 2x2 Cross-Over Designs using T-Tests
Chapter 234 Analysis of 2x2 Cross-Over Designs using T-Tests Introduction This procedure analyzes data from a two-treatment, two-period (2x2) cross-over design. The response is assumed to be a continuous
More informationLecture 25: Models for Matched Pairs
Lecture 25: Models for Matched Pairs Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture
More informationOutline The Rank-Sum Test Procedure Paired Data Comparing Two Variances Lab 8: Hypothesis Testing with R. Week 13 Comparing Two Populations, Part II
Week 13 Comparing Two Populations, Part II Week 13 Objectives Coverage of the topic of comparing two population continues with new procedures and a new sampling design. The week concludes with a lab session.
More informationExperimental Design and Data Analysis for Biologists
Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1
More informationMeasuring relationships among multiple responses
Measuring relationships among multiple responses Linear association (correlation, relatedness, shared information) between pair-wise responses is an important property used in almost all multivariate analyses.
More informationDescriptive Statistics-I. Dr Mahmoud Alhussami
Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.
More information