Power and Sample Size Bios 662

Size: px
Start display at page:

Download "Power and Sample Size Bios 662"

Transcription

1 Power and Sample Size Bios 662 Michael G. Hudgens, Ph.D. mhudgens :06 BIOS Power and Sample Size

2 Outline Introduction One sample: continuous outcome, Z test continuous outcome, t test binary outcome, Z test binary outcome, exact test BIOS Power and Sample Size

3 Introduction Read Chs 5.8 and of text In a test of a hypothesis, we are testing whether some population parameter has a particular value H 0 : θ = θ 0, where θ 0 is a known constant Generally, H A : θ θ 0 BIOS Power and Sample Size

4 Introduction Once the data are collected, we will compute a statistic related to θ, say S(ˆθ) S(ˆθ) is a random variable, since it is computed from a sample (and hence it has a probability distribution) BIOS Power and Sample Size

5 Power Pr[Type I error] = α = Pr[S(ˆθ) C α H 0 ] Pr[Type II error] = β = Pr[S(ˆθ) / C α H A ] P ower = 1 β = Pr[S(ˆθ) C α H A ] BIOS Power and Sample Size

6 One sample Z test Example: One sample test Study: collect data on continuous outcome (Y ) on N individuals E(Y ) = µ, V (Y ) = σ 2 H 0 : µ = µ 0 vs H A : µ > µ 0 S(ˆθ) = Z = Ȳ µ 0 σ/ N BIOS Power and Sample Size

7 One sample Z test Pr[S(ˆθ) C α H 0 ] = Pr[Z > z 1 α H 0 ] = α Pr[S(ˆθ) C α H A ] = Pr[Z > z 1 α H A ] = 1 β Choose a value µ A H A Q: what sample size do we need to detect this alternative with 1 β power? BIOS Power and Sample Size

8 One sample Z test Under H A : µ = µ A Note Thus Z = Y µ A σ/ N N(0, 1) Z + µ A µ 0 σ/ N = Y µ 0 σ/ N = Z Z N ( ) µ A µ 0 σ/ N, 1 BIOS Power and Sample Size

9 Distribution of Z-statistic under H 0 and H A Density H 0 :µ=µ 0 H A :µ=µ 0 +3σ N Pr(Z>1.65 Ha) Pr(Z>1.65 Ho) Z BIOS Power and Sample Size

10 Power is given by One sample Z test 1 β = Pr[Z > z 1 α µ = µ A ] Therefore = Pr[Z > z 1 α + z 1 α + N(µ0 µ A ) σ N(µ0 µ A ) σ µ = µ A ] = z β = z 1 β BIOS Power and Sample Size

11 Equivalently For a two sided test One sample Z test N = σ2 (z 1 α + z 1 β ) 2 (µ 0 µ A ) 2 N = σ2 (z 1 α/2 + z 1 β ) 2 (µ 0 µ A ) 2 BIOS Power and Sample Size

12 Equivalent form One sample Z test N = ( z1 α/2 + z 1 β ) 2 where = µ 0 µ A σ is called the standardized distance or difference BIOS Power and Sample Size

13 One sample Z test To apply this formula, we need to know α, β, µ 0 µ A, and σ 2 Example: a study to determine the effect of a drug on blood pressure. BP measured, drug administered, and BP measured 2 hours later Y = (BP after BP before ) H 0 : µ = 0 vs H A : µ 0 BIOS Power and Sample Size

14 One sample Z test Choose α = 0.05, 1 β = 0.9 Estimate of σ 2 : need data from literature or pilot study; note we need variance of difference after-before The choice of µ A is a subject matter decision. In example: a drug that changes BP only 1 mmhg would not be of practical importance, but a drug that changes BP 5 mmhg might be of interest BIOS Power and Sample Size

15 One sample Z test Suppose α = 0.05, 1 β =.9, σ 2 = 225, and µ A = 5 225( )2 N = 5 2 = Often compute N for various different values of α, β, σ 2, and µ A Note ( ) , so that N BIOS Power and Sample Size

16 One sample Z test α 1 β σ 2 µ A N α 1 β σ 2 µ A N BIOS Power and Sample Size

17 One sample Z test α N 1 β N σ 2 N µ 0 µ A N BIOS Power and Sample Size

18 One sample Z test Sometimes N is fixed and we estimate the power of the test N = σ2 (z 1 α/2 + z 1 β ) 2 (µ 0 µ A ) 2 z 1 β = µ 0 µ A N z σ 1 α/2 { µ 1 β = Φ 0 µ A } N z σ 1 α/2 BIOS Power and Sample Size

19 One sample Z test Investigator says there are only 50 patients, α =.05, σ 2 = 225, µ 0 µ A = 5 z 1 β = = β = BIOS Power and Sample Size

20 One sample t test In practice, σ is not know, so we use a t test instead of a Z test Results above should be viewed as approximations in this case Power of t test is as follows BIOS Power and Sample Size

21 One sample t test Need the following result: if U N(λ, 1) and V χ 2 ν where U V, then U V/ν t ν,λ i.e., a non-central t distribution with df ν and noncentrality parameter λ BIOS Power and Sample Size

22 One sample t test Consider H 0 : µ = 0 versus H A : µ = µ A for µ A 0 Under H A, Thus Recall Y N(µ A, σ 2 /N) Y N σ (N 1)s 2 σ 2 N(µ A N σ, 1) χ 2 N 1 BIOS Power and Sample Size

23 Since Y s 2, One sample t test T = where λ = µ A N/σ Y s/ N t N 1,λ So power of two-sided t-test for µ A > 0 where T t N 1,µA N/σ Pr[T t N 1,0;1 α/2 ] BIOS Power and Sample Size

24 One sample t test: R # by hand > 1-pt(qt(.975,49), 49, 5/15*sqrt(50)) [1] > power.t.test(n=50, sd=15, delta=5, type="one.sample") One-sample t test power calculation n = 50 delta = 5 sd = 15 sig.level = 0.05 power = alternative = two.sided BIOS Power and Sample Size

25 proc power; onesamplemeans mean = 5 ntotal = 50 stddev = 15 power =.; run; One sample t test: SAS The POWER Procedure One-sample t Test for Mean Fixed Scenario Elements Distribution Normal Method Exact Mean 5 Standard Deviation 15 Total Sample Size 50 Number of Sides 2 Null Mean 0 Alpha 0.05 Computed Power Power BIOS Power and Sample Size

26 Null hypothesis One sample Z test: Binary Outcome H 0 : π = π 0 Test statistic Z = ˆp π 0 π0 (1 π 0 )/N Sample size formula ( z 1 α/2 π0 (1 π 0 ) + z 1 β πa (1 π A ) N = (π A π 0 ) 2 ) 2 BIOS Power and Sample Size

27 One sample Z test: Binary Outcome Example: Study of risk of breast cancer in sibs. Prevalence in general population of women years old: 2% Sample sisters of women with breast cancer H 0 : π = 0.02 vs H A : π.02 BIOS Power and Sample Size

28 One sample Z test: Binary Outcome Suppose α =.05, 1 β =.9, π A =.05 Then N = ( (.98) ) 2.05(.95) (.05.02) 2 = BIOS Power and Sample Size

29 One sample Z test with binary outcome: SAS proc power; onesamplefreq test = z method = normal nullp =.02 p =.05 power =.9 ntotal =.; run; The POWER Procedure Z Test for Binomial Proportion Fixed Scenario Elements Method Normal approximation Null Proportion 0.02 Binomial Proportion 0.05 Nominal Power 0.9 Number of Sides 2 Alpha 0.05 Actual N Power Total BIOS Power and Sample Size

30 One sample exact test: binary outcome What is power of exact test? Let Y, the number of successes, be test statistic. Under H 0, Y Binomial(N, π 0 ) Under H A, Y Binomial(N, π A ) Power Pr[Y y 1 α/2 π = π A ] + Pr[Y y α/2 π = π A ] where y α/2 and y 1 α/2 determined as in Counting Data slides BIOS Power and Sample Size

31 One sample exact test: binary outcome Exact example. Suppose N = 20, π 0 =.2, π A =.5, α = What is exact power of two-sided test? Based on CDF of Binomial(20,.2), choose y α/2 = 0 and y 1 α/2 = 9 such that power equals Pr[Y 9 π =.5] + Pr[Y 0 π =.5] = BIOS Power and Sample Size

32 One sample exact test, binary outcome: SAS proc power; onesamplefreq test = exact method = exact nullp =.2 p =.5 power =. ntotal = 20; run; Computed Power Lower Upper Crit Crit Actual Val Val Alpha Power BIOS Power and Sample Size

E509A: Principle of Biostatistics. GY Zou

E509A: Principle of Biostatistics. GY Zou E509A: Principle of Biostatistics (Week 4: Inference for a single mean ) GY Zou gzou@srobarts.ca Example 5.4. (p. 183). A random sample of n =16, Mean I.Q is 106 with standard deviation S =12.4. What

More information

Point and Interval Estimation II Bios 662

Point and Interval Estimation II Bios 662 Point and Interval Estimation II Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2006-09-13 17:17 BIOS 662 1 Point and Interval Estimation II Nonparametric CI

More information

Power and the computation of sample size

Power and the computation of sample size 9 Power and the computation of sample size A statistical test will not be able to detect a true difference if the sample size is too small compared with the magnitude of the difference. When designing

More information

Sample size and power calculation using R and SAS proc power. Ho Kim GSPH, SNU

Sample size and power calculation using R and SAS proc power. Ho Kim GSPH, SNU Sample size and power calculation using R and SAS proc power Ho Kim GSPH, SNU Pvalue (1) We want to show that the means of two populations are different! Y 1 a sample mean from the 1st pop Y 2 a sample

More information

Analysis of Variance Bios 662

Analysis of Variance Bios 662 Analysis of Variance Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-10-21 13:34 BIOS 662 1 ANOVA Outline Introduction Alternative models SS decomposition

More information

STAT 430 (Fall 2017): Tutorial 2

STAT 430 (Fall 2017): Tutorial 2 STAT 430 (Fall 2017): Tutorial 2 A review of statistical power analysis Luyao Lin September 19/21, 2017 Department Statistics and Actuarial Science, Simon Fraser University Hypothesis Testing A statistical

More information

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings Yujin Chung October 4th, 2016 Fall 2016 Yujin Chung Lec6: Statistical hypothesis testings Fall 2016 1/30 Previous Two types of statistical

More information

Topic 19 Extensions on the Likelihood Ratio

Topic 19 Extensions on the Likelihood Ratio Topic 19 Extensions on the Likelihood Ratio Two-Sided Tests 1 / 12 Outline Overview Normal Observations Power Analysis 2 / 12 Overview The likelihood ratio test is a popular choice for composite hypothesis

More information

Analysis of Variance II Bios 662

Analysis of Variance II Bios 662 Analysis of Variance II Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-10-24 17:21 BIOS 662 1 ANOVA II Outline Multiple Comparisons Scheffe Tukey Bonferroni

More information

Power and sample size calculations

Power and sample size calculations Power and sample size calculations Susanne Rosthøj Biostatistisk Afdeling Institut for Folkesundhedsvidenskab Københavns Universitet sr@biostat.ku.dk April 8, 2014 Planning an investigation How many individuals

More information

SAMPLING BIOS 662. Michael G. Hudgens, Ph.D. mhudgens :55. BIOS Sampling

SAMPLING BIOS 662. Michael G. Hudgens, Ph.D.   mhudgens :55. BIOS Sampling SAMPLIG BIOS 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-11-14 15:55 BIOS 662 1 Sampling Outline Preliminaries Simple random sampling Population mean Population

More information

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA Sample Size and Power I: Binary Outcomes James Ware, PhD Harvard School of Public Health Boston, MA Sample Size and Power Principles: Sample size calculations are an essential part of study design Consider

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Sample Size/Power Calculation by Software/Online Calculators

Sample Size/Power Calculation by Software/Online Calculators Sample Size/Power Calculation by Software/Online Calculators May 24, 2018 Li Zhang, Ph.D. li.zhang@ucsf.edu Associate Professor Department of Epidemiology and Biostatistics Division of Hematology and Oncology

More information

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and

More information

Power Analysis. Introduction to Power

Power Analysis. Introduction to Power Power Analysis Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning When testing a specific null hypothesis (H 0 ), we

More information

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y

More information

a Sample By:Dr.Hoseyn Falahzadeh 1

a Sample By:Dr.Hoseyn Falahzadeh 1 In the name of God Determining ee the esize eof a Sample By:Dr.Hoseyn Falahzadeh 1 Sample Accuracy Sample accuracy: refers to how close a random sample s statistic is to the true population s value it

More information

557: MATHEMATICAL STATISTICS II HYPOTHESIS TESTING: EXAMPLES

557: MATHEMATICAL STATISTICS II HYPOTHESIS TESTING: EXAMPLES 557: MATHEMATICAL STATISTICS II HYPOTHESIS TESTING: EXAMPLES Example Suppose that X,..., X n N, ). To test H 0 : 0 H : the most powerful test at level α is based on the statistic λx) f π) X x ) n/ exp

More information

SAMPLING II BIOS 662. Michael G. Hudgens, Ph.D. mhudgens :37. BIOS Sampling II

SAMPLING II BIOS 662. Michael G. Hudgens, Ph.D.  mhudgens :37. BIOS Sampling II SAMPLING II BIOS 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-11-17 14:37 BIOS 662 1 Sampling II Outline Stratified sampling Introduction Notation and Estimands

More information

Welcome! Webinar Biostatistics: sample size & power. Thursday, April 26, 12:30 1:30 pm (NDT)

Welcome! Webinar Biostatistics: sample size & power. Thursday, April 26, 12:30 1:30 pm (NDT) . Welcome! Webinar Biostatistics: sample size & power Thursday, April 26, 12:30 1:30 pm (NDT) Get started now: Please check if your speakers are working and mute your audio. Please use the chat box to

More information

Paired comparisons. We assume that

Paired comparisons. We assume that To compare to methods, A and B, one can collect a sample of n pairs of observations. Pair i provides two measurements, Y Ai and Y Bi, one for each method: If we want to compare a reaction of patients to

More information

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18 and t-tests Patrick Breheny October 15 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18 Introduction The t-test As we discussed previously, W.S. Gossett derived the t-distribution as a way of

More information

Economics 583: Econometric Theory I A Primer on Asymptotics: Hypothesis Testing

Economics 583: Econometric Theory I A Primer on Asymptotics: Hypothesis Testing Economics 583: Econometric Theory I A Primer on Asymptotics: Hypothesis Testing Eric Zivot October 12, 2011 Hypothesis Testing 1. Specify hypothesis to be tested H 0 : null hypothesis versus. H 1 : alternative

More information

Inference for Binomial Parameters

Inference for Binomial Parameters Inference for Binomial Parameters Dipankar Bandyopadhyay, Ph.D. Department of Biostatistics, Virginia Commonwealth University D. Bandyopadhyay (VCU) BIOS 625: Categorical Data & GLM 1 / 58 Inference for

More information

Factorial Analysis of Variance

Factorial Analysis of Variance Factorial Analysis of Variance Overview of the Factorial ANOVA In the context of ANOVA, an independent variable (or a quasiindependent variable) is called a factor, and research studies with multiple factors,

More information

Sampling Distribution: Week 6

Sampling Distribution: Week 6 Sampling Distribution: Week 6 Kwonsang Lee University of Pennsylvania kwonlee@wharton.upenn.edu February 27, 2015 Kwonsang Lee STAT111 February 27, 2015 1 / 16 Sampling Distribution: Sample Mean If X 1,

More information

HYPOTHESIS TESTING. Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.

More information

PubH 5450 Biostatistics I Prof. Carlin. Lecture 13

PubH 5450 Biostatistics I Prof. Carlin. Lecture 13 PubH 5450 Biostatistics I Prof. Carlin Lecture 13 Outline Outline Sample Size Counts, Rates and Proportions Part I Sample Size Type I Error and Power Type I error rate: probability of rejecting the null

More information

Discrete Multivariate Statistics

Discrete Multivariate Statistics Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are

More information

Power. January 12, 2019

Power. January 12, 2019 Power January 12, 2019 Contents Definition of power Z-test example If H 0 is true If H 0 is false The 2x2 matrix of All Things That Can Happen Pr(Type I error) = α Pr(Type II error) = β power = 1 β Things

More information

Answer Key for STAT 200B HW No. 7

Answer Key for STAT 200B HW No. 7 Answer Key for STAT 200B HW No. 7 May 5, 2007 Problem 2.2 p. 649 Assuming binomial 2-sample model ˆπ =.75, ˆπ 2 =.6. a ˆτ = ˆπ 2 ˆπ =.5. From Ex. 2.5a on page 644: ˆπ ˆπ + ˆπ 2 ˆπ 2.75.25.6.4 = + =.087;

More information

Chapter 8 of Devore , H 1 :

Chapter 8 of Devore , H 1 : Chapter 8 of Devore TESTING A STATISTICAL HYPOTHESIS Maghsoodloo A statistical hypothesis is an assumption about the frequency function(s) (i.e., PDF or pdf) of one or more random variables. Stated in

More information

Basic Statistics and Probability Chapter 9: Inferences Based on Two Samples: Confidence Intervals and Tests of Hypotheses

Basic Statistics and Probability Chapter 9: Inferences Based on Two Samples: Confidence Intervals and Tests of Hypotheses Basic Statistics and Probability Chapter 9: Inferences Based on Two Samples: Confidence Intervals and Tests of Hypotheses Identifying the Target Parameter Comparing Two Population Means: Independent Sampling

More information

7 Estimation. 7.1 Population and Sample (P.91-92)

7 Estimation. 7.1 Population and Sample (P.91-92) 7 Estimation MATH1015 Biostatistics Week 7 7.1 Population and Sample (P.91-92) Suppose that we wish to study a particular health problem in Australia, for example, the average serum cholesterol level for

More information

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) B.H. Robbins Scholars Series June 23, 2010 1 / 29 Outline Z-test χ 2 -test Confidence Interval Sample size and power Relative effect

More information

Sample Size Estimation for Studies of High-Dimensional Data

Sample Size Estimation for Studies of High-Dimensional Data Sample Size Estimation for Studies of High-Dimensional Data James J. Chen, Ph.D. National Center for Toxicological Research Food and Drug Administration June 3, 2009 China Medical University Taichung,

More information

Comparison of Two Population Means

Comparison of Two Population Means Comparison of Two Population Means Esra Akdeniz March 15, 2015 Independent versus Dependent (paired) Samples We have independent samples if we perform an experiment in two unrelated populations. We have

More information

BIOS 312: Precision of Statistical Inference

BIOS 312: Precision of Statistical Inference and Power/Sample Size and Standard Errors BIOS 312: of Statistical Inference Chris Slaughter Department of Biostatistics, Vanderbilt University School of Medicine January 3, 2013 Outline Overview and Power/Sample

More information

POWER FOR COMPARING TWO PROPORTIONS WITH INDEPENDENT SAMPLES

POWER FOR COMPARING TWO PROPORTIONS WITH INDEPENDENT SAMPLES This handout covers material found in Section 0.5 of the text. POWER FOR COMPARING TWO PROPORTIONS WITH INDEPENDENT SAMPLES EXAMPLE: Otolaryngology (Example 0.3 of your text, page 405). Suppose a study

More information

COMPARING GROUPS PART 1CONTINUOUS DATA

COMPARING GROUPS PART 1CONTINUOUS DATA COMPARING GROUPS PART 1CONTINUOUS DATA Min Chen, Ph.D. Assistant Professor Quantitative Biomedical Research Center Department of Clinical Sciences Bioinformatics Shared Resource Simmons Comprehensive Cancer

More information

Regression techniques provide statistical analysis of relationships. Research designs may be classified as experimental or observational; regression

Regression techniques provide statistical analysis of relationships. Research designs may be classified as experimental or observational; regression LOGISTIC REGRESSION Regression techniques provide statistical analysis of relationships. Research designs may be classified as eperimental or observational; regression analyses are applicable to both types.

More information

Two Sample Problems. Two sample problems

Two Sample Problems. Two sample problems Two Sample Problems Two sample problems The goal of inference is to compare the responses in two groups. Each group is a sample from a different population. The responses in each group are independent

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Chapter 6 Sampling and Sampling Distributions Ch. 6-1 6.1 Tools of Business Statistics n Descriptive statistics n Collecting, presenting, and describing data n Inferential

More information

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption

Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption Two-sample t-tests. - Independent samples - Pooled standard devation - The equal variance assumption Last time, we used the mean of one sample to test against the hypothesis that the true mean was a particular

More information

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval Epidemiology 9509 Principles of Biostatistics Chapter 10 - Inferences about John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. differences in

More information

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0.

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0. () () a. X is a binomial distribution with n = 000, p = /6 b. The expected value, variance, and standard deviation of X is: E(X) = np = 000 = 000 6 var(x) = np( p) = 000 5 6 666 stdev(x) = np( p) = 000

More information

High-Throughput Sequencing Course

High-Throughput Sequencing Course High-Throughput Sequencing Course DESeq Model for RNA-Seq Biostatistics and Bioinformatics Summer 2017 Outline Review: Standard linear regression model (e.g., to model gene expression as function of an

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

Tests for Population Proportion(s)

Tests for Population Proportion(s) Tests for Population Proportion(s) Esra Akdeniz April 6th, 2016 Motivation We are interested in estimating the prevalence rate of breast cancer among 50- to 54-year-old women whose mothers have had breast

More information

Power of a test. Hypothesis testing

Power of a test. Hypothesis testing Hypothesis testing February 11, 2014 Debdeep Pati Power of a test 1. Assuming standard deviation is known. Calculate power based on one-sample z test. A new drug is proposed for people with high intraocular

More information

Lecture 01: Introduction

Lecture 01: Introduction Lecture 01: Introduction Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture 01: Introduction

More information

Logistic Regression. Interpretation of linear regression. Other types of outcomes. 0-1 response variable: Wound infection. Usual linear regression

Logistic Regression. Interpretation of linear regression. Other types of outcomes. 0-1 response variable: Wound infection. Usual linear regression Logistic Regression Usual linear regression (repetition) y i = b 0 + b 1 x 1i + b 2 x 2i + e i, e i N(0,σ 2 ) or: y i N(b 0 + b 1 x 1i + b 2 x 2i,σ 2 ) Example (DGA, p. 336): E(PEmax) = 47.355 + 1.024

More information

10: Crosstabs & Independent Proportions

10: Crosstabs & Independent Proportions 10: Crosstabs & Independent Proportions p. 10.1 P Background < Two independent groups < Binary outcome < Compare binomial proportions P Illustrative example ( oswege.sav ) < Food poisoning following church

More information

On Assumptions. On Assumptions

On Assumptions. On Assumptions On Assumptions An overview Normality Independence Detection Stem-and-leaf plot Study design Normal scores plot Correction Transformation More complex models Nonparametric procedure e.g. time series Robustness

More information

Epidemiology Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures. John Koval

Epidemiology Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures. John Koval Epidemiology 9509 Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered

More information

Statistics 135 Fall 2008 Final Exam

Statistics 135 Fall 2008 Final Exam Name: SID: Statistics 135 Fall 2008 Final Exam Show your work. The number of points each question is worth is shown at the beginning of the question. There are 10 problems. 1. [2] The normal equations

More information

1 Statistical inference for a population mean

1 Statistical inference for a population mean 1 Statistical inference for a population mean 1. Inference for a large sample, known variance Suppose X 1,..., X n represents a large random sample of data from a population with unknown mean µ and known

More information

Sections 3.4 and 3.5

Sections 3.4 and 3.5 Sections 3.4 and 3.5 Shiwen Shen Department of Statistics University of South Carolina Elementary Statistics for the Biological and Life Sciences (STAT 205) Continuous variables So far we ve dealt with

More information

Mathematical statistics

Mathematical statistics November 15 th, 2018 Lecture 21: The two-sample t-test Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation

More information

STA 303H1F: Two-way Analysis of Variance Practice Problems

STA 303H1F: Two-way Analysis of Variance Practice Problems STA 303H1F: Two-way Analysis of Variance Practice Problems 1. In the Pygmalion example from lecture, why are the average scores of the platoon used as the response variable, rather than the scores of the

More information

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Guosheng Yin Department of Statistics and Actuarial Science The University of Hong Kong Joint work with J. Xu PSI and RSS Journal

More information

Power and sample size calculations

Power and sample size calculations Power and sample size calculations Susanne Rosthøj Biostatistisk Afdeling Institut for Folkesundhedsvidenskab Københavns Universitet sr@biostat.ku.dk October 28 2013 Planning an investigation How many

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Chapter 5 Formulas Distribution Formula Characteristics n. π is the probability Function. x trial and n is the. where x = 0, 1, 2, number of trials

Chapter 5 Formulas Distribution Formula Characteristics n. π is the probability Function. x trial and n is the. where x = 0, 1, 2, number of trials SPSS Program Notes Biostatistics: A Guide to Design, Analysis, and Discovery Second Edition by Ronald N. Forthofer, Eun Sul Lee, Mike Hernandez Chapter 5: Probability Distributions Chapter 5 Formulas Distribution

More information

Contents 1. Contents

Contents 1. Contents Contents 1 Contents 6 Distributions of Functions of Random Variables 2 6.1 Transformation of Discrete r.v.s............. 3 6.2 Method of Distribution Functions............. 6 6.3 Method of Transformations................

More information

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving

More information

T-tests for 2 Independent Means

T-tests for 2 Independent Means T-tests for 2 Independent Means February 22, 208 Contents t-test for 2 independent means Tutorial Example : one-tailed test for independent means, equal sample sizes Error Bars Example 2: two-tailed test

More information

T-tests for 2 Independent Means

T-tests for 2 Independent Means T-tests for 2 Independent Means January 15, 2019 Contents t-test for 2 independent means Tutorial Example 1: one-tailed test for independent means, equal sample sizes Error Bars Example 2: two-tailed test

More information

Power and sample size calculations

Power and sample size calculations Patrick Breheny October 20 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 26 Planning a study Introduction What is power? Why is it important? Setup One of the most important

More information

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses. 1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately

More information

Factorial Analysis of Variance

Factorial Analysis of Variance Factorial Analysis of Variance Conceptual Example A repeated-measures t-test is more likely to lead to rejection of the null hypothesis if a) *Subjects show considerable variability in their change scores.

More information

Continuous Probability Distributions

Continuous Probability Distributions 1 Chapter 5 Continuous Probability Distributions 5.1 Probability density function Example 5.1.1. Revisit Example 3.1.1. 11 12 13 14 15 16 21 22 23 24 25 26 S = 31 32 33 34 35 36 41 42 43 44 45 46 (5.1.1)

More information

STAT 513 fa 2018 Lec 02

STAT 513 fa 2018 Lec 02 STAT 513 fa 2018 Lec 02 Inference about the mean and variance of a Normal population Karl B. Gregory Fall 2018 Inference about the mean and variance of a Normal population Here we consider the case in

More information

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains:

CHAPTER 8. Test Procedures is a rule, based on sample data, for deciding whether to reject H 0 and contains: CHAPTER 8 Test of Hypotheses Based on a Single Sample Hypothesis testing is the method that decide which of two contradictory claims about the parameter is correct. Here the parameters of interest are

More information

Lecture 9. ANOVA: Random-effects model, sample size

Lecture 9. ANOVA: Random-effects model, sample size Lecture 9. ANOVA: Random-effects model, sample size Jesper Rydén Matematiska institutionen, Uppsala universitet jesper@math.uu.se Regressions and Analysis of Variance fall 2015 Fixed or random? Is it reasonable

More information

Solution: First note that the power function of the test is given as follows,

Solution: First note that the power function of the test is given as follows, Problem 4.5.8: Assume the life of a tire given by X is distributed N(θ, 5000 ) Past experience indicates that θ = 30000. The manufacturere claims the tires made by a new process have mean θ > 30000. Is

More information

Sample Size / Power Calculations

Sample Size / Power Calculations Sample Size / Power Calculations A Simple Example Goal: To study the effect of cold on blood pressure (mmhg) in rats Use a Completely Randomized Design (CRD): 12 rats are randomly assigned to one of two

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 3: Bivariate association : Categorical variables Proportion in one group One group is measured one time: z test Use the z distribution as an approximation to the binomial

More information

Asymptotic Statistics-VI. Changliang Zou

Asymptotic Statistics-VI. Changliang Zou Asymptotic Statistics-VI Changliang Zou Kolmogorov-Smirnov distance Example (Kolmogorov-Smirnov confidence intervals) We know given α (0, 1), there is a well-defined d = d α,n such that, for any continuous

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

1; (f) H 0 : = 55 db, H 1 : < 55.

1; (f) H 0 : = 55 db, H 1 : < 55. Reference: Chapter 8 of J. L. Devore s 8 th Edition By S. Maghsoodloo TESTING a STATISTICAL HYPOTHESIS A statistical hypothesis is an assumption about the frequency function(s) (i.e., pmf or pdf) of one

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1.

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1. Problem 1 (21 points) An economist runs the regression y i = β 0 + x 1i β 1 + x 2i β 2 + x 3i β 3 + ε i (1) The results are summarized in the following table: Equation 1. Variable Coefficient Std. Error

More information

Hypothesis Testing. ECE 3530 Spring Antonio Paiva

Hypothesis Testing. ECE 3530 Spring Antonio Paiva Hypothesis Testing ECE 3530 Spring 2010 Antonio Paiva What is hypothesis testing? A statistical hypothesis is an assertion or conjecture concerning one or more populations. To prove that a hypothesis is

More information

SAMPLING III BIOS 662

SAMPLING III BIOS 662 SAMPLIG III BIOS 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2009-08-11 09:52 BIOS 662 1 Sampling III Outline One-stage cluster sampling Systematic sampling Multi-stage

More information

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new

More information

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION

More information

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018

Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Sampling A trait is measured on each member of a population. f(y) = propn of individuals in the popn with measurement

More information

Multivariate Statistical Analysis

Multivariate Statistical Analysis Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Addressing ourliers 1 Addressing ourliers 2 Outliers in Multivariate samples (1) For

More information

Tests for Two Correlated Proportions in a Matched Case- Control Design

Tests for Two Correlated Proportions in a Matched Case- Control Design Chapter 155 Tests for Two Correlated Proportions in a Matched Case- Control Design Introduction A 2-by-M case-control study investigates a risk factor relevant to the development of a disease. A population

More information

Comparing two independent samples

Comparing two independent samples In many applications it is necessary to compare two competing methods (for example, to compare treatment effects of a standard drug and an experimental drug). To compare two methods from statistical point

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Chapter Six: Two Independent Samples Methods 1/51

Chapter Six: Two Independent Samples Methods 1/51 Chapter Six: Two Independent Samples Methods 1/51 6.3 Methods Related To Differences Between Proportions 2/51 Test For A Difference Between Proportions:Introduction Suppose a sampling distribution were

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

ph: 5.2, 5.6, 5.8, 6.4, 6.5, 6.8, 6.9, 7.2, 7.5 sample mean = sample sd = sample size, n = 9

ph: 5.2, 5.6, 5.8, 6.4, 6.5, 6.8, 6.9, 7.2, 7.5 sample mean = sample sd = sample size, n = 9 Name: SOLUTIONS Final Part 1 (100 pts) and Final Part 2 (120 pts) For all of the questions below, please show enough work that it is completely clear how your final solution was derived. Sit at least one

More information

One-sample categorical data: approximate inference

One-sample categorical data: approximate inference One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

Using SAS Proc Power to Perform Model-based Power Analysis for Clinical Pharmacology Studies Peng Sun, Merck & Co., Inc.

Using SAS Proc Power to Perform Model-based Power Analysis for Clinical Pharmacology Studies Peng Sun, Merck & Co., Inc. PharmaSUG00 - Paper SP05 Using SAS Proc Power to Perform Model-based Power Analysis for Clinical Pharmacology Studies Peng Sun, Merck & Co., Inc., North Wales, PA ABSRAC In this paper, we demonstrate that

More information