PubH 5450 Biostatistics I Prof. Carlin. Lecture 13
|
|
- Anastasia Sparks
- 6 years ago
- Views:
Transcription
1 PubH 5450 Biostatistics I Prof. Carlin Lecture 13
2 Outline Outline Sample Size Counts, Rates and Proportions
3 Part I Sample Size
4 Type I Error and Power Type I error rate: probability of rejecting the null when the null is true a mistake!
5 Type I Error and Power Type I error rate: probability of rejecting the null when the null is true a mistake! Power: probability of rejecting the null when the alternative is true NOT a mistake!
6 Sample Size Calculation: Requirements 1. Distribution of the test statistic under the alternative (normal for two-sample t-tests.)
7 Sample Size Calculation: Requirements 1. Distribution of the test statistic under the alternative (normal for two-sample t-tests.) 2. Type I error rate: usually α = 0.05.
8 Sample Size Calculation: Requirements 1. Distribution of the test statistic under the alternative (normal for two-sample t-tests.) 2. Type I error rate: usually α = The (minimal) power: say, 1 β = 0.8.
9 Sample Size Calculation: Requirements 1. Distribution of the test statistic under the alternative (normal for two-sample t-tests.) 2. Type I error rate: usually α = The (minimal) power: say, 1 β = The (minimal) magnitude of the effect µ 1 µ 2 to be detected.
10 Sample Size Calculation: Requirements 1. Distribution of the test statistic under the alternative (normal for two-sample t-tests.) 2. Type I error rate: usually α = The (minimal) power: say, 1 β = The (minimal) magnitude of the effect µ 1 µ 2 to be detected. 5. Variability: σ 2 (if we can assume equal variances)
11 Sample Size for Two-Sample Tests n is a function of the standardized difference between the two populations: = µ 1 µ 2. σ
12 Sample Size for Two-Sample Tests n is a function of the standardized difference between the two populations: = µ 1 µ 2. σ For a two-sided test, the required sample size per group is n = 2( z 1 α/2 + z 1 β ) 2 2.
13 Sample Size for Two-Sample Tests n is a function of the standardized difference between the two populations: = µ 1 µ 2. σ For a two-sided test, the required sample size per group is n = 2( z 1 α/2 + z 1 β ) 2 2. (Rule of thumb) For α = 0.05 and 1 β = 0.8, n 16/ 2.
14 Notes on Sample Size Formula It assumes n 1 = n 2, which gives the best power when n 1 + n 2 is fixed.
15 Notes on Sample Size Formula It assumes n 1 = n 2, which gives the best power when n 1 + n 2 is fixed. To detect half the effect, the sample size needs to be quadrupled.
16 Notes on Sample Size Formula It assumes n 1 = n 2, which gives the best power when n 1 + n 2 is fixed. To detect half the effect, the sample size needs to be quadrupled. Rule of thumb: When σ 2 is estimated (from previous studies), add 1 to each group.
17 Sample Size for One-Sample Tests For one sample tests, the standardized difference is = µ µ 0. σ
18 Sample Size for One-Sample Tests For one sample tests, the standardized difference is = µ µ 0. σ For a two-sided test, the required sample size in the group is n = ( z1 α/2 + z 1 β ) 2 2.
19 Sample Size for One-Sample Tests For one sample tests, the standardized difference is = µ µ 0. σ For a two-sided test, the required sample size in the group is n = ( z1 α/2 + z 1 β ) 2 2. Rule of thumb: For α = 0.05 and 1 β = 0.8, n 8/ 2.
20 Notes on Sample Size for One-Sample Tests For a matched case-control study (paired, dependent samples), you still need 2n subjects.
21 Notes on Sample Size for One-Sample Tests For a matched case-control study (paired, dependent samples), you still need 2n subjects. That is still only half the sample size needed for an unmatched design (more variability in two independent groups need more samples)
22 Notes on Sample Size for One-Sample Tests For a matched case-control study (paired, dependent samples), you still need 2n subjects. That is still only half the sample size needed for an unmatched design (more variability in two independent groups need more samples) Rule of thumb: When σ 2 is estimated (from previous studies), add 2 to n.
23 One-Sided Tests When doing a sample size calculation for a one-sided test, replace z 1 α/2 by z 1 α in the formulae above.
24 One-Sided Tests When doing a sample size calculation for a one-sided test, replace z 1 α/2 by z 1 α in the formulae above. For α = 0.05, these are of course 1.96 and 1.645, respectively.
25 Unequal Sample Sizes In general, for a two-sample problem, when the total sample size n 1 + n 2 is fixed it is most efficient to have n 1 = n 2.
26 Unequal Sample Sizes In general, for a two-sample problem, when the total sample size n 1 + n 2 is fixed it is most efficient to have n 1 = n 2. Situations where unequal sample sizes should be considered:
27 Unequal Sample Sizes In general, for a two-sample problem, when the total sample size n 1 + n 2 is fixed it is most efficient to have n 1 = n 2. Situations where unequal sample sizes should be considered: One group of people is difficult to recruit.
28 Unequal Sample Sizes In general, for a two-sample problem, when the total sample size n 1 + n 2 is fixed it is most efficient to have n 1 = n 2. Situations where unequal sample sizes should be considered: One group of people is difficult to recruit. The costs of the two treatments are different.
29 Unequal Sample Sizes In general, for a two-sample problem, when the total sample size n 1 + n 2 is fixed it is most efficient to have n 1 = n 2. Situations where unequal sample sizes should be considered: One group of people is difficult to recruit. The costs of the two treatments are different. The variances of the two populations are different.
30 Counts, Rates and Proportions Part II Counts, Rates and Proportions
31 Counts, Rates and Proportions Binomial Distribution Refresher A binomial variable X with distribution B(n, p) can be interpreted as the total number of successes in n independent and identical Bernoulli trials with success probability p.
32 Counts, Rates and Proportions Binomial Distribution Refresher A binomial variable X with distribution B(n, p) can be interpreted as the total number of successes in n independent and identical Bernoulli trials with success probability p. The mean of X is np and its variance is np(1 p).
33 Counts, Rates and Proportions Binomial Distribution Refresher A binomial variable X with distribution B(n, p) can be interpreted as the total number of successes in n independent and identical Bernoulli trials with success probability p. The mean of X is np and its variance is np(1 p). ˆp = X /n is an estimator of p with variance (ˆp(1 ˆp))/n.
34 Counts, Rates and Proportions Binomial Distribution Refresher A binomial variable X with distribution B(n, p) can be interpreted as the total number of successes in n independent and identical Bernoulli trials with success probability p. The mean of X is np and its variance is np(1 p). ˆp = X /n is an estimator of p with variance (ˆp(1 ˆp))/n. The sampling probability (the mean of ˆp) and the population proportion (p) are equal only under simple random sampling.
35 Counts, Rates and Proportions What are these rates? Definitions The incidence of a disease is the number of new cases diagnosed during the time interval.
36 Counts, Rates and Proportions What are these rates? Definitions The incidence of a disease is the number of new cases diagnosed during the time interval. The prevalence of a disease is the number of individuals with the disease at a fixed time point.
37 Counts, Rates and Proportions Cautions in Comparing Proportions What are the numerators?
38 Counts, Rates and Proportions Cautions in Comparing Proportions What are the numerators? What are the denominators?
39 Counts, Rates and Proportions Confidence Intervals for Proportions Wilson s 95% CI: p ± 1.96 p(1 p) n + 4, where p = X + 2 n + 4.
40 Counts, Rates and Proportions Confidence Intervals for Proportions Wilson s 95% CI: where p ± 1.96 p(1 p) n + 4, p = X + 2 n + 4. This technique has a Bayesian interpretation: note it is as if we are adding two successes and two failures to the actual observed dataset.
41 Counts, Rates and Proportions Confidence Intervals for Proportions Wilson s 95% CI: where p ± 1.96 p(1 p) n + 4, p = X + 2 n + 4. This technique has a Bayesian interpretation: note it is as if we are adding two successes and two failures to the actual observed dataset. It is still more common to use the ordinary ˆp = X /n (instead of p) when all we want is a point estimate of p.
42 Counts, Rates and Proportions Rare Events Wilson s CI does not work very well when p is very close to 0 or 1: the result of our Bayesian prior belief that p is close to 1/2 (our fake preliminary data are balanced: 2 successes, 2 failures)
43 Counts, Rates and Proportions Rare Events Wilson s CI does not work very well when p is very close to 0 or 1: the result of our Bayesian prior belief that p is close to 1/2 (our fake preliminary data are balanced: 2 successes, 2 failures) The rule of threes : If in n trials, no success is observed, the estimated success probability is 0, with an approximate 95% upper bound 3 n.
44 Counts, Rates and Proportions Large-sample testing for a population proportion To test H 0 : p = p 0, use the z-statistic: where ˆp = X /n. z = ˆp p 0 p 0 (1 p 0 ) n,
45 Counts, Rates and Proportions Large-sample testing for a population proportion To test H 0 : p = p 0, use the z-statistic: where ˆp = X /n. z = ˆp p 0 p 0 (1 p 0 ) n Note that p 0 is used and Z has a standard normal distribution (when n is large, e.g., np 0 > 10 and n(1 p 0 ) > 10 or np 0 (1 p 0 ) > 5).,
46 Counts, Rates and Proportions Large-sample testing for a population proportion To test H 0 : p = p 0, use the z-statistic: where ˆp = X /n. z = ˆp p 0 p 0 (1 p 0 ) n Note that p 0 is used and Z has a standard normal distribution (when n is large, e.g., np 0 > 10 and n(1 p 0 ) > 10 or np 0 (1 p 0 ) > 5). The p-value again depends on H 1 : H 1 : p > p 0 use Pr(Z z) H 1 : p < p 0 use Pr(Z z) H 1 : p p 0 use Pr( Z z ) = 2 Pr(Z z ),
47 Counts, Rates and Proportions Choosing a sample size for a desired margin of error Recall the margin of error for our large-sample Wilson CI is z SE p = z p(1 p) n + 4 where typically z = 1.96, the upper.025 point of Z.
48 Counts, Rates and Proportions Choosing a sample size for a desired margin of error Recall the margin of error for our large-sample Wilson CI is z SE p = z p(1 p) n + 4 where typically z = 1.96, the upper.025 point of Z. When doing a sample size calculation, we must guess the value of p; call it p. We can either Use an estimate of p from an earlier, pilot study, or Use p = 0.5, since this will maximize the margin of error conservative! (safe regardless of what p turns out to be)
49 Counts, Rates and Proportions Choosing a sample size for a desired margin of error Recall the margin of error for our large-sample Wilson CI is z SE p = z p(1 p) n + 4 where typically z = 1.96, the upper.025 point of Z. When doing a sample size calculation, we must guess the value of p; call it p. We can either Use an estimate of p from an earlier, pilot study, or Use p = 0.5, since this will maximize the margin of error conservative! (safe regardless of what p turns out to be) Using the conservative p, the required sample size is ( ) z 2 n = 4, 2m provided this number is still positive!
Outline. PubH 5450 Biostatistics I Prof. Carlin. Confidence Interval for the Mean. Part I. Reviews
Outline Outline PubH 5450 Biostatistics I Prof. Carlin Lecture 11 Confidence Interval for the Mean Known σ (population standard deviation): Part I Reviews σ x ± z 1 α/2 n Small n, normal population. Large
More information1 Statistical inference for a population mean
1 Statistical inference for a population mean 1. Inference for a large sample, known variance Suppose X 1,..., X n represents a large random sample of data from a population with unknown mean µ and known
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationEpidemiology Wonders of Biostatistics Chapter 11 (continued) - probability in a single population. John Koval
Epidemiology 9509 Wonders of Biostatistics Chapter 11 (continued) - probability in a single population John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being
More informationBIO5312 Biostatistics Lecture 6: Statistical hypothesis testings
BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings Yujin Chung October 4th, 2016 Fall 2016 Yujin Chung Lec6: Statistical hypothesis testings Fall 2016 1/30 Previous Two types of statistical
More informationTopic 16 Interval Estimation
Topic 16 Interval Estimation Additional Topics 1 / 9 Outline Linear Regression Interpretation of the Confidence Interval 2 / 9 Linear Regression For ordinary linear regression, we have given least squares
More informationChapter 9 Inferences from Two Samples
Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review
More informationUnit 9: Inferences for Proportions and Count Data
Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 12/15/2008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)
More informationTUTORIAL 8 SOLUTIONS #
TUTORIAL 8 SOLUTIONS #9.11.21 Suppose that a single observation X is taken from a uniform density on [0,θ], and consider testing H 0 : θ = 1 versus H 1 : θ =2. (a) Find a test that has significance level
More informationUnit 9: Inferences for Proportions and Count Data
Unit 9: Inferences for Proportions and Count Data Statistics 571: Statistical Methods Ramón V. León 1/15/008 Unit 9 - Stat 571 - Ramón V. León 1 Large Sample Confidence Interval for Proportion ( pˆ p)
More informationPubh 8482: Sequential Analysis
Pubh 8482: Sequential Analysis Joseph S. Koopmeiners Division of Biostatistics University of Minnesota Week 12 Review So far... We have discussed the role of phase III clinical trials in drug development
More informationBIOS 6222: Biostatistics II. Outline. Course Presentation. Course Presentation. Review of Basic Concepts. Why Nonparametrics.
BIOS 6222: Biostatistics II Instructors: Qingzhao Yu Don Mercante Cruz Velasco 1 Outline Course Presentation Review of Basic Concepts Why Nonparametrics The sign test 2 Course Presentation Contents Justification
More informationHypothesis Testing. ECE 3530 Spring Antonio Paiva
Hypothesis Testing ECE 3530 Spring 2010 Antonio Paiva What is hypothesis testing? A statistical hypothesis is an assertion or conjecture concerning one or more populations. To prove that a hypothesis is
More informationThe Components of a Statistical Hypothesis Testing Problem
Statistical Inference: Recall from chapter 5 that statistical inference is the use of a subset of a population (the sample) to draw conclusions about the entire population. In chapter 5 we studied one
More informationSTAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).
STAT 515 -- Chapter 13: Categorical Data Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). Many studies allow for more than 2 categories. Example
More informationHypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)
Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) B.H. Robbins Scholars Series June 23, 2010 1 / 29 Outline Z-test χ 2 -test Confidence Interval Sample size and power Relative effect
More informationLecture 3: Measures of effect: Risk Difference Attributable Fraction Risk Ratio and Odds Ratio
Lecture 3: Measures of effect: Risk Difference Attributable Fraction Risk Ratio and Odds Ratio Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK March 3-5,
More informationBinomial and Poisson Probability Distributions
Binomial and Poisson Probability Distributions Esra Akdeniz March 3, 2016 Bernoulli Random Variable Any random variable whose only possible values are 0 or 1 is called a Bernoulli random variable. What
More informationPump failure data. Pump Failures Time
Outline 1. Poisson distribution 2. Tests of hypothesis for a single Poisson mean 3. Comparing multiple Poisson means 4. Likelihood equivalence with exponential model Pump failure data Pump 1 2 3 4 5 Failures
More informationLecture 11 - Tests of Proportions
Lecture 11 - Tests of Proportions Statistics 102 Colin Rundel February 27, 2013 Research Project Research Project Proposal - Due Friday March 29th at 5 pm Introduction, Data Plan Data Project - Due Friday,
More informationLecture 01: Introduction
Lecture 01: Introduction Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture 01: Introduction
More informationMargin of Error for Proportions
for Proportions Gene Quinn for Proportions p.1/8 An interval estimate for a population proportion p is often reported not as a confidence interval, but as a margin of error. for Proportions p.2/8 An interval
More informationLecture 9 Two-Sample Test. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech
Lecture 9 Two-Sample Test Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Computer exam 1 18 Histogram 14 Frequency 9 5 0 75 83.33333333
More informationSection Inference for a Single Proportion
Section 8.1 - Inference for a Single Proportion Statistics 104 Autumn 2004 Copyright c 2004 by Mark E. Irwin Inference for a Single Proportion For most of what follows, we will be making two assumptions
More informationQUEEN S UNIVERSITY FINAL EXAMINATION FACULTY OF ARTS AND SCIENCE DEPARTMENT OF ECONOMICS APRIL 2018
Page 1 of 4 QUEEN S UNIVERSITY FINAL EXAMINATION FACULTY OF ARTS AND SCIENCE DEPARTMENT OF ECONOMICS APRIL 2018 ECONOMICS 250 Introduction to Statistics Instructor: Gregor Smith Instructions: The exam
More informationLecture 6: Point Estimation and Large Sample Confidence Intervals. Readings: Sections
Lecture 6: Point Estimation and Large Sample Confidence Intervals Readings: Sections 7.1-7.3 1 Point Estimation Objective of point estimation: use a sample to compute a number that represents in some sense
More informationFrequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<=
A frequency distribution is a kind of probability distribution. It gives the frequency or relative frequency at which given values have been observed among the data collected. For example, for age, Frequency
More informationIntroduction to Bayesian Learning. Machine Learning Fall 2018
Introduction to Bayesian Learning Machine Learning Fall 2018 1 What we have seen so far What does it mean to learn? Mistake-driven learning Learning by counting (and bounding) number of mistakes PAC learnability
More informationSections 7.1 and 7.2. This chapter presents the beginning of inferential statistics. The two major applications of inferential statistics
Sections 7.1 and 7.2 This chapter presents the beginning of inferential statistics. The two major applications of inferential statistics Estimate the value of a population parameter Test some claim (or
More informationBernoulli Trials, Binomial and Cumulative Distributions
Bernoulli Trials, Binomial and Cumulative Distributions Sec 4.4-4.6 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 9-3339 Cathy Poliak,
More informationComparing p s Dr. Don Edwards notes (slightly edited and augmented) The Odds for Success
Comparing p s Dr. Don Edwards notes (slightly edited and augmented) The Odds for Success When the experiment consists of a series of n independent trials, and each trial may end in either success or failure,
More informationInferences for Proportions and Count Data
Inferences for Proportions and Count Data Corresponds to Chapter 9 of Tamhane and Dunlop Slides prepared by Elizabeth Newton (MIT), with some slides by Ramón V. León (University of Tennessee) 1 Inference
More informationBernoulli Trials and Binomial Distribution
Bernoulli Trials and Binomial Distribution Sec 4.4-4.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 10-3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationReports of the Institute of Biostatistics
Reports of the Institute of Biostatistics No 02 / 2008 Leibniz University of Hannover Natural Sciences Faculty Title: Properties of confidence intervals for the comparison of small binomial proportions
More informationPHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1
PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population
More informationLecture 7: Confidence interval and Normal approximation
Lecture 7: Confidence interval and Normal approximation 26th of November 2015 Confidence interval 26th of November 2015 1 / 23 Random sample and uncertainty Example: we aim at estimating the average height
More informationGEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs
STATISTICS 4 Summary Notes. Geometric and Exponential Distributions GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs P(X = x) = ( p) x p x =,, 3,...
More informationLecture 3. Biostatistics in Veterinary Science. Feb 2, Jung-Jin Lee Drexel University. Biostatistics in Veterinary Science Lecture 3
Lecture 3 Biostatistics in Veterinary Science Jung-Jin Lee Drexel University Feb 2, 2015 Review Let S be the sample space and A, B be events. Then 1 P (S) = 1, P ( ) = 0. 2 If A B, then P (A) P (B). In
More informationTests for Population Proportion(s)
Tests for Population Proportion(s) Esra Akdeniz April 6th, 2016 Motivation We are interested in estimating the prevalence rate of breast cancer among 50- to 54-year-old women whose mothers have had breast
More informationProbability: Why do we care? Lecture 2: Probability and Distributions. Classical Definition. What is Probability?
Probability: Why do we care? Lecture 2: Probability and Distributions Sandy Eckel seckel@jhsph.edu 22 April 2008 Probability helps us by: Allowing us to translate scientific questions into mathematical
More informationBernoulli Trials and Binomial Distribution
Bernoulli Trials and Binomial Distribution Sec 4.4-4.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 9-3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationBINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference
BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference BINF702 SPRING 2014 Chapter 7 Hypothesis Testing 1 Section 7.9 One-Sample c 2 Test for the Variance of a Normal Distribution Eq. 7.40
More informationECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12
ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 Winter 2012 Lecture 13 (Winter 2011) Estimation Lecture 13 1 / 33 Review of Main Concepts Sampling Distribution of Sample Mean
More informationConfidence Intervals for Normal Data Spring 2018
Confidence Intervals for Normal Data 18.05 Spring 2018 Agenda Exam on Monday April 30. Practice questions posted. Friday s class is for review (no studio) Today Review of critical values and quantiles.
More informationLecture 2: Discrete Probability Distributions
Lecture 2: Discrete Probability Distributions IB Paper 7: Probability and Statistics Carl Edward Rasmussen Department of Engineering, University of Cambridge February 1st, 2011 Rasmussen (CUED) Lecture
More informationOne-sample categorical data: approximate inference
One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution
More informationTopic 12 Overview of Estimation
Topic 12 Overview of Estimation Classical Statistics 1 / 9 Outline Introduction Parameter Estimation Classical Statistics Densities and Likelihoods 2 / 9 Introduction In the simplest possible terms, the
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 26 (MWF) Tests and CI based on two proportions Suhasini Subba Rao Comparing proportions in
More informationChapters 3.2 Discrete distributions
Chapters 3.2 Discrete distributions In this section we study several discrete distributions and their properties. Here are a few, classified by their support S X. There are of course many, many more. For
More informationLecture 14: Introduction to Poisson Regression
Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why
More informationModelling counts. Lecture 14: Introduction to Poisson Regression. Overview
Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week
More informationSTA 101 Final Review
STA 101 Final Review Statistics 101 Thomas Leininger June 24, 2013 Announcements All work (besides projects) should be returned to you and should be entered on Sakai. Office Hour: 2 3pm today (Old Chem
More informationLecture Slides. Elementary Statistics. Tenth Edition. by Mario F. Triola. and the Triola Statistics Series
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 7 Estimates and Sample Sizes 7-1 Overview 7-2 Estimating a Population Proportion 7-3
More informationCOMPARING GROUPS PART 1CONTINUOUS DATA
COMPARING GROUPS PART 1CONTINUOUS DATA Min Chen, Ph.D. Assistant Professor Quantitative Biomedical Research Center Department of Clinical Sciences Bioinformatics Shared Resource Simmons Comprehensive Cancer
More information1 Hypothesis testing for a single mean
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationPercentage point z /2
Chapter 8: Statistical Intervals Why? point estimate is not reliable under resampling. Interval Estimates: Bounds that represent an interval of plausible values for a parameter There are three types of
More informationan introduction to bayesian inference
with an application to network analysis http://jakehofman.com january 13, 2010 motivation would like models that: provide predictive and explanatory power are complex enough to describe observed phenomena
More informationE509A: Principle of Biostatistics. GY Zou
E509A: Principle of Biostatistics (Week 4: Inference for a single mean ) GY Zou gzou@srobarts.ca Example 5.4. (p. 183). A random sample of n =16, Mean I.Q is 106 with standard deviation S =12.4. What
More informationConfidence Intervals for the Mean of Non-normal Data Class 23, Jeremy Orloff and Jonathan Bloom
Confidence Intervals for the Mean of Non-normal Data Class 23, 8.05 Jeremy Orloff and Jonathan Bloom Learning Goals. Be able to derive the formula for conservative normal confidence intervals for the proportion
More informationAdvanced Herd Management Probabilities and distributions
Advanced Herd Management Probabilities and distributions Anders Ringgaard Kristensen Slide 1 Outline Probabilities Conditional probabilities Bayes theorem Distributions Discrete Continuous Distribution
More informationPractice Questions: Statistics W1111, Fall Solutions
Practice Questions: Statistics W, Fall 9 Solutions Question.. The standard deviation of Z is 89... P(=6) =..3. is definitely inside of a 95% confidence interval for..4. (a) YES (b) YES (c) NO (d) NO Questions
More informationConfidence Intervals for Normal Data Spring 2014
Confidence Intervals for Normal Data 18.05 Spring 2014 Agenda Today Review of critical values and quantiles. Computing z, t, χ 2 confidence intervals for normal data. Conceptual view of confidence intervals.
More information1 Matched pair comparison(p430-)
[1] ST301(AKI) LEC 25 2010/11/30 ST 301 (AKI) LECTURE #25 1 Matched pair comparison(p430-) This has a quite different assumption (matched pair) from the other three methods. Remember LEC 32 page 1 example:
More informationLecture Slides for INTRODUCTION TO. Machine Learning. ETHEM ALPAYDIN The MIT Press,
Lecture Slides for INTRODUCTION TO Machine Learning ETHEM ALPAYDIN The MIT Press, 2004 alpaydin@boun.edu.tr http://www.cmpe.boun.edu.tr/~ethem/i2ml CHAPTER 14: Assessing and Comparing Classification Algorithms
More informationA proportion is the fraction of individuals having a particular attribute. Can range from 0 to 1!
Proportions A proportion is the fraction of individuals having a particular attribute. It is also the probability that an individual randomly sampled from the population will have that attribute Can range
More informationTopic 19 Extensions on the Likelihood Ratio
Topic 19 Extensions on the Likelihood Ratio Two-Sided Tests 1 / 12 Outline Overview Normal Observations Power Analysis 2 / 12 Overview The likelihood ratio test is a popular choice for composite hypothesis
More informationStatistics in medicine
Statistics in medicine Lecture 3: Bivariate association : Categorical variables Proportion in one group One group is measured one time: z test Use the z distribution as an approximation to the binomial
More informationProbability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur
Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation
More informationConfidence Intervals. Confidence interval for sample mean. Confidence interval for sample mean. Confidence interval for sample mean
Confidence Intervals Confidence interval for sample mean The CLT tells us: as the sample size n increases, the sample mean is approximately Normal with mean and standard deviation Thus, we have a standard
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationLecture 25. Ingo Ruczinski. November 24, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University
Lecture 25 Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University November 24, 2015 1 2 3 4 5 6 7 8 9 10 11 1 Hypothesis s of homgeneity 2 Estimating risk
More informationChapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides
Chapter 7 Inference for Distributions Introduction to the Practice of STATISTICS SEVENTH EDITION Moore / McCabe / Craig Lecture Presentation Slides Chapter 7 Inference for Distributions 7.1 Inference for
More informationSummary of Chapters 7-9
Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two
More informationApproximate and Fiducial Confidence Intervals for the Difference Between Two Binomial Proportions
Approximate and Fiducial Confidence Intervals for the Difference Between Two Binomial Proportions K. Krishnamoorthy 1 and Dan Zhang University of Louisiana at Lafayette, Lafayette, LA 70504, USA SUMMARY
More informationMedical statistics part I, autumn 2010: One sample test of hypothesis
Medical statistics part I, autumn 2010: One sample test of hypothesis Eirik Skogvoll Consultant/ Professor Faculty of Medicine Dept. of Anaesthesiology and Emergency Medicine 1 What is a hypothesis test?
More informationSTAT 4385 Topic 01: Introduction & Review
STAT 4385 Topic 01: Introduction & Review Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2016 Outline Welcome What is Regression Analysis? Basics
More informationExpected Value - Revisited
Expected Value - Revisited An experiment is a Bernoulli Trial if: there are two outcomes (success and failure), the probability of success, p, is always the same, the trials are independent. Expected Value
More informationhypothesis a claim about the value of some parameter (like p)
Testing hypotheses hypothesis a claim about the value of some parameter (like p) significance test procedure to assess the strength of evidence provided by a sample of data against the claim of a hypothesized
More informationPoint and Interval Estimation II Bios 662
Point and Interval Estimation II Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2006-09-13 17:17 BIOS 662 1 Point and Interval Estimation II Nonparametric CI
More informationp = q ˆ = 1 -ˆp = sample proportion of failures in a sample size of n x n Chapter 7 Estimates and Sample Sizes
Chapter 7 Estimates and Sample Sizes 7-1 Overview 7-2 Estimating a Population Proportion 7-3 Estimating a Population Mean: σ Known 7-4 Estimating a Population Mean: σ Not Known 7-5 Estimating a Population
More informationChapter 6 Estimation and Sample Sizes
Chapter 6 Estimation and Sample Sizes This chapter presents the beginning of inferential statistics.! The two major applications of inferential statistics! Estimate the value of a population parameter!
More informationUnobservable Parameter. Observed Random Sample. Calculate Posterior. Choosing Prior. Conjugate prior. population proportion, p prior:
Pi Priors Unobservable Parameter population proportion, p prior: π ( p) Conjugate prior π ( p) ~ Beta( a, b) same PDF family exponential family only Posterior π ( p y) ~ Beta( a + y, b + n y) Observed
More information# of 6s # of times Test the null hypthesis that the dice are fair at α =.01 significance
Practice Final Exam Statistical Methods and Models - Math 410, Fall 2011 December 4, 2011 You may use a calculator, and you may bring in one sheet (8.5 by 11 or A4) of notes. Otherwise closed book. The
More informationFoundations of Statistical Inference
Foundations of Statistical Inference Julien Berestycki Department of Statistics University of Oxford MT 2016 Julien Berestycki (University of Oxford) SB2a MT 2016 1 / 20 Lecture 6 : Bayesian Inference
More informationPubh 8482: Sequential Analysis
Pubh 8482: Sequential Analysis Joseph S. Koopmeiners Division of Biostatistics University of Minnesota Week 10 Class Summary Last time... We began our discussion of adaptive clinical trials Specifically,
More informationConditional Probabilities
Lecture Outline BIOST 514/517 Biostatistics I / pplied Biostatistics I Kathleen Kerr, Ph.D. ssociate Professor of Biostatistics University of Washington Probability Diagnostic Testing Random variables:
More informationThis does not cover everything on the final. Look at the posted practice problems for other topics.
Class 7: Review Problems for Final Exam 8.5 Spring 7 This does not cover everything on the final. Look at the posted practice problems for other topics. To save time in class: set up, but do not carry
More informationACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions
ACMS 20340 Statistics for Life Sciences Chapter 13: Sampling Distributions Sampling We use information from a sample to infer something about a population. When using random samples and randomized experiments,
More informationTesting Independence
Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1
More informationSignificance Tests. Review Confidence Intervals. The Gauss Model. Genetics
15.0 Significance Tests Review Confidence Intervals The Gauss Model Genetics Significance Tests 1 15.1 CI Review The general formula for a two-sided C% confidence interval is: L, U = pe ± se cv (1 C)/2
More informationInference for Single Proportions and Means T.Scofield
Inference for Single Proportions and Means TScofield Confidence Intervals for Single Proportions and Means A CI gives upper and lower bounds between which we hope to capture the (fixed) population parameter
More informationBusiness Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing
Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology
More informationInference for Proportions
Inference for Proportions Marc H. Mehlman marcmehlman@yahoo.com University of New Haven Based on Rare Event Rule: rare events happen but not to me. Marc Mehlman (University of New Haven) Inference for
More informationREVIEW: Midterm Exam. Spring 2012
REVIEW: Midterm Exam Spring 2012 Introduction Important Definitions: - Data - Statistics - A Population - A census - A sample Types of Data Parameter (Describing a characteristic of the Population) Statistic
More informationProbability and Probability Distributions. Dr. Mohammed Alahmed
Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about
More informationThe Multinomial Model
The Multinomial Model STA 312: Fall 2012 Contents 1 Multinomial Coefficients 1 2 Multinomial Distribution 2 3 Estimation 4 4 Hypothesis tests 8 5 Power 17 1 Multinomial Coefficients Multinomial coefficient
More informationCarolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li)
Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li) Department of Educational Psychology University of Illinois at Urbana-Champaign 1 Inferential
More informationCentral Limit Theorem and the Law of Large Numbers Class 6, Jeremy Orloff and Jonathan Bloom
Central Limit Theorem and the Law of Large Numbers Class 6, 8.5 Jeremy Orloff and Jonathan Bloom Learning Goals. Understand the statement of the law of large numbers. 2. Understand the statement of the
More informationSTAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)
STAT 515 -- Chapter 9: Two-Sample Problems Paired Differences (Section 9.3) Examples of Paired Differences studies: Similar subjects are paired off and one of two treatments is given to each subject in
More information2011 Pearson Education, Inc
Statistics for Business and Economics Chapter 7 Inferences Based on Two Samples: Confidence Intervals & Tests of Hypotheses Content 1. Identifying the Target Parameter 2. Comparing Two Population Means:
More information