HOW TO DETERMINE THE NUMBER OF SUBJECTS NEEDED FOR MY STUDY?
|
|
- Paulina Snow
- 5 years ago
- Views:
Transcription
1 HOW TO DETERMINE THE NUMBER OF SUBJECTS NEEDED FOR MY STUDY? TUTORIAL ON SAMPLE SIZE AND POWER CALCULATIONS FOR INEQUALITY TESTS. John Zavrakidis May 28, 2018 J.Zavrakidis Sample and Power Calculations
2 OUTLINE Introduction Sample Size Calculation General Information Sample Size Calculation for Continuous Outcome Sample Size Calculation for Binary Outcome Sample Size Calculation for Survival Outcome Summary Sample and Power Calculations J.Zavrakidis May 28, 2018
3 J.Zavrakidis Sample and Power Calculations May 28, 2018 HOW MUCH DATA? Not feasible to collect data on the entire population of interest a random sample is collected How many subjects in a random sample? If no data collection constraints: the more data the better If data collection constraints: sufficient data to ensure results to be accurate, efficient and credible Sample Size Calculation: Is intended to determine the minimal data (or sample size) required for detecting a relevant result. Should be used at the planning stage of theinvestigation
4 HOW MUCH DATA? Sample size & Statistical Power Too small sample size Powerless experiment Unreliable results Too big sample size Too powerful No meaningful results More Power increases chances of finding significant results More Power increases chances of replicating prior findings More Power increases confidence about results, either significant or not Sample and Power Calculations J.Zavrakidis May 28, 2018
5 PROCEDURE FOR A PRIORI SAMPLE SIZE CALCULATION 1. Decide on outcome variables and their measurement level 2. Choose a statistical model Specify the null and the alternative hypothesis Choose test statistic 3. Prespecify the effect size Decide about expected difference, the smallest effect size that can be considered clinically important 4. Select the desired power and α level α (significance level of the test): probability of rejecting the null hypothesis when it is true power: probability of rejecting the null hypothesis when it is false, i.e. probability of correctly rejecting H0 J.Zavrakidis Sample and Power Calculations May 28, 2018
6 TYPES OF OUTCOME AND STATISTICAL MODEL Examples: Means: t-test or ANOVA Proportions: Z test for 2 proportions Bivariate relationship: test for 2 correlations Multiple regression: test for 2 slopes J.Zavrakidis Sample and Power Calculations May 28, 2018
7 HYPOTHESIS TESTING Non-inferiority/superiority testing H0: parameter -δ H1: parameter > -δ Equivalence testing H0: parameter δ or parameter δ H1: δ < parameter < δ Equality/inequality testing H0: parameter = δ H1: parameter δ (or parameter > δ, or parameter < δ) δ: non-inferiority/superiority/equivalence margin Sample and Power Calculations J.Zavrakidis May 28, 2018
8 INEQUALITY HYPOTHESIS TESTING H 0 : parameter = parameter value H 1 : parameter parameter value (or parameter > parameter value, or parameter < parameter value) Decision State of nature Reject H 0 Do not reject H 0 H 0 true Type I error (α) Correct decision (1-α) H 0 false Correct decision (1 β) Type II error (β) Sample and Power Calculations J.Zavrakidis May 28, 2018
9 INEQUALITY HYPOTHESIS TESTING Significance level of the test: probability of incorrectly concluding a significant effect when it does not really exist in the population o The α is often set to 0.05 (5%), sometimes to 0.01 (1%) or 0.10 (10%). Sampling distribution associated with H0 /2 /2 Power : probability of correctly concluding a significant effect when it really exists in the population o It is accepted that the power should be 0.8 or greater. o Power = 1-β Sample and Power Calculations J.Zavrakidis May 28, 2018
10 INEQUALITY HYPOTHESIS TESTING Test statistics: Test Statistic = Effect size Standard error Test statistic has a sampling distribution Sampling distribution associated with H0 Sampling distribution associated with H1 Sample and Power Calculations J.Zavrakidis May 28, 2018
11 INEQUALITY HYPOTHESIS TESTING Test statistics: Test Statistic = Effect size Standard error Test statistic has a sampling distribution Sampling distribution associated with H0 Sampling distribution associated with H1 Sample and Power Calculations J.Zavrakidis May 28, 2018
12 INEQUALITY HYPOTHESIS TESTING Test statistics: Test Statistic = Effect size Standard error Test statistic has a sampling distribution Sampling distribution associated with H1 Sampling distribution associated with H0 Sample and Power Calculations J.Zavrakidis May 28, 2018
13 EFFECT SIZE Effect size is a measure of the magnitude of a difference or relationship Unstandardized effect size E.g. (mean group 1 - mean group 2) Standardized effect size E.g. ((mean group 1 - mean group 2)/pooled standard deviation). Used when the metrics of variables being studied do not have intrinsic meaning, when results from multiple studies are being combined, when some or all of the studies use different scales. Determining effect size: Based on 1. substantive knowledge clinically relevant effect 2. findings from prior research 3. a pilot study 4. conventions, e.g. defined by Cohen (means comparison: d=0.20 is small effect, d=0.50 is medium effect, d=0.80 is large effect) Sample and Power Calculations J.Zavrakidis May 28, 2018
14 DIFFERENCE IN TWO INDEPENDENT MEANS H 0 : µ c = µ E vs H 1 : µ C > µ E Test statistic = ( ഥY c ഥY E )/ S 2 c n c + S 2 E n E Under H0: test statistic normal distribution with mean m1 = 0 Under H1 : teststatistic ~ normal distribution with mean σ 2 m 2 = (μ c μ E ) / c + σ 2 E n c n E µ C, µ E : population means; ഥY c, ഥY E : sample means; σ C, σ E : population standard deviations; S c, S E : sample standard deviations; n c, n E : sample sizes The statistical power is the probability that the test statistic is above the critical value z 1 α when the alternative hypothesis is true Sampling distribution associated with H0 power Sampling distribution associated with H1 m. 1 z1 J.Zavrakidis Sample and Power Calculations May 28, m2
15 DIFFERENCE IN TWO INDEPENDENT MEANS Derivation of sample size formula 1 β = P(Z + μ C μ E σ C 2 n C + σ E 2 n E > z 1 α) = P(Z > z 1 α μ C μ E σ C 2 n C + σ E 2 n E ) = = P(Z < z 1 α + μ C μ E σ C 2 n C + σ E 2 n E ) = F( z 1 α + μ C μ E ) σ 2 C + σ E 2 n C n E z 1 β = z 1 α + μ C μ E σ C 2 n C + σ E 2 n E (some algebra ) H0 H1 n C = ( s c 2 + s E 2 /k) z 1 α+z 1 β y c y E 2, k = n E n c power If H 1 : µ C µ E then z 1 α is replaced by z 1 α/2 m. 1 z1 J.Zavrakidis Sample and Power Calculations May 28, m2
16 DIFFERENCE IN TWO INDEPENDENT MEANS z 1 α, z 1 β : critical values from the standard normal distribution α = 5% z1 α = 1.65, z 1 α/2 = 1.96 α = 1% z 1 α = 2.33, z 1 α/2 = 2.58 Power = 80% z1 β = 0.84 Power = 85% z1 β = 1.04 Power = 90% z1 β = 1.28 Power = 95% z1 β = 1.65 J.Zavrakidis Sample and Power Calculations May 28, 2018
17 DIFFERENCE IN TWO INDEPENDENT MEANS H 0 : µ C = µ E vs H 1 : µ C > µ E Test statistic = ( ഥY c ഥY E )/ S 2 c n c + S 2 E n E Under H0: test statistic ~ Student-t distribution with non-centrality parameter 0 & df = nc + ne 2 Under H1 : teststatistic ~ Student-t distribution with non-centrality parameter with mean λ = (μ c μ E ) / σ c 2 n c + σ 2 E & df = nc + ne 2 n E Sampling distribution associated with H0 Sampling distribution associated with H1 power J.Zavrakidis Sample and Power Calculations May 28, 2018
18 DIFFERENCE IN TWO INDEPENDENT MEANS Example with breast cancer patients: QoL after mastectomy compared to QoL after breast conserving surgery; breast conserving surgery may improve QoL more than mastectomy ഥY M = 15, ഥY BCS = 25 S c = S E = 13 how many women are needed to detect the difference with the power of 90%? J.Zavrakidis Sample and Power Calculations May 28, 2018
19 DIFFERENCE IN TWO INDEPENDENT PROPORTIONS H 0 : π C = π E vs H 1 : π C >π E p C p E Test Statistic = p C (1 p C ) + p E (1 p E ) nc n E Under H0 : test statistic normal distribution with mean = 0 Under H1 : test statistic normal distribution with mean = (π C πe ) / π C(1 π C) /n C + πe(1 πe) / ne π C, π E : population p r o p o r t i o n s; p C, p E : sample p r o p o r t i o n s ; n C, n E : samplesizes The statistical power is the probability that the test statistic is above the critical value z 1 α when the alternative hypothesis is true 1 β = P (Z + πc 1 πc nc π c π E + π E 1 π E n E > Z 1 α) z 1 α πc 1 πc nc π c π E + π E 1 π E n E = z 1 β (some algebra ) J.Zavrakidis n c = ( p c 1 p c + p E (1 p E )/k) z 1 α+z 1 β p c p E Sample and Power Calculations 2, k = n E n c May 28, 2018
20 Ƹ DIFFERENCE IN TWO INDEPENDENT PROPORTIONS, EXAMPLE Example with sarcoma patients: Standard treatment 25x2Gy radiotherapy without chemotherapy compared to New treatment 25x2Gy radiotherapy with chemotherapy; the New treatment may increase the proportion of patients with necrosis induction p c = 30%, pƹ E = 45%, n c = 50, n c = 50 what is the power level that we can reach? J.Zavrakidis Sample and Power Calculations May 28, 2018
21 DIFFERENCE IN TWO INDEPENDENT PROPORTIONS, EXAMPLE Example with sarcoma patients: Standard treatment 25x2Gy radiotherapy without chemotherapy compared to New treatment 25x2Gy radiotherapy with chemotherapy; the New treatment may increase the proportion of patients with necrosis induction pƹ c = 30%, n c = 50, n c = 50 which effect size can be detected with power of 80%? J.Zavrakidis Sample and Power Calculations May 28, 2018
22 RATIO BETWEEN TWO INDEPENDENT ODDS Ho : π c 1 π c = π E 1 π E VS H1: Test statistic = ( p c p E ) π c 1 π c > Nb(1 b) p (1 p) π E 1 π E ; OR = ; OR = Under H0: test statistic normal distribution with mean=0 πc 1 πc π E 1 π E p C 1 p C p E ; p = 1 b p C + b p C 1 p E Nb 1 b Under H1 : test statistic normal distribution with mean= π c π E π 1 π OR: population odds ratio; OR : sample odds ratio; π C, π E : population event rates; p c, p E : sample event rates; π: overall population event rate; p: overall sample event rate; size; b : p rop o rt i on of sample in E group The statistical power is the probability that the test statistic is above the critical value z 1 α when the alternative hypothesis is true N: total sample 1 β = P Z + π c π E 2 Nb 1 b π 1 π > z 1 α z 1 α π c π E 2 Nb 1 b π 1 π ) = z 1 β N = p(1 Ƹ p) Ƹ z 1 α +z 1 β 2 b(1 b) p c p E 2 J.Zavrakidis Sample and Power Calculations May 28, 2018
23 RATIO BETWEEN TWO INDEPENDENT ODDS Example with bladder cancer patients: Is bladder cancer associated with cigarette smoking? b = 80%, pƹ c = 20%, OˆR = 2 what is the power with N = 500? J.Zavrakidis Sample and Power Calculations May 28, 2018
24 RATIO BETWEEN TWO INDEPENDENT ODDS Example with bladder cancer patients: Is bladder cancer associated with cigarette smoking? b = 80%, pƹ c = 20%, OˆR = 2 what is the power with N = 500? J.Zavrakidis Sample and Power Calculations May 28, 2018
25 RATIO BETWEEN TWO INDEPENDENT ODDS Example with bladder cancer patients: Is bladder cancer associated with cigarette smoking? b = 80%, pƹ c = 20%, OˆR = 2 what is the power with N = 500? J.Zavrakidis Sample and Power Calculations May 28, 2018
26 RATIO BETWEEN TWO INDEPENDENT ODDS Example with bladder cancer patients: Is bladder cancer associated with cigarette smoking? b = 80%, pƹ c = 20%, OˆR = 2 what is the power with N = 500? J.Zavrakidis Sample and Power Calculations May 28, 2018
27 RATIO BETWEEN TWO INDEPENDENT ODDS Example with bladder cancer patients: Is bladder cancer associated with cigarette smoking? b = 80%, pƹ c = 20%, OˆR = 2 what is the power with N = 500? J.Zavrakidis Sample and Power Calculations May 28, 2018
28 RATIO BETWEEN TWO INDEPENDENT HAZARDS H 0 : HR = 1 vs H 1 : HR > 1, HR = h 1(t) h 2 (t) Test statistic = l o g ( HR) q c q E dn for all t Under H0: test statistic normal distribution with mean = 0 Under H1: test statistic normal distribution with mean = l o g ( HR ) q c q E dn HR: population hazard ratio; H R: sample hazard ratio; q c, q E : p r o p o r t i o n s of N in each group; N: total sample size; d : overall baseline probability of an event The statistical power is the probability that the test statistic is above the critical value z 1 α when the alternative hypothesis is true 1 β = P(Z + log HR q c q E dn > z 1 α ) z 1 α log HR q c q E dn = z 1 β N = z 1 α+z 1 β 2 log (HR) 2 q c q E d Sample and Power Calculations J.Zavrakidis May 28, 2018
29 RATIO BETWEEN TWO INDEPENDENT HAZARDS, EXAMPLE Example with breast cancer patients: Is dementia associated with chemotherapy? HR = 1.5, q c = q E = 50%, d = 10% how many patients are needed to reach power of 80%? J.Zavrakidis Sample and Power Calculations May 28, 2018
30 RATIO BETWEEN TWO INDEPENDENT HAZARDS, EXAMPLE Example with breast cancer patients: Is dementia associated with chemotherapy? HR = 1.5, q c = q E = 50%, d = 10% how many patients are needed to reach power of 80%? J.Zavrakidis Sample and Power Calculations May 28, 2018
31 RATIO BETWEEN TWO INDEPENDENT HAZARDS, EXAMPLE Example with breast cancer patients: Is dementia associated with chemotherapy? HR = 1.5, q c = q E = 50%, d = 10% how many patients are needed to reach power of 80%? J.Zavrakidis Sample and Power Calculations May 28, 2018
32 RATIO BETWEEN TWO INDEPENDENT HAZARDS, EXAMPLE Example with breast cancer patients: Is dementia associated with chemotherapy? HR = 1.5, q c = q E = 50%, d = 10% how many patients are needed to reach power of 80%? J.Zavrakidis Sample and Power Calculations May 28, 2018
33 RATIO BETWEEN TWO INDEPENDENT HAZARDS, EXAMPLE Example with breast cancer patients: Is dementia associated with chemotherapy? HR = 1.5, q c = q E = 50%, d = 10% how many patients are needed to reach power of 80%? J.Zavrakidis Sample and Power Calculations May 28, 2018
34 PRIOR KNOWLEDGE For any sample size calculation we need to know: Type of a test (e.g., independent t -test, paired t-test, ANOVA, regression, etc.) The significance level The expected effect size The power Fixing the significance level, the expected effect size and sample size we can calculate power Fixing the significance level, the expected effect size and power we can calculate sample size Fixing the significance level, power, sample size we can calculate effect size J.Zavrakidis Sample and Power Calculations May 28, 2018
35 PRIOR KNOWLEDGE For any sample size calculation we need to know: Type of a test (e.g., independent t -test, paired t-test, ANOVA, regression, etc.) The significance level The expected effect size The power General Rules: smaller effect size larger N smaller α or greater power larger N larger measurement variability larger N 2-tailed test larger N than for 1-tailedtest J.Zavrakidis Sample and Power Calculations May 28, 2018
36 GENERAL RULES Measurement variability influence Sampling distribution associated with H0 power Sampling distribution associated with H1 ṃ1 z 1. m2 Sampling distribution associated with H0 power Sampling distribution associated with H1 ṃ1 z 1. m2 J.Zavrakidis Sample and Power Calculations May 28, 2018
37 GENERAL RULES Effect size influence Sampling distribution associated with H0 power Sampling distribution associated with H1 ṃ1 z 1. m2 Sampling distribution associated with H0 power Sampling distribution associated with H1 ṃ1 z 1. m2 J.Zavrakidis Sample and Power Calculations May 28, 2018
38 GENERAL RULES α and test typeinfluence Sampling distribution associated with H0 power Sampling distribution associated with H1 ṃ1 ṃ2 z 1 /2 Sampling distribution associated with H0 power Sampling distribution associated with H1 ṃ1 z 1. m2 J.Zavrakidis Sample and Power Calculations May 28, 2018
39 POWER & SAMPLE SIZE
40 POWER & SAMPLE SIZE
41 POWER & SAMPLE SIZE
42 POWER & SAMPLE SIZE
43 POWER & SAMPLE SIZE
44 LIMITATIONS OF SAMPLE SIZE ANALYSIS Based on assumptions and educated guesses, the analyses give a "best case scenario" estimate of necessary sample size Good strategy is to compute the required sample size for different levels of effect size, α, power, and present N in a range instead of a single number q C q E d HR N power % % % % % % Sample and Power Calculations J.Zavrakidis May 28, 2018
45 ADJUSTMENTS OF SAMPLE SIZE ANALYSIS Loss to follow-up: participants withdraw from the study, are lost to follow-up, information on key variables is missing Multiple regression: testing multiple hypotheses since many effect sizes are estimated, N > 10 ( # prognostic factors ) Interaction: subgroup analyses require at least four times larger sample size than analyses of overall association Sample and Power Calculations J.Zavrakidis May 28, 2018
46 SMALL SAMPLE SIZE ANALYSIS Small sample: studies that have typically between 5 and 30 observations Only large effects can be detected There are fewer options with respect to appropriate statistical procedure, e.g. correlations, logistic regression, multilevel modeling are not appropriate Generalizability of the results may also be questionable Precision of the point estimation is less precise, CI is wider with smaller sample size Sample and Power Calculations J.Zavrakidis May 28, 2018
47 SOFTWARE G*Power 3: Statistical Power Analyses for Windows and Mac, Power, PASS Sample Size, Statsdirect, / Sample and Power Calculations J.Zavrakidis May 28, 2018
48 REFERENCES Ryan TP. Sample size determination and power. John Wiley & Sons, Inc Chow SC, Shao J, Wang H. Sample Size Calculations in Clinical Research, 2nd Edition. Chapman & Hall/CRC Hsieh FY, Bloch DA, Larsen MD. A simple method of sample size calculation for linear and logistic regression. Statistics in Medicine, 1998, 17: Garcia-Closas M, Lubin JH. "Power and sample size calculations in case-control studies of gene-environmental interactions: Comments on different approaches." American Journal of Epidemiology 1999, 149: Faul F, Erdfelder E, Lang A-G, Buchner A. G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behavior Research Methods, 2007, 39 (2): Sample and Power Calculations J.Zavrakidis May 28, 2018
Power Analysis using GPower 3.1
Power Analysis using GPower 3.1 a one-day hands-on practice workshop (session 1) Dr. John Xie Statistics Support Officer, Quantitative Consulting Unit, Research Office, Charles Sturt University, NSW, Australia
More informationSuperiority by a Margin Tests for One Proportion
Chapter 103 Superiority by a Margin Tests for One Proportion Introduction This module provides power analysis and sample size calculation for one-sample proportion tests in which the researcher is testing
More informationHypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)
Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) B.H. Robbins Scholars Series June 23, 2010 1 / 29 Outline Z-test χ 2 -test Confidence Interval Sample size and power Relative effect
More informationThe t-statistic. Student s t Test
The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very
More informationBinary Logistic Regression
The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b
More informationTests for the Odds Ratio in a Matched Case-Control Design with a Quantitative X
Chapter 157 Tests for the Odds Ratio in a Matched Case-Control Design with a Quantitative X Introduction This procedure calculates the power and sample size necessary in a matched case-control study designed
More informationSample Size/Power Calculation by Software/Online Calculators
Sample Size/Power Calculation by Software/Online Calculators May 24, 2018 Li Zhang, Ph.D. li.zhang@ucsf.edu Associate Professor Department of Epidemiology and Biostatistics Division of Hematology and Oncology
More informationSTAT Chapter 8: Hypothesis Tests
STAT 515 -- Chapter 8: Hypothesis Tests CIs are possibly the most useful forms of inference because they give a range of reasonable values for a parameter. But sometimes we want to know whether one particular
More informationLECTURE 5. Introduction to Econometrics. Hypothesis testing
LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will
More informationSample Size. Vorasith Sornsrivichai, MD., FETP Epidemiology Unit, Faculty of Medicine Prince of Songkla University
Sample Size Vorasith Sornsrivichai, MD., FETP Epidemiology Unit, Faculty of Medicine Prince of Songkla University All nature is but art, unknown to thee; All chance, direction, which thou canst not see;
More informationChapter Six: Two Independent Samples Methods 1/51
Chapter Six: Two Independent Samples Methods 1/51 6.3 Methods Related To Differences Between Proportions 2/51 Test For A Difference Between Proportions:Introduction Suppose a sampling distribution were
More informationTwo-Sample Inferential Statistics
The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationFundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur
Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new
More informationTutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances
Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function
More informationQuestion. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?
Hypothesis testing Question Very frequently: what is the possible value of μ? Sample: we know only the average! μ average. Random deviation or not? Standard error: the measure of the random deviation.
More informationMATH 240. Chapter 8 Outlines of Hypothesis Tests
MATH 4 Chapter 8 Outlines of Hypothesis Tests Test for Population Proportion p Specify the null and alternative hypotheses, ie, choose one of the three, where p is some specified number: () H : p H : p
More informationAPPENDIX B Sample-Size Calculation Methods: Classical Design
APPENDIX B Sample-Size Calculation Methods: Classical Design One/Paired - Sample Hypothesis Test for the Mean Sign test for median difference for a paired sample Wilcoxon signed - rank test for one or
More informationHarvard University. Rigorous Research in Engineering Education
Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected
More informationPower Analysis. Ben Kite KU CRMDA 2015 Summer Methodology Institute
Power Analysis Ben Kite KU CRMDA 2015 Summer Methodology Institute Created by Terrence D. Jorgensen, 2014 Recall Hypothesis Testing? Null Hypothesis Significance Testing (NHST) is the most common application
More informationHYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă
HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationAn inferential procedure to use sample data to understand a population Procedures
Hypothesis Test An inferential procedure to use sample data to understand a population Procedures Hypotheses, the alpha value, the critical region (z-scores), statistics, conclusion Two types of errors
More informationA new strategy for meta-analysis of continuous covariates in observational studies with IPD. Willi Sauerbrei & Patrick Royston
A new strategy for meta-analysis of continuous covariates in observational studies with IPD Willi Sauerbrei & Patrick Royston Overview Motivation Continuous variables functional form Fractional polynomials
More informationTests for the Odds Ratio of Two Proportions in a 2x2 Cross-Over Design
Chapter 170 Tests for the Odds Ratio of Two Proportions in a 2x2 Cross-Over Design Introduction Senn (2002) defines a cross-over design as one in which each subject receives all treatments and the objective
More informationMany natural processes can be fit to a Poisson distribution
BE.104 Spring Biostatistics: Poisson Analyses and Power J. L. Sherley Outline 1) Poisson analyses 2) Power What is a Poisson process? Rare events Values are observational (yes or no) Random distributed
More informationSample Size Determination
Sample Size Determination 018 The number of subjects in a clinical study should always be large enough to provide a reliable answer to the question(s addressed. The sample size is usually determined by
More informationSampling Distributions: Central Limit Theorem
Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationTutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups. Acknowledgements:
Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements:
More informationTutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements:
Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported
More informationWhat p values really mean (and why I should care) Francis C. Dane, PhD
What p values really mean (and why I should care) Francis C. Dane, PhD Session Objectives Understand the statistical decision process Appreciate the limitations of interpreting p values Value the use of
More informationIn ANOVA the response variable is numerical and the explanatory variables are categorical.
1 ANOVA ANOVA means ANalysis Of VAriance. The ANOVA is a tool for studying the influence of one or more qualitative variables on the mean of a numerical variable in a population. In ANOVA the response
More informationTutorial 1: Power and Sample Size for the One-sample t-test. Acknowledgements:
Tutorial 1: Power and Sample Size for the One-sample t-test Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in large part by the National
More informationStatistics: CI, Tolerance Intervals, Exceedance, and Hypothesis Testing. Confidence intervals on mean. CL = x ± t * CL1- = exp
Statistics: CI, Tolerance Intervals, Exceedance, and Hypothesis Lecture Notes 1 Confidence intervals on mean Normal Distribution CL = x ± t * 1-α 1- α,n-1 s n Log-Normal Distribution CL = exp 1-α CL1-
More informationInferences About Two Proportions
Inferences About Two Proportions Quantitative Methods II Plan for Today Sampling two populations Confidence intervals for differences of two proportions Testing the difference of proportions Examples 1
More informationLongitudinal Modeling with Logistic Regression
Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to
More informationSample Size Estimation for Studies of High-Dimensional Data
Sample Size Estimation for Studies of High-Dimensional Data James J. Chen, Ph.D. National Center for Toxicological Research Food and Drug Administration June 3, 2009 China Medical University Taichung,
More informationTutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements:
Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in
More informationStatistics in medicine
Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu
More informationParameter Estimation, Sampling Distributions & Hypothesis Testing
Parameter Estimation, Sampling Distributions & Hypothesis Testing Parameter Estimation & Hypothesis Testing In doing research, we are usually interested in some feature of a population distribution (which
More informationBIOS 312: Precision of Statistical Inference
and Power/Sample Size and Standard Errors BIOS 312: of Statistical Inference Chris Slaughter Department of Biostatistics, Vanderbilt University School of Medicine January 3, 2013 Outline Overview and Power/Sample
More information10.4 Hypothesis Testing: Two Independent Samples Proportion
10.4 Hypothesis Testing: Two Independent Samples Proportion Example 3: Smoking cigarettes has been known to cause cancer and other ailments. One politician believes that a higher tax should be imposed
More informationChapter 9. Hypothesis testing. 9.1 Introduction
Chapter 9 Hypothesis testing 9.1 Introduction Confidence intervals are one of the two most common types of statistical inference. Use them when our goal is to estimate a population parameter. The second
More informationSample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA
Sample Size and Power I: Binary Outcomes James Ware, PhD Harvard School of Public Health Boston, MA Sample Size and Power Principles: Sample size calculations are an essential part of study design Consider
More informationHYPOTHESIS TESTING. Hypothesis Testing
MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.
More information16.400/453J Human Factors Engineering. Design of Experiments II
J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential
More informationPolitical Science 236 Hypothesis Testing: Review and Bootstrapping
Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The
More informationIntroduction to Statistical Analysis
Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive
More informationSampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =
2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result
More informationPubh 8482: Sequential Analysis
Pubh 8482: Sequential Analysis Joseph S. Koopmeiners Division of Biostatistics University of Minnesota Week 12 Review So far... We have discussed the role of phase III clinical trials in drug development
More informationHypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals
Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals Lecture 9 Justin Kern April 9, 2018 Measuring Effect Size: Cohen s d Simply finding whether a
More informationT.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS
ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only
More informationPower. Week 8: Lecture 1 STAT: / 48
Power STAT:5201 Week 8: Lecture 1 1 / 48 Power We have already described Type I and II errors. Decision Reality/True state Accept H o Reject H o H o is true good Type I error H o is false Type II error
More informationSample Size Calculations for Group Randomized Trials with Unequal Sample Sizes through Monte Carlo Simulations
Sample Size Calculations for Group Randomized Trials with Unequal Sample Sizes through Monte Carlo Simulations Ben Brewer Duke University March 10, 2017 Introduction Group randomized trials (GRTs) are
More informationStatistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong
Statistics Primer ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong 1 Quick Overview of Statistics 2 Descriptive vs. Inferential Statistics Descriptive Statistics: summarize and describe data
More informationStatistics in medicine
Statistics in medicine Lecture 3: Bivariate association : Categorical variables Proportion in one group One group is measured one time: z test Use the z distribution as an approximation to the binomial
More informationStudy Design: Sample Size Calculation & Power Analysis
Study Design: Sample Size Calculation & Power Analysis RCMAR/CHIME/EXPORT April 21, 2008 Honghu Liu, Ph.D. Contents Background Common Designs Examples Computer Software Summary & Discussion Background
More informationCHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups 10.1 Comparing Two Proportions The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Proportions
More informationHypothesis testing: Steps
Review for Exam 2 Hypothesis testing: Steps Repeated-Measures ANOVA 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region
More informationApplied Epidemiologic Analysis
Patricia Cohen, Ph.D. Henian Chen, M.D., Ph. D. Teaching Assistants Julie Kranick Chelsea Morroni Sylvia Taylor Judith Weissman Lecture 13 Interactional questions and analyses Goals: To understand how
More informationLogistic regression model for survival time analysis using time-varying coefficients
Logistic regression model for survival time analysis using time-varying coefficients Accepted in American Journal of Mathematical and Management Sciences, 2016 Kenichi SATOH ksatoh@hiroshima-u.ac.jp Research
More informationChapter 6. Logistic Regression. 6.1 A linear model for the log odds
Chapter 6 Logistic Regression In logistic regression, there is a categorical response variables, often coded 1=Yes and 0=No. Many important phenomena fit this framework. The patient survives the operation,
More informationSingle Sample Means. SOCY601 Alan Neustadtl
Single Sample Means SOCY601 Alan Neustadtl The Central Limit Theorem If we have a population measured by a variable with a mean µ and a standard deviation σ, and if all possible random samples of size
More informationThe Purpose of Hypothesis Testing
Section 8 1A:! An Introduction to Hypothesis Testing The Purpose of Hypothesis Testing See s Candy states that a box of it s candy weighs 16 oz. They do not mean that every single box weights exactly 16
More informationSimple Linear Regression: One Qualitative IV
Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression
More informationStatistical Aspects of Futility Analyses. Kevin J Carroll. nd 2013
Statistical Aspects of Futility Analyses Kevin J Carroll March Spring 222013 nd 2013 1 Contents Introduction The Problem in Statistical Terms Defining Futility Three Common Futility Rules The Maths An
More informationE509A: Principle of Biostatistics. GY Zou
E509A: Principle of Biostatistics (Week 4: Inference for a single mean ) GY Zou gzou@srobarts.ca Example 5.4. (p. 183). A random sample of n =16, Mean I.Q is 106 with standard deviation S =12.4. What
More informationSTA6938-Logistic Regression Model
Dr. Ying Zhang STA6938-Logistic Regression Model Topic 2-Multiple Logistic Regression Model Outlines:. Model Fitting 2. Statistical Inference for Multiple Logistic Regression Model 3. Interpretation of
More informationPower Analysis Introduction to Power Analysis with G*Power 3 Dale Berger 1401
Power Analysis Introduction to Power Analysis with G*Power 3 Dale Berger 1401 G*Power 3 is a wonderful free resource for power analysis. This program provides power analyses for tests that use F, t, chi-square,
More informationChap The McGraw-Hill Companies, Inc. All rights reserved.
11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview
More informationPOLI 443 Applied Political Research
POLI 443 Applied Political Research Session 4 Tests of Hypotheses The Normal Curve Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College
More informationDIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS
DIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS THUY ANH NGO 1. Introduction Statistics are easily come across in our daily life. Statements such as the average
More informationHypothesis testing: Steps
Review for Exam 2 Hypothesis testing: Steps Exam 2 Review 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region 3. Compute
More informationVARIANCE COMPONENT ANALYSIS
VARIANCE COMPONENT ANALYSIS T. KRISHNAN Cranes Software International Limited Mahatma Gandhi Road, Bangalore - 560 001 krishnan.t@systat.com 1. Introduction In an experiment to compare the yields of two
More informationCorrelation and Simple Linear Regression
Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline
More informationOrdinary Least Squares Regression Explained: Vartanian
Ordinary Least Squares Regression Eplained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent
More informationExperimental Design and Data Analysis for Biologists
Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1
More informationDr. Franz Faul Department of Psychology Christian-Albrechts-University Olshausenstr Kiel Germany
G*Power 3 Buchner, A. (2010). G*Power: Users Guide Analysis by design. Web Page of Heinrich-Heine- Universität - Institut für experimentelle Psychologie. Availlable at http://www.psycho.uniduesseldorf.de/abteilungen/aap/gpower3.
More informationLecture 5: ANOVA and Correlation
Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions
More informationPSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing
Page Title PSY 305 Module 3 Introduction to Hypothesis Testing Z-tests Five steps in hypothesis testing State the research and null hypothesis Determine characteristics of comparison distribution Five
More informationHypothesis testing for µ:
University of California, Los Angeles Department of Statistics Statistics 10 Elements of a hypothesis test: Hypothesis testing Instructor: Nicolas Christou 1. Null hypothesis, H 0 (always =). 2. Alternative
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationTests for the Odds Ratio in Logistic Regression with One Binary X (Wald Test)
Chapter 861 Tests for the Odds Ratio in Logistic Regression with One Binary X (Wald Test) Introduction Logistic regression expresses the relationship between a binary response variable and one or more
More informationANOVA: Comparing More Than Two Means
ANOVA: Comparing More Than Two Means Chapter 11 Cathy Poliak, Ph.D. cathy@math.uh.edu Office Fleming 11c Department of Mathematics University of Houston Lecture 25-3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationWelcome! Webinar Biostatistics: sample size & power. Thursday, April 26, 12:30 1:30 pm (NDT)
. Welcome! Webinar Biostatistics: sample size & power Thursday, April 26, 12:30 1:30 pm (NDT) Get started now: Please check if your speakers are working and mute your audio. Please use the chat box to
More informationSTAT 526 Spring Midterm 1. Wednesday February 2, 2011
STAT 526 Spring 2011 Midterm 1 Wednesday February 2, 2011 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points
More informationTutorial 2: Power and Sample Size for the Paired Sample t-test
Tutorial 2: Power and Sample Size for the Paired Sample t-test Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function of sample size, variability,
More informationThe Difference in Proportions Test
Overview The Difference in Proportions Test Dr Tom Ilvento Department of Food and Resource Economics A Difference of Proportions test is based on large sample only Same strategy as for the mean We calculate
More informationHypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true
Hypothesis esting Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true Statistical Hypothesis: conjecture about a population parameter
More informationMultiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know:
Multiple Regression Ψ320 Ainsworth More Hypothesis Testing What we really want to know: Is the relationship in the population we have selected between X & Y strong enough that we can use the relationship
More informationDose-response modeling with bivariate binary data under model uncertainty
Dose-response modeling with bivariate binary data under model uncertainty Bernhard Klingenberg 1 1 Department of Mathematics and Statistics, Williams College, Williamstown, MA, 01267 and Institute of Statistics,
More informationMcGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination
McGill University Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II Final Examination Date: 20th April 2009 Time: 9am-2pm Examiner: Dr David A Stephens Associate Examiner: Dr Russell Steele Please
More informationBiostatistics VHM 802 Course Winter 2016, Atlantic Veterinary College, PEI Henrik Stryhn. Index of Lecture 13
Biostatistics VHM 802 Course Winter 2016, Atlantic Veterinary College, PEI Henrik Stryhn Index of Lecture 13 Page Title 1 Practical information 2 Intro sample size issues 3 Statistical methods for sample
More informationStudy Ch. 9.3, #47 53 (45 51), 55 61, (55 59)
GOALS: 1. Understand that 2 approaches of hypothesis testing exist: classical or critical value, and p value. We will use the p value approach. 2. Understand the critical value for the classical approach
More informationStat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment
Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem Reading: 2.4 2.6. Motivation: Designing a new silver coins experiment Sample size calculations Margin of error for the pooled two sample
More informationMultiple Testing. Gary W. Oehlert. January 28, School of Statistics University of Minnesota
Multiple Testing Gary W. Oehlert School of Statistics University of Minnesota January 28, 2016 Background Suppose that you had a 20-sided die. Nineteen of the sides are labeled 0 and one of the sides is
More informationIntroduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017
Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent
More informationImpact of covariate misclassification on the power and type I error in clinical trials using covariate-adaptive randomization
Impact of covariate misclassification on the power and type I error in clinical trials using covariate-adaptive randomization L I Q I O N G F A N S H A R O N D. Y E A T T S W E N L E Z H A O M E D I C
More informationStatistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018
Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical
More information