Power and nonparametric methods Basic statistics for experimental researchersrs 2017
|
|
- Stephanie Tyler
- 6 years ago
- Views:
Transcription
1 Faculty of Health Sciences Outline Power and nonparametric methods Basic statistics for experimental researchersrs 2017 Statistical power Julie Lyng Forman Department of Biostatistics, University of Copenhagen Nonparametric methods 2 / 30 What s the conclusion? Errors in statistical testing Does the treatment have an effect? Three answers you can get from your statistical analysis.. Yes, there is a significant treatment effect. No, there is no evidence of a treatment effect and the confidence limits are sufficiently narrow that we can rule out a practically relevant difference. Maybe. We don t see a significant difference but the confidence limits are wide so cannot rule out there is a relevant difference either. Can we do anything to avoid the maybes? Probabilities of making the right or wrong decission Decission Truth Accept H 0 Reject H 0 H 0 true 1-α α error of type I significance level H 0 false β 1-β error of type II power Usually the significance level is fixed at α = 5%. Preferably 1 β should be at least 80% (but this is not always so). 3 / 30 4 / 30
2 Science as a learning process What does power depend on? If the power is low, then many real differences / effective treatments go undetected through our investigations and a higher proportion of the significant discoveries will be false. Sample size. True difference / effect size. Variability in outcome, i.e. the population standard deviation. Significance level (usually fixed at 5%). Today we only consider t-tests, otherwise: Other model parameters. Statistical method for analysis. Experimental design. 5 / 30 6 / 30 Detectable differences: a rough impression Choosing the relevant difference To detect a 1/2 SD difference with 80% power n = 64 is needed in each group. To detect a 1 SD difference with 80% power n = 17 is needed in each group. To detect a 2 SD difference with 80% power n = 6 is needed in each group. Based on a two sample t-test assuming outcomes are normally distributed with equal standard deviations in the two populations / under the two treatments. What difference / effect should be detected by the experiment? Principled choice: The minimum relevant difference is the smallest difference that could be of practical or clinical relevance or general scientific interest BUT: small differences are more difficult to detect.... Pragmatic choices: The smallest difference that it would be embarrasing to overlook. The minimum detectable difference that can be fould with the available sample size. 7 / 30 8 / 30
3 Estimating variability Approximate power of the two-sample t-test To make a sensible estimate of the power we need to estimate the variability in the outcome. An estimate from a pilot study (beware of statistical uncertainty). An expert guess or similar experiments in the literature. (Normal distribution: 4 SDs from lower to upper limit of the normal range.) After data has been collected, it might be a good idea to review the power calculation. Was the assumption you made about the standard deviation too optimistic or pessimistic? Could you plan better in the future? Textbook formula: The required sample size (in each of two equal size groups) to detect a difference of δ with power 1 β at significance level α is approximately given by where n = 2 (z 1 α/2 + z 1 β ) 2 (δ/σ) 2. σ is the standard deviation of the outcome, z 1 α/2 = 1.96 for α = 0.05, z 1 β = 0.84 for 1 β = (the z s are quantiles of the standard normal distribution). ATT: This formula is only valid for larger sample sizes (better use R). 9 / / 30 R: Exact power of the two-sample t-test Use the power.t.test-function in R. power.t.test(delta=1, sd=1, power=0.80, sig=0.05, type= two.sample ) Two-sample t test power calculation n = delta = 1 sd = 1 sig.level = 0.05 power = 0.8 alternative = two.sided NOTE: n is number in *each* group Note: The arguments sig=0.05 and type= two.sample are set at their defaults and can be omitted. Power of the one-sample/paired t-test The required sample size (number of pairs) to detect a difference of δ with power 1 β at significance level α is approximately n = (z 1 α/2 + z 1 β ) 2 (δ/σ d ) 2. where σ d is the standard deviation of the differences σ d. But again we can use R for exact power calculations: power.t.test(delta=1, sd=0.8, power=0.80, sig=0.05, type= paired ) Paired t test power calculation n = delta = 1 sd = 0.8 sig.level = 0.05 power = 0.8 alternative = two.sided 11 / / 30 NOTE: n is number of *pairs*, sd is std.dev. of *differences* within pai
4 The standard deviation of the differences Sometimes we do not have a natural estimate of the standard deviation of the differences, then it is useful to know the following formula: σ 2 d = 2 (1 ρ) σ 2 σ is the standard deviation of a sigle outcome (the population SD). ρ is the correlation between the two measurements in a pair (more about correlations in lecture 3) Note: If the correlation is reasonably strong (> 0.50), then the differences are less variable than the single outcomes. This is the reason why paired t-tests are often more powerful than two-sample t-tests. Attainable power and least detectable difference If you know in advance that you can only get a limited number of observations, then your power calculation should focus on... What power do I have for detecting the relevant difference? power.t.test(n=10, delta=0.5, sd=0.8) Two-sample t test power calculation power = What is the smallest difference I can detect with a decent power? power.t.test(n=10, sd=0.8, power=0.80) Two-sample t test power calculation delta = / 30 If the answers you get aren t reasonable, then it migth be a better idea to give up on the investigation instead of wasting time and money. 14 / 30 Power in other situations Outline Simple power calculators / textbook formulae are available for: Two-sample t-tests with unequal sample sizes / variances. Statistical power Comparing two frequencies (2x2-table) Simple linear regression / correlation. Nonparametric methods In any other case: Talk to a statistician about it. 15 / / 30
5 The normal assumption Alternative: nonparametric methods How important is the normal assumption for the t-tests? Small sample size, n < 10: Important for obtaining valid results. But hard to assess (experience, other studies)... Medium sample size, 10 n 30: Results are most often valid. Beware of marked skewness and/or outliers. Does it make sense to interpret the mean as the typical outcome? Larger samples, n > 30: Rarely important for obtaining valid results. Beware of many/extreme outliers. Does it make sense to interpret the mean as the typical outcome? 17 / 30 Most traditional analyses of continuous outcomes, t-tests, ANOVA, linear regression, (Pearson) correlation. rely on an assumption that data are normally distributed. Statistical methods avoiding to make such distributional assumptions are termed nonparametric or distribution free 18 / 30 Note: Distribution free is not the same as assumption free. Why use nonparametric statistics? Classical nonparametric methods and their conventional counterparts. 1. Data is obviously not normally distributed. 2. Don t know whether data is normally distributed or not because sample size is too small to tell. 3. Want to do a fast analysis without having to check whether or not data is normally distributed. J.H. Tukey referred to nonparametric methods as quick and dirty Normal distribution Paired t-test Two-sample t-test One-way ANOVA Two-way ANOVA with repeated measurements Pearson s correlation Nonparametric Sign test Wilcoxons signed rank test Man-Whitney U-test equivalent to Wilcoxon rank sum test Kruskal-Wallis test Fridman test Spearman s correlation 19 / / 30
6 Rank-based analysis Classical nonparametric analyses are carried out in two steps. First: observations are replaced with their ranks. Smallest observation gets rank 1, second smallest rank 2, etc Example: 17.5 (5), 8.5 (1), 12.6 (4), 8.7 (2), 10.8 (3). Secondly, these are used for hypothesis testing. Two-sample testing The Wilconxon-Man-Whitney test: Assumes that observations have continuous distributions. The null hypothesis is that the two populations have the same distribution (not just the same median!). Compares the ranks between the two groups. With the further assumption that the two distributions differ only by a shift in location, it is possible to get a confidence interval for this shift. If there is no difference between the groups / no effect of treatment, all distributions of ranks between the groups/treatments are equally likely due to randomisation. We have evidence against the null hypothesis if e.g. one group has all the high ranks and the other all the low ranks. 21 / / 30 Two versions of the same test. Trouble shooting: Tied data The Wilconxon rank sum test (originally for equal sample sizes): Add up the ranks from the two groups: R 1 and R 2. Use the smallest rank sum as test statistic: W = min(r 1, R 2 ). The Man-Whitney U-test (extension to unequal sample sizes): Add up the ranks from the two groups: R 1 and R 2. Compute U i = n 1 n 2 + ni(ni+1) 2 R i for i = 1, 2. Use max(u 1, U 2 ) as test statistic. The two tests are equivalent - You get the exact same p-value! Problem with ranking if some observations are identical: Solution: replace ranks with midranks (i.e. average of ranks). Example: 17.5 (5), 8.5 (1), 10.8 (3.5), 8.7 (2), 10.8 (3.5), ranks 3 and 4 are replaced by midranks 3.5 and 3.5. Traditionally tests have been adjusted for ties: Adjusted P-values are approximate, not exact. This is problematic if data is small or many observations are tied. Today exact p-values can be obatained by use of a permutationtest. R-package coin can do it, but beware of other software. 23 / / 30
7 Permutation tests Paired testing The Wilcoxon-Man-Whitney test is an example of a permutation test. The distribution of a teststatistic under the null hypothesis can be simulated by randomly assigning ranks to the two treatment groups. I.e. if treatment has no effect, it is just a random labeling. The idea is quite general and dates back to Fisher s exact test (1934). A wide range of applications is possible today using computer simulations to randomly reassigning treatments to subjects. Permutation tests do not make any distributional assumptions and are valid for all sample sizes. In case of repeated measurements the pairing/clustering must be preserved in the permutations. A range of permutation tests are available in the R-package coin. 25 / 30 Analysis is based on the differences between the paired observations. The sign test: No assumptions save from that obervation pairs have been sampled independently from the same population. The null hypothesis is that the median difference is zero. Counts the positive/negative differences (zeros don t count). The Wilcoxon signed rank test: Assumes that the differences has a symmetric distribution. The null hypothesis is that the median difference is zero. Ranks the absolute values of the differences. Note: We get a confidence interval for the median difference. 26 / 30 Beware of skewness Drawbacks of nonparametric statistics Many nonparametric methods aim solely at hypothesis testing. With few exeptions, you get no quantification from the analysis, that is no estimates or confidence intervals. Limited conclusions can be drawn from testing alone; Accept: There may or may not be a difference we might not have sufficient data to tell. Reject: It is very likely that there is a difference but we don t know how big it is and cannot compare with other studies. Power calculations are difficult: Sample median = 0 and P = 1 for the sign test, but P = 0.03 for the Wilcoxon signed rank test. We need to assume a particular distribution of the data. Then we can estimate the power from computer simulations. 27 / / 30
8 Common (mis)beliefs about nonparametric tests Nonparametric tests are less efficient (powerful) than parametric tests. This is only true if the distributional assumptions for the parametric model are correct. The efficiency of the Wilcoxon rank sum test is never less than 86% of the two-sample t-test. This is only true if the two distributions are identical save from a possible shift in location. The relative efficiency between different test depends on the true distribution of the data. R-demo # Sign test. SignTest(ckd0$aixchange) # Wilcoxon s signed rank test (assuming symmetric distribution) # approximate p-value and confidence interval with: wilcox.test(ckd0$aix0, ckd0$aix24, paired=t, conf.int=true) # More tests with R-package coin: library(coin) # Wilcoxon s signed rank test # exact p-value, but no confidence interval with: wilcoxsign_test(aix0~aix24, data=ckd0, distribution= exact ) # Wilcoxon-Man-Whitney test # exact p-value and confidence interval with: wilcox_test(aixchange~group, data=ckd.complete, + distribution= exact, conf.int=true) 29 / / 30
Non-parametric tests, part A:
Two types of statistical test: Non-parametric tests, part A: Parametric tests: Based on assumption that the data have certain characteristics or "parameters": Results are only valid if (a) the data are
More informationPower and the computation of sample size
9 Power and the computation of sample size A statistical test will not be able to detect a true difference if the sample size is too small compared with the magnitude of the difference. When designing
More informationRank-Based Methods. Lukas Meier
Rank-Based Methods Lukas Meier 20.01.2014 Introduction Up to now we basically always used a parametric family, like the normal distribution N (µ, σ 2 ) for modeling random data. Based on observed data
More informationNonparametric Statistics
Nonparametric Statistics Nonparametric or Distribution-free statistics: used when data are ordinal (i.e., rankings) used when ratio/interval data are not normally distributed (data are converted to ranks)
More informationCHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC
CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests
More informationHYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă
HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and
More informationIntroduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.
Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of
More informationIntroduction to Nonparametric Statistics
Introduction to Nonparametric Statistics by James Bernhard Spring 2012 Parameters Parametric method Nonparametric method µ[x 2 X 1 ] paired t-test Wilcoxon signed rank test µ[x 1 ], µ[x 2 ] 2-sample t-test
More informationLecture 7: Hypothesis Testing and ANOVA
Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis
More informationST4241 Design and Analysis of Clinical Trials Lecture 9: N. Lecture 9: Non-parametric procedures for CRBD
ST21 Design and Analysis of Clinical Trials Lecture 9: Non-parametric procedures for CRBD Department of Statistics & Applied Probability 8:00-10:00 am, Friday, September 9, 2016 Outline Nonparametric tests
More informationSEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics
SEVERAL μs AND MEDIANS: MORE ISSUES Business Statistics CONTENTS Post-hoc analysis ANOVA for 2 groups The equal variances assumption The Kruskal-Wallis test Old exam question Further study POST-HOC ANALYSIS
More informationIntuitive Biostatistics: Choosing a statistical test
pagina 1 van 5 < BACK Intuitive Biostatistics: Choosing a statistical This is chapter 37 of Intuitive Biostatistics (ISBN 0-19-508607-4) by Harvey Motulsky. Copyright 1995 by Oxfd University Press Inc.
More information3. Nonparametric methods
3. Nonparametric methods If the probability distributions of the statistical variables are unknown or are not as required (e.g. normality assumption violated), then we may still apply nonparametric tests
More informationPSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests
PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution
More informationNonparametric Statistics Notes
Nonparametric Statistics Notes Chapter 5: Some Methods Based on Ranks Jesse Crawford Department of Mathematics Tarleton State University (Tarleton State University) Ch 5: Some Methods Based on Ranks 1
More informationNonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown
Nonparametric Statistics Leah Wright, Tyler Ross, Taylor Brown Before we get to nonparametric statistics, what are parametric statistics? These statistics estimate and test population means, while holding
More informationResampling Methods. Lukas Meier
Resampling Methods Lukas Meier 20.01.2014 Introduction: Example Hail prevention (early 80s) Is a vaccination of clouds really reducing total energy? Data: Hail energy for n clouds (via radar image) Y i
More informationPower and sample size calculations
Power and sample size calculations Susanne Rosthøj Biostatistisk Afdeling Institut for Folkesundhedsvidenskab Københavns Universitet sr@biostat.ku.dk April 8, 2014 Planning an investigation How many individuals
More informationGlossary. The ISI glossary of statistical terms provides definitions in a number of different languages:
Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the
More informationBusiness Statistics MEDIAN: NON- PARAMETRIC TESTS
Business Statistics MEDIAN: NON- PARAMETRIC TESTS CONTENTS Hypotheses on the median The sign test The Wilcoxon signed ranks test Old exam question HYPOTHESES ON THE MEDIAN The median is a central value
More informationParametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami
Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous
More informationNon-parametric methods
Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish
More informationChapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics
Section 15.1: An Overview of Nonparametric Statistics Understand Difference between Parametric and Nonparametric Statistical Procedures Parametric statistical procedures inferential procedures that rely
More informationNonparametric Methods
Nonparametric Methods Marc H. Mehlman marcmehlman@yahoo.com University of New Haven Nonparametric Methods, or Distribution Free Methods is for testing from a population without knowing anything about the
More informationNonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health
Nonparametric statistic methods Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Measurement What are the 4 levels of measurement discussed? 1. Nominal or Classificatory Scale Gender,
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More informationWhat is a Hypothesis?
What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill in this city is μ = $42 population proportion Example:
More informationMy data doesn t look like that..
Testing assumptions My data doesn t look like that.. We have made a big deal about testing model assumptions each week. Bill Pine Testing assumptions Testing assumptions We have made a big deal about testing
More informationNon-parametric (Distribution-free) approaches p188 CN
Week 1: Introduction to some nonparametric and computer intensive (re-sampling) approaches: the sign test, Wilcoxon tests and multi-sample extensions, Spearman s rank correlation; the Bootstrap. (ch14
More informationUnit 14: Nonparametric Statistical Methods
Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based
More informationStatistics: revision
NST 1B Experimental Psychology Statistics practical 5 Statistics: revision Rudolf Cardinal & Mike Aitken 29 / 30 April 2004 Department of Experimental Psychology University of Cambridge Handouts: Answers
More informationNonparametric tests, Bootstrapping
Nonparametric tests, Bootstrapping http://www.isrec.isb-sib.ch/~darlene/embnet/ Hypothesis testing review 2 competing theories regarding a population parameter: NULL hypothesis H ( straw man ) ALTERNATIVEhypothesis
More informationAnalysis of variance (ANOVA) Comparing the means of more than two groups
Analysis of variance (ANOVA) Comparing the means of more than two groups Example: Cost of mating in male fruit flies Drosophila Treatments: place males with and without unmated (virgin) females Five treatments
More informationIntroduction to Statistical Data Analysis III
Introduction to Statistical Data Analysis III JULY 2011 Afsaneh Yazdani Preface Major branches of Statistics: - Descriptive Statistics - Inferential Statistics Preface What is Inferential Statistics? The
More informationWhat Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone
What Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone First, a bit about Parametric Statistics Data are expected to be randomly drawn from a normal population Minimum sample size
More informationDETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics
DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and
More informationBackground to Statistics
FACT SHEET Background to Statistics Introduction Statistics include a broad range of methods for manipulating, presenting and interpreting data. Professional scientists of all kinds need to be proficient
More informationSample Size Determination
Sample Size Determination 018 The number of subjects in a clinical study should always be large enough to provide a reliable answer to the question(s addressed. The sample size is usually determined by
More informationappstats27.notebook April 06, 2017
Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves
More informationModule 9: Nonparametric Statistics Statistics (OA3102)
Module 9: Nonparametric Statistics Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 15.1-15.6 Revision: 3-12 1 Goals for this Lecture
More informationMitosis Data Analysis: Testing Statistical Hypotheses By Dana Krempels, Ph.D. and Steven Green, Ph.D.
Mitosis Data Analysis: Testing Statistical Hypotheses By Dana Krempels, Ph.D. and Steven Green, Ph.D. The number of cells in various stages of mitosis in your treatment and control onions are your raw
More informationMATH Notebook 3 Spring 2018
MATH448001 Notebook 3 Spring 2018 prepared by Professor Jenny Baglivo c Copyright 2010 2018 by Jenny A. Baglivo. All Rights Reserved. 3 MATH448001 Notebook 3 3 3.1 One Way Layout........................................
More informationCorrelation and Simple Linear Regression
Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline
More informationSelection should be based on the desired biological interpretation!
Statistical tools to compare levels of parasitism Jen_ Reiczigel,, Lajos Rózsa Hungary What to compare? The prevalence? The mean intensity? The median intensity? Or something else? And which statistical
More informationDo not copy, post, or distribute. Independent-Samples t Test and Mann- C h a p t e r 13
C h a p t e r 13 Independent-Samples t Test and Mann- Whitney U Test 13.1 Introduction and Objectives This chapter continues the theme of hypothesis testing as an inferential statistical procedure. In
More information4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures
Non-parametric Test Stephen Opiyo Overview Distinguish Parametric and Nonparametric Test Procedures Explain commonly used Nonparametric Test Procedures Perform Hypothesis Tests Using Nonparametric Procedures
More informationIn many situations, there is a non-parametric test that corresponds to the standard test, as described below:
There are many standard tests like the t-tests and analyses of variance that are commonly used. They rest on assumptions like normality, which can be hard to assess: for example, if you have small samples,
More informationChapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.
Chapter 24 Comparing Means Copyright 2010 Pearson Education, Inc. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side. For example:
More informationStatistical Inference: Estimation and Confidence Intervals Hypothesis Testing
Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More informationSPSS Guide For MMI 409
SPSS Guide For MMI 409 by John Wong March 2012 Preface Hopefully, this document can provide some guidance to MMI 409 students on how to use SPSS to solve many of the problems covered in the D Agostino
More informationLecture 26. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.
s Sign s Lecture 26 Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University December 19, 2007 s Sign s 1 2 3 s 4 Sign 5 6 7 8 9 10 s s Sign 1 Distribution-free
More informationData Analysis and Statistical Methods Statistics 651
Data Analysis and Statistical Methods Statistics 65 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Review In the previous lecture we considered the following tests: The independent
More informationSTAT Section 5.8: Block Designs
STAT 518 --- Section 5.8: Block Designs Recall that in paired-data studies, we match up pairs of subjects so that the two subjects in a pair are alike in some sense. Then we randomly assign, say, treatment
More informationViolating the normal distribution assumption. So what do you do if the data are not normal and you still need to perform a test?
Violating the normal distribution assumption So what do you do if the data are not normal and you still need to perform a test? Remember, if your n is reasonably large, don t bother doing anything. Your
More informationANOVA - analysis of variance - used to compare the means of several populations.
12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.
More informationThings you always wanted to know about statistics but were afraid to ask
Things you always wanted to know about statistics but were afraid to ask Christoph Amma Felix Putze Design and Evaluation of Innovative User Interfaces 6.12.13 1/43 Overview In the last lecture, we learned
More informationWilcoxon Test and Calculating Sample Sizes
Wilcoxon Test and Calculating Sample Sizes Dan Spencer UC Santa Cruz Dan Spencer (UC Santa Cruz) Wilcoxon Test and Calculating Sample Sizes 1 / 33 Differences in the Means of Two Independent Groups When
More informationEverything is not normal
Everything is not normal According to the dictionary, one thing is considered normal when it s in its natural state or conforms to standards set in advance. And this is its normal meaning. But, like many
More informationTentative solutions TMA4255 Applied Statistics 16 May, 2015
Norwegian University of Science and Technology Department of Mathematical Sciences Page of 9 Tentative solutions TMA455 Applied Statistics 6 May, 05 Problem Manufacturer of fertilizers a) Are these independent
More informationPower Analysis. Introduction to Power
Power Analysis Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning When testing a specific null hypothesis (H 0 ), we
More informationChapter 27 Summary Inferences for Regression
Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test
More informationIntroduction to Statistics with GraphPad Prism 7
Introduction to Statistics with GraphPad Prism 7 Outline of the course Power analysis with G*Power Basic structure of a GraphPad Prism project Analysis of qualitative data Chi-square test Analysis of quantitative
More informationMany natural processes can be fit to a Poisson distribution
BE.104 Spring Biostatistics: Poisson Analyses and Power J. L. Sherley Outline 1) Poisson analyses 2) Power What is a Poisson process? Rare events Values are observational (yes or no) Random distributed
More informationData are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)
BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data
More informationHypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true
Hypothesis esting Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true Statistical Hypothesis: conjecture about a population parameter
More informationMultiple Comparisons
Multiple Comparisons Error Rates, A Priori Tests, and Post-Hoc Tests Multiple Comparisons: A Rationale Multiple comparison tests function to tease apart differences between the groups within our IV when
More informationLECTURE 5. Introduction to Econometrics. Hypothesis testing
LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will
More informationSTA Module 10 Comparing Two Proportions
STA 2023 Module 10 Comparing Two Proportions Learning Objectives Upon completing this module, you should be able to: 1. Perform large-sample inferences (hypothesis test and confidence intervals) to compare
More informationContents. Acknowledgments. xix
Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables
More informationTwo-sample inference: Continuous Data
Two-sample inference: Continuous Data November 5 Diarrhea Diarrhea is a major health problem for babies, especially in underdeveloped countries Diarrhea leads to dehydration, which results in millions
More informationTransition Passage to Descriptive Statistics 28
viii Preface xiv chapter 1 Introduction 1 Disciplines That Use Quantitative Data 5 What Do You Mean, Statistics? 6 Statistics: A Dynamic Discipline 8 Some Terminology 9 Problems and Answers 12 Scales of
More informationIntroduction to Statistical Analysis
Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive
More informationGlossary for the Triola Statistics Series
Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling
More informationLecture 10: Non- parametric Comparison of Loca6on. GENOME 560, Spring 2015 Doug Fowler, GS
Lecture 10: Non- parametric Comparison of Loca6on GENOME 560, Spring 2015 Doug Fowler, GS (dfowler@uw.edu) 1 Review What do we mean by nonparametric? What is a desirable localon stalslc for ordinal data?
More informationNonparametric Tests. Mathematics 47: Lecture 25. Dan Sloughter. Furman University. April 20, 2006
Nonparametric Tests Mathematics 47: Lecture 25 Dan Sloughter Furman University April 20, 2006 Dan Sloughter (Furman University) Nonparametric Tests April 20, 2006 1 / 14 The sign test Suppose X 1, X 2,...,
More informationUnit 12: Analysis of Single Factor Experiments
Unit 12: Analysis of Single Factor Experiments Statistics 571: Statistical Methods Ramón V. León 7/16/2004 Unit 12 - Stat 571 - Ramón V. León 1 Introduction Chapter 8: How to compare two treatments. Chapter
More information1.0 Hypothesis Testing
.0 Hypothesis Testing The Six Steps of Hypothesis Testing Rejecting or Accepting the Null Samples and Data. Parametric Tests for Comparing Two Populations Testing Equality of Population Means (T Test!
More informationThe entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.
One-Way ANOVA Summary The One-Way ANOVA procedure is designed to construct a statistical model describing the impact of a single categorical factor X on a dependent variable Y. Tests are run to determine
More informationRama Nada. -Ensherah Mokheemer. 1 P a g e
- 9 - Rama Nada -Ensherah Mokheemer - 1 P a g e Quick revision: Remember from the last lecture that chi square is an example of nonparametric test, other examples include Kruskal Wallis, Mann Whitney and
More informationHarvard University. Rigorous Research in Engineering Education
Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected
More informationPower and sample size calculations
Power and sample size calculations Susanne Rosthøj Biostatistisk Afdeling Institut for Folkesundhedsvidenskab Københavns Universitet sr@biostat.ku.dk October 28 2013 Planning an investigation How many
More informationInferential Statistics
Inferential Statistics Eva Riccomagno, Maria Piera Rogantin DIMA Università di Genova riccomagno@dima.unige.it rogantin@dima.unige.it Part G Distribution free hypothesis tests 1. Classical and distribution-free
More informationDescriptive Statistics-I. Dr Mahmoud Alhussami
Descriptive Statistics-I Dr Mahmoud Alhussami Biostatistics What is the biostatistics? A branch of applied math. that deals with collecting, organizing and interpreting data using well-defined procedures.
More informationGS Analysis of Microarray Data
GS01 0163 Analysis of Microarray Data Keith Baggerly and Kevin Coombes Section of Bioinformatics Department of Biostatistics and Applied Mathematics UT M. D. Anderson Cancer Center kabagg@mdanderson.org
More informationDegrees of freedom df=1. Limitations OR in SPSS LIM: Knowing σ and µ is unlikely in large
Z Test Comparing a group mean to a hypothesis T test (about 1 mean) T test (about 2 means) Comparing mean to sample mean. Similar means = will have same response to treatment Two unknown means are different
More informationOne-way ANOVA Model Assumptions
One-way ANOVA Model Assumptions STAT:5201 Week 4: Lecture 1 1 / 31 One-way ANOVA: Model Assumptions Consider the single factor model: Y ij = µ + α }{{} i ij iid with ɛ ij N(0, σ 2 ) mean structure random
More informationCorrelation and Regression
Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should
More informationN Utilization of Nursing Research in Advanced Practice, Summer 2008
University of Michigan Deep Blue deepblue.lib.umich.edu 2008-07 536 - Utilization of ursing Research in Advanced Practice, Summer 2008 Tzeng, Huey-Ming Tzeng, H. (2008, ctober 1). Utilization of ursing
More informationIntroduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test
Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test la Contents The two sample t-test generalizes into Analysis of Variance. In analysis of variance ANOVA the population consists
More informationProbability and Statistics
Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT
More informationChapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides
Chapter 7 Inference for Distributions Introduction to the Practice of STATISTICS SEVENTH EDITION Moore / McCabe / Craig Lecture Presentation Slides Chapter 7 Inference for Distributions 7.1 Inference for
More information1 ONE SAMPLE TEST FOR MEDIAN: THE SIGN TEST
NON-PARAMETRIC STATISTICS ONE AND TWO SAMPLE TESTS Non-parametric tests are normally based on ranks of the data samples, and test hypotheses relating to quantiles of the probability distribution representing
More informationE509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou.
E509A: Principle of Biostatistics (Week 11(2): Introduction to non-parametric methods ) GY Zou gzou@robarts.ca Sign test for two dependent samples Ex 12.1 subj 1 2 3 4 5 6 7 8 9 10 baseline 166 135 189
More informationDr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)
Dr. Maddah ENMG 617 EM Statistics 10/12/12 Nonparametric Statistics (Chapter 16, Hines) Introduction Most of the hypothesis testing presented so far assumes normally distributed data. These approaches
More informationChapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.
Chapter 23 Inferences About Means Sampling Distributions of Means Now that we know how to create confidence intervals and test hypotheses about proportions, we do the same for means. Just as we did before,
More informationHypothesis testing, part 2. With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal
Hypothesis testing, part 2 With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal 1 CATEGORICAL IV, NUMERIC DV 2 Independent samples, one IV # Conditions Normal/Parametric Non-parametric
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationFish SR P Diff Sgn rank Fish SR P Diff Sng rank
Nonparametric tests Distribution free methods require fewer assumptions than parametric methods Focus on testing rather than estimation Not sensitive to outlying observations Especially useful for cruder
More informationChapter 7 Comparison of two independent samples
Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N
More information