Selection should be based on the desired biological interpretation!

Similar documents
Non-parametric tests, part A:

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Turning a research question into a statistical question.

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics

Lecture 7: Hypothesis Testing and ANOVA

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

Nonparametric Statistics

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Exam details. Final Review Session. Things to Review

NON-PARAMETRIC STATISTICS * (

Unit 14: Nonparametric Statistical Methods

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

Chapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics

3 Joint Distributions 71

Comparison of two samples

Analysis of Variance (ANOVA) Cancer Research UK 10 th of May 2018 D.-L. Couturier / R. Nicholls / M. Fernandes

Comparison of Two Samples

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

Biostatistics 270 Kruskal-Wallis Test 1. Kruskal-Wallis Test

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014

Non-parametric (Distribution-free) approaches p188 CN

4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures

3. Nonparametric methods

Chapter 18 Resampling and Nonparametric Approaches To Data

One-way ANOVA Model Assumptions

Module 9: Nonparametric Statistics Statistics (OA3102)

CDA Chapter 3 part II

STATISTICS 4, S4 (4769) A2

Rank-Based Methods. Lukas Meier

Inferential Statistics

Small n, σ known or unknown, underlying nongaussian

Contents. Acknowledgments. xix

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Statistics Handbook. All statistical tables were computed by the author.

The entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.

Basic Statistical Analysis

Degrees of freedom df=1. Limitations OR in SPSS LIM: Knowing σ and µ is unlikely in large

What Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone

Wednesday, 20 November 13. p<0.05

ANOVA - analysis of variance - used to compare the means of several populations.

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

My data doesn t look like that..

Transition Passage to Descriptive Statistics 28

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1

SPSS Guide For MMI 409

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data?

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health

Introduction to Statistical Analysis

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

Types of Statistical Tests DR. MIKE MARRAPODI

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data?

Textbook Examples of. SPSS Procedure

Nonparametric Statistics Notes

sphericity, 5-29, 5-32 residuals, 7-1 spread and level, 2-17 t test, 1-13 transformations, 2-15 violations, 1-19

Non-parametric Tests

Intuitive Biostatistics: Choosing a statistical test

Power and nonparametric methods Basic statistics for experimental researchersrs 2017

Chapter 1 - Lecture 3 Measures of Location

Quantifying differences in parasite numbers between samples of hosts

Statistical Procedures for Testing Homogeneity of Water Quality Parameters

Glossary for the Triola Statistics Series

Bootstrap tests. Patrick Breheny. October 11. Bootstrap vs. permutation tests Testing for equality of location

STATISTIKA INDUSTRI 2 TIN 4004

GROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION

Pooled Variance t Test

QUANTITATIVE TECHNIQUES

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

What is a Hypothesis?

Introduction to Nonparametric Statistics

Chapter 7 Comparison of two independent samples

Nonparametric Location Tests: k-sample

Introduction to Statistics with GraphPad Prism 7

Statistical Inference Theory Lesson 46 Non-parametric Statistics

ST4241 Design and Analysis of Clinical Trials Lecture 9: N. Lecture 9: Non-parametric procedures for CRBD

Basic Business Statistics, 10/e

Review for Final. Chapter 1 Type of studies: anecdotal, observational, experimental Random sampling

Data analysis and Geostatistics - lecture VII

Non-parametric methods

An introduction to biostatistics: part 1

1.0 Hypothesis Testing

= 1 i. normal approximation to χ 2 df > df

Exploratory Data Analysis

Analysis of variance (ANOVA) Comparing the means of more than two groups

Physics 509: Non-Parametric Statistics and Correlation Testing

Everything is not normal

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<=

Correlation and Regression

Chapter 3: Statistical methods for estimation and testing. Key reference: Statistical methods in bioinformatics by Ewens & Grant (2001).

Lecture 10: Non- parametric Comparison of Loca6on. GENOME 560, Spring 2015 Doug Fowler, GS

E509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou.


Dr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)

Statistical Hypothesis Testing with SAS and R

Analysis of variance

Analysis of Variance (ANOVA)

Transcription:

Statistical tools to compare levels of parasitism Jen_ Reiczigel,, Lajos Rózsa Hungary What to compare? The prevalence? The mean intensity? The median intensity? Or something else? And which statistical test to use if there are more of them? Selection should be based on the desired biological interpretation!

Comparing prevalence between two (or more) populations Interpretation: comparing frequency of infection Chi-square test All expected counts should be 5 Approximate p-values are obtained Fisher s exact test Computer time expensive Example: No. of infected No. of uninfected Total Sample 1 15 33 48 Sample 2 12 42 54 Sample 3 10 65 75 Both tests are based on this k x 2 table. Note that Fisher s exact test is available for k x 2 tables (not only for 2 x 2 tables)!

Comparing mean intensity between two populations Interpretation: comparing parasite population sizes Student s t-test Normal (=Gaussian) distribution of intensity or large samples ( 30) Equality of variances Welch t-test (t-test for unequal variances) Normal (=Gaussian) distribution of intensity or large samples ( 30) Bootstrap t-test Model should be specified (shift, stretch, etc.) Few computer programs available Student s t-test on transformed data (square root, log, etc.) does not compare mean intensities! Mann-Whitney U-test (=Wilcoxon s rank sum test) does not compare mean intensities in general!

Comparing mean intensity in several populations Interpretation: comparing parasite population sizes ANOVA Normal (=Gaussian) distribution of intensity Equality of variances Bootstrap ANOVA Practically no computer program available ANOVA on transformed data (square root, log, etc.) does not compare mean intensities! Kruskal-Wallis test does not compare mean intensities in general!

Comparing median intensity in two (or more) populations Interpretation: comparing typical states of hosts Mood s median test Few computer programs available Mann-Whitney U-test (=Wilcoxon s rank sum test) does not compare median intensities in general! Kruskal-Wallis test does not compare median intensities in general! Procedures similar to Mood s median test can be applied for the comparison of quantiles (quartiles, percentiles, etc.). Testing for differences in the tails of the parasite distributions. Testing for differences in population proportions beyond a certain intensity threshold. Such questions arise if there is a certain effect but it does not affect the whole population, just a part of it. In this case neither the population mean nor the median may appropriately reflect the situation.

Example: 24 4 37 0 Here neither the difference between the means nor that between the medians is significant (p = 0.1795 and p = 1.0000). However, choosing a threshold at 60 and applying Fisher s exact test to the above 2 x 2 table results in p = 0.0204. Choose always a natural threshold to get interpretable results! If choosing the threshold optimally with respect to the sample at hand, the obtained p-value will be biased, leading to wrong conclusions. (This may be the case in the above example it is just for illustration.)

Comparing distribution of intensity in two populations Interpretation: comparing typical states of hosts in terms of stochastic equality or dominance Stochastic equality means that if choosing randomly a host from population A and another from population B, then P(host from A has more parasites) = P(host from B has more parasites) Stochastic dominance means that if choosing randomly 2 hosts in the same way, then P(host from A has more parasites) > P(host from B has more parasites) Mann-Whitney U test (=Wilcoxon s rank sum test) Welch t-test on ranks Bootstrap WMW test Intensity distributions are of the same shape, the difference is a pure shift (almost never holds in parasitology!) Wider area of application but not clearly understood properties Under preparation Model should be specified (shift, stretch, etc.) Mann-Whitney U-test compares mean or median intensities only under very special (and unrealistic!) conditions, like e.g. the shift model. In general circumstances it is not at all a location test! (There exist distributions such that A < B < C < A in terms of stochastic dominance.) Mann-Whitney U-test is invariant to monotone transformations. Student s t-test on ranks is equivalent with Mann-Whitney U-test. (An idea if someone has no computer program for the Mann-Whitney test.)

Comparing distribution of intensity in several populations Interpretation: comparing typical states of hosts in terms of stochastic homogeneity or dominance Kruskal-Wallis test Intensity distributions are of the same shape, the differences are pure shifts (almost never holds in parasitology!) Kruskal-Wallis test compares mean or median intensities only under very special (and unrealistic!) conditions, like e.g. the shift model. Kruskal-Wallis test is invariant to monotone transformations. Comparing distribution of intensity in 2 or more populations Interpretation: comparing states of hosts Chi-square test Intensities should be categorized All expected counts should be 5 Approximate p-values are obtained