Biostatistics 270 Kruskal-Wallis Test 1. Kruskal-Wallis Test

Similar documents
CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

ANOVA Randomized Block Design

Multiple Pairwise Comparison Procedures in One-Way ANOVA with Fixed Effects Model

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics

Nonparametric Statistics

4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Non-parametric tests, part A:

Statistical Inference Theory Lesson 46 Non-parametric Statistics

Lecture 7: Hypothesis Testing and ANOVA

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

ST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data?

Non-parametric (Distribution-free) approaches p188 CN

Data analysis and Geostatistics - lecture VII

16. Nonparametric Methods. Analysis of ordinal data

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /8/2016 1/38

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data?

Types of Statistical Tests DR. MIKE MARRAPODI

Workshop Research Methods and Statistical Analysis

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

13: Additional ANOVA Topics

I i=1 1 I(J 1) j=1 (Y ij Ȳi ) 2. j=1 (Y j Ȳ )2 ] = 2n( is the two-sample t-test statistic.

Chapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics

Textbook Examples of. SPSS Procedure

Dr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)

Analysis of variance (ANOVA) Comparing the means of more than two groups

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Rank-Based Methods. Lukas Meier

Non-parametric methods

Analysis of Variance. Read Chapter 14 and Sections to review one-way ANOVA.

3. Nonparametric methods

Lecture 14: ANOVA and the F-test

Pooled Variance t Test

Week 14 Comparing k(> 2) Populations

Chapter 18 Resampling and Nonparametric Approaches To Data

BIO 682 Nonparametric Statistics Spring 2010

Degrees of freedom df=1. Limitations OR in SPSS LIM: Knowing σ and µ is unlikely in large

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

22s:152 Applied Linear Regression. Take random samples from each of m populations.

CDA Chapter 3 part II

Selection should be based on the desired biological interpretation!

Inferential Statistics

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

Empirical Power of Four Statistical Tests in One Way Layout

Chapter Seven: Multi-Sample Methods 1/52

Analysis of Variance (ANOVA) Cancer Research UK 10 th of May 2018 D.-L. Couturier / R. Nicholls / M. Fernandes

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Rama Nada. -Ensherah Mokheemer. 1 P a g e

Comparison of two samples

SPSS Guide For MMI 409

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Statistical Significance of Ranking Paradoxes

Tentative solutions TMA4255 Applied Statistics 16 May, 2015

Kolmogorov-Smirnov Test for Goodness of Fit in an ordered sequence

Multiple comparisons - subsequent inferences for two-way ANOVA

Exam details. Final Review Session. Things to Review

Lec 3: Model Adequacy Checking

Module 9: Nonparametric Statistics Statistics (OA3102)

Analysis of Variance

Contents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47

Basic Business Statistics, 10/e

Non-parametric Tests

S D / n t n 1 The paediatrician observes 3 =

NON-PARAMETRIC STATISTICS * (

Extending the Robust Means Modeling Framework. Alyssa Counsell, Phil Chalmers, Matt Sigal, Rob Cribbie

Inferences About the Difference Between Two Means

Contents. Acknowledgments. xix

STATISTIKA INDUSTRI 2 TIN 4004

Nonparametric Location Tests: k-sample

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

My data doesn t look like that..

3 Joint Distributions 71

CORELATION - Pearson-r - Spearman-rho

Introduction to Nonparametric Statistics

Biostatistics 380 Multiple Regression 1. Multiple Regression

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution

Analysis Of Variance Compiled by T.O. Antwi-Asare, U.G

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers

Kruskal-Wallis and Friedman type tests for. nested effects in hierarchical designs 1

One-Way Analysis of Variance (ANOVA)

Nonparametric Statistics Notes

Non-Parametric Statistics: When Normal Isn t Good Enough"

Application of Variance Homogeneity Tests Under Violation of Normality Assumption

Chapter 12. Analysis of variance

Non-parametric Hypothesis Testing

Why should I use a Kruskal-Wallis test? (With Minitab) Why should I use a Kruskal-Wallis test? (With SPSS)

Chap The McGraw-Hill Companies, Inc. All rights reserved.

1.0 Hypothesis Testing

Comparing the means of more than two groups

Transcription:

Biostatistics 270 Kruskal-Wallis Test 1 ORIGIN 1 Kruskal-Wallis Test The Kruskal-Wallis is a non-parametric analog to the One-Way ANOVA F-Test of means. It is useful when the k samples appear not to come from underlying Normal Distributions, or when variance in the different samples are of greatly different magnitudes (non-homogeneous). As with other rank-based tests, it does not have as much power as the fully parametric tests, but nevertheless enjoys wide use. Note that when the number of samples k=2, this test is identical to the Mann-Whitney Test. Data Structure: k groups with not necessarily the same numbers of observations and different means. Let index i,j indicate the ith column (treatment class) and jth row (object). Assumptions: - Observations in each class(block) are a random sample. - Observations in each block are independent of observations in other class. - Underlying distribution of observations in each cell are continuous. - Measurement scale is at least ordinal. One-Way ANOVA Treatment Classes: Objects (Replicates) #1 #2 # #k 1 2 n n1 n2 n nk means: X1bar X2bar Xbar Xkbar Zar Example 10.10 number loc rank RankSum 4.2 1 10 9.6 1 12 H 0 : = 0 < No population differences 5 10.2 1 1 H 1 : 0 2 12.1 1 14 < Two Sided Test 1 14 1 15 64 7 5.1 2 2 5.5 2 4 10 6. 2 6 9 6.6 2 7 6.4 2 11 0 14 4.1 1 15 5.4 1 5. 5 11 6.9 12 7. 9 26 Hypotheses: Criterion for Normal Approximation: IF n i 5 THEN Normal Approximation Applies OTHERWISE use Special Tables e.g. Rosner Table 15 p. 44

Biostatistics 270 Kruskal-Wallis Test 2 Normal Approximation: Rank Data and Sum: - Pool the data over all treatment classes - Total sample size N = n i - Assign Data to Ranks. In the case of ties, t observations in a rank are assigned the appropriate average rank. - Compute the Rank Sum (R i ) for each treatment class i. N 15 k R 64 n 5 1 1 R 0 n 5 2 2 R 26 n 5 Test Statistic: i 1 12 N ( N 1).72 R i 2 ( N 1) n i i < where R i are the Rank sums for each treatment class i IF no ties, THEN: H < no correction factor in this case... OTHERWISE: m 0 correction factor > 1 m j 1 t j t j N N Sampling Distribution of the Test Statistic H: H x < t represent the number of observations that are tied in groups 1 to g < Corrected Test Statistic If Assumptions hold, H 0 is true, and assuming Normal Approximation then H ~ 2 (k-1) Critical Value of the Test: 0.05 < Probability of Type I error must be explicitly set C qchisq 1 k 1 C 5.9915 < Normal approximation < Note: df = (k-1) Decision Rule: C 5.70 < Critical Value from Zar Appendix B-1 IF H > C THEN REJECT H 0 OTHERWISE ACCEPT H 0 H.72 Probability Value: P ( 1 pchisq( Hk 1) ) P 0.012 < Normal approximation

Biostatistics 270 Kruskal-Wallis Test Zar Example 10.11 Hypotheses: H 0 : = 0 H 1 : 0 < No population differences < Two Sided Test ph pond rank RankSum n 7.6 1 1 7.69 1 2 7.7 1.5 7.7 1.5 7.72 1 7.7 1 10 7.7 1 10 7.76 1 17 55 7.71 2 6 7.7 2 10 7.74 2 1.5 7.74 2 1.5 7.7 2 20 7.7 2 20 7. 2 2.5 7.1 2 26 12.5 7.74 1.5 7.75 16 7.77 1 7.7 20 7. 2.5 7.1 26 7.4 2 145 7 7.71 4 6 7.71 4 6 7.74 4 1.5 7.79 4 22 7.1 4 26 7.5 4 29 7.7 4 0 7.91 4 1 16.5 Criterion for Normal Approximation: - IF n i 5 THEN Normal Approximation Applies OTHERWISE use Special Tables e.g. Rosner Table 15 p. 44

Biostatistics 270 Kruskal-Wallis Test 4 Normal Approximation: Rank Data and Sum: - Pool the data over all treatment classes - Total sample size N = n i - Assign Data to Ranks. In the case of ties, t observations in a rank are assigned the appropriate average rank. - Compute the Rank Sum (R i ) for each treatment class i. k 4 R Test Statistic: 55 12.5 145 16.5 i 1 k 12 R i 2 < where R i are the Rank sums ( N 1) N ( N 1) n for each treatment class i i i 11.761 IF no ties, THEN: H x < no correction factor... OTHERWISE: 2 m 7 < t represent the number of t 4 tied observations in each of the m groups of tied ranks 2 m t j t j correction factor > 1 j 1 0.9944 N N H H 11.945 < Corrected Test Statistic in this case Distribution of the Test Statistic H: n N n N 1 7 If H 0 is true and assuming Normal Approximation then H ~ 2 (k-1) Critical Value of the Test: 0.05 < Probability of Type I error must be explicitly set C qchisq 1 k 1 C 7.147 < Normal approximation < Note: df = (k-1) Decision Rule: IF H > C THEN REJECT H 0 OTHERWISE ACCEPT H 0

Biostatistics 270 Kruskal-Wallis Test 5 Probability Value: P ( 1 pchisq( Hk 1) ) P 0.0076 < Normal approximation Prototype in R: #KRUSKAL WALLIS TEST #ZAR EXAMPLE 10.10 ZAR=read.table("c:/DATA/Biostascs/ZarEX10.10R.txt") ZAR aach(zar) kruskal.test(number,loc) Kruskal Wallis rank sum test data: number and loc Kruskal Wallis chi squared =.72, df = 2, p value = 0.0127 #ZAR EXAMPLE 10.11 ZAR=read.table("c:/DATA/Biostascs/ZarEX10.11R.txt") ZAR aach(zar) kruskal.test(ph,pond) Kruskal Wallis rank sum test data: ph and pond Kruskal Wallis chi squared = 11.945, df =, p value = 0.007579