Non-Parametric Statistics: When Normal Isn t Good Enough"

Size: px
Start display at page:

Download "Non-Parametric Statistics: When Normal Isn t Good Enough""

Transcription

1 Non-Parametric Statistics: When Normal Isn t Good Enough" Professor Ron Fricker" Naval Postgraduate School" Monterey, California" 1/28/13 1

2 A Bit About Me" Academic credentials" Ph.D. and M.A. in Statistics, Yale University" M.S. in Ops Research, George Washington University" B.S. in Ops Analysis, U.S. Naval Academy" (Some) research interests" Biosurveillance how to quickly detect a bioterrorist attack" Survey design and analysis" Quality control and statistical process control" Military manpower and personnel issues: recruiting, retention, effects of deployment" Contact information" Phone: (DSN: )" rdfricker@nps.edu" Web site: 1/28/13 2

3 Outline" Discuss advantages and disadvantages of nonparametric tests" Describe some nonparametric tests" One sample data" Paired data" Multiple groups" Illustrate application with various real-world datasets" Show how to implement them in Excel, JMP, and R" 1/28/13 3

4 Parametric vs. Nonparametric Hypothesis Testing" Parametric hypothesis testing: " Statistic distribution are specified (often normal)" Often follows from Central Limit Theorem, but sometimes CLT assumptions don t fit/apply" Perhaps sample size too small or underlying population distribution too skewed" For ANOVA, CLT doesn t apply data is assumed to follow normal distribution" Nonparametric hypothesis testing:" Does not assume a particular probability distribution" Often called distribution free " Generally based on ordering or order statistics" 1/28/13 4

5 Challenges in Parametric Hypothesis Testing" Sometimes experiments give responses they defy exact quantification" Want to rank the utility of four weapons systems" Gives an ordering, but can be impossible to say things like System A is twice as useful as B " Sometimes distributional assumptions of parametric tests violated" " 1/28/13 Want to compare two LVS maintenance programs, but data clearly non-normal" If the data do not fit the assumptions of the (parametric) tests, what to do? " 5

6 Advantages of Nonparametric Tests" Nonparametric tests make less stringent demands on the data" E.g., they require fewer assumptions" Usually require independent observations (or independence of paired differences)" Sometimes assumes continuity of the measure" Can be more appropriate:" When measures are not precise" For ordinal data where scale is not obvious" When only ordering of data is available" 1/28/13 6

7 Disadvantages of Nonparametric Tests" They may throw away information" E.g., Sign test only uses the signs (+ or -) of the data, not the numeric values" If the other information is available and there is an appropriate parametric test, that test will be more powerful" The trade-off: " Parametric tests are more powerful if the assumptions are met! Nonparametric tests provide a more general result if they are powerful enough to reject! 1/28/13 7

8 (Some) Nonparametric Tests" Parametric Test! One-sample t-test" Two-sample t-test" Paired t-test" " One-way ANOVA" Equivalent Nonparametric Test! Sign test, signed rank test" Sign test, Wilcoxon rank sum test/ Mann-Whitney U test" Sign test, signed rank test" Kruskall-Wallis test" Also:" Friedman test" Bootstrap-based methods" Spearman s rank correlation coefficient" Kolmogorov-Smirnov test for comparing two distributions" McNemar s test" 1/28/13 8

9 Sign Test Hypotheses and Assumptions" Basic nonparametric test procedure to assess the plausibility of the null" Assumption: Observations x 1,,x n are independent and identically distributed from some unknown distribution F Hypotheses:" H 0 : F(x) = p 0 H a : F(x) p 0 Common to test p 0 = 0.5; i.e., test the median " For symmetric distributions, equivalent to testing the mean" Can also test quartiles or any other percentile" 1/28/13 9

10 Sign Test: Methodology " First, delete all observations equal to x (and decrement n accordingly)" Calculate null hypothesis is true" and note that if the Test the appropriate alternative hypothesis:" " Alternative Hypothesis" H a : p > p 0 Sx ( ) = # x i x S( X ) ~ Bin(n, p 0 ) p-value" Pr(S( X ) S(x)) = 1-BINOMDIST(S(x) 1,n, p 0,1) H a : p < p 0 Pr(S( X ) S(x)) = BINOMDIST(S(x),n, p 0,1) H a : p p 0 p-value = 2 Pr(S( X ) S(x)), if S(x) > np 0 2 Pr(S( X ) S(x)), if S(x) < np 0 1/28/13 10

11 Large Sample Approximation" For large n, the binomial can be approximated by the normal: " Requirement: np 0 >5 and n(1-p 0 )>5" Then, for a two sided test, with" " S(x) n F(x)(1 F(x)) ~ N F(x), n = N p, p (1 p ) n z = S(x) np 0 np 0 (1 p 0 ) p-value 2 Φ( - z ) 1/28/13 11

12 Example #2: MK-48 LVS Down Time " Number Data set has 9,505 observations" o Clearly not normally distributed" " Mean of all 9,505 obs = days and the median is 35 days" Think of this set as the population " /28/13 12 Days Down

13 Example #2, continued" What if only had 30 observations " Okay to assume mean is normally distributed?" Nope; in this case, n=30 not sufficient for CLT à " Density Histogram of Means of Random Samples of Size n= means 1/28/13 13

14 Example #2, continued" Then let s use the sign test to test whether the median down time is greater than 10 days" Since np 0 =30x0.5=15>5, can do large sample calc:" z = S(x) np 0 np 0 (1 p 0 ) = (0.5)(1 0.5) = = 2.19 So, p-value Φ( ) = /28/13 14

15 Sign Test in Other Software: JMP" p 0 and 1-p 0 " estimated p-value" 1/28/13 15

16 Sign Test in Other Software: R" Just use the binom.test() function" From previous example:" 1/28/13 16

17 Sign Test for Paired Observations" Consider n pairs of observations (x i, y i )" x 1,...,x n is a random sample from F, and" y 1,..., y n is a random sample from G, where G( y) = F( y θ) for some unknown value " θ Want to test the hypothesis that the distribution of the Xs and Ys is the same except perhaps for the location:" H : 0 θ = 0 vs. H : 0 a θ Can easily adapt the sign test to this problem" Idea: Define d i = x i - y i. Then under the null hypothesis, the probability that d i is positive is 0.5 1/28/13 17

18 Location Shift Assumption" G( y) F( y) θ G( y) = F( y θ) y 1/28/13 18

19 Example #3: INSURV Material Inspections vs. Ship Trials" For data, test whether there is a location difference in percent SAT for ship trials versus MIs" Solution:" 1/28/13 19

20 Example #4: AAAV Maintenance Rates at I and III MEF" Test whether the AAAV maintenance rates are different between the two MEFs" Perhaps could use a parametric approach to test the mean maintenance rates:" Normal Probability Plot for I MEF Normal Probability Plot for III MEF Normal Probability Plot for I MEF - III MEF Sample Quantiles Sample Quantiles Sample Quantiles Theoretical Quantiles Theoretical Quantiles Theoretical Quantiles 1/28/13 20

21 Example #4: AAAV Maintenance Rates at I and III MEF" Paired t-test (parametric test) result" Pick your preferred software output" 1/28/13 21

22 Example #4: AAAV Maintenance Rates at I and III MEF" Sign test confirms I MEF rates higher without having to assume normality" 1/28/13 22

23 Signed Rank Test Hypotheses and Assumptions" Most often used to make inferences about median of unknown distribution F " Assumptions:" Observations are independent and identically distributed from F" If distribution is symmetric, median = mean and both can be denoted as" µ = F ( ) H : µ = µ H a : µ µ Hypotheses: " Signed rank more sensitive than Sign test" It uses the magnitudes of the deviations from the median as well as the signs in its calculations" 1/28/13 23

24 Signed Rank Test: Methodology (1)" First, calculate deviations from hypothesized mean value: d" i (µ 0 ) = x i µ 0 Compute the ranks of the absolute values of the deviations: r" i (µ 0 ) = rank d i ( ) Delete any observations equal to! If two observations are tied, give them both the average rank" Calculate S + ( µ 0 ), the sum of the ranks of the positive deviations" µ 0 1/28/13 24

25 Interpretation of the Signed Rank Test Statistic" 1/28/13 25

26 Signed Rank Test: Methodology (3)" If S + ( µ 0 ) is far from n(n+1)/4, then reject null" For small sample sizes, tables/software necessary to determine far " If sample size is large, use normal approximation to judge far " The test statistic" Z + S + µ 0 = ( ) 0 ( µ ) n( n+ 1)/4 nn ( + 1)(2n+ 1)/24 "is approximately standard normally distributed" 1/28/13 26

27 Example #2, continued" µ 0 Calcs give S + ( )=407.5 Table: reject at significance level " α < 0.01 Large sample calcs:" ( ) = S + (µ 0 Z + µ 0 ) n(n +1) / 4 n(n +1)(2n +1) / / 4 = / 24 = 3.6 So, p-value Φ( - 3.6) = Conclude median > 10" 1/28/13 27

28 Signed Rank Variants" Can also use signed rank test for paired data" Do test on differences of the pairs" 1/28/13 28

29 Signed Rank Test in Other Software: JMP" 1/28/13 29

30 Signed Rank Test in Other Software: R" Use the wilcox.test() function for two sample tests" 1/28/13 30

31 Rank Sum Test Hypotheses and Assumptions" One- or two-sided test for the hypotheses of the means of two independent samples" x 1,...,x n is a random sample from F, and" y 1,..., y m is a random sample from G, where G( " y) = " F( y θ) for some unknown value " θ Want to test the hypothesis that the distribution of the Xs and Ys is the same except perhaps for the location:" H : 0 vs. : 0 0 θ = H a θ 1/28/13 31

32 Idea of the Test" If two samples come from same distribution, the data ought to be mixed in together" If not, likely that one sample of data will be systematically higher or lower than other sample" Use ranks to quantify where data falls" I.e., smallest observation gets rank 1, next smallest gets rank 2, etc. " Sum of the ranks quantifies whether whole sample is systematically different" 1/28/13 32

33 Calculating the Test Statistic" In JMP, assuming no ties, the test statistic is " " " " "where" "and" Z = S x E(S x ) Var(S x ) E(S x ) = m(m + n +1) / 2, Var(S x ) = mn(m + n +1) / 12 There are other ways to calculate" And there are details for handling ties" We ll just let JMP or R calculate for us " 1/28/13 33

34 Example #5" Comparison of age distribution of US vs. non- US Iraq war casualties" 4,123 casualties from to " 3,820 US (n) and 301 non-us (m) " Age Distribution of US Casualties Age Distribution of non-us Casualties Frequency Frequency Age (years) non-us US Age (years) Age (years) 1/28/13 34

35 Calculating in JMP" Analyze > Fit Y by X" Be sure X is nominal and Y is continuous" Under red triangle > Nonparametric > Wilcoxon Test" Check the calcs:" E(S x ) = m(m + n +1) / 2 = 295(4,112) / 2 = = 606,520 Var(S x ) = mn(m + n +1) / 12 = 295 3,816 4,112 / 12 = 385,746,720 Z = S x E(S x ) Var(S x ) = 778, , ,746,720 = Conclusion: Location different" 1/28/13 35

36 Calculating in R" Use the wilcox.test() function with option paired=false! 1/28/13 36

37 Kruskal-Wallis Test Hypotheses and Assumptions" Generalization of the rank sum test for k > 2 samples" n T observations divided into k groups" Nonparametric alternative to one-way ANOVA" Null hypothesis is that all k samples are from the same population" Assumption: error terms are identically distributed" Need not be a normal distribution" 1/28/13 37

38 Kruskal-Wallis Test: Methodology" Idea: combine k samples into one large sample, order them, and rank observations from 1 to n T Denote rank of observation x ij as r ij, " 1 i k,1 j ni Further denote the average ranks as where r i = r + + r i1 in i n i r,..., 1 rk 1/28/13 38

39 Calculate the Test Statistic" " The test statistic is "! " " Calculate p-value as where X has a chi-square distribution with k-1 df" " H = 12 n T (n T +1) k 2 n i r i 3(n T +1) i=1 Pr( X > H) 1/28/13 39

40 Example #6: Helo Back Pain Survey" Survey of 683 US Navy helo pilots and copilots about back pain in the cockpit" Question 29: How often do you have back and/ or neck pain during or immediately after a flight? (1=Rarely, 5=Very Often)! Compare by question 11: In what type of helicopter do you have the most flight hours? (H-46, H-53, TH-57, H60)" 553 respondents answered both questions:" 1/28/13 40

41 Kruskall-Wallis Test in JMP" Analyze > Fit Y by X > red triangle > Nonparametric > Wilcoxon Test" Same as Wilcoxon rank sum, just need more than two levels in X " p-value cannot reject null 1/28/13 41

42 Parametric vs. Nonparametric" One-way ANOVA Kruskal-Wallis Test 1/28/13 42

43 Calculating in R" Use the kruskal.test() function" No statistical difference by type of helo " 1/28/13 43

44 Good References" My OA3102 course notes: faculty.nps.edu/rdfricke/oa3102.htm" Hayter, A.J., Probability and Statistics for Engineers and Scientists, Duxbury Press, 2006." Wackerly, D., W. Mendenhall, and R.L. Scheafer, Mathematical Statistics with Applications, Duxbury Press, 2007." Conover, W.J., Practical Nonparametric Statistics, Wiley, 1999." 1/28/13 44

45 What We Covered " Discussed advantages and disadvantages of nonparametric tests" Described some nonparametric tests" One sample data" Paired data" Multiple groups" Illustrated application with various real-world datasets" Showed how to implement them in Excel, JMP, and R" 1/28/13 45

Module 9: Nonparametric Statistics Statistics (OA3102)

Module 9: Nonparametric Statistics Statistics (OA3102) Module 9: Nonparametric Statistics Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 15.1-15.6 Revision: 3-12 1 Goals for this Lecture

More information

Unit 14: Nonparametric Statistical Methods

Unit 14: Nonparametric Statistical Methods Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

ST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data

ST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data ST4241 Design and Analysis of Clinical Trials Lecture 7: Non-parametric tests for PDG data Department of Statistics & Applied Probability 8:00-10:00 am, Friday, September 2, 2016 Outline Non-parametric

More information

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown Nonparametric Statistics Leah Wright, Tyler Ross, Taylor Brown Before we get to nonparametric statistics, what are parametric statistics? These statistics estimate and test population means, while holding

More information

Non-parametric (Distribution-free) approaches p188 CN

Non-parametric (Distribution-free) approaches p188 CN Week 1: Introduction to some nonparametric and computer intensive (re-sampling) approaches: the sign test, Wilcoxon tests and multi-sample extensions, Spearman s rank correlation; the Bootstrap. (ch14

More information

Nonparametric Statistics

Nonparametric Statistics Nonparametric Statistics Nonparametric or Distribution-free statistics: used when data are ordinal (i.e., rankings) used when ratio/interval data are not normally distributed (data are converted to ranks)

More information

3. Nonparametric methods

3. Nonparametric methods 3. Nonparametric methods If the probability distributions of the statistical variables are unknown or are not as required (e.g. normality assumption violated), then we may still apply nonparametric tests

More information

Dr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)

Dr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines) Dr. Maddah ENMG 617 EM Statistics 10/12/12 Nonparametric Statistics (Chapter 16, Hines) Introduction Most of the hypothesis testing presented so far assumes normally distributed data. These approaches

More information

Chapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics

Chapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics Understand Difference between Parametric and Nonparametric Statistical Procedures Parametric statistical procedures inferential procedures that rely

More information

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of

More information

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data

More information

Lecture 7: Hypothesis Testing and ANOVA

Lecture 7: Hypothesis Testing and ANOVA Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures

4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures Non-parametric Test Stephen Opiyo Overview Distinguish Parametric and Nonparametric Test Procedures Explain commonly used Nonparametric Test Procedures Perform Hypothesis Tests Using Nonparametric Procedures

More information

What Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone

What Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone What Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone First, a bit about Parametric Statistics Data are expected to be randomly drawn from a normal population Minimum sample size

More information

Non-parametric tests, part A:

Non-parametric tests, part A: Two types of statistical test: Non-parametric tests, part A: Parametric tests: Based on assumption that the data have certain characteristics or "parameters": Results are only valid if (a) the data are

More information

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution

More information

Textbook Examples of. SPSS Procedure

Textbook Examples of. SPSS Procedure Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of

More information

Rank-Based Methods. Lukas Meier

Rank-Based Methods. Lukas Meier Rank-Based Methods Lukas Meier 20.01.2014 Introduction Up to now we basically always used a parametric family, like the normal distribution N (µ, σ 2 ) for modeling random data. Based on observed data

More information

Non-parametric methods

Non-parametric methods Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish

More information

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests

More information

Fish SR P Diff Sgn rank Fish SR P Diff Sng rank

Fish SR P Diff Sgn rank Fish SR P Diff Sng rank Nonparametric tests Distribution free methods require fewer assumptions than parametric methods Focus on testing rather than estimation Not sensitive to outlying observations Especially useful for cruder

More information

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Nonparametric statistic methods Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Measurement What are the 4 levels of measurement discussed? 1. Nominal or Classificatory Scale Gender,

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

STATISTIKA INDUSTRI 2 TIN 4004

STATISTIKA INDUSTRI 2 TIN 4004 STATISTIKA INDUSTRI 2 TIN 4004 Pertemuan 11 & 12 Outline: Nonparametric Statistics Referensi: Walpole, R.E., Myers, R.H., Myers, S.L., Ye, K., Probability & Statistics for Engineers & Scientists, 9 th

More information

What to do today (Nov 22, 2018)?

What to do today (Nov 22, 2018)? What to do today (Nov 22, 2018)? Part 1. Introduction and Review (Chp 1-5) Part 2. Basic Statistical Inference (Chp 6-9) Part 3. Important Topics in Statistics (Chp 10-13) Part 4. Further Topics (Selected

More information

My data doesn t look like that..

My data doesn t look like that.. Testing assumptions My data doesn t look like that.. We have made a big deal about testing model assumptions each week. Bill Pine Testing assumptions Testing assumptions We have made a big deal about testing

More information

Nonparametric Statistics Notes

Nonparametric Statistics Notes Nonparametric Statistics Notes Chapter 5: Some Methods Based on Ranks Jesse Crawford Department of Mathematics Tarleton State University (Tarleton State University) Ch 5: Some Methods Based on Ranks 1

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

Non-parametric Tests

Non-parametric Tests Statistics Column Shengping Yang PhD,Gilbert Berdine MD I was working on a small study recently to compare drug metabolite concentrations in the blood between two administration regimes. However, the metabolite

More information

Contents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47

Contents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47 Contents 1 Non-parametric Tests 3 1.1 Introduction....................................... 3 1.2 Advantages of Non-parametric Tests......................... 4 1.3 Disadvantages of Non-parametric Tests........................

More information

Dealing with the assumption of independence between samples - introducing the paired design.

Dealing with the assumption of independence between samples - introducing the paired design. Dealing with the assumption of independence between samples - introducing the paired design. a) Suppose you deliberately collect one sample and measure something. Then you collect another sample in such

More information

ST4241 Design and Analysis of Clinical Trials Lecture 9: N. Lecture 9: Non-parametric procedures for CRBD

ST4241 Design and Analysis of Clinical Trials Lecture 9: N. Lecture 9: Non-parametric procedures for CRBD ST21 Design and Analysis of Clinical Trials Lecture 9: Non-parametric procedures for CRBD Department of Statistics & Applied Probability 8:00-10:00 am, Friday, September 9, 2016 Outline Nonparametric tests

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

Hypothesis testing, part 2. With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal

Hypothesis testing, part 2. With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal Hypothesis testing, part 2 With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal 1 CATEGORICAL IV, NUMERIC DV 2 Independent samples, one IV # Conditions Normal/Parametric Non-parametric

More information

NON-PARAMETRIC STATISTICS * (http://www.statsoft.com)

NON-PARAMETRIC STATISTICS * (http://www.statsoft.com) NON-PARAMETRIC STATISTICS * (http://www.statsoft.com) 1. GENERAL PURPOSE 1.1 Brief review of the idea of significance testing To understand the idea of non-parametric statistics (the term non-parametric

More information

Introduction to Survey Analysis!

Introduction to Survey Analysis! Introduction to Survey Analysis! Professor Ron Fricker! Naval Postgraduate School! Monterey, California! Reading Assignment:! 2/22/13 None! 1 Goals for this Lecture! Introduction to analysis for surveys!

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Module 10: Analysis of Categorical Data Statistics (OA3102)

Module 10: Analysis of Categorical Data Statistics (OA3102) Module 10: Analysis of Categorical Data Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 14.1-14.7 Revision: 3-12 1 Goals for this

More information

STAT 135 Lab 8 Hypothesis Testing Review, Mann-Whitney Test by Normal Approximation, and Wilcoxon Signed Rank Test.

STAT 135 Lab 8 Hypothesis Testing Review, Mann-Whitney Test by Normal Approximation, and Wilcoxon Signed Rank Test. STAT 135 Lab 8 Hypothesis Testing Review, Mann-Whitney Test by Normal Approximation, and Wilcoxon Signed Rank Test. Rebecca Barter March 30, 2015 Mann-Whitney Test Mann-Whitney Test Recall that the Mann-Whitney

More information

= 1 i. normal approximation to χ 2 df > df

= 1 i. normal approximation to χ 2 df > df χ tests 1) 1 categorical variable χ test for goodness-of-fit ) categorical variables χ test for independence (association, contingency) 3) categorical variables McNemar's test for change χ df k (O i 1

More information

Nonparametric Location Tests: k-sample

Nonparametric Location Tests: k-sample Nonparametric Location Tests: k-sample Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 04-Jan-2017 Nathaniel E. Helwig (U of Minnesota)

More information

Nonparametric Methods

Nonparametric Methods Nonparametric Methods Marc H. Mehlman marcmehlman@yahoo.com University of New Haven Nonparametric Methods, or Distribution Free Methods is for testing from a population without knowing anything about the

More information

Nonparametric tests, Bootstrapping

Nonparametric tests, Bootstrapping Nonparametric tests, Bootstrapping http://www.isrec.isb-sib.ch/~darlene/embnet/ Hypothesis testing review 2 competing theories regarding a population parameter: NULL hypothesis H ( straw man ) ALTERNATIVEhypothesis

More information

Intuitive Biostatistics: Choosing a statistical test

Intuitive Biostatistics: Choosing a statistical test pagina 1 van 5 < BACK Intuitive Biostatistics: Choosing a statistical This is chapter 37 of Intuitive Biostatistics (ISBN 0-19-508607-4) by Harvey Motulsky. Copyright 1995 by Oxfd University Press Inc.

More information

Exam details. Final Review Session. Things to Review

Exam details. Final Review Session. Things to Review Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit

More information

BIO 682 Nonparametric Statistics Spring 2010

BIO 682 Nonparametric Statistics Spring 2010 BIO 682 Nonparametric Statistics Spring 2010 Steve Shuster http://www4.nau.edu/shustercourses/bio682/index.htm Lecture 8 Example: Sign Test 1. The number of warning cries delivered against intruders by

More information

This is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same!

This is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same! Two sample tests (part II): What to do if your data are not distributed normally: Option 1: if your sample size is large enough, don't worry - go ahead and use a t-test (the CLT will take care of non-normal

More information

6 Single Sample Methods for a Location Parameter

6 Single Sample Methods for a Location Parameter 6 Single Sample Methods for a Location Parameter If there are serious departures from parametric test assumptions (e.g., normality or symmetry), nonparametric tests on a measure of central tendency (usually

More information

Business Statistics MEDIAN: NON- PARAMETRIC TESTS

Business Statistics MEDIAN: NON- PARAMETRIC TESTS Business Statistics MEDIAN: NON- PARAMETRIC TESTS CONTENTS Hypotheses on the median The sign test The Wilcoxon signed ranks test Old exam question HYPOTHESES ON THE MEDIAN The median is a central value

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Inferential Statistics

Inferential Statistics Inferential Statistics Eva Riccomagno, Maria Piera Rogantin DIMA Università di Genova riccomagno@dima.unige.it rogantin@dima.unige.it Part G Distribution free hypothesis tests 1. Classical and distribution-free

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should

More information

Data analysis and Geostatistics - lecture VII

Data analysis and Geostatistics - lecture VII Data analysis and Geostatistics - lecture VII t-tests, ANOVA and goodness-of-fit Statistical testing - significance of r Testing the significance of the correlation coefficient: t = r n - 2 1 - r 2 with

More information

Intro to Parametric & Nonparametric Statistics

Intro to Parametric & Nonparametric Statistics Kinds of variable The classics & some others Intro to Parametric & Nonparametric Statistics Kinds of variables & why we care Kinds & definitions of nonparametric statistics Where parametric stats come

More information

Introduction to Nonparametric Statistics

Introduction to Nonparametric Statistics Introduction to Nonparametric Statistics by James Bernhard Spring 2012 Parameters Parametric method Nonparametric method µ[x 2 X 1 ] paired t-test Wilcoxon signed rank test µ[x 1 ], µ[x 2 ] 2-sample t-test

More information

Analyzing Small Sample Experimental Data

Analyzing Small Sample Experimental Data Analyzing Small Sample Experimental Data Session 2: Non-parametric tests and estimators I Dominik Duell (University of Essex) July 15, 2017 Pick an appropriate (non-parametric) statistic 1. Intro to non-parametric

More information

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and

More information

Distribution-Free Procedures (Devore Chapter Fifteen)

Distribution-Free Procedures (Devore Chapter Fifteen) Distribution-Free Procedures (Devore Chapter Fifteen) MATH-5-01: Probability and Statistics II Spring 018 Contents 1 Nonparametric Hypothesis Tests 1 1.1 The Wilcoxon Rank Sum Test........... 1 1. Normal

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 1 1-1 Basic Business Statistics 11 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Basic Business Statistics, 11e 009 Prentice-Hall, Inc. Chap 1-1 Learning Objectives In this chapter,

More information

Biostatistics 270 Kruskal-Wallis Test 1. Kruskal-Wallis Test

Biostatistics 270 Kruskal-Wallis Test 1. Kruskal-Wallis Test Biostatistics 270 Kruskal-Wallis Test 1 ORIGIN 1 Kruskal-Wallis Test The Kruskal-Wallis is a non-parametric analog to the One-Way ANOVA F-Test of means. It is useful when the k samples appear not to come

More information

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics SEVERAL μs AND MEDIANS: MORE ISSUES Business Statistics CONTENTS Post-hoc analysis ANOVA for 2 groups The equal variances assumption The Kruskal-Wallis test Old exam question Further study POST-HOC ANALYSIS

More information

Linear Regression Analysis for Survey Data. Professor Ron Fricker Naval Postgraduate School Monterey, California

Linear Regression Analysis for Survey Data. Professor Ron Fricker Naval Postgraduate School Monterey, California Linear Regression Analysis for Survey Data Professor Ron Fricker Naval Postgraduate School Monterey, California 1 Goals for this Lecture Linear regression How to think about it for Lickert scale dependent

More information

Wilcoxon Test and Calculating Sample Sizes

Wilcoxon Test and Calculating Sample Sizes Wilcoxon Test and Calculating Sample Sizes Dan Spencer UC Santa Cruz Dan Spencer (UC Santa Cruz) Wilcoxon Test and Calculating Sample Sizes 1 / 33 Differences in the Means of Two Independent Groups When

More information

Kruskal-Wallis and Friedman type tests for. nested effects in hierarchical designs 1

Kruskal-Wallis and Friedman type tests for. nested effects in hierarchical designs 1 Kruskal-Wallis and Friedman type tests for nested effects in hierarchical designs 1 Assaf P. Oron and Peter D. Hoff Department of Statistics, University of Washington, Seattle assaf@u.washington.edu, hoff@stat.washington.edu

More information

Nonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I

Nonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I 1 / 16 Nonparametric tests Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I Nonparametric one and two-sample tests 2 / 16 If data do not come from a normal

More information

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

Everything is not normal

Everything is not normal Everything is not normal According to the dictionary, one thing is considered normal when it s in its natural state or conforms to standards set in advance. And this is its normal meaning. But, like many

More information

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F. Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks

More information

Chapter 6: Functions of Random Variables

Chapter 6: Functions of Random Variables Chapter 6: Functions of Random Variables Professor Ron Fricker Naval Postgraduate School Monterey, California 3/15/15 Reading Assignment: Sections 6.1 6.4 & 6.8 1 Goals for this Chapter Introduce methods

More information

Statistical Procedures for Testing Homogeneity of Water Quality Parameters

Statistical Procedures for Testing Homogeneity of Water Quality Parameters Statistical Procedures for ing Homogeneity of Water Quality Parameters Xu-Feng Niu Professor of Statistics Department of Statistics Florida State University Tallahassee, FL 3306 May-September 004 1. Nonparametric

More information

Inferences About the Difference Between Two Means

Inferences About the Difference Between Two Means 7 Inferences About the Difference Between Two Means Chapter Outline 7.1 New Concepts 7.1.1 Independent Versus Dependent Samples 7.1. Hypotheses 7. Inferences About Two Independent Means 7..1 Independent

More information

22s:152 Applied Linear Regression. Take random samples from each of m populations.

22s:152 Applied Linear Regression. Take random samples from each of m populations. 22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each

More information

One-Sample Numerical Data

One-Sample Numerical Data One-Sample Numerical Data quantiles, boxplot, histogram, bootstrap confidence intervals, goodness-of-fit tests University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html

More information

Types of Statistical Tests DR. MIKE MARRAPODI

Types of Statistical Tests DR. MIKE MARRAPODI Types of Statistical Tests DR. MIKE MARRAPODI Tests t tests ANOVA Correlation Regression Multivariate Techniques Non-parametric t tests One sample t test Independent t test Paired sample t test One sample

More information

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01 An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there

More information

TMA4255 Applied Statistics V2016 (23)

TMA4255 Applied Statistics V2016 (23) TMA4255 Applied Statistics V2016 (23) Part 7: Nonparametric tests Signed-Rank test [16.2] Wilcoxon Rank-sum test [16.3] Anna Marie Holand April 19, 2016, wiki.math.ntnu.no/tma4255/2016v/start 2 Outline

More information

Chapter 18 Resampling and Nonparametric Approaches To Data

Chapter 18 Resampling and Nonparametric Approaches To Data Chapter 18 Resampling and Nonparametric Approaches To Data 18.1 Inferences in children s story summaries (McConaughy, 1980): a. Analysis using Wilcoxon s rank-sum test: Younger Children Older Children

More information

Violating the normal distribution assumption. So what do you do if the data are not normal and you still need to perform a test?

Violating the normal distribution assumption. So what do you do if the data are not normal and you still need to perform a test? Violating the normal distribution assumption So what do you do if the data are not normal and you still need to perform a test? Remember, if your n is reasonably large, don t bother doing anything. Your

More information

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data?

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data? Agonistic Display in Betta splendens: Data Analysis By Joanna Weremjiwicz, Simeon Yurek, and Dana Krempels Once you have collected data with your ethogram, you are ready to analyze that data to see whether

More information

Lecture Slides for INTRODUCTION TO. Machine Learning. ETHEM ALPAYDIN The MIT Press,

Lecture Slides for INTRODUCTION TO. Machine Learning. ETHEM ALPAYDIN The MIT Press, Lecture Slides for INTRODUCTION TO Machine Learning ETHEM ALPAYDIN The MIT Press, 2004 alpaydin@boun.edu.tr http://www.cmpe.boun.edu.tr/~ethem/i2ml CHAPTER 14: Assessing and Comparing Classification Algorithms

More information

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA 22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

Advanced Statistics II: Non Parametric Tests

Advanced Statistics II: Non Parametric Tests Advanced Statistics II: Non Parametric Tests Aurélien Garivier ParisTech February 27, 2011 Outline Fitting a distribution Rank Tests for the comparison of two samples Two unrelated samples: Mann-Whitney

More information

psychological statistics

psychological statistics psychological statistics B Sc. Counselling Psychology 011 Admission onwards III SEMESTER COMPLEMENTARY COURSE UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION CALICUT UNIVERSITY.P.O., MALAPPURAM, KERALA,

More information

Solutions exercises of Chapter 7

Solutions exercises of Chapter 7 Solutions exercises of Chapter 7 Exercise 1 a. These are paired samples: each pair of half plates will have about the same level of corrosion, so the result of polishing by the two brands of polish are

More information

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1) Summary of Chapter 7 (Sections 7.2-7.5) and Chapter 8 (Section 8.1) Chapter 7. Tests of Statistical Hypotheses 7.2. Tests about One Mean (1) Test about One Mean Case 1: σ is known. Assume that X N(µ, σ

More information

Workshop Research Methods and Statistical Analysis

Workshop Research Methods and Statistical Analysis Workshop Research Methods and Statistical Analysis Session 2 Data Analysis Sandra Poeschl 08.04.2013 Page 1 Research process Research Question State of Research / Theoretical Background Design Data Collection

More information

Degrees of freedom df=1. Limitations OR in SPSS LIM: Knowing σ and µ is unlikely in large

Degrees of freedom df=1. Limitations OR in SPSS LIM: Knowing σ and µ is unlikely in large Z Test Comparing a group mean to a hypothesis T test (about 1 mean) T test (about 2 means) Comparing mean to sample mean. Similar means = will have same response to treatment Two unknown means are different

More information

S D / n t n 1 The paediatrician observes 3 =

S D / n t n 1 The paediatrician observes 3 = Non-parametric tests Paired t-test A paediatrician measured the blood cholesterol of her patients and was worried to note that some had levels over 00mg/100ml To investigate whether dietary regulation

More information

Nonparametric hypothesis tests and permutation tests

Nonparametric hypothesis tests and permutation tests Nonparametric hypothesis tests and permutation tests 1.7 & 2.3. Probability Generating Functions 3.8.3. Wilcoxon Signed Rank Test 3.8.2. Mann-Whitney Test Prof. Tesler Math 283 Fall 2018 Prof. Tesler Wilcoxon

More information

Rama Nada. -Ensherah Mokheemer. 1 P a g e

Rama Nada. -Ensherah Mokheemer. 1 P a g e - 9 - Rama Nada -Ensherah Mokheemer - 1 P a g e Quick revision: Remember from the last lecture that chi square is an example of nonparametric test, other examples include Kruskal Wallis, Mann Whitney and

More information

Lecture 10: Non- parametric Comparison of Loca6on. GENOME 560, Spring 2015 Doug Fowler, GS

Lecture 10: Non- parametric Comparison of Loca6on. GENOME 560, Spring 2015 Doug Fowler, GS Lecture 10: Non- parametric Comparison of Loca6on GENOME 560, Spring 2015 Doug Fowler, GS (dfowler@uw.edu) 1 Review What do we mean by nonparametric? What is a desirable localon stalslc for ordinal data?

More information

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data?

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data? Data Analysis: Agonistic Display in Betta splendens By Joanna Weremjiwicz, Simeon Yurek, and Dana Krempels Once you have collected data with your ethogram, you are ready to analyze that data to see whether

More information

Basic Statistical Analysis

Basic Statistical Analysis indexerrt.qxd 8/21/2002 9:47 AM Page 1 Corrected index pages for Sprinthall Basic Statistical Analysis Seventh Edition indexerrt.qxd 8/21/2002 9:47 AM Page 656 Index Abscissa, 24 AB-STAT, vii ADD-OR rule,

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

16. Nonparametric Methods. Analysis of ordinal data

16. Nonparametric Methods. Analysis of ordinal data 16. Nonparametric Methods 數 Analysis of ordinal data 料 1 Data : Non-interval data : nominal data, ordinal data Interval data but not normally distributed Nonparametric tests : Two dependent samples pair

More information