Non-Parametric Two-Sample Analysis: The Mann-Whitney U Test

Size: px
Start display at page:

Download "Non-Parametric Two-Sample Analysis: The Mann-Whitney U Test"

Transcription

1 Non-Parametric Two-Sample Analysis: The Mann-Whitney U Test

2 When samples do not meet the assumption of normality parametric tests should not be used. To overcome this problem, non-parametric tests can be used. These tests are distribution-free (do not assume normality. They are fairly robust and nearly as powerful as parametric tests. They often use RANKS rather than observed values.

3 Earthquake Depth

4 Chilean earthquakes Kolmogorov-Smirnov(a) Tests of Normality (May) Shapiro-Wilk Statistic df Sig. Statistic df Sig. Mag Tests of Normality (June) Kolmogorov-Smirnov(a) Shapiro-Wilk Statistic df Sig. Statistic df Sig. Mag

5 Magnitude Equal variances assumed Equal variances not assumed Levene's Test for Equality of Variances F Sig. Independent Samples Test t df Sig. (2-tailed) t-test for Equality of Means Me an Difference 95% Confidence Interval of the Std. Error Difference Difference Lower Upper Using a t test gives the result that the magnitude of the earthquakes between May and June were significantly different.

6 Magnitude Mo nth 5 6 Total Ranks N Mean Rank Sum of Ranks Test Statistics b Mann-Whitney U Wilcoxon W Z As ymp. Sig. (2-tailed) Exact Sig. [2*(1-tailed Sig.)] a. Not corrected for ties. Magnitude b. Groupi ng Variable: Month a Using a non-parametric test gives the result that the magnitude of the earthquakes between May and June was not significantly different.

7 When the distribution of the data sets deviate substantially from normal, it is better to use non-parametric (distribution free) tests. There are no assumptions made concerning the sample distributions. Tied ranks are assigned the average rank of the tied observations. The Mann-Whitney U test is approximately 95% as powerful as the t test. If the data are severely non-normal, the Mann-Whitney U test is substantially more powerful than the t test.

8 The Mann-Whitney U test (2-tailed) U = U ' = n n 1 n n n1 ( n1 + 1) + 2 U R 1 where R 1 is the sum of the ranks for group1 Compare the critical U value to either U or U, whichever is larger.

9 The sample space The theoretical sum of all ranks for group 1 U U = ' = n n 1 n n n 1 U ( n1 + 1) 2 R 1 The actual sum of all ranks for group 1 This equation is essentially comparing the theoretical sum of the ranks from group 1 to the actual sum of the ranks for group 1 while taking into account the sample space. If the group samples get smaller the test gets more conservative.

10 Observations are first sorted. Tied ranks are dealt with by assigning the average rank to the tied observations: Obs Value Rank (with Ties) Rank (tied) (tied) (tied (tied) (tied) 7.5 ( ) / 3 = 4 (7 + 8) / 2 = 7.5

11 The U test uses the rank of the pooled observations. For a 2- tailed test, ranks can be from highest to lowest or lowest to highest. Earthquake Magnitudes in Chile Earthquake Location Magnitude Rank Oceanic Earthquake Location Magnitude Rank Continental Oceanic Continental Oceanic Continental Oceanic Continental Oceanic Continental Oceanic Continental Oceanic Continental Oceanic Continental Oceanic Continental Oceanic Continental 5 17 Oceanic Continental Oceanic Continental Oceanic Σ 147

12 H o : There is no significant difference magnitude of oceanic versus continental earthquakes in Chile. H a : There is a significant difference magnitude of oceanic versus continental earthquakes in Chile. α = 0.05 n1 = 12 n2 = 11 df = n1, n2 = 12, 11 Note that we are performing a 2-tailed test, so we will use the larger of the test statistics either U or U.

13 U U 12(12 + 1) = (12)(11) + 2 = U = 63 U ' = (12)(11) 63 U ' = U is larger, so it will be used. df = 12,11 U critical = 99 IMPORTANT: This Mann-Whitney table is 1-tailed. Our α level is For a 2- tailed test using a 1-tailed table, you MUST divide the α level between each of the 2 tails So the α level we look up on the table is 0.025, or ½ of 0.05.

14

15 69 < 99 Since U is less than U Critical, Accept H o. There is no significant difference in the magnitudes of oceanic versus continental earthquakes in Chile (U 69, p > 0.10). SPSS Test Statistics a Magnitude Mann-Whitney U Wilcoxon W Z Asymp. Sig. (2-tailed).853 Exact Sig. [2*(1-tailed Sig.)].880 b a. Grouping Variable: Location Note that SPSS calculates the exact probability b. Not corrected for ties.

16 Mann-Whitney U test (1-tailed) Performing a 1-tailed Mann-Whitney test is somewhat different than other methods. The appropriate test statistic is determined using the following method: This technique simply forces one to declare in which tail the difference will be found in advance since U is to the right of the mean (greater than) and U is to the left of the mean (less than).

17 Using depth of epicenter data for the same Chilean earthquakes, a 1-tailed test is performed with the data ranked from low to high and continental earthquakes as group 1. H o : Continental earthquake depths are not significantly deeper than oceanic earthquakes in Chile. H a : Continental earthquake depths are significantly deeper than oceanic earthquakes in Chile.

18 Using depth of epicenter data for the same Chilean earthquakes, a 1-tailed test is performed with the data ranked from low to high and continental earthquakes as group 1. H o : Continental earthquake depths are not significantly deeper than oceanic earthquakes in Chile. H a : Continental earthquake depths are significantly deeper than oceanic earthquakes in Chile. Therefore the test statistic will be:

19 Earthquake Depths (km) in Chile Oceanic Rank Oceanic Continental Rank Continental Σ 180.5

20 U U 11(11+ 1) = (11)(12) + 2 = U = 17.5 U U ' ' = (12)(11) 17.5 = df = 12,11 = 94 U critical > 94 Since U is greater than U Critical, reject H o. Continental earthquake depths are significantly deeper than oceanic earthquakes in Chile (U 114.5, > p > 0.001).

21

22 Therefore, this table is used for 2 purposes: 1. Declaring which group is 1 and which is Declaring which group is greater (or less then) which. It is important to do this because we could easily restate the direction of H o and H a as: H o : Continental earthquake depths are not significantly deeper than oceanic earthquakes in Chile. H a : Oceanic earthquake depths are significantly shallower than continental earthquakes in Chile.

23 The table of which U value to choose helps keep things straight, regardless of how the null and alternate hypotheses are framed (either Group 1 > Group 2 or Group 2 < Group 1. This is clearly demonstrated on the next slide. No matter how we frame the H o and H a, we will use the appropriate statistic.

24 Continental = Group 1, Ranked Low to High Ho: Continental = Oceanic Ha: Continental < Oceanic Oceanic = Group 1, Ranked Low to High Ho: Continental = Oceanic Ha: Continental > Oceanic U U ' ' 11(11+ 1) = (11)(12) + 2 = U U ' ' 12(12 + 1) = (12)(11) + 2 = U ' = 17.5 U = U = U ' = U = U = 17.5 Continental = Group 1, Ranked High to Low Ho: Continental = Oceanic Ha: Continental < Oceanic Oceanic = Group 1, Ranked High to Low Ho: Continental = Oceanic Ha: Continental > Oceanic U U ' ' 11(11+ 1) = (11)(12) + 2 = U U ' ' 12(12 + 1) = (12)(11) + 2 = U ' = U = U = 17.5 U ' = 17.5 U = U = 114.5

25 SPSS uses a different technique that reports the smaller calculated value, regardless of how you arrange the groups. Note that the sum of the ranks does change.

26 Paired Sample t Test

27 Paired Sample t Test This t test is used ONLY when the data are repeat measurements (e.g. measurement at time 1 and time 2 ) or when samples are paired in some manner. The equation is: t = s d d n where d is the mean difference between paired observations, and is the standard deviation of the paired differences. s d Let s test the null hypothesis that unemployment rates in 2007 were lower than in 2008 for selected cities.

28 Determining which data column to subtract from which depends on your hypothesis: Test only difference: does not matter. Testing pair 1 > pair 2: subtract pair 1 from pair 2. Testing pair 1 < pair 2: subtract pair 2 from pair 1. So in this example subtract the 2008 (pair 2) from the 2007 (pair 1) unemployment rate.

29 Unemployment Rate for Selected Cities 2007 (pair 1) 2008 (pair2) d Los Angeles San Francisco Washington DC Bethesda Fort Lauderdale Miami Chicago Boston Detroit Long Island Newark Camden Philadelphia Wilmington Dallas-Fort Worth Seattle Tacoma t = t critical(1) = = = t= > 1.746, reject H o. Remember, the sign just tells us direction, not magnitude. Therefore: Unemployment in 2007 was significantly lower than in 2008 for selected cities (t , p < ). n=17 d v=17-1=16 S d 0.583

30 The paired t test does not have the assumption of normality of the groups or of equality of variances. This is because we are using the paired differences rather than the actual observations. The only assumption is that the paired differences are normally distributed. This test is considered to be fairly robust.

31 Non-Parametric Paired Sample Test Wilcoxon T

32 The Wilcoxon paired-sample test is used when the paired differences are non-normal. The paired t test is fairly robust for slightly non-normal paired differences are not typically a problem If the differences are very non-normal, especially if there is activity in the tails, this test is more appropriate. As with the Mann-Whitney U test, two values are calculated: T + : the sum of the positive ranked differences. T - : the sum of the negative ranked differences.

33 The same subtraction rules for the paired t test apply here. 1. First determined the paired differences. 2. Rank the differences from lowest to highest, ignoring the sign. 3. Apply the signs of the differences to the ranks (called the signed-ranks.).

34 For a 1-tailed test If Ha: Pair1 > Pair 2, reject Ho if T- < the critical value. If Ha: Pair1 < Pair 2, reject Ho if T+ < the critical value. For a 2-tailed test Use the smaller value.

35 H o : Monthly precipitation in 2013 was not greater than in H a : Monthly precipitation in 2013 was greater than in Since our Ha is 2012 (Pair 1) < 2013 (Pair 2) we use T+. Total Precipitation for Shippensburg d Rank d Ranks d Signed January February March April May June July August September October November December The signs are simply used to create 2 groups whose values are summed. n = 12 T + = = 33.5 T - = = 44.5 T critical = 17 Assign the ranks the signs.

36 Calculated value is about here. The Wilcoxon table is one of the few where larger statistics result in accepting the null hypothesis. Make sure you note this.

37 Since we are testing for a positive difference between 2012 and 2013 use the T + statistic. 33 > 13 accept the null hypothesis. Monthly precipitation in 2013 was not greater than in 2012 (Wilcoxon Matched-Pairs T 33, p > 0.25).

38 The treatment should lead to an increase in the measurement. Did the treatment work? Sample Before Treatment After Treatment If Ha: Pair1 > Pair 2, reject Ho if T- < the critical value. If Ha: Pair1 < Pair 2, reject Ho if T+ < the critical value.

Solutions exercises of Chapter 7

Solutions exercises of Chapter 7 Solutions exercises of Chapter 7 Exercise 1 a. These are paired samples: each pair of half plates will have about the same level of corrosion, so the result of polishing by the two brands of polish are

More information

Non-parametric tests, part A:

Non-parametric tests, part A: Two types of statistical test: Non-parametric tests, part A: Parametric tests: Based on assumption that the data have certain characteristics or "parameters": Results are only valid if (a) the data are

More information

The independent-means t-test:

The independent-means t-test: The independent-means t-test: Answers the question: is there a "real" difference between the two conditions in my experiment? Or is the difference due to chance? Previous lecture: (a) Dependent-means t-test:

More information

Distribution-Free Procedures (Devore Chapter Fifteen)

Distribution-Free Procedures (Devore Chapter Fifteen) Distribution-Free Procedures (Devore Chapter Fifteen) MATH-5-01: Probability and Statistics II Spring 018 Contents 1 Nonparametric Hypothesis Tests 1 1.1 The Wilcoxon Rank Sum Test........... 1 1. Normal

More information

Regression Analysis II

Regression Analysis II Regression Analysis II Measures of Goodness of fit Two measures of Goodness of fit Measure of the absolute fit of the sample points to the sample regression line Standard error of the estimate An index

More information

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01 An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there

More information

Chapter 7 Comparison of two independent samples

Chapter 7 Comparison of two independent samples Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N

More information

Testing for Normality

Testing for Normality Testing for Normality For each mean and standard deviation combination a theoretical normal distribution can be determined. This distribution is based on the proportions shown below. This theoretical normal

More information

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics SEVERAL μs AND MEDIANS: MORE ISSUES Business Statistics CONTENTS Post-hoc analysis ANOVA for 2 groups The equal variances assumption The Kruskal-Wallis test Old exam question Further study POST-HOC ANALYSIS

More information

Degrees of freedom df=1. Limitations OR in SPSS LIM: Knowing σ and µ is unlikely in large

Degrees of freedom df=1. Limitations OR in SPSS LIM: Knowing σ and µ is unlikely in large Z Test Comparing a group mean to a hypothesis T test (about 1 mean) T test (about 2 means) Comparing mean to sample mean. Similar means = will have same response to treatment Two unknown means are different

More information

Inferences About the Difference Between Two Means

Inferences About the Difference Between Two Means 7 Inferences About the Difference Between Two Means Chapter Outline 7.1 New Concepts 7.1.1 Independent Versus Dependent Samples 7.1. Hypotheses 7. Inferences About Two Independent Means 7..1 Independent

More information

Nonparametric tests. Mark Muldoon School of Mathematics, University of Manchester. Mark Muldoon, November 8, 2005 Nonparametric tests - p.

Nonparametric tests. Mark Muldoon School of Mathematics, University of Manchester. Mark Muldoon, November 8, 2005 Nonparametric tests - p. Nonparametric s Mark Muldoon School of Mathematics, University of Manchester Mark Muldoon, November 8, 2005 Nonparametric s - p. 1/31 Overview The sign, motivation The Mann-Whitney Larger Larger, in pictures

More information

Discrete distribution. Fitting probability models to frequency data. Hypotheses for! 2 test. ! 2 Goodness-of-fit test

Discrete distribution. Fitting probability models to frequency data. Hypotheses for! 2 test. ! 2 Goodness-of-fit test Discrete distribution Fitting probability models to frequency data A probability distribution describing a discrete numerical random variable For example,! Number of heads from 10 flips of a coin! Number

More information

Testing for Normality

Testing for Normality Testing for Normality For each mean and standard deviation combination a theoretical normal distribution can be determined. This distribution is based on the proportions shown below. This theoretical normal

More information

Levene's Test of Equality of Error Variances a

Levene's Test of Equality of Error Variances a BUTTERFAT DATA: INTERACTION MODEL Levene's Test of Equality of Error Variances a Dependent Variable: Butterfat (%) F df1 df2 Sig. 2.711 9 90.008 Tests the null hypothesis that the error variance of the

More information

Lab Activity: Climate Variables

Lab Activity: Climate Variables Name: Date: Period: Water and Climate The Physical Setting: Earth Science Lab Activity: Climate Variables INTRODUCTION:! The state of the atmosphere continually changes over time in response to the uneven

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Nonparametric statistic methods Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Measurement What are the 4 levels of measurement discussed? 1. Nominal or Classificatory Scale Gender,

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only

More information

S D / n t n 1 The paediatrician observes 3 =

S D / n t n 1 The paediatrician observes 3 = Non-parametric tests Paired t-test A paediatrician measured the blood cholesterol of her patients and was worried to note that some had levels over 00mg/100ml To investigate whether dietary regulation

More information

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data?

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data? Agonistic Display in Betta splendens: Data Analysis By Joanna Weremjiwicz, Simeon Yurek, and Dana Krempels Once you have collected data with your ethogram, you are ready to analyze that data to see whether

More information

Comparison of Two Population Means

Comparison of Two Population Means Comparison of Two Population Means Esra Akdeniz March 15, 2015 Independent versus Dependent (paired) Samples We have independent samples if we perform an experiment in two unrelated populations. We have

More information

Nonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I

Nonparametric tests. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 704: Data Analysis I 1 / 16 Nonparametric tests Timothy Hanson Department of Statistics, University of South Carolina Stat 704: Data Analysis I Nonparametric one and two-sample tests 2 / 16 If data do not come from a normal

More information

Introduction to hypothesis testing

Introduction to hypothesis testing Introduction to hypothesis testing Review: Logic of Hypothesis Tests Usually, we test (attempt to falsify) a null hypothesis (H 0 ): includes all possibilities except prediction in hypothesis (H A ) If

More information

MANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA:

MANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA: MULTIVARIATE ANALYSIS OF VARIANCE MANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA: 1. Cell sizes : o

More information

Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data

Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data 999 Prentice-Hall, Inc. Chap. 9 - Chapter Topics Comparing Two Independent Samples: Z Test for the Difference

More information

Rank-Based Methods. Lukas Meier

Rank-Based Methods. Lukas Meier Rank-Based Methods Lukas Meier 20.01.2014 Introduction Up to now we basically always used a parametric family, like the normal distribution N (µ, σ 2 ) for modeling random data. Based on observed data

More information

Introduction to Statistical Data Analysis III

Introduction to Statistical Data Analysis III Introduction to Statistical Data Analysis III JULY 2011 Afsaneh Yazdani Preface Major branches of Statistics: - Descriptive Statistics - Inferential Statistics Preface What is Inferential Statistics? The

More information

13.7 ANOTHER TEST FOR TREND: KENDALL S TAU

13.7 ANOTHER TEST FOR TREND: KENDALL S TAU 13.7 ANOTHER TEST FOR TREND: KENDALL S TAU In 1969 the U.S. government instituted a draft lottery for choosing young men to be drafted into the military. Numbers from 1 to 366 were randomly assigned to

More information

Difference between means - t-test /25

Difference between means - t-test /25 Difference between means - t-test 1 Discussion Question p492 Ex 9-4 p492 1-3, 6-8, 12 Assume all variances are not equal. Ignore the test for variance. 2 Students will perform hypothesis tests for two

More information

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests

More information

4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures

4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures Non-parametric Test Stephen Opiyo Overview Distinguish Parametric and Nonparametric Test Procedures Explain commonly used Nonparametric Test Procedures Perform Hypothesis Tests Using Nonparametric Procedures

More information

Hotel Industry Overview. UPDATE: Trends and outlook for Northern California. Vail R. Brown

Hotel Industry Overview. UPDATE: Trends and outlook for Northern California. Vail R. Brown Hotel Industry Overview UPDATE: Trends and outlook for Northern California Vail R. Brown Senior Vice President, Global Business Development & Marketing vbrown@str.com @vail_str 2016 STR, Inc. All Rights

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future

More information

Colorado State University, Fort Collins, CO Weather Station Monthly Summary Report

Colorado State University, Fort Collins, CO Weather Station Monthly Summary Report Colorado State University, Fort Collins, CO Weather Station Monthly Summary Report Month: December Year: 2017 Temperature: Mean T max was 47.2 F which is 4.4 above the 1981-2010 normal for the month. This

More information

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data?

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data? Data Analysis: Agonistic Display in Betta splendens By Joanna Weremjiwicz, Simeon Yurek, and Dana Krempels Once you have collected data with your ethogram, you are ready to analyze that data to see whether

More information

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution

More information

Textbook Examples of. SPSS Procedure

Textbook Examples of. SPSS Procedure Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of

More information

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true Hypothesis esting Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true Statistical Hypothesis: conjecture about a population parameter

More information

WHEN IS IT EVER GOING TO RAIN? Table of Average Annual Rainfall and Rainfall For Selected Arizona Cities

WHEN IS IT EVER GOING TO RAIN? Table of Average Annual Rainfall and Rainfall For Selected Arizona Cities WHEN IS IT EVER GOING TO RAIN? Table of Average Annual Rainfall and 2001-2002 Rainfall For Selected Arizona Cities Phoenix Tucson Flagstaff Avg. 2001-2002 Avg. 2001-2002 Avg. 2001-2002 October 0.7 0.0

More information

This is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same!

This is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same! Two sample tests (part II): What to do if your data are not distributed normally: Option 1: if your sample size is large enough, don't worry - go ahead and use a t-test (the CLT will take care of non-normal

More information

Graphing Sea Ice Extent in the Arctic and Antarctic

Graphing Sea Ice Extent in the Arctic and Antarctic Graphing Sea Ice Extent in the Arctic and Antarctic 1. Large amounts of ice form in some seasons in the oceans near the North Pole and the South Pole (the Arctic Ocean and the Southern Ocean). This ice,

More information

Multiple Comparisons

Multiple Comparisons Multiple Comparisons Error Rates, A Priori Tests, and Post-Hoc Tests Multiple Comparisons: A Rationale Multiple comparison tests function to tease apart differences between the groups within our IV when

More information

Relating Graph to Matlab

Relating Graph to Matlab There are two related course documents on the web Probability and Statistics Review -should be read by people without statistics background and it is helpful as a review for those with prior statistics

More information

Chapter 4: Displaying and Summarizing Quantitative Data

Chapter 4: Displaying and Summarizing Quantitative Data Chapter 4: Displaying and Summarizing Quantitative Data This chapter discusses methods of displaying quantitative data. The objective is describe the distribution of the data. The figure below shows three

More information

Making a Climograph: GLOBE Data Explorations

Making a Climograph: GLOBE Data Explorations Making a Climograph: A GLOBE Data Exploration Purpose Students learn how to construct and interpret climographs and understand how climate differs from weather. Overview Students calculate and graph maximum

More information

Exam details. Final Review Session. Things to Review

Exam details. Final Review Session. Things to Review Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit

More information

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Chap The McGraw-Hill Companies, Inc. All rights reserved. 11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview

More information

Public Library Use and Economic Hard Times: Analysis of Recent Data

Public Library Use and Economic Hard Times: Analysis of Recent Data Public Library Use and Economic Hard Times: Analysis of Recent Data A Report Prepared for The American Library Association by The Library Research Center University of Illinois at Urbana Champaign April

More information

Basics on t-tests Independent Sample t-tests Single-Sample t-tests Summary of t-tests Multiple Tests, Effect Size Proportions. Statistiek I.

Basics on t-tests Independent Sample t-tests Single-Sample t-tests Summary of t-tests Multiple Tests, Effect Size Proportions. Statistiek I. Statistiek I t-tests John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/46 Overview 1 Basics on t-tests 2 Independent Sample t-tests 3 Single-Sample

More information

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit

More information

Name: JMJ April 10, 2017 Trigonometry A2 Trimester 2 Exam 8:40 AM 10:10 AM Mr. Casalinuovo

Name: JMJ April 10, 2017 Trigonometry A2 Trimester 2 Exam 8:40 AM 10:10 AM Mr. Casalinuovo Name: JMJ April 10, 2017 Trigonometry A2 Trimester 2 Exam 8:40 AM 10:10 AM Mr. Casalinuovo Part 1: You MUST answer this problem. It is worth 20 points. 1) Temperature vs. Cricket Chirps: Crickets make

More information

Scaling in Biology. How do properties of living systems change as their size is varied?

Scaling in Biology. How do properties of living systems change as their size is varied? Scaling in Biology How do properties of living systems change as their size is varied? Example: How does basal metabolic rate (heat radiation) vary as a function of an animal s body mass? Mouse Hamster

More information

ANOVA - analysis of variance - used to compare the means of several populations.

ANOVA - analysis of variance - used to compare the means of several populations. 12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.

More information

STAT 135 Lab 8 Hypothesis Testing Review, Mann-Whitney Test by Normal Approximation, and Wilcoxon Signed Rank Test.

STAT 135 Lab 8 Hypothesis Testing Review, Mann-Whitney Test by Normal Approximation, and Wilcoxon Signed Rank Test. STAT 135 Lab 8 Hypothesis Testing Review, Mann-Whitney Test by Normal Approximation, and Wilcoxon Signed Rank Test. Rebecca Barter March 30, 2015 Mann-Whitney Test Mann-Whitney Test Recall that the Mann-Whitney

More information

How Do Scientists Find the Epicenter of an Earthquake?

How Do Scientists Find the Epicenter of an Earthquake? 3.4 Explore How Do Scientists Find the Epicenter of an Earthquake? Seismograph data says that the earthquake is 100 km (62 mi) away, but at which point on the circle is the earthquake located? EE 116 3.4

More information

Do not copy, post, or distribute. Independent-Samples t Test and Mann- C h a p t e r 13

Do not copy, post, or distribute. Independent-Samples t Test and Mann- C h a p t e r 13 C h a p t e r 13 Independent-Samples t Test and Mann- Whitney U Test 13.1 Introduction and Objectives This chapter continues the theme of hypothesis testing as an inferential statistical procedure. In

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Non-parametric methods

Non-parametric methods Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish

More information

Transferability of Household Travel Data Across Geographic Areas Using NHTS 2001

Transferability of Household Travel Data Across Geographic Areas Using NHTS 2001 Transferability of Household Travel Data Across Geographic Areas Using NHTS 2001 Jane Lin PhD Assistant Professor Department of Civil and Materials Engineering Institute for Environmental Science and Policy

More information

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<=

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<= A frequency distribution is a kind of probability distribution. It gives the frequency or relative frequency at which given values have been observed among the data collected. For example, for age, Frequency

More information

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown Nonparametric Statistics Leah Wright, Tyler Ross, Taylor Brown Before we get to nonparametric statistics, what are parametric statistics? These statistics estimate and test population means, while holding

More information

Statistics: revision

Statistics: revision NST 1B Experimental Psychology Statistics practical 5 Statistics: revision Rudolf Cardinal & Mike Aitken 29 / 30 April 2004 Department of Experimental Psychology University of Cambridge Handouts: Answers

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution

More information

Wilcoxon Test and Calculating Sample Sizes

Wilcoxon Test and Calculating Sample Sizes Wilcoxon Test and Calculating Sample Sizes Dan Spencer UC Santa Cruz Dan Spencer (UC Santa Cruz) Wilcoxon Test and Calculating Sample Sizes 1 / 33 Differences in the Means of Two Independent Groups When

More information

arxiv: v1 [q-bio.pe] 19 Dec 2012

arxiv: v1 [q-bio.pe] 19 Dec 2012 Week 49 Influenza Forecast for the 2012-2013 U.S. Season JEFFREY SHAMAN Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, New York, New York arxiv:1212.4678v1

More information

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n = Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,

More information

Tentative solutions TMA4255 Applied Statistics 16 May, 2015

Tentative solutions TMA4255 Applied Statistics 16 May, 2015 Norwegian University of Science and Technology Department of Mathematical Sciences Page of 9 Tentative solutions TMA455 Applied Statistics 6 May, 05 Problem Manufacturer of fertilizers a) Are these independent

More information

Statistics Handbook. All statistical tables were computed by the author.

Statistics Handbook. All statistical tables were computed by the author. Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance

More information

Nonparametric Statistics

Nonparametric Statistics Nonparametric Statistics Nonparametric or Distribution-free statistics: used when data are ordinal (i.e., rankings) used when ratio/interval data are not normally distributed (data are converted to ranks)

More information

Stat 427/527: Advanced Data Analysis I

Stat 427/527: Advanced Data Analysis I Stat 427/527: Advanced Data Analysis I Review of Chapters 1-4 Sep, 2017 1 / 18 Concepts you need to know/interpret Numerical summaries: measures of center (mean, median, mode) measures of spread (sample

More information

ST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data

ST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data ST4241 Design and Analysis of Clinical Trials Lecture 7: Non-parametric tests for PDG data Department of Statistics & Applied Probability 8:00-10:00 am, Friday, September 2, 2016 Outline Non-parametric

More information

Violating the normal distribution assumption. So what do you do if the data are not normal and you still need to perform a test?

Violating the normal distribution assumption. So what do you do if the data are not normal and you still need to perform a test? Violating the normal distribution assumption So what do you do if the data are not normal and you still need to perform a test? Remember, if your n is reasonably large, don t bother doing anything. Your

More information

Lecture 7: Hypothesis Testing and ANOVA

Lecture 7: Hypothesis Testing and ANOVA Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis

More information

The Fibonacci Sequence

The Fibonacci Sequence The Fibonacci Sequence MATH 100 Survey of Mathematical Ideas J. Robert Buchanan Department of Mathematics Summer 2018 The Fibonacci Sequence In 1202 Leonardo of Pisa (a.k.a Fibonacci) wrote a problem in

More information

Research Update: Race and Male Joblessness in Milwaukee: 2008

Research Update: Race and Male Joblessness in Milwaukee: 2008 Research Update: Race and Male Joblessness in Milwaukee: 2008 by: Marc V. Levine University of Wisconsin Milwaukee Center for Economic Development Briefing Paper September 2009 Overview Over the past decade,

More information

Name Period Date. Analyzing Climographs

Name Period Date. Analyzing Climographs Name Period Date Analyzing Climographs Climographs: It is often helpful to plot two different types of data on the same graph. For example, a climograph is a single graph that charts both the average temperature

More information

STATISTICS 4, S4 (4769) A2

STATISTICS 4, S4 (4769) A2 (4769) A2 Objectives To provide students with the opportunity to explore ideas in more advanced statistics to a greater depth. Assessment Examination (72 marks) 1 hour 30 minutes There are four options

More information

13: Additional ANOVA Topics. Post hoc Comparisons

13: Additional ANOVA Topics. Post hoc Comparisons 13: Additional ANOVA Topics Post hoc Comparisons ANOVA Assumptions Assessing Group Variances When Distributional Assumptions are Severely Violated Post hoc Comparisons In the prior chapter we used ANOVA

More information

Module 9: Nonparametric Statistics Statistics (OA3102)

Module 9: Nonparametric Statistics Statistics (OA3102) Module 9: Nonparametric Statistics Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 15.1-15.6 Revision: 3-12 1 Goals for this Lecture

More information

Lecture 8 CORRELATION AND LINEAR REGRESSION

Lecture 8 CORRELATION AND LINEAR REGRESSION Announcements CBA5 open in exam mode - deadline midnight Friday! Question 2 on this week s exercises is a prize question. The first good attempt handed in to me by 12 midday this Friday will merit a prize...

More information

Analysis of variance (ANOVA) Comparing the means of more than two groups

Analysis of variance (ANOVA) Comparing the means of more than two groups Analysis of variance (ANOVA) Comparing the means of more than two groups Example: Cost of mating in male fruit flies Drosophila Treatments: place males with and without unmated (virgin) females Five treatments

More information

Creating a Travel Brochure

Creating a Travel Brochure DISCOVERING THE WORLD! Creating a Travel Brochure Objective: Create a travel brochure to a well-known city including weather data and places to visit! Resources provided: www.weather.com, internet Your

More information

Final Exam. DO NOT SEPARATE the answer sheet from the rest of the test. Work and CIRCLE the answer to each problem INSIDE the test.

Final Exam. DO NOT SEPARATE the answer sheet from the rest of the test. Work and CIRCLE the answer to each problem INSIDE the test. Final Exam Math 114 - Statistics for Business Fall 2009 Name: Section: (circle one) 1 2 3 4 5 6 INSTRUCTIONS: This exam contains 24 problems. The first 16 are multiple-choice, and are 3 points each. Record

More information

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and

More information

Lab Activity: Weather Variables

Lab Activity: Weather Variables Name: Date: Period: Weather The Physical Setting: Earth Science Lab Activity: Weather Variables INTRODUCTION: A meteorologist is an individual with specialized education who uses scientific principles

More information

Non-parametric (Distribution-free) approaches p188 CN

Non-parametric (Distribution-free) approaches p188 CN Week 1: Introduction to some nonparametric and computer intensive (re-sampling) approaches: the sign test, Wilcoxon tests and multi-sample extensions, Spearman s rank correlation; the Bootstrap. (ch14

More information

Analysis of 2x2 Cross-Over Designs using T-Tests

Analysis of 2x2 Cross-Over Designs using T-Tests Chapter 234 Analysis of 2x2 Cross-Over Designs using T-Tests Introduction This procedure analyzes data from a two-treatment, two-period (2x2) cross-over design. The response is assumed to be a continuous

More information

Nonparametric Location Tests: k-sample

Nonparametric Location Tests: k-sample Nonparametric Location Tests: k-sample Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 04-Jan-2017 Nathaniel E. Helwig (U of Minnesota)

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 7 Inferences Based on Two Samples: Confidence Intervals & Tests of Hypotheses Content 1. Identifying the Target Parameter 2. Comparing Two Population Means:

More information

Fair Game Review. Chapter. Order the integers from least to greatest. 1. 9, 8, 0, 3, , 4, 1, 2, , 6, 8, 5, 9 4.

Fair Game Review. Chapter. Order the integers from least to greatest. 1. 9, 8, 0, 3, , 4, 1, 2, , 6, 8, 5, 9 4. Name Date Chapter 1 Fair Game Review Order the integers from least to greatest. 1. 9, 8, 0, 3, 7.,, 1,, 1 3. 11, 6, 8, 5, 9.,, 5, 0, 7 Use the graph to write an ordered pair corresponding to the point.

More information

TMA4255 Applied Statistics V2016 (23)

TMA4255 Applied Statistics V2016 (23) TMA4255 Applied Statistics V2016 (23) Part 7: Nonparametric tests Signed-Rank test [16.2] Wilcoxon Rank-sum test [16.3] Anna Marie Holand April 19, 2016, wiki.math.ntnu.no/tma4255/2016v/start 2 Outline

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 65 http://www.stat.tamu.edu/~suhasini/teaching.html Suhasini Subba Rao Review In the previous lecture we considered the following tests: The independent

More information

What Is ANOVA? Comparing Groups. One-way ANOVA. One way ANOVA (the F ratio test)

What Is ANOVA? Comparing Groups. One-way ANOVA. One way ANOVA (the F ratio test) What Is ANOVA? One-way ANOVA ANOVA ANalysis Of VAriance ANOVA compares the means of several groups. The groups are sometimes called "treatments" First textbook presentation in 95. Group Group σ µ µ σ µ

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

3. Nonparametric methods

3. Nonparametric methods 3. Nonparametric methods If the probability distributions of the statistical variables are unknown or are not as required (e.g. normality assumption violated), then we may still apply nonparametric tests

More information

Lecture 06. DSUR CH 05 Exploring Assumptions of parametric statistics Hypothesis Testing Power

Lecture 06. DSUR CH 05 Exploring Assumptions of parametric statistics Hypothesis Testing Power Lecture 06 DSUR CH 05 Exploring Assumptions of parametric statistics Hypothesis Testing Power Introduction Assumptions When broken then we are not able to make inference or accurate descriptions about

More information

One-Sample and Two-Sample Means Tests

One-Sample and Two-Sample Means Tests One-Sample and Two-Sample Means Tests 1 Sample t Test The 1 sample t test allows us to determine whether the mean of a sample data set is different than a known value. Used when the population variance

More information

Statistics for EES and MEME 5. Rank-sum tests

Statistics for EES and MEME 5. Rank-sum tests Statistics for EES and MEME 5. Rank-sum tests Dirk Metzler June 4, 2018 Wilcoxon s rank sum test is also called Mann-Whitney U test References Contents [1] Wilcoxon, F. (1945). Individual comparisons by

More information