Statistical Inference Theory Lesson 46 Non-parametric Statistics

Similar documents
Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Nonparametric Statistics

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

Lecture 7: Hypothesis Testing and ANOVA

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

Chapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Biostatistics 270 Kruskal-Wallis Test 1. Kruskal-Wallis Test

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Non-parametric tests, part A:

Non-parametric (Distribution-free) approaches p188 CN

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

HYPOTHESIS TESTING. Hypothesis Testing

STAT Section 5.8: Block Designs

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

Contents. Acknowledgments. xix

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health

4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<=

Classroom Activity 7 Math 113 Name : 10 pts Intro to Applied Stats

Data Mining. CS57300 Purdue University. March 22, 2018

Non-parametric Tests

Inference About Two Means: Independent Samples

Non-parametric methods

Chapter 10: Inferences based on two samples

Sign test. Josemari Sarasola - Gizapedia. Statistics for Business. Josemari Sarasola - Gizapedia Sign test 1 / 13

Data analysis and Geostatistics - lecture VII

Contents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47

Formulas and Tables by Mario F. Triola

Chapter 18 Resampling and Nonparametric Approaches To Data

ANOVA - analysis of variance - used to compare the means of several populations.

STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis

Unit 14: Nonparametric Statistical Methods

Transition Passage to Descriptive Statistics 28

QUANTITATIVE TECHNIQUES

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

Comparison of Two Samples

Glossary for the Triola Statistics Series

Statistics Handbook. All statistical tables were computed by the author.

3 Joint Distributions 71

Types of Statistical Tests DR. MIKE MARRAPODI

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Inferential Statistics

The Student s t Distribution

Rank-Based Methods. Lukas Meier

Analysis of variance

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data?

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data?

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

Solutions exercises of Chapter 7

NON-PARAMETRIC STATISTICS * (

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics

What Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone

Formulas and Tables. for Elementary Statistics, Tenth Edition, by Mario F. Triola Copyright 2006 Pearson Education, Inc. ˆp E p ˆp E Proportion

1.0 Hypothesis Testing

You may not use your books/notes on this exam. You may use calculator.

Swarthmore Honors Exam 2012: Statistics

Everything is not normal

= 1 i. normal approximation to χ 2 df > df

GROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

S D / n t n 1 The paediatrician observes 3 =

Math 2000 Practice Final Exam: Homework problems to review. Problem numbers

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Module 9: Nonparametric Statistics Statistics (OA3102)

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

Nonparametric tests. Mark Muldoon School of Mathematics, University of Manchester. Mark Muldoon, November 8, 2005 Nonparametric tests - p.

3. Nonparametric methods

Rama Nada. -Ensherah Mokheemer. 1 P a g e

3. Tests in the Bernoulli Model

Statistical Significance of Ranking Paradoxes

a. When a data set is not normally distributed, what should you try in order to appropriately make statistical tests on that data?

Non-parametric Hypothesis Testing

Statistics: revision

Exam details. Final Review Session. Things to Review

Hypothesis Testing. ECE 3530 Spring Antonio Paiva

STATISTIKA INDUSTRI 2 TIN 4004

N Utilization of Nursing Research in Advanced Practice, Summer 2008

Dr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)

Background to Statistics

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.

CDA Chapter 3 part II

Problem of the Day. 7.3 Hypothesis Testing for Mean (Small Samples n<30) Objective(s): Find critical values in a t-distribution

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Statistics and Data Analysis in Geology

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests

(b). What is an expression for the exact value of P(X = 4)? 2. (a). Suppose that the moment generating function for X is M (t) = 2et +1 3

MATH4427 Notebook 5 Fall Semester 2017/2018

1 ONE SAMPLE TEST FOR MEDIAN: THE SIGN TEST

ST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data

Discrete Probability distribution Discrete Probability distribution

Inferences About the Difference Between Two Means

Transcription:

46.1-The Sign Test Statistical Inference Theory Lesson 46 Non-parametric Statistics 46.1 - Problem 1: (a). Let p equal the proportion of supermarkets that charge less than $2.15 a pound. H o : p 0.50 H a : p > 0.50 H a is the claim of the Agency. (b). Step 1: Super Market 1 2 3 4 5 6 7 8 9 10 Price $ per Pound 2.05 2.19 1.98 2.0 1.79 2.77 2.05 2.01 2.18 2.79 +/- - + - - - + - - + + Step 2: Since we have no zeros, we assume N = 10. Step 3: The number of + signs is N + = 4 and the number of minus signs is N - = 6. Therefore, there are 6 supermarkets that charge less than $2.15 a pound. Step 4: Rule 2 assumes that the distribution of signs is binomial. Step5 : Assume H o is true. The following is the cumulative binomial distribution for N = 10 and p = 0.50. k 0 1 2 3 4 5 6 7 8 9 10 P{N + k} 1 0.99 0.99 0.94 0.83 0.62 0.38 0.17 0.05 0.01 0 Step 7: Since we have a one-sided test, we use = 0.01. 615

Using the above cumulation table, P{K 6} = 0.38 > 0.01, and therefore, we reject H a. We have no statistical basis for accepting the Agency s claim. 46.2 - The Mann-Whitney U Test 46.2 - Problem 1: (a). H o : There is no difference in the team's total scores between playing at home or away. H a : The team has greater scores when playing away from home. (b). We will apply the four rules above. Step 1: Using rule 1 we have 56 57 60 61 65 66 67 68 72 74 75 77 88 89 91 96 98 99 100 101 Step 2: Using rule 2 we have 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 56 57 60 61 65 66 67 68 72 74 75 77 88 89 91 96 98 99 100 20 101 Step 3:Using rule 3, we have Final scores of games played at home and rank Final scores of games played away from home and rank Grade Rank Grade Rank 67 7 65 5 89 14 91 15 72 9 74 10 66 6 100 19 101 20 56 1 616

98 17 68 8 77 12 57 2 99 18 60 3 96 16 61 4 88 13 75 11 Sum = R 1 = 132 Sum = R 2 = 78 Step 4: Since N 1 = 10, N 2 = 10, R 1 = 132, (c). Step 1: = = 50 = 23. Step 2: 175 Step 3: Since U is approximately normal, we use which is normal with mean 0 and variance 1. Step 4: - 2.04 Step 5: Since we have a one-sided test, Therefore, the corresponding z value is z = -1.64. Step 6. Since -2.04 < -1.64, we reject H o and conclude there is a significant difference between the team playing at home or away. 46.3 - The Kruskal-Wallis H Test 46.3 - Problem 1: (a). H o : There is no significant weight loss due to different drugs. 617

H a : There is a significant weight loss due to different drugs. (b). Step 1: From the above table, combine and rank the data: 17.6 19.1 19.7 20.8 21.2 21.3 21.5 21.7 22.1 22.1 29 30.5 30.9 1 2 3 4 5 6 7 8 9 10 11 12 13 32.06 33.43 14 15 Step 2: Apply this ranking for each brand: Drug A Rank Drug B Rank Drug C Rank 21.30 6 22.11 9 33.43 15 32.06 14 19.18 2 29.00 11 21.78 8 17.66 1 21.26 5 22.12 10 19.79 3 30.58 12 30.98 13 20.89 4 21.55 7 Sum R 1 = 51 Sum R 2 = 19 Sum R 3 = 50 Step 3: Compute H from the formula: N = N 1 + N 2 + N 3 = 5 + 5 + 5 = 15. = = 6.62 (c). Step 1: The degrees of freedom is k - 1 = 3-1 = 2. Step 2: For = 0.01, the chi-square table give us 2 = 9.21. Step 3: Since H = 6.62 < 5.99, we conclude that at = 0.01 significance level we cannot conclude there is a significant difference in the weight reduction resulting from these drugs. 618

46.4 - The Spearman's Rank Correlation 46.4 - Problem 1: Step 1: List the Patients weight in ascending order and rank: Patients' weight 175 197 210 211 225 235 237 255 278 298 Rank 1 2 3 4 5 6 7 8 9 10 Step 2: List the cholesterol level in ascending order and rank: Cholesterol level 172 190 205 210 220 225 250 320 325 256 Rank 1 2 3 4 5 6 7 8 9 10 Step 3: We now place the ranking in the above table and compute D and D 2 : Patients' weight Rank Cholesterol level Rank D D 2 175 1 225 6 1-6 = -5 25 197 2 205 3 2-3 = -1 1 210 3 220 5 3-5 = -2 4 211 4 320 8 4-8 = -4 16 225 5 190 2 5-2 = 3 9 235 6 172 1 6-1 = 5 25 237 7 250 7 7-7 = 0 0 255 8 210 4 8-4 = 4 16 278 9 325 9 9-9 = 0 0 298 10 256 10 10-10 = 0 0 Sum = 96 Step 4: Since N = 7, r 1-58= 0.42. 619

Supplementary Problems 1. rank 10 9 8 7 6 5 4 3 2 1 y 10 9 8 7 6 5 4 3 2 1 rank 1 2 3 4 5 6 7 8 9 10 x 1 2 3 4 5 6 7 8 9 10 x rank y rank D D 2 1 1 10 10 1-10 = -9 81 2 2 9 9 2-9 = -7 49 3 3 8 8 3-8 = -5 25 4 4 7 7 4-7 = -3 9 5 5 6 6 5-6 = -1 1 6 6 5 5 6-5 = 1 1 7 7 4 4 7-4 = 3 9 8 8 3 3 8-3 = 5 25 9 9 2 2 9-2 = 7 49 10 10 1 1 10-1 = 9 81 Sum = 330 = 1-2 = -1 2. Let a = h and b = t. 620

From the above groups, N 1 = 11, N 2 = 9. 3. a. Since we don t know the outcome of the sample of 200 chips, we estimate N 1 and N 2, as follows: N 1 = 0.05(200) = 10(number of defective chips) and N 2 = (0.95)(200) = 190(number of non-defective chips). R 1.31 b. If the system is stable then 5% or less are defective. Therefore, H o : R 20 (The system is stable) H a : R > 20 (The system is not stable). c. Decision rule: If c* R then assume the system is not stable; otherwise assume the system is stable. c* = + z R = 20 + z1.31 For = 0.05, from the normal distribution table, z = 1.64. Therefore, c* = 20 + 1.64(1.31) 22. Decision rule: If 22 R then assume the system is not stable; otherwise assume the system is stable. 621

d. We estimate N 1 = (0.08)(200) =16 (number of defective chips), N 2 = (0.92)(200) = 184 (number of non-defective chips). R = 2.05 Therefore from the normal distribution table, the probability is zero that the system is assumed stable. = 0. e. Since the system is stable, from problem (a), = 20 and = 1.31. Therefore, from the normal distribution table, the probability is zero that the system will result in at least 25 runs. 622