THE PAIR CHART I. Dana Quade. University of North Carolina. Institute of Statistics Mimeo Series No ~.:. July 1967
|
|
- Hortense Dennis
- 5 years ago
- Views:
Transcription
1 . _ e THE PAR CHART by Dana Quade University of North Carolina nstitute of Statistics Mimeo Series No. 537., ~.:. July 1967 Supported by U. S. Public Health Service Grant No. 3-Tl-ES-6l-0l. DEPARTMENT OF BOSTATSTCS UNVERSTY OF NORTH CAROLNA Chapel Hill, N. C.
2 . _ - unknown 1 The Pair Chart by Dana Quade University of North Carolina Let Xl,XZ,...,X m be a random sample of m observations on a variable X with distribution function F, and let Yl'YZ'...Y n be a random sample of n observations on a variable Y with unknown distribution function G. classic "two sample problem" is then to compare such samples, and in particular to test the null hypothesis H that F =G. My purpose in this paper is to o show how a certain diagram, which call a pair chart, may give insight into the problem and may also shorten the computations. The pair chart is constructed as follows. The First delineate, on a sheet of ordinary graph paper, a rectangle m units wide and n units high. Starting at the lower left corner of this rectangle, which may be designated as the point (0,0), draw a line one unit to the right if the smallest observation in the two samples combined is an X, and one unit up if it is a Y. Then, from the end of this first line, draw a second line, one unit to the right.if the second smallest observation in the two samples combined is an X, and one unit up if it is a Y. Continue in the same manner; except that, on coming to a between-sample tie involving, say m' XS and n' Y's, draw a sloping line from the end of the last line drawn to the point m' to the right and n' units above it. (Within-sample ties present no difficulties to the construction.) units When lines have been drawn corresponding to all the (m + n) observations, the upper right hand corner (m,n) of the original rectangle will have been reached. 1 Supported by U. S. Public Health Service grant no. 3-Tl-ES to the University of North Carolina nstitute for Environmental Health Studies.
3 For example, suppose the two samples are: x 2,4,4,4,6,6,7,9 (m = 8) 2 _ y 4,7,8,8,8,11 (n = 6) Then the combined sample may be arranged as follows: X (XXXY) (XX) (XY) (yyy) and the corresponding pair chart is as shown in Figure 1. 9 X 11 y Figure 1..- Y (0,0) 2 ~ ~ ~ ~ X 6 6 1/ 7 9 (8,6)
4 3. The explanation of the name "pair chart" is this: each unit square within the rectangle represents one of the mn possible pairs (X.,Y.) which ~ J include one observation from each sample. Thus, the shaded square in Figure 1 corresponds to the pair X 3 = 4, Y s = 8. f the chart includes sloping lines, corresponding to between-sample ties, consider forming boxes such that the sloping lines are their diagonals. n Figure 1 there are two such boxes, outlined with dots. Then if X. = Y. for any particular pair, the corresponding ~ J square is inside such a box; if X. > Y. ~ J the corresponding square lies in the region at the lower right portion of the rectangle; and if X. < Y. lies in the remaining region at the upper left. ~ J The areas of the three the square regions---the one consisting of all the boxes, the lower right one, and the upper left one---are equal, respectively, to the numbers of pairs of the three types: already aware of its interpretation in terms of pairs, Drion did not ever since. those where X. = Y., X. > Y., or X. < Y.. Thus counting the squares ~ J ~ J ~ J in the three regions of Figure 1 shows that there are 4 pairs (X.,Y.) such ~ J that X. = Y., 8 pairs such that X. > ~ J ~ total of mn = 48 in all. Y., and 36 such that X. < Y., out of the J ~ J The pair chart seems to have been invented by Drion [lj, who used it only as a tool for obtaining certain theoretical results concerning the two-sample Kolmogorov-Smirnov test (discussed below). Although he was develop his diagram further, and its potentialities have been overlooked Consider the locus of points (X,Y) = (F(z)), G(z)) for ~ < z < ~. is in general a nondecreasing curve which joins the points (0,0) and (1,1), except that there may be gaps in it, corresponding to discontinuities in F This
5 4. or G: let any such gaps be bridged by straight lines. What results may be called the relative distribution function (RDF) of Y with respect to X; the RDF the line x = y. of X with respect to Y can be obtained from it by reflection in of the two samples: and Now let F and G be the empirical distribution functions m n that is, F (z) = m number of sample XS < z m Then the empirical relative distribution function of Y with respect to X is the locus of points (x,y) = (F (z), G (z», with gaps bridged as for the true m n RDF; G (z) = n number of sample Y's < z n again, reflection in the line x = y gives the empirical RDF of X with respect to Y. The empirical RDF is essentially equivalent to the diagram which Wi1k and Gnanadesikan [7J have called a P-p1ot. t is the sample estimator of the true RDF, which it will approach in the limit with increasing m and n. t may also be thought of as a standardized version of the pair chart, from which it differs only by a change of scale, since the path of lines from (0,0) to (m,n) in the pair chart is based on the locus of points (x,y) = (mf (z), ng (z». The pair chart has the advantage in simplicity m n of construction, however, especially when m and n are unequal. f (and only if) F =G, then the RDF of Y with respect to X, or X with respect to Y, is the diagonal of the unit square with corners (0,0), (0,1), (1,0), (1,1). Various manners of departure from this situation can be given clear interpretations. For example, the RDF of Y with respect to
6 . e 5 X lies generally above the diagonal if the XS tend to be larger than the Y's. And the RDF crosses the diagonal, lying first below and then above it, if the XS tend to be more dispersed than the y's. Thus the form of an RDF may give an extremely useful picture of the comparison between the two populations. This is illustrated in Figures 2 and 3. Figure 2 exemplifies the effects on the RDF of differences in mean and variance between X and Y, by showing the RDF's of some nonstandard normal variables with respect to a standard normal variable. Figure 3 exemplifies the effects of differences in distributional form, by showing the RDF's of various nonnormal variables with respect to a normal variable with the same mean and variance. t may be noted that some of the RDF's in Figure 3 are almost indistinguishable from the diagonal of the unit square. This suggests that, with a rank-order method, it will be difficult to distinguish a normal distribution from, say, a logistic or triangular if the mean and variance are the same. The empirical RDF or, equivalently, the pair chart yields similar interpretations with respect to the samples. Thus the path in Figure 1 lies entirely below the diagonal of the rectangle, and this illustrates how the observed values of X are in general smaller than those of Y. n addition, various tests of the hypothesis that F =G may be obtained by agreeing to reject if the empirical RDF, or pair chart, lies too far, in some appropriate sense, from the diagonal. The occurrence of ties,or at least of ties between samples, is troublesome to nearly all nonparametric procedures in that it complicates calculation of exact P-values. The following general rule for use in dealing with such ties can be stated, however: the correct P-value must be no less
7 . _ a =1 y j.l y=1 Figure 2. RELATVE DSTRBUTON FUNCTONS OF SOME NONSTANDARD NORMAL VARABLES WTH RESPECT TO A STANDARD NORMAL VARABLE ij1x = 0, ax = 1). a =1 y j.l y=2 6 a =1 y j.l y=0 a =2 y a =4 y j.l y=1 j.l y=1 a =2 y a =4 y j.l y=2 j.l y=2 a =2 y a =4 y
8 . _ Figure 3. RELATVE DSTRBUTON FUNCTONS OF SOME NONNORMAL VARABLES WTH RESPECT TO A NORMAL VARABLE HAVNG THE SAME MEAN AND VARANCE. Logistic Laplace 1 Cauchy 7 1 Triangular Chi-square (3 d.f.) Rectangular Chi-square (2 d.f.) V-shaped Chi-square (1 d.f.) Normal variable has same quarti1es rather than same variance.
9 8. than what is obtained if all ties are broken in whatever manner least favors the hypothesis, but it must be no more than what is obtained when the ties are broken in the most favorable manner for the hypothesis. are presented below and are considered in the light of this rule. The Wilcoxon and Mann-Whitney Procedures. Let U x Several tests be the area on the X-side of the path in the pair chart, below the path, so that U x is equal to the number of pairs of observations (X.,Y.) such that X. > Y., plus one half the number of pairs (if any) 1 J 1 J that Xi Y j. Similarly, let U y be the opposite area on the Y-side, so that U y = mn - U X ' Then on dividing U x and U y by mn, the total number of pairs, one obtains estimates of the probabilities P x = P {X > Y} + ~P {X = Y} such and P = P {X < Y} + ~P {X = Y}. Y These estimates, which are unbiased and consistent, are the areas within the unit square below and above the empirical RDF P x and P y the null hypothesis H o [5J, [4J, or [3J. are the corresponding areas for the true RDF. The Mann-Whitney two-sample test rejects H, o alternative that P x P y, if an extreme value of U x occurs. of Y with respect to X, while in favor of the Assuming that FaG is true, and in addition that F and G are continuous, so that ties are impossible, U x has mean mn/2 and variance mn(m + n + 1)/12. ts small-sample distribution has been tabulated: see ts large-sample distribution is normal, so the test is practicable in that case also; it is then customary to make a continuity correction by shifting the observed value of U x half a unit toward its mean.
10 . When between-sample ties occur, the U obtained from the pair chart is the x same as if half the ties had been broken in one way and half in the other. Hence the corresponding P-va1ue is a compromise between the two extremes of the general rule given above; in practice it may be either too large or too small but is usually very nearly correct. The area under the path in Figure 1 is U x = 10, and a table of the exact distribution gives the corresponding P-va1ue, for the two-sided test as described above, as.081. By the general rule, a lower bound is.043 (for U x = 8) and an upper bound is.142 (for U x = 12). A one-sided test could also have been performed, if the alternative P x < P y had been specified in advance; it would give P-va1ues half as great as the two-sided test. Another method of computation, due to Wilcoxon, requires ranking the observations of the two samples together. f the ranking is done from smallest first, with average ranks used for ties, then (as is now we11- known) we have L: (ranks of XS in combined sample) = U x + m(m+1)/2, L: (ranks of yls in combined sample) = U y + n(n+1)/2. n the example the XS have ranks 1, 3.5, 3.5, 3.5, 6.5, 6.5, 8.5, 13 in the combined sample, with sum 46; and this is equal to U = 10 plus m(m+1)/2 = 36. x The test can then be performed either using tables based directly on the rank sum or after converting to a U. The pair-chart method is at least as convenient computationally, however, unless the samples are large. The Ko1mogorov-Smirnov Procedures. The standard two-sample Ko1mogorov-Smirnov test rejects H : F =G o if too large a value of 9
11 . e is observed, where D- = sup {F (z) - G (z)}, m n -OO<z<= D+ = sup {G (z) -F(z)}; n m -OO<z<o:> the test is consistent against every alternative F ~ G. One-sided tests can - + be based on D and D. Small-sample tabulations of the null-hypothesis distribution, assuming no between-sample ties, are available: and [2J. and n large samples, P{D :: c} for every c > 0, where N = mn/(m+n). 2 exp (-2Nc ), 10 see [5J, [4J, The computational method usually proposed for these tests requires actually drawing the two empirical distribution functions F and G and then m n searching for the greatest vertical separation between them. This may, of course, involve considerable labor. A more convenient computational method is as follows: let (x,y-) be the point on the path in the pair chart farthest below the diagonal (if two or more points are equally far below it, anyone of them may be used), and let (x+, y+) be the point farthest above the diagonal; then D ( x m y ) n and + = ( y m + x ). n The key to understanding this method lies in seeing that vertical distances between two distribution functions are the same as vertical (or horizontal)
12 . distances from their relative distribution function to the diagonal of the unit square. To illustrate, note that the point on the path farthest below the diagonal in Figure 1 is (x-,y-) = (6,1), and hence D = (6/8-1/6) = 7/12. The path is nowhere above the diagonal, so (x +,y + ) may be taken as (0,0) or (8,6), yielding D+ = either way. The P-value corresponding to D = max (D -,D + ) = 7/12 is.139, according to the table of Massey [2J; corresponding to D = 7/12 and D+ = the P-values are.070 and 1.000, respectively. Because of the labor required by the usual computational methods, it is tempting to perform a preliminary grouping of the data; for instance, Siegel [5J does so in both of the examples which he presents. 11 But grouping will tend to increase the number of between-sample ties, and thus to reduce the power of the test. This is because the Kolmogorov-Smirnov tests are conservative when such ties occur; they use the smallest value of D which could possibly result from any breaking of the ties. Thus, in Figure 1, if the crucial tie at X 7 = Y 2 = 7 were broken by making X 7 > Y 2, which would be favorable to the hypothesis, then the value of D would remain the same as found previously, with the corresponding upper bound.139 on the true P-value; but breaking the tie in the opposite way would yield (x-,y-) = (7,1), D = 17/24, and hence the lower bound.043 on the P-value. The pair-chart method, however, yields such a dramatic reduction in computational effort that grouping is no longer necessary. Runs. another. Assume that all between-sample ties have been broken, one way or Then the pair chart will exhibit no sloping lines, but only horizontal and vertical ones, each corresponding to a "run" of observations
13 ., from the same sample. Let R be the total number of these runs. Wa1d and Wo1fowitz [6J showed that under H : Fe G the expected value of R is o (2mn + m + n)/(m + n) and that this is the maximum expected value for any F and G, under certain restrictions; hence they proposed rejecting H for o small observed values of R. Some insight into their results may be provided by contemplating the pair chart, as follows: 12 if F =G then the pair-chart curve should be "close" to the diagonal of the rectangle, and in order to lie as close as possible it must be made up of many short segments which continually cross and recross the diagonal; if it were made up of fewer and longer segments then it would have to lie farther away on the whole. t is not suggested that the pair chart simplifies computatons for this test, however, nor indeed is the runs test to be recommended for practical use, since it is in general less powerful than competitors such as the Ko1mogorov-Smirnov test. Owen [4J For further discussion and tables see the books of Siegel [5J and (note that Siegel's table is valid for the.025 level rather than.05 as stated). n Figure 1 there are 6 or 8 runs, depending on how the crucial tie at X 7 = Y 2 = enough for significance at any usual level. 7 is broken; even 6, however, is not nearly small = -
14 . e [lj [2J [3J [4J [5J [6J [ 7J REFERENCES E. F. Drion, "Some distribution-free tests for the difference between two empirical cumulative distribution functions", Annals of Mathematical Statistics 23, (1952) F. J. Massey, Jr., "Distribution table for the deviation between two sample cumulatives", Annals of Mathematical Statistics 23, (1952) Roy C. Milton, "An extended table of critical values for the Mann Whitney (Wilcoxon) two sample statistic", Journal of the American Statistical Association 59, (1964) D. B. Owen, Handbook of Statistical Tables, Addison-Wesley (1962). Sidney Siegel, Nonparametric Statistics for the Behavioral Sciences, McGraw-Hill (1956). A. Wald and J. Wolfowitz, "On a test whether two samples are from the same population", Annals of Mathematical Statistics 11, (1940) M. B. Wilk and R. Gnanadesikan, "Probability plotting methods for the analysis of data", invited paper presented at joint meeting of American Statistical Association, Biometric Society (ENAR), and nstitute of Mathematical Statistics, Philadelphia, September
Distribution-Free Procedures (Devore Chapter Fifteen)
Distribution-Free Procedures (Devore Chapter Fifteen) MATH-5-01: Probability and Statistics II Spring 018 Contents 1 Nonparametric Hypothesis Tests 1 1.1 The Wilcoxon Rank Sum Test........... 1 1. Normal
More informationNONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12
NONPARAMETRIC TESTS LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-1 lmb@iasri.res.in 1. Introduction Testing (usually called hypothesis testing ) play a major
More informationPSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests
PSY 307 Statistics for the Behavioral Sciences Chapter 20 Tests for Ranked Data, Choosing Statistical Tests What To Do with Non-normal Distributions Tranformations (pg 382): The shape of the distribution
More informationAgonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data?
Agonistic Display in Betta splendens: Data Analysis By Joanna Weremjiwicz, Simeon Yurek, and Dana Krempels Once you have collected data with your ethogram, you are ready to analyze that data to see whether
More informationFormulas and Tables by Mario F. Triola
Copyright 010 Pearson Education, Inc. Ch. 3: Descriptive Statistics x f # x x f Mean 1x - x s - 1 n 1 x - 1 x s 1n - 1 s B variance s Ch. 4: Probability Mean (frequency table) Standard deviation P1A or
More informationContents Kruskal-Wallis Test Friedman s Two-way Analysis of Variance by Ranks... 47
Contents 1 Non-parametric Tests 3 1.1 Introduction....................................... 3 1.2 Advantages of Non-parametric Tests......................... 4 1.3 Disadvantages of Non-parametric Tests........................
More informationUnit 14: Nonparametric Statistical Methods
Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based
More informationKumaun University Nainital
Kumaun University Nainital Department of Statistics B. Sc. Semester system course structure: 1. The course work shall be divided into six semesters with three papers in each semester. 2. Each paper in
More informationDr. Maddah ENMG 617 EM Statistics 10/12/12. Nonparametric Statistics (Chapter 16, Hines)
Dr. Maddah ENMG 617 EM Statistics 10/12/12 Nonparametric Statistics (Chapter 16, Hines) Introduction Most of the hypothesis testing presented so far assumes normally distributed data. These approaches
More informationEconometrics II - Problem Set 2
Deadline for solutions: 18.5.15 Econometrics II - Problem Set Problem 1 The senior class in a particular high school had thirty boys. Twelve boys lived on farms and the other eighteen lived in town. A
More informationNAG Library Chapter Introduction. G08 Nonparametric Statistics
NAG Library Chapter Introduction G08 Nonparametric Statistics Contents 1 Scope of the Chapter.... 2 2 Background to the Problems... 2 2.1 Parametric and Nonparametric Hypothesis Testing... 2 2.2 Types
More informationUNIVERSITY OF NORTH CAROLINA Department of Statistics Chapel Hill, N. C. RICHARD F. POTTHOFF February, Contract No.
UNIVERSITY OF NORTH CAROLINA Department of Statistics Chapel Hill, N. C. COlPARING THE MEDIANS OF TvlO SYNMEl'RICAL POPULATICNS by RICHARD F. POTTHOFF February, 962 Contract No. AF 49(638)-23 This paper
More informationData Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data?
Data Analysis: Agonistic Display in Betta splendens By Joanna Weremjiwicz, Simeon Yurek, and Dana Krempels Once you have collected data with your ethogram, you are ready to analyze that data to see whether
More informationAdvanced Statistics II: Non Parametric Tests
Advanced Statistics II: Non Parametric Tests Aurélien Garivier ParisTech February 27, 2011 Outline Fitting a distribution Rank Tests for the comparison of two samples Two unrelated samples: Mann-Whitney
More informationNAG Library Chapter Introduction. g08 Nonparametric Statistics
g08 Nonparametric Statistics Introduction g08 NAG Library Chapter Introduction g08 Nonparametric Statistics Contents 1 Scope of the Chapter.... 2 2 Background to the Problems... 2 2.1 Parametric and Nonparametric
More informationInferential Statistics
Inferential Statistics Eva Riccomagno, Maria Piera Rogantin DIMA Università di Genova riccomagno@dima.unige.it rogantin@dima.unige.it Part G Distribution free hypothesis tests 1. Classical and distribution-free
More informationNonparametric Location Tests: k-sample
Nonparametric Location Tests: k-sample Nathaniel E. Helwig Assistant Professor of Psychology and Statistics University of Minnesota (Twin Cities) Updated 04-Jan-2017 Nathaniel E. Helwig (U of Minnesota)
More informationChapter 7 Comparison of two independent samples
Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N
More informationTransition Passage to Descriptive Statistics 28
viii Preface xiv chapter 1 Introduction 1 Disciplines That Use Quantitative Data 5 What Do You Mean, Statistics? 6 Statistics: A Dynamic Discipline 8 Some Terminology 9 Problems and Answers 12 Scales of
More information5 Introduction to the Theory of Order Statistics and Rank Statistics
5 Introduction to the Theory of Order Statistics and Rank Statistics This section will contain a summary of important definitions and theorems that will be useful for understanding the theory of order
More informationChapter 12 - Part I: Correlation Analysis
ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,
More informationNonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health
Nonparametric statistic methods Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Measurement What are the 4 levels of measurement discussed? 1. Nominal or Classificatory Scale Gender,
More informationGlossary. The ISI glossary of statistical terms provides definitions in a number of different languages:
Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the
More informationTESTS BASED ON EMPIRICAL DISTRIBUTION FUNCTION. Submitted in partial fulfillment of the requirements for the award of the degree of
TESTS BASED ON EMPIRICAL DISTRIBUTION FUNCTION Submitted in partial fulfillment of the requirements for the award of the degree of MASTER OF SCIENCE IN MATHEMATICS AND COMPUTING Submitted by Gurpreet Kaur
More informationStatistical Significance of Ranking Paradoxes
Statistical Significance of Ranking Paradoxes Anna E. Bargagliotti and Raymond N. Greenwell 1 February 28, 2009 1 Anna E. Bargagliotti is an Assistant Professor in the Department of Mathematical Sciences
More informationRecall the Basics of Hypothesis Testing
Recall the Basics of Hypothesis Testing The level of significance α, (size of test) is defined as the probability of X falling in w (rejecting H 0 ) when H 0 is true: P(X w H 0 ) = α. H 0 TRUE H 1 TRUE
More informationH 2 : otherwise. that is simply the proportion of the sample points below level x. For any fixed point x the law of large numbers gives that
Lecture 28 28.1 Kolmogorov-Smirnov test. Suppose that we have an i.i.d. sample X 1,..., X n with some unknown distribution and we would like to test the hypothesis that is equal to a particular distribution
More informationChapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics
Section 15.1: An Overview of Nonparametric Statistics Understand Difference between Parametric and Nonparametric Statistical Procedures Parametric statistical procedures inferential procedures that rely
More informationComputer simulation on homogeneity testing for weighted data sets used in HEP
Computer simulation on homogeneity testing for weighted data sets used in HEP Petr Bouř and Václav Kůs Department of Mathematics, Faculty of Nuclear Sciences and Physical Engineering, Czech Technical University
More informationST4241 Design and Analysis of Clinical Trials Lecture 7: N. Lecture 7: Non-parametric tests for PDG data
ST4241 Design and Analysis of Clinical Trials Lecture 7: Non-parametric tests for PDG data Department of Statistics & Applied Probability 8:00-10:00 am, Friday, September 2, 2016 Outline Non-parametric
More informationLinear Equations. Find the domain and the range of the following set. {(4,5), (7,8), (-1,3), (3,3), (2,-3)}
Linear Equations Domain and Range Domain refers to the set of possible values of the x-component of a point in the form (x,y). Range refers to the set of possible values of the y-component of a point in
More informationAnalysis of 2x2 Cross-Over Designs using T-Tests
Chapter 234 Analysis of 2x2 Cross-Over Designs using T-Tests Introduction This procedure analyzes data from a two-treatment, two-period (2x2) cross-over design. The response is assumed to be a continuous
More informationData analysis and Geostatistics - lecture VII
Data analysis and Geostatistics - lecture VII t-tests, ANOVA and goodness-of-fit Statistical testing - significance of r Testing the significance of the correlation coefficient: t = r n - 2 1 - r 2 with
More informationGTPS Curriculum 6 th Grade Math. Topic: Topic 1- Numeration
9 days / September Compute fluently with multi-digit numbers and find common factors and multiples. 6.NS.3 Fluently add, subtract, multiply, and divide multi-digit decimals using the standard algorithm
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More informationSTATISTIKA INDUSTRI 2 TIN 4004
STATISTIKA INDUSTRI 2 TIN 4004 Pertemuan 11 & 12 Outline: Nonparametric Statistics Referensi: Walpole, R.E., Myers, R.H., Myers, S.L., Ye, K., Probability & Statistics for Engineers & Scientists, 9 th
More informationIntroduction to Biostatistics: Part 5, Statistical Inference Techniques for Hypothesis Testing With Nonparametric Data
SPECIAL CONTRIBUTION biostatistics Introduction to Biostatistics: Part 5, Statistical Inference Techniques for Hypothesis Testing With Nonparametric Data Specific statistical tests are used when the null
More informationUNIT 5:Random number generation And Variation Generation
UNIT 5:Random number generation And Variation Generation RANDOM-NUMBER GENERATION Random numbers are a necessary basic ingredient in the simulation of almost all discrete systems. Most computer languages
More informationNon-Parametric Two-Sample Analysis: The Mann-Whitney U Test
Non-Parametric Two-Sample Analysis: The Mann-Whitney U Test When samples do not meet the assumption of normality parametric tests should not be used. To overcome this problem, non-parametric tests can
More informationDETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics
DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and
More informationRank-Based Methods. Lukas Meier
Rank-Based Methods Lukas Meier 20.01.2014 Introduction Up to now we basically always used a parametric family, like the normal distribution N (µ, σ 2 ) for modeling random data. Based on observed data
More informationA Monte Carlo Simulation of the Robust Rank- Order Test Under Various Population Symmetry Conditions
Journal of Modern Applied Statistical Methods Volume 12 Issue 1 Article 7 5-1-2013 A Monte Carlo Simulation of the Robust Rank- Order Test Under Various Population Symmetry Conditions William T. Mickelson
More informationNonparametric hypothesis tests and permutation tests
Nonparametric hypothesis tests and permutation tests 1.7 & 2.3. Probability Generating Functions 3.8.3. Wilcoxon Signed Rank Test 3.8.2. Mann-Whitney Test Prof. Tesler Math 283 Fall 2018 Prof. Tesler Wilcoxon
More informationChapter 18 Resampling and Nonparametric Approaches To Data
Chapter 18 Resampling and Nonparametric Approaches To Data 18.1 Inferences in children s story summaries (McConaughy, 1980): a. Analysis using Wilcoxon s rank-sum test: Younger Children Older Children
More informationChapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing
Chapter Fifteen Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Copyright 2010 Pearson Education, Inc. publishing as Prentice Hall 15-1 Internet Usage Data Table 15.1 Respondent Sex Familiarity
More informationISQS 5349 Final Exam, Spring 2017.
ISQS 5349 Final Exam, Spring 7. Instructions: Put all answers on paper other than this exam. If you do not have paper, some will be provided to you. The exam is OPEN BOOKS, OPEN NOTES, but NO ELECTRONIC
More informationMy data doesn t look like that..
Testing assumptions My data doesn t look like that.. We have made a big deal about testing model assumptions each week. Bill Pine Testing assumptions Testing assumptions We have made a big deal about testing
More informationA MONTE CARLO INVESTIGATION INTO THE PROPERTIES OF A PROPOSED ROBUST ONE-SAMPLE TEST OF LOCATION. Bercedis Peterson
A MONTE CARLO INVESTIGATION INTO THE PROPERTIES OF A PROPOSED ROBUST ONE-SAMPLE TEST OF LOCATION by Bercedis Peterson Department of Biostatistics University of North Carolina at Chapel Hill Institute of
More informationChapte The McGraw-Hill Companies, Inc. All rights reserved.
er15 Chapte Chi-Square Tests d Chi-Square Tests for -Fit Uniform Goodness- Poisson Goodness- Goodness- ECDF Tests (Optional) Contingency Tables A contingency table is a cross-tabulation of n paired observations
More information4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures
Non-parametric Test Stephen Opiyo Overview Distinguish Parametric and Nonparametric Test Procedures Explain commonly used Nonparametric Test Procedures Perform Hypothesis Tests Using Nonparametric Procedures
More informationRandom Number Generation. CS1538: Introduction to simulations
Random Number Generation CS1538: Introduction to simulations Random Numbers Stochastic simulations require random data True random data cannot come from an algorithm We must obtain it from some process
More informationSolutions exercises of Chapter 7
Solutions exercises of Chapter 7 Exercise 1 a. These are paired samples: each pair of half plates will have about the same level of corrosion, so the result of polishing by the two brands of polish are
More informationExam details. Final Review Session. Things to Review
Exam details Final Review Session Short answer, similar to book problems Formulae and tables will be given You CAN use a calculator Date and Time: Dec. 7, 006, 1-1:30 pm Location: Osborne Centre, Unit
More informationANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS
ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing
More informationNON-PARAMETRIC STATISTICS * (http://www.statsoft.com)
NON-PARAMETRIC STATISTICS * (http://www.statsoft.com) 1. GENERAL PURPOSE 1.1 Brief review of the idea of significance testing To understand the idea of non-parametric statistics (the term non-parametric
More informationTHE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE
THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future
More informationStatistical Procedures for Testing Homogeneity of Water Quality Parameters
Statistical Procedures for ing Homogeneity of Water Quality Parameters Xu-Feng Niu Professor of Statistics Department of Statistics Florida State University Tallahassee, FL 3306 May-September 004 1. Nonparametric
More informationSystems Simulation Chapter 7: Random-Number Generation
Systems Simulation Chapter 7: Random-Number Generation Fatih Cavdur fatihcavdur@uludag.edu.tr April 22, 2014 Introduction Introduction Random Numbers (RNs) are a necessary basic ingredient in the simulation
More informationTwo-by-two ANOVA: Global and Graphical Comparisons Based on an Extension of the Shift Function
Journal of Data Science 7(2009), 459-468 Two-by-two ANOVA: Global and Graphical Comparisons Based on an Extension of the Shift Function Rand R. Wilcox University of Southern California Abstract: When comparing
More informationInvestigation Find the area of the triangle. (See student text.)
Selected ACE: Looking For Pythagoras Investigation 1: #20, #32. Investigation 2: #18, #38, #42. Investigation 3: #8, #14, #18. Investigation 4: #12, #15, #23. ACE Problem Investigation 1 20. Find the area
More informationParametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami
Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous
More informationINEQUALITIES M.K. HOME TUITION. Mathematics Revision Guides Level: GCSE Higher Tier
Mathematics Revision Guides Inequalities Page 1 of 17 M.K. HOME TUITION Mathematics Revision Guides Level: GCSE Higher Tier INEQUALITIES Version: 3.5 Date: 22-10-2015 Mathematics Revision Guides Inequalities
More informationY i = η + ɛ i, i = 1,...,n.
Nonparametric tests If data do not come from a normal population (and if the sample is not large), we cannot use a t-test. One useful approach to creating test statistics is through the use of rank statistics.
More informationINDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -27 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.
INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY Lecture -27 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc. Summary of the previous lecture Frequency factors Normal distribution
More informationModule 9: Nonparametric Statistics Statistics (OA3102)
Module 9: Nonparametric Statistics Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 15.1-15.6 Revision: 3-12 1 Goals for this Lecture
More informationANOVA - analysis of variance - used to compare the means of several populations.
12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.
More informationStatistics: revision
NST 1B Experimental Psychology Statistics practical 5 Statistics: revision Rudolf Cardinal & Mike Aitken 29 / 30 April 2004 Department of Experimental Psychology University of Cambridge Handouts: Answers
More informationNon-parametric Hypothesis Testing
Non-parametric Hypothesis Testing Procedures Hypothesis Testing General Procedure for Hypothesis Tests 1. Identify the parameter of interest.. Formulate the null hypothesis, H 0. 3. Specify an appropriate
More information13.7 ANOTHER TEST FOR TREND: KENDALL S TAU
13.7 ANOTHER TEST FOR TREND: KENDALL S TAU In 1969 the U.S. government instituted a draft lottery for choosing young men to be drafted into the military. Numbers from 1 to 366 were randomly assigned to
More informationNonparametric Statistics Notes
Nonparametric Statistics Notes Chapter 5: Some Methods Based on Ranks Jesse Crawford Department of Mathematics Tarleton State University (Tarleton State University) Ch 5: Some Methods Based on Ranks 1
More information14.30 Introduction to Statistical Methods in Economics Spring 2009
MIT OpenCourseWare http://ocw.mit.edu 4.0 Introduction to Statistical Methods in Economics Spring 009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationResearch Article A Nonparametric Two-Sample Wald Test of Equality of Variances
Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner
More informationGlossary for the Triola Statistics Series
Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling
More informationNon-Parametric Statistics: When Normal Isn t Good Enough"
Non-Parametric Statistics: When Normal Isn t Good Enough" Professor Ron Fricker" Naval Postgraduate School" Monterey, California" 1/28/13 1 A Bit About Me" Academic credentials" Ph.D. and M.A. in Statistics,
More informationarxiv: v1 [stat.me] 14 Jan 2019
arxiv:1901.04443v1 [stat.me] 14 Jan 2019 An Approach to Statistical Process Control that is New, Nonparametric, Simple, and Powerful W.J. Conover, Texas Tech University, Lubbock, Texas V. G. Tercero-Gómez,Tecnológico
More information3. Nonparametric methods
3. Nonparametric methods If the probability distributions of the statistical variables are unknown or are not as required (e.g. normality assumption violated), then we may still apply nonparametric tests
More informationNONPARAMETRICS. Statistical Methods Based on Ranks E. L. LEHMANN HOLDEN-DAY, INC. McGRAW-HILL INTERNATIONAL BOOK COMPANY
NONPARAMETRICS Statistical Methods Based on Ranks E. L. LEHMANN University of California, Berkeley With the special assistance of H. J. M. D'ABRERA University of California, Berkeley HOLDEN-DAY, INC. San
More informationComparison of Two Population Means
Comparison of Two Population Means Esra Akdeniz March 15, 2015 Independent versus Dependent (paired) Samples We have independent samples if we perform an experiment in two unrelated populations. We have
More informationIntroduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.
Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of
More informationThe entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.
One-Way ANOVA Summary The One-Way ANOVA procedure is designed to construct a statistical model describing the impact of a single categorical factor X on a dependent variable Y. Tests are run to determine
More informationAnalyzing Small Sample Experimental Data
Analyzing Small Sample Experimental Data Session 2: Non-parametric tests and estimators I Dominik Duell (University of Essex) July 15, 2017 Pick an appropriate (non-parametric) statistic 1. Intro to non-parametric
More informationWhat to do today (Nov 22, 2018)?
What to do today (Nov 22, 2018)? Part 1. Introduction and Review (Chp 1-5) Part 2. Basic Statistical Inference (Chp 6-9) Part 3. Important Topics in Statistics (Chp 10-13) Part 4. Further Topics (Selected
More informationMinimum distance tests and estimates based on ranks
Minimum distance tests and estimates based on ranks Authors: Radim Navrátil Department of Mathematics and Statistics, Masaryk University Brno, Czech Republic (navratil@math.muni.cz) Abstract: It is well
More informationR1: Sets A set is a collection of objects sets are written using set brackets each object in onset is called an element or member
Chapter R Review of basic concepts * R1: Sets A set is a collection of objects sets are written using set brackets each object in onset is called an element or member Ex: Write the set of counting numbers
More informationComparison of Power between Adaptive Tests and Other Tests in the Field of Two Sample Scale Problem
Comparison of Power between Adaptive Tests and Other Tests in the Field of Two Sample Scale Problem Chikhla Jun Gogoi 1, Dr. Bipin Gogoi 2 1 Research Scholar, Department of Statistics, Dibrugarh University,
More informationNon-parametric methods
Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish
More informationp(z)
Chapter Statistics. Introduction This lecture is a quick review of basic statistical concepts; probabilities, mean, variance, covariance, correlation, linear regression, probability density functions and
More informationA comparison study of the nonparametric tests based on the empirical distributions
통계연구 (2015), 제 20 권제 3 호, 1-12 A comparison study of the nonparametric tests based on the empirical distributions Hyo-Il Park 1) Abstract In this study, we propose a nonparametric test based on the empirical
More informationDistribution Fitting (Censored Data)
Distribution Fitting (Censored Data) Summary... 1 Data Input... 2 Analysis Summary... 3 Analysis Options... 4 Goodness-of-Fit Tests... 6 Frequency Histogram... 8 Comparison of Alternative Distributions...
More informationTextbook Examples of. SPSS Procedure
Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of
More informationKruskal-Wallis and Friedman type tests for. nested effects in hierarchical designs 1
Kruskal-Wallis and Friedman type tests for nested effects in hierarchical designs 1 Assaf P. Oron and Peter D. Hoff Department of Statistics, University of Washington, Seattle assaf@u.washington.edu, hoff@stat.washington.edu
More informationSAMPLE. 1.2 Prime factor form. 1.3 Finding the Highest Common Factor (HCF)
2 Exploring numbers.2 Prime factor form You can write any number as the product of its prime factors. These factors are prime numbers: 720 = 2 60 = 2 2 80 = 2 2 2 90 = 2 2 2 2 45 = 2 2 2 2 5 = 2 2 2 2
More informationA Signed-Rank Test Based on the Score Function
Applied Mathematical Sciences, Vol. 10, 2016, no. 51, 2517-2527 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2016.66189 A Signed-Rank Test Based on the Score Function Hyo-Il Park Department
More informationAlternative Presentation of the Standard Normal Distribution
A2 A APPENDIX A Alternative Presentation of the Standard Normal Distribution WHAT YOU SHOULD LEARN How to find areas under the standard normal curve INSIGHT Because every normal distribution can be transformed
More informationHYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă
HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and
More informationDesign of the Fuzzy Rank Tests Package
Design of the Fuzzy Rank Tests Package Charles J. Geyer July 15, 2013 1 Introduction We do fuzzy P -values and confidence intervals following Geyer and Meeden (2005) and Thompson and Geyer (2007) for three
More informationA nonparametric two-sample wald test of equality of variances
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David
More informationStatistics Handbook. All statistical tables were computed by the author.
Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance
More informationTabulation means putting data into tables. A table is a matrix of data in rows and columns, with the rows and the columns having titles.
1 Tabulation means putting data into tables. A table is a matrix of data in rows and columns, with the rows and the columns having titles. 2 converting the set of numbers into the form of a grouped frequency
More informationIntroduction to Statistical Data Analysis III
Introduction to Statistical Data Analysis III JULY 2011 Afsaneh Yazdani Preface Major branches of Statistics: - Descriptive Statistics - Inferential Statistics Preface What is Inferential Statistics? The
More information