A Comparison of the Power of the Discrete Kolmogorov-Smirnov and Chi- Square Goodnessof-Fit

Size: px
Start display at page:

Download "A Comparison of the Power of the Discrete Kolmogorov-Smirnov and Chi- Square Goodnessof-Fit"

Transcription

1 Bond University From the SelectedWorks of Mike Steele 22 A Comparison of the Power of the Discrete Kolmogorov-Smirnov and Chi- Square Goodnessof-Fit Tests. Mike Steele, Bond University Neil Smart, Bond University Cameron Hurst, Queensland University of Technology Janet Chaseling, Griffith University Available at:

2 A comparison of the power of the discrete Kolmogorov-Smirnov and Chi-Square goodness-of-fit tests Michael Steele, Neil Smart 2, Cameron Hurst 3 and Janet Chaseling 4 School of Business, Bond University, Australia. 2 Faculty of Health Science and Medicine, Bond University, Australia 3 School of Public Health, Queensland University of Technology, Australia. 4 Griffith School of Environment, Griffith University, Australia This is the original author version of the paper. The definitive version is: Chaseling, J., Steele, M., Smart, N., & Hurst, C. (22). A comparison of the power of the discrete Kolmorgorov- Smirnov and Chi-Square Goodness of Fit tests. In K. Kumar & A. Chaturvedi (Eds.). Some Recent Developments in Statistical Theory and Applications (pp. 4-9). Boca Raton, Florida, U.S.A: Brown Walker Press Abstract Goodness-of-fit tests for discrete data are widely used in the health and economic disciplines but only a limited number of published power studies exist which make recommendations on relative power of these test statistics. This paper compares the power of Pearson s Chi-Square test with the discrete Kolmogorov-Smirnov test for a uniform null distribution against a number of predefined alternative distributions and makes recommendations on which is the more powerful discrete goodness-of-fit test.. Introduction Goodness-of-fit tests are widely used in economic and health research areas however limited power studies are available in the literature (Choulakian et al. 994; Pettitt and Stephens 977; Steele and Chaseling 26; Ampadu et al. 29). Although Chi-Square type tests are widely used on discrete data for economic and health projects this paper shows situations where the power of other goodnessof-fit test statistics are relatively higher. In this paper the power of Pearson s Chi-Square test statistic (Pearson 9) is compared with that of the discrete Kolmogorov-Smirnov test statistic defined by Pettitt and Stephens (977). The power for each test statistic defined in Section 2 is approximated for a uniform null against various alternative distributions defined in Section 3. The power study results are given in Section 4 and in Section 5 recommendations are made on which of the two test statistics the researcher should use in relation to power. 2. The test statistics defined The two test statistics in this power study are the discrete Kolmogorov-Smirnov (Pettitt and Stephens 977) and Pearson s Chi-Square (Pearson 9) and are defined in equations 2. and 2.2. max i k Z (2.) i k 2 i O E 2 i E i i where k is the number of cells, O i and E i are the observed and expected frequencies for cell i, and Z i is the cumulative sum of the differences between the observed and expected frequencies up to and including cell i. (2.2)

3 3. Techniques to determine power To obtain consistent results a uniform null distribution over ten cells is used for each alternative distribution defined in Table below. The powers of each test statistic is calculated for sample sizes of, 2, 3, 5, and 2 (or, 2, 3, 5, and 2 observations per cell under the uniform null distribution). The power is estimated from simulated random samples by generating a null and alternative distribution of each test statistic. The simulated null distribution of each test statistic is discrete so an exact significance level of 5% is generally not possible. To account for this the power is linearly interpolated about the 5% level to enable consistent estimates of the power for each test statistic. Table. Alternative distributions used in the power studies Description Cell Probability (2 Decimal Places) Decreasing Step Triangular Platykurtic Leptokurtic Bimodal Results of the power study 4. Decreasing alternative For the smaller sample sizes Figure shows that the test statistic has greater power than the χ 2 test statistic. For larger sample sizes of at least five observations per cell this difference is shown to be negligible Step type alternative Fig.. Power of and χ 2 for a uniform null and decreasing alternative In Figure 2 the results are similar to the trend type alternative in Section 4. with the power of being substantially greater than that of the χ 2 test statistic for sample sizes of up to per cell under the null distribution.

4 Power Fig. 2. Power of and χ 2 for a uniform null and step type alternative 4.3 Triangular alternative distribution The power of is shown in Figure 3 to be less than that of the χ 2 test statistic for most of the sample sizes. The exception being the larger sample sizes where the power of both test statistics is approximately equal. Clearly the χ 2 test statistic is the more powerful for this particular alternative distribution. 4.4 Platykurtic alternative Fig. 3. Power of and χ 2 for a uniform null and triangular alternative The power of the test statistic is shown in Figure 4 to be very low for all sample sizes. Although the power of the χ 2 test statistic is shown to be much greater than the test statistic for larger sample sizes its power for smaller sample sizes is however quite poor. Although the χ 2 test statistic is shown to have relatively higher power than its use for the smaller sample sizes is still with very low power.

5 Power Fig. 4. Power of and χ 2 for a uniform null and platykurtic alternative 4.5 Leptokurtic alternative The power of is shown in Figure 5 to be relatively less than the χ 2 test statistic for smaller sample sizes however for sample sizes of at least five per cell the powers of both test statistics are shown to be approximately the same and high for this alternative distribution. Overall the χ 2 test statistic could be used with higher power than the test statistic for the leptokurtic alternative distribution, particularly for smaller sample sizes Fig. 5. Power of and χ 2 for a uniform null and leptokurtic alternative 4.6 Bimodal alternative The test statistic is shown in Figure 6 to be very low regardless of the sample size and the power of the χ 2 test statistic is also shown to be very low for sample sizes less than ten per cell under the null distribution. Clearly unless the sample size is quite large that none of these two test statistics can be used with high power for a bimodal alternative distribution.

6 Fig. 6. Power of and χ 2 for a uniform null and bimodal alternative 5 Conclusions and Recommendations The power study results show that neither of the test statistics can realistically recommended to the applied econometrician as having higher power for all situations. These results and the summary in Table 2 can at least give the applied econometrician some guide to the choice of alternative goodnessof-fit test statistics with respect to power. Table 2. General summary of the power of the two test statistics Alternative Ranking of the Powers Distribution Decreasing > χ 2 Step > χ 2 Triangular χ 2 > Platykurtic Χ 2 > Leptokurtic χ 2 > Bimodal χ 2 > References Ampadu C, Wang D, Steele M (29) Simulated power of some discrete goodness-of-fit test statistics for testing the null hypothesis of a zig-zag distribution. Far East Journal of Theoretical Statistics 28:57-7 Choulakian V, Lockhart RA, Stephens MA (994) Cramér-von Mises test statistics for discrete distributions. The Canadian Journal of Statistics 22:25-37 Pearson K (9) On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philosophical Magazine Series Five 5:57-75 Pettitt AN, Stephens MA (977) The Kolmogorov-Smirnov goodness-of-fit statistic with discrete and grouped data. Technometrics 9:25-2 Steele M, Chaseling J (26) Powers of discrete goodness-of-fit test statistics for a uniform null against a selection of alternative distributions. Communications in Statistics Simulation and Computation 35:67-75.

SIMULATED POWER OF SOME DISCRETE GOODNESS- OF-FIT TEST STATISTICS FOR TESTING THE NULL HYPOTHESIS OF A ZIG-ZAG DISTRIBUTION

SIMULATED POWER OF SOME DISCRETE GOODNESS- OF-FIT TEST STATISTICS FOR TESTING THE NULL HYPOTHESIS OF A ZIG-ZAG DISTRIBUTION Far East Journal of Theoretical Statistics Volume 28, Number 2, 2009, Pages 57-7 This paper is available online at http://www.pphmj.com 2009 Pushpa Publishing House SIMULATED POWER OF SOME DISCRETE GOODNESS-

More information

Some Recent Developments in Statistical Theory and Applications

Some Recent Developments in Statistical Theory and Applications Some Recent Developments in Statistical Theory and Applications Some Recent Developments in Statistical Theory and Applications Selected Proceedings of the International Conference on Recent Developments

More information

MODEL FOR DISTRIBUTION OF WAGES

MODEL FOR DISTRIBUTION OF WAGES 19th Applications of Mathematics and Statistics in Economics AMSE 216 MODEL FOR DISTRIBUTION OF WAGES MICHAL VRABEC, LUBOŠ MAREK University of Economics, Prague, Faculty of Informatics and Statistics,

More information

Recall the Basics of Hypothesis Testing

Recall the Basics of Hypothesis Testing Recall the Basics of Hypothesis Testing The level of significance α, (size of test) is defined as the probability of X falling in w (rejecting H 0 ) when H 0 is true: P(X w H 0 ) = α. H 0 TRUE H 1 TRUE

More information

A note on vector-valued goodness-of-fit tests

A note on vector-valued goodness-of-fit tests A note on vector-valued goodness-of-fit tests Vassilly Voinov and Natalie Pya Kazakhstan Institute of Management, Economics and Strategic Research 050010 Almaty, Kazakhstan (e-mail: voinovv@kimep.kz, pya@kimep.kz)

More information

Statistic Distribution Models for Some Nonparametric Goodness-of-Fit Tests in Testing Composite Hypotheses

Statistic Distribution Models for Some Nonparametric Goodness-of-Fit Tests in Testing Composite Hypotheses Communications in Statistics - Theory and Methods ISSN: 36-926 (Print) 532-45X (Online) Journal homepage: http://www.tandfonline.com/loi/lsta2 Statistic Distribution Models for Some Nonparametric Goodness-of-Fit

More information

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters Communications for Statistical Applications and Methods 2017, Vol. 24, No. 5, 519 531 https://doi.org/10.5351/csam.2017.24.5.519 Print ISSN 2287-7843 / Online ISSN 2383-4757 Goodness-of-fit tests for randomly

More information

Lecture 2: CDF and EDF

Lecture 2: CDF and EDF STAT 425: Introduction to Nonparametric Statistics Winter 2018 Instructor: Yen-Chi Chen Lecture 2: CDF and EDF 2.1 CDF: Cumulative Distribution Function For a random variable X, its CDF F () contains all

More information

H 2 : otherwise. that is simply the proportion of the sample points below level x. For any fixed point x the law of large numbers gives that

H 2 : otherwise. that is simply the proportion of the sample points below level x. For any fixed point x the law of large numbers gives that Lecture 28 28.1 Kolmogorov-Smirnov test. Suppose that we have an i.i.d. sample X 1,..., X n with some unknown distribution and we would like to test the hypothesis that is equal to a particular distribution

More information

CHAPTER 1 CLASSES OF FIXED-ORDER AND ADAPTIVE SMOOTH GOODNESS-OF-FIT TESTS WITH DISCRETE RIGHT-CENSORED DATA

CHAPTER 1 CLASSES OF FIXED-ORDER AND ADAPTIVE SMOOTH GOODNESS-OF-FIT TESTS WITH DISCRETE RIGHT-CENSORED DATA CHAPTER 1 CLASSES OF FIXED-ORDER AND ADAPTIVE SMOOTH GOODNESS-OF-FIT TESTS WITH DISCRETE RIGHT-CENSORED DATA Edsel A. Peña Department of Statistics University of South Carolina Columbia, SC 29208 USA E-mail:

More information

Finding Outliers in Monte Carlo Computations

Finding Outliers in Monte Carlo Computations Finding Outliers in Monte Carlo Computations Prof. Michael Mascagni Department of Computer Science Department of Mathematics Department of Scientific Computing Graduate Program in Molecular Biophysics

More information

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapte The McGraw-Hill Companies, Inc. All rights reserved. er15 Chapte Chi-Square Tests d Chi-Square Tests for -Fit Uniform Goodness- Poisson Goodness- Goodness- ECDF Tests (Optional) Contingency Tables A contingency table is a cross-tabulation of n paired observations

More information

Application of Homogeneity Tests: Problems and Solution

Application of Homogeneity Tests: Problems and Solution Application of Homogeneity Tests: Problems and Solution Boris Yu. Lemeshko (B), Irina V. Veretelnikova, Stanislav B. Lemeshko, and Alena Yu. Novikova Novosibirsk State Technical University, Novosibirsk,

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future

More information

Exact goodness-of-fit tests for censored data

Exact goodness-of-fit tests for censored data Exact goodness-of-fit tests for censored data Aurea Grané Statistics Department. Universidad Carlos III de Madrid. Abstract The statistic introduced in Fortiana and Grané (23, Journal of the Royal Statistical

More information

Irr. Statistical Methods in Experimental Physics. 2nd Edition. Frederick James. World Scientific. CERN, Switzerland

Irr. Statistical Methods in Experimental Physics. 2nd Edition. Frederick James. World Scientific. CERN, Switzerland Frederick James CERN, Switzerland Statistical Methods in Experimental Physics 2nd Edition r i Irr 1- r ri Ibn World Scientific NEW JERSEY LONDON SINGAPORE BEIJING SHANGHAI HONG KONG TAIPEI CHENNAI CONTENTS

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Investigation of goodness-of-fit test statistic distributions by random censored samples

Investigation of goodness-of-fit test statistic distributions by random censored samples d samples Investigation of goodness-of-fit test statistic distributions by random censored samples Novosibirsk State Technical University November 22, 2010 d samples Outline 1 Nonparametric goodness-of-fit

More information

Spare parts inventory management: new evidence from distribution fitting

Spare parts inventory management: new evidence from distribution fitting Spare parts inventory management: new evidence from distribution fitting Laura Turrini Joern Meissner IIF Workshop on Supply Chain Forecasting for Operations Lancaster 29. Juni 2016 FORECASTING SPARE PARTS:

More information

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Nonparametric statistic methods Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Measurement What are the 4 levels of measurement discussed? 1. Nominal or Classificatory Scale Gender,

More information

NAG Toolbox for Matlab. g08cd.1

NAG Toolbox for Matlab. g08cd.1 G08 Nonparametric Statistics NAG Toolbox for Matlab 1 Purpose performs the two sample Kolmogorov Smirnov distribution test. 2 Syntax [d, z, p, sx, sy, ifail] = (x, y, ntype, n1, n1, n2, n2) 3 Description

More information

A TEST OF FIT FOR THE GENERALIZED PARETO DISTRIBUTION BASED ON TRANSFORMS

A TEST OF FIT FOR THE GENERALIZED PARETO DISTRIBUTION BASED ON TRANSFORMS A TEST OF FIT FOR THE GENERALIZED PARETO DISTRIBUTION BASED ON TRANSFORMS Dimitrios Konstantinides, Simos G. Meintanis Department of Statistics and Acturial Science, University of the Aegean, Karlovassi,

More information

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1, Economics 520 Lecture Note 9: Hypothesis Testing via the Neyman-Pearson Lemma CB 8., 8.3.-8.3.3 Uniformly Most Powerful Tests and the Neyman-Pearson Lemma Let s return to the hypothesis testing problem

More information

Hypothesis testing:power, test statistic CMS:

Hypothesis testing:power, test statistic CMS: Hypothesis testing:power, test statistic The more sensitive the test, the better it can discriminate between the null and the alternative hypothesis, quantitatively, maximal power In order to achieve this

More information

Exact Statistical Inference in. Parametric Models

Exact Statistical Inference in. Parametric Models Exact Statistical Inference in Parametric Models Audun Sektnan December 2016 Specialization Project Department of Mathematical Sciences Norwegian University of Science and Technology Supervisor: Professor

More information

Asymptotic Statistics-VI. Changliang Zou

Asymptotic Statistics-VI. Changliang Zou Asymptotic Statistics-VI Changliang Zou Kolmogorov-Smirnov distance Example (Kolmogorov-Smirnov confidence intervals) We know given α (0, 1), there is a well-defined d = d α,n such that, for any continuous

More information

Goodness-of-fit Tests for the Normal Distribution Project 1

Goodness-of-fit Tests for the Normal Distribution Project 1 Goodness-of-fit Tests for the Normal Distribution Project 1 Jeremy Morris September 29, 2005 1 Kolmogorov-Smirnov Test The Kolmogorov-Smirnov Test (KS test) is based on the cumulative distribution function

More information

COMPARISON OF GMM WITH SECOND-ORDER LEAST SQUARES ESTIMATION IN NONLINEAR MODELS. Abstract

COMPARISON OF GMM WITH SECOND-ORDER LEAST SQUARES ESTIMATION IN NONLINEAR MODELS. Abstract Far East J. Theo. Stat. 0() (006), 179-196 COMPARISON OF GMM WITH SECOND-ORDER LEAST SQUARES ESTIMATION IN NONLINEAR MODELS Department of Statistics University of Manitoba Winnipeg, Manitoba, Canada R3T

More information

BENFORD S LAW AND WILCOXON TEST

BENFORD S LAW AND WILCOXON TEST Journal of Mathematical Sciences: Advances and Applications Volume 52, 2018, Pages 69-81 Available at http://scientificadvances.co.in DOI: http://dx.doi.org/10.18642/jmsaa_7100121981 BENFORD S LAW AND

More information

Finding Relationships Among Variables

Finding Relationships Among Variables Finding Relationships Among Variables BUS 230: Business and Economic Research and Communication 1 Goals Specific goals: Re-familiarize ourselves with basic statistics ideas: sampling distributions, hypothesis

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Sources of randomness

Sources of randomness Random Number Generator Chapter 7 In simulations, we generate random values for variables with a specified distribution Ex., model service times using the exponential distribution Generation of random

More information

Best Fit Probability Distributions for Monthly Radiosonde Weather Data

Best Fit Probability Distributions for Monthly Radiosonde Weather Data Best Fit Probability Distributions for Monthly Radiosonde Weather Data Athulya P. S 1 and K. C James 2 1 M.Tech III Semester, 2 Professor Department of statistics Cochin University of Science and Technology

More information

The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study

The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study MATEMATIKA, 2012, Volume 28, Number 1, 35 48 c Department of Mathematics, UTM. The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study 1 Nahdiya Zainal Abidin, 2 Mohd Bakri Adam and 3 Habshah

More information

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE Course Title: Probability and Statistics (MATH 80) Recommended Textbook(s): Number & Type of Questions: Probability and Statistics for Engineers

More information

A MONTE CARLO INVESTIGATION INTO THE PROPERTIES OF A PROPOSED ROBUST ONE-SAMPLE TEST OF LOCATION. Bercedis Peterson

A MONTE CARLO INVESTIGATION INTO THE PROPERTIES OF A PROPOSED ROBUST ONE-SAMPLE TEST OF LOCATION. Bercedis Peterson A MONTE CARLO INVESTIGATION INTO THE PROPERTIES OF A PROPOSED ROBUST ONE-SAMPLE TEST OF LOCATION by Bercedis Peterson Department of Biostatistics University of North Carolina at Chapel Hill Institute of

More information

Chapter Eight: Assessment of Relationships 1/42

Chapter Eight: Assessment of Relationships 1/42 Chapter Eight: Assessment of Relationships 1/42 8.1 Introduction 2/42 Background This chapter deals, primarily, with two topics. The Pearson product-moment correlation coefficient. The chi-square test

More information

Testing Homogeneity Of A Large Data Set By Bootstrapping

Testing Homogeneity Of A Large Data Set By Bootstrapping Testing Homogeneity Of A Large Data Set By Bootstrapping 1 Morimune, K and 2 Hoshino, Y 1 Graduate School of Economics, Kyoto University Yoshida Honcho Sakyo Kyoto 606-8501, Japan. E-Mail: morimune@econ.kyoto-u.ac.jp

More information

Textbook Examples of. SPSS Procedure

Textbook Examples of. SPSS Procedure Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of

More information

Spline Density Estimation and Inference with Model-Based Penalities

Spline Density Estimation and Inference with Model-Based Penalities Spline Density Estimation and Inference with Model-Based Penalities December 7, 016 Abstract In this paper we propose model-based penalties for smoothing spline density estimation and inference. These

More information

Modeling Hydrologic Chanae

Modeling Hydrologic Chanae Modeling Hydrologic Chanae Statistical Methods Richard H. McCuen Department of Civil and Environmental Engineering University of Maryland m LEWIS PUBLISHERS A CRC Press Company Boca Raton London New York

More information

Exact goodness-of-fit tests for censored data

Exact goodness-of-fit tests for censored data Ann Inst Stat Math ) 64:87 3 DOI.7/s463--356-y Exact goodness-of-fit tests for censored data Aurea Grané Received: February / Revised: 5 November / Published online: 7 April The Institute of Statistical

More information

Economic Reliability Test Plans using the Generalized Exponential Distribution

Economic Reliability Test Plans using the Generalized Exponential Distribution ISSN 684-843 Journal of Statistics Volume 4, 27, pp. 53-6 Economic Reliability Test Plans using the Generalized Exponential Distribution Muhammad Aslam and Muhammad Qaisar Shahbaz 2 Abstract Economic Reliability

More information

Chapter 9: Association Between Variables Measured at the Ordinal Level

Chapter 9: Association Between Variables Measured at the Ordinal Level Chapter 9: Association Between Variables Measured at the Ordinal Level After this week s class: SHOULD BE ABLE TO COMPLETE ALL APLIA ASSIGNMENTS DUE END OF THIS WEEK: APLIA ASSIGNMENTS 5-7 DUE: Friday

More information

Statistical Significance of Ranking Paradoxes

Statistical Significance of Ranking Paradoxes Statistical Significance of Ranking Paradoxes Anna E. Bargagliotti and Raymond N. Greenwell 1 February 28, 2009 1 Anna E. Bargagliotti is an Assistant Professor in the Department of Mathematical Sciences

More information

Correlation. Martin Bland. Correlation. Correlation coefficient. Clinical Biostatistics

Correlation. Martin Bland. Correlation. Correlation coefficient. Clinical Biostatistics Clinical Biostatistics Correlation Martin Bland Professor of Health Statistics University of York http://martinbland.co.uk/ Correlation Example: Muscle and height in 42 alcoholics A scatter diagram: How

More information

14.30 Introduction to Statistical Methods in Economics Spring 2009

14.30 Introduction to Statistical Methods in Economics Spring 2009 MIT OpenCourseWare http://ocw.mit.edu 4.0 Introduction to Statistical Methods in Economics Spring 009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Topic 21 Goodness of Fit

Topic 21 Goodness of Fit Topic 21 Goodness of Fit Contingency Tables 1 / 11 Introduction Two-way Table Smoking Habits The Hypothesis The Test Statistic Degrees of Freedom Outline 2 / 11 Introduction Contingency tables, also known

More information

Testing for Normality

Testing for Normality Testing for Normality For each mean and standard deviation combination a theoretical normal distribution can be determined. This distribution is based on the proportions shown below. This theoretical normal

More information

Answer Keys to Homework#10

Answer Keys to Homework#10 Answer Keys to Homework#10 Problem 1 Use either restricted or unrestricted mixed models. Problem 2 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean

More information

(a) II and III (b) I (c) I and III (d) I and II and III (e) None are true.

(a) II and III (b) I (c) I and III (d) I and II and III (e) None are true. 1 Which of the following statements is always true? I The null space of an m n matrix is a subspace of R m II If the set B = {v 1,, v n } spans a vector space V and dimv = n, then B is a basis for V III

More information

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Econometrics Working Paper EWP0402 ISSN 1485-6441 Department of Economics TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Lauren Bin Dong & David E. A. Giles Department

More information

Statistics for Managers Using Microsoft Excel

Statistics for Managers Using Microsoft Excel Statistics for Managers Using Microsoft Excel 7 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Statistics for Managers Using Microsoft Excel 7e Copyright 014 Pearson Education, Inc. Chap

More information

Multivariate Time Series: Part 4

Multivariate Time Series: Part 4 Multivariate Time Series: Part 4 Cointegration Gerald P. Dwyer Clemson University March 2016 Outline 1 Multivariate Time Series: Part 4 Cointegration Engle-Granger Test for Cointegration Johansen Test

More information

Assignment 9 Answer Keys

Assignment 9 Answer Keys Assignment 9 Answer Keys Problem 1 (a) First, the respective means for the 8 level combinations are listed in the following table A B C Mean 26.00 + 34.67 + 39.67 + + 49.33 + 42.33 + + 37.67 + + 54.67

More information

Dr. Wolfgang Rolke University of Puerto Rico - Mayaguez

Dr. Wolfgang Rolke University of Puerto Rico - Mayaguez Dr. Wolfgang Rolke University of Puerto Rico - Mayaguez The Archetypical Statistics Problem Example: Is the die fair? Most Famous Answer: Pearson X 2 Pearson s Reasoning Hypothesis Testing Basics Another

More information

Kumaun University Nainital

Kumaun University Nainital Kumaun University Nainital Department of Statistics B. Sc. Semester system course structure: 1. The course work shall be divided into six semesters with three papers in each semester. 2. Each paper in

More information

1.0 I] MICROCOPY RESOLUTION TEST CHART ..: -,.,-..., ,,..,e ' ilhi~am &32. Il W tle p 10 A..

1.0 I] MICROCOPY RESOLUTION TEST CHART ..: -,.,-..., ,,..,e ' ilhi~am &32. Il W tle p 10 A.. AD-R124 835 A NEW GOODNESS OF FIT TEST FOR THE UNIFORM DISTRIBUTION i WITH UNSPECIFIED PRRRMETERS(U) AIR FORCE INST OF TECH WRIGHT-PRTTERSON RFB OH SCHOOL OF ENGI. L B WOODBURY UNLSIIDDEC 82 RFIT/GOR/NA/82D-6

More information

Latin Hypercube Sampling with Multidimensional Uniformity

Latin Hypercube Sampling with Multidimensional Uniformity Latin Hypercube Sampling with Multidimensional Uniformity Jared L. Deutsch and Clayton V. Deutsch Complex geostatistical models can only be realized a limited number of times due to large computational

More information

Systems Simulation Chapter 7: Random-Number Generation

Systems Simulation Chapter 7: Random-Number Generation Systems Simulation Chapter 7: Random-Number Generation Fatih Cavdur fatihcavdur@uludag.edu.tr April 22, 2014 Introduction Introduction Random Numbers (RNs) are a necessary basic ingredient in the simulation

More information

Fall 2012 Analysis of Experimental Measurements B. Eisenstein/rev. S. Errede

Fall 2012 Analysis of Experimental Measurements B. Eisenstein/rev. S. Errede Hypothesis Testing: Suppose we have two or (in general) more simple hypotheses which can describe a set of data Simple means explicitly defined, so if parameters have to be fitted, that has already been

More information

8/29/2015 Connect Math Plus

8/29/2015 Connect Math Plus Page 773 Appendix A Tables Table A Factorials Table B The Binomial Distribution Table C The Poisson Distribution Table D Random Numbers Table E The Standard Normal Distribution Table F The t Distribution

More information

STATISTICS ( CODE NO. 08 ) PAPER I PART - I

STATISTICS ( CODE NO. 08 ) PAPER I PART - I STATISTICS ( CODE NO. 08 ) PAPER I PART - I 1. Descriptive Statistics Types of data - Concepts of a Statistical population and sample from a population ; qualitative and quantitative data ; nominal and

More information

Binary choice 3.3 Maximum likelihood estimation

Binary choice 3.3 Maximum likelihood estimation Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation We explain here the various outputs from the maximum likelihood estimation procedure. Solution of the maximum likelihood

More information

My data doesn t look like that..

My data doesn t look like that.. Testing assumptions My data doesn t look like that.. We have made a big deal about testing model assumptions each week. Bill Pine Testing assumptions Testing assumptions We have made a big deal about testing

More information

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers: Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 27, 2007 Kosuke Imai (Princeton University) Matching

More information

HOSTOS COMMUNITY COLLEGE DEPARTMENT OF MATHEMATICS

HOSTOS COMMUNITY COLLEGE DEPARTMENT OF MATHEMATICS HOSTOS COMMUNITY COLLEGE DEPARTMENT OF MATHEMATICS MAT 217 Linear Algebra CREDIT HOURS: 4.0 EQUATED HOURS: 4.0 CLASS HOURS: 4.0 PREREQUISITE: PRE/COREQUISITE: MAT 210 Calculus I MAT 220 Calculus II RECOMMENDED

More information

Statistical Hypothesis Testing with SAS and R

Statistical Hypothesis Testing with SAS and R Statistical Hypothesis Testing with SAS and R Statistical Hypothesis Testing with SAS and R Dirk Taeger Institute for Prevention and Occupational Medicine of the German Social Accident Insurance, Institute

More information

Goodness of Fit Test and Test of Independence by Entropy

Goodness of Fit Test and Test of Independence by Entropy Journal of Mathematical Extension Vol. 3, No. 2 (2009), 43-59 Goodness of Fit Test and Test of Independence by Entropy M. Sharifdoost Islamic Azad University Science & Research Branch, Tehran N. Nematollahi

More information

14.2 Do Two Distributions Have the Same Means or Variances?

14.2 Do Two Distributions Have the Same Means or Variances? 14.2 Do Two Distributions Have the Same Means or Variances? 609 If a distribution has a strong central tendency, so that most of its area is under a single peak, then the median is an estimator of the

More information

A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED

A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED by W. Robert Reed Department of Economics and Finance University of Canterbury, New Zealand Email: bob.reed@canterbury.ac.nz

More information

Rank-Based Methods. Lukas Meier

Rank-Based Methods. Lukas Meier Rank-Based Methods Lukas Meier 20.01.2014 Introduction Up to now we basically always used a parametric family, like the normal distribution N (µ, σ 2 ) for modeling random data. Based on observed data

More information

A goodness of fit test for the Pareto distribution

A goodness of fit test for the Pareto distribution Chilean Journal of Statistics Vol. 9, No. 1, April 2018, 33-46 A goodness of fit test for the Pareto distribution Javier Suárez-Espinosa, José A. Villaseñor-Alva, Annel Hurtado-Jaramillo and Paulino Pérez-Rodríguez

More information

unadjusted model for baseline cholesterol 22:31 Monday, April 19,

unadjusted model for baseline cholesterol 22:31 Monday, April 19, unadjusted model for baseline cholesterol 22:31 Monday, April 19, 2004 1 Class Level Information Class Levels Values TRETGRP 3 3 4 5 SEX 2 0 1 Number of observations 916 unadjusted model for baseline cholesterol

More information

Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption

Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption Alisa A. Gorbunova and Boris Yu. Lemeshko Novosibirsk State Technical University Department of Applied Mathematics,

More information

Analysis of Statistical Algorithms. Comparison of Data Distributions in Physics Experiments

Analysis of Statistical Algorithms. Comparison of Data Distributions in Physics Experiments for the Comparison of Data Distributions in Physics Experiments, Andreas Pfeiffer, Maria Grazia Pia 2 and Alberto Ribon CERN, Geneva, Switzerland 2 INFN Genova, Genova, Italy st November 27 GoF-Tests (and

More information

CPSC 531: Random Numbers. Jonathan Hudson Department of Computer Science University of Calgary

CPSC 531: Random Numbers. Jonathan Hudson Department of Computer Science University of Calgary CPSC 531: Random Numbers Jonathan Hudson Department of Computer Science University of Calgary http://www.ucalgary.ca/~hudsonj/531f17 Introduction In simulations, we generate random values for variables

More information

Testing for Normality

Testing for Normality Testing for Normality For each mean and standard deviation combination a theoretical normal distribution can be determined. This distribution is based on the proportions shown below. This theoretical normal

More information

Transmuted Pareto distribution

Transmuted Pareto distribution ProbStat Forum, Volume 7, January 214, Pages 1 11 ISSN 974-3235 ProbStat Forum is an e-journal For details please visit wwwprobstatorgin Transmuted Pareto distribution Faton Merovci a, Llukan Puka b a

More information

R Short Course Session 4

R Short Course Session 4 R Short Course Session 4 Daniel Zhao, PhD Sixia Chen, PhD Department of Biostatistics and Epidemiology College of Public Health, OUHSC 11/13/2015 Outline Random distributions Summary statistics Statistical

More information

Statistical Procedures for Testing Homogeneity of Water Quality Parameters

Statistical Procedures for Testing Homogeneity of Water Quality Parameters Statistical Procedures for ing Homogeneity of Water Quality Parameters Xu-Feng Niu Professor of Statistics Department of Statistics Florida State University Tallahassee, FL 3306 May-September 004 1. Nonparametric

More information

Two-sample scale rank procedures optimal for the generalized secant hyperbolic distribution

Two-sample scale rank procedures optimal for the generalized secant hyperbolic distribution Two-sample scale rank procedures optimal for the generalized secant hyperbolic distribution O.Y. Kravchuk School of Physical Sciences, School of Land and Food Sciences, University of Queensland, Australia

More information

MODIFIED NON-OVERLAPPING TEMPLATE MATCHING TEST AND PROPOSAL ON SETTING TEMPLATE

MODIFIED NON-OVERLAPPING TEMPLATE MATCHING TEST AND PROPOSAL ON SETTING TEMPLATE J. Jpn. Soc. Comp. Statist., 27(2014), 49 60 DOI:10.5183/jjscs.1311001 208 MODIFIED NON-OVERLAPPING TEMPLATE MATCHING TEST AND PROPOSAL ON SETTING TEMPLATE Yuichi Takeda, Mituaki Huzii, Norio Watanabe

More information

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy This article was downloaded by: [Ferdowsi University] On: 16 April 212, At: 4:53 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 172954 Registered office: Mortimer

More information

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of

More information

Sequential Testing in Reliability and Validity Studies With Repeated Measurements per Subject

Sequential Testing in Reliability and Validity Studies With Repeated Measurements per Subject http://ijsp.ccsenet.org International Journal of Statistics and Probability Vol. 8, No. 1; 019 Sequential Testing in Reliability and Validity Studies With Repeated Measurements per Subject Steven B. Kim

More information

Journal of Biostatistics and Epidemiology

Journal of Biostatistics and Epidemiology Journal of Biostatistics and Epidemiology Original Article Robust correlation coefficient goodness-of-fit test for the Gumbel distribution Abbas Mahdavi 1* 1 Department of Statistics, School of Mathematical

More information

Quadratic Statistics for the Goodness-of-Fit Test of the Inverse Gaussian Distribution

Quadratic Statistics for the Goodness-of-Fit Test of the Inverse Gaussian Distribution 118 IEEE TRANSACTIONS ON RELIABILITY, VOL. 41, NO. 1, 1992 MARCH Quadratic Statistics for the Goodness-of-Fit Test of the Inverse Gaussian Distribution Robert J. Pavur University of North Texas, Denton

More information

PROBABILITY DISTRIBUTIONS IN LOTTERY GAMES

PROBABILITY DISTRIBUTIONS IN LOTTERY GAMES PROBABILITY DISTRIBUTIONS IN LOTTERY GAMES Jiří Rozkovec Abstract The aim of this paper is to analyse a probability distribution of percentiles of a lottery game, Sportka, and a distribution of drawn numbers

More information

Contents LIST OF TABLES... LIST OF FIGURES... xvii. LIST OF LISTINGS... xxi PREFACE. ...xxiii

Contents LIST OF TABLES... LIST OF FIGURES... xvii. LIST OF LISTINGS... xxi PREFACE. ...xxiii LIST OF TABLES... xv LIST OF FIGURES... xvii LIST OF LISTINGS... xxi PREFACE...xxiii CHAPTER 1. PERFORMANCE EVALUATION... 1 1.1. Performance evaluation... 1 1.2. Performance versus resources provisioning...

More information

Non-parametric Hypothesis Testing

Non-parametric Hypothesis Testing Non-parametric Hypothesis Testing Procedures Hypothesis Testing General Procedure for Hypothesis Tests 1. Identify the parameter of interest.. Formulate the null hypothesis, H 0. 3. Specify an appropriate

More information

Tests for spatial randomness based on spacings

Tests for spatial randomness based on spacings Tests for spatial randomness based on spacings Lionel Cucala and Christine Thomas-Agnan LSP, Université Paul Sabatier and GREMAQ, Université Sciences-Sociales, Toulouse, France E-mail addresses : cucala@cict.fr,

More information

Modeling and Performance Analysis with Discrete-Event Simulation

Modeling and Performance Analysis with Discrete-Event Simulation Simulation Modeling and Performance Analysis with Discrete-Event Simulation Chapter 9 Input Modeling Contents Data Collection Identifying the Distribution with Data Parameter Estimation Goodness-of-Fit

More information

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr.

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr. Simulation Discrete-Event System Simulation Chapter 8 Input Modeling Purpose & Overview Input models provide the driving force for a simulation model. The quality of the output is no better than the quality

More information

QUANTITATIVE TECHNIQUES

QUANTITATIVE TECHNIQUES UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION (For B Com. IV Semester & BBA III Semester) COMPLEMENTARY COURSE QUANTITATIVE TECHNIQUES QUESTION BANK 1. The techniques which provide the decision maker

More information

Bivariate Paired Numerical Data

Bivariate Paired Numerical Data Bivariate Paired Numerical Data Pearson s correlation, Spearman s ρ and Kendall s τ, tests of independence University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html

More information

Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8]

Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8] 1 Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8] Insights: Price movements in one market can spread easily and instantly to another market [economic globalization and internet

More information

Study of a Nuclear Power Plant Containment Damage Caused by Impact of a Plane

Study of a Nuclear Power Plant Containment Damage Caused by Impact of a Plane International Refereed Journal of Engineering and Science (IRJES) ISSN (Online) 2319-183X, (Print) 2319-1821 Volume 1, Issue 4(December 2012), PP.48-53 Study of a Nuclear Power Plant Containment Damage

More information

Part III: Unstructured Data

Part III: Unstructured Data Inf1-DA 2010 2011 III: 51 / 89 Part III Unstructured Data Data Retrieval: III.1 Unstructured data and data retrieval Statistical Analysis of Data: III.2 Data scales and summary statistics III.3 Hypothesis

More information

Testing for Poisson Behavior

Testing for Poisson Behavior Testing for Poisson Behavior Philip B. Stark Department of Statistics, UC Berkeley joint with Brad Luen 17 April 2012 Seismological Society of America Annual Meeting San Diego, CA Quake Physics versus

More information