E MERGING T ECHNOLOGIES

Size: px
Start display at page:

Download "E MERGING T ECHNOLOGIES"

Transcription

1 E MERGING T ECHNOLOGIES A Remedy for the Overzealous Bonferroni Technique for Multiple Statistical Tests Zvi Drezner Steven G. Mihaylo College of Business and Economics, California State University-Fullerton, Fullerton, California USA Taly Dawn Drezner Department of Geography, N430 Ross, Yor University, 4700 Keele Street, Toronto, Ontario M3J 1P3 Canada Abstract. Ecological research often involves multiple statistical tests. It is common practice to employ the Bonferroni technique or its more advanced sequential variant for such multiple tests. Indeed, Moran ( Oios, 100, 2003, 403) found that 13% of ecological papers apply this technique. The seminal paper by Rice ( Evolution, 43, 1989, 223 ) that introduced this technique to the ecological community, is cited to date over times. However, these techniques are conservative and some null hypotheses that should be rejected are not. Using order statistics we find that significant results are correlated even when the data consist of independent events. The Bonferroni methods assume independent significant results which results in Type II error with their application. We propose a simple approach, which we term the correlated Bonferroni technique, to rectify this shortcoming, which reduces rejection of significant results. Ecologists may be able to confirm the significance of their results while they are unable to confirm it using the original Bonferroni technique. Researchers may revisit their projects and find that significant results were mistaenly ignored. We provide an Excel file (see supplement) that researchers can easily use. We illustrate the correlated Bonferroni technique with an example. Key words: Bonferroni ; correlated events ; multiple statistical tests ; statistics Introduction The vast majority of ecological (and biological) papers incorporate multiple statistical tests and/or analyses throughout the process of hypothesis testing. Statistically, this appears to introduce type I error (false positives) as when many tests are run and multiple significant results are obtained, the lielihood Emerging Technologies January

2 Emerging Technologies of obtaining P = 0.05 from among those tests is higher than 5%. Thus, if one runs enough tests, results may yield spurious significant results. The original Bonferroni technique (BT) suggested by Bonferroni ( 1936 ), Miller ( 1981 ), and the sequential Bonferroni technique (SBT) suggested by Holm ( 1979 ), Rice ( 1989 ) who aimed to correct this issue and their use has been generally adopted in many studies, including by one of us (e.g., Drezner 2003, 2008, 2011 ). Recently, ecologists in particular have questioned whether such a correction is valid for continued use in ecological research. A wide variety of issues have been raised, including that the SBT is too conservative and that ecological research often relies on multiple tests (Cabin and Mitchell 2000 ). More analyses lead to a greater number of significant results. However, this in turn drives down the P - value cut- offs using the SBT eventually reversing all or many of the originally significant test results when the SBT is applied. Ultimately this process discourages in depth and continued analysis. Greater analytical depth should be encouraged rather than discouraged (Moran 2003 ). Also, there is no consensus for when and how to apply the SBT. Rules for grouping tests for the application of the technique vary even among journal editors (Cabin and Mitchell 2000, Moran 2003 ). As Moran ( 2003 ) argues, many significant results, even with P - values that are not far below 0.05, may be more telling of ecological conditions than one isolated but very small P - value. He also argues that the number of tests should be considered. For example, if 5 of 10 tests are significant at P = 0.05 but each is not much lower than 0.05, then the BT or SBT will fail to reject all null hypotheses. However, the probability that 5 tests of 10 are significant at the P = 0.05 level is only which means that some of the results must be significant (Moran 2003 ). It is well established (e.g., Cabin and Mitchell 2000, Moran 2003 ) that the BT and SBT are conservative leading to a Type II error when they are applied. It is assumed in the BT and SBT that ordered statistics are independent, while they actually are not. By order statistics theory (Arnold et al ), the largest statistic and the second largest statistic are correlated and this correlation can be calculated. The BT and SBT assume that the correlation coefficient is zero or nearly so. For example, by a simulation based on the methods in Arnold et al. ( 1992 ), for 10 significant results of a normal distribution, the expected correlation between the largest statistic and the second largest is even when the data are randomly generated from independent standardized normal distributions! In this article, we propose the correlated Bonferroni technique (CBT) that accounts for the correlation that exists between ordered data. When it is used sequentially, we term it the sequential correlated Bonferroni technique (SCBT). When a correlation of zero is assumed, the CBT is equivalent to the BT and the SCBT is equivalent to the SBT. The correlated Bonferroni technique The CBT and its variant SCBT are designed to incorporate the correlation between sorted events and thus enable us to reject some null hypotheses that mistaenly are not rejected by applying the BT or SBT. By the BT and SBT, the critical value when events are more significant than α is approximately α /. We derive the critical values for a given correlation coefficient between significant results ( ρ ) and in the Appendix. We provide an easy to use Excel spreadsheet (see supplement) in which the critical P - values are calculated for any value of when the number of significant results and α is entered. 92 Bulletin of the Ecological Society of America, 97(1)

3 Table 1. Results for the number of seeds example. Nine significant test results are sorted in column 2. In column 3 we depict the SBT critical values. The results from the Excel spreadsheet (see supplement) are shown in column 4. Note that the BT and SBT give the same values in column 3. Sorted Critical values P-values SBT SCBT We define θ (ρ, ) as the critical value for the th significant result. If the smallest P-value of significant results is less than θ (ρ, ) (given in Eq. 1 below), the th null hypothesis can be rejected with significance α. θ(ρ,)= α ( (1) + α α ) ρ λ(ρ,) where λ (ρ, ) is given by Eq. 10 in the Appendix. These critical values for a given ρ and are embedded in the Excel file (supplement). When ρ = 0 is used, the critical values by (1) are the same as the BT critical values (approximately α / ). The critical values for rejecting the null hypothesis increase as ρ increases. Let s be the number of significant results. To estimate ρ for a given s, we found the correlation coefficient for 3 s 100 by simulation of randomly generated normal variates. By linear regression we found the relationship (significance ) ρ s s s s 2. (2) This equation is also embedded in the Excel file (supplement). An application example Consider the mean number of seeds for a species in 10 different populations, to determine which populations are significantly different from one another. The data consist of 10 columns with each containing the number of seeds on each plant in that population. This sets up a one- way ANOVA. Emerging Technologies January

4 Emerging Technologies A Tuey post hoc test is performed on the 10 column ANOVA. Suppose that s = 9 pairs (out of 45) were significant with P - values less than A sorted list of such = 1,,9 P - values are given in Table 1. In the third column, we give 0.05/ which are the BT or SBT critical values and in column 4, we give the results from our Excel file (supplement). When applying the SBT, only the smallest two P-values are significant at the α = 0.05 level and the other seven null hypotheses cannot be rejected despite their original P - values being less than However, using our SCBT, we conclude that all nine P-values are significant at the α = 0.05 level and thus all nine null hypotheses should be rejected. Literature cited Abramowitz, M., and I. Stegun Handboo of mathematical functions. Dover Publications Inc., New Yor, NY, USA. Arnold, B. C., N. Balarishnan, and H. N. Nagaraja A first course in order statistics. John Wiley & Sons, New Yor, NY, USA. Bonferroni, C. E Teoria statistica delle classi e calcolo delle probabilita. Libreria internazionale Seeber, Economiche e commerciali di Firenze 8 : Cabin, R. J., and R. J. Mitchell To bonferroni or not to bonferroni: when and how are the questions. Bulletin of the Ecological Society of America, 81 : Drezner, Z Computation of the multivariate normal integral. ACM Transactions on Mathematical Software, 18 : Drezner, T. D A test of the relationship between seasonal rainfall and saguaro cacti branching patterns. Ecography, 26 : Drezner, T. D Variation in age and height of onset of reproduction in the saguaro cactus (carnegiea gigantea) in the sonoran desert. Plant Ecology, 194 : Drezner, T. D Cactus surface temperatures are impacted by seasonality, spines and height on plant. Environmental and Experimental Botany, 74 : Holm, S A simple sequentially rejective multiple test procedure. Scandinavian journal of statistics, 6 (2 ): Johnson, N. L., and S. Kotz Distributions in statistics: continuous multivariate distributions. Wiley, New Yor, NY, USA. Miller, R. G Simultaneous statistical inference. McGraw Hill, New Yor, NY, USA. Moran, M. D Arguments for rejecting the sequential bonferroni in ecological studies. Oios, 100 : Rice, W. R Analyzing tables of statistical tests. Evolution, 43 (1 ): Appendix: Incorporating Association between Probabilities Let the probability that a hypothesis is not true be 1 P j = Pr (Z j h j ) by some distribution. The probability P that at least one hypothesis is true is determined by a multivariate distribution based on a correlation matrix R between hypotheses: P = 1 Pr ( Z 1 h 1,Z 2 h 2,,Z h ) (3) 94 Bulletin of the Ecological Society of America, 97(1)

5 The multivariate normal case It is reasonable to assume that the individual probabilities are generated by a normal distribution. If the probabilities are a result of a normal distribution, i.e. P j = Pr (Z h j ), correlated probabilities are a result of a multivariate normal distribution. For a multivariate normal distribution with a correlation matrix R (Drezner 1992, Johnson and Kotz 1972 ): Pr ( ) 1 Z 1 h 1,Z 2 h 2,,Z h = (2π) R h 1 h e 1 2 zt R 1z dz dz 1 (4) where R is the determinant of R. We derive the expression for the probability P, when the correlation coefficient between all events is the same, by a multivariate normal distribution based on a correlation matrix R with equal off diagonal values. Equal off diagonal correlations For the multivariate normal distribution, when the correlation matrix R has equal off diagonal correlations ρ 0, by Johnson and Kotz ( 1972 :48), the -dimensional integral (4) can be reduced to a one- dimensional integral: Pr ( ) 1 Z 1 h 1,Z 2 h 2,,Z h = 2π Φ j=1 ( hj z ) ρ 1 ρ e z 2 2 dz (5) where Φ( ) is the cumulative standardized normal probability. The integral (5) can be calculated using either Simpson s integration formula or Gaussian quadrature formulas based on Hermite polynomials (Abramowitz and Stegun 1972 ). Let From (6) we get Pr(Z j h j )=Φ(h j )=1 P j. h j =Φ 1 ( 1 P j ) (6) (7) and the probability P that at least one hypothesis is true is: P = 1 1 2π ( Φ 1 1 P j ) z ρ Φ e 1 ρ z 2 2 dz j=1 (8) Emerging Technologies January

6 Emerging Technologies Table 2. The calculated critical values for α = ρ = 0 ρ = 0.1 ρ = 0.2 ρ = 0.3 ρ = 0.4 ρ = 0.5 ρ = 0.6 ρ = 0.7 ρ = 0.8 ρ = Bulletin of the Ecological Society of America, 97(1)

7 Table 2. (Continued) ρ = 0 ρ = 0.1 ρ = 0.2 ρ = 0.3 ρ = 0.4 ρ = 0.5 ρ = 0.6 ρ = 0.7 ρ = 0.8 ρ = Emerging Technologies January

8 Emerging Technologies When P j = P 0 for all j, the probability that at least one of them occurs is by (8) : [ ( P = 1 1 z0 z )] ρ Φ e z 2 2 dz 2π 1 ρ (9) where Φ( ) is the standardized normal distribution and Φ(z0 ) = 1 P. For ρ = 0 P = 1 (1 P 0 0 ) which leads to the original Bonferroni. For ρ = 1 P = P 0 for any. Rather than using P 0 α, we use the value of P0 that yields P = α in (9). For any α, ρ we can define ( 0 θ 1 such that the probability P 0 = θ yields in (9) P = α. For the original Bonferroni ( ρ = 0), θ = 1 (1 α) 1 ) α. For ρ = 1, θ = α. For 0 < ρ < 1, θ should be between these two values. We calculated θ for α = 0.05 and = 2,3,,30, ρ = 0,0.1,0.2,,0.9 listed in Table 2. ρ = 0 is the original Bonferroni. We fit θ by the expression which is exact for ρ = 0 and ρ = 1. θ(ρ,)= 1 ( ) ρ λ(ρ,) The power λ was fit by multiple regression yielding with significance of : λ(ρ,) ( ) ( (1 ρ) ) 1 ρ 2 (10) Supporting Information Additional supporting information may be found in the online version of this article at library.wiley.com/doi/ /bes2.1214/suppinfo Table S1. Calculating the critical values for a given number of significant results. 98 Bulletin of the Ecological Society of America, 97(1)

Online Appendix for Targeting Policies: Multiple Testing and Distributional Treatment Effects

Online Appendix for Targeting Policies: Multiple Testing and Distributional Treatment Effects Online Appendix for Targeting Policies: Multiple Testing and Distributional Treatment Effects Steven F Lehrer Queen s University, NYU Shanghai, and NBER R Vincent Pohl University of Georgia November 2016

More information

Tests about a population mean

Tests about a population mean October 2 nd, 2017 Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 1: Descriptive statistics Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter 8: Confidence

More information

STAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons:

STAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons: STAT 263/363: Experimental Design Winter 206/7 Lecture January 9 Lecturer: Minyong Lee Scribe: Zachary del Rosario. Design of Experiments Why perform Design of Experiments (DOE)? There are at least two

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Class 6 AMS-UCSC Thu 26, 2012 Winter 2012. Session 1 (Class 6) AMS-132/206 Thu 26, 2012 1 / 15 Topics Topics We will talk about... 1 Hypothesis testing

More information

Econometrics. 4) Statistical inference

Econometrics. 4) Statistical inference 30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution

More information

A Power Analysis of Variable Deletion Within the MEWMA Control Chart Statistic

A Power Analysis of Variable Deletion Within the MEWMA Control Chart Statistic A Power Analysis of Variable Deletion Within the MEWMA Control Chart Statistic Jay R. Schaffer & Shawn VandenHul University of Northern Colorado McKee Greeley, CO 869 jay.schaffer@unco.edu gathen9@hotmail.com

More information

IEOR165 Discussion Week 12

IEOR165 Discussion Week 12 IEOR165 Discussion Week 12 Sheng Liu University of California, Berkeley Apr 15, 2016 Outline 1 Type I errors & Type II errors 2 Multiple Testing 3 ANOVA IEOR165 Discussion Sheng Liu 2 Type I errors & Type

More information

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

y ˆ i = ˆ  T u i ( i th fitted value or i th fit) 1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u

More information

Continuous Univariate Distributions

Continuous Univariate Distributions Continuous Univariate Distributions Volume 1 Second Edition NORMAN L. JOHNSON University of North Carolina Chapel Hill, North Carolina SAMUEL KOTZ University of Maryland College Park, Maryland N. BALAKRISHNAN

More information

Visual interpretation with normal approximation

Visual interpretation with normal approximation Visual interpretation with normal approximation H 0 is true: H 1 is true: p =0.06 25 33 Reject H 0 α =0.05 (Type I error rate) Fail to reject H 0 β =0.6468 (Type II error rate) 30 Accept H 1 Visual interpretation

More information

Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test

Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test la Contents The two sample t-test generalizes into Analysis of Variance. In analysis of variance ANOVA the population consists

More information

Control of Directional Errors in Fixed Sequence Multiple Testing

Control of Directional Errors in Fixed Sequence Multiple Testing Control of Directional Errors in Fixed Sequence Multiple Testing Anjana Grandhi Department of Mathematical Sciences New Jersey Institute of Technology Newark, NJ 07102-1982 Wenge Guo Department of Mathematical

More information

Journal of Forensic & Investigative Accounting Volume 10: Issue 2, Special Issue 2018

Journal of Forensic & Investigative Accounting Volume 10: Issue 2, Special Issue 2018 Detecting Fraud in Non-Benford Transaction Files William R. Fairweather* Introduction An auditor faced with the need to evaluate a number of files for potential fraud might wish to focus his or her efforts,

More information

Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups. Acknowledgements:

Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups. Acknowledgements: Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements:

More information

MULTIVARIATE ANALYSIS OF VARIANCE

MULTIVARIATE ANALYSIS OF VARIANCE MULTIVARIATE ANALYSIS OF VARIANCE RAJENDER PARSAD AND L.M. BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 0 0 lmb@iasri.res.in. Introduction In many agricultural experiments,

More information

Multiple Dependent Hypothesis Tests in Geographically Weighted Regression

Multiple Dependent Hypothesis Tests in Geographically Weighted Regression Multiple Dependent Hypothesis Tests in Geographically Weighted Regression Graeme Byrne 1, Martin Charlton 2, and Stewart Fotheringham 3 1 La Trobe University, Bendigo, Victoria Austrlaia Telephone: +61

More information

A note on tree gatekeeping procedures in clinical trials

A note on tree gatekeeping procedures in clinical trials STATISTICS IN MEDICINE Statist. Med. 2008; 06:1 6 [Version: 2002/09/18 v1.11] A note on tree gatekeeping procedures in clinical trials Alex Dmitrienko 1, Ajit C. Tamhane 2, Lingyun Liu 2, Brian L. Wiens

More information

Logistic Regression Analysis

Logistic Regression Analysis Logistic Regression Analysis Predicting whether an event will or will not occur, as well as identifying the variables useful in making the prediction, is important in most academic disciplines as well

More information

Chapter Seven: Multi-Sample Methods 1/52

Chapter Seven: Multi-Sample Methods 1/52 Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze

More information

Introduction to Business Statistics QM 220 Chapter 12

Introduction to Business Statistics QM 220 Chapter 12 Department of Quantitative Methods & Information Systems Introduction to Business Statistics QM 220 Chapter 12 Dr. Mohammad Zainal 12.1 The F distribution We already covered this topic in Ch. 10 QM-220,

More information

Modified Simes Critical Values Under Positive Dependence

Modified Simes Critical Values Under Positive Dependence Modified Simes Critical Values Under Positive Dependence Gengqian Cai, Sanat K. Sarkar Clinical Pharmacology Statistics & Programming, BDS, GlaxoSmithKline Statistics Department, Temple University, Philadelphia

More information

arxiv: v4 [stat.me] 20 Oct 2017

arxiv: v4 [stat.me] 20 Oct 2017 A simple method for implementing Monte Carlo tests Dong Ding, Axel Gandy, and Georg Hahn Department of Mathematics, Imperial College London arxiv:1611.01675v4 [stat.me] 20 Oct 2017 Abstract We consider

More information

A. Motivation To motivate the analysis of variance framework, we consider the following example.

A. Motivation To motivate the analysis of variance framework, we consider the following example. 9.07 ntroduction to Statistics for Brain and Cognitive Sciences Emery N. Brown Lecture 14: Analysis of Variance. Objectives Understand analysis of variance as a special case of the linear model. Understand

More information

F n and theoretical, F 0 CDF values, for the ordered sample

F n and theoretical, F 0 CDF values, for the ordered sample Material E A S E 7 AMPTIAC Jorge Luis Romeu IIT Research Institute Rome, New York STATISTICAL ANALYSIS OF MATERIAL DATA PART III: ON THE APPLICATION OF STATISTICS TO MATERIALS ANALYSIS Introduction This

More information

Revisiting the returns-volume relationship: Time variation, alternative measures and the nancial crisis

Revisiting the returns-volume relationship: Time variation, alternative measures and the nancial crisis *Manuscript Click here to view linked References Revisiting the returns-volume relationship: Time variation, alternative measures and the nancial crisis Steve Cook ; y and Duncan Watson z September 20,

More information

Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances

Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function

More information

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Jeremy S. Conner and Dale E. Seborg Department of Chemical Engineering University of California, Santa Barbara, CA

More information

DIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS

DIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS DIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS THUY ANH NGO 1. Introduction Statistics are easily come across in our daily life. Statements such as the average

More information

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown Nonparametric Statistics Leah Wright, Tyler Ross, Taylor Brown Before we get to nonparametric statistics, what are parametric statistics? These statistics estimate and test population means, while holding

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

High-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018

High-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018 High-Throughput Sequencing Course Multiple Testing Biostatistics and Bioinformatics Summer 2018 Introduction You have previously considered the significance of a single gene Introduction You have previously

More information

SIMULATED POWER OF SOME DISCRETE GOODNESS- OF-FIT TEST STATISTICS FOR TESTING THE NULL HYPOTHESIS OF A ZIG-ZAG DISTRIBUTION

SIMULATED POWER OF SOME DISCRETE GOODNESS- OF-FIT TEST STATISTICS FOR TESTING THE NULL HYPOTHESIS OF A ZIG-ZAG DISTRIBUTION Far East Journal of Theoretical Statistics Volume 28, Number 2, 2009, Pages 57-7 This paper is available online at http://www.pphmj.com 2009 Pushpa Publishing House SIMULATED POWER OF SOME DISCRETE GOODNESS-

More information

Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process

Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process Applied Mathematical Sciences, Vol. 4, 2010, no. 62, 3083-3093 Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process Julia Bondarenko Helmut-Schmidt University Hamburg University

More information

A REVERSE TO THE JEFFREYS LINDLEY PARADOX

A REVERSE TO THE JEFFREYS LINDLEY PARADOX PROBABILITY AND MATHEMATICAL STATISTICS Vol. 38, Fasc. 1 (2018), pp. 243 247 doi:10.19195/0208-4147.38.1.13 A REVERSE TO THE JEFFREYS LINDLEY PARADOX BY WIEBE R. P E S T M A N (LEUVEN), FRANCIS T U E R

More information

A comparison of inverse transform and composition methods of data simulation from the Lindley distribution

A comparison of inverse transform and composition methods of data simulation from the Lindley distribution Communications for Statistical Applications and Methods 2016, Vol. 23, No. 6, 517 529 http://dx.doi.org/10.5351/csam.2016.23.6.517 Print ISSN 2287-7843 / Online ISSN 2383-4757 A comparison of inverse transform

More information

Using Simulation and Symbolic Computing in Linear Programming

Using Simulation and Symbolic Computing in Linear Programming Proceedings of the 6th WSEAS International Conference on Simulation, Modelling and Optimization, Lisbon, Portugal, September 22-24, 2006 571 Using Simulation and Symbolic Computing in Linear Programming

More information

HYPOTHESIS TESTING. Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.

More information

Introduction to Statistical Data Analysis III

Introduction to Statistical Data Analysis III Introduction to Statistical Data Analysis III JULY 2011 Afsaneh Yazdani Preface Major branches of Statistics: - Descriptive Statistics - Inferential Statistics Preface What is Inferential Statistics? The

More information

Applied Regression Modeling

Applied Regression Modeling Applied Regression Modeling A Business Approach Iain Pardoe University of Oregon Charles H. Lundquist College of Business Eugene, Oregon WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION CONTENTS

More information

Week 5 Video 1 Relationship Mining Correlation Mining

Week 5 Video 1 Relationship Mining Correlation Mining Week 5 Video 1 Relationship Mining Correlation Mining Relationship Mining Discover relationships between variables in a data set with many variables Many types of relationship mining Correlation Mining

More information

An Approximate Test for Homogeneity of Correlated Correlation Coefficients

An Approximate Test for Homogeneity of Correlated Correlation Coefficients Quality & Quantity 37: 99 110, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. 99 Research Note An Approximate Test for Homogeneity of Correlated Correlation Coefficients TRIVELLORE

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Analysis of Variance (ANOVA)

Analysis of Variance (ANOVA) Analysis of Variance (ANOVA) Two types of ANOVA tests: Independent measures and Repeated measures Comparing 2 means: X 1 = 20 t - test X 2 = 30 How can we Compare 3 means?: X 1 = 20 X 2 = 30 X 3 = 35 ANOVA

More information

Multivariate Regression Generalized Likelihood Ratio Tests for FMRI Activation

Multivariate Regression Generalized Likelihood Ratio Tests for FMRI Activation Multivariate Regression Generalized Likelihood Ratio Tests for FMRI Activation Daniel B Rowe Division of Biostatistics Medical College of Wisconsin Technical Report 40 November 00 Division of Biostatistics

More information

Oikos. Appendix 1 and 2. o20751

Oikos. Appendix 1 and 2. o20751 Oikos o20751 Rosindell, J. and Cornell, S. J. 2013. Universal scaling of species-abundance distributions across multiple scales. Oikos 122: 1101 1111. Appendix 1 and 2 Universal scaling of species-abundance

More information

Chapter 13 Section D. F versus Q: Different Approaches to Controlling Type I Errors with Multiple Comparisons

Chapter 13 Section D. F versus Q: Different Approaches to Controlling Type I Errors with Multiple Comparisons Explaining Psychological Statistics (2 nd Ed.) by Barry H. Cohen Chapter 13 Section D F versus Q: Different Approaches to Controlling Type I Errors with Multiple Comparisons In section B of this chapter,

More information

MULTISTAGE AND MIXTURE PARALLEL GATEKEEPING PROCEDURES IN CLINICAL TRIALS

MULTISTAGE AND MIXTURE PARALLEL GATEKEEPING PROCEDURES IN CLINICAL TRIALS Journal of Biopharmaceutical Statistics, 21: 726 747, 2011 Copyright Taylor & Francis Group, LLC ISSN: 1054-3406 print/1520-5711 online DOI: 10.1080/10543406.2011.551333 MULTISTAGE AND MIXTURE PARALLEL

More information

Hypothesis T e T sting w ith with O ne O One-Way - ANOV ANO A V Statistics Arlo Clark Foos -

Hypothesis T e T sting w ith with O ne O One-Way - ANOV ANO A V Statistics Arlo Clark Foos - Hypothesis Testing with One-Way ANOVA Statistics Arlo Clark-Foos Conceptual Refresher 1. Standardized z distribution of scores and of means can be represented as percentile rankings. 2. t distribution

More information

Types of Statistical Tests DR. MIKE MARRAPODI

Types of Statistical Tests DR. MIKE MARRAPODI Types of Statistical Tests DR. MIKE MARRAPODI Tests t tests ANOVA Correlation Regression Multivariate Techniques Non-parametric t tests One sample t test Independent t test Paired sample t test One sample

More information

Mathematical statistics

Mathematical statistics October 20 th, 2018 Lecture 17: Tests of Hypotheses Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation

More information

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES BIOL 458 - Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES PART 1: INTRODUCTION TO ANOVA Purpose of ANOVA Analysis of Variance (ANOVA) is an extremely useful statistical method

More information

Difference in two or more average scores in different groups

Difference in two or more average scores in different groups ANOVAs Analysis of Variance (ANOVA) Difference in two or more average scores in different groups Each participant tested once Same outcome tested in each group Simplest is one-way ANOVA (one variable as

More information

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

Hotelling s One- Sample T2

Hotelling s One- Sample T2 Chapter 405 Hotelling s One- Sample T2 Introduction The one-sample Hotelling s T2 is the multivariate extension of the common one-sample or paired Student s t-test. In a one-sample t-test, the mean response

More information

Tutorial 2: Power and Sample Size for the Paired Sample t-test

Tutorial 2: Power and Sample Size for the Paired Sample t-test Tutorial 2: Power and Sample Size for the Paired Sample t-test Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function of sample size, variability,

More information

This document contains 3 sets of practice problems.

This document contains 3 sets of practice problems. P RACTICE PROBLEMS This document contains 3 sets of practice problems. Correlation: 3 problems Regression: 4 problems ANOVA: 8 problems You should print a copy of these practice problems and bring them

More information

Statistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers

Statistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers Statistical Inference Greg C Elvers 1 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population and not just the sample that we used But our sample

More information

INTRODUCTION TO INTERSECTION-UNION TESTS

INTRODUCTION TO INTERSECTION-UNION TESTS INTRODUCTION TO INTERSECTION-UNION TESTS Jimmy A. Doi, Cal Poly State University San Luis Obispo Department of Statistics (jdoi@calpoly.edu Key Words: Intersection-Union Tests; Multiple Comparisons; Acceptance

More information

Inverse-variance Weighted Average

Inverse-variance Weighted Average Fixed Effects Meta-Analysisand Homogeneity Evaluation Jeff Valentine University of Louisville Campbell Collaboration Annual Colloquium Oslo 2009 Inverse-variance Weighted Average All effect sizes are not

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

Independent Samples ANOVA

Independent Samples ANOVA Independent Samples ANOVA In this example students were randomly assigned to one of three mnemonics (techniques for improving memory) rehearsal (the control group; simply repeat the words), visual imagery

More information

Study Ch. 9.3, #47 53 (45 51), 55 61, (55 59)

Study Ch. 9.3, #47 53 (45 51), 55 61, (55 59) GOALS: 1. Understand that 2 approaches of hypothesis testing exist: classical or critical value, and p value. We will use the p value approach. 2. Understand the critical value for the classical approach

More information

A Note on Hypothesis Testing with Random Sample Sizes and its Relationship to Bayes Factors

A Note on Hypothesis Testing with Random Sample Sizes and its Relationship to Bayes Factors Journal of Data Science 6(008), 75-87 A Note on Hypothesis Testing with Random Sample Sizes and its Relationship to Bayes Factors Scott Berry 1 and Kert Viele 1 Berry Consultants and University of Kentucky

More information

Continuous Univariate Distributions

Continuous Univariate Distributions Continuous Univariate Distributions Volume 2 Second Edition NORMAN L. JOHNSON University of North Carolina Chapel Hill, North Carolina SAMUEL KOTZ University of Maryland College Park, Maryland N. BALAKRISHNAN

More information

A discussion on multiple regression models

A discussion on multiple regression models A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value

More information

Approximate Test for Comparing Parameters of Several Inverse Hypergeometric Distributions

Approximate Test for Comparing Parameters of Several Inverse Hypergeometric Distributions Approximate Test for Comparing Parameters of Several Inverse Hypergeometric Distributions Lei Zhang 1, Hongmei Han 2, Dachuan Zhang 3, and William D. Johnson 2 1. Mississippi State Department of Health,

More information

Closure properties of classes of multiple testing procedures

Closure properties of classes of multiple testing procedures AStA Adv Stat Anal (2018) 102:167 178 https://doi.org/10.1007/s10182-017-0297-0 ORIGINAL PAPER Closure properties of classes of multiple testing procedures Georg Hahn 1 Received: 28 June 2016 / Accepted:

More information

Testing for Unit Roots with Cointegrated Data

Testing for Unit Roots with Cointegrated Data Discussion Paper No. 2015-57 August 19, 2015 http://www.economics-ejournal.org/economics/discussionpapers/2015-57 Testing for Unit Roots with Cointegrated Data W. Robert Reed Abstract This paper demonstrates

More information

Statistical methods and data analysis

Statistical methods and data analysis Statistical methods and data analysis Teacher Stefano Siboni Aim The aim of the course is to illustrate the basic mathematical tools for the analysis and modelling of experimental data, particularly concerning

More information

Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements:

Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements: Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in

More information

Non-specific filtering and control of false positives

Non-specific filtering and control of false positives Non-specific filtering and control of false positives Richard Bourgon 16 June 2009 bourgon@ebi.ac.uk EBI is an outstation of the European Molecular Biology Laboratory Outline Multiple testing I: overview

More information

Lecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem

Lecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem Lecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem Prof. Massimo Guidolin 20192 Financial Econometrics Winter/Spring 2018 Overview Defining cointegration Vector

More information

Limited Dependent Variables and Panel Data

Limited Dependent Variables and Panel Data and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement

More information

Distribution Theory. Comparison Between Two Quantiles: The Normal and Exponential Cases

Distribution Theory. Comparison Between Two Quantiles: The Normal and Exponential Cases Communications in Statistics Simulation and Computation, 34: 43 5, 005 Copyright Taylor & Francis, Inc. ISSN: 0361-0918 print/153-4141 online DOI: 10.1081/SAC-00055639 Distribution Theory Comparison Between

More information

Model Analysis for Partitioning of Dry Matter and Plant Nitrogen for Stem and Leaf in Alfalfa

Model Analysis for Partitioning of Dry Matter and Plant Nitrogen for Stem and Leaf in Alfalfa Communications in Soil Science and Plant Analysis, 36: 1163 1175, 2005 Copyright # Taylor & Francis, Inc. ISSN 0010-3624 print/1532-2416 online DOI: 10.1081/CSS-200056889 Model Analysis for Partitioning

More information

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval

Epidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval Epidemiology 9509 Principles of Biostatistics Chapter 10 - Inferences about John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. differences in

More information

Causal Model Selection Hypothesis Tests in Systems Genetics

Causal Model Selection Hypothesis Tests in Systems Genetics 1 Causal Model Selection Hypothesis Tests in Systems Genetics Elias Chaibub Neto and Brian S Yandell SISG 2012 July 13, 2012 2 Correlation and Causation The old view of cause and effect... could only fail;

More information

COMPARING SEVERAL MEANS: ANOVA

COMPARING SEVERAL MEANS: ANOVA LAST UPDATED: November 15, 2012 COMPARING SEVERAL MEANS: ANOVA Objectives 2 Basic principles of ANOVA Equations underlying one-way ANOVA Doing a one-way ANOVA in R Following up an ANOVA: Planned contrasts/comparisons

More information

A Test of Homogeneity Against Umbrella Scale Alternative Based on Gini s Mean Difference

A Test of Homogeneity Against Umbrella Scale Alternative Based on Gini s Mean Difference J. Stat. Appl. Pro. 2, No. 2, 145-154 (2013) 145 Journal of Statistics Applications & Probability An International Journal http://dx.doi.org/10.12785/jsap/020207 A Test of Homogeneity Against Umbrella

More information

Example. χ 2 = Continued on the next page. All cells

Example. χ 2 = Continued on the next page. All cells Section 11.1 Chi Square Statistic k Categories 1 st 2 nd 3 rd k th Total Observed Frequencies O 1 O 2 O 3 O k n Expected Frequencies E 1 E 2 E 3 E k n O 1 + O 2 + O 3 + + O k = n E 1 + E 2 + E 3 + + E

More information

Contrasts and Multiple Comparisons Supplement for Pages

Contrasts and Multiple Comparisons Supplement for Pages Contrasts and Multiple Comparisons Supplement for Pages 302-323 Brian Habing University of South Carolina Last Updated: July 20, 2001 The F-test from the ANOVA table allows us to test the null hypothesis

More information

Neuendorf MANOVA /MANCOVA. Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y4. Like ANOVA/ANCOVA:

Neuendorf MANOVA /MANCOVA. Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y4. Like ANOVA/ANCOVA: 1 Neuendorf MANOVA /MANCOVA Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y1 Y2 Y3 Y4 Like ANOVA/ANCOVA: 1. Assumes equal variance (equal covariance matrices) across cells (groups defined by

More information

Research Article The Laplace Likelihood Ratio Test for Heteroscedasticity

Research Article The Laplace Likelihood Ratio Test for Heteroscedasticity International Mathematics and Mathematical Sciences Volume 2011, Article ID 249564, 7 pages doi:10.1155/2011/249564 Research Article The Laplace Likelihood Ratio Test for Heteroscedasticity J. Martin van

More information

Fundamentals of Probability Theory and Mathematical Statistics

Fundamentals of Probability Theory and Mathematical Statistics Fundamentals of Probability Theory and Mathematical Statistics Gerry Del Fiacco Math Center Metropolitan State University St. Paul, Minnesota June 6, 2016 1 Preface This collection of material was researched,

More information

NONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12

NONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12 NONPARAMETRIC TESTS LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-1 lmb@iasri.res.in 1. Introduction Testing (usually called hypothesis testing ) play a major

More information

Electronic Research Archive of Blekinge Institute of Technology

Electronic Research Archive of Blekinge Institute of Technology Electronic Research Archive of Blekinge Institute of Technology http://www.bth.se/fou/ This is an author produced version of a ournal paper. The paper has been peer-reviewed but may not include the final

More information

Mathematical statistics

Mathematical statistics November 1 st, 2018 Lecture 18: Tests about a population mean Overview 9.1 Hypotheses and test procedures test procedures errors in hypothesis testing significance level 9.2 Tests about a population mean

More information

Estimation of Stress-Strength Reliability for Kumaraswamy Exponential Distribution Based on Upper Record Values

Estimation of Stress-Strength Reliability for Kumaraswamy Exponential Distribution Based on Upper Record Values International Journal of Contemporary Mathematical Sciences Vol. 12, 2017, no. 2, 59-71 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ijcms.2017.7210 Estimation of Stress-Strength Reliability for

More information

Multiple testing: Intro & FWER 1

Multiple testing: Intro & FWER 1 Multiple testing: Intro & FWER 1 Mark van de Wiel mark.vdwiel@vumc.nl Dep of Epidemiology & Biostatistics,VUmc, Amsterdam Dep of Mathematics, VU 1 Some slides courtesy of Jelle Goeman 1 Practical notes

More information

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Example 10-2: Absences/Final Grades Please enter the data below in L1 and L2. The data appears on page 537 of your textbook.

More information

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving

More information

Testing for a break in persistence under long-range dependencies and mean shifts

Testing for a break in persistence under long-range dependencies and mean shifts Testing for a break in persistence under long-range dependencies and mean shifts Philipp Sibbertsen and Juliane Willert Institute of Statistics, Faculty of Economics and Management Leibniz Universität

More information

FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING

FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING Vishwanath Mantha Department for Electrical and Computer Engineering Mississippi State University, Mississippi State, MS 39762 mantha@isip.msstate.edu ABSTRACT

More information

STAT 328 (Statistical Packages)

STAT 328 (Statistical Packages) Department of Statistics and Operations Research College of Science King Saud University Exercises STAT 328 (Statistical Packages) nashmiah r.alshammari ^-^ Excel and Minitab - 1 - Write the commands of

More information

Data envelopment analysis

Data envelopment analysis 15 Data envelopment analysis The purpose of data envelopment analysis (DEA) is to compare the operating performance of a set of units such as companies, university departments, hospitals, bank branch offices,

More information

Optimal SPRT and CUSUM Procedures using Compressed Limit Gauges

Optimal SPRT and CUSUM Procedures using Compressed Limit Gauges Optimal SPRT and CUSUM Procedures using Compressed Limit Gauges P. Lee Geyer Stefan H. Steiner 1 Faculty of Business McMaster University Hamilton, Ontario L8S 4M4 Canada Dept. of Statistics and Actuarial

More information

Analysis of Variance. ภาว น ศ ร ประภาน ก ล คณะเศรษฐศาสตร มหาว ทยาล ยธรรมศาสตร

Analysis of Variance. ภาว น ศ ร ประภาน ก ล คณะเศรษฐศาสตร มหาว ทยาล ยธรรมศาสตร Analysis of Variance ภาว น ศ ร ประภาน ก ล คณะเศรษฐศาสตร มหาว ทยาล ยธรรมศาสตร pawin@econ.tu.ac.th Outline Introduction One Factor Analysis of Variance Two Factor Analysis of Variance ANCOVA MANOVA Introduction

More information

A multiple testing procedure for input variable selection in neural networks

A multiple testing procedure for input variable selection in neural networks A multiple testing procedure for input variable selection in neural networks MicheleLaRoccaandCiraPerna Department of Economics and Statistics - University of Salerno Via Ponte Don Melillo, 84084, Fisciano

More information