E MERGING T ECHNOLOGIES
|
|
- Steven Tyler
- 5 years ago
- Views:
Transcription
1 E MERGING T ECHNOLOGIES A Remedy for the Overzealous Bonferroni Technique for Multiple Statistical Tests Zvi Drezner Steven G. Mihaylo College of Business and Economics, California State University-Fullerton, Fullerton, California USA Taly Dawn Drezner Department of Geography, N430 Ross, Yor University, 4700 Keele Street, Toronto, Ontario M3J 1P3 Canada Abstract. Ecological research often involves multiple statistical tests. It is common practice to employ the Bonferroni technique or its more advanced sequential variant for such multiple tests. Indeed, Moran ( Oios, 100, 2003, 403) found that 13% of ecological papers apply this technique. The seminal paper by Rice ( Evolution, 43, 1989, 223 ) that introduced this technique to the ecological community, is cited to date over times. However, these techniques are conservative and some null hypotheses that should be rejected are not. Using order statistics we find that significant results are correlated even when the data consist of independent events. The Bonferroni methods assume independent significant results which results in Type II error with their application. We propose a simple approach, which we term the correlated Bonferroni technique, to rectify this shortcoming, which reduces rejection of significant results. Ecologists may be able to confirm the significance of their results while they are unable to confirm it using the original Bonferroni technique. Researchers may revisit their projects and find that significant results were mistaenly ignored. We provide an Excel file (see supplement) that researchers can easily use. We illustrate the correlated Bonferroni technique with an example. Key words: Bonferroni ; correlated events ; multiple statistical tests ; statistics Introduction The vast majority of ecological (and biological) papers incorporate multiple statistical tests and/or analyses throughout the process of hypothesis testing. Statistically, this appears to introduce type I error (false positives) as when many tests are run and multiple significant results are obtained, the lielihood Emerging Technologies January
2 Emerging Technologies of obtaining P = 0.05 from among those tests is higher than 5%. Thus, if one runs enough tests, results may yield spurious significant results. The original Bonferroni technique (BT) suggested by Bonferroni ( 1936 ), Miller ( 1981 ), and the sequential Bonferroni technique (SBT) suggested by Holm ( 1979 ), Rice ( 1989 ) who aimed to correct this issue and their use has been generally adopted in many studies, including by one of us (e.g., Drezner 2003, 2008, 2011 ). Recently, ecologists in particular have questioned whether such a correction is valid for continued use in ecological research. A wide variety of issues have been raised, including that the SBT is too conservative and that ecological research often relies on multiple tests (Cabin and Mitchell 2000 ). More analyses lead to a greater number of significant results. However, this in turn drives down the P - value cut- offs using the SBT eventually reversing all or many of the originally significant test results when the SBT is applied. Ultimately this process discourages in depth and continued analysis. Greater analytical depth should be encouraged rather than discouraged (Moran 2003 ). Also, there is no consensus for when and how to apply the SBT. Rules for grouping tests for the application of the technique vary even among journal editors (Cabin and Mitchell 2000, Moran 2003 ). As Moran ( 2003 ) argues, many significant results, even with P - values that are not far below 0.05, may be more telling of ecological conditions than one isolated but very small P - value. He also argues that the number of tests should be considered. For example, if 5 of 10 tests are significant at P = 0.05 but each is not much lower than 0.05, then the BT or SBT will fail to reject all null hypotheses. However, the probability that 5 tests of 10 are significant at the P = 0.05 level is only which means that some of the results must be significant (Moran 2003 ). It is well established (e.g., Cabin and Mitchell 2000, Moran 2003 ) that the BT and SBT are conservative leading to a Type II error when they are applied. It is assumed in the BT and SBT that ordered statistics are independent, while they actually are not. By order statistics theory (Arnold et al ), the largest statistic and the second largest statistic are correlated and this correlation can be calculated. The BT and SBT assume that the correlation coefficient is zero or nearly so. For example, by a simulation based on the methods in Arnold et al. ( 1992 ), for 10 significant results of a normal distribution, the expected correlation between the largest statistic and the second largest is even when the data are randomly generated from independent standardized normal distributions! In this article, we propose the correlated Bonferroni technique (CBT) that accounts for the correlation that exists between ordered data. When it is used sequentially, we term it the sequential correlated Bonferroni technique (SCBT). When a correlation of zero is assumed, the CBT is equivalent to the BT and the SCBT is equivalent to the SBT. The correlated Bonferroni technique The CBT and its variant SCBT are designed to incorporate the correlation between sorted events and thus enable us to reject some null hypotheses that mistaenly are not rejected by applying the BT or SBT. By the BT and SBT, the critical value when events are more significant than α is approximately α /. We derive the critical values for a given correlation coefficient between significant results ( ρ ) and in the Appendix. We provide an easy to use Excel spreadsheet (see supplement) in which the critical P - values are calculated for any value of when the number of significant results and α is entered. 92 Bulletin of the Ecological Society of America, 97(1)
3 Table 1. Results for the number of seeds example. Nine significant test results are sorted in column 2. In column 3 we depict the SBT critical values. The results from the Excel spreadsheet (see supplement) are shown in column 4. Note that the BT and SBT give the same values in column 3. Sorted Critical values P-values SBT SCBT We define θ (ρ, ) as the critical value for the th significant result. If the smallest P-value of significant results is less than θ (ρ, ) (given in Eq. 1 below), the th null hypothesis can be rejected with significance α. θ(ρ,)= α ( (1) + α α ) ρ λ(ρ,) where λ (ρ, ) is given by Eq. 10 in the Appendix. These critical values for a given ρ and are embedded in the Excel file (supplement). When ρ = 0 is used, the critical values by (1) are the same as the BT critical values (approximately α / ). The critical values for rejecting the null hypothesis increase as ρ increases. Let s be the number of significant results. To estimate ρ for a given s, we found the correlation coefficient for 3 s 100 by simulation of randomly generated normal variates. By linear regression we found the relationship (significance ) ρ s s s s 2. (2) This equation is also embedded in the Excel file (supplement). An application example Consider the mean number of seeds for a species in 10 different populations, to determine which populations are significantly different from one another. The data consist of 10 columns with each containing the number of seeds on each plant in that population. This sets up a one- way ANOVA. Emerging Technologies January
4 Emerging Technologies A Tuey post hoc test is performed on the 10 column ANOVA. Suppose that s = 9 pairs (out of 45) were significant with P - values less than A sorted list of such = 1,,9 P - values are given in Table 1. In the third column, we give 0.05/ which are the BT or SBT critical values and in column 4, we give the results from our Excel file (supplement). When applying the SBT, only the smallest two P-values are significant at the α = 0.05 level and the other seven null hypotheses cannot be rejected despite their original P - values being less than However, using our SCBT, we conclude that all nine P-values are significant at the α = 0.05 level and thus all nine null hypotheses should be rejected. Literature cited Abramowitz, M., and I. Stegun Handboo of mathematical functions. Dover Publications Inc., New Yor, NY, USA. Arnold, B. C., N. Balarishnan, and H. N. Nagaraja A first course in order statistics. John Wiley & Sons, New Yor, NY, USA. Bonferroni, C. E Teoria statistica delle classi e calcolo delle probabilita. Libreria internazionale Seeber, Economiche e commerciali di Firenze 8 : Cabin, R. J., and R. J. Mitchell To bonferroni or not to bonferroni: when and how are the questions. Bulletin of the Ecological Society of America, 81 : Drezner, Z Computation of the multivariate normal integral. ACM Transactions on Mathematical Software, 18 : Drezner, T. D A test of the relationship between seasonal rainfall and saguaro cacti branching patterns. Ecography, 26 : Drezner, T. D Variation in age and height of onset of reproduction in the saguaro cactus (carnegiea gigantea) in the sonoran desert. Plant Ecology, 194 : Drezner, T. D Cactus surface temperatures are impacted by seasonality, spines and height on plant. Environmental and Experimental Botany, 74 : Holm, S A simple sequentially rejective multiple test procedure. Scandinavian journal of statistics, 6 (2 ): Johnson, N. L., and S. Kotz Distributions in statistics: continuous multivariate distributions. Wiley, New Yor, NY, USA. Miller, R. G Simultaneous statistical inference. McGraw Hill, New Yor, NY, USA. Moran, M. D Arguments for rejecting the sequential bonferroni in ecological studies. Oios, 100 : Rice, W. R Analyzing tables of statistical tests. Evolution, 43 (1 ): Appendix: Incorporating Association between Probabilities Let the probability that a hypothesis is not true be 1 P j = Pr (Z j h j ) by some distribution. The probability P that at least one hypothesis is true is determined by a multivariate distribution based on a correlation matrix R between hypotheses: P = 1 Pr ( Z 1 h 1,Z 2 h 2,,Z h ) (3) 94 Bulletin of the Ecological Society of America, 97(1)
5 The multivariate normal case It is reasonable to assume that the individual probabilities are generated by a normal distribution. If the probabilities are a result of a normal distribution, i.e. P j = Pr (Z h j ), correlated probabilities are a result of a multivariate normal distribution. For a multivariate normal distribution with a correlation matrix R (Drezner 1992, Johnson and Kotz 1972 ): Pr ( ) 1 Z 1 h 1,Z 2 h 2,,Z h = (2π) R h 1 h e 1 2 zt R 1z dz dz 1 (4) where R is the determinant of R. We derive the expression for the probability P, when the correlation coefficient between all events is the same, by a multivariate normal distribution based on a correlation matrix R with equal off diagonal values. Equal off diagonal correlations For the multivariate normal distribution, when the correlation matrix R has equal off diagonal correlations ρ 0, by Johnson and Kotz ( 1972 :48), the -dimensional integral (4) can be reduced to a one- dimensional integral: Pr ( ) 1 Z 1 h 1,Z 2 h 2,,Z h = 2π Φ j=1 ( hj z ) ρ 1 ρ e z 2 2 dz (5) where Φ( ) is the cumulative standardized normal probability. The integral (5) can be calculated using either Simpson s integration formula or Gaussian quadrature formulas based on Hermite polynomials (Abramowitz and Stegun 1972 ). Let From (6) we get Pr(Z j h j )=Φ(h j )=1 P j. h j =Φ 1 ( 1 P j ) (6) (7) and the probability P that at least one hypothesis is true is: P = 1 1 2π ( Φ 1 1 P j ) z ρ Φ e 1 ρ z 2 2 dz j=1 (8) Emerging Technologies January
6 Emerging Technologies Table 2. The calculated critical values for α = ρ = 0 ρ = 0.1 ρ = 0.2 ρ = 0.3 ρ = 0.4 ρ = 0.5 ρ = 0.6 ρ = 0.7 ρ = 0.8 ρ = Bulletin of the Ecological Society of America, 97(1)
7 Table 2. (Continued) ρ = 0 ρ = 0.1 ρ = 0.2 ρ = 0.3 ρ = 0.4 ρ = 0.5 ρ = 0.6 ρ = 0.7 ρ = 0.8 ρ = Emerging Technologies January
8 Emerging Technologies When P j = P 0 for all j, the probability that at least one of them occurs is by (8) : [ ( P = 1 1 z0 z )] ρ Φ e z 2 2 dz 2π 1 ρ (9) where Φ( ) is the standardized normal distribution and Φ(z0 ) = 1 P. For ρ = 0 P = 1 (1 P 0 0 ) which leads to the original Bonferroni. For ρ = 1 P = P 0 for any. Rather than using P 0 α, we use the value of P0 that yields P = α in (9). For any α, ρ we can define ( 0 θ 1 such that the probability P 0 = θ yields in (9) P = α. For the original Bonferroni ( ρ = 0), θ = 1 (1 α) 1 ) α. For ρ = 1, θ = α. For 0 < ρ < 1, θ should be between these two values. We calculated θ for α = 0.05 and = 2,3,,30, ρ = 0,0.1,0.2,,0.9 listed in Table 2. ρ = 0 is the original Bonferroni. We fit θ by the expression which is exact for ρ = 0 and ρ = 1. θ(ρ,)= 1 ( ) ρ λ(ρ,) The power λ was fit by multiple regression yielding with significance of : λ(ρ,) ( ) ( (1 ρ) ) 1 ρ 2 (10) Supporting Information Additional supporting information may be found in the online version of this article at library.wiley.com/doi/ /bes2.1214/suppinfo Table S1. Calculating the critical values for a given number of significant results. 98 Bulletin of the Ecological Society of America, 97(1)
Online Appendix for Targeting Policies: Multiple Testing and Distributional Treatment Effects
Online Appendix for Targeting Policies: Multiple Testing and Distributional Treatment Effects Steven F Lehrer Queen s University, NYU Shanghai, and NBER R Vincent Pohl University of Georgia November 2016
More informationTests about a population mean
October 2 nd, 2017 Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 1: Descriptive statistics Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter 8: Confidence
More informationSTAT 263/363: Experimental Design Winter 2016/17. Lecture 1 January 9. Why perform Design of Experiments (DOE)? There are at least two reasons:
STAT 263/363: Experimental Design Winter 206/7 Lecture January 9 Lecturer: Minyong Lee Scribe: Zachary del Rosario. Design of Experiments Why perform Design of Experiments (DOE)? There are at least two
More informationINTERVAL ESTIMATION AND HYPOTHESES TESTING
INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,
More informationStatistical Inference
Statistical Inference Classical and Bayesian Methods Class 6 AMS-UCSC Thu 26, 2012 Winter 2012. Session 1 (Class 6) AMS-132/206 Thu 26, 2012 1 / 15 Topics Topics We will talk about... 1 Hypothesis testing
More informationEconometrics. 4) Statistical inference
30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution
More informationA Power Analysis of Variable Deletion Within the MEWMA Control Chart Statistic
A Power Analysis of Variable Deletion Within the MEWMA Control Chart Statistic Jay R. Schaffer & Shawn VandenHul University of Northern Colorado McKee Greeley, CO 869 jay.schaffer@unco.edu gathen9@hotmail.com
More informationIEOR165 Discussion Week 12
IEOR165 Discussion Week 12 Sheng Liu University of California, Berkeley Apr 15, 2016 Outline 1 Type I errors & Type II errors 2 Multiple Testing 3 ANOVA IEOR165 Discussion Sheng Liu 2 Type I errors & Type
More informationy ˆ i = ˆ " T u i ( i th fitted value or i th fit)
1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u
More informationContinuous Univariate Distributions
Continuous Univariate Distributions Volume 1 Second Edition NORMAN L. JOHNSON University of North Carolina Chapel Hill, North Carolina SAMUEL KOTZ University of Maryland College Park, Maryland N. BALAKRISHNAN
More informationVisual interpretation with normal approximation
Visual interpretation with normal approximation H 0 is true: H 1 is true: p =0.06 25 33 Reject H 0 α =0.05 (Type I error rate) Fail to reject H 0 β =0.6468 (Type II error rate) 30 Accept H 1 Visual interpretation
More informationIntroduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test
Introduction to Statistical Inference Lecture 10: ANOVA, Kruskal-Wallis Test la Contents The two sample t-test generalizes into Analysis of Variance. In analysis of variance ANOVA the population consists
More informationControl of Directional Errors in Fixed Sequence Multiple Testing
Control of Directional Errors in Fixed Sequence Multiple Testing Anjana Grandhi Department of Mathematical Sciences New Jersey Institute of Technology Newark, NJ 07102-1982 Wenge Guo Department of Mathematical
More informationJournal of Forensic & Investigative Accounting Volume 10: Issue 2, Special Issue 2018
Detecting Fraud in Non-Benford Transaction Files William R. Fairweather* Introduction An auditor faced with the need to evaluate a number of files for potential fraud might wish to focus his or her efforts,
More informationTutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups. Acknowledgements:
Tutorial 5: Power and Sample Size for One-way Analysis of Variance (ANOVA) with Equal Variances Across Groups Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements:
More informationMULTIVARIATE ANALYSIS OF VARIANCE
MULTIVARIATE ANALYSIS OF VARIANCE RAJENDER PARSAD AND L.M. BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi - 0 0 lmb@iasri.res.in. Introduction In many agricultural experiments,
More informationMultiple Dependent Hypothesis Tests in Geographically Weighted Regression
Multiple Dependent Hypothesis Tests in Geographically Weighted Regression Graeme Byrne 1, Martin Charlton 2, and Stewart Fotheringham 3 1 La Trobe University, Bendigo, Victoria Austrlaia Telephone: +61
More informationA note on tree gatekeeping procedures in clinical trials
STATISTICS IN MEDICINE Statist. Med. 2008; 06:1 6 [Version: 2002/09/18 v1.11] A note on tree gatekeeping procedures in clinical trials Alex Dmitrienko 1, Ajit C. Tamhane 2, Lingyun Liu 2, Brian L. Wiens
More informationLogistic Regression Analysis
Logistic Regression Analysis Predicting whether an event will or will not occur, as well as identifying the variables useful in making the prediction, is important in most academic disciplines as well
More informationChapter Seven: Multi-Sample Methods 1/52
Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze
More informationIntroduction to Business Statistics QM 220 Chapter 12
Department of Quantitative Methods & Information Systems Introduction to Business Statistics QM 220 Chapter 12 Dr. Mohammad Zainal 12.1 The F distribution We already covered this topic in Ch. 10 QM-220,
More informationModified Simes Critical Values Under Positive Dependence
Modified Simes Critical Values Under Positive Dependence Gengqian Cai, Sanat K. Sarkar Clinical Pharmacology Statistics & Programming, BDS, GlaxoSmithKline Statistics Department, Temple University, Philadelphia
More informationarxiv: v4 [stat.me] 20 Oct 2017
A simple method for implementing Monte Carlo tests Dong Ding, Axel Gandy, and Georg Hahn Department of Mathematics, Imperial College London arxiv:1611.01675v4 [stat.me] 20 Oct 2017 Abstract We consider
More informationA. Motivation To motivate the analysis of variance framework, we consider the following example.
9.07 ntroduction to Statistics for Brain and Cognitive Sciences Emery N. Brown Lecture 14: Analysis of Variance. Objectives Understand analysis of variance as a special case of the linear model. Understand
More informationF n and theoretical, F 0 CDF values, for the ordered sample
Material E A S E 7 AMPTIAC Jorge Luis Romeu IIT Research Institute Rome, New York STATISTICAL ANALYSIS OF MATERIAL DATA PART III: ON THE APPLICATION OF STATISTICS TO MATERIALS ANALYSIS Introduction This
More informationRevisiting the returns-volume relationship: Time variation, alternative measures and the nancial crisis
*Manuscript Click here to view linked References Revisiting the returns-volume relationship: Time variation, alternative measures and the nancial crisis Steve Cook ; y and Duncan Watson z September 20,
More informationTutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances
Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function
More informationAnalysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems
Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Jeremy S. Conner and Dale E. Seborg Department of Chemical Engineering University of California, Santa Barbara, CA
More informationDIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS
DIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS THUY ANH NGO 1. Introduction Statistics are easily come across in our daily life. Statements such as the average
More informationNonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown
Nonparametric Statistics Leah Wright, Tyler Ross, Taylor Brown Before we get to nonparametric statistics, what are parametric statistics? These statistics estimate and test population means, while holding
More informationThe simple linear regression model discussed in Chapter 13 was written as
1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationHigh-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018
High-Throughput Sequencing Course Multiple Testing Biostatistics and Bioinformatics Summer 2018 Introduction You have previously considered the significance of a single gene Introduction You have previously
More informationSIMULATED POWER OF SOME DISCRETE GOODNESS- OF-FIT TEST STATISTICS FOR TESTING THE NULL HYPOTHESIS OF A ZIG-ZAG DISTRIBUTION
Far East Journal of Theoretical Statistics Volume 28, Number 2, 2009, Pages 57-7 This paper is available online at http://www.pphmj.com 2009 Pushpa Publishing House SIMULATED POWER OF SOME DISCRETE GOODNESS-
More informationSequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process
Applied Mathematical Sciences, Vol. 4, 2010, no. 62, 3083-3093 Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process Julia Bondarenko Helmut-Schmidt University Hamburg University
More informationA REVERSE TO THE JEFFREYS LINDLEY PARADOX
PROBABILITY AND MATHEMATICAL STATISTICS Vol. 38, Fasc. 1 (2018), pp. 243 247 doi:10.19195/0208-4147.38.1.13 A REVERSE TO THE JEFFREYS LINDLEY PARADOX BY WIEBE R. P E S T M A N (LEUVEN), FRANCIS T U E R
More informationA comparison of inverse transform and composition methods of data simulation from the Lindley distribution
Communications for Statistical Applications and Methods 2016, Vol. 23, No. 6, 517 529 http://dx.doi.org/10.5351/csam.2016.23.6.517 Print ISSN 2287-7843 / Online ISSN 2383-4757 A comparison of inverse transform
More informationUsing Simulation and Symbolic Computing in Linear Programming
Proceedings of the 6th WSEAS International Conference on Simulation, Modelling and Optimization, Lisbon, Portugal, September 22-24, 2006 571 Using Simulation and Symbolic Computing in Linear Programming
More informationHYPOTHESIS TESTING. Hypothesis Testing
MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.
More informationIntroduction to Statistical Data Analysis III
Introduction to Statistical Data Analysis III JULY 2011 Afsaneh Yazdani Preface Major branches of Statistics: - Descriptive Statistics - Inferential Statistics Preface What is Inferential Statistics? The
More informationApplied Regression Modeling
Applied Regression Modeling A Business Approach Iain Pardoe University of Oregon Charles H. Lundquist College of Business Eugene, Oregon WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION CONTENTS
More informationWeek 5 Video 1 Relationship Mining Correlation Mining
Week 5 Video 1 Relationship Mining Correlation Mining Relationship Mining Discover relationships between variables in a data set with many variables Many types of relationship mining Correlation Mining
More informationAn Approximate Test for Homogeneity of Correlated Correlation Coefficients
Quality & Quantity 37: 99 110, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. 99 Research Note An Approximate Test for Homogeneity of Correlated Correlation Coefficients TRIVELLORE
More informationGlossary. The ISI glossary of statistical terms provides definitions in a number of different languages:
Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the
More informationAnalysis of Variance (ANOVA)
Analysis of Variance (ANOVA) Two types of ANOVA tests: Independent measures and Repeated measures Comparing 2 means: X 1 = 20 t - test X 2 = 30 How can we Compare 3 means?: X 1 = 20 X 2 = 30 X 3 = 35 ANOVA
More informationMultivariate Regression Generalized Likelihood Ratio Tests for FMRI Activation
Multivariate Regression Generalized Likelihood Ratio Tests for FMRI Activation Daniel B Rowe Division of Biostatistics Medical College of Wisconsin Technical Report 40 November 00 Division of Biostatistics
More informationOikos. Appendix 1 and 2. o20751
Oikos o20751 Rosindell, J. and Cornell, S. J. 2013. Universal scaling of species-abundance distributions across multiple scales. Oikos 122: 1101 1111. Appendix 1 and 2 Universal scaling of species-abundance
More informationChapter 13 Section D. F versus Q: Different Approaches to Controlling Type I Errors with Multiple Comparisons
Explaining Psychological Statistics (2 nd Ed.) by Barry H. Cohen Chapter 13 Section D F versus Q: Different Approaches to Controlling Type I Errors with Multiple Comparisons In section B of this chapter,
More informationMULTISTAGE AND MIXTURE PARALLEL GATEKEEPING PROCEDURES IN CLINICAL TRIALS
Journal of Biopharmaceutical Statistics, 21: 726 747, 2011 Copyright Taylor & Francis Group, LLC ISSN: 1054-3406 print/1520-5711 online DOI: 10.1080/10543406.2011.551333 MULTISTAGE AND MIXTURE PARALLEL
More informationHypothesis T e T sting w ith with O ne O One-Way - ANOV ANO A V Statistics Arlo Clark Foos -
Hypothesis Testing with One-Way ANOVA Statistics Arlo Clark-Foos Conceptual Refresher 1. Standardized z distribution of scores and of means can be represented as percentile rankings. 2. t distribution
More informationTypes of Statistical Tests DR. MIKE MARRAPODI
Types of Statistical Tests DR. MIKE MARRAPODI Tests t tests ANOVA Correlation Regression Multivariate Techniques Non-parametric t tests One sample t test Independent t test Paired sample t test One sample
More informationMathematical statistics
October 20 th, 2018 Lecture 17: Tests of Hypotheses Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation
More informationBIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES
BIOL 458 - Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES PART 1: INTRODUCTION TO ANOVA Purpose of ANOVA Analysis of Variance (ANOVA) is an extremely useful statistical method
More informationDifference in two or more average scores in different groups
ANOVAs Analysis of Variance (ANOVA) Difference in two or more average scores in different groups Each participant tested once Same outcome tested in each group Simplest is one-way ANOVA (one variable as
More informationCorrelation. A statistics method to measure the relationship between two variables. Three characteristics
Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction
More informationHotelling s One- Sample T2
Chapter 405 Hotelling s One- Sample T2 Introduction The one-sample Hotelling s T2 is the multivariate extension of the common one-sample or paired Student s t-test. In a one-sample t-test, the mean response
More informationTutorial 2: Power and Sample Size for the Paired Sample t-test
Tutorial 2: Power and Sample Size for the Paired Sample t-test Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function of sample size, variability,
More informationThis document contains 3 sets of practice problems.
P RACTICE PROBLEMS This document contains 3 sets of practice problems. Correlation: 3 problems Regression: 4 problems ANOVA: 8 problems You should print a copy of these practice problems and bring them
More informationStatistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers
Statistical Inference Greg C Elvers 1 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population and not just the sample that we used But our sample
More informationINTRODUCTION TO INTERSECTION-UNION TESTS
INTRODUCTION TO INTERSECTION-UNION TESTS Jimmy A. Doi, Cal Poly State University San Luis Obispo Department of Statistics (jdoi@calpoly.edu Key Words: Intersection-Union Tests; Multiple Comparisons; Acceptance
More informationInverse-variance Weighted Average
Fixed Effects Meta-Analysisand Homogeneity Evaluation Jeff Valentine University of Louisville Campbell Collaboration Annual Colloquium Oslo 2009 Inverse-variance Weighted Average All effect sizes are not
More informationBusiness Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal
Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing
More informationIndependent Samples ANOVA
Independent Samples ANOVA In this example students were randomly assigned to one of three mnemonics (techniques for improving memory) rehearsal (the control group; simply repeat the words), visual imagery
More informationStudy Ch. 9.3, #47 53 (45 51), 55 61, (55 59)
GOALS: 1. Understand that 2 approaches of hypothesis testing exist: classical or critical value, and p value. We will use the p value approach. 2. Understand the critical value for the classical approach
More informationA Note on Hypothesis Testing with Random Sample Sizes and its Relationship to Bayes Factors
Journal of Data Science 6(008), 75-87 A Note on Hypothesis Testing with Random Sample Sizes and its Relationship to Bayes Factors Scott Berry 1 and Kert Viele 1 Berry Consultants and University of Kentucky
More informationContinuous Univariate Distributions
Continuous Univariate Distributions Volume 2 Second Edition NORMAN L. JOHNSON University of North Carolina Chapel Hill, North Carolina SAMUEL KOTZ University of Maryland College Park, Maryland N. BALAKRISHNAN
More informationA discussion on multiple regression models
A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value
More informationApproximate Test for Comparing Parameters of Several Inverse Hypergeometric Distributions
Approximate Test for Comparing Parameters of Several Inverse Hypergeometric Distributions Lei Zhang 1, Hongmei Han 2, Dachuan Zhang 3, and William D. Johnson 2 1. Mississippi State Department of Health,
More informationClosure properties of classes of multiple testing procedures
AStA Adv Stat Anal (2018) 102:167 178 https://doi.org/10.1007/s10182-017-0297-0 ORIGINAL PAPER Closure properties of classes of multiple testing procedures Georg Hahn 1 Received: 28 June 2016 / Accepted:
More informationTesting for Unit Roots with Cointegrated Data
Discussion Paper No. 2015-57 August 19, 2015 http://www.economics-ejournal.org/economics/discussionpapers/2015-57 Testing for Unit Roots with Cointegrated Data W. Robert Reed Abstract This paper demonstrates
More informationStatistical methods and data analysis
Statistical methods and data analysis Teacher Stefano Siboni Aim The aim of the course is to illustrate the basic mathematical tools for the analysis and modelling of experimental data, particularly concerning
More informationTutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements:
Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in
More informationNon-specific filtering and control of false positives
Non-specific filtering and control of false positives Richard Bourgon 16 June 2009 bourgon@ebi.ac.uk EBI is an outstation of the European Molecular Biology Laboratory Outline Multiple testing I: overview
More informationLecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem
Lecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem Prof. Massimo Guidolin 20192 Financial Econometrics Winter/Spring 2018 Overview Defining cointegration Vector
More informationLimited Dependent Variables and Panel Data
and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement
More informationDistribution Theory. Comparison Between Two Quantiles: The Normal and Exponential Cases
Communications in Statistics Simulation and Computation, 34: 43 5, 005 Copyright Taylor & Francis, Inc. ISSN: 0361-0918 print/153-4141 online DOI: 10.1081/SAC-00055639 Distribution Theory Comparison Between
More informationModel Analysis for Partitioning of Dry Matter and Plant Nitrogen for Stem and Leaf in Alfalfa
Communications in Soil Science and Plant Analysis, 36: 1163 1175, 2005 Copyright # Taylor & Francis, Inc. ISSN 0010-3624 print/1532-2416 online DOI: 10.1081/CSS-200056889 Model Analysis for Partitioning
More informationEpidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval
Epidemiology 9509 Principles of Biostatistics Chapter 10 - Inferences about John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. differences in
More informationCausal Model Selection Hypothesis Tests in Systems Genetics
1 Causal Model Selection Hypothesis Tests in Systems Genetics Elias Chaibub Neto and Brian S Yandell SISG 2012 July 13, 2012 2 Correlation and Causation The old view of cause and effect... could only fail;
More informationCOMPARING SEVERAL MEANS: ANOVA
LAST UPDATED: November 15, 2012 COMPARING SEVERAL MEANS: ANOVA Objectives 2 Basic principles of ANOVA Equations underlying one-way ANOVA Doing a one-way ANOVA in R Following up an ANOVA: Planned contrasts/comparisons
More informationA Test of Homogeneity Against Umbrella Scale Alternative Based on Gini s Mean Difference
J. Stat. Appl. Pro. 2, No. 2, 145-154 (2013) 145 Journal of Statistics Applications & Probability An International Journal http://dx.doi.org/10.12785/jsap/020207 A Test of Homogeneity Against Umbrella
More informationExample. χ 2 = Continued on the next page. All cells
Section 11.1 Chi Square Statistic k Categories 1 st 2 nd 3 rd k th Total Observed Frequencies O 1 O 2 O 3 O k n Expected Frequencies E 1 E 2 E 3 E k n O 1 + O 2 + O 3 + + O k = n E 1 + E 2 + E 3 + + E
More informationContrasts and Multiple Comparisons Supplement for Pages
Contrasts and Multiple Comparisons Supplement for Pages 302-323 Brian Habing University of South Carolina Last Updated: July 20, 2001 The F-test from the ANOVA table allows us to test the null hypothesis
More informationNeuendorf MANOVA /MANCOVA. Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y4. Like ANOVA/ANCOVA:
1 Neuendorf MANOVA /MANCOVA Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y1 Y2 Y3 Y4 Like ANOVA/ANCOVA: 1. Assumes equal variance (equal covariance matrices) across cells (groups defined by
More informationResearch Article The Laplace Likelihood Ratio Test for Heteroscedasticity
International Mathematics and Mathematical Sciences Volume 2011, Article ID 249564, 7 pages doi:10.1155/2011/249564 Research Article The Laplace Likelihood Ratio Test for Heteroscedasticity J. Martin van
More informationFundamentals of Probability Theory and Mathematical Statistics
Fundamentals of Probability Theory and Mathematical Statistics Gerry Del Fiacco Math Center Metropolitan State University St. Paul, Minnesota June 6, 2016 1 Preface This collection of material was researched,
More informationNONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12
NONPARAMETRIC TESTS LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-1 lmb@iasri.res.in 1. Introduction Testing (usually called hypothesis testing ) play a major
More informationElectronic Research Archive of Blekinge Institute of Technology
Electronic Research Archive of Blekinge Institute of Technology http://www.bth.se/fou/ This is an author produced version of a ournal paper. The paper has been peer-reviewed but may not include the final
More informationMathematical statistics
November 1 st, 2018 Lecture 18: Tests about a population mean Overview 9.1 Hypotheses and test procedures test procedures errors in hypothesis testing significance level 9.2 Tests about a population mean
More informationEstimation of Stress-Strength Reliability for Kumaraswamy Exponential Distribution Based on Upper Record Values
International Journal of Contemporary Mathematical Sciences Vol. 12, 2017, no. 2, 59-71 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ijcms.2017.7210 Estimation of Stress-Strength Reliability for
More informationMultiple testing: Intro & FWER 1
Multiple testing: Intro & FWER 1 Mark van de Wiel mark.vdwiel@vumc.nl Dep of Epidemiology & Biostatistics,VUmc, Amsterdam Dep of Mathematics, VU 1 Some slides courtesy of Jelle Goeman 1 Practical notes
More informationChapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1
Chapter 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Example 10-2: Absences/Final Grades Please enter the data below in L1 and L2. The data appears on page 537 of your textbook.
More informationClass 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving
More informationTesting for a break in persistence under long-range dependencies and mean shifts
Testing for a break in persistence under long-range dependencies and mean shifts Philipp Sibbertsen and Juliane Willert Institute of Statistics, Faculty of Economics and Management Leibniz Universität
More informationFACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING
FACTOR ANALYSIS AND MULTIDIMENSIONAL SCALING Vishwanath Mantha Department for Electrical and Computer Engineering Mississippi State University, Mississippi State, MS 39762 mantha@isip.msstate.edu ABSTRACT
More informationSTAT 328 (Statistical Packages)
Department of Statistics and Operations Research College of Science King Saud University Exercises STAT 328 (Statistical Packages) nashmiah r.alshammari ^-^ Excel and Minitab - 1 - Write the commands of
More informationData envelopment analysis
15 Data envelopment analysis The purpose of data envelopment analysis (DEA) is to compare the operating performance of a set of units such as companies, university departments, hospitals, bank branch offices,
More informationOptimal SPRT and CUSUM Procedures using Compressed Limit Gauges
Optimal SPRT and CUSUM Procedures using Compressed Limit Gauges P. Lee Geyer Stefan H. Steiner 1 Faculty of Business McMaster University Hamilton, Ontario L8S 4M4 Canada Dept. of Statistics and Actuarial
More informationAnalysis of Variance. ภาว น ศ ร ประภาน ก ล คณะเศรษฐศาสตร มหาว ทยาล ยธรรมศาสตร
Analysis of Variance ภาว น ศ ร ประภาน ก ล คณะเศรษฐศาสตร มหาว ทยาล ยธรรมศาสตร pawin@econ.tu.ac.th Outline Introduction One Factor Analysis of Variance Two Factor Analysis of Variance ANCOVA MANOVA Introduction
More informationA multiple testing procedure for input variable selection in neural networks
A multiple testing procedure for input variable selection in neural networks MicheleLaRoccaandCiraPerna Department of Economics and Statistics - University of Salerno Via Ponte Don Melillo, 84084, Fisciano
More information