U.S. Department of Agriculture, Beltsville, Maryland 20705
|
|
- Myrtle Andrews
- 6 years ago
- Views:
Transcription
1 AN EVALUATION OF MULTIPLE COMPARISON PROCEDURES D. R. Waldo 1'2 U.S. Department of Agriculture, Beltsville, Maryland SUMMARY Least significant difference, Duncan's multiple range test, Student-Newman-Keuls, Tukey's significant difference, and Scheffe's significant difference procedures are compared. The criteria of comparison are Type 1 and Type II error probabilities. Type I error probabilities on 2-treatment and n-treatment bases are theoretically related. The theoretical error probabilities are compared to published observed error probabilities. The choice of a test procedure rests on the relative importance of Type I and Type II errors. (Key Words: Multiple Comparisons, Least Significant Difference, Duncan's Multiple Range Test, Student-Newman-Keuls Test, Tukey's Significant Difference, Scheffe's Significant Difference.) INTRODUCTION Multiple comparison procedures have been available for several years. Probably the most widely used is that of Duncan (1955), which has even been included in the Statistical Analysis System (Barr and Goodnight, 1972) program. However, Gill (1973) has attacked Duncan's test on the basis that it does not control experimentwise Type I error probabilities. Some journals have developed or are developing editorial policies that either strongly support or strongly reject Duncan's test. Even with several recent papers on multiple comparison procedures (Warde, 1970; O'Neill and Wetherill, 1971; Carmer and Swanson, 1973; Gill, 1973) very different viewpoints persist. These more ' 4 I A.R.S., Nutrition Institute, Ruminant Nutrition Laboratory, BARe-East. 2The author deeply appreciates the statistical discussions with David B. Duncan, Johns Hopkins University and Judson U. McGuire, Jr., A.R.S., U.S.D.A. Patricia M. Cochran, A.R.S., U.S.D.A., improved the grammar and the mathematical clarity, precision, and consistency. The author appreciates the difficult typing job performed by Jo Ann Brown and Jenny Hysan. recent and more applied papers have used very little of the more fundamental statistical theory described by Harter (1957) and Duncan (1965). The objective of this paper is to present a comparison in a precise, point-by-point discussion of fundamental relationships and support these with published Monte Carlo results. Guidance for practical decisions on the use of these methods will be based on these theoretical relationships. BASES FOR EVALUATION Basically, two types of error are possible when a statistical decision is made, as indicated in table I. Deciding that a treatment difference exists, when in fact it does not, results in a Type I error which has probability, r Deciding that a treatment difference does not exist, when in fact it does, results in a Type II error which has probability,/3. For multiple comparisons testing, the Type I error requires further subdivisions. These subdivisions are usually termed comparisonwise, ac, and experimentwise, a E, error (Tukey, 1953) and are defined as: ~C = ~E = Number of erroneous inferences Number of inferences Number of experiments with one or more erroneous inferences Number of experiments Although the term, experimentwise, is correct for a simple one-way analysis or a completely random experiment, it is a misnomer for more complicated experiments because it loses its implied relation to the conventional F test. Consider a 2 3 factorial experiment conducted in a completely random design. Assuming five replications the analysis of variance is given in table 2. This one experiment could be tested with three F tests. If only three F tests are used with each at e~ =.05, the error probability in the experiment rises to This relation is described by (l--& E) = (1--ac)k for 539 JOURNAL OF ANIMAL SCIENCE, Vol. 42, No. 2, 1976
2 540 WALDO TABLE 1. TYPES OF ERRORS ARISING FROM DECISIONS ABOUT POPULATIONS Population null hypothesis Decision Type of Probability (H o) on H o error of error True Accept None... True Reject Type I a False Accept Type II 3 False Reject None... k independent comparisons (Steel; 1961). Even though the mean squares in the numerator are independent the tests are not independent because the same mean square for error was used (Kempthorne 1952, p. 245). The effect of this lack of independence decreases with increasing degrees of freedom for error. The dependence introduced by the repeated use of the same mean square for error was ignored in this paper. For this discussion, the terminology of Duncan (1955) seems more consistent. His generalized 'p-mean significance level' or O:p will be used, where a2 is equivalent to the comparisonwise ~ and a n is equivalent to the experimentwise a of the one way classification. For more complex designs the a n is equivalent to the error chosen for the F test of any one line in any complex analysis. A third type of error, Type III, is possible in multiple comparisons testing (Harter, undated; Carmer and Swanson, 1973). Type III error was considered outside the scope of this paper. BASES FOR TESTS Three of the five tests to be described are based on the studentized range, q (ap, p, n2), where q is studentized range statistic, ap is the Type 1 error probability, p = 2, 3... n l is the number of treatment means, and n2 is the error degrees of freedom. An example of the tabu- Source TABLE 2. ANALYSIS OF VARIANCE TABLE Whole plots 30 Mean 1 A 1 B 2 AB 2 Error 24 df lated values given a =.05, nl = 5, n2 = 10 is shown in table 3. The first of three tests is the least significant difference (LSD) or muhiple t test (Student, 1908). I.SD = q (a2, 2, n2) Sx (1) where c~2 is the 2-treatment error rate and p = 2; i.e., it has a single and minimum critical value. The second test is Tukey's (1953) significant difference (TSD), TSD = q (an nl, n2) Sx (2) where ~n is the n-treatment error rate and p = n~; i.e., it has a single and maximum critical value. The third test is Student (1927)--Newman (1939)--Keuls (1952) or (SNK), SNKp = q (ap, p, n2)s~ (3) where ap is the p-treatment error rate and p = 2, 3,..., nl; i.e., it has n1-1 critical values. These critical values are intermediate between the LSD and the TSD. The fourth test is Duncan's (1955) multiple range test (MRT), MRTp = Q (t~p, p, n2 ) s,~ (4) where Q is the multiple range statistic, ap is the p-treatment error rate, and p = 2, 3,..., nl ; i.e., it has n~-i critical values. The C~p = 1-(1-a2)P-1 by definition (Duncan, 1955). The Q values are tabulated in the same way as q values. An example is given in table 3. Critical values to four significant digits and a =.10,.05,.01,.005 and.001 are available (Harter, 1960). The tabulated values of Q are less than those of q for p>2, so the critical values of MRT, except when n~ = 2, are always greater than the critical value of LSD and less than the critical values of SNK. The fifth test is Scheffe's (1953) significant difference (SSD),
3 MULTIPLE COMPARISON PROCEDURES 541 TABLE 3. STUDENTIZED RANGES p = number of treatment means Error Range df a Ordinary, qa Special, Qb asteel and Torrie, Table A.8. Federer, Table 1I-1. bsteel and Torrie, Table A.7. Federer, Table il-3. Harter, Table 1 gives critical values with four digits. SSD= [(nl--1)f(an, nl-l, n2)] 1/2Sd (5) where nl-1 is the numerator df, F is the F statistic, a n is the n-treatment error rate or conventional F-test error rate, and n2 is the denominator df. To make this test consistent with the other four tests, use the relation s~ = s d giving SSD= [(nt-1)f(a n,nx-l, n2)l 1/2 42s x " (6) Then for the example in table 4 where nl = 5, n2 = 10 and a n =.05, F = So with s x = 1, SSD = [4(3.48)] 1/2 42 = Except when nl = 2 where all five tests are equivalent, the SSD is always greater than the TSD. POWER OR TYPE II ERROR PROBABILITY The relative sizes of the Type II error probabilities are indicated by the relative critical values tabulated in table 4. All of these critical values must be multiplied by s~- before their use. Smaller critical values imply narrower confidence intervals which imply more powerful tests, which further imply lower Type II error probabilities on a theoretical basis. The ob- served results of Monte Carlo experiments (Carmer and Swanson, 1973) support this ranking of the relative power of these tests. The relative power of these five tests is influenced by both nl and n2. Increasing nl increases the difference between the tests. When nl = 2, results of all tests are identical. When nl increases to 5, as in table 4, the differences between the tests increase. As n2 increases, the differences in relative power decrease. This difference can be seen by constructing a table such as table 4 but by letting n2 = 20. TYPE I ERROR The second major criterion for evaluating these five tests is the Type I error probability. However, as described earlier, it should be recalled that the tests have different Type I error bases. In review, these are listed in table 5. The orthogonal contrast or single degree of freedom test is also included in this discussion. To make the different error bases comparable it is necessary to know the translation equations. The relation of error bases for independent tests is derived from Duncan (1955) or Steel (1961) as: TABLE 4. RELATIVE CRITICAL VALUES a p = number of treatment means Test Theoretical Rank Observed b LSD 3.15 ~ 1 MRTp SNKp TSD ~ SSD ( arequire multiplier of s x. Assumed error df = 10 and Type I error =.05. bcarmer and Swanson, 1973.
4 542 WALDO r: xo00 eq ~- 0 ~l" ~ ' ~ 0 0 O0 0 " 0 0 0,-~ o,10,"-~ o Lr..] >., e,i,,.,. ~~176 ~ ~ ~ 0 O 0 O r 7 I v I < I= II d II A t A I:,X ii u,~ - II g~ r-, el & '4 d ~ ",d e,,i e,,i e~l eq ~ I:I.:-, 0 ~ N m m
5 MULTIPLE COMPARISON PROCEDURES 543 0/nl = 1--(1--0/2) nl-1 (7) This equation is independent of and is the same for all n2. These translation equations apply to orthogonal contrasts because of their independence and to MRT by its definition. When all possible tests among n means are made, the relations of the error bases are given by Duncan (1965) as probability statements. For calculating 0/n when given o/2, 0/n = Pr [qn > t 4-21 (8) where Pr is a probability. Substituting q2 for t ~2 gives, 0/n=Pr [qn >q2] " (9) Using the complementary relation 1--0/p = Pp, where P is defined as a protection level, Pn can substitute for 0/n and at the same time the inequality reverses to give, Pn = Pr [qn < q21 " (10) For calculating 0/2 when given 0/n, P2 = Pr [t < qn / ~] " (11) Multiplying the inequality by ~ tuting q2 for t 4-2-gives, and substi- P2 =Pr [q2<qn] " (12) These relations are dependent on n2, just as q was dependent on n2. These translation equations apply to finding 0/n from tx2 for LSD and finding 0/2 from 0/n for TSD. For practical applications, these changes of error base are easily made by the use of table B-1 of Harter (undated). Example data in the same format are presented in table 6. Similar translations of LSD and TSD error rates may be made by the use of tables of Harter (1957) or Harter et al, (1959) with identical results. For SSD, which is based on the F test, the proper equation for computing the 2-treatment error is given by Duncan (Private communication, 1974). 0/2 = 2 Pr [t > ((nl-1) F) 1/2] (13) In the example in table 5, nl = 5 and F= 3.48 so 0/2 = 2 Pr [t >(4 (3.48)1/2 ] or2pr[t > 3.73]. From table 9 of Pearson and Hartley (1966) with t = 3.7 and n2 = o = 10, Pr [t > 3.7] = =.002 and 0/2 = 2 Pr or.004. Similar translations may be made from table II, p. 149 of Locks et al., (1963) with KP = 0, i.e. non-centrality = 0. The SNK test has p critical values from the tables of q with 0/n theoretically equal to the chosen a, and 0/2 ~< 0/n =.05. Implicitly 0/2 is defined as the probability of getting a wrongful rejection of a null hypothesis for the two-mean test. This definition of 0/2 should not be confused with the maximum for the same probability that is sometimes denoted by 0/2, for example by Duncan (1955). These theoretical equations and their theoretical values are presented in table 5. Base 0/ was assumed to be.05. The criticism of MRT by Gill (1973) has resulted from a comparison of 0/n for MRT with 0/2 for orthogonal contrasts. Of the 0e2 based tests, the orthogonal and MRT have identical 0/n. This identity is a result of the definition of the Qp values. These will be identical for all nl. The proper comparison must be consistently based on either 0/2 or 0/n. The LSD test has a higher n-treatment error than either orthogonal contrasts or MRT. When the LSD and MRT are used only after a significant F test, the n-treatment error is limited to that used in the F test. In applica- TABLE 6. PROBABILITY INTEGRAL (Pr) OF THE STUDENTIZED RANGE a p = number of treatment means q Setting Pp =.95 when p = Setting Pp =.95 when p = n = aharter, undate& Table B1, p. 492 with n~ = 10.
6 544 WALDO tion, this pretesting with F is a common practice. But when the LSD is used after a significant F test, the aa..., an-1 can be as high as that for the LSD without prior F-test significance (Duncan, 1965). Significance statements based on a multiple comparison test should state not only which test was used but also whether it was preceded by a significant F test in the case of LSD and MRT. The tests based on a n =.05 have very low theoretical a 2 values even with nl = 5. With larger nl, the difference between a2 and a n increases. The observed values of Carmer and Swanson (1973) are in good agreement with the theoretical values as presented in table 5. When their observed 2-treatment Type I errors are used, the ranking of all tests is exactly the inverse of the ranking on Type li errors. A quote from Warde (1970) summarizes this paper. "Thus the choice of which procedure to use will depend upon the importance attached by the experimentor to each of the two types of error." Those wishing to minimize Type II errors will tend toward LSD whereas those wishing to minimize Type I errors will tend toward SSD. Duncan (1965) and Waller and Duncan (1969) have proposed Bayesian approaches that optimize these errors, but these approaches were considered beyond the scope of this paper. CONCLUSIONS 1. Where orthogonal contrasts, orthogonal polynominals, or factorial analyses are appropriate, do not use multiple comparisons. 2. State which test is being used. 3. State whether LSD and MRT are used only after a significant F test. LITERATURE CITED Barr, A. J. and J. H. Goodnight Statistical analysis system. Student Supply Stores, North Carolina State Univ., Raleigh Carmer, S. G. and M. R. Swanson An evaluation of ten pairwise multiple comparison procedures by Monte Carlo methods. J. Amer. Statist. Ass. 68:66. Duncan, D. B Multiple range and multiple F tests. Biometrics 11:1. Duncan, D. B A Bayesian approach to multiple comparisons. Technometrics. 7 : 171. Federer, W. T Experimental Design Theory and Applications. The Macmillian Company. New York. Gill, J. L Current status of multiple compari- sons of means in designed experiments. J. Dairy Sci. 56:973. Hatter, H. L Error rates and sample sizes for range tests in multiple comparisons. Biometrics 13:511. Hatter, H. L Critical values for Duncan's new multiple range test. Biometrics 16:671. Hatter, H. L. Undated (but must be 1969 or 1970). Order Statistics and Their Use in Testing and Estimation. Volume 1. Tests Based on Range and Studentized Range of Samples from a Normal Population. Aerospace Research Laboratories. Office of Aerospace Research. United States Air Force. For sale by the Superintendent of Documents, U.S. Government Printing Office, Washington, D.C Harter, H. L., D. S. Clemm and E. H. Guthrie The Probability Integrals of the Range and of the Studentized Range: Probability Integral and Percentage Points of the Studentized Range; Critical Values for Duncan's New Multiple Range Test. WADC Technical Rep Volume II. Wright Air Development Center, Wright-Patterson Air Force Base. Ohio. Kempthorne, O The Design and Analysis of Experiments. John Wiley and Sons, Inc., New York. Keuls, M The use of the studentized range in connection with an analysis of variance. Euphytica. 1:112. Locks, M. O., M. J. Alexander and B. J. Byars New Tables of the Noncentral t Distribution. Aeronautical Research Laboratories. Office of Aerospace Research. United States Air Force. For sale by the Office of Technical Services. U.S. Department of Commerce. Washington 25, D.C. Newman, D The distribution of range in samples from a normal population, expressed in terms of an independent estimate of standard deviation. Biometrika 31:20. O'Neill, R. and G. B. Wetherill The present state of multiple comparison methods. J. Royal Stat. Soc. B33:218. Pearson, E. S. and H. O. Hartley Biometrika Tables for Statisticians. Volume I, 3rd ed. Cambridge University Press. Cambridge, England. Scheffe, H A method for judging all contrasts in the analysis of variance. Biometrika 40:87. Steel, R. G. D Query: Error rates in multiple comparisons. Biometrika 17: 326. Steel, R. G. D. and J. H. Torrie Principles and Procedures of Statistics. McGraw-Hill Book Company, Inc., New York. Student The probable error of a mean. Biometrika 6:1. Student Errors of routine analysis. Biometrika 19:151. Tukey, J. W The problem of multiple comparisons. Unpublished dittoed notes. Princeton University, Princeton, N.J. 396 pp. (As cited by Federer, 1955 or Steel and Torrie, 1960). Waller, R. A. and D. B. Duncan A Bayes rule for the symmetric multiple comparisons problem. J. Amer. Statist. Ass. 65:1484. Warde, W. D A review of multiple comparison procedures. Agricultural Experiment Station. Statistical Laboratory. Iowa State Univ.
Linear Combinations. Comparison of treatment means. Bruce A Craig. Department of Statistics Purdue University. STAT 514 Topic 6 1
Linear Combinations Comparison of treatment means Bruce A Craig Department of Statistics Purdue University STAT 514 Topic 6 1 Linear Combinations of Means y ij = µ + τ i + ǫ ij = µ i + ǫ ij Often study
More informationLec 1: An Introduction to ANOVA
Ying Li Stockholm University October 31, 2011 Three end-aisle displays Which is the best? Design of the Experiment Identify the stores of the similar size and type. The displays are randomly assigned to
More informationANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS
ANALYSIS OF VARIANCE OF BALANCED DAIRY SCIENCE DATA USING SAS Ravinder Malhotra and Vipul Sharma National Dairy Research Institute, Karnal-132001 The most common use of statistics in dairy science is testing
More informationAPPLICATION AND POWER OF PARAMETRIC CRITERIA FOR TESTING THE HOMOGENEITY OF VARIANCES. PART IV
DOI 10.1007/s11018-017-1213-4 Measurement Techniques, Vol. 60, No. 5, August, 2017 APPLICATION AND POWER OF PARAMETRIC CRITERIA FOR TESTING THE HOMOGENEITY OF VARIANCES. PART IV B. Yu. Lemeshko and T.
More informationOutline. Topic 19 - Inference. The Cell Means Model. Estimates. Inference for Means Differences in cell means Contrasts. STAT Fall 2013
Topic 19 - Inference - Fall 2013 Outline Inference for Means Differences in cell means Contrasts Multiplicity Topic 19 2 The Cell Means Model Expressed numerically Y ij = µ i + ε ij where µ i is the theoretical
More informationIntroduction. Chapter 8
Chapter 8 Introduction In general, a researcher wants to compare one treatment against another. The analysis of variance (ANOVA) is a general test for comparing treatment means. When the null hypothesis
More informationLecture 5: Comparing Treatment Means Montgomery: Section 3-5
Lecture 5: Comparing Treatment Means Montgomery: Section 3-5 Page 1 Linear Combination of Means ANOVA: y ij = µ + τ i + ɛ ij = µ i + ɛ ij Linear combination: L = c 1 µ 1 + c 1 µ 2 +...+ c a µ a = a i=1
More informationCOMPARING RELATIVE AND TOTAL COST MULTIPLE COMPARISON PROCEDURES VIA JAMES-STEIN ESTIMATORS. Roy E. Welsch. WP December 1976
COMPARING RELATIVE AND TOTAL COST MULTIPLE COMPARISON PROCEDURES VIA JAMES-STEIN ESTIMATORS Roy E. Welsch WP 892-76 December 1976 U COMPARING MULTIPLE COMPARISON PROCEDURES 1. INTRODUCTION In a recent
More informationMultiple Comparison Procedures Cohen Chapter 13. For EDUC/PSY 6600
Multiple Comparison Procedures Cohen Chapter 13 For EDUC/PSY 6600 1 We have to go to the deductions and the inferences, said Lestrade, winking at me. I find it hard enough to tackle facts, Holmes, without
More informationLinear Combinations of Group Means
Linear Combinations of Group Means Look at the handicap example on p. 150 of the text. proc means data=mth567.disability; class handicap; var score; proc sort data=mth567.disability; by handicap; proc
More informationThe One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)
The One-Way Independent-Samples ANOVA (For Between-Subjects Designs) Computations for the ANOVA In computing the terms required for the F-statistic, we won t explicitly compute any sample variances or
More informationCH.9 Tests of Hypotheses for a Single Sample
CH.9 Tests of Hypotheses for a Single Sample Hypotheses testing Tests on the mean of a normal distributionvariance known Tests on the mean of a normal distributionvariance unknown Tests on the variance
More informationA Note on a Method for the Analysis of Significances en masse. Paul Seeger. Technometrics, Vol. 10, No. 3. (Aug., 1968), pp
A Note on a Method for the Analysis of Significances en masse Paul Seeger Technometrics, Vol. 10, No. 3. (Aug., 1968), pp. 586-593. Stable URL: http://links.jstor.org/sici?sici=0040-1706%28196808%2910%3a3%3c586%3aanoamf%3e2.0.co%3b2-j
More informationA posteriori multiple comparison tests
A posteriori multiple comparison tests 11/15/16 1 Recall the Lakes experiment Source of variation SS DF MS F P Lakes 58.000 2 29.400 8.243 0.006 Error 42.800 12 3.567 Total 101.600 14 The ANOVA tells us
More informationGENERAL PROBLEMS OF METROLOGY AND MEASUREMENT TECHNIQUE
DOI 10.1007/s11018-017-1141-3 Measurement Techniques, Vol. 60, No. 1, April, 2017 GENERAL PROBLEMS OF METROLOGY AND MEASUREMENT TECHNIQUE APPLICATION AND POWER OF PARAMETRIC CRITERIA FOR TESTING THE HOMOGENEITY
More informationLaboratory Topics 4 & 5
PLS205 Lab 3 January 23, 2014 Orthogonal contrasts Class comparisons in SAS Trend analysis in SAS Multiple mean comparisons Laboratory Topics 4 & 5 Orthogonal contrasts Planned, single degree-of-freedom
More informationSTAT 5200 Handout #7a Contrasts & Post hoc Means Comparisons (Ch. 4-5)
STAT 5200 Handout #7a Contrasts & Post hoc Means Comparisons Ch. 4-5) Recall CRD means and effects models: Y ij = µ i + ϵ ij = µ + α i + ϵ ij i = 1,..., g ; j = 1,..., n ; ϵ ij s iid N0, σ 2 ) If we reject
More informationDescriptive Statistics
Descriptive Statistics Once an experiment is carried out and the results are measured, the researcher has to decide whether the results of the treatments are different. This would be easy if the results
More informationIntroduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs
Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs The Analysis of Variance (ANOVA) The analysis of variance (ANOVA) is a statistical technique
More informationThe legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.
1 Chapter 1: Research Design Principles The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 2 Chapter 2: Completely Randomized Design
More informationhttp://www.statsoft.it/out.php?loc=http://www.statsoft.com/textbook/ Group comparison test for independent samples The purpose of the Analysis of Variance (ANOVA) is to test for significant differences
More informationSt. Xavier s College Autonomous Mumbai T.Y.B.A. Syllabus For 6 th Semester Courses in Statistics (June 2016 onwards)
St. Xavier s College Autonomous Mumbai T.Y.B.A Syllabus For 6 th Semester Courses in Statistics (June 2016 onwards) Contents: Theory Syllabus for Courses: A.STA.6.01 Probability & Sampling Distributions
More informationSTAT 115:Experimental Designs
STAT 115:Experimental Designs Josefina V. Almeda 2013 Multisample inference: Analysis of Variance 1 Learning Objectives 1. Describe Analysis of Variance (ANOVA) 2. Explain the Rationale of ANOVA 3. Compare
More informationThese are all actually contrasts (the coef sum to zero). What are these contrasts representing? What would make them large?
Lecture 12 Comparing treatment effects Orthogonal Contrasts What use are contrasts? Recall the Cotton data In this case, the treatment levels have an ordering to them this is not always the case) Consider
More informationSimultaneous Concentration Bands for Continuous Random Samples
Worcester Polytechnic Institute Digital WPI Mathematical Sciences Faculty Publications Department of Mathematical Sciences 10-1-2004 Simultaneous Concentration Bands for Continuous Random Samples Balgobin
More informationH0: Tested by k-grp ANOVA
Pairwise Comparisons ANOVA for multiple condition designs Pairwise comparisons and RH Testing Alpha inflation & Correction LSD & HSD procedures Alpha estimation reconsidered H0: Tested by k-grp ANOVA Regardless
More informationThe entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.
One-Way ANOVA Summary The One-Way ANOVA procedure is designed to construct a statistical model describing the impact of a single categorical factor X on a dependent variable Y. Tests are run to determine
More informationPSYC 331 STATISTICS FOR PSYCHOLOGISTS
PSYC 331 STATISTICS FOR PSYCHOLOGISTS Session 4 A PARAMETRIC STATISTICAL TEST FOR MORE THAN TWO POPULATIONS Lecturer: Dr. Paul Narh Doku, Dept of Psychology, UG Contact Information: pndoku@ug.edu.gh College
More informationAN EMPIRICAL INVESTIGATION OF TUKEY 1 S HONESTLY SIGNIFICANT DIFFERENCE TEST WITH VARIANCE HETEROGENEITY AND EQUAL SAMPLE SIZES,
37
More informationSTATISTICS 174: APPLIED STATISTICS FINAL EXAM DECEMBER 10, 2002
Time allowed: 3 HOURS. STATISTICS 174: APPLIED STATISTICS FINAL EXAM DECEMBER 10, 2002 This is an open book exam: all course notes and the text are allowed, and you are expected to use your own calculator.
More informationFundamental Probability and Statistics
Fundamental Probability and Statistics "There are known knowns. These are things we know that we know. There are known unknowns. That is to say, there are things that we know we don't know. But there are
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More informationPresented to the Graduate Council of the. North Texas State University. in Partial. Fulfillment of the Requirements. For the Degree of.
AN EMPIRICAL INVESTIGATION OF TUKEY'S HONESTLY SIGNIFICANT DIFFERENCE TEST WITH VARIANCE HETEROGENEITY AND UNEQUAL SAMPLE SIZES, UTILIZING KRAMER'S PROCEDURE AND THE HARMONIC MEAN DISSERTATION Presented
More information1 One-way Analysis of Variance
1 One-way Analysis of Variance Suppose that a random sample of q individuals receives treatment T i, i = 1,,... p. Let Y ij be the response from the jth individual to be treated with the ith treatment
More informationMean Comparisons PLANNED F TESTS
Mean Comparisons F-tests provide information on significance of treatment effects, but no information on what the treatment effects are. Comparisons of treatment means provide information on what the treatment
More information16.400/453J Human Factors Engineering. Design of Experiments II
J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential
More informationBiological Applications of ANOVA - Examples and Readings
BIO 575 Biological Applications of ANOVA - Winter Quarter 2010 Page 1 ANOVA Pac Biological Applications of ANOVA - Examples and Readings One-factor Model I (Fixed Effects) This is the same example for
More informationBattery Life. Factory
Statistics 354 (Fall 2018) Analysis of Variance: Comparing Several Means Remark. These notes are from an elementary statistics class and introduce the Analysis of Variance technique for comparing several
More informationInferences About Two Proportions
Inferences About Two Proportions Quantitative Methods II Plan for Today Sampling two populations Confidence intervals for differences of two proportions Testing the difference of proportions Examples 1
More informationSpecific Differences. Lukas Meier, Seminar für Statistik
Specific Differences Lukas Meier, Seminar für Statistik Problem with Global F-test Problem: Global F-test (aka omnibus F-test) is very unspecific. Typically: Want a more precise answer (or have a more
More informationGroup comparison test for independent samples
Group comparison test for independent samples The purpose of the Analysis of Variance (ANOVA) is to test for significant differences between means. Supposing that: samples come from normal populations
More informationDETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics
DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and
More informationApplication of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption
Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption Alisa A. Gorbunova and Boris Yu. Lemeshko Novosibirsk State Technical University Department of Applied Mathematics,
More informationChapter Seven: Multi-Sample Methods 1/52
Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze
More informationH0: Tested by k-grp ANOVA
Analyses of K-Group Designs : Omnibus F, Pairwise Comparisons & Trend Analyses ANOVA for multiple condition designs Pairwise comparisons and RH Testing Alpha inflation & Correction LSD & HSD procedures
More informationSTAT 135 Lab 7 Distributions derived from the normal distribution, and comparing independent samples.
STAT 135 Lab 7 Distributions derived from the normal distribution, and comparing independent samples. Rebecca Barter March 16, 2015 The χ 2 distribution The χ 2 distribution We have seen several instances
More informationLecture Notes #3: Contrasts and Post Hoc Tests 3-1
Lecture Notes #3: Contrasts and Post Hoc Tests 3-1 Richard Gonzalez Psych 613 Version 2.6 (9/2016) LECTURE NOTES #3: Contrasts and Post Hoc tests Reading assignment Read MD chs 4, 5, & 6 Read G chs 7 &
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationOrthogonal, Planned and Unplanned Comparisons
This is a chapter excerpt from Guilford Publications. Data Analysis for Experimental Design, by Richard Gonzalez Copyright 2008. 8 Orthogonal, Planned and Unplanned Comparisons 8.1 Introduction In this
More information2 Hand-out 2. Dr. M. P. M. M. M c Loughlin Revised 2018
Math 403 - P. & S. III - Dr. McLoughlin - 1 2018 2 Hand-out 2 Dr. M. P. M. M. M c Loughlin Revised 2018 3. Fundamentals 3.1. Preliminaries. Suppose we can produce a random sample of weights of 10 year-olds
More informationMultiple comparisons of slopes of regression lines. Jolanta Wojnar, Wojciech Zieliński
Multiple comparisons of slopes of regression lines Jolanta Wojnar, Wojciech Zieliński Institute of Statistics and Econometrics University of Rzeszów ul Ćwiklińskiej 2, 35-61 Rzeszów e-mail: jwojnar@univrzeszowpl
More informationWhat p values really mean (and why I should care) Francis C. Dane, PhD
What p values really mean (and why I should care) Francis C. Dane, PhD Session Objectives Understand the statistical decision process Appreciate the limitations of interpreting p values Value the use of
More informationExamining Multiple Comparison Procedures According to Error Rate, Power Type and False Discovery Rate
Journal of Modern Applied Statistical Methods Volume 11 Issue 2 Article 7 11-1-2012 Examining Multiple Comparison Procedures According to Error Rate, Power Type and False Discovery Rate Guven Ozkaya Uludag
More informationHypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.
Hypothesis Tests and Estimation for Population Variances 11-1 Learning Outcomes Outcome 1. Formulate and carry out hypothesis tests for a single population variance. Outcome 2. Develop and interpret confidence
More information7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between
7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation
More informationSTAT 536: Genetic Statistics
STAT 536: Genetic Statistics Tests for Hardy Weinberg Equilibrium Karin S. Dorman Department of Statistics Iowa State University September 7, 2006 Statistical Hypothesis Testing Identify a hypothesis,
More information4:3 LEC - PLANNED COMPARISONS AND REGRESSION ANALYSES
4:3 LEC - PLANNED COMPARISONS AND REGRESSION ANALYSES FOR SINGLE FACTOR BETWEEN-S DESIGNS Planned or A Priori Comparisons We previously showed various ways to test all possible pairwise comparisons for
More informationTopic 12. The Split-plot Design and its Relatives (continued) Repeated Measures
12.1 Topic 12. The Split-plot Design and its Relatives (continued) Repeated Measures 12.9 Repeated measures analysis Sometimes researchers make multiple measurements on the same experimental unit. We have
More informationUnified approach to the classical statistical analysis of small signals
PHYSICAL REVIEW D VOLUME 57, NUMBER 7 1 APRIL 1998 Unified approach to the classical statistical analysis of small signals Gary J. Feldman * Department of Physics, Harvard University, Cambridge, Massachusetts
More informationMultiple Testing. Gary W. Oehlert. January 28, School of Statistics University of Minnesota
Multiple Testing Gary W. Oehlert School of Statistics University of Minnesota January 28, 2016 Background Suppose that you had a 20-sided die. Nineteen of the sides are labeled 0 and one of the sides is
More informationAn Introduction to Multivariate Statistical Analysis
An Introduction to Multivariate Statistical Analysis Third Edition T. W. ANDERSON Stanford University Department of Statistics Stanford, CA WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Contents
More informationMultiple comparisons - subsequent inferences for two-way ANOVA
1 Multiple comparisons - subsequent inferences for two-way ANOVA the kinds of inferences to be made after the F tests of a two-way ANOVA depend on the results if none of the F tests lead to rejection of
More informationOn Consistency of Tests for Stationarity in Autoregressive and Moving Average Models of Different Orders
American Journal of Theoretical and Applied Statistics 2016; 5(3): 146-153 http://www.sciencepublishinggroup.com/j/ajtas doi: 10.11648/j.ajtas.20160503.20 ISSN: 2326-8999 (Print); ISSN: 2326-9006 (Online)
More informationRedacted for Privacy David B. Duncan
AN ABSTRACT OF THE THESIS OF Gene A. Penne llo for the degree of Doctor of Philosophy in Statistics presented on September 28, 1993. Title: Multiple Comparisons for the Balanced Two-Way Factorial: An Applied
More informationMore Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order Restriction
Sankhyā : The Indian Journal of Statistics 2007, Volume 69, Part 4, pp. 700-716 c 2007, Indian Statistical Institute More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order
More informationMultiple Comparison Methods for Means
SIAM REVIEW Vol. 44, No. 2, pp. 259 278 c 2002 Society for Industrial and Applied Mathematics Multiple Comparison Methods for Means John A. Rafter Martha L. Abell James P. Braselton Abstract. Multiple
More informationA consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation
Ann. Hum. Genet., Lond. (1975), 39, 141 Printed in Great Britain 141 A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation BY CHARLES F. SING AND EDWARD D.
More informationChap The McGraw-Hill Companies, Inc. All rights reserved.
11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview
More informationAnalysis of variance
Analysis of variance 1 Method If the null hypothesis is true, then the populations are the same: they are normal, and they have the same mean and the same variance. We will estimate the numerical value
More informationDisadvantages of using many pooled t procedures. The sampling distribution of the sample means. The variability between the sample means
Stat 529 (Winter 2011) Analysis of Variance (ANOVA) Reading: Sections 5.1 5.3. Introduction and notation Birthweight example Disadvantages of using many pooled t procedures The analysis of variance procedure
More informationAdvanced Experimental Design
Advanced Experimental Design Topic Four Hypothesis testing (z and t tests) & Power Agenda Hypothesis testing Sampling distributions/central limit theorem z test (σ known) One sample z & Confidence intervals
More informationEvaluation requires to define performance measures to be optimized
Evaluation Basic concepts Evaluation requires to define performance measures to be optimized Performance of learning algorithms cannot be evaluated on entire domain (generalization error) approximation
More informationSTAT 515 fa 2016 Lec Statistical inference - hypothesis testing
STAT 515 fa 2016 Lec 20-21 Statistical inference - hypothesis testing Karl B. Gregory Wednesday, Oct 12th Contents 1 Statistical inference 1 1.1 Forms of the null and alternate hypothesis for µ and p....................
More informationStatistical Data Analysis Stat 3: p-values, parameter estimation
Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,
More informationN J SS W /df W N - 1
One-Way ANOVA Source Table ANOVA MODEL: ij = µ* + α j + ε ij H 0 : µ = µ =... = µ j or H 0 : Σα j = 0 Source Sum of Squares df Mean Squares F J Between Groups nj( j * ) J - SS B /(J ) MS B /MS W = ( N
More informationApplication of Variance Homogeneity Tests Under Violation of Normality Assumption
Application of Variance Homogeneity Tests Under Violation of Normality Assumption Alisa A. Gorbunova, Boris Yu. Lemeshko Novosibirsk State Technical University Novosibirsk, Russia e-mail: gorbunova.alisa@gmail.com
More informationAn Old Research Question
ANOVA An Old Research Question The impact of TV on high-school grade Watch or not watch Two groups The impact of TV hours on high-school grade Exactly how much TV watching would make difference Multiple
More informationStatistical Inference: Estimation and Confidence Intervals Hypothesis Testing
Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire
More informationMarcia Gumpertz and Sastry G. Pantula Department of Statistics North Carolina State University Raleigh, NC
A Simple Approach to Inference in Random Coefficient Models March 8, 1988 Marcia Gumpertz and Sastry G. Pantula Department of Statistics North Carolina State University Raleigh, NC 27695-8203 Key Words
More informationA Test of Homogeneity Against Umbrella Scale Alternative Based on Gini s Mean Difference
J. Stat. Appl. Pro. 2, No. 2, 145-154 (2013) 145 Journal of Statistics Applications & Probability An International Journal http://dx.doi.org/10.12785/jsap/020207 A Test of Homogeneity Against Umbrella
More information20.0 Experimental Design
20.0 Experimental Design Answer Questions 1 Philosophy One-Way ANOVA Egg Sample Multiple Comparisons 20.1 Philosophy Experiments are often expensive and/or dangerous. One wants to use good techniques that
More informationWeek 14 Comparing k(> 2) Populations
Week 14 Comparing k(> 2) Populations Week 14 Objectives Methods associated with testing for the equality of k(> 2) means or proportions are presented. Post-testing concepts and analysis are introduced.
More informationYour schedule of coming weeks. One-way ANOVA, II. Review from last time. Review from last time /22/2004. Create ANOVA table
Your schedule of coming weeks One-way ANOVA, II 9.07 //00 Today: One-way ANOVA, part II Next week: Two-way ANOVA, parts I and II. One-way ANOVA HW due Thursday Week of May Teacher out of town all week
More informationA semi-bayesian study of Duncan's Bayesian multiple
A semi-bayesian study of Duncan's Bayesian multiple comparison procedure Juliet Popper Shaer, University of California, Department of Statistics, 367 Evans Hall # 3860, Berkeley, CA 94704-3860, USA February
More informationOrthogonal contrasts and multiple comparisons
BIOL 933 Lab 4 Fall 2017 Orthogonal contrasts Class comparisons in R Trend analysis in R Multiple mean comparisons Orthogonal contrasts and multiple comparisons Orthogonal contrasts Planned, single degree-of-freedom
More informationOnline publication date: 22 March 2010
This article was downloaded by: [South Dakota State University] On: 25 March 2010 Access details: Access Details: [subscription number 919556249] Publisher Taylor & Francis Informa Ltd Registered in England
More informationConfidence Intervals of the Simple Difference between the Proportions of a Primary Infection and a Secondary Infection, Given the Primary Infection
Biometrical Journal 42 (2000) 1, 59±69 Confidence Intervals of the Simple Difference between the Proportions of a Primary Infection and a Secondary Infection, Given the Primary Infection Kung-Jong Lui
More informationLarge Sample Properties of Estimators in the Classical Linear Regression Model
Large Sample Properties of Estimators in the Classical Linear Regression Model 7 October 004 A. Statement of the classical linear regression model The classical linear regression model can be written in
More informationStatistical Inference
Statistical Inference Classical and Bayesian Methods Class 6 AMS-UCSC Thu 26, 2012 Winter 2012. Session 1 (Class 6) AMS-132/206 Thu 26, 2012 1 / 15 Topics Topics We will talk about... 1 Hypothesis testing
More informationFirst we look at some terms to be used in this section.
8 Hypothesis Testing 8.1 Introduction MATH1015 Biostatistics Week 8 In Chapter 7, we ve studied the estimation of parameters, point or interval estimates. The construction of CI relies on the sampling
More information9 One-Way Analysis of Variance
9 One-Way Analysis of Variance SW Chapter 11 - all sections except 6. The one-way analysis of variance (ANOVA) is a generalization of the two sample t test to k 2 groups. Assume that the populations of
More informationEC2001 Econometrics 1 Dr. Jose Olmo Room D309
EC2001 Econometrics 1 Dr. Jose Olmo Room D309 J.Olmo@City.ac.uk 1 Revision of Statistical Inference 1.1 Sample, observations, population A sample is a number of observations drawn from a population. Population:
More informationSTAT22200 Spring 2014 Chapter 5
STAT22200 Spring 2014 Chapter 5 Yibi Huang April 29, 2014 Chapter 5 Multiple Comparisons Chapter 5-1 Chapter 5 Multiple Comparisons Note the t-tests and C.I. s are constructed assuming we only do one test,
More informationDefense Technical Information Center Compilation Part Notice
UNCLASSIFIED Defense Technical Information Center Compilation Part Notice ADP014614 TITLE: Precision of Simultaneous Measurement Procedures DISTRIBUTION: Approved for public release, distribution unlimited
More informationGeneral Linear Models (GLM) for Fixed Factors
Chapter 224 General Linear Models (GLM) for Fixed Factors Introduction This procedure performs analysis of variance (ANOVA) and analysis of covariance (ANCOVA) for factorial models that include fixed factors
More informationTesting Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information
Testing Equality of Two Intercepts for the Parallel Regression Model with Non-sample Prior Information Budi Pratikno 1 and Shahjahan Khan 2 1 Department of Mathematics and Natural Science Jenderal Soedirman
More informationSimultaneous Estimation in a Restricted Linear Model*
journal of multivariate analysis 61, 6166 (1997) article no. MV971657 Simultaneous Estimation in a Restricted Linear Model* C. Rueda, B. Salvador, and M. A. Ferna ndez Universidad de Valladolid, Spain
More informationT.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS
ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only
More informationChapter 11. Correlation and Regression
Chapter 11. Correlation and Regression The word correlation is used in everyday life to denote some form of association. We might say that we have noticed a correlation between foggy days and attacks of
More informationDesign of Screening Experiments with Partial Replication
Design of Screening Experiments with Partial Replication David J. Edwards Department of Statistical Sciences & Operations Research Virginia Commonwealth University Robert D. Leonard Department of Information
More information