Journal of Educational and Behavioral Statistics

Size: px
Start display at page:

Download "Journal of Educational and Behavioral Statistics"

Transcription

1 Journal of Educational and Behavioral Statistics Theory of Estimation and Testing of Effect Sizes: Use in Meta-Analysis Helena Chmura Kraemer JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS : 93 DOI: / The online version of this article can be found at: Published on behalf of American Educational Research Association and Additional services and information for Journal of Educational and Behavioral Statistics can be found at: Alerts: Subscriptions: Reprints: Permissions: Citations: >> Version of Record - Jan 1, 1983 What is This?

2 Journal of Educational Statistics Summer 1983, Volume 8, Number 2,pp THEORY OF ESTIMATION AND TESTING OF EFFECT SIZES: USE IN META-ANALYSIS HELENA CHMURA KRAEMER Stanford University Key words: Meta-analysis, Research Synthesis, Effect Sizes ABSTRACT. Approximations to the distribution of a common form of effect size are presented. Single sample tests, confidence interval formulation, tests of homogeneity and pooling procedures are based on these approximations. Caveats are presented concerning statistical procedures as applied to sample effect sizes commonly used in meta-analysis. When policy decisions are based on research, such decisions are generally not based on one research study, but on the consensus of many, nor on statistical significance alone, but on evaluation of practical significance (i.e., cost-effectiveness) as well. In recent years there has been growing emphasis on systematic syntheses of the results of research studies, meta-analysis (Glass, McGaw, & Smith, 1981), based on the use of quantitative measures indicating practical as well as statistical significance: effect sizes. The application of such methods has, in many ways, preceded the development of sound mathematical theory necessary to implement its application. Consequently, there is much controversy and doubt about the validity of results based on meta-analysis. The focus here is on the most frequently used sample effect size: d= (x E -x c )/S, where x E is the mean response of n E subjects in an experimental group, x c that of n c subjects in a separate control group, and S 2 is a sample variance. Generally, it is assumed that x Ei ~%(ii E9 o ) 9 /= 1,2,...,*, x C/ ~9l(/x c,a<?), /= 1,2,...,w c, 2 2 Further, S 2 is an estimate of a 2, independent of x E and x c with Under these assumptions (Winer, 1971), [Np(l-p)V /2 d~t> r ([Np(l-p)y 2 8), 93

3 94 Helena Chmura Kraemer where N = n E + n c is the total sample size; p = n E /N reflects balance; = di E jti c )/a is the population effect size; and ^(X) represents a noncentral /-distribution with v degrees of freedom and noncentrality parameter X. If S 2 is the pooled within-group variance, v N 2. If S 2 is based on the control group alone, v = n c 1. The mathematical form of the noncentral / distribution is known, and has been tabled (Resnikoff & Lieberman, 1957). Use of either the exact form or tables as a basis of statistical analysis of effect sizes is extremely difficult. However, when n E and n c both exceed about 10, accurate approximations to the noncentral / distribution are available and provide an accessible approach to applications. When N is small, or when groups sizes are disparate, or when S 2 is injudiciously chosen, distribution theory is highly sensitive to departures from the assumptions (Scheffe, 1959). Interpretation of 8 also becomes problematic (Kraemer & Andrews, 1982). Finally, the approximation procedures are of doubtful accuracy. For all these reasons, attention is here restricted only to the situation in which N>20, when A^p<.6 and when S 2 is the pooled within-group variance (v = N 2). To test the null hypothesis H 0 : 8 = 0 versus 8 > 0, one would reject H 0 at the a-level of significance if [Np(l-p)V /2 d>^a, where t v a is the upper a-level critical value of the / distribution with v degrees of freedom. Since computation of effect size is usually done when one has an a priori reason to believe 8 > 0, this particular test is minimally important. More useful are procedures, in any single study, to: 1. Test the null hypothesis H Q : 8 <8 0 versus A: 8 > 8 0, and compute the power of this test; 2. compute a confidence interval for 8; or, in meta-analysis, to: 3. test the homogeneity of 8,, 8 2,...,8 m ; 4. obtain a pooled estimate and compute a confidence interval for 8 on the basis of a set of sample effect sizes d x,d 2,...,d m \ and 5. to examine statistically those factors producing heterogeneity of effect sizes. Approximation to the Distribution of Sample Effect Size One approximation with little mathematical justification (cf. Hedges, 1982a) is: [Np(l - p)v /2 (d - S) ^91(0,1).

4 Theory of Estimation and Testing of Effect Sizes 95 Yet this approximation underlies many of the statistical procedures commonly used in meta-analysis (e.g., Hsu, 1980). It should be noted that 1. rrjv.(^/2) 1/2 r((.-i)/2) E { d ) ~ W ) Hence d always overestimates 8 (Hedges, 1981). 2. For large sample sizes, var(</)~- 1 + ^ p{\~p) 2 (Hedges, 1981). Thus, not only is the variance consistently underestimated, but the variance is not independent of 8. For this reason, applications of test procedures to sample effect sizes such as / tests, analysis of variance, or linear regression, which assume homoscedasticity, are of questionable validity. 3. The distribution of d is both skewed and heavy tailed and is here approximated by the normal distribution which is neither. Better approximations are the Johnson-Welch procedure (1940) and the Kraemer-Paik procedure (1979). The Johnson-Welch procedure describes a normal approximation to the noncentral / distribution which, applied to effect size, would yield:?r{d^d) «Pr{z>[tf/>(l ~ P)Y /2 (D - 8)/[l +Z> 2 /2/] 1/2 }, where /= v/np{\ p) (/«4). This procedure justifies tests such as those proposed by Hedges (1982a, 1982b, 1982c). More accurate for small noncentrality parameters, and more useful in this context, is the Kraemer-Paik procedure (1979). Applied to effect sizes, this procedure indicates that if and r = d/(d*+f) l/l, p = «/(«2 +/) 1/2 then w(r, p) = u (r p)/(l rp) is approximately distributed according to the null distribution of the product moment correlation coefficient; that is, v x / 2 u/{\-u 2 ) X/2 ~t v. Thus percentile points of w(r, p), say C v P, where Pr{ W (r,p)>c^}=/>,

5 96 Helena Chmura Kraemer are tabled (Fisher & Yates, 1957). Alternatively, since c v, P = t^p/(tl P + p) l/ \ percentile points of w(r, p) may be computed from tables of the / distribution. Furthermore, since w(r, p) is distributed as is the product moment correlation coefficient, Fisher's z transformation; that is, z(u) = 2^(1^) = tanh ~ lw is both a variance-stabilizing and a normalizing transformation. Since then z(u) = z(r) - z(p), Computing percentile points is somewhat less accurate and more tedious using this transformation, rather than using C v P directly, but there are many other applications in which having a variance independent of the mean is crucial. Single Sample Tests Consider the null hypothesis H 0 : 8 < S 0 versus A: 8 > 8 0. One would reject the null hypothesis at the a level of significance if: where z(r)^%(z(p), v^- [ ). u{r,p 0 )^C va, that is, if or in terms of d: PO = V(«O 2 +/)' /2 ; 'MQ,«+ Po)/(i + Q,«Po)> <*>«;.. + po)/[(i - Q 2, a )(i - PDY /2. The power of this test at 5, > 8 0 can easily be estimated, for Power(5,) = Pr{r > (C,, + p 0 )/(1 + C v, apo ) p = p,} = Pr{«(r, Pl) > «[(<;, + Po)/(l + C,, apo ), p,] p = p,} = Pr{«, (r, Pl) > (C,, a - A)/(l - C,, A) p = p,}, where A = (Pi -Po)/(l -PiPo)-

6 Theory of Estimation and Testing of Effect Sizes 97 Thus, since r'/ 2 «(r,p.)/[l-«2 (^.Pi)],/2 ~^ Power(S,) = Vr{t, > v^(c a - A)/[(l - C*.)(l - A 2 )] 1/2 ). Table I presents values of A as a function of v and P. One notes that to detect a separation between effect sizes yielding A =.4, one needs approximately a sample size of 60 for 95 percent power, about 50 for 90 percent power, about 35 for 80 percent power, and so forth. For a sample size of 20 (v = 18) one has less than an even chance of detecting such a separation. The import of this observation is clear only if one realizes how the metric A reflects separation between effect sizes. In Table II are presented A for paired values of S 0 < 8,. A separation between effect size S 0 = 0 and 8 } =.8, that is, between null effect size and one quite large as compared to those TABLE I Table of A to Achieve a Power of P with v Degrees of Freedom (N = v 4-2) T^ % % TABLE II A as a Function of8 0,8 l (/ = 4) \ *

7 98 Helena Chmura Kraemer reported in the literature, yields A =.371. Consequently, with a sample size of 20, one would have less than an even chance of discriminating no effect from a large effect. Once again, from yet another viewpoint, there is serious difficulty in using effect sizes when sample sizes are small. Confidence Interval for 8 Since w(r, p) has a distribution independent of p, one- or two-sided confidence intervals for p and hence for 5=/'/V0-p 2 ) 1/2 are readily obtained. For example, where Thus or r = d/(di+f) l/ \ Pr{ W (r,p)<q, tt }~l-«. Pr{p > (r - C)/(l - rc)} «1 - a, C = C va9 Pr{«^/! / 2 (r - C)/[(l - r 2 )(l - C 2 )] 1/2 } ~ 1 - a. It can readily be verified that for small N, confidence intervals for S will be very wide. For example, if one observed d = 1.0 (r =.45) for N = 20 {v = 18, C.Q5-38), the one-tailed 95 percent confidence interval for 8 will be 8>2(.4S-.38) = 1? (.80 X.86) 1/2 A Test of Homogeneity/Pooling Effect Sizes If d l9 d 2,...,d m are independent sample effect sizes (i.e., based on different samples), one might wish to test whether they all estimate the same effect size, that is, H 0 : S x =8 2 = = 8 m (cf. Hedges, 1982a, 1982b). Because it is assumed that/«4 for all included effect sizes, this null hypothesis is equivalent to H 0 : p x p 2 ' P m where P, = V(«, 2 + 4)' /2. This, then, becomes a test of homogeneity of correlation coefficients. One estimates the common p, under the null hypothesis, by p where The test statistic is 2M - 1) m *=2(',-i)(*(/;)-*0>)) 2,

8 Theory of Estimation and Testing of Effect Sizes 99 which, under i/ 0, has approximately a x^distribution with (m - 1) degrees of freedom (Kraemer, 1975, 1979). Furthermore, then the pooled estimate of 8 is 8 where 5 = 2p/(l-p 2 ),/2. Then z(f>) ~ 9t(z(p), \/m.(v - 1)), where p = 2,-^/m.. On this basis, tests or confidence intervals for p are readily formulated. Note that $ is not a weighted average old x,d 2,...,d m and that 8 itself is not normally distributed. These are important points to consider in meta-analysis because pooling effect sizes is usually implemented by using a weighted average of d x,..., d m say 4* = 2«,-4/2«,-» where co, are positive weights based on sample sizes. This statistic is asymptotically normally distributed if the sample size underlying each d i is large or if m, the number of studies, is large, but the number of subjects per study and the number of different research studies is rarely large enough to warrent using asymptotic theory. If all the studies are small, even asymptotically as the number of studies increases, d u is biased. However, as Hedges points out (1981), one might replace d i by an unbiased estimate of 8. Even then, however, the variance of d i9 is not independent of the unknown 8. Obtaining valid confidence intervals or tests based on d u under these circumstances is indeed a problem, particularly when sample sizes are small. General Applications There are many other statistical questions related to use of sample effect sizes in meta-analysis. One might compile effect sizes for each of g interventions and wish to compare these using t tests or analysis of variance. One might examine characteristics of the studies yielding different effect sizes (size of class, intensity or duration of intervention, length of follow-up, etc.) and wish to assess the influence of such factors on effect size using Multiple Linear Regression. All such evaluations are questionable when applied to sample effect sizes directly (cf. Hedges, 1982c). The above statistical considerations, however, suggest a strategy to implement such procedures, namely: (1) Studies with group size less than 10 or which are seriously unbalanced (p <.4, p >.6) should be set aside. Such studies may be valid for purposes of testing, but estimation of effect sizes from such studies, for reasons detailed above, are problematic. (2) Only one effect size per study can be used to ensure independence.

9 100 Helena Chmura Kraemer (3) All remaining effect sizes are transformed as follows: r l = rf / /(rf l 2 + 4) 1/2 f *,- = *(*/) Analytic procedures are then applied, not to d i9 but to z,. The effect of this transformation is to attenuate size. When d t is small, there is little change; that is, z, «d r When d t =.80, for example, z,- =.78. When d i = 2.0, z, = 1.8. When d t = 10.0, z,. = Sample effect size, </,., has a skew distribution with heavy tails; z, has approximately a normal distribution with mean z(p) 9 (p = 5/(^2 + 4) 1/2 ) and variance equal (*> 1) _1. Hedges and Olkin (1981) suggest a somewhat different variance stabilizing transformation, one essentially based on the Johnson-Welch approximation to the distribution of d. They suggest (in the balanced case) using with Here we suggest using: with z H (d) = j2smh- l (d\2jl) z H (d)~%(z lf (S),l/N). z K (d) = tanh- 1 (rf/y / </ 2 + 4), z K (d)^%(z K (S),l/(N-3)). In Table III are presented values of Z H and Z K for effect sizes d 0 to 2.0. For the typical range of effect sizes, only for relatively small sample size will results based on the two approaches differ. In any one study, because effect sizes are generally small, use of either transformation rather than d will make little difference. However, the relatively TABLE III Comparison of Two Variance Stabilizing Transformations for d: z K,z H. d Z k Z H !

10 Theory of Estimation and Testing of Effect Sizes 101 small errors incurred by using d rather than z K or z H, since they tend to be in the same direction, cumulate over studies in a meta-analysis and do not cancel each other out. As a result, there may be major impact on what inferences are drawn from meta-analysis and ultimately what recommendations are based thereon. References Fisher, R. A., & Yates, F. Statistical tables. London: Oliver and Boyd, Glass, G. V, Primary, secondary and meta-analysis of research. Educational Researcher, 1976,5,3-8. Glass, G. V, McGaw, B., & Smith, M. L. Meta-analysis in social research. Beverly Hills: Sage, Hedges, L. V. Distribution theory for Glass's estimator of effect size and related estimators. Journal of Educational Statistics, 1981, 6(2), Hedges, L. V. Estimation and testing for differences in effect size: Comment on Hsu. Psychological Bulletin, 1982, 97, (a) Hedges, L. V. Estimation of effect size from a series of independent experiments. Psychological Bulletin, 1982, 92, (b) Hedges, L. V. Fitting categorical models to effect sizes from a series of experiments. Journal of Educational Statistics, 1982, 7, (c) Hedges, L. V., & Olkin, I. Clustering estimates of effect magnitude from independent studies (Technical Report No. 173). Stanford, Calif: Stanford University, April, Hsu, L. M. Tests of differences in /^-levels as tests for differences in effect size. Psychological Bulletin, 1980, 88, Johnson, N. L., & Welch, B. L. Applications of the noncentral /-distribution. Biometrika, 1940,57, Kraemer, H. C. On estimation and hypothesis testing problems for correlation coefficients. Psychometrika, 1975, 40(4), Kraemer, H. C. Tests of homogeneity of independent correlation coefficients. Psychometrika, 1979, 44(3), Kraemer, H. C, & Andrews, G. A non-parametric technique for meta-analysis effect size calculation. Psychological Bulletin, 1982, 97(2), Kraemer, H. C, & Paik, M. A central / approximation to the noncentral /-distribution. Technometrics, 1979, 27(3), Resnikoff, G. J., & Lieberman, G. J. Tables of the noncentral t-distribution. Stanford. Calif.: Stanford University Press, Scheffe, H. The analysis of variance. New York: John Wiley & Sons, Winer, B. J. Statistical principles in experimental design (2d ed.). New York: McGraw-Hill Book Company, Author KRAEMER, HELENA CHMURA. Associate Professor of Biostatistics in Psychiatry. Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, California, Specializations: Statistical applications in bio-behavioral areas, particularly of correlation techniques.

Estimation of Effect Size From a Series of Experiments Involving Paired Comparisons

Estimation of Effect Size From a Series of Experiments Involving Paired Comparisons Journal of Educational Statistics Fall 1993, Vol 18, No. 3, pp. 271-279 Estimation of Effect Size From a Series of Experiments Involving Paired Comparisons Robert D. Gibbons Donald R. Hedeker John M. Davis

More information

HYPOTHESIS TESTING. Hypothesis Testing

HYPOTHESIS TESTING. Hypothesis Testing MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.

More information

Using Power Tables to Compute Statistical Power in Multilevel Experimental Designs

Using Power Tables to Compute Statistical Power in Multilevel Experimental Designs A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to

More information

r(equivalent): A Simple Effect Size Indicator

r(equivalent): A Simple Effect Size Indicator r(equivalent): A Simple Effect Size Indicator The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Rosenthal, Robert, and

More information

Workshop on Statistical Applications in Meta-Analysis

Workshop on Statistical Applications in Meta-Analysis Workshop on Statistical Applications in Meta-Analysis Robert M. Bernard & Phil C. Abrami Centre for the Study of Learning and Performance and CanKnow Concordia University May 16, 2007 Two Main Purposes

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

A Monte Carlo Simulation of the Robust Rank- Order Test Under Various Population Symmetry Conditions

A Monte Carlo Simulation of the Robust Rank- Order Test Under Various Population Symmetry Conditions Journal of Modern Applied Statistical Methods Volume 12 Issue 1 Article 7 5-1-2013 A Monte Carlo Simulation of the Robust Rank- Order Test Under Various Population Symmetry Conditions William T. Mickelson

More information

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner

More information

A nonparametric two-sample wald test of equality of variances

A nonparametric two-sample wald test of equality of variances University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David

More information

Another Look at the Confidence Intervals for the Noncentral T Distribution

Another Look at the Confidence Intervals for the Noncentral T Distribution Journal of Modern Applied Statistical Methods Volume 6 Issue 1 Article 11 5-1-007 Another Look at the Confidence Intervals for the Noncentral T Distribution Bruno Lecoutre Centre National de la Recherche

More information

S Abelman * Keywords: Multivariate analysis of variance (MANOVA), hypothesis testing.

S Abelman * Keywords: Multivariate analysis of variance (MANOVA), hypothesis testing. S Afr Optom 2006 65 (2) 62 67 A p p l i c a t i o n o f m u l t i v a r i a t e a n a l y s i s o f v a r i - a n c e ( M A N O VA ) t o d i s t a n c e r e f r a c t i v e v a r i - a b i l i t y a n

More information

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between 7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation

More information

Probabilistic Hindsight: A SAS Macro for Retrospective Statistical Power Analysis

Probabilistic Hindsight: A SAS Macro for Retrospective Statistical Power Analysis 1 Paper 0-6 Probabilistic Hindsight: A SAS Macro for Retrospective Statistical Power Analysis Kristine Y. Hogarty and Jeffrey D. Kromrey Department of Educational Measurement and Research, University of

More information

A Monte-Carlo study of asymptotically robust tests for correlation coefficients

A Monte-Carlo study of asymptotically robust tests for correlation coefficients Biometrika (1973), 6, 3, p. 661 551 Printed in Great Britain A Monte-Carlo study of asymptotically robust tests for correlation coefficients BY G. T. DUNCAN AND M. W. J. LAYAKD University of California,

More information

ScienceDirect. Who s afraid of the effect size?

ScienceDirect. Who s afraid of the effect size? Available online at www.sciencedirect.com ScienceDirect Procedia Economics and Finance 0 ( 015 ) 665 669 7th International Conference on Globalization of Higher Education in Economics and Business Administration,

More information

Methodology Review: Applications of Distribution Theory in Studies of. Population Validity and Cross Validity. James Algina. University of Florida

Methodology Review: Applications of Distribution Theory in Studies of. Population Validity and Cross Validity. James Algina. University of Florida Distribution Theory 1 Methodology eview: Applications of Distribution Theory in Studies of Population Validity and Cross Validity by James Algina University of Florida and H. J. Keselman University of

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number

More information

Two-Sample Inferential Statistics

Two-Sample Inferential Statistics The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is

More information

TWO-FACTOR AGRICULTURAL EXPERIMENT WITH REPEATED MEASURES ON ONE FACTOR IN A COMPLETE RANDOMIZED DESIGN

TWO-FACTOR AGRICULTURAL EXPERIMENT WITH REPEATED MEASURES ON ONE FACTOR IN A COMPLETE RANDOMIZED DESIGN Libraries Annual Conference on Applied Statistics in Agriculture 1995-7th Annual Conference Proceedings TWO-FACTOR AGRICULTURAL EXPERIMENT WITH REPEATED MEASURES ON ONE FACTOR IN A COMPLETE RANDOMIZED

More information

NEW APPROXIMATE INFERENTIAL METHODS FOR THE RELIABILITY PARAMETER IN A STRESS-STRENGTH MODEL: THE NORMAL CASE

NEW APPROXIMATE INFERENTIAL METHODS FOR THE RELIABILITY PARAMETER IN A STRESS-STRENGTH MODEL: THE NORMAL CASE Communications in Statistics-Theory and Methods 33 (4) 1715-1731 NEW APPROXIMATE INFERENTIAL METODS FOR TE RELIABILITY PARAMETER IN A STRESS-STRENGT MODEL: TE NORMAL CASE uizhen Guo and K. Krishnamoorthy

More information

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit

More information

On Selecting Tests for Equality of Two Normal Mean Vectors

On Selecting Tests for Equality of Two Normal Mean Vectors MULTIVARIATE BEHAVIORAL RESEARCH, 41(4), 533 548 Copyright 006, Lawrence Erlbaum Associates, Inc. On Selecting Tests for Equality of Two Normal Mean Vectors K. Krishnamoorthy and Yanping Xia Department

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Fall 213 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific

More information

Single Sample Means. SOCY601 Alan Neustadtl

Single Sample Means. SOCY601 Alan Neustadtl Single Sample Means SOCY601 Alan Neustadtl The Central Limit Theorem If we have a population measured by a variable with a mean µ and a standard deviation σ, and if all possible random samples of size

More information

CBA4 is live in practice mode this week exam mode from Saturday!

CBA4 is live in practice mode this week exam mode from Saturday! Announcements CBA4 is live in practice mode this week exam mode from Saturday! Material covered: Confidence intervals (both cases) 1 sample hypothesis tests (both cases) Hypothesis tests for 2 means as

More information

Distribution-Free Procedures (Devore Chapter Fifteen)

Distribution-Free Procedures (Devore Chapter Fifteen) Distribution-Free Procedures (Devore Chapter Fifteen) MATH-5-01: Probability and Statistics II Spring 018 Contents 1 Nonparametric Hypothesis Tests 1 1.1 The Wilcoxon Rank Sum Test........... 1 1. Normal

More information

THE 'IMPROVED' BROWN AND FORSYTHE TEST FOR MEAN EQUALITY: SOME THINGS CAN'T BE FIXED

THE 'IMPROVED' BROWN AND FORSYTHE TEST FOR MEAN EQUALITY: SOME THINGS CAN'T BE FIXED THE 'IMPROVED' BROWN AND FORSYTHE TEST FOR MEAN EQUALITY: SOME THINGS CAN'T BE FIXED H. J. Keselman Rand R. Wilcox University of Manitoba University of Southern California Winnipeg, Manitoba Los Angeles,

More information

Bootstrap Procedures for Testing Homogeneity Hypotheses

Bootstrap Procedures for Testing Homogeneity Hypotheses Journal of Statistical Theory and Applications Volume 11, Number 2, 2012, pp. 183-195 ISSN 1538-7887 Bootstrap Procedures for Testing Homogeneity Hypotheses Bimal Sinha 1, Arvind Shah 2, Dihua Xu 1, Jianxin

More information

A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED

A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED by W. Robert Reed Department of Economics and Finance University of Canterbury, New Zealand Email: bob.reed@canterbury.ac.nz

More information

A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation

A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation Ann. Hum. Genet., Lond. (1975), 39, 141 Printed in Great Britain 141 A consideration of the chi-square test of Hardy-Weinberg equilibrium in a non-multinomial situation BY CHARLES F. SING AND EDWARD D.

More information

Maximum-Likelihood Estimation: Basic Ideas

Maximum-Likelihood Estimation: Basic Ideas Sociology 740 John Fox Lecture Notes Maximum-Likelihood Estimation: Basic Ideas Copyright 2014 by John Fox Maximum-Likelihood Estimation: Basic Ideas 1 I The method of maximum likelihood provides estimators

More information

Inference in Normal Regression Model. Dr. Frank Wood

Inference in Normal Regression Model. Dr. Frank Wood Inference in Normal Regression Model Dr. Frank Wood Remember We know that the point estimator of b 1 is b 1 = (Xi X )(Y i Ȳ ) (Xi X ) 2 Last class we derived the sampling distribution of b 1, it being

More information

Inferences About the Difference Between Two Means

Inferences About the Difference Between Two Means 7 Inferences About the Difference Between Two Means Chapter Outline 7.1 New Concepts 7.1.1 Independent Versus Dependent Samples 7.1. Hypotheses 7. Inferences About Two Independent Means 7..1 Independent

More information

A Random Effects Model for Effect Sizes

A Random Effects Model for Effect Sizes Psychological Bulletin 1983, Vol. 93, No., 3 8-395 Copyright 1983 by the American Psychological Association, Inc. 0033-909/83/930-0388S00.75 A Random Effects Model for Effect Sizes Larry V. Hedges Department

More information

F n and theoretical, F 0 CDF values, for the ordered sample

F n and theoretical, F 0 CDF values, for the ordered sample Material E A S E 7 AMPTIAC Jorge Luis Romeu IIT Research Institute Rome, New York STATISTICAL ANALYSIS OF MATERIAL DATA PART III: ON THE APPLICATION OF STATISTICS TO MATERIALS ANALYSIS Introduction This

More information

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017 Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent

More information

Psicológica ISSN: Universitat de València España

Psicológica ISSN: Universitat de València España Psicológica ISSN: 0211-2159 psicologica@uv.es Universitat de València España Zimmerman, Donald W.; Zumbo, Bruno D. Hazards in Choosing Between Pooled and Separate- Variances t Tests Psicológica, vol. 30,

More information

Statistics: CI, Tolerance Intervals, Exceedance, and Hypothesis Testing. Confidence intervals on mean. CL = x ± t * CL1- = exp

Statistics: CI, Tolerance Intervals, Exceedance, and Hypothesis Testing. Confidence intervals on mean. CL = x ± t * CL1- = exp Statistics: CI, Tolerance Intervals, Exceedance, and Hypothesis Lecture Notes 1 Confidence intervals on mean Normal Distribution CL = x ± t * 1-α 1- α,n-1 s n Log-Normal Distribution CL = exp 1-α CL1-

More information

Neuendorf MANOVA /MANCOVA. Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y4. Like ANOVA/ANCOVA:

Neuendorf MANOVA /MANCOVA. Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y4. Like ANOVA/ANCOVA: 1 Neuendorf MANOVA /MANCOVA Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y1 Y2 Y3 Y4 Like ANOVA/ANCOVA: 1. Assumes equal variance (equal covariance matrices) across cells (groups defined by

More information

Confidence Interval Estimation

Confidence Interval Estimation Department of Psychology and Human Development Vanderbilt University 1 Introduction 2 3 4 5 Relationship to the 2-Tailed Hypothesis Test Relationship to the 1-Tailed Hypothesis Test 6 7 Introduction In

More information

Neuendorf MANOVA /MANCOVA. Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y4. Like ANOVA/ANCOVA:

Neuendorf MANOVA /MANCOVA. Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y4. Like ANOVA/ANCOVA: 1 Neuendorf MANOVA /MANCOVA Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y1 Y2 Y3 Y4 Like ANOVA/ANCOVA: 1. Assumes equal variance (equal covariance matrices) across cells (groups defined by

More information

Distribution Theory. Comparison Between Two Quantiles: The Normal and Exponential Cases

Distribution Theory. Comparison Between Two Quantiles: The Normal and Exponential Cases Communications in Statistics Simulation and Computation, 34: 43 5, 005 Copyright Taylor & Francis, Inc. ISSN: 0361-0918 print/153-4141 online DOI: 10.1081/SAC-00055639 Distribution Theory Comparison Between

More information

On Testing for Uniformity of Fit in Regression: An Econometric Case Study

On Testing for Uniformity of Fit in Regression: An Econometric Case Study On Testing for Uniformity of Fit in Regression: An Econometric Case Study JOHN L. PRATSCHKE* IT is frequently necessary to test regression results for uniformity of fit i.e. to test the randomness of the

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 6 Patrick Breheny University of Iowa to Biostatistics (BIOS 4120) 1 / 36 Our next several lectures will deal with two-sample inference for continuous

More information

Outline. PubH 5450 Biostatistics I Prof. Carlin. Confidence Interval for the Mean. Part I. Reviews

Outline. PubH 5450 Biostatistics I Prof. Carlin. Confidence Interval for the Mean. Part I. Reviews Outline Outline PubH 5450 Biostatistics I Prof. Carlin Lecture 11 Confidence Interval for the Mean Known σ (population standard deviation): Part I Reviews σ x ± z 1 α/2 n Small n, normal population. Large

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS Page 1 MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level

More information

A Cautionary Note on Estimating the Reliability of a Mastery Test with the Beta-Binomial Model

A Cautionary Note on Estimating the Reliability of a Mastery Test with the Beta-Binomial Model A Cautionary Note on Estimating the Reliability of a Mastery Test with the Beta-Binomial Model Rand R. Wilcox University of Southern California Based on recently published papers, it might be tempting

More information

Exam 2 (KEY) July 20, 2009

Exam 2 (KEY) July 20, 2009 STAT 2300 Business Statistics/Summer 2009, Section 002 Exam 2 (KEY) July 20, 2009 Name: USU A#: Score: /225 Directions: This exam consists of six (6) questions, assessing material learned within Modules

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

The exact bootstrap method shown on the example of the mean and variance estimation

The exact bootstrap method shown on the example of the mean and variance estimation Comput Stat (2013) 28:1061 1077 DOI 10.1007/s00180-012-0350-0 ORIGINAL PAPER The exact bootstrap method shown on the example of the mean and variance estimation Joanna Kisielinska Received: 21 May 2011

More information

The Chi-Square and F Distributions

The Chi-Square and F Distributions Department of Psychology and Human Development Vanderbilt University Introductory Distribution Theory 1 Introduction 2 Some Basic Properties Basic Chi-Square Distribution Calculations in R Convergence

More information

THE EFFECTS OF NONNORMAL DISTRIBUTIONS ON CONFIDENCE INTERVALS AROUND THE STANDARDIZED MEAN DIFFERENCE: BOOTSTRAP AND PARAMETRIC CONFIDENCE INTERVALS

THE EFFECTS OF NONNORMAL DISTRIBUTIONS ON CONFIDENCE INTERVALS AROUND THE STANDARDIZED MEAN DIFFERENCE: BOOTSTRAP AND PARAMETRIC CONFIDENCE INTERVALS EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT 10.1177/0013164404264850 KELLEY THE EFFECTS OF NONNORMAL DISTRIBUTIONS ON CONFIDENCE INTERVALS AROUND THE STANDARDIZED MEAN DIFFERENCE: BOOTSTRAP AND PARAMETRIC

More information

Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption

Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption Application of Parametric Homogeneity of Variances Tests under Violation of Classical Assumption Alisa A. Gorbunova and Boris Yu. Lemeshko Novosibirsk State Technical University Department of Applied Mathematics,

More information

Inferential statistics

Inferential statistics Inferential statistics Inference involves making a Generalization about a larger group of individuals on the basis of a subset or sample. Ahmed-Refat-ZU Null and alternative hypotheses In hypotheses testing,

More information

Chapter 7 Comparison of two independent samples

Chapter 7 Comparison of two independent samples Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N

More information

NONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12

NONPARAMETRIC TESTS. LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-12 NONPARAMETRIC TESTS LALMOHAN BHAR Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-1 lmb@iasri.res.in 1. Introduction Testing (usually called hypothesis testing ) play a major

More information

PROGRAM STATISTICS RESEARCH

PROGRAM STATISTICS RESEARCH An Alternate Definition of the ETS Delta Scale of Item Difficulty Paul W. Holland and Dorothy T. Thayer @) PROGRAM STATISTICS RESEARCH TECHNICAL REPORT NO. 85..64 EDUCATIONAL TESTING SERVICE PRINCETON,

More information

Confidence Intervals for the Process Capability Index C p Based on Confidence Intervals for Variance under Non-Normality

Confidence Intervals for the Process Capability Index C p Based on Confidence Intervals for Variance under Non-Normality Malaysian Journal of Mathematical Sciences 101): 101 115 2016) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Confidence Intervals for the Process Capability

More information

INTRODUCTION TO ANALYSIS OF VARIANCE

INTRODUCTION TO ANALYSIS OF VARIANCE CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two

More information

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new

More information

Chapter 8 Handout: Interval Estimates and Hypothesis Testing

Chapter 8 Handout: Interval Estimates and Hypothesis Testing Chapter 8 Handout: Interval Estimates and Hypothesis esting Preview Clint s Assignment: aking Stock General Properties of the Ordinary Least Squares (OLS) Estimation Procedure Estimate Reliability: Interval

More information

Biostat Methods STAT 5820/6910 Handout #9a: Intro. to Meta-Analysis Methods

Biostat Methods STAT 5820/6910 Handout #9a: Intro. to Meta-Analysis Methods Biostat Methods STAT 5820/6910 Handout #9a: Intro. to Meta-Analysis Methods Meta-analysis describes statistical approach to systematically combine results from multiple studies [identified follong an exhaustive

More information

POWER FUNCTION CHARTS FOR SPECIFICATION OF SAMPLE SIZE IN ANALYSIS OF VARIANCE

POWER FUNCTION CHARTS FOR SPECIFICATION OF SAMPLE SIZE IN ANALYSIS OF VARIANCE PSYCHOMETRIKA--VOL. 23, NO. 3 SEPTEMBER, 1958 POWER FUNCTION CHARTS FOR SPECIFICATION OF SAMPLE SIZE IN ANALYSIS OF VARIANCE LEONARD S. FELDT STATE UNIVERSITY OF IOWA AND 5~OHARRAM W. MAHMOUD EGYPTIAN

More information

Types of Statistical Tests DR. MIKE MARRAPODI

Types of Statistical Tests DR. MIKE MARRAPODI Types of Statistical Tests DR. MIKE MARRAPODI Tests t tests ANOVA Correlation Regression Multivariate Techniques Non-parametric t tests One sample t test Independent t test Paired sample t test One sample

More information

PIRLS 2016 Achievement Scaling Methodology 1

PIRLS 2016 Achievement Scaling Methodology 1 CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at Some Applications of Exponential Ordered Scores Author(s): D. R. Cox Source: Journal of the Royal Statistical Society. Series B (Methodological), Vol. 26, No. 1 (1964), pp. 103-110 Published by: Wiley

More information

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 16 Introduction

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 16 Introduction Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 16 Introduction ReCap. Parts I IV. The General Linear Model Part V. The Generalized Linear Model 16 Introduction 16.1 Analysis

More information

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). Basic Statistics There are three types of error: 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). 2. Systematic error - always too high or too low

More information

Bayesian Estimation of Prediction Error and Variable Selection in Linear Regression

Bayesian Estimation of Prediction Error and Variable Selection in Linear Regression Bayesian Estimation of Prediction Error and Variable Selection in Linear Regression Andrew A. Neath Department of Mathematics and Statistics; Southern Illinois University Edwardsville; Edwardsville, IL,

More information

Tutorial 1: Power and Sample Size for the One-sample t-test. Acknowledgements:

Tutorial 1: Power and Sample Size for the One-sample t-test. Acknowledgements: Tutorial 1: Power and Sample Size for the One-sample t-test Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in large part by the National

More information

A Hypothesis Test for the End of a Common Source Outbreak

A Hypothesis Test for the End of a Common Source Outbreak Johns Hopkins University, Dept. of Biostatistics Working Papers 9-20-2004 A Hypothesis Test for the End of a Common Source Outbreak Ron Brookmeyer Johns Hopkins Bloomberg School of Public Health, Department

More information

On the Triangle Test with Replications

On the Triangle Test with Replications On the Triangle Test with Replications Joachim Kunert and Michael Meyners Fachbereich Statistik, University of Dortmund, D-44221 Dortmund, Germany E-mail: kunert@statistik.uni-dortmund.de E-mail: meyners@statistik.uni-dortmund.de

More information

Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances

Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

Two-by-two ANOVA: Global and Graphical Comparisons Based on an Extension of the Shift Function

Two-by-two ANOVA: Global and Graphical Comparisons Based on an Extension of the Shift Function Journal of Data Science 7(2009), 459-468 Two-by-two ANOVA: Global and Graphical Comparisons Based on an Extension of the Shift Function Rand R. Wilcox University of Southern California Abstract: When comparing

More information

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions Journal of Modern Applied Statistical Methods Volume 8 Issue 1 Article 13 5-1-2009 Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny November 11 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data

More information

IMPACT OF ALTERNATIVE DISTRIBUTIONS ON QUANTILE-QUANTILE NORMALITY PLOT

IMPACT OF ALTERNATIVE DISTRIBUTIONS ON QUANTILE-QUANTILE NORMALITY PLOT Colloquium Biometricum 45 2015, 67 78 IMPACT OF ALTERNATIVE DISTRIBUTIONS ON QUANTILE-QUANTILE NORMALITY PLOT Zofia Hanusz, Joanna Tarasińska Department of Applied Mathematics and Computer Science University

More information

Research Note: A more powerful test statistic for reasoning about interference between units

Research Note: A more powerful test statistic for reasoning about interference between units Research Note: A more powerful test statistic for reasoning about interference between units Jake Bowers Mark Fredrickson Peter M. Aronow August 26, 2015 Abstract Bowers, Fredrickson and Panagopoulos (2012)

More information

TEST POWER IN COMPARISON DIFFERENCE BETWEEN TWO INDEPENDENT PROPORTIONS

TEST POWER IN COMPARISON DIFFERENCE BETWEEN TWO INDEPENDENT PROPORTIONS TEST POWER IN COMPARISON DIFFERENCE BETWEEN TWO INDEPENDENT PROPORTIONS Mehmet MENDES PhD, Associate Professor, Canakkale Onsekiz Mart University, Agriculture Faculty, Animal Science Department, Biometry

More information

Two Measurement Procedures

Two Measurement Procedures Test of the Hypothesis That the Intraclass Reliability Coefficient is the Same for Two Measurement Procedures Yousef M. Alsawalmeh, Yarmouk University Leonard S. Feldt, University of lowa An approximate

More information

Paper Robust Effect Size Estimates and Meta-Analytic Tests of Homogeneity

Paper Robust Effect Size Estimates and Meta-Analytic Tests of Homogeneity Paper 27-25 Robust Effect Size Estimates and Meta-Analytic Tests of Homogeneity Kristine Y. Hogarty and Jeffrey D. Kromrey Department of Educational Measurement and Research, University of South Florida

More information

Analysis of variance

Analysis of variance Analysis of variance Andrew Gelman March 4, 2006 Abstract Analysis of variance (ANOVA) is a statistical procedure for summarizing a classical linear model a decomposition of sum of squares into a component

More information

Jerome Kaltenhauser and Yuk Lee

Jerome Kaltenhauser and Yuk Lee Correlation Coefficients for Binary Data In Factor Analysis Jerome Kaltenhauser and Yuk Lee The most commonly used factor analytic models employ a symmetric matrix of correlation coefficients as input.

More information

SOME ASPECTS OF MULTIVARIATE BEHRENS-FISHER PROBLEM

SOME ASPECTS OF MULTIVARIATE BEHRENS-FISHER PROBLEM SOME ASPECTS OF MULTIVARIATE BEHRENS-FISHER PROBLEM Junyong Park Bimal Sinha Department of Mathematics/Statistics University of Maryland, Baltimore Abstract In this paper we discuss the well known multivariate

More information

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006 Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312 This draft, April 20, 2006 1 1 A Brief Review of Hypothesis Testing and Its Uses values and pure significance tests (R.A. Fisher)

More information

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics

Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics A short review of the principles of mathematical statistics (or, what you should have learned in EC 151).

More information

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 9 Inferences from Two Samples 9-1 Overview 9-2 Inferences About Two Proportions 9-3

More information

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y

More information

A Practitioner s Guide to Cluster-Robust Inference

A Practitioner s Guide to Cluster-Robust Inference A Practitioner s Guide to Cluster-Robust Inference A. C. Cameron and D. L. Miller presented by Federico Curci March 4, 2015 Cameron Miller Cluster Clinic II March 4, 2015 1 / 20 In the previous episode

More information

Neuendorf MANOVA /MANCOVA. Model: MAIN EFFECTS: X1 (Factor A) X2 (Factor B) INTERACTIONS : X1 x X2 (A x B Interaction) Y4. Like ANOVA/ANCOVA:

Neuendorf MANOVA /MANCOVA. Model: MAIN EFFECTS: X1 (Factor A) X2 (Factor B) INTERACTIONS : X1 x X2 (A x B Interaction) Y4. Like ANOVA/ANCOVA: 1 Neuendorf MANOVA /MANCOVA Model: MAIN EFFECTS: X1 (Factor A) X2 (Factor B) Y1 Y2 INTERACTIONS : Y3 X1 x X2 (A x B Interaction) Y4 Like ANOVA/ANCOVA: 1. Assumes equal variance (equal covariance matrices)

More information

Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements:

Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances. Acknowledgements: Tutorial 3: Power and Sample Size for the Two-sample t-test with Equal Variances Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported in

More information

BIOL 4605/7220 CH 20.1 Correlation

BIOL 4605/7220 CH 20.1 Correlation BIOL 4605/70 CH 0. Correlation GPT Lectures Cailin Xu November 9, 0 GLM: correlation Regression ANOVA Only one dependent variable GLM ANCOVA Multivariate analysis Multiple dependent variables (Correlation)

More information

Outline. Possible Reasons. Nature of Heteroscedasticity. Basic Econometrics in Transportation. Heteroscedasticity

Outline. Possible Reasons. Nature of Heteroscedasticity. Basic Econometrics in Transportation. Heteroscedasticity 1/25 Outline Basic Econometrics in Transportation Heteroscedasticity What is the nature of heteroscedasticity? What are its consequences? How does one detect it? What are the remedial measures? Amir Samimi

More information

Upon completion of this chapter, you should be able to:

Upon completion of this chapter, you should be able to: 1 Chaptter 7:: CORRELATIION Upon completion of this chapter, you should be able to: Explain the concept of relationship between variables Discuss the use of the statistical tests to determine correlation

More information

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008 MIT OpenCourseWare http://ocw.mit.edu.830j / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

y ˆ i = ˆ  T u i ( i th fitted value or i th fit) 1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information