Abstract Title Page Title: A method for improving power in cluster randomized experiments by using prior information about the covariance structure.

Size: px
Start display at page:

Download "Abstract Title Page Title: A method for improving power in cluster randomized experiments by using prior information about the covariance structure."

Transcription

1 Abstract Title Page Title: A method for improving power in cluster randomized experiments by using prior information about the covariance structure. Author(s): Chris Rhoads, University of Connecticut. 1

2 2 Abstract Body Background/context: Description of prior research, its intellectual context and its policy context. The educational research community has become quite aware in recent years that experiments which randomize entire clusters (e.g. schools) to treatments result in estimates of the average treatment effect which are less precise than estimates from individually randomized experiments with the same total number of subjects. (Raudenbush, Martinez and Spybrook, 2007; Bloom, Richburg-Hayes and Black, 2007). In particular, we let σb 2 represent the within treatment group variance between school mean test scores and σw 2 represent the variance of individual test scores around the school means. We define σt 2 = σ2 b + σ2 w and ρ = σ2 b. The σt 2 quantity ρ is referred to as the intra-class correlation coefficient (ICC). Then the variance in a cluster randomized design is inflated by a factor of 1 + (n 1)ρ (where n is the sample size within each cluster). We assume n to be the same within each cluster for simplicity of exposition. Less well known is the fact that a second penalty is paid when clusters are randomized. Not only is variance inflated, but there are also many fewer degrees of freedom available to estimate that variance. The two penalties that are paid when one chooses cluster randomization were the subject of a short but widely cited paper by Cornfield (1978). Specifically, Cornfield noted that if there are 2m clusters in the experiment, divided equally between treatment and control groups, then the usual t test has only 2m 2 degrees of freedom when clusters are randomized. In contrast, when individuals are randomized the usual t test has 2mn 2 degrees of freedom. In situations where cluster randomization cannot be avoided it is impossible to avoid a treatment effect estimator with increased variance. However, Blair and Higgins (1986) pointed out that if the ICC is known prior to conducting the experiment, generalized least squares (GLS) methods can be used and Cornfield s second penalty can be avoided. Thus, when the ICC is known ahead of time, a test of the treatment effect can be conducted with 2mn 2 rather than 2m 2 degrees of freedom. Konstantopoulos (forthcoming) has made the same point with regard to experimental designs with two levels of nesting. While the methods proposed by Blair and Higgins (1986) and Konstantopoulos (forthcoming) are intriguing ways to boost power, they can only be used if the exact value of the ICC (or ICCs in the case of a three level design) is known prior to the experiment. In many situations some estimator of the ICC will be available from data external to the main experiment. For example, a pilot study may provide information about the ICC. However, the pilot study may not be very large and so the available estimate of the ICC will be subject to considerable imprecision due to sampling variance. In a situation like this it is not reasonable to assume that the ICC is known exactly. However, it would still be useful to use the prior information about the ICC in order to improve the power of the experiment. Utilizing an estimate of the ICC to improve power is the subject of two papers by Blitstein and colleagues: Blitstein, Hannan, Murray and Shadish (2005), and Blitstein, Murray, Hannan and Shadish (2005). These papers describe a method for utilizing external information about ρ when the external estimator of ρ is subject to sampling variance (so that ρ is not known exactly).

3 3 Purpose / objective / research question / focus of study: Description of what the research focused on and why At SREE 2010 I presented bounds on the amount of power improvement that can be achieved if we have information about the ICC ahead of time as compared to when we have no prior information. I also discussed some of the flaws with the method of df described in Blitstein, Hannan, Murray and Shadish (2005), and Blitstein, Murray, Hannan and Shadish (2005). In the current paper I illustrate a new method for incorporating prior information about the ICC into an analysis of a cluster randomized experiment with the objective of increasing power. The method can be used with multi-level designs with any number of levels (although its application is greatly simplified if there is only a single level of nesting). Unlike previous methods, the current procedure can be applied in a way that guarantees that type I error rates are at their nominal level. The design situations where it would be useful to utilize prior information about the ICC are explored in some detail. Significance / novelty of study: Description of what is missing in previous work and the contribution the study makes. Given the large number of schools (and hence, expense) necessary to do well powered experiments that randomize schools to treatments (Raudenbush, Martinez and Spybrook, 2007) it is natural for the educational research community to attempt to find methods that help achieve sufficient power with fewer schools. Utilizing prior information about the ICC holds out the promise of being one such method. However, the methods described in Blair and Higgins (1986) and Konstantopoulos (forthcoming) apply only when the ICC is known exactly prior to the experiment. When only an estimate of the ICC is known prior to the study, the method of df cannot guarantee that type I error rates will be controlled at the nominal level. Furthermore, existing methods for incorporating prior information about the correlation structure into an analysis have only dealt with certain designs (specifically, designs with no more than two random effects and all random effects nested). The current method can both (a) guarantee the correct type I error rate when prior information about the covariance structure is subject to sampling error; and (b) is broadly applicable to designs with any sort of covariance structure whatsoever (provided, of course, that prior information about this covariance structure is available). Statistical, Measurement, or Econometric Model: Description of the proposed new methods or novel applications of existing methods. I consider first the case of a cluster randomized design with a single level of nesting. Denote the test statistic that is used when ρ is known exactly prior to the study as t GLS (ρ). I consider the use of this test statistic with a prior estimate of ρ (denoted ˆρ ex ) substituted for ρ. Space constraints prevent a full exposition, but the basic steps in the method are as follows: (1) I note that conditional on ˆρ ex, t GLS (ˆρ ex ) is approximately distributed as a constant k times a t random variable with h degrees of freedom. The values of k and h are functions of both ρ and ˆρ ex. Hence, given any critical value, we can determine the conditional size of the test (which depends on ρ and ˆρ ex ). (2) Average the values of the conditional size over the sampling distribution of ˆρ ex to obtain an unconditional size which depends only on ρ. Denote this unconditional

4 4 size as α U (ρ). I note that if we make the usual normality assumptions about the distribution of the random effects in the model then the sampling distribution of ˆρ ex is a simple transformation of the F distribution. (3) Use a root finding algorithm to determine the critical value that satisfies the condition that max ρ [0,1] α U (ρ) = α. Using this critical value results in a level α test of the desired null hypothesis. The method can be easily modified for use with any sort of assumed correlation structure for the data. In particular, assume that the data y from the experiment can be reasonably modeled by a multivariate normal distribution with known correlation matrix V. Assume that we have specified a linear model for the mean of y with parameter vector b. Then any linear hypothesis of the form H b = θ, can be tested by forming the appropriate test statistic which utilizes the GLS estimate for the vector b. The test statistic will depend on the value of V and will follow the F distribution under the null hypothesis. Denote this test statistic F GLS (V), and denote the test statistic with the prior estimate of the correlation matrix substituted for V as F GLS ( ˆV ex ). Assume that the correlation matrix V for data can be parameterized in terms of a vector ρ = (ρ 1,..., ρ q ) [0, 1] q and that a prior estimate of ρ, ˆρ ex, is available. Then the same three steps outlined above may be followed to derive a level α test of the desired null hypothesis. That is: (1) Given a critical value c we can compute the size of the test conditional on ρ and ˆρ ex. (2) We can average over the distribution of ˆρ ex to obtain the unconditional size. (3) We use a multivariate root finding algorithm to determine the critical value that satisfies the condition (1) maxρ [0,1] q α U(ρ) = α. The above presentation may be impractical to implement as it involves (a) deriving the distribution of the q dimensional vector ˆρ ex and (b) finding the root of the equation given in (1). The paper points out that frequently the difficulty can be avoided by identifying a sum of squares like quantity (call it SS un ) such that: (a) SS un has an expected mean square equal to the expected mean square of the the numerator of the F statistic, (b) the degrees of freedom of a test statistic utilizing SS un in the denominator are large enough that further increases in degrees of freedom would produce negligible changes in the critical value used for the test, and (c) the distribution of SS un depends on only one or two elements of the q dimensional vector ρ. Usefulness / Applicability of Method: Demonstration of the usefulness of the proposed methods using hypothetical or real data. I focus on results for the simple case of a cluster randomized experiment with only one level of nesting. It is desirable to know under which conditions can external information about ρ improve power. In particular, it is useful to know how precise the prior estimate

5 of ρ needs to be in order to improve power. Some results in this area are presented below. The test based on the method outlined above is referred to as the CEGLS test, n ex refers to the within cluster sample size in the data providing the prior estimate of ρ and m ex refers to the number of clusters that provide prior information about ρ. The GLS test is the test that would be conducted if ρ were known exactly prior to the experiment and the cluster means (CM) test is the test that would be conducted if prior information about the ICC were not used at all. Results assume that the restricted maximum likelihood estimator of ρ is used to obtain the prior estimate of ρ and that hypothesis tests are one-sided at level This last assumption was made since in most practical situations the power of the one sided test at level will correspond to the power of the two sided test at level However, by assuming one-sided tests we avoid certain odd results that might occur when there is substantial power in the lower tail of the distribution. Figure 2 displays the power improvement of the CEGLS test over the usual (CM) test as a function of the true value of the ICC when m = 6, n = 25, m ex = 10 and n ex = 5, 25, 100. The graph makes clear that in this case the CEGLS R test performs no better than the CM test, and when n ex = 5 it performs significantly worse. The next figure, figure 2, keeps the value of n at 25, the value of δ at 0.5 and again varies n ex from 5 to 100. However, now m has been changed to 3, and m ex = 30. Unlike figure 1 in this case we see substantial power improvements, especially for value of ρ less than 0.2. The exception is when n ex = 5, in which case power improvement is very slight and is even negative for very small value of ρ. Figure 3 sets m to 4 and n = n ex to 25. The effect size remains at 0.5 and α is still The value of m ex is varied from 5 to 50. We see that when our prior estimate of ρ is based on only 5 clusters it is better not to use this information at all. When our prior estimate is based on 10 clusters it is useful to have prior information, but power improvement is substantially less than when ρ is known exactly. When our prior estimate is based on 20 clusters or more power improvement is almost as much as it would have been if rho had been known exactly. The pattern seen in figures 1 and 2, where power improvement is much less when n ex < n appears to hold in general. This fact is confirmed in table 1, where the power of the CEGLS test that uses the REML estimate of ρ is reported for a variety of values of n, m, n ex and m ex. As long as the prior estimate of ρ is based on at least 20 clusters it is preferable to have external information unless n ex < n. In this case it is frequently the case that power is actually worse when the external information about the ICC is utilized. Conclusions: Description of conclusions and recommendations based on findings and overall study. The current paper presents a method for improving power in experiments with clustering when prior information about the covariance structure is available. The method controls type I error to the nominal level and can be used with any sort of assumed covariance structure. It is shown that when prior estimates of the ICC are very noisy the experimenter will often be better off ignoring this information altogether, rather than trying to use it in her analysis. Additionally, if the size of the clusters providing the prior estimate of the ICC are substantially smaller than the size of the clusters in the main experiment it is difficult to obtain power improvements via the use of prior ICC information. However, if prior estimates are based on a sample of 20 or more clusters of size 25 or more and the cluster size in the 5

6 6 main experiment and the prior data are not too discrepant it will usually be advantageous to utilize the prior ICC estimate.

7 7 Appendix A. References Appendices Not included in page count. References [1] Blair, R.C. and Higgins, J.J. (1986). Comment on Statistical power with group mean as the unit of analysis. Journal of Educational Statistics, 11, [2] Blitstein, J.L., Hannan, P.J., Murray, D.M. and Shadish, W.R. (2005a). Increasing the degrees of freedom in existing group randomized trials through the use of external estimates of the intraclass correlation: The df* approach. Evaluation Review, 29, [3] Blitstein, J.L., Murray, D.M.,Hannan, P.J. and Shadish, W.R. (2005b). Increasing the degrees of freedom in future group randomized trials through the use of external estimates of the intraclass correlation: The df* approach. Evaluation Review, 29, [4] Bloom, H. S., Richburg-Hayes, L., and Black, A. R. (2007). Using covariates to improve precision: Empirical guidelines for studies that randomize schools to measure the impacts of educational interventions. Educational Evaluation and Policy Analysis, 29, [5] Cornfield, J. (1978). Randomization by Group: a formal analysis. American Journal of Epidemiology, 108, [6] Konstantopoulos, S (forthcoming). Constructing a More Powerful Test in Three-Level Cluster Randomized Designs. Journal of Research on Educational Effectiveness. [7] Raudenbush, S., Martinez, A. and Spybrook, J. (2007). Strategies for improving precision in grouprandomized experiments. Educational Evaluation and Policy Analysis, 29(1), Appendix B. Tables and Figures

8 8 Figure 1. Power CEGLS R (GLS)-CM test, m = 6, n = 25, m ex = 10, α =.025, δ = 0.5, vary n ex

9 Figure 2. Power CEGLS R (GLS)-CM test, m = 3, n = 25, m ex = 30, α =.025, δ = 0.5, vary n ex 9

10 10 Figure 3. Power CEGLS R (GLS)-CM test, m = 4, n = n ex = 25, α =.025, δ = 0.5, vary m ex

11 mex = 10 mex = 20 mex = 50 mex = 100 m n nex= CEGLSR pow Improve crit val CEGLSR pow Improve crit val CEGLSR pow Improve crit val CEGLSR pow Improve crit val CEGLSR pow Improve crit val CEGLSR pow Improve crit val CEGLSR pow Improve crit val CEGLSR pow Improve crit val CEGLSR pow Improve crit val CEGLSR pow Improve crit val CEGLSR pow Improve crit val CEGLSR pow Improve crit val Table 1. CEGLSR power and improvement over CM test, δ = 0.5, ρ =.02, α =

Incorporating Cost in Power Analysis for Three-Level Cluster Randomized Designs

Incorporating Cost in Power Analysis for Three-Level Cluster Randomized Designs DISCUSSION PAPER SERIES IZA DP No. 75 Incorporating Cost in Power Analysis for Three-Level Cluster Randomized Designs Spyros Konstantopoulos October 008 Forschungsinstitut zur Zukunft der Arbeit Institute

More information

Statistical Power and Autocorrelation for Short, Comparative Interrupted Time Series Designs with Aggregate Data

Statistical Power and Autocorrelation for Short, Comparative Interrupted Time Series Designs with Aggregate Data Statistical Power and Autocorrelation for Short, Comparative Interrupted Time Series Designs with Aggregate Data Andrew Swanlund, American Institutes for Research (aswanlund@air.org) Kelly Hallberg, University

More information

Using Power Tables to Compute Statistical Power in Multilevel Experimental Designs

Using Power Tables to Compute Statistical Power in Multilevel Experimental Designs A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to the Practical Assessment, Research & Evaluation. Permission is granted to

More information

Assessing the Precision of Multisite Trials for Estimating the Parameters Of Cross-site Distributions of Program Effects. Howard S.

Assessing the Precision of Multisite Trials for Estimating the Parameters Of Cross-site Distributions of Program Effects. Howard S. Assessing the Precision of Multisite Trials for Estimating the Parameters Of Cross-site Distributions of Program Effects Howard S. Bloom MDRC Jessaca Spybrook Western Michigan University May 10, 016 Manuscript

More information

Abstract Title Page. Title: Degenerate Power in Multilevel Mediation: The Non-monotonic Relationship Between Power & Effect Size

Abstract Title Page. Title: Degenerate Power in Multilevel Mediation: The Non-monotonic Relationship Between Power & Effect Size Abstract Title Page Title: Degenerate Power in Multilevel Mediation: The Non-monotonic Relationship Between Power & Effect Size Authors and Affiliations: Ben Kelcey University of Cincinnati SREE Spring

More information

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models EPSY 905: Multivariate Analysis Spring 2016 Lecture #12 April 20, 2016 EPSY 905: RM ANOVA, MANOVA, and Mixed Models

More information

One-way ANOVA. Experimental Design. One-way ANOVA

One-way ANOVA. Experimental Design. One-way ANOVA Method to compare more than two samples simultaneously without inflating Type I Error rate (α) Simplicity Few assumptions Adequate for highly complex hypothesis testing 09/30/12 1 Outline of this class

More information

Three-Level Modeling for Factorial Experiments With Experimentally Induced Clustering

Three-Level Modeling for Factorial Experiments With Experimentally Induced Clustering Three-Level Modeling for Factorial Experiments With Experimentally Induced Clustering John J. Dziak The Pennsylvania State University Inbal Nahum-Shani The University of Michigan Copyright 016, Penn State.

More information

Introducing Generalized Linear Models: Logistic Regression

Introducing Generalized Linear Models: Logistic Regression Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and

More information

Statistics Introductory Correlation

Statistics Introductory Correlation Statistics Introductory Correlation Session 10 oscardavid.barrerarodriguez@sciencespo.fr April 9, 2018 Outline 1 Statistics are not used only to describe central tendency and variability for a single variable.

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011) Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October

More information

STT 843 Key to Homework 1 Spring 2018

STT 843 Key to Homework 1 Spring 2018 STT 843 Key to Homework Spring 208 Due date: Feb 4, 208 42 (a Because σ = 2, σ 22 = and ρ 2 = 05, we have σ 2 = ρ 2 σ σ22 = 2/2 Then, the mean and covariance of the bivariate normal is µ = ( 0 2 and Σ

More information

Assessing the relation between language comprehension and performance in general chemistry. Appendices

Assessing the relation between language comprehension and performance in general chemistry. Appendices Assessing the relation between language comprehension and performance in general chemistry Daniel T. Pyburn a, Samuel Pazicni* a, Victor A. Benassi b, and Elizabeth E. Tappin c a Department of Chemistry,

More information

STRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013

STRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013 STRUCTURAL EQUATION MODELING Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013 Introduction: Path analysis Path Analysis is used to estimate a system of equations in which all of the

More information

Hypothesis Testing hypothesis testing approach

Hypothesis Testing hypothesis testing approach Hypothesis Testing In this case, we d be trying to form an inference about that neighborhood: Do people there shop more often those people who are members of the larger population To ascertain this, we

More information

Maximum-Likelihood Estimation: Basic Ideas

Maximum-Likelihood Estimation: Basic Ideas Sociology 740 John Fox Lecture Notes Maximum-Likelihood Estimation: Basic Ideas Copyright 2014 by John Fox Maximum-Likelihood Estimation: Basic Ideas 1 I The method of maximum likelihood provides estimators

More information

Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression

Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression Association Between Variables Measured at the Interval-Ratio Level: Bivariate Correlation and Regression Last couple of classes: Measures of Association: Phi, Cramer s V and Lambda (nominal level of measurement)

More information

Stat472/572 Sampling: Theory and Practice Instructor: Yan Lu

Stat472/572 Sampling: Theory and Practice Instructor: Yan Lu Stat472/572 Sampling: Theory and Practice Instructor: Yan Lu 1 Chapter 5 Cluster Sampling with Equal Probability Example: Sampling students in high school. Take a random sample of n classes (The classes

More information

Multilevel/Mixed Models and Longitudinal Analysis Using Stata

Multilevel/Mixed Models and Longitudinal Analysis Using Stata Multilevel/Mixed Models and Longitudinal Analysis Using Stata Isaac J. Washburn PhD Research Associate Oregon Social Learning Center Summer Workshop Series July 2010 Longitudinal Analysis 1 Longitudinal

More information

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs) The One-Way Repeated-Measures ANOVA (For Within-Subjects Designs) Logic of the Repeated-Measures ANOVA The repeated-measures ANOVA extends the analysis of variance to research situations using repeated-measures

More information

PSYC 331 STATISTICS FOR PSYCHOLOGISTS

PSYC 331 STATISTICS FOR PSYCHOLOGISTS PSYC 331 STATISTICS FOR PSYCHOLOGISTS Session 4 A PARAMETRIC STATISTICAL TEST FOR MORE THAN TWO POPULATIONS Lecturer: Dr. Paul Narh Doku, Dept of Psychology, UG Contact Information: pndoku@ug.edu.gh College

More information

Lecture 18: Analysis of variance: ANOVA

Lecture 18: Analysis of variance: ANOVA Lecture 18: Announcements: Exam has been graded. See website for results. Lecture 18: Announcements: Exam has been graded. See website for results. Reading: Vasilj pp. 83-97. Lecture 18: Announcements:

More information

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong Statistics Primer ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong 1 Quick Overview of Statistics 2 Descriptive vs. Inferential Statistics Descriptive Statistics: summarize and describe data

More information

Supplemental Materials. In the main text, we recommend graphing physiological values for individual dyad

Supplemental Materials. In the main text, we recommend graphing physiological values for individual dyad 1 Supplemental Materials Graphing Values for Individual Dyad Members over Time In the main text, we recommend graphing physiological values for individual dyad members over time to aid in the decision

More information

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study Science Journal of Applied Mathematics and Statistics 2014; 2(1): 20-25 Published online February 20, 2014 (http://www.sciencepublishinggroup.com/j/sjams) doi: 10.11648/j.sjams.20140201.13 Robust covariance

More information

Intraclass Correlations in One-Factor Studies

Intraclass Correlations in One-Factor Studies CHAPTER Intraclass Correlations in One-Factor Studies OBJECTIVE The objective of this chapter is to present methods and techniques for calculating the intraclass correlation coefficient and associated

More information

Longitudinal Data Analysis of Health Outcomes

Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis Workshop Running Example: Days 2 and 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development

More information

Impact of serial correlation structures on random effect misspecification with the linear mixed model.

Impact of serial correlation structures on random effect misspecification with the linear mixed model. Impact of serial correlation structures on random effect misspecification with the linear mixed model. Brandon LeBeau University of Iowa file:///c:/users/bleb/onedrive%20 %20University%20of%20Iowa%201/JournalArticlesInProgress/Diss/Study2/Pres/pres.html#(2)

More information

ANALYSIS OF CORRELATED DATA SAMPLING FROM CLUSTERS CLUSTER-RANDOMIZED TRIALS

ANALYSIS OF CORRELATED DATA SAMPLING FROM CLUSTERS CLUSTER-RANDOMIZED TRIALS ANALYSIS OF CORRELATED DATA SAMPLING FROM CLUSTERS CLUSTER-RANDOMIZED TRIALS Background Independent observations: Short review of well-known facts Comparison of two groups continuous response Control group:

More information

Introduction to Within-Person Analysis and RM ANOVA

Introduction to Within-Person Analysis and RM ANOVA Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides

More information

A Practitioner s Guide to Cluster-Robust Inference

A Practitioner s Guide to Cluster-Robust Inference A Practitioner s Guide to Cluster-Robust Inference A. C. Cameron and D. L. Miller presented by Federico Curci March 4, 2015 Cameron Miller Cluster Clinic II March 4, 2015 1 / 20 In the previous episode

More information

The Common Factor Model. Measurement Methods Lecture 15 Chapter 9

The Common Factor Model. Measurement Methods Lecture 15 Chapter 9 The Common Factor Model Measurement Methods Lecture 15 Chapter 9 Today s Class Common Factor Model Multiple factors with a single test ML Estimation Methods New fit indices because of ML Estimation method

More information

Clustered Measurement in Cluster Randomized Trials

Clustered Measurement in Cluster Randomized Trials in Cluster Randomized Trials Robert W. Department of Epidemiology, Biostatistics, and Occupational Health Department of Pediatrics University June 2009 Outline Background Cluster RCTs Intra-cluster correlation

More information

Sociology 593 Exam 1 Answer Key February 17, 1995

Sociology 593 Exam 1 Answer Key February 17, 1995 Sociology 593 Exam 1 Answer Key February 17, 1995 I. True-False. (5 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. A researcher regressed Y on. When

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Question. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?

Question. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not? Hypothesis testing Question Very frequently: what is the possible value of μ? Sample: we know only the average! μ average. Random deviation or not? Standard error: the measure of the random deviation.

More information

Sample Size and Power Considerations for Longitudinal Studies

Sample Size and Power Considerations for Longitudinal Studies Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous

More information

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review

More information

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t = 2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result

More information

Psychology Seminar Psych 406 Dr. Jeffrey Leitzel

Psychology Seminar Psych 406 Dr. Jeffrey Leitzel Psychology Seminar Psych 406 Dr. Jeffrey Leitzel Structural Equation Modeling Topic 1: Correlation / Linear Regression Outline/Overview Correlations (r, pr, sr) Linear regression Multiple regression interpreting

More information

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

Hypothesis Testing hypothesis testing approach formulation of the test statistic

Hypothesis Testing hypothesis testing approach formulation of the test statistic Hypothesis Testing For the next few lectures, we re going to look at various test statistics that are formulated to allow us to test hypotheses in a variety of contexts: In all cases, the hypothesis testing

More information

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin Equivalency Test for Model Fit 1 Running head: EQUIVALENCY TEST FOR MODEL FIT An Equivalency Test for Model Fit Craig S. Wells University of Massachusetts Amherst James. A. Wollack Ronald C. Serlin University

More information

Likelihood-Based Methods

Likelihood-Based Methods Likelihood-Based Methods Handbook of Spatial Statistics, Chapter 4 Susheela Singh September 22, 2016 OVERVIEW INTRODUCTION MAXIMUM LIKELIHOOD ESTIMATION (ML) RESTRICTED MAXIMUM LIKELIHOOD ESTIMATION (REML)

More information

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables /4/04 Structural Equation Modeling and Confirmatory Factor Analysis Advanced Statistics for Researchers Session 3 Dr. Chris Rakes Website: http://csrakes.yolasite.com Email: Rakes@umbc.edu Twitter: @RakesChris

More information

A Significance Test for the Lasso

A Significance Test for the Lasso A Significance Test for the Lasso Lockhart R, Taylor J, Tibshirani R, and Tibshirani R Ashley Petersen May 14, 2013 1 Last time Problem: Many clinical covariates which are important to a certain medical

More information

Sample size determination for a binary response in a superiority clinical trial using a hybrid classical and Bayesian procedure

Sample size determination for a binary response in a superiority clinical trial using a hybrid classical and Bayesian procedure Ciarleglio and Arendt Trials (2017) 18:83 DOI 10.1186/s13063-017-1791-0 METHODOLOGY Open Access Sample size determination for a binary response in a superiority clinical trial using a hybrid classical

More information

T-TEST FOR HYPOTHESIS ABOUT

T-TEST FOR HYPOTHESIS ABOUT T-TEST FOR HYPOTHESIS ABOUT Previously we tested the hypothesis that a sample comes from a population with a specified using the normal distribution and a z-test. But the z-test required the population

More information

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1 Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

Statistics Handbook. All statistical tables were computed by the author.

Statistics Handbook. All statistical tables were computed by the author. Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance

More information

An inferential procedure to use sample data to understand a population Procedures

An inferential procedure to use sample data to understand a population Procedures Hypothesis Test An inferential procedure to use sample data to understand a population Procedures Hypotheses, the alpha value, the critical region (z-scores), statistics, conclusion Two types of errors

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

Lecture 6: Single-classification multivariate ANOVA (k-group( MANOVA)

Lecture 6: Single-classification multivariate ANOVA (k-group( MANOVA) Lecture 6: Single-classification multivariate ANOVA (k-group( MANOVA) Rationale and MANOVA test statistics underlying principles MANOVA assumptions Univariate ANOVA Planned and unplanned Multivariate ANOVA

More information

Applied Statistics Preliminary Examination Theory of Linear Models August 2017

Applied Statistics Preliminary Examination Theory of Linear Models August 2017 Applied Statistics Preliminary Examination Theory of Linear Models August 2017 Instructions: Do all 3 Problems. Neither calculators nor electronic devices of any kind are allowed. Show all your work, clearly

More information

Harvard University. The use of permutation tests for the analysis of parallel and stepped-wedge cluster randomized trials. Rui Wang Victor DeGruttola

Harvard University. The use of permutation tests for the analysis of parallel and stepped-wedge cluster randomized trials. Rui Wang Victor DeGruttola Harvard University Harvard University Biostatistics Working Paper Series Year 2016 Paper 205 The use of permutation tests for the analysis of parallel and stepped-wedge cluster randomized trials Rui Wang

More information

10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing

10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing & z-test Lecture Set 11 We have a coin and are trying to determine if it is biased or unbiased What should we assume? Why? Flip coin n = 100 times E(Heads) = 50 Why? Assume we count 53 Heads... What could

More information

Inverse-variance Weighted Average

Inverse-variance Weighted Average Fixed Effects Meta-Analysisand Homogeneity Evaluation Jeff Valentine University of Louisville Campbell Collaboration Annual Colloquium Oslo 2009 Inverse-variance Weighted Average All effect sizes are not

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Simultaneous Equations and Weak Instruments under Conditionally Heteroscedastic Disturbances

Simultaneous Equations and Weak Instruments under Conditionally Heteroscedastic Disturbances Simultaneous Equations and Weak Instruments under Conditionally Heteroscedastic Disturbances E M. I and G D.A. P Michigan State University Cardiff University This version: July 004. Preliminary version

More information

APPENDICES TO Protest Movements and Citizen Discontent. Appendix A: Question Wordings

APPENDICES TO Protest Movements and Citizen Discontent. Appendix A: Question Wordings APPENDICES TO Protest Movements and Citizen Discontent Appendix A: Question Wordings IDEOLOGY: How would you describe your views on most political matters? Generally do you think of yourself as liberal,

More information

Applied Statistics for the Behavioral Sciences

Applied Statistics for the Behavioral Sciences Applied Statistics for the Behavioral Sciences Chapter 8 One-sample designs Hypothesis testing/effect size Chapter Outline Hypothesis testing null & alternative hypotheses alpha ( ), significance level,

More information

Appendix from L. J. Revell, On the Analysis of Evolutionary Change along Single Branches in a Phylogeny

Appendix from L. J. Revell, On the Analysis of Evolutionary Change along Single Branches in a Phylogeny 008 by The University of Chicago. All rights reserved.doi: 10.1086/588078 Appendix from L. J. Revell, On the Analysis of Evolutionary Change along Single Branches in a Phylogeny (Am. Nat., vol. 17, no.

More information

Chapter 4: Factor Analysis

Chapter 4: Factor Analysis Chapter 4: Factor Analysis In many studies, we may not be able to measure directly the variables of interest. We can merely collect data on other variables which may be related to the variables of interest.

More information

Strati cation in Multivariate Modeling

Strati cation in Multivariate Modeling Strati cation in Multivariate Modeling Tihomir Asparouhov Muthen & Muthen Mplus Web Notes: No. 9 Version 2, December 16, 2004 1 The author is thankful to Bengt Muthen for his guidance, to Linda Muthen

More information

Non-Inferiority Tests for the Ratio of Two Proportions in a Cluster- Randomized Design

Non-Inferiority Tests for the Ratio of Two Proportions in a Cluster- Randomized Design Chapter 236 Non-Inferiority Tests for the Ratio of Two Proportions in a Cluster- Randomized Design Introduction This module provides power analysis and sample size calculation for non-inferiority tests

More information

BIOS 312: Precision of Statistical Inference

BIOS 312: Precision of Statistical Inference and Power/Sample Size and Standard Errors BIOS 312: of Statistical Inference Chris Slaughter Department of Biostatistics, Vanderbilt University School of Medicine January 3, 2013 Outline Overview and Power/Sample

More information

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs) The One-Way Independent-Samples ANOVA (For Between-Subjects Designs) Computations for the ANOVA In computing the terms required for the F-statistic, we won t explicitly compute any sample variances or

More information

Poisson regression: Further topics

Poisson regression: Further topics Poisson regression: Further topics April 21 Overdispersion One of the defining characteristics of Poisson regression is its lack of a scale parameter: E(Y ) = Var(Y ), and no parameter is available to

More information

ScholarlyCommons. University of Pennsylvania. Rebecca A. Maynard University of Pennsylvania, Nianbo Dong

ScholarlyCommons. University of Pennsylvania. Rebecca A. Maynard University of Pennsylvania, Nianbo Dong University of Pennsylvania ScholarlyCommons GSE Publications Graduate School of Education 0 PowerUp!: A Tool for Calculating Minimum Detectable Effect Sizes and Minimum equired Sample Sizes for Experimental

More information

Heteroskedasticity. Part VII. Heteroskedasticity

Heteroskedasticity. Part VII. Heteroskedasticity Part VII Heteroskedasticity As of Oct 15, 2015 1 Heteroskedasticity Consequences Heteroskedasticity-robust inference Testing for Heteroskedasticity Weighted Least Squares (WLS) Feasible generalized Least

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

Inverse Sampling for McNemar s Test

Inverse Sampling for McNemar s Test International Journal of Statistics and Probability; Vol. 6, No. 1; January 27 ISSN 1927-7032 E-ISSN 1927-7040 Published by Canadian Center of Science and Education Inverse Sampling for McNemar s Test

More information

Model Estimation Example

Model Estimation Example Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions

More information

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate

More information

Adaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013

Adaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013 Adaptive designs beyond p-value combination methods Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013 Outline Introduction Combination-p-value method and conditional error function

More information

AR, MA and ARMA models

AR, MA and ARMA models AR, MA and AR by Hedibert Lopes P Based on Tsay s Analysis of Financial Time Series (3rd edition) P 1 Stationarity 2 3 4 5 6 7 P 8 9 10 11 Outline P Linear Time Series Analysis and Its Applications For

More information

ANOVA CIVL 7012/8012

ANOVA CIVL 7012/8012 ANOVA CIVL 7012/8012 ANOVA ANOVA = Analysis of Variance A statistical method used to compare means among various datasets (2 or more samples) Can provide summary of any regression analysis in a table called

More information

Analysis of Variance: Part 1

Analysis of Variance: Part 1 Analysis of Variance: Part 1 Oneway ANOVA When there are more than two means Each time two means are compared the probability (Type I error) =α. When there are more than two means Each time two means are

More information

This gives us an upper and lower bound that capture our population mean.

This gives us an upper and lower bound that capture our population mean. Confidence Intervals Critical Values Practice Problems 1 Estimation 1.1 Confidence Intervals Definition 1.1 Margin of error. The margin of error of a distribution is the amount of error we predict when

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Research Design: Topic 18 Hierarchical Linear Modeling (Measures within Persons) 2010 R.C. Gardner, Ph.d.

Research Design: Topic 18 Hierarchical Linear Modeling (Measures within Persons) 2010 R.C. Gardner, Ph.d. Research Design: Topic 8 Hierarchical Linear Modeling (Measures within Persons) R.C. Gardner, Ph.d. General Rationale, Purpose, and Applications Linear Growth Models HLM can also be used with repeated

More information

Repeated-Measures ANOVA in SPSS Correct data formatting for a repeated-measures ANOVA in SPSS involves having a single line of data for each

Repeated-Measures ANOVA in SPSS Correct data formatting for a repeated-measures ANOVA in SPSS involves having a single line of data for each Repeated-Measures ANOVA in SPSS Correct data formatting for a repeated-measures ANOVA in SPSS involves having a single line of data for each participant, with the repeated measures entered as separate

More information

Reports of the Institute of Biostatistics

Reports of the Institute of Biostatistics Reports of the Institute of Biostatistics No 02 / 2008 Leibniz University of Hannover Natural Sciences Faculty Title: Properties of confidence intervals for the comparison of small binomial proportions

More information

Downloaded from:

Downloaded from: Hossain, A; DiazOrdaz, K; Bartlett, JW (2017) Missing binary outcomes under covariate-dependent missingness in cluster randomised trials. Statistics in medicine. ISSN 0277-6715 DOI: https://doi.org/10.1002/sim.7334

More information

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline. Practitioner Course: Portfolio Optimization September 10, 2008 Before we define dependence, it is useful to define Random variables X and Y are independent iff For all x, y. In particular, F (X,Y ) (x,

More information

Design and analysis of trials with a partially nested design and a binary outcome measure

Design and analysis of trials with a partially nested design and a binary outcome measure Research Article Received: 10 April 2014, Accepted: 10 November 2015 Published online 15 December 2015 in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.6828 Design and analysis of trials

More information

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector

More information

6-1. Canonical Correlation Analysis

6-1. Canonical Correlation Analysis 6-1. Canonical Correlation Analysis Canonical Correlatin analysis focuses on the correlation between a linear combination of the variable in one set and a linear combination of the variables in another

More information

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

Ron Heck, Fall Week 3: Notes Building a Two-Level Model Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level

More information

Peng Li * and David T Redden

Peng Li * and David T Redden Li and Redden BMC Medical Research Methodology (2015) 15:38 DOI 10.1186/s12874-015-0026-x RESEARCH ARTICLE Open Access Comparing denominator degrees of freedom approximations for the generalized linear

More information

The Estimation of Simultaneous Equation Models under Conditional Heteroscedasticity

The Estimation of Simultaneous Equation Models under Conditional Heteroscedasticity The Estimation of Simultaneous Equation Models under Conditional Heteroscedasticity E M. I and G D.A. P University of Alicante Cardiff University This version: April 2004. Preliminary and to be completed

More information

,..., θ(2),..., θ(n)

,..., θ(2),..., θ(n) Likelihoods for Multivariate Binary Data Log-Linear Model We have 2 n 1 distinct probabilities, but we wish to consider formulations that allow more parsimonious descriptions as a function of covariates.

More information

On Realized Volatility, Covariance and Hedging Coefficient of the Nikkei-225 Futures with Micro-Market Noise

On Realized Volatility, Covariance and Hedging Coefficient of the Nikkei-225 Futures with Micro-Market Noise On Realized Volatility, Covariance and Hedging Coefficient of the Nikkei-225 Futures with Micro-Market Noise Naoto Kunitomo and Seisho Sato October 22, 2008 Abstract For the estimation problem of the realized

More information

Are Forecast Updates Progressive?

Are Forecast Updates Progressive? CIRJE-F-736 Are Forecast Updates Progressive? Chia-Lin Chang National Chung Hsing University Philip Hans Franses Erasmus University Rotterdam Michael McAleer Erasmus University Rotterdam and Tinbergen

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

Spatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions

Spatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions Spatial inference I will start with a simple model, using species diversity data Strong spatial dependence, Î = 0.79 what is the mean diversity? How precise is our estimate? Sampling discussion: The 64

More information

A simulation study for comparing testing statistics in response-adaptive randomization

A simulation study for comparing testing statistics in response-adaptive randomization RESEARCH ARTICLE Open Access A simulation study for comparing testing statistics in response-adaptive randomization Xuemin Gu 1, J Jack Lee 2* Abstract Background: Response-adaptive randomizations are

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information