Estimation and Centering

Size: px
Start display at page:

Download "Estimation and Centering"

Transcription

1 Estimation and Centering PSYED 3486 Feifei Ye University of Pittsburgh

2 Main Topics Estimating the level-1 coefficients for a particular unit Reading: R&B, Chapter 3 (p85-94) Centering-Location of X Reading Chapters 2 (p31-35) & 5 (p ) Kreft1995 Hofmann DA & Gavin MB (1998) Centering decisions in hierarchical linear models: Implications for research in organizations Journal of Management, 23, Estimation and Centering p1/32

3 General Two-Level HLM Model Level 1: p Y ij = β 0j + β kj X kij + r ij (1) k=1 r ij N(0,σ 2 ) (2) Level 2: β kj = γ k0 + q γ kl W lj + u kj, for k = 0,1,2,,p (3) l=1 Estimation and Centering p2/32

4 General Two-Level HLM Model u 0j u 1j u 2j u pj N , τ 00 τ 01 τ 02 τ 0p τ 10 τ 11 τ 12 τ 1p τ 20 τ 21 τ 22 τ 2p τ p0 τ p1 τ p2 τ pp (4) and independent between macro units and independent of r ij (5) Estimation and Centering p3/32

5 Types of Parameters Fixed effects (γs) Variance-covariance components σ 2 τ 00, τ 11, τ 01 etc Random level one coefficients (βs) Estimation and Centering p4/32

6 Methods of Estimation Generalized Least Squares for fixed effects (γs) Maximum Likelihood for variance-covariance components FIML: Full Maximum Likelihood One shortcoming of the FEML is that the estimates are conditional on point estimates of the fixed effects REML: Restricted Maximum Likelihood REML estimates adjust for uncertainty about the fixed effects; FIML estimates do not If J is large, the two procedures are often similar If J is small, the FIML estimates for variance components tend to be smaller than the REML estimates therefore, the FIML components are biased in small samples Empirical Bayes for the randomly varying level 1 coefficients (βs) Estimation and Centering p5/32

7 Reliability Definition Reliability = λ kj = Reliability( ˆ β kj ) = Variance of True Scores Variance of observed scores (6) τ kk τ kk + σ 2 /n j, for k = 0,1,,p (7) Estimation and Centering p6/32

8 Factors impacting reliability Larger amounts of between school variance (relative to within school variance) increase reliability Larger n j s give you higher reliability Larger ICCs increases reliability: Larger ICC means that within school group variance is small relative to between group variability The reliability will be higher when the group means vary substantially across level-2 units (holding constant the sample size per group) So increasing group size, increasing homogeneity within clusters, and increasing heterogeneity between clusters will increase reliability Very low reliabilities ( 10) may indicate that a random coefficient could be fixed in subsequent analyses (HLM 6 Users Manual, p 82) Or, 05 in R&B, page 125 Estimation and Centering p7/32

9 Consequence of Reliability for Empirical Bayes Estimation Different groups have different precision (reliability) in the estimation of the group mean and level-1 slopes EB estimates borrows strength from information contained in other units or groups EB estimates optimally combine two alternative but feasible estimates for the level-1 coefficients by weighting the estimates based on the reliability (precision) of the coefficients substantially across level-2 units (holding constant the sample size per group) Estimation and Centering p8/32

10 EB estimates EB estimate of β 0j for random effects ANOVA β0j = λ j Ȳ j + (1 λ j ) γˆ 00 (8) shrinkage estimators : the Y-bar estimate is shrunk towards the model-based estimate The higher the reliability, the less shrinkage; the lower the reliability, the more shrinkage Reliability of 1= No shrinkage; Reliability of 0 = Complete Shrinkage Estimation and Centering p9/32

11 Empirical Bayes Estimates EB estimate is based not only on its own data but also takes into account estimates for other groups and the characteristics groups share Shrinkage estimates of level 1 coefficients- EB shrinks the group specific estimate towards the overall mean (although when the overall mean is greater than the group specific estimate, the shrunken or empirical Bayes estimate may actually be greater than the group specific estimate) Lower Reliability = Greater Shrinkage Estimation and Centering p10/32

12 Empirical Bayes Estimates Useful in estimating parameters for a group with few within group observations Weighted average shifts the group specific estimate toward the mean for similar groups The less precise the group specific estimate and the less the variability observed across groups, the greater the shift towards the overall group mean a a From Rouxs Glossary of Multilevel Termshttp://wwwpahoorg/English/DD/AIS/be v24n4-multilevelhtm Estimation and Centering p11/32

13 Centering Variables Centering refers to the act of shifting the location of a predictor to some other value such as zero by adding or subtracting a constant In standard linear regression, centering variables leads to the same model (ie, same fit), but centering affects the interpretation of the lower-order terms In a model with only main effects, centering of predictors influences only the interpretation of the intercept In a model that includes interactions, centering of predictors also influences the interpretation of the main effects Estimation and Centering p12/32

14 Effects of Centering Impacts the estimation of parameters Affects their variances and covariances Alters their interpretation No simple answer or one way to always approach centering decisions Make decisions early, before the analysis, to avoid misinterpretation, misspecification of the model, or drawing conclusions that dont mirror the overall goals for the analysis Estimation and Centering p13/32

15 What is Centering Centering issues usually focus on how the level one explanatory variables are treated in the model Usually done for continuous IVs although it is possible to center dummy coded variables For level 1 predictor, there are typically three choices Raw-score or uncentered data (no transformation RAW): X ij Group-mean centering (centering within contexts CWC): X ij X j Grand-mean centering (centering around the grand mean CGM): X ij X Estimation and Centering p14/32

16 Raw Scores Maintaining the raw scale for level 1 predictors can be useful if the predictors have a natural and meaningful scale The intercept of a model (here β 0j ) is always interpreted as the expected value when all predictors are equal to zero If zero is outside of the logical score range of the predictor, the intercept will be difficult to interpret eg, SAT can serve as a predictor The intercept will be the expected outcome for a student in school j who has an SAT of zero Intercept parameter is meaningless because the lowest SAT score is 200 Correlation between the intercept and slope will tend toward -10 The intercept is essentially determined by the slope Schools with strong positive SAT-outcome slopes will tend to have very low intercepts Schools with negligible slopes will have higher intercepts Meaningful X value of zero: egl, drug dosage Estimation and Centering p15/32

17 Grand-mean Centering Under grand mean centering, we subtract the overall mean from each persons value of the IV: X ij X The intercept, β 0j, is the expected outcome for a subject whose value on X ij is equal to the grand mean This is the standard choice of location for X ij in the classical ANCOVA model Grand-mean centering yields an intercept that can be interpreted as an adjusted mean for group j β 0j = µ Yj β 1j ( X j X ) (9) Variance of intercept, τ 00, is the variance among the level 2 units in the adjusted means (adjusted for level one predictors) Estimation and Centering p16/32

18 Grand-mean Centering Y X = 0 GM X Estimation and Centering p17/32

19 Group-mean Centering Center the original predictors around their corresponding level 2 unit means: X ij X j The zero point for the predictor differs for each group Two individuals may have the same X ij, but X ij X j will be dependent on group mean The group intercept, β 0j, is the unadjusted mean for the group β 0j = µ Yj (10) Variance of intercept, τ 00, is the variance among the level 2 unit means (µ Yj ) Estimation and Centering p18/32

20 Group-mean Centering Y Group Means X = 0 X Predictions for each GROUP when X is at the GROUP MEAN Estimation and Centering p19/32

21 Group-mean Centering In organizational research (using HSB as an example), coefficient of interest is the pooled within-group relationship between math achievement and student SES We want to estimate the level-one relationship net of any group membership effects That means we want an unbiased estimate of the slope for SES on math achievement BUT: frog pond effect Estimation and Centering p20/32

22 Frog Pond Effects The overall SES for a school may also be related to the outcome of math achievement For the same frog, the effect of being in a pond with big frogs is different from the being in a pond with small frogs The effect of SES on math achievement is dependent on relative standing of the student within his or her own school Group mean centering addresses the frog pond effect, but it also changes the model in a complex way Estimation and Centering p21/32

23 Group-mean Centering We need to consider how the aggregate of the person-level SES is related to the outcome, even after controlling for the effect of individual SES Compositional effect: difference between the group or organizational level effect and the person-level effect When aggregate is added back in at level two, we can get estimates of both effects through group mean centering Group mean centering removes between group variability from the model (since we are using as predictors just the deviations from the group mean) Thus, to examine contextual effects we need to reintroduce the means at level two, otherwise the between and within group relationship is confounded (only looking at relative standing) Estimation and Centering p22/32

24 Within- and between-group effects Y between-group regression line regression line within group 2 regression line within group 3 regression line within group 1 X Estimation and Centering p23/32

25 Confounding of Within- and between-group effects Raw and Grand mean centering: within-group and between-group effects are confounded when Level 1 predictors are raw scale or grand-mean centered Combined model for grand-mean centering Y ij = γ 00 + γ 10 (X ij X ) + u 0j + r ij (11) = γ 00 + γ 10 (X ij X j + X j X ) + u 0j + r ij (12) = γ 00 + γ }{{} 10 (X ij X j ) + γ 10 ( }{{} X j X ) + u 0j + r ij within between (13) Estimation and Centering p24/32

26 Within- and between-group effects For raw and Grand mean centering, including the group mean as a predictor separates the between-group effect from the within group effect Level 1: Y ij = β 0j + β 1j (X ij X ) + r ij (14) Level 2: β 0j = γ 00 + γ 01 X j + u 0j (15) β 1j = γ 10 (16) Combined: Y ij = γ 00 + γ }{{} 10 (X ij X ) + γ 01 X j + u }{{} 0j + r ij β w β b β w β c = β b β w is called the context or compositional effect (17) Estimation and Centering p25/32

27 Within- and between-group effects For group mean centering, within and between group effects are orthogonal Including a group mean centered predictor will only decrease level 1 variance; including the group means will only decrease level 2 variance Level 1: Y ij = β 0j + β 1j (X ij X j ) + r ij (18) Level 2: β 0j = γ 00 + γ 01 X j + u 0j (19) β 1j = γ 10 (20) Combined: Y ij = γ 00 + γ }{{} 10 (X ij X j ) + γ }{{} 01 β w β b X j + u 0j + r ij (21) Estimation and Centering p26/32

28 Compositional Effects Compositional effect: when the aggregate of a person level predictor (here, MeanSES determined from the sample of children within each school) is related to the outcome Y, even after controlling for the effect of individual characteristics CWC CGM Y ( SES SES ) r ij 0 j 1 j ij j ij ( MeanSES) u 0 j j 0 j u 1 j 10 1 j Y ( SES SES ) r ij 0 j 1 j ij ij ( MeanSES) u 0 j j 0 j u 1 j 10 1 j how much does the organizational level effect (ie, between-groups) differ from the individual level effect (ie, within-groups)? γ cwc 01 = β between group γ cwc 10 = β within group (22) γ cgm 01 = γ cwc 01 γ cwc 10 = β compositional effect (23) Estimation and Centering p27/32

29 Summary of centering predictors Scaling of predictors has direct implications on the ability to disentangle between and within effects Raw-scale predictor Intercept represents the group mean when the predictor equals zero Intercept is only directly interpretable if the scale has a natural zero-point Grand-mean centered predictor Intercept represents the adjusted group mean, holding the predictor constant at the grand mean for all groups Group-mean centered predictor Intercept represents the unadjusted group mean Inclusion of group mean as level 2 predictors disentangles between and within group effects Estimation and Centering p28/32

30 Summary Centering decisions impact both the intercept and the slope estimates in level two (our examples will make this clear) Group mean centering with reintroduction of the aggregates at level two will always provide an unbiased estimate of the within group slope Need to carefully consider the needs of your analysis and your assumptions (theory) about how contextual variables are measured, how they relate to level one variables and your outcome Be aware of how your choice affects the model and the question you are trying to answer Estimation and Centering p29/32

31 Considerations Researchers tend to differ in how they approach centering primarily because the decision hinges on goals for the analysis Theres not one way thats correct All approaches are correct if they accommodate your theory regarding contextual effects of variables and address the research question you really want to be asking! Estimation and Centering p30/32

32 Some guidelines For raw score (uncentered models) generally the interest is more in effects of variables on individuals rather than group effects For centering without reintroducing the means (aggregates of level one variables) at level two, interest is generally for situations where that aggregate is not of interest, but other level two variables might be For centering with introduction of the aggregates at level two, interest is on separating out individual effects of a variable (like SES) from group-level effects of that variable (like MeanSES for a particular school) Estimation and Centering p31/32

33 Recommendation When to use raw scores or grand mean centering If you are more interested in the effects in individuals performance than in group effects eg, clustering is a nuisance factor When raw score does not allow a meaningful interpretation of the intercept, use grand mean centering When to use group mean centering Theory says that individual and group effects are separate For technical reasons Smaller correlation between random intercept and random slope Smaller correlation between level 1 and level 2 variables and cross-level interactions This will stabilize model (coefficients are more or less independent estimates) Estimation and Centering p32/32

Multilevel Modeling: A Second Course

Multilevel Modeling: A Second Course Multilevel Modeling: A Second Course Kristopher Preacher, Ph.D. Upcoming Seminar: February 2-3, 2017, Ft. Myers, Florida What this workshop will accomplish I will review the basics of multilevel modeling

More information

The Implications of Centering in a Three-Level Multilevel Model

The Implications of Centering in a Three-Level Multilevel Model University of Miami Scholarly Repository Open Access Dissertations Electronic Theses and Dissertations 2012-04-29 The Implications of Centering in a Three-Level Multilevel Model Ahnalee Brincks University

More information

Mixed Models II - Behind the Scenes Report Revised June 11, 2002 by G. Monette

Mixed Models II - Behind the Scenes Report Revised June 11, 2002 by G. Monette Mixed Models II - Behind the Scenes Report Revised June 11, 2002 by G. Monette What is a mixed model "really" estimating? Paradox lost - paradox regained - paradox lost again. "Simple example": 4 patients

More information

Multi-level Models: Idea

Multi-level Models: Idea Review of 140.656 Review Introduction to multi-level models The two-stage normal-normal model Two-stage linear models with random effects Three-stage linear models Two-stage logistic regression with random

More information

Hierarchical Linear Models (HLM) Using R Package nlme. Interpretation. 2 = ( x 2) u 0j. e ij

Hierarchical Linear Models (HLM) Using R Package nlme. Interpretation. 2 = ( x 2) u 0j. e ij Hierarchical Linear Models (HLM) Using R Package nlme Interpretation I. The Null Model Level 1 (student level) model is mathach ij = β 0j + e ij Level 2 (school level) model is β 0j = γ 00 + u 0j Combined

More information

Centering Predictor and Mediator Variables in Multilevel and Time-Series Models

Centering Predictor and Mediator Variables in Multilevel and Time-Series Models Centering Predictor and Mediator Variables in Multilevel and Time-Series Models Tihomir Asparouhov and Bengt Muthén Part 2 May 7, 2018 Tihomir Asparouhov and Bengt Muthén Part 2 Muthén & Muthén 1/ 42 Overview

More information

multilevel modeling: concepts, applications and interpretations

multilevel modeling: concepts, applications and interpretations multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models

More information

The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs

The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs Chad S. Briggs, Kathie Lorentz & Eric Davis Education & Outreach University Housing Southern Illinois

More information

Introduction and Background to Multilevel Analysis

Introduction and Background to Multilevel Analysis Introduction and Background to Multilevel Analysis Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Background and

More information

Random Intercept Models

Random Intercept Models Random Intercept Models Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline A very simple case of a random intercept

More information

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as

1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available as ST 51, Summer, Dr. Jason A. Osborne Homework assignment # - Solutions 1. (Rao example 11.15) A study measures oxygen demand (y) (on a log scale) and five explanatory variables (see below). Data are available

More information

Models for Clustered Data

Models for Clustered Data Models for Clustered Data Edps/Psych/Soc 589 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline Notation NELS88 data Fixed Effects ANOVA

More information

Models for Clustered Data

Models for Clustered Data Models for Clustered Data Edps/Psych/Stat 587 Carolyn J Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline Notation NELS88 data Fixed Effects ANOVA

More information

Introduction to Within-Person Analysis and RM ANOVA

Introduction to Within-Person Analysis and RM ANOVA Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides

More information

WU Weiterbildung. Linear Mixed Models

WU Weiterbildung. Linear Mixed Models Linear Mixed Effects Models WU Weiterbildung SLIDE 1 Outline 1 Estimation: ML vs. REML 2 Special Models On Two Levels Mixed ANOVA Or Random ANOVA Random Intercept Model Random Coefficients Model Intercept-and-Slopes-as-Outcomes

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs) 36-309/749 Experimental Design for Behavioral and Social Sciences Dec 1, 2015 Lecture 11: Mixed Models (HLMs) Independent Errors Assumption An error is the deviation of an individual observed outcome (DV)

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D.

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D. Designing Multilevel Models Using SPSS 11.5 Mixed Model John Painter, Ph.D. Jordan Institute for Families School of Social Work University of North Carolina at Chapel Hill 1 Creating Multilevel Models

More information

Formula for the t-test

Formula for the t-test Formula for the t-test: How the t-test Relates to the Distribution of the Data for the Groups Formula for the t-test: Formula for the Standard Error of the Difference Between the Means Formula for the

More information

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

Ron Heck, Fall Week 3: Notes Building a Two-Level Model Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level

More information

8/04/2011. last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression)

8/04/2011. last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression) psyc3010 lecture 7 analysis of covariance (ANCOVA) last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression) 1 announcements quiz 2 correlation and

More information

Latent Variable Centering of Predictors and Mediators in Multilevel and Time-Series Models

Latent Variable Centering of Predictors and Mediators in Multilevel and Time-Series Models Latent Variable Centering of Predictors and Mediators in Multilevel and Time-Series Models Tihomir Asparouhov and Bengt Muthén August 5, 2018 Abstract We discuss different methods for centering a predictor

More information

Prediction in Multilevel Models

Prediction in Multilevel Models Journal of Educational and Behavioral Statistics Spring 005, Vol. 30, No. 1, pp. 00 00 Prediction in Multilevel Models David Afshartous University of Miami Jan de Leeuw University of California, Los Angeles

More information

Describing Change over Time: Adding Linear Trends

Describing Change over Time: Adding Linear Trends Describing Change over Time: Adding Linear Trends Longitudinal Data Analysis Workshop Section 7 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

Longitudinal Data Analysis

Longitudinal Data Analysis Longitudinal Data Analysis Mike Allerhand This document has been produced for the CCACE short course: Longitudinal Data Analysis. No part of this document may be reproduced, in any form or by any means,

More information

Swarthmore Honors Exam 2012: Statistics

Swarthmore Honors Exam 2012: Statistics Swarthmore Honors Exam 2012: Statistics 1 Swarthmore Honors Exam 2012: Statistics John W. Emerson, Yale University NAME: Instructions: This is a closed-book three-hour exam having six questions. You may

More information

Multilevel modelling of fish abundance in streams

Multilevel modelling of fish abundance in streams Multilevel modelling of fish abundance in streams Marco A. Rodríguez Université du Québec à Trois-Rivières Hierarchies and nestedness are common in nature Biological data often have clustered or nested

More information

The Factorial Survey

The Factorial Survey The Factorial Survey Design Selection and Its Impact on Reliability and Internal Validity LCSR-EBES Training Methodological Workshop Economic and Socail Changes: Values Effects across Eurasia Antalya,

More information

Regression in R. Seth Margolis GradQuant May 31,

Regression in R. Seth Margolis GradQuant May 31, Regression in R Seth Margolis GradQuant May 31, 2018 1 GPA What is Regression Good For? Assessing relationships between variables This probably covers most of what you do 4 3.8 3.6 3.4 Person Intelligence

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

Mixed- Model Analysis of Variance. Sohad Murrar & Markus Brauer. University of Wisconsin- Madison. Target Word Count: Actual Word Count: 2755

Mixed- Model Analysis of Variance. Sohad Murrar & Markus Brauer. University of Wisconsin- Madison. Target Word Count: Actual Word Count: 2755 Mixed- Model Analysis of Variance Sohad Murrar & Markus Brauer University of Wisconsin- Madison The SAGE Encyclopedia of Educational Research, Measurement and Evaluation Target Word Count: 3000 - Actual

More information

Interactions among Continuous Predictors

Interactions among Continuous Predictors Interactions among Continuous Predictors Today s Class: Simple main effects within two-way interactions Conquering TEST/ESTIMATE/LINCOM statements Regions of significance Three-way interactions (and beyond

More information

Introduction to Random Effects of Time and Model Estimation

Introduction to Random Effects of Time and Model Estimation Introduction to Random Effects of Time and Model Estimation Today s Class: The Big Picture Multilevel model notation Fixed vs. random effects of time Random intercept vs. random slope models How MLM =

More information

Hierarchical Linear Models. Jeff Gill. University of Florida

Hierarchical Linear Models. Jeff Gill. University of Florida Hierarchical Linear Models Jeff Gill University of Florida I. ESSENTIAL DESCRIPTION OF HIERARCHICAL LINEAR MODELS II. SPECIAL CASES OF THE HLM III. THE GENERAL STRUCTURE OF THE HLM IV. ESTIMATION OF THE

More information

A multivariate multilevel model for the analysis of TIMMS & PIRLS data

A multivariate multilevel model for the analysis of TIMMS & PIRLS data A multivariate multilevel model for the analysis of TIMMS & PIRLS data European Congress of Methodology July 23-25, 2014 - Utrecht Leonardo Grilli 1, Fulvia Pennoni 2, Carla Rampichini 1, Isabella Romeo

More information

Research Design: Topic 18 Hierarchical Linear Modeling (Measures within Persons) 2010 R.C. Gardner, Ph.d.

Research Design: Topic 18 Hierarchical Linear Modeling (Measures within Persons) 2010 R.C. Gardner, Ph.d. Research Design: Topic 8 Hierarchical Linear Modeling (Measures within Persons) R.C. Gardner, Ph.d. General Rationale, Purpose, and Applications Linear Growth Models HLM can also be used with repeated

More information

Analysing data: regression and correlation S6 and S7

Analysing data: regression and correlation S6 and S7 Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association

More information

Notation Survey. Paul E. Johnson 1 2 MLM 1 / Department of Political Science

Notation Survey. Paul E. Johnson 1 2 MLM 1 / Department of Political Science MLM 1 / 52 Notation Survey Paul E. Johnson 1 2 1 Department of Political Science 2 Center for Research Methods and Data Analysis, University of Kansas 2015 MLM 2 / 52 Outline 1 Raudenbush and Bryk 2 Snijders

More information

HLM Models and Type VI Errors

HLM Models and Type VI Errors Models and Type VI Errors Understanding Models and Type VI Errors: The Need for Reflection Susan M. Tracz Isadore Newman David O. Newman California State University, Fresno Florida International University

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter

More information

TABLE OF CONTENTS INTRODUCTION TO MIXED-EFFECTS MODELS...3

TABLE OF CONTENTS INTRODUCTION TO MIXED-EFFECTS MODELS...3 Table of contents TABLE OF CONTENTS...1 1 INTRODUCTION TO MIXED-EFFECTS MODELS...3 Fixed-effects regression ignoring data clustering...5 Fixed-effects regression including data clustering...1 Fixed-effects

More information

Three-Level Modeling for Factorial Experiments With Experimentally Induced Clustering

Three-Level Modeling for Factorial Experiments With Experimentally Induced Clustering Three-Level Modeling for Factorial Experiments With Experimentally Induced Clustering John J. Dziak The Pennsylvania State University Inbal Nahum-Shani The University of Michigan Copyright 016, Penn State.

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors

More information

ANCOVA. ANCOVA allows the inclusion of a 3rd source of variation into the F-formula (called the covariate) and changes the F-formula

ANCOVA. ANCOVA allows the inclusion of a 3rd source of variation into the F-formula (called the covariate) and changes the F-formula ANCOVA Workings of ANOVA & ANCOVA ANCOVA, Semi-Partial correlations, statistical control Using model plotting to think about ANCOVA & Statistical control You know how ANOVA works the total variation among

More information

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective Second Edition Scott E. Maxwell Uniuersity of Notre Dame Harold D. Delaney Uniuersity of New Mexico J,t{,.?; LAWRENCE ERLBAUM ASSOCIATES,

More information

Review of CLDP 944: Multilevel Models for Longitudinal Data

Review of CLDP 944: Multilevel Models for Longitudinal Data Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance

More information

Assessing the relation between language comprehension and performance in general chemistry. Appendices

Assessing the relation between language comprehension and performance in general chemistry. Appendices Assessing the relation between language comprehension and performance in general chemistry Daniel T. Pyburn a, Samuel Pazicni* a, Victor A. Benassi b, and Elizabeth E. Tappin c a Department of Chemistry,

More information

psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests

psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests last lecture: introduction to factorial designs next lecture: factorial between-ps ANOVA II: (effect sizes and follow-up tests) 1 general

More information

LINEAR MULTILEVEL MODELS. Data are often hierarchical. By this we mean that data contain information

LINEAR MULTILEVEL MODELS. Data are often hierarchical. By this we mean that data contain information LINEAR MULTILEVEL MODELS JAN DE LEEUW ABSTRACT. This is an entry for The Encyclopedia of Statistics in Behavioral Science, to be published by Wiley in 2005. 1. HIERARCHICAL DATA Data are often hierarchical.

More information

A Re-Introduction to General Linear Models (GLM)

A Re-Introduction to General Linear Models (GLM) A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing

More information

STA441: Spring Multiple Regression. More than one explanatory variable at the same time

STA441: Spring Multiple Regression. More than one explanatory variable at the same time STA441: Spring 2016 Multiple Regression More than one explanatory variable at the same time This slide show is a free open source document. See the last slide for copyright information. One Explanatory

More information

ANCOVA. Lecture 9 Andrew Ainsworth

ANCOVA. Lecture 9 Andrew Ainsworth ANCOVA Lecture 9 Andrew Ainsworth What is ANCOVA? Analysis of covariance an extension of ANOVA in which main effects and interactions are assessed on DV scores after the DV has been adjusted for by the

More information

Hierarchical Linear Modeling. Lesson Two

Hierarchical Linear Modeling. Lesson Two Hierarchical Linear Modeling Lesson Two Lesson Two Plan Multivariate Multilevel Model I. The Two-Level Multivariate Model II. Examining Residuals III. More Practice in Running HLM I. The Two-Level Multivariate

More information

INTRODUCTION TO ANALYSIS OF VARIANCE

INTRODUCTION TO ANALYSIS OF VARIANCE CHAPTER 22 INTRODUCTION TO ANALYSIS OF VARIANCE Chapter 18 on inferences about population means illustrated two hypothesis testing situations: for one population mean and for the difference between two

More information

Machine Learning for OR & FE

Machine Learning for OR & FE Machine Learning for OR & FE Regression II: Regularization and Shrinkage Methods Martin Haugh Department of Industrial Engineering and Operations Research Columbia University Email: martin.b.haugh@gmail.com

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

Part 8: GLMs and Hierarchical LMs and GLMs

Part 8: GLMs and Hierarchical LMs and GLMs Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course

More information

Longitudinal Data Analysis of Health Outcomes

Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis Workshop Running Example: Days 2 and 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development

More information

Describing Within-Person Change over Time

Describing Within-Person Change over Time Describing Within-Person Change over Time Topics: Multilevel modeling notation and terminology Fixed and random effects of linear time Predicted variances and covariances from random slopes Dependency

More information

MLA Software for MultiLevel Analysis of Data with Two Levels

MLA Software for MultiLevel Analysis of Data with Two Levels MLA Software for MultiLevel Analysis of Data with Two Levels User s Guide for Version 4.1 Frank M. T. A. Busing Leiden University, Department of Psychology, P.O. Box 9555, 2300 RB Leiden, The Netherlands.

More information

Correlation and Regression Bangkok, 14-18, Sept. 2015

Correlation and Regression Bangkok, 14-18, Sept. 2015 Analysing and Understanding Learning Assessment for Evidence-based Policy Making Correlation and Regression Bangkok, 14-18, Sept. 2015 Australian Council for Educational Research Correlation The strength

More information

ANCOVA. Psy 420 Andrew Ainsworth

ANCOVA. Psy 420 Andrew Ainsworth ANCOVA Psy 420 Andrew Ainsworth What is ANCOVA? Analysis of covariance an extension of ANOVA in which main effects and interactions are assessed on DV scores after the DV has been adjusted for by the DV

More information

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is

More information

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information. STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory

More information

Distribution-free ROC Analysis Using Binary Regression Techniques

Distribution-free ROC Analysis Using Binary Regression Techniques Distribution-free ROC Analysis Using Binary Regression Techniques Todd A. Alonzo and Margaret S. Pepe As interpreted by: Andrew J. Spieker University of Washington Dept. of Biostatistics Update Talk 1

More information

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */

over Time line for the means). Specifically, & covariances) just a fixed variance instead. PROC MIXED: to 1000 is default) list models with TYPE=VC */ CLP 944 Example 4 page 1 Within-Personn Fluctuation in Symptom Severity over Time These data come from a study of weekly fluctuation in psoriasis severity. There was no intervention and no real reason

More information

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company Multiple Regression Inference for Multiple Regression and A Case Study IPS Chapters 11.1 and 11.2 2009 W.H. Freeman and Company Objectives (IPS Chapters 11.1 and 11.2) Multiple regression Data for multiple

More information

VIII. ANCOVA. A. Introduction

VIII. ANCOVA. A. Introduction VIII. ANCOVA A. Introduction In most experiments and observational studies, additional information on each experimental unit is available, information besides the factors under direct control or of interest.

More information

Regression, Ridge Regression, Lasso

Regression, Ridge Regression, Lasso Regression, Ridge Regression, Lasso Fabio G. Cozman - fgcozman@usp.br October 2, 2018 A general definition Regression studies the relationship between a response variable Y and covariates X 1,..., X n.

More information

Technical Appendix C: Methods

Technical Appendix C: Methods Technical Appendix C: Methods As not all readers may be familiar with the multilevel analytical methods used in this study, a brief note helps to clarify the techniques. The general theory developed in

More information

Simple Linear Regression for the Climate Data

Simple Linear Regression for the Climate Data Prediction Prediction Interval Temperature 0.2 0.0 0.2 0.4 0.6 0.8 320 340 360 380 CO 2 Simple Linear Regression for the Climate Data What do we do with the data? y i = Temperature of i th Year x i =CO

More information

BIOSTATISTICAL METHODS

BIOSTATISTICAL METHODS BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH Cross-over Designs #: DESIGNING CLINICAL RESEARCH The subtraction of measurements from the same subject will mostly cancel or minimize effects

More information

Ridge regression. Patrick Breheny. February 8. Penalized regression Ridge regression Bayesian interpretation

Ridge regression. Patrick Breheny. February 8. Penalized regression Ridge regression Bayesian interpretation Patrick Breheny February 8 Patrick Breheny High-Dimensional Data Analysis (BIOS 7600) 1/27 Introduction Basic idea Standardization Large-scale testing is, of course, a big area and we could keep talking

More information

Introduction. Dottorato XX ciclo febbraio Outline. A hierarchical structure. Hierarchical structures: type 2. Hierarchical structures: type 1

Introduction. Dottorato XX ciclo febbraio Outline. A hierarchical structure. Hierarchical structures: type 2. Hierarchical structures: type 1 Outline Introduction to multilevel analysis. Introduction >. > Leonardo Grilli 3. Estimation > 4. Software & Books > Email: grilli@ds.unifi.it Web: http://www.ds.unifi.it/grilli/ Department of Statistics

More information

Advancing the Formulation and Testing of Multilevel Mediation and Moderated Mediation Models

Advancing the Formulation and Testing of Multilevel Mediation and Moderated Mediation Models Advancing the Formulation and Testing of Multilevel Mediation and Moderated Mediation Models A Thesis Presented in Partial Fulfillment of the Requirements for the Degree Master of Arts in the Graduate

More information

23. Inference for regression

23. Inference for regression 23. Inference for regression The Practice of Statistics in the Life Sciences Third Edition 2014 W. H. Freeman and Company Objectives (PSLS Chapter 23) Inference for regression The regression model Confidence

More information

MLMED. User Guide. Nicholas J. Rockwood The Ohio State University Beta Version May, 2017

MLMED. User Guide. Nicholas J. Rockwood The Ohio State University Beta Version May, 2017 MLMED User Guide Nicholas J. Rockwood The Ohio State University rockwood.19@osu.edu Beta Version May, 2017 MLmed is a computational macro for SPSS that simplifies the fitting of multilevel mediation and

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

Outline

Outline 2559 Outline cvonck@111zeelandnet.nl 1. Review of analysis of variance (ANOVA), simple regression analysis (SRA), and path analysis (PA) 1.1 Similarities and differences between MRA with dummy variables

More information

Module 2. General Linear Model

Module 2. General Linear Model D.G. Bonett (9/018) Module General Linear Model The relation between one response variable (y) and q 1 predictor variables (x 1, x,, x q ) for one randomly selected person can be represented by the following

More information

Least Squares Analyses of Variance and Covariance

Least Squares Analyses of Variance and Covariance Least Squares Analyses of Variance and Covariance One-Way ANOVA Read Sections 1 and 2 in Chapter 16 of Howell. Run the program ANOVA1- LS.sas, which can be found on my SAS programs page. The data here

More information

High-dimensional regression

High-dimensional regression High-dimensional regression Advanced Methods for Data Analysis 36-402/36-608) Spring 2014 1 Back to linear regression 1.1 Shortcomings Suppose that we are given outcome measurements y 1,... y n R, and

More information

Research Design - - Topic 19 Multiple regression: Applications 2009 R.C. Gardner, Ph.D.

Research Design - - Topic 19 Multiple regression: Applications 2009 R.C. Gardner, Ph.D. Research Design - - Topic 19 Multiple regression: Applications 2009 R.C. Gardner, Ph.D. Curve Fitting Mediation analysis Moderation Analysis 1 Curve Fitting The investigation of non-linear functions using

More information

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data Today s Class: Review of concepts in multivariate data Introduction to random intercepts Crossed random effects models

More information

SAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1

SAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1 CLDP 944 Example 3a page 1 From Between-Person to Within-Person Models for Longitudinal Data The models for this example come from Hoffman (2015) chapter 3 example 3a. We will be examining the extent to

More information

Exploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement

Exploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement Exploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement Second meeting of the FIRB 2012 project Mixture and latent variable models for causal-inference and analysis

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Analysis of Covariance

Analysis of Covariance Analysis of Covariance Using categorical and continuous predictor variables Example An experiment is set up to look at the effects of watering on Oak Seedling establishment Three levels of watering: (no

More information

Multilevel Modeling Day 2 Intermediate and Advanced Issues: Multilevel Models as Mixed Models. Jian Wang September 18, 2012

Multilevel Modeling Day 2 Intermediate and Advanced Issues: Multilevel Models as Mixed Models. Jian Wang September 18, 2012 Multilevel Modeling Day 2 Intermediate and Advanced Issues: Multilevel Models as Mixed Models Jian Wang September 18, 2012 What are mixed models The simplest multilevel models are in fact mixed models:

More information

Hierarchical Linear Models-Redux

Hierarchical Linear Models-Redux Hierarchical Linear Models-Redux Joseph Stevens, Ph.D., University of Oregon (541) 346-2445, stevensj@uoregon.edu Stevens, 2008 1 Overview and resources Overview Listserv: http://www.jiscmail.ac.uk/lists/multilevel.html

More information

Correlation and Linear Regression

Correlation and Linear Regression Correlation and Linear Regression Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means

More information

Sparse Linear Models (10/7/13)

Sparse Linear Models (10/7/13) STA56: Probabilistic machine learning Sparse Linear Models (0/7/) Lecturer: Barbara Engelhardt Scribes: Jiaji Huang, Xin Jiang, Albert Oh Sparsity Sparsity has been a hot topic in statistics and machine

More information

Estimation: Problems & Solutions

Estimation: Problems & Solutions Estimation: Problems & Solutions Edps/Psych/Stat 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline 1. Introduction: Estimation of

More information

Advanced Experimental Design

Advanced Experimental Design Advanced Experimental Design Topic 8 Chapter : Repeated Measures Analysis of Variance Overview Basic idea, different forms of repeated measures Partialling out between subjects effects Simple repeated

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor

More information

Spring RMC Professional Development Series January 14, Generalized Linear Mixed Models (GLMMs): Concepts and some Demonstrations

Spring RMC Professional Development Series January 14, Generalized Linear Mixed Models (GLMMs): Concepts and some Demonstrations Spring RMC Professional Development Series January 14, 2016 Generalized Linear Mixed Models (GLMMs): Concepts and some Demonstrations Ann A. O Connell, Ed.D. Professor, Educational Studies (QREM) Director,

More information

Coping with Additional Sources of Variation: ANCOVA and Random Effects

Coping with Additional Sources of Variation: ANCOVA and Random Effects Coping with Additional Sources of Variation: ANCOVA and Random Effects 1/49 More Noise in Experiments & Observations Your fixed coefficients are not always so fixed Continuous variation between samples

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies

More information

Analysis of Variance

Analysis of Variance Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also

More information