The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs

Similar documents
Research Design: Topic 18 Hierarchical Linear Modeling (Measures within Persons) 2010 R.C. Gardner, Ph.d.

Exploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

Estimation and Centering

Multilevel Modeling: A Second Course

Regression in R. Seth Margolis GradQuant May 31,

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D.

multilevel modeling: concepts, applications and interpretations

HLM Models and Type VI Errors

Assessing the relation between language comprehension and performance in general chemistry. Appendices

A multivariate multilevel model for the analysis of TIMMS & PIRLS data

Introduction to. Multilevel Analysis

Hierarchical Linear Modeling. Lesson Two

Random Intercept Models

Using Power Tables to Compute Statistical Power in Multilevel Experimental Designs

Introducing Generalized Linear Models: Logistic Regression

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

Technical Appendix C: Methods

Mixed- Model Analysis of Variance. Sohad Murrar & Markus Brauer. University of Wisconsin- Madison. Target Word Count: Actual Word Count: 2755

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

ROBUSTNESS OF MULTILEVEL PARAMETER ESTIMATES AGAINST SMALL SAMPLE SIZES

Measurement of Academic Growth of Individual Students toward Variable and Meaningful Academic Standards 1

Hierarchical Linear Models-Redux

Review of the General Linear Model

psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests

Review of Multiple Regression

Simple Linear Regression: One Qualitative IV

Utilizing Hierarchical Linear Modeling in Evaluation: Concepts and Applications

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

Technical Appendix C: Methods. Multilevel Regression Models

Introduction to Random Effects of Time and Model Estimation

Impact of serial correlation structures on random effect misspecification with the linear mixed model.

Review of CLDP 944: Multilevel Models for Longitudinal Data

WU Weiterbildung. Linear Mixed Models

Three-Level Modeling for Factorial Experiments With Experimentally Induced Clustering

Hypothesis Testing for Var-Cov Components

Estimating a Piecewise Growth Model with Longitudinal Data that Contains Individual Mobility across Clusters

Modelling heterogeneous variance-covariance components in two-level multilevel models with application to school effects educational research

Generalized Linear Probability Models in HLM R. B. Taylor Department of Criminal Justice Temple University (c) 2000 by Ralph B.

Summary of the model specified

Centering Predictor and Mediator Variables in Multilevel and Time-Series Models

Longitudinal Modeling with Logistic Regression

Models for Clustered Data

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Models for Clustered Data

Determining Sample Sizes for Surveys with Data Analyzed by Hierarchical Linear Models

Additional Notes: Investigating a Random Slope. When we have fixed level-1 predictors at level 2 we show them like this:

Hierarchical Linear Models. Jeff Gill. University of Florida

Testing and Interpreting Interaction Effects in Multilevel Models

Workshop on Statistical Applications in Meta-Analysis

Simple Linear Regression: One Quantitative IV

CAMPBELL COLLABORATION

Introduction to Within-Person Analysis and RM ANOVA

LINEAR MULTILEVEL MODELS. Data are often hierarchical. By this we mean that data contain information

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!

Partitioning variation in multilevel models.

Extensions of One-Way ANOVA.

Summary of the model specified

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Formula for the t-test

ANCOVA. Lecture 9 Andrew Ainsworth

9. Linear Regression and Correlation

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data

Multiple Linear Regression

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective

General Principles Within-Cases Factors Only Within and Between. Within Cases ANOVA. Part One

TABLE OF CONTENTS INTRODUCTION TO MIXED-EFFECTS MODELS...3

Simple, Marginal, and Interaction Effects in General Linear Models: Part 1

Technical and Practical Considerations in applying Value Added Models to estimate teacher effects

Introduction and Background to Multilevel Analysis

Notation Survey. Paul E. Johnson 1 2 MLM 1 / Department of Political Science

BIOS 6649: Handout Exercise Solution

Ronald Heck Week 14 1 EDEP 768E: Seminar in Categorical Data Modeling (F2012) Nov. 17, 2012

Categorical Predictor Variables

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised )

Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti

Advanced Quantitative Data Analysis

Extensions of One-Way ANOVA.

Multiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know:

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Statistical Inference When Classroom Quality is Measured With Error

Deciphering Math Notation. Billy Skorupski Associate Professor, School of Education

Multilevel Analysis of Assessment Data

ANCOVA. ANCOVA allows the inclusion of a 3rd source of variation into the F-formula (called the covariate) and changes the F-formula

Two-Level Models for Clustered* Data

This is the publisher s copyrighted version of this article.

Incorporating Cost in Power Analysis for Three-Level Cluster Randomized Designs

Multilevel Analysis, with Extensions

Econometrics Review questions for exam

Spring RMC Professional Development Series January 14, Generalized Linear Mixed Models (GLMMs): Concepts and some Demonstrations

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7

This document contains 3 sets of practice problems.

8/04/2011. last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression)

Difference in two or more average scores in different groups

Hierarchical Linear Models (HLM) Using R Package nlme. Interpretation. 2 = ( x 2) u 0j. e ij

Final Exam - Solutions

The Implications of Centering in a Three-Level Multilevel Model

THE EFFECTS OF MULTICOLLINEARITY IN MULTILEVEL MODELS

Multi-level Models: Idea

Transcription:

The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs Chad S. Briggs, Kathie Lorentz & Eric Davis Education & Outreach University Housing Southern Illinois University Carbondale

Southern Illinois University Carbondale Large doctoral research public university located in the southern tip of Illinois Six hours from Chicago Rural community Large number of students from the Chicago land area

Enrollment On Campus Enrollment at the time of study 19,124 On-Campus Residence Hall Enrollment at the time of study 4,314 students

Primary Purpose and Applications for Hierarchical Linear Modeling (HLM) HLM allows us to assess and model the variable effects of context or environment Example Housing and FYE Applications Students nested within Classrooms (e.g., freshman seminars) Programs (e.g., LLCs or Peer Mentoring) Residence halls and floors Universities (cross-institutional research)

Additional Applications for HLM Item Analysis Items nested within respondents Growth Modeling or Longitudinal Research Observations over time nested within students Cross-Classification Students nested within more than one group (e.g., floors and classrooms, or different living environments across time) Meta-Analysis Coefficients nested within studies

Failing to Capture the Social Ecology of First-Year Experience Programs Participation in FYE programs typically takes place within a group context AND this context often influences individual outcomes In fact, Living-Learning Communities (LLCs) are designed to capitalize on the resources and dynamics of group membership to yield desired outcomes (e.g., GPA and persistence) Yet, FYE and Housing evaluation efforts rarely use HLM to model the influence of these context effects

Literature Review Located Housing and First-Year Experience (FYE) articles that used HLM in their analysis 4 major databases were searched EBSCO, ERIC JSTOR and MUSE Keywords for HLM, Housing and First-Year Experience programs were cross-referenced in each database Results Just 5 articles* were found * Please contact presenters for references

Traditional Methods of Modeling Context (or Group-Level) Effects Disaggregation of Group Characteristics Group characteristics are assigned to everyone in a group Violates assumption of independence Aggregation of Individual Characteristics Mean individual characteristics assigned to group Loss of sample size, within-group variability and power Consequence Biased estimates of effect inaccurate/misleading results

Hierarchical Linear Modeling (HLM) HLM allows us to obtain unbiased estimates of effect for group context variables Hierarchical Indicates that Level 1 (or student-level) coefficients become outcomes at Level 2 (grouplevel)

Regression Refresher Y-axis (DV) Random error Slope Intercept Predicted outcome for Student i Y i B B1( X ) o i r i X-axis (IV)

Linear Model Terminology Where, Y ij = Outcome for person i in group j β 0j = Intercept for group j Value of Y when X = mean of group j X If X is centered around the grand mean ( ij ), then the intercept equals the value of Y for a person with an X equal to the average X across all groups β 1j = Slope for group j Change in Y associated with a 1 unit change in X X j X. ij = group-mean centered value of Level 1 variable for person i in group j r ij = random error term (predicted Y ij observed Y ij ) for person i in group j r ij ~ N(0,σ 2 ), or r ij is assumed normally distributed with a mean of 0 and a constant variance equal to sigma-squared. X..

Regression with Multiple Groups Two Group Case Y Y B B ( X ) r i1 01 11 i1 i1 B B ( X ) r i2 02 12 i2 i2 J Group Case Y ij 0 j 1 j ( X X ij. j ) r ij

Two-Level Model Level 1 Equation Y ij 0 j 1 j ( X X ij. j ) r ij Level 2 Equations u 0 j 00 01 W j 0 j u 1 j 10 11 W j 1 j

Mixed Two-Level Model Grand Mean Main Effects of W j and X ij Y ij ( W ) ( X X 00 01 j 10 i. j j ) Interaction Effect of W j and X ij W ( Xi X 11 j j. j ) Random Error Terms for Intercept, Slope and Student ) 0 j 1 ( X X j i j. j r ij

Two-Level Model Terminology Generally, 00 = Grand Intercept (mean of Y across all groups) 01 = avg. difference between Grand Intercept and Intercept for group j given W j 10 = Grand slope (slope across all groups) 11 = avg. difference between Grand Slope and slope for group j given W j and X ij µ 0j = random deviation of group means about the grand intercept µ 1j = random deviation of group slopes about the grand slope W j = Level 2 predictor for group j

Variance-Covariance Components Var(r ij ) = σ 2 Within-group variability Var(µ 0j ) = τ 00 Variability about grand mean Var(µ 1j ) = τ 11 Variability about grand slope Cov(µ 0j, µ 1j ) = τ 01 Covariance of slopes and intercepts

EXAMPLE ANALYSIS

Background on Housing Hierarchy University Housing Halls/Floors Brush Towers 32 Floors 2 Halls Thompson Point 33 Floors 11 Halls University Park 56 Floors 11 Halls Totals 121 Floors 24 Halls

Living-Learning Community (LLC) Program at SIUC in 2005 LLC Program Components Academic/Special Emphasis Floors (AEFs) First Implemented in 1996 12 Academic/Special Emphasis Floors Freshman Interest Groups (FIGs) First implemented in 2001 17 FIGs offered Some FIGs were nested on AEFs

Data Collection All example data were collected via university records Sampling LLC Students (n = 421) All FIG students (n = 223) Random sample of AEF students (n = 147) All FIG students nested on AEFs (n = 51) Random sample of Comparison students (n = 237)

2005 Cohort Sample Demographics Percent African Percent Percent American Mean Group N Female White ACT LLC 421 38% 64% 25% 22.44 Comparison 237 46% 57% 36% 21.21 Total 658 41% 62% 31% 22.00

Hierarchical 2-Level Dataset Level 1 657 Students Level 2 93 Floors 1 to 31 students (mean = 7) populated each floor Low n-sizes per floor are not ideal, but HLM makes it possible to estimate coefficients with some accuracy via Bayes estimation.

Variables Included in Example Analysis Outcome (Y) First-Semester GPA (Fall 2005) Student-Level Variables (X s) Student ACT score Student LLC Program Participation (LLC student = 1, Other = 0) Floor-Level Variables (W s) MEAN_ACT (average floor ACT score) LLC Participation Rate (Percent LLC students on floor)

Research Questions Addressed in Example Analysis How much do residence hall floors vary in terms of firstsemester GPA? Do floors with high MEAN ACT scores also have high first-semester GPAs? Does the strength of the relationship between the student-level variables (e.g., student ACT) and GPA vary across floors? Are ACT effects greater at the student- or floor-level? Does participation in an LLC (and participation rate per floor) influence first-semester GPA after controlling for student- and floor-level ACT? Are there any student-by-environment interactions?

Model Building HLM involves five model building steps: 1. One-Way ANOVA with random effects 2. Means-as-Outcomes 3. One-Way ANCOVA with random effects 4. Random Intercepts-and-Slopes 5. Intercepts-and-Slopes-as-Outcomes

One-Way ANOVA with Random Effects LEVEL 1 MODEL F05GPA ij = 0j + r ij LEVEL 2 MODEL Level-1 Slope is set equal to 0. 0j = 00 + u 0j MIXED MODEL F05GPA ij = 00 + u 0j + r ij

ANOVA Results and Auxiliary Statistics Point Estimate for Grand Mean 00 = 2.59*** Variance Components σ 2 =.93 τ 00 =.05* Because τ 00 is significant, HLM is appropriate Auxiliary Statistics Plausible Range of Floor Means 95%CI = 00 +- 1.96(τ 00 ) ½ = 2.13 to 3.05 Intraclass Correlation Coefficient (ICC) ICC =.06 (6% of the var. in first-semester GPA is between floors) Reliability (of sample means) λ =.24

Means-as-Outcomes LEVEL 1 MODEL F05GPA ij = 0j + r ij LEVEL 2 MODEL 0j = 00 + 01 (MEAN_ACT j ) + u 0j Level-2 Predictor Added to Intercept Model MIXED MODEL F05GPA ij = 00 + 01 MEAN_ACT j + u 0j + r ij

Means-as-Outcomes Results and Auxiliary Statistics Fixed Coefficients 01 = 0.92*** (effect of floor-level ACT) Variance Components σ 2 =.93 τ 00 =.04 (ns, p =.11) Auxiliary Statistics Proportion Reduction in Variance (PRV) PRV =.28 (MEAN ACT accounted for 28% of between-group var.) Conditional ICC ICC =.04 (Remaining unexplained variance between floors = 4%) Conditional Reliability λ =.20 (reliability with which we can discriminate among floors with identical MEAN ACT values)

One-Way ANCOVA with Random Effects LEVEL 1 MODEL F05GPA ij = 0j + 1j (ACT ij - ACT. j ) + r ij Level-1 Covariate Added to Model LEVEL 2 MODEL 0j = 00 + u 0j 1j = 10 MIXED MODEL F05GPA ij = 00 + 10 (ACT ij - ACT.j ) + u 0j + r ij

ANCOVA Results and Auxiliary Statistics Fixed Coefficients 10 = 0.07*** (effect of student ACT) Variance Components σ 2 =.88 Auxiliary Statistics PRV due to Student SES PRV =.05 Student ACT accounted for 5% of the within-group variance MEAN ACT accounted for 28% of the between-group variance» Thus, ACT seems to have more of an influence on firstsemester GPA at the group (or floor) level than at the individual-level

Random Coefficients LEVEL 1 MODEL F05GPA ij = 0j + 1j (ACT ij - ACT. j ) + r ij LEVEL 2 MODEL 0j 1j = 00 + u 0j = 10 + u 1j Both Intercepts and Slopes are Set to Randomly Varying Across Level-2 Units MIXED MODEL F05GPA ij = 00 + 10 (ACT ij - ACT.j ) + u 0j + u 1j (ACT ij - ACT.j ) + r ij

Random Coefficients Results Variance-Covariance Components σ 2 =.88 τ 00 =.06** τ 11 =.00 (ns, p =.40) Because τ 11 is non-sig., slopes are constant across floors, and term can be dropped Dropping τ 11 also increases efficiency because µ 1j, τ 11 and τ 01 don t have to be estimated. τ 01 =.001

Random Coefficients Auxiliary Statistics Auxiliary Statistics Reliability of Intercepts and Slopes λ (β 0 ) =.31 Reliability with which we can discriminate among floor means after student SES has been taken into account λ (β 1 ) =.00 Reliability with which we can discriminate among the floor slopes; in this case, the grand slope adequately describes the slope for each floor. Correlation Between Floor Intercepts and Slopes ρ =.66 Floors with high MEAN ACT scores also have high mean firstsemester GPAs

Intercepts-and-Slopes-as-Outcomes (ISO) LEVEL 1 MODEL Added student-level participation in LLC program to Level 1 model; set slope to nonrandomly vary across floors F05GPA ij = 0j + 1j (LLC ij ) + 2j (ACT ij - ACT. j ) + r ij LEVEL 2 MODEL 0j = 00 + 01 (LLC j ) + 02 (MEAN_ACT j ) + u 0j 1j = 10 + 11 (LLC j ) + 12 (MEAN_ACT j ) 2j = 20 + 21 (LLC j ) + 22 (MEAN_ACT j )

I-S-O Mixed Model MIXED MODEL ) ( ) ( 05 02 01 00 j j ij MEANACT LLC GPA F ij j j ij j j ij j j ij ij j ij j ij r ACT ACT MEANACT ACT ACT LLC ACT ACT LLC MEANACT LLC LLC LLC 0. 22. 21. 20 12 11 10 ) )( ( ) )( ( ) ( ) )( ( ) )( ( ) (

Interpretation of Mixed I-S-O Model First-Semester GPA = Grand Mean 4 Main Effects Percentage of Students on Floor Participating in an LLC Floor s Mean ACT score Student-Level Participation in a LLC Student s ACT score 4 Interaction Terms Floor LLC by Student LLC Floor ACT by Student LLC Floor LLC by Student ACT Floor ACT by Student ACT 2 Random Error Components Residual deviation about grand mean Residual deviation about floor mean

I-S-O Results Fixed Effects Coefficient SE Sig. Grand Intercept, -0.18 0.58 ns Main Effects LLC Floor-Level Participation Rate, 0.15 0.22 ns Floor MEAN ACT, 0.12 0.03 *** Student LLC Participation, 2.03 0.83 * Student-Level ACT (Grand Slope), -.07.14 ns Interaction Effects 00 02 10 01 20 11 12 21 22 Floor LLC Rate by Student LLC, 0.29 0.32 ns Floor MEAN ACT by Student LLC, -0.10 0.04 ** Floor LLC Rate by Student ACT, 0.07 0.03 * Floor MEAN ACT by Student ACT, 0.00 0.01 ns *** p <.001 ** p <.01 * p <.05 ^ p <.10

Floor MEAN ACT by Student LLC 2.85 LLC = 0 LLC = 1 2.67 F05GPA 2.48 2.30 2.11 18.00 19.50 21.00 22.50 24.00 MEAN_ACT

Caveats of Using HLM Large Sample Sizes Sample sizes of j by n can quickly get out of hand (and expensive) Large n- and j-sizes not required, but reliability of estimates increase as n and effect size increase No magic number, but central limit theorem suggests that (multivariate) normality can be achieved with around 30 per group, with 30 groups Advanced Statistical Procedure Use of HLM requires: A solid background in multivariate statistics Time to learn the statistical language SSI offers seminars, but statistical language should be familiar before attending HLM is only a Statistical Technique HLM s efficacy is limited by quality of research design and data collection procedures

Resources and Costs Scientific Software International (SSI) - www.ssicentral.com HLM 6.06 software (single license = $425) Free student version available Software User s Manual ($35) Raudenbush, S., Bryk, A., Cheong, Y. F., & Congdon, R. (2004). HLM 6: Hierarchical Linear and Nonlinear Modeling. Scientific Software International: Lincolnwood, IL. Textbook ($85) Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical Linear Models: Applications and Data Analysis Methods. Sage Publications: Thousand Oaks, CA.

Presentation Reference Briggs, C. S., Lorentz, K., & Davis, E. (2009). The application and promise of hierarchical linear modeling in studying first-year student programs. Presented at the annual conference on The First-Year Experience, Orlando, FL.

Further Information For further information, please contact: Chad Briggs (briggs@siu.edu, 618-453-7535) Kathie Lorentz (klorentz@siu.edu, 618-453-7993)