ROBUSTNESS OF MULTILEVEL PARAMETER ESTIMATES AGAINST SMALL SAMPLE SIZES

Similar documents
Multilevel Modeling: When and Why 1. 1 Why multilevel data need multilevel models

Multilevel Analysis of Grouped and Longitudinal Data

Partitioning variation in multilevel models.

Multiple-Objective Optimal Designs for the Hierarchical Linear Model

Determining Sample Sizes for Surveys with Data Analyzed by Hierarchical Linear Models

Hypothesis Testing for Var-Cov Components

CENTERING IN MULTILEVEL MODELS. Consider the situation in which we have m groups of individuals, where

1 Introduction. 2 Example

The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs

Using Power Tables to Compute Statistical Power in Multilevel Experimental Designs

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Logistic regression analysis with multidimensional random effects: A comparison of three approaches

The Basic Two-Level Regression Model

A Monte Carlo Power Analysis of Traditional Repeated Measures and Hierarchical Multivariate Linear Models in Longitudinal Data Analysis

Multilevel Statistical Models: 3 rd edition, 2003 Contents

LINEAR MULTILEVEL MODELS. Data are often hierarchical. By this we mean that data contain information

Multilevel Structural Equation Modeling

Latent Variable Centering of Predictors and Mediators in Multilevel and Time-Series Models

Hierarchical Linear Models-Redux

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised )

MLA Software for MultiLevel Analysis of Data with Two Levels

Multilevel regression mixture analysis

Centering Predictor and Mediator Variables in Multilevel and Time-Series Models

BIBLIOGRAPHY. Azzalini, A., Bowman, A., and Hardle, W. (1986), On the Use of Nonparametric Regression for Model Checking, Biometrika, 76, 1, 2-12.

AN APPLICATION OF MULTILEVEL MODEL PREDICTION TO NELS:88

multilevel modeling: concepts, applications and interpretations

Exploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement

Random Intercept Models

Hierarchical Linear Models. Jeff Gill. University of Florida

Introduction to. Multilevel Analysis

Sampling. February 24, Multilevel analysis and multistage samples

Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior

Introducing Generalized Linear Models: Logistic Regression

Notation Survey. Paul E. Johnson 1 2 MLM 1 / Department of Political Science

Ronald Heck Week 14 1 EDEP 768E: Seminar in Categorical Data Modeling (F2012) Nov. 17, 2012

APPLIED MULTILEVEL ANALYSIS. J.J. Hox. TT-Publikaties, Amsterdam

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Generalized Linear Probability Models in HLM R. B. Taylor Department of Criminal Justice Temple University (c) 2000 by Ralph B.

Multilevel Analysis. Techniques and Applications. Second Edition. Joop J. Hox. Utrecht University, The Netherlands

Plausible Values for Latent Variables Using Mplus

A Non-parametric bootstrap for multilevel models

This is the publisher s copyrighted version of this article.

Bayesian Methods in Multilevel Regression

HLM Models and Type VI Errors

Discrete Response Multilevel Models for Repeated Measures: An Application to Voting Intentions Data

Time Invariant Predictors in Longitudinal Models

Variance partitioning in multilevel logistic models that exhibit overdispersion

Psychology 282 Lecture #4 Outline Inferences in SLR

FIT CRITERIA PERFORMANCE AND PARAMETER ESTIMATE BIAS IN LATENT GROWTH MODELS WITH SMALL SAMPLES

Multilevel Regression Mixture Analysis

Spring RMC Professional Development Series January 14, Generalized Linear Mixed Models (GLMMs): Concepts and some Demonstrations

Hierarchical linear models for the quantitative integration of effect sizes in single-case research

Library of Congress Cataloging-in-Publication Data

WU Weiterbildung. Linear Mixed Models

Nonlinear multilevel models, with an application to discrete response data

Summary of the model specified

Consistent Parameter Estimation for Lagged Multilevel Models

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D.

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective

THE EFFECTS OF MULTICOLLINEARITY IN MULTILEVEL MODELS

Follow this and additional works at:

IMPACT OF NOT FULLY ADDRESSING CROSS-CLASSIFIED MULTILEVEL STRUCTURE IN TESTING MEASUREMENT INVARIANCE AND CONDUCTING MULTILEVEL MIXTURE MODELING

Modelling heterogeneous variance-covariance components in two-level multilevel models with application to school effects educational research

Efficient Analysis of Mixed Hierarchical and Cross-Classified Random Structures Using a Multilevel Model

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

A multivariate multilevel model for the analysis of TIMMS & PIRLS data

Multilevel Modeling: A Second Course

Perugia, 13 maggio 2011

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

Time-Invariant Predictors in Longitudinal Models

Summary of the model specified

Incorporating Cost in Power Analysis for Three-Level Cluster Randomized Designs

Introduction. Dottorato XX ciclo febbraio Outline. A hierarchical structure. Hierarchical structures: type 2. Hierarchical structures: type 1

Longitudinal Modeling with Logistic Regression

Estimation and Centering

A Comparison of Multilevel Logistic Regression Models with Parametric and Nonparametric Random Intercepts

Hierarchical Linear Modeling. Lesson Two

Dyadic Data Analysis. Richard Gonzalez University of Michigan. September 9, 2010

TABLE OF CONTENTS INTRODUCTION TO MIXED-EFFECTS MODELS...3

Handbook of Multilevel Analysis

Research Design: Topic 18 Hierarchical Linear Modeling (Measures within Persons) 2010 R.C. Gardner, Ph.d.

Measurement of Academic Growth of Individual Students toward Variable and Meaningful Academic Standards 1

Logistic And Probit Regression

Statistical Power and Autocorrelation for Short, Comparative Interrupted Time Series Designs with Aggregate Data

Describing Change over Time: Adding Linear Trends

Geoffrey Woodhouse; Min Yang; Harvey Goldstein; Jon Rasbash

From Micro to Macro: Multilevel modelling with group-level outcomes

Testing and Interpreting Interaction Effects in Multilevel Models

AN INVESTIGATION OF THE ALIGNMENT METHOD FOR DETECTING MEASUREMENT NON- INVARIANCE ACROSS MANY GROUPS WITH DICHOTOMOUS INDICATORS

The linear mixed model: introduction and the basic model

Multilevel bootstrap analysis with assumptions violated

STATS DOESN T SUCK! ~ CHAPTER 16

Two-Level Models for Clustered* Data

Utilizing Hierarchical Linear Modeling in Evaluation: Concepts and Applications

Multilevel modeling and panel data analysis in educational research (Case study: National examination data senior high school in West Java)

Prediction in Multilevel Models

Introduction to Within-Person Analysis and RM ANOVA

Modeling Heterogeneity in Indirect Effects: Multilevel Structural Equation Modeling Strategies. Emily Fall

MODEL IMPLIED INSTRUMENTAL VARIABLE ESTIMATION FOR MULTILEVEL CONFIRMATORY FACTOR ANALYSIS. Michael L. Giordano

Technical Appendix C: Methods. Multilevel Regression Models

Transcription:

ROBUSTNESS OF MULTILEVEL PARAMETER ESTIMATES AGAINST SMALL SAMPLE SIZES Cora J.M. Maas 1 Utrecht University, The Netherlands Joop J. Hox Utrecht University, The Netherlands In social sciences, research questions often refer to hierarchically structured data. For instance, the achievements of pupils in classes (in schools) are studied, or the work satisfaction of employees in organizations. The main problem of studying such hierarchical systems, which have more than one level, is the dependence of the observations at the lower levels. Multilevel analyzing programs account for this dependence and in recent years these programs are widely accepted. In this paper we will discuss the influence of different circumstances on the robustness of the parameter estimates (regressioncoefficients and the variance) in two level situations. A simulation study is used to determine the influence of small sample sizes at both level one and level two, and different variance distributions between the levels (the so called intraclass correlation ). Key words: Multilevel analysis, robustness of parameter estimates INTRODUCTION Social science often studies systems with a hierarchical structure. For instance, the achievements of pupils in classes (in schools) are studied, or the work satisfaction of 1 E-mail: C.Maas@fss.uu.nl and J.Hox@fss.uu.nl 1

employees in organizations. This hierarchical structure of the data creates problems, because the observations at the lower levels are not independent. Multilevel analysis techniques have been developed for the linear regression model, which account for this dependence (Bryk & Raudenbush, 199; Goldstein, 1995), and specialized software is now widely available (Bryk, Raudenbush & Congdon, 1996; Rasbash & Woodhouse, 1995). In this paper we will discuss the influence of different circumstances on the robustness of the parameter estimates in two level situations. Three conditions are varied in the simulation: (1) Number of Groups (3 conditions: 30, 50 and 1), () Group Size (3 conditions: 5, 30 and 50) and (3) Intraclass Correlation (3 conditions: low: 0.1, medium: 0. and high 0.3). The multilevel model We will use a simple two level model, for data obtained from N individuals, nested within J groups, each containing N j individuals. At the individual level we specify one explanatory variable (X ) and at the group level also one explanatory variable (Z j ). The following model is specified: = γ + γ 10 X + γ 01Z j + γ 11Z j X + u1 j Y X + u + e oj (1.1) The u-terms u 0j and u 1j are residual error terms at the group level, with variance and 11. The covariance between the u-terms is 01. The e is the residual error at the indivudual level, with variance e. In the above specified model four fixed parameters have to be estimated (the γ-coefficients) and four random parameters (the variance components). If we delete all explanatory variables of equation (1.1), we get the 'intercept-onlymodel, as specified in equation (1.) Y γ + u + e = oj (1.) This equation is used to calculate the intraclass correlation (ICC), that is the estimated proportion of group level variance compared to the estimated total variance: ICC = + / e (1.3)

Sample sizes In hierarchically nested data, we have two sample sizes. First the Group Size (GS), that means the number of N j indivuduals in group J, and second the Number of Groups (NG), that is the number of J groups. Both conditions have influence on the estimates of the above specified parameters. For the estimation of the fixed parameters and their standard errors a large nuber of groups appears more important than a large number of individuals per group (Van der Leeden & Busing, 1994; Mok,1995; Snders & Bosker, 1994). The estimates of the random parameters and their standard errors at the lowest level are generally accurate. The group level variance components are generally underestimated. For accurate estimates many groups (more than 1) are needed (Busing, 1993; Van der Leeden & Busing, 1994; Afshartous, 1995). Kreft (1996) suggests a rule of thumb wich she calls the '30/30' rule. To be on the safe side, with respect to the estimates of all the parameters and their standard errors, a sample of at least 30 groups with at least 30 individuals per group is necessary. The intraclass correlation Another interesting question concerns the influence of the intraclass correlation on the estimation of the parameters, although the intraclass correlations can't be influenced by the researcher, which was the case in the above discussed influence of the sample sizes. Simulation studies by Muthén, Wisnicky and Nelson (1991) and Hox and Maas (1) in the context of multilevel structural equation modeling (SEM) suggest that the intraclass correlation also affects the accuracy of the estimates. METHOD 3

The Simulation Model We used the above specified simple two level model (equation 1.1) for the simulation study. The only exception is that there was no covariance specified at the second level. Simulation Procedure We will look at three conditions in this simulation: (1) Number of Groups (3 conditions: 30, 50 and 1), () Group Size (3 conditions: 5, 30 and 50) and (3) Intraclass Correlation (3 conditions: low: 0.1, medium: 0. and high 0.3). The number of groups (30-50-1) is chosen so that the highest number is almost the maximum achievable number. Only in longitudinal research or large-scale research more than 1 units at the second level are usual. In practice, 50 groups is a frequently occuring number. Although, according to Busing (1993) this is possibly too low, Kreft (1996) argued that 30 groups is the absolute minimum. In longitudinal research, the lowest level is the number of repeated measures. A number of 5 often occurs. Also in family research, the number of 5 units at the lower level is quite normal (parents with 3 children). Therefore, the lowest group size is 5. In educational research, a group size of 30 is normal. (See also the 30/30 rule of Kreft 1996). The highest number, a group size of 50, must be sufficient. In educational research, most ICC's are below 0.0. However, in family research, or when group chararchteristics such as sociometric status are studied, ICC's above 0.33 do occur. There are 3x3x3=7 conditions. For each condition, we generate 10 data sets of two variables. Both the lower level variable X, as the second level variable Z, are randomly drawn from a normal distribution. The mulilevel program MLn (Rasbach & Woodhouse, 1995) is used to specify the model and to generate the simulation data. First the intercept and the crosslevel interaction between the variabels X and Z are computed. The regression coefficients are specified as follows: 1. for the intercept and the medium effect of.3 for the other regression coefficients (Cohen, 1988). The random 4

variance of the intercept at the lowest level ( ) is.5. The random variance of the intercept at the second level ( e ) follows from the specification of the ICC and e, given formula (1.3). Busing (1993) shows that both the pattern and the size of the and the 01 (random variances of the variables of the lower level) are the same. So, the same value for 01 is used as for and 01 is set to zero.. To keep a relative simple model, the covariance between The foregoing sumulation decisions are summarized in table 1. Table 1 Simulation decisions Assumption Formula Fixing X ~ N(0,1) Z j ~ N(0,1) e = 0.5 X Z j e ICC = (0.1, 0., 0.3) ICC = /( + e ) = 01 NG = (30, 50, 1) GS = (5, 30, 50) 01 NG GS Two different estimations methods are used in the multilevel software, the 'Restricted Iterative Generalized Least Squares' (RIGLS) estimation method and the Iterative Generalized Least Squares (IGLS) estimation method. In this paper the RIGLS method is used, because this method is always as good as the IGLS method and sometimes better (Browne, 1998). RESULTS Analysis 5

The relative bias and the 'Mean squared error' (MSE) are commonly used measures of the accuracy of the parameter estimates. Let θˆ be the estimation of the population parameter θ. Then the percentage relative bias is given by: θ θˆ. The MSE is given by: ( θˆ θ ). In addition we also present the observed coverage of the 95% confidence interval. Relative bias of the parameter estimates The fixed parameter estimates have an overall relative bias of 0.0%, with no differences across the conditions. There are no significant effects of the specified conditions (α = 0.01). In table the overall relative bias for the fixed effects (regression coefficients) in the various conditions are presented. The estimated mean bias in all the conditions is 1.. The mean estimates of the parameters are thus almost perfect. Table Relative bias of fixed parameters Number Group ICC of Groups Size Low (0.1) Medium (0.) High (0.3) 30 5.998 1.7.99 30 1.1 1. 1.3 50.998 1.0 1.3 50 5 1.3 1.0 1.0 30 1.1 1. 1.1 50 1.0 1..997 1 5 1.0.999.998 30 1.1 1.0 1.1 50 1.1.999 1.0 The estimates of the random parameters at the second level have also an overall relative bias of 0.0% (no significant effects). Tabel 3 shows the overall relative bias. The mean estimates show some deviations of 0.01% and one of 0.04%. So, also the mean estimates of the random effects of the second level are almost perfect. 6

Table 3 Relative bias of random parameters at level Number Group ICC of Groups Size Low (0.1) Medium (0.) High (0.3) 30 5 1.037 1.9.99 30 1.010 1.4.993 50 1.4 1.5.994 50 5 1.8.997.994 30.997.991 1.011 50 1.6 1.010 1.0 1 5 1.4.998 1.6 30 1.010.997 1.3 50.997 1.0 1.1 The estimates of the random parameter of the lowest level has an overall relative bias of 0.0%. There is a significant effect of the Group Size (p = 0.5). In Table 4 the relative bias of this variance component is presented. In table 5 the overall relative bias for all the conditions are presented. The differences in the relative mean bias for the different conditions in table 4 are less than 0.01%. In table 5, the mean estimates show some deviations of 0.01%. Concluding, we can say that the relative bias of the parameter estimates is negligible. Table 4 Relative bias of random parameter at level 1 with respect to the number of persons per group Group Size rel. bias 5.997 30 1.0 50 1.0 Table 5 Relative bias of the random parameter at level 1 Number Group ICC of Groups Size Low (0.1) Medium (0.) High (0.3) 30 5.990.998.998 30.998 1.0.999 50 1.0 1. 1.0 7

50 5.998 1.5.995 30 1.1 1.1 1.1 50.999 1.0.999 1 5.997.995.998 30 1.0 1.0 1.0 50 1.1 1.1 1.1 Absolute bias of the parameter estimates The analyses of the absolute bias of the parameters show different results than the above presented relative bais. With respect to the estimates of the fixed parameters, the overall absolute bias is 0.0%, but all the conditions, and interactions between them, have significant effects. Though, the size of the absolute bias is very little. Only the signficiant effects of the ICC are of interest, because the effects of Number of Groups and Group Size are simply a reflection of less sampling variability due to larger sample sizes. In table 6, the absolute bias for different values of the ICC is presented. The differences between the ICC-conditions are less than 0.01. The interactions of the ICC with the Number of Groups and Group Size have also differences between conditions less than 0.01. Table 6 Absolute bias of the ICC ICC abs. bias low 0.09 medium 0.035 high 0.041 95% confidence intervals For each parameter is counted how many times the confidence intervals cover the true popopulation value of the parameter. Most of the random parameters are affected by the 8

Number of Groups and the Size of the Group. Some of the fixed parameters are affected by the Number of Groups too. Table 7 represents the influence of the Number of Groups and table 8 the influence of the Group Size. Table 7 Influence of the Number of Groups on the non-covarage of the 95% confidence interval Number of Groups 30 50 1 U0 0.089 0.074 0.060 U1 0.088 0.07 0.057 E0 0.058 0.056 0.049 INT 0.064 0.057 0.053 X 0.064 0.057 0.05 Table 8 Influence of the Group Size on the non-covarage of the 95% confidence interval Group Size 5 30 50 U1 0.074 0.075 0.074 E0 0.061 0.051 0.051 Concluding we can state, that only when the Number of Groups is 30, the covarage of the 95% confidence interval of the variance components of the second level is really too small (almost 9% outside the interval). All the other effects are of minor importance. DISCUSSION The conditions that were varied in this simulation have very little impact on the accuracy of the estimates of the fixed parameters. Only the confidence intervals and hence the significance tests of the random parameters at the second level are not accurate when the number of groups is as small as 30. Almost 9 percent of the random parameter estimates of the second level lay outside the 95% percent confidence interval when the number of 9

groups is 30. We conclude therfore that the 30/30 rule of Kreft (1996) only holds with respect to the fixed parameters and the random parameter of the lowest level and not with respect to the significance tests of the random parameters of the second level. A remarkable result of this study is further that all simulations did converge. This is in contradiction with Busing (1993), who found nonconvergence especially in conditions with 10 to 50 groups and in conditions with 5 or 10 individuals in the groups. Also in contradiction with Leeden & Busing (1994) are the surprisingly accurate estimates. Are these the results of better software? Another explanation could be that these results are found because we have a model with a relatively high percentage explained variance at the lower level (about 35 percent). However, when we repeated the simulations with only about 10 percent explained variance, almost the same results were found. A final remark we want to make is about the estimation procedure. The simulations are done with the RIGLS-estimation procedure. This procedure gives always better (or the same) results as the IGLS-estimation procedure. So the results can not be generalized to IGLS-estimations. 10

References Afshartous, D. (1995): Determination of Sample Size for Multilevel Model Design. Paper AERA Conference, San Francisco, 18- april 1995. Browne, (1998) Bryk, A.S. & Raudenbush, S.W. (199). Hierarchical Linear Models. Newbury Park, CA: Sage. Busing, F. (1993): Distribution Characteristics of Variance Estimates in Two-level Models. Department of Psychometrica and research Methodology, Leiden University, Leiden. Bryk, A.S., Raudenbush, S.W., & Congdon, R.T. (1996). HLM. Hierarchical Linear and Nonlinear Modeling with the HLM/L adn HLM/3L Programs. Chicago: Scientific software International. Cohen, J. (1988). Statistical Power Analysis. Hillsdale, New Jersey, Erlbaum. Goldstein, H. (1995). Multilevel Statistical Model. London: Griffin. Hox, Joop J. & Maas, Cora J.M. (1). Multilevel SEM with Pseudobalanced Groups. Structural Equation Modeling. Kreft, Ita, G.G. (1996). Are Multilevel Techniques Necessary? An Overview, Including Simulation Studies. California State University, Los Angeles. Mok, M. (1995): 'Sample size requirements for -level designs in educational research'. Multilevel Modeling Newsletter, 7 (), 11-15. Muthén, B., Wisnicky, K.S. & Nelson, G. (1991). Estimation of covariance structure models with multilevel (clustered) data. Paper presented at the annual meeting of the American Educational Research Association, Chicago, April 3-7, 1991. Rasbash, J. & Woodhouse, G. (1995). MLn Command Reference. London: Institute of Education, University of London. Snders, T.A.B. & Bosker, R. (1994): Modeled Variance in Two-level Models. Journal of Educational Statistics, 18, 73-59. Van der Leeden, R. & Busing, F. (1994). First Iteration versus IGLS/RIGLS Estimates in Two-level Models: a Monte Carlo Study with ML3. Department of Psychometrica and research Methodology, Leiden University, Leiden. 11