Introduction. Dottorato XX ciclo febbraio Outline. A hierarchical structure. Hierarchical structures: type 2. Hierarchical structures: type 1

Size: px
Start display at page:

Download "Introduction. Dottorato XX ciclo febbraio Outline. A hierarchical structure. Hierarchical structures: type 2. Hierarchical structures: type 1"

Transcription

1 Outline Introduction to multilevel analysis. Introduction >. > Leonardo Grilli 3. Estimation > 4. Software & Books > Web: Department of Statistics G. Parenti - University of Florence L. Grilli - Introduction to multilevel analysis - April 6 Introduction A hierarchical structure Multilevel structures Vocabulary Aggregated vs. disaggregated analysis Random effects ANOVA Design effect Relationships within and between clusters district level 4 school level 3 school class class level class 3 class 4 s s s3 s4 s5 s6 s7 s8 s9 s0 s s level - students L. Grilli - Introduction to multilevel analysis - April 6 4 Hierarchical structures: type Hierarchical structures: type Units within clusters pupil, class, school patient, doctor, hospital worker, firm individual, family, region interviewed, interviewer Often the sampling design reflects the hierarchical structure (multi-stage sampling), but this is not necessary!! Multiple responses (level : responses, level : individuals) Multivariate data Longitudinal data (panel, repeated measurements) Remark: a hierarchical structure may combine multiple responses and true clusters: e.g. questionnaire on the students -> item, student, school L. Grilli - Introduction to multilevel analysis - April 6 5 L. Grilli - Introduction to multilevel analysis - April 6 6 Dottorato XX ciclo febbraio 6

2 Beyond standard hierarchical structures Which is the relevant structure? cross-classified structures, e.g. pupil classified by quarter and school Usually it is assumed that each level k unit belongs to one and only one level k+ unit. Otherwise -> multiple membership models e.g. pupils changing their school during the observation interval L. Grilli - Introduction to multilevel analysis - April 6 7 Usually the phenomenon under study can be modelled through several alternative structures: e.g. Pupil, class Pupil, class, school Pupil, school Pupil, teacher Pupil, (school by quarter) The structure to be used in the analysis depends on the aims of the research Even if a complex structure may appear more realistic, for most research purposes a simple structure with or 3 levels is enough L. Grilli - Introduction to multilevel analysis - April 6 8 Vocabulary Vocabulary Populations a) Hierarchical b) Nested c) Cross-classified d) Multilevel Ian Plewis (997) on Multilevel Modelling Newsletter, Vol. 9 No. a) and b) are interchangeable d) incorporates each of a), b) and c) L. Grilli - Introduction to multilevel analysis - April 6 9 Models Multilevel (MLM) Hierarchical linear (HLM) Mixed linear Random coefficients Random effects Subect/unit specific Random intercept Variance components L. Grilli - Introduction to multilevel analysis - April 6 0 Two types of macro (cluster) variables Relationships between levels GLOBAL: feature of the macro unit (cluster) with no corresponding micro level measure Example: public/private school, number of teachers CONTEXTUAL: macro indicator obtained through aggregation of micro level measures (summary of the features of the micro units) Example: average class size, proportion of females, average grade Z Z X Z X macro: level, e.g. school macro-micro relationship micro: level, e.g. pupil adusted macro-micro relationship cross-level interaction X Z micro-macro relationship L. Grilli - Introduction to multilevel analysis - April 6 L. Grilli - Introduction to multilevel analysis - April 6 Dottorato XX ciclo febbraio 6

3 The unit of analysis dilemma Problems with aggregated analysis One can choose to analyse the data at Individual level (e.g. pupil) -> disaggregated analysis Cluster level (e.g. school) -> aggregated analysis (the aggregated analysis entails reducing the dataset by taking the cluster means) Both choices lead to several problems Shift of meaning: the macro variables obtained through aggregation refer to the cluster (not the individual) -> they cannot be used to investigate micro level relationships Ecological fallacy (aggregation bias): macro level relationships micro level relationships Interactions between levels: an aggregated analysis precludes the study of the relationships between levels L. Grilli - Introduction to multilevel analysis - April 6 3 L. Grilli - Introduction to multilevel analysis - April 6 4 Problems with disaggregated analysis Which type of model? Inference on the clusters: impossible to make inference on the clusters (i.e. treat the clusters as a random sample from a population of clusters) Wrong sample size for the macro variables (their sample size should be the number of clusters) Dependence: the units of the same cluster are alike with a positive within cluster correlation the independence assumption of many standard models (e.g. GLM) is violated biased standard errors (often underestimated, leading to type I error rates higher than the nominal level α) Basic question: the hierarchical structure (along with the relationships within and between levels and the associated correlation structure) is of primary interest for the research? es, it is of primary interest multilevel models No, it is merely a nuisance (e.g. the sampling design is multistage but interest is limited to individual level relationships) methods able to correct the standard errors: GEE, robust (sandwich) estimation of the covariance matrix of the estimators L. Grilli - Introduction to multilevel analysis - April 6 5 L. Grilli - Introduction to multilevel analysis - April 6 6 One-Way random effects ANOVA i.e. the simplest multilevel model One-Way random effects ANOVA i.e. the simplest multilevel model ε, E( ε ) = 0, Var( ε ) = σ i i i ε u i i, = µ + u + ε i i =,, clusters (in ANOVA terminology "levels of the factor") i =,, n units in cluster N = n total sample size u, E( u ) = 0, Var( u ) = = Be aware: is the variance, not the standard deviation L. Grilli - Introduction to multilevel analysis - April 6 7 Useful to think in terms of a two-stage data generation process: = µ + u + ε i i ) sampling cluster -> take a realization of u from its distribution (so cluster has a mean of µ+u ) ) sampling unit i within cluster -> take a realization of ε i from its distribution (so unit i of cluster has value µ+u +ε i ) L. Grilli - Introduction to multilevel analysis - April 6 8 Dottorato XX ciclo febbraio 6 3

4 One-Way random effects ANOVA: variances and covariances One-Way random effects ANOVA: covariance matrix Var( ) = Var( u + ε ) = + σ i i = µ + u + ε i i 0 if Cov( i, i ) = if = and i i' Variance of i decomposed in two components: cluster (between) level + individual (within) level Observations belonging to the same cluster are positively correlated Remark: the correlation is necessarily positive since it is generated by a shared latent variable u (it is the same basic idea of factor models, where u is called factor indeed the GLLAMM class of Rabe-Hesketh and Skrondal includes both multilevel and factor models as special cases) L. Grilli - Introduction to multilevel analysis - April 6 9 Example =, n =, n = 3 + σ + σ Var( ) = + σ + σ + σ Block diagonal structure (empty space stands for zero) L. Grilli - Introduction to multilevel analysis - April 6 0 One-Way ANOVA: fixed or random effects? One-Way random effects ANOVA: intraclass correlation coefficient i = µ + α + εi parameters Fixed effects ANOVA when the clusters (levels of the factor) are few and represent an exhaustive classification distributional assumptions are avoided, but impossible to generalize the results to a popoulation of clusters = µ + u + ε i i random variables Random effects ANOVA when the clusters (levels of the factor) are a sample from a population of clusters or anyway when the clusters are many parisimonious description of the observed variability among clusters + generalizability L. Grilli - Introduction to multilevel analysis - April 6 ρ denotes the ICC (intraclass correlation coefficient) cluster variance ρ = Corr( i, i ) = = ρ 0, + σ total variance ρ is a measure of the degree of homogeneity of units belonging to the same cluster when ρ is high it is crucial to adopt an estimation method able to account for the correlation [ ] L. Grilli - Introduction to multilevel analysis - April 6 The consequences of correlation The design effect N µ σ,, (, ) N identically distributed N = i is an estimator of µ N i= ( ) E = µ unbiased regardless of correlation BUT Var σ N ( ) = + Cov( i, i' ) precision of depends on correlation σ Remark : for positive correlations Var ( ) > N i i' L. Grilli - Introduction to multilevel analysis - April 6 3 For simple random sampling (SRS), the variance of the sample mean is σ SRS / N For a design other than SRS, the theory of survey sampling defines deff = σ design / σ SRS (usually deff >) effective sample size = N / deff So the effective sample size is the sample size that under SRS would lead to the same variance of the sample mean L. Grilli - Introduction to multilevel analysis - April 6 4 Dottorato XX ciclo febbraio 6 4

5 The design effect Relationships within and between clusters 8 In a balanced two-stage design ( clusters of size n, so N= n): 7 Example from Sniders & Bosker, p. 7 deff = + (n-) ρ where ρ is the ICC The loss of precision of a balanced two-stage design (w.r.t. SRS) depends on both ρ and n (but not on ) Within Total i X_i X_. _i _ Example: =0, n=0, ρ=0. deff=.9, so the nominal sample size is N= but the effective sample size is /.9 69! 0 Between Difference Between-Within: Ecological fallacy L. Grilli - Introduction to multilevel analysis - April 6 5 L. Grilli - Introduction to multilevel analysis - April 6 6 Relationships within and between clusters Relationships within and between clusters = x i = 8..x.. i i =. +.( xi x. ) i = 8..x. +.( xi x. ) Total Between cluster means Within clusters Multilevel ( ) X β = η β + η β total X between cluster means within clusters SSBX η X = SSTX Remark : β total correlation ratio is a linear combination The multilevel regression model allows to study between and within relationships at the same time L. Grilli - Introduction to multilevel analysis - April 6 7 L. Grilli - Introduction to multilevel analysis - April 6 8 Motivating example: school effectiveness (one covariate at level ) (one covariate at level + one covariate at level ) Centering the covariates Issues in the random slope model Example: school effectiveness Levels: pupils (level ); schools (level ) Response variable : score on the final test Explanatory variable (at level ) X: score on the initial test First consider a single school: = β + β x + ε i 0 i i ε i ~ N(0, σ ) L. Grilli - Introduction to multilevel analysis - April 6 30 Dottorato XX ciclo febbraio 6 5

6 Example: school effectiveness (one covariate at level ) Sample of schools (from a population of schools) Comparison between schools A and B School A more effective (higher predicted for all the range of X) School A more equitable (lower slope) A Level model Equation for the -th school: i = β0 + βx i + εi ε ~ N(0, σ ) i effectiveness : β > β equity : β < β 0A 0B A B B X Remark: each school has its own slope and intercept L. Grilli - Introduction to multilevel analysis - April 6 3 L. Grilli - Introduction to multilevel analysis - April 6 3 (one covariate at level ) (one covariate at level ) Each school has a couple of parameters ( β, β ) 0 Assumption: the parameters ( β0, β) are random variables with a bivariate Normal distribution in the population of schools β N γ 0 0, β γ 0 ( 0, ) β β independent from ε i Other distributions are possible, but the Normal has nice properties and works well in many cases Model parameters γ mean intercept γ 0 mean slope Intercept variance Slope variance 0 Slope-intercept covariance Residual variance (level ) σ Fixed parameters Random parameters L. Grilli - Introduction to multilevel analysis - April 6 33 L. Grilli - Introduction to multilevel analysis - April 6 34 (one covariate at level ) (one covariate at level ) Correlation between slopes and intercepts: 0 corr( β0, β ) = β 0 Example of negative correlation Level model: Level model: Combined model: = β + β x + ε i 0 i i β = γ + u β = γ + u = γ + γ x + u x + u + ε i 0 i i 0 i β Fixed part Random part L. Grilli - Introduction to multilevel analysis - April 6 35 L. Grilli - Introduction to multilevel analysis - April 6 36 Dottorato XX ciclo febbraio 6 6

7 (one covariate at level ) (one covariate at level ) Level errors (random effects): u u = β γ Var( u ) = = β γ Var( u ) = Random effect = unexplained deviation of the value of the parameter in the -th cluster from the mean value in the population The covariates may contribute to explain the deviations (so reducing the corresponding variances) Usually the distributional assumptions are made on the random effects (and not on the random intercepts/slopes): u 0 0, 0 u0 N u 0 εi u indep. from L. Grilli - Introduction to multilevel analysis - April 6 37 The total error is 0 i i implying heteroschedasticity: u + u x + ε Var( x ) = + x + x + σ i i 0 i i non-homogeneous correlation among the responses of the units of the same cluster : Cov(, x, x ) = + ( x + x ) + x x i i' i i' 0 i i' i i' L. Grilli - Introduction to multilevel analysis - April 6 38 (one covariate at level ) (one covariate at level ) Var( x ) = + x + x + σ i i 0 i i The variance function is a parabola with minimum in 0 / Depending on the range of x, in a given application the variance function can be descending, ascending or U-shaped (but never shaped!) Var( x ) i 0 / L. Grilli - Introduction to multilevel analysis - April 6 39 i x i What about estimating the parameters γ and γ 0 using OLS (Ordinary Least Squares) or ML under the standard assumptions of the linear model? = γ + γ x + ε * i 0 i i ε = u + u x + ε * The model is linear but the error term i 0 i i violates the homoschedasticity and uncorrelatedness assumptions Inefficient estimators Biased standard errors L. Grilli - Introduction to multilevel analysis - April 6 40 (one covariate at level + one covariate at level ) (one covariate at level + one covariate at level ) Introduction of level covariates: level covariates represent features of the clusters useful to define a model for the level parameters ( β0, β ) and so reduce the level variances (, ) Example: W is a binary variable coded =public school; 0=private school Level model: i 0 i i Level model: Combined model: = β + β x + ε β = γ + γ w + u β = γ + γ w + u = γ + γ w + γ x + γ w x i 0 0 i i 0 i i Here it becomes clear why the γ have a double index Fixed part + u + u x + ε Random part L. Grilli - Introduction to multilevel analysis - April 6 4 L. Grilli - Introduction to multilevel analysis - April 6 4 Dottorato XX ciclo febbraio 6 7

8 (one covariate at level + one covariate at level ) (one covariate at level + one covariate at level ) Level model: β0 = γ + γ0w + u0 β = γ0 + γw + u γ 0 γ u 0 u Var( u ) = Var( u ) = 0 mean difference in intercept between private and public school mean difference in slope between private and public school deviation of school from the corresponding mean intercept deviation of school from the corresponding mean slope Remark: the distributional assumptions on the random effects are the same as before, but now the variances have a different meaning (remind: the variances are residual w.r.t. to the model covariates) L. Grilli - Introduction to multilevel analysis - April 6 43 In the combined model there is a cross-level interaction wx It arises because the level coefficient depends on the level covariate w A multilevel model can be written down in two ways: a) a single combined equation or, equivalently, b) a system of hierarchical equations. Who uses approach b usually ends up with a more complex model (notably, with more cross-level interactions) i β L. Grilli - Introduction to multilevel analysis - April 6 44 Model assumptions Centering a covariate Level Level εi ~ N(0, σ ) u 0 0, 0 N u 0 u u 0 indep. from εi The critical assumption is that E(u 0 )=0 and E(u )=0 for every value of the covariate, implying that E( x) is correctly specified The normality assumption is usually not so critical (especially with regard to level errors) A covariate can be centered w.r.t. a given constant, such as the grand mean: this affects the intercept γ (in a random slope model also the intercept variance and the intercept-slope covariance, see later on) the cluster mean (CM centering), so if the cluster means are different the centering varies from cluster to cluster: this affects the slope (total effect vs. within effect) L. Grilli - Introduction to multilevel analysis - April 6 45 L. Grilli - Introduction to multilevel analysis - April 6 46 Centering a covariate Centering a covariate Cronbach model: CM centering & cluster mean = γ + γ ( x x ) + γ x + u + ε i 0 i i = γ + γ x + u + ε Between slope = γ ( x x ) + ( ε ε ) i. 0 i. i. Within slope L. Grilli - Introduction to multilevel analysis - April 6 47 contextual model: no CM centering, but cluster mean = γ + γ x + γ x + u + ε i 0 i 0. 0 i replacing xi with ( xi x. ) + x. yields = γ + γ ( x x ) + ( γ + γ ) x + u + ε i 0 i i ust a reparametrization of the Cronbach model!! γ0 = γ0 = within slope γ = γ γ = between slope within slope L. Grilli - Introduction to multilevel analysis - April 6 48 Dottorato XX ciclo febbraio 6 8

9 Centering a covariate Centering a covariate raw covariate model: no CM centering, no cluster mean = γ + γ x + u + ε i 0 i 0 i = γ + γ x + u + ε = γ ( x x ) + ( ε ε ) i. 0 i. i. This model implicitly assumes that the between and within slopes are identical!! L. Grilli - Introduction to multilevel analysis - April 6 49 Summary: ) = + γ x + i total i ) = + γ x + ( γ γ ) x + i within i between within. 3) = + γ ( x x ) i within i. 4) = + γ ( x x ) + γ x + i within i. between. More details in Raudenbush & Bryk and Sniders & Bosker L. Grilli - Introduction to multilevel analysis - April 6 50 The fixed effects model (also called fixed effects ANCOVA) = γ + γ x + u + ε i 0 i 0 i replaced by parameters α so no distributional assumptions on u 0!!! In this way the level variability is completely explained -> impossible to add level covariates (multicollinearity) If many clusters -> too many parameters (inefficiency, overfitting) For a small cluster the α parameter is estimated with low precision Impossible to generalize to a population of clusters Remark: in the fixed effecs model the slope γ 0 is not the total effect, but the within effect (in panel data the corresponding estimator is known as the fixed effects estimator) L. Grilli - Introduction to multilevel analysis - April 6 5 Endogeneity Assume the true model is i = γ + γ 0xi + γ 0x. + u0 + εi where γ, i.e. the between and within slope are different 0 = 0 Assume the model is specified without the cluster mean = α + βx + u + ε * * i i 0 i ( ) u x u E u x * * 0 = γ = γ0. 0 Econometriacians say there is endogeneity, since the random effect is correlated with the covariate estimate of slope is biased L. Grilli - Introduction to multilevel analysis - April 6 5 Endogeneity Versions of the two-level model Solutions to endogeneity:. Replace the random effects with fixed effects. Keep the random effects but add the cluster mean in the equation The famous Hausman specification test is ust a test for the equality of between and within slopes, i.e. H0 : γ 0 = 0 Some econometriacians believe that when the Hausman test reects the null hypotesis one is forced to use solution. But also solution is feasible! L. Grilli - Introduction to multilevel analysis - April 6 53 i = β0 + βxi + εi β0 = γ + u0 β = γ0 + u T 0 = ε i ~ N(0, σ ) u0 0 0 ~ N, u 0 Variance matrix of the random effects Let us consider some special cases of T Remark: when one of the variances is null, also the covariance 0 is null L. Grilli - Introduction to multilevel analysis - April 6 54 Dottorato XX ciclo febbraio 6 9

10 Versions of the two-level model Versions of the two-level model Special case T=0 No variability among clusters Fixed coefficients, same as a standard regression model y x L. Grilli - Introduction to multilevel analysis - April 6 55 Special case 0 T = 0 The variance of the slope is null (and so is the slope-intercept covariance) The variance of the intercept does not depend on X (i.e. centering X is irrelevant) The cluster regression lines are parallel The clusters can be ranked Equi-correlation within clusters (so residual ICC is meaningful) y L. Grilli - Introduction to multilevel analysis - April 6 56 x Versions of the two-level model Versions of the two-level model Special case T singular (rank=) 0 = All the regression lines cross at a point x * If x * is outside the range of X megaphone configuration, so the clusters can be ranked If x * is inside the range of X radial configuration, so there are two specular rankings of the clusters depending on x>x * or x<x * y y X General case T full rank There are many crossing points The clusters cannot be ranked Heterogeneous correlation within clusters (no unique residual ICC) y x L. Grilli - Introduction to multilevel analysis - April 6 57 x L. Grilli - Introduction to multilevel analysis - April 6 58 Versions of the two-level model Unit of measure General case T full rank What it the unit of measure of the variances/covariances? The intercept variance ( ) and the slope-intercept covariance ( 0 ) depend on X and 0 refer to X=0 so it is useful to center X Since the origin of X is often arbitrary the covariance should not be constrained to be zero y x m = unit of measure of (e.g. Kilogrammes) m X = unit of measure of X (e.g. Metres) Then is expressed in (m ) is expressed in (m / m X ) 0 is expressed in (m ) / m X y x Be careful in the interpretation! L. Grilli - Introduction to multilevel analysis - April 6 59 L. Grilli - Introduction to multilevel analysis - April 6 60 Dottorato XX ciclo febbraio 6 0

11 Extensions of the model The three level variance component model Many covariates at both levels: X, X,,W, W, Complex error structure: At level : e.g. heteroschedasticity Var( εi ) = σ x At level : e.g. many random slopes More than two hierarchical levels But be aware that the imagination of the researchers can easily outrun the capacity of the data, the computer, and current optimization techniques to provide robust estimates (Di Prete & Forristal) L. Grilli - Introduction to multilevel analysis - April 6 6 i k =,,, K level 3 units (e.g. schools) =,, i =,, I k k k level units (e.g. classes) level units (e.g. pupils) [ ] = fixed part + v + u + ε ik k k ik v ~ N(0, σ ) u (3) ~ N(0, σ ) k ik () ε ~ N(0, σ ) () independence among levels L. Grilli - Introduction to multilevel analysis - April 6 6 Estimation Estimation in random effects ANOVA Shrinkage estimators and reliability Model specification and likelihood Type of inference: maximum likelihood vs. Bayesian Maximum likelihood: full information vs. restricted Hypotesis testing Diagnostics based on the residuals Comparing the residuals Parameter estimation i= γ+ γ0w+ γ0xi+ γwxi Fixed part + u + u x + ε Random part 0 i i Maximum likelihood step : estimation of fixed parameters (γ, γ 0, γ 0, γ ) and random parameters (σ,, 0, ) step : prediction ofrandom effects (u 0,u : =,,) Bayesian inference the parameters are random variables with a prior distribution no distinction between parameters and random effects L. Grilli - Introduction to multilevel analysis - April 6 64 One-way random effects ANOVA: estimation of γ One-way random effects ANOVA: estimation of γ = γ + u + ε i 0 i Cluster mean. = γ + u0 + ε. E( ) = γ Var( u ) = Var( εi ) = σ 0. σ (. ) = Var( u0 + ε. ) = + = n Var Each cluster mean is un unbiased estimator of γ, but it is inefficient Idea : estimate γ,, combining.. L. Grilli - Introduction to multilevel analysis - April 6 65 The minimum variance unbiased estimator of γ depends on the design (all the sample means Balanced design: n = n = have the same variance) γ =. = Unbalanced design: ˆ γ =. = = is the precision of. Remark: in the unbalanced case it is necessary to have an estimate of the best estimator depends on the variances L. Grilli - Introduction to multilevel analysis - April 6 66 Dottorato XX ciclo febbraio 6

12 One-way random effects ANOVA: estimation of the variance components One-way random effects ANOVA: estimation of the variance components SST = SSW + SSB n n ( i.. ) = ( i. ) + n (... ) = i= = i= = For simplicity consider only the balanced case n =n Two possible estimators of the variance components: n SSW SSB W = = ( i. ) B = = (...) N N = i= n( ) = S S L. Grilli - Introduction to multilevel analysis - April 6 67 S σ E( SW ) = σ E( SB) = + > n σ Var(. ) = + n Not all the variability of the sample cluster means. is due to the variability of the population cluster means µ + u B overestimates since it actually estimates Remark : this fact underlies the classical ANOVA F test ( H : = 0) ˆ ˆ B Un unbiased estimator of is = S ˆ σ ˆ < ˆ = 0 n Remark : can be negative! If S B then 0 ˆ σ n 0 L. Grilli - Introduction to multilevel analysis - April 6 68 oint estimation of fixed and random parameters To estimate the fixed parameters it is necessary to have an estimate of the random parameters However the converse is also true: to estimate the random parameters it is necessary to have an estimate of the fixed parameters In general it is necessary to rely upon iterative procedures Exception: balanced design (all clusters with the same size and the same matrix of covariates) In a balanced design there are closed-form estimators of the variance components (see the textbook by Searle, McCulloch and Casella) L. Grilli - Introduction to multilevel analysis - April 6 69 Effect of the covariates on the variances Consider starting with a random effects ANOVA model = µ + u + ε i 0 i Var( u ) = Var( ε ) = σ 0 both level and level covariates can be added: A level covariate reduces (or leaves unchanged) the level variance but (being constant within each cluster) cannot affect the level variance σ A level covariate reduces (or leaves unchanged) the level variance but its effect on the level variance is unpredictable σ L. Grilli - Introduction to multilevel analysis - April 6 70 i Effect of the covariates on the variances a level covariate by definition varies only between clusters purely between a level covariate can be written as the sum of: A purely between component Reduces and does not affect σ A purely within component Reduces σ and thus increases x i x x If a level covariate is purely within it increases Usually a level covariate varies both within and between the effect on is unpredictable (often reduces it) L. Grilli - Introduction to multilevel analysis - April 6 7 ˆ σ ˆ = SB n This effect is strong only if n is small Effect of the covariates on the variances General rule: a covariate at an arbitrary level does not affect the variances at lower levels reduces (or leaves unchanged) the variances at the same level has an unpredictable effect on the variances at higher levels L. Grilli - Introduction to multilevel analysis - April 6 7 Dottorato XX ciclo febbraio 6

13 One-way random effects ANOVA: prediction of β 0 or u 0 One-way random effects ANOVA: prediction of β 0 or u 0 From. = β0 + ε., with ε. ~ N(0, σ / n), it follows that. is an (unbiased) estimator of β0 with variance σ / n From β0 = γ + u0, with u0 ~ N(0, ), it follows that also ˆ γ is an estimator (biased) of β =. = = Remark: here unbiasedness is evaluated conditional on β 0 (i.e. hold u 0 and calculate E( ) under repeated sampling of ε i ) Good news: A linear combination of estimators and yields a new estimator better then both and in terms of MSE!!! 0 L. Grilli - Introduction to multilevel analysis - April 6 73 The best estimator of of estimators and λ β 0 is given by a linear combination EB β 0. ( ) ˆ = λ + λ γ the weight is a measure of the reliability of. as an estimator of β 0 The superscrip EB stands for Empirical Bayes since 0 is also the posterior mean of calculated by pluggin-in the ML estimates β 0 βˆ EB L. Grilli - Introduction to multilevel analysis - April 6 74 One-way random effects ANOVA: prediction of β 0 or u 0 The reliability coefficient EB β 0. ( ) ˆ = λ + λ γ EB EB u 0 β ˆ 0 (. ˆ γ λ γ ) = = shrinkage estimator OLS residual shrinkage factor Borrowing strength L. Grilli - Introduction to multilevel analysis - April 6 75 The coefficient in the linear combination in β is λ = In fact from the psychometric theory of tests (i=item, =individual) γ + u 0. + σ / n λ true score of individual observed score of individual λ = = + σ / n ˆ EB 0 known as the reliability coefficient variance of true scores λ varies from cluster variance of observed scores to cluster (as it depends also on the cluster size) L. Grilli - Introduction to multilevel analysis - April 6 76 Likelihood Likelihood The equation defining a random effects model includes the random effects, but they are not observable (so cannot appear in the likelihood) the random effects must be integrated out! f ( y u, θ) p( u θ) Distribution of responses, conditional on random effects and parameters Distribution of random effects, conditional on parameters L( θ) = f ( y u, θ) p( u θ) du Likelihood Problem: the integral has analytical solution only for conugate distributions (e.g. Normal-Normal, Binomial-Beta, ) L. Grilli - Introduction to multilevel analysis - April 6 77 Multiple random effects -> multivariate distribution -> the Normal distribution is preferable (and in fact it is the standard choice in applications) Linear model -> Normal-Normal (conugate) -> the integral has analytical solution Non linear model -> f ( y u, θ) -Normal (non conugate) -> the likelihood must be evaluated through approximate integration methods L. Grilli - Introduction to multilevel analysis - April 6 78 Dottorato XX ciclo febbraio 6 3

14 Maximum Likelihood Bayesian inference θ ML = arg max L( θ) = f ( y u, θ) p( u θ) du Once the ML estimates of fixed and random parameters have been obtained -> Empirical Bayes estimation of the random effects p( u y, θ ML) = θ f ( y u, θ ) ( ML p u θ ML ) f ( y u, θ ) ( ML p u θ ML ) du For Bayesian inference a prior distribution p( θ) and the oint posterior distribution is used p( u, θ y ) = Inference on the parameters Inference on the random effects f ( y u, θ) p( u θ) p( θ) f ( y u, θ) p( u θ) p( θ) dudθ is defined p( θ y ) = p( θ, u y ) du p( u y) = p( θ, u y ) dθ L. Grilli - Introduction to multilevel analysis - April 6 79 L. Grilli - Introduction to multilevel analysis - April 6 80 Bayesian inference Hierarchical specification of the model (useful to write down the likelihood) Even for the linear model it is necessary to use approximate integration algorithms (e.g. Gibbs sampling) The Bayesian approach has some well-known pros and cons An important advantage in complex multilevel models is that the estimates properly account for all the uncertainty -> Bayesian methods yield good estimates of the variance components and confidence intervals with appropriate coverage even in highly complex models, where ML methods show a poor performance L. Grilli - Introduction to multilevel analysis - April 6 8 The simple random intercept model = γ + γ x + u + ε i 0 i 0 i can also be written hierarchically (which is the standard way to specify multilevel GLMs) Independence stems from. i xi, u0 N ( γ + γ0xi + u0, σ ) conditioning on the random effects. u N 0, ( ) 0 Remark: in the hierarchical specification the level errors ε i are not written (only their variance) L. Grilli - Introduction to multilevel analysis - April 6 8 Hierarchical construction of the likelihood for the random intercept model Maximum Likelihood The likelihood can be written in steps by exploiting the conditional independencies shown by the hierarchical formulation n L ( ψ u ) = L ( ψ u ) by conditional indep. given u 0 i 0 0 i= L ( θ) = L ( ψ u ) p( u ) du integrating out the random effect u L ( ) ( γ γ0 σ ) N( γ + γ0xi + u0 σ ) = param. of random effects θ= ( ψ, ) where ψ =,, all other parameters density of, Li ( ψ u0 ) = for fixed u evaluated at the observed y u ( θ) = L ( θ) by independence among clusters = L. Grilli - Introduction to multilevel analysis - April i Full Information Maximum Likelihood (FIML) Full likelihood, oint estimation of fixed and random parameters Underestimates the random parameters since it treats the fixed parameters as known quantities (ignoring degrees of freedom) Restricted Maximum Likelihood (REML) The random parameters are estimated by maximizing the restricted likelihood, i.e. the density of the residuals The random parameters are well estimated even in small samples L. Grilli - Introduction to multilevel analysis - April 6 84 Dottorato XX ciclo febbraio 6 4

15 Maximum Likelihood Maximum Likelihood To understand why FIML underestimates the random parameters remind what happens in standard linear regression i = β ' Xi + εi Var( εi)= σ εi = ' ( i β X ) i σ FIML = εi σ OLS = εi E σ OLS = σ n n k i If k is large w.r.t. n, then σ FIML is severely downward biased (it does not correct for the degrees of freedom lost in estimating β) In a multilevel model the sample size which is relevant for the estimation of is, i.e. the number of clusters i L. Grilli - Introduction to multilevel analysis - April 6 85 In a two-level model, REML and FIML lead to: Similar estimates for σ Discordant estimates for the parameters of the random effects if is small (in such a case FIML estimates of variances are lower) Unless the main aim is the estimation of the random parameters, FIML is preferred because: FIML estimators have a lower sampling variance (the comparison in terms of MSE is often in favour of FIML estimators) With FIML the LRT (Likelihood Ratio Test) can be used not only to test the random parameters, but also to test the fixed parameters L. Grilli - Introduction to multilevel analysis - April 6 86 FIML algorithms Properties of FIML estimators i) Iterative Generalized Least Squares (Goldstein 986) ii) Fisher Scoring (Longford 987) Under mild regularity conditions FIML estimators have good asymptotic properties: Consistency Normality Efficiency iii) EM (developed by Dempster, Laird and Rubin for models with missing data, later applied to multilevel models since the random effects are, in a sense, missing data) Remark: here asymptotic requires increasing the number of clusters (increasing the cluster sizes is not enough), so is the key quantity for asymptotics L. Grilli - Introduction to multilevel analysis - April 6 87 L. Grilli - Introduction to multilevel analysis - April 6 88 Hypotesis testing on a single fixed parameter Hypotesis testing on a set of fixed parameters Null hypotesis: H 0 : γ h =0 Wald test statistic: γ h T ( γ h) = se..( γ ) approx units)-#(covariates)- if γ h coeff. covariate at level ( γ h) t d.f. {#(level #(clusters)-#(covariates at level )- if γ h coeff. covariate at level A caveat: with few clusters the standard errors are usually underestimated the test reects the null hypothesis too often (i.e. the type I error rate is higher the nominal level) T h L. Grilli - Introduction to multilevel analysis - April 6 89 Null hypotesis: H 0 : C γ = 0 (C = matrix of contrasts with k rows) ˆ T Q( C' γ) = ( C' γˆ) ΣC' γ ( C' γγ ˆ) ˆ Wald test statistic: Q( C' γ) approx χk Remark: with few clusters the F distribution is preferable Alternative: LRT (asymptotically equivalent) L. Grilli - Introduction to multilevel analysis - April 6 90 Dottorato XX ciclo febbraio 6 5

16 Hypotesis testing on the random parameters LRT on a variance component Hypotesis H : T= T vs. H : T= T 0 0 where T 0 is a restricted version of T (e.g. some elements are constrained to 0) Unless the number of clusters is huge, the Wald test should not be used since the sampling distributions of the estimators of the random parameters are highly asymmetric L. Grilli - Introduction to multilevel analysis - April 6 9 D = deviance of the unrestriced model D 0 = deviance of the model with qq =0 #(restrictions) = + #(corresponding covariances) approx 0 #( restrictions) D D χ 0 = χ #( restrictions) with prob. / with prob. / Practical rule: the p-value must be halved! Remark: with REML the LRT can be used only if the fixed part of the model is unchanged! L. Grilli - Introduction to multilevel analysis - April 6 9 Diagnostics based on the residuals Diagnostics based on the residuals Many residuals Level ˆi ε Level (Empirical Bayes) uˆ, u ˆ, 0 Extension of the techniques used in standard regression Purposes: Check the functional form and the distributional assumptions (normality, homoschedasticity, ) Look for influential units To check the normality assumption use an histogram or a Q-Q plot L. Grilli - Introduction to multilevel analysis - April 6 93 L. Grilli - Introduction to multilevel analysis - April 6 94 Diagnostics based on the residuals Diagnostics based on the residuals To locate anomalous units look at the standardized residuals To look for misspecifications in the fixed part and/or heteroschedasticity, plot the residuals one by one against the predicted values of the response standardized u ˆ standardized u ˆ0 L. Grilli - Introduction to multilevel analysis - April 6 95 L. Grilli - Introduction to multilevel analysis - April 6 96 Dottorato XX ciclo febbraio 6 6

17 Comparing the residuals Comparing the residuals The cluster (level ) residuals are predictions of the corresponding random effects u Often the value of u for a single cluster is not of primary interest; rather, it more interesting to compare the value of u for two clusters (e.g. to see if institution A is more effective than institution B) A statistical problem: the random effects of two given clusters are different? L. Grilli - Introduction to multilevel analysis - April 6 97 A common misconception: thinking that two quantities whose 95% intervals are disoint are significantly different at 5% X N( µ X, σ ) X ±.96 σ N( µ, σ ) ±.96 σ Assuming for simplicity that X and are independent X N ( µ µ, σ ) ( X ) ±.96 σ X µ X is significantly different from µ at level 95% if and only if: the distance (in σ units) between X and exceeds.96 =.77 or the univariate intervals of length.77/ =.39 are disoint L. Grilli - Introduction to multilevel analysis - April 6 98 Comparing the residuals The value.39 stems from assumptions (Normal distribution, independence, same variance) which usually are not satisfied -> the value.39 is an approximation and the significance level 5% is an average level Software & Books Graph for pairwise comparisons More details in Goldstein & Healy (995) RSS A Half-interval=.39* s.e. of the residual L. Grilli - Introduction to multilevel analysis - April 6 99 Software for multilevel modelling Specialized software Specialized software Procedures in general purpose software Review: Data and exercises: Multilevel Modeling Resources at UCLA aml (Panis) HLM (Raudenbush) the MIXOR suite (Hedeker) MLwiN (Goldstein) L. Grilli - Introduction to multilevel analysis - April 6 0 L. Grilli - Introduction to multilevel analysis - April 6 0 Dottorato XX ciclo febbraio 6 7

18 Procedures in general purpose software Books on multilevel modelling BMDP EGRET GENSTAT SPSS SAS PROC MIXED and NLMIXED R and S-PLUS STATA xt suite STATA gllamm (Rabe-Hesketh & Skrondal) LISREL (oreskog) M-plus (Muthén) WINBUGS (for Bayesian analysis) Also structural equations L. Grilli - Introduction to multilevel analysis - April 6 03 Sniders & Bosker Raudenbush & Bryk Hox L. Grilli - Introduction to multilevel analysis - April 6 04 Books on multilevel modelling XT commands in Stata Goldstein Skrondal & Rabe-Hesketh L. Grilli - Introduction to multilevel analysis - April 6 05 For fitting random intercept models = β + β x + u + ε i 0 i 0 i Conceived for panel data, where =individual, i=time point (wave) xtreg (Continuous response) be between-effects estimator fe fixed-effects (conditional) estimator From version 9 xtmixed re GLS random-effects estimator pa GEE population-averaged estimator mle maximum-likelihood random-effects estimator xtlogit, xtprobit (Binary response) xtpois (Counts) L. Grilli - Introduction to multilevel analysis - April 6 06 GLLAMM Generalised Linear Latent And Mixed Models GLLAMM: pros and cons Wide class of latent variable models for a (multivariate mixed) response (continuous, binary, ordinal, polytomous, counts, durations). The ingredients of GLLAMM are: The conditional mean of the response given the covariates and the latent variables E( X,u); The conditional distribution of the response given the covariates and the latent variables f( X,u) The structural equations (if any) for the latent variables The distribution of the latent variables Rabe-Hesketh, Skrondal e Pickles developed the corresponding STATA command gllamm See L. Grilli - Introduction to multilevel analysis - April 6 07 Pros Quite flexible (the GLLAMM class is very wide) Good approximations, accurate estimates Cons Computationally intensive because of Gaussian quadrature. Computational time is proportional on Number of observations Square of the number of parameters Product of the number of quadrature points for all the latent variables For models with continuous response is inefficient L. Grilli - Introduction to multilevel analysis - April 6 08 Dottorato XX ciclo febbraio 6 8

19 To know more about multilevel modelling in Stata For xt commands: Stata manuals For gllamm: manual: web site: For both: the recent book by Rabe-Hesketh & Skrondal L. Grilli - Introduction to multilevel analysis - April 6 09 Dottorato XX ciclo febbraio 6 9

multilevel modeling: concepts, applications and interpretations

multilevel modeling: concepts, applications and interpretations multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models

More information

Erasmus Teaching staff mobility INTRODUCTION TO MULTILEVEL MODELLING

Erasmus Teaching staff mobility INTRODUCTION TO MULTILEVEL MODELLING Erasmus Teaching staff mobility Konstanz, June 2017 INTRODUCTION TO MULTILEVEL MODELLING Leonardo Grilli Dipartimento di Statistica, Informatica, Applicazioni G. Parenti Email grilli@disia.unifi.it, WEB

More information

Estimation and Centering

Estimation and Centering Estimation and Centering PSYED 3486 Feifei Ye University of Pittsburgh Main Topics Estimating the level-1 coefficients for a particular unit Reading: R&B, Chapter 3 (p85-94) Centering-Location of X Reading

More information

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control

More information

A Re-Introduction to General Linear Models (GLM)

A Re-Introduction to General Linear Models (GLM) A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing

More information

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

More information

Perugia, 13 maggio 2011

Perugia, 13 maggio 2011 Perugia, 13 maggio 2011 DIFFERENTIAL VARIABILITY OF TEST SCORES AMONG SCHOOLS: A MULTILEVEL ANALYSIS OF THE 5 TH GRADE INVALSI TEST USING HETEROSCHEDASTIC RANDOM EFFECTS Leonardo Grilli Claudia Sani Outline

More information

Introduction to. Multilevel Analysis

Introduction to. Multilevel Analysis Introduction to Multilevel Analysis Tom Snijders University of Oxford University of Groningen December 2009 Tom AB Snijders Introduction to Multilevel Analysis 1 Multilevel Analysis based on the Hierarchical

More information

Introduction to Within-Person Analysis and RM ANOVA

Introduction to Within-Person Analysis and RM ANOVA Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides

More information

Exploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement

Exploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement Exploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement Second meeting of the FIRB 2012 project Mixture and latent variable models for causal-inference and analysis

More information

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science. Texts in Statistical Science Generalized Linear Mixed Models Modern Concepts, Methods and Applications Walter W. Stroup CRC Press Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint

More information

Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 2017, Boston, Massachusetts

Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 2017, Boston, Massachusetts Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 217, Boston, Massachusetts Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control

More information

Review of CLDP 944: Multilevel Models for Longitudinal Data

Review of CLDP 944: Multilevel Models for Longitudinal Data Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance

More information

Multi-level Models: Idea

Multi-level Models: Idea Review of 140.656 Review Introduction to multi-level models The two-stage normal-normal model Two-stage linear models with random effects Three-stage linear models Two-stage logistic regression with random

More information

A multilevel multinomial logit model for the analysis of graduates skills

A multilevel multinomial logit model for the analysis of graduates skills Stat. Meth. & Appl. (2007) 16:381 393 DOI 10.1007/s10260-006-0039-z ORIGINAL ARTICLE A multilevel multinomial logit model for the analysis of graduates skills Leonardo Grilli Carla Rampichini Accepted:

More information

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Tihomir Asparouhov 1, Bengt Muthen 2 Muthen & Muthen 1 UCLA 2 Abstract Multilevel analysis often leads to modeling

More information

WU Weiterbildung. Linear Mixed Models

WU Weiterbildung. Linear Mixed Models Linear Mixed Effects Models WU Weiterbildung SLIDE 1 Outline 1 Estimation: ML vs. REML 2 Special Models On Two Levels Mixed ANOVA Or Random ANOVA Random Intercept Model Random Coefficients Model Intercept-and-Slopes-as-Outcomes

More information

A multivariate multilevel model for the analysis of TIMMS & PIRLS data

A multivariate multilevel model for the analysis of TIMMS & PIRLS data A multivariate multilevel model for the analysis of TIMMS & PIRLS data European Congress of Methodology July 23-25, 2014 - Utrecht Leonardo Grilli 1, Fulvia Pennoni 2, Carla Rampichini 1, Isabella Romeo

More information

Plausible Values for Latent Variables Using Mplus

Plausible Values for Latent Variables Using Mplus Plausible Values for Latent Variables Using Mplus Tihomir Asparouhov and Bengt Muthén August 21, 2010 1 1 Introduction Plausible values are imputed values for latent variables. All latent variables can

More information

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011

Lecture 2: Linear Models. Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 Lecture 2: Linear Models Bruce Walsh lecture notes Seattle SISG -Mixed Model Course version 23 June 2011 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

Outline. Selection Bias in Multilevel Models. The selection problem. The selection problem. Scope of analysis. The selection problem

Outline. Selection Bias in Multilevel Models. The selection problem. The selection problem. Scope of analysis. The selection problem International Workshop on tatistical Latent Variable Models in Health ciences erugia, 6-8 eptember 6 Outline election Bias in Multilevel Models Leonardo Grilli Carla Rampichini grilli@ds.unifi.it carla@ds.unifi.it

More information

LINEAR MULTILEVEL MODELS. Data are often hierarchical. By this we mean that data contain information

LINEAR MULTILEVEL MODELS. Data are often hierarchical. By this we mean that data contain information LINEAR MULTILEVEL MODELS JAN DE LEEUW ABSTRACT. This is an entry for The Encyclopedia of Statistics in Behavioral Science, to be published by Wiley in 2005. 1. HIERARCHICAL DATA Data are often hierarchical.

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter

More information

Confidence intervals for the variance component of random-effects linear models

Confidence intervals for the variance component of random-effects linear models The Stata Journal (2004) 4, Number 4, pp. 429 435 Confidence intervals for the variance component of random-effects linear models Matteo Bottai Arnold School of Public Health University of South Carolina

More information

PS 271B: Quantitative Methods II Lecture Notes

PS 271B: Quantitative Methods II Lecture Notes PS 271B: Quantitative Methods II Lecture Notes (Part 6: Panel/Longitudinal Data; Multilevel/Mixed Effects models) Langche Zeng zeng@ucsd.edu Panel/Longitudinal Data; Multilevel Modeling; Mixed effects

More information

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models EPSY 905: Multivariate Analysis Spring 2016 Lecture #12 April 20, 2016 EPSY 905: RM ANOVA, MANOVA, and Mixed Models

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs) 36-309/749 Experimental Design for Behavioral and Social Sciences Dec 1, 2015 Lecture 11: Mixed Models (HLMs) Independent Errors Assumption An error is the deviation of an individual observed outcome (DV)

More information

Introduction to Random Effects of Time and Model Estimation

Introduction to Random Effects of Time and Model Estimation Introduction to Random Effects of Time and Model Estimation Today s Class: The Big Picture Multilevel model notation Fixed vs. random effects of time Random intercept vs. random slope models How MLM =

More information

Hypothesis Testing for Var-Cov Components

Hypothesis Testing for Var-Cov Components Hypothesis Testing for Var-Cov Components When the specification of coefficients as fixed, random or non-randomly varying is considered, a null hypothesis of the form is considered, where Additional output

More information

Citation for published version (APA): Jak, S. (2013). Cluster bias: Testing measurement invariance in multilevel data

Citation for published version (APA): Jak, S. (2013). Cluster bias: Testing measurement invariance in multilevel data UvA-DARE (Digital Academic Repository) Cluster bias: Testing measurement invariance in multilevel data Jak, S. Link to publication Citation for published version (APA): Jak, S. (2013). Cluster bias: Testing

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Multilevel Modeling: When and Why 1. 1 Why multilevel data need multilevel models

Multilevel Modeling: When and Why 1. 1 Why multilevel data need multilevel models Multilevel Modeling: When and Why 1 J. Hox University of Amsterdam & Utrecht University Amsterdam/Utrecht, the Netherlands Abstract: Multilevel models have become popular for the analysis of a variety

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs

The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs Chad S. Briggs, Kathie Lorentz & Eric Davis Education & Outreach University Housing Southern Illinois

More information

Regression Models - Introduction

Regression Models - Introduction Regression Models - Introduction In regression models there are two types of variables that are studied: A dependent variable, Y, also called response variable. It is modeled as random. An independent

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors

More information

Prediction in Multilevel Models

Prediction in Multilevel Models Journal of Educational and Behavioral Statistics Spring 005, Vol. 30, No. 1, pp. 00 00 Prediction in Multilevel Models David Afshartous University of Miami Jan de Leeuw University of California, Los Angeles

More information

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL INTRODUCTION TO BASIC LINEAR REGRESSION MODEL 13 September 2011 Yogyakarta, Indonesia Cosimo Beverelli (World Trade Organization) 1 LINEAR REGRESSION MODEL In general, regression models estimate the effect

More information

General structural model Part 2: Categorical variables and beyond. Psychology 588: Covariance structure and factor models

General structural model Part 2: Categorical variables and beyond. Psychology 588: Covariance structure and factor models General structural model Part 2: Categorical variables and beyond Psychology 588: Covariance structure and factor models Categorical variables 2 Conventional (linear) SEM assumes continuous observed variables

More information

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data Today s Class: Review of concepts in multivariate data Introduction to random intercepts Crossed random effects models

More information

INTRODUCTION TO MULTILEVEL MODELLING FOR REPEATED MEASURES DATA. Belfast 9 th June to 10 th June, 2011

INTRODUCTION TO MULTILEVEL MODELLING FOR REPEATED MEASURES DATA. Belfast 9 th June to 10 th June, 2011 INTRODUCTION TO MULTILEVEL MODELLING FOR REPEATED MEASURES DATA Belfast 9 th June to 10 th June, 2011 Dr James J Brown Southampton Statistical Sciences Research Institute (UoS) ADMIN Research Centre (IoE

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Multiple Linear Regression CIVL 7012/8012

Multiple Linear Regression CIVL 7012/8012 Multiple Linear Regression CIVL 7012/8012 2 Multiple Regression Analysis (MLR) Allows us to explicitly control for many factors those simultaneously affect the dependent variable This is important for

More information

Multilevel Analysis of Grouped and Longitudinal Data

Multilevel Analysis of Grouped and Longitudinal Data Multilevel Analysis of Grouped and Longitudinal Data Joop J. Hox Utrecht University Second draft, to appear in: T.D. Little, K.U. Schnabel, & J. Baumert (Eds.). Modeling longitudinal and multiple-group

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

Longitudinal Modeling with Logistic Regression

Longitudinal Modeling with Logistic Regression Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to

More information

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen January 23-24, 2012 Page 1 Part I The Single Level Logit Model: A Review Motivating Example Imagine we are interested in voting

More information

Modelling heterogeneous variance-covariance components in two-level multilevel models with application to school effects educational research

Modelling heterogeneous variance-covariance components in two-level multilevel models with application to school effects educational research Modelling heterogeneous variance-covariance components in two-level multilevel models with application to school effects educational research Research Methods Festival Oxford 9 th July 014 George Leckie

More information

Chapter 6: Linear Regression With Multiple Regressors

Chapter 6: Linear Regression With Multiple Regressors Chapter 6: Linear Regression With Multiple Regressors 1-1 Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution

More information

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University. Panel GLMs Department of Political Science and Government Aarhus University May 12, 2015 1 Review of Panel Data 2 Model Types 3 Review and Looking Forward 1 Review of Panel Data 2 Model Types 3 Review

More information

2 Bayesian Hierarchical Response Modeling

2 Bayesian Hierarchical Response Modeling 2 Bayesian Hierarchical Response Modeling In the first chapter, an introduction to Bayesian item response modeling was given. The Bayesian methodology requires careful specification of priors since item

More information

Multilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2

Multilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Multilevel Models in Matrix Form Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Today s Lecture Linear models from a matrix perspective An example of how to do

More information

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1 Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables /4/04 Structural Equation Modeling and Confirmatory Factor Analysis Advanced Statistics for Researchers Session 3 Dr. Chris Rakes Website: http://csrakes.yolasite.com Email: Rakes@umbc.edu Twitter: @RakesChris

More information

Latent Variable Centering of Predictors and Mediators in Multilevel and Time-Series Models

Latent Variable Centering of Predictors and Mediators in Multilevel and Time-Series Models Latent Variable Centering of Predictors and Mediators in Multilevel and Time-Series Models Tihomir Asparouhov and Bengt Muthén August 5, 2018 Abstract We discuss different methods for centering a predictor

More information

MODEL IMPLIED INSTRUMENTAL VARIABLE ESTIMATION FOR MULTILEVEL CONFIRMATORY FACTOR ANALYSIS. Michael L. Giordano

MODEL IMPLIED INSTRUMENTAL VARIABLE ESTIMATION FOR MULTILEVEL CONFIRMATORY FACTOR ANALYSIS. Michael L. Giordano MODEL IMPLIED INSTRUMENTAL VARIABLE ESTIMATION FOR MULTILEVEL CONFIRMATORY FACTOR ANALYSIS Michael L. Giordano A thesis submitted to the faculty at the University of North Carolina at Chapel Hill in partial

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

Jeffrey M. Wooldridge Michigan State University

Jeffrey M. Wooldridge Michigan State University Fractional Response Models with Endogenous Explanatory Variables and Heterogeneity Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Fractional Probit with Heteroskedasticity 3. Fractional

More information

The Basic Two-Level Regression Model

The Basic Two-Level Regression Model 7 Manuscript version, chapter in J.J. Hox, M. Moerbeek & R. van de Schoot (018). Multilevel Analysis. Techniques and Applications. New York, NY: Routledge. The Basic Two-Level Regression Model Summary.

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Describing Within-Person Change over Time

Describing Within-Person Change over Time Describing Within-Person Change over Time Topics: Multilevel modeling notation and terminology Fixed and random effects of linear time Predicted variances and covariances from random slopes Dependency

More information

Lecture 10 Multiple Linear Regression

Lecture 10 Multiple Linear Regression Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable

More information

Multilevel Modeling: A Second Course

Multilevel Modeling: A Second Course Multilevel Modeling: A Second Course Kristopher Preacher, Ph.D. Upcoming Seminar: February 2-3, 2017, Ft. Myers, Florida What this workshop will accomplish I will review the basics of multilevel modeling

More information

Bayesian Analysis of Latent Variable Models using Mplus

Bayesian Analysis of Latent Variable Models using Mplus Bayesian Analysis of Latent Variable Models using Mplus Tihomir Asparouhov and Bengt Muthén Version 2 June 29, 2010 1 1 Introduction In this paper we describe some of the modeling possibilities that are

More information

INTRODUCTION TO STRUCTURAL EQUATION MODELS

INTRODUCTION TO STRUCTURAL EQUATION MODELS I. Description of the course. INTRODUCTION TO STRUCTURAL EQUATION MODELS A. Objectives and scope of the course. B. Logistics of enrollment, auditing, requirements, distribution of notes, access to programs.

More information

A Practitioner s Guide to Cluster-Robust Inference

A Practitioner s Guide to Cluster-Robust Inference A Practitioner s Guide to Cluster-Robust Inference A. C. Cameron and D. L. Miller presented by Federico Curci March 4, 2015 Cameron Miller Cluster Clinic II March 4, 2015 1 / 20 In the previous episode

More information

Experimental Design and Data Analysis for Biologists

Experimental Design and Data Analysis for Biologists Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1

More information

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Xiuming Zhang zhangxiuming@u.nus.edu A*STAR-NUS Clinical Imaging Research Center October, 015 Summary This report derives

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

Lecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012

Lecture 3: Linear Models. Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012 Lecture 3: Linear Models Bruce Walsh lecture notes Uppsala EQG course version 28 Jan 2012 1 Quick Review of the Major Points The general linear model can be written as y = X! + e y = vector of observed

More information

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or

More information

MLA Software for MultiLevel Analysis of Data with Two Levels

MLA Software for MultiLevel Analysis of Data with Two Levels MLA Software for MultiLevel Analysis of Data with Two Levels User s Guide for Version 4.1 Frank M. T. A. Busing Leiden University, Department of Psychology, P.O. Box 9555, 2300 RB Leiden, The Netherlands.

More information

Recent Developments in Multilevel Modeling

Recent Developments in Multilevel Modeling Recent Developments in Multilevel Modeling Roberto G. Gutierrez Director of Statistics StataCorp LP 2007 North American Stata Users Group Meeting, Boston R. Gutierrez (StataCorp) Multilevel Modeling August

More information

Model Estimation Example

Model Estimation Example Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions

More information

Strati cation in Multivariate Modeling

Strati cation in Multivariate Modeling Strati cation in Multivariate Modeling Tihomir Asparouhov Muthen & Muthen Mplus Web Notes: No. 9 Version 2, December 16, 2004 1 The author is thankful to Bengt Muthen for his guidance, to Linda Muthen

More information

Bayesian Methods for Machine Learning

Bayesian Methods for Machine Learning Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),

More information

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D.

Designing Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D. Designing Multilevel Models Using SPSS 11.5 Mixed Model John Painter, Ph.D. Jordan Institute for Families School of Social Work University of North Carolina at Chapel Hill 1 Creating Multilevel Models

More information

An overview of applied econometrics

An overview of applied econometrics An overview of applied econometrics Jo Thori Lind September 4, 2011 1 Introduction This note is intended as a brief overview of what is necessary to read and understand journal articles with empirical

More information

Hierarchical Linear Models. Jeff Gill. University of Florida

Hierarchical Linear Models. Jeff Gill. University of Florida Hierarchical Linear Models Jeff Gill University of Florida I. ESSENTIAL DESCRIPTION OF HIERARCHICAL LINEAR MODELS II. SPECIAL CASES OF THE HLM III. THE GENERAL STRUCTURE OF THE HLM IV. ESTIMATION OF THE

More information

Introducing Generalized Linear Models: Logistic Regression

Introducing Generalized Linear Models: Logistic Regression Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and

More information

Bayesian Methods in Multilevel Regression

Bayesian Methods in Multilevel Regression Bayesian Methods in Multilevel Regression Joop Hox MuLOG, 15 september 2000 mcmc What is Statistics?! Statistics is about uncertainty To err is human, to forgive divine, but to include errors in your design

More information

Measuring relationships among multiple responses

Measuring relationships among multiple responses Measuring relationships among multiple responses Linear association (correlation, relatedness, shared information) between pair-wise responses is an important property used in almost all multivariate analyses.

More information

Tom A.B. Snijders, ICS, University of Groningen Johannes Berkhof, Free University, Amsterdam

Tom A.B. Snijders, ICS, University of Groningen Johannes Berkhof, Free University, Amsterdam Chapter 4 DIAGNOSTIC CHECKS FOR MULTILEVEL MODELS Tom A.B. Sniders, ICS, University of Groningen Johannes Berkhof, Free University, Amsterdam To be published in Jan de Leeuw & Ita Kreft (eds., Handbook

More information

A First Look at Multilevel and Longitudinal Models York University Statistical Consulting Service (Revision 3, March 2, 2006) March 2006

A First Look at Multilevel and Longitudinal Models York University Statistical Consulting Service (Revision 3, March 2, 2006) March 2006 A First Look at Multilevel and Longitudinal Models York University Statistical Consulting Service (Revision 3, March, 006) March 006 Georges Monette with help from Ernest Kwan, Alina Rivilis, Qing Shao

More information

Estimation: Problems & Solutions

Estimation: Problems & Solutions Estimation: Problems & Solutions Edps/Psych/Stat 587 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Fall 2017 Outline 1. Introduction: Estimation of

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies

More information

Generalized Linear Models (GLZ)

Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) are an extension of the linear modeling process that allows models to be fit to data that follow probability distributions other than the

More information

Centering Predictor and Mediator Variables in Multilevel and Time-Series Models

Centering Predictor and Mediator Variables in Multilevel and Time-Series Models Centering Predictor and Mediator Variables in Multilevel and Time-Series Models Tihomir Asparouhov and Bengt Muthén Part 2 May 7, 2018 Tihomir Asparouhov and Bengt Muthén Part 2 Muthén & Muthén 1/ 42 Overview

More information

Economics 308: Econometrics Professor Moody

Economics 308: Econometrics Professor Moody Economics 308: Econometrics Professor Moody References on reserve: Text Moody, Basic Econometrics with Stata (BES) Pindyck and Rubinfeld, Econometric Models and Economic Forecasts (PR) Wooldridge, Jeffrey

More information

Longitudinal Data Analysis

Longitudinal Data Analysis Longitudinal Data Analysis Mike Allerhand This document has been produced for the CCACE short course: Longitudinal Data Analysis. No part of this document may be reproduced, in any form or by any means,

More information

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011) Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

ROBUSTNESS OF MULTILEVEL PARAMETER ESTIMATES AGAINST SMALL SAMPLE SIZES

ROBUSTNESS OF MULTILEVEL PARAMETER ESTIMATES AGAINST SMALL SAMPLE SIZES ROBUSTNESS OF MULTILEVEL PARAMETER ESTIMATES AGAINST SMALL SAMPLE SIZES Cora J.M. Maas 1 Utrecht University, The Netherlands Joop J. Hox Utrecht University, The Netherlands In social sciences, research

More information

Introduction to the Analysis of Hierarchical and Longitudinal Data

Introduction to the Analysis of Hierarchical and Longitudinal Data Introduction to the Analysis of Hierarchical and Longitudinal Data Georges Monette, York University with Ye Sun SPIDA June 7, 2004 1 Graphical overview of selected concepts Nature of hierarchical models

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information