An R # Statistic for Fixed Effects in the Linear Mixed Model and Extension to the GLMM
|
|
- Martha Cameron
- 5 years ago
- Views:
Transcription
1 An R Statistic for Fixed Effects in the Linear Mixed Model and Extension to the GLMM Lloyd J. Edwards, Ph.D. UNC-CH Department of Biostatistics Lloyd_Edwards@unc.edu Presented to the Department of Biostatistics, The University of Alabama at Birmingham August 17,
2 1. Introduction 2. The Linear Mixed Model " 3. R : A n R Statistic for Fixed Effects in the Linear Mixed Model - Choice of Null Model - Model R " - Semi-partial R " 4. R : A n R " Statistic for Fixed Effects in the Generalized Linear Mixed Model 5. Software: SAS and R 6. Example Computations and Interpretations - Retrospective Longitudinal Study of Adults with Hypertension 7. Conclusions 2
3 1. Introduction ñ The linear mixed model (LMM) is the most widely used statistical tool for the analysis of longitudinal models with Gaussian errors. ñ The LMM explicitly specifies not only the mean structure, but also the covariance structure. ñ Hence three types of model comparisons can occur. 1. Compare mean models with the same covariance structure. Nested mean models are the most common. 2. Compare covariance models with the same mean structure. Two LMMs may be nested or nonnested in the covariance models. 3. Compare LMMs with different mean and different covariance structures. 3
4 1. Introduction ñ Here we describe an V only for model comparison 1, i.e., comparing nested mean models with the same covariance structure. ñ We use an approximate J statistic for a Wald test of fixed effects, the Kenward-Roger small sample J statistic, to define an V statistic for fixed effects in the linear mixed model. ñ The V statistic measures multivariate association between the repeated outcomes and the fixed effects in the linear mixed model and can also be interpreted as the proportion of variance explained by the fixed effects. ñ Defining V in terms of an J statistic for fixed effects allows computing it with results from fitting only a single model, i.e., there is no need to explicitly fit a null model. ñ The approach necessarily assumes the covariance structure holds for both the model of interest and the implied null model. 4
5 2. The Linear Mixed Model for Longitudinal Data ñ With R independ ent sampling units (often persons in practice), the linear mixed model for person 3 may be written ] œ \ " ^, /. (1) Here, ] 3 is a : " 3 vector of observations on person 3; \ 3 is a : ; 3 known, constant design matrix for person 3, with full column rank ; " is a ; " vector of unknown, constant, population parameters. ^3 is a : 7 3 known, constant design matrix with rank 7for person 3, 3 7 " vector of unknown random effects is a : " vector of unknown random errors. / 3 3 Gaussian, 3 and / 3 are independent with mean! and, 3,3,! i œ / D 7! D 7. (2) 3 /3 / 5
6 2. The Linear Mixed Model for Longitudinal Data ñ Here ið Ñ is the covariance operator, while both D,3 œ D,3 7, and D/3 œ D/3 7/ are positive-definite, symmetric covariance matrices. Therefore ið] Ñmay be written ið] Ñ œ D œ ^ D ^ D w,3 3 /3 We assume that D 3 can be characterized by a finite set of parameters represented by an < " vector 7 which consists of the unique parameters in 7. and 7/. R Throughout 8œ 8. 3œ" 3 ñ We will also need to refer to a stacked data formulation of model (0) given by ] œ \" ^, /, (3) w w w w w with ] œ ] " â] R, \ œ \ " â\ R, ^ œ diag ^" ßâß^R, w w w w w w,œ, " â, R, and /œ / " â/ R. Here, µ ar;! ßD,3 7, ŒMR and / µ a 8! ßD/ for D/ œ diag D/" 7/ ßâßD/R 7/. As a result we have ] µ \ ", D with D œ ið] Ñ œ diag D ßâßD. a 8 " R w 6
7 " 3. R : A n R Model R " Statistic for Fixed Effects in the Linear Mixed Model ñ We consider a LMM that includes an intercept in \ 3, the fixed effects design, and may or may not include an intercept in (the random effects design). ñ The model J ("D s, s) statistic in the LMM corresponds to a comparison between two nested fixed effects models: the model of interest (full or maximum model) and a null model with no covariates except the intercept. ^3 ñ With \ 3 œ B âb and " œ " " â", 1 3 "ß3 ; "ß3! " ; " w 1. Model of Interest: ] œ \ 3 " ^, / Null Model: ] œ 1 " ^, /. 3 3! ñ The model J ("D s, s) statistic uses a single model fit to compare two models differing by the presence or absence of all fixed effect predictors except the intercept. 7
8 " 3. R : A n R Statistic for Fixed Effects in the Linear Mixed Model ñ The model J statistic corresponds to a test of the null hypothesis L À G " œ! for G œ! M! ; " " ; " or equivalently L! À "" œ " œ â œ "; " œ! where ; " is the numerator degrees of freedom (ndf) for full rank G. ñ The model J statistic is given by w " " 1 " s wd s w G G \ \ G G" s J( "D s, s) œ. (6) rank G ñ Approximations for denominator degrees of freedom (ddf) include Kenward- Roger, Satterthwaithe, Containment method, Residual method. 8
9 " 3. R : A n R Statistic for Fixed Effects in the Linear Mixed Model ñ From univariate linear model theory, we have J ( s, VÎ; " s ) " "D œ " V Î/, (7) where V" denotes our proposed V statistic and / is the (approximated) ddf. ñ Solving for V " yields " ; " J s s V" " / ("D, ) œ " ; " / " J( "D s, s). (8) ñ As stated previously, different choices exist for the approximate denominator degrees of freedom (d.f.), /, in the J ("s sd ) statistic for fixed effects. 9
10 " 3. R : A n R Statistic for Fixed Effects in the Linear Mixed Model ñ The natural extension of the univariate linear model uses the residual degrees of freedom, with / œ8 rank \, where 8is the total number of observations. ñ Under REML estimation, the Kenward-Roger J provides the most accurate inference in small samples, while the Satterthwaite method also does well. ñ The statistic reflects our belief that the covariance model for ] 3 should remain the same when the comparison centers on nested fixed effects. V " ñ The fact that J ( s, s "D)! insures!ÿv" ", with V" œ! indicating no multivariate association between ] and \. ñ On the other hand, as V nears 1, then the multivariate association between and \ becomes perfect. " ] 10
11 3. R : " An R Statistic for Fixed Effects in the Linear Mixed Model Semi-partial R " ñ R" is the only R statistic in its class that leads to defining a semi-partial V statistic. ñ Just as for overall regression, the partial J in the LMM defines a partial V " by way of equation, where the ndf, ; 1, is set equal to 1 (7). ñ Testing L! À " 4 œ! for 4 "ßáß; ", i.e., testing a particular fixed effect regression coefficient, gives a partial J statisticþ ñ It measures the marginal contribution of when all the other predictors have \ 5 already been included in the model. 11
12 4. R" : An R Statistic for Fixed Effects in the Generalized Model (GLMM) Linear Mixed ñ The generalized linear mixed model (GLMM) is one of the most commonly used random effects model for discrete outcomes and continuous outcomes with non-normal distributions. ñ We summarize the basic features of the GLMM here with a focus on longitudinal data. ñ With R independ ent sampling units (often persons in practice) and w c onditionally on the random effects, 3 œ, 3" á, 3; (; "), assume that the responses ] 34 of ] 3 ( 73 "), 4œ"ßáß73, 3œ"ßáßRß are independent with density function that is a member of the exponential family. 12
13 4. R" : An R Statistic for Fixed Effects in the Generalized Model (GLMM) ñ The conditional mean satisfies a linear regression model I] ( 34l, 3) œ. 34 œ2( ( 34), w w ( œ1i] [ ( l, )] œb " D, ( l, ) ) Linear Mixed (9) where " (: ") is a vector of unknown fixed effect parameters, B34 (: ") and D34 (; ") are vectors of fixed and random effect explanatory variables (the first element of B 34 is a 1), + 34 is a known constant, and 9 is a dispersion parameter, 1œ2 " is referred to as the link function. 13
14 4. R" : An R Statistic for Fixed Effects in the Generalized Model (GLMM) Linear Mixed ñ The random effects, 3 are assumed to be sampled from a (multivariate) normal distribution with mean 0 (; ") and covariance matrix D, œ D,( ) (; ;) that depends on a vector ( < " ) of unknown variance components. ñ When estimating the parameters in the GLMM, the exact likelihood function involves an intractable high-dimensional integration that is difficult to compute. There are several approximations to the likelihood function and approximate maximum likelihood estimators (MLE) have been proposed in the literature. ñ Among the available estimation techniques, the penalized quasi-likelihood (PQL) by Breslow and Clayton (1993) is the most popular for the GLMM. ñ PQL approximates the high-dimensional integration using the well-known Laplace approximation and the approximated likelihood function has that of a Gaussian distribution. 14
15 4. R" : An R Statistic for Fixed Effects in the Generalized Model (GLMM) Linear Mixed ñ PQL estimation creates a transformed outcome for which the normal theory LMM with REML estimation applies. ñ PQL is attractive to practitioners due to its general availability, ease of use, and computational efficiency. ñ PQL is perhaps the most used estimation technique for the GLMM in commercial software. 15
16 4. R" : An R Statistic for Fixed Effects in the Generalized Model (GLMM) Linear Mixed ñ PQL Summary: Breslow and Clayton (1993) assumes that I( ] 3l, 3) œ. 3 can be derived from a first-order approximation to the hierarchical model that is valid in the limit as the components of dispersion approach zero. If we write the model as ] 3 œ. 3 % 3 3Ñ œ + 3@ (. 3) and, µ a!,, a first-order Taylor series of 2 about "s and s, yields 3 ; D, 3 w w 2Ð( Ñ 2Ð( s Ñ 2 Ð( s Ñ\ (" " s) 2 Ð( s Ñ^ (, s, ) %, (10) w where 2 Ðs Ñ œ `2Ðs w ( 3 ( 3ÑÎ`( 3 œ "Î1 Ð( s3ñ is a diagonal matrix of derivatives of the conditional mean evaluated at "s and ș 3. 16
17 4. R" : An R Statistic for Fixed Effects in the Generalized Model (GLMM) Linear Mixed Define ] 3 as the working vector of pseudo-responses 3 3 w ] œ1ð] Ñœ1Ð. s ÑÐ]. s ( s, (11) where. s œ2ð( s Ñ ( s w w and œ\ "s ^ s, When re-ordering terms in (10) we get ) 3 ] œ \ " ^, 3 / 3, (12) where the working vector of pseudo-responses ] 3 can be specified as an approximate LMM with multivariate normal, 3 and / 3, independent with mean! 3, œ / D -1! [
18 4. R" : An R Statistic for Fixed Effects in the Generalized Model (GLMM) Linear Mixed w -1 The variance/covariance of ] 3 may be written D3 œ ^3 D, ^ 3 [ 3 where w œ diag s 1 s (. )[ (. )]. [ Model fitting is accomplished by iterating between updating the pseudoresponses ] 3 and fitting a LMM until convergence. ñ Recall from Section 3 that the choice of the null model plays a central role in defining a model V statistic. 18
19 4. R" : An R Statistic for Fixed Effects in the Generalized Model (GLMM) Linear Mixed ñ To define the model V statistic for fixed effects in the GLMM, the following two models we compare 1. Model of interest: ( œ \ " ^,, (13) 2. Null Model: or in the transformed LMM via PQL estimation 1. Model of Interest: ( œ " 1 ^, 3! ! ] œ \ 3 " ^, / 2. Null Model: ] œ 1 " ^, /. ñ We define the model R ", an V statistic for fixed effects in the GLMM, as V " for fixed effects in the LMM for the transformed outcome vector ] 3 at convergence of PQL. 19
20 4. R" : An R Statistic for Fixed Effects in the Generalized Model (GLMM) Linear Mixed ñ R " for the GLMM here is defined for a single model and incorporates the null model given by (13). ñ In addition, since V gives rise to a natural definition of a (semi) partial V for the LMM, we now can compute partial V 's for the GLMM using R ". " 20
21 5. Software - (1) SAS: %Glimmix_R2_V3.SAS This SAS macro computes model and semi-partial R statistics for each fixed effect in the model. Due to a lack of the non-central beta distribution in SAS, confidence limits are not currently supplied. The user may specify denominator degrees of freedom estimation. Kenward-Roger (DDF = KR) is recommended, and is fully compatible with the GLMM. (2) R: r2glmm Package This package computes model R and semi-partial R for the LMM and GLMM. Three methods of computation are provided: (1) The Kenward-Roger approach. (2) The method introduced by Nakagawa and Schielzeth (2013), which was later extended by Johnson (2014). (3) An approach using standardized generalized variance (SGV) that can be used for both mean model and covariance model selection. 21
22 Confidence limits for model R and semi-partial R are computed for each of the methods listed. 22
23 6. Example: Blood pressure in adults, continuous and binary outcomes ñ Retrospective longitudinal cohort study of adults with hypertension. ñ University of North Carolina Family Medicine Center ñ 459 hypertensive patients making at least 4 visits to the Center during a two year period, Black=205, White/Asian/Other=254; Female=283, Male=176. Average age 59 years old. ñ Mean number of visits and std dev: 10.4 (5.66). Range: ñ Outcomes: Longitudinal blood pressure level ( continuous SBP and DBP) and dichotomous blood pressure control (<140 systolic and < 90 systolic). Independent variables include age, race, gender, continuity of care, and linear time in years. ñ Both systolic and diastolic BP fell over the two years (systolic 2.2 mmhg/yr and diastolic 2.8 mmhg/yr). Lower systolic blood pressure was not associated with continuity of care or gender. Lower blood pressures were associated with Caucasian race (vs. African American race). 23
24 6. Example: Blood pressure in adults, continuous and binary outcomes ñ Fixed effects for the analysis include Age, Race, Gender, Continuity of Care (COC), and continuous, linear time in years. ñ Random effects included intercept and time with unstructured random effects covariance, D, (2 2). ñ Within-subject error variance D /3 œ 5 M : 3. 24
25 Outcome Effect Estimate (SE) NDF DDF F-value P-value Semi-Partial SBP Intercept (3.03) < Age 0.21 (0.04) < Race 3.97 (1.32) Gender 1.64 (1.35) COC (2.31) Time (0.60) R " < R " DBP Intercept (1.64) < Age (0.02) < Race 1.85 (0.72) Gender (0.73) COC (1.26) Time (0.30) < R " < BP Control Intercept (0.31) (LMM from PQL) Age 0.01 (0.004) Race 0.36 (0.14) Gender 0.11 (0.14) COC (0.24) Time (0.07) < R " < OR (95% CI) BP Control Intercept (0.30) (Final GLMM) Age 0.01 (0.004) (1.00, 1.02) Race 0.35 (0.13) (1.09, 1.84) Gender 0.12 (0.14) (0.86, 1.46) COC (0.23) (0.52, 1.29) Time (0.07) < (0.60, 0.79) 25
26 7. Conclusions ñ Due to the familiarity of the R statistic in the linear univariate model, there is naturally great interest in its extension to the linear mixed model. ñ Unfortunately, the development of R statistics for the linear mixed model has received comparatively little attention. ñ We propose an R statistic for assessing fixed effects in the linear mixed model. ñ We extended the proposed R statistic to the GLMM under PQL estimation for assessing fixed effects. ñ By way of a longitudinal data example, we demonstrated the utility of the proposed R statistic for both continuous and binary outcomes. 26
27 6. Conclusions ñ It should be noted, however, that is based on a marginal J statistic and cannot be used to determine subject-specific goodness-of-fit. R " ñ In addition, was developed for models with nested fixed effects and should R " not be used with non-nested fixed effects. 27
28 References 1. Edwards LJ, Muller KE, Wolfinger RD, Qaqish BF, Schabenberger O (2008). An R statistic for fixed effects in the linear mixed model. Statistics in Medicine, 27(29): Jaeger BC, Edwards LJ, Das K, Sen PK (2016). An R statistic for fixed effects in the generalized linear mixed model. In press, Journal of Applied Statistics. 3. Fisher M, Sloane P, Edwards L, Gamble G (2007). Continuity of care and hypertension control in a university-based practice. Ethnicity and Disease, 17(4): Ballarini N, Jaeger BC (2015). Software in SAS and R programming languages to calculate model R and semi-partial R for fixed effects in the linear and generalized linear mixed model. 28
29 5. B. Jaeger, r2glmm: Computes R squared for mixed (multilevel) models (LMMs and GLMMs), (2016). R package version Available at 6. Nakagawa S, Schielzeth H (2013). A general and simple method for obtaining R from generalized linear mixed-effects models. Methods Ecol. Evolut. 4: Johnson PC (2014). Extension of Nakagawa & Schielzeth s r2glmm to random slopes models. Methods Ecol. Evolut., 5:
Package r2glmm. August 5, 2017
Type Package Package r2glmm August 5, 2017 Title Computes R Squared for Mixed (Multilevel) Models Date 2017-08-04 Version 0.1.2 The model R squared and semi-partial R squared for the linear and generalized
More informationGeneralized Linear Models for Non-Normal Data
Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture
More informationPower and Sample Size for the Most Common Hypotheses in Mixed Models
Power and Sample Size for the Most Common Hypotheses in Mixed Models Anna E. Barón Sarah M. Kreidler Deborah H. Glueck, University of Colorado and Keith E. Muller, University of Florida UNIVERSITY OF COLORADO
More informationA Re-Introduction to General Linear Models (GLM)
A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing
More informationStatistical Distribution Assumptions of General Linear Models
Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Today s Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors
More informationPQL Estimation Biases in Generalized Linear Mixed Models
PQL Estimation Biases in Generalized Linear Mixed Models Woncheol Jang Johan Lim March 18, 2006 Abstract The penalized quasi-likelihood (PQL) approach is the most common estimation procedure for the generalized
More informationGeneralized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.
Texts in Statistical Science Generalized Linear Mixed Models Modern Concepts, Methods and Applications Walter W. Stroup CRC Press Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint
More informationNon-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models
Optimum Design for Mixed Effects Non-Linear and generalized Linear Models Cambridge, August 9-12, 2011 Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More informationPeng Li * and David T Redden
Li and Redden BMC Medical Research Methodology (2015) 15:38 DOI 10.1186/s12874-015-0026-x RESEARCH ARTICLE Open Access Comparing denominator degrees of freedom approximations for the generalized linear
More informationPackage HGLMMM for Hierarchical Generalized Linear Models
Package HGLMMM for Hierarchical Generalized Linear Models Marek Molas Emmanuel Lesaffre Erasmus MC Erasmus Universiteit - Rotterdam The Netherlands ERASMUSMC - Biostatistics 20-04-2010 1 / 52 Outline General
More informationWU Weiterbildung. Linear Mixed Models
Linear Mixed Effects Models WU Weiterbildung SLIDE 1 Outline 1 Estimation: ML vs. REML 2 Special Models On Two Levels Mixed ANOVA Or Random ANOVA Random Intercept Model Random Coefficients Model Intercept-and-Slopes-as-Outcomes
More informationAnalysis of 24-Hour Ambulatory Blood Pressure Monitoring Data using Orthonormal Polynomials in the Linear Mixed Model
Analysis of 24-Hour Ambulatory Blood Pressure Monitoring Data using Orthonormal Polynomials in the Linear Mixed Model "* # Lloyd J. Edwards and Sean L. Simpson " Department of Biostatistics, University
More informationGeneralized linear models
Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models
More informationReview of the General Linear Model
Review of the General Linear Model EPSY 905: Multivariate Analysis Online Lecture #2 Learning Objectives Types of distributions: Ø Conditional distributions The General Linear Model Ø Regression Ø Analysis
More informationEstimation in Generalized Linear Models with Heterogeneous Random Effects. Woncheol Jang Johan Lim. May 19, 2004
Estimation in Generalized Linear Models with Heterogeneous Random Effects Woncheol Jang Johan Lim May 19, 2004 Abstract The penalized quasi-likelihood (PQL) approach is the most common estimation procedure
More informationPractical Biostatistics
Practical Biostatistics Clinical Epidemiology, Biostatistics and Bioinformatics AMC Multivariable regression Day 5 Recap Describing association: Correlation Parametric technique: Pearson (PMCC) Non-parametric:
More informationCorrelation and Simple Linear Regression
Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline
More informationStatistics in medicine
Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu
More informationR Package glmm: Likelihood-Based Inference for Generalized Linear Mixed Models
R Package glmm: Likelihood-Based Inference for Generalized Linear Mixed Models Christina Knudson, Ph.D. University of St. Thomas user!2017 Reviewing the Linear Model The usual linear model assumptions:
More informationIntroduction to lnmle: An R Package for Marginally Specified Logistic-Normal Models for Longitudinal Binary Data
Introduction to lnmle: An R Package for Marginally Specified Logistic-Normal Models for Longitudinal Binary Data Bryan A. Comstock and Patrick J. Heagerty Department of Biostatistics University of Washington
More informationGeneralized, Linear, and Mixed Models
Generalized, Linear, and Mixed Models CHARLES E. McCULLOCH SHAYLER.SEARLE Departments of Statistical Science and Biometrics Cornell University A WILEY-INTERSCIENCE PUBLICATION JOHN WILEY & SONS, INC. New
More informationAdditional Notes: Investigating a Random Slope. When we have fixed level-1 predictors at level 2 we show them like this:
Ron Heck, Summer 01 Seminars 1 Multilevel Regression Models and Their Applications Seminar Additional Notes: Investigating a Random Slope We can begin with Model 3 and add a Random slope parameter. If
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Topics: Summary of building unconditional models for time Missing predictors in MLM Effects of time-invariant predictors Fixed, systematically varying,
More informationDesigning Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D.
Designing Multilevel Models Using SPSS 11.5 Mixed Model John Painter, Ph.D. Jordan Institute for Families School of Social Work University of North Carolina at Chapel Hill 1 Creating Multilevel Models
More informationTutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements:
Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported
More informationMixed models in R using the lme4 package Part 5: Generalized linear mixed models
Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates 2011-03-16 Contents 1 Generalized Linear Mixed Models Generalized Linear Mixed Models When using linear mixed
More informationSTAT 526 Advanced Statistical Methodology
STAT 526 Advanced Statistical Methodology Fall 2017 Lecture Note 10 Analyzing Clustered/Repeated Categorical Data 0-0 Outline Clustered/Repeated Categorical Data Generalized Linear Mixed Models Generalized
More informationMixed models in R using the lme4 package Part 5: Generalized linear mixed models
Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates Madison January 11, 2011 Contents 1 Definition 1 2 Links 2 3 Example 7 4 Model building 9 5 Conclusions 14
More informationReview of CLDP 944: Multilevel Models for Longitudinal Data
Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance
More informationClass Notes: Week 8. Probit versus Logit Link Functions and Count Data
Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While
More informationThe performance of estimation methods for generalized linear mixed models
University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2008 The performance of estimation methods for generalized linear
More informationLogistic Regression. Fitting the Logistic Regression Model BAL040-A.A.-10-MAJ
Logistic Regression The goal of a logistic regression analysis is to find the best fitting and most parsimonious, yet biologically reasonable, model to describe the relationship between an outcome (dependent
More informationLecture 18: Simple Linear Regression
Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength
More informationTutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances
Tutorial 4: Power and Sample Size for the Two-sample t-test with Unequal Variances Preface Power is the probability that a study will reject the null hypothesis. The estimated probability is a function
More informationEPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7
Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review
More informationCorrelation and simple linear regression S5
Basic medical statistics for clinical and eperimental research Correlation and simple linear regression S5 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/41 Introduction Eample: Brain size and
More informationMultilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2
Multilevel Models in Matrix Form Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Today s Lecture Linear models from a matrix perspective An example of how to do
More informationA (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data
A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data Today s Class: Review of concepts in multivariate data Introduction to random intercepts Crossed random effects models
More informationHierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!
Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter
More informationReview of Multiple Regression
Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate
More informationTime Invariant Predictors in Longitudinal Models
Time Invariant Predictors in Longitudinal Models Longitudinal Data Analysis Workshop Section 9 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section
More informationDiscrete Multivariate Statistics
Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are
More informationA Re-Introduction to General Linear Models
A Re-Introduction to General Linear Models Today s Class: Big picture overview Why we are using restricted maximum likelihood within MIXED instead of least squares within GLM Linear model interpretation
More informationGeneralized linear mixed models for biologists
Generalized linear mixed models for biologists McMaster University 7 May 2009 Outline 1 2 Outline 1 2 Coral protection by symbionts 10 Number of predation events Number of blocks 8 6 4 2 2 2 1 0 2 0 2
More informationBiostatistics 4: Trends and Differences
Biostatistics 4: Trends and Differences Dr. Jessica Ketchum, PhD. email: McKinneyJL@vcu.edu Objectives 1) Know how to see the strength, direction, and linearity of relationships in a scatter plot 2) Interpret
More information36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)
36-309/749 Experimental Design for Behavioral and Social Sciences Dec 1, 2015 Lecture 11: Mixed Models (HLMs) Independent Errors Assumption An error is the deviation of an individual observed outcome (DV)
More informationCourse Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -
More informationSection 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples
Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means
More informationIntroduction to Within-Person Analysis and RM ANOVA
Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides
More informationLecture 14: Introduction to Poisson Regression
Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why
More informationModelling counts. Lecture 14: Introduction to Poisson Regression. Overview
Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week
More information1 Mixed effect models and longitudinal data analysis
1 Mixed effect models and longitudinal data analysis Mixed effects models provide a flexible approach to any situation where data have a grouping structure which introduces some kind of correlation between
More informationProfile Analysis Multivariate Regression
Lecture 8 October 12, 2005 Analysis Lecture #8-10/12/2005 Slide 1 of 68 Today s Lecture Profile analysis Today s Lecture Schedule : regression review multiple regression is due Thursday, October 27th,
More informationYou can compute the maximum likelihood estimate for the correlation
Stat 50 Solutions Comments on Assignment Spring 005. (a) _ 37.6 X = 6.5 5.8 97.84 Σ = 9.70 4.9 9.70 75.05 7.80 4.9 7.80 4.96 (b) 08.7 0 S = Σ = 03 9 6.58 03 305.6 30.89 6.58 30.89 5.5 (c) You can compute
More informationMLMED. User Guide. Nicholas J. Rockwood The Ohio State University Beta Version May, 2017
MLMED User Guide Nicholas J. Rockwood The Ohio State University rockwood.19@osu.edu Beta Version May, 2017 MLmed is a computational macro for SPSS that simplifies the fitting of multilevel mediation and
More informationEXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY
EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 009 MODULE 4 : Linear models Time allowed: One and a half hours Candidates should answer THREE questions. Each question carries
More informationIntroduction (Alex Dmitrienko, Lilly) Web-based training program
Web-based training Introduction (Alex Dmitrienko, Lilly) Web-based training program http://www.amstat.org/sections/sbiop/webinarseries.html Four-part web-based training series Geert Verbeke (Katholieke
More informationRegression: Main Ideas Setting: Quantitative outcome with a quantitative explanatory variable. Example, cont.
TCELL 9/4/205 36-309/749 Experimental Design for Behavioral and Social Sciences Simple Regression Example Male black wheatear birds carry stones to the nest as a form of sexual display. Soler et al. wanted
More informationPackage MACAU2. R topics documented: April 8, Type Package. Title MACAU 2.0: Efficient Mixed Model Analysis of Count Data. Version 1.
Package MACAU2 April 8, 2017 Type Package Title MACAU 2.0: Efficient Mixed Model Analysis of Count Data Version 1.10 Date 2017-03-31 Author Shiquan Sun, Jiaqiang Zhu, Xiang Zhou Maintainer Shiquan Sun
More informationLongitudinal Modeling with Logistic Regression
Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to
More informationIntroducing Generalized Linear Models: Logistic Regression
Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and
More informationLinear Regression. In this lecture we will study a particular type of regression model: the linear regression model
1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor
More informationBayesian Linear Regression
Bayesian Linear Regression Sudipto Banerjee 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. September 15, 2010 1 Linear regression models: a Bayesian perspective
More information36-309/749 Experimental Design for Behavioral and Social Sciences. Sep. 22, 2015 Lecture 4: Linear Regression
36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 22, 2015 Lecture 4: Linear Regression TCELL Simple Regression Example Male black wheatear birds carry stones to the nest as a form
More informationRon Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)
Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October
More informationOutline. Statistical inference for linear mixed models. One-way ANOVA in matrix-vector form
Outline Statistical inference for linear mixed models Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark general form of linear mixed models examples of analyses using linear mixed
More informationA discussion on multiple regression models
A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value
More informationSAS Syntax and Output for Data Manipulation: CLDP 944 Example 3a page 1
CLDP 944 Example 3a page 1 From Between-Person to Within-Person Models for Longitudinal Data The models for this example come from Hoffman (2015) chapter 3 example 3a. We will be examining the extent to
More informationInvestigating Models with Two or Three Categories
Ronald H. Heck and Lynn N. Tabata 1 Investigating Models with Two or Three Categories For the past few weeks we have been working with discriminant analysis. Let s now see what the same sort of model might
More informationRandom Intercept Models
Random Intercept Models Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline A very simple case of a random intercept
More informationMeta-analysis of epidemiological dose-response studies
Meta-analysis of epidemiological dose-response studies Nicola Orsini 2nd Italian Stata Users Group meeting October 10-11, 2005 Institute Environmental Medicine, Karolinska Institutet Rino Bellocco Dept.
More informationNemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014
Nemours Biomedical Research Statistics Course Li Xie Nemours Biostatistics Core October 14, 2014 Outline Recap Introduction to Logistic Regression Recap Descriptive statistics Variable type Example of
More informationCourse Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012
More informationAcknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression
INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical
More informationMANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA:
MULTIVARIATE ANALYSIS OF VARIANCE MANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA: 1. Cell sizes : o
More informationSTA6938-Logistic Regression Model
Dr. Ying Zhang STA6938-Logistic Regression Model Topic 2-Multiple Logistic Regression Model Outlines:. Model Fitting 2. Statistical Inference for Multiple Logistic Regression Model 3. Interpretation of
More informationBinary Logistic Regression
The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b
More informationMixed models in R using the lme4 package Part 7: Generalized linear mixed models
Mixed models in R using the lme4 package Part 7: Generalized linear mixed models Douglas Bates University of Wisconsin - Madison and R Development Core Team University of
More informationRonald Heck Week 14 1 EDEP 768E: Seminar in Categorical Data Modeling (F2012) Nov. 17, 2012
Ronald Heck Week 14 1 From Single Level to Multilevel Categorical Models This week we develop a two-level model to examine the event probability for an ordinal response variable with three categories (persist
More informationI r j Binom(m j, p j ) I L(, ; y) / exp{ y j + (x j y j ) m j log(1 + e + x j. I (, y) / L(, ; y) (, )
Today I Bayesian analysis of logistic regression I Generalized linear mixed models I CD on fixed and random effects I HW 2 due February 28 I Case Studies SSC 2014 Toronto I March/April: Semi-parametric
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science
UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator
More informationSTAT 5500/6500 Conditional Logistic Regression for Matched Pairs
STAT 5500/6500 Conditional Logistic Regression for Matched Pairs Motivating Example: The data we will be using comes from a subset of data taken from the Los Angeles Study of the Endometrial Cancer Data
More informationSCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models
SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION
More informationIntroduction to logistic regression
Introduction to logistic regression Tuan V. Nguyen Professor and NHMRC Senior Research Fellow Garvan Institute of Medical Research University of New South Wales Sydney, Australia What we are going to learn
More informationModel Estimation Example
Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions
More informationCorrelation and regression
1 Correlation and regression Yongjua Laosiritaworn Introductory on Field Epidemiology 6 July 2015, Thailand Data 2 Illustrative data (Doll, 1955) 3 Scatter plot 4 Doll, 1955 5 6 Correlation coefficient,
More informationLab 11. Multilevel Models. Description of Data
Lab 11 Multilevel Models Henian Chen, M.D., Ph.D. Description of Data MULTILEVEL.TXT is clustered data for 386 women distributed across 40 groups. ID: 386 women, id from 1 to 386, individual level (level
More informationLinear Regression Models P8111
Linear Regression Models P8111 Lecture 25 Jeff Goldsmith April 26, 2016 1 of 37 Today s Lecture Logistic regression / GLMs Model framework Interpretation Estimation 2 of 37 Linear regression Course started
More informationOutline. Mixed models in R using the lme4 package Part 5: Generalized linear mixed models. Parts of LMMs carried over to GLMMs
Outline Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates University of Wisconsin - Madison and R Development Core Team UseR!2009,
More informationVarieties of Count Data
CHAPTER 1 Varieties of Count Data SOME POINTS OF DISCUSSION What are counts? What are count data? What is a linear statistical model? What is the relationship between a probability distribution function
More informationGenerative classifiers: The Gaussian classifier. Ata Kaban School of Computer Science University of Birmingham
Generative classifiers: The Gaussian classifier Ata Kaban School of Computer Science University of Birmingham Outline We have already seen how Bayes rule can be turned into a classifier In all our examples
More informationRepeated ordinal measurements: a generalised estimating equation approach
Repeated ordinal measurements: a generalised estimating equation approach David Clayton MRC Biostatistics Unit 5, Shaftesbury Road Cambridge CB2 2BW April 7, 1992 Abstract Cumulative logit and related
More informationName: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm
Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam June 8 th, 2016: 9am to 1pm Instructions: 1. This is exam is to be completed independently. Do not discuss your work with
More informationLogistic regression: Miscellaneous topics
Logistic regression: Miscellaneous topics April 11 Introduction We have covered two approaches to inference for GLMs: the Wald approach and the likelihood ratio approach I claimed that the likelihood ratio
More informationST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses
ST3241 Categorical Data Analysis I Multicategory Logit Models Logit Models For Nominal Responses 1 Models For Nominal Responses Y is nominal with J categories. Let {π 1,, π J } denote the response probabilities
More informationRon Heck, Fall Week 3: Notes Building a Two-Level Model
Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level
More information