Fisher information for generalised linear mixed models

Similar documents
Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University

Sample size calculations for logistic and Poisson regression models

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

LOGISTIC REGRESSION Joseph M. Hilbe

The equivalence of the Maximum Likelihood and a modified Least Squares for a case of Generalized Linear Model

Lecture 16 Solving GLMs via IRWLS

Using Estimating Equations for Spatially Correlated A

Second-order approximation of dynamic models without the use of tensors

High-dimensional asymptotic expansions for the distributions of canonical correlations

PQL Estimation Biases in Generalized Linear Mixed Models

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Generalised Regression Models

Selection of Smoothing Parameter for One-Step Sparse Estimates with L q Penalty

Explicit connections between longitudinal data analysis and kernel machines

Linear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52

1 Mixed effect models and longitudinal data analysis

Outline of GLMs. Definitions

Generalized Quasi-likelihood (GQL) Inference* by Brajendra C. Sutradhar Memorial University address:

Some explanations about the IWLS algorithm to fit generalized linear models

Trends in Human Development Index of European Union

STA216: Generalized Linear Models. Lecture 1. Review and Introduction

Mixed models in R using the lme4 package Part 5: Generalized linear mixed models

Adaptive modelling of conditional variance function

ON VARIANCE COVARIANCE COMPONENTS ESTIMATION IN LINEAR MODELS WITH AR(1) DISTURBANCES. 1. Introduction

Generalized Linear Models. Last time: Background & motivation for moving beyond linear

Generalized Linear Models (GLZ)

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014

Approximate Likelihoods

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

SB1a Applied Statistics Lectures 9-10

Bias Study of the Naive Estimator in a Longitudinal Binary Mixed-effects Model with Measurement Error and Misclassification in Covariates

Longitudinal data analysis using generalized linear models

Sampling distribution of GLM regression coefficients

STAT 135 Lab 13 (Review) Linear Regression, Multivariate Random Variables, Prediction, Logistic Regression and the δ-method.

Mixed models in R using the lme4 package Part 7: Generalized linear mixed models

HOW IS GENERALIZED LEAST SQUARES RELATED TO WITHIN AND BETWEEN ESTIMATORS IN UNBALANCED PANEL DATA?

Generalized Linear Models. Kurt Hornik

Generalized Estimating Equations (gee) for glm type data

Submitted to the Brazilian Journal of Probability and Statistics

Confidence intervals for the variance component of random-effects linear models

Estimation in Generalized Linear Models with Heterogeneous Random Effects. Woncheol Jang Johan Lim. May 19, 2004

Comments. x > w = w > x. Clarification: this course is about getting you to be able to think as a machine learning expert

Mixed models in R using the lme4 package Part 5: Generalized linear mixed models

Outline. Mixed models in R using the lme4 package Part 5: Generalized linear mixed models. Parts of LMMs carried over to GLMMs

,..., θ(2),..., θ(n)

LONGITUDINAL DATA ANALYSIS

Bayesian Regressions in Experimental Design

Discrete Response Multilevel Models for Repeated Measures: An Application to Voting Intentions Data

Rejoinder. 1 Phase I and Phase II Profile Monitoring. Peihua Qiu 1, Changliang Zou 2 and Zhaojun Wang 2

P n. This is called the law of large numbers but it comes in two forms: Strong and Weak.

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit

Generalized Linear Models Introduction

Generalised linear mixed model analysis via sequential Monte Carlo sampling

Computational methods for mixed models

AN EFFICIENT GLS ESTIMATOR OF TRIANGULAR MODELS WITH COVARIANCE RESTRICTIONS*

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model

A Note on Bootstraps and Robustness. Tony Lancaster, Brown University, December 2003.

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon

Chap 2. Linear Classifiers (FTH, ) Yongdai Kim Seoul National University

On Expected Gaussian Random Determinants

Generalized linear models

Repeated ordinal measurements: a generalised estimating equation approach

Quasi-Monte Carlo for Highly Structured Generalised Response Models

MIT Spring 2016

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Corrigendum to Inference on impulse response functions in structural VAR models [J. Econometrics 177 (2013), 1-13]

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses

STA 216: GENERALIZED LINEAR MODELS. Lecture 1. Review and Introduction. Much of statistics is based on the assumption that random

Linear Methods for Prediction

Multivariate spatial models and the multikrig class

Multivariate Analysis and Likelihood Inference

Conditional Inference Functions for Mixed-Effects Models with Unspecified Random-Effects Distribution

Lecture 3 September 1

Generalized Quasi-likelihood versus Hierarchical Likelihood Inferences in Generalized Linear Mixed Models for Count Data

Generalized Linear Models

Lecture Notes. Introduction

Generalized Linear Models

Model Selection for Semiparametric Bayesian Models with Application to Overdispersion

Longitudinal Data Analysis. Michael L. Berbaum Institute for Health Research and Policy University of Illinois at Chicago

f(x θ)dx with respect to θ. Assuming certain smoothness conditions concern differentiating under the integral the integral sign, we first obtain

Sample size determination for logistic regression: A simulation study

Web-Supplement for: Accurate Logistic Variational Message Passing: Algebraic and Numerical Details

MA 575 Linear Models: Cedric E. Ginestet, Boston University Mixed Effects Estimation, Residuals Diagnostics Week 11, Lecture 1

Regression models for multivariate ordered responses via the Plackett distribution

The Multivariate Gaussian Distribution

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands

Corrigendum to Inference on impulse. response functions in structural VAR models. [J. Econometrics 177 (2013), 1-13]

Lecture 4: Exponential family of distributions and generalized linear model (GLM) (Draft: version 0.9.2)

Lecture 13: Simple Linear Regression in Matrix Format. 1 Expectations and Variances with Vectors and Matrices

Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models

DISPLAYING THE POISSON REGRESSION ANALYSIS

Solving Non-Homogeneous Coupled Linear Matrix Differential Equations in Terms of Matrix Convolution Product and Hadamard Product

Composite Likelihood Estimation

Efficiency of generalized estimating equations for binary responses

Some Theories about Backfitting Algorithm for Varying Coefficient Partially Linear Model

On Properties of QIC in Generalized. Estimating Equations. Shinpei Imori

Sample Size and Power Considerations for Longitudinal Studies

Optimization. The value x is called a maximizer of f and is written argmax X f. g(λx + (1 λ)y) < λg(x) + (1 λ)g(y) 0 < λ < 1; x, y X.

Transcription:

Journal of Multivariate Analysis 98 2007 1412 1416 www.elsevier.com/locate/jmva Fisher information for generalised linear mixed models M.P. Wand Department of Statistics, School of Mathematics and Statistics, University of New South Wales, Sydney 2052, Australia Received 25 October 2005 Available online 11 January 2007 Abstract The Fisher information for the canonical link exponential family generalised linear mixed model is derived. The contribution from the fixed effects parameters is shown to have a particularly simple form. 2007 Elsevier Inc. All rights reserved. AMS 2000 subject classification: 62J12; 62H12; 62F12 Keywords: Longitudinal data analysis; Semiparametric regression; Vector differential calculus; Variance components 1. Introduction Linear mixed models are an extension of linear models for which covariance structure is based on random effects and their covariance parameters. A further extension is generalised linear mixed models, which specifically cater for non-normal response variables. The use of such models has increased dramatically in the past decade. For example, they are now the main vehicle for analysis of longitudinal data e.g. [1,2]. Their use in semiparametric regression is advocated in some recent literature e.g. [6]. The popularity of linear mixed models has been accompanied by vigorous research on analytic results and computational methods. McCulloch and Searle [5] provides a comprehensive summary of the developments that took place in the 20th Century. In the case of the normal response linear mixed model with variance component structure they give an elegant and succinct expression for the Fisher information matrix of the model parameters. The result is reproduced in Section 2. It is revealing in that asymptotic independence of fixed effects and variance components is apparent and useful in that asymptotic sampling variances may be obtained. In this note, we derive an explicit expression for the Fisher information for generalised linear mixed models for exponential family response variables. Potentially, this analytic result can be E-mail address: wand@maths.unsw.edu.au. 0047-259X/$ - see front matter 2007 Elsevier Inc. All rights reserved. doi:10.1016/j.jmva.2007.01.001

M.P. Wand / Journal of Multivariate Analysis 98 2007 1412 1416 1413 used in the same way that Fisher information matrices are used for standard error approximations in generalised linear models and normal linear mixed models. In Section 2 we briefly review the Fisher information results for normal linear mixed models. New results for generalised mixed models are presented in Section 3 and their ramifications are discussed. 2. Normal linear mixed models The normal linear mixed model is given by y u NXβ + Zu, R. Here β is the vector of fixed effects, with corresponding design matrix X, and u is the vector of random effects, with corresponding design matrix Z. The random effects vector u has density fu and is such that Eu 0. Most commonly, the random effects distribution is Gaussian: u N0, G for some covariance matrix G. More details are given in Chapter 6 of McCulloch and Searle [5]. Regardless of the random effects distribution, the Fisher information matrix of β is X T V 1 X where V covy ZGZ T + R is the covariance matrix of y. Now consider the special case of variance component covariance structure Zu r Z i u i, i1 G blockdiag i I q i and R 0 I n, 1 1 i r where I d denotes the d d identity matrix, q i is the length of u i and n is the length of y. The full parameter vector is β, where 0,...,σ2 r. Let its Fisher information matrix be written as [ ] Iββ Iβ, E{H β, log fy; β, I β } I T, 2 I β where fy; β, denotes the density of y and H β, denotes the Hessian matrix with respect to β,. From above I ββ X T V 1 X. Also, regardless of fu;, I β 0 which indicates an asymptotic independence between the maximum likelihood estimators of β and. In the special case of u being normally distributed the I σ2 block has an explicit expression that results in [ X T Iβ, V 1 ] X 0 1 0 2 [tr{zt i V 1 Z j Z T i V 1 Z j T, 3 }] 0 i,j, r where Z 0 I. This is Eq. 6.62 of McCulloch and Searle [5]. In the ensuing discussion, these authors describe asymptotic sampling variances based on I β, 1 where β and denote the maximum likelihood estimators. 3. Generalised linear mixed models As mentioned previously, generalised linear mixed models extend linear mixed models to nonnormal response situations. Alternatively, generalised linear mixed models extend generalised linear models by allowing for the incorporation of random effects. A useful class of generalised

1414 M.P. Wand / Journal of Multivariate Analysis 98 2007 1412 1416 linear models is that corresponding to the one-parameter exponential family with canonical link as treated in McCullagh and Nelder [4]. If y is a random vector and X is a general design matrix with corresponding parameter vector β then the model may be expressed in terms of the density function of y as fy exp{y T Xβ 1 T bxβ + 1 T cy}, 4 where, for example, bx expx for the Poisson model and bx log1+e x for the Bernoulli model. For a general vector v [v 1,...,v d ] T, bv denotes the vector [bv 1,...,bv d ] T. The mixed model extension of 4 is fy u exp{y T Xβ + Zu 1 T bxβ + Zu + 1 T cy}, where u is the vector of random effects and Z is a corresponding design matrix. As in Section 2 let fu denote the density of u and retain the convention that Eu 0. Result 1. Regardless of the random effects density fu I ββ X T cov[y E{Ey u y}]x. 5 Remark 1. While 5 is exact, calculation of the conditional moments in this expression can be quite difficult. Generally, they involve intractable multivariate integrals although, for some specific cases e.g. random intercept models, some integrals simplify to low-dimensional forms. Algorithms and approximations for dealing with such integrals have been the subject of a great deal of research in the past several years [5, Chapter 10]. Such research is ongoing. Now consider the special case of variance component structure where G G is given by 1. Then the full parameter vector is β, where 1,...,σ2 r. The full Fisher information matrix takes the form of 2 where I ββ is given by Result 1. The contribution from the variance component vector has the following simplified expression: { Dσ I σ [E 2 E 2fu; T { Dfu; σ fu; y} 2 E fu; y} ], 6 where D fu; denotes the derivative vector of fu; with respect to see Appendix. For general fu; no further simplification of 6 is possible. However, in the case of u N0, G an explicit expression can be obtained. It depends on previously defined expressions and the r i1 q i 2 r binary matrix I [I 1... I r ] where I i vec{blockdiago q1... O qi 1 I qi O qi+1... O qr } and O d denotes the d d matrix of zeroes. Result 2. If u N0, G, where G is given by 1, then I 1 4 IT G 1 G 1 cov[vec{euu T y}]g 1 G 1 I. The off-diagonal block is { I β X T Dfu; } E [y E{Ey u y}]e fu; y. 7

M.P. Wand / Journal of Multivariate Analysis 98 2007 1412 1416 1415 In general, this matrix is non-zero. This indicates that the maximum likelihood estimators of fixed effects and variance components are not asymptotically independent in the general exponential family setting. In the special case of normal u we have Result 3. If u N0, G, where G is given by 1, then I β 2 1 XT E [y E{Ey u y}] vec[{g 1 Euu T y I}G 1 ] T I. Remark 2. Approximate standard errors for the maximum likelihood estimates β and can be obtained from the diagonal entries of I β, 1. However, as pointed out in Remark 1, implementation is often hindered by intractable multivariate integrals. Additionally, dependence among the entries of y induced by u means that central limit theorems of the type: I β, 1 { β, β, } converges in distribution to a N0, I random vector, have not been established in general and, hence, interpretation of standard errors is cloudy. Nevertheless, there are many special cases, such as m-dependence when the data are from a longitudinal study, for which central limit theorems can be established. Acknowledgments This article has benefited considerably from the comments of an associate editor. Appendix. Derivations Differential calculus preliminaries: Let f be a scalar-valued function with argument x R d. The derivative vector of f, Dfx, is the 1 d vector whose ith entry is fx/ x i. The Hessian matrix of f is the d d matrix Hfx D{Dfx T }. Magnus and Neudecker [3] and Wand [7] describe techniques for finding derivative vectors and Hessian matrices. For a general smooth log-likelihood lθ with parameter vector θ the Fisher information is Iθ E{ Hlθ} E{Dlθ T Dlθ}. Either expression for Iθ could be used. For generalised linear mixed models the second one leads to more direct derivation of the Fisher information. Derivation of Result 1: The log-likelihood is lβ, log fy, u; β, du, where fy, u; β, exp{y T Xβ + Zu 1 T bxβ + Zu + 1 T cy}fu;. The first differential is fy, u; β, σ d β lβ, 2 [{y b Xβ + Zu} T X dβ] du fy; β, and so [y E{Ey u y}] T X dβ D β lβ, [y E{Ey u y}] T X.

1416 M.P. Wand / Journal of Multivariate Analysis 98 2007 1412 1416 Therefore, I ββ E{D β lβ, T D β lβ, }X T E [y E{Ey u y}] 2 X, where, for a general vector v, v 2 vv T. Result 1 then follows from the fact that E[E{Ey u y}] Ey. Derivation of Result 2: We have fu; 2π q/2 G 1/2 exp 2 1 ut G 1 u and d fu; 2π q/2 2 1 G 3/2 G trg 1 d G exp 2 1 ut G 1 u + 2π q/2 G 1/2 exp 2 1 ut G 1 u 1 2 ut { G 1 d G G 1 }u 2 1 fu; σ2 tr{g 1 uu T IG 1 d G } 2 1 fu; σ2 vec{g 1 uu T IG 1 } T d vecg 2 1 fu; σ2 vec{g 1 uu T IG 1 } T I d courtesy of the identity tra T B veca T vecb and the result vecg I. It follows immediately that, for u N0, G, D fu; fu; and, because of 6, I 1 4 IT E 1 2 vec{g 1 uu T IG 1 } T I 8 vec[{g 1 Euu T y IG 1 }] 2 I. 9 Noting that E{Euu T y}euu T G and the matrix identity vecabcc T B vecb, the expectation in 9 becomes cov[vec{g 1 Euu T yg 1 }] cov[g 1 G 1 vec{euu T y}] G 1 G 1 cov[vec{euu T y}]g 1 G 1. Result 2 then follows immediately from this expression and 9. Derivation of Result 3: This result follows immediately from 7 and 8. References [1] P. Diggle, P. Heagerty, K.-L. Liang, S. Zeger, Analysis of Longitudinal Data, second ed., Oxford University Press, Oxford, 2002. [2] G.M. Fitzmaurice, N.M. Laird, J.H. Ware, Applied Longitudinal Analysis, Wiley, New York, 2004. [3] J.R. Magnus, H. Neudecker, Matrix Differential Calculus with Applications in Statistics and Econometrics, second ed., Wiley, Chichester, 1999. [4] P. McCullagh, J.A. Nelder, Generalized Linear Models, second ed., Chapman & Hall, London, 1990. [5] C.E. McCulloch, S.R. Searle, Generalized, Linear, and Mixed Models, Wiley, New York, 2000. [6] D. Ruppert, M.P. Wand, R.J. Carroll, Semiparametric Regression, Cambridge University Press, New York, 2003. [7] M.P. Wand, Vector differential calculus in statistics, Amer. Statist. 56 2002 55 62.