On Properties of QIC in Generalized. Estimating Equations. Shinpei Imori

Size: px
Start display at page:

Download "On Properties of QIC in Generalized. Estimating Equations. Shinpei Imori"

Transcription

1 On Properties of QIC in Generalized Estimating Equations Shinpei Imori Graduate School of Engineering Science, Osaka University 1-3 Machikaneyama-cho, Toyonaka, Osaka , Japan Abstract: The generalized estimating equations (GEE) approach has attracted considerable interest in analysis of correlated response data. An information criterion based on the quasi-likelihood in the GEE framework, called the quasi-likelihood under the independence model criterion (QIC), is proposed in the past literature. This paper studies the properties of the QIC. We establish a formal derivation of the QIC as an asymptotically unbiased estimator of the prediction risk based on the quasi-likelihood. Especially, when deriving the QIC, we explicitly take into account the effect of estimating the correlation matrix used in the GEE procedure. Furthermore, we discuss an adequacy of the risk function used in the derivation of the QIC. Key words: Generalized estimating equations; Longitudinal data analysis; Quasi-likelihood under the independence model criterion; Variable selection. 1. Introduction 1

2 In analysis of biomedical study, longitudinal data often arise, which data are correlated within individual response. The generalized estimating equations (GEE) approach developed by Liang & Zeger (1986) has been of considerable interest for the parameter estimating in such data. The GEE methodology avoids assuming the simultaneous distribution of observations by only assuming a functional form or the marginal distribution at each time and a correlation structure called working correlation matrix. Furthermore, under some regularity conditions (see, Xie & Yang, 2003; Balan & Schiopu-Kratina, 2005), the GEE estimator has properties which include asymptotic normality and consistency even when the working correlation matrix has been misspecified. From these advantage points, the GEE method is often used to analyze the longitudinal data (e.g., Thall & Vail, 1990; Barnhart & Williamson, 1998; Vens & Ziegler, 2012). As with a common regression analysis, we should select the best model among candidate models in the GEE methodology. Model selection in GEE framework has been extensively discussed in previous literature (e.g., Pan, 2001b, 2002; Cantoni, et al., 2005; Shen & Chen, 2012) In general, many model selection methods have been proposed (for details of statistical model selection; see Burnham & Anderson, 2002). It is famous for a model selection to measure the goodness of fit of the model by the risk function based on the expected Kullback-Leibler (KL) information (Kullback & Leibler, 1951). For actual use, we must estimate the risk function, which depends on unknown parameters. Akaike s information criterion (AIC), which is proposed 2

3 by Akaike (1973, 1974) as an estimator of the risk function based on KL information is used for selecting the best model among the candidate models. Since the AIC can be simply defined as 2 the maximum log-likelihood +2 the number of parameters, the AIC is widely applied in many fields for selecting appropriate models using a set of explanatory variables. It may not be adequate to directly use the AIC since we do not assume the multivariate distribution of observations in the GEE procedure. Pan (2001a) extended the AIC to the GEE method based on quasi-likelihood constructed from the estimating equations (Wedderburn, 1974), which is called Quasi-likelihood under the independence model criterion (QIC). The QIC is often used as an alternative of the AIC in applied longitudinal analysis, and a representative model selector in GEE methodology. On the other hands, the theoretical properties of the QIC have not been discussed until now. However, the QIC was derived by ignoring the calculation of the particular part. The aim of this paper is to study these problems, especially we examine from the viewpoint of estimating the correlation parameter. In the present paper, we establish a formal derivation of the QIC (called formal QIC or fqic) as an asymptotic unbiased estimate of the prediction risk based on the quasi-likelihood. Notably, when deriving the formal QIC, we explicitly take into account the effect of estimating the correlation matrix used in the GEE procedure. Furthermore, we discuss an adequacy of the risk function used in the derivation of the QIC. Concretely, when we extend the independence quasi-likelihood to more general case (i.e., the mul- 3

4 tivariate quasi-likelihood) in order to consider correlations of response, the risk function reduces to the same one used in the derivation of the QIC. For details of the meaning of this adequacy, we show in section 3. The rest of the paper is organized as follows. Section 2 introduces the GEE approach and re-derive the QIC formally. We present the properties of the QIC in section 3. A comparison between the formal QIC and original QIC is given in section 4. In section 5, we conclude this paper. Technical details are provided in the Appendix. 2. Modifications of the original QIC In this section, we present the definition of the GEE and re-derive the QIC, formally. For individuals i = 1,..., n, we have an m-dimensional response vector y i = (y i1,..., y im ), and an m p explanatory variable matrix X i = (x i1,..., x im ). Note that y = (y 1,..., y n). In general, the components of y i are correlated but y 1,..., y n are independent. Furthermore, we do not decide the simultaneous distribution of each y i. In the GEE frameworks, we assume y ij be the generalized linear model (GLM) developed in Nelder & Wedderburn (1972). Hence, the probability density function of y ij is expressed as follows: { } θij y ij b(θ ij ) f(y ij ; θ ij, ϕ) = exp + c(y ij, ϕ), a(ϕ) where a( ), b( ), and c( ) are known functions, θ ij is an unknown location parameter, and ϕ is a known scale parameter. Let us the first and second 4

5 moments for y ij be E[y ij ] = µ ij, Var[y ij ] = σij, 2 respectively. From the GLM properties, we can obtain µ ij = b (θ ij ), σij 2 = σ 2 (µ ij ) = a(ϕ)b (θ ij ). For all θ i = (θ i1,..., θ im ), we would like to note that h(µ ij ) = h(µ(θ ij )) = η ij, where h( ) is a link function, η ij = x ijβ is a linear predictor, and β is an unknown regression parameter. Herein, the GEE for β is expressed as follows: g(β; R, y) = D iv 1 i (y i µ i ) = 0 p. (1) where D i = µ i / β = A i i X i, i = diag( θ i1 / η i1,..., θ im / η im ), A i = a(ϕ)diag{b (θ i1 ),..., b (θ im )}, µ i = (µ i1,..., µ im ), V i = A 1/2 i RA 1/2 i, R = R(α) is a working correlation matrix and α is a q-dimensional parameter, which is referred to as a correlation parameter. We can choose some useful working correlation matrices as the situation demands. For instance, independence (i.e., (R) jk = 0, j k), exchangeable (i.e., (R) jk = α, j k), first-order autoregressive (AR-1) (i.e., (R) jk = (R) kj = α j k, j > k), one-dependent (i.e., (R) j,j+1 = (R) j+1,j = α), and unstructured (i.e., (R) jk = (R) kj = α jk, j > k) are often used. By solving (1), we can obtain ˆβ, which is the GEE estimator of β. In order to guarantee the asymptotic properties, we will assume the regularity appropriate conditions. The risk function based on the independent quasi-likelihood is Risk = E y E y [ 2Q(y ; ˆβ)], (2) 5

6 where y is a future observation and Q(y; β) is the quasi-likelihood proposed by McCullagh & Nelder (1989). That is, Q(y; β) = m µij j=1 y ij y ij t σ 2 (t) dt, where σ 2 ( ) is a variance function such that σ 2 (µ ij ) = a(ϕ)b (θ ij ). Let B be the bias when estimating (2) by 2Q(y; ˆβ). The QIC is defined as QIC = 2Q(y; ˆβ) + ˆB, (3) where ˆB is an estimator of B. We divide B in order to calculate precisely as follows: B = E y E y [ 2Q(y ; ˆβ) + 2Q(y; ˆβ)] = E y E y [ 2Q(y ; ˆβ) + 2Q(y ; β)] (B1) + E y E y [ 2Q(y ; β) + 2Q(y; β)] (B2) + E y [ 2Q(y; β) + 2Q(y; ˆβ)]. (B3) It is obvious (B2)= 0. By applying a Taylor expansion around ˆβ = β to equation Q(y; ˆβ) yields Q(y; ˆβ) = Q(y; β) + Q(y; β) β ( ˆβ β) ( ˆβ β) 2 Q(y; β) β β ( ˆβ β) + o p (1). (4) 6

7 Note that Q(y; β) β = D ia 1 i (y i µ i ), ˆβ = β + 1 n Ω 1 R g(β; R, y) + o p(n 1/2 ), 2 Q(y; β) β β = nω I + o p (n), (5) where Ω R = 1 n D iv 1 i D i, Ω I = 1 n D ia 1 i D i. Substituting (4) and (5) into (B1) and (B3), respectively, we can show that (B1) = tr(v s Ω I ) + o(1), (B3) = 2tr(V a Ω 1 R ) tr(v sω I ) + o(1), (6) where V a = 1 n V s = Ω 1 R D iv 1 i Cov[y i ]A 1 i D i, ( 1 n From (6), an expansion of B is given as follows: ) (7) D iv 1 i Cov[y i ]V 1 i D i Ω 1 R. B = 2tr(V a Ω 1 R ) + o(1). (8) 7

8 By substituting (8) into (3) and ignoring the term of o(1), we can obtain fqic = 2Q(y; ˆβ) + 2tr( ˆV a ˆΩ 1 R ), where ˆV a, ˆVs ˆΩR and ˆΩ I are substituted ˆβ into β in V a, V s, Ω R and Ω I, respectively. On the other hand, the original QIC proposed in Pan (2001a) is an estimator of (2) defined as follows: original QIC = 2Q(y; ˆβ) + 2tr( ˆV s ˆΩI ). We would like to note that V a = V s and Ω R = Ω I when R is independence. Furthermore, when R includes the true correlation structure, Ω I = V a and Ω 1 R = V s are achieved since V i = Cov[y i ] in this situations. Hence, the original QIC is exactly and asymptotically equivalent to the formal QIC when the working correlation matrix is independence and includes the true correlation structure, respectively. More comparisons are given in section Effect of estimating the correlation parameter We present two properties of the formal QIC in this section. Firstly, we mention about the bias of the QIC. In Pan (2001a), the original QIC was derived by only considering the bias that arises when estimating β. However, we would like to note that we need to estimate α when we use the QIC in real 8

9 data analysis. Hence, the QIC can be regarded as a function of β and α and we must use the QIC by substituting each estimator of unknown parameters. Hence, we should also consider the bias that arises when estimating α. According to Liang & Zeger (1986), we can assume that α is defined as a function of the sample correlation matrix R(β), R(β) = 1 n p A 1/2 i (y i µ i )(y i µ i ) A 1/2 i. For example, by assuming the exchangeable covariance structure (i.e., (R) jk = α), α = n p m(m 1) a(ϕ) m y ij b (θ ij ) y ik b (θ ik ). (9) b (θ ij ) 1/2 b (θ ik ) 1/2 j<k When we explicitly take into account the effect of estimating α, the simultaneous estimating equation of β and α is given as follows: α = α(β) = h( R(β)), D iv 1 i (y i µ i ) = 0 p, (10) where h( ) is a q-dimensional vector-valued function. Let ˆβ s and ˆα s be the solution of (10). Under these situations, the risk function (2) is rewritten as E y E y [ 2Q(y ; ˆβ s )], and we can show the following theorem. 9

10 Theorem.1. Let us assume vec{r(α 0 ) 1 }, α(β) = O α β p (1), (11) where α 0 = h(r 0 ) and R 0 is the true correlation matrix. The formal QIC is an asymptotically unbiased estimator of the Risk function even when taking account into the effect of estimating α. That is, E y E y [ 2Q(y ; ˆβ s )] E y [fqic(y; ˆβ s )] = o(1). Details of a proof of theorem 1 are given in Appendix A.1. This theorem leads that the formal QIC is an asymptotic unbiased estimator of the risk function in taking account into the effect of estimating correlation parameter. This is an optimality of the formal QIC. Liang & Zeger (1986) provided ways to estimate that can satisfy the assumption (11). For example, in the case of the exchangeable correlation structure, R(α) 1 = 1 1 α (I 1 m P m ) (m 1)α P m, (12) 10

11 where P m = 1 m 1 m/m. Derivations of (9) and (12) are expressed as follows: α(β) = 1 β (n p) vec{r(α 0 ) 1 } α = 2 1 m(m 1) a(ϕ) m j k [ b (θ ij ) 1/2 + {y ] ij b (θ ij )}b (θ ij ) 2b (θ ij ) 3/2 y ik b (θ ik ) θ ij x b (θ ik ) 1/2 ij, η ij 1 (1 α 0 ) vec(i m 1 2 m P m ) {1 + (m 1)α 0 } vec(p m). 2 Hence, the assumption (11) is satisfied when α 0 1, 1/(m 1), which is usually assumed for the non-singularity of the working correlation matrix. Next, we consider the risk function of the formal QIC. In spite of assuming a correlation structure for response in the GEE procedure, the risk function (2) is based on the independence quasi-likelihood. In order to resolve this contradiction, we attempt to expand the risk function to the quasilikelihood with considering the correlations, which is the multivariate quasilikelihood (McCullagh & Nelder, 1989). As mentioned in Pan (2001a), the GEE methodology is closely related to the multivariate quasi-likelihood. The ith multivariate quasi-likelihood Q m (y i ; M, β) is given as the following differential form: Q m (y i ; M, β) µ i = M(µ i ) 1 (y i µ i ). where M( ) is an m m matrix-valued function. We can get the multivariate 11

12 quasi-likelihood to line integrate as the following expression. Q m (y i ; M, β) = t=µi t=y i (y i t) M(t) 1 dt, (13) where t = (t 1,..., t m ). In general, the value of this integration is misspecified since it depends on the integral path chosen. Such problem had been already discussed. Wang gave an example about the non-integrability of the GEE (see Wang, 1999, Example 2.4). The risk function should be specified because of which is a criteria of statistical models. The following theorem gives a condition about the pathdependence of the multivariate quasi-likelihood. Theorem.2. To assume the independence structure for correlation matrix and/or the constant variance function σ 2 ( ) is a necessary and sufficient condition in order to avoid the path-dependence of the multivariate quasilikelihood in the GEE approach. A proof of theorem 2 is given in Appendix A.2. This theorem suggests an adequacy of the risk function of the formal QIC. It may be possible to decide an ad hoc path of the multivariate quasi-likelihood (for example of the path; see McCullagh & Nelder, 1989). However, it is difficult to derive a new model selector based on the path-decided multivariate quasi-likelihood since the derivation of this multivariate quasi-likelihood become more comparison form than the formal QIC. In order to establish the uniqueness of the risk function 12

13 among the multivariate quasi-likelihood class, we recommend assuming the model independence for the risk function, which leads the same as the risk function of the formal QIC. 4. Comparison of the formal QIC and the original QIC Let us compare the bias term of the formal QIC derived in section 2 with the original QIC through a simulation study. We prepared the four candidate models with 500 samples which is constructed of a 4-dimensional response vector y i = (y i1,..., y i4 ) and an 4 6 explanatory variable matrix X i = (x i1,..., x i4 ). Let x ij = (1, x ij1,..., x i4 ) and x ijk be random variable which are independent and identically distributed as the uniform distribution U(0, 1), k = 1,..., 4. We assume y ij is distributed according to a logistic regression model B(1, p ij ) where p ij = 1/{1+exp( x ijβ)}, β = (1, 1, 0, 0, 0). Let an explanatory variable matrix in the kth candidate model consist of the first k columns of X i, k = 1,..., 4. As mentioned in Section 2, the original QIC is exactly and asymptotically equivalent to the formal QIC when the working correlation matrix is independence and includes the true correlation structure, respectively. However, in the other case, the difference may be greatly. For example, we assume that R is one-dependent and R 0 is exchangeable with correlation parameter α = 0.3. In above situations, we simulated 10,000 repetitions in order to compare the formal QIC and original QIC. We show the average value and bias of the 13

14 Table 1: mean and bias of formal QIC and original QIC in the case 1 candidate model risk function formal QIC mean bias original QIC mean bias Figure 1: value of the fomal QIC and original QIC in the case Risk function Formal QIC Original QIC Model index 14

15 each QIC in table 1. From table 1, we can see that the bias of the original QIC is getting large when the number of the parameter is getting increase even when the formal QIC keeps a stable value in each model. Hence, there may be a non-negligible difference between the formal QIC and original QIC. Therefore, we recommend the use of the formal QIC rather than the original QIC for the model selection in GEE procedure. 5. Conclusions In the present paper, we derive the formal QIC as an asymptotic unbiased estimator of the risk function based on the independent quasi-likelihood. Through the simulation study, we illustrate that the difference between the formal QIC and original QIC may have non-negligible effect for model selection, especially when the true correlation structure completely different from the working correlation structure. Furthermore, we show two theorems regarding the formal QIC. In Theorem 1, we prove the asymptotically unbiasedness of the formal QIC in taking account into the effect of estimating correlation parameter. This result may arise because of the risk function of the formal QIC, which do not include the correlation parameter since it is based on the independent quasi-likelihood. In Theorem 2, we obtain an adequacy of the formal QIC in considering a wide class of the risk function, which is based on the multivariate quasi-likelihood. The unique risk function can be established by assuming the independence structure for mul- 15

16 tivariate quasi-likelihood. These theorems guarantee the adequacy of the formal QIC. Appendix A.1. Proof of Theorem 1 We can re-writing (10) as D ia 1/2 i R(α(β)) 1 A 1/2 i (y i µ i ) = 0 p. (A.1) Under the assumptions of Theorem 1, by applying chain rule into the derivation of (A.1), we are immediately able to show vec {R(α(β)) 1 } = vec {R(α) 1 } α(β) β α β = vec {R(α 0) 1 } α α(β) β + o p (1) = O p (1). (A.2) By combining (A.1) and (A.2), we obtain 1 n D ia 1/2 i R(α(β)) 1 A 1/2 i (y i µ i ) β = Ω R + o p (1). This leads the following result, n( ˆβs β) = 1 n Ω 1 R g(β; R, D) + o p(1). 16

17 The rest of proof is very similar to the derivation of the QIC in section 2. We obtain E y E y [Q(y ; ˆβ)] E y [Q(y ; ˆβ s )] = o(1). A.2. Proof of Theorem 2 Applying the Stokes theorem, which is usually used in differential geometry, we obtain the necessary and sufficient conditions in order to avoid the path-dependence of (13) as follows: m j,k,l (y ij t j ) (M(t) 1 ) jk t l = 0, (A.3) for individuals, i = 1,..., n. Hence, we re-write the condition (A.3) as { m (M(t) 1 ) jk (y ij t i ) t l j=1 (M(t) 1 ) jl t k } = 0, k > l. Since we assume M(t) = A(t) 1/2 RA(t) 1/2 in the GEE methodology, this is equivalent to the following conditions: (y il t l ) σ(t k) 1 σ(t l ) 1 (R 1 ) lk = (y ik t k ) σ(t l) 1 σ(t k ) 1 (R 1 ) kl (A.4) t l t k 17

18 where σ(µ ij ) 2 = a(ϕ)b (θ ij ). If we assume (R) kl 0, (A.4) is equivalent to σ(t l ) 1 = 0. t l From above results, we can see that for all 1 k < l m, (R) kl = 0 or the variance function σ 2 ( ) is constant in order to achieve the expression (A.4). The proof of Theorem 2 is completed. References Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle. In Second International Symposium on Information Theory (eds. Petrov, B. N. & Csáki, F.), , Akadémiai Kiadó, Budapest. Akaike, H. (1974). A new look at the statistical model identification. IEEE Trans. Automatic Control, AC-19, Balan R. M. & Schiopu-Kratina, I. (2005). Asymptotic results with generalized estimating equations for longitudinal data. Ann. Statist., 33, Barnhart, H. X. & Williamson, J. M. (1998). Goodness-of-fit tests for the GEE modeling with binary responses. Biometrics., 54, Burnham, K. P. & Anderson, D. R. (2002). Model selection and multi- 18

19 model inference: A practical information-theoretic approach, 2nd edition. Springer-Verlag, New York. Cantoni, E., Flemming, M. & Ronchetti, E. (2005). Variable selection for marginal longitudinal generalized linear models. Biometrics., 61, Kullback, S. & Leibler, R. (1951). On information and sufficiency. Ann. Math. Statist., 22, Liang, K-Y. & Zeger, S. L. (1986). Longitudinal data analysis using generalized linear models. Biometrika., 73, McCullagh, P. & Nelder, J. A. (1989). Generalized linear models, 2nd edition. Chapman and Hall, London. Nelder, J. A. & Wedderburn, W. M. (1972). Generalized linear models. J. R. Statist. Soc. A., 135, Pan, W. (2001a). Akaike s information criterion in generalized estimating equations. Biometrics., 57, Pan, W. (2001b). Model selection in estimating equations. Biometrics., 57, Pan, W. (2002). Goodness-of-fit tests for GEE with correlated binary data. Scand. J. Statist., 29,

20 R Development Core Team (2012). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. URL Shen, C-W. & Chen, Y-H. (2012). Model selection for Generalized Estimating equations accommodating dropout missingness. Biometrics, 68, Thall, P. F. & Vail, S. C. (1990). Some covariance models for longitudinal count data with overdispersion. Biometrics, 46, Vens, M. & Ziegler, A. (2012). Generalized estimating equations and regression diagnostic for longitudinal controlled trials: A case study. Comput. Statist. Data Anal., 56, Wang, J. (1999). Artificial likelihoods for general nonlinear regressions (in Japanese). Proc. Inst. Statist. Math, 47, Wedderburn, W. M. (1974). Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method. Biometrika., 61, Xie, M. & Yang, Y. (2003). Asymptotics for generalized estimating equations with large cluster sizes. Ann. Statist., 31,

Model Selection for Semiparametric Bayesian Models with Application to Overdispersion

Model Selection for Semiparametric Bayesian Models with Application to Overdispersion Proceedings 59th ISI World Statistics Congress, 25-30 August 2013, Hong Kong (Session CPS020) p.3863 Model Selection for Semiparametric Bayesian Models with Application to Overdispersion Jinfang Wang and

More information

Bias-corrected AIC for selecting variables in Poisson regression models

Bias-corrected AIC for selecting variables in Poisson regression models Bias-corrected AIC for selecting variables in Poisson regression models Ken-ichi Kamo (a), Hirokazu Yanagihara (b) and Kenichi Satoh (c) (a) Corresponding author: Department of Liberal Arts and Sciences,

More information

MODEL SELECTION BASED ON QUASI-LIKELIHOOD WITH APPLICATION TO OVERDISPERSED DATA

MODEL SELECTION BASED ON QUASI-LIKELIHOOD WITH APPLICATION TO OVERDISPERSED DATA J. Jpn. Soc. Comp. Statist., 26(2013), 53 69 DOI:10.5183/jjscs.1212002 204 MODEL SELECTION BASED ON QUASI-LIKELIHOOD WITH APPLICATION TO OVERDISPERSED DATA Yiping Tang ABSTRACT Overdispersion is a common

More information

1. Introduction Over the last three decades a number of model selection criteria have been proposed, including AIC (Akaike, 1973), AICC (Hurvich & Tsa

1. Introduction Over the last three decades a number of model selection criteria have been proposed, including AIC (Akaike, 1973), AICC (Hurvich & Tsa On the Use of Marginal Likelihood in Model Selection Peide Shi Department of Probability and Statistics Peking University, Beijing 100871 P. R. China Chih-Ling Tsai Graduate School of Management University

More information

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses Outline Marginal model Examples of marginal model GEE1 Augmented GEE GEE1.5 GEE2 Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association

More information

Bias Correction of Cross-Validation Criterion Based on Kullback-Leibler Information under a General Condition

Bias Correction of Cross-Validation Criterion Based on Kullback-Leibler Information under a General Condition Bias Correction of Cross-Validation Criterion Based on Kullback-Leibler Information under a General Condition Hirokazu Yanagihara 1, Tetsuji Tonda 2 and Chieko Matsumoto 3 1 Department of Social Systems

More information

Stat 579: Generalized Linear Models and Extensions

Stat 579: Generalized Linear Models and Extensions Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 15 1 / 38 Data structure t1 t2 tn i 1st subject y 11 y 12 y 1n1 Experimental 2nd subject

More information

Generalized Quasi-likelihood versus Hierarchical Likelihood Inferences in Generalized Linear Mixed Models for Count Data

Generalized Quasi-likelihood versus Hierarchical Likelihood Inferences in Generalized Linear Mixed Models for Count Data Sankhyā : The Indian Journal of Statistics 2009, Volume 71-B, Part 1, pp. 55-78 c 2009, Indian Statistical Institute Generalized Quasi-likelihood versus Hierarchical Likelihood Inferences in Generalized

More information

Selection Criteria Based on Monte Carlo Simulation and Cross Validation in Mixed Models

Selection Criteria Based on Monte Carlo Simulation and Cross Validation in Mixed Models Selection Criteria Based on Monte Carlo Simulation and Cross Validation in Mixed Models Junfeng Shang Bowling Green State University, USA Abstract In the mixed modeling framework, Monte Carlo simulation

More information

The equivalence of the Maximum Likelihood and a modified Least Squares for a case of Generalized Linear Model

The equivalence of the Maximum Likelihood and a modified Least Squares for a case of Generalized Linear Model Applied and Computational Mathematics 2014; 3(5): 268-272 Published online November 10, 2014 (http://www.sciencepublishinggroup.com/j/acm) doi: 10.11648/j.acm.20140305.22 ISSN: 2328-5605 (Print); ISSN:

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

Modfiied Conditional AIC in Linear Mixed Models

Modfiied Conditional AIC in Linear Mixed Models CIRJE-F-895 Modfiied Conditional AIC in Linear Mixed Models Yuki Kawakubo Graduate School of Economics, University of Tokyo Tatsuya Kubokawa University of Tokyo July 2013 CIRJE Discussion Papers can be

More information

LOGISTIC REGRESSION Joseph M. Hilbe

LOGISTIC REGRESSION Joseph M. Hilbe LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of

More information

Generalized Estimating Equations (gee) for glm type data

Generalized Estimating Equations (gee) for glm type data Generalized Estimating Equations (gee) for glm type data Søren Højsgaard mailto:sorenh@agrsci.dk Biometry Research Unit Danish Institute of Agricultural Sciences January 23, 2006 Printed: January 23, 2006

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

Model and Working Correlation Structure Selection in GEE Analyses of Longitudinal Data

Model and Working Correlation Structure Selection in GEE Analyses of Longitudinal Data The 3rd Australian and New Zealand Stata Users Group Meeting, Sydney, 5 November 2009 1 Model and Working Correlation Structure Selection in GEE Analyses of Longitudinal Data Dr Jisheng Cui Public Health

More information

GENERAL FORMULA OF BIAS-CORRECTED AIC IN GENERALIZED LINEAR MODELS

GENERAL FORMULA OF BIAS-CORRECTED AIC IN GENERALIZED LINEAR MODELS GENERAL FORMULA OF BIAS-CORRECTED AIC IN GENERALIZED LINEAR MODELS Shinpei Imori, Hirokazu Yanagihara, Hirofumi Wakaki Department of Mathematics, Graduate School of Science, Hiroshima University ABSTRACT

More information

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University A SURVEY OF VARIANCE COMPONENTS ESTIMATION FROM BINARY DATA by Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University BU-1211-M May 1993 ABSTRACT The basic problem of variance components

More information

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter

More information

Bootstrap prediction and Bayesian prediction under misspecified models

Bootstrap prediction and Bayesian prediction under misspecified models Bernoulli 11(4), 2005, 747 758 Bootstrap prediction and Bayesian prediction under misspecified models TADAYOSHI FUSHIKI Institute of Statistical Mathematics, 4-6-7 Minami-Azabu, Minato-ku, Tokyo 106-8569,

More information

Generalized Quasi-likelihood (GQL) Inference* by Brajendra C. Sutradhar Memorial University address:

Generalized Quasi-likelihood (GQL) Inference* by Brajendra C. Sutradhar Memorial University  address: Generalized Quasi-likelihood (GQL) Inference* by Brajendra C. Sutradhar Memorial University Email address: bsutradh@mun.ca QL Estimation for Independent Data. For i = 1,...,K, let Y i denote the response

More information

arxiv: v1 [stat.ap] 17 Jun 2013

arxiv: v1 [stat.ap] 17 Jun 2013 Modeling Dependencies in Claims Reserving with GEE Šárka Hudecová a, Michal Pešta a, arxiv:1306.3768v1 [stat.ap] 17 Jun 2013 a Charles University in Prague, Faculty of Mathematics and Physics, Department

More information

Longitudinal analysis of ordinal data

Longitudinal analysis of ordinal data Longitudinal analysis of ordinal data A report on the external research project with ULg Anne-Françoise Donneau, Murielle Mauer June 30 th 2009 Generalized Estimating Equations (Liang and Zeger, 1986)

More information

Pricing and Risk Analysis of a Long-Term Care Insurance Contract in a non-markov Multi-State Model

Pricing and Risk Analysis of a Long-Term Care Insurance Contract in a non-markov Multi-State Model Pricing and Risk Analysis of a Long-Term Care Insurance Contract in a non-markov Multi-State Model Quentin Guibert Univ Lyon, Université Claude Bernard Lyon 1, ISFA, Laboratoire SAF EA2429, F-69366, Lyon,

More information

Bias Study of the Naive Estimator in a Longitudinal Binary Mixed-effects Model with Measurement Error and Misclassification in Covariates

Bias Study of the Naive Estimator in a Longitudinal Binary Mixed-effects Model with Measurement Error and Misclassification in Covariates Bias Study of the Naive Estimator in a Longitudinal Binary Mixed-effects Model with Measurement Error and Misclassification in Covariates by c Ernest Dankwa A thesis submitted to the School of Graduate

More information

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Department of Mathematics Carl von Ossietzky University Oldenburg Sonja Greven Department of

More information

A Study on the Bias-Correction Effect of the AIC for Selecting Variables in Normal Multivariate Linear Regression Models under Model Misspecification

A Study on the Bias-Correction Effect of the AIC for Selecting Variables in Normal Multivariate Linear Regression Models under Model Misspecification TR-No. 13-08, Hiroshima Statistical Research Group, 1 23 A Study on the Bias-Correction Effect of the AIC for Selecting Variables in Normal Multivariate Linear Regression Models under Model Misspecification

More information

Repeated ordinal measurements: a generalised estimating equation approach

Repeated ordinal measurements: a generalised estimating equation approach Repeated ordinal measurements: a generalised estimating equation approach David Clayton MRC Biostatistics Unit 5, Shaftesbury Road Cambridge CB2 2BW April 7, 1992 Abstract Cumulative logit and related

More information

Consistency of Test-based Criterion for Selection of Variables in High-dimensional Two Group-Discriminant Analysis

Consistency of Test-based Criterion for Selection of Variables in High-dimensional Two Group-Discriminant Analysis Consistency of Test-based Criterion for Selection of Variables in High-dimensional Two Group-Discriminant Analysis Yasunori Fujikoshi and Tetsuro Sakurai Department of Mathematics, Graduate School of Science,

More information

Outline of GLMs. Definitions

Outline of GLMs. Definitions Outline of GLMs Definitions This is a short outline of GLM details, adapted from the book Nonparametric Regression and Generalized Linear Models, by Green and Silverman. The responses Y i have density

More information

The perils of quasi-likelihood information criteria

The perils of quasi-likelihood information criteria The perils of quasi-likelihood information criteria Yishu Wang a, Orla Murphy b, Maxime Turgeon a,c,zhuoyuwang a, Sahir R. Bhatnagar a,c, Juliana Schulz b andericae.m.moodie a, Received 15 September 2015;

More information

Variable Selection in Multivariate Linear Regression Models with Fewer Observations than the Dimension

Variable Selection in Multivariate Linear Regression Models with Fewer Observations than the Dimension Variable Selection in Multivariate Linear Regression Models with Fewer Observations than the Dimension (Last Modified: November 3 2008) Mariko Yamamura 1, Hirokazu Yanagihara 2 and Muni S. Srivastava 3

More information

Assessing GEE Models with Longitudinal Ordinal Data by Global Odds Ratio

Assessing GEE Models with Longitudinal Ordinal Data by Global Odds Ratio Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS074) p.5763 Assessing GEE Models wh Longudinal Ordinal Data by Global Odds Ratio LIN, KUO-CHIN Graduate Instute of

More information

On Fitting Generalized Linear Mixed Effects Models for Longitudinal Binary Data Using Different Correlation

On Fitting Generalized Linear Mixed Effects Models for Longitudinal Binary Data Using Different Correlation On Fitting Generalized Linear Mixed Effects Models for Longitudinal Binary Data Using Different Correlation Structures Authors: M. Salomé Cabral CEAUL and Departamento de Estatística e Investigação Operacional,

More information

Order Selection for Vector Autoregressive Models

Order Selection for Vector Autoregressive Models IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 2, FEBRUARY 2003 427 Order Selection for Vector Autoregressive Models Stijn de Waele and Piet M. T. Broersen Abstract Order-selection criteria for vector

More information

Longitudinal data analysis using generalized linear models

Longitudinal data analysis using generalized linear models Biomttrika (1986). 73. 1. pp. 13-22 13 I'rinlfH in flreal Britain Longitudinal data analysis using generalized linear models BY KUNG-YEE LIANG AND SCOTT L. ZEGER Department of Biostatistics, Johns Hopkins

More information

Geographically Weighted Regression as a Statistical Model

Geographically Weighted Regression as a Statistical Model Geographically Weighted Regression as a Statistical Model Chris Brunsdon Stewart Fotheringham Martin Charlton October 6, 2000 Spatial Analysis Research Group Department of Geography University of Newcastle-upon-Tyne

More information

CHOOSING AMONG GENERALIZED LINEAR MODELS APPLIED TO MEDICAL DATA

CHOOSING AMONG GENERALIZED LINEAR MODELS APPLIED TO MEDICAL DATA STATISTICS IN MEDICINE, VOL. 17, 59 68 (1998) CHOOSING AMONG GENERALIZED LINEAR MODELS APPLIED TO MEDICAL DATA J. K. LINDSEY AND B. JONES* Department of Medical Statistics, School of Computing Sciences,

More information

A weighted simulation-based estimator for incomplete longitudinal data models

A weighted simulation-based estimator for incomplete longitudinal data models To appear in Statistics and Probability Letters, 113 (2016), 16-22. doi 10.1016/j.spl.2016.02.004 A weighted simulation-based estimator for incomplete longitudinal data models Daniel H. Li 1 and Liqun

More information

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Statistics 203: Introduction to Regression and Analysis of Variance Course review Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying

More information

An Unbiased C p Criterion for Multivariate Ridge Regression

An Unbiased C p Criterion for Multivariate Ridge Regression An Unbiased C p Criterion for Multivariate Ridge Regression (Last Modified: March 7, 2008) Hirokazu Yanagihara 1 and Kenichi Satoh 2 1 Department of Mathematics, Graduate School of Science, Hiroshima University

More information

DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS

DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS Ivy Liu and Dong Q. Wang School of Mathematics, Statistics and Computer Science Victoria University of Wellington New Zealand Corresponding

More information

High-dimensional asymptotic expansions for the distributions of canonical correlations

High-dimensional asymptotic expansions for the distributions of canonical correlations Journal of Multivariate Analysis 100 2009) 231 242 Contents lists available at ScienceDirect Journal of Multivariate Analysis journal homepage: www.elsevier.com/locate/jmva High-dimensional asymptotic

More information

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study Science Journal of Applied Mathematics and Statistics 2014; 2(1): 20-25 Published online February 20, 2014 (http://www.sciencepublishinggroup.com/j/sjams) doi: 10.11648/j.sjams.20140201.13 Robust covariance

More information

PQL Estimation Biases in Generalized Linear Mixed Models

PQL Estimation Biases in Generalized Linear Mixed Models PQL Estimation Biases in Generalized Linear Mixed Models Woncheol Jang Johan Lim March 18, 2006 Abstract The penalized quasi-likelihood (PQL) approach is the most common estimation procedure for the generalized

More information

STAT 526 Advanced Statistical Methodology

STAT 526 Advanced Statistical Methodology STAT 526 Advanced Statistical Methodology Fall 2017 Lecture Note 10 Analyzing Clustered/Repeated Categorical Data 0-0 Outline Clustered/Repeated Categorical Data Generalized Linear Mixed Models Generalized

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Trends in Human Development Index of European Union

Trends in Human Development Index of European Union Trends in Human Development Index of European Union Department of Statistics, Hacettepe University, Beytepe, Ankara, Turkey spxl@hacettepe.edu.tr, deryacal@hacettepe.edu.tr Abstract: The Human Development

More information

Conditional Inference Functions for Mixed-Effects Models with Unspecified Random-Effects Distribution

Conditional Inference Functions for Mixed-Effects Models with Unspecified Random-Effects Distribution Conditional Inference Functions for Mixed-Effects Models with Unspecified Random-Effects Distribution Peng WANG, Guei-feng TSAI and Annie QU 1 Abstract In longitudinal studies, mixed-effects models are

More information

The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.1/27

The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification p.1/27 The In-and-Out-of-Sample (IOS) Likelihood Ratio Test for Model Misspecification Brett Presnell Dennis Boos Department of Statistics University of Florida and Department of Statistics North Carolina State

More information

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Jeremy S. Conner and Dale E. Seborg Department of Chemical Engineering University of California, Santa Barbara, CA

More information

Generalized Estimating Equations

Generalized Estimating Equations Outline Review of Generalized Linear Models (GLM) Generalized Linear Model Exponential Family Components of GLM MLE for GLM, Iterative Weighted Least Squares Measuring Goodness of Fit - Deviance and Pearson

More information

,..., θ(2),..., θ(n)

,..., θ(2),..., θ(n) Likelihoods for Multivariate Binary Data Log-Linear Model We have 2 n 1 distinct probabilities, but we wish to consider formulations that allow more parsimonious descriptions as a function of covariates.

More information

Sample Size and Power Considerations for Longitudinal Studies

Sample Size and Power Considerations for Longitudinal Studies Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous

More information

Comparison of Estimators in GLM with Binary Data

Comparison of Estimators in GLM with Binary Data Journal of Modern Applied Statistical Methods Volume 13 Issue 2 Article 10 11-2014 Comparison of Estimators in GLM with Binary Data D. M. Sakate Shivaji University, Kolhapur, India, dms.stats@gmail.com

More information

An Akaike Criterion based on Kullback Symmetric Divergence in the Presence of Incomplete-Data

An Akaike Criterion based on Kullback Symmetric Divergence in the Presence of Incomplete-Data An Akaike Criterion based on Kullback Symmetric Divergence Bezza Hafidi a and Abdallah Mkhadri a a University Cadi-Ayyad, Faculty of sciences Semlalia, Department of Mathematics, PB.2390 Marrakech, Moroco

More information

Consistency of test based method for selection of variables in high dimensional two group discriminant analysis

Consistency of test based method for selection of variables in high dimensional two group discriminant analysis https://doi.org/10.1007/s42081-019-00032-4 ORIGINAL PAPER Consistency of test based method for selection of variables in high dimensional two group discriminant analysis Yasunori Fujikoshi 1 Tetsuro Sakurai

More information

KULLBACK-LEIBLER INFORMATION THEORY A BASIS FOR MODEL SELECTION AND INFERENCE

KULLBACK-LEIBLER INFORMATION THEORY A BASIS FOR MODEL SELECTION AND INFERENCE KULLBACK-LEIBLER INFORMATION THEORY A BASIS FOR MODEL SELECTION AND INFERENCE Kullback-Leibler Information or Distance f( x) gx ( ±) ) I( f, g) = ' f( x)log dx, If, ( g) is the "information" lost when

More information

A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL

A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL Discussiones Mathematicae Probability and Statistics 36 206 43 5 doi:0.75/dmps.80 A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL Tadeusz Bednarski Wroclaw University e-mail: t.bednarski@prawo.uni.wroc.pl

More information

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models Optimum Design for Mixed Effects Non-Linear and generalized Linear Models Cambridge, August 9-12, 2011 Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

More information

Generalized Linear Models I

Generalized Linear Models I Statistics 203: Introduction to Regression and Analysis of Variance Generalized Linear Models I Jonathan Taylor - p. 1/16 Today s class Poisson regression. Residuals for diagnostics. Exponential families.

More information

Model comparison and selection

Model comparison and selection BS2 Statistical Inference, Lectures 9 and 10, Hilary Term 2008 March 2, 2008 Hypothesis testing Consider two alternative models M 1 = {f (x; θ), θ Θ 1 } and M 2 = {f (x; θ), θ Θ 2 } for a sample (X = x)

More information

Estimating prediction error in mixed models

Estimating prediction error in mixed models Estimating prediction error in mixed models benjamin saefken, thomas kneib georg-august university goettingen sonja greven ludwig-maximilians-university munich 1 / 12 GLMM - Generalized linear mixed models

More information

Generalized Linear Models. Kurt Hornik

Generalized Linear Models. Kurt Hornik Generalized Linear Models Kurt Hornik Motivation Assuming normality, the linear model y = Xβ + e has y = β + ε, ε N(0, σ 2 ) such that y N(μ, σ 2 ), E(y ) = μ = β. Various generalizations, including general

More information

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Michael J. Daniels and Chenguang Wang Jan. 18, 2009 First, we would like to thank Joe and Geert for a carefully

More information

H-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL

H-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL H-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL Intesar N. El-Saeiti Department of Statistics, Faculty of Science, University of Bengahzi-Libya. entesar.el-saeiti@uob.edu.ly

More information

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION

More information

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science. Texts in Statistical Science Generalized Linear Mixed Models Modern Concepts, Methods and Applications Walter W. Stroup CRC Press Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint

More information

MODEL SELECTION FOR CORRELATED DATA WITH DIVERGING NUMBER OF PARAMETERS

MODEL SELECTION FOR CORRELATED DATA WITH DIVERGING NUMBER OF PARAMETERS Statistica Sinica 23 (2013), 901-927 doi:http://dx.doi.org/10.5705/ss.2011.058 MODEL SELECTION FOR CORRELATED DATA WITH DIVERGING NUMBER OF PARAMETERS Hyunkeun Cho and Annie Qu University of Illinois at

More information

Akaike Information Criterion

Akaike Information Criterion Akaike Information Criterion Shuhua Hu Center for Research in Scientific Computation North Carolina State University Raleigh, NC February 7, 2012-1- background Background Model statistical model: Y j =

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2009 Paper 251 Nonparametric population average models: deriving the form of approximate population

More information

Generalized linear models

Generalized linear models Generalized linear models Søren Højsgaard Department of Mathematical Sciences Aalborg University, Denmark October 29, 202 Contents Densities for generalized linear models. Mean and variance...............................

More information

Properties and approximations of some matrix variate probability density functions

Properties and approximations of some matrix variate probability density functions Technical report from Automatic Control at Linköpings universitet Properties and approximations of some matrix variate probability density functions Karl Granström, Umut Orguner Division of Automatic Control

More information

Asymptotic inference for a nonstationary double ar(1) model

Asymptotic inference for a nonstationary double ar(1) model Asymptotic inference for a nonstationary double ar() model By SHIQING LING and DONG LI Department of Mathematics, Hong Kong University of Science and Technology, Hong Kong maling@ust.hk malidong@ust.hk

More information

ATINER's Conference Paper Series STA

ATINER's Conference Paper Series STA ATINER CONFERENCE PAPER SERIES No: LNG2014-1176 Athens Institute for Education and Research ATINER ATINER's Conference Paper Series STA2014-1255 Parametric versus Semi-parametric Mixed Models for Panel

More information

GEE for Longitudinal Data - Chapter 8

GEE for Longitudinal Data - Chapter 8 GEE for Longitudinal Data - Chapter 8 GEE: generalized estimating equations (Liang & Zeger, 1986; Zeger & Liang, 1986) extension of GLM to longitudinal data analysis using quasi-likelihood estimation method

More information

Ken-ichi Kamo Department of Liberal Arts and Sciences, Sapporo Medical University, Hokkaido, Japan

Ken-ichi Kamo Department of Liberal Arts and Sciences, Sapporo Medical University, Hokkaido, Japan A STUDY ON THE BIAS-CORRECTION EFFECT OF THE AIC FOR SELECTING VARIABLES IN NOR- MAL MULTIVARIATE LINEAR REGRESSION MOD- ELS UNDER MODEL MISSPECIFICATION Authors: Hirokazu Yanagihara Department of Mathematics,

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Department of Mathematics Carl von Ossietzky University Oldenburg Sonja Greven Department of

More information

Approximate Likelihoods

Approximate Likelihoods Approximate Likelihoods Nancy Reid July 28, 2015 Why likelihood? makes probability modelling central l(θ; y) = log f (y; θ) emphasizes the inverse problem of reasoning y θ converts a prior probability

More information

Simulating Longer Vectors of Correlated Binary Random Variables via Multinomial Sampling

Simulating Longer Vectors of Correlated Binary Random Variables via Multinomial Sampling Simulating Longer Vectors of Correlated Binary Random Variables via Multinomial Sampling J. Shults a a Department of Biostatistics, University of Pennsylvania, PA 19104, USA (v4.0 released January 2015)

More information

A New Iterative Procedure for Estimation of RCA Parameters Based on Estimating Functions

A New Iterative Procedure for Estimation of RCA Parameters Based on Estimating Functions Applied Mathematical Sciences, Vol. 5, 011, no. 4, 193-0 A New Iterative Procedure for Estimation of RCA Parameters Based on Estimating Functions Norli Anida Abdullah, Ibrahim Mohamed 1 Shelton Peiris

More information

Justine Shults 1,,, Wenguang Sun 1,XinTu 2, Hanjoo Kim 1, Jay Amsterdam 3, Joseph M. Hilbe 4,5 and Thomas Ten-Have 1

Justine Shults 1,,, Wenguang Sun 1,XinTu 2, Hanjoo Kim 1, Jay Amsterdam 3, Joseph M. Hilbe 4,5 and Thomas Ten-Have 1 STATISTICS IN MEDICINE Statist. Med. 2009; 28:2338 2355 Published online 26 May 2009 in Wiley InterScience (www.interscience.wiley.com).3622 A comparison of several approaches for choosing between working

More information

QUANTIFYING PQL BIAS IN ESTIMATING CLUSTER-LEVEL COVARIATE EFFECTS IN GENERALIZED LINEAR MIXED MODELS FOR GROUP-RANDOMIZED TRIALS

QUANTIFYING PQL BIAS IN ESTIMATING CLUSTER-LEVEL COVARIATE EFFECTS IN GENERALIZED LINEAR MIXED MODELS FOR GROUP-RANDOMIZED TRIALS Statistica Sinica 15(05), 1015-1032 QUANTIFYING PQL BIAS IN ESTIMATING CLUSTER-LEVEL COVARIATE EFFECTS IN GENERALIZED LINEAR MIXED MODELS FOR GROUP-RANDOMIZED TRIALS Scarlett L. Bellamy 1, Yi Li 2, Xihong

More information

Covariance function estimation in Gaussian process regression

Covariance function estimation in Gaussian process regression Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian

More information

On the conservative multivariate multiple comparison procedure of correlated mean vectors with a control

On the conservative multivariate multiple comparison procedure of correlated mean vectors with a control On the conservative multivariate multiple comparison procedure of correlated mean vectors with a control Takahiro Nishiyama a a Department of Mathematical Information Science, Tokyo University of Science,

More information

AR-order estimation by testing sets using the Modified Information Criterion

AR-order estimation by testing sets using the Modified Information Criterion AR-order estimation by testing sets using the Modified Information Criterion Rudy Moddemeijer 14th March 2006 Abstract The Modified Information Criterion (MIC) is an Akaike-like criterion which allows

More information

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Institute of Statistics and Econometrics Georg-August-University Göttingen Department of Statistics

More information

Discrete Response Multilevel Models for Repeated Measures: An Application to Voting Intentions Data

Discrete Response Multilevel Models for Repeated Measures: An Application to Voting Intentions Data Quality & Quantity 34: 323 330, 2000. 2000 Kluwer Academic Publishers. Printed in the Netherlands. 323 Note Discrete Response Multilevel Models for Repeated Measures: An Application to Voting Intentions

More information

QIC program and model selection in GEE analyses

QIC program and model selection in GEE analyses The Stata Journal (2007) 7, Number 2, pp. 209 220 QIC program and model selection in GEE analyses James Cui Department of Epidemiology and Preventive Medicine Monash University Melbourne, Australia james.cui@med.monash.edu.au

More information

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate

More information

Covariance modelling for longitudinal randomised controlled trials

Covariance modelling for longitudinal randomised controlled trials Covariance modelling for longitudinal randomised controlled trials G. MacKenzie 1,2 1 Centre of Biostatistics, University of Limerick, Ireland. www.staff.ul.ie/mackenzieg 2 CREST, ENSAI, Rennes, France.

More information

Pearson-type goodness-of-fit test with bootstrap maximum likelihood estimation. Citation Electronic Journal of Statistics, 2013, v. 7, p.

Pearson-type goodness-of-fit test with bootstrap maximum likelihood estimation. Citation Electronic Journal of Statistics, 2013, v. 7, p. Title Pearson-type goodness-of-fit test with bootstrap maximum likelihood estimation Author(s) Yin, G; Ma, Y Citation Electronic Journal of Statistics, 2013, v. 7, p. 412-427 Issued Date 2013 URL http://hdl.handle.net/10722/189457

More information

if n is large, Z i are weakly dependent 0-1-variables, p i = P(Z i = 1) small, and Then n approx i=1 i=1 n i=1

if n is large, Z i are weakly dependent 0-1-variables, p i = P(Z i = 1) small, and Then n approx i=1 i=1 n i=1 Count models A classical, theoretical argument for the Poisson distribution is the approximation Binom(n, p) Pois(λ) for large n and small p and λ = np. This can be extended considerably to n approx Z

More information

Journal of Statistical Software

Journal of Statistical Software JSS Journal of Statistical Software January 2006, Volume 15, Issue 2. http://www.jstatsoft.org/ The R Package geepack for Generalized Estimating Equations Ulrich Halekoh Danish Institute of Agricultural

More information

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

More information

Modeling Longitudinal Count Data with Excess Zeros and Time-Dependent Covariates: Application to Drug Use

Modeling Longitudinal Count Data with Excess Zeros and Time-Dependent Covariates: Application to Drug Use Modeling Longitudinal Count Data with Excess Zeros and : Application to Drug Use University of Northern Colorado November 17, 2014 Presentation Outline I and Data Issues II Correlated Count Regression

More information

Variable Selection in Multivariate Multiple Regression

Variable Selection in Multivariate Multiple Regression Variable Selection in Multivariate Multiple Regression by c Anita Brobbey A thesis submitted to the School of Graduate Studies in partial fulfillment of the requirement for the Degree of Master of Science

More information

GMM Logistic Regression with Time-Dependent Covariates and Feedback Processes in SAS TM

GMM Logistic Regression with Time-Dependent Covariates and Feedback Processes in SAS TM Paper 1025-2017 GMM Logistic Regression with Time-Dependent Covariates and Feedback Processes in SAS TM Kyle M. Irimata, Arizona State University; Jeffrey R. Wilson, Arizona State University ABSTRACT The

More information

DYNAMIC ECONOMETRIC MODELS Vol. 9 Nicolaus Copernicus University Toruń Mariola Piłatowska Nicolaus Copernicus University in Toruń

DYNAMIC ECONOMETRIC MODELS Vol. 9 Nicolaus Copernicus University Toruń Mariola Piłatowska Nicolaus Copernicus University in Toruń DYNAMIC ECONOMETRIC MODELS Vol. 9 Nicolaus Copernicus University Toruń 2009 Mariola Piłatowska Nicolaus Copernicus University in Toruń Combined Forecasts Using the Akaike Weights A b s t r a c t. The focus

More information