Repeated ordinal measurements: a generalised estimating equation approach

Size: px
Start display at page:

Download "Repeated ordinal measurements: a generalised estimating equation approach"

Transcription

1 Repeated ordinal measurements: a generalised estimating equation approach David Clayton MRC Biostatistics Unit 5, Shaftesbury Road Cambridge CB2 2BW April 7, 1992 Abstract Cumulative logit and related regression models for ordered categorical data may be expressed as generalised linear models for correlated binary responses. These may be fitted using the generalised estimated equation approach of Liang and Zeger (1986) and yields nearly identical results to maximum likelihood while offering further flexibility. The approach also generalises to deal with repeated ordinal measurements in the same subject, such as those commonly observed in medical cross-over experiments. Keywords: Generalised estimating equations, ordinal data, repeated measures, cross-over trials. 1 Background: generalised estimating equations In a generalised linear model (GLM), a response vector y, length N, has expectation vector µ whose elements are related to those of a linear predictor η by the link function g(.) (so that g(µ i ) = η i ). The linear predictor is given by the linear model η = Xβ where X is a matrix whose rows, x i, are vectors of covariates for each observational unit. In the original formulation of GLMs, the responses are assumed to be independent with variances φv(µ i ), where V (.) is the variance function and φ the scale factor. 1

2 Estimation of the regression coefficients, β, is by solution of the estimating equations X t We = 0 (1) where e is the vector of scaled residuals, e i = g (µ i )(y i µ i ), and W is a diagonal matrix of weights such that [W ii ] 1 = φv(µ i ) [ g (µ i ) ] 2. It is well known that these estimating equations lead to maximum likelihood estimates of β when the distribution of responses is drawn from the exponential family. In other cases, they are referred to as maximum quasi-likelihood estimates. In either case, under the assumptions set out above, the asymptotic variance of the estimates is estimated by (X t WX) 1 (evaluated with β at its estimated value, ˆβ). More recently, it has been recognised that these estimating equations lead to consistent (if not fully efficient) estimates of β even when the variance function V(.) is mis-specified. However, in these circumstances the above variance estimate is incorrect and it is necessary to use an alternative robust estimate of the form (X t WX) 1 S(X t W X) 1. Here S is the sum of squares and products (SSP) matrix for the individual contributions to the estimating equation. More precisely, if x i is the i th row of X, the contribution of subject i to the estimating equations is given by u i = W ii e i x i so that equation 1 may be rewritten as i u i = 0, then S = u i u t i. i In the papers by Liang, Zeger and collaborators (for example, Liang and Zeger, 1986) this idea has been further developed. If the vector y, of length NM, represents M repeated measurements on N different individuals, then the model may be extended to allow for correlation between repeated measurements of the same individual with, say, Corr(y i j,y ik ) = (ρ jk ) i. 2

3 (Usually the correlation structure is assumed to be constant across subjects, so that the i subscript may be dropped.) Efficient estimation of β may be achieved by solution of estimating equations of the same general form as discussed above, with the modification that the W matrix should now be block-diagonal as a result of the correlation between repeated measures. Since it may be difficult in practice to specify the correct variance and correlation structures, Liang and Zeger recommend the use of a convenient working approximation to ρ, and of a robust estimate for the variance of ˆβ. This is obtained in the same way as before, the matrix S now representing the empirical SSP matrix for the N contributions of individual subjects to the estimating equation, u i (= j u i j ). 2 Ordered categorical responses Perhaps the most popular method for analysis of ordered categorical data analysis is that based upon the cumulative logit regression model. This was first proposed by Snell (1964) and further generalised by McCullagh (1980) to allow link functions other than the logit. McCullagh s description of the model was in terms of an underlying latent continuous response, stratified at unknown cutpoints. For a response with C categories we need C 2 parameters to represent these cutpoints (since the boundary between the first two categories can be taken as zero without loss of generality). An alternative view of the class of models is that they hold that, over the C 1 different ways of collapsing the response into a binary one, the quantal regression equations are unchanged, save in their intercepts. Any of the usual binary regression links (logit, probit, complementary log-log etc.) are available. With this view of the model, the extra C 2 cutpoint parameters represent differences between the intercepts of the C 1 binary regressions. This latter view of the logit version of the model prompted Clayton (1974) to propose, for the two sample problem, a modified version of the Mantel-Haenszel estimate of the common odds ratio in a stack of 2 2 contingency tables. The possible collapses of the ordinal response yield C 1 such tables and these provide C 1 correlated estimates of the common odds ratio. Clayton showed that, although the optimal weights for pooling these estimates are rather complicated, use of weights which are optimal under the null hypothesis provides a convenient practical method. This method was based on two main ideas, 3

4 1. the treatment of the ordinal response as C 1 correlated binary responses, and 2. the use of weights which are locally optimal around the null. The first (and, to a lesser extent, the second) of these ideas is carried through in the present proposal. Thus, if the ordered categorical response of the i th subject is coded 1,...,C then we may create an expanded vector of binary responses, y, of length N(C 1) and indexed by i and j so that, for j = 1,...,C 1, y i j = I(y i j). The Snell-McCullagh model relates the expectation of this vector, µ, via a link function to the linear predictor vector, η, with elements also indexed by i and j η i j = θ j + x t iβ. This model may be fitted using generalized estimating equations. An expanded design matrix, X, is created by repeating each row of the original design matrix C 1 times, corresponding to the C 1 possible collapses, and appending C 2 columns of dummy variables to allow for differences in intercepts of the C 1 possible binary regressions. The binomial variance function correctly specifies the variances of the elements of y. The correlations of responses are simple functions of their expectations, µ, Corr(y i j,y ik ) = µ i,min( j,k) µ i jµ ik µi j (1 µ i j )µ ik (1 µ ik ). A working correlation matrix, constant for all i, is provided by the estimate under the null hypothesis of homogeneity of response. This is obtained by substituting the marginal cumulative proportions for µ i j, j = 1,...,C 1 in the above expression. Notice that, in contrast with the method of Clayton (1974), the weighting scheme uses estimates under the null hypothesis only for correlations between the elements of y their variances are dealt with correctly. However, if software allowed, there would be no need even for this inaccuracy, which arises solely from a requirement to specify a common correlation structure across subjects. 4

5 Time (minutes) Treatment < > 60 Active Placebo Table 1: Time to falling asleep for 239 subjects An example The main purpose of this paper is to exploit the natural generalisation of this approach to deal with repeated ordinal measurements within the same subject. Before proceeding to this, however, a comparison with the method of maximum likelihood in the simpler case serves to demonstrate the efficiency of the method, and some practical advantages. Table 1 reproduces a dataset which has been analysed elsewhere in the literature (Framcom, Chuang and Landis, 1989; Agresti, 1989). The data concern time to falling asleep, coded into 4 ordered categories, for N = 239 subjects, half receiving active treatment and half placebo. Measurements were made pretreatment and on a follow-up occasion after treatment, but for this first analysis only the follow-up data are shown. For the GEE analysis, each subject contributes three binary response variables coding whether time to falling asleep was (a) 20 minutes, (b) 30 minutes, or (c) 60 minutes. The corresponding marginal proportions are , and so that the working correlation matrix is If treatment is coded into a vector z, with z i = 0 indicating placebo and z i = 1 indicating active treatment, the Snell-McCullagh model is η i j = µ + θ j + βz i where the cutpoint parameters, θ j, are subject to a linear constraint such as the corner constraint θ 1 = 0. Alternatively, in the syntax introduced by Wilkinson and Rogers (1973) and further developed in computer programs such as GLIM, this model can be written. 1 + Cutpoint + Treatment. 5

6 Using GEE with a logistic link and binomial variance function, the treatment effect is estimated as ˆβ = with asymptotic standard error (note that the positive coefficient indicates a shift to the left in the response distribution). Full maximum likelihood yielded ˆβ = with an ASE of An unexpected benefit of the GEE approach is that the cutpoint parameters enter simply as terms in the linear model. The assumption of constancy of treatment effect across cutpoints may be tested by inclusion of a Cutpoint Treatment interaction term in the model. A single degree of freedom test for trend of treatment effect across cutpoints can be carried out by fitting the model η i j = µ + θ j + βz i + γ jz i. In the present example, this yields ˆγ = with ASE There is, therefore, some suggestion of failure of the Snell-McCullagh model, with a tendency for the treatment effect to increase with shift of cutpoint to the right. This impression is also suggested by inspection of the odds ratios for the three cutpoints; cutting at 20 minutes gives an odds ratio of (40 89)/(31 79) = 1.45, cutting at 30 minutes gives an odds ratio of (89 60)/(60 30) = 2.97, and cutting at 60 minutes gives an odds ratio of (108 25)/(95 11) = 2.58 The ability to include such interaction terms represents a genuine extension of the Snell-McCullagh approach. Although the representation of treatment effect with a single parameter requires us to assume no interaction between treatment and cutpoint, there is no such requirement for other explanatory variables of less direct interest. Thus, the proportional odds assumption may be maintained for the effect of interest (treatment), but relaxed for the effects of other powerful disturbing influences. 3 Repeated ordinal response data The extension of the method to deal with repeated ordinal measurements in the same subject is natural. Such repeated ordinal measurements occur frequently in cross-over trials (see, for example, Jones and Kenward, 1989), and in experiments which incorporate a pre-treatment baseline measurement. The analysis of such data by maximum likelihood is difficult. Incorporation of a random subject effect in the linear model leads to an intractable likelihood, as do other approaches to modelling the association structure. 6

7 By contrast, the GEE approach is straightforward. Each ordinal measurement contributes a block of derived binary response variables so that, if there are R repeated measurements, the binary response vector is of length NR(C 1). Explanatory variables may be constant within a subject, in which case each value must be repeated R(C 1) times in the design matrix, or may vary from occasion to occasion, requiring each value to be repeated C 1 times. The model will include effects for cutpoint, occasion, other explanatory variables, and (possibly) their interaction. Two methods have been considered for calculating a working correlation matrix 1. to calculate working correlations between binary responses representing different cutpoints of the same measurement as in 2, and to ignore all others, and 2. to estimate the correlation structure as a free R(C 1) R(C 1) matrix. The second suggestion requires estimation of the correlation structure and this is an active research area. In this paper the approach suggested by Liang and Zeger (1986) is used. In later work (Liang, Zeger and Qaqish, 1992) this was termed GEE1 to distinguish it from the (rather more efficient) approach of Prentice and Zhao (1991), which they termed GEE2. An example Table 2 shows the sleep data in more detail, including both pre-treatment and follow-up measurements. The extended analysis simultaneously models pre-treatment and follow-up responses by expanding each subject s responses into 6 binary indicators. If, as before, we index subjects by i and cutpoints by j, and further index pre-treatment and follow-up responses by t = 0 and 1 respectively, then a model for treatment effect is or, in the Wilkinson and Rogers syntax, η i jt = µ + θ j + βz i + γt + δz i t. 1 + Cutpoint + Treatment + Occasion + Treatment.Occasion The parameter of interest in this model is the interaction parameter, δ. 7

8 Initial Follow-up occasion Treatment occasion < > 60 Active < > Placebo < > Table 2: Time to falling asleep for 239 subjects, pre-treatment and at follow-up Our first working correlation structure is ρ 1 = The bottom right section of this matrix is the same as that used in 2 and the top left section is calculated in the same way from the marginal cumulative proportions for the pre-treatment measurement (0.1088, and ). Correlation between pre-treatment and follow-up responses are ignored. In the second approach, the correlation matrix was estimated from the data, as ρ 2 = Note that these two matrices agree quite closely except for those elements set to zero in the former. 8..

9 ASE Method Estimate Naive Robust GEE(ρ 1 ) GEE(ρ 2 ) EWLS Table 3: Estimates of the Treatment Occasion interaction parameter The estimates of the interaction parameter δ obtained from these two analyses are given in Table 3. Also shown is the estimate obtained by Agresti (1989), who fitted the same model to these data using empirically weighted least squares (EWLS). For the GEE analyses, two ASE s are given. The first ( naive ) estimate is the appropriate diagonal element of (X t WX) 1 and requires that the working correlation matrix and the variance function are both correct. The second is the robust estimate which allows for mis-specification of either or both of these. The variance function cannot fail to be correctly specified since, for any response, y, taking on values 0 or 1, Var(y) = E(y)[1 E(y)]. It is therefore not surprising that in the second GEE analysis, which estimates the correlation structure from the data, the naive and robust ASE s agree closely. In the first analysis, the naive ASE is incorrect owing to the mis-specification of the working correlation matrix. However, the robust ASE is very close to that obtained in the second analysis. It would seem, therefore, that the loss of efficiency due to using an incorrect working correlation structure is negigible. Agresti s estimates using EWLS were only published to two decimal places, but seem to agree quite closely with the GEE analyses. In no analysis was the treatment effect estimated more precisely than our earlier analysis which discarded the pre-treatment baseline measurement. This, of course, is not surprising if we consider the analogous analysis for measurements on a continuous interval scale. In that case, we only gain from using the baseline data if the between-subject component of variance excedes the within-subject error variance. In that case, the correlation between pre-treatment and follow-up measurements would excede

10 4 Discussion The generalised estimating equation method proposed by Liang and Zeger provides an invaluable new tool for the applied statistician. The approach to ordinal response data described here serves to demonstrate the flexibility of the approach and its ability to provide a unified approach to seemingly unrelated problems. Now that software is becoming available, it is increasingly attractive to use this general technique in preference to more specialised (and limited) programs. This paper has shown that 1. the Snell-McCullagh model for ordinal response data may be treated as a special instance of marginal models for repeated binary responses, 2. the GEE method of estimation is nearly as efficient as full maximum likelihood, 3. the approach allows extension of the model to include interactions between cutpoints and explanatory variables, and to deal with repeated ordinal measurements. Some problems remain. In particular the performance of the method for repeated measurements in small samples requires further investigation, particularly in view of its potential application in cross-over trials. In this context, the adequacy of the robust ASE requires further study. The alternative is to estimate the correlation structure and use the naive ASE, but estimation of a large number of correlations from a small sample is potentially hazardous. A further possibility is to model the correlation structure more parsimoniously in terms of the expected values and, perhaps, one further parameter expressing the strength of association between pretreatment and follow-up measurements. It must be expected, however, that whatever approach turns out to be preferable, generalised estimating equation methods will prove better in small samples than the empirical weighted least squares approach which is currently its main competitor. Software The computations described in this paper were carried out in S using the gee() function written by Vincent Carey and available on STATLIB. The maximum like- 10

11 lihood analysis of the follow-up data was carried out using SAS PROC LOGIS- TIC. Agresti s (1989) analysis used SAS PROC CATMOD. Acknowledgements I am grateful to the associate editor and to the referees for their constructive criticism of an earlier version. References Agresti, A. (1989) A survey of models for repeated ordered categorical response data. Statistics in Medicine, 8, Clayton, D.G. (1974) Some odds ratio statistics for the analysis of ordered categorical data. Biometrika, 61, Francom, S.F., Chuang, C. and Landis, J.R. (1989) A log-linear model for ordinal data to characterize differential change among treatments. Statistics in Medicine, 8, Jones, B. and Kenward, M.G. (1989) The Design and Analysis of Cross-over Trials. Chapman and Hall, London. Liang, K.-Y. and Zeger, S.L. (1986) Longitudinal data analysis using generalized linear models. Biometrika, 73, Liang, K.-Y., Zeger, S.L. and Qaqish, B. (1992) Multivariate regression analyses for categorical data (with discussion). J.R.Statist.Soc. B, 54, McCullagh, P. (1980) Regression models for ordinal data (with discussion). Journal of the Royal Statistical Society, Series B, 42, Prentice, R.L. and Zhao, L.P. (1991) Estimating equations in means and covariances of multivariate discrete and continuous responses. Biometrics, 47, Snell, E.J. (1964) A scaling procedure for ordered categorical data. Biometrics, 20, Wilkinson, G.N. and Rogers, C.E. (1973) Symbolic description of factorial models for analysis of variance. Applied Statistics, 22,

ANALYSING BINARY DATA IN A REPEATED MEASUREMENTS SETTING USING SAS

ANALYSING BINARY DATA IN A REPEATED MEASUREMENTS SETTING USING SAS Libraries 1997-9th Annual Conference Proceedings ANALYSING BINARY DATA IN A REPEATED MEASUREMENTS SETTING USING SAS Eleanor F. Allan Follow this and additional works at: http://newprairiepress.org/agstatconference

More information

,..., θ(2),..., θ(n)

,..., θ(2),..., θ(n) Likelihoods for Multivariate Binary Data Log-Linear Model We have 2 n 1 distinct probabilities, but we wish to consider formulations that allow more parsimonious descriptions as a function of covariates.

More information

PQL Estimation Biases in Generalized Linear Mixed Models

PQL Estimation Biases in Generalized Linear Mixed Models PQL Estimation Biases in Generalized Linear Mixed Models Woncheol Jang Johan Lim March 18, 2006 Abstract The penalized quasi-likelihood (PQL) approach is the most common estimation procedure for the generalized

More information

Generalized Linear Models (GLZ)

Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) are an extension of the linear modeling process that allows models to be fit to data that follow probability distributions other than the

More information

Longitudinal Modeling with Logistic Regression

Longitudinal Modeling with Logistic Regression Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University A SURVEY OF VARIANCE COMPONENTS ESTIMATION FROM BINARY DATA by Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University BU-1211-M May 1993 ABSTRACT The basic problem of variance components

More information

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses Outline Marginal model Examples of marginal model GEE1 Augmented GEE GEE1.5 GEE2 Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association

More information

Figure 36: Respiratory infection versus time for the first 49 children.

Figure 36: Respiratory infection versus time for the first 49 children. y BINARY DATA MODELS We devote an entire chapter to binary data since such data are challenging, both in terms of modeling the dependence, and parameter interpretation. We again consider mixed effects

More information

Stat 579: Generalized Linear Models and Extensions

Stat 579: Generalized Linear Models and Extensions Stat 579: Generalized Linear Models and Extensions Linear Mixed Models for Longitudinal Data Yan Lu April, 2018, week 15 1 / 38 Data structure t1 t2 tn i 1st subject y 11 y 12 y 1n1 Experimental 2nd subject

More information

Models for Longitudinal Analysis of Binary Response Data for Identifying the Effects of Different Treatments on Insomnia

Models for Longitudinal Analysis of Binary Response Data for Identifying the Effects of Different Treatments on Insomnia Applied Mathematical Sciences, Vol. 4, 2010, no. 62, 3067-3082 Models for Longitudinal Analysis of Binary Response Data for Identifying the Effects of Different Treatments on Insomnia Z. Rezaei Ghahroodi

More information

8 Nominal and Ordinal Logistic Regression

8 Nominal and Ordinal Logistic Regression 8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on

More information

Simulating Longer Vectors of Correlated Binary Random Variables via Multinomial Sampling

Simulating Longer Vectors of Correlated Binary Random Variables via Multinomial Sampling Simulating Longer Vectors of Correlated Binary Random Variables via Multinomial Sampling J. Shults a a Department of Biostatistics, University of Pennsylvania, PA 19104, USA (v4.0 released January 2015)

More information

DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS

DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS Ivy Liu and Dong Q. Wang School of Mathematics, Statistics and Computer Science Victoria University of Wellington New Zealand Corresponding

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

LOGISTIC REGRESSION Joseph M. Hilbe

LOGISTIC REGRESSION Joseph M. Hilbe LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of

More information

STAT 526 Advanced Statistical Methodology

STAT 526 Advanced Statistical Methodology STAT 526 Advanced Statistical Methodology Fall 2017 Lecture Note 10 Analyzing Clustered/Repeated Categorical Data 0-0 Outline Clustered/Repeated Categorical Data Generalized Linear Mixed Models Generalized

More information

LOGISTICS REGRESSION FOR SAMPLE SURVEYS

LOGISTICS REGRESSION FOR SAMPLE SURVEYS 4 LOGISTICS REGRESSION FOR SAMPLE SURVEYS Hukum Chandra Indian Agricultural Statistics Research Institute, New Delhi-002 4. INTRODUCTION Researchers use sample survey methodology to obtain information

More information

Describing Stratified Multiple Responses for Sparse Data

Describing Stratified Multiple Responses for Sparse Data Describing Stratified Multiple Responses for Sparse Data Ivy Liu School of Mathematical and Computing Sciences Victoria University Wellington, New Zealand June 28, 2004 SUMMARY Surveys often contain qualitative

More information

Assessing GEE Models with Longitudinal Ordinal Data by Global Odds Ratio

Assessing GEE Models with Longitudinal Ordinal Data by Global Odds Ratio Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS074) p.5763 Assessing GEE Models wh Longudinal Ordinal Data by Global Odds Ratio LIN, KUO-CHIN Graduate Instute of

More information

GEE for Longitudinal Data - Chapter 8

GEE for Longitudinal Data - Chapter 8 GEE for Longitudinal Data - Chapter 8 GEE: generalized estimating equations (Liang & Zeger, 1986; Zeger & Liang, 1986) extension of GLM to longitudinal data analysis using quasi-likelihood estimation method

More information

Chapter 1. Modeling Basics

Chapter 1. Modeling Basics Chapter 1. Modeling Basics What is a model? Model equation and probability distribution Types of model effects Writing models in matrix form Summary 1 What is a statistical model? A model is a mathematical

More information

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science. Texts in Statistical Science Generalized Linear Mixed Models Modern Concepts, Methods and Applications Walter W. Stroup CRC Press Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint

More information

Chapter 2: Describing Contingency Tables - II

Chapter 2: Describing Contingency Tables - II : Describing Contingency Tables - II Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM [Acknowledgements to Tim Hanson and Haitao Chu]

More information

Goodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links

Goodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links Communications of the Korean Statistical Society 2009, Vol 16, No 4, 697 705 Goodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links Kwang Mo Jeong a, Hyun Yung Lee 1, a a Department

More information

Regression models for multivariate ordered responses via the Plackett distribution

Regression models for multivariate ordered responses via the Plackett distribution Journal of Multivariate Analysis 99 (2008) 2472 2478 www.elsevier.com/locate/jmva Regression models for multivariate ordered responses via the Plackett distribution A. Forcina a,, V. Dardanoni b a Dipartimento

More information

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples ST3241 Categorical Data Analysis I Generalized Linear Models Introduction and Some Examples 1 Introduction We have discussed methods for analyzing associations in two-way and three-way tables. Now we will

More information

SUPPLEMENTARY SIMULATIONS & FIGURES

SUPPLEMENTARY SIMULATIONS & FIGURES Supplementary Material: Supplementary Material for Mixed Effects Models for Resampled Network Statistics Improve Statistical Power to Find Differences in Multi-Subject Functional Connectivity Manjari Narayan,

More information

Bayesian Multivariate Logistic Regression

Bayesian Multivariate Logistic Regression Bayesian Multivariate Logistic Regression Sean M. O Brien and David B. Dunson Biostatistics Branch National Institute of Environmental Health Sciences Research Triangle Park, NC 1 Goals Brief review of

More information

Multivariate Extensions of McNemar s Test

Multivariate Extensions of McNemar s Test Multivariate Extensions of McNemar s Test Bernhard Klingenberg Department of Mathematics and Statistics, Williams College Williamstown, MA 01267, U.S.A. e-mail: bklingen@williams.edu and Alan Agresti Department

More information

Efficiency of generalized estimating equations for binary responses

Efficiency of generalized estimating equations for binary responses J. R. Statist. Soc. B (2004) 66, Part 4, pp. 851 860 Efficiency of generalized estimating equations for binary responses N. Rao Chaganty Old Dominion University, Norfolk, USA and Harry Joe University of

More information

Investigating Models with Two or Three Categories

Investigating Models with Two or Three Categories Ronald H. Heck and Lynn N. Tabata 1 Investigating Models with Two or Three Categories For the past few weeks we have been working with discriminant analysis. Let s now see what the same sort of model might

More information

Multinomial Logistic Regression Models

Multinomial Logistic Regression Models Stat 544, Lecture 19 1 Multinomial Logistic Regression Models Polytomous responses. Logistic regression can be extended to handle responses that are polytomous, i.e. taking r>2 categories. (Note: The word

More information

Sections 2.3, 2.4. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis 1 / 21

Sections 2.3, 2.4. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis 1 / 21 Sections 2.3, 2.4 Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 21 2.3 Partial association in stratified 2 2 tables In describing a relationship

More information

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study Science Journal of Applied Mathematics and Statistics 2014; 2(1): 20-25 Published online February 20, 2014 (http://www.sciencepublishinggroup.com/j/sjams) doi: 10.11648/j.sjams.20140201.13 Robust covariance

More information

GLM models and OLS regression

GLM models and OLS regression GLM models and OLS regression Graeme Hutcheson, University of Manchester These lecture notes are based on material published in... Hutcheson, G. D. and Sofroniou, N. (1999). The Multivariate Social Scientist:

More information

Generalized Estimating Equations (gee) for glm type data

Generalized Estimating Equations (gee) for glm type data Generalized Estimating Equations (gee) for glm type data Søren Højsgaard mailto:sorenh@agrsci.dk Biometry Research Unit Danish Institute of Agricultural Sciences January 23, 2006 Printed: January 23, 2006

More information

Single-level Models for Binary Responses

Single-level Models for Binary Responses Single-level Models for Binary Responses Distribution of Binary Data y i response for individual i (i = 1,..., n), coded 0 or 1 Denote by r the number in the sample with y = 1 Mean and variance E(y) =

More information

Sample size calculations for logistic and Poisson regression models

Sample size calculations for logistic and Poisson regression models Biometrika (2), 88, 4, pp. 93 99 2 Biometrika Trust Printed in Great Britain Sample size calculations for logistic and Poisson regression models BY GWOWEN SHIEH Department of Management Science, National

More information

Anders Skrondal. Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine. Based on joint work with Sophia Rabe-Hesketh

Anders Skrondal. Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine. Based on joint work with Sophia Rabe-Hesketh Constructing Latent Variable Models using Composite Links Anders Skrondal Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine Based on joint work with Sophia Rabe-Hesketh

More information

Mantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC

Mantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC Mantel-Haenszel Test Statistics for Correlated Binary Data by Jie Zhang and Dennis D. Boos Department of Statistics, North Carolina State University Raleigh, NC 27695-8203 tel: (919) 515-1918 fax: (919)

More information

Ronald Heck Week 14 1 EDEP 768E: Seminar in Categorical Data Modeling (F2012) Nov. 17, 2012

Ronald Heck Week 14 1 EDEP 768E: Seminar in Categorical Data Modeling (F2012) Nov. 17, 2012 Ronald Heck Week 14 1 From Single Level to Multilevel Categorical Models This week we develop a two-level model to examine the event probability for an ordinal response variable with three categories (persist

More information

Testing Non-Linear Ordinal Responses in L2 K Tables

Testing Non-Linear Ordinal Responses in L2 K Tables RUHUNA JOURNA OF SCIENCE Vol. 2, September 2007, pp. 18 29 http://www.ruh.ac.lk/rjs/ ISSN 1800-279X 2007 Faculty of Science University of Ruhuna. Testing Non-inear Ordinal Responses in 2 Tables eslie Jayasekara

More information

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014 LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R Liang (Sally) Shan Nov. 4, 2014 L Laboratory for Interdisciplinary Statistical Analysis LISA helps VT researchers

More information

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES REVSTAT Statistical Journal Volume 13, Number 3, November 2015, 233 243 MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES Authors: Serpil Aktas Department of

More information

Generalized Linear Models

Generalized Linear Models York SPIDA John Fox Notes Generalized Linear Models Copyright 2010 by John Fox Generalized Linear Models 1 1. Topics I The structure of generalized linear models I Poisson and other generalized linear

More information

Poisson regression: Further topics

Poisson regression: Further topics Poisson regression: Further topics April 21 Overdispersion One of the defining characteristics of Poisson regression is its lack of a scale parameter: E(Y ) = Var(Y ), and no parameter is available to

More information

Good Confidence Intervals for Categorical Data Analyses. Alan Agresti

Good Confidence Intervals for Categorical Data Analyses. Alan Agresti Good Confidence Intervals for Categorical Data Analyses Alan Agresti Department of Statistics, University of Florida visiting Statistics Department, Harvard University LSHTM, July 22, 2011 p. 1/36 Outline

More information

Now consider the case where E(Y) = µ = Xβ and V (Y) = σ 2 G, where G is diagonal, but unknown.

Now consider the case where E(Y) = µ = Xβ and V (Y) = σ 2 G, where G is diagonal, but unknown. Weighting We have seen that if E(Y) = Xβ and V (Y) = σ 2 G, where G is known, the model can be rewritten as a linear model. This is known as generalized least squares or, if G is diagonal, with trace(g)

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Multinomial Regression Models

Multinomial Regression Models Multinomial Regression Models Objectives: Multinomial distribution and likelihood Ordinal data: Cumulative link models (POM). Ordinal data: Continuation models (CRM). 84 Heagerty, Bio/Stat 571 Models for

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models Generalized Linear Models - part II Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs.

More information

Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland

Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2018 Part III Limited Dependent Variable Models As of Jan 30, 2017 1 Background 2 Binary Dependent Variable The Linear Probability

More information

Semiparametric Generalized Linear Models

Semiparametric Generalized Linear Models Semiparametric Generalized Linear Models North American Stata Users Group Meeting Chicago, Illinois Paul Rathouz Department of Health Studies University of Chicago prathouz@uchicago.edu Liping Gao MS Student

More information

Sample size determination for logistic regression: A simulation study

Sample size determination for logistic regression: A simulation study Sample size determination for logistic regression: A simulation study Stephen Bush School of Mathematical Sciences, University of Technology Sydney, PO Box 123 Broadway NSW 2007, Australia Abstract This

More information

Bias-corrected AIC for selecting variables in Poisson regression models

Bias-corrected AIC for selecting variables in Poisson regression models Bias-corrected AIC for selecting variables in Poisson regression models Ken-ichi Kamo (a), Hirokazu Yanagihara (b) and Kenichi Satoh (c) (a) Corresponding author: Department of Liberal Arts and Sciences,

More information

A COEFFICIENT OF DETERMINATION FOR LOGISTIC REGRESSION MODELS

A COEFFICIENT OF DETERMINATION FOR LOGISTIC REGRESSION MODELS A COEFFICIENT OF DETEMINATION FO LOGISTIC EGESSION MODELS ENATO MICELI UNIVESITY OF TOINO After a brief presentation of the main extensions of the classical coefficient of determination ( ), a new index

More information

Generalized Quasi-likelihood (GQL) Inference* by Brajendra C. Sutradhar Memorial University address:

Generalized Quasi-likelihood (GQL) Inference* by Brajendra C. Sutradhar Memorial University  address: Generalized Quasi-likelihood (GQL) Inference* by Brajendra C. Sutradhar Memorial University Email address: bsutradh@mun.ca QL Estimation for Independent Data. For i = 1,...,K, let Y i denote the response

More information

Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations)

Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations) Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations) Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology

More information

ONE MORE TIME ABOUT R 2 MEASURES OF FIT IN LOGISTIC REGRESSION

ONE MORE TIME ABOUT R 2 MEASURES OF FIT IN LOGISTIC REGRESSION ONE MORE TIME ABOUT R 2 MEASURES OF FIT IN LOGISTIC REGRESSION Ernest S. Shtatland, Ken Kleinman, Emily M. Cain Harvard Medical School, Harvard Pilgrim Health Care, Boston, MA ABSTRACT In logistic regression,

More information

Modeling and Measuring Association for Ordinal Data

Modeling and Measuring Association for Ordinal Data Modeling and Measuring Association for Ordinal Data A Thesis Submitted to the Faculty of Graduate Studies and Research In Partial Fulfillment of the Requirements for the Degree of Master of Science in

More information

Review: what is a linear model. Y = β 0 + β 1 X 1 + β 2 X 2 + A model of the following form:

Review: what is a linear model. Y = β 0 + β 1 X 1 + β 2 X 2 + A model of the following form: Outline for today What is a generalized linear model Linear predictors and link functions Example: fit a constant (the proportion) Analysis of deviance table Example: fit dose-response data using logistic

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

On Properties of QIC in Generalized. Estimating Equations. Shinpei Imori

On Properties of QIC in Generalized. Estimating Equations. Shinpei Imori On Properties of QIC in Generalized Estimating Equations Shinpei Imori Graduate School of Engineering Science, Osaka University 1-3 Machikaneyama-cho, Toyonaka, Osaka 560-8531, Japan E-mail: imori.stat@gmail.com

More information

The equivalence of the Maximum Likelihood and a modified Least Squares for a case of Generalized Linear Model

The equivalence of the Maximum Likelihood and a modified Least Squares for a case of Generalized Linear Model Applied and Computational Mathematics 2014; 3(5): 268-272 Published online November 10, 2014 (http://www.sciencepublishinggroup.com/j/acm) doi: 10.11648/j.acm.20140305.22 ISSN: 2328-5605 (Print); ISSN:

More information

Logistic Regression: Regression with a Binary Dependent Variable

Logistic Regression: Regression with a Binary Dependent Variable Logistic Regression: Regression with a Binary Dependent Variable LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the circumstances under which logistic regression

More information

Generalized linear models

Generalized linear models Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models

More information

Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence

Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Sunil Kumar Dhar Center for Applied Mathematics and Statistics, Department of Mathematical Sciences, New Jersey

More information

Generalized Linear Models: An Introduction

Generalized Linear Models: An Introduction Applied Statistics With R Generalized Linear Models: An Introduction John Fox WU Wien May/June 2006 2006 by John Fox Generalized Linear Models: An Introduction 1 A synthesis due to Nelder and Wedderburn,

More information

General Regression Model

General Regression Model Scott S. Emerson, M.D., Ph.D. Department of Biostatistics, University of Washington, Seattle, WA 98195, USA January 5, 2015 Abstract Regression analysis can be viewed as an extension of two sample statistical

More information

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION

More information

Model Assumptions; Predicting Heterogeneity of Variance

Model Assumptions; Predicting Heterogeneity of Variance Model Assumptions; Predicting Heterogeneity of Variance Today s topics: Model assumptions Normality Constant variance Predicting heterogeneity of variance CLP 945: Lecture 6 1 Checking for Violations of

More information

Longitudinal analysis of ordinal data

Longitudinal analysis of ordinal data Longitudinal analysis of ordinal data A report on the external research project with ULg Anne-Françoise Donneau, Murielle Mauer June 30 th 2009 Generalized Estimating Equations (Liang and Zeger, 1986)

More information

Discrete Response Multilevel Models for Repeated Measures: An Application to Voting Intentions Data

Discrete Response Multilevel Models for Repeated Measures: An Application to Voting Intentions Data Quality & Quantity 34: 323 330, 2000. 2000 Kluwer Academic Publishers. Printed in the Netherlands. 323 Note Discrete Response Multilevel Models for Repeated Measures: An Application to Voting Intentions

More information

An R # Statistic for Fixed Effects in the Linear Mixed Model and Extension to the GLMM

An R # Statistic for Fixed Effects in the Linear Mixed Model and Extension to the GLMM An R Statistic for Fixed Effects in the Linear Mixed Model and Extension to the GLMM Lloyd J. Edwards, Ph.D. UNC-CH Department of Biostatistics email: Lloyd_Edwards@unc.edu Presented to the Department

More information

Logistic regression. 11 Nov Logistic regression (EPFL) Applied Statistics 11 Nov / 20

Logistic regression. 11 Nov Logistic regression (EPFL) Applied Statistics 11 Nov / 20 Logistic regression 11 Nov 2010 Logistic regression (EPFL) Applied Statistics 11 Nov 2010 1 / 20 Modeling overview Want to capture important features of the relationship between a (set of) variable(s)

More information

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY Ingo Langner 1, Ralf Bender 2, Rebecca Lenz-Tönjes 1, Helmut Küchenhoff 2, Maria Blettner 2 1

More information

Logistic regression: Miscellaneous topics

Logistic regression: Miscellaneous topics Logistic regression: Miscellaneous topics April 11 Introduction We have covered two approaches to inference for GLMs: the Wald approach and the likelihood ratio approach I claimed that the likelihood ratio

More information

A weighted simulation-based estimator for incomplete longitudinal data models

A weighted simulation-based estimator for incomplete longitudinal data models To appear in Statistics and Probability Letters, 113 (2016), 16-22. doi 10.1016/j.spl.2016.02.004 A weighted simulation-based estimator for incomplete longitudinal data models Daniel H. Li 1 and Liqun

More information

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1 PANEL DATA RANDOM AND FIXED EFFECTS MODEL Professor Menelaos Karanasos December 2011 PANEL DATA Notation y it is the value of the dependent variable for cross-section unit i at time t where i = 1,...,

More information

Outline of GLMs. Definitions

Outline of GLMs. Definitions Outline of GLMs Definitions This is a short outline of GLM details, adapted from the book Nonparametric Regression and Generalized Linear Models, by Green and Silverman. The responses Y i have density

More information

The GENMOD Procedure. Overview. Getting Started. Syntax. Details. Examples. References. SAS/STAT User's Guide. Book Contents Previous Next

The GENMOD Procedure. Overview. Getting Started. Syntax. Details. Examples. References. SAS/STAT User's Guide. Book Contents Previous Next Book Contents Previous Next SAS/STAT User's Guide Overview Getting Started Syntax Details Examples References Book Contents Previous Next Top http://v8doc.sas.com/sashtml/stat/chap29/index.htm29/10/2004

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2009 Paper 251 Nonparametric population average models: deriving the form of approximate population

More information

TECHNICAL REPORT # 59 MAY Interim sample size recalculation for linear and logistic regression models: a comprehensive Monte-Carlo study

TECHNICAL REPORT # 59 MAY Interim sample size recalculation for linear and logistic regression models: a comprehensive Monte-Carlo study TECHNICAL REPORT # 59 MAY 2013 Interim sample size recalculation for linear and logistic regression models: a comprehensive Monte-Carlo study Sergey Tarima, Peng He, Tao Wang, Aniko Szabo Division of Biostatistics,

More information

1 Mixed effect models and longitudinal data analysis

1 Mixed effect models and longitudinal data analysis 1 Mixed effect models and longitudinal data analysis Mixed effects models provide a flexible approach to any situation where data have a grouping structure which introduces some kind of correlation between

More information

A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL

A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL Discussiones Mathematicae Probability and Statistics 36 206 43 5 doi:0.75/dmps.80 A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL Tadeusz Bednarski Wroclaw University e-mail: t.bednarski@prawo.uni.wroc.pl

More information

Generalized Linear Models Introduction

Generalized Linear Models Introduction Generalized Linear Models Introduction Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Generalized Linear Models For many problems, standard linear regression approaches don t work. Sometimes,

More information

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter

More information

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models Optimum Design for Mixed Effects Non-Linear and generalized Linear Models Cambridge, August 9-12, 2011 Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

More information

A measure of partial association for generalized estimating equations

A measure of partial association for generalized estimating equations A measure of partial association for generalized estimating equations Sundar Natarajan, 1 Stuart Lipsitz, 2 Michael Parzen 3 and Stephen Lipshultz 4 1 Department of Medicine, New York University School

More information

Comparison of methods for repeated measures binary data with missing values. Farhood Mohammadi. A thesis submitted in partial fulfillment of the

Comparison of methods for repeated measures binary data with missing values. Farhood Mohammadi. A thesis submitted in partial fulfillment of the Comparison of methods for repeated measures binary data with missing values by Farhood Mohammadi A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Biostatistics

More information

COMPOSITIONAL IDEAS IN THE BAYESIAN ANALYSIS OF CATEGORICAL DATA WITH APPLICATION TO DOSE FINDING CLINICAL TRIALS

COMPOSITIONAL IDEAS IN THE BAYESIAN ANALYSIS OF CATEGORICAL DATA WITH APPLICATION TO DOSE FINDING CLINICAL TRIALS COMPOSITIONAL IDEAS IN THE BAYESIAN ANALYSIS OF CATEGORICAL DATA WITH APPLICATION TO DOSE FINDING CLINICAL TRIALS M. Gasparini and J. Eisele 2 Politecnico di Torino, Torino, Italy; mauro.gasparini@polito.it

More information

A test for improved forecasting performance at higher lead times

A test for improved forecasting performance at higher lead times A test for improved forecasting performance at higher lead times John Haywood and Granville Tunnicliffe Wilson September 3 Abstract Tiao and Xu (1993) proposed a test of whether a time series model, estimated

More information

Improving the Precision of Estimation by fitting a Generalized Linear Model, and Quasi-likelihood.

Improving the Precision of Estimation by fitting a Generalized Linear Model, and Quasi-likelihood. Improving the Precision of Estimation by fitting a Generalized Linear Model, and Quasi-likelihood. P.M.E.Altham, Statistical Laboratory, University of Cambridge June 27, 2006 This article was published

More information

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Michael J. Daniels and Chenguang Wang Jan. 18, 2009 First, we would like to thank Joe and Geert for a carefully

More information

Journal of Statistical Software

Journal of Statistical Software JSS Journal of Statistical Software January 2006, Volume 15, Issue 2. http://www.jstatsoft.org/ The R Package geepack for Generalized Estimating Equations Ulrich Halekoh Danish Institute of Agricultural

More information

Covariance modelling for longitudinal randomised controlled trials

Covariance modelling for longitudinal randomised controlled trials Covariance modelling for longitudinal randomised controlled trials G. MacKenzie 1,2 1 Centre of Biostatistics, University of Limerick, Ireland. www.staff.ul.ie/mackenzieg 2 CREST, ENSAI, Rennes, France.

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012

More information