Factor Analytic Models of Clustered Multivariate Data with Informative Censoring (refer to Dunson and Perreault, 2001, Biometrics 57, )

Size: px
Start display at page:

Download "Factor Analytic Models of Clustered Multivariate Data with Informative Censoring (refer to Dunson and Perreault, 2001, Biometrics 57, )"

Transcription

1 Factor Analytic Models of Clustered Multivariate Data with Informative Censoring (refer to Dunson and Perreault, 2001, Biometrics 57, ) Consider data in which multiple outcomes are collected for subunits nested within subjects, and some of the observations may be missing. Latent trait models discussed in lecture 16 can be generalized and applied to this setting. We are particularly interested in the case in which missingness may depend on the subject s latent traits, and hence missingness is informative. We will consider Bayesian methods motivated by applications to toxicology and epidemiology studies.

2 Motivation In many reproductive toxicology and epidemiology studies, multiple outcomes are measured for each subunit (e.g. sperm, embryo) that survives to the time of observation. Since subunits within healthier subjects may be more likely to have a variety of favorable outcomes, including survival, censoring due to subunit mortality is potentially informative about differences between subjects in the primary outcomes. Such informative censoring also occurs in longitudinal studies in which subjects who tend to miss visits frequently, perhaps due to illness or death, respond differently than other subjects. Since the censored data are not missing at random (Little and Rubin, 1987), failure to account for within-subject dependency between the missingness indicators and the observed outcomes can result in biased inference.

3 Spermatotoxicity Application Interest is in study effect of agent known to cause infertility at high dosages after short term exposure. A study was designed to detect toxicant-induced changes in sperm motility at relatively low levels of exposure. Adult male rats were randomized to one of four dose groups (0, 8, 24, 72), and there were 10 animals in each group. Rats were dosed daily for 14 days, and sperm were obtained from the proximal cauda epididymidis of each animal on day 15. These sperm had been exposed to the toxicant during their final differentiation in the testis and their maturation in the caput and corpus epididymidis. It is during this time that the capability for progressive motion develops in sperm.

4 Using computer aided sperm analysis (CASA) (reviewed by Boyers, Davis, and Katz, 1989), the proportion of motile (surviving) sperm was recorded for a sample of sperm from each rat, as were the (x, y) coordinates of points (60 per sec) along the path travelled by each surviving sperm. We use three kinematic measures (LDV: linear displacement velocity, LNR: linearity, and PRD: predictability), to quantify the progressive motility of each surviving sperm.

5 Summary of sperm motility data (LDV = linear displacement velocity, LNR = linearity, PRD = predictability). Outcome Dose Mean SD Corr. with % Survival % Survival LDV LNR PRD Statistics are based on average values for each animal

6 There is a decreasing trend in the proportion of surviving sperm with increasing dose There is a decline in LDV, LNR, and PRD with increasing dose; though a conventional MANOVA test for an overall effect of dose was not significant (P = 0.224). Within-animal correlation between each of the kinematic measures and the proportion of surviving sperm tended to decrease from positive values in the unexposed and low dose groups to negative values in the higher dose groups.

7 Such a trend could be due to variability between rats in the stage of the sperm production cycle when treatment occurs combined with differences in the effect of exposure at different stages. We are interested in assessing agent-related changes in sperm survival and in progressive motility (as quantified by the kinematic measures). Conventional analyses that do not adjust for sperm survival can produce biased inferences in the presence of nonignorable missingness due to within-animal dependency in the proportion of surviving sperm and the motility of the survivors.

8 Models for nonignorable missing data Three general classes, depending on how the joint distribution of the primary data (y) and missingness indicators (z) is specified (Little and Rubin, 1987). 1. Selection approaches assume different parameters for the primary data model p(y) and for p(z y). 2. Pattern mixture approaches assume different parameters for p(y z) and for p(z). 3. Shared parameter approaches instead incorporate common parameters into models for p(y) and p(z).

9 Relevant Literature Cowles, Carlin, and Connett (1996) and Bradlow and Zaslavsky (1999) proposed Bayesian selection models for ordinal data with nonignorable missingness. Both approaches assume that there are several independent normal latent variables underlying the observed response. Cowles et al. accounted for nonignorable missingness by placing informative priors directly on the missing data, while Bradlow and Zaslavsky instead consider missingness as a category of the outcome variable. In clustered and longitudinal data settings, in which missingness may be associated with the true underlying response for a subject, it is natural to use a shared parameter model. Several shared parameter approaches have been proposed for analyzing longitudinal data when the time to censoring depends on a subject s underlying rate of change. Wu and Carroll (1988) described a pseudo-maximum likelihood approach under a probit censoring model. Wu and Bailey (1989) approximated this shared parameter model by conditioning on the time to censoring, and Schluchter (1992) and De Gruttola and Tu (1994) developed maximum likelihood procedures.

10 Outline of proposed approach Use a type of Bayesian shared parameter model for clustered multivariate data subject to nonignorable missingness. Instead of following the conventional shared parameter approach of incorporating the same subject-specific parameters into the primary data and missingness models, we instead link latent variables related to censoring to latent variables related to the primary outcomes through a linear model. A factor analytic structure is then used to relate these latent variables to the primary data and missingness indicators. Subunit-specific factors are incorporated to accommodate dependency between the multiple measurements on each subunit (e.g. sperm), and covariate effects are included in each level of the hierarchy.

11 Multilevel Factor Analytic Model Consider a study involving N clusters (e.g. subjects), the ith of which contains n i subunits. For subunit j (j = 1,..., n i ) within cluster i (i = 1,..., N), M measures of health are collected y ij = (y ij1,..., y ijm ) T. Each subunit is either uncensored (z ij = 1), in which case y ij is completely observed, or censored (z ij = 0), in which case y ij is missing.

12 Model for Primary Outcomes: We model the primary outcomes y ij using a multilevel factor analytic model: y ij = βu i1 + Λ 1 V i1 a i1 + Λ 2 V i2 b ij + ɛ ij ; i = 1,..., N; j = 1,..., n i,(1) β is a M r 1 matrix of parameters with row vectors β 1,..., β M, u i1 is a r 1 1 vector of cluster-level covariates, a i1 is a p 1 1 vector of cluster-specific latent variables, Λ 1 is a M p 1 factor loadings matrix, v i1 is a p 1 1 vector of covariates modifying the expression of a i1, V i1 = diag(v i1 ), b ij is a q 1 vector of subunit-specific latent variables, Λ 2 is a M q factor loadings matrix, v i2 is a q 1 vector of covariates modifying the expression of b ij, V i2 = diag(v i2 ), ɛ ij is a M 1 vector of residuals with independently distributed elements ɛ ijk N(0, τ 1 k ), k = 1,..., M, τ = (τ 1,..., τ M ) T are precision parameters. a i and b ij are independent and b ijl N(0, ν 1 l ), l = 1,..., q, where ν = (ν 1,..., ν q ) T are precision parameters.

13 Model for Censoring Process We assume that the censoring probability relates to the latent traits a i2 and to the covariates u i2 through the generalized linear model: h{pr(z ij = 1 a i, u i, v i )} = u T i2γ + λ T 3 V i3 a i2, (2) h( ) is a monotonic link function (e.g. logistic), γ is a r 2 1 vector of parameters, λ 3 is a p 2 1 factor loadings vector, v i3 is a p 2 1 vector of covariates modifying the expression of a i2, and V i3 = diag(v i3 ).

14 Comments Without additional structure, the models for the primary outcomes and censoring mechanism assume an ignorable censoring mechanism. To allow for nonignorable missingness, we could 1. Use cluster-specific summary statistics of y ij as covariates in the censoring model (selection approach) 2. Include the proportion of surviving subunits as a covariate in the primary outcome model (pattern-mixture approach) 3. Let a i2 a i1 (shared parameter approach).

15 Linking the Primary Outcome and Censoring Models We instead specify a two stage hierarchical model for the cluster-level latent variables: a i1 = α 11 x i1 + α 12 w(x i1, a i2 ) + e i1, a i2 = α 2 x i2 + e i2, (3) α 11 is a p 1 s 1 matrix of parameters, x i1 is an s 1 1 covariate vector, α 12 is a p 1 s 2 matrix of parameters, w i = w(x i1, a i2 ) is a s 2 1 vector with elements that are functions of x i1 and a i2, e i1 is a p 1 1 vector of residuals with independently distributed elements e i1l N(0, φ 1 l ), l = 1,..., p 1, φ 1 = (φ 1,..., φ p1 ) T are precision parameters, α 2 is a p 2 t matrix of parameters, x i2 is a t 1 covariate vector, e i2 is a p 2 1 vector of residuals with independently distributed elements e i2l N(0, 1), l = 1,..., p 2.

16 Comments Our model assumes that the data are missing at random (MAR) conditional on the latent variables a i1 and a i2. If the latent variables were observed, expression (3) would be in the form of a pattern mixture model. Since the censoring-related latent variables (a i2 ) are incorporated into the model for the factors (a i1 ) underlying the primary outcomes, our hierarchical model can be viewed as a shared parameter model. Our approach accommodates a broader class of data structures and missing data mechanisms than current univariate shared parameter approaches (e.g. De Gruttola and Tu, 1994) that use the same subject-specific parameters in the primary data and censoring models.

17 The factor analytic form of models (1) and (2) has several advantages over conventional random-effects approaches (Laird and Ware, 1982). By incorporating outcome-specific factor loadings, flexible models can be formulated that include a single latent variable measuring the health of a subject. In sperm motility applications, this latent variable provides a subject-specific motility score that can be used both to identify subjects with high or low motility and to simplify analyses relating sperm motility to other endpoints (e.g. fertility). By including covariate effects on the subject-specific latent variable instead of on each of the measured outcomes, one can reduce the dimensionality of the analysis. The parameters characterizing such effects have appealing interpretations.

Joint Modeling of Longitudinal Item Response Data and Survival

Joint Modeling of Longitudinal Item Response Data and Survival Joint Modeling of Longitudinal Item Response Data and Survival Jean-Paul Fox University of Twente Department of Research Methodology, Measurement and Data Analysis Faculty of Behavioural Sciences Enschede,

More information

STA 216, GLM, Lecture 16. October 29, 2007

STA 216, GLM, Lecture 16. October 29, 2007 STA 216, GLM, Lecture 16 October 29, 2007 Efficient Posterior Computation in Factor Models Underlying Normal Models Generalized Latent Trait Models Formulation Genetic Epidemiology Illustration Structural

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

Sample Size and Power Considerations for Longitudinal Studies

Sample Size and Power Considerations for Longitudinal Studies Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

Bayes methods for categorical data. April 25, 2017

Bayes methods for categorical data. April 25, 2017 Bayes methods for categorical data April 25, 2017 Motivation for joint probability models Increasing interest in high-dimensional data in broad applications Focus may be on prediction, variable selection,

More information

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Tihomir Asparouhov 1, Bengt Muthen 2 Muthen & Muthen 1 UCLA 2 Abstract Multilevel analysis often leads to modeling

More information

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January

More information

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal

More information

Using Historical Experimental Information in the Bayesian Analysis of Reproduction Toxicological Experimental Results

Using Historical Experimental Information in the Bayesian Analysis of Reproduction Toxicological Experimental Results Using Historical Experimental Information in the Bayesian Analysis of Reproduction Toxicological Experimental Results Jing Zhang Miami University August 12, 2014 Jing Zhang (Miami University) Using Historical

More information

Default Priors and Effcient Posterior Computation in Bayesian

Default Priors and Effcient Posterior Computation in Bayesian Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature

More information

A Fully Nonparametric Modeling Approach to. BNP Binary Regression

A Fully Nonparametric Modeling Approach to. BNP Binary Regression A Fully Nonparametric Modeling Approach to Binary Regression Maria Department of Applied Mathematics and Statistics University of California, Santa Cruz SBIES, April 27-28, 2012 Outline 1 2 3 Simulation

More information

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Michael J. Daniels and Chenguang Wang Jan. 18, 2009 First, we would like to thank Joe and Geert for a carefully

More information

Bayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London

Bayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London Bayesian methods for missing data: part 1 Key Concepts Nicky Best and Alexina Mason Imperial College London BAYES 2013, May 21-23, Erasmus University Rotterdam Missing Data: Part 1 BAYES2013 1 / 68 Outline

More information

arxiv: v1 [stat.me] 27 Feb 2017

arxiv: v1 [stat.me] 27 Feb 2017 arxiv:1702.08148v1 [stat.me] 27 Feb 2017 A Copula-based Imputation Model for Missing Data of Mixed Type in Multilevel Data Sets Jiali Wang 1, Bronwyn Loong 1, Anton H. Westveld 1,2, and Alan H. Welsh 3

More information

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

More information

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -

More information

Optimal rules for timing intercourse to achieve pregnancy

Optimal rules for timing intercourse to achieve pregnancy Optimal rules for timing intercourse to achieve pregnancy Bruno Scarpa and David Dunson Dipartimento di Statistica ed Economia Applicate Università di Pavia Biostatistics Branch, National Institute of

More information

Fitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation

Fitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation Fitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation Dimitris Rizopoulos Department of Biostatistics, Erasmus University Medical Center, the Netherlands d.rizopoulos@erasmusmc.nl

More information

CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline:

CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline: CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity Outline: 1. NIEHS Uterine Fibroid Study Design of Study Scientific Questions Difficulties 2.

More information

A Joint Model with Marginal Interpretation for Longitudinal Continuous and Time-to-event Outcomes

A Joint Model with Marginal Interpretation for Longitudinal Continuous and Time-to-event Outcomes A Joint Model with Marginal Interpretation for Longitudinal Continuous and Time-to-event Outcomes Achmad Efendi 1, Geert Molenberghs 2,1, Edmund Njagi 1, Paul Dendale 3 1 I-BioStat, Katholieke Universiteit

More information

Anders Skrondal. Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine. Based on joint work with Sophia Rabe-Hesketh

Anders Skrondal. Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine. Based on joint work with Sophia Rabe-Hesketh Constructing Latent Variable Models using Composite Links Anders Skrondal Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine Based on joint work with Sophia Rabe-Hesketh

More information

Gibbs Sampling in Latent Variable Models #1

Gibbs Sampling in Latent Variable Models #1 Gibbs Sampling in Latent Variable Models #1 Econ 690 Purdue University Outline 1 Data augmentation 2 Probit Model Probit Application A Panel Probit Panel Probit 3 The Tobit Model Example: Female Labor

More information

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data 1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions

More information

A Nonparametric Bayesian Model for Multivariate Ordinal Data

A Nonparametric Bayesian Model for Multivariate Ordinal Data A Nonparametric Bayesian Model for Multivariate Ordinal Data Athanasios Kottas, University of California at Santa Cruz Peter Müller, The University of Texas M. D. Anderson Cancer Center Fernando A. Quintana,

More information

MULTILEVEL IMPUTATION 1

MULTILEVEL IMPUTATION 1 MULTILEVEL IMPUTATION 1 Supplement B: MCMC Sampling Steps and Distributions for Two-Level Imputation This document gives technical details of the full conditional distributions used to draw regression

More information

Bayesian Quantile Regression for Longitudinal Studies with Nonignorable Missing Data

Bayesian Quantile Regression for Longitudinal Studies with Nonignorable Missing Data Biometrics 66, 105 114 March 2010 DOI: 10.1111/j.1541-0420.2009.01269.x Bayesian Quantile Regression for Longitudinal Studies with Nonignorable Missing Data Ying Yuan and Guosheng Yin Department of Biostatistics,

More information

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing Alessandra Mattei Dipartimento di Statistica G. Parenti Università

More information

Some methods for handling missing values in outcome variables. Roderick J. Little

Some methods for handling missing values in outcome variables. Roderick J. Little Some methods for handling missing values in outcome variables Roderick J. Little Missing data principles Likelihood methods Outline ML, Bayes, Multiple Imputation (MI) Robust MAR methods Predictive mean

More information

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation NELS 88 Table 2.3 Adjusted odds ratios of eighth-grade students in 988 performing below basic levels of reading and mathematics in 988 and dropping out of school, 988 to 990, by basic demographics Variable

More information

Bayesian methods for latent trait modeling of longitudinal data

Bayesian methods for latent trait modeling of longitudinal data Bayesian methods for latent trait modeling of longitudinal data DAVID B. DUNSON Biostatistics Branch, National Institute of Environmental Health Sciences MD A3-03, P.O. Box 12233 Research Triangle Park,

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

Bayesian non-parametric model to longitudinally predict churn

Bayesian non-parametric model to longitudinally predict churn Bayesian non-parametric model to longitudinally predict churn Bruno Scarpa Università di Padova Conference of European Statistics Stakeholders Methodologists, Producers and Users of European Statistics

More information

Latent Factor Regression Models for Grouped Outcomes

Latent Factor Regression Models for Grouped Outcomes Latent Factor Regression Models for Grouped Outcomes Dawn Woodard Operations Research and Information Engineering Cornell University with T. M. T. Love, S. W. Thurston, D. Ruppert S. Sathyanarayana and

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs) 36-309/749 Experimental Design for Behavioral and Social Sciences Dec 1, 2015 Lecture 11: Mixed Models (HLMs) Independent Errors Assumption An error is the deviation of an individual observed outcome (DV)

More information

On Estimating the Relationship between Longitudinal Measurements and Time-to-Event Data Using a Simple Two-Stage Procedure

On Estimating the Relationship between Longitudinal Measurements and Time-to-Event Data Using a Simple Two-Stage Procedure Biometrics DOI: 10.1111/j.1541-0420.2009.01324.x On Estimating the Relationship between Longitudinal Measurements and Time-to-Event Data Using a Simple Two-Stage Procedure Paul S. Albert 1, and Joanna

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

A Bayesian multi-dimensional couple-based latent risk model for infertility

A Bayesian multi-dimensional couple-based latent risk model for infertility A Bayesian multi-dimensional couple-based latent risk model for infertility Zhen Chen, Ph.D. Eunice Kennedy Shriver National Institute of Child Health and Human Development National Institutes of Health

More information

Statistical Methods. Missing Data snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23

Statistical Methods. Missing Data  snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23 1 / 23 Statistical Methods Missing Data http://www.stats.ox.ac.uk/ snijders/sm.htm Tom A.B. Snijders University of Oxford November, 2011 2 / 23 Literature: Joseph L. Schafer and John W. Graham, Missing

More information

A Generalized Global Rank Test for Multiple, Possibly Censored, Outcomes

A Generalized Global Rank Test for Multiple, Possibly Censored, Outcomes A Generalized Global Rank Test for Multiple, Possibly Censored, Outcomes Ritesh Ramchandani Harvard School of Public Health August 5, 2014 Ritesh Ramchandani (HSPH) Global Rank Test for Multiple Outcomes

More information

General structural model Part 2: Categorical variables and beyond. Psychology 588: Covariance structure and factor models

General structural model Part 2: Categorical variables and beyond. Psychology 588: Covariance structure and factor models General structural model Part 2: Categorical variables and beyond Psychology 588: Covariance structure and factor models Categorical variables 2 Conventional (linear) SEM assumes continuous observed variables

More information

Joint Longitudinal and Survival-cure Models with Constrained Parameters in Tumour Xenograft Experiments

Joint Longitudinal and Survival-cure Models with Constrained Parameters in Tumour Xenograft Experiments Joint Longitudinal and Survival-cure Models with Constrained Parameters in Tumour Xenograft Experiments Bao, Yanchun and Pan, Jianxin and Dai, Hongsheng and Fang, Hong-Bin 2009 MIMS EPrint: 2009.48 Manchester

More information

Fractional Imputation in Survey Sampling: A Comparative Review

Fractional Imputation in Survey Sampling: A Comparative Review Fractional Imputation in Survey Sampling: A Comparative Review Shu Yang Jae-Kwang Kim Iowa State University Joint Statistical Meetings, August 2015 Outline Introduction Fractional imputation Features Numerical

More information

Chapter 4. Parametric Approach. 4.1 Introduction

Chapter 4. Parametric Approach. 4.1 Introduction Chapter 4 Parametric Approach 4.1 Introduction The missing data problem is already a classical problem that has not been yet solved satisfactorily. This problem includes those situations where the dependent

More information

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or

More information

Richard N. Jones, Sc.D. HSPH Kresge G2 October 5, 2011

Richard N. Jones, Sc.D. HSPH Kresge G2 October 5, 2011 Harvard Catalyst Biostatistical Seminar Neuropsychological Proles in Alzheimer's Disease and Cerebral Infarction: A Longitudinal MIMIC Model An Overview of Structural Equation Modeling using Mplus Richard

More information

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE

Biostatistics Workshop Longitudinal Data Analysis. Session 4 GARRETT FITZMAURICE Biostatistics Workshop 2008 Longitudinal Data Analysis Session 4 GARRETT FITZMAURICE Harvard University 1 LINEAR MIXED EFFECTS MODELS Motivating Example: Influence of Menarche on Changes in Body Fat Prospective

More information

6 Pattern Mixture Models

6 Pattern Mixture Models 6 Pattern Mixture Models A common theme underlying the methods we have discussed so far is that interest focuses on making inference on parameters in a parametric or semiparametric model for the full data

More information

A Nonparametric Approach Using Dirichlet Process for Hierarchical Generalized Linear Mixed Models

A Nonparametric Approach Using Dirichlet Process for Hierarchical Generalized Linear Mixed Models Journal of Data Science 8(2010), 43-59 A Nonparametric Approach Using Dirichlet Process for Hierarchical Generalized Linear Mixed Models Jing Wang Louisiana State University Abstract: In this paper, we

More information

Missing covariate data in matched case-control studies: Do the usual paradigms apply?

Missing covariate data in matched case-control studies: Do the usual paradigms apply? Missing covariate data in matched case-control studies: Do the usual paradigms apply? Bryan Langholz USC Department of Preventive Medicine Joint work with Mulugeta Gebregziabher Larry Goldstein Mark Huberman

More information

Default Priors and Efficient Posterior Computation in Bayesian Factor Analysis

Default Priors and Efficient Posterior Computation in Bayesian Factor Analysis Default Priors and Efficient Posterior Computation in Bayesian Factor Analysis Joyee Ghosh Institute of Statistics and Decision Sciences, Duke University Box 90251, Durham, NC 27708 joyee@stat.duke.edu

More information

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS Donald B. Rubin Harvard University 1 Oxford Street, 7th Floor Cambridge, MA 02138 USA Tel: 617-495-5496; Fax: 617-496-8057 email: rubin@stat.harvard.edu

More information

Analysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington

Analysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington Analysis of Longitudinal Data Patrick J Heagerty PhD Department of Biostatistics University of Washington Auckland 8 Session One Outline Examples of longitudinal data Scientific motivation Opportunities

More information

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University A SURVEY OF VARIANCE COMPONENTS ESTIMATION FROM BINARY DATA by Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University BU-1211-M May 1993 ABSTRACT The basic problem of variance components

More information

A general mixed model approach for spatio-temporal regression data

A general mixed model approach for spatio-temporal regression data A general mixed model approach for spatio-temporal regression data Thomas Kneib, Ludwig Fahrmeir & Stefan Lang Department of Statistics, Ludwig-Maximilians-University Munich 1. Spatio-temporal regression

More information

Dynamic analysis of binary longitudinal data

Dynamic analysis of binary longitudinal data Dynamic analysis of binary longitudinal data Ørnulf Borgan Department of Mathematics University of Oslo Based on joint work with Rosemeire L. Fiaccone, Robin Henderson and Mauricio L. Barreto 1 Outline:

More information

Generalized linear models with a coarsened covariate

Generalized linear models with a coarsened covariate Appl. Statist. (2004) 53, Part 2, pp. 279 292 Generalized linear models with a coarsened covariate Stuart Lipsitz, Medical University of South Carolina, Charleston, USA Michael Parzen, University of Chicago,

More information

Comparison of multiple imputation methods for systematically and sporadically missing multilevel data

Comparison of multiple imputation methods for systematically and sporadically missing multilevel data Comparison of multiple imputation methods for systematically and sporadically missing multilevel data V. Audigier, I. White, S. Jolani, T. Debray, M. Quartagno, J. Carpenter, S. van Buuren, M. Resche-Rigon

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu

More information

Bayesian Multilevel Latent Class Models for the Multiple. Imputation of Nested Categorical Data

Bayesian Multilevel Latent Class Models for the Multiple. Imputation of Nested Categorical Data Bayesian Multilevel Latent Class Models for the Multiple Imputation of Nested Categorical Data Davide Vidotto Jeroen K. Vermunt Katrijn van Deun Department of Methodology and Statistics, Tilburg University

More information

Generalized Linear Mixed Models for Longitudinal Data with Missing Values: A Monte Carlo EM Approach

Generalized Linear Mixed Models for Longitudinal Data with Missing Values: A Monte Carlo EM Approach International Journal of Probability and Statistics 2016, 5(3): 82-88 DOI: 10.5923/j.ijps.20160503.03 Generalized Linear Mixed Models for Longitudinal Data with Missing Values: A Monte Carlo EM Approach

More information

A Joint Longitudinal-Survival Model with Possible Cure: An Analysis of Patient Outcomes on the Liver Transplant Waiting List

A Joint Longitudinal-Survival Model with Possible Cure: An Analysis of Patient Outcomes on the Liver Transplant Waiting List University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 1-1-2012 A Joint Longitudinal-Survival Model with Possible Cure: An Analysis of Patient Outcomes on the Liver Transplant

More information

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3 University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.

More information

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /rssa.

University of Bristol - Explore Bristol Research. Peer reviewed version. Link to published version (if available): /rssa. Goldstein, H., Carpenter, J. R., & Browne, W. J. (2014). Fitting multilevel multivariate models with missing data in responses and covariates that may include interactions and non-linear terms. Journal

More information

MODEL ASSESSMENT FOR MODELS WITH MISSING DATA. Xiaolei Zhou. Chapel Hill 2015

MODEL ASSESSMENT FOR MODELS WITH MISSING DATA. Xiaolei Zhou. Chapel Hill 2015 MODEL ASSESSMENT FOR MODELS WITH MISSING DATA Xiaolei Zhou A dissertation submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment of the requirements for the

More information

Variable selection and machine learning methods in causal inference

Variable selection and machine learning methods in causal inference Variable selection and machine learning methods in causal inference Debashis Ghosh Department of Biostatistics and Informatics Colorado School of Public Health Joint work with Yeying Zhu, University of

More information

PIRLS 2016 Achievement Scaling Methodology 1

PIRLS 2016 Achievement Scaling Methodology 1 CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally

More information

REGRESSION ANALYSIS IN LONGITUDINAL STUDIES WITH NON-IGNORABLE MISSING OUTCOMES

REGRESSION ANALYSIS IN LONGITUDINAL STUDIES WITH NON-IGNORABLE MISSING OUTCOMES REGRESSION ANALYSIS IN LONGITUDINAL STUDIES WITH NON-IGNORABLE MISSING OUTCOMES by Changyu Shen B.S. in Biology University of Science and Technology of China, 1998 Submitted to the Graduate Faculty of

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

Bayesian Hypothesis Testing in GLMs: One-Sided and Ordered Alternatives. 1(w i = h + 1)β h + ɛ i,

Bayesian Hypothesis Testing in GLMs: One-Sided and Ordered Alternatives. 1(w i = h + 1)β h + ɛ i, Bayesian Hypothesis Testing in GLMs: One-Sided and Ordered Alternatives Often interest may focus on comparing a null hypothesis of no difference between groups to an ordered restricted alternative. For

More information

MISSING or INCOMPLETE DATA

MISSING or INCOMPLETE DATA MISSING or INCOMPLETE DATA A (fairly) complete review of basic practice Don McLeish and Cyntha Struthers University of Waterloo Dec 5, 2015 Structure of the Workshop Session 1 Common methods for dealing

More information

1 Introduction A common problem in categorical data analysis is to determine the effect of explanatory variables V on a binary outcome D of interest.

1 Introduction A common problem in categorical data analysis is to determine the effect of explanatory variables V on a binary outcome D of interest. Conditional and Unconditional Categorical Regression Models with Missing Covariates Glen A. Satten and Raymond J. Carroll Λ December 4, 1999 Abstract We consider methods for analyzing categorical regression

More information

Latent Variable Model for Weight Gain Prevention Data with Informative Intermittent Missingness

Latent Variable Model for Weight Gain Prevention Data with Informative Intermittent Missingness Journal of Modern Applied Statistical Methods Volume 15 Issue 2 Article 36 11-1-2016 Latent Variable Model for Weight Gain Prevention Data with Informative Intermittent Missingness Li Qin Yale University,

More information

Basics of Modern Missing Data Analysis

Basics of Modern Missing Data Analysis Basics of Modern Missing Data Analysis Kyle M. Lang Center for Research Methods and Data Analysis University of Kansas March 8, 2013 Topics to be Covered An introduction to the missing data problem Missing

More information

Linear Mixed Models for Longitudinal Data with Nonrandom Dropouts

Linear Mixed Models for Longitudinal Data with Nonrandom Dropouts Journal of Data Science 4(2006), 447-460 Linear Mixed Models for Longitudinal Data with Nonrandom Dropouts Ahmed M. Gad and Noha A. Youssif Cairo University Abstract: Longitudinal studies represent one

More information

Nonparametric Bayesian Modeling for Multivariate Ordinal. Data

Nonparametric Bayesian Modeling for Multivariate Ordinal. Data Nonparametric Bayesian Modeling for Multivariate Ordinal Data Athanasios Kottas, Peter Müller and Fernando Quintana Abstract We propose a probability model for k-dimensional ordinal outcomes, i.e., we

More information

Longitudinal analysis of ordinal data

Longitudinal analysis of ordinal data Longitudinal analysis of ordinal data A report on the external research project with ULg Anne-Françoise Donneau, Murielle Mauer June 30 th 2009 Generalized Estimating Equations (Liang and Zeger, 1986)

More information

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised )

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised ) Ronald H. Heck 1 University of Hawai i at Mānoa Handout #20 Specifying Latent Curve and Other Growth Models Using Mplus (Revised 12-1-2014) The SEM approach offers a contrasting framework for use in analyzing

More information

HISTORICAL CONTROL DATA OF SPERM ANALYSES FROM 2-GENERATION AND FERTILITY STUDIES IN HsdRccHan TM : WIST, Wistar Hannover Rats

HISTORICAL CONTROL DATA OF SPERM ANALYSES FROM 2-GENERATION AND FERTILITY STUDIES IN HsdRccHan TM : WIST, Wistar Hannover Rats HISTORICAL CONTROL DATA OF SPERM ANALYSES FROM 2-GENERATION AND FERTILITY STUDIES IN HsdRccHan TM : WIST, Wistar Hannover Rats Compiled from 2-Generation and Fertility Studies performed at RCC Ltd. Itingen/Switzerland

More information

Bayesian Multivariate Logistic Regression

Bayesian Multivariate Logistic Regression Bayesian Multivariate Logistic Regression Sean M. O Brien and David B. Dunson Biostatistics Branch National Institute of Environmental Health Sciences Research Triangle Park, NC 1 Goals Brief review of

More information

Survival Analysis I (CHL5209H)

Survival Analysis I (CHL5209H) Survival Analysis Dalla Lana School of Public Health University of Toronto olli.saarela@utoronto.ca January 7, 2015 31-1 Literature Clayton D & Hills M (1993): Statistical Models in Epidemiology. Not really

More information

Overview and Basic Approaches

Overview and Basic Approaches PART I Overview and Basic Approaches CHAPTER 1 Introduction 1.1. THE PROBLEM OF MISSING DATA Standard statistical methods have been developed to analyze rectangular data sets. Traditionally, the rows

More information

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke

More information

8 Nominal and Ordinal Logistic Regression

8 Nominal and Ordinal Logistic Regression 8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on

More information

Longitudinal Modeling with Logistic Regression

Longitudinal Modeling with Logistic Regression Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to

More information

Experimental Design and Data Analysis for Biologists

Experimental Design and Data Analysis for Biologists Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1

More information

Joint longitudinal and survival-cure models in tumour xenograft experiments

Joint longitudinal and survival-cure models in tumour xenograft experiments Joint longitudinal and survival-cure models in tumour xenograft experiments Jianxin Pan, Yanchun Bao,, Hongsheng Dai,, Hong-Bin Fang, Abstract In tumour xenograft experiments, treatment regimens are administered

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

A Flexible Bayesian Approach to Monotone Missing. Data in Longitudinal Studies with Nonignorable. Missingness with Application to an Acute

A Flexible Bayesian Approach to Monotone Missing. Data in Longitudinal Studies with Nonignorable. Missingness with Application to an Acute A Flexible Bayesian Approach to Monotone Missing Data in Longitudinal Studies with Nonignorable Missingness with Application to an Acute Schizophrenia Clinical Trial Antonio R. Linero, Michael J. Daniels

More information

Comparison between conditional and marginal maximum likelihood for a class of item response models

Comparison between conditional and marginal maximum likelihood for a class of item response models (1/24) Comparison between conditional and marginal maximum likelihood for a class of item response models Francesco Bartolucci, University of Perugia (IT) Silvia Bacci, University of Perugia (IT) Claudia

More information

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Kosuke Imai Department of Politics Princeton University July 31 2007 Kosuke Imai (Princeton University) Nonignorable

More information

7 Sensitivity Analysis

7 Sensitivity Analysis 7 Sensitivity Analysis A recurrent theme underlying methodology for analysis in the presence of missing data is the need to make assumptions that cannot be verified based on the observed data. If the assumption

More information

A weighted simulation-based estimator for incomplete longitudinal data models

A weighted simulation-based estimator for incomplete longitudinal data models To appear in Statistics and Probability Letters, 113 (2016), 16-22. doi 10.1016/j.spl.2016.02.004 A weighted simulation-based estimator for incomplete longitudinal data models Daniel H. Li 1 and Liqun

More information

Missing Covariate Data in Matched Case-Control Studies

Missing Covariate Data in Matched Case-Control Studies Missing Covariate Data in Matched Case-Control Studies Department of Statistics North Carolina State University Paul Rathouz Dept. of Health Studies U. of Chicago prathouz@health.bsd.uchicago.edu with

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

Estimating chopit models in gllamm Political efficacy example from King et al. (2002)

Estimating chopit models in gllamm Political efficacy example from King et al. (2002) Estimating chopit models in gllamm Political efficacy example from King et al. (2002) Sophia Rabe-Hesketh Department of Biostatistics and Computing Institute of Psychiatry King s College London Anders

More information

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements:

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements: Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported

More information

Longitudinal + Reliability = Joint Modeling

Longitudinal + Reliability = Joint Modeling Longitudinal + Reliability = Joint Modeling Carles Serrat Institute of Statistics and Mathematics Applied to Building CYTED-HAROSA International Workshop November 21-22, 2013 Barcelona Mainly from Rizopoulos,

More information

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models John M. Neuhaus Charles E. McCulloch Division of Biostatistics University of California, San

More information

Review Article Analysis of Longitudinal and Survival Data: Joint Modeling, Inference Methods, and Issues

Review Article Analysis of Longitudinal and Survival Data: Joint Modeling, Inference Methods, and Issues Journal of Probability and Statistics Volume 2012, Article ID 640153, 17 pages doi:10.1155/2012/640153 Review Article Analysis of Longitudinal and Survival Data: Joint Modeling, Inference Methods, and

More information