Factor Analytic Models of Clustered Multivariate Data with Informative Censoring (refer to Dunson and Perreault, 2001, Biometrics 57, 302-308) Consider data in which multiple outcomes are collected for subunits nested within subjects, and some of the observations may be missing. Latent trait models discussed in lecture 16 can be generalized and applied to this setting. We are particularly interested in the case in which missingness may depend on the subject s latent traits, and hence missingness is informative. We will consider Bayesian methods motivated by applications to toxicology and epidemiology studies.
Motivation In many reproductive toxicology and epidemiology studies, multiple outcomes are measured for each subunit (e.g. sperm, embryo) that survives to the time of observation. Since subunits within healthier subjects may be more likely to have a variety of favorable outcomes, including survival, censoring due to subunit mortality is potentially informative about differences between subjects in the primary outcomes. Such informative censoring also occurs in longitudinal studies in which subjects who tend to miss visits frequently, perhaps due to illness or death, respond differently than other subjects. Since the censored data are not missing at random (Little and Rubin, 1987), failure to account for within-subject dependency between the missingness indicators and the observed outcomes can result in biased inference.
Spermatotoxicity Application Interest is in study effect of agent known to cause infertility at high dosages after short term exposure. A study was designed to detect toxicant-induced changes in sperm motility at relatively low levels of exposure. Adult male rats were randomized to one of four dose groups (0, 8, 24, 72), and there were 10 animals in each group. Rats were dosed daily for 14 days, and sperm were obtained from the proximal cauda epididymidis of each animal on day 15. These sperm had been exposed to the toxicant during their final differentiation in the testis and their maturation in the caput and corpus epididymidis. It is during this time that the capability for progressive motion develops in sperm.
Using computer aided sperm analysis (CASA) (reviewed by Boyers, Davis, and Katz, 1989), the proportion of motile (surviving) sperm was recorded for a sample of 100-200 sperm from each rat, as were the (x, y) coordinates of points (60 per sec) along the path travelled by each surviving sperm. We use three kinematic measures (LDV: linear displacement velocity, LNR: linearity, and PRD: predictability), to quantify the progressive motility of each surviving sperm.
Summary of sperm motility data (LDV = linear displacement velocity, LNR = linearity, PRD = predictability). Outcome Dose Mean SD Corr. with % Survival % Survival 0 83.6 0.042 1.000 8 80.4 0.065 1.000 24 79.5 0.078 1.000 72 62.2 0.126 1.000 LDV 0 88.4 9.21 0.153 8 76.1 7.54 0.146 24 82.1 15.6 0.053 72 77.2 13.3-0.687 LNR 0 0.219 0.013 0.079 8 0.216 0.013 0.069 24 0.207 0.012-0.043 72 0.206 0.020-0.386 PRD 0 25.5 2.70 0.336 8 22.0 2.52 0.202 24 24.3 5.05 0.131 72 23.6 3.38-0.516 Statistics are based on average values for each animal
There is a decreasing trend in the proportion of surviving sperm with increasing dose There is a decline in LDV, LNR, and PRD with increasing dose; though a conventional MANOVA test for an overall effect of dose was not significant (P = 0.224). Within-animal correlation between each of the kinematic measures and the proportion of surviving sperm tended to decrease from positive values in the unexposed and low dose groups to negative values in the higher dose groups.
Such a trend could be due to variability between rats in the stage of the sperm production cycle when treatment occurs combined with differences in the effect of exposure at different stages. We are interested in assessing agent-related changes in sperm survival and in progressive motility (as quantified by the kinematic measures). Conventional analyses that do not adjust for sperm survival can produce biased inferences in the presence of nonignorable missingness due to within-animal dependency in the proportion of surviving sperm and the motility of the survivors.
Models for nonignorable missing data Three general classes, depending on how the joint distribution of the primary data (y) and missingness indicators (z) is specified (Little and Rubin, 1987). 1. Selection approaches assume different parameters for the primary data model p(y) and for p(z y). 2. Pattern mixture approaches assume different parameters for p(y z) and for p(z). 3. Shared parameter approaches instead incorporate common parameters into models for p(y) and p(z).
Relevant Literature Cowles, Carlin, and Connett (1996) and Bradlow and Zaslavsky (1999) proposed Bayesian selection models for ordinal data with nonignorable missingness. Both approaches assume that there are several independent normal latent variables underlying the observed response. Cowles et al. accounted for nonignorable missingness by placing informative priors directly on the missing data, while Bradlow and Zaslavsky instead consider missingness as a category of the outcome variable. In clustered and longitudinal data settings, in which missingness may be associated with the true underlying response for a subject, it is natural to use a shared parameter model. Several shared parameter approaches have been proposed for analyzing longitudinal data when the time to censoring depends on a subject s underlying rate of change. Wu and Carroll (1988) described a pseudo-maximum likelihood approach under a probit censoring model. Wu and Bailey (1989) approximated this shared parameter model by conditioning on the time to censoring, and Schluchter (1992) and De Gruttola and Tu (1994) developed maximum likelihood procedures.
Outline of proposed approach Use a type of Bayesian shared parameter model for clustered multivariate data subject to nonignorable missingness. Instead of following the conventional shared parameter approach of incorporating the same subject-specific parameters into the primary data and missingness models, we instead link latent variables related to censoring to latent variables related to the primary outcomes through a linear model. A factor analytic structure is then used to relate these latent variables to the primary data and missingness indicators. Subunit-specific factors are incorporated to accommodate dependency between the multiple measurements on each subunit (e.g. sperm), and covariate effects are included in each level of the hierarchy.
Multilevel Factor Analytic Model Consider a study involving N clusters (e.g. subjects), the ith of which contains n i subunits. For subunit j (j = 1,..., n i ) within cluster i (i = 1,..., N), M measures of health are collected y ij = (y ij1,..., y ijm ) T. Each subunit is either uncensored (z ij = 1), in which case y ij is completely observed, or censored (z ij = 0), in which case y ij is missing.
Model for Primary Outcomes: We model the primary outcomes y ij using a multilevel factor analytic model: y ij = βu i1 + Λ 1 V i1 a i1 + Λ 2 V i2 b ij + ɛ ij ; i = 1,..., N; j = 1,..., n i,(1) β is a M r 1 matrix of parameters with row vectors β 1,..., β M, u i1 is a r 1 1 vector of cluster-level covariates, a i1 is a p 1 1 vector of cluster-specific latent variables, Λ 1 is a M p 1 factor loadings matrix, v i1 is a p 1 1 vector of covariates modifying the expression of a i1, V i1 = diag(v i1 ), b ij is a q 1 vector of subunit-specific latent variables, Λ 2 is a M q factor loadings matrix, v i2 is a q 1 vector of covariates modifying the expression of b ij, V i2 = diag(v i2 ), ɛ ij is a M 1 vector of residuals with independently distributed elements ɛ ijk N(0, τ 1 k ), k = 1,..., M, τ = (τ 1,..., τ M ) T are precision parameters. a i and b ij are independent and b ijl N(0, ν 1 l ), l = 1,..., q, where ν = (ν 1,..., ν q ) T are precision parameters.
Model for Censoring Process We assume that the censoring probability relates to the latent traits a i2 and to the covariates u i2 through the generalized linear model: h{pr(z ij = 1 a i, u i, v i )} = u T i2γ + λ T 3 V i3 a i2, (2) h( ) is a monotonic link function (e.g. logistic), γ is a r 2 1 vector of parameters, λ 3 is a p 2 1 factor loadings vector, v i3 is a p 2 1 vector of covariates modifying the expression of a i2, and V i3 = diag(v i3 ).
Comments Without additional structure, the models for the primary outcomes and censoring mechanism assume an ignorable censoring mechanism. To allow for nonignorable missingness, we could 1. Use cluster-specific summary statistics of y ij as covariates in the censoring model (selection approach) 2. Include the proportion of surviving subunits as a covariate in the primary outcome model (pattern-mixture approach) 3. Let a i2 a i1 (shared parameter approach).
Linking the Primary Outcome and Censoring Models We instead specify a two stage hierarchical model for the cluster-level latent variables: a i1 = α 11 x i1 + α 12 w(x i1, a i2 ) + e i1, a i2 = α 2 x i2 + e i2, (3) α 11 is a p 1 s 1 matrix of parameters, x i1 is an s 1 1 covariate vector, α 12 is a p 1 s 2 matrix of parameters, w i = w(x i1, a i2 ) is a s 2 1 vector with elements that are functions of x i1 and a i2, e i1 is a p 1 1 vector of residuals with independently distributed elements e i1l N(0, φ 1 l ), l = 1,..., p 1, φ 1 = (φ 1,..., φ p1 ) T are precision parameters, α 2 is a p 2 t matrix of parameters, x i2 is a t 1 covariate vector, e i2 is a p 2 1 vector of residuals with independently distributed elements e i2l N(0, 1), l = 1,..., p 2.
Comments Our model assumes that the data are missing at random (MAR) conditional on the latent variables a i1 and a i2. If the latent variables were observed, expression (3) would be in the form of a pattern mixture model. Since the censoring-related latent variables (a i2 ) are incorporated into the model for the factors (a i1 ) underlying the primary outcomes, our hierarchical model can be viewed as a shared parameter model. Our approach accommodates a broader class of data structures and missing data mechanisms than current univariate shared parameter approaches (e.g. De Gruttola and Tu, 1994) that use the same subject-specific parameters in the primary data and censoring models.
The factor analytic form of models (1) and (2) has several advantages over conventional random-effects approaches (Laird and Ware, 1982). By incorporating outcome-specific factor loadings, flexible models can be formulated that include a single latent variable measuring the health of a subject. In sperm motility applications, this latent variable provides a subject-specific motility score that can be used both to identify subjects with high or low motility and to simplify analyses relating sperm motility to other endpoints (e.g. fertility). By including covariate effects on the subject-specific latent variable instead of on each of the measured outcomes, one can reduce the dimensionality of the analysis. The parameters characterizing such effects have appealing interpretations.