On the multivariate probit model for exchangeable binary data. with covariates. 1 Introduction. Catalina Stefanescu 1 and Bruce W. Turnbull 2.

Size: px
Start display at page:

Download "On the multivariate probit model for exchangeable binary data. with covariates. 1 Introduction. Catalina Stefanescu 1 and Bruce W. Turnbull 2."

Transcription

1 On the multivariate probit model for exchangeable binary data with covariates Catalina Stefanescu 1 and Bruce W. Turnbull 2 1 London Business School, Regent s Park, London NW1 4SA, UK 2 School of Operations Research, Cornell University, Ithaca, NY 14853, USA Summary This paper considers the use of a multivariate binomial probit model for the analysis of correlated exchangeable binary data. The model can naturally accommodate both cluster and individual level covariates, while keeping a fairly flexible intracluster association structure. We discuss Bayesian estimation when a sample of independent clusters of varying sizes are available, and show how Gibbs sampling may be used to derive the posterior densities of parameters. The methodology is illustrated with two examples: the first involves epidemiological data from a study of familial disease aggregation; the second uses teratological data from a developmental toxicity application. Key words: Exchangeable binary data, multivariate binomial probit, Gibbs sampling, hierarchical Bayesian modelling. 1 Introduction Correlated binary data are a common feature in many applications, such as group randomized clinical trials, cluster sample surveys, or developmental toxicity experiments. In many such studies, it is reasonable to assume that the responses within a cluster are exchangeable. A sequence of binary random variables Corresponding author: cstefanescu@london.edu, Phone: +44 (0) , Fax: +44 (0)

2 2 C. Stefanescu and B. Turnbull: Multivariate Probit Model Y 1, Y 2,... is exchangeable if Pr(Y 1 = y 1,..., Y r = y r ) = Pr(Y 1 = y π(1),..., Y r = y π(r) ) for any r, any (y 1,..., y r ) and any permutation π of 1, 2,..., r. There is a large literature that has been devoted to the analysis of clustered binary data; an extensive review of modelling approaches is provided by Pendergast et al. (1996). However, fewer papers have focused specifically on the exchangeable case, and most of the research on this topic has been devoted to non parametric models and likelihood estimation, e.g. Bowman and George (1995), Stefanescu and Turnbull (2003) and Xu and Prorok (2003). These papers investigate a saturated model in which interactions of all orders are allowed, but treat only homogeneous populations (or subpopulations). See also George and Kodell (1996). However in the heterogeneous case, it may be reasonable to assume that the clustered binary responses are exchangeable only after accounting for the presence of explanatory variables. This is what we will mean by exchangeable binary data with covariates. The inclusion of covariates is an important issue, because their omission may bias the results of the analysis. Also, the effects of covariates on the marginal means may themselves be of interest. Covariates may act either at cluster level or individual level a discussion on separating individual level and cluster level effects is given by Begg and Parides (2003). George and Bowman (1995) propose inclusion of cluster level covariates in the saturated model by means of a folded logistic parametrization. As noted by Aerts et al. (2002, p.55), this parametrization is not invariant to reversal of the binary coding, it is valid only for a restricted range of the regressor function, and the model does not simplify to the binomial model under independence. In addition, the intracluster correlation is severely restricted for large cluster sizes. The q power model developed by Kuk (2004) is based on the same approach as George and Bowman (1995), but is more flexible in the choice of regressor function and does reduce to the binomial model under independence. However the q power model still is not coding invariant. This is an undesirable feature, although Kuk (2004, Sec.3.1) has argued that this asymmetry may be natural to consider in applications to quantitative risk assessment in developmental toxicology. More importantly, neither the folded logistic

3 3 nor the q power family are able to accommodate individual level covariates. Such covariates are present in epidemiological studies where responses from members of the same family can depend on explanatory variables specific to each individual. Also, in teratological studies of risk assessment, the incidence of malformations in litter members is linked not only to the toxicant dose to which the mother has been exposed (a cluster level covariate), but also to individual characteristics such as birth weight (Ryan et al., 1991; Catalano and Ryan, 1992; Chen, 1993). Thus there is a need for richer models which allow individual level covariates, while keeping the intracluster association structure fairly flexible. In this paper we examine in detail the use of a multivariate latent threshold model (Ashford and Sowden, 1970) for the analysis of exchangeable binary data with covariates. The binary responses are seen as indicators of the event that some unobserved latent variables exceed a threshold value of zero. A multivariate normal distribution is assumed for the unobserved variables, and exchangeability is ensured by appropriately parameterizing the variance covariance matrix. This leads to an exchangeable multivariate probit model, whereby both cluster and individual level covariates may be naturally included by assuming a linear form of the multivariate normal mean. The model is flexible enough to allow a reasonably wide range of correlation values, and in particular negative correlation between the binary responses can also be modelled. This allows the presence of either cooperative or competitive effects. Another important advantage of this model is that its parameters retain the same interpretation irrespective of cluster size. This is an important consideration when clusters are of varying sizes the interpretability assumption as discussed by Stefanescu and Turnbull (2003). Likelihood methods are an attractive option for inference (e.g. Ochi and Prentice, 1984; Chan and Kuk, 1997; McCulloch, 1994), but they are computationally difficult due to the intractability of the expressions obtained by integrating out the latent variables (Chib and Greenberg, 1998). As an alternative, we propose the use of a Bayesian framework for estimation of the model parameters. Given the underlying hierarchical structure, this is a natural approach to estimation and may be implemented using data augmentation (Tanner

4 4 C. Stefanescu and B. Turnbull: Multivariate Probit Model and Wong, 1987). Simulating from the posterior distributions of parameters can be accomplished using the Gibbs sampling algorithm, as described in Section 3. For the Bayesian model, Albert and Chib (1993) developed a framework for estimation through data augmentation for independent binary data, of which the univariate probit is a special case. Chib and Greenberg (1998) and Edwards and Allenby (2003) investigate Bayesian estimation of the multivariate probit model with a general correlation matrix. Both of these papers focus on the case when clusters have equal sizes. However, this is often not the case in epidemiological or teratological applications where clusters are, for example, families or litters of varying sizes. It is then necessary to impose some structure on the correlation matrix. One way is to specify a variance components model and Sorensen et al. (1995) applied such a model to an animal breeding study of genetic heritability. The alternative that we consider here consists in specifying an exchangeable structure (after adjustments for covariates), which is a very natural one in many applications. The main contributions of this paper consist in investigating the use of multivariate threshold latent models for the analysis of exchangeable binary data with individual level covariates, and developing a Bayesian framework for estimation of these models with samples of varying cluster sizes. The paper is structured as follows: Section 2 reviews the multivariate probit model for clustered binary variables with covariates. The Bayesian framework for estimation is developed in Section 3. Two applications are presented in Section 4: a study of familial disease aggregation and a teratology experiment concerning malformations in mouse litters. Several directions for future research are outlined in a concluding section. 2 Threshold latent variable models for exchangeable binary data Let (Y 1,..., Y r ) be a cluster of r binary random variables, where Y i = 0 ( failure ) or 1 ( success ), i = 1,..., r, and let x 1,..., x r R p denote corresponding vectors of covariates. Some of these covariates may be cluster level, in which case the corresponding components of x 1,..., x r are all equal. In particular, the first component of each x will typically be one, indicating the presence of an intercept term.

5 5 A flexible class of binary data models may be obtained by assuming that the response Y i (1 i r) is an indicator of the event that some unobserved continuous variable, Z i say, exceeds a threshold which can be taken to be zero without loss of generality. Specifically, let Z 1,..., Z r be latent continuous variables and assume that Y i = I(Z i > 0), for i = 1,..., r, Z = Xβ + ε, ε N r (0, Σ r ) (1) Here X is the r p matrix of covariates with rows x 1,..., x r, the vector of regression parameters is β R p, and ε is the vector of error terms assumed to have an exchangeable multivariate normal distribution with mean 0 and covariance matrix Σ r, given by 1 ρ... ρ ρ 1... ρ Σ r =., ρ 1.. r 1 ρ ρ... 1 (2) For identifiability reasons, we can take the common variance of ε to be one see, for example, Edwards and Allenby (2003). Note that, while the normality assumption for ε is motivated by mathematical convenience, other choices for the error distribution are also possible. For example, ε may have a multivariate t or logistic distribution. The specification (1) leads to a flexible model with the property that the binary responses for individuals with common covariate values are exchangeable. The marginal probability of success with covariate vector x is given by p(x) = E(Y ; x) = Pr(Y = 1; x) = Pr(Z > 0; x) = = Pr(xβ + ε > 0) = Pr(ε > xβ) = 1 Φ( xβ), (3) where Φ( ) is the standard normal cdf. Hence V ar(y ; x) = p(x){1 p(x)} = {1 Φ( xβ)}φ( xβ). (4)

6 6 C. Stefanescu and B. Turnbull: Multivariate Probit Model and E(Y i Y j ; x i, x j ) = Pr(Y i = 1, Y j = 1; x i, x j ) = Pr(Z i > 0, Z j > 0; x i, x j ) = Pr(ε i > x i β, ε j > x j β) = L( x i β, x j β, ρ), (5) say, using the standard notation L(h, k, ρ) for the upper quadrant probability of a standard bivariate normal see e.g. Kotz et al. (2000, p.264). The function L(h, k, ρ) has been extensively tabulated, e.g. U.S. National Bureau of Standards (1959). Alternatively, it can be computed from L(h, k, ρ) = h { ( )} k ρx 1 Φ φ(x) dx, (6) 1 ρ 2 where φ( ) is the standard normal pdf (Kotz et al., 2000, Eqn (46.46)). Following Bahadur (1961), the correlation of order l (l = 2,..., r) is defined by ρ l (x 1,..., x l ) = E( l i=1 Y i) l i=1 p(x i) l. i=1 p(x i){1 p(x i )} Here p(x) is given by (3) and E( l i=1 Y i) is the upper orthant probability of an l variate normal distribution, the obvious generalization of (5). This can be evaluated using the methods of Schervish (1984) or Genz (1992). Of course, by exchangeability, ρ l (x 1,..., x l ) = ρ l (x i1,..., x il ) for any choice of l distinct subscripts from {1,..., r}. In particular for l = 2, the correlation Corr(Y i, Y j ; x i, x j ) can be computed directly from (5), and the model can accommodate some limited negative intracluster dependence as well. In general, as expected with binary variates, their correlation Corr(Y i, Y j ; x i, x j ) depends on the correlation ρ of the latent variables {Z i } but also on their means (x i β, x j β). Table 1 shows how this correlation varies as a function of ρ and xβ for a cluster level covariate x i = x j = x, say. Note that the correlation depends on xβ through its absolute value. Both positive and negative correlation can be exhibited although only ρ values exceeding (r 1) 1 are feasible for clusters of size r. Note also that, in the absence of covariates, Corr(Y i, Y j ; 0, 0) = 2 π arcsin ρ which can take values in [ 2 π arcsin( 1 r 1 ), 1].

7 7 Another measure of association between the pair (Y i, Y j ) is the odds ratio: Ψ(x i, x j ) = Pr(Y i = 1, Y j = 1; x i, x j ) Pr(Y i = 0, Y j = 0; x i, x j ) Pr(Y i = 1, Y j = 0; x i, x j ) Pr(Y i = 0, Y j = 1; x i, x j ) (7) This can be evaluated using (5) and the expressions: Pr(Y i = 0, Y j = 0; x i, x j ) = L( x i β, x j β, ρ) + Φ( x i β) + Φ( x j β) 1 Pr(Y i = 1, Y j = 0; x i, x j ) = 1 Φ( x i β) L( x i β, x j β, ρ) The dependence of the odds ratio on the correlation ρ and mean xβ is also displayed in Table 1. [Table 1] 3 Bayesian estimation Suppose that K independent clusters of varying sizes are available for inference. Let Z k1,..., Z krk denote the latent data and let (Y k1, x k1 ),..., (Y krk, x krk ) be the responses and covariates in the k th cluster. Denote the maximum cluster size by R = max{r k ; 1 k K}. We impose the constraint ρ > 1/(R 1) in order to ensure that Σ r is positive definite for all 1 r R. The likelihood of the parameters β and ρ given the observed and latent data {y k = (y k1,..., y krk ), Z k = (Z k1,..., Z krk )} is a product of K cluster likelihoods: K L(β, ρ {y k }, {x k }, {Z k }) = φ rk (Z k ; X k β, ρ) k=1 K r k k=1 j=1 {I(y kj = 0)I(Z kj 0) + I(y kj = 1)I(Z kj > 0)}, (8) where φ r ( ; µ, ρ) is the density function of the r variate normal distribution with mean µ and covariance matrix Σ r given by (2). The likelihood given the observed data is now obtained by integrating out the latent variables, and its expression involves high dimensional integrals which are not numerically tractable. To alleviate the computational difficulties of the likelihood approach, Chan and Kuk (1997) use the EM algorithm for estimation with non exchangeable data.

8 8 C. Stefanescu and B. Turnbull: Multivariate Probit Model We propose instead the use of data augmentation as advocated by Tanner and Wong (1987) and Albert and Chib (1993). Independent prior densities p(β) and p(ρ) are specified for the parameters, then the joint posterior density p(β, ρ, {Z k } {y k }) of β, ρ and {Z k } is proportional to the product of the prior and the augmented likelihood (8), i.e. : { K } p(β) p(ρ) φ rk (Z k ; X k β, ρ) k=1 K r k k=1 j=1 {I(y kj = 0)I(Z kj 0) + I(y kj = 1)I(Z kj > 0)} (9) It is difficult to sample directly from (9), but the Gibbs sampling algorithm can be used to compute the marginal posterior distributions of β and ρ. The Gibbs sampler (Geman and Geman, 1984; Gelfand and Smith, 1990) is an iterative algorithm for generation of samples from a multivariate distribution. It proceeds by successively updating each variable by sampling from its conditional distribution given current values of all other variables. After a sufficiently large number of iterations, under mild conditions it can be proven that the values of the updated variables so obtained form a sample from the joint distribution see for example, Robert and Casella (1999). The Gibbs sampler requires all the posterior conditional distributions, and for model (1) these can be derived based on the joint posterior (9): The random variables Z 1,..., Z K are independent, and Z k y k, x k, β, ρ has a truncated multivariate normal distribution N rk (X k β, Σ rk ), where Z ki is truncated at the left by 0 if y ki = 1, and it is truncated at the right by 0 if y ki = 0, i = 1,..., r k. The posterior distribution of β given {y k }, {x k }, {Z k } and ρ and under the assumption of a diffuse prior for p(β), is multivariate normal N p ( β Z,Σ, (X Σ 1 X) 1 ), where Z = (Z 1,..., Z k ), X = (X 1,..., X k ), βz,σ = (X Σ 1 X) 1 X Σ 1 Z, and Σ is a block diagonal matrix with blocks Σ rk given by (2), k = 1,..., K. This follows from standard linear model results (Lindley and Smith, 1972).

9 9 The posterior distribution of ρ conditional on {y k }, {x k }, {Z k }, β has density proportional to p(ρ) (1 ρ) 1 2 P K k=1 (r k 1) exp [ K k=1 K {1 + (r k 1)ρ} 1 2 k=1 {1 + (r k 3)ρ}S k2 + ρs 2 k1 2(1 ρ) r k 1 {1 + (r k 1)ρ} ], (10) where S k1 = r k j=1 {Z kj (x k β) kj } and S k2 = r k j=1 {Z kj (x k β) kj } 2 for k = 1,..., K. To see this, note that it follows from (9) that the posterior conditional distribution of ρ is proportional to { K } p(ρ) φ rk (Z k ; X k β, ρ) k=1 p(ρ) K k=1 1 (2π) r k/2 Σ rk { exp 1 } 1/2 2 (Z k X k β) Σ 1 r k (Z k X k β). (11) But Σ r = (1 ρ) r 1 {1 + (r 1)ρ}, and Σ 1 1 r = (1 ρ) r 1 {1 + (r 1)ρ} [I r + ρ{1 r + (r 3)I r }], where I r is the identity matrix of order r and 1 r is the r r matrix of ones. Hence (10) may now be derived from (11). Note that it is easy to sample from the conditional posterior distributions of β and {Z k }. For example, sampling from the truncated normal distribution of Z k may be accomplished through a series of univariate draws using an inverse cdf method (Edwards and Allenby, 2003). The conditional posterior distribution of ρ is more difficult to simulate, however sampling may still be implemented using a griddy Gibbs approach (Ritter and Tanner, 1992). 4 Applications 4.1 Familial aggregation of liver cancer The aggregation of a specific disease within families may indicate potential factors contributing to disease etiology (Khoury, Beaty and Cohen, 1993), and it is therefore of major interest to genetic epidemiologists.

10 10 C. Stefanescu and B. Turnbull: Multivariate Probit Model Studies of familial aggregation of disease commonly use the case control design, and a cluster is formed by members of the same family. The degree of familial association of risk of disease may be quantified either by the correlation of responses within a cluster or by the cluster odds ratio. The threshold model (1) finds a natural application in studies of familial disease aggregation, whenever it is reasonable to assume exchangeable responses within a family after adjusting for individual factors. This is often the assumption underlying case control studies. Using the methods of Section 3, the intracluster correlation ρ Y (x i, x j ) can be estimated together with the marginal probability of becoming diseased p(x) = Pr(Y i = 1; x) and with the odds ratio Ψ(x i, x j ). As an example, we analyze the data from a genetic epidemiologic study on liver cancer in Shanghai reported by Liang and Beaty (1991). The data set comprised 347 relatives of 138 cases of primary hepatic carcinoma. A cluster in this analysis is a family, and Table 2 presents the frequency of liver cancer among families of varying size. This table is a slightly corrected version of the Table II that appears in Liang and Beaty (1991) K-Y. Liang (private communication). The binary response indicates whether each relative of a case had contracted the disease. The covariates available for analysis are gender (coded 1 for males and 0 for females), age (coded 1 if under 40 years and 0 otherwise), presence of corn in the usual diet (1 = yes, 0 = no), the source of drinking water (1 if river, 0 otherwise), and presence of antibodies to the hepatitis B virus (HBV, coded 1 = yes, 0 = no). Some of these covariates act at individual level, others at cluster level, and all are binary. Previous analysis of these data (Liang and Beaty, 1991) used a logistic regression model to account for individual effects on the risk of liver cancer. We fit two models of the form (1) to the liver data using the methods from Section 3. The first model includes all five covariates, the second model only takes into account the gender and age covariates. We have implemented the Gibbs sampling algorithm using the WinBUGS software (Spiegelhalter et al., 2003). [Table 2]

11 11 Diffuse but proper priors were specified for all parameters, however other priors are also possible (Turner, Omar and Thompson, 2001). We chose auniform prior U( 1/5, 1) for the correlation ρ Z, and N(0, σ 2 ) priors for the covariate effects β i. To investigate the impact of the choice of prior variance, we carried out a sensitivity analysis by running the chains with different values for σ 2 ranging between 10 3 and The results did not change significantly, hence we report the summary statistics based on the runs with σ 2 = Three parallel chains were started from different sets of initial values, and the Gibbs samplers were run for iterations with the first 5000 iterations discarded as burn in period. Gelman and Rubin s diagnostic (Cowles and Carlin, 1996) indicated satisfactory convergence of all chains. From the pooled sample of three chains we saved the values with a thinning interval of 10, which resulted in 7500 sampling values available for inference for each parameter. [Table 3] The results of our analysis are summarized in Table 3, which reports the mean, standard deviation and median of the marginal posterior distributions of the parameters, as well as the 95% credible intervals. In the first model where all covariates were included in the analysis, it appears that the HBV, diet and water source variables are not significant. Hence we fitted the second model with only the intercept, age and gender variables retained in the analysis. [Table 4] For this reduced second model, the latent correlation ρ is estimated at 0.77 with a 95% credible interval (0.605, 0.890), providing strong evidence of clustering within family. Table 4 reports the medians of the posterior distributions of p(x), Ψ(x i, x j ) and ρ Y (x i, x j ). The 95% credible intervals for these estimates have been obtained from the percentiles of the posterior distributions. The estimated odds ratios range between and for this data set suggesting that the risk of a family member is greatly increased if his/her relative is diseased rather than disease free. The estimates of the gender effect β 1 and age effect β 2 are negative and significantly different from zero. The analysis therefore provides evidence that there is

12 12 C. Stefanescu and B. Turnbull: Multivariate Probit Model a smaller risk of becoming diseased for males, and is consistent with the previous findings that liver cancer tends to occur in older age. These results are also in agreement with the conclusions of Liang and Beaty, who found a strong degree of familial aggregation and negative regression coefficients for the gender and age covariates in all the logistic regression models considered. 4.2 Developmental toxicology data A typical developmental toxicology study involves randomly assigning pregnant animals to groups which receive different doses of a potential toxic substance. The animals are then sacrificed near term and the fetuses in each litter are examined. One aim of these studies is to assess the relationship between dose level exposure and the incidence of malformations in litters, recorded as clustered binary data. To illustrate the application of the multivariate probit model, we analyze a data set from a study of exposure to the herbicide 2,4,5 trichlorophenoxyacetic acid conducted at the National Center for Toxicological Research. The data are described in Table 1 of Ahn and Chen (1997). There are 389 litters ranging in size from 1 to 13, which have been exposed to one of seven toxicant doses (0, 15, 20, 25, 30, 45, and 60 mg/kg/day). Besides dose, the data recorded include the incidence of cleft palate malformations and the fetal weight for each individual in the litter. In fact, Ahn and Chen (1997) only report fetal weight averaged over each litter, since this is all they used in their fitting of a tree structured logistic model to this data. However Dr. Chen kindly made available to us the full data set which included the weights of the individual fetuses. We fit the multivariate probit model (1) to the teratology data using weight as an individual level covariate, and dose and the indicator that dose is greater than zero as cluster level covariates. The relationship between fetal weight and malformation incidence has been documented empirically (Ryan et al., 1991; Chen, 1993). The inclusion of the indicator variable was prompted by Ahn and Chen s analysis of this data set which suggests that the control group behaves quite differently from the other dose groups, and also by the findings of the analysis of a similar data set in Kuk (2004).

13 13 We chose diffuse and proper priors for all parameters a uniform prior U( 1/12, 1) was specified for the correlation ρ Z, and N(0, σ 2 ) priors for the covariate effects β i. The summary statistics reported in Table 5 are based on prior variance σ 2 = 10 3, however the results are very similar for different choices of prior variances ranging between 10 3 and The chain converged rapidly and the sample autocorrelations were relatively low. The Gibbs sampler was run for 6000 iterations with the first 1000 iterations discarded as burn in period, resulting in 5000 sampling values available for inference for each parameter. The estimates from fitting model (1) with the methods from Section 3 are summarized in Table 5. All covariates have significant impact on the individual probability of malformation which, as expected, increases with dose and decreases with weight. The estimated frailty correlation is ρ = 0.369, suggesting a significant clustering effect of malformation incidence within litters. [Table 5] Ahn and Chen (1997) fit several logistic regression tree models to this data set, using dose as a cluster level covariate. Fetal weight is also included in their analysis. However, instead of considering individual weights as covariates, Ahn and Chen use the average cluster weight (a cluster level measure) which is regressed on dose level. The residuals from this linear regression are then used as predictors in a logistic regression model for the probability of malformation. Table 6 gives the estimated malformation frequencies and rates, using the multivariate probit model (1) and a logistic regression tree model investigated by Ahn and Chen (1997). It can be seen that the predicted rates of the tree-based model are closer to those observed for the lower dose groups, while the reverse is true for the higher doses. This equivocal finding may seem surprising given that the probit model can take advantage of the individual weights information. However it should be realized that the tree structured model is somewhat semi parametric and so it should not be surprising that the splitting procedure can produce predicted rates close to those observed. [Table 6]

14 14 C. Stefanescu and B. Turnbull: Multivariate Probit Model An alternate approach is to view malformation incidence and weight as a bivariate response, and this is also possible via an extension of the multivariate latent variable model. We will not discuss this further here see Regan and Catalano (1999) and Geys et al. (2001) who consider likelihood and GEE approaches respectively, and the comment in the next section. 5 Conclusions In this paper we have investigated the exchangeable multivariate probit model and proposed a Bayesian framework for estimation. The model belongs to the class of cluster specific approaches for modelling correlated data, as opposed to population average approaches of which the most common example are the GEE type methods (Zeger, Liang and Albert, 1988; Liang, Zeger and Qaqish, 1992). This latter class of models focus on the marginal expectation of the response, treat the existence of any random parameters as a nuisance, and thus are useful when interest lies only in the fixed parameters. By contrast, the cluster specific approaches such as the multivariate probit model, focus on the conditional expectation given the cluster specific random effect. The exchangeable multivariate probit model has several attractive features which make it particularly suitable for the analysis of correlated binary data. First, the model is a generalization of the ordinary probit model. As a way of relating stimulus and response, the probit model is a natural choice in situations where such an interpretation for a threshold approach is readily available; examples include attitude measurement, assigning pass/fail gradings for an examination based on a mark cut off, and categorization of illness severity based on an underlying continuous scale (Goldstein, 2003, p.107). Second, the connection to the Gaussian distribution in the specification of the multivariate probit model allows for flexible modelling of the association structure and straightforward interpretation of the parameters. For example, the model is particularly attractive in marketing research of consumer choice, because the latent correlations capture the cross dependencies in latent utilities across different items. Also, within the class of cluster specific approaches, the exchangeable multivariate probit model is more flexible than other fully specified

15 15 models (such as the beta binomial) which use compound distributions to account for overdispersion in the data. This is due to the fact that both underdispersion and overdispersion can be accommodated in the multivariate probit model through the flexible underlying covariance structure. Finally, due to the underlying threshold approach, the multivariate probit model extends to analysis of clustered mixed binary and continuous data, or of multivariate binary data (Ryan and Catalano, 1999; Geys et al., 2001). An extension to multiple threshholds leads to a model for clustered ordinal data (Sorensen et al., 1995). The Bayesian framework is a natural approach for estimation of the exchangeable multivariate probit model. Generic prior distributions may be used to incorporate prior information when this is available for example, it is often the case in clinical studies that some valuable prior information is available on the degree of intracluster association (Turner et al., 2001). The Markov chain Monte Carlo methods used to implement the Bayesian approach are particularly useful in models where some structure is imposed on the covariance matrix, because the Gibbs sampler simulates conditional distributions sequentially and so does not have to adjust to the structure of the model. By contrast, likelihood methods usually involve constructing large constrained variance matrices for these models. Finally, Markov chain Monte Carlo methods yield inferences based upon samples from the full posterior distribution, therefore they can give accurate interval estimates for non Gaussian error distributions as well, and allow exact inference in cases where the likelihood based methods yield approximations. Acknowledgements The authors are very grateful to Professor K-Y Liang for providing the liver cancer data used in Section 4.1, and to Dr. James Chen for the teratology data in Section 4.2. This research was supported in part by grant R01 CA66218 from the U.S. National Institutes of Health and by an RAMD grant from London Business School.

16 16 C. Stefanescu and B. Turnbull: Multivariate Probit Model References Aerts, M., Geys, H., Molenberghs, G. and Ryan, L., eds. (2002). Topics in Modelling of Clustered Data. Chapman and Hall, London. Ahn, H. and Chen, J.J. (1997). Tree structured logistic model for over dispersed binomial data with applications to modeling developmental effects. Biometrics 53, Albert, J.H. and Chib, S. (1997). Bayesian analysis of binary and polychotomous response data. Journal of the American Statistical Association 88, Ashford, J.R. and Sowden, R.R. (1970). Multi variate probit analysis. Biometrics 26, Bahadur, R. R. (1961). A representation of the joint distribution of responses to n dichotomous items. In Studies in Item Analysis and Prediction, H. Solomon (ed), Stanford University Press, California. Begg, M.D. and Parides, M.K. (2003). Separation of individual level and cluster level covariate effects in regression analysis of correlated data. Statistics in Medicine 22, Bowman, D. and George, E.O. (1995). A saturated model for analyzing exchangeable binary data: Applications to clinical and developmental toxicity studies. Journal of the American Statistical Association 90, Catalano, P.J. and Ryan, L.M. (1993). Bivariate latent variable models for clustered discrete and continuous outcomes. Journal of the American Statistical Association 87, Chan, J.S.K. and Kuk, A.Y.C. (1997). Maximum likelihood estimation for probit linear mixed models with correlated random effects. Biometrics 53, Chen, J. (1993). A malformation incidence dose response model incorporating fetal weight and/or litter size as covariates. Risk Analysis 13, Chib, S. and Greenberg, E. (1998). Analysis of multivariate probit models. Biometrika 85,

17 17 Cowles, M.K. and Carlin, B.P. (1996). Markov chain Monte Carlo convergence diagnostics: A comparative review. Journal of the American Statistical Association 91, Edwards, Y.D. and Allenby, G.M. (2003). Multivariate analysis of multiple response data. Journal of Marketing Research 40, Gelfand, A.E. and Smith, A.F.M. (1990). Sampling based approaches to calculating marginal densities. Journal of the American Statistical Association 85, Geman, S. and Geman, D. (1984). Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Transactions on Pattern Analysis and Machine Intelligence 6, George, E.O. and Bowman, D. (1995). A full likelihood procedure for analysing exchangeable binary data. Biometrics 51, George, E.O. and Kodell R.L. (1996). Tests of independence, treatment heterogeneity, and dose related trend with exchangeable binary data. Journal of the American Statistical Society 91, Genz, A. (1992). Numerical computation of the multivariate normal probabilities. Journal of Computational and Graphical Statistics 1, Geys, H., Regan, M.M., Catalano, P.J. and Molenberghs, G. (2001). Two latent variable risk assessment approaches for mixed continuous and discrete outcomes from developmental toxicity data Journal of Agricultural, Biological and Environmental Statistics, 6, Goldstein, H. (2003). Multilevel Statistical Models. 3rd Edition, Arnold, London. Khoury, M.J., Beaty, T.H. and Cohen, B.H. (1993) Fundamentals of Genetic Epidemiology. Oxford University Press, New York. Kotz, S., Balakrishnan, N. and Johnson, N.L. (2000). Continuous Multivariate Distributions. Volume 1: Models and Applications, 2nd Edition. Wiley, New York. Kuk, A.Y.C. (2004). A litter based approach to risk assessment in developmental toxicity studies via a power family of completely monotone functions. Applied Statistics 53,

18 18 C. Stefanescu and B. Turnbull: Multivariate Probit Model Liang, K. Y. and Beaty, T. H. (1991). Measuring familial aggregation by using odds ratio regression models. Genetic Epidemiology 8, Liang, K.Y., Zeger, S.L. and Qaqish, B. (1992). Multivariate regression analyses for categorical data. Journal of the Royal Statistical Society, Series B 54, Lindley, D.V. and Smith, A.F.M. (1972). Bayes estimates for the linear model. Journal of the Royal Statistical Society, Series B 135, McCulloch, C.E. (1994). Maximum likelihood variance components estimation for binary data. Journal of the American Statistical Association 89, Ochi, Y. and Prentice, R.L. (1984). Likelihood inference in a correlated probit regression model. Biometrika 71, Pendergast, J. F., Gange, S. J., Newton, M. A., Lindstrom, M. J., Palta, M. and Fisher, M. R. (1996). A survey of methods for analyzing clustered binary response data. International Statistical Review 64, Regan, M.M. and Catalano, P.J. (1999). Likelihood models for clustered binary and continuous outcomes: application to developmental toxicology. Biometrics 55, Ritter, C. and Tanner, M.A. (1992). The Gibbs stopper and the griddy Gibbs sampler. Journal of the American Statistical Association 87, Robert, C.P. and Casella, G. (1999). Monte Carlo Statistical Methods. Springer, New York. Ryan, L.M., Catalano, P.J., Kimmel, C.A. and Kimmel, G.L. (1991). Relationship between fetal weight and malformation in developmental toxicity studies. Teratology 44, Schervish, M.J. (1984). Algorithm AS 195: Multivariate normal probabilities with error bound. Applied Statistics 33, (Correction 34 (1985), ) Sorensen, D.A., Andersen,S,, Gianola, D. and Korsgaard, I. (1995). Bayesian inference in threshold models using Gibbs sampling. Genetics, Selection, Evolution 27,

19 19 Spiegelhalter, D.J., Thomas, A., Best, N.G. and Lunn, D. (2003). WinBUGS Version 1.4 User Manual. MRC Biostatistics Unit, Cambridge. Stefanescu, C. and Turnbull, B. W. (2003). Likelihood inference for exchangeable binary data with varying cluster sizes. Biometrics 59, Tanner, T.A. and Wong, W.H. (1987). The calculation of posterior distributions by data augmentation. Journal of the American Statistical Association 82, Turner, R.M., Omar, R.Z. and Thompson, S.G. (2001). Bayesian methods of analysis for cluster randomized trials with binary outcome data. Statistics in Medicine 20, U.S. National Bureau of Standards (1959). Tables of the Bivariate Normal Distribution Function and Related Functions. U.S. Government Printing Office, Washington DC. Xu, J.L. and Prorok, P.C. (2003). Modelling and analysing exchangeable binary data with random cluster sizes. Statistics in Medicine 22, Zeger, S.L., Liang, K.Y. and Albert, P.S. (1988). Models for longitudinal data: A generalized estimating equations approach. Biometrics 44,

20 20 C. Stefanescu and B. Turnbull: Multivariate Probit Model Table 1 Values of correlation and odds ratio of binary variates in a multivariate probit model, for selected values of the absolute values of the means x and correlation ρ of the latent variables. Corr(Y i, Y j ; x, x) Ψ(x, x) xβ xβ ρ

21 21 Table 2 Frequency of liver cancer by size of the family (Liang and Beaty, 1991). Family Number of cases of liver cancer size Total Total

22 22 C. Stefanescu and B. Turnbull: Multivariate Probit Model Table 3 Bayesian estimates for liver cancer data. Model 1 Explanatory Mean Median Standard 95% Credible variable error intervals Intercept (β 0 ) ( 1.307, 0.253) Gender (β 1 ) ( 0.670, 0.066) Age (β 2 ) ( 1.018, 0.292) HBV (β 3 ) ( 0.076, 1.100) Diet (β 4 ) ( 0.738, 0.163) Water source (β 5 ) ( 0.567, 0.377) ρ (0.607, 0.897) Model 2 Intercept (β 0 ) ( 0.882, 0.416) Gender (β 1 ) ( 0.676, 0.076) Age (β 2 ) ( 1.002, 0.293) ρ (0.605, 0.890)

23 23 Table 4 Estimates of p(x), Ψ(x i, x j) and ρ Y (x i, x j) with 95% credible intervals. (Model 2) female female male male over 40 yrs under 40 yrs over 40 yrs under 40 yrs x (1 0 0) (1 0 1) (1 1 0) (1 1 1) p(x) (0.189, 0.339) (0.046, 0.178) (0.091, 0.237) (0.018, 0.107) Estimates of Ψ(x i, x j ) (1 0 0) (6.50, 36.97) (8.35, 97.71) (7.30, 52.25) (10.68, ) (1 0 1) (8.98, 56.72) (8.56, 65.92) (10.84, 85.49) (1 1 0) (7.63, 45.61) (10.36, ) (1 1 1) (12.15, 83.81) Estimates of ρ Y (x i, x j ) (1 0 0) (0.39, 0.68) (0.30, 0.58) (0.35, 0.64) (0.20, 0.49) (1 0 1) (0.31, 0.65) (0.31, 0.63) (0.27, 0.60) (1 1 0) (0.35, 0.66) (0.25, 0.55) (1 1 1) 0.44 (0.26, 0.62)

24 24 C. Stefanescu and B. Turnbull: Multivariate Probit Model Table 5 Bayesian estimates for teratology data. Explanatory Mean Median Standard 95% Credible variable error intervals Intercept (β 0 ) (1.034, 2.005) Dose (β 1 ) (0.0445, ) I(dose > 0) (β 2 ) ( 1.032, 0.374) Weight (β 3 ) ( , ) ρ (0.294, 0.441) Table 6 Comparison of multivariate probit model and tree structured model with regard to predicted number of malformations and malformation rates at each dose level for teratology data. Tree model is model 2 from Ahn and Chen (1997, Table 4, last column). Dose Number of malformations Malformation rates Observed Probit Tree model Observed Probit Tree model

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University A SURVEY OF VARIANCE COMPONENTS ESTIMATION FROM BINARY DATA by Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University BU-1211-M May 1993 ABSTRACT The basic problem of variance components

More information

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns

More information

eqr094: Hierarchical MCMC for Bayesian System Reliability

eqr094: Hierarchical MCMC for Bayesian System Reliability eqr094: Hierarchical MCMC for Bayesian System Reliability Alyson G. Wilson Statistical Sciences Group, Los Alamos National Laboratory P.O. Box 1663, MS F600 Los Alamos, NM 87545 USA Phone: 505-667-9167

More information

STA 216, GLM, Lecture 16. October 29, 2007

STA 216, GLM, Lecture 16. October 29, 2007 STA 216, GLM, Lecture 16 October 29, 2007 Efficient Posterior Computation in Factor Models Underlying Normal Models Generalized Latent Trait Models Formulation Genetic Epidemiology Illustration Structural

More information

Bayesian Multivariate Logistic Regression

Bayesian Multivariate Logistic Regression Bayesian Multivariate Logistic Regression Sean M. O Brien and David B. Dunson Biostatistics Branch National Institute of Environmental Health Sciences Research Triangle Park, NC 1 Goals Brief review of

More information

Gibbs Sampling in Latent Variable Models #1

Gibbs Sampling in Latent Variable Models #1 Gibbs Sampling in Latent Variable Models #1 Econ 690 Purdue University Outline 1 Data augmentation 2 Probit Model Probit Application A Panel Probit Panel Probit 3 The Tobit Model Example: Female Labor

More information

Repeated ordinal measurements: a generalised estimating equation approach

Repeated ordinal measurements: a generalised estimating equation approach Repeated ordinal measurements: a generalised estimating equation approach David Clayton MRC Biostatistics Unit 5, Shaftesbury Road Cambridge CB2 2BW April 7, 1992 Abstract Cumulative logit and related

More information

Multivariate Frailty Models for Exchangeable Survival Data With Covariates

Multivariate Frailty Models for Exchangeable Survival Data With Covariates Multivariate Frailty Models for Exchangeable Survival Data With Covariates Catalina STEFANESCU London Business School London NW1 4SA United Kingdom (cstefanescu@london.edu) Bruce W. TURNBULL School of

More information

Lecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH

Lecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH Lecture 5: Spatial probit models James P. LeSage University of Toledo Department of Economics Toledo, OH 43606 jlesage@spatial-econometrics.com March 2004 1 A Bayesian spatial probit model with individual

More information

Default Priors and Effcient Posterior Computation in Bayesian

Default Priors and Effcient Posterior Computation in Bayesian Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature

More information

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal

More information

Bayesian Hypothesis Testing in GLMs: One-Sided and Ordered Alternatives. 1(w i = h + 1)β h + ɛ i,

Bayesian Hypothesis Testing in GLMs: One-Sided and Ordered Alternatives. 1(w i = h + 1)β h + ɛ i, Bayesian Hypothesis Testing in GLMs: One-Sided and Ordered Alternatives Often interest may focus on comparing a null hypothesis of no difference between groups to an ordered restricted alternative. For

More information

Markov Chain Monte Carlo methods

Markov Chain Monte Carlo methods Markov Chain Monte Carlo methods By Oleg Makhnin 1 Introduction a b c M = d e f g h i 0 f(x)dx 1.1 Motivation 1.1.1 Just here Supresses numbering 1.1.2 After this 1.2 Literature 2 Method 2.1 New math As

More information

Bayes methods for categorical data. April 25, 2017

Bayes methods for categorical data. April 25, 2017 Bayes methods for categorical data April 25, 2017 Motivation for joint probability models Increasing interest in high-dimensional data in broad applications Focus may be on prediction, variable selection,

More information

Markov Chain Monte Carlo in Practice

Markov Chain Monte Carlo in Practice Markov Chain Monte Carlo in Practice Edited by W.R. Gilks Medical Research Council Biostatistics Unit Cambridge UK S. Richardson French National Institute for Health and Medical Research Vilejuif France

More information

Multivariate Versus Multinomial Probit: When are Binary Decisions Made Separately also Jointly Optimal?

Multivariate Versus Multinomial Probit: When are Binary Decisions Made Separately also Jointly Optimal? Multivariate Versus Multinomial Probit: When are Binary Decisions Made Separately also Jointly Optimal? Dale J. Poirier and Deven Kapadia University of California, Irvine March 10, 2012 Abstract We provide

More information

Bayesian Inference in the Multivariate Probit Model

Bayesian Inference in the Multivariate Probit Model Bayesian Inference in the Multivariate Probit Model Estimation of the Correlation Matrix by Aline Tabet A THESIS SUBMITTED IN PARTIAL FULFILMENT OF THE REQUIREMENTS FOR THE DEGREE OF Master of Science

More information

A Fully Nonparametric Modeling Approach to. BNP Binary Regression

A Fully Nonparametric Modeling Approach to. BNP Binary Regression A Fully Nonparametric Modeling Approach to Binary Regression Maria Department of Applied Mathematics and Statistics University of California, Santa Cruz SBIES, April 27-28, 2012 Outline 1 2 3 Simulation

More information

MULTILEVEL IMPUTATION 1

MULTILEVEL IMPUTATION 1 MULTILEVEL IMPUTATION 1 Supplement B: MCMC Sampling Steps and Distributions for Two-Level Imputation This document gives technical details of the full conditional distributions used to draw regression

More information

Simulating Longer Vectors of Correlated Binary Random Variables via Multinomial Sampling

Simulating Longer Vectors of Correlated Binary Random Variables via Multinomial Sampling Simulating Longer Vectors of Correlated Binary Random Variables via Multinomial Sampling J. Shults a a Department of Biostatistics, University of Pennsylvania, PA 19104, USA (v4.0 released January 2015)

More information

Research Projects. Hanxiang Peng. March 4, Department of Mathematical Sciences Indiana University-Purdue University at Indianapolis

Research Projects. Hanxiang Peng. March 4, Department of Mathematical Sciences Indiana University-Purdue University at Indianapolis Hanxiang Department of Mathematical Sciences Indiana University-Purdue University at Indianapolis March 4, 2009 Outline Project I: Free Knot Spline Cox Model Project I: Free Knot Spline Cox Model Consider

More information

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University

More information

On a multivariate implementation of the Gibbs sampler

On a multivariate implementation of the Gibbs sampler Note On a multivariate implementation of the Gibbs sampler LA García-Cortés, D Sorensen* National Institute of Animal Science, Research Center Foulum, PB 39, DK-8830 Tjele, Denmark (Received 2 August 1995;

More information

Rank Regression with Normal Residuals using the Gibbs Sampler

Rank Regression with Normal Residuals using the Gibbs Sampler Rank Regression with Normal Residuals using the Gibbs Sampler Stephen P Smith email: hucklebird@aol.com, 2018 Abstract Yu (2000) described the use of the Gibbs sampler to estimate regression parameters

More information

A Bayesian Mixture Model with Application to Typhoon Rainfall Predictions in Taipei, Taiwan 1

A Bayesian Mixture Model with Application to Typhoon Rainfall Predictions in Taipei, Taiwan 1 Int. J. Contemp. Math. Sci., Vol. 2, 2007, no. 13, 639-648 A Bayesian Mixture Model with Application to Typhoon Rainfall Predictions in Taipei, Taiwan 1 Tsai-Hung Fan Graduate Institute of Statistics National

More information

Bayes Estimation in Meta-analysis using a linear model theorem

Bayes Estimation in Meta-analysis using a linear model theorem University of Wollongong Research Online Applied Statistics Education and Research Collaboration (ASEARC) - Conference Papers Faculty of Engineering and Information Sciences 2012 Bayes Estimation in Meta-analysis

More information

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Michael J. Daniels and Chenguang Wang Jan. 18, 2009 First, we would like to thank Joe and Geert for a carefully

More information

The STS Surgeon Composite Technical Appendix

The STS Surgeon Composite Technical Appendix The STS Surgeon Composite Technical Appendix Overview Surgeon-specific risk-adjusted operative operative mortality and major complication rates were estimated using a bivariate random-effects logistic

More information

Factor Analytic Models of Clustered Multivariate Data with Informative Censoring (refer to Dunson and Perreault, 2001, Biometrics 57, )

Factor Analytic Models of Clustered Multivariate Data with Informative Censoring (refer to Dunson and Perreault, 2001, Biometrics 57, ) Factor Analytic Models of Clustered Multivariate Data with Informative Censoring (refer to Dunson and Perreault, 2001, Biometrics 57, 302-308) Consider data in which multiple outcomes are collected for

More information

Marginal Specifications and a Gaussian Copula Estimation

Marginal Specifications and a Gaussian Copula Estimation Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

Reconstruction of individual patient data for meta analysis via Bayesian approach

Reconstruction of individual patient data for meta analysis via Bayesian approach Reconstruction of individual patient data for meta analysis via Bayesian approach Yusuke Yamaguchi, Wataru Sakamoto and Shingo Shirahata Graduate School of Engineering Science, Osaka University Masashi

More information

Variable Selection for Multivariate Logistic Regression Models

Variable Selection for Multivariate Logistic Regression Models Variable Selection for Multivariate Logistic Regression Models Ming-Hui Chen and Dipak K. Dey Journal of Statistical Planning and Inference, 111, 37-55 Abstract In this paper, we use multivariate logistic

More information

Online Appendix to: Marijuana on Main Street? Estimating Demand in Markets with Limited Access

Online Appendix to: Marijuana on Main Street? Estimating Demand in Markets with Limited Access Online Appendix to: Marijuana on Main Street? Estating Demand in Markets with Lited Access By Liana Jacobi and Michelle Sovinsky This appendix provides details on the estation methodology for various speci

More information

Simulation of truncated normal variables. Christian P. Robert LSTA, Université Pierre et Marie Curie, Paris

Simulation of truncated normal variables. Christian P. Robert LSTA, Université Pierre et Marie Curie, Paris Simulation of truncated normal variables Christian P. Robert LSTA, Université Pierre et Marie Curie, Paris Abstract arxiv:0907.4010v1 [stat.co] 23 Jul 2009 We provide in this paper simulation algorithms

More information

Contents. Part I: Fundamentals of Bayesian Inference 1

Contents. Part I: Fundamentals of Bayesian Inference 1 Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian

More information

Hierarchical Linear Models. Jeff Gill. University of Florida

Hierarchical Linear Models. Jeff Gill. University of Florida Hierarchical Linear Models Jeff Gill University of Florida I. ESSENTIAL DESCRIPTION OF HIERARCHICAL LINEAR MODELS II. SPECIAL CASES OF THE HLM III. THE GENERAL STRUCTURE OF THE HLM IV. ESTIMATION OF THE

More information

Figure 36: Respiratory infection versus time for the first 49 children.

Figure 36: Respiratory infection versus time for the first 49 children. y BINARY DATA MODELS We devote an entire chapter to binary data since such data are challenging, both in terms of modeling the dependence, and parameter interpretation. We again consider mixed effects

More information

Part 8: GLMs and Hierarchical LMs and GLMs

Part 8: GLMs and Hierarchical LMs and GLMs Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course

More information

COMPOSITIONAL IDEAS IN THE BAYESIAN ANALYSIS OF CATEGORICAL DATA WITH APPLICATION TO DOSE FINDING CLINICAL TRIALS

COMPOSITIONAL IDEAS IN THE BAYESIAN ANALYSIS OF CATEGORICAL DATA WITH APPLICATION TO DOSE FINDING CLINICAL TRIALS COMPOSITIONAL IDEAS IN THE BAYESIAN ANALYSIS OF CATEGORICAL DATA WITH APPLICATION TO DOSE FINDING CLINICAL TRIALS M. Gasparini and J. Eisele 2 Politecnico di Torino, Torino, Italy; mauro.gasparini@polito.it

More information

The Bayesian Approach to Multi-equation Econometric Model Estimation

The Bayesian Approach to Multi-equation Econometric Model Estimation Journal of Statistical and Econometric Methods, vol.3, no.1, 2014, 85-96 ISSN: 2241-0384 (print), 2241-0376 (online) Scienpress Ltd, 2014 The Bayesian Approach to Multi-equation Econometric Model Estimation

More information

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements:

Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis. Acknowledgements: Tutorial 6: Tutorial on Translating between GLIMMPSE Power Analysis and Data Analysis Anna E. Barón, Keith E. Muller, Sarah M. Kreidler, and Deborah H. Glueck Acknowledgements: The project was supported

More information

,..., θ(2),..., θ(n)

,..., θ(2),..., θ(n) Likelihoods for Multivariate Binary Data Log-Linear Model We have 2 n 1 distinct probabilities, but we wish to consider formulations that allow more parsimonious descriptions as a function of covariates.

More information

Local Likelihood Bayesian Cluster Modeling for small area health data. Andrew Lawson Arnold School of Public Health University of South Carolina

Local Likelihood Bayesian Cluster Modeling for small area health data. Andrew Lawson Arnold School of Public Health University of South Carolina Local Likelihood Bayesian Cluster Modeling for small area health data Andrew Lawson Arnold School of Public Health University of South Carolina Local Likelihood Bayesian Cluster Modelling for Small Area

More information

The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations

The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations John R. Michael, Significance, Inc. and William R. Schucany, Southern Methodist University The mixture

More information

PQL Estimation Biases in Generalized Linear Mixed Models

PQL Estimation Biases in Generalized Linear Mixed Models PQL Estimation Biases in Generalized Linear Mixed Models Woncheol Jang Johan Lim March 18, 2006 Abstract The penalized quasi-likelihood (PQL) approach is the most common estimation procedure for the generalized

More information

Bayesian Mixture Modeling

Bayesian Mixture Modeling University of California, Merced July 21, 2014 Mplus Users Meeting, Utrecht Organization of the Talk Organization s modeling estimation framework Motivating examples duce the basic LCA model Illustrated

More information

Bayesian Linear Regression

Bayesian Linear Regression Bayesian Linear Regression Sudipto Banerjee 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. September 15, 2010 1 Linear regression models: a Bayesian perspective

More information

Limited Dependent Variable Panel Data Models: A Bayesian. Approach

Limited Dependent Variable Panel Data Models: A Bayesian. Approach Limited Dependent Variable Panel Data Models: A Bayesian Approach Giuseppe Bruno 1 June 2005 Preliminary Draft 1 Bank of Italy Research Department. e-mail: giuseppe.bruno@bancaditalia.it 1 Introduction

More information

A note on Reversible Jump Markov Chain Monte Carlo

A note on Reversible Jump Markov Chain Monte Carlo A note on Reversible Jump Markov Chain Monte Carlo Hedibert Freitas Lopes Graduate School of Business The University of Chicago 5807 South Woodlawn Avenue Chicago, Illinois 60637 February, 1st 2006 1 Introduction

More information

Approximate Bayesian Computation

Approximate Bayesian Computation Approximate Bayesian Computation Michael Gutmann https://sites.google.com/site/michaelgutmann University of Helsinki and Aalto University 1st December 2015 Content Two parts: 1. The basics of approximate

More information

Obnoxious lateness humor

Obnoxious lateness humor Obnoxious lateness humor 1 Using Bayesian Model Averaging For Addressing Model Uncertainty in Environmental Risk Assessment Louise Ryan and Melissa Whitney Department of Biostatistics Harvard School of

More information

BAYESIAN ESTIMATION OF LINEAR STATISTICAL MODEL BIAS

BAYESIAN ESTIMATION OF LINEAR STATISTICAL MODEL BIAS BAYESIAN ESTIMATION OF LINEAR STATISTICAL MODEL BIAS Andrew A. Neath 1 and Joseph E. Cavanaugh 1 Department of Mathematics and Statistics, Southern Illinois University, Edwardsville, Illinois 606, USA

More information

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-675.

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-675. McGill University Department of Epidemiology and Biostatistics Bayesian Analysis for the Health Sciences Course EPIB-675 Lawrence Joseph Bayesian Analysis for the Health Sciences EPIB-675 3 credits Instructor:

More information

Factorization of Seperable and Patterned Covariance Matrices for Gibbs Sampling

Factorization of Seperable and Patterned Covariance Matrices for Gibbs Sampling Monte Carlo Methods Appl, Vol 6, No 3 (2000), pp 205 210 c VSP 2000 Factorization of Seperable and Patterned Covariance Matrices for Gibbs Sampling Daniel B Rowe H & SS, 228-77 California Institute of

More information

2 Bayesian Hierarchical Response Modeling

2 Bayesian Hierarchical Response Modeling 2 Bayesian Hierarchical Response Modeling In the first chapter, an introduction to Bayesian item response modeling was given. The Bayesian methodology requires careful specification of priors since item

More information

Bagging During Markov Chain Monte Carlo for Smoother Predictions

Bagging During Markov Chain Monte Carlo for Smoother Predictions Bagging During Markov Chain Monte Carlo for Smoother Predictions Herbert K. H. Lee University of California, Santa Cruz Abstract: Making good predictions from noisy data is a challenging problem. Methods

More information

Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model

Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model UNIVERSITY OF TEXAS AT SAN ANTONIO Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model Liang Jing April 2010 1 1 ABSTRACT In this paper, common MCMC algorithms are introduced

More information

Anders Skrondal. Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine. Based on joint work with Sophia Rabe-Hesketh

Anders Skrondal. Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine. Based on joint work with Sophia Rabe-Hesketh Constructing Latent Variable Models using Composite Links Anders Skrondal Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine Based on joint work with Sophia Rabe-Hesketh

More information

The impact of covariance misspecification in multivariate Gaussian mixtures on estimation and inference

The impact of covariance misspecification in multivariate Gaussian mixtures on estimation and inference The impact of covariance misspecification in multivariate Gaussian mixtures on estimation and inference An application to longitudinal modeling Brianna Heggeseth with Nicholas Jewell Department of Statistics

More information

Bayesian Inference for the Multivariate Normal

Bayesian Inference for the Multivariate Normal Bayesian Inference for the Multivariate Normal Will Penny Wellcome Trust Centre for Neuroimaging, University College, London WC1N 3BG, UK. November 28, 2014 Abstract Bayesian inference for the multivariate

More information

MONTE CARLO METHODS. Hedibert Freitas Lopes

MONTE CARLO METHODS. Hedibert Freitas Lopes MONTE CARLO METHODS Hedibert Freitas Lopes The University of Chicago Booth School of Business 5807 South Woodlawn Avenue, Chicago, IL 60637 http://faculty.chicagobooth.edu/hedibert.lopes hlopes@chicagobooth.edu

More information

Bayesian Methods for Machine Learning

Bayesian Methods for Machine Learning Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),

More information

Longitudinal and Panel Data: Analysis and Applications for the Social Sciences. Table of Contents

Longitudinal and Panel Data: Analysis and Applications for the Social Sciences. Table of Contents Longitudinal and Panel Data Preface / i Longitudinal and Panel Data: Analysis and Applications for the Social Sciences Table of Contents August, 2003 Table of Contents Preface i vi 1. Introduction 1.1

More information

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke

More information

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter

More information

Markov Chain Monte Carlo A Contribution to the Encyclopedia of Environmetrics

Markov Chain Monte Carlo A Contribution to the Encyclopedia of Environmetrics Markov Chain Monte Carlo A Contribution to the Encyclopedia of Environmetrics Galin L. Jones and James P. Hobert Department of Statistics University of Florida May 2000 1 Introduction Realistic statistical

More information

Semiparametric Generalized Linear Models

Semiparametric Generalized Linear Models Semiparametric Generalized Linear Models North American Stata Users Group Meeting Chicago, Illinois Paul Rathouz Department of Health Studies University of Chicago prathouz@uchicago.edu Liping Gao MS Student

More information

Lecture 1 Introduction to Multi-level Models

Lecture 1 Introduction to Multi-level Models Lecture 1 Introduction to Multi-level Models Course Website: http://www.biostat.jhsph.edu/~ejohnson/multilevel.htm All lecture materials extracted and further developed from the Multilevel Model course

More information

Default Priors and Efficient Posterior Computation in Bayesian Factor Analysis

Default Priors and Efficient Posterior Computation in Bayesian Factor Analysis Default Priors and Efficient Posterior Computation in Bayesian Factor Analysis Joyee Ghosh Institute of Statistics and Decision Sciences, Duke University Box 90251, Durham, NC 27708 joyee@stat.duke.edu

More information

arxiv: v2 [stat.me] 27 Aug 2014

arxiv: v2 [stat.me] 27 Aug 2014 Biostatistics (2014), 0, 0, pp. 1 20 doi:10.1093/biostatistics/depcounts arxiv:1305.1656v2 [stat.me] 27 Aug 2014 Markov counting models for correlated binary responses FORREST W. CRAWFORD, DANIEL ZELTERMAN

More information

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling Jon Wakefield Departments of Statistics and Biostatistics University of Washington 1 / 37 Lecture Content Motivation

More information

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

(5) Multi-parameter models - Gibbs sampling. ST440/540: Applied Bayesian Analysis

(5) Multi-parameter models - Gibbs sampling. ST440/540: Applied Bayesian Analysis Summarizing a posterior Given the data and prior the posterior is determined Summarizing the posterior gives parameter estimates, intervals, and hypothesis tests Most of these computations are integrals

More information

Session 3A: Markov chain Monte Carlo (MCMC)

Session 3A: Markov chain Monte Carlo (MCMC) Session 3A: Markov chain Monte Carlo (MCMC) John Geweke Bayesian Econometrics and its Applications August 15, 2012 ohn Geweke Bayesian Econometrics and its Session Applications 3A: Markov () chain Monte

More information

Bayesian Meta-analysis with Hierarchical Modeling Brian P. Hobbs 1

Bayesian Meta-analysis with Hierarchical Modeling Brian P. Hobbs 1 Bayesian Meta-analysis with Hierarchical Modeling Brian P. Hobbs 1 Division of Biostatistics, School of Public Health, University of Minnesota, Mayo Mail Code 303, Minneapolis, Minnesota 55455 0392, U.S.A.

More information

Multivariate Survival Analysis

Multivariate Survival Analysis Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in

More information

Random Effects Models for Network Data

Random Effects Models for Network Data Random Effects Models for Network Data Peter D. Hoff 1 Working Paper no. 28 Center for Statistics and the Social Sciences University of Washington Seattle, WA 98195-4320 January 14, 2003 1 Department of

More information

Experimental Design and Data Analysis for Biologists

Experimental Design and Data Analysis for Biologists Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1

More information

The equivalence of the Maximum Likelihood and a modified Least Squares for a case of Generalized Linear Model

The equivalence of the Maximum Likelihood and a modified Least Squares for a case of Generalized Linear Model Applied and Computational Mathematics 2014; 3(5): 268-272 Published online November 10, 2014 (http://www.sciencepublishinggroup.com/j/acm) doi: 10.11648/j.acm.20140305.22 ISSN: 2328-5605 (Print); ISSN:

More information

Outline. Clustering. Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models

Outline. Clustering. Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models Collaboration with Rudolf Winter-Ebmer, Department of Economics, Johannes Kepler University

More information

Bayesian linear regression

Bayesian linear regression Bayesian linear regression Linear regression is the basis of most statistical modeling. The model is Y i = X T i β + ε i, where Y i is the continuous response X i = (X i1,..., X ip ) T is the corresponding

More information

Markov Chain Monte Carlo methods

Markov Chain Monte Carlo methods Markov Chain Monte Carlo methods Tomas McKelvey and Lennart Svensson Signal Processing Group Department of Signals and Systems Chalmers University of Technology, Sweden November 26, 2012 Today s learning

More information

Bayesian Nonparametric Regression for Diabetes Deaths

Bayesian Nonparametric Regression for Diabetes Deaths Bayesian Nonparametric Regression for Diabetes Deaths Brian M. Hartman PhD Student, 2010 Texas A&M University College Station, TX, USA David B. Dahl Assistant Professor Texas A&M University College Station,

More information

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Xiuming Zhang zhangxiuming@u.nus.edu A*STAR-NUS Clinical Imaging Research Center October, 015 Summary This report derives

More information

CTDL-Positive Stable Frailty Model

CTDL-Positive Stable Frailty Model CTDL-Positive Stable Frailty Model M. Blagojevic 1, G. MacKenzie 2 1 Department of Mathematics, Keele University, Staffordshire ST5 5BG,UK and 2 Centre of Biostatistics, University of Limerick, Ireland

More information

Efficiency of generalized estimating equations for binary responses

Efficiency of generalized estimating equations for binary responses J. R. Statist. Soc. B (2004) 66, Part 4, pp. 851 860 Efficiency of generalized estimating equations for binary responses N. Rao Chaganty Old Dominion University, Norfolk, USA and Harry Joe University of

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

Statistical Inference for Stochastic Epidemic Models

Statistical Inference for Stochastic Epidemic Models Statistical Inference for Stochastic Epidemic Models George Streftaris 1 and Gavin J. Gibson 1 1 Department of Actuarial Mathematics & Statistics, Heriot-Watt University, Riccarton, Edinburgh EH14 4AS,

More information

Tutorial on Approximate Bayesian Computation

Tutorial on Approximate Bayesian Computation Tutorial on Approximate Bayesian Computation Michael Gutmann https://sites.google.com/site/michaelgutmann University of Helsinki Aalto University Helsinki Institute for Information Technology 16 May 2016

More information

BAYESIAN ANALYSIS OF DOSE-RESPONSE CALIBRATION CURVES

BAYESIAN ANALYSIS OF DOSE-RESPONSE CALIBRATION CURVES Libraries Annual Conference on Applied Statistics in Agriculture 2005-17th Annual Conference Proceedings BAYESIAN ANALYSIS OF DOSE-RESPONSE CALIBRATION CURVES William J. Price Bahman Shafii Follow this

More information

7. Estimation and hypothesis testing. Objective. Recommended reading

7. Estimation and hypothesis testing. Objective. Recommended reading 7. Estimation and hypothesis testing Objective In this chapter, we show how the election of estimators can be represented as a decision problem. Secondly, we consider the problem of hypothesis testing

More information

A general mixed model approach for spatio-temporal regression data

A general mixed model approach for spatio-temporal regression data A general mixed model approach for spatio-temporal regression data Thomas Kneib, Ludwig Fahrmeir & Stefan Lang Department of Statistics, Ludwig-Maximilians-University Munich 1. Spatio-temporal regression

More information

Quantile POD for Hit-Miss Data

Quantile POD for Hit-Miss Data Quantile POD for Hit-Miss Data Yew-Meng Koh a and William Q. Meeker a a Center for Nondestructive Evaluation, Department of Statistics, Iowa State niversity, Ames, Iowa 50010 Abstract. Probability of detection

More information

Discrete Response Multilevel Models for Repeated Measures: An Application to Voting Intentions Data

Discrete Response Multilevel Models for Repeated Measures: An Application to Voting Intentions Data Quality & Quantity 34: 323 330, 2000. 2000 Kluwer Academic Publishers. Printed in the Netherlands. 323 Note Discrete Response Multilevel Models for Repeated Measures: An Application to Voting Intentions

More information

Bayesian Inference: Probit and Linear Probability Models

Bayesian Inference: Probit and Linear Probability Models Utah State University DigitalCommons@USU All Graduate Plan B and other Reports Graduate Studies 5-1-2014 Bayesian Inference: Probit and Linear Probability Models Nate Rex Reasch Utah State University Follow

More information