Regression models for multivariate ordered responses via the Plackett distribution

Similar documents
ARTICLE IN PRESS. Journal of Multivariate Analysis ( ) Contents lists available at ScienceDirect. Journal of Multivariate Analysis

Computational Statistics and Data Analysis. Identifiability of extended latent class models with individual covariates

Smoothness of conditional independence models for discrete data

AN EXTENDED CLASS OF MARGINAL LINK FUNCTIONS FOR MODELLING CONTINGENCY TABLES BY EQUALITY AND INEQUALITY CONSTRAINTS

A class of latent marginal models for capture-recapture data with continuous covariates

A dynamic model for binary panel data with unobserved heterogeneity admitting a n-consistent conditional estimator

Testing order restrictions in contingency tables

2. Basic concepts on HMM models

Repeated ordinal measurements: a generalised estimating equation approach

Stat 5101 Lecture Notes

arxiv: v2 [stat.me] 5 May 2016

Analysing geoadditive regression data: a mixed model approach

COPULA PARAMETERIZATIONS FOR BINARY TABLES. By Mohamad A. Khaled. School of Economics, University of Queensland. March 21, 2014

Invariant HPD credible sets and MAP estimators

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

arxiv: v1 [math.pr] 10 Oct 2017

Dynamic logit model: pseudo conditional likelihood estimation and latent Markov extension

A Fully Nonparametric Modeling Approach to. BNP Binary Regression

Multivariate Versus Multinomial Probit: When are Binary Decisions Made Separately also Jointly Optimal?

MARGINAL MODELS FOR CATEGORICAL DATA. BY WICHER P. BERGSMA 1 AND TAMÁS RUDAS 2 Tilburg University and Eötvös Loránd University

STA 216, GLM, Lecture 16. October 29, 2007

Factor Analysis (10/2/13)

,..., θ(2),..., θ(n)

On prediction and density estimation Peter McCullagh University of Chicago December 2004

Total positivity in Markov structures

A general mixed model approach for spatio-temporal regression data

ECON 594: Lecture #6

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Modulation of symmetric densities

COMPOSITIONAL IDEAS IN THE BAYESIAN ANALYSIS OF CATEGORICAL DATA WITH APPLICATION TO DOSE FINDING CLINICAL TRIALS

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES

Consistent Bivariate Distribution

Semiparametric Generalized Linear Models

Some Results on the Multivariate Truncated Normal Distribution

Goodness-of-Fit Tests for the Ordinal Response Models with Misspecified Links

ECE521 week 3: 23/26 January 2017

Yu Xie, Institute for Social Research, 426 Thompson Street, University of Michigan, Ann

Generalized Linear Models (GLZ)

A-optimal designs for generalized linear model with two parameters

Stat 642, Lecture notes for 04/12/05 96

ASSESSING A VECTOR PARAMETER

8 Nominal and Ordinal Logistic Regression

Fisher information for generalised linear mixed models

Non-linear panel data modeling

Figure 36: Respiratory infection versus time for the first 49 children.

More Powerful Tests for Homogeneity of Multivariate Normal Mean Vectors under an Order Restriction

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

An Introduction to Multivariate Statistical Analysis

Econometric Analysis of Cross Section and Panel Data

Efficiency of Profile/Partial Likelihood in the Cox Model

Log-linear multidimensional Rasch model for capture-recapture

PQL Estimation Biases in Generalized Linear Mixed Models

A new specification of generalized linear models for categorical data

Investigating Models with Two or Three Categories

Describing Contingency tables

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Copula-based Logistic Regression Models for Bivariate Binary Responses

Journal of Multivariate Analysis

FREQUENTIST BEHAVIOR OF FORMAL BAYESIAN INFERENCE

D-optimal Designs for Multinomial Logistic Models

LOGISTIC REGRESSION Joseph M. Hilbe

Tail dependence in bivariate skew-normal and skew-t distributions

Simplified marginal effects in discrete choice models

OPTIMAL DESIGNS FOR GENERALIZED LINEAR MODELS WITH MULTIPLE DESIGN VARIABLES

Simulation-based robust IV inference for lifetime data

2 Dipartimento di Medicina Sperimentale, Sapienza - Universita di Roma

Flat and multimodal likelihoods and model lack of fit in curved exponential families

Introduction to Statistical Analysis

A Two-latent-class Model for Education Transmission

STA 4273H: Statistical Machine Learning

A New Generalized Gumbel Copula for Multivariate Distributions

Robustness of Principal Components

Multistate Modeling and Applications

CS281 Section 4: Factor Analysis and PCA

Bayes methods for categorical data. April 25, 2017

Measuring Social Influence Without Bias

Charles E. McCulloch Biometrics Unit and Statistics Center Cornell University

The Instability of Correlations: Measurement and the Implications for Market Risk

INTRODUCTION TO LOG-LINEAR MODELING

Bayesian Multivariate Logistic Regression

Chap 2. Linear Classifiers (FTH, ) Yongdai Kim Seoul National University

Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach"

Default Priors and Effcient Posterior Computation in Bayesian

Flexible Estimation of Treatment Effect Parameters

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW

Part 6: Multivariate Normal and Linear Models

Limited Dependent Variable Models II

Pseudo-score confidence intervals for parameters in discrete statistical models

MARGINAL MODELLING OF MULTIVARIATE CATEGORICAL DATA

Parametric Identification of Multiplicative Exponential Heteroskedasticity

Models for Longitudinal Analysis of Binary Response Data for Identifying the Effects of Different Treatments on Insomnia

Principle Components Analysis (PCA) Relationship Between a Linear Combination of Variables and Axes Rotation for PCA

Links Between Binary and Multi-Category Logit Item Response Models and Quasi-Symmetric Loglinear Models

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation.

σ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =

A Note on Binary Choice Duration Models

INFORMATION THEORY AND STATISTICS

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING

Transcription:

Journal of Multivariate Analysis 99 (2008) 2472 2478 www.elsevier.com/locate/jmva Regression models for multivariate ordered responses via the Plackett distribution A. Forcina a,, V. Dardanoni b a Dipartimento di Economia, Finanza e Statistica, Unversità di Perugia, via Pascoli, 06100 Perugia, Italy b Dipartimento di Scienze Economiche, Aziendali e Finanziarie, Unversità di Palermo, Viale delle Scienze, 90138 Palermo, Italy Received 19 October 2007 Available online 25 February 2008 Abstract We investigate the properties of a class of discrete multivariate distributions whose univariate marginals have ordered categories, all the bivariate marginals, like in the Plackett distribution, have log-odds ratios which do not depend on cut points and all higher-order interactions are constrained to 0. We show that this class of distributions may be interpreted as a discretized version of a multivariate continuous distribution having univariate logistic marginals. Convenient features of this class relative to the class of ordered probit models (the discretized version of the multivariate normal) are highlighted. Relevant properties of this distribution like quadratic log-linear expansion, invariance to collapsing of adjacent categories, properties related to positive dependence, marginalization and conditioning are discussed briefly. When continuous explanatory variables are available, regression models may be fitted to relate the univariate logits (as in a proportional odds model) and the log-odds ratios to covariates. c 2008 Elsevier Inc. All rights reserved. AMS (2000) subject classifications: 60E15; 62H05; 62J12 Keywords: Plackett distribution; Marginal models; Global logits; Proportional odds; Multivariate ordered regression 1. Introduction The proportional odds model [15] assumes that the global logits of an ordered categorical response are a linear function of cut points and covariate effects; it is well known that Corresponding author. E-mail addresses: forcina@stat.unipg.it (A. Forcina), vdardano@unipa.it (V. Dardanoni). 0047-259X/$ - see front matter c 2008 Elsevier Inc. All rights reserved. doi:10.1016/j.jmva.2008.02.037

A. Forcina, V. Dardanoni / Journal of Multivariate Analysis 99 (2008) 2472 2478 2473 this is equivalent to assuming that the response variable is the discrete approximation of a continuous latent variable for which a linear regression model with logistic errors holds, an interpretation which is very popular in the econometric literature (see for example [12,20]). A completely general class of multivariate distributions whose univariate marginals have ordered categories and whose association structure is based on the multivariate extension of the Plackett distribution [18], have been studied by Molenbergs and Lesaffre [16]. This family of distributions is extremely flexible, at the cost, however, of a parametric complexity which increases rapidly with the number of responses and the number of categories. On the other hand, though the literature on the Plackett distribution seen as a copula is extensive (see for instance [10,17], sec. 3.3.1), its applications are mostly limited to bivariate distributions. Here we focus on a relevant subclass of such distributions which resemble the discretized multivariate normal in the sense that the marginal association between any pair is determined by a single parameter, the global odds ratio, assumed invariant to cut points, as in a discrete analog of the Plackett distribution, with the log-linear interactions of order higher than two constrained to zero. The results presented in this paper exploits recent advances on marginal modeling of discrete distributions (see for example [3] or [6]) which allow simultaneous modeling of several marginal distributions. In the special case of binary responses, some of these results were anticipated by Zhao and Prentice [21] who studied the mapping between the set of canonical parameters related to main effects and bivariate interactions and the set of univariate and bivariate moments; with higher-order interactions treated as nuisance parameters or constrained to 0. When individual covariates are available, our formulation allows for a set of proportional odds models which describe the dependence of each univariate marginal on explanatory variables; in addition, linear regression models may be formulated to allow the global log-odds ratios to depend on covariates. In Section 2 we describe the construction and parametrization of the multivariate discrete Plackett, the construction of regression model and a proof of the existence of an underlying continuous distribution. The main properties of this class of distributions are discussed in Section 3. 2. Definition and construction Below we first give a formal definition of the mapping defined by parametrizing a multivariate discrete distribution with global logits and log-odds ratios and then show that it represents a link function, that is, the mapping is invertible and differentiable. We then describe how these parameters may depend on covariates as in a multivariate generalized linear model. Finally we show that this distribution may be interpreted as the discretized version of a multivariate continuous distribution having logistic univariate marginals and discuss the implications of this result. 2.1. The Plackett link function Let Y 1,..., Y J denote J ordinal categorical response variables. Because categories may be associated to labels which, apart from being ordered, may be arbitrary, there is no loss of generality in assuming that Y j, j = 1,..., J takes values in 1,..., m j. The probabilities which define the joint density of (Y 1,... Y J ) conditionally on a vector of covariates z can be displayed in a table having t = j m j cells. In the following it will be convenient to arrange these probabilities into the vector π(z) by letting variables Y j with a larger index j run faster from 1 to m j. Define the marginal univariate global logits λ j;k (z) = log[pr(y j > k z)/pr(y j k z)], j = 1,..., J, k = 1,..., m j 1,

2474 A. Forcina, V. Dardanoni / Journal of Multivariate Analysis 99 (2008) 2472 2478 and the marginal global log-odds ratios λ j,k; j,k (z) = λ j,k (z; Y j > k) λ j,k (z; Y j k) for 1 j < j J and k < m j, k < m j, where λ j,k (z; Y j > k) denotes the logit of Y j conditionally on z and Y j > k. These parameters may be arranged into the vector λ(z) which has dimension v = j (m j 1)+ j j > j (m j 1) (m j 1) with the logits coming first and k running faster than j, then k runs faster than k within each pair j, j. We call Plackett the simplified model where, for each pair j, j, the log-odds ratios λ j,k; j,k (z) do not depend on cut points k, k ; this is equivalent to imposing a set of j j > j [(m j 1) (m j 1) 1] linear constraints so that the association between each pair is determined by a single parameter. Let L denote the subset of R v such that λ(z) L is Plackett and corresponds to a proper probability distribution π(z) with strictly positive elements. Let also g denote the mapping λ(z) = g[π(z)] and Π L the image of L in R t. Elements of R v may fail to determine a proper probability distribution for two reasons: (i) univariate logits must be strictly decreasing and (ii) log-odds ratios within any subset of three or more Y j s must satisfy consistency constraints which are the analog of the constraint that a covariance matrix must be positive definite. An algorithm for checking whether a vector of marginal parameters is inconsistent has been described by Qaqish and Ivanova [19]. Though the algorithm is limited to binary distributions, due to the Placket constraint, it could be applied to our context by dichotomizing at arbitrary cut points. The mapping from Π L to L may be written in the compact matrix form as λ(z) = C log(mπ(z)), (1) where C is a matrix of row contrasts and M is a matrix of 0 s and 1 s which produce the marginal probabilities required. An algorithm for constructing these matrices is described by Colombi and Forcina ( [7], p. 1018). The log-linear expansion for π(z) may be written as log[π(z)] = Gθ(z) 1 log[1 tgθ(z)], (2) where θ(z) is the vector of log-linear parameters not constrained to 0 and G is an appropriate design matrix of full column rank v. A convenient choice for G is described by [3], p. 699. Proposition 1. The mapping from Π L to L is invertible and differentiable. Proof. A somehow informal proof was given by Glonek [11]; the result is a special case of Theorem 1 in [3], p. 703. Essentially, the proof is based on the properties of the mixed parametrization of Barndorff-Nielsen ( [2], p. 121) which first ensures that the mapping between θ(z) and the vector of expected values of sufficient statistics µ(z) = G π(z) is a diffeomorphism and then that, within each univariate and bivariate marginal, there is again a diffeomorphism between the corresponding subsets of µ(z) and λ(z). Notice that, obviously, λ(z) and θ(z) have the same size v. While π(z) is determined by θ(z) from (2), the inverse of the logistic transformation may be written in explicit form as θ(z) = H log(π(z)), where H is the left inverse of G which is orthogonal to 1. This matrix may be computed as H = (G G G 1 t 1 t G/t) 1 (G G 1 t 1 t /t).

A. Forcina, V. Dardanoni / Journal of Multivariate Analysis 99 (2008) 2472 2478 2475 2.2. A reconstruction algorithm Though for J = 2 there is an explicit formula for reconstructing π(z) from λ(z) (Dale [9]) and an extension to multiway table is described by [16], its actual implementation for J > 2 is rather demanding. An algorithm based on a first-order approximation is equally accurate and faster even for J = 2. Different versions of this algorithm have been proposed by [21] and [11]; the version described here requires the explicit construction of the matrix G and works as follows (the dependence on z is omitted for simplicity): 1. at the initial step choose a value θ (0) such that λ (0) is sufficiently close to λ; 2. at the hth step update the vector of canonical parameters by the first-order approximation θ (h) = θ (h 1) + D[λ λ (h 1) ]; where D = [ λ / θ] 1 is easily computed from (1); 3. iterate until the norm of λ λ (h 1) is close to 0 and then reconstruct π(z) from (2). When J is greater than two the algorithm is efficient because it works in the reduced dimension v. In practice the actual choice of θ (0) has negligible effects on the efficiency of the reconstruction algorithm and may be set to 0 v. A non-iterative algorithm based on the reconstruction of the elements of µ(z) from those of λ(z) and those of π(z) from those of µ(z) is described by [19]. 2.3. Linear models If the number of distinct covariate configurations was very small relative to the number of observations, each vector λ(z) could be estimated within the corresponding strata like in a semi-parametric model. However, in most practical situations the number of distinct covariate configurations is so large that each stratum will contain few (or even just one) observations. In these cases we assume a linear model of the form λ j,k (z) = α j,k + z j β j; j = 1,..., J; k = 1,..., m j 1; (3) where z j denotes the subset of covariates affecting the logits of Y j ; under the Plackett constraint we also assume that λ j,k; j,k (z) = τ j, j + z j, j ξ j, j 1 j < j J, (4) where z j, j denotes the subset of covariates affecting the log-odds ratio between Y j, Y j. These equations may be written in a compact matrix form as λ(z) = Zψ, where Z will be block diagonal, with one block for each set of m j 1 univariate logits and (m j 1)(m j 1) different log-odds ratios, and ψ collects the α s, β s, τ s and ξ s parameters. Note that the Plackett constraint is implicit in the construction of the Z matrix. 2.4. Existence of a continuous multivariate latent distribution In many contexts it may be meaningful to interpret each ordinal response variable considered separately as a discrete approximation of a continuous latent variable which is also related to covariates by some kind of regression model. More formally, the following result holds: Proposition 2. Let U j, j = 1,..., J, denote a continuous latent variable such that U j > α j,k whenever Y j > k; in addition, assume that (5)

2476 A. Forcina, V. Dardanoni / Journal of Multivariate Analysis 99 (2008) 2472 2478 U j = z j β j + ɛ j. Then, if ɛ j is symmetric around 0, it has a standard logistic univariate marginal. Proof. See for example Agresti [1], Sec. 7.2.3 or Wooldridge [20], Sec. 15.10. This result is very popular in econometrics because it provides a justification for the proportional odds model and indicates that its intercepts reversed in sign may be interpreted as cut points on the latent scale; thus they must be strictly increasing in order for π(z) Π L. The previous result holds when each marginal distribution is considered on its own. We now examine whether there exists a multivariate continuous latent distribution having the logistic univariate marginals specified above and satisfying the dependence structure implied by the Plackett constraints. The result contained in the next proposition indicates that the class of multivariate discrete distributions studied in this paper is an alternative to the class of distributions obtained by discretizing a multivariate normal at a given set of cut points defined for each marginal. Proposition 3. For any value of z and ψ there exists a continuous multivariate distribution with cumulative distribution function H(u z; ψ), defined for any u R J, having logistic univariate marginals as stated in Proposition 2 and matching the cumulative distribution at any choice of k( j) < m j, H( α 1,k(1),..., α J,k(J) z; ψ) = pr(y 1 k(1),..., Y J k(j) z). Proof. For any u R J, let c > 0 be an arbitrary constant and define λ(z, u, c) to be the vector of marginal parameters constructed as λ(z), but containing two logits for each variable equal to u j + z j β j and u j + z j β j c and four log-odds ratio for each pair equal to the corresponding elements of λ(z). This is a compatible vector of marginal parameters because the two logits are strictly decreasing and the log-odds ratios are equal to those of the discrete distribution, thus it defines a J-dimensional three-variate distribution. Call π(z, u, c) the corresponding vector of joint probabilities and define H(u z; ψ) to be equal to the first cell of π(z, u, c). Note that, when u j = α j,k( j), H(u z; ψ) equals the cumulative discrete distribution at the corresponding cut points as follows from Proposition 1. Since the elements of π(z, u, c) are strictly positive, it follows that, if an arbitrary subset of cut points u j, say H, is replaced by u j c, the corresponding cumulative distribution must be strictly greater than H(u z; ψ). 3. Main properties 3.1. Marginal and conditional distributions The fact that λ(z) is a set of univariate and bivariate marginal parameters has the advantage that the corresponding parameters for the marginal distribution of a subset of K response variables Y j (1),..., Y j (K ) are equal to the corresponding elements of λ(z) and the Plackett constraint continues to hold. However, the constraint that the log-linear interactions of order higher than two are 0 is no longer true exactly, though it should still hold approximately in most cases. This is so because the log-linear parameters refer to the full joint distribution and thus are not invariant to marginalizations. On the other hand, for any partition of the response variables into two components U and V, the conditional distribution of V U retains the property that the log-linear interactions of order greater than two are 0, but now the Plackett constraint will hold only approximately within conditional distributions.

A. Forcina, V. Dardanoni / Journal of Multivariate Analysis 99 (2008) 2472 2478 2477 3.2. Collapsing of adjacent categories It is easily verified that the distribution obtained by collapsing two or more adjacent categories of any given response variable not only is still Plackett, but it has the same parameters of the original distribution except for the logits determined by the cut points used to separate the collapsed categories which will be dropped. Invariance holds because the global logits and odds ratios depend on the cumulative distribution which is unaffected by collapsing of adjacent categories except that certain intermediate reference points are dropped. 3.3. Quadratic log-linear expansion It may be worth noting that, since log-linear interactions of order greater than two are constrained to 0, by adopting the log-linear parametrization described by [3], Section 3, the columns of the matrix G may be written either as dummies g j;k which are equal to 1 if Y j > k and 0 otherwise, with k < m j for all j, or as dummies g j, j ;k,k which are equal to 1 if both Y j > k and Y j > k. It can be easily verified that in this formulation g j, j ;k,k is the elementwise product of g j;k and g j ;k ; thus Eq. (2), apart from a rescaling factor, may be seen as a quadratic function of the response variables. This extends the similar expression given in Cox and Wermuth ([8], Equation (8)) for binary variables. 3.4. Stochastic orderings A simple approach to the analysis of ordinal response variables is to replace, within each variable, the m j ordinal categories with a suitable set of scores which, apart from being strictly increasing, may be arbitrary. Then it is natural to expect that the substantial properties of the joint distribution should be invariant to the assignment of arbitrary, strictly increasing scores. The following proposition highlights a well-known property of the global logits: Proposition 4. Let z ja and z jb be two distinct values taken by z j ; then the condition that the global logits of Y j z jb are not smaller than the corresponding logits of Y j z ja is equivalent to the condition that E[s(Y j ) z jb ] > E[s(Y j ) z ja ] for any arbitrary strictly increasing set of scores s. Proof. Since global logits are strictly increasing functions of the cumulative distribution, it follows that the global logits condition holds if and only if Y j z jb is stochastically larger than Y j z ja ; the result follows since it is well known (see among others Hadar and Russell [13], Theorems 1 and 2) that Y j z jb is stochastically greater than Y j z ja if and only if E[s(Y j ) z jb ] E[s(Y j ) z ja ] for any arbitrary strictly increasing set of scores s. Regarding the positive dependence between a given pair of response variables, it is well known (see for example [5], p. 1498) that the condition that all the log-odds ratios between a given pair of response variables are non-negative is equivalent to the condition of positive quadrant dependence (PQD), a notion of positive dependence between ordinal variables first introduced by Lehmann [14]. Recall that a pair of response variables Y j, Y j are PQD if pr ( Y j k( j), Y j k ( j ) ) pr ( Y j k( j) ) pr ( Y j k ( j ) ) for all k( j), k( j ), which intuitively means that, relative to independence, small values of one variable are associated with small values of the other. Thus, for the Plackett distribution, λ j, j (z) 0 is equivalent to the condition that Y j, Y j are PQD, in addition we have:

2478 A. Forcina, V. Dardanoni / Journal of Multivariate Analysis 99 (2008) 2472 2478 Proposition 5. Within the Plackett distribution, λ j, j (z) 0 if and only if cov[u(y j ), v(y j ) z] 0 for any arbitrary strictly increasing set of scores u and v. Proof. Since λ j, j (z) 0 is equivalent to PQD, the result follows from [14]. 3.5. Likelihood inference The issue is not discussed here in detail because the theory for fitting regression models similar to those described above to data with individual covariates can be derived from Colombi and Forcina [7] and Bartolucci and Forcina [4] among others. A specific set of MATLAB routines will be available from the website http://www.stat.unipg.it/forcina/plackett. References [1] A. Agresti, Categorical Data Analysis, John Wiley & Sons, New York, 2002. [2] O. Barndorff-Nielsen, Information and Exponential Families in Statistical Theory, John Wiley & Sons, New York, 1978. [3] F. Bartolucci, R. Colombi, A. Forcina, An extended class of marginal link functions for modeling contingency tables by equality and inequality constraints, Statist. Sinica 17 (2007) 691 711. [4] F. Bartolucci, A. Forcina, A class of latent marginal models for capture-recapture data with continuous covariates, J. Amer. Statist. Assoc. 101 (2006) 786 794. [5] F. Bartolucci, A. Forcina, V. Dardanoni, Positive quadrant dependence and marginal modeling in two-way tables with ordered marginals, J. Amer. Statist. Assoc. 96 (2001) 1497 1505. [6] W.P. Bergsma, T. Rudas, Marginal models for categorical data, Ann. Statist. 30 (2002) 140 159. [7] R. Colombi, A. Forcina, Marginal regression models for the analysis of positive association, Biometrika 88 (2001) 1007 1019. [8] D.R. Cox, N. Wermuth, On some models for multivariate binary variables parallel in complexity with the multivariate Gaussian distribution, Biometrika 89 (2002) 462 469. [9] J.R. Dale, Global cross-ratio models for bivariate, discrete, ordered responses, Biometrics 42 (1986) 909 917. [10] D. Ghosh, Semi parametric global cross-ratio models for bivariate censored data, Scand. J. Statist. 33 (2006) 609 619. [11] G. Glonek, A class of regression models for multivariate categorical responses, Biometrika 83 (1996) 15 28. [12] W.H. Greene, Econometric Analysis, Prentice Hall, 2004. [13] J. Hadar, W. Russell, Rules for ordering uncertain prospects, Am. Econ. Rev. 59 (1969) 25 34. [14] E.L. Lehmann, Some concepts of dependence, Ann. Math. Statist. 37 (1966) 1137 1153. [15] P. McCullagh, Regression models for ordinal data, J. R. Statist. Soc. B 42 (1980) 109 142. [16] G. Molenberghs, E. Lesaffre, Marginal modeling of correlated ordinal data using a multivariate Plackett distribution, J. Amer. Statist. Assoc. 89 (1994) 633 644. [17] R.B. Nelsen, An Introduction to Copulas, Springer, New York, 1999. [18] R.L. Plackett, A class of bivariate distribution, J. Amer. Statist. Assoc. 60 (1965) 516 522. [19] B.F. Qaqish, A. Ivanova, Multivariate logistic models, Biometrika 93 (2006) 1011 1017. [20] J. Wooldridge, Econometric Analysis of Cross Section and Panel Data, MIT Press, 2003. [21] L.P. Zhao, R.L. Prentice, Correlated binary regression using a quadratic exponential model, Biometrika 77 (1990) 642 648.