On a connection between the Bradley Terry model and the Cox proportional hazards model

Similar documents
On a connection between the Bradley-Terry model and the Cox proportional hazards model

A note on sufficient dimension reduction

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

β j = coefficient of x j in the model; β = ( β1, β2,

Understanding the Cox Regression Models with Time-Change Covariates

Time-dependent coefficients

Survival Regression Models

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

The coxvc_1-1-1 package

Survival analysis in R

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

Relative-risk regression and model diagnostics. 16 November, 2015

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs

Survival Analysis Math 434 Fall 2011

Multivariate Survival Analysis

Extensions of Cox Model for Non-Proportional Hazards Purpose

Lecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016

Full likelihood inferences in the Cox model: an empirical likelihood approach

MAS3301 / MAS8311 Biostatistics Part II: Survival

Multistate models and recurrent event models

Lecture 12. Multivariate Survival Data Statistics Survival Analysis. Presented March 8, 2016

Lecture 8 Stat D. Gillen

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times

A Regression Model For Recurrent Events With Distribution Free Correlation Structure

Lecture 7 Time-dependent Covariates in Cox Regression

Multistate models and recurrent event models

1 Introduction. 2 Residuals in PH model

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

STAT331. Cox s Proportional Hazards Model

Survival analysis in R

EVALUATION OF EIGEN-VALUE METHOD AND GEOMETRIC MEAN BY BRADLEY-TERRY MODEL IN BINARY AHP

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW

Cox s proportional hazards/regression model - model assessment

Tied survival times; estimation of survival probabilities

SAS/STAT 13.1 User s Guide. Introduction to Survey Sampling and Analysis Procedures

SSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that need work

University of California, Berkeley

Survival Analysis for Case-Cohort Studies

SAS/STAT 13.2 User s Guide. Introduction to Survey Sampling and Analysis Procedures

The Proportional Hazard Model and the Modelling of Recurrent Failure Data: Analysis of a Disconnector Population in Sweden. Sweden

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Chapter 7. Network Flow. Slides by Kevin Wayne. Copyright 2005 Pearson-Addison Wesley. All rights reserved.

Chapter 4 Regression Models

A Survival Analysis of GMO vs Non-GMO Corn Hybrid Persistence Using Simulated Time Dependent Covariates in SAS

Extensions of Cox Model for Non-Proportional Hazards Purpose

Comparison of Hazard, Odds and Risk Ratio in the Two-Sample Survival Problem

Correlation (pp. 1 of 6)

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design

Nonparametric Bayes Estimator of Survival Function for Right-Censoring and Left-Truncation Data

Joint Modeling of Longitudinal Item Response Data and Survival

Philosophy and Features of the mstate package

Time-dependent covariates

Sample selection with probability proportional to size sampling using SAS and R software

ST745: Survival Analysis: Cox-PH!

SAS/STAT 14.2 User s Guide. Introduction to Survey Sampling and Analysis Procedures

On equality and proportionality of ordinary least squares, weighted least squares and best linear unbiased estimators in the general linear model

Estimation of growth convergence using a stochastic production frontier approach

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

A note on Two-warehouse inventory model with deterioration under FIFO dispatch policy

exclusive prepublication prepublication discount 25 FREE reprints Order now!

COMPARING TRANSFORMATIONS USING TESTS OF SEPARATE FAMILIES. Department of Biostatistics, University of North Carolina at Chapel Hill, NC.

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Multivariable Fractional Polynomials

Chapter 7. Network Flow. Slides by Kevin Wayne. Copyright 2005 Pearson-Addison Wesley. All rights reserved.

Learning Binary Classifiers for Multi-Class Problem

GROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin

UNIVERSITY OF CALIFORNIA, SAN DIEGO

Longitudinal + Reliability = Joint Modeling

Package MiRKATS. May 30, 2016

and Comparison with NPMLE

A Note on Item Restscore Association in Rasch Models

Multivariable Fractional Polynomials

ONE MORE TIME ABOUT R 2 MEASURES OF FIT IN LOGISTIC REGRESSION

Misclassification in Logistic Regression with Discrete Covariates

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

A fast routine for fitting Cox models with time varying effects

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

Sensitivity analysis and distributional assumptions

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

Lecture 11. Interval Censored and. Discrete-Time Data. Statistics Survival Analysis. Presented March 3, 2016

Strati cation in Multivariate Modeling

Technical Report - 7/87 AN APPLICATION OF COX REGRESSION MODEL TO THE ANALYSIS OF GROUPED PULMONARY TUBERCULOSIS SURVIVAL DATA

Spatial autoregression model:strong consistency

A comparison of inverse transform and composition methods of data simulation from the Lindley distribution

Efficiency of Profile/Partial Likelihood in the Cox Model

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm

Tests of transformation in nonlinear regression

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Fitting Cox Regression Models

Unit 9: Inferences for Proportions and Count Data

Lecture 22 Survival Analysis: An Introduction

Analysis of Survival Data Using Cox Model (Continuous Type)

Fitting Semiparametric Additive Hazards Models using Standard Statistical Software

11 Survival Analysis and Empirical Likelihood

Lecture 9. Statistics Survival Analysis. Presented February 23, Dan Gillen Department of Statistics University of California, Irvine

A comparison study of the nonparametric tests based on the empirical distributions

USING MARTINGALE RESIDUALS TO ASSESS GOODNESS-OF-FIT FOR SAMPLED RISK SET DATA

C. J. Skinner Cross-classified sampling: some estimation theory

Transcription:

Statistics & Probability Letters 76 (2006) 698 702 www.elsevier.com/locate/stapro On a connection between the Bradley Terry model and the Cox proportional hazards model Yuhua Su, Mai Zhou Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Received 25 August 2004; received in revised form 29 August 2005 Available online 4 November 2005 Abstract This article addresses a connection between the Bradley and Terry [1952a. The rank analysis of incomplete block designs. I. The method of paired comparisons. Biometrika 39, 324 345] model and Cox [1972. Regression models and lifetables. J. Roy. Statist. Soc. B 24, 187 220 (with discussion)] proportional hazards model. We show that the partial likelihood of random variables that satisfy the stratified proportional hazards assumption of Cox [1972. Regression models and life-tables. J. Roy. Statist. Soc. B 24, 187 220 (with discussion)] coincides with the likelihood given by the Bradley Terry (BT) model for rank order events. Such a connection not only allows many available software for the Cox model to be used for regression analysis based on the BT model, but also enables the new developments to fit certain rank order data by using ideas that stem from the Cox model and its extensions available in many recent literatures from survival analysis. r 2005 Elsevier B.V. All rights reserved. MSC: primary 00A69; secondary 60G25 Keywords: Rank order data; Partial likelihood 1. Introduction rank order data In games such as horse racing and chess, ordered name lists are often of interest and recorded after the comparison of the performance of contestants. Suppose there are N independent (unobservable) random variables T 1 ;...; T N representing the performance of N contestants. If we compare two random variables T i and T j, where 1pi; jpn, then the event ft i 4T j gis called a rankorder event for a paired comparison experiment. In general, if we compare K ð2pkpnþ random variables, T i1 ;...; T ik, where i j are distinct numbers from f1;...; Ng, then the event ft i1 4 4T ik g, is called a rankorder event for a multiple comparison experiment. The rank order data is a collection of outcomes of rank order events. Corresponding author. E-mail address: yuhua.su@yale.edu (Y. Su). 0167-7152/$ - see front matter r 2005 Elsevier B.V. All rights reserved. doi:10.1016/j.spl.2005.10.001

Y. Su, M. Zhou / Statistics & Probability Letters 76 (2006) 698 702 699 Table 1 Game outcomes of American league baseball (K ¼ 2; N ¼ 7) Winning team Losing team Milwaukee Detroit Toronto New York Boston Cleveland Baltimore Milwaukee 7 9 7 7 9 11 Detroit 6 7 5 11 9 9 Toronto 4 6 7 7 8 12 New York 6 8 6 6 7 10 Boston 6 2 6 7 7 12 Cleveland 4 4 5 6 6 6 Baltimore 2 4 1 3 1 7 Example 1 (N ¼ 7; K ¼ 2). Consider the example of paired comparisons in Agresti (2002, p. 438). Table 1 gives the outcomes of the games within the Eastern Division of the American league in the 1987 baseball season. In this example, New York and Baltimore had 13 matches (comparisons) and New York wins 10 of them. From those observations, we may want to estimate the probabilities of rank order events, for example, the event that New York would defeat Baltimore. 2. Model definitions Bradley Terry model is a popular model used to estimate the probabilities of rank order events based on data as described in Section 1. 2.1. The Bradley Terry (BT) models Bradley and Terry (1952a) do not assume specific distributions for T i in the model description but postulate that every subject i can have a parameter p i 40, and assume that the event ft i 4T j g has a probability given by p j =ðp i þ p j Þ. Definition 1 (The BT model (K ¼ 2)). If p i ð40þ is a parameter associated with T i, i ¼ 1;...; N, then the probability of the event ft i 4T j g is given by P B ðt i 4T j Þ¼ p j ; iaj; i; j ¼ 1;...; N. (1) p i þ p j Bradley and Terry (1952b) propose a model for triple comparisons where the events ft i 4T j 4T k g are assumed to have probabilities given by p k =ðp i þ p j þ p k Þp j =ðp i þ p j Þ. From here we can easily generalize the BT model to multiple comparisons. Definition 2 (The BT model (K42)). The BT model of K-subject matches assigns the probabilities by the following rule, P B ðt i1 4 4T ik Þ¼ YK j¼2 p ij Pj r¼1 p i r, (2) where i j s are distinct numbers in f1;...; Ng; p i ð40þ s are parameters associated with T i s, respectively. Estimation of the parameters p i can then be obtained by the maximum likelihood method.

700 Y. Su, M. Zhou / Statistics & Probability Letters 76 (2006) 698 702 2.2. The Cox proportional hazards model In the Cox proportional hazards model, one assumption is the existence of the hazard functions l i ðtþ for random variables T i ; 1pipN. This model, introduced by Cox (1972) presumes that a parameter p i ð40þ of T i affects the hazard function in a multiplicative manner according to l i ðtþ ¼l 0 ðtþp i ; i ¼ 1;...; N (3) where l 0 ðtþ is an unspecified baseline hazard function. In the survival analysis, usually p i ¼ expðz t i bþ, where Z i is a 1 q vector of covariates for subject i and b is a 1 q vector of unknown parameters. There are many extensions to the proportional hazards model (see Therneau and Grambsch, 2000). The one that is relevant to our study here is the stratified proportional hazards model. Basically we shall consider each match/comparison among K contestants to be a distinct stratum. Definition 3 (2pKpN). In a stratified Cox proportional hazards model, a random variable, T ij, in a stratum/ comparison s has the following hazard function: l ij ðtþ ¼l s ðtþp ij ; j ¼ 1;...; K, (4) where i j s are distinct numbers in f1;...; Ng; and l s ðtþ is the common (but un-specified) baseline hazard shared by random variables T i1 ;...; T ik, in this particular comparison. This model allows the baseline hazard l s ðtþ to change from strata to strata. In the analysis of the stratified proportional hazards model, the statistical inference is typically based on the so called partial likelihood. For details please see Therneau and Grambsch (2000, Chapter 3). Example 1 (Continued). Note that every team plays every other team exactly 13 times. There are ð 7 2 Þ¼21 different combinations of pairings between two teams. The total number of pairwise comparisons is 21 13 ¼ 273. Therefore, the stratified Cox model have 21 different baselines here: s ¼ 1; 2;...; 21: Let ði; jþ ð1piojp7þ span over all the combinations of possible pairings. Denote the performance of team q in the comparison ði; jþ by random variable T q;ði;jþ, q 2fi; jg, 1piojp7. The stratified Cox proportional hazards model for this paired comparison assumes that the hazards of those random variables satisfy l q;ði;jþ ðtþ ¼l ði;jþ ðtþp q where q ¼ i or j; 1piojp7. (5) l ð1;2þ ðtþ;...; l ð6;7þ ðtþ are unspecified baseline hazards. They may be different from each other. 3. Main results Now we are ready to describe the main results of this paper. Theorem 1. Suppose that independent random variables T i satisfy the stratified Cox proportional hazards assumption (4) and T i and T j belong to one stratum, then we have (i) PðT i 4T j Þ¼p j =ðp i þ p j Þ; (ii) The contribution to the partial likelihood from the random variables T i and T j with ft i 4T j g is also p j =ðp i þ p j Þ. The proof of the theorem is straightforward and will be omitted. Recall the BT model Definition 1, we immediately conclude that the likelihood of BT model and the partial likelihood of the stratified Cox proportional hazards model are identical. This is an important finding with far reaching consequences: (i) as illustrated in the next section, software developed for stratified Cox proportional hazards model can be used to obtain MLE in BT model since the two estimators are identical; (ii) MLE in BT model can share large sample properties obtained by powerful counting process martingale theory for partial likelihood estimators in Cox model; (iii) likelihood ratio tests are going to be identical in two models, as well as Fisher information matrices; (iv) methods to treat ties proposed in the two different models can borrow strength from each other and give us more options; and (v) more importantly, many extensions to the Cox model, namely handling of right censored observations and use of time dependent covariates, can be introduced to the BT model in analyzing rank order data and give us interesting results. For details please see Su (2003).

In multiple comparisons, we have K subjects in a comparison. Let fi 1 ;...; i K g be the set of indices of the subjects engaged in a comparison. As in the paired comparison case, we have the following theorem. Theorem 2. Let T ij ; 1pjpK be independent random variables with distributions F ij ðt; p ij Þ, respectively. Then, Z 1 Z 1 Z 1 Y. Su, M. Zhou / Statistics & Probability Letters 76 (2006) 698 702 701 PðT i1 4 4T ik Þ¼ df i1 ðt 1 ; p i1 ÞdF ik 1 ðt K 1 ; p ik 1 Þ df ik ðt K ; p ik Þ. 1 t K t 2 If furthermore, T i1 ;...; T ik satisfy the stratified Cox proportional hazards assumption (4) and are in one stratum, then the BT model will assign the probability correctly to the event ft i1 4 4T ik g, i.e. PðT i1 4 4T ik Þ¼ YK j¼2 p ij Pj r¼1 p i r. The contribution to the partial likelihood from observations T i1 ;...; T ik corresponding likelihood given by the BT model in (2). with ft i1 4 4T ik g is the same as the 4. Application Now we illustrate fitting the BT model using the coxph procedure in R. All the observations for a game should be in one stratum. In Example 1, each game yields two observations for two subjects. Both observations should be in a stratum. After entering the data in counting process style (cf. SAS Institute, Inc., 2000), the following output can be obtained. (For complete R and SAS code please see web page www.ms.uky.edu/mai/research/examplebtcox.) Call: coxph(formula ¼ Surv(ta, tp, ev) z2 + z3 + z4 + z5 + z6 + z7 + strata(games), weights ¼ sc, method ¼ efron") coef exp(coef) se(coef) z p z2 0.145 0.865 0.311 0.466 6.4e 01 z3 0.287 0.751 0.310 0.925 3.6e 01 z4 0.334 0.716 0.310 1.076 2.8e 01 z5 0.474 0.623 0.311 1.525 1.3e 01 z6 0.898 0.408 0.317 2.835 4.6e 03 z7 1.581 0.206 0.343 4.607 4.1e 06 Likelihood ratio test ¼ 34.0 on 6 df, p ¼ 6.84e 06 n ¼ 84 The coef for z1 is fixed at zero and thus exp(coef) ¼ 1. The estimates of p s given by the coxph procedure are 1; 0:865; 0:751; 0:716; 0:623; 0:408 and 0:206. They are for Milwaukee, Detroit, Toronto, New York, Boston, Cleveland and Baltimore respectively. Using (1), the predicted probability of the event that New York would defeat Baltimore is 0:716=ð0:716 þ 0:206Þ 0:777. The predicted probability of the event that New York would defeat Boston is 0:716=ð0:716 þ 0:623Þ 0:535, etc. The results agree with Agresti (2002). 5. Discussion ties To cover the possibility of an appreciable amount of ties, Cox (1972) suggests the conditional logistic likelihood. If the continuous time assumption can be made, Therneau and Grambsch (2000) address different methods to handle tied data and their computation issues. Those methods can be easily applied to the rankorder data analysis. Davidson (1970) discusses an extension of the BT model for paired comparisons to situations which allow an expression of no preference, and compared its performance with a model proposed by Rao and Kupper (1967). Beaver and Rao (1973) have developed a model to account for ties in triple

702 Y. Su, M. Zhou / Statistics & Probability Letters 76 (2006) 698 702 comparisons. Unlike the approach of Cox, these extensions of the BT model introduce an additional parameter. 6. Acknowledgements Professors Ali (1977) and Zelterman (2002) kindly offered invaluable suggestions during the preparation of this paper. References Agresti, A., 2002. Categorical Data Analysis, second ed. Wiley, New York. Ali, M., 1977. Probability and utility estimates for racetrack betting. J. Political Economy 85, 803 815. Beaver, R.J., Rao, P.V., 1973. On ties in triple comparisons. Trabajos de Estadistica y de Investigacion Operativa 24, 77 92. Bradley, R.A., Terry, M.E., 1952a. The rank analysis of incomplete block designs. I. The method of paired comparisons. Biometrika 39, 324 345. Bradley, R.A., Terry, M.E., 1952b. The rank analysis of incomplete block designs II. The method for blocks of size three. Appendix A, Bi-annual Report No. 4, Va. Agr. Exp. Sta. Cox, D.R., 1972. Regression models and life-tables. J. Roy. Statist. Soc. B 24, 187 220 (with discussion). Davidson, R.R., 1970. On extending the Bradley Terry model to accommodate ties in paired comparison experiments. J. Amer. Statist. Assoc. 65, 317 328. Rao, P.V., Kupper, L.L., 1967. Ties in paired-comparison experiments: a generalization of the Bradley Terry model. J. Amer. Statist. Assoc. 62, 194 204 Corrigenda 63 (December 1968), 1550. SAS Institute, Inc., 2000. SAS/STAT User s Guide, Version 8. Cary, NC. Su, Y., 2003. Modeling rank-order data the Bradley Terry model and its extensions. Ph.D. Dissertation, University of Kentucky. Therneau, T.M., Grambsch, P.M., 2000. Modeling Survival Data: Extending the Cox Model. Springer, New York. Zelterman, D., 2002. Advanced Log-Linear Models. SAS Institute, Cary, NC.