On a connection between the Bradley-Terry model and the Cox proportional hazards model

Similar documents
On a connection between the Bradley Terry model and the Cox proportional hazards model

β j = coefficient of x j in the model; β = ( β1, β2,

Survival Regression Models

Understanding the Cox Regression Models with Time-Change Covariates

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

Relative-risk regression and model diagnostics. 16 November, 2015

Time-dependent coefficients

Survival analysis in R

Lecture 12. Multivariate Survival Data Statistics Survival Analysis. Presented March 8, 2016

The coxvc_1-1-1 package

Lecture 8 Stat D. Gillen

Multivariate Survival Analysis

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

MAS3301 / MAS8311 Biostatistics Part II: Survival

Lecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016

Survival Analysis Math 434 Fall 2011

Multistate models and recurrent event models

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times

Full likelihood inferences in the Cox model: an empirical likelihood approach

Lecture 7 Time-dependent Covariates in Cox Regression

Cox s proportional hazards/regression model - model assessment

Multistate models and recurrent event models

Survival analysis in R

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

SAS/STAT 13.1 User s Guide. Introduction to Survey Sampling and Analysis Procedures

The Proportional Hazard Model and the Modelling of Recurrent Failure Data: Analysis of a Disconnector Population in Sweden. Sweden

Extensions of Cox Model for Non-Proportional Hazards Purpose

Correlation (pp. 1 of 6)

Survival Analysis for Case-Cohort Studies

SAS/STAT 13.2 User s Guide. Introduction to Survey Sampling and Analysis Procedures

Philosophy and Features of the mstate package

EVALUATION OF EIGEN-VALUE METHOD AND GEOMETRIC MEAN BY BRADLEY-TERRY MODEL IN BINARY AHP

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW

Tied survival times; estimation of survival probabilities

SSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that need work

Nonparametric Bayes Estimator of Survival Function for Right-Censoring and Left-Truncation Data

Chapter 4 Regression Models

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

A Survival Analysis of GMO vs Non-GMO Corn Hybrid Persistence Using Simulated Time Dependent Covariates in SAS

1 Introduction. 2 Residuals in PH model

STAT331. Cox s Proportional Hazards Model

SAS/STAT 14.2 User s Guide. Introduction to Survey Sampling and Analysis Procedures

Section on Survey Research Methods JSM 2010 STATISTICAL GRAPHICS OF PEARSON RESIDUALS IN SURVEY LOGISTIC REGRESSION DIAGNOSIS

Comparison of Hazard, Odds and Risk Ratio in the Two-Sample Survival Problem

ONE MORE TIME ABOUT R 2 MEASURES OF FIT IN LOGISTIC REGRESSION

University of California, Berkeley

Joint Modeling of Longitudinal Item Response Data and Survival

Extensions of Cox Model for Non-Proportional Hazards Purpose

Multivariable Fractional Polynomials

Sample selection with probability proportional to size sampling using SAS and R software

ST745: Survival Analysis: Cox-PH!

A fast routine for fitting Cox models with time varying effects

Package MiRKATS. May 30, 2016

Multivariable Fractional Polynomials

Modelling geoadditive survival data

Lecture 11. Interval Censored and. Discrete-Time Data. Statistics Survival Analysis. Presented March 3, 2016

Cox s proportional hazards model and Cox s partial likelihood

Strati cation in Multivariate Modeling

Fitting Cox Regression Models

A few Murray connections: Extended Bradley-Terry models. Bradley-Terry model. Pair-comparison studies. Some data. Some data

Longitudinal + Reliability = Joint Modeling

Learning Binary Classifiers for Multi-Class Problem

USING MARTINGALE RESIDUALS TO ASSESS GOODNESS-OF-FIT FOR SAMPLED RISK SET DATA

Lecture 9. Statistics Survival Analysis. Presented February 23, Dan Gillen Department of Statistics University of California, Irvine

Survival Analysis I (CHL5209H)

A Regression Model For Recurrent Events With Distribution Free Correlation Structure

A comparison study of the nonparametric tests based on the empirical distributions

MULTIVARIATE ANALYSIS OF VARIANCE

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

DISPLAYING THE POISSON REGRESSION ANALYSIS

Time-dependent covariates

\It is important that there be options in explication, and equally important that. Joseph Retzer, Market Probe, Inc., Milwaukee WI

Survival Analysis I (CHL5209H)

Booklet of Code and Output for STAD29/STA 1007 Midterm Exam

Longitudinal Modeling with Logistic Regression

UNIVERSITY OF CALIFORNIA, SAN DIEGO

Technical Report - 7/87 AN APPLICATION OF COX REGRESSION MODEL TO THE ANALYSIS OF GROUPED PULMONARY TUBERCULOSIS SURVIVAL DATA

PENALIZING YOUR MODELS

0.1 weibull: Weibull Regression for Duration Dependent

Introduction to Statistical Analysis

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Reduced-rank hazard regression

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

A Model for Correlated Paired Comparison Data

Lecture 22 Survival Analysis: An Introduction

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm

Handling Ties in the Rank Ordered Logit Model Applied in Epidemiological

Asymptotic equivalence of paired Hotelling test and conditional logistic regression

Chapter 7. Network Flow. Slides by Kevin Wayne. Copyright 2005 Pearson-Addison Wesley. All rights reserved.

Latent Class Analysis for Models with Error of Measurement Using Log-Linear Models and An Application to Women s Liberation Data

Appendix A Summary of Tasks. Appendix Table of Contents

7. Assumes that there is little or no multicollinearity (however, SPSS will not assess this in the [binary] Logistic Regression procedure).

Rank Regression with Normal Residuals using the Gibbs Sampler

Package threg. August 10, 2015

Transcription:

On a connection between the Bradley-Terry model and the Cox proportional hazards model Yuhua Su and Mai Zhou Department of Statistics University of Kentucky Lexington, KY 40506-0027, U.S.A. SUMMARY This article addresses a connection between the Bradley and Terry (1952A) model and Cox (1972) proportional hazards model. We show that the partial likelihood of random variables that satisfy the stratified proportional hazards assumption of Cox (1972) coincides with the likelihood given by the Bradley-Terry (BT) model for rank order events. Such a connection not only allows many available software for the Cox model to be used for regression analysis based on the BT model, but also enables the new developments to fit certain rank order data by using the idea stem from the Cox model and its extensions available in many recent literatures from survival analysis. AMS 2000 Subject Classification: Primary 00A69; secondary 60G25. Key Words and Phrases: Rank order data. Partial likelihood. 1. INTRODUCTION RANK ORDER DATA In games such as horse racing and chess, ordered name lists are often of interest and recorded after the comparison of the performance of contestants. Suppose there are N independent (unobservable) random variables T 1,..., T N representing the performance of N contestants. If we compare two random variables T i and T j, where 1 i, j N, then the event {T i > T j } is called a rank-order event for a paired comparison experiment. In general, if we compare K(2 K N) random variables, T i1,..., T ik, where i j are distinct numbers from {1,..., N}, then the event {T i1 > > T ik }, is called a rank-order event for a multiple comparison experiment. The rank order data is a collection of outcomes of rank order events. Example 1 (N = 7, K = 2) Consider the example of paired comparisons in Agresti (2002), p438. Table 1 gives the outcomes of the games within the Eastern Division of the American league in the 1987 baseball season. In this example New York and Baltimore had 13 matches (comparisons) and New York wins 10 of them. From those observations, we may want to estimate the probabilities of rank order events, for example, the event that New York would defeat Baltimore. 1

Winning Losing Team Team Milwaukee Detroit Toronto New York Boston Cleveland Baltimore Milwaukee - 7 9 7 7 9 11 Detroit 6-7 5 11 9 9 Toronto 4 6-7 7 8 12 New York 6 8 6-6 7 10 Boston 6 2 6 7-7 12 Cleveland 4 4 5 6 6-6 Baltimore 2 4 1 3 1 7 - Table 1: Game Outcomes of American League Baseball (K = 2, N = 7) 2. MODEL DEFINITIONS Bradley-Terry model is a popular model used to estimate the probabilities of rank order events based on data as described in section one. 2.1 The Bradley-Terry (BT) Models Bradley and Terry (1952A) do not assume specific distributions for T i in the model description but postulate that every subject i can have a parameter π i > 0, and assume that the event {T i > T j } has a probability given by π j /(π i + π j ). Definition 1 (The BT model (K = 2)) If π i (> 0) is a parameter associated with T i, i = 1,..., N, then the probability of the event {T i > T j } is given by P B (T i > T j ) = π j π i + π j, i j, i, j = 1,..., N. (1) Bradley and Terry (1952B) propose a model for triple comparisons where the events (T i > T j > π k T k ) are assumed to have probabilities given by π j. From here we can easily π i + π j + π k π i + π j generalize the BT model to multiple comparisons. Definition 2 (The BT model (K > 2)) The BT model of K-subject matches assigns the probabilities by the following rule, P B (T i1 > > T ik ) = K j=2 π ij, (2) j π ir where i j s are distinct numbers in {1,..., N}; π i (> 0) s are parameters associated with T i s, respectively. Estimation of the parameters π i can then be obtained by the maximum likelihood method. r=1 2

2.2 The Cox proportional hazards model In the Cox proportional hazards model, one assumption is the existence of the hazard functions λ i (t) for random variables T i, 1 i N. This model, introduced by Cox (1972) presumes that a parameter π i (> 0) of T i affects the hazard function in a multiplicative manner according to λ i (t) = λ 0 (t)π i, i = 1,..., N (3) where λ 0 (t) is an unspecified baseline hazard function. In the survival analysis, usually π i = exp(zi tβ), where Z i is a 1 q vector of covariates for subject i and β is a 1 q vector of unknown parameters. There are many extensions to the proportional hazards model (see Therneau and Grambsch 2000). The one that is relevant to our study here is the stratified proportional hazards model. Basically we shall consider each match/comparison among K contestants to be a distinct stratum. Definition 3 (2 K N) In a stratified Cox proportional hazards model, a random variable, T ij, in a stratum/comparison s has the following hazard function: λ ij (t) = λ s (t)π ij, j = 1,..., K, (4) where i j s are distinct numbers in {1,..., N}, and λ s (t) is the common (but un-specified) baseline hazard shared by random variables T i1,..., T ik, in this particular comparison. This model allows the baseline hazard λ s (t) to change from strata to strata. In the analysis of the stratified proportional hazards model, the statistical inference is typically based on the so called partial likelihood. For details please see Therneau and Grambsch (2000), chapter three. Example ( ) 1 (Continued) Notice that every team plays every other team exactly 13 times. There 7 are = 21 different combinations of pairings between two teams. The total number of pairwise 2 comparisons is 21 13 = 273. Therefore, the stratified Cox model have 21 different baselines here: s = 1, 2,..., 21. Let (i, j) (1 i < j 7) span over all the combinations of possible parings. Denote the performance of team q in the comparison (i, j) by random variable T q,(i,j), q {i, j}, 1 i < j 7. The Stratified Cox proportional hazards model for this paired comparison assumes that the hazards of those random variables satisfy λ q,(i,j) (t) = λ (i,j) (t)π q, where q = i or j, 1 i < j 7. (5) λ (1,2) (t),..., λ (6,7) (t) are unspecified baseline hazards. They may be different from each other. 3

3. MAIN RESULTS Now we are ready to describe the main results of this paper. Theorem 1 Suppose that independent random variables T i satisfy the stratified Cox proportional hazards assumption (4) and T i and T j belong to one stratum, then we have (i) P (T i > T j ) = π j /(π i + π j ); (ii) The contribution to the partial likelihood from the random variables T i and T j with {T i > T j } is also π j /(π i + π j ). The proof of the theorem is straight forward and will be omitted. Recall the BT model definition (1), we immediately conclude that the likelihood of BT model and the partial likelihood of the stratified Cox proportional hazards model are identical. This is an important finding with far reaching consequences: (i) as illustrated in the next section, software developed for stratified Cox proportional hazards model can be used to obtain MLE in BT model since the two estimators are identical. (ii) MLE in BT model can share large sample properties obtained by powerful counting process martingale theory for partial likelihood estimators in Cox model. (iii) likelihood ratio tests are going to be identical in two models, as well as Fisher information matrices; (iv) methods to treat ties proposed in the two different models can borrow strength from each other and give us more options; and (v) more importantly, many extensions to the Cox model, namely handling of right censored observations and use of time dependent covariates, can be introduced to the BT model in analyzing rank order data and give us interesting results. For details please see Su (2003). In multiple comparisons, we have K subjects in a comparison. Let {i 1,..., i K } be the set of indices of the subjects engaged in a comparison. As in the paired comparison case, we have the following theorem. Theorem 2 Let T ij, 1 j K be independent random variables with distributions F ij (t, π ij ) respectively. Then, P (T i1 > > T ik ) = t K t 2 df i1 (t 1, π i1 ) df ik 1 (t K 1, π ik 1 )df i K (t K, π ik ). If furthermore, T i1,..., T ik satisfy the stratified Cox proportional hazards assumption (4) and are in one stratum, then the BT model will assign the probability correctly to the event {T i1 > > T ik }, i.e. K π ij P (T i1 > > T ik ) =. j j=2 π ir 4 r=1

The contribution to the partial likelihood from observations T i1,..., T ik with (T i1 > > T ik ) is the same as the corresponding likelihood given by the BT model in (2). 4. APPLICATION Now we illustrate fitting the BT model using the coxph procedure in R. All the observations for a game should be in one stratum. In Example 1, each game yields two observations for two subjects. Both observations should be in a stratum. After entering the data in counting process style (cf. SAS Institute, Inc. 2000), the following output can be obtained. (For complete R and SAS code please see web page www.ms.uky.edu/ mai/research/examplebtcox.) Call: coxph(formula = Surv(ta, tp, ev) ~ z2 + z3 + z4 + z5 + z6 + z7 + strata(games), weights = sc, method = "efron") coef exp(coef) se(coef) z p z2-0.145 0.865 0.311-0.466 6.4e-01 z3-0.287 0.751 0.310-0.925 3.6e-01 z4-0.334 0.716 0.310-1.076 2.8e-01 z5-0.474 0.623 0.311-1.525 1.3e-01 z6-0.898 0.408 0.317-2.835 4.6e-03 z7-1.581 0.206 0.343-4.607 4.1e-06 Likelihood ratio test=34.0 on 6 df, p=6.84e-06 n=84 The coef for z1 is fixed at zero and thus exp(coef)=1. The estimates of π s given by the coxph procedure are 1, 0.865, 0.751, 0.716, 0.623, 0.408 and 0.206. They are for Milwaukee, Detroit, Toronto, New York, Boston, Cleveland and Baltimore respectively. Using (1), the predicted probability of the event that New York would defeat Baltimore is 0.716/(0.716+0.206) 0.777. The predicted probability of the event that New York would defeat Boston is 0.716/(0.716+0.623) 0.535, etc. The results agree with Agresti (2002). 5. DISCUSSION TIES To cover the possibility of an appreciable amount of ties, Cox (1972) suggests the conditional logistic likelihood. If the continuous time assumption can be made, Therneau and Grambsch (2000) address 5

different methods to handle tied data and their computation issues. Those methods can be easily applied to the rank-order data analysis. Davidson (1970) discusses an extension of the BT model for paired comparisons to situations which allow an expression of no preference, and compared its performance with a model proposed by Rao and Kupper (1967). Beaver and Rao (1973) have developed a model to account for ties in triple comparisons. Unlike the approach of Cox, these extensions of the BT model introduce an additional parameter. Acknowledgments Professors Ali and Zelterman kindly offered invaluable suggestions during the preparation of this paper. References Agresti, A. (2002) Categorical Data Analysis (Second edition). Wiley, New York. Ali, M. (1977). Probability and Utility Estimates for Racetrack Betting. Journal of Political Economy. 85, p.803-815. Beaver, R. J. and Rao, P.V. (1973). On Ties in Triple Comparisons. Trabajos de Estadistica y de Investigacion Operativa. 24, p.77-92. Bradley, R. A. and Terry, M. E. (1952A). The rank analysis of incomplete block designs. I. The method of paired comparisons. Biometrika. 39, p.324-345. Bradley, R. A. and Terry, M. E. (1952B). The rank analysis of incomplete block designs II. The method for blocks of size three. Appendix A, Bi-annual Report No. 4, Va. Agr. Exp. Sta. Cox, D. R. (1972). Regression Models and Life-tables. J. Roy. Statist. Soc. B. 24, p.187-220 (with discussion). Davidson, R. R. (1970). On Extending the Bradley-Terry Model to Accommodate Ties in Paired Comparison Experiments. Journal of the American Statistical Association. 65, p.317-328. Rao, P.V., and Kupper, L. L. (1967) Ties in Paired-Comparison Experiments: A Generalization of the Bradley-Terry model. Journal of the American Statistical Association. 62 (March 1967), p.194-204. Corrigenda, 63 (December 1968), 1550. SAS Institute, Inc. (2000). SAS/STAT User s Guide, Version 8. Cary, NC. Su, Y. (2003). Modeling rank-order data - The Bradley-Terry model and its extensions. Ph.D. Dissertation, University of Kentucky. Therneau, T. M. and Grambsch, P. M. (2000). Modeling Survival Data: Extending the Cox Model. Springer, New York. Zelterman, D. (2002). Advanced Log-Linear Models. Cary, N.C.: SAS Institute. 6