AFT Models and Empirical Likelihood

Similar documents
EMPIRICAL LIKELIHOOD ANALYSIS FOR THE HETEROSCEDASTIC ACCELERATED FAILURE TIME MODEL

Quantile Regression for Residual Life and Empirical Likelihood

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm

Empirical Likelihood in Survival Analysis

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

11 Survival Analysis and Empirical Likelihood

Least Absolute Deviations Estimation for the Accelerated Failure Time Model. University of Iowa. *

Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

COMPUTATION OF THE EMPIRICAL LIKELIHOOD RATIO FROM CENSORED DATA. Kun Chen and Mai Zhou 1 Bayer Pharmaceuticals and University of Kentucky

LEAST ABSOLUTE DEVIATIONS ESTIMATION FOR THE ACCELERATED FAILURE TIME MODEL

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints and Its Application to Empirical Likelihood

and Comparison with NPMLE

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky

COMPUTE CENSORED EMPIRICAL LIKELIHOOD RATIO BY SEQUENTIAL QUADRATIC PROGRAMMING Kun Chen and Mai Zhou University of Kentucky

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where

Marginal Screening and Post-Selection Inference

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

University of California, Berkeley

Nonparametric Bayes Estimator of Survival Function for Right-Censoring and Left-Truncation Data

CENSORED QUANTILE REGRESSION WITH COVARIATE MEASUREMENT ERRORS

Tests of independence for censored bivariate failure time data

1 Glivenko-Cantelli type theorems

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models

log T = β T Z + ɛ Zi Z(u; β) } dn i (ue βzi ) = 0,

Local Linear Estimation with Censored Data

A Significance Test for the Lasso

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach

Estimation of the Bivariate and Marginal Distributions with Censored Data

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models

Nonparametric rank based estimation of bivariate densities given censored data conditional on marginal probabilities

A Resampling Method on Pivotal Estimating Functions

1 Introduction. 2 Residuals in PH model

TMA 4275 Lifetime Analysis June 2004 Solution

UNIVERSITÄT POTSDAM Institut für Mathematik

TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOZIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississip

Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM

INFERENCE ON QUANTILE REGRESSION FOR HETEROSCEDASTIC MIXED MODELS

MAS3301 / MAS8311 Biostatistics Part II: Survival

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications

University of California, Berkeley

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Censored partial regression

Modeling Real Estate Data using Quantile Regression

Empirical likelihood for linear models in the presence of nuisance parameters

Empirical Likelihood Methods for Sample Survey Data: An Overview

Median regression using inverse censoring weights

A comparison study of the nonparametric tests based on the empirical distributions

Survival Analysis for Case-Cohort Studies

ICSA Applied Statistics Symposium 1. Balanced adjusted empirical likelihood

Empirical Likelihood Inference for Two-Sample Problems

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Inference For High Dimensional M-estimates: Fixed Design Results

Censored quantile regression with varying coefficients

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

Binary choice 3.3 Maximum likelihood estimation

Targeted Maximum Likelihood Estimation for Adaptive Designs: Adaptive Randomization in Community Randomized Trial

exclusive prepublication prepublication discount 25 FREE reprints Order now!

Nonparametric estimation of linear functionals of a multivariate distribution under multivariate censoring with applications.

Quantile Regression for Recurrent Gap Time Data

A review of some semiparametric regression models with application to scoring

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

Quantile regression and heteroskedasticity

Likelihood Ratio Test in High-Dimensional Logistic Regression Is Asymptotically a Rescaled Chi-Square

Goodness-of-fit test for the Cox Proportional Hazard Model

From semi- to non-parametric inference in general time scale models

Investigation of goodness-of-fit test statistic distributions by random censored samples

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

Full likelihood inferences in the Cox model: an empirical likelihood approach

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling

ST745: Survival Analysis: Nonparametric methods

Jackknife Empirical Likelihood for the Variance in the Linear Regression Model

Published online: 10 Apr 2012.

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES

Outline of GLMs. Definitions

Partial Rank Estimation of Transformation Models with General forms of Censoring

Issues on quantile autoregression

Negative Multinomial Model and Cancer. Incidence

Power-Transformed Linear Quantile Regression With Censored Data

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters

GOODNESS-OF-FIT TEST FOR RANDOMLY CENSORED DATA BASED ON MAXIMUM CORRELATION. Ewa Strzalkowska-Kominiak and Aurea Grané (1)

Likelihood Ratio Tests. that Certain Variance Components Are Zero. Ciprian M. Crainiceanu. Department of Statistical Science

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Regression and Statistical Inference

Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview

Summer School in Statistics for Astronomers V June 1 - June 6, Regression. Mosuk Chow Statistics Department Penn State University.

Economics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models

A Bivariate Weibull Regression Model

STT 843 Key to Homework 1 Spring 2018

LSS: An S-Plus/R program for the accelerated failure time model to right censored data based on least-squares principle

Nonparametric Heteroscedastic Transformation Regression Models for Skewed Data, with an Application to Health Care Costs

ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES

A note on L convergence of Neumann series approximation in missing data problems

Computational treatment of the error distribution in nonparametric regression with right-censored and selection-biased data

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon

Transcription:

AFT Models and Empirical Likelihood Mai Zhou Department of Statistics, University of Kentucky Collaborators: Gang Li (UCLA); A. Bathke; M. Kim (Kentucky)

Accelerated Failure Time (AFT) models: Y = log(t ) = Xβ + ɛ. The responses T, log(t ) or Y are subject to right censoring. We study the use of empirical likelihood method (Owen 2001) for inference about β.

Owen (1991) studied the use of empirical likelihood with linear models. (no censoring) He defines the Empirical Likelihood as: EL = n i=1 p i ; where pi = 1; p i 0. Is this a likelihood of the Y s or is this the likelihood for the ɛ s? Either interpretation is plausible here.

Owen (1991) identified two different linear models from the viewpoint of Empirical Likelihood (EL): regression models and correlation models, which came from Freedman (1981) bootstrap study of linear models.

Correlation Model: We observe iid random vectors (Y i, X i ); i = 1,..., n where Y i are one dimensional responses, X i are k-dimensional covariates. Unknown parameter β is the minimizer of g(b) = E Y X T b 2.

Regression Model: The covariates x i ; i = 1,..., n are fixed constants, observable k-dimensional vectors. Y i are independent random variables with distributions having location parameters x t i β.

The EL for the two types of models appears identical in Owen (1991) as given above, but have different interpretations and require different proofs for the generalized Wilks theorem under different assumptions. Owen (1991) showed the EL Wilks Theorem holds for both models: 2 log max EL 1/n χ 2 where the max of EL is taken over p i such that 0 = X i ψ(y i X i β 0 )p i.

Notice for a given (non-censored) data set, under either regression or correlation model the two EL ratios have the same value and the least squares estimator ˆβ is also the same under either model. So the two different models only pertain to two different sets of assumptions, under which the EL Wilks theorem hold.

With censored responses, not all Y i are available. Estimation equations, as well as the empirical likelihood needs to be changed. Censored data: randomly right censoring: Q i = min(y i, C i ) ; δ i = I[Y i C i ] where C i are the censoring times.

We see two different ways of defining the empirical likelihood for censored data, corresponding to the two types of linear model: Define the empirical likelihood based on the censored residuals : e i (b) = Q i X i b; δ i : EL (res) = n i=1 (p i ) δ i 1 p j 1 δi. e j e i Notice the order in the summation is based on the resid-

uals. This empirical likelihood is more suitable for regression models.

The second approach is to consider the empirical likelihood for (censored) vectors: (Q i, δ i, X i ): which we shall call it case-wise EL. EL (case) = n i=1 (p i ) δ i 1 p j 1 δi. Q j Q i Notice the order in the summation of the EL is based on Q s. This empirical likelihood is more suitable for correlation models.

The two definitions of the EL is parallel to the two ways of bootstrapping in linear models: (1) re-sample the residuals e i, for the regression models. (2) re-sample the vectors or cases (Y i, X i ), for the correlation models. See for example Freedman (1981).

We consider two types of estimates ˆβ suitable for the two types of EL and AFT models: (1) Buckley-James (1979) estimate. Rely on the fact that the errors (residuals) are iid. More suitable for the regression models. (2) Case weighted estimators (Koul, Susarla, Van Ryzin (1982) [not 1981!], Zhou (1992), Stute (1993, 1994, 1999); Huang, Ma, Xie (2005) etc.) Idea is based on the

(iid) cases (Y i, X i ) or (Q i, δ i, X i ). correlation models. More suitable for the

Let us first consider the Buckley-James estimator and EL (res). Let e i (b) = Q i X i b, be the (censored) residuals. The Buckley-James estimator of β is the solution to the estimation equation 0 = n X i i=1 δ i e i (b) + (1 δ i ) e j (b) ˆF (e j ) j:e j >e i 1 ˆF (e i ),

where ˆF ( ) is the Kaplan-Meier estimator computed from (e i (b), δ i ). The expression in blue is an estimation of the conditional expectation, Ê(Y i X i b Q i, β = b, δ i = 0). There are two summation signs in the above equation. (one for index i one for index j). Exchange the order of the summations we have 0 = n j=1 δ j e j (b) X j + i:e i <e j,δ i =0 X i ˆF (e j ) 1 ˆF (e i ).

This form of Buckley-James estimation equation will lead to the constraint equations we use with the censored empirical likelihood EL res.

The above defined EL (res) is to be maximized with and without the following added constraint equations (with b = β 0 ): 0 = n j=1 p j δ j e j (b) X j + i:e i <e j,δ i =0 X i ˆF (e j ) 1 ˆF (e i ) 1 ˆF (e j ). Notice the same p i appears in this constraint equations and in the definition of EL (res).

Without this constraint, the EL res is maximized at the Kaplan-Meier estimator, computed from e i (β 0 ), δ i. Under null hypothesis, i.e. when b = β 0, the 2logEL res ratio has asymptotically a chi squared distribution: 2 log max EL res max EL res χ 2 Proof See Zhou and Li (2004).

Simulation confirmation of the chi squared limit: Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Now let us consider the correlation model and the case weighted estimator.

Many authors have proposed and studied the case-weighted estimator, including large sample property of the estimator under various assumptions. The earliest reference I can find is Koul, Susarla and Van Ryzin (1982), the latest reference is Huang, Ma and Xie (2005). In between there are: Zhou (1992), Stute (1993, 1994, 1999), Gross and Lai (1996), and van der Laan and Robins (2003): in their book they call this inverse censoring probability weight : since ˆF (Q i ) = δ i 1 Ĝ(Q i ) = w i

The case weighted estimator is defined by the estimating equations (cf. Zhou 1992) 0 = n i=1 w i δ i X i ψ(q i X i b) where w i is the jump of the Kaplan-Meier estimator at Q i computed from (Q i, δ i ). ψ() is monotone, usually the derivative of ρ(). The estimator ˆβ is easy to obtain, no iteration needed.

The constraint equations to work with the EL case are 0 = n i=1 p i δ i X i ψ(q i X i b). The EL (case) is to be maximized with and with out the above constraint equations. We have a chi square limit theorem (Wilks): Under null hypothesis, we have 2 log max EL case max EL case χ 2

in distribution. In the denominator, the max of EL case occurs when p i = jumps of the Kaplan-Meier estimator based on Q i, δ i.

If we interprets the constraint equations as constraints on the marginal DF of Y, then (since the EL is also on the Y marginal), existing results apply.

The constraint equation can be thought of as a constraint on the marginal CDF of the Y g(y, x)df Y (y) = 0

R Censored Data AFT Models Regression Model Buckley-James Est. EL(residual) Wilks Theorem Correlation Model Weighted Est. EL(case) Wilks Theorem

Notice with censored data, the two estimators are different and the two EL are also different. Yet they both have EL ratio converge to chi-square distribution under certain conditions.

Simulation results for correlation AFT models: Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Less censoring: Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Quantile regression is an important model in econometrics (Koenker 2005) Quantile of Y i = βx i Estimate β by arg min n i=1 ρ(y i βx i ) where ρ is the so called check function.

Estimating equation for β 0 = n i=1 X i ψ(y i βx i ) where ψ needs to be defined carefully.

Median regression: ψ(t) = 1, 0 or 1 depend on t > 0, t = 0 or t < 0. Variance of the ˆβ is hard to estimate. Using EL, we do not need to estimate the variance of ˆβ.

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

25% Quantile regression: ψ(t) = 0.5, 0.5 or 1.5 depend on t > 0, t = 0 or t < 0. 75% Quantile regression: ψ(t) = 1.5, 0.5 or 0.5 depend on t > 0, t = 0 or t < 0. These functions equal to 2 times the derivative of the so called check functions defined in Koenker Quantile Regression. We take this definition so that the median regression ψ is the more commonly used -1/+1 function.

The (heteroscedastic) model we are simulating next is generated by Y = 0.5 + 1.5X + 0.5(1 + X)ɛ where ɛ is iid Normal(0,1). X is uniform (0,1) and Y are right censored by 0.5 + 2exp(1). For 25% quantile regression, the true regression line is Y = 0.162755 + 1.162755X

For 75% quantile regression, the true regression line is Y = 0.837245 + 1.837245X From the plot we see that the 25% quantile regression is less sensitive to the right censoring, compared to median regression and 75% quantile regression.

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Q-Q plot of 2 log ELR

Example: Wei 1995) Smallcell Lung Cancer Data (Ying, Jung & log(t ) = β 0 + X 1 β 1 + X 2 β 2 + σ(x 1, X 2 )ɛ X 1 = indicator of treatment; X 2 = age at entry.

Contour plot of 2 log ELR for β 1 β 2 ; β 0 fixed.

Comments about heteroscedasticity. The correlation model can accommodate some heteroscedasticity. When Y is censored, X can be missing. Censoring: C independent of Y. (stronger than conditional independence usually assumed by regression model and Buckley-James estimator). But Buckley-James assumes iis error.

References Buckley, J. J. and James, I. R. (1979). Linear regression with censored data. Biometrika, 66 429-36. Freedman, D.A. (1981). Bootstrapping regression models. Annals of Statist., 9, 1218-1228. Gross, S. and Lai, T.L. (1996). Nonparametric estimation and regression analysis with left truncated and right censored data. J. Amer. Statist. Assoc. 91, 1166-1180. Huang, J., Ma, S. and Xie, H. (2005) Least Absolute Deviations Estimation for the Accelerated Failure Time Model. Tech Report #350 Univ. of Iowa, Dept. Stat. and Actuarial Science. Koul, H., Susarla, V, and Van Ryzin, J. (1982). Least squares regression analysis with censored survival data. In Topics in Applied Statistics, Ed.Y.P. Chaubey and T.D. Dwivedi, pp 151-165. Marcel Dekker, New York. Laan, M. van der, and Robins, J. M. (2003) Unified Methods for Censored Longitudinal Data and Causality. Springer, New York. Owen, A (1991), Empirical likelihood for linear models. Annals of Statist., 19, 1725-1747. Owen, A. (2001). Empirical likelihood. Chapman & Hall, London. Stute, W. (1999). Nonlinear Censored Regression. Statistica Sinica, 9, 4, 1089 1102.

Jin, Z, Lin, D. Y. and Ying, Z. (2006). On least-squares regression with censored data Biometrika 93 147-161 Zhou, M. (1992). M-estimation in censored linear models. Biometrika, 79, 837-841. Zhou, M. and Li, G. (2004). Empirical likelihood analysis of the Buckley-James estimator.