Empirical Processes & Survival Analysis. The Functional Delta Method

Size: px
Start display at page:

Download "Empirical Processes & Survival Analysis. The Functional Delta Method"

Transcription

1 STAT/BMI 741 University of Wisconsin-Madison Empirical Processes & Survival Analysis Lecture 3 The Functional Delta Method Lu Mao lmao@biostat.wisc.edu 3-1

2 Objectives By the end of this lecture, you will learn the intuitive idea of functional delta method see various examples of functional derivatives be able to apply the functional delta method to survival analysis problems such as estimation of the cumulative incidence function of competing risks (Gray s estimator and tests) The Functional Delta Method 3-2

3 Contents 1.1 von-mises Calculus 1.2 Hadamard Differentiable Functions 1.3 Application: The Cumulative Incidence of Competing Risks The Functional Delta Method 3-3

4 Smooth Functionals Consider a parameter that is defined as a functional of the underlying distribution P : θ(p ). Examples: Mean: θ(p ) = P X Variance: θ(p ) = P (X P X) 2 Quantiles: θ(f ) = inf{ξ : F (ξ) p} The Functional Delta Method 3-4

5 Smooth Functionals The natural estimator is θ(p n ): Examples: Sample mean: θ(p n ) = P n X Sample variance: θ(p n ) = P n (X P n X) 2 Sample quantiles: θ( F n ) = inf{ξ : F n (ξ) p} The Functional Delta Method 3-5

6 Smooth Functionals How to derive the asymptotic distribution of θ(p n )? Note n(θ(pn ) θ(p )) = ) n( θ(p + n 1/2 G n ) θ(p ). (3.1) Now, view G n as if it were a fixed quantity H (not an entirely strange thing to do since G is stabilized over a Donsker class). Then the right hand side (3.1) can be written as for some linear operator θ. θ(p + n 1/2 H) θ(p ) n 1/2 θ(h), The Functional Delta Method 3-6

7 Smooth Functionals The linear operator θ can be computed by θ[h] = θ(p + ɛh). ɛ ɛ= A linear functional θ on the signed measures H can always be represented by θ[h] = HΦ for some function Φ. The Functional Delta Method 3-7

8 Smooth Functionals So n(θ(pn ) θ(p )) = θ[g n ] = G n Φ + o P (1). The influence function usually presents itself during the calculation of θ. The Functional Delta Method 3-8

9 Smooth Functionals Example 1: θ(p ) = P X. θ[h] = (P + ɛh)x = HX. ɛ ɛ= Hence n(pn X P X) = G n X + o p (1). This is actually a quite trivial example where the remainder is obviously zero. The Functional Delta Method 3-9

10 Smooth Functionals Example 2: θ(p ) = P (X P X) 2 =: σ 2. θ[h] = ( (P + ɛh) X (P + ɛh)x ɛ ɛ= = H(X P X) 2 2HP (X P X) = H(X P X) 2. ) 2 Hence n ( P n (X P n X) 2 σ 2) = G n (X P X) 2 + o p (1). The Functional Delta Method 3-1

11 Smooth Functionals Example 3: θ(f ) = F 1 (p) =: ξ p. Re-define θ(p ) such that P 1(X θ(p )) = p. Hence (P + ɛh)1(x θ(p + ɛh)) =. ɛ ɛ= By the chain rule, H1(X θ(p )) + f(θ(p )) θ[h] =. So, θ[h] = H1(X ξ p). f(ξ p ) The Functional Delta Method 3-11

12 Smooth Functionals So n(θ( Fn ) ξ p ) = G n 1(X ξ p ) f(ξ p ) The same result as derived in Example o P (1). The Functional Delta Method 3-12

13 Contents 1.1 von-mises Calculus 1.2 Hadamard Differentiable Functions 1.3 Application: The Cumulative Incidence of Competing Risks The Functional Delta Method 3-13

14 Hadamard Differentiable Functions In the previous section, we have treated G n as if it were fixed. If G n on a Donsker class, then it eventually ranges over a compact set. So for first-order approximation, we need the stronger condition that for every compact set K θ(p + ɛh) θ(p ) sup ɛ H K θ(h). If there exists such a linear function θ, then θ(p ) is said to be Hadamard differentiable at P with derivative θ. The Functional Delta Method 3-14

15 Hadamard Differentiable Functions More generally, let φ(η) be a Hadamard differentiable function on a (functional) parameter η, and let η n be an estimator of η, such that η n η is tight: n( ηn η ) = G n Ψ + o P (1). Let φ be the derivative (a linear operator) of φ at η. Then n(φ( ηn ) φ(η )) = G n φ[ψ] + op (1). The Functional Delta Method 3-15

16 Hadamard Differentiable Functions Many functions are Hadamard differentiable. We omit the technical proofs. Example: φ(f, G) = F dg. φ F,G [h 1, h 2 ] = (F + ɛh 1 )d(g + ɛh 2 ) ɛ ɛ= = h 1 dg + F dh 2. Application: Mann-Whitnet statistic. The Functional Delta Method 3-16

17 Hadamard Differentiable Functions Example 1.3 (Nelsen-Aalen Estimator, cont d) We have shown that the Nelsen-Aalen estimator takes the form Λ(t) = t P n dn(s) P n Y (s) The Functional Delta Method 1-17

18 Hadamard Differentiable Functions Example 1.3 (Nelsen-Aalen Estimator, cont d) Now, view Λ as a function φ of P, where Obviously, and φ(p ) = P dn(s) P Y (s). Λ( ) = φ(p n ), Λ ( ) = φ(p ). The Functional Delta Method 1-18

19 Hadamard Differentiable Functions Example 1.3 (Nelsen-Aalen Estimator, cont d) The functional derivative can be calculated by φ[h] = φ(p + ɛh) ɛ ɛ= = (P + ɛh)dn(s) ɛ ɛ= (P + ɛh)y (s) { = H = H = H dn(s) π(s) dn(s) Y (s)dλ (s) π(s) dm Λ (s). π(s) The same as derived in Example 1.3 of 1. Y (s)p dn(s) } π(s) 2 The Functional Delta Method 1-19

20 Kaplan-Meier as Product Limit Functional We consider the product limit functional mapping the space of cadlag functions on [, τ] into itself: φ(a)(t) = t (1 + da(s)) = lim s= s i s i 1 {1 + A(s i ) A(s i 1 )}, where the second equality is the definition and the limit is over partitions = s < s 1 < < s m = t. with maximum separation decreasing to zero. i The Functional Delta Method 1-2

21 Kaplan-Meier as Product Limit Functional To derive the functional derivative of φ(a), observe φ A (H)(t) = φ(a + ɛh)(t) ɛ ɛ= = lim {1 + A(s i ) A(s i 1 ) + ɛ(h(s i ) H(s i 1 ))} ɛ ɛ= s i s i 1 i = lim (H(s i ) H(s i 1 ) + A(s j ) A(s j 1 )} s i s i 1 j i{1 i = lim {1 + A(s i ) A(s i 1 )} s i s i 1 i i = φ(a)(t) t where A(s) = A(s) A(s ). {1 + A(s)} 1 dh(s), H(s i ) H(s i 1 ) 1 + A(s i ) A(s i 1 ) The Functional Delta Method 1-21

22 Kaplan-Meier as Product Limit Functional Note that the Kaplan-Meier estimator can be expressed as a product limit functional of Nelsen-Aalen estimator: Thus, Ŝ n (t) = φ( Λ n )(t) = n( Ŝ n S )(t) = G n S (t) t = G n S (t) t t (1 d Λ n (s)). s= 1 (1 Λ (s))π(s) dm Λ (s) + o P (1) 1 π(s) dm Λ (s) + o P (1). (when Λ is continuous.) The Functional Delta Method 1-22

23 Contents 1.1 von-mises Calculus 1.2 Hadamard Differentiable Functions 1.3 Application: The Cumulative Incidence of Competing Risks The Functional Delta Method 1-23

24 Competing Risks Competing risks data arise when each subject can experience one and only one of several competing causes of failure. Examples: death from cancer vs death related to treatment (e.g., chemotherapy). The Functional Delta Method 1-24

25 Competing Risks Competing risks data (T, D), T is failure time and D = 1,, J is the cause of failure. We can image (T, D) as arising from a vector of J latent competing failure times, T 1,, T J such that T = min{ T 1,, T J }, and D is the subscript of the first event time among the T j. The Functional Delta Method 1-25

26 Competing Risks However, the joint distribution of ( T 1,, T J ) cannot be identified from the observed data (T, D). Even independence is not identifiable. To make inference on the latent event times one has to make strong and usually unrealistic assumptions such as independence. The alternative is to stick with identifiable quantities based on (T, D). The Functional Delta Method 1-26

27 Competing Risks A popular quantity that is identifiable is the cause-specific hazard: dλ j (t) = Pr(t T < t + dt, D = j T t), i.e., the instantaneous rate for the jth cause of failure given survival to that point. The cause-specific hazard can be estimated by the Nelsen-Aalen estimator treating other causes as censoring. The cause-specific hazard Λ j reduces to the net hazard of T j if all other T k, k j, are independent of T j. The Functional Delta Method 1-27

28 Competing Risks Another identifiable quantity that is often of interest is the sub-distribution: F j (t) = Pr(T t, D = j), i.e., the cumulative incidence of the jth cause of failure in the presence of other causes. The sub-distribution F j (t) is not a functional of the cause-specific hazard Λ j (t); in particular, one cannot use the naive Kaplan-Meier curve (product limit of the Nelsen-Aalen estimator for the cause-specific hazard) to estimate the sub-distribution. The Functional Delta Method 1-28

29 Competing Risks Observe that df j (t) = Pr(t T < t + dt, D = j) = Pr(T t)pr(t T < t + dt, D = j T t) = S(t )dλ j (t), where S(t) = Pr(T > t). So F j (t) = t S(s )dλ j (s). The Functional Delta Method 1-29

30 Competing Risks Hence we can estimate the sub-distribution by F jn (t) = t Ŝ n (s )d Λ jn (s), where Ŝn is the Kaplan-Meier estimator for the overall survival function and Λ jn is the Nelsen-Aalen estimator fort the cause-specific hazard function of the jth cause of failure. The Functional Delta Method 1-3

31 Competing Risks Specifically, let C be the independent censoring time, then the observed data are {T I(T C), DI(T C), I(T C)}. The observed data can also be represented by N(t) = I(T t C), Y (t) = I(T C t), and N j (t) = I(T t C, D = j), j = 1,, J. Denote π(s) = PY (s), M Λ (t) = N(t) t Y (s)dλ(s), M Λj (t) = N j (t) t Y (s)dλ j(s), where Λ is the hazard function for T. The Functional Delta Method 1-31

32 Competing Risks We focus on the estimation of the cumulative incidence of the first cause of failure. We know that and n( Ŝ n S )(t) = G n S (t) t n( Λ1n Λ 1 )(t) = G n t 1 π(s) dm Λ (s) + o P (1), 1 π(s) dm Λ 1 (s) + o P (1) The Functional Delta Method 1-32

33 Competing Risks Since F 1n = φ(ŝn, Λ 1n ) and F 1 = φ(s, Λ 1 ), where φ(s, Λ 1 ) = We know that φ S,Λ 1 [H 1, H 2 ](t) = t S(s )dλ 1 (s). H 1 (s )dλ 1 (s) + t S (s )dh 2 (s). The Functional Delta Method 1-33

34 Competing Risks So, ( ) [ 1 n F1n F 1 (t) = G n φs,λ 1 S ( ) π(s) dm Λ (s), 1 ] π(s) dm Λ 1 (s) (t) + o P (1) t = G n S (u ) u 1 π(s) dm Λ (s)dλ 1 (u) t 1 + G n S (s ) π(s) dm Λ 1 (s) + o P (1). The Functional Delta Method 1-34

35 Competing Risks t To simplify the first term on the right, use integration by parts, u 1 S (u ) π(s) dm Λ (s)dλ 1 (u) = t u = F 1 (u) = t t 1 π(s) dm Λ (s)df 1 (u) 1 π(s) dm Λ (s) t u= u F 1 (s) π(s) dm Λ (s) F 1 (t) F 1 (s) dm Λ (s) π(s) The Functional Delta Method 1-35

36 Competing Risks Therefore, to conclude, ( ) t F 1 (t) F 1 (s) n F1n F 1 (t) = G n dm Λ (s) π(s) t 1 + G n S (s ) π(s) dm Λ 1 (s) + o P (1). The Functional Delta Method 1-36

37 Concluding Remarks The functional delta method is a powerful tool in semiparametric inference, particularly survival analysis. We have provided a few simple examples in this lecture. For a more formal treatment of the functional delta method and more examples refer to van der Vaart (1998, chap 2) and Andersen et al. (1993, II.8). The Functional Delta Method 1-37

38 References - Andersen, P. K., Borgan, O., Gill, R. D., & Keiding, N. (1993). Statistical models based on counting processes. Springer Science & Business Media. - van der Vaart, A. W. (1998). Asymptotic Statistics. Cambridge University Press. The Functional Delta Method 1-38

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics.

Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Dragi Anevski Mathematical Sciences und University November 25, 21 1 Asymptotic distributions for statistical

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

Asymptotic statistics using the Functional Delta Method

Asymptotic statistics using the Functional Delta Method Quantiles, Order Statistics and L-Statsitics TU Kaiserslautern 15. Februar 2015 Motivation Functional The delta method introduced in chapter 3 is an useful technique to turn the weak convergence of random

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

Continuous case Discrete case General case. Hazard functions. Patrick Breheny. August 27. Patrick Breheny Survival Data Analysis (BIOS 7210) 1/21

Continuous case Discrete case General case. Hazard functions. Patrick Breheny. August 27. Patrick Breheny Survival Data Analysis (BIOS 7210) 1/21 Hazard functions Patrick Breheny August 27 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/21 Introduction Continuous case Let T be a nonnegative random variable representing the time to an event

More information

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve

More information

Understanding product integration. A talk about teaching survival analysis.

Understanding product integration. A talk about teaching survival analysis. Understanding product integration. A talk about teaching survival analysis. Jan Beyersmann, Arthur Allignol, Martin Schumacher. Freiburg, Germany DFG Research Unit FOR 534 jan@fdm.uni-freiburg.de It is

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

Statistical Analysis of Competing Risks With Missing Causes of Failure

Statistical Analysis of Competing Risks With Missing Causes of Failure Proceedings 59th ISI World Statistics Congress, 25-3 August 213, Hong Kong (Session STS9) p.1223 Statistical Analysis of Competing Risks With Missing Causes of Failure Isha Dewan 1,3 and Uttara V. Naik-Nimbalkar

More information

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling Supplementary Materials: Proofs of the Main Results S1 Verification of the weight function v i (t) for the lengthbiased

More information

ST745: Survival Analysis: Nonparametric methods

ST745: Survival Analysis: Nonparametric methods ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

DAGStat Event History Analysis.

DAGStat Event History Analysis. DAGStat 2016 Event History Analysis Robin.Henderson@ncl.ac.uk 1 / 75 Schedule 9.00 Introduction 10.30 Break 11.00 Regression Models, Frailty and Multivariate Survival 12.30 Lunch 13.30 Time-Variation and

More information

Estimation for Modified Data

Estimation for Modified Data Definition. Estimation for Modified Data 1. Empirical distribution for complete individual data (section 11.) An observation X is truncated from below ( left truncated) at d if when it is at or below d

More information

1 Glivenko-Cantelli type theorems

1 Glivenko-Cantelli type theorems STA79 Lecture Spring Semester Glivenko-Cantelli type theorems Given i.i.d. observations X,..., X n with unknown distribution function F (t, consider the empirical (sample CDF ˆF n (t = I [Xi t]. n Then

More information

Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong Tseng

Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong Tseng Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong seng Reference: 1. Andersen, Borgan, Gill, and Keiding (1993). Statistical Model Based on Counting Processes, Springer-Verlag, p.229-255

More information

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL The Cox PH model: λ(t Z) = λ 0 (t) exp(β Z). How do we estimate the survival probability, S z (t) = S(t Z) = P (T > t Z), for an individual with covariates

More information

Theoretical Statistics. Lecture 19.

Theoretical Statistics. Lecture 19. Theoretical Statistics. Lecture 19. Peter Bartlett 1. Functional delta method. [vdv20] 2. Differentiability in normed spaces: Hadamard derivatives. [vdv20] 3. Quantile estimates. [vdv21] 1 Recall: Delta

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen The Annals of Statistics 21, Vol. 29, No. 5, 1344 136 A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1 By Thomas H. Scheike University of Copenhagen We present a non-parametric survival model

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and

More information

Consistency of bootstrap procedures for the nonparametric assessment of noninferiority with random censorship

Consistency of bootstrap procedures for the nonparametric assessment of noninferiority with random censorship Consistency of bootstrap procedures for the nonparametric assessment of noninferiority with random censorship Gudrun Freitag 1 and Axel Mun Institut für Mathematische Stochasti Georg-August-Universität

More information

Survival Analysis I (CHL5209H)

Survival Analysis I (CHL5209H) Survival Analysis Dalla Lana School of Public Health University of Toronto olli.saarela@utoronto.ca January 7, 2015 31-1 Literature Clayton D & Hills M (1993): Statistical Models in Epidemiology. Not really

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

EMPIRICAL LIKELIHOOD AND DIFFERENTIABLE FUNCTIONALS

EMPIRICAL LIKELIHOOD AND DIFFERENTIABLE FUNCTIONALS University of Kentucky UKnowledge Theses and Dissertations--Statistics Statistics 2016 EMPIRICAL LIKELIHOOD AND DIFFERENTIABLE FUNCTIONALS Zhiyuan Shen University of Kentucky, alanshenpku10@gmail.com Digital

More information

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t)

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t) PhD course in Advanced survival analysis. (ABGK, sect. V.1.1) One-sample tests. Counting process N(t) Non-parametric hypothesis tests. Parametric models. Intensity process λ(t) = α(t)y (t) satisfying Aalen

More information

Exercises. (a) Prove that m(t) =

Exercises. (a) Prove that m(t) = Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for

More information

Analysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time

Analysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time Analysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term

More information

Multi-state models: prediction

Multi-state models: prediction Department of Medical Statistics and Bioinformatics Leiden University Medical Center Course on advanced survival analysis, Copenhagen Outline Prediction Theory Aalen-Johansen Computational aspects Applications

More information

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models NIH Talk, September 03 Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models Eric Slud, Math Dept, Univ of Maryland Ongoing joint project with Ilia

More information

Chapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n

Chapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 4 Fall 2012 4.2 Estimators of the survival and cumulative hazard functions for RC data Suppose X is a continuous random failure time with

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

MODELING THE SUBDISTRIBUTION OF A COMPETING RISK

MODELING THE SUBDISTRIBUTION OF A COMPETING RISK Statistica Sinica 16(26), 1367-1385 MODELING THE SUBDISTRIBUTION OF A COMPETING RISK Liuquan Sun 1, Jingxia Liu 2, Jianguo Sun 3 and Mei-Jie Zhang 2 1 Chinese Academy of Sciences, 2 Medical College of

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

STAT 331. Martingale Central Limit Theorem and Related Results

STAT 331. Martingale Central Limit Theorem and Related Results STAT 331 Martingale Central Limit Theorem and Related Results In this unit we discuss a version of the martingale central limit theorem, which states that under certain conditions, a sum of orthogonal

More information

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Progress, Updates, Problems William Jen Hoe Koh May 9, 2013 Overview Marginal vs Conditional What is TMLE? Key Estimation

More information

A multi-state model for the prognosis of non-mild acute pancreatitis

A multi-state model for the prognosis of non-mild acute pancreatitis A multi-state model for the prognosis of non-mild acute pancreatitis Lore Zumeta Olaskoaga 1, Felix Zubia Olaskoaga 2, Guadalupe Gómez Melis 1 1 Universitat Politècnica de Catalunya 2 Intensive Care Unit,

More information

Appendix. Proof of Theorem 1. Define. [ ˆΛ 0(D) ˆΛ 0(t) ˆΛ (t) ˆΛ. (0) t. X 0 n(t) = D t. and. 0(t) ˆΛ 0(0) g(t(d t)), 0 < t < D, t.

Appendix. Proof of Theorem 1. Define. [ ˆΛ 0(D) ˆΛ 0(t) ˆΛ (t) ˆΛ. (0) t. X 0 n(t) = D t. and. 0(t) ˆΛ 0(0) g(t(d t)), 0 < t < D, t. Appendix Proof of Theorem. Define [ ˆΛ (D) X n (t) = ˆΛ (t) D t ˆΛ (t) ˆΛ () g(t(d t)), t < t < D X n(t) = [ ˆΛ (D) ˆΛ (t) D t ˆΛ (t) ˆΛ () g(t(d t)), < t < D, t where ˆΛ (t) = log[exp( ˆΛ(t)) + ˆp/ˆp,

More information

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky EMPIRICAL ENVELOPE MLE AND LR TESTS Mai Zhou University of Kentucky Summary We study in this paper some nonparametric inference problems where the nonparametric maximum likelihood estimator (NPMLE) are

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

Survival analysis in R

Survival analysis in R Survival analysis in R Niels Richard Hansen This note describes a few elementary aspects of practical analysis of survival data in R. For further information we refer to the book Introductory Statistics

More information

Theoretical Statistics. Lecture 17.

Theoretical Statistics. Lecture 17. Theoretical Statistics. Lecture 17. Peter Bartlett 1. Asymptotic normality of Z-estimators: classical conditions. 2. Asymptotic equicontinuity. 1 Recall: Delta method Theorem: Supposeφ : R k R m is differentiable

More information

A Comparison of Different Approaches to Nonparametric Inference for Subdistributions

A Comparison of Different Approaches to Nonparametric Inference for Subdistributions A Comparison of Different Approaches to Nonparametric Inference for Subdistributions Johannes Mertsching Johannes.Mertsching@gmail.com Master Thesis Supervision: dr. Ronald B. Geskus prof. Chris A.J. Klaassen

More information

Goodness-of-Fit Tests With Right-Censored Data by Edsel A. Pe~na Department of Statistics University of South Carolina Colloquium Talk August 31, 2 Research supported by an NIH Grant 1 1. Practical Problem

More information

Part III Measures of Classification Accuracy for the Prediction of Survival Times

Part III Measures of Classification Accuracy for the Prediction of Survival Times Part III Measures of Classification Accuracy for the Prediction of Survival Times Patrick J Heagerty PhD Department of Biostatistics University of Washington 102 ISCB 2010 Session Three Outline Examples

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the

More information

A Regression Model for the Copula Graphic Estimator

A Regression Model for the Copula Graphic Estimator Discussion Papers in Economics Discussion Paper No. 11/04 A Regression Model for the Copula Graphic Estimator S.M.S. Lo and R.A. Wilke April 2011 2011 DP 11/04 A Regression Model for the Copula Graphic

More information

From semi- to non-parametric inference in general time scale models

From semi- to non-parametric inference in general time scale models From semi- to non-parametric inference in general time scale models Thierry DUCHESNE duchesne@matulavalca Département de mathématiques et de statistique Université Laval Québec, Québec, Canada Research

More information

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University Survival Analysis: Weeks 2-3 Lu Tian and Richard Olshen Stanford University 2 Kaplan-Meier(KM) Estimator Nonparametric estimation of the survival function S(t) = pr(t > t) The nonparametric estimation

More information

Comparing Distribution Functions via Empirical Likelihood

Comparing Distribution Functions via Empirical Likelihood Georgia State University ScholarWorks @ Georgia State University Mathematics and Statistics Faculty Publications Department of Mathematics and Statistics 25 Comparing Distribution Functions via Empirical

More information

Survival Regression Models

Survival Regression Models Survival Regression Models David M. Rocke May 18, 2017 David M. Rocke Survival Regression Models May 18, 2017 1 / 32 Background on the Proportional Hazards Model The exponential distribution has constant

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued

Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations Research

More information

Multistate Modeling and Applications

Multistate Modeling and Applications Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Nonparametric Model Construction

Nonparametric Model Construction Nonparametric Model Construction Chapters 4 and 12 Stat 477 - Loss Models Chapters 4 and 12 (Stat 477) Nonparametric Model Construction Brian Hartman - BYU 1 / 28 Types of data Types of data For non-life

More information

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data 1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions

More information

Tests of independence for censored bivariate failure time data

Tests of independence for censored bivariate failure time data Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ

More information

SEMIPARAMETRIC LIKELIHOOD RATIO INFERENCE. By S. A. Murphy 1 and A. W. van der Vaart Pennsylvania State University and Free University Amsterdam

SEMIPARAMETRIC LIKELIHOOD RATIO INFERENCE. By S. A. Murphy 1 and A. W. van der Vaart Pennsylvania State University and Free University Amsterdam The Annals of Statistics 1997, Vol. 25, No. 4, 1471 159 SEMIPARAMETRIC LIKELIHOOD RATIO INFERENCE By S. A. Murphy 1 and A. W. van der Vaart Pennsylvania State University and Free University Amsterdam Likelihood

More information

Empirical Processes: General Weak Convergence Theory

Empirical Processes: General Weak Convergence Theory Empirical Processes: General Weak Convergence Theory Moulinath Banerjee May 18, 2010 1 Extended Weak Convergence The lack of measurability of the empirical process with respect to the sigma-field generated

More information

The International Journal of Biostatistics

The International Journal of Biostatistics The International Journal of Biostatistics Volume 1, Issue 1 2005 Article 3 Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics Moulinath Banerjee Jon A. Wellner

More information

Multivariate Survival Data With Censoring.

Multivariate Survival Data With Censoring. 1 Multivariate Survival Data With Censoring. Shulamith Gross and Catherine Huber-Carol Baruch College of the City University of New York, Dept of Statistics and CIS, Box 11-220, 1 Baruch way, 10010 NY.

More information

Investigation of goodness-of-fit test statistic distributions by random censored samples

Investigation of goodness-of-fit test statistic distributions by random censored samples d samples Investigation of goodness-of-fit test statistic distributions by random censored samples Novosibirsk State Technical University November 22, 2010 d samples Outline 1 Nonparametric goodness-of-fit

More information

An augmented inverse probability weighted survival function estimator

An augmented inverse probability weighted survival function estimator An augmented inverse probability weighted survival function estimator Sundarraman Subramanian & Dipankar Bandyopadhyay Abstract We analyze an augmented inverse probability of non-missingness weighted estimator

More information

M- and Z- theorems; GMM and Empirical Likelihood Wellner; 5/13/98, 1/26/07, 5/08/09, 6/14/2010

M- and Z- theorems; GMM and Empirical Likelihood Wellner; 5/13/98, 1/26/07, 5/08/09, 6/14/2010 M- and Z- theorems; GMM and Empirical Likelihood Wellner; 5/13/98, 1/26/07, 5/08/09, 6/14/2010 Z-theorems: Notation and Context Suppose that Θ R k, and that Ψ n : Θ R k, random maps Ψ : Θ R k, deterministic

More information

Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes.

Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes. Unit 2: Models, Censoring, and Likelihood for Failure-Time Data Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes. Ramón

More information

Lectures on Survival Analysis

Lectures on Survival Analysis 1 Lectures on Survival Analysis Richard D. Gill Mathematical Institute, University Utrecht, Budapestlaan 6, 3584 CD Utrecht, Netherlands. gill@math.ruu.nl To appear in: Ecole d Eté de Probabilités de Saint

More information

Distance between multinomial and multivariate normal models

Distance between multinomial and multivariate normal models Chapter 9 Distance between multinomial and multivariate normal models SECTION 1 introduces Andrew Carter s recursive procedure for bounding the Le Cam distance between a multinomialmodeland its approximating

More information

Kernel density estimation in R

Kernel density estimation in R Kernel density estimation in R Kernel density estimation can be done in R using the density() function in R. The default is a Guassian kernel, but others are possible also. It uses it s own algorithm to

More information

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Patrick J. Heagerty PhD Department of Biostatistics University of Washington 166 ISCB 2010 Session Four Outline Examples

More information

Goodness-of-fit test for the Cox Proportional Hazard Model

Goodness-of-fit test for the Cox Proportional Hazard Model Goodness-of-fit test for the Cox Proportional Hazard Model Rui Cui rcui@eco.uc3m.es Department of Economics, UC3M Abstract In this paper, we develop new goodness-of-fit tests for the Cox proportional hazard

More information

TMA 4275 Lifetime Analysis June 2004 Solution

TMA 4275 Lifetime Analysis June 2004 Solution TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,

More information

Likelihood ratio confidence bands in nonparametric regression with censored data

Likelihood ratio confidence bands in nonparametric regression with censored data Likelihood ratio confidence bands in nonparametric regression with censored data Gang Li University of California at Los Angeles Department of Biostatistics Ingrid Van Keilegom Eindhoven University of

More information

Resampling methods for randomly censored survival data

Resampling methods for randomly censored survival data Resampling methods for randomly censored survival data Markus Pauly Lehrstuhl für Mathematische Statistik und Wahrscheinlichkeitstheorie Mathematisches Institut Heinrich-Heine-Universität Düsseldorf Wien,

More information

Chapter 2 Inference on Mean Residual Life-Overview

Chapter 2 Inference on Mean Residual Life-Overview Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Lecture 2: CDF and EDF

Lecture 2: CDF and EDF STAT 425: Introduction to Nonparametric Statistics Winter 2018 Instructor: Yen-Chi Chen Lecture 2: CDF and EDF 2.1 CDF: Cumulative Distribution Function For a random variable X, its CDF F () contains all

More information

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Right censored

More information

Machine Learning. Module 3-4: Regression and Survival Analysis Day 2, Asst. Prof. Dr. Santitham Prom-on

Machine Learning. Module 3-4: Regression and Survival Analysis Day 2, Asst. Prof. Dr. Santitham Prom-on Machine Learning Module 3-4: Regression and Survival Analysis Day 2, 9.00 16.00 Asst. Prof. Dr. Santitham Prom-on Department of Computer Engineering, Faculty of Engineering King Mongkut s University of

More information

A General Kernel Functional Estimator with Generalized Bandwidth Strong Consistency and Applications

A General Kernel Functional Estimator with Generalized Bandwidth Strong Consistency and Applications A General Kernel Functional Estimator with Generalized Bandwidth Strong Consistency and Applications Rafael Weißbach Fakultät Statistik, Universität Dortmund e-mail: Rafael.Weissbach@uni-dortmund.de March

More information

Longitudinal + Reliability = Joint Modeling

Longitudinal + Reliability = Joint Modeling Longitudinal + Reliability = Joint Modeling Carles Serrat Institute of Statistics and Mathematics Applied to Building CYTED-HAROSA International Workshop November 21-22, 2013 Barcelona Mainly from Rizopoulos,

More information

arxiv:submit/ [math.st] 6 May 2011

arxiv:submit/ [math.st] 6 May 2011 A Continuous Mapping Theorem for the Smallest Argmax Functional arxiv:submit/0243372 [math.st] 6 May 2011 Emilio Seijo and Bodhisattva Sen Columbia University Abstract This paper introduces a version of

More information

Sample-weighted semiparametric estimates of cause-specific cumulative incidence using left-/interval censored data from electronic health records

Sample-weighted semiparametric estimates of cause-specific cumulative incidence using left-/interval censored data from electronic health records 1 / 22 Sample-weighted semiparametric estimates of cause-specific cumulative incidence using left-/interval censored data from electronic health records Noorie Hyun, Hormuzd A. Katki, Barry I. Graubard

More information

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks Y. Xu, D. Scharfstein, P. Mueller, M. Daniels Johns Hopkins, Johns Hopkins, UT-Austin, UF JSM 2018, Vancouver 1 What are semi-competing

More information

Semiparametric posterior limits

Semiparametric posterior limits Statistics Department, Seoul National University, Korea, 2012 Semiparametric posterior limits for regular and some irregular problems Bas Kleijn, KdV Institute, University of Amsterdam Based on collaborations

More information

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 7 Fall 2012 Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample H 0 : S(t) = S 0 (t), where S 0 ( ) is known survival function,

More information

Reinforced urns and the subdistribution beta-stacy process prior for competing risks analysis

Reinforced urns and the subdistribution beta-stacy process prior for competing risks analysis Reinforced urns and the subdistribution beta-stacy process prior for competing risks analysis Andrea Arfè 1 Stefano Peluso 2 Pietro Muliere 1 1 Università Commerciale Luigi Bocconi 2 Università Cattolica

More information

Effects of a Misattributed Cause of Death on Cancer Mortality

Effects of a Misattributed Cause of Death on Cancer Mortality Effects of a Misattributed Cause of Death on Cancer Mortality by Jinkyung Ha A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy (Biostatistics) in

More information

Lecture 2: Martingale theory for univariate survival analysis

Lecture 2: Martingale theory for univariate survival analysis Lecture 2: Martingale theory for univariate survival analysis In this lecture T is assumed to be a continuous failure time. A core question in this lecture is how to develop asymptotic properties when

More information

Stat 710: Mathematical Statistics Lecture 31

Stat 710: Mathematical Statistics Lecture 31 Stat 710: Mathematical Statistics Lecture 31 Jun Shao Department of Statistics University of Wisconsin Madison, WI 53706, USA Jun Shao (UW-Madison) Stat 710, Lecture 31 April 13, 2009 1 / 13 Lecture 31:

More information

Nonparametric two-sample tests of longitudinal data in the presence of a terminal event

Nonparametric two-sample tests of longitudinal data in the presence of a terminal event Nonparametric two-sample tests of longitudinal data in the presence of a terminal event Jinheum Kim 1, Yang-Jin Kim, 2 & Chung Mo Nam 3 1 Department of Applied Statistics, University of Suwon, 2 Department

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

Analysis of competing risks data and simulation of data following predened subdistribution hazards

Analysis of competing risks data and simulation of data following predened subdistribution hazards Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013

More information

Multistate models and recurrent event models

Multistate models and recurrent event models Multistate models Multistate models and recurrent event models Patrick Breheny December 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/22 Introduction Multistate models In this final lecture,

More information

Survival analysis in R

Survival analysis in R Survival analysis in R Niels Richard Hansen This note describes a few elementary aspects of practical analysis of survival data in R. For further information we refer to the book Introductory Statistics

More information

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements [Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements Aasthaa Bansal PhD Pharmaceutical Outcomes Research & Policy Program University of Washington 69 Biomarkers

More information

Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½

Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½ University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 1998 Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½ Lawrence D. Brown University

More information