Analysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time

Size: px
Start display at page:

Download "Analysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time"

Transcription

1 Analysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/36

2 The case of complete observations Suppose that we have a single sample of survival times, where none of the observations are censored. Then, the survivor function S(t) can be estimated by the empirical survivor function, given by Ŝ(t) = Number of individuals with survival times > t Number of individuals in the data set. Let t 1, t 2,..., t n be the exact survival times of the n individuals under study. We relabel the n survival times t 1, t 2,..., t n in ascending order such that t (1) t (2)... t (n). Winter term 2018/19 2/36

3 The case of complete observations The survivorship function at t (i) can be estimated as Ŝ(t (i) ) = n i n = 1 i n, where n i is the number of individuals surviving longer than t (i). If two or more t (i) are equal (tied observations), the largest i value is used. This gives a conservative estimate for the tied observations. Since every individual is alive at the beginning of the study and no one survives longer than t (n), Ŝ(t (0) ) = 1 and Ŝ(t (n) ) = 0. Winter term 2018/19 3/36

4 Example: Computation of Ŝ(t) for 10 lung cancer patients Winter term 2018/19 4/36

5 Plot of the empirical survivor function Months Figure: Step function Ŝ(t) of lung cancer patients. Winter term 2018/19 5/36

6 Estimating S(t) in case of censored observations When censored observations are present, a different method of estimating S(t) is required. Before attempting to fit a theoretical distribution to a set of survival data, we will discuss nonparametric methods for estimating S(t). They are also said to be distribution-free, since they do not require specific assumptions to be made about the underlying distribution of the survival times. If the main objective is to find a model for the data, estimates obtained by nonparametric methods and graphical methods can be helpful in choosing a distribution. Winter term 2018/19 6/36

7 Kaplan-Meier estimator Let n be the total number of individuals whose survival times, censored or not, are available. Relabel the n survival times in order of increasing magnitude such that t (1) t (2)... t (n). The Kaplan-Meier (product-limit) estimate of the survivor function is Ŝ(t) = n r n r + 1, t (r) t where r runs through those positive integers for which t (r) t and t (r) is uncensored. Winter term 2018/19 7/36

8 Example Table: Kaplan-Meier estimates Ŝ(t) of remission durations of 10 patients with solid tumors. Remission time Rank r (n r) (n r+1) Ŝ(t) / /7 9/10 6/7 = /6 9/10 6/7 5/6 = /4 9/10 6/7 5/6 3/4 = /2 9/10 6/7 5/6 3/4 1/2 = Winter term 2018/19 8/36

9 Plot of the Kaplan-Meier survival curve s(t) 0> 1.0 c s: "2: 0.8 :::l Cf) c :e g_ 0.4 e Months Figure: Function Ŝ(t) for remission data. Winter term 2018/19 9/36

10 2.2 Nonparametrische Pointwise confidence S(t) intervals und Λ(t) for the Schätzung, survival function Kaplan-Meier-Schätzer Alternative Kaplan-Meier-Schätzer formulation of the (aka. Kaplan-Meier Produkt-Limit-Schätzer estimate Consider Betrachtet m distinct werden failure dietimes, geordneten t (1) < t (2) Ereigniszeitpunkte <... < t (m) [* denotes t (k), k = event]. 1, 2,, m, * * * *..., m n 0 t (1) < t (2) < < t (m 1) < t (m) * Ereignis TheDiskrete Kaplan-Meier Hazardrate estimate für ofis(t) k = [t can (k 1) be, written t (k) ) istas wiederum { λ d k = 1 P (T [t (k 1), for t (k) t ) < t(1) T t (k 1) ) Ŝ(t) = ( ) t Die Wahrscheinlichkeit, (k) t 1 d, k n das k for t t (1) k-te Intervall zu überleben, where gegeben n k the es wurde numbererreicht: of individuals at risk just prior to t (k) and d k the number of failures at t p k = 1 λ d (k) (k = 1..., m). k = P (T t (k) T t (k 1) ) Winter term 2018/19 10/36

11 Greenwood s formula The variance of the Kaplan-Meier estimator is estimated by Greenwood s formula: Var[Ŝ(t)] = [Ŝ(t)] 2 t (k) t d k n k (n k d k ). The standard error of the Kaplan-Meier estimator is given by se{ŝ(t)} = { Var[Ŝ(t)]} 1/2. Winter term 2018/19 11/36

12 Nelson-Aalen estimator The Nelson-Aalen estimator of the cumulative hazard function H(t) is defined as { 0 for t < t(1) H(t) = t (k) t d k. n k for t t (1) The estimated variance of the Nelson-Aalen estimator is given by Var[ H(t)] = d k. n 2 t (k) t k Winter term 2018/19 12/36

13 Breslow estimator Based on the Nelson-Aalen estimator of the cumulative hazard rate, the Breslow estimator of the survival function is given by S(t) = exp[ H(t)]. The estimated variance is obtained as Var[ S(t)] = [ S(t)] 2 d k n 2 t (k) t k. Winter term 2018/19 13/36

14 Example Table: Nelson-Aalen estimates H(t) and S(t) and Kaplan-Meier estimates Ŝ(t) of remission time of 10 patients. t (k) d k n k d k /n k H(t) S(t) Ŝ(t) Winter term 2018/19 14/36

15 Life-table estimate Sterbetafel-Methode (1) Eine der traditionellsten Methoden zur Analyse von Verweildauern und Lebenszeiten. Wird The life-table vorwiegend (or actuarial) in Demographie estimate und of the Lebensversicherungen survivor function angewandt allows to group event times into intervals. Eignet sich auch für Daten in gruppierter Form Diskretisierung Consider a discretization der Zeitachse of the in time q + axis 1 Intervalle in q + 1 adjacent, [a non-overlapping k 1, a k ), k = intervals, 1,..., q + [a k 1 1, wobei, a k ), k a= 0 = 1,. 0.. und, q + a1 q+1 where =. a 0 = 0 and a q+1 =. a 0 = 0 a 1 a 2 a 3... a q Winter term 2018/19 15/36

16 Life-table estimate (2) Notations n: number of individuals at the start of the study. d k : number of deaths in the kth interval. c k : number of censored survival times in the kth interval. n k : number of individuals who are alive at the start of the kth interval. It holds that n k = n k 1 d k 1 c k 1 for k = 2,..., q + 1. n k : Number of individuals at risk of experiencing the event in the kth interval, assuming that censored survival times occur uniformly throughout the kth interval, n k = n k c k /2. Winter term 2018/19 16/36

17 Life-table estimate (3) The conditional proportion of death in the kth interval given exposure to the risk of death in the kth interval is d k /n k. The life-table estimate of S(a k ) is given by ( S (a k ) = S (a k 1 ) 1 d ) ( ) k k n k = 1 d j n j for k = 1,..., q with S (a 0 ) = 1. The estimated variance of the life-table estimate is given by k Var[S (a k )] = [S (a k )] 2 d j n j (n j d, j) for k = 1,..., q with the standard error of S (a 0 ) = 1 being zero. Winter term 2018/19 17/36 j=1 j=1,

18 Example Table: Life-table estimate of the survivor function for some data. k a k 1 n k d k c k n k 1 d k S n (a k k ) Winter term 2018/19 18/36

19 Sterbetafel-Methode (6) Beispiel Plot of the life-table estimate Plot von ˆP k für jedes Intervall. S(t) t Winter term 2018/19 19/36

20 Cohort life tables A cohort is a group of individuals who have some common origin from which the event time will be calculated. They are followed over time and their event time or censoring time is recorded to fall in one of q + 1 adjacent, non-overlapping intervals, I k = [a k 1, a k ), k = 1,..., q + 1 with a 0 = 0 and a q+1 =. A traditional cohort life table presents the actual mortality experience of the cohort from the birth of each individual to the death of the last surviving member of the cohort. Censoring may occur because some individuals may migrate out of the study area or drop out of observation. Winter term 2018/19 20/36

21 Basic construction of a cohort life table 1 The 1st column gives the intervals I k, k = 1,..., q The 2nd column gives the number of subjects n k, entering the kth interval who have not experienced the event. 3 The 3rd column gives the number of censored survival times in the kth interval, c k. 4 The 4th column gives the number of individuals at risk of experiencing the event in the kth interval, n k. 5 The 5th column reports the number of individuals d k who experience the event in the kth interval. 6 The 6th column gives the estimated survival function at the start of the kth interval, S (a k 1 ) with S (a 0 ) = 1. Winter term 2018/19 21/36

22 Basic construction of a cohort life table (2) 7 The 7th column gives the estimated pdf ˆf (a mk ) at the midpoint of the kth interval, a mk = (a k + a k 1 )/2: ˆf (a mk ) = [S (a k 1 ) S (a k )]/(a k a k 1 ). 8 The 8th column gives the estimated hazard rate ĥ(a mk ) at the midpoint of the kth interval, a mk : ĥ(a mk ) = ˆf (a mk )/S (a mk ) = ˆf (a mk )/{S (a k ) + [S (a k 1 ) S (a k )]/2} = 2ˆf (a mk ) [S (a k ) + S (a k 1 )] Note that S (a mk ) is based on a linear approximation between the estimate of the survivor function at the endpoints of the interval. Winter term 2018/19 22/36.

23 Basic construction of a cohort life table (3) 9 The 9th column gives the standard error of survival at the beginning of the kth interval, se{s (a k 1 )} = { Var[S (a k 1 )]} 1/2 for k = 2,..., q + 1 with se{s (a 0 )} = The 10th column shows the standard error of the pdf at the midpoint of the kth interval. 11 The 11th column shows the standard error of the hazard function at the midpoint of the kth interval. Winter term 2018/19 23/36

24 Example of a cohort life table Figure: Life table for weaning example (Klein and Moeschberger 2003, p. 156). Winter term 2018/19 24/36

25 Interval estimates An estimate of the survivor function provides a summary estimate of the mortality experience of a given population. The standard error of an estimate of the survivor function provides some information about the precision of the estimate. We can use these estimators to construct a pointwise confidence interval (CI) for the corresponding value of the survivor function at a fixed time t. The intervals are constructed to assure, with prescribed probability, that the true value of the survival function, at a predetermined time t, falls in the interval we construct. Winter term 2018/19 25/36

26 Standard normal distribution cp(x) 1 - p 1-p 0 Figure: Percentiles of the standard normal distribution. Winter term 2018/19 26/36

27 Confidence interval A pointwise 100(1 α)% confidence interval for S(t), for a given value of t, is given by Ŝ(t) ± z 1 α 2 se{ŝ(t)}, where se{ŝ(t)} = { Var[Ŝ(t)]} 1/2 and z 1 α is the 1 α/2 2 quantile of the standard normal distribution. For all t it holds that ) a P (Ŝ(t) z1 α se{ŝ(t)} S(t) Ŝ(t) + z 2 1 α se{ŝ(t)} 2 1 α for a given confidence level 1 α. Winter term 2018/19 27/36

28 Confidence intervals based on transformations of Ŝ(t) Alternative CIs can be constructed by first transforming Ŝ(t) and obtaining a CI for the transformed value. The resulting confidence limits are then back-transformed to give a confidence interval for S(t) itself. Possible transformations include the log transformation, ln[ŝ(t)], the logistic transformation, ln[ŝ(t)/{1 Ŝ(t)}], and the complementary log-log transformation, ln[ ln{ŝ(t)}]. Such intervals can lead to better coverage probabilities. Winter term 2018/19 28/36

29 Simultaneous confidence bands Pointwise confidence intervals are valid for a single fixed time at which the inference is to be made. For simultaneous confidence bands of the survival function it should hold that P (Ŝ(t) ± cα (t L, t U ) S(t), t [t L, t U ]) a 1 α for a given confidence level 1 α. We present two approaches for constructing confidence bands for S(t). Winter term 2018/19 29/36

30 Equal precision (EP) bands Equal precision (EP) bands are obtained as Ŝ(t) ± c α (a L, a U ) se{ŝ(t)}, with c α (a L, a U ) chosen such that 0 < a L < a U < 1, where and a L = nˆσ2 S (t L) 1 + nˆσ 2 S (t L) ˆσ 2 S(t) = Var(Ŝ(t)) Ŝ(t) 2, a U = nˆσ2 S (t U) 1 + nˆσ 2 S (t U) = t (k) t d k n k (n k d k ). Winter term 2018/19 30/36

31 Equal precision (EP) bands (2) To construct 100(1 α)% confidence bands for S(t) over the range [t L, t U ], we find a confidence coefficient, c α (a L, a U ). We pick t L < t U so that t L is greater than or equal to the smallest observed event time and t U is less than or equal to the largest observed event time. Values of c α (a L, a U ) can be obtained from Table C.3 in Appendix C in Klein and Moeschberger (2003). EP bands are proportional to pointwise confidence intervals. Winter term 2018/19 31/36

32 Hall-Wellner bands An alternate set of confidence bands are Hall-Wellner bands, which are obtained as Ŝ(t) ± k α(a L, a U ) n [1 + nˆσ 2 S(t)]Ŝ(t). To construct a 100(1 α)% confidence band for S(t) over the region [t L, t U ], we find a confidence coefficient, k α (a L, a U ). For these bands, a lower limit, t L, of zero is allowed. Winter term 2018/19 32/36

33 Hall-Wellner bands (2) Values of k α (a L, a U ) can be obtained from Table C.4 in Appendix C in Klein and Moeschberger (2003). Hall-Wellner bands are not proportional to the pointwise confidence intervals. As in the case of pointwise confidence intervals, other forms for the confidence bands based on transformations are available. Winter term 2018/19 33/36

34 Point estimates of quantiles of survival times Recall that the pth quantile (0 p 1) of a random variable T with survival function S(t) is defined as t p = inf{t : S(t) 1 p}. To estimate t p, we find the smallest time ˆt p for which the value of the estimated survivor function is less than or equal to 1 p, that is, ˆt p = inf{t : Ŝ(t) 1 p}. Some software packages use a different estimator. Winter term 2018/19 34/36

35 Interval estimates of quantiles of survival times An estimator of the variance of ˆt p may be obtained from an application of the delta method. The suggested estimator for the variance of the estimator of the pth quantile is Var(ˆt p ) = Var(Ŝ(ˆt p )) [ˆf (ˆt p )] 2. The estimator of the pdf often used is ˆf (ˆt p ) = Ŝ(û p) Ŝ(ˆl p ) ˆl p û p. Winter term 2018/19 35/36

36 Interval estimates of quantiles of survival times (2) The values û p and ˆl p are chosen such that û p < ˆt p < ˆl p and are obtained as û p = max{t : Ŝ(t) 1 p+ɛ} and ˆl p = min{t : Ŝ(t) 1 p ɛ} for small values of ɛ. The endpoints of a 100 (1 α)% confidence interval are where s.e.(ˆt p ) = ˆt p ± z 1 α/2 s.e.(ˆt p ), Var(ˆt p ). Winter term 2018/19 36/36

Chapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n

Chapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 4 Fall 2012 4.2 Estimators of the survival and cumulative hazard functions for RC data Suppose X is a continuous random failure time with

More information

Estimation for Modified Data

Estimation for Modified Data Definition. Estimation for Modified Data 1. Empirical distribution for complete individual data (section 11.) An observation X is truncated from below ( left truncated) at d if when it is at or below d

More information

ST745: Survival Analysis: Nonparametric methods

ST745: Survival Analysis: Nonparametric methods ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate

More information

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University Survival Analysis: Weeks 2-3 Lu Tian and Richard Olshen Stanford University 2 Kaplan-Meier(KM) Estimator Nonparametric estimation of the survival function S(t) = pr(t > t) The nonparametric estimation

More information

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the

More information

Exercises. (a) Prove that m(t) =

Exercises. (a) Prove that m(t) = Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for

More information

Nonparametric Model Construction

Nonparametric Model Construction Nonparametric Model Construction Chapters 4 and 12 Stat 477 - Loss Models Chapters 4 and 12 (Stat 477) Nonparametric Model Construction Brian Hartman - BYU 1 / 28 Types of data Types of data For non-life

More information

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Right censored

More information

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL The Cox PH model: λ(t Z) = λ 0 (t) exp(β Z). How do we estimate the survival probability, S z (t) = S(t Z) = P (T > t Z), for an individual with covariates

More information

18.465, further revised November 27, 2012 Survival analysis and the Kaplan Meier estimator

18.465, further revised November 27, 2012 Survival analysis and the Kaplan Meier estimator 18.465, further revised November 27, 2012 Survival analysis and the Kaplan Meier estimator 1. Definitions Ordinarily, an unknown distribution function F is estimated by an empirical distribution function

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Residuals for the

More information

Multistate Modeling and Applications

Multistate Modeling and Applications Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)

More information

Right-truncated data. STAT474/STAT574 February 7, / 44

Right-truncated data. STAT474/STAT574 February 7, / 44 Right-truncated data For this data, only individuals for whom the event has occurred by a given date are included in the study. Right truncation can occur in infectious disease studies. Let T i denote

More information

STAT 6385 Survey of Nonparametric Statistics. Order Statistics, EDF and Censoring

STAT 6385 Survey of Nonparametric Statistics. Order Statistics, EDF and Censoring STAT 6385 Survey of Nonparametric Statistics Order Statistics, EDF and Censoring Quantile Function A quantile (or a percentile) of a distribution is that value of X such that a specific percentage of the

More information

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Mai Zhou 1 University of Kentucky, Lexington, KY 40506 USA Summary. Empirical likelihood ratio method (Thomas and Grunkmier

More information

Multi-state models: prediction

Multi-state models: prediction Department of Medical Statistics and Bioinformatics Leiden University Medical Center Course on advanced survival analysis, Copenhagen Outline Prediction Theory Aalen-Johansen Computational aspects Applications

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

Survival Distributions, Hazard Functions, Cumulative Hazards

Survival Distributions, Hazard Functions, Cumulative Hazards BIO 244: Unit 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction Outline CHL 5225H Advanced Statistical Methods for Clinical Trials: Survival Analysis Prof. Kevin E. Thorpe Defining Survival Data Mathematical Definitions Non-parametric Estimates of Survival Comparing

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

Lecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016

Lecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016 Proportional Hazards Model - Handling Ties and Survival Estimation Statistics 255 - Survival Analysis Presented February 4, 2016 likelihood - Discrete Dan Gillen Department of Statistics University of

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

Survival Regression Models

Survival Regression Models Survival Regression Models David M. Rocke May 18, 2017 David M. Rocke Survival Regression Models May 18, 2017 1 / 32 Background on the Proportional Hazards Model The exponential distribution has constant

More information

Survival Analysis. Stat 526. April 13, 2018

Survival Analysis. Stat 526. April 13, 2018 Survival Analysis Stat 526 April 13, 2018 1 Functions of Survival Time Let T be the survival time for a subject Then P [T < 0] = 0 and T is a continuous random variable The Survival function is defined

More information

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data 1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions

More information

Empirical Processes & Survival Analysis. The Functional Delta Method

Empirical Processes & Survival Analysis. The Functional Delta Method STAT/BMI 741 University of Wisconsin-Madison Empirical Processes & Survival Analysis Lecture 3 The Functional Delta Method Lu Mao lmao@biostat.wisc.edu 3-1 Objectives By the end of this lecture, you will

More information

Survival Analysis. STAT 526 Professor Olga Vitek

Survival Analysis. STAT 526 Professor Olga Vitek Survival Analysis STAT 526 Professor Olga Vitek May 4, 2011 9 Survival Data and Survival Functions Statistical analysis of time-to-event data Lifetime of machines and/or parts (called failure time analysis

More information

Censoring and Truncation - Highlighting the Differences

Censoring and Truncation - Highlighting the Differences Censoring and Truncation - Highlighting the Differences Micha Mandel The Hebrew University of Jerusalem, Jerusalem, Israel, 91905 July 9, 2007 Micha Mandel is a Lecturer, Department of Statistics, The

More information

Linear models and their mathematical foundations: Simple linear regression

Linear models and their mathematical foundations: Simple linear regression Linear models and their mathematical foundations: Simple linear regression Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/21 Introduction

More information

Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics.

Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Dragi Anevski Mathematical Sciences und University November 25, 21 1 Asymptotic distributions for statistical

More information

Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes.

Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes. Unit 2: Models, Censoring, and Likelihood for Failure-Time Data Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes. Ramón

More information

ST495: Survival Analysis: Maximum likelihood

ST495: Survival Analysis: Maximum likelihood ST495: Survival Analysis: Maximum likelihood Eric B. Laber Department of Statistics, North Carolina State University February 11, 2014 Everything is deception: seeking the minimum of illusion, keeping

More information

Credit risk and survival analysis: Estimation of Conditional Cure Rate

Credit risk and survival analysis: Estimation of Conditional Cure Rate Stochastics and Financial Mathematics Master Thesis Credit risk and survival analysis: Estimation of Conditional Cure Rate Author: Just Bajželj Examination date: August 30, 2018 Supervisor: dr. A.J. van

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and

More information

β j = coefficient of x j in the model; β = ( β1, β2,

β j = coefficient of x j in the model; β = ( β1, β2, Regression Modeling of Survival Time Data Why regression models? Groups similar except for the treatment under study use the nonparametric methods discussed earlier. Groups differ in variables (covariates)

More information

arxiv: v3 [stat.me] 24 Mar 2018

arxiv: v3 [stat.me] 24 Mar 2018 Nonparametric generalized fiducial inference for survival functions under censoring Yifan Cui 1 and Jan Hannig 1 1 Department of Statistics and Operations Research, University of North Carolina, Chapel

More information

Lecture 4 - Survival Models

Lecture 4 - Survival Models Lecture 4 - Survival Models Survival Models Definition and Hazards Kaplan Meier Proportional Hazards Model Estimation of Survival in R GLM Extensions: Survival Models Survival Models are a common and incredibly

More information

Constrained estimation for binary and survival data

Constrained estimation for binary and survival data Constrained estimation for binary and survival data Jeremy M. G. Taylor Yong Seok Park John D. Kalbfleisch Biostatistics, University of Michigan May, 2010 () Constrained estimation May, 2010 1 / 43 Outline

More information

Survival Analysis I (CHL5209H)

Survival Analysis I (CHL5209H) Survival Analysis Dalla Lana School of Public Health University of Toronto olli.saarela@utoronto.ca January 7, 2015 31-1 Literature Clayton D & Hills M (1993): Statistical Models in Epidemiology. Not really

More information

Continuous Time Survival in Latent Variable Models

Continuous Time Survival in Latent Variable Models Continuous Time Survival in Latent Variable Models Tihomir Asparouhov 1, Katherine Masyn 2, Bengt Muthen 3 Muthen & Muthen 1 University of California, Davis 2 University of California, Los Angeles 3 Abstract

More information

9 Estimating the Underlying Survival Distribution for a

9 Estimating the Underlying Survival Distribution for a 9 Estimating the Underlying Survival Distribution for a Proportional Hazards Model So far the focus has been on the regression parameters in the proportional hazards model. These parameters describe the

More information

Step-Stress Models and Associated Inference

Step-Stress Models and Associated Inference Department of Mathematics & Statistics Indian Institute of Technology Kanpur August 19, 2014 Outline Accelerated Life Test 1 Accelerated Life Test 2 3 4 5 6 7 Outline Accelerated Life Test 1 Accelerated

More information

Part III Measures of Classification Accuracy for the Prediction of Survival Times

Part III Measures of Classification Accuracy for the Prediction of Survival Times Part III Measures of Classification Accuracy for the Prediction of Survival Times Patrick J Heagerty PhD Department of Biostatistics University of Washington 102 ISCB 2010 Session Three Outline Examples

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS330 / MAS83 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-0 8 Parametric models 8. Introduction In the last few sections (the KM

More information

3003 Cure. F. P. Treasure

3003 Cure. F. P. Treasure 3003 Cure F. P. reasure November 8, 2000 Peter reasure / November 8, 2000/ Cure / 3003 1 Cure A Simple Cure Model he Concept of Cure A cure model is a survival model where a fraction of the population

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Log-linearity for Cox s regression model. Thesis for the Degree Master of Science

Log-linearity for Cox s regression model. Thesis for the Degree Master of Science Log-linearity for Cox s regression model Thesis for the Degree Master of Science Zaki Amini Master s Thesis, Spring 2015 i Abstract Cox s regression model is one of the most applied methods in medical

More information

11 Survival Analysis and Empirical Likelihood

11 Survival Analysis and Empirical Likelihood 11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with

More information

10 Introduction to Reliability

10 Introduction to Reliability 0 Introduction to Reliability 10 Introduction to Reliability The following notes are based on Volume 6: How to Analyze Reliability Data, by Wayne Nelson (1993), ASQC Press. When considering the reliability

More information

Estimating transition probabilities for the illness-death model The Aalen-Johansen estimator under violation of the Markov assumption Torunn Heggland

Estimating transition probabilities for the illness-death model The Aalen-Johansen estimator under violation of the Markov assumption Torunn Heggland Estimating transition probabilities for the illness-death model The Aalen-Johansen estimator under violation of the Markov assumption Torunn Heggland Master s Thesis for the degree Modelling and Data Analysis

More information

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters Communications for Statistical Applications and Methods 2017, Vol. 24, No. 5, 519 531 https://doi.org/10.5351/csam.2017.24.5.519 Print ISSN 2287-7843 / Online ISSN 2383-4757 Goodness-of-fit tests for randomly

More information

Lecture 11. Interval Censored and. Discrete-Time Data. Statistics Survival Analysis. Presented March 3, 2016

Lecture 11. Interval Censored and. Discrete-Time Data. Statistics Survival Analysis. Presented March 3, 2016 Statistics 255 - Survival Analysis Presented March 3, 2016 Motivating Dan Gillen Department of Statistics University of California, Irvine 11.1 First question: Are the data truly discrete? : Number of

More information

Cox s proportional hazards model and Cox s partial likelihood

Cox s proportional hazards model and Cox s partial likelihood Cox s proportional hazards model and Cox s partial likelihood Rasmus Waagepetersen October 12, 2018 1 / 27 Non-parametric vs. parametric Suppose we want to estimate unknown function, e.g. survival function.

More information

Survival Analysis Using S/R

Survival Analysis Using S/R Survival Analysis Using S/R Unterlagen für den Weiterbildungs Lehrgang in angewandter Statistik an der ETH Zürich Professor Mara Tableman. Fariborz Maseeh Department of Mathematics & Statistics Portland

More information

Multistate models in survival and event history analysis

Multistate models in survival and event history analysis Multistate models in survival and event history analysis Dorota M. Dabrowska UCLA November 8, 2011 Research supported by the grant R01 AI067943 from NIAID. The content is solely the responsibility of the

More information

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Guosheng Yin Department of Statistics and Actuarial Science The University of Hong Kong Joint work with J. Xu PSI and RSS Journal

More information

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

Chapter 4 Regression Models

Chapter 4 Regression Models 23.August 2010 Chapter 4 Regression Models The target variable T denotes failure time We let x = (x (1),..., x (m) ) represent a vector of available covariates. Also called regression variables, regressors,

More information

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen Recap of Part 1 Per Kragh Andersen Section of Biostatistics, University of Copenhagen DSBS Course Survival Analysis in Clinical Trials January 2018 1 / 65 Overview Definitions and examples Simple estimation

More information

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where STAT 331 Accelerated Failure Time Models Previously, we have focused on multiplicative intensity models, where h t z) = h 0 t) g z). These can also be expressed as H t z) = H 0 t) g z) or S t z) = e Ht

More information

Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data

Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data International Mathematical Forum, 3, 2008, no. 33, 1643-1654 Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data A. Al-khedhairi Department of Statistics and O.R. Faculty

More information

Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data

Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data Journal of Modern Applied Statistical Methods Volume 13 Issue 2 Article 22 11-2014 Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data Jayanthi Arasan University Putra Malaysia,

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis CIMAT Taller de Modelos de Capture y Recaptura 2010 Known Fate urvival Analysis B D BALANCE MODEL implest population model N = λ t+ 1 N t Deeper understanding of dynamics can be gained by identifying variation

More information

Package threg. August 10, 2015

Package threg. August 10, 2015 Package threg August 10, 2015 Title Threshold Regression Version 1.0.3 Date 2015-08-10 Author Tao Xiao Maintainer Tao Xiao Depends R (>= 2.10), survival, Formula Fit a threshold regression

More information

Double-Sampling Designs for Dropouts

Double-Sampling Designs for Dropouts Double-Sampling Designs for Dropouts Dave Glidden Division of Biostatistics, UCSF 26 February 2010 http://www.epibiostat.ucsf.edu/dave/talks.html Outline Mbarra s Dropouts Future Directions Inverse Prob.

More information

Comparing Distribution Functions via Empirical Likelihood

Comparing Distribution Functions via Empirical Likelihood Georgia State University ScholarWorks @ Georgia State University Mathematics and Statistics Faculty Publications Department of Mathematics and Statistics 25 Comparing Distribution Functions via Empirical

More information

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes: Practice Exam 1 1. Losses for an insurance coverage have the following cumulative distribution function: F(0) = 0 F(1,000) = 0.2 F(5,000) = 0.4 F(10,000) = 0.9 F(100,000) = 1 with linear interpolation

More information

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Noname manuscript No. (will be inserted by the editor) A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Mai Zhou Yifan Yang Received: date / Accepted: date Abstract In this note

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

The Relationship Between Confidence Intervals for Failure Probabilities and Life Time Quantiles

The Relationship Between Confidence Intervals for Failure Probabilities and Life Time Quantiles Statistics Preprints Statistics 2008 The Relationship Between Confidence Intervals for Failure Probabilities and Life Time Quantiles Yili Hong Iowa State University, yili_hong@hotmail.com William Q. Meeker

More information

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What? You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?) I m not goin stop (What?) I m goin work harder (What?) Sir David

More information

Empirical Likelihood in Survival Analysis

Empirical Likelihood in Survival Analysis Empirical Likelihood in Survival Analysis Gang Li 1, Runze Li 2, and Mai Zhou 3 1 Department of Biostatistics, University of California, Los Angeles, CA 90095 vli@ucla.edu 2 Department of Statistics, The

More information

STAT Section 2.1: Basic Inference. Basic Definitions

STAT Section 2.1: Basic Inference. Basic Definitions STAT 518 --- Section 2.1: Basic Inference Basic Definitions Population: The collection of all the individuals of interest. This collection may be or even. Sample: A collection of elements of the population.

More information

1 The problem of survival analysis

1 The problem of survival analysis 1 The problem of survival analysis Survival analysis concerns analyzing the time to the occurrence of an event. For instance, we have a dataset in which the times are 1, 5, 9, 20, and 22. Perhaps those

More information

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require Chapter 5 modelling Semi parametric We have considered parametric and nonparametric techniques for comparing survival distributions between different treatment groups. Nonparametric techniques, such as

More information

Confidence Intervals. Contents. Technical Guide

Confidence Intervals. Contents. Technical Guide Technical Guide Confidence Intervals Contents Introduction Software options Directory of methods 3 Appendix 1 Byar s method 6 Appendix χ exact method 7 Appendix 3 Wilson Score method 8 Appendix 4 Dobson

More information

4. Comparison of Two (K) Samples

4. Comparison of Two (K) Samples 4. Comparison of Two (K) Samples K=2 Problem: compare the survival distributions between two groups. E: comparing treatments on patients with a particular disease. Z: Treatment indicator, i.e. Z = 1 for

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 7 Fall 2012 Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample H 0 : S(t) = S 0 (t), where S 0 ( ) is known survival function,

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

Logistic regression model for survival time analysis using time-varying coefficients

Logistic regression model for survival time analysis using time-varying coefficients Logistic regression model for survival time analysis using time-varying coefficients Accepted in American Journal of Mathematical and Management Sciences, 2016 Kenichi SATOH ksatoh@hiroshima-u.ac.jp Research

More information

Package Rsurrogate. October 20, 2016

Package Rsurrogate. October 20, 2016 Type Package Package Rsurrogate October 20, 2016 Title Robust Estimation of the Proportion of Treatment Effect Explained by Surrogate Marker Information Version 2.0 Date 2016-10-19 Author Layla Parast

More information

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Patrick J. Heagerty PhD Department of Biostatistics University of Washington 166 ISCB 2010 Session Four Outline Examples

More information

and Comparison with NPMLE

and Comparison with NPMLE NONPARAMETRIC BAYES ESTIMATOR OF SURVIVAL FUNCTIONS FOR DOUBLY/INTERVAL CENSORED DATA and Comparison with NPMLE Mai Zhou Department of Statistics, University of Kentucky, Lexington, KY 40506 USA http://ms.uky.edu/

More information

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -

More information

Likelihood Construction, Inference for Parametric Survival Distributions

Likelihood Construction, Inference for Parametric Survival Distributions Week 1 Likelihood Construction, Inference for Parametric Survival Distributions In this section we obtain the likelihood function for noninformatively rightcensored survival data and indicate how to make

More information

Analysis of competing risks data and simulation of data following predened subdistribution hazards

Analysis of competing risks data and simulation of data following predened subdistribution hazards Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013

More information

Survival Analysis. 732G34 Statistisk analys av komplexa data. Krzysztof Bartoszek

Survival Analysis. 732G34 Statistisk analys av komplexa data. Krzysztof Bartoszek Survival Analysis 732G34 Statistisk analys av komplexa data Krzysztof Bartoszek (krzysztof.bartoszek@liu.se) 10, 11 I 2018 Department of Computer and Information Science Linköping University Survival analysis

More information

POWER AND SAMPLE SIZE DETERMINATIONS IN DYNAMIC RISK PREDICTION. by Zhaowen Sun M.S., University of Pittsburgh, 2012

POWER AND SAMPLE SIZE DETERMINATIONS IN DYNAMIC RISK PREDICTION. by Zhaowen Sun M.S., University of Pittsburgh, 2012 POWER AND SAMPLE SIZE DETERMINATIONS IN DYNAMIC RISK PREDICTION by Zhaowen Sun M.S., University of Pittsburgh, 2012 B.S.N., Wuhan University, China, 2010 Submitted to the Graduate Faculty of the Graduate

More information

A multi-state model for the prognosis of non-mild acute pancreatitis

A multi-state model for the prognosis of non-mild acute pancreatitis A multi-state model for the prognosis of non-mild acute pancreatitis Lore Zumeta Olaskoaga 1, Felix Zubia Olaskoaga 2, Guadalupe Gómez Melis 1 1 Universitat Politècnica de Catalunya 2 Intensive Care Unit,

More information

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t)

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t) PhD course in Advanced survival analysis. (ABGK, sect. V.1.1) One-sample tests. Counting process N(t) Non-parametric hypothesis tests. Parametric models. Intensity process λ(t) = α(t)y (t) satisfying Aalen

More information

Consistency of bootstrap procedures for the nonparametric assessment of noninferiority with random censorship

Consistency of bootstrap procedures for the nonparametric assessment of noninferiority with random censorship Consistency of bootstrap procedures for the nonparametric assessment of noninferiority with random censorship Gudrun Freitag 1 and Axel Mun Institut für Mathematische Stochasti Georg-August-Universität

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information