Nonparametric two-sample tests of longitudinal data in the presence of a terminal event

Size: px
Start display at page:

Download "Nonparametric two-sample tests of longitudinal data in the presence of a terminal event"

Transcription

1 Nonparametric two-sample tests of longitudinal data in the presence of a terminal event Jinheum Kim 1, Yang-Jin Kim, 2 & Chung Mo Nam 3 1 Department of Applied Statistics, University of Suwon, 2 Department of Statistics, Ewha Womans University, 3 Department of Preventive Medicine, Yonsei University College of Medicine August 19, 2009 J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

2 Outline Contents Analysis of longitudinal data Marked point process Proposed tests Simulations An example: CSL 1 data set Concluding remarks J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

3 Background & Review Longitudinal data In longitudinal studies, subjects are measured repeatedly over time For subject i = 1,...,n, observed data are (T ij,z ij,x ij ), j = 1,...,k i, where T ij is a measurement time (observation time), and Z ij and x ij are response and a vector of covariates measured at T ij, respectively In ordinary longitudinal data, measurement times are fixed k i are same for all subjects, i.e., k i = k, i However, longitudinal studies related with human being are not easy to keep this scheme J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

4 Background & Review Statistical modeling of longitudinal data Two objectives for statistical models of longitudinal data To adopt the conventional regression tools, which relate the response variables to the explanatory variables To account for the within-subject correlation In most longitudinal studies, the regression objective is of primary interest the within-subject correlation is essential, but is often of secondary interest Three modeling approaches with longitudinal data (Diggle, Heagerty, Liang & Zegger, 2002) J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

5 Three approaches Background & Review Marginal model approach E(Z ij ) = µ ij is related to x ij by g(µ ij ) = x ij β, where g is a known link function Var(Z ij ) = ν(µ ij )φ, where ν is a known function and φ is the over-dispersion parameter Cov(Z ij, Z ik ) = c(µ ij, µ ik ; α), where c is a known function and α is the additional parameter. Random effects model approach g(e(z ij b i )) = x ij β + w ij b i, where w ij is a vector of covariates whose coefficients vary across subjects; b i F(Θ) with mean zero and covariance matrix Θ, where F is a known distribution function Assume the conditional independence of Z i1,...,z iki given b i J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

6 Three approaches Background & Review Transition model approach E(Z ij Z i,j 1,..., Z i1 ) = µ c ij depends on x ij and past responses, g(µ c ij) = x ijβ + k α k f k (Z i,j 1,..., Z i1 ), where f k, k = 1,... are known functions Var(Z ij Z i,j 1,..., Z i1 ) = ν(µ c ij )φ, where ν is a known function and φ is the over-dispersion parameter J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

7 Background & Review Goal Dynamic model approach proposed by Scheike & Zhang (1998), Martinussen & Scheike (1999, 2000) among others Describe longitudinal data consisting of the triplet (responses, observation times, covariates) using a marked point process Model the conditional mean of the current response given past outcomes, which amount to previously obtained measurements and the times for these measurements ( transition model approach) Propose nonparametric tests whether two groups of longitudinal response data have identical conditional mean functions in the presence of group-specific observation and/or termination times J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

8 Background & Review Marked point process Denote (E, E) by a measurable mark space Let (Z k,k 1) be a sequence of rvs in E and the sequence (T k,k 1) constitutes a counting process N(t) = k I(T k t) The double sequence (T k,z k ) is called a marked point process (MPP) with an associated counting process N(A,t) = k I(Z k A)I(T k t),a E Specially, N(t) = N(E,t) The MPP can be identified with counting measure p(ds dz) defined by p((0,t] A) = N(A,t),A E. Let F Tk = σ(t j,z j,1 j k 1;T k ) J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

9 Marked point process Background & Review The marked point counting process N(A,t) has the intensity function λ t (dz) = λ(t)φ t (dz), where λ(t) is a nonnegative F t predictable process and Φ t is defined as Φ t (A) = Pr(Z(t) A F t ) For a F t predictable process H, the MPP integral H(s,z)p(ds dz) may be decomposed as t 0 t 0 E t λ(s){ H(s,z)Φ s (dz)}ds + H(s, z)q(ds dz) (1) E 0 E where q(dt dz) = p(dt dz) λ(t)φ t (dz)dt is a marked point martingale. J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

10 Proposed tests Data structure & notations Let (R, B) denote a mark space, where B is the Borel σ-field on R = (, ) Let D k,i denote the death time and C k,i the censoring time, where k = 1,2;i = 1,...,n k Assume that C k,i is independent of both D k,i and N k,i (, ) Due to censoring, D k,i and N k,i (, ) may not be fully observed For A B, we observe {(N k,i (A, C k,i ),Z k,i ( C k,i ),X k,i,δ k,i ) k = 1,2;i = 1,...,n k }, where X k,i = D k,i C k,i, and δ k,i = I(D k,i C k,i ). J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

11 Proposed tests Data structure & notations Let Y k,i (t) = I(X k,i t) and Y k (t) = n k i=1 Y k,i(t) For the longitudinal responses, i.e. marks, it is modeled as Z k,i (t) = m k (t) + ǫ k,i, (2) where m k (t) is a smooth mean function and ǫ k,i has mean zero and variance σ 2 k For the observation times, the intensity process λ k,i (t) can be written as λ k,i (t) = α k (t)y k,i (t), (3) where α k (t) is a deterministic function given the accrued information up to time t J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

12 Proposed tests Cumulative mean function Let µ k (t) = t 0 m k(s)ds be the cumulative mean function for group k Under the assumed models (2) and (3), for any fixed t, using the decomposition (1), we have the decomposition zp k,i (dt dz) = α k (t)y k,i (t)dµ k (t) + zq k,i (dt dz) R Estimate µ k (t) by n k ˆµ k (t) = i=1 t 0 R z J k (s)y k,i (s) ˆα k (s)y p k,i(ds dz), k (s) where J k (t) = I {Y k (t) > 0, ˆα k (t) > 0} and ˆα k (t) is the kernel-smoothed estimate of the Nelson-Aalen estimator of A k (t) = t 0 α k(s)ds R J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

13 Proposed tests Weighted cumulative mean function Define a weighted cumulative mean function as ψ k (t) = where S k (t) = Pr(D k,i t) Estimate ψ k (t) by ˆψ k (t) = t 0 t 0 S k (s)dµ k (s), Ŝ k (s)d ˆµ k (s), where Ŝk(t) is the Kaplan-Meier estimator of S k (t) based on {(X k,i,δ k,i ) i = 1,...,n k } J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

14 Proposed tests Asymptotic distribution of ˆψ k (t) Under the regularity conditions, n 1/2 k { ˆψ k (t) ψ k (t)} d U k (t), t [0,τ k ], where U k (t) is a mean zero Gaussian process whose covariance function at (s,t) consistently estimated by ˆξ k (s,t) = n 1 nk ˆΨ k i=1 k,i (s)ˆψ k,i (t), where ˆΨ k,i (t) = t 0 R J Ŝ k (s) k(s) ˆα k (s)y k (s)/n k {Y k,i (s)z ˆm k (s)}p k,i (ds dz) ˆµ k (t) t 0 d ˆM D k,i (s) Y k (s)/n k + t 0 ˆµ k(s) d ˆM D k,i (s) Y k (s)/n k J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

15 Proposed tests Nonparametric two-sample tests To compare mean functions of two groups when two groups have both different distributions for the observation times and/or different intensities for the termination time Hypothesis of interest: H 0 : ψ 1 (t) = ψ 2 (t), t (0,τ], where τ = τ 1 τ 2 Two test statistics Q C = ˆψ 1 (τ) ˆψ 2 (τ) Q LR = τ 0 ˆK LR (s)d{ψ 1 (s) ψ 2 (s)}, where ˆK LR (t) = Y1 (t)y2 (t) Y 1 (t)+y 2 (t) n n 1n 2 J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

16 Proposed tests Asymptotic null distribution (n 1 n 2 /n) 1/2 Q C and (n 1 n 2 /n) 1/2 Q LR converge in distribution to mean zero normal random variables with variances consistently estimated by ˆΣ C = n { 2 n1 τ nn 1 i=1 0 d ˆΨ } 2 1,i (s) + n 1 { n2 τ nn 2 i=1 0 d ˆΨ } 2 2,i (s) and n 2 { n1 τ ˆK nn 1 i=1 0 LR (s)d ˆΨ 1,i (s) respectively } ˆΣ LR = 2+ n 1 nn 2 n2 i=1 { τ 0 ˆK LR (s)d ˆΨ 2,i (s)} 2, J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

17 Design parameters Simulations Gap times(s k,i,j ): Poisson(λ s k = ρλ k) Observation times: t k,i,j = t k,i,j 1 + s k,i,j with t k,i,0 0 Two types of dependency ρ = 1 ρ Gamma(0.2, 5) Two types of mean function v(t) = t w(t) = t + 0.3t 0.5 Error term(ǫ k,i ): N(0,0.7 2 ) Survival times(d k,i,j ): Exp(λ D k ) on (2, ), i.e., Generated from a two-parameter exponential distribution Censoring times(c k,i ): Fixed censoring at 6.7 J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

18 Simulations Setup for parameters For Group 1, λ 1 = 1.0 and λ D 1 = 0.4 For size, m 1 (t) = v(t) = m 2 (t), λ 1 = λ 2, and λ D 1 = λd 2 For power, m 1 (t) = v(t) and m 2 (t) = w(t), i.e., m 1 (t) m 2 (t), λ 1 λ 2, or λ D 1 λd 2 Sample sizes: n 1 = n 2 = 50 or 100 Based on 1,000 samples J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

19 Simulations Size and powers when ρ = 1 m 1 (t) = m 2 (t) m 1 (t) m 2 (t) λ 2 λ D 2 n = 50 n = 100 n = 50 n = J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

20 Simulations Size and powers when ρ Gamma(0, 2, 5) m 1 = m 2 m 1 m 2 λ 2 λ D 2 n = 50 n = 100 n = 50 n = J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

21 Real example CSL 1 data set Data set from 446 patients with liver cirrhosis conducted by the Copenhagen Study Group for Liver Diseases (Schlichting et al., Hepatology, 1983) Outcome: Prothrombin index, a measurement based on a blood test of coagulation factors II, VII, and X produced by the liver Placebo: 257 (165: died, 92: censored), Prednisone-treated: 189 (105: died, 84: censored) Scheduled visiting times: At the entry, three, six and twelve months after treatment and thereafter once a year, BUT the actual visiting times varied considerably around the scheduled times # of visits ranged from 1 to 16 Average visit numbers: 5.45 (Placebo) and 5.72 (Prednisone-treated) Q LR = n 1 n 2 /nq LR = (p value=0.0005) J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

22 Real example Plots of survival curves & Nelson-Aalen estimators of visiting times Survival function of death Nelson Aalen estimator of observation tim Survival function Predniso Placebo Nelson Aalen estimator Predni Placeb Time J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25 Time

23 Real example Plots of cumulative mean functions & weighted cumulative mean functions Cumulative mean function Cumulative weighted mean function Cumulative mean function Prednison Placebo Cumulative weighted mean function Prednison Placebo Time J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25 Time

24 Concluding remarks Conclusion & extension Propose test statistics to compare the mean functions of two groups for longitudinal data with group-specific observation and termination times According to simulations, it controls well a significance level and also its power seems to be reasonable for several combinations of the distribution of the observation times and the intensity of termination time Extend to the longitudinal regression data with covariate-specific observation and termination times J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

25 Thank You! J. Kim (Univ Suwon) Nonparametric two-sample tests ISI / 25

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data 1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions

More information

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen Outline Cox s proportional hazards model. Goodness-of-fit tools More flexible models R-package timereg Forthcoming book, Martinussen and Scheike. 2/38 University of Copenhagen http://www.biostat.ku.dk

More information

Robust estimates of state occupancy and transition probabilities for Non-Markov multi-state models

Robust estimates of state occupancy and transition probabilities for Non-Markov multi-state models Robust estimates of state occupancy and transition probabilities for Non-Markov multi-state models 26 March 2014 Overview Continuously observed data Three-state illness-death General robust estimator Interval

More information

Exercises. (a) Prove that m(t) =

Exercises. (a) Prove that m(t) = Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t)

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t) PhD course in Advanced survival analysis. (ABGK, sect. V.1.1) One-sample tests. Counting process N(t) Non-parametric hypothesis tests. Parametric models. Intensity process λ(t) = α(t)y (t) satisfying Aalen

More information

Sample Size and Power Considerations for Longitudinal Studies

Sample Size and Power Considerations for Longitudinal Studies Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous

More information

Simple techniques for comparing survival functions with interval-censored data

Simple techniques for comparing survival functions with interval-censored data Simple techniques for comparing survival functions with interval-censored data Jinheum Kim, joint with Chung Mo Nam jinhkim@suwon.ac.kr Department of Applied Statistics University of Suwon Comparing survival

More information

Nonparametric Model Construction

Nonparametric Model Construction Nonparametric Model Construction Chapters 4 and 12 Stat 477 - Loss Models Chapters 4 and 12 (Stat 477) Nonparametric Model Construction Brian Hartman - BYU 1 / 28 Types of data Types of data For non-life

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Progress, Updates, Problems William Jen Hoe Koh May 9, 2013 Overview Marginal vs Conditional What is TMLE? Key Estimation

More information

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 7 Fall 2012 Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample H 0 : S(t) = S 0 (t), where S 0 ( ) is known survival function,

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

Survival models and health sequences

Survival models and health sequences Survival models and health sequences Walter Dempsey University of Michigan July 27, 2015 Survival Data Problem Description Survival data is commonplace in medical studies, consisting of failure time information

More information

Multistate Modeling and Applications

Multistate Modeling and Applications Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)

More information

Package Rsurrogate. October 20, 2016

Package Rsurrogate. October 20, 2016 Type Package Package Rsurrogate October 20, 2016 Title Robust Estimation of the Proportion of Treatment Effect Explained by Surrogate Marker Information Version 2.0 Date 2016-10-19 Author Layla Parast

More information

1 Glivenko-Cantelli type theorems

1 Glivenko-Cantelli type theorems STA79 Lecture Spring Semester Glivenko-Cantelli type theorems Given i.i.d. observations X,..., X n with unknown distribution function F (t, consider the empirical (sample CDF ˆF n (t = I [Xi t]. n Then

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

Lecture 7 Time-dependent Covariates in Cox Regression

Lecture 7 Time-dependent Covariates in Cox Regression Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the

More information

Residuals and model diagnostics

Residuals and model diagnostics Residuals and model diagnostics Patrick Breheny November 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/42 Introduction Residuals Many assumptions go into regression models, and the Cox proportional

More information

asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data

asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data Xingqiu Zhao and Ying Zhang The Hong Kong Polytechnic University and Indiana University Abstract:

More information

CTDL-Positive Stable Frailty Model

CTDL-Positive Stable Frailty Model CTDL-Positive Stable Frailty Model M. Blagojevic 1, G. MacKenzie 2 1 Department of Mathematics, Keele University, Staffordshire ST5 5BG,UK and 2 Centre of Biostatistics, University of Limerick, Ireland

More information

Models for Multivariate Panel Count Data

Models for Multivariate Panel Count Data Semiparametric Models for Multivariate Panel Count Data KyungMann Kim University of Wisconsin-Madison kmkim@biostat.wisc.edu 2 April 2015 Outline 1 Introduction 2 3 4 Panel Count Data Motivation Previous

More information

Modelling and Analysis of Recurrent Event Data

Modelling and Analysis of Recurrent Event Data Modelling and Analysis of Recurrent Event Data Edsel A. Peña Department of Statistics University of South Carolina Research support from NIH, NSF, and USC/MUSC Collaborative Grants Joint work with Prof.

More information

4. Comparison of Two (K) Samples

4. Comparison of Two (K) Samples 4. Comparison of Two (K) Samples K=2 Problem: compare the survival distributions between two groups. E: comparing treatments on patients with a particular disease. Z: Treatment indicator, i.e. Z = 1 for

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen The Annals of Statistics 21, Vol. 29, No. 5, 1344 136 A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1 By Thomas H. Scheike University of Copenhagen We present a non-parametric survival model

More information

Error distribution function for parametrically truncated and censored data

Error distribution function for parametrically truncated and censored data Error distribution function for parametrically truncated and censored data Géraldine LAURENT Jointly with Cédric HEUCHENNE QuantOM, HEC-ULg Management School - University of Liège Friday, 14 September

More information

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate

More information

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a

More information

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology Group Sequential Tests for Delayed Responses Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj Lisa Hampson Department of Mathematics and Statistics,

More information

Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong Tseng

Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong Tseng Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong seng Reference: 1. Andersen, Borgan, Gill, and Keiding (1993). Statistical Model Based on Counting Processes, Springer-Verlag, p.229-255

More information

Statistical Analysis of Competing Risks With Missing Causes of Failure

Statistical Analysis of Competing Risks With Missing Causes of Failure Proceedings 59th ISI World Statistics Congress, 25-3 August 213, Hong Kong (Session STS9) p.1223 Statistical Analysis of Competing Risks With Missing Causes of Failure Isha Dewan 1,3 and Uttara V. Naik-Nimbalkar

More information

Empirical Processes & Survival Analysis. The Functional Delta Method

Empirical Processes & Survival Analysis. The Functional Delta Method STAT/BMI 741 University of Wisconsin-Madison Empirical Processes & Survival Analysis Lecture 3 The Functional Delta Method Lu Mao lmao@biostat.wisc.edu 3-1 Objectives By the end of this lecture, you will

More information

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL The Cox PH model: λ(t Z) = λ 0 (t) exp(β Z). How do we estimate the survival probability, S z (t) = S(t Z) = P (T > t Z), for an individual with covariates

More information

Estimation for Modified Data

Estimation for Modified Data Definition. Estimation for Modified Data 1. Empirical distribution for complete individual data (section 11.) An observation X is truncated from below ( left truncated) at d if when it is at or below d

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and

More information

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Residuals for the

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

11 Survival Analysis and Empirical Likelihood

11 Survival Analysis and Empirical Likelihood 11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with

More information

Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics.

Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Dragi Anevski Mathematical Sciences und University November 25, 21 1 Asymptotic distributions for statistical

More information

Goodness-of-fit test for the Cox Proportional Hazard Model

Goodness-of-fit test for the Cox Proportional Hazard Model Goodness-of-fit test for the Cox Proportional Hazard Model Rui Cui rcui@eco.uc3m.es Department of Economics, UC3M Abstract In this paper, we develop new goodness-of-fit tests for the Cox proportional hazard

More information

Double-Sampling Designs for Dropouts

Double-Sampling Designs for Dropouts Double-Sampling Designs for Dropouts Dave Glidden Division of Biostatistics, UCSF 26 February 2010 http://www.epibiostat.ucsf.edu/dave/talks.html Outline Mbarra s Dropouts Future Directions Inverse Prob.

More information

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the

More information

Part III Measures of Classification Accuracy for the Prediction of Survival Times

Part III Measures of Classification Accuracy for the Prediction of Survival Times Part III Measures of Classification Accuracy for the Prediction of Survival Times Patrick J Heagerty PhD Department of Biostatistics University of Washington 102 ISCB 2010 Session Three Outline Examples

More information

TESTINGGOODNESSOFFITINTHECOX AALEN MODEL

TESTINGGOODNESSOFFITINTHECOX AALEN MODEL ROBUST 24 c JČMF 24 TESTINGGOODNESSOFFITINTHECOX AALEN MODEL David Kraus Keywords: Counting process, Cox Aalen model, goodness-of-fit, martingale, residual, survival analysis. Abstract: The Cox Aalen regression

More information

Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D.

Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Web Appendix for Hierarchical Adaptive Regression Kernels for Regression with Functional Predictors by D. B. Woodard, C. Crainiceanu, and D. Ruppert A. EMPIRICAL ESTIMATE OF THE KERNEL MIXTURE Here we

More information

DAGStat Event History Analysis.

DAGStat Event History Analysis. DAGStat 2016 Event History Analysis Robin.Henderson@ncl.ac.uk 1 / 75 Schedule 9.00 Introduction 10.30 Break 11.00 Regression Models, Frailty and Multivariate Survival 12.30 Lunch 13.30 Time-Variation and

More information

Log-linearity for Cox s regression model. Thesis for the Degree Master of Science

Log-linearity for Cox s regression model. Thesis for the Degree Master of Science Log-linearity for Cox s regression model Thesis for the Degree Master of Science Zaki Amini Master s Thesis, Spring 2015 i Abstract Cox s regression model is one of the most applied methods in medical

More information

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes: Practice Exam 1 1. Losses for an insurance coverage have the following cumulative distribution function: F(0) = 0 F(1,000) = 0.2 F(5,000) = 0.4 F(10,000) = 0.9 F(100,000) = 1 with linear interpolation

More information

EEL 5544 Noise in Linear Systems Lecture 30. X (s) = E [ e sx] f X (x)e sx dx. Moments can be found from the Laplace transform as

EEL 5544 Noise in Linear Systems Lecture 30. X (s) = E [ e sx] f X (x)e sx dx. Moments can be found from the Laplace transform as L30-1 EEL 5544 Noise in Linear Systems Lecture 30 OTHER TRANSFORMS For a continuous, nonnegative RV X, the Laplace transform of X is X (s) = E [ e sx] = 0 f X (x)e sx dx. For a nonnegative RV, the Laplace

More information

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen Recap of Part 1 Per Kragh Andersen Section of Biostatistics, University of Copenhagen DSBS Course Survival Analysis in Clinical Trials January 2018 1 / 65 Overview Definitions and examples Simple estimation

More information

Testing Error Correction in Panel data

Testing Error Correction in Panel data University of Vienna, Dept. of Economics Master in Economics Vienna 2010 The Model (1) Westerlund (2007) consider the following DGP: y it = φ 1i + φ 2i t + z it (1) x it = x it 1 + υ it (2) where the stochastic

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

Multi-state models: prediction

Multi-state models: prediction Department of Medical Statistics and Bioinformatics Leiden University Medical Center Course on advanced survival analysis, Copenhagen Outline Prediction Theory Aalen-Johansen Computational aspects Applications

More information

Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs. Christopher Jennison

Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs. Christopher Jennison Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs Christopher Jennison Department of Mathematical Sciences, University of Bath http://people.bath.ac.uk/mascj

More information

CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline:

CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity. Outline: CASE STUDY: Bayesian Incidence Analyses from Cross-Sectional Data with Multiple Markers of Disease Severity Outline: 1. NIEHS Uterine Fibroid Study Design of Study Scientific Questions Difficulties 2.

More information

SEMIPARAMETRIC METHODS FOR ESTIMATING CUMULATIVE TREATMENT EFFECTS IN THE PRESENCE OF NON-PROPORTIONAL HAZARDS AND DEPENDENT CENSORING

SEMIPARAMETRIC METHODS FOR ESTIMATING CUMULATIVE TREATMENT EFFECTS IN THE PRESENCE OF NON-PROPORTIONAL HAZARDS AND DEPENDENT CENSORING SEMIPARAMETRIC METHODS FOR ESTIMATING CUMULATIVE TREATMENT EFFECTS IN THE PRESENCE OF NON-PROPORTIONAL HAZARDS AND DEPENDENT CENSORING by Guanghui Wei A dissertation submitted in partial fulfillment of

More information

Lectures 16-17: Poisson Approximation. Using Lemma (2.4.3) with θ = 1 and then Lemma (2.4.4), which is valid when max m p n,m 1/2, we have

Lectures 16-17: Poisson Approximation. Using Lemma (2.4.3) with θ = 1 and then Lemma (2.4.4), which is valid when max m p n,m 1/2, we have Lectures 16-17: Poisson Approximation 1. The Law of Rare Events Theorem 2.6.1: For each n 1, let X n,m, 1 m n be a collection of independent random variables with PX n,m = 1 = p n,m and PX n,m = = 1 p

More information

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University Survival Analysis: Weeks 2-3 Lu Tian and Richard Olshen Stanford University 2 Kaplan-Meier(KM) Estimator Nonparametric estimation of the survival function S(t) = pr(t > t) The nonparametric estimation

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

Survival Distributions, Hazard Functions, Cumulative Hazards

Survival Distributions, Hazard Functions, Cumulative Hazards BIO 244: Unit 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution

More information

Nonparametric Inference In Functional Data

Nonparametric Inference In Functional Data Nonparametric Inference In Functional Data Zuofeng Shang Purdue University Joint work with Guang Cheng from Purdue Univ. An Example Consider the functional linear model: Y = α + where 1 0 X(t)β(t)dt +

More information

The Multivariate Normal Distribution. In this case according to our theorem

The Multivariate Normal Distribution. In this case according to our theorem The Multivariate Normal Distribution Defn: Z R 1 N(0, 1) iff f Z (z) = 1 2π e z2 /2. Defn: Z R p MV N p (0, I) if and only if Z = (Z 1,..., Z p ) T with the Z i independent and each Z i N(0, 1). In this

More information

Tests of independence for censored bivariate failure time data

Tests of independence for censored bivariate failure time data Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ

More information

Frailty Probit model for multivariate and clustered interval-censor

Frailty Probit model for multivariate and clustered interval-censor Frailty Probit model for multivariate and clustered interval-censored failure time data University of South Carolina Department of Statistics June 4, 2013 Outline Introduction Proposed models Simulation

More information

UNIVERSITÄT POTSDAM Institut für Mathematik

UNIVERSITÄT POTSDAM Institut für Mathematik UNIVERSITÄT POTSDAM Institut für Mathematik Testing the Acceleration Function in Life Time Models Hannelore Liero Matthias Liero Mathematische Statistik und Wahrscheinlichkeitstheorie Universität Potsdam

More information

Multivariate Survival Data With Censoring.

Multivariate Survival Data With Censoring. 1 Multivariate Survival Data With Censoring. Shulamith Gross and Catherine Huber-Carol Baruch College of the City University of New York, Dept of Statistics and CIS, Box 11-220, 1 Baruch way, 10010 NY.

More information

GEE for Longitudinal Data - Chapter 8

GEE for Longitudinal Data - Chapter 8 GEE for Longitudinal Data - Chapter 8 GEE: generalized estimating equations (Liang & Zeger, 1986; Zeger & Liang, 1986) extension of GLM to longitudinal data analysis using quasi-likelihood estimation method

More information

Implementing Response-Adaptive Randomization in Multi-Armed Survival Trials

Implementing Response-Adaptive Randomization in Multi-Armed Survival Trials Implementing Response-Adaptive Randomization in Multi-Armed Survival Trials BASS Conference 2009 Alex Sverdlov, Bristol-Myers Squibb A.Sverdlov (B-MS) Response-Adaptive Randomization 1 / 35 Joint work

More information

Spatial Process Estimates as Smoothers: A Review

Spatial Process Estimates as Smoothers: A Review Spatial Process Estimates as Smoothers: A Review Soutir Bandyopadhyay 1 Basic Model The observational model considered here has the form Y i = f(x i ) + ɛ i, for 1 i n. (1.1) where Y i is the observed

More information

Multi-state Models: An Overview

Multi-state Models: An Overview Multi-state Models: An Overview Andrew Titman Lancaster University 14 April 2016 Overview Introduction to multi-state modelling Examples of applications Continuously observed processes Intermittently observed

More information

Score test for random changepoint in a mixed model

Score test for random changepoint in a mixed model Score test for random changepoint in a mixed model Corentin Segalas and Hélène Jacqmin-Gadda INSERM U1219, Biostatistics team, Bordeaux GDR Statistiques et Santé October 6, 2017 Biostatistics 1 / 27 Introduction

More information

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Patrick J. Heagerty PhD Department of Biostatistics University of Washington 166 ISCB 2010 Session Four Outline Examples

More information

Mathematics Qualifying Examination January 2015 STAT Mathematical Statistics

Mathematics Qualifying Examination January 2015 STAT Mathematical Statistics Mathematics Qualifying Examination January 2015 STAT 52800 - Mathematical Statistics NOTE: Answer all questions completely and justify your derivations and steps. A calculator and statistical tables (normal,

More information

Master s Written Examination - Solution

Master s Written Examination - Solution Master s Written Examination - Solution Spring 204 Problem Stat 40 Suppose X and X 2 have the joint pdf f X,X 2 (x, x 2 ) = 2e (x +x 2 ), 0 < x < x 2

More information

Ph.D. course: Regression models. Introduction. 19 April 2012

Ph.D. course: Regression models. Introduction. 19 April 2012 Ph.D. course: Regression models Introduction PKA & LTS Sect. 1.1, 1.2, 1.4 19 April 2012 www.biostat.ku.dk/~pka/regrmodels12 Per Kragh Andersen 1 Regression models The distribution of one outcome variable

More information

Accounting for Baseline Observations in Randomized Clinical Trials

Accounting for Baseline Observations in Randomized Clinical Trials Accounting for Baseline Observations in Randomized Clinical Trials Scott S Emerson, MD, PhD Department of Biostatistics, University of Washington, Seattle, WA 9895, USA August 5, 0 Abstract In clinical

More information

arxiv: v1 [math.st] 22 Oct 2018

arxiv: v1 [math.st] 22 Oct 2018 Subtleties in the interpretation of hazard ratios arxiv:1810.09192v1 [math.st] 22 Oct 2018 Torben Martinussen 1, Stijn Vansteelandt 2 and Per Kragh Andersen 1 1 Department of Biostatistics University of

More information

Model Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial

Model Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial Model Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial Jin Kyung Park International Vaccine Institute Min Woo Chae Seoul National University R. Leon Ochiai International

More information

Spatial Misalignment

Spatial Misalignment Spatial Misalignment Jamie Monogan University of Georgia Spring 2013 Jamie Monogan (UGA) Spatial Misalignment Spring 2013 1 / 28 Objectives By the end of today s meeting, participants should be able to:

More information

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements [Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements Aasthaa Bansal PhD Pharmaceutical Outcomes Research & Policy Program University of Washington 69 Biomarkers

More information

A Joint Model with Marginal Interpretation for Longitudinal Continuous and Time-to-event Outcomes

A Joint Model with Marginal Interpretation for Longitudinal Continuous and Time-to-event Outcomes A Joint Model with Marginal Interpretation for Longitudinal Continuous and Time-to-event Outcomes Achmad Efendi 1, Geert Molenberghs 2,1, Edmund Njagi 1, Paul Dendale 3 1 I-BioStat, Katholieke Universiteit

More information

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018 Mathematics Ph.D. Qualifying Examination Stat 52800 Probability, January 2018 NOTE: Answers all questions completely. Justify every step. Time allowed: 3 hours. 1. Let X 1,..., X n be a random sample from

More information

Ph.D. course: Regression models. Regression models. Explanatory variables. Example 1.1: Body mass index and vitamin D status

Ph.D. course: Regression models. Regression models. Explanatory variables. Example 1.1: Body mass index and vitamin D status Ph.D. course: Regression models Introduction PKA & LTS Sect. 1.1, 1.2, 1.4 25 April 2013 www.biostat.ku.dk/~pka/regrmodels13 Per Kragh Andersen Regression models The distribution of one outcome variable

More information

Chapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n

Chapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 4 Fall 2012 4.2 Estimators of the survival and cumulative hazard functions for RC data Suppose X is a continuous random failure time with

More information

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS PROBABILITY: LIMIT THEOREMS II, SPRING 218. HOMEWORK PROBLEMS PROF. YURI BAKHTIN Instructions. You are allowed to work on solutions in groups, but you are required to write up solutions on your own. Please

More information

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples ST3241 Categorical Data Analysis I Generalized Linear Models Introduction and Some Examples 1 Introduction We have discussed methods for analyzing associations in two-way and three-way tables. Now we will

More information

Statistical Methods for Alzheimer s Disease Studies

Statistical Methods for Alzheimer s Disease Studies Statistical Methods for Alzheimer s Disease Studies Rebecca A. Betensky, Ph.D. Department of Biostatistics, Harvard T.H. Chan School of Public Health July 19, 2016 1/37 OUTLINE 1 Statistical collaborations

More information

Group Sequential Designs: Theory, Computation and Optimisation

Group Sequential Designs: Theory, Computation and Optimisation Group Sequential Designs: Theory, Computation and Optimisation Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj 8th International Conference

More information

On prediction and density estimation Peter McCullagh University of Chicago December 2004

On prediction and density estimation Peter McCullagh University of Chicago December 2004 On prediction and density estimation Peter McCullagh University of Chicago December 2004 Summary Having observed the initial segment of a random sequence, subsequent values may be predicted by calculating

More information

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models NIH Talk, September 03 Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models Eric Slud, Math Dept, Univ of Maryland Ongoing joint project with Ilia

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January

More information

Statistical Inference of Interval-censored Failure Time Data

Statistical Inference of Interval-censored Failure Time Data Statistical Inference of Interval-censored Failure Time Data Jinheum Kim 1 1 Department of Applied Statistics, University of Suwon May 28, 2011 J. Kim (Univ Suwon) Interval-censored data Sprring KSS 2011

More information

Chapter 7: Hypothesis testing

Chapter 7: Hypothesis testing Chapter 7: Hypothesis testing Hypothesis testing is typically done based on the cumulative hazard function. Here we ll use the Nelson-Aalen estimate of the cumulative hazard. The survival function is used

More information