Part III Measures of Classification Accuracy for the Prediction of Survival Times

Size: px
Start display at page:

Download "Part III Measures of Classification Accuracy for the Prediction of Survival Times"

Transcription

1 Part III Measures of Classification Accuracy for the Prediction of Survival Times Patrick J Heagerty PhD Department of Biostatistics University of Washington 102 ISCB 2010

2 Session Three Outline Examples Breast Cancer Cytology Data Mayo PBC Data Cystic Fibrosis Foundation Registry Data Previous approaches ROC overview TP, FP for survival outcomes ROC C/D t (p) ROC I/D t (p), AUC(t), and concordance, C τ ROC(p) interpretation and dynamic criteria, c p (t) Further work 103 ISCB 2010

3 Example: Comparing cytometry measures Breast Cancer among Younger Women N = 253 women BC diagnosed aged 20 to 44 Endpoint: time-until-death (any cause) Cytometry measurements: old (S-phase ungated) new (S-phase gated) Goal: compare measures as predictors of mortality Heagerty, Lumley & Pepe (2000) 104 ISCB 2010

4 New versus Old Method %S Gated %S Ungated 105 ISCB 2010

5 Predictive Survival Models Mayo PBC Data N = 312 subjects, 125 deaths, Baseline measurements: bilirubin, prothrombin time, albumin Goal: predict mortality; medical management 106 ISCB 2010

6 107 ISCB 2010

7 Predictive Survival Models Cystic Fibrosis Data N = 23, 530 subjects, 4, 772 deaths, n = 160, 005 longitudinal observations Longitudinal measurements: FEV1, weight, height Goal: predict mortality; transplantation selection 108 ISCB 2010

8 fev fev fev fev fev fev age age age age age age fev fev fev fev fev fev age age age age age age fev fev fev fev fev fev age age age age age age fev fev fev fev fev fev age age age ISCB 2010 age age age

9 Accuracy: Some proposals R 2 Generalizations Korn and Simon (1990) Schemper and Henderson (2000) O Quigley and Xu (2001) TP, FP, ROC Generalizations Etzioni et al (1999); Slate and Turnbull (1999) Heagerty, Lumley, and Pepe (2000) Heagerty and Zheng (2005) 109 ISCB 2010

10 Some Comments Schemper and Henderson (2000), p 249: Consequently, there have been a number of attempts to develop measures akin to R 2 for Cox proportional hazards models [[references]], though as yet, none have been generally accepted These versions of R 2 are not about variance in T They focus on average variances of the counting process: N(t) = 1(T t) 110 ISCB 2010

11 Some Comments Natural to think of survival through counting process N(t) Common to use ROC curves for logistic regression / binary classification Q: Extend classification error rate concepts to survival data? 111 ISCB 2010

12 Components of Accuracy Calibration Bias does observed match predicted? Evaluated graphically and formally Discrimination Does prediction separate subjects with different risks? Evaluated qualitatively based on K-M plots Harrell, Lee and Mark (1996) 112 ISCB 2010

13 BC Survival: New Measurement (gated) (a) Survival for %S, Gated %S: [00, 2 ), N= 86 %S: [ 2, 6 ), N= 86 %S: [ 6, 100), N= ISCB 2010

14 BC Survival: Old Measurement (ungated) (b) Survival for %S, Ungated %S: [00, 11 ), N= 86 %S: [ 11, 35 ), N= 86 %S: [ 35, 100), N= ISCB 2010

15 Binary Classification Sensitivity True Positive BINARY TEST : P (T + D = 1) CONTINUOUS MARKER : P (M > c D = 1) Specificity True Negative BINARY TEST : P (T D = 0) CONTINUOUS MARKER : P (M c D = 0) 115 ISCB 2010

16 ROC Curve An ROC curve plots the True Positive Rate, TP(c), versus the False Positive Rate, FP(c) for all possible cutpoints, c: FP(c) = P (M > c D = 0) TP(c) = P (M > c D = 1) ROC Curve : [ F P (c), T P (c) ] c 116 ISCB 2010

17 Marker versus Disease status ROC curve sensitivity specificity ISCB 2010

18 Marker versus Disease status ROC curve sensitivity specificity ISCB 2010

19 Marker versus Disease status ROC curve sensitivity specificity ISCB 2010

20 Marker versus Disease status ROC curve sensitivity specificity ISCB 2010

21 Marker versus Disease status ROC curve sensitivity specificity ISCB 2010

22 Marker versus Disease status ROC curve sensitivity specificity ISCB 2010

23 Marker versus Disease status ROC curve sensitivity specificity ISCB 2010

24 Marker versus Disease status ROC curve sensitivity specificity ISCB 2010

25 Marker versus Disease status ROC curve sensitivity specificity ISCB 2010

26 Marker versus Disease status ROC curve sensitivity specificity ISCB 2010

27 ROC Curves 1 Compare different markers over full spectrum of error combinations 2 Compare sensitivity when controlling specificity (eg TP when FP=10%) 3 AUC interpretation: For a randomly chosen case and control, the area under the ROC curve is the probability that the marker for the case is greater than the marker for the control 4 AUC is a marker-outcome concordance summary 118 ISCB 2010

28 Does a (repeated) measurement predict onset? Q: Can a measurement accurately predict which CASES will experience an event (soon)? eg FEV1 eg death time Q: Can a measurement be used to accurately guide longitudinal treatment decisions? eg lung transplantation 119 ISCB 2010

29 Classification Errors True Positive Rate P ( high measurement CASE ) False Positive Rate P ( high measurement CONTROL ) 120 ISCB 2010

30 Issues Related to Time Q: When is the measurement taken? At baseline: Y (0) At a follow-up time t: Y (t) Q: What time is used to determine when someone is a CASE or a CONTROL? Case = event (disease, death) before time t Case = event (disease, death) at time t Control = event-free through time t Control = event-free through a large follow-up time t 121 ISCB 2010

31 Our approaches measurement case control Heagerty, Lumley & Pepe (2000) baseline T t T > t Zheng & Heagerty (2004) longitudinal T = t T > t Heagerty & Zheng (2005) baseline T = t T > t Zheng & Heagerty (2007) longitudinal T t T > t Saha & Heagerty (2010) longitudinal T = t, δ = j T > t 122 ISCB 2010

32 Sensitivity and Specificity for Survival Let T denote the survival time, and let N(t) denote the counting process for the uncensored outcome: N(t) = 1(T t) Possible definitions: CASE(t) : CONTROL(t) : Cumulative N(t) = 1 Incident dn(t) = 1 Static N(t ) = 0 Dynamic N(t) = 0 Where t is a fixed large time, t >> t 123 ISCB 2010

33 HLP(2000) Time ZH(2004) Time HZ(2005) Time ZH(2007) Time 124 ISCB 2010

34 [1] Sensitivity and Specificity for Survival Define: Heagerty, Lumley & Pepe (2000) sensitivity C (c, t) : P (M > c T t) P (M > c N(t) = 1) specificity D (c, t) : P (M c T > t) P (M c N(t) = 0) T P C t (c) = P (M > c N(t) = 1) F P D t (c) = P (M > c N(t) = 0) 125 ISCB 2010

35 [1] Time-dependent ROC Curve An Cumulative/Dynamic ROC curve shows the ability of a marker to separate the cumulative cases through time t (eg T t) from the controls at time t (eg T > t) Define curve [ p, ROC C/D t (p) ]: ROC C/D t (p) = T P C { [F P D ] 1 (p) } = T P C (c p ) where c p : p = F P D (c p ) Define AUC as: AU C(t) = ROC C/D t (p) dp 126 ISCB 2010

36 Estimation: NNE for S(m, t) With censored times a valid ROC solution can be provided by using an estimator of the bivariate distribution function for [M, T ] Define: S(c, t) = P [M > c, T > t] S(c, t) = c S(t M = u)df M (u) where F M (u) is the distribution function for M 127 ISCB 2010

37 Akritas (1994): Ŝ λn (c, t) = 1 n Ŝ λn (t M = M i )1(M i > c) i where Ŝλ n (t M = M i ) is a suitable estimator of the conditional survival function characterized by a smoothing parameter λ n 128 ISCB 2010

38 Estimation: NNE for S(m, t) Define: P λn [M > c N(t) = 1] = {[1 F M (c)] Ŝλ n (c, t)} {1 Ŝλ n (t)} P λn [M c N(t) = 0] = 1 Ŝλ n (c, t) Ŝ λn (t) where Ŝλ n (t) = Ŝλ n (, t) For NNE λ n = O(n 1/3 ) sufficient for weak consistency Results from Akritas (1994) and van de Vaart and Wellner (1996) imply that the bootstrap can be used for inference 129 ISCB 2010

39 (a) ROC(40m), NNE (b) ROC(60m), NNE (c) ROC(100m), NNE sensitivity specificity (d) ROC(40m), Kaplan Meier sensitivity sensitivity specificity sensitivity sensitivity specificity (e) ROC(60m), Kaplan Meier (f) ROC(100m), Kaplan Meier sensitivity specificity 1 specificity 1 specificity 130 ISCB 2010

40 Accuracy Comparisons Sensitivity Using gated (new) measurement: P [M 1 54 N(60m) = 0] = 071 P [M 1 > 54 N(60m) = 1] = 082 Using ungated (old) measurement: P [M 2 35 N(60m) = 0] = 071 P [M 2 > 35 N(60m) = 1] = 054 Therefore, controlling M 1 and M 2 to have equal specificity, the new measure, M 1, has a greater sensitivity 131 ISCB 2010

41 Accuracy Comparisons AUC Calculations AUC for gated (new): 080, Bootstrap 95% CI: (072,089) AUC for ungated (old): 068, Bootstrap 95% CI: (056,077) 95% CI for difference in areas (new-old): (003, 026) 132 ISCB 2010

42 Summary Define time-dependent ROC curves based on prospective data and cumulative occurence of events Local survival estimation handles censored event times Tool to evaluate a marker or a survival regression model score R package: survivalroc (see web for doc/validation) Alternative definitions? More general longitudinal marker scenario? Heagerty, Lumley and Pepe (2000) Time-dependent ROC Curves for Censored Survival Data and a Diagnostic Marker Biometrics 133 ISCB 2010

43 [2] Sensitivity and Specificity for Survival Define: Heagerty & Zheng (2005) sensitivity I (c, t) : P (M > c T = t) P (M > c dn(t) = 1) specificity D (c, t) : P (M c T > t) P (M c N(t) = 0) T P I t (c) = P (M > c dn(t) = 1) F P D t (c) = P (M > c N(t) = 0) 134 ISCB 2010

44 [2] Time-dependent ROC Curve An Incident/Dynamic ROC curve shows the ability of a marker to separate the cases (T = t) from the controls (T > t) within a (potential) risk-set (T t) Define curve [ p, ROC I/D t (p) ]: ROC I/D t (p) = T P I { [F P D ] 1 (p) } = T P I (c p ) where c p : p = F P D (c p ) Define AUC as function of time: AU C(t) = ROC I/D t (p) dp 135 ISCB 2010

45 log normal survival example Marker RHO = log(time) 136 ISCB 2010

46 log-normal survival example AT RISK Marker CASE CONTROL log(time) ISCB 2010

47 I/D ROC curve for log normal sensitivity log(t) = specificity 137 ISCB 2010

48 log-normal survival example AT RISK Marker CASE CONTROL log(time) ISCB 2010

49 log-normal survival example AT RISK Marker CASE CONTROL log(time) ISCB 2010

50 log-normal survival example AT RISK Marker CASE CONTROL log(time) ISCB 2010

51 log-normal survival example AT RISK Marker CASE CONTROL log(time) ISCB 2010

52 log-normal survival example AT RISK Marker CASE CONTROL log(time) ISCB 2010

53 I/D ROC curves for log normal sensitivity log(t) = 15 log(t) = 1 log(t) = 05 log(t) = 0 log(t) = 05 log(t) = specificity 138 ISCB 2010

54 AUC(t) and Concordance Q: The I/D ROC curve and AUC(t) provide time-specific summaries of accuracy, but is there a single global summary? Concordance: C = P (M j > M k T j < T k ) C = t AUC(t) w(t) dt with w(t) = 2 f(t) S(t) 139 ISCB 2010

55 AUC(t) and Concordance Time can be restricted to (0, τ) to obtain: C τ = P (M j > M k T j < T k, T j < τ) = τ 0 AUC(t) w τ (t) dt with w τ (t) = w(t)/[1 S 2 (τ)] C is directly related to Kendall s tau, K: Korn and Simon (1990) Harrell, Lee and Mark (1996) C = K/2 + 1/2 140 ISCB 2010

56 AUC(t) curves for log normal AUC(t) w(t) rho = 09, C = 083 rho = 08, C = 078 rho = 07, C = 074 rho = 06, C = time 141 ISCB 2010

57 Estimation: Issues F Pt D (c) = P (M > c T > t) can be estimated non-parametrically for times when i R i(t) moderate-to-large, where R i (t) = 1(Ti > t), at-risk indicator However, estimation of T P I t (c) = P (M > c T = t) requires some sort of smoothing since the observed subset with T i = t may only contain one observation Essentially we are interested in regression quantiles for the marker as a function of time, T = t, but T may be censored (coarsened covariate) The hazard can be used as a bridge to estimate T P I t (p) 142 ISCB 2010

58 Estimation: Proportional Hazards Assume: (M i, T i, i) iid T i = min(t i, C i ), i = 1(T i < C i ) Independent censoring, C i No assumption for marker distribution Hazard Model: λ(t M i ) = λ 0 (t) exp(γ M i ) 143 ISCB 2010

59 Estimation: Proportional Hazards To estimate F P D t (c) we use the empirical distribution for the control set (T > t): F P D t (c) = i 1 n t R i (t+) 1(M i > c) where R i (t+) = 1(T i > t), n t = i R i (t+) 144 ISCB 2010

60 Estimation: Proportional Hazards To estimate T P I t (c) we use the exponential tilt of the empirical distribution for the risk set (T t): T P I t(c) = i π i (t, γ) 1(M i > c) where π i (t, γ) = R i (t) exp(γ M i )/W t W t = i R i (t) exp(γ M i ), R i (t) = 1(T i t) Xu and O Quigley (2000) show that the weights, π i (t, γ), applied to the risk set provide consistent estimation of P (M i > c T i = t) Partial likelihood connections: E(M T = t) 145 ISCB 2010

61 Estimation: Hazard as Bridge A general definition for the hazard is Then using a little algebra yields λ(t M i ) = P (T i = t M i ) P (T i t M i ) P (M i = m T i = t) λ(t M i = m) P (M i = m T i t) 146 ISCB 2010

62 Estimation: Hazard as Bridge A general definition for the hazard is Then using a little algebra yields λ(t M i ) = P (T i = t M i ) P (T i t M i ) P (M i = m T i = t) λ(t M i = m) }{{} P (M i = m T i t) }{{} Estimate = Smooth model + Empirical 146 ISCB 2010

63 Estimation: non-ph Estimation assuming PH can be relaxed using a varying coefficient model: Estimation of γ(t) λ(t M i ) = λ 0 (t) exp[γ(t) M i ] Hastie and Tibshirani (1993) Cai and Sun (2003) [local linear MPLE] simple smoothing of scaled Schoenfeld residuals Only assumes smooth hazard ratios, and linearity in M Linearity in M can be relaxed using functions f(m) 147 ISCB 2010

64 Estimation: non-ph Only the estimate of T P I t (c) is modified: T P I t(c) = i π i [t, γ(t)] 1(M i > c) where π i [t, γ(t)] = R i (t) exp[γ(t) M i ]/W t W t = i R i (t) exp[γ(t) M i ] 148 ISCB 2010

65 Some Simulations Data (M i, log T i ) were generated as bivariate normal with a correlation of ρ = 07 The sample size for each simulated data set was N = 200 The AUC(t) curve and the integrated curve, C τ, was estimated using: maximum likelihood assuming a bivariate normal model Cox model which assumes proportional hazards local maximum partial likelihood for the varying-coefficient model λ(t) = λ 0 (t) exp[γ(t) M i ] local linear smooth of the scaled Schoenfeld residuals to estimate the varying-coefficient model 149 ISCB 2010

66 AUC(t) Estimated Using Local-linear MPL AUC(t) log(time) 150 ISCB 2010

67 40% Censoring MLE Cox model local MPLE residual smooth log time AU C(t) mean (sd) mean (sd) mean (sd) mean (sd) (0019) 0749 (0031) 0859 (0054) 0875 (0048) (0021) 0742 (0029) 0818 (0035) 0827 (0037) (0021) 0732 (0026) 0770 (0035) 0772 (0035) (0020) 0722 (0024) 0724 (0038) 0722 (0039) (0019) 0712 (0024) 0689 (0042) 0687 (0041) (0018) 0702 (0026) 0654 (0045) 0655 (0043) (0016) 0689 (0035) 0633 (0057) 0637 (0048) (0015) 0653 (0055) 0617 (0075) 0614 (0051) (0013) 0560 (0073) 0555 (0075) 0546 (0058) C τ (0017) 0727 (0022) 0740 (0021) 0742 (0021) 151 ISCB 2010

68 Illustration: PBC Data Marker, M i, derived as linear predictor from Cox model: Covariate estimate se Z log(bilirubin) log(prothrombin time) edema albumin age Accuracy evaluated using λ 0 (t) exp[γ(t) M i ] local-linear MPLE for γ(t) (5) predictor model compared to (4) predictor excluding bilirubin 152 ISCB 2010

69 I/D ROC curves for PBC model score sensitivity year = 1 year = 2 year = 3 year = 4 year = specificity ISCB 2010

70 Illustration: PBC Data AUC based on Varying-Coefficient Cox model AUC w(t) 5 covariates: iauc = covariates: iauc = Time (days) 153 ISCB 2010

71 Summary Extension of ROC concepts to risk sets Estimation based on hazard model Varying coefficient simple methods look promising All summaries can be obtained from routine Cox model output Criterion for marker and/or model comparison Separation of marker generation and marker evaluation Note: R 2 for PBC data estimated as 032 (max possible 098) Heagerty and Zheng (2005) Survival Model Predictive Accuracy and ROC Curves Biometrics 154 ISCB 2010

72 Extension to Longitudinal Markers TP, FP based on longitudinal marker T P I t (c) = P [M(t) > c T = t] F P D t (c) = P [M(t) > c T > t] No simple concordance, but AUC(t) w(t)dt Dynamic criterion, c p (t), controls specificity c p (t) : p = P [M(t) > c p (t) T > t] Summary ROC curve shows total percent test positive when controlling FP using dynamic criterion 155 ISCB 2010

73 156 ISCB 2010

74 Illustration: CFF Data and Longitudinal Markers Split sample (development / validation) Time-dependent covariate Cox model Covariate estimate se Z fev fev1(t2) fev1(t3) gender weightz heightz Accuracy evaluated using λ 0 (t) exp[γ(t) M i (t)] Measurement spacing: median = 100, Q1 = 087, Q3 = ISCB 2010

75 CFF Survival Survival Time 158 ISCB 2010

76 Incident / Dynamic ROC for CFF Data True Positive Rate Age 10 Age 15 Age 20 Age 25 Age False Positive Rate 159 ISCB 2010

77 Summary Survival ROC Curve Dynamic criterion Saha and Heagerty (manuscript) c p (t) : p = P [M(t) > c p (t) T > t] Q: If a fixed time-dependent FP rate of p is used then what percent of cases will test positive at the appropriate time? Total Sensitivity = P [M(t) > c p (t) T = t] P [T = t]dt ROC(p) = E T [ ROC I/D T (p) ] t Note: Consider a time lag, L, such that the marker used to model the hazard is M(t L) This leads to test positive L units before failure 160 ISCB 2010

78 Time (years) model score Threshold functions for FP = 001, 005, and ISCB 2010

79 Summary Survival ROC for CFF Data sensitivity fev1, weight weight specificity 162 ISCB 2010

80 AUC(t) for CFF Data AUC(t) Time 163 ISCB 2010

81 Summary Accuracy summary ROC I/D t (p) : vary (M,t) AU C(t) : vary (t) ROC(p) : vary (M) C : global 164 ISCB 2010

82 Summary Longitudinal data as predictors of an event time Accuracy using sequential binary classification Connections to partial likelihood Software R code survivalroc R code risksetroc Now: Inference Future: Relax strong censoring assumption 165 ISCB 2010

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Patrick J. Heagerty PhD Department of Biostatistics University of Washington 166 ISCB 2010 Session Four Outline Examples

More information

Survival Model Predictive Accuracy and ROC Curves

Survival Model Predictive Accuracy and ROC Curves UW Biostatistics Working Paper Series 12-19-2003 Survival Model Predictive Accuracy and ROC Curves Patrick Heagerty University of Washington, heagerty@u.washington.edu Yingye Zheng Fred Hutchinson Cancer

More information

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements [Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements Aasthaa Bansal PhD Pharmaceutical Outcomes Research & Policy Program University of Washington 69 Biomarkers

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

PhD course: Statistical evaluation of diagnostic and predictive models

PhD course: Statistical evaluation of diagnostic and predictive models PhD course: Statistical evaluation of diagnostic and predictive models Tianxi Cai (Harvard University, Boston) Paul Blanche (University of Copenhagen) Thomas Alexander Gerds (University of Copenhagen)

More information

arxiv: v1 [stat.me] 25 Oct 2012

arxiv: v1 [stat.me] 25 Oct 2012 Time-dependent AUC with right-censored data: a survey study Paul Blanche, Aurélien Latouche, Vivian Viallon arxiv:1210.6805v1 [stat.me] 25 Oct 2012 Abstract The ROC curve and the corresponding AUC are

More information

Residuals and model diagnostics

Residuals and model diagnostics Residuals and model diagnostics Patrick Breheny November 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/42 Introduction Residuals Many assumptions go into regression models, and the Cox proportional

More information

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data 1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions

More information

Lecture 7 Time-dependent Covariates in Cox Regression

Lecture 7 Time-dependent Covariates in Cox Regression Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the

More information

Evaluating Predictive Accuracy of Survival Models with PROC PHREG

Evaluating Predictive Accuracy of Survival Models with PROC PHREG Paper SAS462-2017 Evaluating Predictive Accuracy of Survival Models with PROC PHREG Changbin Guo, Ying So, and Woosung Jang, SAS Institute Inc. Abstract Model validation is an important step in the model

More information

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times Patrick J. Heagerty PhD Department of Biostatistics University of Washington 1 Biomarkers Review: Cox Regression Model

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview

Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective Anastasios (Butch) Tsiatis and Xiaofei Bai Department of Statistics North Carolina State University 1/35 Optimal Treatment

More information

c Copyright 2015 Chao-Kang Jason Liang

c Copyright 2015 Chao-Kang Jason Liang c Copyright 2015 Chao-Kang Jason Liang Methods for describing the time-varying predictive performance of survival models Chao-Kang Jason Liang A dissertation submitted in partial fulfillment of the requirements

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL The Cox PH model: λ(t Z) = λ 0 (t) exp(β Z). How do we estimate the survival probability, S z (t) = S(t Z) = P (T > t Z), for an individual with covariates

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the

More information

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation Biost 58 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture 5: Review Purpose of Statistics Statistics is about science (Science in the broadest

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

Application of the Time-Dependent ROC Curves for Prognostic Accuracy with Multiple Biomarkers

Application of the Time-Dependent ROC Curves for Prognostic Accuracy with Multiple Biomarkers UW Biostatistics Working Paper Series 4-8-2005 Application of the Time-Dependent ROC Curves for Prognostic Accuracy with Multiple Biomarkers Yingye Zheng Fred Hutchinson Cancer Research Center, yzheng@fhcrc.org

More information

Survival analysis in R

Survival analysis in R Survival analysis in R Niels Richard Hansen This note describes a few elementary aspects of practical analysis of survival data in R. For further information we refer to the book Introductory Statistics

More information

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

Exercises. (a) Prove that m(t) =

Exercises. (a) Prove that m(t) = Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

The influence of categorising survival time on parameter estimates in a Cox model

The influence of categorising survival time on parameter estimates in a Cox model The influence of categorising survival time on parameter estimates in a Cox model Anika Buchholz 1,2, Willi Sauerbrei 2, Patrick Royston 3 1 Freiburger Zentrum für Datenanalyse und Modellbildung, Albert-Ludwigs-Universität

More information

Analysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington

Analysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington Analysis of Longitudinal Data Patrick J Heagerty PhD Department of Biostatistics University of Washington Auckland 8 Session One Outline Examples of longitudinal data Scientific motivation Opportunities

More information

Semiparametric Estimation of Time-Dependent: ROC Curves for Longitudinal Marker Data

Semiparametric Estimation of Time-Dependent: ROC Curves for Longitudinal Marker Data UW Biostatistics Working Paper Series 12-19-2003 Semiparametric Estimation of Time-Dependent: ROC Curves for Longitudinal Marker Data Yingye Zheng Fred Hutchinson Cancer Research Center, yzheng@fhcrcorg

More information

Distribution-free ROC Analysis Using Binary Regression Techniques

Distribution-free ROC Analysis Using Binary Regression Techniques Distribution-free Analysis Using Binary Techniques Todd A. Alonzo and Margaret S. Pepe As interpreted by: Andrew J. Spieker University of Washington Dept. of Biostatistics Introductory Talk No, not that!

More information

STATISTICAL METHODS FOR EVALUATING BIOMARKERS SUBJECT TO DETECTION LIMIT

STATISTICAL METHODS FOR EVALUATING BIOMARKERS SUBJECT TO DETECTION LIMIT STATISTICAL METHODS FOR EVALUATING BIOMARKERS SUBJECT TO DETECTION LIMIT by Yeonhee Kim B.S. and B.B.A., Ewha Womans University, Korea, 2001 M.S., North Carolina State University, 2005 Submitted to the

More information

Log-linearity for Cox s regression model. Thesis for the Degree Master of Science

Log-linearity for Cox s regression model. Thesis for the Degree Master of Science Log-linearity for Cox s regression model Thesis for the Degree Master of Science Zaki Amini Master s Thesis, Spring 2015 i Abstract Cox s regression model is one of the most applied methods in medical

More information

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen Recap of Part 1 Per Kragh Andersen Section of Biostatistics, University of Copenhagen DSBS Course Survival Analysis in Clinical Trials January 2018 1 / 65 Overview Definitions and examples Simple estimation

More information

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 Department of Statistics North Carolina State University Presented by: Butch Tsiatis, Department of Statistics, NCSU

More information

Evaluation of the predictive capacity of a biomarker

Evaluation of the predictive capacity of a biomarker Evaluation of the predictive capacity of a biomarker Bassirou Mboup (ISUP Université Paris VI) Paul Blanche (Université Bretagne Sud) Aurélien Latouche (Institut Curie & Cnam) GDR STATISTIQUE ET SANTE,

More information

Proportional hazards regression

Proportional hazards regression Proportional hazards regression Patrick Breheny October 8 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/28 Introduction The model Solving for the MLE Inference Today we will begin discussing regression

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

Statistical Methods for Alzheimer s Disease Studies

Statistical Methods for Alzheimer s Disease Studies Statistical Methods for Alzheimer s Disease Studies Rebecca A. Betensky, Ph.D. Department of Biostatistics, Harvard T.H. Chan School of Public Health July 19, 2016 1/37 OUTLINE 1 Statistical collaborations

More information

Survival analysis in R

Survival analysis in R Survival analysis in R Niels Richard Hansen This note describes a few elementary aspects of practical analysis of survival data in R. For further information we refer to the book Introductory Statistics

More information

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Progress, Updates, Problems William Jen Hoe Koh May 9, 2013 Overview Marginal vs Conditional What is TMLE? Key Estimation

More information

Extensions of Cox Model for Non-Proportional Hazards Purpose

Extensions of Cox Model for Non-Proportional Hazards Purpose PhUSE 2013 Paper SP07 Extensions of Cox Model for Non-Proportional Hazards Purpose Jadwiga Borucka, PAREXEL, Warsaw, Poland ABSTRACT Cox proportional hazard model is one of the most common methods used

More information

Joint Modeling of Longitudinal Item Response Data and Survival

Joint Modeling of Longitudinal Item Response Data and Survival Joint Modeling of Longitudinal Item Response Data and Survival Jean-Paul Fox University of Twente Department of Research Methodology, Measurement and Data Analysis Faculty of Behavioural Sciences Enschede,

More information

Longitudinal + Reliability = Joint Modeling

Longitudinal + Reliability = Joint Modeling Longitudinal + Reliability = Joint Modeling Carles Serrat Institute of Statistics and Mathematics Applied to Building CYTED-HAROSA International Workshop November 21-22, 2013 Barcelona Mainly from Rizopoulos,

More information

Checking model assumptions with regression diagnostics

Checking model assumptions with regression diagnostics @graemeleehickey www.glhickey.com graeme.hickey@liverpool.ac.uk Checking model assumptions with regression diagnostics Graeme L. Hickey University of Liverpool Conflicts of interest None Assistant Editor

More information

NONPARAMETRIC ADJUSTMENT FOR MEASUREMENT ERROR IN TIME TO EVENT DATA: APPLICATION TO RISK PREDICTION MODELS

NONPARAMETRIC ADJUSTMENT FOR MEASUREMENT ERROR IN TIME TO EVENT DATA: APPLICATION TO RISK PREDICTION MODELS BIRS 2016 1 NONPARAMETRIC ADJUSTMENT FOR MEASUREMENT ERROR IN TIME TO EVENT DATA: APPLICATION TO RISK PREDICTION MODELS Malka Gorfine Tel Aviv University, Israel Joint work with Danielle Braun and Giovanni

More information

Longitudinal breast density as a marker of breast cancer risk

Longitudinal breast density as a marker of breast cancer risk Longitudinal breast density as a marker of breast cancer risk C. Armero (1), M. Rué (2), A. Forte (1), C. Forné (2), H. Perpiñán (1), M. Baré (3), and G. Gómez (4) (1) BIOstatnet and Universitat de València,

More information

DAGStat Event History Analysis.

DAGStat Event History Analysis. DAGStat 2016 Event History Analysis Robin.Henderson@ncl.ac.uk 1 / 75 Schedule 9.00 Introduction 10.30 Break 11.00 Regression Models, Frailty and Multivariate Survival 12.30 Lunch 13.30 Time-Variation and

More information

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University Survival Analysis: Weeks 2-3 Lu Tian and Richard Olshen Stanford University 2 Kaplan-Meier(KM) Estimator Nonparametric estimation of the survival function S(t) = pr(t > t) The nonparametric estimation

More information

Joint longitudinal and time-to-event models via Stan

Joint longitudinal and time-to-event models via Stan Joint longitudinal and time-to-event models via Stan Sam Brilleman 1,2, Michael J. Crowther 3, Margarita Moreno-Betancur 2,4,5, Jacqueline Buros Novik 6, Rory Wolfe 1,2 StanCon 2018 Pacific Grove, California,

More information

Support Vector Hazard Regression (SVHR) for Predicting Survival Outcomes. Donglin Zeng, Department of Biostatistics, University of North Carolina

Support Vector Hazard Regression (SVHR) for Predicting Survival Outcomes. Donglin Zeng, Department of Biostatistics, University of North Carolina Support Vector Hazard Regression (SVHR) for Predicting Survival Outcomes Introduction Method Theoretical Results Simulation Studies Application Conclusions Introduction Introduction For survival data,

More information

Tied survival times; estimation of survival probabilities

Tied survival times; estimation of survival probabilities Tied survival times; estimation of survival probabilities Patrick Breheny November 5 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/22 Introduction Tied survival times Introduction Breslow approximation

More information

TMA 4275 Lifetime Analysis June 2004 Solution

TMA 4275 Lifetime Analysis June 2004 Solution TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,

More information

Survival Analysis. Stat 526. April 13, 2018

Survival Analysis. Stat 526. April 13, 2018 Survival Analysis Stat 526 April 13, 2018 1 Functions of Survival Time Let T be the survival time for a subject Then P [T < 0] = 0 and T is a continuous random variable The Survival function is defined

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

Estimating a time-dependent concordance index for survival prediction models with covariate dependent censoring

Estimating a time-dependent concordance index for survival prediction models with covariate dependent censoring Noname manuscript No. (will be inserted by the editor) Estimating a time-dependent concordance index for survival prediction models with covariate dependent censoring Thomas A. Gerds 1, Michael W Kattan

More information

Partly Conditional Survival Models for Longitudinal Data

Partly Conditional Survival Models for Longitudinal Data UW Biostatistics Working Paper Series 12-19-23 Partly Conditional Survival Models for Longitudinal Data Yingye Zheng Fred Hutchinson Cancer Research Center, yzheng@fhcrc.org Patrick Heagerty University

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

Bayesian Nonparametric Accelerated Failure Time Models for Analyzing Heterogeneous Treatment Effects

Bayesian Nonparametric Accelerated Failure Time Models for Analyzing Heterogeneous Treatment Effects Bayesian Nonparametric Accelerated Failure Time Models for Analyzing Heterogeneous Treatment Effects Nicholas C. Henderson Thomas A. Louis Gary Rosner Ravi Varadhan Johns Hopkins University September 28,

More information

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional

More information

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

The Weibull Distribution

The Weibull Distribution The Weibull Distribution Patrick Breheny October 10 Patrick Breheny University of Iowa Survival Data Analysis (BIOS 7210) 1 / 19 Introduction Today we will introduce an important generalization of the

More information

Survival Analysis I (CHL5209H)

Survival Analysis I (CHL5209H) Survival Analysis Dalla Lana School of Public Health University of Toronto olli.saarela@utoronto.ca January 7, 2015 31-1 Literature Clayton D & Hills M (1993): Statistical Models in Epidemiology. Not really

More information

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology Group Sequential Tests for Delayed Responses Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj Lisa Hampson Department of Mathematics and Statistics,

More information

Machine Learning Linear Classification. Prof. Matteo Matteucci

Machine Learning Linear Classification. Prof. Matteo Matteucci Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)

More information

Estimating Optimum Linear Combination of Multiple Correlated Diagnostic Tests at a Fixed Specificity with Receiver Operating Characteristic Curves

Estimating Optimum Linear Combination of Multiple Correlated Diagnostic Tests at a Fixed Specificity with Receiver Operating Characteristic Curves Journal of Data Science 6(2008), 1-13 Estimating Optimum Linear Combination of Multiple Correlated Diagnostic Tests at a Fixed Specificity with Receiver Operating Characteristic Curves Feng Gao 1, Chengjie

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data Malaysian Journal of Mathematical Sciences 11(3): 33 315 (217) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Approximation of Survival Function by Taylor

More information

Müller: Goodness-of-fit criteria for survival data

Müller: Goodness-of-fit criteria for survival data Müller: Goodness-of-fit criteria for survival data Sonderforschungsbereich 386, Paper 382 (2004) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Goodness of fit criteria for survival data

More information

Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models

Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models Nicholas C. Henderson Thomas A. Louis Gary Rosner Ravi Varadhan Johns Hopkins University July 31, 2018

More information

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen Outline Cox s proportional hazards model. Goodness-of-fit tools More flexible models R-package timereg Forthcoming book, Martinussen and Scheike. 2/38 University of Copenhagen http://www.biostat.ku.dk

More information

Building a Prognostic Biomarker

Building a Prognostic Biomarker Building a Prognostic Biomarker Noah Simon and Richard Simon July 2016 1 / 44 Prognostic Biomarker for a Continuous Measure On each of n patients measure y i - single continuous outcome (eg. blood pressure,

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

Multivariate Survival Analysis

Multivariate Survival Analysis Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in

More information

Ph.D. course: Regression models. Introduction. 19 April 2012

Ph.D. course: Regression models. Introduction. 19 April 2012 Ph.D. course: Regression models Introduction PKA & LTS Sect. 1.1, 1.2, 1.4 19 April 2012 www.biostat.ku.dk/~pka/regrmodels12 Per Kragh Andersen 1 Regression models The distribution of one outcome variable

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu

More information

STAT 526 Spring Final Exam. Thursday May 5, 2011

STAT 526 Spring Final Exam. Thursday May 5, 2011 STAT 526 Spring 2011 Final Exam Thursday May 5, 2011 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will

More information

Faculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics

Faculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics Faculty of Health Sciences Regression models Counts, Poisson regression, 27-5-2013 Lene Theil Skovgaard Dept. of Biostatistics 1 / 36 Count outcome PKA & LTS, Sect. 7.2 Poisson regression The Binomial

More information

Ph.D. course: Regression models. Regression models. Explanatory variables. Example 1.1: Body mass index and vitamin D status

Ph.D. course: Regression models. Regression models. Explanatory variables. Example 1.1: Body mass index and vitamin D status Ph.D. course: Regression models Introduction PKA & LTS Sect. 1.1, 1.2, 1.4 25 April 2013 www.biostat.ku.dk/~pka/regrmodels13 Per Kragh Andersen Regression models The distribution of one outcome variable

More information

Survival Regression Models

Survival Regression Models Survival Regression Models David M. Rocke May 18, 2017 David M. Rocke Survival Regression Models May 18, 2017 1 / 32 Background on the Proportional Hazards Model The exponential distribution has constant

More information

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models

More information

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction Outline CHL 5225H Advanced Statistical Methods for Clinical Trials: Survival Analysis Prof. Kevin E. Thorpe Defining Survival Data Mathematical Definitions Non-parametric Estimates of Survival Comparing

More information

Lecture 9. Statistics Survival Analysis. Presented February 23, Dan Gillen Department of Statistics University of California, Irvine

Lecture 9. Statistics Survival Analysis. Presented February 23, Dan Gillen Department of Statistics University of California, Irvine Statistics 255 - Survival Analysis Presented February 23, 2016 Dan Gillen Department of Statistics University of California, Irvine 9.1 Survival analysis involves subjects moving through time Hazard may

More information

General Regression Model

General Regression Model Scott S. Emerson, M.D., Ph.D. Department of Biostatistics, University of Washington, Seattle, WA 98195, USA January 5, 2015 Abstract Regression analysis can be viewed as an extension of two sample statistical

More information

Data splitting. INSERM Workshop: Evaluation of predictive models: goodness-of-fit and predictive power #+TITLE:

Data splitting. INSERM Workshop: Evaluation of predictive models: goodness-of-fit and predictive power #+TITLE: #+TITLE: Data splitting INSERM Workshop: Evaluation of predictive models: goodness-of-fit and predictive power #+AUTHOR: Thomas Alexander Gerds #+INSTITUTE: Department of Biostatistics, University of Copenhagen

More information

Extensions of Cox Model for Non-Proportional Hazards Purpose

Extensions of Cox Model for Non-Proportional Hazards Purpose PhUSE Annual Conference 2013 Paper SP07 Extensions of Cox Model for Non-Proportional Hazards Purpose Author: Jadwiga Borucka PAREXEL, Warsaw, Poland Brussels 13 th - 16 th October 2013 Presentation Plan

More information

Classification. Classification is similar to regression in that the goal is to use covariates to predict on outcome.

Classification. Classification is similar to regression in that the goal is to use covariates to predict on outcome. Classification Classification is similar to regression in that the goal is to use covariates to predict on outcome. We still have a vector of covariates X. However, the response is binary (or a few classes),

More information

Survival Prediction Under Dependent Censoring: A Copula-based Approach

Survival Prediction Under Dependent Censoring: A Copula-based Approach Survival Prediction Under Dependent Censoring: A Copula-based Approach Yi-Hau Chen Institute of Statistical Science, Academia Sinica 2013 AMMS, National Sun Yat-Sen University December 7 2013 Joint work

More information

Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs. Christopher Jennison

Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs. Christopher Jennison Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs Christopher Jennison Department of Mathematical Sciences, University of Bath http://people.bath.ac.uk/mascj

More information

Frailty Modeling for clustered survival data: a simulation study

Frailty Modeling for clustered survival data: a simulation study Frailty Modeling for clustered survival data: a simulation study IAA Oslo 2015 Souad ROMDHANE LaREMFiQ - IHEC University of Sousse (Tunisia) souad_romdhane@yahoo.fr Lotfi BELKACEM LaREMFiQ - IHEC University

More information

Likelihood Construction, Inference for Parametric Survival Distributions

Likelihood Construction, Inference for Parametric Survival Distributions Week 1 Likelihood Construction, Inference for Parametric Survival Distributions In this section we obtain the likelihood function for noninformatively rightcensored survival data and indicate how to make

More information

Section IX. Introduction to Logistic Regression for binary outcomes. Poisson regression

Section IX. Introduction to Logistic Regression for binary outcomes. Poisson regression Section IX Introduction to Logistic Regression for binary outcomes Poisson regression 0 Sec 9 - Logistic regression In linear regression, we studied models where Y is a continuous variable. What about

More information

Analysis of competing risks data and simulation of data following predened subdistribution hazards

Analysis of competing risks data and simulation of data following predened subdistribution hazards Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013

More information

Package Rsurrogate. October 20, 2016

Package Rsurrogate. October 20, 2016 Type Package Package Rsurrogate October 20, 2016 Title Robust Estimation of the Proportion of Treatment Effect Explained by Surrogate Marker Information Version 2.0 Date 2016-10-19 Author Layla Parast

More information

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What? You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?) I m not goin stop (What?) I m goin work harder (What?) Sir David

More information

Ph.D. course: Regression models

Ph.D. course: Regression models Ph.D. course: Regression models Non-linear effect of a quantitative covariate PKA & LTS Sect. 4.2.1, 4.2.2 8 May 2017 www.biostat.ku.dk/~pka/regrmodels17 Per Kragh Andersen 1 Linear effects We have studied

More information