arm, we estimate the time to initial suppression using the nonparametric Kaplan Meier

Similar documents
Multiple imputation to account for measurement error in marginal structural models

Survival Analysis with Time- Dependent Covariates: A Practical Example. October 28, 2016 SAS Health Users Group Maria Eberg

Advantages of Mixed-effects Regression Models (MRM; aka multilevel, hierarchical linear, linear mixed models) 1. MRM explicitly models individual

Extensions of Cox Model for Non-Proportional Hazards Purpose

Extensions of Cox Model for Non-Proportional Hazards Purpose

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Let us use the term failure time to indicate the time of the event of interest in either a survival analysis or reliability analysis.

A Survival Analysis of GMO vs Non-GMO Corn Hybrid Persistence Using Simulated Time Dependent Covariates in SAS

Nonparametric Model Construction

Analysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time

Survival Regression Models

9 Estimating the Underlying Survival Distribution for a

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

MAS3301 / MAS8311 Biostatistics Part II: Survival

Estimation for Modified Data

Missing Data in Longitudinal Studies: Mixed-effects Pattern-Mixture and Selection Models

USING THE SAS SYSTEM TO DEVELOP RISK PREDICTION MODELS FOR PATIENT RE-ADMISSION REDUCTION IN VAMCS. Issac Shams

STAT Sample Problem: General Asymptotic Results

Philosophy and Features of the mstate package

Testing Indirect Effects for Lower Level Mediation Models in SAS PROC MIXED

Multi-state models: prediction

Chapter 4 Regression Models

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Exercises. (a) Prove that m(t) =

Survival Analysis. Stat 526. April 13, 2018

Multivariate Survival Analysis

β j = coefficient of x j in the model; β = ( β1, β2,

4. Comparison of Two (K) Samples

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Assessing the effect of a partly unobserved, exogenous, binary time-dependent covariate on -APPENDIX-

PRINCIPAL COMPONENTS ANALYSIS (PCA)

Package threg. August 10, 2015

UNIVERSITY OF CALIFORNIA, SAN DIEGO

Kaplan-Meier in SAS. filename foo url " data small; infile foo firstobs=2; input time censor; run;

Product-limit estimators of the survival function with left or right censored data

Mixed Models Lecture Notes By Dr. Hanford page 199 More Statistics& SAS Tutorial at

Beyond GLM and likelihood

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

Chapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n

Point and Interval Estimation II Bios 662

18.465, further revised November 27, 2012 Survival analysis and the Kaplan Meier estimator

Some Relatively Simple Ways to Get Good Answers

Comparison of Two Population Means

A Regression Model For Recurrent Events With Distribution Free Correlation Structure

A SAS Macro to Find the Best Fitted Model for Repeated Measures Using PROC MIXED

Constrained estimation for binary and survival data

for Time-to-event Data Mei-Ling Ting Lee University of Maryland, College Park

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

Statistical Methods III Statistics 212. Problem Set 2 - Answer Key

Adjusting Analyses of Survey Results using a Predicted Probability of Response Rob Gately, INNOVUS: Epidemiology, Waltham, MA

Estimating Causal Effects of Organ Transplantation Treatment Regimes

Mapping Participants to the Closest Medical Center

S The Over-Reliance on the Central Limit Theorem

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

Chapter 7: Hypothesis testing

Logistic regression model for survival time analysis using time-varying coefficients

Time Series Forecasting Methods

Sample size re-estimation in clinical trials. Dealing with those unknowns. Chris Jennison. University of Kyoto, January 2018

SAS Analysis Examples Replication C11

Results of a simulation of modeling and nonparametric methodology for count data (from patient diary) in drug studies

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Power and Sample Size Bios 662

Statistics 262: Intermediate Biostatistics Regression & Survival Analysis

Survival Analysis. 732G34 Statistisk analys av komplexa data. Krzysztof Bartoszek

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL

Introduction to Statistical Analysis

Lecture 4 - Survival Models

Lecture 9. Statistics Survival Analysis. Presented February 23, Dan Gillen Department of Statistics University of California, Irvine

Models for Ordinal Response Data

Summarizing and Displaying Measurement Data/Understanding and Comparing Distributions

^l,2-, 3. f 2_ ' >v., 4. b/-e. of- fee. -^3-5- I70(fa = )&*>$)

Power and Sample Size Calculations with the Additive Hazards Model

Statistical Methods for Alzheimer s Disease Studies

Survival analysis in R

PASS Sample Size Software. Poisson Regression

Machine Learning. Module 3-4: Regression and Survival Analysis Day 2, Asst. Prof. Dr. Santitham Prom-on

Model Building Chap 5 p251

Pattern Structures for Risk Group Identification

Analysis of Survival Data Using Cox Model (Continuous Type)

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen

Interpolation {x,y} Data with Suavity. Peter K. Ott Forest Analysis & Inventory Branch BC Ministry of FLNRO Victoria, BC

3003 Cure. F. P. Treasure

MAS361. MAS361 1 Turn Over SCHOOL OF MATHEMATICS AND STATISTICS. Medical Statistics

Supply Forecasting via Survival Curve Estimation

Analysis of Incomplete Non-Normal Longitudinal Lipid Data

Statistical Inference and Methods

Survival models and health sequences

Multimodal Deep Learning for Predicting Survival from Breast Cancer

1 The problem of survival analysis

Part III Measures of Classification Accuracy for the Prediction of Survival Times

6 Single Sample Methods for a Location Parameter

Model Selection Procedures

SAS/STAT 13.1 User s Guide. The PHREG Procedure

STA 303H1F: Two-way Analysis of Variance Practice Problems

ST745: Survival Analysis: Cox-PH!

Description Syntax for predict Menu for predict Options for predict Remarks and examples Methods and formulas References Also see

Paired comparisons. We assume that

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics

Transcription:

Appendix 1: Technical details Let patients be indexed from 1 to 761 and index the treatment arms. is the time from randomization to first viral suppression, is the time from randomization to viral rebound, and is the time from randomization to censoring at loss to follow up, day 1012, or death. Define min, and min,. is an indicator that and is an indicator that. is the number of patients in arm. For each arm, we estimate the time to initial suppression using the nonparametric Kaplan Meier product limit estimator of the survivor function:, 1, /,, where indexes the distinct suppression event times,, is the number of suppression events at time,, and, is the number of patients at risk of initial suppression at time,. Similarly, we estimate the time from randomization to viral rebound as, 1, /,, where indexes the distinct rebound event times,, is the number of viral rebound events at time, and, is the number of patients at risk for viral rebound at time,. The probability that a patient with exposure has a suppressed viral load at time is the difference of these two curves:. Rather than choose a time point at which to estimate this probability, we can summarize this probability over followup interval [0, ] as the mean time spent in a state of suppression, estimated as. The mean time in a state of suppression can be compared between exposure groups as d, which can be calculated as the Riemann sum Δ. 1

where The variance of DS can be estimated as 1, :. 1 1, Where is an indicator for individual of being at risk of suppression at time u, : and is an indicator of being at risk of rebound at time u, :. 2

Appendix 2: SAS code ***************************************************************************** Code for viral suppression manuscript illustrating approach outlined by Gouskova 2014 start with dataset d with the following variables: t1: time from randomization to suppression d1: indicator of suppression at time t1 t2: time from randomization to rebound d2: indicator of rebound at t2 x: exposure indicator ****************************************************************************; *Estimate S(Suppression); proc phreg data=d noprint; model t1*d1(0)=; strata x; baseline out=sup(keep= x t1 s1) survival=s1; data sup; set sup; t=t1; *Estimate S(Rebound); proc phreg data=d noprint; model t2*d2(0)=; strata x; baseline out=reb(keep= x t2 s2) survival=s2; data reb; set reb; t=t2; *Make G(t) = S(Reb,t)-S(Sup,t); proc sort data=sup; by x t; proc sort data=reb; by x t; data times; do x=0 to 1; do t=0 to 1012; end; end; data g; merge sup reb times; by x t; keep x t s1 s2; data g; set g; by x t; retain holdr1 holdr2; if first.x then do; holdr1=0; holdr2=0; end; if s1=. then s1=holdr1; if s2=. then s2=holdr2; delta=s2-s1; holdr1=s1; holdr2=s2; data three; set g; if x=0; delta0=delta; keep t delta0; data four; set g; if x=1; delta1=delta; keep t delta1; data deltag; merge three four; by t; retain sum0 sum1 0; if first.b then do; sum0=0; sum1=0; end; sum0=sum0+delta0; sum1=sum1+delta1; if t=0 then do; g0=0; g1=0; end; if t>0 then do; g0=(sum0/t)*1012; g1=(sum1/t)*1012;end; delta=g1-g0; if t=1012 then proc print data=deltag noobs; var g0 g1 delta; title "Mean time in suppression and difference for x=0 and =1"; *Variance; proc phreg data=d atrisk noprint; model t1*d1(0)=; strata x; 3

output out=ars(keep=x t1 ys) num_left=ys ; baseline out=sup(keep=x t1 s) survival=s; proc sort data=ars; by x t1; data sup; set sup; by x t1; if last.t1;* get rid of duplicate record at t=0; data ars; set ars; by x t1; if first.t1; data shell; do x=0 to 1; do t1=1 to 1012;t2=t1; end; end; data sup2; merge sup ars; by x t1; retain s2; if s ne. then s2=s; else s2=s2; drop s; rename s2=s; data sup2; set sup2; by x t1; retain chm1 0 sm1 1 tm1 0 nsu; if first.x then do; nsu=0; chm1=0; sm1=1; end; if s>0 then ch=-log(s); nsu=round((ch-chm1)*ys,1);*above recreates y because SAS does not sst=s; t=t1; chm1=ch; sm1=s; keep x t nsu ys sst ; proc phreg data=d atrisk noprint; model t2*d2(0)=; strata x; output out=arr(keep=x t2 yr) num_left=yr ; baseline out=reb(keep=x t2 s) survival=s; proc sort data=arr; by x t2; data reb; set reb; by x t2; if last.t2;* get rid of duplicate record at t=0; data arr; set arr; by x t1; if first.t2; data shell; do x=0 to 1; do t1=1 to 1012;t2=t1; t=t1; end; end; data reb2; merge reb arr; by x t2; retain s2; if s ne. then s2=s; else s2=s2; drop s; rename s2=s; data reb2; set reb2; by x t2; retain chm1 0 sm1 1 tm1 0 nru; if first.x then do; nru=0; chm1=0; sm1=1; end; if s>0 then ch=-log(s); nru=round((ch-chm1)*yr,1);* recreates nru because SAS does not ssr=s; t=t2; chm1=ch; sm1=s; keep x t ssr nru yr ; data ltp4; merge reb2 sup2 shell; by x t; retain yr2 nru2 ys2 nsu2 ssr2 sst2; if yr ne. then yr2=yr; else yr2=yr2; 4

if nru ne. then nru2=nru; else nru2=0; if ys ne. then ys2=ys; else ys2=ys2; if nsu ne. then nsu2=nsu; else nsu2=0; if sst ne. then sst2=sst; else sst2=sst2; if ssr ne. then ssr2=ssr; else ssr2=ssr2; if last.t then drop yr nru ys nsu sst ssr t1 t2; rename yr2=yr nru2=nru ys2=ys nsu2=nsu ssr2=ssr sst2=sst; data e; set d; do t=0 to 1012; end; proc sort data=e; by x t; data lte; merge e(in=ine) ltp4 ; by x t ; if ine; proc sort data=lte; by x id t; data lte; set lte; if t<=t1 then ysui=1; else ysui=0; if t<=t2 then yrui=1; else yrui=0; if t1=t and d1=1 then nsi=1; else nsi=0; if t2=t and d2=1 then nri=1; else nri=0; if t<=1012 then *count number in each arm; proc sql; create table counts as select x, count(distinct(id)) as nx from lte group by x;quit; data lte2; merge counts lte; by x; proc sort data=lte2; by x id; *get components of Pi(t); data lte2; set lte2; by x id; q1u=(1/ys)*nsi; q2u=(ysui/ys**2)*nsu; q3u=(1/yr)*nri; q4u=(yrui/yr**2)*nru; *sum components of Pi(t) over time (until time t) and calculate Pi(t); data lte3; set lte2; by x id; retain q1t q2t q3t q4t; if first.id then do; q1t=0; q2t=0; q3t=0; q4t=0; end; q1t=q1t+q1u; q2t=q2t+q2u; q3t=q3t+q3u; q4t=q4t+q4u; pit=nx*sst*(q1t-q2t)-(nx*ssr*(q3t-q4t)); data lte4; set lte3; by x id; retain q6i; if first.id then q6i=0; q6i=pit+q6i; if t>0 then p=(q6i); if last.id then data lte5; set lte4; by x; retain q8; if first.x then q8=0; q7=p**2; q8=q7+q8; 5

if last.x then data lte6; set lte5 ; by x; retain v0 n0; if x=0 then do; v0=(1/nx**2)*q8; n0=nx;end; if x=1 then do; v1=(1/nx**2)*q8; n1=nx;end; if x=1 then keep v0 v1 n0 n1; data lte7; set lte6; se1=sqrt(v1); se0=sqrt(v0); sediff=sqrt(v0+v1); ods listing; *get estimate and 95% CI; data est; merge reb2 sup2 shell ;by x t; data est; set est;by x; retain sr st;if first.x then do; sr=1; st=1; end; if ssr ne. then sr=ssr; if sst ne. then st=sst; data estx1; set est; if x=1 ; gt1=sr-st; data estx0; set est; if x=0 ; gt0=sr-st; data estx; merge estx1 estx0 end=eof; by t; if last.t; retain sum0 0 sum1 0 g0 g1; sum0=sum(gt0,sum0); sum1=sum(gt1,sum1); if t=0 then do; g0=0; g1=0; end; else do; g0=sum0/(t)*1012; g1=sum1/(t)*1012; end; if eof then data estx; merge estx(keep=g0 g1) lte7(keep=sediff se1 se0); gtdiff=g1-g0; lcl=gtdiff-1.96*sediff; ucl=gtdiff+1.96*sediff; proc print data=estx; var g1 g0 gtdiff lcl ucl; title "Mean days suppressed for 3- and 4-drug arms with different and 95% CI"; 6