Double-Sampling Designs for Dropouts
|
|
- Spencer Fowler
- 5 years ago
- Views:
Transcription
1 Double-Sampling Designs for Dropouts Dave Glidden Division of Biostatistics, UCSF 26 February
2 Outline Mbarra s Dropouts Future Directions Inverse Prob. Weights/Horvitz Thompson Double Sampling for Dropouts An IPW Estimator and Variance
3 PEPFAR Massive scale-up of ARV treatment Outcome results essential Required large-scale follow-up 1.3 Million initiated ARV from
4 Mbarara ISS Cohort of 3,340 HIV+ infected individuals Started on ARV from 1/04 to 7/07 Followed through 7/1/07 56 died 2,530 alive as of 7/1/ lost to FU prior to 7/1/07
5 Multiple Events Three possible events Death Dropout Administrative Censoring Competing risks
6 Calendar Time Scale Patient 1 E D 2 E 3 E L A 4 E A Date
7 Follow Up Time Scale Patient D L Administrative Limit Time Since Initiation
8 Notation C i: time to administrative censoring always known L i: time to dropout censored by Ci and Ti T i: time to death censored by Ci and Li Xi: min(ti,ci), i = I(Ti Ci) data in absence of dropouts Robs =0 dropout, R obs =1 non-dropout
9 Administrative Censoring Patients can only dropout if Li < min(ti, Ci) Patients can only die if Di < min(li, Ci) Some people may have dropout later...
10 Administrative Censoring (T,C) are independent very standard assumption violated if demographics change over time Can be relaxed to (T,C) independent given a series of covariates Conditional on Robs, (Ti,Ci) NOT indep example of collider stratification creates dependent censoring
11 Dependent Dropout (T,L) are likely correlated Hastens death Not easily handled Dropout suggest ARV discontinuation What about observing after dropout how about sampling?
12 Sampling Plan 3,340 HIV+ initiated ART Robs =1: n1=2,625 2,569 Alive and in FU as of 7/1/09 56 Died in Follow-Up Robs=0: n 0=715, ñ0 =79 95 sought, 79 vital status ascertained
13 Advantages of Sampling Very flexible Sampling prob can vary by individual using ancillary data Valid framework for dependent censoring in a way that is model-independent no need to specify cor(t,l) will get information on this
14 Horvitz Thompson Have finite population of size n Want to estimate µ = n 1 n i=1 x i ξi=1 indicated if ith person sampled Sample with probability E(ξi=1)=πi HT estimator ˆµ = n 1 n i=1 ξ i π i x i
15 Variance var{ n( µ ˆµ)} n n = n 1 πij π i π j π ij π i π j i=1 j=1 ξ i ξ j x i x j πij is the probability that i and j are selected termed the second-order inclusion probability
16 Simple Sample Total population is n Sample with equal probability πi = ñ/n Sample with replacement chose ñ from n (putting balls back in jar) Sample with quota chose exactly ñ different from n Sample without quota chose ith person with probability ñ/n Same estimate, different variances!!
17 Second-Order Weights under equal probability schemes πii πij w/ replace (ñ/n) 2 (ñ/n) 2 quota ñ/n ñ(ñ-1)/n(n-1) no quota ñ/n (ñ/n) 2
18 Variance for Finite Population under quota sampling var{ˆµ} = (π 1 1)σ 2 n n: size of total popn (1 π)σ2 var{ˆµ} = ñ For the standard sample mean, π is effectively 0 var{ x} = σ2 ñ ñ: size of sampled popn
19 Double Sampling group of i=1,...,n subjects sampled from a population (first level of sampling) some additional data avail. (z1,...,zn) second level of data richer data collection on subset sampling determined by (z1,...,zn) pr(ξi=1 zi)=πi Neyman, 1938
20 Example: Two-Stage Case Control Study Bladder cancer case/control study Exposure: metal working fluids (MWF) Phase 1: Data collected including Q: Have you ever worked in metal industry? Phase II: Detailed work history/exposure collected Metal workers oversampled in Phase II
21 Other Examples Two-phase genotyping studies Nested case-control studies Case-cohort studies With significant cost savings All valid, comparatively efft designs
22 Back to Mbarara
23 Double Sample for Dropouts Robs =1: observed death/admin censoring data already completely observed sampling fraction 1.00 Robs =0: observed dropouts n 0 dropouts sample ñ 0 for vital ascertainment sampling fraction ñ0/n0 = π
24 Observed Data (Xi, i) if ξi=1 ignored otherwise ξ=i if R obs =1 or R obs =0 & sampled pr(sampled R obs =0) = π Data is MAR conditional on Robs
25 Frangakis and Rubin Double sampling estimate of survivor function Ignore data if ξ=0 if dropout & not dble sampled, drop data Constructed survivor estimate Showed that (T,C) not independent conditional on the value of R obs Estimate hazard and transform to survival
26 Frangakis Rubin Estimator consistent with a Gaussian limiting distribution
27 Their representation Cumulative hazard is weighted sum Of crude hazards in 2 groups Weight varies with time Not an intuitive representation why does it work?
28 Double-Sampling Λ(t) Λ(t) True cumulative hazard function Nelson-Aalen estimator if complete data avail on cohort ˆΛ(t) The FR estimator based on dble-sampled data
29 More Notation N(t)=I(X t, Δ=1) Y(t) = I(X t) H 1 : set of people with R obs =1 has size n1, ñ1 sampled (n1 = ñ1) i in H1 πi= 1 H 0 : set of people with R obs =0 has size n0, ñ0 sampled i in H0: πi= ñ0/n0
30 Complete Data on Cohort N g (t) =n 1 g Ȳ g (t) =n 1 g N(t) = g N i (t) i H g Y i (t) i H g n g n N g (t) Ȳ (t) = g n g n Ȳg(t)
31 Other Representation Λ(t) = t 0 N (du) Y(u) as n N(t) N (t) Ȳ (t) Y(t) Λ(t) = t 0 N(du) Ȳ (u)
32 Horvitz Thompson ˆN g (t) =ñ 1 g Ŷ g (t) =ñ 1 g i H g ˆN(t) = g i H g ξ i π i N i (t) ξ i π i Y i (t) n g n ˆN g (t) Ŷ (t) = g n g n Ŷg(t)
33 Approximations ˆN 1 (t) = N 1 (t) Ŷ 1 (t) =Ȳ1(t) ˆN 0 (t) N 0 (t) Ŷ 0 (t) Ȳ0(t)
34 A Natural Estimator ˆΛ(t) = t 0 ˆN(du) Ŷ (u) identical to FR estimator and to ˆΛ(t) = t 0 i i ξ i and has a IPW representation N π i i (du) ξ i Y π i i (u)
35 But what about variance?
36 FR Variance
37 Two-Part Variance n{ˆλ(t) Λ(t)} = n{ˆλ(t) Λ(t)} + n{ Λ(t) Λ(t)} last two terms are independent σ 2 (t) = var{ˆλ(t) Λ(t)} σ1(t) 2 = var{ˆλ(t) Λ(t)} σ2(t) 2 = var{ Λ(t) Λ(t)} σ 2 (t) =σ1(t)+σ 2 2(t) 2
38 Variance Decomposition σ 2 (t) : Total Variance σ 2 2(t) : Variance if full cohort observed Usual variance for Nelson-Aalen estimate Easily estimated σ 2 1(t) : Variance due to double sampling HT type variance Estimation more complicated
39 Nelson-Aalen Variance σ 2 2(t) = ˆσ 2 2(t) = t 0 t 0 = n 0 n = t 0 Λ(du) Y(u) ˆΛ(du) Ŷ (u) t 0 ˆN 0 (du) Ŷ 2 + n 1 (u) n ˆN(du) Ŷ 2 (u) t 0 ˆN 1 (du) Ŷ 2 (u)
40 Double-Sample Variance looong HT-based variance arguments lead to ˆσ 2 1(t) = (π 1 1) n 0 = n 1 n t i:r obs 0 =0 t 0 ˆN 0 (du) Ŷ 2 (u) ξ i (π 1 i π i 1)N i (du) Ỹ 2 (u). = n 1 n i=1 t 0 ξ i (π 1 i π i 1)N i (du) Ŷ 2 (u)
41 The total variance ˆσ 2 (t) =ˆσ 2 1(t)+ˆσ 2 2(t) = n 1 n n 1 i=1 n i=1 = n 1 n i=1 t 0 t 0 t 0 ξ i π i N i (du) Ŷ 2 (u) + ξ i (π 1 i π i ξ i π 1 i π i 1)N i (du) Ŷ 2 (u) N i (du) Ŷ 2 (u)
42 Source of Variance R obs =0 R obs =1 ˆσ 2 1(t) (π 1 1) n 0 n t 0 ˆN 0 (du) Ŷ 2 (u) 0 ˆσ 2 2(t) n 0 n t 0 ˆN 0 (du) Ŷ 2 (u) n 1 n t 0 ˆN 1 (du) Ŷ 2 (u) ˆσ 2 (t) π 1 n 0 n t 0 ˆN 0 (du) Ŷ 2 (u) n 1 n t 0 ˆN 1 (du) Ŷ 2 (u)
43 Some observations Robs =1: no contribution to double-sample variance Robs =0: ratio of NA variance to FR variance = 1/π = n0/ñ0 just the ratio of sample size in Robs=0 between DS data and full data Intuitive look at the variance
44 Variance Estimate Easily computed Demystifies the form Facilitates sample size calculations look at effect of various sample fractions Performs great in simulations
45 Data Example Cohort of 3,340 HIV+ infected individuals Robs =1: n1=2,625 (56 died) Robs=0: n 0=715, ñ0 =79 (26 died) π = 1/9.18 = Rate of death is about 16 times higher in dropouts compared to non-dropouts
46 FR and Naive Survival Proportion surviving Naïve estimate Adjusted estimate Days since initiation of ART
47 Relative Efficiency Trade-off between sampling fractions What is efficiency of sampling ρ dropouts compared to all dropouts Can be consistently estimated Based on Mbarara data
48 Relative Efficiency Relative Efficiency Compared to Complete Cohort % 30% 50% 70% 90% Percentage of Dropouts Sampled
49 Future Directions Apply insights from survey statistics formulae and approximations post-stratification, calibration auxiliary variables => more efficiency Look at using non-sample dropout data
50 Acknowledgement Jeff Martin Elvin Geng Constantin Yiannoutsos Mbarara ISS clinic East Africa Region of IeDEA Chuck McCulloch
Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis
Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve
More informationPart III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data
1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions
More informationYou know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?
You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?) I m not goin stop (What?) I m goin work harder (What?) Sir David
More informationST745: Survival Analysis: Nonparametric methods
ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate
More informationMulti-state models: prediction
Department of Medical Statistics and Bioinformatics Leiden University Medical Center Course on advanced survival analysis, Copenhagen Outline Prediction Theory Aalen-Johansen Computational aspects Applications
More informationPerson-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data
Person-Time Data CF Jeff Lin, MD., PhD. Incidence 1. Cumulative incidence (incidence proportion) 2. Incidence density (incidence rate) December 14, 2005 c Jeff Lin, MD., PhD. c Jeff Lin, MD., PhD. Person-Time
More informationMAS3301 / MAS8311 Biostatistics Part II: Survival
MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the
More informationMultistate models and recurrent event models
Multistate models Multistate models and recurrent event models Patrick Breheny December 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/22 Introduction Multistate models In this final lecture,
More informationCensoring and Truncation - Highlighting the Differences
Censoring and Truncation - Highlighting the Differences Micha Mandel The Hebrew University of Jerusalem, Jerusalem, Israel, 91905 July 9, 2007 Micha Mandel is a Lecturer, Department of Statistics, The
More informationPart IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation
Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Patrick J. Heagerty PhD Department of Biostatistics University of Washington 166 ISCB 2010 Session Four Outline Examples
More informationAnalysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time
Analysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term
More informationLecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016
Proportional Hazards Model - Handling Ties and Survival Estimation Statistics 255 - Survival Analysis Presented February 4, 2016 likelihood - Discrete Dan Gillen Department of Statistics University of
More informationSurvival Analysis. Lu Tian and Richard Olshen Stanford University
1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival
More informationEstimation for Modified Data
Definition. Estimation for Modified Data 1. Empirical distribution for complete individual data (section 11.) An observation X is truncated from below ( left truncated) at d if when it is at or below d
More informationMulti-state Models: An Overview
Multi-state Models: An Overview Andrew Titman Lancaster University 14 April 2016 Overview Introduction to multi-state modelling Examples of applications Continuously observed processes Intermittently observed
More informationJoint Modeling of Longitudinal Item Response Data and Survival
Joint Modeling of Longitudinal Item Response Data and Survival Jean-Paul Fox University of Twente Department of Research Methodology, Measurement and Data Analysis Faculty of Behavioural Sciences Enschede,
More informationExercises. (a) Prove that m(t) =
Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for
More informationMultistate models and recurrent event models
and recurrent event models Patrick Breheny December 6 Patrick Breheny University of Iowa Survival Data Analysis (BIOS:7210) 1 / 22 Introduction In this final lecture, we will briefly look at two other
More information11 November 2011 Department of Biostatistics, University of Copengen. 9:15 10:00 Recap of case-control studies. Frequency-matched studies.
Matched and nested case-control studies Bendix Carstensen Steno Diabetes Center, Gentofte, Denmark http://staff.pubhealth.ku.dk/~bxc/ Department of Biostatistics, University of Copengen 11 November 2011
More informationCox s proportional hazards model and Cox s partial likelihood
Cox s proportional hazards model and Cox s partial likelihood Rasmus Waagepetersen October 12, 2018 1 / 27 Non-parametric vs. parametric Suppose we want to estimate unknown function, e.g. survival function.
More informationA Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks
A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks Y. Xu, D. Scharfstein, P. Mueller, M. Daniels Johns Hopkins, Johns Hopkins, UT-Austin, UF JSM 2018, Vancouver 1 What are semi-competing
More informationNonparametric two-sample tests of longitudinal data in the presence of a terminal event
Nonparametric two-sample tests of longitudinal data in the presence of a terminal event Jinheum Kim 1, Yang-Jin Kim, 2 & Chung Mo Nam 3 1 Department of Applied Statistics, University of Suwon, 2 Department
More informationNonparametric Model Construction
Nonparametric Model Construction Chapters 4 and 12 Stat 477 - Loss Models Chapters 4 and 12 (Stat 477) Nonparametric Model Construction Brian Hartman - BYU 1 / 28 Types of data Types of data For non-life
More informationLecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL
Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL The Cox PH model: λ(t Z) = λ 0 (t) exp(β Z). How do we estimate the survival probability, S z (t) = S(t Z) = P (T > t Z), for an individual with covariates
More informationChapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n
Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 4 Fall 2012 4.2 Estimators of the survival and cumulative hazard functions for RC data Suppose X is a continuous random failure time with
More informationSample-weighted semiparametric estimates of cause-specific cumulative incidence using left-/interval censored data from electronic health records
1 / 22 Sample-weighted semiparametric estimates of cause-specific cumulative incidence using left-/interval censored data from electronic health records Noorie Hyun, Hormuzd A. Katki, Barry I. Graubard
More informationCase-control studies
Matched and nested case-control studies Bendix Carstensen Steno Diabetes Center, Gentofte, Denmark b@bxc.dk http://bendixcarstensen.com Department of Biostatistics, University of Copenhagen, 8 November
More informationCTDL-Positive Stable Frailty Model
CTDL-Positive Stable Frailty Model M. Blagojevic 1, G. MacKenzie 2 1 Department of Mathematics, Keele University, Staffordshire ST5 5BG,UK and 2 Centre of Biostatistics, University of Limerick, Ireland
More informationMAS3301 / MAS8311 Biostatistics Part II: Survival
MAS330 / MAS83 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-0 8 Parametric models 8. Introduction In the last few sections (the KM
More informationSTAT331. Combining Martingales, Stochastic Integrals, and Applications to Logrank Test & Cox s Model
STAT331 Combining Martingales, Stochastic Integrals, and Applications to Logrank Test & Cox s Model Because of Theorem 2.5.1 in Fleming and Harrington, see Unit 11: For counting process martingales with
More informationPower and Sample Size Calculations with the Additive Hazards Model
Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine
More informationPhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t)
PhD course in Advanced survival analysis. (ABGK, sect. V.1.1) One-sample tests. Counting process N(t) Non-parametric hypothesis tests. Parametric models. Intensity process λ(t) = α(t)y (t) satisfying Aalen
More informationLongitudinal + Reliability = Joint Modeling
Longitudinal + Reliability = Joint Modeling Carles Serrat Institute of Statistics and Mathematics Applied to Building CYTED-HAROSA International Workshop November 21-22, 2013 Barcelona Mainly from Rizopoulos,
More informationPhilosophy and Features of the mstate package
Introduction Mathematical theory Practice Discussion Philosophy and Features of the mstate package Liesbeth de Wreede, Hein Putter Department of Medical Statistics and Bioinformatics Leiden University
More informationSession 9: Introduction to Sieve Analysis of Pathogen Sequences, for Assessing How VE Depends on Pathogen Genomics Part I
Session 9: Introduction to Sieve Analysis of Pathogen Sequences, for Assessing How VE Depends on Pathogen Genomics Part I Peter B Gilbert Vaccine and Infectious Disease Division, Fred Hutchinson Cancer
More informationPackage Rsurrogate. October 20, 2016
Type Package Package Rsurrogate October 20, 2016 Title Robust Estimation of the Proportion of Treatment Effect Explained by Surrogate Marker Information Version 2.0 Date 2016-10-19 Author Layla Parast
More informationSurvival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University
Survival Analysis: Weeks 2-3 Lu Tian and Richard Olshen Stanford University 2 Kaplan-Meier(KM) Estimator Nonparametric estimation of the survival function S(t) = pr(t > t) The nonparametric estimation
More information3003 Cure. F. P. Treasure
3003 Cure F. P. reasure November 8, 2000 Peter reasure / November 8, 2000/ Cure / 3003 1 Cure A Simple Cure Model he Concept of Cure A cure model is a survival model where a fraction of the population
More informationTHESIS for the degree of MASTER OF SCIENCE. Modelling and Data Analysis
PROPERTIES OF ESTIMATORS FOR RELATIVE RISKS FROM NESTED CASE-CONTROL STUDIES WITH MULTIPLE OUTCOMES (COMPETING RISKS) by NATHALIE C. STØER THESIS for the degree of MASTER OF SCIENCE Modelling and Data
More informationAnalysis of Time-to-Event Data: Chapter 6 - Regression diagnostics
Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Residuals for the
More informationMarginal and Conditional Distribution Estimation from Double-Sampled Semi-Competing Risks Data
arginal and Conditional Distribution Estimation from Double-Sampled Semi-Competing Risks Data enggang u 1, and Constantin T. iannoutsos 1 Department of Biostatistics & edical Informatics K6/446 CSC 600
More informationSurvival Distributions, Hazard Functions, Cumulative Hazards
BIO 244: Unit 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution
More informationSTAT Sample Problem: General Asymptotic Results
STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative
More informationRight-truncated data. STAT474/STAT574 February 7, / 44
Right-truncated data For this data, only individuals for whom the event has occurred by a given date are included in the study. Right truncation can occur in infectious disease studies. Let T i denote
More informationSimple techniques for comparing survival functions with interval-censored data
Simple techniques for comparing survival functions with interval-censored data Jinheum Kim, joint with Chung Mo Nam jinhkim@suwon.ac.kr Department of Applied Statistics University of Suwon Comparing survival
More informationSTAT331. Cox s Proportional Hazards Model
STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations
More information9. Estimating Survival Distribution for a PH Model
9. Estimating Survival Distribution for a PH Model Objective: Another Goal of the COX model Estimating the survival distribution for individuals with a certain combination of covariates. PH model assumption:
More informationSurvival Regression Models
Survival Regression Models David M. Rocke May 18, 2017 David M. Rocke Survival Regression Models May 18, 2017 1 / 32 Background on the Proportional Hazards Model The exponential distribution has constant
More information8. Parametric models in survival analysis General accelerated failure time models for parametric regression
8. Parametric models in survival analysis 8.1. General accelerated failure time models for parametric regression The accelerated failure time model Let T be the time to event and x be a vector of covariates.
More informationModelling and Analysis of Recurrent Event Data
Modelling and Analysis of Recurrent Event Data Edsel A. Peña Department of Statistics University of South Carolina Research support from NIH, NSF, and USC/MUSC Collaborative Grants Joint work with Prof.
More informationLecture 3. Truncation, length-bias and prevalence sampling
Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in
More informationAnalysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington
Analysis of Longitudinal Data Patrick J Heagerty PhD Department of Biostatistics University of Washington Auckland 8 Session One Outline Examples of longitudinal data Scientific motivation Opportunities
More informationOptimal Blocking by Minimizing the Maximum Within-Block Distance
Optimal Blocking by Minimizing the Maximum Within-Block Distance Michael J. Higgins Jasjeet Sekhon Princeton University University of California at Berkeley November 14, 2013 For the Kansas State University
More informationResiduals and model diagnostics
Residuals and model diagnostics Patrick Breheny November 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/42 Introduction Residuals Many assumptions go into regression models, and the Cox proportional
More informationChapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample
Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 7 Fall 2012 Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample H 0 : S(t) = S 0 (t), where S 0 ( ) is known survival function,
More information[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements
[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements Aasthaa Bansal PhD Pharmaceutical Outcomes Research & Policy Program University of Washington 69 Biomarkers
More informationLecture 7 Time-dependent Covariates in Cox Regression
Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the
More informationADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables
ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES Cox s regression analysis Time dependent explanatory variables Henrik Ravn Bandim Health Project, Statens Serum Institut 4 November 2011 1 / 53
More informationAsymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics.
Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Dragi Anevski Mathematical Sciences und University November 25, 21 1 Asymptotic distributions for statistical
More informationDynamic Models Part 1
Dynamic Models Part 1 Christopher Taber University of Wisconsin December 5, 2016 Survival analysis This is especially useful for variables of interest measured in lengths of time: Length of life after
More informationChapter 7: Hypothesis testing
Chapter 7: Hypothesis testing Hypothesis testing is typically done based on the cumulative hazard function. Here we ll use the Nelson-Aalen estimate of the cumulative hazard. The survival function is used
More informationSSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that need work
SSUI: Presentation Hints 1 Comparing Marginal and Random Eects (Frailty) Models Terry M. Therneau Mayo Clinic April 1998 SSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that
More informationEstimating Causal Effects of Organ Transplantation Treatment Regimes
Estimating Causal Effects of Organ Transplantation Treatment Regimes David M. Vock, Jeffrey A. Verdoliva Boatman Division of Biostatistics University of Minnesota July 31, 2018 1 / 27 Hot off the Press
More informationof being selected and varying such probability across strata under optimal allocation leads to increased accuracy.
5 Sampling with Unequal Probabilities Simple random sampling and systematic sampling are schemes where every unit in the population has the same chance of being selected We will now consider unequal probability
More informationFAILURE-TIME WITH DELAYED ONSET
REVSTAT Statistical Journal Volume 13 Number 3 November 2015 227 231 FAILURE-TIME WITH DELAYED ONSET Authors: Man Yu Wong Department of Mathematics Hong Kong University of Science and Technology Hong Kong
More informationLecture 7. Poisson and lifetime processes in risk analysis
Lecture 7. Poisson and lifetime processes in risk analysis Jesper Rydén Department of Mathematics, Uppsala University jesper.ryden@math.uu.se Statistical Risk Analysis Spring 2014 Example: Life times of
More informationSTAT331 Lebesgue-Stieltjes Integrals, Martingales, Counting Processes
STAT331 Lebesgue-Stieltjes Integrals, Martingales, Counting Processes This section introduces Lebesgue-Stieltjes integrals, and defines two important stochastic processes: a martingale process and a counting
More informationMISSING or INCOMPLETE DATA
MISSING or INCOMPLETE DATA A (fairly) complete review of basic practice Don McLeish and Cyntha Struthers University of Waterloo Dec 5, 2015 Structure of the Workshop Session 1 Common methods for dealing
More informationFrailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.
Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Melanie M. Wall, Bradley P. Carlin November 24, 2014 Outlines of the talk
More informationβ j = coefficient of x j in the model; β = ( β1, β2,
Regression Modeling of Survival Time Data Why regression models? Groups similar except for the treatment under study use the nonparametric methods discussed earlier. Groups differ in variables (covariates)
More informationfor Time-to-event Data Mei-Ling Ting Lee University of Maryland, College Park
Threshold Regression for Time-to-event Data Mei-Ling Ting Lee University of Maryland, College Park MLTLEE@UMD.EDU Outline The proportional hazards (PH) model is widely used in analyzing time-to-event data.
More informationREGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520
REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 Department of Statistics North Carolina State University Presented by: Butch Tsiatis, Department of Statistics, NCSU
More informationStatistical Analysis of Spatio-temporal Point Process Data. Peter J Diggle
Statistical Analysis of Spatio-temporal Point Process Data Peter J Diggle Department of Medicine, Lancaster University and Department of Biostatistics, Johns Hopkins University School of Public Health
More informationData Integration for Big Data Analysis for finite population inference
for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation
More informationEEL 5544 Noise in Linear Systems Lecture 30. X (s) = E [ e sx] f X (x)e sx dx. Moments can be found from the Laplace transform as
L30-1 EEL 5544 Noise in Linear Systems Lecture 30 OTHER TRANSFORMS For a continuous, nonnegative RV X, the Laplace transform of X is X (s) = E [ e sx] = 0 f X (x)e sx dx. For a nonnegative RV, the Laplace
More informationOne-stage dose-response meta-analysis
One-stage dose-response meta-analysis Nicola Orsini, Alessio Crippa Biostatistics Team Department of Public Health Sciences Karolinska Institutet http://ki.se/en/phs/biostatistics-team 2017 Nordic and
More informationSurvival Analysis for Case-Cohort Studies
Survival Analysis for ase-ohort Studies Petr Klášterecký Dept. of Probability and Mathematical Statistics, Faculty of Mathematics and Physics, harles University, Prague, zech Republic e-mail: petr.klasterecky@matfyz.cz
More informationSTAT 6385 Survey of Nonparametric Statistics. Order Statistics, EDF and Censoring
STAT 6385 Survey of Nonparametric Statistics Order Statistics, EDF and Censoring Quantile Function A quantile (or a percentile) of a distribution is that value of X such that a specific percentage of the
More informationDescription Syntax for predict Menu for predict Options for predict Remarks and examples Methods and formulas References Also see
Title stata.com stcrreg postestimation Postestimation tools for stcrreg Description Syntax for predict Menu for predict Options for predict Remarks and examples Methods and formulas References Also see
More informationBayesian Analysis for Markers and Degradation
Bayesian Analysis for Markers and Degradation Mei-Ling Ting Lee 12, Maria Shubina 1, and Alan Zaslavsky 13 1 Biostatistics Department, Harvard School of Public Health, Boston, USA 2 Channing Laboratory,
More informationMultistate Modeling and Applications
Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)
More informationEstimating Optimal Dynamic Treatment Regimes from Clustered Data
Estimating Optimal Dynamic Treatment Regimes from Clustered Data Bibhas Chakraborty Department of Biostatistics, Columbia University bc2425@columbia.edu Society for Clinical Trials Annual Meetings Boston,
More informationPackage SimSCRPiecewise
Package SimSCRPiecewise July 27, 2016 Type Package Title 'Simulates Univariate and Semi-Competing Risks Data Given Covariates and Piecewise Exponential Baseline Hazards' Version 0.1.1 Author Andrew G Chapple
More informationMeta-analysis of epidemiological dose-response studies
Meta-analysis of epidemiological dose-response studies Nicola Orsini 2nd Italian Stata Users Group meeting October 10-11, 2005 Institute Environmental Medicine, Karolinska Institutet Rino Bellocco Dept.
More informationPh.D. Qualifying Exam Friday Saturday, January 6 7, 2017
Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a
More informationDefinitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen
Recap of Part 1 Per Kragh Andersen Section of Biostatistics, University of Copenhagen DSBS Course Survival Analysis in Clinical Trials January 2018 1 / 65 Overview Definitions and examples Simple estimation
More informationCRISP: Capture-Recapture Interactive Simulation Package
CRISP: Capture-Recapture Interactive Simulation Package George Volichenko Carnegie Mellon University Pittsburgh, PA gvoliche@andrew.cmu.edu December 17, 2012 Contents 1 Executive Summary 1 2 Introduction
More information9 Estimating the Underlying Survival Distribution for a
9 Estimating the Underlying Survival Distribution for a Proportional Hazards Model So far the focus has been on the regression parameters in the proportional hazards model. These parameters describe the
More informationSpatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States
Spatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States Laura A. Hatfield and Bradley P. Carlin Division of Biostatistics School of Public Health University
More informationOn Measurement Error Problems with Predictors Derived from Stationary Stochastic Processes and Application to Cocaine Dependence Treatment Data
On Measurement Error Problems with Predictors Derived from Stationary Stochastic Processes and Application to Cocaine Dependence Treatment Data Yehua Li Department of Statistics University of Georgia Yongtao
More informationHacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), Abstract
Hacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), 1605 1620 Comparing of some estimation methods for parameters of the Marshall-Olkin generalized exponential distribution under progressive
More informationDynamic analysis of binary longitudinal data
Dynamic analysis of binary longitudinal data Ørnulf Borgan Department of Mathematics University of Oslo Based on joint work with Rosemeire L. Fiaccone, Robin Henderson and Mauricio L. Barreto 1 Outline:
More informationA COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky
A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),
More informationLecture 5 Models and methods for recurrent event data
Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.
More informationEstimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.
Estimating the Mean Response of Treatment Duration Regimes in an Observational Study Anastasios A. Tsiatis http://www.stat.ncsu.edu/ tsiatis/ Introduction to Dynamic Treatment Regimes 1 Outline Description
More informationAnalysis of competing risks data and simulation of data following predened subdistribution hazards
Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013
More informationImproving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates
Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/
More informationBuilding a Prognostic Biomarker
Building a Prognostic Biomarker Noah Simon and Richard Simon July 2016 1 / 44 Prognostic Biomarker for a Continuous Measure On each of n patients measure y i - single continuous outcome (eg. blood pressure,
More informationA Regression Model For Recurrent Events With Distribution Free Correlation Structure
A Regression Model For Recurrent Events With Distribution Free Correlation Structure J. Pénichoux(1), A. Latouche(2), T. Moreau(1) (1) INSERM U780 (2) Université de Versailles, EA2506 ISCB - 2009 - Prague
More informationSampling and Estimation in Network Graphs
Sampling and Estimation in Network Graphs Gonzalo Mateos Dept. of ECE and Goergen Institute for Data Science University of Rochester gmateosb@ece.rochester.edu http://www.ece.rochester.edu/~gmateosb/ March
More information