Childhood Cancer Survivor Study Analysis Concept Proposal

Similar documents
( t) Outline of program: BGC1: Survival and event history analysis Oslo, March-May Recapitulation. The additive regression model

CHAPTER FOUR REPEATED MEASURES IN TOXICITY TESTING

January Examinations 2012

TSS = SST + SSE An orthogonal partition of the total SS

New M-Estimator Objective Function. in Simultaneous Equations Model. (A Comparative Study)

Econ107 Applied Econometrics Topic 5: Specification: Choosing Independent Variables (Studenmund, Chapter 6)

F-Tests and Analysis of Variance (ANOVA) in the Simple Linear Regression Model. 1. Introduction

Robustness Experiments with Two Variance Components

CHAPTER 5: MULTIVARIATE METHODS

Solution in semi infinite diffusion couples (error function analysis)

ACEI working paper series RETRANSFORMATION BIAS IN THE ADJACENT ART PRICE INDEX

CHAPTER 10: LINEAR DISCRIMINATION

Department of Economics University of Toronto

PhD/MA Econometrics Examination. January, 2019

2. SPATIALLY LAGGED DEPENDENT VARIABLES

Standard Error of Technical Cost Incorporating Parameter Uncertainty

( ) () we define the interaction representation by the unitary transformation () = ()

Appendix H: Rarefaction and extrapolation of Hill numbers for incidence data

Analysis And Evaluation of Econometric Time Series Models: Dynamic Transfer Function Approach

In the complete model, these slopes are ANALYSIS OF VARIANCE FOR THE COMPLETE TWO-WAY MODEL. (! i+1 -! i ) + [(!") i+1,q - [(!

Kayode Ayinde Department of Pure and Applied Mathematics, Ladoke Akintola University of Technology P. M. B. 4000, Ogbomoso, Oyo State, Nigeria

UNIVERSITAT AUTÒNOMA DE BARCELONA MARCH 2017 EXAMINATION

Panel Data Regression Models

RELATIONSHIP BETWEEN VOLATILITY AND TRADING VOLUME: THE CASE OF HSI STOCK RETURNS DATA

Chapter 6: AC Circuits

5th International Conference on Advanced Design and Manufacturing Engineering (ICADME 2015)

Anomaly Detection. Lecture Notes for Chapter 9. Introduction to Data Mining, 2 nd Edition by Tan, Steinbach, Karpatne, Kumar

Improvement in Estimating Population Mean using Two Auxiliary Variables in Two-Phase Sampling

Improvement in Estimating Population Mean using Two Auxiliary Variables in Two-Phase Sampling

Variants of Pegasos. December 11, 2009

Dynamic Team Decision Theory. EECS 558 Project Shrutivandana Sharma and David Shuman December 10, 2005

FI 3103 Quantum Physics

J i-1 i. J i i+1. Numerical integration of the diffusion equation (I) Finite difference method. Spatial Discretization. Internal nodes.

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 4

OMXS30 Balance 20% Index Rules

FTCS Solution to the Heat Equation

Linear Response Theory: The connection between QFT and experiments

Introduction to Boosting

Chapter 8 Dynamic Models

Volatility Interpolation

Chapter 5. The linear fixed-effects estimators: matrix creation

Data Collection Definitions of Variables - Conceptualize vs Operationalize Sample Selection Criteria Source of Data Consistency of Data

An introduction to Support Vector Machine

Comparison of Supervised & Unsupervised Learning in βs Estimation between Stocks and the S&P500

Hidden Markov Models Following a lecture by Andrew W. Moore Carnegie Mellon University

Lecture 18: The Laplace Transform (See Sections and 14.7 in Boas)

V.Abramov - FURTHER ANALYSIS OF CONFIDENCE INTERVALS FOR LARGE CLIENT/SERVER COMPUTER NETWORKS

[Link to MIT-Lab 6P.1 goes here.] After completing the lab, fill in the following blanks: Numerical. Simulation s Calculations

Geographically weighted regression (GWR)

DEEP UNFOLDING FOR MULTICHANNEL SOURCE SEPARATION SUPPLEMENTARY MATERIAL

The topology and signature of the regulatory interactions predict the expression pattern of the segment polarity genes in Drosophila m elanogaster

Advanced Machine Learning & Perception

Chapter Lagrangian Interpolation

Fall 2009 Social Sciences 7418 University of Wisconsin-Madison. Problem Set 2 Answers (4) (6) di = D (10)

ELASTIC MODULUS ESTIMATION OF CHOPPED CARBON FIBER TAPE REINFORCED THERMOPLASTICS USING THE MONTE CARLO SIMULATION

Robustness of DEWMA versus EWMA Control Charts to Non-Normal Processes

Mechanics Physics 151

Math 128b Project. Jude Yuen

Bernoulli process with 282 ky periodicity is detected in the R-N reversals of the earth s magnetic field

Lecture 11 SVM cont

. The geometric multiplicity is dim[ker( λi. number of linearly independent eigenvectors associated with this eigenvalue.

Computing Relevance, Similarity: The Vector Space Model

Mechanics Physics 151

Advanced time-series analysis (University of Lund, Economic History Department)

The Finite Element Method for the Analysis of Non-Linear and Dynamic Systems

Comparison of Alternative Labour Force Survey Estimators

Machine Learning Linear Regression

On One Analytic Method of. Constructing Program Controls

CS 536: Machine Learning. Nonparametric Density Estimation Unsupervised Learning - Clustering

2.1 Constitutive Theory

CS286.2 Lecture 14: Quantum de Finetti Theorems II

Clustering (Bishop ch 9)

Tools for Analysis of Accelerated Life and Degradation Test Data

National Exams December 2015 NOTES: 04-BS-13, Biology. 3 hours duration

GENERATING CERTAIN QUINTIC IRREDUCIBLE POLYNOMIALS OVER FINITE FIELDS. Youngwoo Ahn and Kitae Kim

ABSTRACT KEYWORDS. Bonus-malus systems, frequency component, severity component. 1. INTRODUCTION

Robust and Accurate Cancer Classification with Gene Expression Profiling

. The geometric multiplicity is dim[ker( λi. A )], i.e. the number of linearly independent eigenvectors associated with this eigenvalue.

Bayesian Inference of the GARCH model with Rational Errors

PubH 7405: REGRESSION ANALYSIS DIAGNOSTICS IN MULTIPLE REGRESSION

Endogeneity. Is the term given to the situation when one or more of the regressors in the model are correlated with the error term such that

12d Model. Civil and Surveying Software. Drainage Analysis Module Detention/Retention Basins. Owen Thornton BE (Mech), 12d Model Programmer

Cubic Bezier Homotopy Function for Solving Exponential Equations

Optimal environmental charges under imperfect compliance

Comb Filters. Comb Filters

Reactive Methods to Solve the Berth AllocationProblem with Stochastic Arrival and Handling Times

MODELING TIME-VARYING TRADING-DAY EFFECTS IN MONTHLY TIME SERIES

Performance Analysis for a Network having Standby Redundant Unit with Waiting in Repair

Fall 2010 Graduate Course on Dynamic Learning

Tight results for Next Fit and Worst Fit with resource augmentation

Lecture VI Regression

Density Matrix Description of NMR BCMB/CHEM 8190

Lecture 6: Learning for Control (Generalised Linear Regression)

3. OVERVIEW OF NUMERICAL METHODS

Should Exact Index Numbers have Standard Errors? Theory and Application to Asian Growth

Machine Learning 2nd Edition

Efficient Asynchronous Channel Hopping Design for Cognitive Radio Networks

A Novel Iron Loss Reduction Technique for Distribution Transformers. Based on a Combined Genetic Algorithm - Neural Network Approach

This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.

Sampling Procedure of the Sum of two Binary Markov Process Realizations

Transcription:

Chldhood Cancer Survvor Sudy Analyss Concep Proposal 1. Tle: Inverse probably censored weghng (IPCW) o adjus for selecon bas and drop ou n he conex of CCSS analyses 2. Workng group and nvesgaors: Epdemology/Bosascs Workng Group Chongzh D cd@fhcrc.org 206-667-2093 Wendy Lesenrng wlesenr@fhcrc.org 206-667-4374 Kayla Sraon ksraon@fhcrc.org 206-667-7293 Toana Kawashma kawash@fhcrc.org 206-667-7697 Kr Ness kr.ness@sjude.org 901-595-5157 Yuaka Yasu yyasu@ualbera.ca 780-492-4220 Greg Armsrong greg.armsrong@sjude.org 901-495-5892 Ann Merens ann.merens@choa.org 404-785-0691 Aaron McDonald aaron.mcdonald@sjude.org Les Robson les.robson@sjude.org 901-495-5817 3. Background and raonale The Chldhood Cancer Survvor Sudy (CCSS) quered parcpans wh a baselne and several follow-up (we focus on Follow-up 2000, 2003 and 2007 here) quesonnares. As n mos longudnal cohor sudes, here s non-parcpaon a each follow-up quesonnare. Ths may be problemac as dfferenal non-parcpaon, by eher an exposure of neres or a poenal confounder of he assocaon beween he exposure and he oucome of neres, has he poenal o nroduce bas no he esmae of he assocaon beween he exposure and he oucome. Tha s, s unlkely ha he responden cohor members a a gven quesonnare, hose who remaned n he cohor and who compleed he quesonnare, are a represenave sample of he orgnal populaon of neres. Mos analyses n he CCSS have smply omed ndvduals who dd no complee a parcular quesonnare, cng dfferenal non-parcpaon as a poenal lmaon of he sudy. Invesgaors have ndcaed ha dfferenal non-parcpaon may generae based resuls, bu have no quanfed hs poenal bas. Because he cohor connues o age, and because parcpaon declnes gradually over me, s mporan o quanfy hs poenal bas so ha we have a conssen mechansm o examne bas when conducng analyses and wrng manuscrps. Ths concep proposes an nvesgaon o examne poenal bas due o non-parcpaon n he CCSS.

4. Goal We wan o undersand whch characerscs of survvors are assocaed wh nonparcpaon and o wha degree non-parcpaon a follow-up quesonnares are nfluencng analyses (.e. assocaons beween rsk facors and oucomes). If here s an mpac, we wan o adjus for hs ype of selecon bas n fuure analyss, f possble. 5. Analyss framework We plan o conduc analyses usng nverse probably censored weghng (IPCW) mehodology. Ths mehod can be used for mos of he analyses oulned below. When he oucome of neres s from a follow-up quesonnare and he exposures (rsk facors) are from baselne (e.g., dabees a FU 2003 vs. BMI a baselne), we may also explore he augmened IPCW (AIPCW) mehod. When he daa are avalable, he laer mehodology can provde asympocally more effcen esmaes han he former. Analyses for hree follow-ups wll be conduced separaely, each compared o he baselne populaon. In he followng, we use FU2000 as an example, and mehods for FU2003 and FU2007 are he same. IPCW: Inversely wegh regresson analyses by he probably of parcpaon (deermned based on a logsc regresson model for probably of parcpaon gven pas hsory covaraes and oucomes), effecvely nflaes he mpac of underrepresened subjecs, so we can observe assocaons ha would have been observed f all subjecs had sayed n he sudy, assumng he models are correcly specfed. Key references for hs mehodology are: Robns JM, Ronzky A, Zhao LP. Analyss of semparamerc regresson models for repeaed oucomes n he presence of mssng daa. Journal of he Amercan Sascal Assocaon 1995; 90:106 121. Robns JM. Margnal srucural models versus srucural nesed models as ools for causal nference. In Sascal Models n Epdemology: The Envronmen and Clncal Trals, Halloran ME, Berry D (eds). IMA Volume 116, Sprnger-Verlag: New York, 1999; 95 134. Robns JM, Hernan MA, Brumback B. Margnal srucural models and causal nference n epdemology. Epdemology 2000; 11:550 560. Mahemacally,he models are represened as follows: Le Y be he oucome a FU1, X be covaraes (FU1 and/or baselne) for he oucome model, Z be covaraes (baselne) for he mssng mechansm model, R be he mssng ndcaor (R =1 when Y s observed and R =0 f Y s mssng).

Then he probably of mssngness a FU1 can be modeled usng logsc regresson exp( γ 0 + Z γ ) π = P( R = 1 Z ) =, 1+ exp( γ + Z γ ) where he oucome s assumed o follow a generalzed lnear model 1 µ = E ( Y X ) = g ( β0 + X 0 β ), g s a lnk funcon, for nsance deny for connuous oucome and log for bnary oucome. The IPCW esmaon s equvalen o solvng he followng esmang equaons n = 1 R π µ V β 1 ( Y µ ) = 0,where V = var(y ), whch uses nformaon from complee cases only, and gnores nformaon from ncomplee cases. The sandard errors for regresson coeffcens need o be correced by he sandwch varance esmaors. In IPCW analyss, assumng he logsc model for R (.e., facors nfluence log odds of R lnearly) could be quesoned as may or may no approxmae he underlyng mssngness mechansm. Therefore, we wll conduc some sensvy analyss wh respec o he logsc model for mssngness. We wll also consder he possbly of usng nonparamerc models f needed. Augmened IPCW (AIPCW): To mprove effcency, he augmened IPCW mehod (see 4-5) adds an augmenaon erm o he IPCW esmang equaon. Ths mehod nroduces he augmenaon erm for non-parcpans, n conras o IPCW, whch gnores observaons from non-parcpans. However, hs mehod requres ha covaraes for oucome model X are avalable for boh respondens and non-respondens. Thus, Augmened IPCW s applcable n CCSS only f rsk facors are observed a baselne. The mahemacal formula for AIPCW esmang equaon s gven by n n R µ 1 R V ( Y µ ) + (1 ) h( X ) = 0 π β π = 1 = 1 where h s a funcon of X. The augmened erm ncorporaes nformaon from ncomplee cases, and hus poenally mproves effcency compared o IPCW esmaes. The gan n effcency depends on he choce of he funcon h. In pracce, one can choose some smple funconal forms for h (see 4-5).

In addon o possble effcency gans, anoher advanage of AIPCW s ha s doubly robus, n he sense ha yelds conssen resuls f eher he mssngness mechansm or he oucome regresson model s correcly specfed. Algorhm for mplemenaon: 1. Calculae probably weghs: Among subjecs who parcpaed a baselne, and who were elgble (.e alve and elgble) for he relevan FU quesonnare, f a logsc regresson model o predc parcpaon a ha quesonnare, usng covaraes from baselne ha are key predcors of parcpaon (Z defne hese as he se we ve currenly denfed n our parcpaon models) and hose ha are any baselne versons of he curren oucome (D) and rsk facor of neres (E) (f he exac quesons are no avalable, we ry o use any smlar nformaon avalable). Resuls of hs modelng process wll be summarzed o descrbe facors assocaed wh parcpaon a each quesonnare. Calculae predced probables from hs model call hese P. 2. Calculae sablzed versons of probably weghs: A sablzng calculaon nvolves fng he same predcon model as above, bu only wh E as he covarae. Calculae predced probables from hs model call hese S. Then calculae sablzed weghs = SW = S/P. 3. F weghed logsc regresson usng 1/P as weghs: Among subjecs who responded o FU quesonnare of neres, f a logsc regresson model wh D as he oucome, and wh E as he covarae of neres and any oher approprae adjusmen covaraes for ha oucome (would probably do boh unvarae w/ E as well as mulvarae). A samplng wegh of 1/P should be ncorporaed n hs analyss (double check wheher SAS auomacally akes he nverse of P when usng weghs, or wheher you need o gve 1/P. Use robus varances. 4. F weghed logsc regresson usng sablzed weghs: F he same model as above wh sablzed weghs, SW = S/P, nsead of 1/P. Correc sandard errors by sandwch esmaors. 5. F an unweghed verson of he logsc regresson for comparson. Use he same model(s) as above n (3), bu whou any samplng weghs. Repor ORs for E for he hree models, 1) weghed w/ P, 2) weghed wh SW and 3) unweghed.

Proposed Oucome / Rsk facor combnaons o look a, along wh relevan covaraes for probably wegh predcon model. Covaraes for: Quesonnare Oucome (D) Key Rsk Facor (E) Probably wegh Model (1,2) FU 2003 Dabees* BMI (FU 2003) BMI(base), Dabees(base), oher RF from parcpaon model (Z) FU 2007 Dabees* BMI (FU 2007) BMI(base), Dabees(base), oher RF from parcpaon model (Z) Assocaon Models (3, 4, 5, 6) BMI(FU 2003), oher RF from parcpaon model (X) BMI(FU 2007), oher RF from parcpaon model (X) FU 2003 Pan (E21) BSI (scored from FU 2003 G) hree subscales depresson, somazaon, anxey (dchoomzed) BSI (base - J16 J37), RF from parcpaon model (Z) BSI (FU 2003), RF from parcpaon model (X) FU 2003 SF-36 Physcal funcon and General healh subscales Age (FU 2003) Age (Base), Healh a base (N15), RF from parcpaon model (Z) Age (FU 2003), RF from parcpaon model (X) * Use defnon used by Yuaka s group ** We wll also consder analyses wh reamen as a covarae. However, he use of reamen s somewha problemac snce here s a dfferen mssng daa suaon here (mssng covaraes, n conras o mssng oucomes n oher proposed analyses). Yuaka Yasu s group s workng on mssng reamen ssue usng a mulple mpuaon mehodology and as ha daa becomes avalable, we wll evaluae ncorporang he mulply mpued daa no some reamen relaed hypoheses.

Table 1: Saus a each follow-up Baselne FU 2000 FU 2003 FU 2007 Parcpan Non- Parcpan Dead Inelgble Toal

Table 2 Dsrbuon of covaraes a each follow-up or for MRAF avalably Baselne elgble FU 2000 elgble FU 2003 elgble FU 2007 elgble Parcpans Parcpans Parcpans Parcpans Nonparcpans Nonparcpans Nonparcpans Nonparcpans Gender Race Baselne age Dagnoss Age of dagnoss

Table 3: ORs beween parcpaon and subjec characerscs Baselne FU 2000 FU 2003 FU 2007 OR (95% CI) OR (95% CI) OR (95% CI) OR (95% CI) Gender Race Baselne age Dagnoss Age of dagnoss Table 4: ORs beween oucomes and key rsk facors, unadjused and adjused Quesonnare Oucome Key rsk facor ORs (95% CIs) FU 2003 Dabees BMI FU 2003 Dabees BMI (base)? FU 2003 Pan BSI FU 2003 Pan BSI (base)? FU 2003 SF-36 Age Naïve IPCW IPCW (sablzed)

6. References 1. Robns JM, Ronzky A, Zhao LP. Analyss of semparamerc regresson models for repeaed oucomes n he presence of mssng daa. Journal of he Amercan Sascal Assocaon 1995; 90:106 121. 2. Robns JM. Margnal srucural models versus srucural nesed models as ools for causal nference. In Sascal Models n Epdemology: The Envronmen and Clncal Trals, Halloran ME, Berry D (eds). IMA Volume 116, Sprnger-Verlag: New York, 1999; 95 134. 2. Robns JM, Hernan MA, Brumback B. Margnal srucural models and causal nference n epdemology. Epdemology 2000; 11:550 560. 3. Scharfsen DO, Ronzky A, Robns JM. (1999). Adjusng for non-gnorable drop-ou usng semparamerc non-response models. Journal of he Amercan Sascal Assocaon, 94:1096-1120. 4. Ronzky A, Robns JM, Scharfsen D. (1999). Semparamerc regresson for repeaed oucomes wh nongnorable nonresponse. Journal of he Amercan Sascal Assocaon, 93(444):1321-1339.