Authors and Affiliations: Nianbo Dong University of Missouri 14 Hill Hall, Columbia, MO Phone: (573)

Size: px
Start display at page:

Download "Authors and Affiliations: Nianbo Dong University of Missouri 14 Hill Hall, Columbia, MO Phone: (573)"

Transcription

1 Prognostic Propensity Scores: A Method Accounting for the Correlations of the Covariates with Both the Treatment and the Outcome Variables in Matching and Diagnostics Authors and Affiliations: Nianbo Dong University of Missouri 14 Hill Hall, Columbia, MO Phone: (573) dong.nianbo@gmail.com Metin Bulus University of Missouri 14 Hill Hall, Columbia, MO mbnt9@mail.missouri.edu 1

2 Prognostic Propensity Scores: A Method Accounting for the Correlations of the Covariates with Both the Treatment and the Outcome Variables in Matching and Diagnostics Background Propensity scores (PS, Rosenbaum & Rubin, 1983) and prognostic scores (PGS, Hansen, 2006, 2008; Wyss et al., 2015) are two summary scores to address confounding for treatment effect estimation. PS focus on capturing the associations of the covariates and the treatment variable and PGS focus on capturing the associations of the covariates and the outcome variable. Because the associations of the covariates with both the treatment and the outcome variables may affect accuracy of treatment effect estimates, combining PS and PGS may have desirable properties. Hansen (2006) combined PS and PGS into a single index using Mahalanobis distance and used this index for full matching. He concluded that full matching on the combined index resulted in smaller bias and variance compared to matching on PS alone. Kelcey (2013) further compared full matching on the combined index proposed by Hansen (2006) to full matching on PS (0.1 caliper size), and full matching on PS within PGS strata (quintiles). Kelcey (2013) concluded that full matching on PS within PGS strata is superior to full matching on the combined index, and full matching on PS alone. A more comprehensive simulation study was conducted by Leacy and Stuart (2014) to compare full matching on the combined index with full matching on PGS within PS calipers (0.2 caliper size), subclassification based on a grid of 5 x 5 subclasses obtained from quintiles of PS and PGS, full matching on PS or PGS alone, and subclassification (quintiles) on PS or PGS. Leacy and Stuart (2014) concluded that full matching on the combined index and full matching on PGS within PS calipers are superior to other methods. Albeit contradictory in their findings with regard to performance of the combined index, it is difficult to draw any definite conclusion since all three studies only have full matching on PS in common as a comparison method. Nevertheless, results are encouraging and warrant further exploration of combining PS and PGS. Furthermore, the diagnostics for PS that commonly rely on covariate-based absolute standardized difference (ASD) or PS-based standardized difference (PSSD), may not apply to PGS or its joint use with PS. The PGS-based standardized difference (PGSSD) was suggested as a measure of balance for both PS and PGS (Stuart, Lee, & Leacy, 2013). However, diagnostics for the joint use of PS and PGS have not been investigated to date. Purpose The purpose is twofold: (1) We propose an alternative method to combine PS and PGS, and compare its performance to regression adjustment, optimal matching on PS, and optimal matching on PGS, and (2) We propose a correlation-based balance measure for diagnostics. Research Design Let a matrix of covariates be denoted as XX, and define PS as XXXX, where αα being a column vector of standardized regression weights to predict treatment status (T). For the same matrix of XX, let PGS be denoted as XXXX, where ββ being a column vector of standardized regression weights to predict the outcome (Y). Then the prognostic propensity score (PGPS) is defined based on redefined weights as PGPS = XX(dddddddd[ααββ TT ]). 2

3 This combined score (rather than combined index derived from Mahalanobis distance) is used for optimal matching (Ming & Rosenbaum, 2001). We also propose a correlation based covariate balance measure, which can be used to diagnose PS, PGS, and PGPS. Let the correlation between the covariates and the treatment status (TT) be denoted in vector form as cccccccc(xx, TT), the correlation between the covariates and the outcome (YY) in vector form as cccccccc(xx, YY), and number of covariates as NN, then the proposed covariate balance measure is defined as: RR XXXX RR XXXX = [cccccccc(xx, TT) TT cccccccc(xx, YY)]/NN. We conducted Monte Carlo simulation to compare the root mean square error (RMSE) of the treatment effect estimates among several estimation methods: analysis of variance (ANOVA), regression (REG), prognostic score (PGS), propensity score (PS), and prognostic propensity score (PGPS), as well as the doubly robust methods by including covariates in the outcome models (doubly robust prognostic score (DR.PGS), doubly robust prognostic score (DR.PS), and doubly robust prognostic propensity score (DR.PGPS)). Balance measure, RR XXXX RR XXXX, were compared with other balance measures regarding the correlations between the balance measures and bias. Simulation scenarios (n = 576) are adapted from Ali et al. (2014) and Stuart, Lee, and Leacy (2013). See Appendix A for further details. Findings and Discussion This is an ongoing study. We report partial simulation results from 500 replications for each scenario below. Table 1 and Figure 1 present the results of root mean square error (RMSE) of various estimation methods in 576 simulation scenarios. Overall, there is no superiority of PGPS in terms of RMSE. The doubly robust methods (DR.PGS; DR.PS; DR.PGPS) perform similarly better than the methods (PGS; PS; PGPS) without including covariates in the outcome model, and better than regression. Simulation results may guide researchers as to when PGS may be a better option compared to regression adjustment and PS. There are already some studies focusing on these comparisons (Argobast & Ray, 2011; Argobast et al, 2012; Pfeiffer & Riedl, 2015; Schmidt et al, 2016; Wyss et al., 2015; Xu et al., 2016). These studies concluded that PS and PGS have different characteristics that may reduce bias under different circumstances, in particular when treatmentto-control ratio is small. We will identify specific scenarios where some methods are better than the others in the final paper. Finally, this study shows that RR XXXX RR XXXX balance measure have higher correlation with bias and relatively more robust to estimation methods (PS, PGS, and PGPS) (Table 2 and Figure 2). The second best measure, PGSSD is superior to ASD or PSD, confirming Stuart, Lee, and Leacy (2013) findings. Nonetheless, this holds when the outcome model is correctly specified, or when there is no omitted variables in the outcome model. In all other cases, we recommend using RR XXXX RR XXXX balance measure regardless of method because it has the highest correlation when models are miss-specified, and it is more robust to estimation methods. 3

4 References Ali, M. S., Groenwold, R. H., Pestman, W. R., Belitser, S. V., Roes, K. C., Hoes, A. W.,... & Klungel, O. H. (2014). Propensity score balance measures in pharmacoepidemiology: a simulation study. Pharmacoepidemiology and drug safety, 23(8), Arbogast, P. G., & Ray, W. A. (2011). Performance of disease risk scores, propensity scores, and traditional multivariable outcome regression in the presence of multiple confounders. American journal of epidemiology, 174(5), Arbogast, P. G., Seeger, J. D., & DEcIDE Methods Center Summary Variable Working Group. (2012). Summary variables in observational research: propensity scores and disease risk scores. Effective health care program research report, (33). Hansen, B. B. (2006). Bias reduction in observational studies via prognosis scores. Technical report #441, University of Michigan Statistics Department. Hansen, B. B. (2008). The prognostic analogue of the propensity score. Biometrika, 95(2), Ming, K, & Rosenbaum, P. R. (2001). A note on optimal matching with variable controls using the assignment algorithm. Journal of Computational and Graphical Statistics, 10: Pfeiffer, R. M., & Riedl, R. (2015). On the use and misuse of scalar scores of confounders in design and analysis of observational studies. Statistics in medicine, 34(18), Kelcey, B. (2013). Propensity Score Matching within Prognostic Strata. Society for Research on Educational Effectiveness. Leacy, F. P., & Stuart, E. A. (2014). On the joint use of propensity and prognostic scores in estimation of the average treatment effect on the treated: a simulation study. Statistics in medicine, 33(20), Rosenbaum, P. R. & Rubin, D. B. (1983). The central role of the propensity score in observational studies for causal effects. Biometrika 70, Schmidt, A. F., Klungel, O. H., Groenwold, R. H., & GetReal Consortium. (2016). Adjusting for confounding in early post launch settings: Going beyond logistic regression models. Epidemiology, 27(1), Stuart, E. A., Lee, B. K., & Leacy, F. P. (2013). Prognostic score based balance measures can be a useful diagnostic for propensity score methods in comparative effectiveness research. Journal of clinical epidemiology,66(8), S84-S90. Wyss, R., Ellis, A. R., Brookhart, M. A., Jonsson Funk, M., Girman, C. J., Simpson, R. J., & Stürmer, T. (2015). Matching on the disease risk score in comparative effectiveness research of new treatments.pharmacoepidemiology and drug safety, 24(9), Xu, S., Shetterly, S., Cook, A. J., Raebel, M. A., Goonesekera, S., Shoaibi, A.,... & Fireman, B. (2016). Evaluation of propensity scores, disease risk scores, and regression in confounder adjustment for the safety of emerging treatment with group sequential monitoring. Pharmacoepidemiology and drug safety, 25(4),

5 Tables & Figures Table 1 Root Mean Squared Error of Various Estimation Methods in 576 Simulation Scenarios Sample Estimation LCL UCL Mean Method Mean Mean SD Median Min Max Full ANOVA Sample REG DR.PGPS DR.PGS Common DR.PS Support PGPS Sample PGS PS Note. ANOVA: Analysis of Variance. REG: Regression. DR.PGPS: Doubly Robust Prognostic Propensity Score. DR.PGS: Doubly Robust Prognostic Score. DR.PS: Doubly Robust Prognostic Score. PGPS: Prognostic Propensity Score. PGS: Prognostic Score. PS: Propensity Score. 5

6 Table 2 Correlations of Covariate Balance Measure with Bias in 576 Simulation Scenarios Estimation Method Covariate Balance Measure Mean LCL Mean UCL Mean SD Min Max ASD PGPSSD PGPS PGSSD PSSD RxtRxy ASD PGPSSD PGS PGSSD PSSD RxtRxy ASD PGPSSD PS PGSSD PSSD RxtRxy Note. ANOVA: Analysis of Variance. PGPS: Prognostic Propensity Score. PGS: Prognostic Score. PS: Propensity Score. ASD: Absolute Standardized Difference. PSSD: Propensity Score Standardized Difference. PGSSD: Prognostic Score Standardized Difference. PGPSSD: Prognostic Propensity Score Standardized Difference. RxtRxy: Product of correlations between covariate and treatment status, and covariate and outcome. 6

7 Note. DR.PGPS: Doubly Robust Prognostic Propensity Score. DR.PGS: Doubly Robust Prognostic Score. DR.PS: Doubly Robust Prognostic Score. PGPS: Prognostic Propensity Score. PGS: Prognostic Score. PS: Propensity Score. REG: Regression. Figure 1. Optimal Matching within Common Support Sample 7

8 PS MS01 MS02 MS03 MS04 MS05 MS06 MS07 MS08 MS09 MS10 MS11 MS12 MS01 MS02 MS03 MS04 MS05 MS06 MS07 MS08 MS09 MS10 MS11 MS12 MS01 MS02 MS03 MS04 MS05 MS06 MS07 MS08 MS09 MS10 MS11 Model Specification MS12 MS01 MS02 MS03 MS04 MS05 MS06 MS07 MS08 MS09 MS10 MS11 MS12 MS01 MS02 MS03 MS04 MS05 MS06 MS07 MS08 MS09 MS10 MS11 MS12 Estimation Method PGS PGPS ASD PGPSSD PGSSD PSSD RxtRxy Covariate Balance Measure Note. PS: Propensity Score. PGS: Prognostic Score. PGPS: Prognostic Propensity Score. ASD: Absolute Standardized Difference. PSSD: Propensity Score Standardized Difference. PGSSD: Prognostic Score Standardized Difference. PGPSSD: Prognostic Propensity Score Standardized Difference. RxtRxy: Product of correlations between covariate and treatment status, and covariate and outcome. Figure 2. Covariate Balance Measure Correlation with Bias 8

9 Appendix A. Simulation Framework The following figure demonstrates the relationship of covariates and confounders with the treatment and the outcome variables. Unlike Ali et al (2014), X 9 is not in this framework (related to neither treatment nor outcome). We also modified coefficients due to having a continuous outcome instead of a binary outcome. We adapted model specifications from Stuart, Lee and Leacy (2013), and modified them by considering both prognostic score and propensity scores models. X7 and X8 X3 and X6 Treatment Outcome X1, X2, X4 and X5 Figure A1. Association of Covariates and Confounders with Treatment and Outcome True Models: llllllllll(pp tt ) = αα 0 + αα 1 xx 1 + αα 2 xx 2 + αα 3 xx 44 + αα 4 xx 55 + αα 5 xx 7 + αα 6 xx 88 + αα 7 xx 2 xx 44 + αα 8 xx 2 xx 7 + αα 9 xx 7 xx 88 + αα 10 xx 44 xx 55 + αα 11 xx αα 12 xx yy = ββ 0 + γγγγ + ββ 1 xx 1 + ββ 2 xx 2 + ββ 3 xx 3 + ββ 4 xx 44 + ββ 5 xx 55 + ββ 6 xx 66 + ββ 7 xx 2 xx 44 + ββ 8 xx 3 xx 55 + ββ 9 xx 3 xx 66 + ββ 10 xx 44 xx 55 + ββ 11 xx ββ 12 xx Color and Font Codes: X 7, X 8: only treatment X 1, X 2, X 4, X 5: both treatment and outcome X 3, X 6: only outcome X 1, X 2, X 3, X 7: binary {~ Bernoulli (p), where p= (.30,.40,.50,.70)} X 4, X 5, X 6, X 8: continuous {all ~ Normal (0, 1)} In this study, for the true treatment model the intercept value is assigned such that treatment-tocontrol ratio is approximately.30. We further used acceptance-rejection technique and discarded samples having less than.20 and more than.40 treatment-to-control ratio. 9

10 Table A1. Four Scenarios for True Model Coefficients True Models (TM) True Model Coefficients TM1: Propensity score Main effects in propensity score model (PSM) model and outcome model αα 3, αα 4, αα 6 : log(1.5), log(2), log (3) ffffff cccccccc coefficients are both in αα 1, αα 2, αα 5 : log(1.2), log(1.5), log(2) ffffff bbbbbbbbbbbb increasing order Interactions and quadratic terms in PSM αα 7, αα 8, αα 9, αα 10, αα 11, αα 12 : log(1.1), log(1.2), log(1.3), log(1.4), log(1.5), log (1.6) TM2: Propensity score model coefficients are in increasing order when outcome model coefficients are in decreasing order TM3: Propensity score model coefficients are in decreasing order when outcome model coefficients are in increasing order TM4: Propensity score model and outcome model coefficients are both in decreasing order Main effects in outcome model (OM) γγ: 0 oooo 0.5 oooo 1 ββ 4, ββ 5, ββ 6 : 0.40, 0.70, 1 ffffff cccccccc ββ 1, ββ 2, ββ 3 : 0.20, 0.40, 0.70 ffffff bbbbbbbbbbbb Interactions and quadratic terms in OM ββ 7, ββ 8, ββ 9, ββ 10, ββ 11, ββ 12 : 0.1, 0.2, 0.3, 0.4, 0.5, 0.6 Main effects in propensity score model (PSM) αα 3, αα 4, αα 6 : log(1.5), log(2), log (3) ffffff cccccccc αα 1, αα 2, αα 5 : log(1.2), log(1.5), log(2) ffffff bbbbbbbbbbbb Interactions and quadratic terms in PSM αα 7, αα 8, αα 9, αα 10, αα 11, αα 12 : log(1.1), log(1.2), log(1.3), log(1.4), log(1.5), log (1.6) Main effects in outcome model (OM) γγ: 0 oooo 0.5 oooo 1 ββ 4, ββ 5, ββ 6 : 1, 0.70, 0.40 ffffff cccccccc ββ 1, ββ 2, ββ 3 : 0.70, 0.40, 0.20 ffffff bbbbbbbbbbbb Interactions and quadratic terms in OM ββ 7, ββ 8, ββ 9, ββ 10, ββ 11, ββ 12 : 0.6, 0.5, 0.4, 0.3, 0.2, 0.1 Main effects in propensity score model (PSM) αα 3, αα 4, αα 6 : log(3), log(2), log (1.5) ffffff cccccccc αα 1, αα 2, αα 5 : log(2), log(1.5), log(1.2) ffffff bbbbbbbbbbbb Interactions and quadratic terms in PSM αα 7, αα 8, αα 9, αα 10, αα 11, αα 12 : log(1.6), log(1.5), log(1.4), log(1.3), log(1.2), log (1.1) Main effects in outcome model (OM) γγ: 0 oooo 0.5 oooo 1 ββ 4, ββ 5, ββ 6 : 0.40, 0.70, 1 ffffff cccccccc ββ 1, ββ 2, ββ 3 : 0.20, 0.40, 0.70 ffffff bbbbbbbbbbbb Interactions and quadratic terms in OM ββ 7, ββ 8, ββ 9, ββ 10, ββ 11, ββ 12 : 0.1, 0.2, 0.3, 0.4, 0.5, 0.6 Main effects in propensity score model (PSM) αα 3, αα 4, αα 6 : log(3), log(2), log (1.5) ffffff cccccccc αα 1, αα 2, αα 5 : log(2), log(1.5), log(1.2) ffffff bbbbbbbbbbbb Interactions and quadratic terms in PSM αα 7, αα 8, αα 9, αα 10, αα 11, αα 12 : log(1.6), log(1.5), log(1.4), log(1.3), log(1.2), log (1.1) Main effects in outcome model (OM) γγ: 0 oooo 0.5 oooo 1 ββ 4, ββ 5, ββ 6 : 1, 0.70, 0.40 ffffff cccccccc ββ 1, ββ 2, ββ 3 : 0.70, 0.40, 0.20 ffffff bbbbbbbbbbbb Interactions and quadratic terms in OM ββ 7, ββ 8, ββ 9, ββ 10, ββ 11, ββ 12 : 0.6, 0.5, 0.4, 0.3, 0.2,

11 Model Specifications (MS) Propensity Score Model (PSM) Prognostic Score Model (PGSM) Outcome Model (OM) MS1: True treatment model PSM1: llllllllll(pp tt ) = αα 0 + αα 1 xx 1 + αα 2 xx 2 + αα 3 xx 4 + αα 4 xx 5 + αα 5 xx 7 + αα 6 xx 8 + αα 7 xx 2 xx 4 + αα 8 xx 2 xx 7 + αα 9 xx 7 xx 8 + αα 10 xx 4 xx 5 + αα 11 xx αα 12 xx 8 2 PGSM1: yy = ξξ 0 + ξξ 1 xx 1 + ξξ 2 xx 2 + ξξ 3 xx 4 + ξξ 4 xx 5 + ξξ 5 xx 7 + ξξ 6 xx 8 + ξξ 7 xx 2 xx 4 + ξξ 8 xx 2 xx 7 + ξξ 9 xx 7 xx 8 + ξξ 10 xx 4 xx 5 + ξξ 11 xx ξξ 12 xx 8 2 OM1: yy = ββ 0 + γγγγ + ββ 1 xx 1 + ββ 2 xx 2 + ββ 3 xx 4 + ββ 4 xx 5 + ββ 5 xx 7 + ββ 6 xx 8 + ββ 7 xx 2 xx 4 + ββ 8 xx 2 xx 7 + ββ 9 xx 7 xx 8 + ββ 10 xx 4 xx 5 + ββ 11 xx ββ 12 xx 8 2 MS2: True outcome model PSM2: llllllllll(pp tt ) = αα 0 +αα 1 xx 1 + αα 2 xx 2 + αα 3 xx 3 + αα 4 xx 44 + αα 5 xx 55 + αα 6 xx 66 + αα 7 xx 2 xx 44 + αα 8 xx 3 xx 55 + αα 9 xx 3 xx 66 + αα 10 xx 44 xx 55 + αα 11 xx αα 12 xx PGSM2: yy = ξξ 0 + ξξ 1 xx 1 + ξξ 2 xx 2 + ξξ 3 xx 3 + ξξ 4 xx 44 + ξξ 5 xx 55 + ξξ 6 xx 66 + ξξ 7 xx 2 xx 44 + ξξ 8 xx 3 xx 55 + ξξ 9 xx 3 xx 66 + ξξ 10 xx 44 xx 55 + ξξ 11 xx ξξ 12 xx OM2: yy = ββ 0 + γγγγ + ββ 1 xx 1 + ββ 2 xx 2 + ββ 3 xx 3 + ββ 4 xx 44 + ββ 5 xx 55 + ββ 6 xx 66 + ββ 7 xx 2 xx 44 + ββ 8 xx 3 xx 55 + ββ 9 xx 3 xx 66 + ββ 10 xx 44 xx 55 + ββ 11 xx ββ 12 xx MS3: Main effects for all covariates PSM3: llllllllll(pp tt ) = αα 0 + αα 1 xx 1 + αα 2 xx 2 + αα 3 xx 3 + αα 4 xx 4 + αα 5 xx 5 + αα 6 xx 66 + αα 7 xx 7 + αα 8 xx 8 PGSM3: yy = ξξ 0 + ξξ 1 xx 1 + ξξ 2 xx 2 + ξξ 3 xx 3 + ξξ 4 xx 4 + ξξ 5 xx 5 + ξξ 6 xx 66 + ξξ 7 xx 7 + ξξ 8 xx 8 OM3: yy = ββ 0 + γγγγ + ββ 1 xx 1 + ββ 2 xx 2 + ββ 3 xx 3 + ββ 4 xx 4 + ββ 5 xx 5 + ββ 6 xx 66 + ββ 7 xx 7 + ββ 8 xx 8 MS4: Main effects for six covariates related to treatment PSM4: llllllllll(pp tt ) = αα 0 + αα 1 xx 1 + αα 2 xx 2 +αα 3 xx 4 + αα 4 xx 5 + αα 5 xx 7 + αα 6 xx 8 PGSM4: yy = ξξ 0 + ξξ 1 xx 1 + ξξ 2 xx 2 +ξξ 3 xx 4 + ξξ 4 xx 5 + ξξ 5 xx 7 + ξξ 6 xx 8 OM4: yy = ββ 0 + γγγγ + ββ 1 xx 1 + ββ 2 xx 2 +ββ 3 xx 4 + ββ 4 xx 5 + ββ 5 xx 7 + ββ 6 xx 8 MS5: Main effects for six covariates related to outcome PSM5: llllllllll(pp tt ) = αα 0 +αα 1 xx 1 + αα 2 xx 2 + αα 3 xx 3 + αα 4 xx 44 + αα 5 xx 55 + αα 6 xx 66 PGSM5: yy = ξξ 0 +ξξ 1 xx 1 + ξξ 2 xx 2 + ξξ 3 xx 3 + ξξ 4 xx 44 + ξξ 5 xx 55 + ξξ 6 xx 66 OM5: yy = ββ 0 + γγγγ + ββ 1 xx 1 + ββ 2 xx 2 + ββ 3 xx 3 + ββ 4 xx 44 + ββ 5 xx 55 + ββ 6 xx 66 11

12 MS6: Main effects for five covariates related to treatment, omit confounder strongly related to outcome PSM6: llllllllll(pp tt ) = αα 0 + αα 1 xx 1 + αα 2 xx 2 +αα 3 xx 4 + αα 5 xx 7 + αα 6 xx 8 PGSM6: yy = ξξ 0 + ξξ 1 xx 1 + ξξ 2 xx 2 +ξξ 3 xx 4 + ξξ 5 xx 7 + ξξ 6 xx 8 OM6: yy = ββ 0 + γγγγ + ββ 1 xx 1 + ββ 2 xx 2 +ββ 3 xx 4 + ββ 5 xx 7 + ββ 6 xx 8 MS7: Main effects for five covariates related to outcome, omit confounder strongly related to treatment PSM7: llllllllll(pp tt ) = αα 0 + αα 1 xx 1 + αα 2 xx 2 + αα 3 xx 3 + αα 4 xx 44 + αα 6 xx 66 PGSM7: yy = ξξ 0 + ξξ 1 xx 1 + ξξ 2 xx 2 + ξξ 3 xx 3 + ξξ 4 xx 44 + ξξ 6 xx 66 OM7: yy = ββ 0 + γγγγ + ββ 1 xx 1 + ββ 2 xx 2 + ββ 3 xx 3 + ββ 4 xx 44 + ββ 6 xx 66 MS8: Main effects for five covariates related to treatment, omit confounder weakly related to outcome PSM8: llllllllll(pp tt ) = αα 0 + αα 2 xx 2 +αα 3 xx 4 + αα 4 xx 5 + αα 5 xx 7 + αα 6 xx 8 PGSM8: yy = ξξ 0 + ξξ 2 xx 2 +ξξ 3 xx 4 + ξξ 4 xx 5 + ξξ 5 xx 7 + ξξ 6 xx 8 OM8: yy = ββ 0 + γγγγ + ββ 2 xx 2 +ββ 3 xx 4 + ββ 4 xx 5 + ββ 5 xx 7 + ββ 6 xx 8 MS9: Main effects for five covariates related to outcome, omit confounder weakly related to treatment PSM9: llllllllll(pp tt ) = αα 0 + αα 2 xx 2 + αα 3 xx 3 + αα 4 xx 44 + αα 5 xx 55 + αα 6 xx 66 PGSM9: yy = ξξ 0 + ξξ 2 xx 2 + ξξ 3 xx 3 + ξξ 4 xx 44 + ξξ 5 xx 55 + ξξ 6 xx 66 OM9: yy = ββ 0 + γγγγ + ββ 2 xx 2 + ββ 3 xx 3 + ββ 4 xx 44 + ββ 5 xx 55 + ββ 6 xx 66 MS10: Main effects for six covariates related to treatment, add one covariate strongly related to outcome only PSM10: llllllllll(pp tt ) = αα 0 + αα 1 xx 1 + αα 2 xx 2 +αα 3 xx 4 + αα 4 xx 5 + αα 5 xx 7 + αα 6 xx 8 + αα 7 xx 66 PGSM10: yy = ξξ 0 + ξξxx 1 + ξξ 2 xx 2 +ξξ 3 xx 4 + ξξ 4 xx 5 + ξξ 5 xx 7 + ξξxx 8 + ξξ 7 xx 66 OM10: yy = ββ 0 + γγγγ + ββ 1 xx 1 + ββ 2 xx 2 +ββ 3 xx 4 + ββ 4 xx 5 + ββ 5 xx 7 + ββ 6 xx 8 + ββ 7 xx 66 MS11: Main effects for six covariates related to outcome, add one covariate strongly related to treatment only PSM11: llllllllll(pp tt ) = αα 0 +αα 1 xx 1 + αα 2 xx 2 + αα 3 xx 3 + αα 4 xx 44 + αα 5 xx 55 + αα 6 xx 66 + αα 7 xx 88 PGSM11: 12

13 yy = ξξ 0 +ξξ 1 xx 1 + ξξ 2 xx 2 + ξξ 3 xx 3 + ξξ 4 xx 44 + ξξ 5 xx 55 + ξξ 6 xx 66 + ξξ 7 xx 88 OM11: yy = ββ 0 + γγγγ + ββ 1 xx 1 + ββ 2 xx 2 + ββ 3 xx 3 + ββ 4 xx 44 + ββ 5 xx 55 + ββ 6 xx 66 + ββ 7 xx 88 MS12: Main effects for four covariates related to both treatment and outcome PSM12: llllllllll(pp tt ) = αα 0 +αα 1 xx 1 + αα 2 xx 2 + αα 3 xx 44 + αα 4 xx 55 PGSM12: yy = ξξ 0 +ξξ 1 xx 1 + ξξ 2 xx 2 + ξξ 3 xx 44 + ξξ 4 xx 55 OM12: yy = ββ 0 + γγγγ + ββ 1 xx 1 + ββ 2 xx 2 + ββ 3 xx 44 + ββ 4 xx 55 13

14 Table A2 Simulation Scenarios and Labels (Example: TM1GAM1N1) True Model (TM) (1) TM1 (2) TM2 (3) TM3 (4) TM4 Treatment Effect (γγ) (1) 0 (2) 0.5 (3) 1 Sample Size (1) 200 (2) 400 (3) 800 (4) 1600 Estimation Method (1) Regression (2) PS (3) PGS (4) PGPS (5) Doubly-robust PS (6) Doubly-robust PGS (7) Doubly-robust PGPS (8) PS within CS (9) PGS within CS (10) PGPS within CS (11) Doubly-robust PS within CS (12) Doubly-robust PGS within CS (13) Doubly-robust PGPS within CS Model Specification (MS) (1) MS1 (2) MS2 (3) MS3 (4) MS4 (5) MS5 (6) MS6 (7) MS7 (8) MS8 (9) MS9 (10) MS10 (11) MS11 (12) MS12 Balance Measures (1) ASD (2) PSSD (3) PGSSD (4) PGPSSD (5) R XTR XY Note. PS: Propensity Score. PGS: Prognostic Score. PGPS: Prognostic Propensity Score. CS: Common Support. ASD: Absolute Standardized Difference. PSSD: Propensity Score Standardized Difference. PGSSD: Prognostic Score Standardized Difference. PGPSSD: Prognostic Propensity Score Standardized Difference. 14

The Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates

The Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates The Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates Eun Sook Kim, Patricia Rodríguez de Gil, Jeffrey D. Kromrey, Rheta E. Lanehart, Aarti Bellara,

More information

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded 1 Background Latent confounder is common in social and behavioral science in which most of cases the selection mechanism

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14

Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14 STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University Frequency 0 2 4 6 8 Quiz 2 Histogram of Quiz2 10 12 14 16 18 20 Quiz2

More information

An Introduction to Causal Analysis on Observational Data using Propensity Scores

An Introduction to Causal Analysis on Observational Data using Propensity Scores An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut

More information

University of Michigan School of Public Health

University of Michigan School of Public Health University of Michigan School of Public Health The University of Michigan Department of Biostatistics Working Paper Series Year 003 Paper Weighting Adustments for Unit Nonresponse with Multiple Outcome

More information

On the Use of the Bross Formula for Prioritizing Covariates in the High-Dimensional Propensity Score Algorithm

On the Use of the Bross Formula for Prioritizing Covariates in the High-Dimensional Propensity Score Algorithm On the Use of the Bross Formula for Prioritizing Covariates in the High-Dimensional Propensity Score Algorithm Richard Wyss 1, Bruce Fireman 2, Jeremy A. Rassen 3, Sebastian Schneeweiss 1 Author Affiliations:

More information

DATA-ADAPTIVE VARIABLE SELECTION FOR

DATA-ADAPTIVE VARIABLE SELECTION FOR DATA-ADAPTIVE VARIABLE SELECTION FOR CAUSAL INFERENCE Group Health Research Institute Department of Biostatistics, University of Washington shortreed.s@ghc.org joint work with Ashkan Ertefaie Department

More information

Propensity Score Weighting with Multilevel Data

Propensity Score Weighting with Multilevel Data Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative

More information

Estimating the Marginal Odds Ratio in Observational Studies

Estimating the Marginal Odds Ratio in Observational Studies Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios

More information

arxiv: v1 [stat.me] 15 May 2011

arxiv: v1 [stat.me] 15 May 2011 Working Paper Propensity Score Analysis with Matching Weights Liang Li, Ph.D. arxiv:1105.2917v1 [stat.me] 15 May 2011 Associate Staff of Biostatistics Department of Quantitative Health Sciences, Cleveland

More information

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 An Introduction to Causal Mediation Analysis Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 1 Causality In the applications of statistics, many central questions

More information

Tests for the Odds Ratio in a Matched Case-Control Design with a Quantitative X

Tests for the Odds Ratio in a Matched Case-Control Design with a Quantitative X Chapter 157 Tests for the Odds Ratio in a Matched Case-Control Design with a Quantitative X Introduction This procedure calculates the power and sample size necessary in a matched case-control study designed

More information

Propensity Score Analysis with Hierarchical Data

Propensity Score Analysis with Hierarchical Data Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational

More information

Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai

Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai Chapter 1 Propensity Score Analysis Concepts and Issues Wei Pan Haiyan Bai Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity score analysis, research using propensity score analysis

More information

Propensity Score Matching

Propensity Score Matching Methods James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 Methods 1 Introduction 2 3 4 Introduction Why Match? 5 Definition Methods and In

More information

What s New in Econometrics. Lecture 1

What s New in Econometrics. Lecture 1 What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and

More information

PEARL VS RUBIN (GELMAN)

PEARL VS RUBIN (GELMAN) PEARL VS RUBIN (GELMAN) AN EPIC battle between the Rubin Causal Model school (Gelman et al) AND the Structural Causal Model school (Pearl et al) a cursory overview Dokyun Lee WHO ARE THEY? Judea Pearl

More information

Flexible, Optimal Matching for Comparative Studies Using the optmatch package

Flexible, Optimal Matching for Comparative Studies Using the optmatch package Flexible, Optimal Matching for Comparative Studies Using the optmatch package Ben Hansen Statistics Department University of Michigan ben.b.hansen@umich.edu http://www.stat.lsa.umich.edu/~bbh 9 August

More information

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers: Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 27, 2007 Kosuke Imai (Princeton University) Matching

More information

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal

More information

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke

More information

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk Causal Inference in Observational Studies with Non-Binary reatments Statistics Section, Imperial College London Joint work with Shandong Zhao and Kosuke Imai Cass Business School, October 2013 Outline

More information

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014 Assess Assumptions and Sensitivity Analysis Fan Li March 26, 2014 Two Key Assumptions 1. Overlap: 0

More information

Targeted Maximum Likelihood Estimation in Safety Analysis

Targeted Maximum Likelihood Estimation in Safety Analysis Targeted Maximum Likelihood Estimation in Safety Analysis Sam Lendle 1 Bruce Fireman 2 Mark van der Laan 1 1 UC Berkeley 2 Kaiser Permanente ISPE Advanced Topics Session, Barcelona, August 2012 1 / 35

More information

Measuring Social Influence Without Bias

Measuring Social Influence Without Bias Measuring Social Influence Without Bias Annie Franco Bobbie NJ Macdonald December 9, 2015 The Problem CS224W: Final Paper How well can statistical models disentangle the effects of social influence from

More information

Summary and discussion of The central role of the propensity score in observational studies for causal effects

Summary and discussion of The central role of the propensity score in observational studies for causal effects Summary and discussion of The central role of the propensity score in observational studies for causal effects Statistics Journal Club, 36-825 Jessica Chemali and Michael Vespe 1 Summary 1.1 Background

More information

Analysis of propensity score approaches in difference-in-differences designs

Analysis of propensity score approaches in difference-in-differences designs Author: Diego A. Luna Bazaldua Institution: Lynch School of Education, Boston College Contact email: diego.lunabazaldua@bc.edu Conference section: Research methods Analysis of propensity score approaches

More information

A Note on Bayesian Inference After Multiple Imputation

A Note on Bayesian Inference After Multiple Imputation A Note on Bayesian Inference After Multiple Imputation Xiang Zhou and Jerome P. Reiter Abstract This article is aimed at practitioners who plan to use Bayesian inference on multiplyimputed datasets in

More information

Covariate Balancing Propensity Score for General Treatment Regimes

Covariate Balancing Propensity Score for General Treatment Regimes Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian

More information

Matching for Causal Inference Without Balance Checking

Matching for Causal Inference Without Balance Checking Matching for ausal Inference Without Balance hecking Gary King Institute for Quantitative Social Science Harvard University joint work with Stefano M. Iacus (Univ. of Milan) and Giuseppe Porro (Univ. of

More information

Variable selection and machine learning methods in causal inference

Variable selection and machine learning methods in causal inference Variable selection and machine learning methods in causal inference Debashis Ghosh Department of Biostatistics and Informatics Colorado School of Public Health Joint work with Yeying Zhu, University of

More information

Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings

Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings Jessica Lum, MA 1 Steven Pizer, PhD 1, 2 Melissa Garrido,

More information

Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018

Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018 Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018 Schedule and outline 1:00 Introduction and overview 1:15 Quasi-experimental vs. experimental designs

More information

The propensity score with continuous treatments

The propensity score with continuous treatments 7 The propensity score with continuous treatments Keisuke Hirano and Guido W. Imbens 1 7.1 Introduction Much of the work on propensity score analysis has focused on the case in which the treatment is binary.

More information

Propensity Score Matching and Genetic Matching : Monte Carlo Results

Propensity Score Matching and Genetic Matching : Monte Carlo Results Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS060) p.5391 Propensity Score Matching and Genetic Matching : Monte Carlo Results Donzé, Laurent University of Fribourg

More information

Confidence Intervals for the Interaction Odds Ratio in Logistic Regression with Two Binary X s

Confidence Intervals for the Interaction Odds Ratio in Logistic Regression with Two Binary X s Chapter 867 Confidence Intervals for the Interaction Odds Ratio in Logistic Regression with Two Binary X s Introduction Logistic regression expresses the relationship between a binary response variable

More information

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW SSC Annual Meeting, June 2015 Proceedings of the Survey Methods Section ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW Xichen She and Changbao Wu 1 ABSTRACT Ordinal responses are frequently involved

More information

Statistical Practice

Statistical Practice Statistical Practice A Note on Bayesian Inference After Multiple Imputation Xiang ZHOU and Jerome P. REITER This article is aimed at practitioners who plan to use Bayesian inference on multiply-imputed

More information

PROPENSITY SCORE MATCHING. Walter Leite

PROPENSITY SCORE MATCHING. Walter Leite PROPENSITY SCORE MATCHING Walter Leite 1 EXAMPLE Question: Does having a job that provides or subsidizes child care increate the length that working mothers breastfeed their children? Treatment: Working

More information

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census

More information

Propensity Score Analysis Using teffects in Stata. SOC 561 Programming for the Social Sciences Hyungjun Suh Apr

Propensity Score Analysis Using teffects in Stata. SOC 561 Programming for the Social Sciences Hyungjun Suh Apr Propensity Score Analysis Using teffects in Stata SOC 561 Programming for the Social Sciences Hyungjun Suh Apr. 25. 2016 Overview Motivation Propensity Score Weighting Propensity Score Matching with teffects

More information

Causal Inference Basics

Causal Inference Basics Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,

More information

Balancing Covariates via Propensity Score Weighting

Balancing Covariates via Propensity Score Weighting Balancing Covariates via Propensity Score Weighting Kari Lock Morgan Department of Statistics Penn State University klm47@psu.edu Stochastic Modeling and Computational Statistics Seminar October 17, 2014

More information

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers: Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 13, 2009 Kosuke Imai (Princeton University) Matching

More information

Calculating Effect-Sizes. David B. Wilson, PhD George Mason University

Calculating Effect-Sizes. David B. Wilson, PhD George Mason University Calculating Effect-Sizes David B. Wilson, PhD George Mason University The Heart and Soul of Meta-analysis: The Effect Size Meta-analysis shifts focus from statistical significance to the direction and

More information

Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies.

Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies. Institute for Clinical Evaluative Sciences From the SelectedWorks of Peter Austin 2011 Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions

More information

OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores

OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS 776 1 15 Outcome regressions and propensity scores Outcome Regression and Propensity Scores ( 15) Outline 15.1 Outcome regression 15.2 Propensity

More information

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect

More information

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies Section 9c Propensity scores Controlling for bias & confounding in observational studies 1 Logistic regression and propensity scores Consider comparing an outcome in two treatment groups: A vs B. In a

More information

Gov 2002: 5. Matching

Gov 2002: 5. Matching Gov 2002: 5. Matching Matthew Blackwell October 1, 2015 Where are we? Where are we going? Discussed randomized experiments, started talking about observational data. Last week: no unmeasured confounders

More information

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study

Robust covariance estimator for small-sample adjustment in the generalized estimating equations: A simulation study Science Journal of Applied Mathematics and Statistics 2014; 2(1): 20-25 Published online February 20, 2014 (http://www.sciencepublishinggroup.com/j/sjams) doi: 10.11648/j.sjams.20140201.13 Robust covariance

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

Previous lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing.

Previous lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing. Previous lecture P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing. Interaction Outline: Definition of interaction Additive versus multiplicative

More information

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007)

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007) Double Robustness Bang and Robins (2005) Kang and Schafer (2007) Set-Up Assume throughout that treatment assignment is ignorable given covariates (similar to assumption that data are missing at random

More information

Discussion of Papers on the Extensions of Propensity Score

Discussion of Papers on the Extensions of Propensity Score Discussion of Papers on the Extensions of Propensity Score Kosuke Imai Princeton University August 3, 2010 Kosuke Imai (Princeton) Generalized Propensity Score 2010 JSM (Vancouver) 1 / 11 The Theme and

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu

More information

New Developments in Nonresponse Adjustment Methods

New Developments in Nonresponse Adjustment Methods New Developments in Nonresponse Adjustment Methods Fannie Cobben January 23, 2009 1 Introduction In this paper, we describe two relatively new techniques to adjust for (unit) nonresponse bias: The sample

More information

Propensity Score for Causal Inference of Multiple and Multivalued Treatments

Propensity Score for Causal Inference of Multiple and Multivalued Treatments Virginia Commonwealth University VCU Scholars Compass Theses and Dissertations Graduate School 2016 Propensity Score for Causal Inference of Multiple and Multivalued Treatments Zirui Gu Virginia Commonwealth

More information

APPENDIX 1 BASIC STATISTICS. Summarizing Data

APPENDIX 1 BASIC STATISTICS. Summarizing Data 1 APPENDIX 1 Figure A1.1: Normal Distribution BASIC STATISTICS The problem that we face in financial analysis today is not having too little information but too much. Making sense of large and often contradictory

More information

SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS

SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS TOMMASO NANNICINI universidad carlos iii de madrid UK Stata Users Group Meeting London, September 10, 2007 CONTENT Presentation of a Stata

More information

University of Michigan School of Public Health

University of Michigan School of Public Health University of Michigan School of Public Health The University of Michigan Department of Biostatistics Working Paper Series Year 2016 Paper 121 A Weighted Instrumental Variable Estimator to Control for

More information

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i. Weighting Unconfounded Homework 2 Describe imbalance direction matters STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University

More information

JOINTLY MODELING CONTINUOUS AND BINARY OUTCOMES FOR BOOLEAN OUTCOMES: AN APPLICATION TO MODELING HYPERTENSION

JOINTLY MODELING CONTINUOUS AND BINARY OUTCOMES FOR BOOLEAN OUTCOMES: AN APPLICATION TO MODELING HYPERTENSION Johns Hopkins University, Dept. of Biostatistics Working Papers 2-19-2008 JOINTLY MODELING CONTINUOUS AND BINARY OUTCOMES FOR BOOLEAN OUTCOMES: AN APPLICATION TO MODELING HYPERTENSION Xianbin Li Department

More information

Indirect estimation of a simultaneous limited dependent variable model for patient costs and outcome

Indirect estimation of a simultaneous limited dependent variable model for patient costs and outcome Indirect estimation of a simultaneous limited dependent variable model for patient costs and outcome Per Hjertstrand Research Institute of Industrial Economics (IFN), Stockholm, Sweden Per.Hjertstrand@ifn.se

More information

Combining multiple observational data sources to estimate causal eects

Combining multiple observational data sources to estimate causal eects Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,

More information

Strategy of Bayesian Propensity. Score Estimation Approach. in Observational Study

Strategy of Bayesian Propensity. Score Estimation Approach. in Observational Study Theoretical Mathematics & Applications, vol.2, no.3, 2012, 75-86 ISSN: 1792-9687 (print), 1792-9709 (online) Scienpress Ltd, 2012 Strategy of Bayesian Propensity Score Estimation Approach in Observational

More information

NISS. Technical Report Number 167 June 2007

NISS. Technical Report Number 167 June 2007 NISS Estimation of Propensity Scores Using Generalized Additive Models Mi-Ja Woo, Jerome Reiter and Alan F. Karr Technical Report Number 167 June 2007 National Institute of Statistical Sciences 19 T. W.

More information

Causal Analysis in Social Research

Causal Analysis in Social Research Causal Analysis in Social Research Walter R Davis National Institute of Applied Statistics Research Australia University of Wollongong Frontiers in Social Statistics Metholodogy 8 February 2017 Walter

More information

Memorial Sloan-Kettering Cancer Center

Memorial Sloan-Kettering Cancer Center Memorial Sloan-Kettering Cancer Center Memorial Sloan-Kettering Cancer Center, Dept. of Epidemiology & Biostatistics Working Paper Series Year 2006 Paper 5 Comparing the Predictive Values of Diagnostic

More information

Passing-Bablok Regression for Method Comparison

Passing-Bablok Regression for Method Comparison Chapter 313 Passing-Bablok Regression for Method Comparison Introduction Passing-Bablok regression for method comparison is a robust, nonparametric method for fitting a straight line to two-dimensional

More information

Imbens/Wooldridge, IRP Lecture Notes 2, August 08 1

Imbens/Wooldridge, IRP Lecture Notes 2, August 08 1 Imbens/Wooldridge, IRP Lecture Notes 2, August 08 IRP Lectures Madison, WI, August 2008 Lecture 2, Monday, Aug 4th, 0.00-.00am Estimation of Average Treatment Effects Under Unconfoundedness, Part II. Introduction

More information

Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary Treatments

Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary Treatments Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary reatments Shandong Zhao Department of Statistics, University of California, Irvine, CA 92697 shandonm@uci.edu

More information

Data Integration for Big Data Analysis for finite population inference

Data Integration for Big Data Analysis for finite population inference for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation

More information

Tests for the Odds Ratio of Two Proportions in a 2x2 Cross-Over Design

Tests for the Odds Ratio of Two Proportions in a 2x2 Cross-Over Design Chapter 170 Tests for the Odds Ratio of Two Proportions in a 2x2 Cross-Over Design Introduction Senn (2002) defines a cross-over design as one in which each subject receives all treatments and the objective

More information

Propensity Score Methods, Models and Adjustment

Propensity Score Methods, Models and Adjustment Propensity Score Methods, Models and Adjustment Dr David A. Stephens Department of Mathematics & Statistics McGill University Montreal, QC, Canada. d.stephens@math.mcgill.ca www.math.mcgill.ca/dstephens/siscr2016/

More information

Dynamics in Social Networks and Causality

Dynamics in Social Networks and Causality Web Science & Technologies University of Koblenz Landau, Germany Dynamics in Social Networks and Causality JProf. Dr. University Koblenz Landau GESIS Leibniz Institute for the Social Sciences Last Time:

More information

Incorporating published univariable associations in diagnostic and prognostic modeling

Incorporating published univariable associations in diagnostic and prognostic modeling Incorporating published univariable associations in diagnostic and prognostic modeling Thomas Debray Julius Center for Health Sciences and Primary Care University Medical Center Utrecht The Netherlands

More information

A Sampling of IMPACT Research:

A Sampling of IMPACT Research: A Sampling of IMPACT Research: Methods for Analysis with Dropout and Identifying Optimal Treatment Regimes Marie Davidian Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

In Praise of the Listwise-Deletion Method (Perhaps with Reweighting)

In Praise of the Listwise-Deletion Method (Perhaps with Reweighting) In Praise of the Listwise-Deletion Method (Perhaps with Reweighting) Phillip S. Kott RTI International NISS Worshop on the Analysis of Complex Survey Data With Missing Item Values October 17, 2014 1 RTI

More information

Journal of Biostatistics and Epidemiology

Journal of Biostatistics and Epidemiology Journal of Biostatistics and Epidemiology Methodology Marginal versus conditional causal effects Kazem Mohammad 1, Seyed Saeed Hashemi-Nazari 2, Nasrin Mansournia 3, Mohammad Ali Mansournia 1* 1 Department

More information

Sensitivity analysis and distributional assumptions

Sensitivity analysis and distributional assumptions Sensitivity analysis and distributional assumptions Tyler J. VanderWeele Department of Health Studies, University of Chicago 5841 South Maryland Avenue, MC 2007, Chicago, IL 60637, USA vanderweele@uchicago.edu

More information

Confidence Intervals for the Odds Ratio in Logistic Regression with Two Binary X s

Confidence Intervals for the Odds Ratio in Logistic Regression with Two Binary X s Chapter 866 Confidence Intervals for the Odds Ratio in Logistic Regression with Two Binary X s Introduction Logistic regression expresses the relationship between a binary response variable and one or

More information

An Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies

An Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies Paper 177-2015 An Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies Yan Wang, Seang-Hwane Joo, Patricia Rodríguez de Gil, Jeffrey D. Kromrey, Rheta

More information

Causal modelling in Medical Research

Causal modelling in Medical Research Causal modelling in Medical Research Debashis Ghosh Department of Biostatistics and Informatics, Colorado School of Public Health Biostatistics Workshop Series Goals for today Introduction to Potential

More information

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai Weighting Methods Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Weighting Methods Stat186/Gov2002 Fall 2018 1 / 13 Motivation Matching methods for improving

More information

Penalized Spline of Propensity Methods for Missing Data and Causal Inference. Roderick Little

Penalized Spline of Propensity Methods for Missing Data and Causal Inference. Roderick Little Penalized Spline of Propensity Methods for Missing Data and Causal Inference Roderick Little A Tail of Two Statisticians (but who s tailing who) Taylor Little Cambridge U 978 BA Math ( st class) 97 BA

More information

Impact of covariate misclassification on the power and type I error in clinical trials using covariate-adaptive randomization

Impact of covariate misclassification on the power and type I error in clinical trials using covariate-adaptive randomization Impact of covariate misclassification on the power and type I error in clinical trials using covariate-adaptive randomization L I Q I O N G F A N S H A R O N D. Y E A T T S W E N L E Z H A O M E D I C

More information

Integrated approaches for analysis of cluster randomised trials

Integrated approaches for analysis of cluster randomised trials Integrated approaches for analysis of cluster randomised trials Invited Session 4.1 - Recent developments in CRTs Joint work with L. Turner, F. Li, J. Gallis and D. Murray Mélanie PRAGUE - SCT 2017 - Liverpool

More information

Ignoring the matching variables in cohort studies - when is it valid, and why?

Ignoring the matching variables in cohort studies - when is it valid, and why? Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association

More information

Exogenous Treatment and Endogenous Factors: Vanishing of Omitted Variable Bias on the Interaction Term

Exogenous Treatment and Endogenous Factors: Vanishing of Omitted Variable Bias on the Interaction Term D I S C U S S I O N P A P E R S E R I E S IZA DP No. 6282 Exogenous Treatment and Endogenous Factors: Vanishing of Omitted Variable Bias on the Interaction Term Olena Nizalova Irina Murtazashvili January

More information

Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations)

Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations) Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations) Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology

More information

Stratified Randomized Experiments

Stratified Randomized Experiments Stratified Randomized Experiments Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Stratified Randomized Experiments Stat186/Gov2002 Fall 2018 1 / 13 Blocking

More information

Matching with Multiple Control Groups, and Adjusting for Group Differences

Matching with Multiple Control Groups, and Adjusting for Group Differences Matching with Multiple Control Groups, and Adjusting for Group Differences Donald B. Rubin, Harvard University Elizabeth A. Stuart, Mathematica Policy Research, Inc. estuart@mathematica-mpr.com KEY WORDS:

More information

Nuisance parameter elimination for proportional likelihood ratio models with nonignorable missingness and random truncation

Nuisance parameter elimination for proportional likelihood ratio models with nonignorable missingness and random truncation Biometrika Advance Access published October 24, 202 Biometrika (202), pp. 8 C 202 Biometrika rust Printed in Great Britain doi: 0.093/biomet/ass056 Nuisance parameter elimination for proportional likelihood

More information

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY Ingo Langner 1, Ralf Bender 2, Rebecca Lenz-Tönjes 1, Helmut Küchenhoff 2, Maria Blettner 2 1

More information

Asymptotic equivalence of paired Hotelling test and conditional logistic regression

Asymptotic equivalence of paired Hotelling test and conditional logistic regression Asymptotic equivalence of paired Hotelling test and conditional logistic regression Félix Balazard 1,2 arxiv:1610.06774v1 [math.st] 21 Oct 2016 Abstract 1 Sorbonne Universités, UPMC Univ Paris 06, CNRS

More information

Controlling for overlap in matching

Controlling for overlap in matching Working Papers No. 10/2013 (95) PAWEŁ STRAWIŃSKI Controlling for overlap in matching Warsaw 2013 Controlling for overlap in matching PAWEŁ STRAWIŃSKI Faculty of Economic Sciences, University of Warsaw

More information

e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls

e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls under the restrictions of the copyright, in particular

More information

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical

More information