Deductive Derivation and Computerization of Semiparametric Efficient Estimation

Similar documents
Semiparametric Estimation in Observational Studies and. Randomized Trials. Tianchen Qian

Combining multiple observational data sources to estimate causal eects

Propensity Score Analysis with Hierarchical Data

A Sampling of IMPACT Research:

Propensity Score Weighting with Multilevel Data

Targeted Maximum Likelihood Estimation in Safety Analysis

Robustness to Parametric Assumptions in Missing Data Models

Causal Inference Basics

SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION

Gov 2002: 5. Matching

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

University of California, Berkeley

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity

What s New in Econometrics. Lecture 1

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Targeted Group Sequential Adaptive Designs

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

University of California, Berkeley

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Propensity Score Methods for Causal Inference

Balancing Covariates via Propensity Score Weighting

Empirical Bayes Moderation of Asymptotically Linear Parameters

Flexible Estimation of Treatment Effect Parameters

University of California, Berkeley

Topics and Papers for Spring 14 RIT

Targeted Maximum Likelihood Estimation for Adaptive Designs: Adaptive Randomization in Community Randomized Trial

Adaptive Trial Designs

Bayesian causal forests: dealing with regularization induced confounding and shrinking towards homogeneous effects

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai

Marginal Screening and Post-Selection Inference

Fair Inference Through Semiparametric-Efficient Estimation Over Constraint-Specific Paths

Ridge regression. Patrick Breheny. February 8. Penalized regression Ridge regression Bayesian interpretation

High Dimensional Propensity Score Estimation via Covariate Balancing

Statistical Inference for Data Adaptive Target Parameters

Covariate Balancing Propensity Score for General Treatment Regimes

Bounds on Causal Effects in Three-Arm Trials with Non-compliance. Jing Cheng Dylan Small

Cross-fitting and fast remainder rates for semiparametric estimation

Balancing Covariates via Propensity Score Weighting: The Overlap Weights

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007)

Variable selection and machine learning methods in causal inference

University of California, Berkeley

University of California, Berkeley

Targeted Learning for High-Dimensional Variable Importance

Identification and Inference in Causal Mediation Analysis

Semi-Nonparametric Inferences for Massive Data

Lecture 8. October 22, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls

Achieving Optimal Covariate Balance Under General Treatment Regimes

Empirical Bayes Moderation of Asymptotically Linear Parameters

IMPROVING POWER IN GROUP SEQUENTIAL, RANDOMIZED TRIALS BY ADJUSTING FOR PROGNOSTIC BASELINE VARIABLES AND SHORT-TERM OUTCOMES

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks

Estimation of Optimal Treatment Regimes Via Machine Learning. Marie Davidian

Stratified Randomized Experiments

5 Methods Based on Inverse Probability Weighting Under MAR

Selection on Observables: Propensity Score Matching.

Single Index Quantile Regression for Heteroscedastic Data

Statistical Applications in Genetics and Molecular Biology

Bootstrapping Sensitivity Analysis

arxiv: v1 [stat.me] 15 May 2011

Using Estimating Equations for Spatially Correlated A

Section 10: Inverse propensity score weighting (IPSW)

Statistical Analysis of Causal Mechanisms

Selective Inference for Effect Modification

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

General Doubly Robust Identification and Estimation

Web-based Supplementary Materials for A Robust Method for Estimating. Optimal Treatment Regimes

Identification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death

Constrained Maximum Likelihood Estimation for Model Calibration Using Summary-level Information from External Big Data Sources

Potential Outcomes Model (POM)

Introduction to Statistical Inference

Chapter 11. Regression with a Binary Dependent Variable

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

University of California, Berkeley

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Truncated Regression Model and Nonparametric Estimation for Gifted and Talented Education Program

UNIVERSITY OF TORONTO Faculty of Arts and Science

Simulation-based robust IV inference for lifetime data

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Locally Robust Semiparametric Estimation

Statistical Methods for Alzheimer s Disease Studies

Propensity Score Methods, Models and Adjustment

Construction and statistical analysis of adaptive group sequential designs for randomized clinical trials

Single Index Quantile Regression for Heteroscedastic Data

Semiparametric Models and Estimators

University of California, Berkeley

ENHANCED PRECISION IN THE ANALYSIS OF RANDOMIZED TRIALS WITH ORDINAL OUTCOMES

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk

Math 494: Mathematical Statistics

Causal Inference with Big Data Sets

Statistical Analysis of Causal Mechanisms

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction

Lecture 7: Interaction Analysis. Summer Institute in Statistical Genetics 2017

Causal Inference with Measurement Error

Longitudinal + Reliability = Joint Modeling

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

7 Sensitivity Analysis

3 Comparison with Other Dummy Variable Methods

A note on L convergence of Neumann series approximation in missing data problems

Transcription:

Deductive Derivation and Computerization of Semiparametric Efficient Estimation Constantine Frangakis, Tianchen Qian, Zhenke Wu, and Ivan Diaz Department of Biostatistics Johns Hopkins Bloomberg School of Public Health November 10, 2014 Deductive & Compatible Efficient Inference November 10, 2014 1 / 21

Motivation Researchers often seek robust inference for a parameter using semiparametric estimation Example: estimating population mean µ = E[Y ] with self-selective treatments; other model parts unspecified A much-discussed estimator: doubly-robust estimator (Y: outcome; X: covariates; R: binary treatment indicator) 1 Consistency: if [R X ], or [Y R, X ] is correct 2 Locally efficient: has minimum possible asymptotic variance if both [R X ], and [Y R, X ] are correct. Deductive & Compatible Efficient Inference November 10, 2014 2 / 21

Motivation Take a step back: What is the process to arrive at such a doubly robust estimate? 1 Needs to know the closed form of estimating equations. Usually use the efficient influence function (EIF) for minimal asymptotic variance, 2 Identify parts of the model specification that are required in EIF. For example, the propensity score and outcome regression model. 3 Specify and fit the working models as guided by step 2 4 Solve for the zeros of the empirical version of EIF, after plugging in the fitted working models. 5 Hope the working models are close to truth (there are robust estimators in the presence of this deviations, e.g., Cao, Tsiatis, and Davidian, 2009) Deductive & Compatible Efficient Inference November 10, 2014 3 / 21

Motivation QUESTION: What if, in Step 1, the EIF is hard to derive mathematically? This can happen if one is to estimate population median, or other quantiles, or other general wild estimand. Can we specify the model and leave the efficient estimation to computer, like Stan or WinBUGS that takes in model specification, and outputs inferential results, there the posterior? Deductive & Compatible Efficient Inference November 10, 2014 4 / 21

Start with an example: Two-phase design and goal The Doubly-Robust Estimator Two-stage sampling: Collect a set of covariates X i, i = 1,..., n Based on X i, choose a subset of the first-stage subjects to measure their outcomes Y i. R i = 1 for those sampled in the second stage; 0 for those not included in the second stage. Observed data: (X i, Y i R i, R i ), i = 1,..., n, iid Commonly adopted if Y i is very expensive or invasive to measure Goal: Estimate E[Y ] in the presence of covariate-dependent selection into the second-stage samples Deductive & Compatible Efficient Inference November 10, 2014 5 / 21

Example continued 1 We know the mathematical form of EIF, φ, is: φ{d i, (F τ), τ} = R i {Y i y(x i )} e(x i ) 2 Here we directly sove for τ following Steps 2-4: ˆτ nondeductive = 1 n i R i {Y i y w (X i )} e w (X i ) 3 Our motivating question embodied in this problem: What if (1) is unknown to us? + y(x i ) τ, (1) + y w (X i ); (2) Deductive & Compatible Efficient Inference November 10, 2014 6 / 21

A deductive and compatible (DC) efficient estimation algorithm Deductive : use a discrete and finite set of instructions, and, for every input, finish in discrete finite steps. (Turing, 1937) Compatible : ensure estimates lie in correct range, e.g. E[Y ] = Pr(Y = 1) will always be estimated within [0, 1] if Y is binary Efficient : smallest possible asymptotic variance Deductive & Compatible Efficient Inference November 10, 2014 7 / 21

Inspired by EIF for unrestricted problem (e.g. mean outcome estimation without other model specifications) can be written in general as a Gateaux derivative (Hampel, 1974) Reduction to the absurd. Use compatibility constraints to overcome the major problem of numerical computing of EIF that masks EIF s functional dependency on the target parameter. Deductive & Compatible Efficient Inference November 10, 2014 8 / 21

Details of the DC-estimation procedure Step 1 Extend the working distribution F w to a parametric model, say, F w (δ), around F w (i.e., so that F w (0) = F w ), where δ is a finite dimensional vector. In practice, leave those reliable parts unmodifed, e.g., propensity score elicited by physicians. Deductive & Compatible Efficient Inference November 10, 2014 9 / 21

Details of the DC-estimation procedure Step 2 Calculate Gateaux numerical difference derivative Use the Gateaux numerical difference derivative Gateaux{τ, F w (δ), D i, ɛ} := [ τ{f w(di,ɛ)(δ)} τ{f w (δ)} ] /ɛ for a machine-small ɛ, to deduce the value of φ{d i, F w (δ)} for arbitrary δ, and find ˆδ opt that minimizes the empirical variance of τ{f w (ˆδ)} (3) among all roots {ˆδ} that are subject to the condition i [ ] φ{d i, F w (ˆδ)} Gateaux{τ, F w (ˆδ), D i, ɛ} = 0, (4) Deductive & Compatible Efficient Inference November 10, 2014 10 / 21

Details of the DC-estimation procedure Step 3 Calculate the parameter at the EIF-fitted distribution F w (ˆδ) as ˆτ deductive := τ{f w (ˆδ opt )}. (5) Empirical variance estimated by jacknife. Deductive & Compatible Efficient Inference November 10, 2014 11 / 21

Properties 1 Consistent as the usual, non-deductive estimators (e.g. Scharfstein et al., 1999), if a variation independent component of F is specified correctly (i.e., locally semiparametrically efficient) 2 Asymptotically normal under regularity conditions 3 Does not need functional form of efficient influence function, φ, but deduce it by numerical Gateaux derivative 4 Extends working model and performs empirical minimization (similar to Chaffee and van der Laan (2011); Rubin and van der Laan (2008)). Deductive & Compatible Efficient Inference November 10, 2014 12 / 21

Feasibility: the two-phase design problem again Asthma care study (Huang et al, 2005) Mailed survey of patients from 20 California physician groups between July 1998 and Feburary 1999 to assess physician group performance Different distributions of characteristics across groups Outcome (Y ): patient satisfaction for asthma care (binary, yes/no) Covariates (X ): age, gender, race, education, health insurance, drug insurance coverage, asthma severity, number of comorbidities, and SF=36 physical and mental scores. Goal: Compare rates of patient satisfaction for asthma care, accounting for selective behaviors into physician groups Deductive & Compatible Efficient Inference November 10, 2014 13 / 21

Asthma care study continued To estimate the probability of patient satisfaction adjusted for covariates for two physician group pairs. We compare the following three estimates: 1 unadjusted analysis 2 adjusted analysis, non-deductive estimation (this talk) 3 adjusted adjusted, deductive estimation Deductive & Compatible Efficient Inference November 10, 2014 14 / 21

Asthma care study continued Comparisons Deductive & Compatible Efficient Inference November 10, 2014 15 / 21

Implications Usual doubly-robust estimator can be re-expressed compatibly by the set of parameter values derived by the deductive estimator Increased common support gives further evidence that deductive estimator gives similar estimates as from non-deductive estimators No form of efficient influence function necessary, and can be computed in < 1s Deductive & Compatible Efficient Inference November 10, 2014 16 / 21

Extension beyond mean outcome Example: median for continuous outcomes We can as well calculate numerical Gateaux derivative Following the DC-estimation procedure, gives locally semiparametric efficient estimator No other implemented semiparametrically efficient estimator Simulation studies shows consistency Feasible for general estimand Deductive & Compatible Efficient Inference November 10, 2014 17 / 21

Main points once again 1 Proposed a deductive and compatible (DC) procedure to give locally semiparametrically efficient estimates 2 Works for general estimand beyond population mean 3 Relies on numerical methods for differentiation and root finding 4 Save dramatic amounts of human efforts on essentially computerizable processes, and minimize errors. Deductive & Compatible Efficient Inference November 10, 2014 18 / 21

Discussions Extentions to restricted problems: Here, in the model specification, we did not constraint the form of how mean outcome depends on covariates; only working models were used as intermediate tools What if E[Y i X i ] = β X i is true and other model parts are unspecified? Our goal is to infer β When X s are discrete, we have shown that efficiency bound of β can be achieved by 1 Using proposed method for unrestricted problem, and 2 Imposing the linear restrictions numerically Deductive & Compatible Efficient Inference November 10, 2014 19 / 21

Discussions Computation: standard root finding methods may be unstable: use targeted MLE to transform root finding problem to iterative loss minimization problem. Deductive & Compatible Efficient Inference November 10, 2014 20 / 21

Thank you! Deductive & Compatible Efficient Inference November 10, 2014 21 / 21