Randomization-Based Inference With Complex Data Need Not Be Complex!

Similar documents
Assessing Time-Varying Causal Effect. Moderation

Time-Varying Causal. Treatment Effects

Micro-Randomized Trials & mhealth. S.A. Murphy NRC 8/2014

Robustness of the Contextual Bandit Algorithm to A Physical Activity Motivation Effect

Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14

Selection on Observables: Propensity Score Matching.

SEQUENTIAL MULTIPLE ASSIGNMENT RANDOMIZATION TRIALS WITH ENRICHMENT (SMARTER) DESIGN

Robust Monte Carlo Methods for Sequential Planning and Decision Making

Robustifying Trial-Derived Treatment Rules to a Target Population

Propensity Score Methods for Causal Inference

AGEC 661 Note Fourteen

An Online Actor Critic Algorithm and a Statistical Decision Procedure for Personalizing Intervention

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Lecture 2: Constant Treatment Strategies. Donglin Zeng, Department of Biostatistics, University of North Carolina

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Core Courses for Students Who Enrolled Prior to Fall 2018

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Assessing Time-Varying Causal Effect Moderation in Mobile Health

Lecture 1 Introduction to Multi-level Models

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

Adaptive Trial Designs

Empirical approaches in public economics

multilevel modeling: concepts, applications and interpretations

Probability and statistics; Rehearsal for pattern recognition

Quantitative Economics for the Evaluation of the European Policy

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL

Lecture 9: Learning Optimal Dynamic Treatment Regimes. Donglin Zeng, Department of Biostatistics, University of North Carolina

2.3 Estimating PDFs and PDF Parameters

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

STATISTICS 4, S4 (4769) A2

Comparative effectiveness of dynamic treatment regimes

Causality and Experiments

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

PROBABILITY AND STOCHASTIC PROCESSES A Friendly Introduction for Electrical and Computer Engineers

Using Instrumental Variables to Find Causal Effects in Public Health

Technical Track Session I:

Variable selection and machine learning methods in causal inference

Be able to define the following terms and answer basic questions about them:

PHYS 6710: Nuclear and Particle Physics II

Probability, Random Processes and Inference

Empirical Bayes Moderation of Asymptotically Linear Parameters

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

HHS Public Access Author manuscript Stat Med. Author manuscript; available in PMC 2017 May 30.

Estimating Causal Effects of Organ Transplantation Treatment Regimes

Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings

Estimation Considerations in Contextual Bandits

Combining multiple observational data sources to estimate causal eects

Simulation. Where real stuff starts

Technical Track Session I: Causal Inference

Estimating Optimal Dynamic Treatment Regimes from Clustered Data

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

The risk of machine learning

Targeted Maximum Likelihood Estimation for Adaptive Designs: Adaptive Randomization in Community Randomized Trial

Discrete Random Variable

Specification Errors, Measurement Errors, Confounding

We would like to describe this population. Central tendency (mean) Variability (standard deviation) ( X X ) 2 N

Reinforcement Learning Wrap-up

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Time Series Methods. Sanjaya Desilva

Math Review Sheet, Fall 2008

CS 4649/7649 Robot Intelligence: Planning

A Measure of Robustness to Misspecification

Discussion of Papers on the Extensions of Propensity Score

A Sampling of IMPACT Research:

Module 9: Sampling IPDET. Sampling. Intro Concepts Types Confidence/ Precision? How Large? Intervention or Policy. Evaluation Questions

STATISTICAL INFERENCE FOR SURVEY DATA ANALYSIS

Semiparametric Generalized Linear Models

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

ARTIFICIAL INTELLIGENCE. Reinforcement learning

Propensity Score Analysis with Hierarchical Data

Lecture 13: Covariance. Lisa Yan July 25, 2018

Machine Learning, Fall 2009: Midterm

Probabilistic Machine Learning

DISCRETE STOCHASTIC PROCESSES Draft of 2nd Edition

Rerandomization to Balance Covariates

Lecture Topic 4: Chapter 7 Sampling and Sampling Distributions

Human-Oriented Robotics. Probability Refresher. Kai Arras Social Robotics Lab, University of Freiburg Winter term 2014/2015

ECON Introductory Econometrics. Lecture 17: Experiments

Output Data Analysis for a Single System

Lecture 25. Ingo Ruczinski. November 24, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University

Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts?

Estimating the Technology of Children s Skill Formation

STAT 302 Introduction to Probability Learning Outcomes. Textbook: A First Course in Probability by Sheldon Ross, 8 th ed.

CSE 546 Final Exam, Autumn 2013

Probability Theory for Machine Learning. Chris Cremer September 2015

Comparing Adaptive Interventions Using Data Arising from a SMART: With Application to Autism, ADHD, and Mood Disorders

What is to be done? Two attempts using Gaussian process priors

Review of probability and statistics 1 / 31

The Supervised Learning Approach To Estimating Heterogeneous Causal Regime Effects

Linear Dynamical Systems

Respecting Markov Equivalence in Computing Posterior Probabilities of Causal Graphical Features

Inference for Average Treatment Effects

Personalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Causal inference in multilevel data structures:

Comments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006

STA414/2104. Lecture 11: Gaussian Processes. Department of Statistics

Fundamental Probability and Statistics

Transcription:

Randomization-Based Inference With Complex Data Need Not Be Complex! JITAIs JITAIs Susan Murphy 07.18.17 HeartSteps JITAI JITAIs

Sequential Decision Making Use data to inform science and construct decision making policies Mobile Health: determine, repeatedly over time, whether a mobile device/wearable should deliver a treatment &, if so, which type of treatment to deliver. 2

Data On each individual:,,,,,,, t: Decision point (possible treatment delivery times) : All (sensor, self report) observations up to and at t th decision point (high dimensional) : Treatment at t th decision point (actions, controls) : Proximal response (e.g., reward, utility) 3

Micro-Randomized Trial 4

Micro-Randomized Trial If a participant is available at each decision point, t, randomize between treatments. Pre-specified algorithm for the randomization probability: P[A t =a H t, I t =1] = p t (H t ) I t =1 if available, I t =0 if not H t denotes data on participant through t 5

Causal Effects Assess causal effects of tailored activity suggestion on subsequent 30 min. step count. Above is a very vague statement! 6

Potential Outcomes =,,, (random treatments), =,,, (realizations of treatments) is one potential proximal response is one potential available for treatment indicator is one potential history vector is a vector of features of history 7

Marginal, Causal Excursion Effects Excursion Effect at decision point t:,1,0 = 1, Effect is marginal over any,,, < not in ---over all variables not in. Effect is conditional on availability; only concerns the subpopulation of individuals available at decision t 8

B,1,0 = 1, = = Consistency & Micro-Randomized A t = 1, = 1, = 0, = 1, = 1,! 1 1! = 1, (! is randomization probability) 9

Marginal Excursion Effect Model Excursion Effect: = 1, = 1, = 0, = 1, = 1, Assume low dimensional structure for above effect: " # We aim to conduct inference about #! is participant s data up to and at time t is a vector of data summaries and time, t, ( ) indicator of availability 10

Centered and Weighted Least Squares Estimation Simple method for complex data Enables unbiased inference for a causal treatment effect (the β s) Inference for treatment effect is not biased by how we use covariates to reduce the noise variance in Y t+1 https://arxiv.org/abs/1601.00237 11

Estimation Steps 1. Select probabilities:!' ( 0,1 2. Form weights: * = +', -, +,., /, +', -, +,., /, 3. Center treatment actions:!' 4. Form working model: = 1, 2 " 3 5. Minimize over #, 3 : b 4 " 2 " 3!' " # 6 * 4 is empirical distribution over individuals. 12

" Minimize 4 7 2 " 3!' 6 " # * 4 is expectation with respect to empirical distribution Incorrect intuition: We are *not* assuming that, = 1,2, = 2 " 3 +!' " # 13

Minimize " 4 7 2 " 3!' 6 " # * If!' depends, at most, on features in and = 1, = 1, = 0, = 1, = 1, = " # < then, under moment conditions, #= is consistent for # < and > #= # < has a limiting normal distribution. 14

Choice of Weights If simple model, " # <, does not hold, then choice of!' determines the estimand. Example: = 1,!' =!'. Resulting #= is an estimator of where " " 7 #? 7 6 6 # = = 1, = 1, = 0, = 1, = 1 15

Gains from Randomization Causal inference for a marginal treatment effect Inference on treatment effect is robust to **working** model: = 1, 2 " 3 2 is a vector of features from the history, 16

On each of n=37 participants: Tailored activity suggestion Provide a suggestion with probability.6 Do nothing with probability=.4 5 times per day * 42 days= 210 randomizations per participant 17

Conceptual Models = 1, = 1, = 0, = 1, = 1 = β 0 = 1, = 1, = 0, = 1, = 1 = β 0 + β 1 d t t=1, T=210 Y t+1 = log-transformed step count in the 30 minutes after the t th decision point, A t = 1 if an activity suggestion is delivered at the t th decision point; A t = 0, otherwise, Z t = log-transformed step count in the 30 minutes prior to the t th decision point, (use to reduce variance) d t =days in study; takes values in (0,1,.,41) 18

Pilot Study Analysis = 1, = 1, = 0, = 1, = 1 = β 0 & = 1, = 1, = 0, = 1, = 1 = β 0 + β 1 d t Causal Effect Term Estimate 95% CI p-value β 0 A t (effect of an activity suggestion) β 0 A t + β 1 A t d t (time trend in effect of an activity suggestion) β@ < =.13 (-0.01, 0.27).06 β@ < =.51 (.20,.81) <.01 β@ = -.02 (-.03, -.01) <.01 19

Non-stationarity The data indicates that there is a causal effect of the activity suggestion vs no activity suggestion on step count in the succeeding 30 minutes. This effect deteriorates with time The walking activity suggestion initially increases step count over succeeding 30 minutes by 171 steps but by day 21 this increase is only 35 steps. 20

Discussion Problematic Analyses GLM & GEE analyses Random effects models & analyses Machine Learning Generalizations: Partially linear, single index models & analysis Varying coefficient models & analysis --These analyses do not take advantage of the microrandomization. Can accidentally eliminate the advantages of randomization for estimating causal effects-- 21

Discussion Randomization enhances: Causal inference based on minimal structural assumptions Challenge: How to include random effects which reflect scientific understanding ( person-specific effects) yet not destroy causal inference? 22

Collaborators! 23