Methods Lunch Talk: Causal Mediation Analysis

Similar documents
Chapter 9: Statistical Inference and the Relationship between Two Variables

Statistics for Economics & Business

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Statistics for Business and Economics

Heterogeneous Treatment Effect Analysis

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

The Ordinary Least Squares (OLS) Estimator

January Examinations 2015

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

Comparison of Regression Lines

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Standardized Simple Mediation Model: A Numerical Example

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Chapter 11: Simple Linear Regression and Correlation

Tests of Exclusion Restrictions on Regression Coefficients: Formulation and Interpretation

Lecture 6: Introduction to Linear Regression

Introduction to Regression

Chapter 5 Multilevel Models

Continuous vs. Discrete Goods

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

Basic Business Statistics, 10/e

Andreas C. Drichoutis Agriculural University of Athens. Abstract

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

III. Econometric Methodology Regression Analysis

Statistical tools to perform Sensitivity Analysis in the Context of the Evaluation of Measurement Uncertainty

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Chapter 15 - Multiple Regression

On Outlier Robust Small Area Mean Estimate Based on Prediction of Empirical Distribution Function

Limited Dependent Variables and Panel Data. Tibor Hanappi

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Uncertainty as the Overlap of Alternate Conditional Distributions

A Simple Inventory System

A Robust Method for Calculating the Correlation Coefficient

Homework Assignment 3 Due in class, Thursday October 15

Motion Perception Under Uncertainty. Hongjing Lu Department of Psychology University of Hong Kong

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling

Advances in Longitudinal Methods in the Social and Behavioral Sciences. Finite Mixtures of Nonlinear Mixed-Effects Models.

DECADAL DECLINE ( )OF LOGGERHEAD SHRIKES ON CHRISTMAS BIRD COUNTS IN ALABAMA, MISSISSIPPI, AND TENNESSEE

The Relationship between Factor Analytic and Item Response Models

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

e i is a random error

x i1 =1 for all i (the constant ).

β0 + β1xi and want to estimate the unknown

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

1 Introduction. Mediation Analysis for Count and Zero-inflated Count Data

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT

Kernel Methods and SVMs Extension

Phillippe G Leite, HDNSP-SSN team Anna Fruttero, LCSHS

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

This column is a continuation of our previous column

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2017 Instructor: Victor Aguirregabiria

Basically, if you have a dummy dependent variable you will be estimating a probability.

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

Multilevel Logistic Regression for Polytomous Data and Rankings

Empirical Methods for Corporate Finance. Identification

Explaining the Stein Paradox

Semiparametric Estimation of Treatment Effect in a Pretest-Posttest Study

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

T E C O L O T E R E S E A R C H, I N C.

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Linear Regression Analysis: Terminology and Notation

Chapter 13: Multiple Regression

Multinomial logit regression

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

Polynomial Regression Models

Negative Binomial Regression

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

a. (All your answers should be in the letter!

Cokriging Partial Grades - Application to Block Modeling of Copper Deposits

Chapter 8 Indicator Variables

Using Multivariate Rank Sum Tests to Evaluate Effectiveness of Computer Applications in Teaching Business Statistics

Homework 9 STAT 530/J530 November 22 nd, 2005

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

A Comparative Study for Estimation Parameters in Panel Data Model

Lecture 19. Endogenous Regressors and Instrumental Variables

Chapter 7 - Modeling Issues

Limited Dependent Variables

Diagnostics in Poisson Regression. Models - Residual Analysis

Econometrics of Panel Data

Effective plots to assess bias and precision in method comparison studies

Online Appendix to: Axiomatization and measurement of Quasi-hyperbolic Discounting

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

ASYMPTOTIC PROPERTIES OF ESTIMATES FOR THE PARAMETERS IN THE LOGISTIC REGRESSION MODEL

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Econometrics: What's It All About, Alfie?

Some basic statistics and curve fitting techniques

Linear Approximation with Regularization and Moving Least Squares

experimenteel en correlationeel onderzoek

MLE and Bayesian Estimation. Jie Tang Department of Computer Science & Technology Tsinghua University 2012

STAT 3008 Applied Regression Analysis

Lab 4: Two-level Random Intercept Model

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek

Outline. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture k r Factorial Designs with Replication

Applications of GEE Methodology Using the SAS System

Transcription:

Methods Lunch Talk: Causal Medaton Analyss Taeyong Park Washngton Unversty n St. Lous Aprl 9, 2015 Park (Wash U.) Methods Lunch Aprl 9, 2015 1 / 1

References Baron and Kenny. 1986. The Moderator-Medator Varable Dstncton n Socal Psychologcal Research: Conceptual, Strategc, and Statstcal Consderatons. Journal of Personalty and Socal Psychology. MacKnnon. 2008. Introducton to Statstcal Medaton Analyss. NY: Routledge. Ima et al. 2010a. Identfcaton, Inference, and Senstvty Analyss for Causal Medaton Effects. Socal Scence. Ima et al. 2010b. A General Approach to Causal Medaton Analyss. Psychologcal Methods. Ima et al. 2011. Unpackng the Black Box of Causalty: Learnng about Causal Mechansms from Expermental and Observatonal Studes. Amercan Poltcal Scence Revew. Park (Wash U.) Methods Lunch Aprl 9, 2015 2 / 1

Overvew 1. What does a causal mechansm mean? 2. Methods for nvestgatng causal mechansms Causal medaton analyss based on the structural equaton modelng framework based on the potental outcomes framework 3. The potental outcomes approach: dentfyng and estmatng causal mechansms 4. Emprcal applcatons Park (Wash U.) Methods Lunch Aprl 9, 2015 3 / 1

What does a causal mechansm mean? Researchers are nterested n whether an varable has an mpact on another varable (causal effects) how the mpact operates (causal mechansm) Example: A job tranng program Job seekers mental health How, or why, does the program affect the mental health? One possble explanaton: attendng the program enhances partcpants confdence n ther ablty to search for a job better mental health job search self-effcacy ntermedates between the job program and job seekers mental health Park (Wash U.) Methods Lunch Aprl 9, 2015 4 / 1

What does a causal mechansm mean? Self effcacy Job program Mental health Park (Wash U.) Methods Lunch Aprl 9, 2015 5 / 1

What does a causal mechansm mean? Total effect = Medaton effect + Drect effect Medator Medaton effect Treatment Outcome Drect effect Park (Wash U.) Methods Lunch Aprl 9, 2015 6 / 1

Does a standard multvarate regresson estmate causal mechansms? Y = α + βt + γm + ɛ M γ T β Y Park (Wash U.) Methods Lunch Aprl 9, 2015 7 / 1

Methods for nvestgatng causal mechansms Causal medaton analyss Lnear Structural Equaton Modelng (LSEM) wth a sngle medator Y = α 1 + ζt + ɛ 1, M = α 2 + δt + ɛ 2, Y = α 3 + βt + γm + ɛ 3 Total effect = ζ Drect effect = β Medaton effect = ζ β = δγ Park (Wash U.) Methods Lunch Aprl 9, 2015 8 / 1

Methods for nvestgatng causal mechansms Causal medaton analyss M M δ γ γ T β LSEM Y T β Regresson Y Park (Wash U.) Methods Lunch Aprl 9, 2015 9 / 1

Methods for nvestgatng causal mechansms Causal medaton analyss What assumptons are requred for δγ to be an asymptotcally consstent estmate? Lnearty Zero correlaton between ɛ 2 and ɛ 3 The potental outcomes framework: an alternatve approach to causal medaton analyss (Ima et al. 2011). A general method that s not dependent on any statstcal model applcable to lnear and nonlnear models n the same way. Clarfes the requred assumptons and provdes a senstvty tool. Park (Wash U.) Methods Lunch Aprl 9, 2015 10 / 1

The potental outcomes approach to medaton analyss Potental outcomes (Holland 1986; Neyman 1923; Rubn 1974) Gven a unt and a set of actons ncludng control and treatment T = t, for t = 0 or 1, two outcomes reman potental untl one s actually realzed: Y = Y (t = 0) and Y = Y (t = 1), where Y s the realzed outcome. Y (0) s s mental health that would be realzed f does not attend the job program. Y (1) s s mental health that would be realzed f attend the job program. Park (Wash U.) Methods Lunch Aprl 9, 2015 11 / 1

The potental outcomes approach to medaton analyss Defne the potental outcomes as a functon of treatment and medator such as Y (T = t, M (T ) = m) for ndvdual. Y : an outcome; T : a treatment varable; M : a medator; t and m: specfc values for T and M. For the sake of smplcty, suppose two alternatve values of the treatment. t = 0 represents the control condton and t = 1 represents the treatment condton. The potental medator values for are denoted by M (0) and M (1). For nstance, Y (0, M (1)) refers to a potental outcome that would be realzed under the control condton wth the medator takng a counterfactual value that would be realzed under the treatment condton. Y (0, M (0))? Park (Wash U.) Methods Lunch Aprl 9, 2015 12 / 1

The potental outcomes approach to medaton analyss Identfyng the medaton effect: By holdng the treatment constant and changng only the medator, thereby controllng for the drect causal pathway between the treatment and outcome, we can solate the medaton effect from other alternatve causal mechansms. The medaton effect s a purely counterfactual quantty that s nherently unobservable. Medator Medaton effect Treatment Outcome Drect effect Park (Wash U.) Methods Lunch Aprl 9, 2015 13 / 1

The potental outcomes approach to medaton analyss Explots predcted counterfactual values of the treatment and the medator. Clarfes what assumptons are requred to dentfy the unobservable medaton effect. Average Medaton Effect (AME) ME Y (t, M (1)) Y (t, M (0)), for t = 0 or 1 AME E[Y (t, M (1)) Y (t, M (0))], for t = 0 or 1 How much the outcome would change f the medator changes whle the treatment remans constant. Park (Wash U.) Methods Lunch Aprl 9, 2015 14 / 1

The potental outcomes approach to medaton analyss Average Drect Effect (ADE) DE Y (1, M (t)) Y (0, M (t)), for t = 0 or 1 ADE E[Y (1, M (t)) Y (0, M (t))], for t = 0 or 1 How much the outcome would change f the treatment changes whle the medator remans constant. Average Total Effect (ATE) TE Y (1, M (1)) Y (0, M (0)) ATE E[Y (1, M (1)) Y (0, M (0))] The total effect of the treatment on the outcome. Park (Wash U.) Methods Lunch Aprl 9, 2015 15 / 1

The potental outcomes approach to medaton analyss Requred Assumptons The sequental gnorablty assumpton (Ima et al. 2010a; 2010b; 2011). (1) The treatment s assumed to be ndependent of potental outcomes and potental medators, so that t s gnorable, gven the observed pretreatment covarates. (2) The medator s assumed to be ndependent of potental outcomes, so that t s gnorable, gven the observed pretreatment covarates and the gnorable treatment. Perfect complance wth treatment assgnment. SUTVA (stable unt treatment value assumpton): Non-nterference and excluson restrcton. Park (Wash U.) Methods Lunch Aprl 9, 2015 16 / 1

The potental outcomes approach to medaton analyss Estmaton procedure (Three steps): 1. Fttng models 2. Smulatng potental values of the medator and the outcome 3. Computng the medaton effect and the drect effect 1. Fttng models: Runnng two regressons. A regresson model for the medator as a functon of the treatment and pretreatment covarates f θm (M T, X ) A regresson model for the outcome as a functon of the treatment, medator and pretreatment covarates f θy (Y T, M, X ) That s, M = α 2 + δt + ɛ M Y = α 3 + βt + γm + ɛ Y Park (Wash U.) Methods Lunch Aprl 9, 2015 17 / 1

The potental outcomes approach to medaton analyss 2. Smulatng potental values of the medator and the outcome. Use the estmated parameters from the two regresson models (ˆθ M and ˆθ Y ) to smulate potental values of the medator and the outcome. M (k) Y (k) Y (k) (t X ) from fˆθ(k) (M t, X ) for k = 1, 2,..., K (t, M (k) (t X ) X ) from fˆθ (k) (t, M (k) (t X ) X ) from fˆθ (k) M Y Y (Y t, M (k) (t), X ) for k = 1, 2,..., K (Y t, M (k) (t ), X ) for k = 1, 2,..., K Example: t = 0 or 1 K copes of M (0 X ) and M (1 X ) K copes of Y (0, M (k) (0 X ) X ) and Y (0, M (k) (1 X ) X ) for condton t = 0 K copes of Y (1, M (k) (0 X ) X ) and Y (1, M (k) (1 X ) X ) for condton t = 1 Park (Wash U.) Methods Lunch Aprl 9, 2015 18 / 1

The potental outcomes approach to medaton analyss 3. Computng the medaton and drect effects ÂME = 1 1 2 nk 1 n K [Y (k) t=0 =1 k=1 (t, M (k) (1 X ) X ) Y (k) (t, M (k) (0 X ) X )] 1 2 [{Y (0, M (1 X ) X ) Y (0, M (0 X ) X )} + {Y (1, M (1 X ) X ) Y (1, M (0 X ) X )}] How much the outcome would change f the medator changes whle the treatment remans constant. Park (Wash U.) Methods Lunch Aprl 9, 2015 19 / 1

The potental outcomes approach to medaton analyss 3. Computng the medaton and drect effects ÂDE = 1 1 2 nk 1 n K [Y (k) t=0 =1 k=1 (1, M (k) (t X ) X ) Y (k) (0, M (k) (t X ) X )] How much the outcome would change f the treatment changes whle the medator remans constant. ÂTE = 1 nk n K [Y (k) =1 k=1 (1, M (k) (1 X ) X ) Y (k) (0, M (k) (0 X ) X )] The total effect of the treatment on the outcome. Park (Wash U.) Methods Lunch Aprl 9, 2015 20 / 1

Emprcal Applcaton Local economc condtons Presdental electons n the U.S.? Hypothess: Voters observe local economc condtons (gas prces, foreclosures, etc) shape subjectve evaluatons of the natonal economy use these evaluatons to cast a presdental vote at the ballot box. Subjectve evaluatons of the economy: the medatng role. M: Evaluatons Medaton effect T: Local economy Y: Vote choce Drect effect Park (Wash U.) Methods Lunch Aprl 9, 2015 21 / 1

Emprcal Applcaton How to mplement causal medaton analyss? Use medaton package (STATA, R). Wrte your own code. Contnuous treatment, ordered categorcal medator, bnary outcome wth multlevel data structure. Want to run a fully Bayesan multlevel model. Park (Wash U.) Methods Lunch Aprl 9, 2015 22 / 1

Emprcal Applcaton Predcted Effects of Gas Prce on John McCan's Vote Share n 2008 0.2 0.0 0.2 0.2 0.0 0.2 0.2 0.0 0.2 2 1 0 1 2 ADE AME ATE 0.04 0.00 0.04 0.10 0.00 0.10 0.034 0.011 2 1 0 1 2 [Fgures enlarged: The dotted lnes ndcate the predcted ATE and AME due to an $1 ncrease n gas prce] ATE AME Change n Gas Prce Park (Wash U.) Methods Lunch Aprl 9, 2015 23 / 1

Emprcal Applcaton Predcted Effects of Foreclosure on John McCan's Vote Share n 2008 0.08 0.04 0.00 0.08 0.04 0.00 0.08 0.04 0.00 0 20 40 60 80 100 ATE AME ADE Number of Foreclosures per 1000 Households 0.030 0.015 0.000 0.030 0.015 0.000 0.02 0.011 0 20 40 60 80 100 [Fgures enlarged: The dotted lnes ndcate the predcted ATE and AME due to a 70 unts ncrease n foreclosures per 1000 households] ATE AME Park (Wash U.) Methods Lunch Aprl 9, 2015 24 / 1

Senstvty Analyss (1) An objectve local economc condton s ndependent of potental outcomes for vote choce and subjectve evaluatons of the economy gven the observed pretreatment covarates. tem (2) A subjectve economc evaluaton s ndependent of potental outcomes for vote choce gven the observed pretreatment covarates and the observed values for objectve economc condtons. To see how much a volaton of the requred assumpton would affect the estmates If nference s senstve, a slght volaton of the assumpton may lead to substantvely dfferent conclusons (Ima et al. 2011). ρ = Corr(ɛ M, ɛ Y ) Non-zero values of ρ mply departures from (2). Use medaton package to fgure out the value of ρ that makes our estmates equal to zero or nsgnfcant. Park (Wash U.) Methods Lunch Aprl 9, 2015 25 / 1