Bayesian Causal Forests
|
|
- Edwin Harvey
- 5 years ago
- Views:
Transcription
1 Bayesian Causal Forests for estimating personalized treatment effects P. Richard Hahn, Jared Murray, and Carlos Carvalho December 8, 2016
2 Overview We consider observational data, assuming conditional unconfoundedness, and are interested in individual treatment effects. Three key statistical concepts will be: prediction, regularization, and selection. Our regression model will take the novel form D i = g(x i ) + ɛ i, k Y i = m(x i ) + b j (x)d j i + ν i. j=1 where g, m and b are represented as regression forests. 1
3 Accurate prediction is necessary but not sufficient Higher predictive accuracy is associated with more reliable information about the underlying data mechanism. Weak predictive accuracy can lead to questionable conclusions. Algorithmic models can give better predictive accuracy than data models, and provide better information about the underlying mechanism. Leo Breiman Prediction by itself is only occasionally sufficient.the post office is happy with any method that predicts correct addresses from hand-written scrawls...[but] most statistical surveys have the identification of causal factors as their ultimate goal. Bradley Efron 2
4 Flexible models are necessary Taken from OpenIntro Statistics 2nd Edition. 3
5 Flexible models aren t sufficient An expensive car doesn t cause good teeth. 4
6 Concrete example Suppose we re interested in the treatment effect of dietary kale intake. We want to know how effective it is at lowering cholesterol, which is our outcome variable. Unfortunately, we have only observational data (i.e., not a randomized study). 5
7 Kale intake predicts exercise Only gym-rats seem to eat much kale and exercise is known to lower cholesterol. Y i = β 0 + αd i + ε i, Because cov(d i, ε i ) 0, we can write Y i = β 0 + αd i + ωd i + ε. Since cov(d i, ε i ) = 0, we mis-estimate α as α + ω. But if we include exercise in the model, Conditional on x i, cov(d i, ε i ) = 0. Y i = β 0 + αd i + βx i + ε i. 6
8 Unmeasured confounding and control variables Randomized experiments are the special case where prediction and causation are equivalent. ɛ ν ɛ D Y D Y x Holding x constant, one has experimental variation from which to infer causal impacts. (We have to assume Y doesn t cause D.) 7
9 Regularization Y i = β 0 + αd i + βx i + ε i. a flat prior on the treatment effect: (α, σ 2 ε) 1/σ ε, shrinkage prior on β (e.g., the horseshoe prior). Now turn the Bayesian crank... 8
10 Something goes wrong GoodBayes NaiveBayes OLS Oracle It turns out that this obvious approach is really bad at getting reasonable estimates of the treatment effect α. 9
11 Regularization-induced confounding The bias of the treatment parameter under ridge regression is: bias(ˆα rr ) = ( (D t D) 1 D t X ) (I p + X t (X ˆX D )) 1 β. This is a function of all the other unknown regression parameters. 10
12 Our reparametrization: a latent error approach We reparametrize as α α β + αγ β d. γ β c which gives the new equations Selection Eq.: D = x t β c + ɛ, ɛ N(0, σ 2 ɛ), Response Eq.: Y = α(d x t β c ) + x t β d + ν, ν N(0, σ 2 ν). We may now shrink β d and β c with impunity. 11
13 Control of bias The expression for ridge treatment effect bias is now: bias(ˆα) = ( (R t R) 1 R t X ) (I p + X t (X ˆX R )) 1 β d. where R = D x t β c. By construction, (R t R) 1 R t X will be close to the zero vector, because R is independent of x. 12
14 ρ 2 Bias Coverage I.L. MSE 0.1 New Approach OLS Naive Regularization Oracle OLS New Approach OLS Naive Regularization Oracle OLS New Approach OLS Naive Regularization Oracle OLS New Approach OLS -8e Naive Regularization Oracle OLS New Approach OLS Naive Regularization Oracle OLS Table 1: n = 50, p = 30, k = 3. κ 2 = φ 2 = σ 2 ν =
15 Nonlinear regressions for deconfounding Consider the nonlinear model Y i = f (x i, D i ) + ε i. Now our de-confoundedness condition is stronger D i ε i x i. And our causal estimand is more general α(x i, D i, D i ) = E(Y i x i, D i ) E(Y i x i, D i ), = f (x i, D i ) f (x i, D i ). 14
16 Regression Trees no x 1 < c Tree T h yes g(x, T h, M h ) µ h2 µ h1 x 3 < d no yes x 3 d µ h3 µ h1 µ h2 µ h3 x 1 c Leaf/End node parameters M h = (µ h1, µ h2, µ h3 ) Partition A h = {A h1, A h2, A h3 } g(x, T h, M h ) = µ ht if x A ht (for 1 t b h ). 15
17 Bayesian Additive Regression Trees (BART) Bayesian additive regression trees (BART) (Chipman, George and McCulloch, 2010): y i = f (x i ) + ɛ i, ɛ i N(0, σ 2 ) m f (x) = g(x, T h, M h ) h=1 g are basis functions determined by a binary tree T h and vector of parameters M h. These models were specifically advocated for causal inference in Hill (2012). 16
18 BART exhibits regularization-induced confounding Estimate EYhat4 0 0 EYhat New Causal FLM 2 State of the Art (Hill) EY EY Treatment effect Figure 1: Left: BART predictions. Middle: BCF predictions. Right: treatment effect estimates; BART (blue) and BCF (orange). 17
19 Taylor expanding f (x, D) We expand f (x, D) about the point D = E(D x) = g(x): f (j) (x, g(x))(d g(x)) j f (x, D) = j! j=0 where f (j) denotes the jth derivative of f with respect to D, and reparametrize as f (x, D) = f (x, g(x)) + = m(x, g(x)) + k b j (x, g(x))(d g(x)) j j=1 k b j (x, g(x))(d g(x)) j. j=1 18
20 Taylor Shifts One can transform back via b j (x) = j h k for 0 j k and b 0 (x) := m(x, g(x)). In this parametrization ( ) h b h (x, g(x))g(x) h j. j α(x i, D i, D i ) = f (x i, D i ) f (x i, D i ), k k = b j (x)d j b j (x)d j, = j=0 j=0 k b j (x) ( D j D j). j=0 This suggests k j=0 b j (x) is a nice summary of the causal effect at x. 19
21 Bayesian causal forests Take k <, give g, m and b independent BART priors and assume normal additive errors: D i = g(x i ) + ɛ i, Y i = m(x i, g(x i )) + k b j (x, g(x))(d i g(x i )) j + ν i. j=1 This model allows for the various roles of control variables to be regularized independently: g(x) represents selection, m(x, g(x)) represents direct association, and b j (x, g(x)) represents nonlinearity as it varies over j, b j (x, g(x)) represents heterogeneous effects as it varies in x. Confusion of these roles is confounding. 20
22 Advantages of BCF The analogies with the linear case are clear, D i = g(x i ) + ɛ i, Y i = m(x i, g(x i )) + Advantages are that k b j (x, g(x i ))(D i g(x i )) j + ν i. j=1 Independent regularization of g, m, and b. easy summarization of treatment effects, transparent interpolation/extrapolation. By comparison, BART s regularization is implicit, treatment effects are difficult to extract, and interpolation/extrapolation is poor. 21
23 1987 National Medical Expenditure Survey (NMES) What is the effect of smoking on medical expenditures? outcome variable Y is medical expenses (verified, log transformed), treatment variable D is cigarettes smoked per day, n = 7.7k complete-case analysis for Y > 0, covariates include age at time of the survey (19-94), age started smoking, gender (male, female), race (hispanic, black, other), marriage status (married, widowed, divorced, separated, never married), education level (college graduate, some college, high school graduate, other), census region (Northeast, Midwest, South, West), poverty status (poor, near poor, low income, middle income, high income), and seat belt usage (rarely, sometimes, always/almost always) 22
24 Average treatment effect posterior Frequency Average Treatment effect Figure 2: The average treatment effect is statistically significant, but small. 23
25 Individual treatment effect estimates bhat[ind] Index Figure 3: precision. Individual level treatment effects are difficult to estimate with any 24
26 Seeing the forests through a tree Fitting a single regression tree to posterior summaries of g, m, and k j=0 b j (x) allows us to find interesting sub-groups for follow-up study. Save posterior samples of each subject-specific parameter, compute ĝ, ˆm, and b j, fit single regression trees (with little or no regularization) with these summaries as the outcome, Leaf nodes define subgroups for further examination (such as subgroup average treatment effects). 25
27 Tree fit to π(g i ) yes % RACE3 < 2.5 no % AGESMOKE >= % MALE < % 2.5 6% % % RACE3 < 1.5 LASTAGE < 32 AGESMOKE >= 18 AGESMOKE >= % % % % 3 19% MALE < 0.5 LASTAGE >= 76 LASTAGE < 28 LASTAGE < 38 LASTAGE < % LASTAGE >= % 2.2 6% 2.4 3% 2.3 2% 2.6 4% 2.3 2% % 2.6 6% % 2.7 5% 2.7 2% % 2.8 6% % Figure 4: Age, gender and race appear to be the dominant determinants of treatment/smoking level. 26
28 Tree fit to π(m i ) yes % LASTAGE < 50 no % MALE >= % LASTAGE < % LASTAGE < % LASTAGE < % 6.4 3% % % 7 15% 7.3 7% Figure 5: Age appears to be the dominant driver of medical expenditures. 27
29 Tree fit to π(b i ) yes % RACE3 < 2.5 no % % educate < 1.5 educate < % educate >= % educate < % % % % % % Figure 6: Race and educational level appear to modulate the treatment effect. 28
30 What about BART? 0.10 bhat mhat + bhat * (z - ghat) BART and BCF give similar predictions bartfit$yhat.train.mean bhat_bart Same predictions, different conclusions. 29
31 Summary Regularization-induced confounding can adversely bias treatment effect estimates. Explicitly modeling de-confoundedness allows regularization to be imposed robustly. The intuitive parametrization makes posterior exploration quite convenient. Bayesian causal forests permit finer control of regularization. Easily modified to handle categorical outcomes and treatments. Future work considers non-normal errors. Thank you for your time. 30
32 Some related work Machine learning for causal inference: Hill, Van der Laan, Athey and Imbens, Athey and Wager, Hansen et al. Earlier work looked at joint modeling: Zigler et al (2013), Whitney and Powell (2003), Robins, Mark and Newey (1992), McCandless, Gustafson and Austin (2009). So-called double-robust estimators: Robins, Hernan, and Wasserman (2015). Our work differs from these works in its explicit focus on regularization and its novel forested-linear-model structure. 31
Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity
Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity P. Richard Hahn, Jared Murray, and Carlos Carvalho June 22, 2017 The problem setting We want to estimate
More informationBayesian causal forests: dealing with regularization induced confounding and shrinking towards homogeneous effects
Bayesian causal forests: dealing with regularization induced confounding and shrinking towards homogeneous effects P. Richard Hahn, Jared Murray, and Carlos Carvalho July 29, 2018 Regularization induced
More informationBayesian Additive Regression Tree (BART) with application to controlled trail data analysis
Bayesian Additive Regression Tree (BART) with application to controlled trail data analysis Weilan Yang wyang@stat.wisc.edu May. 2015 1 / 20 Background CATE i = E(Y i (Z 1 ) Y i (Z 0 ) X i ) 2 / 20 Background
More informationSelective Inference for Effect Modification
Inference for Modification (Joint work with Dylan Small and Ashkan Ertefaie) Department of Statistics, University of Pennsylvania May 24, ACIC 2017 Manuscript and slides are available at http://www-stat.wharton.upenn.edu/~qyzhao/.
More informationCausal Inference with General Treatment Regimes: Generalizing the Propensity Score
Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke
More informationAchieving Optimal Covariate Balance Under General Treatment Regimes
Achieving Under General Treatment Regimes Marc Ratkovic Princeton University May 24, 2012 Motivation For many questions of interest in the social sciences, experiments are not possible Possible bias in
More informationItem Response Theory for Conjoint Survey Experiments
Item Response Theory for Conjoint Survey Experiments Devin Caughey Hiroto Katsumata Teppei Yamamoto Massachusetts Institute of Technology PolMeth XXXV @ Brigham Young University July 21, 2018 Conjoint
More informationStructural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall
1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Duke University Medical, Dept. of Biostatistics Joint work
More informationPartial factor modeling: predictor-dependent shrinkage for linear regression
modeling: predictor-dependent shrinkage for linear Richard Hahn, Carlos Carvalho and Sayan Mukherjee JASA 2013 Review by Esther Salazar Duke University December, 2013 Factor framework The factor framework
More informationCausal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk
Causal Inference in Observational Studies with Non-Binary reatments Statistics Section, Imperial College London Joint work with Shandong Zhao and Kosuke Imai Cass Business School, October 2013 Outline
More informationEfficient Identification of Heterogeneous Treatment Effects via Anomalous Pattern Detection
Efficient Identification of Heterogeneous Treatment Effects via Anomalous Pattern Detection Edward McFowland III emcfowla@umn.edu Joint work with (Sriram Somanchi & Daniel B. Neill) 1 Treatment Effects
More informationarxiv: v3 [stat.me] 28 Dec 2016
By P. Richard Hahn, Carlos M. Carvalho, Jingyu He and David Puelz REGULARIZATION AND CONFOUNDING arxiv:1602.02176v3 [stat.me] 28 Dec 2016 IN LINEAR REGRESSION FOR TREATMENT EFFECT ESTIMATION This paper
More informationJob Training Partnership Act (JTPA)
Causal inference Part I.b: randomized experiments, matching and regression (this lecture starts with other slides on randomized experiments) Frank Venmans Example of a randomized experiment: Job Training
More informationStructural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall
1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work
More informationQuantitative Economics for the Evaluation of the European Policy
Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,
More informationWhat s New in Econometrics. Lecture 1
What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and
More informationTHE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B.
THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B. RUBIN My perspective on inference for causal effects: In randomized
More informationCausal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions
Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census
More informationCausal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies
Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University
More informationChapter 11. Regression with a Binary Dependent Variable
Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score
More informationProgram Evaluation with High-Dimensional Data
Program Evaluation with High-Dimensional Data Alexandre Belloni Duke Victor Chernozhukov MIT Iván Fernández-Val BU Christian Hansen Booth ESWC 215 August 17, 215 Introduction Goal is to perform inference
More informationSelection on Observables: Propensity Score Matching.
Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017
More informationSensitivity checks for the local average treatment effect
Sensitivity checks for the local average treatment effect Martin Huber March 13, 2014 University of St. Gallen, Dept. of Economics Abstract: The nonparametric identification of the local average treatment
More informationGov 2002: 4. Observational Studies and Confounding
Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What
More informationLearning Representations for Counterfactual Inference. Fredrik Johansson 1, Uri Shalit 2, David Sontag 2
Learning Representations for Counterfactual Inference Fredrik Johansson 1, Uri Shalit 2, David Sontag 2 1 2 Counterfactual inference Patient Anna comes in with hypertension. She is 50 years old, Asian
More informationAGEC 661 Note Fourteen
AGEC 661 Note Fourteen Ximing Wu 1 Selection bias 1.1 Heckman s two-step model Consider the model in Heckman (1979) Y i = X iβ + ε i, D i = I {Z iγ + η i > 0}. For a random sample from the population,
More informationIdentification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case
Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College December 2016 Abstract Lewbel (2012) provides an estimator
More informationarxiv: v1 [stat.me] 18 Nov 2018
MALTS: Matching After Learning to Stretch Harsh Parikh 1, Cynthia Rudin 1, and Alexander Volfovsky 2 1 Computer Science, Duke University 2 Statistical Science, Duke University arxiv:1811.07415v1 [stat.me]
More informationBootstrapping Sensitivity Analysis
Bootstrapping Sensitivity Analysis Qingyuan Zhao Department of Statistics, The Wharton School University of Pennsylvania May 23, 2018 @ ACIC Based on: Qingyuan Zhao, Dylan S. Small, and Bhaswar B. Bhattacharya.
More informationIndividualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models
Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models Nicholas C. Henderson Thomas A. Louis Gary Rosner Ravi Varadhan Johns Hopkins University July 31, 2018
More informationGov 2002: 13. Dynamic Causal Inference
Gov 2002: 13. Dynamic Causal Inference Matthew Blackwell December 19, 2015 1 / 33 1. Time-varying treatments 2. Marginal structural models 2 / 33 1/ Time-varying treatments 3 / 33 Time-varying treatments
More informationA spatial causal analysis of wildfire-contributed PM 2.5 using numerical model output. Brian Reich, NC State
A spatial causal analysis of wildfire-contributed PM 2.5 using numerical model output Brian Reich, NC State Workshop on Causal Adjustment in the Presence of Spatial Dependence June 11-13, 2018 Brian Reich,
More informationCh 7: Dummy (binary, indicator) variables
Ch 7: Dummy (binary, indicator) variables :Examples Dummy variable are used to indicate the presence or absence of a characteristic. For example, define female i 1 if obs i is female 0 otherwise or male
More informationECONOMETRIC MODEL WITH QUALITATIVE VARIABLES
ECONOMETRIC MODEL WITH QUALITATIVE VARIABLES How to quantify qualitative variables to quantitative variables? Why do we need to do this? Econometric model needs quantitative variables to estimate its parameters
More informationEstimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing
Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing Alessandra Mattei Dipartimento di Statistica G. Parenti Università
More informationPSC 504: Instrumental Variables
PSC 504: Instrumental Variables Matthew Blackwell 3/28/2013 Instrumental Variables and Structural Equation Modeling Setup e basic idea behind instrumental variables is that we have a treatment with unmeasured
More informationDEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS
DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS Donald B. Rubin Harvard University 1 Oxford Street, 7th Floor Cambridge, MA 02138 USA Tel: 617-495-5496; Fax: 617-496-8057 email: rubin@stat.harvard.edu
More informationBayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London
Bayesian methods for missing data: part 1 Key Concepts Nicky Best and Alexina Mason Imperial College London BAYES 2013, May 21-23, Erasmus University Rotterdam Missing Data: Part 1 BAYES2013 1 / 68 Outline
More informationFlexible Estimation of Treatment Effect Parameters
Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both
More informationCombining multiple observational data sources to estimate causal eects
Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,
More informationPrevious lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing.
Previous lecture P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing. Interaction Outline: Definition of interaction Additive versus multiplicative
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint
More informationVariable selection and machine learning methods in causal inference
Variable selection and machine learning methods in causal inference Debashis Ghosh Department of Biostatistics and Informatics Colorado School of Public Health Joint work with Yeying Zhu, University of
More informationChallenges (& Some Solutions) and Making Connections
Challenges (& Some Solutions) and Making Connections Real-life Search All search algorithm theorems have form: If the world behaves like, then (probability 1) the algorithm will recover the true structure
More informationIdentification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case
Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College Original December 2016, revised July 2017 Abstract Lewbel (2012)
More information41903: Introduction to Nonparametrics
41903: Notes 5 Introduction Nonparametrics fundamentally about fitting flexible models: want model that is flexible enough to accommodate important patterns but not so flexible it overspecializes to specific
More informationPropensity Score Weighting with Multilevel Data
Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative
More informationStatistical Models for Causal Analysis
Statistical Models for Causal Analysis Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 Three Modes of Statistical Inference 1. Descriptive Inference: summarizing and exploring
More informationA Measure of Robustness to Misspecification
A Measure of Robustness to Misspecification Susan Athey Guido W. Imbens December 2014 Graduate School of Business, Stanford University, and NBER. Electronic correspondence: athey@stanford.edu. Graduate
More informationCHAPTER 2 Estimating Probabilities
CHAPTER 2 Estimating Probabilities Machine Learning Copyright c 2017. Tom M. Mitchell. All rights reserved. *DRAFT OF September 16, 2017* *PLEASE DO NOT DISTRIBUTE WITHOUT AUTHOR S PERMISSION* This is
More informationThe Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest
The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest Edward Vytlacil, Yale University Renmin University, Department
More informationSelective Inference for Effect Modification: An Empirical Investigation
Observational Studies () Submitted ; Published Selective Inference for Effect Modification: An Empirical Investigation Qingyuan Zhao Department of Statistics The Wharton School, University of Pennsylvania
More informationCombining Non-probability and Probability Survey Samples Through Mass Imputation
Combining Non-probability and Probability Survey Samples Through Mass Imputation Jae-Kwang Kim 1 Iowa State University & KAIST October 27, 2018 1 Joint work with Seho Park, Yilin Chen, and Changbao Wu
More informationDiscussion: Can We Get More Out of Experiments?
Discussion: Can We Get More Out of Experiments? Kosuke Imai Princeton University September 4, 2010 Kosuke Imai (Princeton) Discussion APSA 2010 (Washington D.C.) 1 / 9 Keele, McConnaughy, and White Question:
More informationCausal Inference with Big Data Sets
Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November 2016 1 / 1 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity
More informationLecture 3: Causal inference
MACHINE LEARNING FOR HEALTHCARE 6.S897, HST.S53 Lecture 3: Causal inference Prof. David Sontag MIT EECS, CSAIL, IMES (Thanks to Uri Shalit for many of the slides) *Last week: Type 2 diabetes 1994 2000
More informationGov 2002: 5. Matching
Gov 2002: 5. Matching Matthew Blackwell October 1, 2015 Where are we? Where are we going? Discussed randomized experiments, started talking about observational data. Last week: no unmeasured confounders
More informationLecture 5: ANOVA and Correlation
Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions
More informationSTAT 518 Intro Student Presentation
STAT 518 Intro Student Presentation Wen Wei Loh April 11, 2013 Title of paper Radford M. Neal [1999] Bayesian Statistics, 6: 475-501, 1999 What the paper is about Regression and Classification Flexible
More informationThe risk of machine learning
/ 33 The risk of machine learning Alberto Abadie Maximilian Kasy July 27, 27 2 / 33 Two key features of machine learning procedures Regularization / shrinkage: Improve prediction or estimation performance
More informationMostly Dangerous Econometrics: How to do Model Selection with Inference in Mind
Outline Introduction Analysis in Low Dimensional Settings Analysis in High-Dimensional Settings Bonus Track: Genaralizations Econometrics: How to do Model Selection with Inference in Mind June 25, 2015,
More informationApplied Econometrics (MSc.) Lecture 3 Instrumental Variables
Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.
More informationMultiple Regression: Chapter 13. July 24, 2015
Multiple Regression: Chapter 13 July 24, 2015 Multiple Regression (MR) Response Variable: Y - only one response variable (quantitative) Several Predictor Variables: X 1, X 2, X 3,..., X p (p = # predictors)
More informationLongitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois
Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control
More information150C Causal Inference
150C Causal Inference Instrumental Variables: Modern Perspective with Heterogeneous Treatment Effects Jonathan Mummolo May 22, 2017 Jonathan Mummolo 150C Causal Inference May 22, 2017 1 / 26 Two Views
More informationAn Introduction to Causal Analysis on Observational Data using Propensity Scores
An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut
More informationLecture 8. Roy Model, IV with essential heterogeneity, MTE
Lecture 8. Roy Model, IV with essential heterogeneity, MTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Heterogeneity When we talk about heterogeneity, usually we mean heterogeneity
More informationApplied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid
Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?
More informationDeductive Derivation and Computerization of Semiparametric Efficient Estimation
Deductive Derivation and Computerization of Semiparametric Efficient Estimation Constantine Frangakis, Tianchen Qian, Zhenke Wu, and Ivan Diaz Department of Biostatistics Johns Hopkins Bloomberg School
More informationPotential Outcomes Model (POM)
Potential Outcomes Model (POM) Relationship Between Counterfactual States Causality Empirical Strategies in Labor Economics, Angrist Krueger (1999): The most challenging empirical questions in economics
More informationBART: Bayesian additive regression trees
BART: Bayesian additive regression trees Hedibert F. Lopes & Paulo Marques Insper Institute of Education and Research São Paulo, Brazil Most of the notes were kindly provided by Rob McCulloch (Arizona
More informationUnpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies
Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University February 23, 2012 Joint work with L. Keele (Penn State)
More informationEconometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction
Econometrics I Professor William Greene Stern School of Business Department of Economics 1-1/40 http://people.stern.nyu.edu/wgreene/econometrics/econometrics.htm 1-2/40 Overview: This is an intermediate
More informationPropensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary Treatments
Propensity-Score Based Methods for Causal Inference in Observational Studies with Fixed Non-Binary reatments Shandong Zhao David A. van Dyk Kosuke Imai July 3, 2018 Abstract Propensity score methods are
More informationGroupe de lecture. Instrumental Variables Estimates of the Effect of Subsidized Training on the Quantiles of Trainee Earnings. Abadie, Angrist, Imbens
Groupe de lecture Instrumental Variables Estimates of the Effect of Subsidized Training on the Quantiles of Trainee Earnings Abadie, Angrist, Imbens Econometrica (2002) 02 décembre 2010 Objectives Using
More informationUnpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies
Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University January 23, 2012 Joint work with L. Keele (Penn State)
More informationCausal Inference. Prediction and causation are very different. Typical questions are:
Causal Inference Prediction and causation are very different. Typical questions are: Prediction: Predict Y after observing X = x Causation: Predict Y after setting X = x. Causation involves predicting
More informationOr How to select variables Using Bayesian LASSO
Or How to select variables Using Bayesian LASSO x 1 x 2 x 3 x 4 Or How to select variables Using Bayesian LASSO x 1 x 2 x 3 x 4 Or How to select variables Using Bayesian LASSO On Bayesian Variable Selection
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint
More informationBayesian Nonparametric Accelerated Failure Time Models for Analyzing Heterogeneous Treatment Effects
Bayesian Nonparametric Accelerated Failure Time Models for Analyzing Heterogeneous Treatment Effects Nicholas C. Henderson Thomas A. Louis Gary Rosner Ravi Varadhan Johns Hopkins University September 28,
More informationStatistical Analysis of List Experiments
Statistical Analysis of List Experiments Graeme Blair Kosuke Imai Princeton University December 17, 2010 Blair and Imai (Princeton) List Experiments Political Methodology Seminar 1 / 32 Motivation Surveys
More informationThe propensity score with continuous treatments
7 The propensity score with continuous treatments Keisuke Hirano and Guido W. Imbens 1 7.1 Introduction Much of the work on propensity score analysis has focused on the case in which the treatment is binary.
More informationECON 5350 Class Notes Functional Form and Structural Change
ECON 5350 Class Notes Functional Form and Structural Change 1 Introduction Although OLS is considered a linear estimator, it does not mean that the relationship between Y and X needs to be linear. In this
More informationCausal inference in multilevel data structures:
Causal inference in multilevel data structures: Discussion of papers by Li and Imai Jennifer Hill May 19 th, 2008 Li paper Strengths Area that needs attention! With regard to propensity score strategies
More informationTargeted Maximum Likelihood Estimation in Safety Analysis
Targeted Maximum Likelihood Estimation in Safety Analysis Sam Lendle 1 Bruce Fireman 2 Mark van der Laan 1 1 UC Berkeley 2 Kaiser Permanente ISPE Advanced Topics Session, Barcelona, August 2012 1 / 35
More informationDiscussion of Papers on the Extensions of Propensity Score
Discussion of Papers on the Extensions of Propensity Score Kosuke Imai Princeton University August 3, 2010 Kosuke Imai (Princeton) Generalized Propensity Score 2010 JSM (Vancouver) 1 / 11 The Theme and
More informationEconometrics I Lecture 7: Dummy Variables
Econometrics I Lecture 7: Dummy Variables Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 27 Introduction Dummy variable: d i is a dummy variable
More informationHigh Dimensional Propensity Score Estimation via Covariate Balancing
High Dimensional Propensity Score Estimation via Covariate Balancing Kosuke Imai Princeton University Talk at Columbia University May 13, 2017 Joint work with Yang Ning and Sida Peng Kosuke Imai (Princeton)
More informationSC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM)
SC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM) SEM is a family of statistical techniques which builds upon multiple regression,
More informationRandomized Decision Trees
Randomized Decision Trees compiled by Alvin Wan from Professor Jitendra Malik s lecture Discrete Variables First, let us consider some terminology. We have primarily been dealing with real-valued data,
More informationEconometrics Problem Set 6
Econometrics Problem Set 6 WISE, Xiamen University Spring 2016-17 Conceptual Questions 1. This question refers to the estimated regressions shown in Table 1 computed using data for 1988 from the CPS. The
More informationIntroduction to GSEM in Stata
Introduction to GSEM in Stata Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Introduction to GSEM in Stata Boston College, Spring 2016 1 /
More informationGaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012
Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature
More informationPropensity Score Analysis with Hierarchical Data
Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational
More informationECE521 week 3: 23/26 January 2017
ECE521 week 3: 23/26 January 2017 Outline Probabilistic interpretation of linear regression - Maximum likelihood estimation (MLE) - Maximum a posteriori (MAP) estimation Bias-variance trade-off Linear
More informationPSC 504: Dynamic Causal Inference
PSC 504: Dynamic Causal Inference Matthew Blackwell 4/8/203 e problem Let s go back to a problem that we faced earlier, which is how to estimate causal effects with treatments that vary over time. We could
More informationStatistical Consulting Topics Classification and Regression Trees (CART)
Statistical Consulting Topics Classification and Regression Trees (CART) Suppose the main goal in a data analysis is the prediction of a categorical variable outcome. Such as in the examples below. Given
More informationHERITABILITY ESTIMATION USING A REGULARIZED REGRESSION APPROACH (HERRA)
BIRS 016 1 HERITABILITY ESTIMATION USING A REGULARIZED REGRESSION APPROACH (HERRA) Malka Gorfine, Tel Aviv University, Israel Joint work with Li Hsu, FHCRC, Seattle, USA BIRS 016 The concept of heritability
More informationLecture 7: Interaction Analysis. Summer Institute in Statistical Genetics 2017
Lecture 7: Interaction Analysis Timothy Thornton and Michael Wu Summer Institute in Statistical Genetics 2017 1 / 39 Lecture Outline Beyond main SNP effects Introduction to Concept of Statistical Interaction
More information