Causal Inference with Big Data Sets
|
|
- Cameron Jordan
- 6 years ago
- Views:
Transcription
1 Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November / 1
2 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity design Examples 1 Five-star ratings of nursing homes 2 Early intervention for very low-birth weight infants 3 Hospital readmission penalties Estimation issues and extensions 2 / 1
3 Big data What is big data? The new big data: remarkable advances in the ability to store and query very large data sets in real time Think of Uber, Google maps, Kayak Also advances in prediction methods: think Netlix recommendations and machine learning That same technology could be applied to decision making tools for physicians in real time through electronic health records Query speed is fundamental 3 / 1
4 Big data Big data in old-fashioned research Not immediate benefit in terms of causal inference and traditional research questions (no hypothesis testing in machine learning) In health economics and health services research, big data has been used for a long time Think of studies using Medicare claims: millions of enrollees with multiple claims per year using multiple years. That s very big data Causal problems are the same regardless of the data size; a lot of bad data is the same a little of bad data However, some observational methods are better suited for large data sets 4 / 1
5 Causal inference Causal inference Causal inference has been a constant in economics for almost 100 years: supply and demand curves (around 1918) New advances in the last 15 years in several fields A new language, notation, and a unified theory in the counterfactual framework (see Imbens and Rubin, 2015) Today: regression discontinuity design (RDD) Not a big data method per se, but in practice one needs large datasets to implement it 5 / 1
6 Regression discontinuity Thistlethwaite and Campbell Studied the impact of merit awards on future academic outcomes Awards allocated based on test scores. A test score greater than c (cutoff point) guaranteed receipt of scholarship Research question: What is the effect of receiving a scholarship on future grades or income? We cannot just compare students who received the scholarship to those who did not But... Thistlethwaite and Campbell realized they could compare individuals just above and just below the cutoff point 6 / 1
7 Regression discontinuity Simple idea Is there much difference among students scoring, say, 720 versus 740? Probably not (given some assumptions) Can be empirically verified: compare measured baseline characteristics of those just above and just below the cutoff point c We cannot compare unmeasured characteristics Perhaps the main reason RDD is becoming popular is that one can see the design working 7 / 1
8 Regression discontinuity Assumptions and validity The assignment mechanism is known and depends on a continuous variable (assignment variable is often called the running or forcing variable) The cutoff point is known (the probability of treatment jumps to 1 if test score > c) Key assumption: individuals cannot manipulate with precision their assignment variable Data can be analyzed as if it were a (conditionally) randomized experiment Limitation: the estimated treatment effect applies to those near the cutoff point (LATE) 8 / 1
9 Regression discontinuity Assumptions and validity Expressing the key assumption of RDD as the absence of manipulation with precision is common in economics because economists model the behavior of agents making choices Some authors write that validity depends on the assignment being arbitrary or that the assignment variable is measured with error (e.g. Moscoe et al, 2015) These are slightly different ways of saying the same but it s somewhat confusing Arbitrary: The cutoff point is not related to outcome it could be any other value (think of weight) Measured with error: If assignment variable is measured with error, then cutoff point c is essentially (somewhat) arbitrary 9 / 1
10 Regression discontinuity Manipulation Manipulation example 1: Test with few questions and plenty of time Manipulation example 2: DMV test to get a driving license Manipulation example 3: Eligibility criteria to obtain some benefit (say, income below 150% of Federal Poverty Level) Consequence: not comparable close to cutoff point (but somewhat testable) Some manipulation does NOT invalidate RDD. Precision and lack of relation between cutoff point and outcome are key to identify causal effects 10 / 1
11 Regression discontinuity Graphical Example Simulated data with c = / 1
12 Regression discontinuity No effect 12 / 1
13 Regression discontinuity Sharp and fuzzy RDD Sharp RDD: Assignment or running variable completely determines treatment. A jump in the probability of treatment before and after cutoff point, from 0 to 1 Fuzzy RDD: Cutoff point increases the probability of treatment but running variable does not completely determines treatment Which brings us back to the world of instrumental variables; think of encouragement designs where treatment assignment is the instrument Unlike most instruments, one can still check if observations are similar near cutoff point Fuzzy RDD designs are underutilized 13 / 1
14 Examples Examples from literature Research question: Do the benefits of additional medical expenditures for at-risk newborns exceed their costs? Almond et al. (2010): Assignment variable is birth weight. Newborns weighting less than 1, 500 grams (about 3 pounds) receive more medical treatment Fuzzy design: low birth weight increases the probability of more medical treatment but does not completely determine treatment (first stage) Mortality as outcome (second stage) Restricted analysis to small window around threshold (85 grams) Clinical guidelines are a good place to find (fuzzy) discontinuities 14 / 1
15 Examples Examples from literature Do consumers respond to composite ratings of quality in nursing homes (Perraillon et al.,2016)? In 2008 the Centers for Medicare & Medicaid (CMS) released composite ratings of quality in nursing homes (1 to 5 stars) Overall stars are assigned based on deficiency data transformed into a points system Percentiles based on this score determine cutoff points to assign stars (running variable) Outcome: new admissions six months after the release of ratings Nursing homes that received an additional start gained more admissions, except for lower-rated nursing homes 15 / 1
16 Examples Assignment of stars based on scores 16 / 1
17 Examples Other examples Hospitals readmission penalties (with Rich Lindrooth) Anderson and Magruder (2012) and Lucas (2012): Yelp.com ratings have an underlying continuous score. Distribution determines cutoff points for 1 to 5 stars. Effect of an extra star on future reservations and revenue Anderson et al. (2012): Young adults lose their health insurance as they age (older than 18 and in college but different after ACA). Age changes the probability of having health insurance (fuzzy design) Card et al (2009): Does Medicare save lives? Older adults can get Medicare after age 65. They compared admissions to ER for people just below and just above 65. Found that patients over 65 get more services (first stage) and that mortality is reduced (second stage) 17 / 1
18 Examples Estimation: Parametric Simplest case is linear relationship between Y and X Y i = β 0 + β 1 T i + β 3 X i + ɛ i T i = 1 if subject i received treatment and T i = 0 otherwise. Often written as T i = 1(X i > c) or T i = 1 [Xi >c] X is the assignment variable Usually centered at cutoff point Y i = β 0 + β 1 T i + β 3 (X i c) + ɛ i. Treatment effect is given by β 1. E[Y T = 1, X = c] = β 0 + β 1 and E[Y T = 0, X = c] = β 0. E[Y T = 1, X = c] E[Y T = 0, X = c] = β 1. What about covariates? 18 / 1
19 Examples Need to model relationship between X and Y correctly What if nonlinear? Could result in a biased treatment effect if one assumes a linear model 19 / 1
20 Examples Main issues in estimation Modeling correctly the functional form of outcome and running variable Paper by Hahn, Todd, and Van der Klaauw (2001) clarified assumptions about RDD and framed estimation as a nonparametric problem Idea is to estimate a model that does not assume a functional form for the relationship between Y and X. The model is something like Y i = f (X i ) + ɛ i Emphasized using weighted local polynomial regression (instead of LOWESS) Problem: difficult to add covariates and outcome may not be normally distributed 20 / 1
21 Examples A window or all data? Should we use all data or just the data near a cutoff point? Typical bias-variance trade-off; the optimal bandwidth literature was never influential Common advice was to use high-order polynomials to control for the running variable until Gelman and Inbems (2014): Why High-order Polynomials Should not be Used in Regression Discontinuity Designs We argue that estimators for causal effects based on [higher order polynomials] can be misleading, and we recommend researchers do not use them, and instead use estimators based on local linear or quadratic polynomials / 1
22 Examples Extension, other issues Statistical tests for manipulation (clustering close to cutoff point) More than one assigning variable (e.g. scholarship based on verbal and quantitative tests) Discontinuities in geography Fuzzy RDD is underutilized 22 / 1
23 Examples Summary of method Developed to estimate causal treatment effects in non-experimental settings Good internal validity; some assumptions can be empirically verified Fairly convincing when it is possible to show balance close to cutoff point Treatment effects are local (LATE); limits external validity Relatively easy to estimate In practice it does require large datasets to have enough observations around a bandwidth Stata code and more slides: 23 / 1
Regression Discontinuity Design
Regression Discontinuity Design Marcelo Coca Perraillon University of Chicago May 13 & 18, 2015 1 / 51 Introduction Plan Overview of RDD Meaning and validity of RDD Several examples from the literature
More informationRegression Discontinuity Designs in Economics
Regression Discontinuity Designs in Economics Dr. Kamiljon T. Akramov IFPRI, Washington, DC, USA Training Course on Applied Econometric Analysis September 13-23, 2016, WIUT, Tashkent, Uzbekistan Outline
More informationMichael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp
page 1 Lecture 7 The Regression Discontinuity Design fuzzy and sharp page 2 Regression Discontinuity Design () Introduction (1) The design is a quasi-experimental design with the defining characteristic
More informationRegression Discontinuity Design Econometric Issues
Regression Discontinuity Design Econometric Issues Brian P. McCall University of Michigan Texas Schools Project, University of Texas, Dallas November 20, 2009 1 Regression Discontinuity Design Introduction
More informationRegression Discontinuity Design
Chapter 11 Regression Discontinuity Design 11.1 Introduction The idea in Regression Discontinuity Design (RDD) is to estimate a treatment effect where the treatment is determined by whether as observed
More informationted: a Stata Command for Testing Stability of Regression Discontinuity Models
ted: a Stata Command for Testing Stability of Regression Discontinuity Models Giovanni Cerulli IRCrES, Research Institute on Sustainable Economic Growth National Research Council of Italy 2016 Stata Conference
More informationThe Economics of European Regions: Theory, Empirics, and Policy
The Economics of European Regions: Theory, Empirics, and Policy Dipartimento di Economia e Management Davide Fiaschi Angela Parenti 1 1 davide.fiaschi@unipi.it, and aparenti@ec.unipi.it. Fiaschi-Parenti
More informationRegression Discontinuity Designs.
Regression Discontinuity Designs. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 31/10/2017 I. Brunetti Labour Economics in an European Perspective 31/10/2017 1 / 36 Introduction
More informationAddressing Analysis Issues REGRESSION-DISCONTINUITY (RD) DESIGN
Addressing Analysis Issues REGRESSION-DISCONTINUITY (RD) DESIGN Overview Assumptions of RD Causal estimand of interest Discuss common analysis issues In the afternoon, you will have the opportunity to
More informationPropensity Score Weighting with Multilevel Data
Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative
More informationEmpirical approaches in public economics
Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental
More informationRegression Discontinuity
Regression Discontinuity Assignment based on a cutoff score Second best design Theory and Unbiased Inference Can be widely used Analysis is tricky Need to use more often Regression Discontinuity Resource
More informationRegression Discontinuity
Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 16, 2018 I will describe the basic ideas of RD, but ignore many of the details Good references
More informationAn Alternative Assumption to Identify LATE in Regression Discontinuity Design
An Alternative Assumption to Identify LATE in Regression Discontinuity Design Yingying Dong University of California Irvine May 2014 Abstract One key assumption Imbens and Angrist (1994) use to identify
More informationESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics
ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.
More informationWeek 3: Simple Linear Regression
Week 3: Simple Linear Regression Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ALL RIGHTS RESERVED 1 Outline
More informationApplied Microeconometrics Chapter 8 Regression Discontinuity (RD)
1 / 26 Applied Microeconometrics Chapter 8 Regression Discontinuity (RD) Romuald Méango and Michele Battisti LMU, SoSe 2016 Overview What is it about? What are its assumptions? What are the main applications?
More informationRegression Discontinuity
Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 24, 2017 I will describe the basic ideas of RD, but ignore many of the details Good references
More informationLecture 1: Introduction to Regression Discontinuity Designs in Economics
Lecture 1: Introduction to Regression Discontinuity Designs in Economics Thomas Lemieux, UBC Spring Course in Labor Econometrics University of Coimbra, March 18 2011 Plan of the three lectures on regression
More informationRegression Discontinuity Designs
Regression Discontinuity Designs Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Regression Discontinuity Design Stat186/Gov2002 Fall 2018 1 / 1 Observational
More informationEconometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous
Econometrics of causal inference Throughout, we consider the simplest case of a linear outcome equation, and homogeneous effects: y = βx + ɛ (1) where y is some outcome, x is an explanatory variable, and
More informationLecture 10 Regression Discontinuity (and Kink) Design
Lecture 10 Regression Discontinuity (and Kink) Design Economics 2123 George Washington University Instructor: Prof. Ben Williams Introduction Estimation in RDD Identification RDD implementation RDD example
More informationAn Alternative Assumption to Identify LATE in Regression Discontinuity Designs
An Alternative Assumption to Identify LATE in Regression Discontinuity Designs Yingying Dong University of California Irvine September 2014 Abstract One key assumption Imbens and Angrist (1994) use to
More informationIdentifying the Effect of Changing the Policy Threshold in Regression Discontinuity Models
Identifying the Effect of Changing the Policy Threshold in Regression Discontinuity Models Yingying Dong and Arthur Lewbel University of California Irvine and Boston College First version July 2010, revised
More informationWeek 2: Review of probability and statistics
Week 2: Review of probability and statistics Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ALL RIGHTS RESERVED
More informationQuantitative Economics for the Evaluation of the European Policy
Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,
More informationMarginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal
Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect
More informationSelection on Observables: Propensity Score Matching.
Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017
More informationEMERGING MARKETS - Lecture 2: Methodology refresher
EMERGING MARKETS - Lecture 2: Methodology refresher Maria Perrotta April 4, 2013 SITE http://www.hhs.se/site/pages/default.aspx My contact: maria.perrotta@hhs.se Aim of this class There are many different
More informationPropensity Score Analysis with Hierarchical Data
Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational
More informationDevelopment. ECON 8830 Anant Nyshadham
Development ECON 8830 Anant Nyshadham Projections & Regressions Linear Projections If we have many potentially related (jointly distributed) variables Outcome of interest Y Explanatory variable of interest
More informationBIOS 6649: Handout Exercise Solution
BIOS 6649: Handout Exercise Solution NOTE: I encourage you to work together, but the work you submit must be your own. Any plagiarism will result in loss of all marks. This assignment is based on weight-loss
More informationRegression Discontinuity: Advanced Topics. NYU Wagner Rajeev Dehejia
Regression Discontinuity: Advanced Topics NYU Wagner Rajeev Dehejia Summary of RD assumptions The treatment is determined at least in part by the assignment variable There is a discontinuity in the level
More informationRegression Discontinuity
Regression Discontinuity STP Advanced Econometrics Lecture Douglas G. Steigerwald UC Santa Barbara March 2006 D. Steigerwald (Institute) Regression Discontinuity 03/06 1 / 11 Intuition Reference: Mostly
More informationFlexible Estimation of Treatment Effect Parameters
Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both
More informationCausal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies
Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed
More informationFinding Instrumental Variables: Identification Strategies. Amine Ouazad Ass. Professor of Economics
Finding Instrumental Variables: Identification Strategies Amine Ouazad Ass. Professor of Economics Outline 1. Before/After 2. Difference-in-difference estimation 3. Regression Discontinuity Design BEFORE/AFTER
More informationComments on Best Quasi- Experimental Practice
Comments on Best Quasi- Experimental Practice Larry V. Hedges Northwestern University Presented at the Future of Implementation Evaluation, National Science Foundation, Arlington, VA, October 28, 2013
More informationTargeted Maximum Likelihood Estimation in Safety Analysis
Targeted Maximum Likelihood Estimation in Safety Analysis Sam Lendle 1 Bruce Fireman 2 Mark van der Laan 1 1 UC Berkeley 2 Kaiser Permanente ISPE Advanced Topics Session, Barcelona, August 2012 1 / 35
More informationExperiments and Quasi-Experiments
Experiments and Quasi-Experiments (SW Chapter 13) Outline 1. Potential Outcomes, Causal Effects, and Idealized Experiments 2. Threats to Validity of Experiments 3. Application: The Tennessee STAR Experiment
More informationThe problem of causality in microeconometrics.
The problem of causality in microeconometrics. Andrea Ichino University of Bologna and Cepr June 11, 2007 Contents 1 The Problem of Causality 1 1.1 A formal framework to think about causality....................................
More informationEstimating the Effect of Financial Aid Offers on College Enrollment: A Regression Discontinuity Approach. Wilbert van der Klaaw
Estimating the Effect of Financial Aid Offers on College Enrollment: A Regression Discontinuity Approach Wilbert van der Klaaw 2002 Regression Discontinuity Design The Treatment Model W i = 1 : Treatment
More informationIntroduction to Econometrics
Introduction to Econometrics STAT-S-301 Experiments and Quasi-Experiments (2016/2017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 Why study experiments? Ideal randomized controlled experiments
More informationBig Data, Machine Learning, and Causal Inference
Big Data, Machine Learning, and Causal Inference I C T 4 E v a l I n t e r n a t i o n a l C o n f e r e n c e I F A D H e a d q u a r t e r s, R o m e, I t a l y P a u l J a s p e r p a u l. j a s p e
More informationEc1123 Section 7 Instrumental Variables
Ec1123 Section 7 Instrumental Variables Andrea Passalacqua Harvard University andreapassalacqua@g.harvard.edu November 16th, 2017 Andrea Passalacqua (Harvard) Ec1123 Section 7 Instrumental Variables November
More informationCausal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD
Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification Todd MacKenzie, PhD Collaborators A. James O Malley Tor Tosteson Therese Stukel 2 Overview 1. Instrumental variable
More informationWhy high-order polynomials should not be used in regression discontinuity designs
Why high-order polynomials should not be used in regression discontinuity designs Andrew Gelman Guido Imbens 6 Jul 217 Abstract It is common in regression discontinuity analysis to control for third, fourth,
More informationTHE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B.
THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B. RUBIN My perspective on inference for causal effects: In randomized
More informationRegression Discontinuity Design. Day 2 of Quasi Experimental Workshop
Regression Discontinuity Design Day 2 of Quasi Experimental Workshop Agenda Design Overview Estimation approaches Addressing threats to validity Example 1: Shadish, Galindo, Wong, V., Steiner, & Cook (under
More informationPropensity Score Matching
Methods James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 Methods 1 Introduction 2 3 4 Introduction Why Match? 5 Definition Methods and In
More informationRegression Discontinuity Design on Model Schools Value-Added Effects: Empirical Evidence from Rural Beijing
Regression Discontinuity Design on Model Schools Value-Added Effects: Empirical Evidence from Rural Beijing Kai Hong CentER Graduate School, Tilburg University April 2010 Abstract In this study we examine
More informationNBER WORKING PAPER SERIES REGRESSION DISCONTINUITY DESIGNS IN ECONOMICS. David S. Lee Thomas Lemieux
NBER WORKING PAPER SERIES REGRESSION DISCONTINUITY DESIGNS IN ECONOMICS David S. Lee Thomas Lemieux Working Paper 14723 http://www.nber.org/papers/w14723 NATIONAL BUREAU OF ECONOMIC RESEARCH 1050 Massachusetts
More informationRegression Discontinuity
Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 9, 2016 I will describe the basic ideas of RD, but ignore many of the details Good references
More informationOne Economist s Perspective on Some Important Estimation Issues
One Economist s Perspective on Some Important Estimation Issues Jere R. Behrman W.R. Kenan Jr. Professor of Economics & Sociology University of Pennsylvania SRCD Seattle Preconference on Interventions
More information14.74 Lecture 10: The returns to human capital: education
14.74 Lecture 10: The returns to human capital: education Esther Duflo March 7, 2011 Education is a form of human capital. You invest in it, and you get returns, in the form of higher earnings, etc...
More informationCausal Modeling in Environmental Epidemiology. Joel Schwartz Harvard University
Causal Modeling in Environmental Epidemiology Joel Schwartz Harvard University When I was Young What do I mean by Causal Modeling? What would have happened if the population had been exposed to a instead
More informationAnalysis of propensity score approaches in difference-in-differences designs
Author: Diego A. Luna Bazaldua Institution: Lynch School of Education, Boston College Contact email: diego.lunabazaldua@bc.edu Conference section: Research methods Analysis of propensity score approaches
More informationBayesian nonparametric predictive approaches for causal inference: Regression Discontinuity Methods
Bayesian nonparametric predictive approaches for causal inference: Regression Discontinuity Methods George Karabatsos University of Illinois-Chicago ERCIM Conference, 14-16 December, 2013 Senate House,
More informationCollaborative Topic Modeling for Recommending Scientific Articles
Collaborative Topic Modeling for Recommending Scientific Articles Chong Wang and David M. Blei Best student paper award at KDD 2011 Computer Science Department, Princeton University Presented by Tian Cao
More informationWhat s New in Econometrics. Lecture 1
What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and
More informationChapter 2 Regression Discontinuity Design: When Series Interrupt
Chapter 2 Regression Discontinuity Design: When Series Interrupt Abstract This chapter introduces the identification and estimation of policy effects when outcome variables can be ordered according to
More informationQuantifying Weather Risk Analysis
Quantifying Weather Risk Analysis Now that an index has been selected and calibrated, it can be used to conduct a more thorough risk analysis. The objective of such a risk analysis is to gain a better
More informationGov 2002: 4. Observational Studies and Confounding
Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What
More informationECO Class 6 Nonparametric Econometrics
ECO 523 - Class 6 Nonparametric Econometrics Carolina Caetano Contents 1 Nonparametric instrumental variable regression 1 2 Nonparametric Estimation of Average Treatment Effects 3 2.1 Asymptotic results................................
More informationAdaptive Trial Designs
Adaptive Trial Designs Wenjing Zheng, Ph.D. Methods Core Seminar Center for AIDS Prevention Studies University of California, San Francisco Nov. 17 th, 2015 Trial Design! Ethical:!eg.! Safety!! Efficacy!
More informationWhy High-Order Polynomials Should Not Be Used in Regression Discontinuity Designs
Why High-Order Polynomials Should Not Be Used in Regression Discontinuity Designs Andrew GELMAN Department of Statistics and Department of Political Science, Columbia University, New York, NY, 10027 (gelman@stat.columbia.edu)
More informationSupplemental Appendix to "Alternative Assumptions to Identify LATE in Fuzzy Regression Discontinuity Designs"
Supplemental Appendix to "Alternative Assumptions to Identify LATE in Fuzzy Regression Discontinuity Designs" Yingying Dong University of California Irvine February 2018 Abstract This document provides
More informationResearch Design: Causal inference and counterfactuals
Research Design: Causal inference and counterfactuals University College Dublin 8 March 2013 1 2 3 4 Outline 1 2 3 4 Inference In regression analysis we look at the relationship between (a set of) independent
More informationStructural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall
1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work
More informationTreatment Effects. Christopher Taber. September 6, Department of Economics University of Wisconsin-Madison
Treatment Effects Christopher Taber Department of Economics University of Wisconsin-Madison September 6, 2017 Notation First a word on notation I like to use i subscripts on random variables to be clear
More informationVariable selection and machine learning methods in causal inference
Variable selection and machine learning methods in causal inference Debashis Ghosh Department of Biostatistics and Informatics Colorado School of Public Health Joint work with Yeying Zhu, University of
More informationDynamics in Social Networks and Causality
Web Science & Technologies University of Koblenz Landau, Germany Dynamics in Social Networks and Causality JProf. Dr. University Koblenz Landau GESIS Leibniz Institute for the Social Sciences Last Time:
More information6. Assessing studies based on multiple regression
6. Assessing studies based on multiple regression Questions of this section: What makes a study using multiple regression (un)reliable? When does multiple regression provide a useful estimate of the causal
More informationSection 7: Local linear regression (loess) and regression discontinuity designs
Section 7: Local linear regression (loess) and regression discontinuity designs Yotam Shem-Tov Fall 2015 Yotam Shem-Tov STAT 239/ PS 236A October 26, 2015 1 / 57 Motivation We will focus on local linear
More informationEconometrics of Policy Evaluation (Geneva summer school)
Michael Lechner, Slide 1 Econometrics of Policy Evaluation (Geneva summer school) Michael Lechner Swiss Institute for Empirical Economic Research (SEW) University of St. Gallen Switzerland June 2016 Overview
More informationDetermining Sufficient Number of Imputations Using Variance of Imputation Variances: Data from 2012 NAMCS Physician Workflow Mail Survey *
Applied Mathematics, 2014,, 3421-3430 Published Online December 2014 in SciRes. http://www.scirp.org/journal/am http://dx.doi.org/10.4236/am.2014.21319 Determining Sufficient Number of Imputations Using
More informationMeasuring community health outcomes: New approaches for public health services research
Research Brief March 2015 Measuring community health outcomes: New approaches for public health services research P ublic Health agencies are increasingly asked to do more with less. Tough economic times
More informationEconometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction
Econometrics I Professor William Greene Stern School of Business Department of Economics 1-1/40 http://people.stern.nyu.edu/wgreene/econometrics/econometrics.htm 1-2/40 Overview: This is an intermediate
More informationFrom Causality, Second edition, Contents
From Causality, Second edition, 2009. Preface to the First Edition Preface to the Second Edition page xv xix 1 Introduction to Probabilities, Graphs, and Causal Models 1 1.1 Introduction to Probability
More informationThe Essential Role of Pair Matching in. Cluster-Randomized Experiments. with Application to the Mexican Universal Health Insurance Evaluation
The Essential Role of Pair Matching in Cluster-Randomized Experiments, with Application to the Mexican Universal Health Insurance Evaluation Kosuke Imai Princeton University Gary King Clayton Nall Harvard
More informationCS 540: Machine Learning Lecture 1: Introduction
CS 540: Machine Learning Lecture 1: Introduction AD January 2008 AD () January 2008 1 / 41 Acknowledgments Thanks to Nando de Freitas Kevin Murphy AD () January 2008 2 / 41 Administrivia & Announcement
More informationCasual Mediation Analysis
Casual Mediation Analysis Tyler J. VanderWeele, Ph.D. Upcoming Seminar: April 21-22, 2017, Philadelphia, Pennsylvania OXFORD UNIVERSITY PRESS Explanation in Causal Inference Methods for Mediation and Interaction
More informationInference in Regression Discontinuity Designs with a Discrete Running Variable
Inference in Regression Discontinuity Designs with a Discrete Running Variable Michal Kolesár Christoph Rothe June 21, 2016 Abstract We consider inference in regression discontinuity designs when the running
More informationANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS
ANALYTIC COMPARISON of Pearl and Rubin CAUSAL FRAMEWORKS Content Page Part I. General Considerations Chapter 1. What is the question? 16 Introduction 16 1. Randomization 17 1.1 An Example of Randomization
More informationExam ECON5106/9106 Fall 2018
Exam ECO506/906 Fall 208. Suppose you observe (y i,x i ) for i,2,, and you assume f (y i x i ;α,β) γ i exp( γ i y i ) where γ i exp(α + βx i ). ote that in this case, the conditional mean of E(y i X x
More informationINSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN SOLUTIONS
INSTITUTE AND FACULTY OF ACTUARIES Curriculum 09 SPECIMEN SOLUTIONS Subject CSA Risk Modelling and Survival Analysis Institute and Faculty of Actuaries Sample path A continuous time, discrete state process
More informationIntroduction to Econometrics. Assessing Studies Based on Multiple Regression
Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Assessing Studies Based on Multiple Regression Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1
More informationWeighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.
Weighting Unconfounded Homework 2 Describe imbalance direction matters STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University
More informationRegression Discontinuity Designs in Stata
Regression Discontinuity Designs in Stata Matias D. Cattaneo University of Michigan July 30, 2015 Overview Main goal: learn about treatment effect of policy or intervention. If treatment randomization
More informationEconometrics. Week 11. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 11 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 30 Recommended Reading For the today Advanced Time Series Topics Selected topics
More informationEcon 2148, spring 2019 Statistical decision theory
Econ 2148, spring 2019 Statistical decision theory Maximilian Kasy Department of Economics, Harvard University 1 / 53 Takeaways for this part of class 1. A general framework to think about what makes a
More informationDefining Statistically Significant Spatial Clusters of a Target Population using a Patient-Centered Approach within a GIS
Defining Statistically Significant Spatial Clusters of a Target Population using a Patient-Centered Approach within a GIS Efforts to Improve Quality of Care Stephen Jones, PhD Bio-statistical Research
More informationEmpirical Methods in Applied Microeconomics
Empirical Methods in Applied Microeconomics Jörn-Ste en Pischke LSE November 2007 1 Nonlinearity and Heterogeneity We have so far concentrated on the estimation of treatment e ects when the treatment e
More informationOutline. Possible Reasons. Nature of Heteroscedasticity. Basic Econometrics in Transportation. Heteroscedasticity
1/25 Outline Basic Econometrics in Transportation Heteroscedasticity What is the nature of heteroscedasticity? What are its consequences? How does one detect it? What are the remedial measures? Amir Samimi
More informationTechnical Track Session I:
Impact Evaluation Technical Track Session I: Click to edit Master title style Causal Inference Damien de Walque Amman, Jordan March 8-12, 2009 Click to edit Master subtitle style Human Development Human
More informationCausal Inference. Prediction and causation are very different. Typical questions are:
Causal Inference Prediction and causation are very different. Typical questions are: Prediction: Predict Y after observing X = x Causation: Predict Y after setting X = x. Causation involves predicting
More information4. Nonlinear regression functions
4. Nonlinear regression functions Up to now: Population regression function was assumed to be linear The slope(s) of the population regression function is (are) constant The effect on Y of a unit-change
More informationSampling and Sample Size. Shawn Cole Harvard Business School
Sampling and Sample Size Shawn Cole Harvard Business School Calculating Sample Size Effect Size Power Significance Level Variance ICC EffectSize 2 ( ) 1 σ = t( 1 κ ) + tα * * 1+ ρ( m 1) P N ( 1 P) Proportion
More informationEcon 2148, fall 2017 Instrumental variables I, origins and binary treatment case
Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Maximilian Kasy Department of Economics, Harvard University 1 / 40 Agenda instrumental variables part I Origins of instrumental
More information1 A Non-technical Introduction to Regression
1 A Non-technical Introduction to Regression Chapters 1 and Chapter 2 of the textbook are reviews of material you should know from your previous study (e.g. in your second year course). They cover, in
More information