Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing
|
|
- Shanon Cornelius Stokes
- 6 years ago
- Views:
Transcription
1 Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing Alessandra Mattei Dipartimento di Statistica G. Parenti Università degli Studi di Firenze
2 Outline 1. Motivation of the study 2. Estimating causal effects through a quasi-experimental approach 3. Estimating propensity scores with incomplete data 4. Estimating the causal effects of a childbearing on economic wellbeing in Indonesia using the Indonesia Family Life Survey (IFLS) 5. Concluding remarks
3 Motivation of the Study We compare three different approaches of handling missing background data in the estimation and use of propensity scores: 1. A complete-case analysis 2. A pattern-mixture model based approach developed by Rosenbaum and Rubin (1984) 3. A multiple imputation approach We make explicit the assumptions underlying each approach by illustrating the interaction between the treatment assignment mechanism and the missing data mechanism We apply these methods to assess the impact of childbearing events on individuals wellbeing in Indonesia, using a sample of women from the Indonesia Family Life Survey
4 The Quasi-Experimental Approach We use appropriate econometric techniques based on longitudinal micro data in order to identify the causal effects of childbearing events on poverty We consider the endogenous variable of interest, here change in fertility, as treatment variable Z, and divides individuals into two groups: those who experienced a childbirth - the treatment group, indicated by Z = T, and those who did not - the control group, indicated by Z = C The outcome variable, say Y, is a measure of wellbeing Strong Ignorability Assumption (Rosenbaum and Rubin, 1983) (i) Z is independent of the potential outcomes (Y (C), Y (T)) conditional on X = x (Unconfoundedness Assumption) (ii) η < Pr(Z = 1 X = x) < 1 η, for some η > 0
5 The Unconfoundedness Assumption The unconfoundedness assumption requires that all variables that affect both outcome and the likelihood of receiving the treatment are observed or that all the others are perfectly collinear with the observed ones This assumption is not testable, it is a very strong assumption, and one that need not generally be applicable Selection may also take place on the basis of unobservable characteristics We view it as a useful starting point for two reasons 1. In our study, we have carefully investigated which variables are most likely to confound any comparison between treated and control units 2. Any alternative assumptions that not rely on unconfoundedness, while allowing for consistent estimation of the causal effects of interest, must make alternative untestable assumptions
6 The Propensity Score The propensity score is the conditional probability of receiving a particular treatment (Z = T) versus control (Z = C) given a vector of observed covariates, X, e = e(x) = Pr (Z = T X) Balancing of pre-treatment variables given the propensity score If e(x) is the propensity score, then Z X e(x) Unconfoundedness given the propensity score Z (Y (C), Y (T)) X = Z (Y (C), Y (T)) e(x)
7 Notation Let the response indicator be 1, if the value of the k covariate for the ith subject is observed R ik = 0, if the value of the k covariate for the ith subject is missing for i = 1,...,N and k = 1,...,K. Let X = (X obs,x mis ), where X obs = {X ik : R ik = 1} e X mis = {X ik : R ik = 0}
8 Estimating Propensity Score with Incomplete Data It is not clear how the propensity score should be estimated when some covariate values are missing The missingness itself may be predictive about which treatment is received Any technique for estimating propensity score in the presence of covariate missing data will have to either make a stronger assumption regarding ignorability of the assignment mechanism or will have to make an assumption about the missing data mechanism In order to have ignorability of the assignment mechanism, for all of the techniques here described, we will maintain the following assumption: Pr (Z X, R, Y (C), Y (T)) = Pr (Z X, R)
9 Complete-Data Analysis A complete-data analysis uses only observations where all variables are observed To make valid causal inferences with this approach we require that data is Missing Completely At Random (MCAR, Little and Rubin): 1987): Pr(R X, Z) = Pr(R) Note that This means that the units removed from the data set, those with missing data, are just a simple random sample of the other Pr (Z X, R, Y (C), Y (T)) = Pr (Z X, R) and Pr(R X, Z) = Pr(R) Pr (Z X, R, Y (C), Y (T)) = Pr (Z X)
10 Rosenbaum - Rubin Approach The Propensity Scores with Incomplete Data The generalized propensity score, which conditions on all of the observed covariate information, is ( ) e = e (X obs, R) = Pr Z = T X obs, R Balancing of pre-treatment variables given the generalized propensity score ( ) Z X obs, R e (X obs, R) Unconfoundedness given the generalized propensity score ( ) Z (Y (C), Y (T)) X obs, R = Z (Y (C), Y (T)) e (X obs, R)
11 Rosenbaum - Rubin Approach Assumptions The Rosenbaum-Rubin method relies on either one of the following assumptions: Pr(Z X, R) Pr(Z X obs,x mis, R) = Pr(Z X obs, R) or Pr(Y (C), Y (T) X, R) Pr(Y (C), Y (T) X obs,x mis, R) = Pr(Y (C), Y (C) X obs, R) The Rosenbaum - Rubin method does not make any assumption about the missing data mechanism
12 Drawbacks Rosenbaum - Rubin Approach The Rosenbaum - Rubin method does assume that either all missing covariate values are independent of the the assignment mechanism conditional on the missing data patterns or or that they are independent of the potential outcomes conditional on observed covariate values and the missing data patterns Since the Rosenbaum - Rubin method specifies one model for both handling missing data and estimating propensity scores, it is not possible to incorporate the outcome variable Y into this model even though it might provide useful information about missing values
13 Multiple Imputation and Propensity Score Methods The latent ignorability of the assignment mechanism Using Multiple Imputation (MI) to handling incomplete data covariates, we essentially assume the latent ignorability of the assignment mechanism Pr(Z X, R, Y (C), Y (T)) = Pr(Z X). In our case, the assignment mechanism is ignorable only conditional on complete covariate data (which includes, of course, values that in practice are missing) Computationally, this is achieved by filling in the missing covariate values using MI
14 Multiple Imputation and Propensity Score Methods Assumptions on the assignment mechanism Imputations may in principle be created under any kind of model for the missing data mechanism, and the resulting inferences will be valid under that mechanism (Rubin, 1987) In our application, MI was performed assuming that the missing observations are Missing At Random (MAR), that is, Pr(R X, Z, Y (C), Y (T)) = Pr(R X obs, Z, Y obs ), where Y obs = [ ] Yi obs n, Y obs i=1 i = I{Z i = T }Y i (T) + I{Z i = C}Y i (C) This MAR assumption involves all the observed variables In our application, we perform MI in two way: including Y in the model, and not including Y in the imputation model
15 Multiple Imputation and Propensity Score Methods Estimators Let ÂTT l and se 2 l denote the point estimate and variance respectively from the lth (l = 1,..., m) dataset. Then, m ÂTT = 1 m ÂTT l ) V ar (ÂTT l=1 = se 2 W + se 2 B ( ) m where se 2 W = 1 m m l=1 se2 l Within-imputation variance se 2 B = 1 m 1 m l=1 (ÂTTl ÂTT ) 2 Between-imputation variance In our application, MI was performed using the mvis module in STATA (Patrick Royston, 2004), which is based on MICE method of multiple multivariate imputation (van Buuren et al., 1999)
16 Matching Estimators of the ATT Effect based on the Propensity Score The Nearest Neighbor Matching Estimator The Kernel Matching Estimator The Stratification Matching Estimator Irrespective of the method of handling missing data, the propensity score analysis is implemented by the use of the pscore module in STATA written by Becker and Ichino (2000)
17 The Indonesia Family Life Survey Data The IFLS consists of three waves (1993, 1997, 2000) plus a special wave (1998), which we will not use in our study We will use a subsample of panel ever-married women age In our study the outcome variable is a measure of monetary wellbeing, given by the annual value of the total household consumption expenditures adjusted for price variability across space and time and household heterogeneity Adjustment for price variability We divided the nominal consumption expenditures by the national consumption price index (IFS, 2002) Adjustment for household heterogeneity We adjust our income-based measure of wellbeing for household heterogeneity by applying the following equivalence scale: Total number of persons in the household
18 The Outcome Variable Descriptive statistics of total net equivalised household consumption expenditures in 2000 (Rupiah in thousands) by number of live births Consumption expenditures (Rupiah in thousands) Live births Obs mean s.d. median At least a live birth , Rupiah = 1 USA$ Note that =
19 Self-Selection of the Treated Units We observe that women who experience a childbearing and women who do not are very different in almost all their characteristics (Details are omitted) Systematic differences between the treatment group and the control group can also occur in the distribution of the missing covariate data 10.7% of the units in the sample presents at least a missing covariate value
20 Self-Selection of the Treated Units Missing-value indicators (proportion observed) Covariate Z = C Z = T Difference (%) Deprivation Index Education level of HH head Yrs of schooling of the HH head Education level Yrs of schooling Activity last week Age at first marriage Islam Parents in HH Years since the last live birth Pregnant Ever used contraceptives Use of contraceptives Total
21 Propensity Score Models for IFLS Data Standardized Differences (in %) and Percent Reduction in Bias for Propensity Scores, before and after matching using each approaches to the missing covariates problem in combination with Nearest Neighbor, Gaussian Kernel, and Stratification Propensity Score Matching Results after matching Nearest Neighbor Kernel Stratification Matching Matching Matching Missing Data Initial Stand. Red. Stand. Red. Stand. Red. Approaches Stand. Diff. Diff. in Bias Diff. in Bias Diff. in Bias (%) (%) (%) (%) (%) (%) (%) Complete-Data Rosenbaum-Rubin MI (without Y ) MI (with Y )
22 Treatment Effects Estimation Complete-Data Analysis Matching Method N T N C ATT S.E. t-value Nearest Neighbor Kernel Stratification The complete-cases analysis gives quite high average treatment effects and quite high standard errors It appears to be very sensitive to the choice of the matching method In our application, the MCAR assumption does not appear plausible; it is more reasonable to believe that the missing data mechanism is either Missing At Random (MAR) or nonignorable
23 Treatment Effects Estimation Rosenbaum-Rubin Model Matching Method N T N C ATT S.E. t-value Nearest Neighbor Kernel Stratification With respect to the complete-data analysis, the Rosenbaum-Rubin model appears to be more robust concerning the choice of the matching method It yields lower average treatment effects and lower standard errors It does not produce an excellent balance in the distribution of the estimated propensity score
24 Treatment Effects Estimation Multiple Imputation (without Y ) Matching Method N T N C ATT S.E. t-value Nearest Neighbor Kernel Stratification Multiple Imputation (with Y ) Matching Method N T N C ATT S.E. t-value Nearest Neighbor Kernel Stratification
25 Advantages of the MI Techniques The two imputation models outperform both of the other two approaches in terms of robustness of the estimates to the choice of the matching method Using different models for imputation and propensity score, the MI approach allows to incorporate model features in one model that might be inappropriate for another MI makes the choice of the propensity model easier The MI approach allows for final analysis of the outcomes (such as covariance adjustment) which include covariates which are not fully observed
26 Concluding Remarks We compared missing completely at random based estimates of propensity scores and the causal effect of interest with estimators based on alternative models for the missing data process: A pattern-mixture model based approach developed by Rosenbaum and Rubin (1984) A combination of propensity score matching with MI We judged the plausibility of these alternative approaches by the balance that the resulting propensity score models produced and the estimands they brought out In our application, the MI models appear to outperform both the complete data analysis and the Rosenbaum-Rubin method The combination of propensity score matching with MI we choose shows evidence that childbearing events reduce consumption levels
Selection on Observables: Propensity Score Matching.
Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017
More informationAssess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014
Assess Assumptions and Sensitivity Analysis Fan Li March 26, 2014 Two Key Assumptions 1. Overlap: 0
More informationWhat s New in Econometrics. Lecture 1
What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and
More informationSIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS
SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS TOMMASO NANNICINI universidad carlos iii de madrid UK Stata Users Group Meeting London, September 10, 2007 CONTENT Presentation of a Stata
More informationEstimating the causal effect of fertility on economic wellbeing: data requirements, identifying assumptions and estimation methods
Empir Econ (2013) 44:355 385 DOI 10.1007/s00181-010-0356-9 Estimating the causal effect of fertility on economic wellbeing: data requirements, identifying assumptions and estimation methods Bruno Arpino
More informationSome methods for handling missing values in outcome variables. Roderick J. Little
Some methods for handling missing values in outcome variables Roderick J. Little Missing data principles Likelihood methods Outline ML, Bayes, Multiple Imputation (MI) Robust MAR methods Predictive mean
More informationCausal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies
Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed
More informationAn Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies
Paper 177-2015 An Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies Yan Wang, Seang-Hwane Joo, Patricia Rodríguez de Gil, Jeffrey D. Kromrey, Rheta
More information(Mis)use of matching techniques
University of Warsaw 5th Polish Stata Users Meeting, Warsaw, 27th November 2017 Research financed under National Science Center, Poland grant 2015/19/B/HS4/03231 Outline Introduction and motivation 1 Introduction
More informationGov 2002: 4. Observational Studies and Confounding
Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What
More informationGov 2002: 5. Matching
Gov 2002: 5. Matching Matthew Blackwell October 1, 2015 Where are we? Where are we going? Discussed randomized experiments, started talking about observational data. Last week: no unmeasured confounders
More informationApplied Microeconometrics (L5): Panel Data-Basics
Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics
More informationMixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina
Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -
More informationA SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS
A SIMULATION-BASED SENSITIVITY ANALYSIS FOR MATCHING ESTIMATORS TOMMASO NANNICINI universidad carlos iii de madrid North American Stata Users Group Meeting Boston, July 24, 2006 CONTENT Presentation of
More informationStatistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes
Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Kosuke Imai Department of Politics Princeton University July 31 2007 Kosuke Imai (Princeton University) Nonignorable
More informationAnalyzing Pilot Studies with Missing Observations
Analyzing Pilot Studies with Missing Observations Monnie McGee mmcgee@smu.edu. Department of Statistical Science Southern Methodist University, Dallas, Texas Co-authored with N. Bergasa (SUNY Downstate
More informationCan a Pseudo Panel be a Substitute for a Genuine Panel?
Can a Pseudo Panel be a Substitute for a Genuine Panel? Min Hee Seo Washington University in St. Louis minheeseo@wustl.edu February 16th 1 / 20 Outline Motivation: gauging mechanism of changes Introduce
More informationPropensity Score Weighting with Multilevel Data
Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative
More informationBayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London
Bayesian methods for missing data: part 1 Key Concepts Nicky Best and Alexina Mason Imperial College London BAYES 2013, May 21-23, Erasmus University Rotterdam Missing Data: Part 1 BAYES2013 1 / 68 Outline
More informationAn Introduction to Causal Analysis on Observational Data using Propensity Scores
An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut
More informationA Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,
A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type
More informationCausal Inference with General Treatment Regimes: Generalizing the Propensity Score
Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke
More informationCombining multiple observational data sources to estimate causal eects
Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,
More informationStatistical Methods. Missing Data snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23
1 / 23 Statistical Methods Missing Data http://www.stats.ox.ac.uk/ snijders/sm.htm Tom A.B. Snijders University of Oxford November, 2011 2 / 23 Literature: Joseph L. Schafer and John W. Graham, Missing
More informationPropensity Score Methods for Causal Inference
John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good
More informationShu Yang and Jae Kwang Kim. Harvard University and Iowa State University
Statistica Sinica 27 (2017), 000-000 doi:https://doi.org/10.5705/ss.202016.0155 DISCUSSION: DISSECTING MULTIPLE IMPUTATION FROM A MULTI-PHASE INFERENCE PERSPECTIVE: WHAT HAPPENS WHEN GOD S, IMPUTER S AND
More informationBootstrapping Sensitivity Analysis
Bootstrapping Sensitivity Analysis Qingyuan Zhao Department of Statistics, The Wharton School University of Pennsylvania May 23, 2018 @ ACIC Based on: Qingyuan Zhao, Dylan S. Small, and Bhaswar B. Bhattacharya.
More informationA Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness
A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model
More informationImbens/Wooldridge, IRP Lecture Notes 2, August 08 1
Imbens/Wooldridge, IRP Lecture Notes 2, August 08 IRP Lectures Madison, WI, August 2008 Lecture 2, Monday, Aug 4th, 0.00-.00am Estimation of Average Treatment Effects Under Unconfoundedness, Part II. Introduction
More informationFlexible Estimation of Treatment Effect Parameters
Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both
More informationPropensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018
Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018 Schedule and outline 1:00 Introduction and overview 1:15 Quasi-experimental vs. experimental designs
More informationQuantitative Economics for the Evaluation of the European Policy
Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,
More informationMatching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14
STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University Frequency 0 2 4 6 8 Quiz 2 Histogram of Quiz2 10 12 14 16 18 20 Quiz2
More informationESTIMATION OF TREATMENT EFFECTS VIA MATCHING
ESTIMATION OF TREATMENT EFFECTS VIA MATCHING AAEC 56 INSTRUCTOR: KLAUS MOELTNER Textbooks: R scripts: Wooldridge (00), Ch.; Greene (0), Ch.9; Angrist and Pischke (00), Ch. 3 mod5s3 General Approach The
More informationTruncation and Censoring
Truncation and Censoring Laura Magazzini laura.magazzini@univr.it Laura Magazzini (@univr.it) Truncation and Censoring 1 / 35 Truncation and censoring Truncation: sample data are drawn from a subset of
More informationIdentification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case
Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College December 2016 Abstract Lewbel (2012) provides an estimator
More informationIntroduction An approximated EM algorithm Simulation studies Discussion
1 / 33 An Approximated Expectation-Maximization Algorithm for Analysis of Data with Missing Values Gong Tang Department of Biostatistics, GSPH University of Pittsburgh NISS Workshop on Nonignorable Nonresponse
More informationNonrespondent subsample multiple imputation in two-phase random sampling for nonresponse
Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse Nanhua Zhang Division of Biostatistics & Epidemiology Cincinnati Children s Hospital Medical Center (Joint work
More informationCausal Inference Lecture Notes: Selection Bias in Observational Studies
Causal Inference Lecture Notes: Selection Bias in Observational Studies Kosuke Imai Department of Politics Princeton University April 7, 2008 So far, we have studied how to analyze randomized experiments.
More informationMISSING or INCOMPLETE DATA
MISSING or INCOMPLETE DATA A (fairly) complete review of basic practice Don McLeish and Cyntha Struthers University of Waterloo Dec 5, 2015 Structure of the Workshop Session 1 Common methods for dealing
More informationA Measure of Robustness to Misspecification
A Measure of Robustness to Misspecification Susan Athey Guido W. Imbens December 2014 Graduate School of Business, Stanford University, and NBER. Electronic correspondence: athey@stanford.edu. Graduate
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies
More informationCausal Inference Basics
Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,
More informationCausal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk
Causal Inference in Observational Studies with Non-Binary reatments Statistics Section, Imperial College London Joint work with Shandong Zhao and Kosuke Imai Cass Business School, October 2013 Outline
More informationImplementing Matching Estimators for. Average Treatment Effects in STATA
Implementing Matching Estimators for Average Treatment Effects in STATA Guido W. Imbens - Harvard University West Coast Stata Users Group meeting, Los Angeles October 26th, 2007 General Motivation Estimation
More informationSelection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models
Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models Massimiliano Bratti & Alfonso Miranda In many fields of applied work researchers need to model an
More informationJob Training Partnership Act (JTPA)
Causal inference Part I.b: randomized experiments, matching and regression (this lecture starts with other slides on randomized experiments) Frank Venmans Example of a randomized experiment: Job Training
More informationPROPENSITY SCORE MATCHING. Walter Leite
PROPENSITY SCORE MATCHING Walter Leite 1 EXAMPLE Question: Does having a job that provides or subsidizes child care increate the length that working mothers breastfeed their children? Treatment: Working
More informationVariable selection and machine learning methods in causal inference
Variable selection and machine learning methods in causal inference Debashis Ghosh Department of Biostatistics and Informatics Colorado School of Public Health Joint work with Yeying Zhu, University of
More informationIdentification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case
Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College Original December 2016, revised July 2017 Abstract Lewbel (2012)
More informationA comparison of fully Bayesian and two-stage imputation strategies for missing covariate data
A comparison of fully Bayesian and two-stage imputation strategies for missing covariate data Alexina Mason, Sylvia Richardson and Nicky Best Department of Epidemiology and Biostatistics, Imperial College
More informationApplied Quantitative Methods II
Applied Quantitative Methods II Lecture 10: Panel Data Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 10 VŠE, SS 2016/17 1 / 38 Outline 1 Introduction 2 Pooled OLS 3 First differences 4 Fixed effects
More informationChapter 1 Introduction. What are longitudinal and panel data? Benefits and drawbacks of longitudinal data Longitudinal data models Historical notes
Chapter 1 Introduction What are longitudinal and panel data? Benefits and drawbacks of longitudinal data Longitudinal data models Historical notes 1.1 What are longitudinal and panel data? With regression
More informationBayesian Inference for Sequential Treatments under Latent Sequential Ignorability. June 19, 2017
Bayesian Inference for Sequential Treatments under Latent Sequential Ignorability Alessandra Mattei, Federico Ricciardi and Fabrizia Mealli Department of Statistics, Computer Science, Applications, University
More informationTHE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B.
THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B. RUBIN My perspective on inference for causal effects: In randomized
More informationImbens/Wooldridge, Lecture Notes 1, Summer 07 1
Imbens/Wooldridge, Lecture Notes 1, Summer 07 1 What s New in Econometrics NBER, Summer 2007 Lecture 1, Monday, July 30th, 9.00-10.30am Estimation of Average Treatment Effects Under Unconfoundedness 1.
More informationA note on multiple imputation for general purpose estimation
A note on multiple imputation for general purpose estimation Shu Yang Jae Kwang Kim SSC meeting June 16, 2015 Shu Yang, Jae Kwang Kim Multiple Imputation June 16, 2015 1 / 32 Introduction Basic Setup Assume
More informationFractional Imputation in Survey Sampling: A Comparative Review
Fractional Imputation in Survey Sampling: A Comparative Review Shu Yang Jae-Kwang Kim Iowa State University Joint Statistical Meetings, August 2015 Outline Introduction Fractional imputation Features Numerical
More informationFractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling
Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction
More informationComparison of multiple imputation methods for systematically and sporadically missing multilevel data
Comparison of multiple imputation methods for systematically and sporadically missing multilevel data V. Audigier, I. White, S. Jolani, T. Debray, M. Quartagno, J. Carpenter, S. van Buuren, M. Resche-Rigon
More informationEconometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017
Econometrics with Observational Data Introduction and Identification Todd Wagner February 1, 2017 Goals for Course To enable researchers to conduct careful quantitative analyses with existing VA (and non-va)
More informationWomen. Sheng-Kai Chang. Abstract. In this paper a computationally practical simulation estimator is proposed for the twotiered
Simulation Estimation of Two-Tiered Dynamic Panel Tobit Models with an Application to the Labor Supply of Married Women Sheng-Kai Chang Abstract In this paper a computationally practical simulation estimator
More informationMatching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region
Impact Evaluation Technical Session VI Matching Techniques Jed Friedman Manila, December 2008 Human Development Network East Asia and the Pacific Region Spanish Impact Evaluation Fund The case of random
More informationGibbs Sampling in Latent Variable Models #1
Gibbs Sampling in Latent Variable Models #1 Econ 690 Purdue University Outline 1 Data augmentation 2 Probit Model Probit Application A Panel Probit Panel Probit 3 The Tobit Model Example: Female Labor
More informationLast lecture 1/35. General optimization problems Newton Raphson Fisher scoring Quasi Newton
EM Algorithm Last lecture 1/35 General optimization problems Newton Raphson Fisher scoring Quasi Newton Nonlinear regression models Gauss-Newton Generalized linear models Iteratively reweighted least squares
More informationBasics of Modern Missing Data Analysis
Basics of Modern Missing Data Analysis Kyle M. Lang Center for Research Methods and Data Analysis University of Kansas March 8, 2013 Topics to be Covered An introduction to the missing data problem Missing
More informationBayesian regression tree models for causal inference: regularization, confounding and heterogeneity
Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity P. Richard Hahn, Jared Murray, and Carlos Carvalho June 22, 2017 The problem setting We want to estimate
More informationA Simulation-Based Sensitivity Analysis for Matching Estimators
A Simulation-Based Sensitivity Analysis for Matching Estimators Tommaso Nannicini Universidad Carlos III de Madrid Abstract. This article presents a Stata program (sensatt) that implements the sensitivity
More informationanalysis of incomplete data in statistical surveys
analysis of incomplete data in statistical surveys Ugo Guarnera 1 1 Italian National Institute of Statistics, Italy guarnera@istat.it Jordan Twinning: Imputation - Amman, 6-13 Dec 2014 outline 1 origin
More informationAlexina Mason. Department of Epidemiology and Biostatistics Imperial College, London. 16 February 2010
Strategy for modelling non-random missing data mechanisms in longitudinal studies using Bayesian methods: application to income data from the Millennium Cohort Study Alexina Mason Department of Epidemiology
More informationCombining Difference-in-difference and Matching for Panel Data Analysis
Combining Difference-in-difference and Matching for Panel Data Analysis Weihua An Departments of Sociology and Statistics Indiana University July 28, 2016 1 / 15 Research Interests Network Analysis Social
More informationA Course in Applied Econometrics. Lecture 2 Outline. Estimation of Average Treatment Effects. Under Unconfoundedness, Part II
A Course in Applied Econometrics Lecture Outline Estimation of Average Treatment Effects Under Unconfoundedness, Part II. Assessing Unconfoundedness (not testable). Overlap. Illustration based on Lalonde
More informationESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics
ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.
More informationUniversity of Pennsylvania and The Children s Hospital of Philadelphia
Submitted to the Annals of Applied Statistics arxiv: arxiv:0000.0000 ESTIMATION OF CAUSAL EFFECTS USING INSTRUMENTAL VARIABLES WITH NONIGNORABLE MISSING COVARIATES: APPLICATION TO EFFECT OF TYPE OF DELIVERY
More informationComparing Group Means When Nonresponse Rates Differ
UNF Digital Commons UNF Theses and Dissertations Student Scholarship 2015 Comparing Group Means When Nonresponse Rates Differ Gabriela M. Stegmann University of North Florida Suggested Citation Stegmann,
More informationWhether to use MMRM as primary estimand.
Whether to use MMRM as primary estimand. James Roger London School of Hygiene & Tropical Medicine, London. PSI/EFSPI European Statistical Meeting on Estimands. Stevenage, UK: 28 September 2015. 1 / 38
More informationLogistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015
Logistic regression: Why we often can do what we think we can do Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 1 Introduction Introduction - In 2010 Carina Mood published an overview article
More informationGov 2000: 9. Regression with Two Independent Variables
Gov 2000: 9. Regression with Two Independent Variables Matthew Blackwell Fall 2016 1 / 62 1. Why Add Variables to a Regression? 2. Adding a Binary Covariate 3. Adding a Continuous Covariate 4. OLS Mechanics
More informationControlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded
Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded 1 Background Latent confounder is common in social and behavioral science in which most of cases the selection mechanism
More informationEcon 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem
Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects The Problem Analysts are frequently interested in measuring the impact of a treatment on individual behavior; e.g., the impact of job
More informationExercise sheet 6 Models with endogenous explanatory variables
Exercise sheet 6 Models with endogenous explanatory variables Note: Some of the exercises include estimations and references to the data files. Use these to compare them to the results you obtained with
More informationStatistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach
Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Observed likelihood 3 Mean Score
More informationMATCHING FOR EE AND DR IMPACTS
MATCHING FOR EE AND DR IMPACTS Seth Wayland, Opinion Dynamics August 12, 2015 A Proposal Always use matching Non-parametric preprocessing to reduce model dependence Decrease bias and variance Better understand
More informationComparison for alternative imputation methods for ordinal data
Comparison for alternative imputation methods for ordinal data Federica Cugnata e Silvia Salini DEMM, Università degli Studi di Milano 22 maggio 2013 Cugnata & Salini (DEMM - Unimi) Imputation methods
More informationIntroduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015
Introduction to causal identification Nidhiya Menon IGC Summer School, New Delhi, July 2015 Outline 1. Micro-empirical methods 2. Rubin causal model 3. More on Instrumental Variables (IV) Estimating causal
More informationTable 1. Answers to income and consumption adequacy questions Percentage of responses: less than adequate more than adequate adequate Total income 68.7% 30.6% 0.7% Food consumption 46.6% 51.4% 2.0% Clothing
More informationSensitivity checks for the local average treatment effect
Sensitivity checks for the local average treatment effect Martin Huber March 13, 2014 University of St. Gallen, Dept. of Economics Abstract: The nonparametric identification of the local average treatment
More informationImplementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston
Implementing Matching Estimators for Average Treatment Effects in STATA Guido W. Imbens - Harvard University Stata User Group Meeting, Boston July 26th, 2006 General Motivation Estimation of average effect
More informationEstimation of average treatment effects based on propensity scores
The Stata Journal (2002) 2, Number 4, pp. 358 377 Estimation of average treatment effects based on propensity scores Sascha O. Becker University of Munich Andrea Ichino EUI Abstract. In this paper, we
More informationWeighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.
Weighting Unconfounded Homework 2 Describe imbalance direction matters STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University
More informationAGEC 661 Note Fourteen
AGEC 661 Note Fourteen Ximing Wu 1 Selection bias 1.1 Heckman s two-step model Consider the model in Heckman (1979) Y i = X iβ + ε i, D i = I {Z iγ + η i > 0}. For a random sample from the population,
More informationLeast Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates
Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates Matthew Harding and Carlos Lamarche January 12, 2011 Abstract We propose a method for estimating
More informationApplied Econometrics Lecture 1
Lecture 1 1 1 Università di Urbino Università di Urbino PhD Programme in Global Studies Spring 2018 Outline of this module Beyond OLS (very brief sketch) Regression and causality: sources of endogeneity
More informationEconometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous
Econometrics of causal inference Throughout, we consider the simplest case of a linear outcome equation, and homogeneous effects: y = βx + ɛ (1) where y is some outcome, x is an explanatory variable, and
More informationNew Developments in Nonresponse Adjustment Methods
New Developments in Nonresponse Adjustment Methods Fannie Cobben January 23, 2009 1 Introduction In this paper, we describe two relatively new techniques to adjust for (unit) nonresponse bias: The sample
More informationDiscussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data
Biometrics 000, 000 000 DOI: 000 000 0000 Discussion of Identifiability and Estimation of Causal Effects in Randomized Trials with Noncompliance and Completely Non-ignorable Missing Data Dylan S. Small
More informationSC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM)
SC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM) SEM is a family of statistical techniques which builds upon multiple regression,
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors
More informationEconometric Analysis of Cross Section and Panel Data
Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND
More informationThe propensity score with continuous treatments
7 The propensity score with continuous treatments Keisuke Hirano and Guido W. Imbens 1 7.1 Introduction Much of the work on propensity score analysis has focused on the case in which the treatment is binary.
More information