I. Partial identification of causal parameters with multiple proposed instruments
|
|
- Wesley Floyd
- 5 years ago
- Views:
Transcription
1 Supplemental Materials Table of contents of Supplemental Materials: I. Partial identification of causal parameters with multiple proposed instruments II. Data example III. R code I. Partial identification of causal parameters with multiple proposed instruments We first consider bounding the global average causal effect of a dichotomous treatment A on a dichotomous outcome Y: E[Y a=1 ] E[Y a=0 ] where superscripts denote counterfactuals. We consider Z 1,, Z m pre-treatment dichotomous variables as possible instruments. To simplify notation, let LB k and UB k denote the lower and upper Balke-Pearl bounds computed using Z k, respectively. Below we present derivations for bounds for the average causal effect when all proposed instruments are assumed to be valid (Theorem 1) and when at least one proposed instrument is assumed to be valid (Theorem 2). While these marginally-computed summary bounds are valid, they are not sharp for the data generating procedure depicted in the causal diagram in Figure 2. Generally, narrower bounds will be achievable when the proposed instruments jointly satisfy the instrumental conditions: see Richardson and Robins 2014 for expressions for sharp bounds. The difference is in the assumptions being leveraged. The sharp bounds rely on the vector (Z 1,, Z m ) being independent of the counterfactual outcomes, while the marginally-computed bounds rely on Z 1 being marginally independent of the counterfactual outcomes, as well as Z 2,, and Z m. The logic of the proofs for Theorems 1-2 can readily apply to bounds for any causal parameter that can be bounded under a set of assumptions about a proposed instrument whenever the definition of that causal parameter is not dependent on the proposed instrument. For example, extensions immediately follow for: partial identification of a counterfactual risk (e.g., E[Y a=1 ]) and the causal risk ratio for dichotomous outcomes; partial identification of a mean counterfactual outcome and causal mean difference for continuous outcomes; and partial identification strategies combining the instrumental conditions with various sets of additional assumptions (e.g., see Swanson et al for further examples). Analogs of these proofs are not applicable to causal parameters for which the definitions are instrument-dependent (e.g., local average treatment effects), as each marginally-computed bound or point estimate is then targeting a different causal parameter. In principle, bounds can be identified for many permutations of assumed (marginally or jointly defined) instruments. For example, suppose in our dataset we assumed that the true data generating process followed one of the four causal diagrams in efigure 1, but we do not know which one specifically. In that case, we can compute bounds for each scenario assuming that the appropriate subset of proposed instruments jointly satisfies the instrumental conditions, and then take the union as the bounds under our collective assumptions. For a given dataset and research Swanson Supplementary Materials p.1
2 question, investigators could consider all relevant sets of assumptions (perhaps as encoded in a set of causal diagrams) and use the logic presented here to guide analyses. Theorem 1. Suppose Z 1,, Z m each satisfy the instrumental conditions marginally, and the intersection of their respective Balke-Pearl bounds is non-empty. Then E[Y a=1 ] E[Y a=0 ] [max k (LB k ), min k (UB k )]. Proof. For each Z k, the average causal effect is contained within [LB k, UB k ] under our assumption that Z k satisfies the instrumental conditions marginally (Balke & Pearl 1997). Therefore, the average causal effect is contained within all m sets: E[Y a=1 ] E[Y a=0 ] [LB 1, UB 1 ] E[Y a=1 ] E[Y a=0 ] [LB 2, UB 2 ] E[Y a=1 ] E[Y a=0 ] [LB m, UB m ] Taking the intersection of these sets, E[Y a=1 ] E[Y a=0 ] is contained within [max k (LB k ), min k (UB k )]. Note: If the intersection of [LB 1, UB 1 ], [LB 2, UB 2 ],, and [LB m, UB m ] is an empty set, then we have evidence that at least one of the proposed instruments Z 1,, Z m does not satisfy the instrumental conditions. This could be viewed as an alternative to the standard overidentification tests that rely on homogeneity conditions. However, this test would only detect very extreme violations of the instrumental conditions. Theorem 2. If at least one of Z 1,, Z m satisfies the instrumental conditions, then E[Y a=1 ] E[Y a=0 ] [min k (LB k ), max k (UB k )]. Proof. For an arbitrary Z k, if Z k satisfies the instrumental conditions, then the average causal effect lies within [LB k, UB k ] (Balke & Pearl 1997). Therefore, under our assumption that at least one of Z 1,, Z m satisfies the instrumental conditions, the average causal effect is contained within at least one of these m Balke-Pearl bounds. That is, at least one of the following expressions is true: E[Y a=1 ] E[Y a=0 ] [LB 1, UB 1 ] E[Y a=1 ] E[Y a=0 ] [LB 2, UB 2 ] E[Y a=1 ] E[Y a=0 ] [LB m, UB m ] If the average causal effect must be in at least one of these sets, then it must be in the union of the sets. Thus, E[Y a=1 ] E[Y a=0 ] is contained within [min k (LB k ), max k (UB k )]. II. Data example In the main text, bounds and point estimates from a toy dataset were presented for which the various methods relying on homogeneity conditions were biased, while assuming all proposed instruments jointly satisfied the instrumental conditions (without homogeneity) led to point identification. The data example was generated as follows. Swanson Supplementary Materials p.2
3 Z 1 ~ B(0.4) Z 2 ~ B(0.6) Z 3 ~ B(0.6) Z 4 ~ B(0.4) U ~ Multinom(0.1, 0.2, 0.3, 0.4) A = Z 1 U 1 + Z 2 U 2 + Z 3 U 3 + Z 4 U 4 Y a=0 ~ B(0.2U U U U 4 ) Y a=1 ~ B(0.1U U U U 4 ) Note that the four instruments are perfect predictors of treatment in four mutually-exclusive subpopulations with different effect sizes. As such, the homogeneity conditions are not satisfied for any proposed instrument. However, the bounds computed assuming Z 1, Z 2 Z 3, and Z 4 jointly satisfy the instrumental conditions collapse to a single value because it suffices to compare subjects with Z 1 =Z 2 =Z 3 =Z 4 =1 (who all will have A=1) to subjects with Z 1 =Z 2 =Z 3 =Z 4 =0 (who all will have A=0). Obviously, it is unlikely that human biology functions as simply as in the data example presented here. (It is also unlikely that it functions as homogeneously as presumed by pointidentification methods.) That said, the simplicity of the data generating procedure was not important for deriving valid bounds. It is, however, the reason the bounds in this particular dataset are so informative. Of note, the data example was used to illustrate different point and partial identification results, and for didactic purposes we did not consider the role of sampling variability. In practice, each point estimate or set of bounds should be accompanied by confidence intervals. See Imbens and Manski 2004 for some discussion on confidence intervals for partially identified parameters. III. R code ## Generating data ##sample size n, number of IVs k ##n very large to minimize role of sampling variability in illustration n < k <- 4 set.seed(66) ##generating data with four IVs (Z), treatment (A), ##[counterfactual and observed] outcome (Y), and unmeasured confounder (U) z <- matrix(nrow=n,ncol=k) z[1:n,1] <- rbinom(n=n,size=1,prob=0.6) z[1:n,2] <- rbinom(n=n,size=1,prob=0.4) z[1:n,3] <- rbinom(n=n,size=1,prob=0.4) z[1:n,4] <- rbinom(n=n,size=1,prob=0.6) u <- matrix(nrow=n,ncol=4) u <- t(rmultinom(n=n,size=1,prob=c(0.1,0.2,0.3,0.4))) Swanson Supplementary Materials p.3
4 a <- matrix(nrow=n,ncol=1) y <- matrix(nrow=n,ncol=3) for(i in 1:n) { a[i,1] <- min(z[i,1]*u[i,1] + z[i,2]*u[i,2] + z[i,3]*u[i,3] + z[i,4]*u[i,4],1) y[i,1] <- rbinom(n=1,size=1,prob= 0.20*u[i,1]+ 0.10*u[i,2] + 0.3*u[i,3]+ 0.1*u[i,4]) y[i,2] <- rbinom(n=1,size=1,prob= 0.10*u[i,1]+ 0.20*u[i,2] + 0.2*u[i,3]+ 0.9*u[i,4]) y[i,3] <- ifelse(a[i,1]==1,y[i,2],y[i,1]) dat <- cbind(z,u,a,y) dat <- as.data.frame(dat) colnames(dat) <- c(paste('z',1:k,sep=''),paste('u',1:ncol(u),sep=''),'a','y0','y1','y') ## Computing bounds and point estimates ##summary dataset for storing single IV point estimates and bounds summarydat <- matrix(nrow=k,ncol=5) colnames(summarydat) <- c('lb','ub','ivnum','ivden','ivest') ##computing bounds and IV ratio for each Zk ##Pr[Y=y,A=a Z=z]=pya.z dat$set <- dat[,i] p00.0 <- sum(with(dat,set==0 & a==0 & y==0))/sum(with(dat,set==0)) p01.0 <- sum(with(dat,set==0 & a==1 & y==0))/sum(with(dat,set==0)) p10.0 <- sum(with(dat,set==0 & a==0 & y==1))/sum(with(dat,set==0)) p11.0 <- sum(with(dat,set==0 & a==1 & y==1))/sum(with(dat,set==0)) p00.1 <- sum(with(dat,set==1 & a==0 & y==0))/sum(with(dat,set==1)) p01.1 <- sum(with(dat,set==1 & a==1 & y==0))/sum(with(dat,set==1)) p10.1 <- sum(with(dat,set==1 & a==0 & y==1))/sum(with(dat,set==1)) p11.1 <- sum(with(dat,set==1 & a==1 & y==1))/sum(with(dat,set==1)) summarydat[i,1] <- max( p p00.0-1, p p00.1-1, p p p p p10.0, p p p p p10.1, -p p10.1, -p p10.0, p p p p p00.0, p p p p p00.1 ) summarydat[i,2] <- min( 1 - p p10.0, 1 - p p10.1, -p p p p p00.0, -p p p p p00.1, p p00.1, p p00.0, -p p p p p10.0, -p p p p p10.1 ) py1.1 <- sum(with(dat,set==1 & y==1))/sum(with(dat,set==1)) py1.0 <- sum(with(dat,set==0 & y==1))/sum(with(dat,set==0)) pa1.1 <- sum(with(dat,set==1 & a==1))/sum(with(dat,set==1)) pa1.0 <- sum(with(dat,set==0 & a==1))/sum(with(dat,set==0)) summarydat[i,3] <- (py1.1 - py1.0) summarydat[i,4] <- (pa1.1 - pa1.0) summarydat[i,5] <- (py1.1 - py1.0)/(pa1.1 - pa1.0) ##computing joint IV bounds ##for our contrived data example, it is easy to 'see' the relevant strata ##when Z1=Z2=Z3=Z4=1 and when Z1=Z2=Z3=Z4=0 ##in general, see Richardson and Robins 2014 for the full expression ##note: this code is specific to this particular dataset! dat$nzs <- rowsums(dat[,1:k]) Swanson Supplementary Materials p.4
5 py1.zz1 <- sum(with(dat,y==1 & nzs==k))/sum(with(dat,nzs==k)) py1.zz0 <- sum(with(dat,y==1 & nzs==0))/sum(with(dat,nzs==0)) jointp <- py1.zz1 - py1.zz0 ## Table 1 dat$set <- dat[,i] print(i) print(round(sum(with(dat,set==0 & y==1))/sum(with(dat,set==0)),2)) print(round(sum(with(dat,set==1 & y==1))/sum(with(dat,set==1)),2)) print(round(sum(with(dat,set==0 & a==1))/sum(with(dat,set==0)),2)) print(round(sum(with(dat,set==1 & a==1))/sum(with(dat,set==1)),2)) ## Figure 1 plot(0,0,type='n',xlim=c(-1,1),ylim=c(16,0),ylab='',xlab='causal Risk Difference',yaxt='n') abline(v=mean(dat$y1) - mean(dat$y0),lty=3) abline(v=0,lty=1) ##IV ratios text(-1.1,0,bquote(underline('iv ratios')),pos=4) points(summarydat[i,5],i,col='gray',pch=19,lwd=2) text(-1.1,1,expression('z'[1]),pos=4) text(-1.1,2,expression('z'[2]),pos=4) text(-1.1,3,expression('z'[3]),pos=4) text(-1.1,4,expression('z'[4]),pos=4) ##robust methods (simple examples) ##note: see Burgess et al. for further considerations re: use and estimation ##only simplest versions are shown here to demonstrate potential bias due to heterogeneity ##see also Burgess et al. for sensitivity analyses that may inform the use of these methods text(-1.1,5,bquote(underline('examples of robust methods')),pos=4) ##simple median points(median(summarydat[1:k,5]),6,col='gray',pch=19,lwd=2) text(-1.1,6,'median-based',pos=4) ##simple penalization-based method where we leave out the extreme ##(in these data, most obvious outlier to leave out is the IV ratio for Z4) points(mean(summarydat[1:3,5]),7,col='gray',pch=19,lwd=2) text(-1.1,7,'penalization-based',pos=4) ##MR-Egger ##see Burgess et al. for weighting given finite sample estimates ##MR-Egger developed with a linear model (no constraints for our dichotomous Y) points(lm(summarydat[,3]~summarydat[,4])$coeff[2],8,col='gray', pch=19,lwd=2) text(-1.1,8,'egger',pos=4) ##bounds text(-1.1,9,bquote(underline('bounds under sets of assumed IVs')),pos=4) segments(summarydat[i,1],i+9,summarydat[i,2],i+9,col='black',lwd=2) text(-1.1,10,expression('z'[1]),pos=4) text(-1.1,11,expression('z'[2]),pos=4) text(-1.1,12,expression('z'[3]),pos=4) text(-1.1,13,expression('z'[4]),pos=4) segments(max(summarydat[,1],na.rm=true),14,min(summarydat[,2],na.rm=true),14,col='black',lwd=2) text(-1.1,14,expression('z'[1]*',z'[2]*',z'[3]*',z'[4]*' (marginally)'),pos=4) segments(min(summarydat[,1],na.rm=true),15,max(summarydat[,2],na.rm=true),15,col='black',lwd=2) text(-1.1,15,expression('at least one: Z'[1]*',Z'[2]*',Z'[3]*',Z'[4]),pos=4) points(jointp,16,col='black',lwd=2,pch=19) text(-1.1,16,expression('joint: (Z'[1]*',Z'[2]*',Z'[3]*',Z'[4]*')'),pos=4) Swanson Supplementary Materials p.5
6 efigure 1. Causal diagrams depicting examples of data generating procedures in which three of the four proposed instruments are valid. Z 1, Z 2, Z 3, and Z 4 are proposed instruments for the effect of treatment A on outcome Y; U represents unmeasured confounders. Z 1, Z 2, Z 3, and Z 4 do not meet the instrumental conditions in A), B), C), and D), respectively, due to confounding of the proposed instrument. References Balke A, Pearl J. Bounds on treatment effects for studies with imperfect compliance. Journal of the American Statistical Association. 1997;92(439): Imbens GW, Manski CF. Confidence intervals for partially identified parameters. Econometrica. 2004: Richardson TS, Robins JM. ACE Bounds; SEMs with Equilibrium Conditions. Statistical Science. 2014;29(3): Swanson SA, Holme Ø, Løberg M, et al. Bounding the per-protocol effect in randomized trials: an application to colorectal cancer screening. Trials. 2015;16(1):1-11. Swanson Supplementary Materials p.6
Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh
Comparison of Three Approaches to Causal Mediation Analysis Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh Introduction Mediation defined using the potential outcomes framework natural effects
More informationMarginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal
Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect
More informationIDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL
IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL BRENDAN KLINE AND ELIE TAMER Abstract. Randomized trials (RTs) are used to learn about treatment effects. This paper
More informationSensitivity analysis and distributional assumptions
Sensitivity analysis and distributional assumptions Tyler J. VanderWeele Department of Health Studies, University of Chicago 5841 South Maryland Avenue, MC 2007, Chicago, IL 60637, USA vanderweele@uchicago.edu
More informationSupplementary material to: Tolerating deance? Local average treatment eects without monotonicity.
Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity. Clément de Chaisemartin September 1, 2016 Abstract This paper gathers the supplementary material to de
More information6.3 How the Associational Criterion Fails
6.3. HOW THE ASSOCIATIONAL CRITERION FAILS 271 is randomized. We recall that this probability can be calculated from a causal model M either directly, by simulating the intervention do( = x), or (if P
More informationCausal inference in epidemiological practice
Causal inference in epidemiological practice Willem van der Wal Biostatistics, Julius Center UMC Utrecht June 5, 2 Overview Introduction to causal inference Marginal causal effects Estimating marginal
More informationEstimating the Marginal Odds Ratio in Observational Studies
Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios
More informationSTAT 302 Introduction to Probability Learning Outcomes. Textbook: A First Course in Probability by Sheldon Ross, 8 th ed.
STAT 302 Introduction to Probability Learning Outcomes Textbook: A First Course in Probability by Sheldon Ross, 8 th ed. Chapter 1: Combinatorial Analysis Demonstrate the ability to solve combinatorial
More informationIdenti cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance
Identi cation of Positive Treatment E ects in Randomized Experiments with Non-Compliance Aleksey Tetenov y February 18, 2012 Abstract I derive sharp nonparametric lower bounds on some parameters of the
More informationBehavioral Data Mining. Lecture 19 Regression and Causal Effects
Behavioral Data Mining Lecture 19 Regression and Causal Effects Outline Counterfactuals and Potential Outcomes Regression Models Causal Effects from Matching and Regression Weighted regression Counterfactuals
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2004 Paper 155 Estimation of Direct and Indirect Causal Effects in Longitudinal Studies Mark J. van
More informationExtending causal inferences from a randomized trial to a target population
Extending causal inferences from a randomized trial to a target population Issa Dahabreh Center for Evidence Synthesis in Health, Brown University issa dahabreh@brown.edu January 16, 2019 Issa Dahabreh
More informationRobust instrumental variable methods using multiple candidate instruments with application to Mendelian randomization
Robust instrumental variable methods using multiple candidate instruments with application to Mendelian randomization arxiv:1606.03729v1 [stat.me] 12 Jun 2016 Stephen Burgess 1, Jack Bowden 2 Frank Dudbridge
More informationGraphical Representation of Causal Effects. November 10, 2016
Graphical Representation of Causal Effects November 10, 2016 Lord s Paradox: Observed Data Units: Students; Covariates: Sex, September Weight; Potential Outcomes: June Weight under Treatment and Control;
More informationSensitivity checks for the local average treatment effect
Sensitivity checks for the local average treatment effect Martin Huber March 13, 2014 University of St. Gallen, Dept. of Economics Abstract: The nonparametric identification of the local average treatment
More informationCombining multiple observational data sources to estimate causal eects
Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,
More informationCausal Inference with Counterfactuals
Causal Inference with Counterfactuals Robin Evans robin.evans@stats.ox.ac.uk Hilary 2014 1 Introduction What does it mean to say that a (possibly random) variable X is a cause of the random variable Y?
More informationStatistical Models for Causal Analysis
Statistical Models for Causal Analysis Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 Three Modes of Statistical Inference 1. Descriptive Inference: summarizing and exploring
More informationTechnical Track Session I: Causal Inference
Impact Evaluation Technical Track Session I: Causal Inference Human Development Human Network Development Network Middle East and North Africa Region World Bank Institute Spanish Impact Evaluation Fund
More informationPackage noncompliance
Type Package Package noncompliance February 15, 2016 Title Causal Inference in the Presence of Treatment Noncompliance Under the Binary Instrumental Variable Model Version 0.2.2 Date 2016-02-11 A finite-population
More informationIsoLATEing: Identifying Heterogeneous Effects of Multiple Treatments
IsoLATEing: Identifying Heterogeneous Effects of Multiple Treatments Peter Hull December 2014 PRELIMINARY: Please do not cite or distribute without permission. Please see www.mit.edu/~hull/research.html
More informationPackage gma. September 19, 2017
Type Package Title Granger Mediation Analysis Version 1.0 Date 2018-08-23 Package gma September 19, 2017 Author Yi Zhao , Xi Luo Maintainer Yi Zhao
More informationCMPSCI 240: Reasoning about Uncertainty
CMPSCI 240: Reasoning about Uncertainty Lecture 2: Sets and Events Andrew McGregor University of Massachusetts Last Compiled: January 27, 2017 Outline 1 Recap 2 Experiments and Events 3 Probabilistic Models
More informationThe International Journal of Biostatistics
The International Journal of Biostatistics Volume 7, Issue 1 2011 Article 16 A Complete Graphical Criterion for the Adjustment Formula in Mediation Analysis Ilya Shpitser, Harvard University Tyler J. VanderWeele,
More informationLecture Outline. Biost 518 Applied Biostatistics II. Choice of Model for Analysis. Choice of Model. Choice of Model. Lecture 10: Multiple Regression:
Biost 518 Applied Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University of Washington Lecture utline Choice of Model Alternative Models Effect of data driven selection of
More informationOUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores
OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS 776 1 15 Outcome regressions and propensity scores Outcome Regression and Propensity Scores ( 15) Outline 15.1 Outcome regression 15.2 Propensity
More informationPrinciples Underlying Evaluation Estimators
The Principles Underlying Evaluation Estimators James J. University of Chicago Econ 350, Winter 2019 The Basic Principles Underlying the Identification of the Main Econometric Evaluation Estimators Two
More informationCausal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017
Causal Inference Miguel A. Hernán, James M. Robins May 19, 2017 ii Causal Inference Part III Causal inference from complex longitudinal data Chapter 19 TIME-VARYING TREATMENTS So far this book has dealt
More informationGov 2002: 4. Observational Studies and Confounding
Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What
More informationOnline Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha
Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha January 18, 2010 A2 This appendix has six parts: 1. Proof that ab = c d
More informationReview of Statistics
Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and
More informationStructural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall
1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work
More informationTrimming for Bounds on Treatment Effects with Missing Outcomes *
CENTER FOR LABOR ECONOMICS UNIVERSITY OF CALIFORNIA, BERKELEY WORKING PAPER NO. 5 Trimming for Bounds on Treatment Effects with Missing Outcomes * David S. Lee UC Berkeley and NBER March 2002 Abstract
More informationUsing Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models
Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models James J. Heckman and Salvador Navarro The University of Chicago Review of Economics and Statistics 86(1)
More informationInstrumental Variables
Instrumental Variables Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Noncompliance in Experiments Stat186/Gov2002 Fall 2018 1 / 18 Instrumental Variables
More informationCasual Mediation Analysis
Casual Mediation Analysis Tyler J. VanderWeele, Ph.D. Upcoming Seminar: April 21-22, 2017, Philadelphia, Pennsylvania OXFORD UNIVERSITY PRESS Explanation in Causal Inference Methods for Mediation and Interaction
More informationCausal Inference with General Treatment Regimes: Generalizing the Propensity Score
Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke
More informationRank preserving Structural Nested Distribution Model (RPSNDM) for Continuous
Rank preserving Structural Nested Distribution Model (RPSNDM) for Continuous Y : X M Y a=0 = Y a a m = Y a cum (a) : Y a = Y a=0 + cum (a) an unknown parameter. = 0, Y a = Y a=0 = Y for all subjects Rank
More informationIdentification and Estimation of Causal Effects from Dependent Data
Identification and Estimation of Causal Effects from Dependent Data Eli Sherman esherman@jhu.edu with Ilya Shpitser Johns Hopkins Computer Science 12/6/2018 Eli Sherman Identification and Estimation of
More informationAssess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014
Assess Assumptions and Sensitivity Analysis Fan Li March 26, 2014 Two Key Assumptions 1. Overlap: 0
More informationLecture Discussion. Confounding, Non-Collapsibility, Precision, and Power Statistics Statistical Methods II. Presented February 27, 2018
, Non-, Precision, and Power Statistics 211 - Statistical Methods II Presented February 27, 2018 Dan Gillen Department of Statistics University of California, Irvine Discussion.1 Various definitions of
More informationExtending the MR-Egger method for multivariable Mendelian randomization to correct for both measured and unmeasured pleiotropy
Received: 20 October 2016 Revised: 15 August 2017 Accepted: 23 August 2017 DOI: 10.1002/sim.7492 RESEARCH ARTICLE Extending the MR-Egger method for multivariable Mendelian randomization to correct for
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint
More informationProbability Theory and Applications
Probability Theory and Applications Videos of the topics covered in this manual are available at the following links: Lesson 4 Probability I http://faculty.citadel.edu/silver/ba205/online course/lesson
More informationBounds on Direct Effects in the Presence of Confounded Intermediate Variables
Bounds on Direct Effects in the Presence of Confounded Intermediate Variables Zhihong Cai, 1, Manabu Kuroki, 2 Judea Pearl 3 and Jin Tian 4 1 Department of Biostatistics, Kyoto University Yoshida-Konoe-cho,
More informationEstimating direct effects in cohort and case-control studies
Estimating direct effects in cohort and case-control studies, Ghent University Direct effects Introduction Motivation The problem of standard approaches Controlled direct effect models In many research
More informationCausal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies
Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed
More informationOutline
2559 Outline cvonck@111zeelandnet.nl 1. Review of analysis of variance (ANOVA), simple regression analysis (SRA), and path analysis (PA) 1.1 Similarities and differences between MRA with dummy variables
More informationAP Statistics Cumulative AP Exam Study Guide
AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics
More informationOUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla.
OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/) Statistical vs. Causal vs. Counterfactual inference: syntax and semantics
More informationMath-Stat-491-Fall2014-Notes-I
Math-Stat-491-Fall2014-Notes-I Hariharan Narayanan October 2, 2014 1 Introduction This writeup is intended to supplement material in the prescribed texts: Introduction to Probability Models, 10th Edition,
More informationBootstrapping Sensitivity Analysis
Bootstrapping Sensitivity Analysis Qingyuan Zhao Department of Statistics, The Wharton School University of Pennsylvania May 23, 2018 @ ACIC Based on: Qingyuan Zhao, Dylan S. Small, and Bhaswar B. Bhattacharya.
More informationMATH 118 FINAL EXAM STUDY GUIDE
MATH 118 FINAL EXAM STUDY GUIDE Recommendations: 1. Take the Final Practice Exam and take note of questions 2. Use this study guide as you take the tests and cross off what you know well 3. Take the Practice
More informationCAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea)
CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) OUTLINE Inference: Statistical vs. Causal distinctions and mental barriers Formal semantics
More informationAdvanced Statistical Methods for Observational Studies L E C T U R E 0 1
Advanced Statistical Methods for Observational Studies L E C T U R E 0 1 introduction this class Website Expectations Questions observational studies The world of observational studies is kind of hard
More informationReview of Basic Probability
Review of Basic Probability Erik G. Learned-Miller Department of Computer Science University of Massachusetts, Amherst Amherst, MA 01003 September 16, 2009 Abstract This document reviews basic discrete
More informationCausal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD
Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification Todd MacKenzie, PhD Collaborators A. James O Malley Tor Tosteson Therese Stukel 2 Overview 1. Instrumental variable
More informationIdentification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case
Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College Original December 2016, revised July 2017 Abstract Lewbel (2012)
More informationHarvard University. Harvard University Biostatistics Working Paper Series
Harvard University Harvard University Biostatistics Working Paper Series Year 2010 Paper 117 Estimating Causal Effects in Trials Involving Multi-treatment Arms Subject to Non-compliance: A Bayesian Frame-work
More informationIgnoring the matching variables in cohort studies - when is it valid, and why?
Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association
More informationThe GENIUS Approach to Robust Mendelian Randomization Inference
Original Article The GENIUS Approach to Robust Mendelian Randomization Inference Eric J. Tchetgen Tchetgen, BaoLuo Sun, Stefan Walter Department of Biostatistics,Harvard University arxiv:1709.07779v5 [stat.me]
More informationCausal Inference Basics
Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,
More informationIntroduction to Causal Inference. Solutions to Quiz 4
Introduction to Causal Inference Solutions to Quiz 4 Teppei Yamamoto Tuesday, July 9 206 Instructions: Write your name in the space provided below before you begin. You have 20 minutes to complete the
More informationEffect Modification and Interaction
By Sander Greenland Keywords: antagonism, causal coaction, effect-measure modification, effect modification, heterogeneity of effect, interaction, synergism Abstract: This article discusses definitions
More informationAGEC 661 Note Fourteen
AGEC 661 Note Fourteen Ximing Wu 1 Selection bias 1.1 Heckman s two-step model Consider the model in Heckman (1979) Y i = X iβ + ε i, D i = I {Z iγ + η i > 0}. For a random sample from the population,
More informationPropensity Score Weighting with Multilevel Data
Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationA counterfactual approach to bias and effect modification in terms of response types
uzuki et al. BM Medical Research Methodology 2013, 13:101 RARH ARTIL Open Access A counterfactual approach to bias and effect modification in terms of response types tsuji uzuki 1*, Toshiharu Mitsuhashi
More informationIdentification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death
Identification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death Kosuke Imai First Draft: January 19, 2007 This Draft: August 24, 2007 Abstract Zhang and Rubin 2003) derives
More informationLearning Objectives for Stat 225
Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:
More informationRatio of Mediator Probability Weighting for Estimating Natural Direct and Indirect Effects
Ratio of Mediator Probability Weighting for Estimating Natural Direct and Indirect Effects Guanglei Hong University of Chicago, 5736 S. Woodlawn Ave., Chicago, IL 60637 Abstract Decomposing a total causal
More information2) There should be uncertainty as to which outcome will occur before the procedure takes place.
robability Numbers For many statisticians the concept of the probability that an event occurs is ultimately rooted in the interpretation of an event as an outcome of an experiment, others would interpret
More informationMichael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp
page 1 Lecture 7 The Regression Discontinuity Design fuzzy and sharp page 2 Regression Discontinuity Design () Introduction (1) The design is a quasi-experimental design with the defining characteristic
More informationEconometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous
Econometrics of causal inference Throughout, we consider the simplest case of a linear outcome equation, and homogeneous effects: y = βx + ɛ (1) where y is some outcome, x is an explanatory variable, and
More informationFront-Door Adjustment
Front-Door Adjustment Ethan Fosse Princeton University Fall 2016 Ethan Fosse Princeton University Front-Door Adjustment Fall 2016 1 / 38 1 Preliminaries 2 Examples of Mechanisms in Sociology 3 Bias Formulas
More informationLecture 4: Two-point Sampling, Coupon Collector s problem
Randomized Algorithms Lecture 4: Two-point Sampling, Coupon Collector s problem Sotiris Nikoletseas Associate Professor CEID - ETY Course 2013-2014 Sotiris Nikoletseas, Associate Professor Randomized Algorithms
More informationAIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)
AIM HIGH SCHOOL Curriculum Map 2923 W. 12 Mile Road Farmington Hills, MI 48334 (248) 702-6922 www.aimhighschool.com COURSE TITLE: Statistics DESCRIPTION OF COURSE: PREREQUISITES: Algebra 2 Students will
More informationDirected acyclic graphs with edge-specific bounds
Biometrika (2012), 99,1,pp. 115 126 doi: 10.1093/biomet/asr059 C 2011 Biometrika Trust Advance Access publication 20 December 2011 Printed in Great Britain Directed acyclic graphs with edge-specific bounds
More informationA Decision Theoretic Approach to Causality
A Decision Theoretic Approach to Causality Vanessa Didelez School of Mathematics University of Bristol (based on joint work with Philip Dawid) Bordeaux, June 2011 Based on: Dawid & Didelez (2010). Identifying
More informationSpecification Errors, Measurement Errors, Confounding
Specification Errors, Measurement Errors, Confounding Kerby Shedden Department of Statistics, University of Michigan October 10, 2018 1 / 32 An unobserved covariate Suppose we have a data generating model
More informationarxiv: v2 [stat.me] 21 Nov 2016
Biometrika, pp. 1 8 Printed in Great Britain On falsification of the binary instrumental variable model arxiv:1605.03677v2 [stat.me] 21 Nov 2016 BY LINBO WANG Department of Biostatistics, Harvard School
More informationDiscussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data
Biometrics 000, 000 000 DOI: 000 000 0000 Discussion of Identifiability and Estimation of Causal Effects in Randomized Trials with Noncompliance and Completely Non-ignorable Missing Data Dylan S. Small
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint
More informationMediation Analysis: A Practitioner s Guide
ANNUAL REVIEWS Further Click here to view this article's online features: Download figures as PPT slides Navigate linked references Download citations Explore related articles Search keywords Mediation
More informationSmall-sample cluster-robust variance estimators for two-stage least squares models
Small-sample cluster-robust variance estimators for two-stage least squares models ames E. Pustejovsky The University of Texas at Austin Context In randomized field trials of educational interventions,
More informationMediation analyses. Advanced Psychometrics Methods in Cognitive Aging Research Workshop. June 6, 2016
Mediation analyses Advanced Psychometrics Methods in Cognitive Aging Research Workshop June 6, 2016 1 / 40 1 2 3 4 5 2 / 40 Goals for today Motivate mediation analysis Survey rapidly developing field in
More informationGeoffrey T. Wodtke. University of Toronto. Daniel Almirall. University of Michigan. Population Studies Center Research Report July 2015
Estimating Heterogeneous Causal Effects with Time-Varying Treatments and Time-Varying Effect Moderators: Structural Nested Mean Models and Regression-with-Residuals Geoffrey T. Wodtke University of Toronto
More informationJournal of Biostatistics and Epidemiology
Journal of Biostatistics and Epidemiology Methodology Marginal versus conditional causal effects Kazem Mohammad 1, Seyed Saeed Hashemi-Nazari 2, Nasrin Mansournia 3, Mohammad Ali Mansournia 1* 1 Department
More informationSimple Sensitivity Analysis for Differential Measurement Error. By Tyler J. VanderWeele and Yige Li Harvard University, Cambridge, MA, U.S.A.
Simple Sensitivity Analysis for Differential Measurement Error By Tyler J. VanderWeele and Yige Li Harvard University, Cambridge, MA, U.S.A. Abstract Simple sensitivity analysis results are given for differential
More informationModule 1. Probability
Module 1 Probability 1. Introduction In our daily life we come across many processes whose nature cannot be predicted in advance. Such processes are referred to as random processes. The only way to derive
More informationCausality II: How does causal inference fit into public health and what it is the role of statistics?
Causality II: How does causal inference fit into public health and what it is the role of statistics? Statistics for Psychosocial Research II November 13, 2006 1 Outline Potential Outcomes / Counterfactual
More informationVariable selection and machine learning methods in causal inference
Variable selection and machine learning methods in causal inference Debashis Ghosh Department of Biostatistics and Informatics Colorado School of Public Health Joint work with Yeying Zhu, University of
More informationGeneralized Do-Calculus with Testable Causal Assumptions
eneralized Do-Calculus with Testable Causal Assumptions Jiji Zhang Division of Humanities and Social Sciences California nstitute of Technology Pasadena CA 91125 jiji@hss.caltech.edu Abstract A primary
More informationMarginal and Nested Structural Models Using Instrumental Variables
Marginal and Nested Structural Models Using Instrumental Variables Zhiqiang TAN The objective of many scientific studies is to evaluate the effect of a treatment on an outcome of interest ceteris paribus.
More informationReview Basic Probability Concept
Economic Risk and Decision Analysis for Oil and Gas Industry CE81.9008 School of Engineering and Technology Asian Institute of Technology January Semester Presented by Dr. Thitisak Boonpramote Department
More informationarxiv: v2 [math.st] 4 Mar 2013
Running head:: LONGITUDINAL MEDIATION ANALYSIS 1 arxiv:1205.0241v2 [math.st] 4 Mar 2013 Counterfactual Graphical Models for Longitudinal Mediation Analysis with Unobserved Confounding Ilya Shpitser School
More informationEstimating and contextualizing the attenuation of odds ratios due to non-collapsibility
Estimating and contextualizing the attenuation of odds ratios due to non-collapsibility Stephen Burgess Department of Public Health & Primary Care, University of Cambridge September 6, 014 Short title:
More informationThe Union and Intersection for Different Configurations of Two Events Mutually Exclusive vs Independency of Events
Section 1: Introductory Probability Basic Probability Facts Probabilities of Simple Events Overview of Set Language Venn Diagrams Probabilities of Compound Events Choices of Events The Addition Rule Combinations
More informationStatistical inference in Mendelian randomization: From genetic association to epidemiological causation
Statistical inference in Mendelian randomization: From genetic association to epidemiological causation Department of Statistics, The Wharton School, University of Pennsylvania March 1st, 2018 @ UMN Based
More information