Bayesian nonparametric predictive approaches for causal inference: Regression Discontinuity Methods
|
|
- Mabel Willis
- 5 years ago
- Views:
Transcription
1 Bayesian nonparametric predictive approaches for causal inference: Regression Discontinuity Methods George Karabatsos University of Illinois-Chicago ERCIM Conference, December, 2013 Senate House, University of London Session ES38: Bayesian Nonparametric Regression Sunday , 08:45-10:25 In collaboration with S.G. Walker Research is supported by NSF-MMS Grant SES
2 Introduction: Outline I. Review causal inference framework (counterfactual) II. Randomized studies and non-randomized studies. The regression discontinuity (RD) design for non-randomized studies. (Thistlewaite & Campbell, 1960; Cook, 2008) Causal Modeling Framework: DAG and extended conditional independence (Dawid, 2002, 2010) III. Issues of current causal models for RD designs. IV. Propose a Bayesian nonparametric regression model for RD designs. Sharp RD design (full treatment compliance among subjects). For a fuzzy RD design (imperfect treatment compliance). V. Illustrate Bayesian nonparametric model on two real data sets. VI. Impact of new teacher education curriculum on student performance Impact of basic skills on teaching ability. Consider more recent work on RD-based causal inference, involving the restricted DP mixture of linear regressions model (Wade, Walker, & Petrone, 2013), which more directly exploits the local randomization feature of RD designs. Time permitting. 2
3 Introduction: Randomized Studies Causal inference: A basic aim of scientific research. Randomized studies: gold standard of causal inference (Rubin, 2008). Randomization ensures that the (pretreatment) covariate distribution does not differ between treatment subjects and non-treatment subjects. Then, any difference in treatment outcomes and non-treatment outcomes are only due to changes in the treatment variable, i.e., is attributable to the causal effect of the treatment on the outcome. A randomized study is often infeasible: financial, ethical, or timeliness. Regression Discontinuity (RD) Design Y Outcome R Assignment Variable A = 1(R > r 0 ) T Treatment assignment indicator Treatment receipt indicator. Sharp RD: Full compliance (A = T). Fuzzy RD: Imperfect compliance. For subjects with R observations located near r 0, treatments are as good as randomly assigned, under mild conditions. 3
4 Outcomes Y Sharp RD Design Illustration Non-treatment (T = 0; R <.6) Treatment (T = 1; R >=.6) At the cutoff of.6, the average jump size of Y from black line (control) to red line (treatment) is Assignment Variable R 4
5 RD Assumptions for Causal Inference Characterizing (RD) assumption: lim r r0 E(T r) lim r r0 E(T r) Sharp RD: f(t r) = Pr(T = t r) = t1(r r 0 ) + (1 t)1(r < r 0 ) Probabilistic DAG for sharp RD: R {, r 0 }, T {, 0, 1}, are intervention parameters. Y : general regime parameter that specifies the circumstances of Y; experimental conditions, environment, kind of subject, etc. DAG implies conditional independence properties: R T, Y R, T ( Y, R ) R, T, Y ( T, R ) R, T, Y Local stability ( SUTVA ): Y Ψ Y R = r 0, T, i.e., f(y r 0, t, ψ R ) = f(y r 0, t). In idle state, i.e., ψ R = ψ T =, joint p.d.f. is left at undisturbed state : f(r, t, y) = f(r)f(t r)f(y r, t). All previous CI assumptions imply a causal property: Y (Ψ T, Ψ R ) R = r 0, T An intervention regime (Ψ R = r 0, Ψ T = t 0 ), t 0 {0,1}, modifies f(r, t, y) to f(y r 0, t 0 ) = f(y r 0, t 0 ). Causal effect: comparison of functionals of f(y r 0, T = 1) and f(y r 0, T = 0). 5
6 RD Assumptions for Causal Inference Causal effect: comparison of functionals of f(y r 0, T = 1) and f(y r 0, T = 0). Conditioning on R = r 0 is motivated by the following assumption. Local Randomization (LR) (Lee, 2008): Each subject, described by all unobserved and observed pre-treatment covariates, W, has "imprecise control" over R, i.e., F R (r w) = Pr(R r w) is continuous in r at r 0, with 0 < F R (r 0 w) < 1. Then the p.d.f. of all observed pretreatment covariates, f(x w), is the same for all subjects just to the left and just to the right of the cutoff r 0. Estimate of causal effect of T on Y: Sharp RD: E(h{Y} r + ) E(h{Y} r ), for any chosen functional h{ }, where r+ denotes setting (R = r 0, 1(R r 0 ) = 1), and r denotes setting (R = r 0, 1(R r 0 ) = 0), as covariates in a regression model. Fuzzy RD (imperfect treatment compliance; f(t r) not point-mass): [E(h{Y} r + ) E(h{Y} r ) ] / [E(T r + ) E(T r ) ] under additional assumption of local exclusion restriction, i.e., conditionally on R = r 0, any effect of A on Y is only through T. 6
7 Standard Models for RD designs A standard model for sharp RD designs (e.g., Bloom, 2012): Y i = (r i ) + 1(r i > r 0 ) + 2 (r i ) 1(r i > r 0 ) + i, i ~ N(0, 2 ) is the average causal effect of the treatment; 1 (r i ) and 2 (r i ) are each linear or polynomial effects of R. Estimate of causal effect ( ) can be easily biased by outliers. Local linear models (Fan & Gijbels, 1996) provide an outlier-resistant alternative (Imbens & Lemieux, 2008). A bandwidth parameter is chosen to assign higher weight to observations that are located close around the cutoff r 0 (Imbens & Lemieux, 2008). The local linear model has been extended to provide quantile regression, to provide causal effects in terms of quantiles (Frandsen et al. 2012). Local linear models can estimate the effect h in either sharp or fuzzy RD. However: Bandwidth choices only have large-sample justifications (Imbens & Kalyanaraman, 2012). Quantile regression method has the quantile-crossing problem. 7
8 Modeling RD designs For RD designs, a regression model is desired: That is flexible enough to make accurate predictions, while being able to capture r 0 -local effects. Accurate estimation of causal effects relies on a predictively-accurate regression model. That can provide coherent inferences of the causal effect of the treatment (versus the non-treatment), on the outcome Y, either in terms of the outcome s mean, variance, chosen quantiles, probability density, of Y. (i.e., for general functionals h{ } of Y). That would involve no quantile crossing problems. 8
9 IPMW Model for Sharp RD Designs f y i r i j j r, r j r r n y i j, j 2 j r i, r i, i 1,,n, r 0 1 r 2 1 r r 0 j 1 r r r exp 0 1 r 2 1 r r 0 1/2 j, j 2 N j, 2 IG j 2 1,b, 2 N 0, 0 2 Un 0,b b,, Ga b a 0,b 0, IPMW: Infinite-Probits Mixture Weights model (Karabatsos & Walker, 2012,EJS). 9
10 Density f(y r) Mixture weight j (r) IPMW Model (Karabatsos-Walker 12) f y r n y j, 2 j j r, r j r 0 1 r 2 1 r r 0 r exp 0 1 r 2 1 r r 0 1/2 1 (r) = 1/20 1 (r) = 1/2 Weights ω j ( (r), (r)) indicate how well r explains Y. (r) controls multimodality. If 2 0 or 2 0, then there is a regression discontinuity causal effect of T on p.d.f. of Y. 1 (r) = 1 1 (r) = Index j Index j Index j Index j y y y y
11 Posterior Predictive Inference Fast MCMC sampling/estimation of posterior Π(ζ Data) i f(y i r i ) ( ), with = ((μ j, j2 ) j Z,, 2, β σ,, ) (Karabatsos & Walker, 2012). Inference focuses on the posterior predictive density: f n (y r, t) = f (y r, t)dπ(ζ Data). Sharp RD design: causal effect estimate: E n (h{y} r + ) E n (h{y} r ) Fuzzy RD design: causal effect estimate: {E n (h{y} r + ) E n (h{y} r )} / {E n (T r + ) E n (T r )} 11
12 Two Data Applications of the IPMW Model for Causal Inference Both data sets involve Sharp RD designs. Prior parameter specification: b = 5, for 2 ~ U( 0, b ). Same priors for all other model parameters, as before. 40K samples retained from 200K MCMC samples and 2K burn-in. For parameters of interest: -- Trace plots showed good mixing of model parameters % Monte Carlo Confidence intervals half-widths were sufficiently small (near.00), according to the sub-sampling batch method (Flegal & Jones, 2011). 12
13 IPMW Data Application #1 A new teacher education curriculum, CTPP (Chicago Teacher Pipeline Partnership), was implemented at one of the four Chicago schools of education, starting the Fall of Data on n = 347 undergraduate math teaching candidates (90% female), who has just completed a course on how to teach algebra. Pre-CTPP and Post-CTPP data (Fall Spring 2013). Dependent variable: Z-score, learning to teach Math assessment. Covariates: TimeF10 = (Year )/10; CTPP = 1({Year }>0). [ is Fall 2010 cutoff]. [treatment assignment indicator] IPMW Model Results: Standardized residuals ranged from.8 to.8. R-squared =.92. Posterior distribution of and slopes, for CTPP, each concentrate around zero. 13
14 Density p.d.f. (-- 95%) IPMW Data Application #1 CTPP = 0 (Blue) vs. 1 (Red) TimeF10 = Z_posttest The new curriculum, compared to the old curriculum, increased the LMT scores, in terms of shifting the density of LMT scores to the right. This shift corresponds to an increase in the mean (from.17 to.20), the 10%ile (-1.43 to -1.35), the median (.07 to.15), and corresponds to a variance decrease (1.78 to 1.69). 14
15 IPMW Data Application #2 Causal link between basic skills and teaching ability? (Gitomer et al. 2011, J Teacher Education). Data on n = 205 undergraduate teaching candidates, under CTPP. Dependent Variable: Haberman Z-score on urban teaching ability (persistence; organization & planning; values student learning; theory to practice; at risk students; approach to students; survive in bureaucracy; explains teacher success; explains student success; fallibility). Covariates: B240d10 = (min[reading, Language, Math, Write] 240) / 10. BasicPass = 1({B240d10 240} > 0) = 1(Pass Reading Test). IPMW Model Results: Standardized residuals ranged from 1.3 to 1.2. R-squared =.99. For BasicPass, posterior mean (s.d.) estimate of slope is 1.49 (s.d. = 1.54), and posterior mean (s.d.) estimate of the slope is -.04 (.49). 15
16 Density p.d.f. (-- 95%) IPMW Data Application #2 BasicPas = 0 (Blue) vs. 1 (Red) B240d10 = Four clusters of students Z_haberman A detailed inspection revealed that passing the basic skills reading test causally increased the Haberman z-score, in terms of the mean (from.31 to.45), 25%ile (-.65 to -.62), 75%ile (1.30 to 1.43), and 95%ile (2.36 to 2.82). 16
17 Conclusions We proposed a Bayesian nonparametric regression model for RD designs. The model provides a way to estimate the causal effect of a treatment (versus non-treatment), in terms of the treatment s regression discontinuity effect on the entire density of the outcome variable. Through the analyses of real data, we showed how the model can be extended to provide a causal analysis of how a treatment variable impacts the full distribution of the outcomes, including mean, variance, quantiles, p.d.f., and so forth... The model can be easily extended for the analysis of discrete-valued or (left- and/or right- censored) outcomes. Manuscript A Bayesian Nonparametric Causal Model for Regression Discontinuity Designs : User-friendly software has been developed for the model. 17
18 18
19 X 1,,X n r, n, j, 2 k j n j 1 Restricted DP (rdp) Mixture model X r i = (1, r i ) j, 2 k j n j 1 ρ n = (s₁,,s n ) r n k n j 1 Normal x i r i j, 2 j i:s i j n k n n k n n kn! j 1 j j 2 Normal j 0, j 2 C 1 j 2 InverseGamma j 2 a,b 1 nj 1 s r 1 s r n observed pre-treatment covariate (or prognostic/propensity score) are vectors of the assignment variables (i = 1,,n); the k n n distinct values of parameters that are assigned to each of the n subjects, with k n random; random partition of the n observations; 1, 1 2,, n, n2 s i = j if ( i, i2 ) = ( j*, j 2* ) and n j = i 1{( i, i2 ) = ( j*, j 2* )} the permutation of the first n integers that rearranges (r₁,,r n ) in increasing order, as r r 1 r r n with corresponding values x r 1,,x r n and s r 1,,s r n of x and s 1,,s n The rdp has precision and Normal-InvGamma baseline distribution. 19
20 Posterior of Random Partition n x,r k n k n! k n 1 n j j 1 C C R j R j b a a n j /2 a b V j 2 /2 a n j/2 1 S r 1 S r n V j 2 r j r j W j r j r j, W j I j R j C R j R j j 1 R j, and r j R j 0, r i is vector of r i, and R j is matrix of r i = (1, r i ), for subjects in cluster j. Posterior is sampled by a RJ-MCMC algorithm, which either splits or merges a randomly-selected cluster. A Causal Inference Strategy for sharp RD: Identify the subject i = i 0 with observed r i nearest to the cutoff r 0. For each draw of the partition n from its posterior ( n x, r), find the cluster where that subject is located, and then within that cluster, use a two-sample test statistic to compare the outcomes (y i ) for treatment subjects (having r i > r 0 ) and the outcomes for non-treatment subjects (having r i < r 0 ). Average two-sample statistics over a large number of RJ-MCMC draws. 20
21 Statistic Non-Treatment Treatment sample size (3, 190) 6.7 (2, 16) mean.37 (.07, 1.55) 1.23 (.97, 1.59) variance.76 (.01, 1.04).47 (.01, 0.85) interquartile range 1.17 (.18, 1.71).90 (.24, 1.41) skewness.11 (-1.26,.71).03 (.63, 0.82) kurtosis 2.69 (1.45, 3.41) 2.06 (1.00, 3.06) 1%ile 1.34 ( 2.20, 1.47).27 ( 0.65, 1.47) 10%ile.77 ( 1.35, 1.47).42 (.01, 1.47) 25%ile.20 ( 0.65, 1.47).74 (.30, 1.47) 50%ile.34 (.18, 1.47) 1.22 (1.00, 1.59) 75%ile.98 (.53, 1.65) 1.65 (1.47, 2.06) 90%ile 1.43 (1.24, 1.71) 2.14 (1.71, 2.77) 99%ile 2.04 (1.71, 2.42) 2.28 (1.71, 2.89) t-statistic 2.02 ( 4.21,.88) p-value:.19 (.00,.91) F test,variance 4.86 (.02, 34.45) p-value:.65 (.05,.98) Pr Y 1 Y 0 C r (.21,.93) Pr Y 1 Y 0 C r (.04,.67) KS test.28 (.05,.98) Basic skills example (again). rdp: = 1, vague N-IG baseline prior. Posterior mean (95% posterior credible interval). for various test statistics, in comparing treatment outcomes (y i ) vs. non-treatment outcomes, for the cluster of subjects around the cutoff r 0. 21
22 References Bloom, H. (2012). Modern regression discontinuity analysis. Journal of Research on Educational Effectiveness, 5, Cattaneo, M., Frandsen, B., and Titiunik, R. (2013). Randomization Inference in the Regression Discontinuity Design: An Application to the Study of Party Advantages in the U.S. Senate. University of Michigan. February 19th. Unpublished manuscript. Cook, T. (2008). Waiting for life to arrive: A history of the regression discontinuity design in psychology, statistics and economics. Journal of Econometrics, 142, Dawid, A. (2000). Causal inference without counterfactuals. Journal of the American Statistical Association, 95, Dawid, A. (2002). Influence diagrams for causal modelling and inference. International Statistical Review, 70,
23 References (continued) Dawid, A. (2010). Beware of the DAG! Journal of Machine Learning Research-Proceedings Track, 6, Fan, J. and Gijbels, I. (1996). Local Polynomial Modelling and Its Applications. London: Chapman and Hall/CRC. Flegal, J.M., and Jones, G.L. (2011). Implementing Markov chain Monte Carlo: Estimating with confidence. In S.P. Brooks and A.E. Gelman and G.L. Jones and X.L. Meng (Eds.), Handbook of Markov Chain Monte Carlo, pp Boca Raton, FL: CRC Press. Frandsen, B., Frölich, M., and Melly, B. (2012). Quantile treatment effects in the regression discontinuity design. Journal of Econometrics, 168, Gitomer, D.H., Brown, T.L., and Bonett, J. (2011). Useful signal or unnecessary obstacle? The role of basic skills tests in teacher preparation. Journal of Teacher Education, 62,
24 References (continued) Hahn, J., Todd, P., and der Klaauw, W. V. (2001). Identification and estimation of treatment effects with a regression-discontinuity design. Econometrica, 69, Imbens, G. and Kalyanaraman, K. (2012). Optimal bandwidth choice for the regression discontinuity estimator. The Review of Economic Studies, 79, Imbens, G. W. and Lemieux, T. (2008). Regression discontinuity designs: A guide to practice. Journal of Econometrics, 142, Kalli, M., Griffin, J., and S.G. Walker (2010). Slice Sampling Mixture Models. Statistics and Computing, 21, Karabatsos, G. and Walker, S. (2012). Adaptive-modal Bayesian nonparametric regression. Electronic Journal of Statistics, 6, Lee, D. (2008). Randomized experiments from non-random selection in U.S. House elections. Journal of Econometrics, 142,
25 References (continued) Lee, D. and Lemieux, T. (2010). Regression Discontinuity Designs in Economics. The Journal of Economic Literature, 48, Rubin, D.B. (2008). For objective causal inference, design trumps analysis. The Annals of Applied Statistics, 2, Thistlewaite, D. and Campbell, D. (1960). Regression-discontinuity analysis: An alternative to the ex-post facto experiment. Journal of Educational Psychology, 51, Wade, S., Walker, S.G., and Petrone, S. (2013, to appear) A Predictive Study of Dirichlet Process Mixture Models for Curve Fitting. Scandinavian Journal of Statistics. Wong, V., Steiner, P., and Cook, T. (2013). Analyzing Regression- Discontinuity Designs With Multiple Assignment Variables: A Comparative Study of Four Estimation Methods. Journal of Educational and Behavioral Statistics, 38,
Supplemental Appendix to "Alternative Assumptions to Identify LATE in Fuzzy Regression Discontinuity Designs"
Supplemental Appendix to "Alternative Assumptions to Identify LATE in Fuzzy Regression Discontinuity Designs" Yingying Dong University of California Irvine February 2018 Abstract This document provides
More informationBayesian Nonparametric Rasch Modeling: Methods and Software
Bayesian Nonparametric Rasch Modeling: Methods and Software George Karabatsos University of Illinois-Chicago Keynote talk Friday May 2, 2014 (9:15-10am) Ohio River Valley Objective Measurement Seminar
More informationAn Alternative Assumption to Identify LATE in Regression Discontinuity Design
An Alternative Assumption to Identify LATE in Regression Discontinuity Design Yingying Dong University of California Irvine May 2014 Abstract One key assumption Imbens and Angrist (1994) use to identify
More informationWhy high-order polynomials should not be used in regression discontinuity designs
Why high-order polynomials should not be used in regression discontinuity designs Andrew Gelman Guido Imbens 6 Jul 217 Abstract It is common in regression discontinuity analysis to control for third, fourth,
More informationWhy High-Order Polynomials Should Not Be Used in Regression Discontinuity Designs
Why High-Order Polynomials Should Not Be Used in Regression Discontinuity Designs Andrew GELMAN Department of Statistics and Department of Political Science, Columbia University, New York, NY, 10027 (gelman@stat.columbia.edu)
More informationThe Economics of European Regions: Theory, Empirics, and Policy
The Economics of European Regions: Theory, Empirics, and Policy Dipartimento di Economia e Management Davide Fiaschi Angela Parenti 1 1 davide.fiaschi@unipi.it, and aparenti@ec.unipi.it. Fiaschi-Parenti
More informationESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics
ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.
More informationMichael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp
page 1 Lecture 7 The Regression Discontinuity Design fuzzy and sharp page 2 Regression Discontinuity Design () Introduction (1) The design is a quasi-experimental design with the defining characteristic
More informationBayesian Nonparametric Meta-Analysis Model George Karabatsos University of Illinois-Chicago (UIC)
Bayesian Nonparametric Meta-Analysis Model George Karabatsos University of Illinois-Chicago (UIC) Collaborators: Elizabeth Talbott, UIC. Stephen Walker, UT-Austin. August 9, 5, 4:5-4:45pm JSM 5 Meeting,
More informationRegression Discontinuity Designs
Regression Discontinuity Designs Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Regression Discontinuity Design Stat186/Gov2002 Fall 2018 1 / 1 Observational
More informationAn Alternative Assumption to Identify LATE in Regression Discontinuity Designs
An Alternative Assumption to Identify LATE in Regression Discontinuity Designs Yingying Dong University of California Irvine September 2014 Abstract One key assumption Imbens and Angrist (1994) use to
More informationANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS
ANALYTIC COMPARISON of Pearl and Rubin CAUSAL FRAMEWORKS Content Page Part I. General Considerations Chapter 1. What is the question? 16 Introduction 16 1. Randomization 17 1.1 An Example of Randomization
More informationCausal Inference with Big Data Sets
Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November 2016 1 / 1 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity
More informationApplied Microeconometrics Chapter 8 Regression Discontinuity (RD)
1 / 26 Applied Microeconometrics Chapter 8 Regression Discontinuity (RD) Romuald Méango and Michele Battisti LMU, SoSe 2016 Overview What is it about? What are its assumptions? What are the main applications?
More informationAddressing Analysis Issues REGRESSION-DISCONTINUITY (RD) DESIGN
Addressing Analysis Issues REGRESSION-DISCONTINUITY (RD) DESIGN Overview Assumptions of RD Causal estimand of interest Discuss common analysis issues In the afternoon, you will have the opportunity to
More informationted: a Stata Command for Testing Stability of Regression Discontinuity Models
ted: a Stata Command for Testing Stability of Regression Discontinuity Models Giovanni Cerulli IRCrES, Research Institute on Sustainable Economic Growth National Research Council of Italy 2016 Stata Conference
More informationBayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang
Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January
More informationRegression Discontinuity
Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 24, 2017 I will describe the basic ideas of RD, but ignore many of the details Good references
More informationRegression Discontinuity Design
Chapter 11 Regression Discontinuity Design 11.1 Introduction The idea in Regression Discontinuity Design (RDD) is to estimate a treatment effect where the treatment is determined by whether as observed
More informationLecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH
Lecture 5: Spatial probit models James P. LeSage University of Toledo Department of Economics Toledo, OH 43606 jlesage@spatial-econometrics.com March 2004 1 A Bayesian spatial probit model with individual
More informationEmpirical Validation of the Critical Thinking Assessment Test: A Bayesian CFA Approach
Empirical Validation of the Critical Thinking Assessment Test: A Bayesian CFA Approach CHI HANG AU & ALLISON AMES, PH.D. 1 Acknowledgement Allison Ames, PhD Jeanne Horst, PhD 2 Overview Features of the
More informationRegression Discontinuity
Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 16, 2018 I will describe the basic ideas of RD, but ignore many of the details Good references
More informationBagging During Markov Chain Monte Carlo for Smoother Predictions
Bagging During Markov Chain Monte Carlo for Smoother Predictions Herbert K. H. Lee University of California, Santa Cruz Abstract: Making good predictions from noisy data is a challenging problem. Methods
More informationBayesian Methods for Testing Axioms of Measurement
Bayesian Methods for Testing Axioms of Measurement George Karabatsos University of Illinois-Chicago University of Minnesota Quantitative/Psychometric Methods Area Department of Psychology April 3, 2015,
More informationRegression Discontinuity Designs.
Regression Discontinuity Designs. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 31/10/2017 I. Brunetti Labour Economics in an European Perspective 31/10/2017 1 / 36 Introduction
More informationOptimal Bandwidth Choice for the Regression Discontinuity Estimator
Optimal Bandwidth Choice for the Regression Discontinuity Estimator Guido Imbens and Karthik Kalyanaraman First Draft: June 8 This Draft: September Abstract We investigate the choice of the bandwidth for
More informationA Workshop on Bayesian Nonparametric Regression Analysis
A Workshop on Bayesian Nonparametric Regression Analysis George Karabatsos University of Illinois-Chicago Methodological Illustration Presentation (90 min.) Modern Modeling Methods (M 3 ) Conference University
More informationRegression Discontinuity Designs in Stata
Regression Discontinuity Designs in Stata Matias D. Cattaneo University of Michigan July 30, 2015 Overview Main goal: learn about treatment effect of policy or intervention. If treatment randomization
More informationSection 7: Local linear regression (loess) and regression discontinuity designs
Section 7: Local linear regression (loess) and regression discontinuity designs Yotam Shem-Tov Fall 2015 Yotam Shem-Tov STAT 239/ PS 236A October 26, 2015 1 / 57 Motivation We will focus on local linear
More informationQuantile POD for Hit-Miss Data
Quantile POD for Hit-Miss Data Yew-Meng Koh a and William Q. Meeker a a Center for Nondestructive Evaluation, Department of Statistics, Iowa State niversity, Ames, Iowa 50010 Abstract. Probability of detection
More informationPrerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3
University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.
More informationMarkov Chain Monte Carlo methods
Markov Chain Monte Carlo methods By Oleg Makhnin 1 Introduction a b c M = d e f g h i 0 f(x)dx 1.1 Motivation 1.1.1 Just here Supresses numbering 1.1.2 After this 1.2 Literature 2 Method 2.1 New math As
More informationTruncated Regression Model and Nonparametric Estimation for Gifted and Talented Education Program
Global Journal of Pure and Applied Mathematics. ISSN 0973-768 Volume 2, Number (206), pp. 995-002 Research India Publications http://www.ripublication.com Truncated Regression Model and Nonparametric Estimation
More informationNonparametric Bayesian Methods (Gaussian Processes)
[70240413 Statistical Machine Learning, Spring, 2015] Nonparametric Bayesian Methods (Gaussian Processes) Jun Zhu dcszj@mail.tsinghua.edu.cn http://bigml.cs.tsinghua.edu.cn/~jun State Key Lab of Intelligent
More informationOptimal bandwidth selection for the fuzzy regression discontinuity estimator
Optimal bandwidth selection for the fuzzy regression discontinuity estimator Yoichi Arai Hidehiko Ichimura The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP49/5 Optimal
More informationRegression Discontinuity Design on Model Schools Value-Added Effects: Empirical Evidence from Rural Beijing
Regression Discontinuity Design on Model Schools Value-Added Effects: Empirical Evidence from Rural Beijing Kai Hong CentER Graduate School, Tilburg University April 2010 Abstract In this study we examine
More informationPart 8: GLMs and Hierarchical LMs and GLMs
Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course
More informationPractical Bayesian Quantile Regression. Keming Yu University of Plymouth, UK
Practical Bayesian Quantile Regression Keming Yu University of Plymouth, UK (kyu@plymouth.ac.uk) A brief summary of some recent work of us (Keming Yu, Rana Moyeed and Julian Stander). Summary We develops
More informationDisk Diffusion Breakpoint Determination Using a Bayesian Nonparametric Variation of the Errors-in-Variables Model
1 / 23 Disk Diffusion Breakpoint Determination Using a Bayesian Nonparametric Variation of the Errors-in-Variables Model Glen DePalma gdepalma@purdue.edu Bruce A. Craig bacraig@purdue.edu Eastern North
More informationBayesian Modeling of Conditional Distributions
Bayesian Modeling of Conditional Distributions John Geweke University of Iowa Indiana University Department of Economics February 27, 2007 Outline Motivation Model description Methods of inference Earnings
More informationTobit and Interval Censored Regression Model
Global Journal of Pure and Applied Mathematics. ISSN 0973-768 Volume 2, Number (206), pp. 98-994 Research India Publications http://www.ripublication.com Tobit and Interval Censored Regression Model Raidani
More informationFlexible Estimation of Treatment Effect Parameters
Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both
More informationECO 2403 TOPICS IN ECONOMETRICS
ECO 2403 TOPICS IN ECONOMETRICS Department of Economics. University of Toronto Winter 2019 Instructors: Victor Aguirregabiria Phone: 416-978-4358 Office: 150 St. George Street, Room 309 E-mail: victor.aguirregabiria@utoronto.ca
More informationEconometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction
Econometrics I Professor William Greene Stern School of Business Department of Economics 1-1/40 http://people.stern.nyu.edu/wgreene/econometrics/econometrics.htm 1-2/40 Overview: This is an intermediate
More informationRonald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California
Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University
More informationMatching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region
Impact Evaluation Technical Session VI Matching Techniques Jed Friedman Manila, December 2008 Human Development Network East Asia and the Pacific Region Spanish Impact Evaluation Fund The case of random
More informationEmpirical approaches in public economics
Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental
More informationStatistical Inference for Stochastic Epidemic Models
Statistical Inference for Stochastic Epidemic Models George Streftaris 1 and Gavin J. Gibson 1 1 Department of Actuarial Mathematics & Statistics, Heriot-Watt University, Riccarton, Edinburgh EH14 4AS,
More informationModeling Mediation: Causes, Markers, and Mechanisms
Modeling Mediation: Causes, Markers, and Mechanisms Stephen W. Raudenbush University of Chicago Address at the Society for Resesarch on Educational Effectiveness,Washington, DC, March 3, 2011. Many thanks
More informationRobust Monte Carlo Methods for Sequential Planning and Decision Making
Robust Monte Carlo Methods for Sequential Planning and Decision Making Sue Zheng, Jason Pacheco, & John Fisher Sensing, Learning, & Inference Group Computer Science & Artificial Intelligence Laboratory
More informationFinding Instrumental Variables: Identification Strategies. Amine Ouazad Ass. Professor of Economics
Finding Instrumental Variables: Identification Strategies Amine Ouazad Ass. Professor of Economics Outline 1. Before/After 2. Difference-in-difference estimation 3. Regression Discontinuity Design BEFORE/AFTER
More information12E016. Econometric Methods II 6 ECTS. Overview and Objectives
Overview and Objectives This course builds on and further extends the econometric and statistical content studied in the first quarter, with a special focus on techniques relevant to the specific field
More informationGaussian kernel GARCH models
Gaussian kernel GARCH models Xibin (Bill) Zhang and Maxwell L. King Department of Econometrics and Business Statistics Faculty of Business and Economics 7 June 2013 Motivation A regression model is often
More informationIntroduction to Statistical Analysis
Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive
More informationPOSTERIOR ANALYSIS OF THE MULTIPLICATIVE HETEROSCEDASTICITY MODEL
COMMUN. STATIST. THEORY METH., 30(5), 855 874 (2001) POSTERIOR ANALYSIS OF THE MULTIPLICATIVE HETEROSCEDASTICITY MODEL Hisashi Tanizaki and Xingyuan Zhang Faculty of Economics, Kobe University, Kobe 657-8501,
More informationBayesian inference for factor scores
Bayesian inference for factor scores Murray Aitkin and Irit Aitkin School of Mathematics and Statistics University of Newcastle UK October, 3 Abstract Bayesian inference for the parameters of the factor
More informationThe propensity score with continuous treatments
7 The propensity score with continuous treatments Keisuke Hirano and Guido W. Imbens 1 7.1 Introduction Much of the work on propensity score analysis has focused on the case in which the treatment is binary.
More informationBayesian Semiparametric GARCH Models
Bayesian Semiparametric GARCH Models Xibin (Bill) Zhang and Maxwell L. King Department of Econometrics and Business Statistics Faculty of Business and Economics xibin.zhang@monash.edu Quantitative Methods
More informationBayesian Methods for Machine Learning
Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),
More informationWeb Appendix for The Dynamics of Reciprocity, Accountability, and Credibility
Web Appendix for The Dynamics of Reciprocity, Accountability, and Credibility Patrick T. Brandt School of Economic, Political and Policy Sciences University of Texas at Dallas E-mail: pbrandt@utdallas.edu
More informationBayesian Semiparametric GARCH Models
Bayesian Semiparametric GARCH Models Xibin (Bill) Zhang and Maxwell L. King Department of Econometrics and Business Statistics Faculty of Business and Economics xibin.zhang@monash.edu Quantitative Methods
More informationSubject CS1 Actuarial Statistics 1 Core Principles
Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and
More informationInfinite-State Markov-switching for Dynamic. Volatility Models : Web Appendix
Infinite-State Markov-switching for Dynamic Volatility Models : Web Appendix Arnaud Dufays 1 Centre de Recherche en Economie et Statistique March 19, 2014 1 Comparison of the two MS-GARCH approximations
More informationBayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence
Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns
More informationLecture 10 Regression Discontinuity (and Kink) Design
Lecture 10 Regression Discontinuity (and Kink) Design Economics 2123 George Washington University Instructor: Prof. Ben Williams Introduction Estimation in RDD Identification RDD implementation RDD example
More informationPIRLS 2016 Achievement Scaling Methodology 1
CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally
More informationVCMC: Variational Consensus Monte Carlo
VCMC: Variational Consensus Monte Carlo Maxim Rabinovich, Elaine Angelino, Michael I. Jordan Berkeley Vision and Learning Center September 22, 2015 probabilistic models! sky fog bridge water grass object
More informationMixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina
Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -
More informationRegression Discontinuity Design Econometric Issues
Regression Discontinuity Design Econometric Issues Brian P. McCall University of Michigan Texas Schools Project, University of Texas, Dallas November 20, 2009 1 Regression Discontinuity Design Introduction
More informationOnline Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha
Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha January 18, 2010 A2 This appendix has six parts: 1. Proof that ab = c d
More informationIdentifying the Effect of Changing the Policy Threshold in Regression Discontinuity Models
Identifying the Effect of Changing the Policy Threshold in Regression Discontinuity Models Yingying Dong and Arthur Lewbel University of California Irvine and Boston College First version July 2010, revised
More informationAnalysis of propensity score approaches in difference-in-differences designs
Author: Diego A. Luna Bazaldua Institution: Lynch School of Education, Boston College Contact email: diego.lunabazaldua@bc.edu Conference section: Research methods Analysis of propensity score approaches
More informationUnivariate Normal Distribution; GLM with the Univariate Normal; Least Squares Estimation
Univariate Normal Distribution; GLM with the Univariate Normal; Least Squares Estimation PRE 905: Multivariate Analysis Spring 2014 Lecture 4 Today s Class The building blocks: The basics of mathematical
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 7 Approximate
More informationA Nonparametric Bayesian Methodology for Regression Discontinuity Designs
A Nonparametric Bayesian Methodology for Regression Discontinuity Designs arxiv:1704.04858v4 [stat.me] 14 Mar 2018 Zach Branson Department of Statistics, Harvard University Maxime Rischard Department of
More informationOnline Appendix to: Crises and Recoveries in an Empirical Model of. Consumption Disasters
Online Appendix to: Crises and Recoveries in an Empirical Model of Consumption Disasters Emi Nakamura Columbia University Robert Barro Harvard University Jón Steinsson Columbia University José Ursúa Harvard
More informationCAEPR Working Paper # Identifying Multiple Marginal Effects with a Single Binary Instrument or by Regression Discontinuity
CAEPR Working Paper #205-009 Identifying Multiple Marginal Effects with a Single Binary Instrument or by Regression Discontinuity Carolina Caetano University of Rochester Juan Carlos Escanciano Indiana
More informationA Bayesian Nonparametric Model for Predicting Disease Status Using Longitudinal Profiles
A Bayesian Nonparametric Model for Predicting Disease Status Using Longitudinal Profiles Jeremy Gaskins Department of Bioinformatics & Biostatistics University of Louisville Joint work with Claudio Fuentes
More informationMaster of Science in Statistics A Proposal
1 Master of Science in Statistics A Proposal Rationale of the Program In order to cope up with the emerging complexity on the solutions of realistic problems involving several phenomena of nature it is
More informationProbability and statistics; Rehearsal for pattern recognition
Probability and statistics; Rehearsal for pattern recognition Václav Hlaváč Czech Technical University in Prague Czech Institute of Informatics, Robotics and Cybernetics 166 36 Prague 6, Jugoslávských
More informationSTAT 518 Intro Student Presentation
STAT 518 Intro Student Presentation Wen Wei Loh April 11, 2013 Title of paper Radford M. Neal [1999] Bayesian Statistics, 6: 475-501, 1999 What the paper is about Regression and Classification Flexible
More informationThe STS Surgeon Composite Technical Appendix
The STS Surgeon Composite Technical Appendix Overview Surgeon-specific risk-adjusted operative operative mortality and major complication rates were estimated using a bivariate random-effects logistic
More informationReview: Probabilistic Matrix Factorization. Probabilistic Matrix Factorization (PMF)
Case Study 4: Collaborative Filtering Review: Probabilistic Matrix Factorization Machine Learning for Big Data CSE547/STAT548, University of Washington Emily Fox February 2 th, 214 Emily Fox 214 1 Probabilistic
More informationOctober 11, Keywords: Regression Discontinuity Design, Permutation Test, Induced Order Statistics, R.
RATest: An R package for Randomization Tests with an application to testing the continuity of the baseline covariates in RDD using Approximate Permutation Tests Mauricio Olivares-González Department of
More informationRegression Discontinuity
Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 9, 2016 I will describe the basic ideas of RD, but ignore many of the details Good references
More informationPrinciples of Bayesian Inference
Principles of Bayesian Inference Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department
More informationFlexible Regression Modeling using Bayesian Nonparametric Mixtures
Flexible Regression Modeling using Bayesian Nonparametric Mixtures Athanasios Kottas Department of Applied Mathematics and Statistics University of California, Santa Cruz Department of Statistics Brigham
More informationOptimal Bandwidth Choice for the Regression Discontinuity Estimator
Optimal Bandwidth Choice for the Regression Discontinuity Estimator Guido Imbens and Karthik Kalyanaraman First Draft: June 8 This Draft: February 9 Abstract We investigate the problem of optimal choice
More informationRegression Discontinuity Designs Using Covariates
Regression Discontinuity Designs Using Covariates Sebastian Calonico Matias D. Cattaneo Max H. Farrell Rocío Titiunik May 25, 2018 We thank the co-editor, Bryan Graham, and three reviewers for comments.
More informationQuiz 1. Name: Instructions: Closed book, notes, and no electronic devices.
Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices. 1. What is the difference between a deterministic model and a probabilistic model? (Two or three sentences only). 2. What is the
More informationSupplementary material to: Tolerating deance? Local average treatment eects without monotonicity.
Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity. Clément de Chaisemartin September 1, 2016 Abstract This paper gathers the supplementary material to de
More informationIndependent and conditionally independent counterfactual distributions
Independent and conditionally independent counterfactual distributions Marcin Wolski European Investment Bank M.Wolski@eib.org Society for Nonlinear Dynamics and Econometrics Tokyo March 19, 2018 Views
More informationThe Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest
The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest Edward Vytlacil, Yale University Renmin University, Department
More informationMarginal Specifications and a Gaussian Copula Estimation
Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required
More informationINVERTED KUMARASWAMY DISTRIBUTION: PROPERTIES AND ESTIMATION
Pak. J. Statist. 2017 Vol. 33(1), 37-61 INVERTED KUMARASWAMY DISTRIBUTION: PROPERTIES AND ESTIMATION A. M. Abd AL-Fattah, A.A. EL-Helbawy G.R. AL-Dayian Statistics Department, Faculty of Commerce, AL-Azhar
More informationMeasurement Error and Linear Regression of Astronomical Data. Brandon Kelly Penn State Summer School in Astrostatistics, June 2007
Measurement Error and Linear Regression of Astronomical Data Brandon Kelly Penn State Summer School in Astrostatistics, June 2007 Classical Regression Model Collect n data points, denote i th pair as (η
More informationStat 451 Lecture Notes Markov Chain Monte Carlo. Ryan Martin UIC
Stat 451 Lecture Notes 07 12 Markov Chain Monte Carlo Ryan Martin UIC www.math.uic.edu/~rgmartin 1 Based on Chapters 8 9 in Givens & Hoeting, Chapters 25 27 in Lange 2 Updated: April 4, 2016 1 / 42 Outline
More informationEconometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous
Econometrics of causal inference Throughout, we consider the simplest case of a linear outcome equation, and homogeneous effects: y = βx + ɛ (1) where y is some outcome, x is an explanatory variable, and
More informationAn Alternative Infinite Mixture Of Gaussian Process Experts
An Alternative Infinite Mixture Of Gaussian Process Experts Edward Meeds and Simon Osindero Department of Computer Science University of Toronto Toronto, M5S 3G4 {ewm,osindero}@cs.toronto.edu Abstract
More informationModeling conditional distributions with mixture models: Theory and Inference
Modeling conditional distributions with mixture models: Theory and Inference John Geweke University of Iowa, USA Journal of Applied Econometrics Invited Lecture Università di Venezia Italia June 2, 2005
More information