Exam ECON5106/9106 Fall 2018

Similar documents
ECO Class 6 Nonparametric Econometrics

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

Introduction to Causal Inference. Solutions to Quiz 4

EMERGING MARKETS - Lecture 2: Methodology refresher

150C Causal Inference

Chapter 11. Regression with a Binary Dependent Variable

Linear Regression with Multiple Regressors

Applied Statistics and Econometrics

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

Problem Set 1 ANSWERS

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals

Econometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018

ECON Introductory Econometrics. Lecture 16: Instrumental variables

Final Exam. Economics 835: Econometrics. Fall 2010

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Recitation Notes 5. Konrad Menzel. October 13, 2006

ECON3150/4150 Spring 2016

Mostly Dangerous Econometrics: How to do Model Selection with Inference in Mind

Lecture 4: Multivariate Regression, Part 2

Applied Econometrics Lecture 1

Lecture#12. Instrumental variables regression Causal parameters III

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Regression Discontinuity Designs.

Lecture 10: Introduction to Logistic Regression

Statistical Inference with Regression Analysis

Applied Statistics and Econometrics

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD)

PSC 504: Instrumental Variables

A Course in Applied Econometrics. Lecture 5. Instrumental Variables with Treatment Effect. Heterogeneity: Local Average Treatment Effects.

Essential of Simple regression

Chapter 6: Linear Regression With Multiple Regressors

The problem of causality in microeconometrics.

Empirical Application of Simple Regression (Chapter 2)

Lecture 4: Multivariate Regression, Part 2

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler

Section I. Define or explain the following terms (3 points each) 1. centered vs. uncentered 2 R - 2. Frisch theorem -

Impact Evaluation Technical Workshop:

Empirical approaches in public economics

ECO375 Tutorial 9 2SLS Applications and Endogeneity Tests

S o c i o l o g y E x a m 2 A n s w e r K e y - D R A F T M a r c h 2 7,

Homework Solutions Applied Logistic Regression

ECON3150/4150 Spring 2016

ECON Introductory Econometrics. Lecture 17: Experiments

Addressing Analysis Issues REGRESSION-DISCONTINUITY (RD) DESIGN

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

THE MULTIVARIATE LINEAR REGRESSION MODEL

Binary Dependent Variable. Regression with a

A Framework for Eliciting, Incorporating and Disciplining Identification Beliefs in Linear Models

Multivariate Regression: Part I

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Binary Dependent Variables

AGEC 661 Note Fourteen

Introduction to Econometrics. Review of Probability & Statistics

2.1. Consider the following production function, known in the literature as the transcendental production function (TPF).

Exploring Marginal Treatment Effects

Outline for Today. Review of In-class Exercise Bivariate Hypothesis Test 2: Difference of Means Bivariate Hypothesis Testing 3: Correla

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Ec1123 Section 7 Instrumental Variables

At this point, if you ve done everything correctly, you should have data that looks something like:

ECON3150/4150 Spring 2015

Hypothesis Tests and Confidence Intervals in Multiple Regression

ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics August 2013

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois

On the Use of Linear Fixed Effects Regression Models for Causal Inference

ECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests

Econometrics. 8) Instrumental variables

Sensitivity checks for the local average treatment effect

Recitation Notes 6. Konrad Menzel. October 22, 2006

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS

Ecmt 675: Econometrics I

Econometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton.

Chapter 7. Hypothesis Tests and Confidence Intervals in Multiple Regression

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11

Instrumental Variables

ECON 836 Midterm 2016

Lecture 8: Instrumental Variables Estimation

An Alternative Assumption to Identify LATE in Regression Discontinuity Design

Nonrecursive Models Highlights Richard Williams, University of Notre Dame, Last revised April 6, 2015

An overview of applied econometrics

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

The problem of causality in microeconometrics.

Methods to Estimate Causal Effects Theory and Applications. Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA

SOCY5601 Handout 8, Fall DETECTING CURVILINEARITY (continued) CONDITIONAL EFFECTS PLOTS

General Linear Model (Chapter 4)

Week 3: Simple Linear Regression

ECO375 Tutorial 8 Instrumental Variables

ECO220Y Simple Regression: Testing the Slope

8. Nonstandard standard error issues 8.1. The bias of robust standard errors

Econometric Methods and Applications II Chapter 2: Simultaneous equations. Econometric Methods and Applications II, Chapter 2, Slide 1

Sociology 63993, Exam 2 Answer Key [DRAFT] March 27, 2015 Richard Williams, University of Notre Dame,

Graduate Econometrics Lecture 4: Heteroskedasticity

22s:152 Applied Linear Regression. Example: Study on lead levels in children. Ch. 14 (sec. 1) and Ch. 15 (sec. 1 & 4): Logistic Regression

Varieties of Count Data

Quantitative Economics for the Evaluation of the European Policy

Lecture 7: OLS with qualitative information

Transcription:

Exam ECO506/906 Fall 208. Suppose you observe (y i,x i ) for i,2,, and you assume f (y i x i ;α,β) γ i exp( γ i y i ) where γ i exp(α + βx i ). ote that in this case, the conditional mean of E(y i X x i ) γ i exp(α+βx i ). (a) Write down the log-likelihood function and derive the score function. Explain how can you obtain the MLE. logl i i i log f (y i x i ;α,β) [logγ i + log[exp( γ i y i )]] [α + βx i y i exp(α + βx i )] logl α ( y i exp(α + βx i )) 0 i logl β i (x i x i y i exp(α + βx i )) 0 Setting the first order conditions equal to zero and solving them will give us the MLE. The analytic solution is not available. (b) Can you suggest a GMM estimator for (α,β)? You are supposed to write down the moment conditions and the minimization problem. ote that E[y x] exp(α+βx i ), so we have E[y exp(α + βx i ) ] E XE[y exp[α + βx i ] x] E X[0] 0 E[x(y exp(α + βx i ) )] E XE[x(y exp[α + βx i ] ) x] E X[x 0] 0 Let g(α,β) ( i (y i exp(α+βx i ) ) i x i (y i exp(α+βx i ) ) ) be the sample moment condition. Just identified: the GMM estimator is the unique solution to g(α,β) 0. Or equivalently, the GMM estimator minimizes Q(α,β) g (α,β)g(α,β) (c) Which will you prefer, MLE or GMM estimator, explain briefly why.

Both are consistent. But MLE is more efficient. Cramer-Rao lower bound. (Smallest possible variance). (d) We are interested in the parameter δ α exp(β). Give a consistent estimator of δ based on the MLE estimators ( ˆα ML, ˆβ ML ). Outline how you can construct the standard error for your estimator ˆδ using the bootstrapping. ˆδ ˆα ML exp( ˆβ ML ) i. Take an r-th random sample with replacement from the data ii. Calculate the statistic ˆδ ( r) using MLE on the r-th random sample iii. Repeat steps i. and ii many (R) times The bootstrap estimate of the standard error of ˆδ is ŝe ( ˆδ) R ( R ˆδ ( r) ˆδ ) 2 ( ) r 2. In this problem, you are to investigate the effect of social institutions (having strong modern property rights) on the economic development (GDP per capita) of a country. The data include the following variables for any country i that was previously colonized: Instrument M i {0,}: Settler mortality in the 7th, 8th, and early 9th centuries, 0 if high mortality, if low. Treatment I i {0,}: Modern property rights institutions, 0 if weak, if strong. Outcome Y i : Modern log GDP per capita. In the following, you will estimate the effect of I i on Y i by instrumenting for I i with M i. You will have to use the attached Stata output when answering the questions. (a) ame the assumptions underpinning instrumental variables as a strategy for identifying the effect of an endogenous treatment on the outcome for compliers. Write out each assumption formally in terms of M i, I i, and Y i. In your own words, interpret each assumption with regard to the specific setup above. Finally, discuss the plausibility of each assumption. This is taken from one of the most famous social science articles using instrumental variables, Acemoglu, Johnson, & Robinson s 200 paper The Colonial Origins of Comparative Development: An Empirical Investigation American Economic Review, 9 (5): 369-40. Here, we present only a stylized characterization of the study. 2

ASWER HIT: (a) Random assignment: Y i (i,m);d i (m) M i. That is, instrument is independent to the potential outcome and potential treatment. In this case, knowing the value of M i does not tell us anything about the potential outcomes for log GDP per capita Y i, nor the potential treatment status for property rights strength, I i. Is this plausible? First, note that we typically cannot assess this directly with data. Theoretically, this assumptions seems kind of unlikely. Settler mortality is not assigned randomly, nor in a quasi-random way: it is a function, of, among many other things, the disease environment, the latitude/longitude of an area, natural geography, agricultural feasibility, water access, etc. Many of these factors could simultaneously correlate with both property rights institutions (treatment I i ) or log GDP per capita (outcome Y i ). (b) Exclusion restriction: Y i (I i,) Y i (I i,0) for I i (,0). The exclusion restriction in this case requires that the potential outcomes for log GDP per capita, when property rights institutions are weak (strong), are equal irrespective of the level of settler mortality. That is, there is no effect of settler mortality on log GDP per capita other than through the effect of M i on I i. Theoretically, we can probably imagine a number of potential ways in which M i could affect Y i other than through levels of I i. For instance, higher levels of settler mortality might have induced more conflict over resources, lowering log GDP per capita in the future. (c) First-stage relevance: E(I i () I i (0)] 0. First-stage relevance in this case means that higher levels of settler mortality correlates with (causes?) weaker property rights institutions. The plausibility of this assumption can be tested in data, and seems probable in this case. (d) Monotonicity: I i () I i (0) for all i. This assumption implies that reduce the settler potential mortality will not reduce property rights protection. This assumption does seem reasonably plausible. It s hard to think of a (plausible) reason why some colonies would be defiers and have the opposite behavior. (b) The attached Stata output reports the OLS estimate of the effect of I i on Y i. Interpret the direction and statistical significance of the estimates. Why should you be concerned about whether these are good estimates of the causal quantity of interest. ASWER HIT: Using OLS, we find a positive (and significant) association between average property rights protections and log GDP per capita, significant at 95% LOS. Of course, we should be cautious about interpreting these findings as causal. It seems very likely that there would be omitted variables that simultaneously affect both levels of property rights protection and levels of log GDP per capita. Should also give magnitude of the effect (c) The attached Stata output reports the OLS estimate of the effect of Z i on Y i. Interpret the direction and statistical significance of the estimate. What does this estimator purport to estimate? Under what conditions can we interpret this 3

result as causal? Is that what you want to study? ASWER HIT: Our results show that higher levels of settler mortality are associated with lower GDP per capita. This can be seen as an attempt to estimate the intent-to-treat" effect of settler mortality on log GDP per capita. ot very interesting. It would be nice to reference to assumption A. Should also give magnitude of the effect. (d) The attached Stata output reports the results from the 2SLS approach to estimate the effect of I i on Y i, with M i used as the instrument for I i. Interpret the results. Is M i a weak instrument? ASWER HIT: LAT E, average causal effect of compliers. Stronger property rights protection have a positive effect on levels of log GDP per capita, for compliers. Should also give magnitude of the effect. F t t 3.87 < 0 possibly a weak instrument. (e) Compute the fraction of always takers, never takers and compliers,and the average potential outcome when treated for always-takers and compliers. Based on your results, what can you say about the external validity of your estimator? ASWER HIT: p a Pr(I M 0) 0.52 p n Pr(I 0 M ) 0.24 p c p a p n 0.24 E[y i () Complier].8025 E[y i () AlwaysTaker] 7.558485 Rather different mean potential outcome for different group. May suggest strong heterogeneity. 3. Consider a model in which Assume that Y i β 0 + β T i + µ i + ε i () ε i has mean 0 and is independent of everything else in the model µ i is 0 with probability 0.5 and with probability 0.5. There is a further variable X i which is X i µ i + υ i (2) where ν i is independent of everything else in the model and follows a standard normal distribution. 4

T i is binary and determined according to the following threshold rule When X i < x and µ i 0 then Pr(T i ) 0 When X i x and µ i 0 then Pr(T i ) p 0 > 0 When X i < x and µ i then Pr(T i ) 0 When X i x and µ i then Pr(T i ) p > 0 The econometrician observes X i, T i, and Y i but not the other variables. (a) Suppose you run an OLS regression of Y i on T i. Would ˆ β be a consistent estimator of β? Explain briefly why. ASWER HIT: ot consistent. Because cov(t i, µ i + ε i ) 0. Endogeneity bias. (b) Explain how Regression Discontinuity Design can be used to get a consistent estimate of β. List the necessary assumptions and sketch your estimation procedure. Fuzzy regression discontinuity: Assumption:. that the probability being treated Pr(T i X i x) is not continuous at x 2. The conditional mean of potential outcome E[y i (t) x] is continuous at x. Both assumption satisfied, so we can use a Fuzzy regression discontinuity design. lim E(Y i X i x) lime(y i X i x) βˆ x x x x lim E(T i X i x) lime(t i X i x) x x x x To estimate this, one can either use the Local IV method (2SLS) or the local linear regressions. The first is preferred since the functional forms of the potential outcomes is known. (c) ow suppose the type people (i.e. the people for whom µ i ) have control over their X i. In particular they can change their υ i to υ i + δ at some cost. Assume that type people with X i < x < X i + δ pay this cost and change their X i. How does this affect the estimates from your estimation method above? ASWER HIT: We can no longer use the FRD design: the estimate is no longer consistent. Some people can perfectly manipulate the running variable. The Local randomization assumption of RDD fails. To see this, to the left of the cutoff (x δ, x ), there is only type 0 individuals, while to the right of cutoff, they are both types. It is clear that the ones near the threshold do not represent valid counterfactuals. Mathematically speaking, E(u i X i x) is no longer a continuous function at x so the key assumptions that the conditional mean of potential outcome is a continuous function of x fails. (d) ow suppose people cannot manipulate X i exactly but let X i µ i + Z i + ω i where people can manipulate Z i but not ω i where ω i is independent of every- 5

thing else and normal. How does this affect the estimates from estimation method you suggested in (b)? ASWER HIT: The FRD will still be valid when the agent cannot perfectly control the running variable. 6

Stata Output. su y m i Variable Obs Mean Std. Dev. Min Max - - - - - - - - - - - - -+ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - y 64 8.062237.043359 6.09248 0.2574 m 64.57825.4977629 0 i 64.65625.478736 0. tab m i i m 0 Total 0 3 4 27 9 28 37 Total 22 42 64. reg y i, robust noheader y Coef. Std. Err. t P > t [95% Conf. Interval ] i.30448.24226 5.28 0.000.702268.55868 _cons 7.32038.50532 48.46 0.000 7.08429 7.62233. reg y m, robust noheader y Coef. Std. Err. t P > t [95% Conf. Interval ] m.234065.204262 6.05 0.000 -.64208 -.826023 _cons 7.348793.29828 56.60 0.000 7.08927 7.60835. reg i m, robust noheader i Coef. Std. Err. t P > t [95% Conf. Interval ] m.2382382.2625.97 0.054 -.4804386.003962 _cons.58585.0976977 5.3 0.000.3232237.73834.. ivregress 2 sls y ( i m ), noheader robust y Coef. Std. Err. z P > z [95% Conf. Interval ] i 5.79964 2.350875 2.20 0.028.5723343 9.787593 _cons 4.662886.63648 2.85 0.004.455444 7.870327 Instrumented : i Instruments : m. gen yi y * i. ivregress 2 sls yi ( i m ), noheader robust yi Coef. Std. Err. z P > z [95% Conf. Interval ] i.8025 2.0066 5.87 0.000 7.869432 5.7507 _cons -2.204622.42848 -.55 0.2-4.99394.582495 Instrumented : i Instruments : m. tabulate m i, summarize ( y ) means Means of y i m 0 Total 0 7.229743 7.558485 7.3487928 7.605529 8.897002 8.5828583 Total 7.320380 8.4508286 8.0622369 7