Quantitative Empirical Methods Exam

Similar documents
EXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science

Lawrence D. Brown* and Daniel McCarthy*

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Department of Statistical Science FIRST YEAR EXAM - SPRING 2017

Understanding Ding s Apparent Paradox

Comprehensive Examination Quantitative Methods Spring, 2018

EMERGING MARKETS - Lecture 2: Methodology refresher

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).

What s New in Econometrics. Lecture 1

AP Statistics Semester I Examination Section I Questions 1-30 Spend approximately 60 minutes on this part of the exam.

review session gov 2000 gov 2000 () review session 1 / 38

Lecture 1: OLS derivations and inference

Linear Models in Econometrics

Test Yourself! Methodological and Statistical Requirements for M.Sc. Early Childhood Research

Final Exam. Economics 835: Econometrics. Fall 2010

Lecture 14 Simple Linear Regression

Topic 12 Overview of Estimation

Statistical Inference with Regression Analysis

Research Note: A more powerful test statistic for reasoning about interference between units

ECON 4160, Autumn term Lecture 1

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered)

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons

Machine Learning, Fall 2009: Midterm

First Year Examination Department of Statistics, University of Florida

2.1 Linear regression with matrices

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

IEOR E4703: Monte-Carlo Simulation

The Slow Convergence of OLS Estimators of α, β and Portfolio. β and Portfolio Weights under Long Memory Stochastic Volatility

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

The exact bootstrap method shown on the example of the mean and variance estimation

Parameter Estimation, Sampling Distributions & Hypothesis Testing

MS&E 226: Small Data

ˆπ(x) = exp(ˆα + ˆβ T x) 1 + exp(ˆα + ˆβ T.

Testing Linear Restrictions: cont.

Bayesian inference for sample surveys. Roderick Little Module 2: Bayesian models for simple random samples

Review of Econometrics

ECO375 Tutorial 4 Introduction to Statistical Inference

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1

Linear Regression with one Regressor

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

2013 ~ 2014 AP CALCULUS AB SYLLABUS

Eco517 Fall 2014 C. Sims FINAL EXAM

A Simple, Graphical Procedure for Comparing Multiple Treatment Effects

where x and ȳ are the sample means of x 1,, x n

Gibbs Sampling in Endogenous Variables Models

CS446: Machine Learning Spring Problem Set 4

Econometric Analysis of Panel Data. Final Examination: Spring 2013

Topic 16 Interval Estimation

Problem Selected Scores

6A Lab Post-test. Instructions for printed version. Week 10, Fall 2016

A Practitioner s Guide to Cluster-Robust Inference

Statistical Tests. Matthieu de Lapparent

36. Multisample U-statistics and jointly distributed U-statistics Lehmann 6.1

Applied Statistics Preliminary Examination Theory of Linear Models August 2017

Fall 2013 Northwestern University STAT Section 23 Introduction to Statistics. Final Exam. Name:

ORF 363/COS 323 Final Exam, Fall 2016

An Introduction to Econometrics. A Self-contained Approach. Frank Westhoff. The MIT Press Cambridge, Massachusetts London, England

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

ECONOMET RICS P RELIM EXAM August 19, 2014 Department of Economics, Michigan State University

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

δ -method and M-estimation

8. Instrumental variables regression

ECON Program Evaluation, Binary Dependent Variable, Misc.

Applied Statistics and Econometrics

interval forecasting

PART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable,

Chemical Engineering: 4C3/6C3 Statistics for Engineering McMaster University: Final examination

Econ 583 Final Exam Fall 2008

Spring 1998 Final exam (35%)

Multiple Regression Analysis: The Problem of Inference

2007 ~ 2008 AP CALCULUS AB SYLLABUS

ST430 Exam 1 with Answers

1 Motivation for Instrumental Variable (IV) Regression

Association studies and regression

This chapter follows from the work done in Chapter 4 of the Core topics book involving quadratic equations.

IEOR 165 Lecture 7 1 Bias-Variance Tradeoff

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Working Paper No Maximum score type estimators

2. Linear regression with multiple regressors

Response Surface Methodology: Process and Product Optimization Using Designed Experiments, 3rd Edition

Ability Bias, Errors in Variables and Sibling Methods. James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Massachusetts Institute of Technology Department of Economics Time Series Lecture 6: Additional Results for VAR s

1.4 Properties of the autocovariance for stationary time-series

Correlation and Regression

if n is large, Z i are weakly dependent 0-1-variables, p i = P(Z i = 1) small, and Then n approx i=1 i=1 n i=1


Exam Applied Statistical Regression. Good Luck!

Exam D0M61A Advanced econometrics

Simple Linear Regression: The Model

New Developments in Econometrics Lecture 9: Stratified Sampling

Introduction to Causal Inference. Solutions to Quiz 4

Subject CS1 Actuarial Statistics 1 Core Principles

An Experimental Evaluation of High-Dimensional Multi-Armed Bandits

Transcription:

Quantitative Empirical Methods Exam Yale Department of Political Science, August 2016 You have seven hours to complete the exam. This exam consists of three parts. Back up your assertions with mathematics where appropriate and show your work. Good answers will provide a direct answer that illustrates an understanding of the question, and calculations or statistical arguments to validate the answer. Where applicable, exceptional answers will include all of these as well as proofs that are technically complete, including formally articulating sufficient assumptions and regularity conditions. Questions will not be weighted equally, and a holistic score will be assigned to the exam, and thus it is important to demonstrate your understanding of the material to the best of your ability. Part 1 (Short Answer Section) consists of five short answer questions. Advice: Note there are multiple correct answers to some questions, and we encourage you to give the most complete (but still succinct) solution possible. Do not leave sub-parts of questions unanswered. Part 2 (Essay Section) contains a recent, well-regarded empirical article. We will ask you to offer an evaluation of its methodological approach and presentation of results. In particular, we will advise you to pay particular attention to the identification conditions (either explicit or implicit), the associated estimation strategy, and possible threats to inference. Your response may be anywhere from 500 to 1500 words. The only aids permitted for Parts 1 and 2 are (i) one page of double-sided notes, (ii) a word processor on one of the Statlab computers to write up your answers (you may also write up your answers to Part 1 using pencil/pen and paper). After handing in your answers for Parts 1 and 2 of the exam, you may begin Part 3 (though feel free to look ahead). You may hand in Parts 1 and 2 whenever you wish, but we recommend spending no longer than five hours on Parts 1 and 2. Part 3 (Computer Assisted Section) will involve using statistical software to answer one longer exercise with six associated questions. A complete answer to Part 3 will include code and output, as well as your written answers. Advice: We recommend that you explain what you are trying to do in comments. Even if you are not able to execute your program correctly, you can receive partial credit for explaining clearly what you wanted to do and why. For Part 3, you are permitted (i) unrestricted use of your own computer with access to the internet or (ii) use of a Statlab computer with access to the internet. The only restriction for Part 3 is that you may not interact with anyone, online or otherwise. For Part 3 (Computer Assisted Portion) of the exam, please turn in a hard copy of your code to Colleen, and also email a digital copy of the code to colleen.amaro@yale.edu. 1

1 Short Answer Section 1. Assume that the conditional expectation function of Y given X and Z, E [Y X, Z] = 10 + 10XZ. Further assume that X and Z are independent and each distributed according to the standard uniform distribution U(0, 1). (a) What is the marginal effect of X on Y when X = 0 and Z = 1? (b) What is the marginal effect of Z on Y when X = 0 and Z = 1? (c) What is the marginal effect of X on Y when both X and Z are at their means? (d) What is the average marginal effect of X on Y? 2. Suppose that the data generating process is i.i.d. Y i = a + bx i + u i, with E [u i X i ] = 0, Pr(X i < 0) (0, 1), and Var (X i ) > 0. The researcher observes only values of X i when X i 0. Suppose that the researcher drops all observations of (Y i, X i ) with missing values on X i, and performs an ordinary least squares (OLS) regression on the remaining values of Y i on X i (and a constant). Will the estimated slope from this regression generally be consistent for b? Why or why not? 3. Suppose that you have n mutually independent draws from a normally distributed random variable X with known variance Var (X) = 1, and unknown mean E [X] = µ. (a) What is the maximum likelihood estimate of µ? Denote this ˆµ. (b) What is lim n nvar (ˆµ)? (c) What is lim n ne [ˆµ µ]? (d) What is lim n ne [(ˆµ µ) 2 ]? (e) Suppose that Var (X) were unknown. Would the maximum likelihood estimate of µ differ without knowledge of the variance, Var (X)? (f) Bonus. Suppose that we know that µ is an integer. (You may assume Var (X) = 1.) Propose an estimator that is asymptotically more efficient than ˆµ. You may choose your own definition of asymptotic efficiency. [If you cannot give a formal proof, give an intuition.] 4. Suppose n observations are taken i.i.d. from (Y, D, Z, X), where Y, D, Z, and X are (scalar) random variables. Consider the following procedure: The researcher estimates the following model using OLS: D = α 0 + α 1 Z + α 2 X + U 1. Denote the estimated coefficients from this regression as (ˆα 0, ˆα 1, ˆα 2 ). Denote ˆD = ˆα 0 + ˆα 1 Z + ˆα 2 X. The researcher then estimates the following model using OLS: Y = β 0 + β 1 ˆD + β2 X + U 2. Denote the estimated coefficients from this regression as ( ˆβ 0, ˆβ 1, ˆβ 2 ). 2

Articulate a set of nontrivial conditions under which ˆβ 1 is consistent for a causal effect of D on Y. 5. Suppose that we are trying to conduct inference on 1 ni=1 1 E [X n i ]. Assume that lim ni=1 n E [X n i ] = µ, such that µ is finite. Let ˆµ = 1 ni=1 X n i. Further suppose we computed a Wald-type confidence interval as ˆµ ± 1.96 1 ni=1 (X n n i ˆµ) 2. Give at least one example of a data generating process such that the asymptotic coverage of the resulting confidence interval will not be (at least) 95%. 3

2 Essay section Read the article attached to your exam. Offer a critical evaluation of its methodological approach and presentation of results. Note: critical does not imply that you should only criticize where praise is warranted, or where the authors claims are well-justified, it is recommended that you give credit to the authors when their arguments are convincing and/or novel with respect to standard practice. Your response may be anywhere from 500 to 1500 words. We advise you to pay particular attention to the identification conditions (either explicit or implicit), the associated estimation strategy, and possible threats to inference. Justify each of your claims and, where applicable, suggest ways in which this line of research might be improved. (We do not expect you to have special expertise in the topic area, but we do expect you to bring to bear your general analytical skills as a political scientist). Article: Hainmueller, Jens and Dominik Hangartner. 2013. Who gets a Swiss passport? A natural experiment in immigrant discrimination. American Political Science Review. 159 187. 4

3 Computer Assisted Portion Consider the following data generating process: Y i = α + βx i + u i. Assume that X i is binary with Pr(X i = 1) = 1/4 and E [u i X i ] = 0. The researcher observes n i.i.d. draws from (Y i, X i ). Suppose that the researcher fits an ordinary least squares (OLS) regression of Y i on X i and a constant. Denote the estimated coefficient on X i as ˆβ. We will ask you to conduct a series of simulation studies to assess the behavior of Wald-type normal approximation-based confidence intervals for ˆβ constructed using classical standard errors, robust standard errors and the bootstrap. So as to guarantee non-collinearity in estimation, as you proceed, condition on the event that 0 < n i=1 X i < n. (Throughout, note that we are asking for Wald-type normal approximation-based confidence intervals, so do not use the percentile bootstrap.) Use at least 1000 simulations and at least 500 bootstrap replicates in computing your answers. 1. Assume that u i = U( 1, 1), where U(a, b) denotes the uniform distribution over the interval [a, b]. (a) Suppose that n = 10. What is the coverage of Wald-type 95% confidence intervals for ˆβ using (i) classical (OLS), (ii) robust (Huber-White), and (iii) bootstrap standard errors? (b) How about for n = 100? Repeat for (i)-(iii). (c) And n = 2500? Repeat for (i)-(iii). 2. Assume that u i = interval [a, b]. { U( 1, 1) : Xi = 0 U( 3, 3) : X i = 1, where U(a, b) denotes the uniform distribution over the (a) Suppose that n = 10. What is the coverage of Wald-type 95% confidence intervals for ˆβ using (i) classical (OLS), (ii) robust (Huber-White), and (iii) bootstrap standard errors? (b) How about for n = 100? Repeat for (i)-(iii). (c) And n = 2500? Repeat for (i)-(iii). 3. Throughout, (a) did your answers depend on the true values of α or β? Why or why not? (b) did you find any differences between the results in Q1 and Q2? Why or why not? (c) did your answers depend on n? Were some procedures preferable to others with small n? What practical conclusions might you draw? 5