Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM

Similar documents
APEC 8212: Econometric Analysis II

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

Lecture 22 Survival Analysis: An Introduction

Lecture 9: Quantile Methods 2

Econ 273B Advanced Econometrics Spring

Exam D0M61A Advanced econometrics

For right censored data with Y i = T i C i and censoring indicator, δ i = I(T i < C i ), arising from such a parametric model we have the likelihood,

ISQS 5349 Spring 2013 Final Exam

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be

Step-Stress Models and Associated Inference

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

Econometric Analysis of Cross Section and Panel Data

Modelling geoadditive survival data

11. Bootstrap Methods

Quantile regression and heteroskedasticity

Final Exam. Name: Solution:

AFT Models and Empirical Likelihood

Economics 508 Lecture 22 Duration Models

What s New in Econometrics? Lecture 14 Quantile Methods

Robust covariance estimation for quantile regression

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid

Working Paper No Maximum score type estimators

New Developments in Econometrics Lecture 16: Quantile Estimation

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction

Semiparametric Regression

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

Parametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1

A Simple Way to Calculate Confidence Intervals for Partially Identified Parameters. By Tiemen Woutersen. Draft, September

Semiparametric Generalized Linear Models

Estimating the Derivative Function and Counterfactuals in Duration Models with Heterogeneity

Parametric Identification of Multiplicative Exponential Heteroskedasticity

Stock Sampling with Interval-Censored Elapsed Duration: A Monte Carlo Analysis

UNIVERSITY OF CALIFORNIA Spring Economics 241A Econometrics

Lecture 7 Time-dependent Covariates in Cox Regression

A Course on Advanced Econometrics

Duration Analysis. Joan Llull

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

ST745: Survival Analysis: Nonparametric methods

Econometrics I G (Part I) Fall 2004

Maximum Likelihood (ML) Estimation

Simulation-based robust IV inference for lifetime data

Post-exam 2 practice questions 18.05, Spring 2014

Likelihood Construction, Inference for Parametric Survival Distributions

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where

Subject CS1 Actuarial Statistics 1 Core Principles

Analysing geoadditive regression data: a mixed model approach

STAT 6350 Analysis of Lifetime Data. Probability Plotting

Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions

Censored quantile instrumental variable estimation with Stata

Unconditional Quantile Regression for Panel Data with Exogenous or Endogenous Regressors

Multilevel Statistical Models: 3 rd edition, 2003 Contents

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

Applied Health Economics (for B.Sc.)

Economics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models

Stat 5101 Lecture Notes

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Decomposing Changes (or Differences) in Distributions. Thomas Lemieux, UBC Econ 561 March 2016

Practical Bayesian Quantile Regression. Keming Yu University of Plymouth, UK

Multistate Modeling and Applications

Introduction to Reliability Theory (part 2)

ON ILL-POSEDNESS OF NONPARAMETRIC INSTRUMENTAL VARIABLE REGRESSION WITH CONVEXITY CONSTRAINTS

STAT331. Cox s Proportional Hazards Model

Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina

Introduction to Econometrics

6.867 Machine Learning

Fall, 2007 Nonlinear Econometrics. Theory: Consistency for Extremum Estimators. Modeling: Probit, Logit, and Other Links.

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Accelerated Failure Time Models

CALCULATION METHOD FOR NONLINEAR DYNAMIC LEAST-ABSOLUTE DEVIATIONS ESTIMATOR

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

IDENTIFICATION OF THE BINARY CHOICE MODEL WITH MISCLASSIFICATION

6.867 Machine Learning

Comment on A Comparison of Two Business Cycle Dating Methods. James D. Hamilton

A Note on Binary Choice Duration Models

Informational Content in Static and Dynamic Discrete Response Panel Data Models

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Quantile Regression for Panel/Longitudinal Data

Practical Considerations Surrounding Normality

Analogy Principle. Asymptotic Theory Part II. James J. Heckman University of Chicago. Econ 312 This draft, April 5, 2006

On Backtesting Risk Measurement Models

Comparing Predictive Accuracy, Twenty Years Later: On The Use and Abuse of Diebold-Mariano Tests

Multistate models in survival and event history analysis

University of California, Berkeley

1. GENERAL DESCRIPTION

Logit estimates Number of obs = 5054 Wald chi2(1) = 2.70 Prob > chi2 = Log pseudolikelihood = Pseudo R2 =

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

2. Linear regression with multiple regressors

INVERTED KUMARASWAMY DISTRIBUTION: PROPERTIES AND ESTIMATION

Transparent Structural Estimation. Matthew Gentzkow Fisher-Schultz Lecture (from work w/ Isaiah Andrews & Jesse M. Shapiro)

Chapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a

A better way to bootstrap pairs

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models

Flexible Regression Modeling using Bayesian Nonparametric Mixtures

Transcription:

Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods MIT 14.385, Fall 2007 Due: Wednesday, 07 November 2007, 5:00 PM 1 Applied Problems Instructions: The page indications given below give you an upper bound on what you should write. Answers should be very brief. Hand in clear, annotated code. You are strongly encouraged to work in groups, but the write-ups should be individual. 1.1 Bootstrap 1. Explain briefly what the bootstrap is, why it works when it works and how it can fail. (1/2 page max, informal reasoning is fine) 2. Find an empirical paper that uses the bootstrap. Explain why they use the bootstrap. (1/4 page max) Write pseudo-code ( a brief outline of an algorithm) for how you would reproduce the bootstrap used in this paper. (1/2 page max) 3. What is the bootstrap bias correction method? How does it work? State the pseudo-code for bias correction. (1/2 page max, informal reasoning is fine; hint: read Horowitz) 4. Two-step estimators have messy standard errors, which makes bootstrap inference appealing. Consider the censored regression 2-step estimator of PSet 2, Q1.b.iii. Bootstrap the standard errors. Compare these standard errors to those generated for you by the Stata canned command. 1

1.2 Quantile Regression 1. Theory. Briefly answer the following questions. Fundamentals: (a) (1/4 page max) Consider the location-scale model Y = X α + (X γ) u where u is independent of x and has distribution function F u (u) Write down the conditional quantile function of y given x in the form Q Y (τ x) = x β (τ) Characterize β (τ) as a function of τ (e.g. monotonicity if any...). How does the answer change if (x γ) = 1? (b) (1/4 page max) Show that E[Y x] = Q Y (τ x) dτ is of the form x β, where β is what? (c) (1/4 page max) Consider a general linear quantile model, Q Y (τ x) = x β (τ) corresponds to the random coefficient model Y = X β(u). Can we have slope coefficients β (τ) that behave non-monotonically in τ? Recall Doksum s quantile treatment effect example. (d) (1/2 page max) Using (a)-(b), give a brief conceptual evaluation of the linear quantile regression model. Compare it with the classical linear model, and state the advantages and disadvantages of the two. Refer to at least one empirical example discussed in class where the difference in models is important. E.g. refer to Doksum s guinea pig example. (e) (1 page max) Quantile equivariance to monotone transformations Suppose the conditional quantile function of y given x is in the form Q Y (τ x) = x β (τ) Now consider the following models of data transformation 2

i. y = f(y) = max {y, 0} ii. y = f(y) = 1 {y > 0} iii. y = f(y) = exp(y) For each of the above, write down the conditional quantile function for the trasnformed variable conditional on x: Q Y (τ x) = f(x β (τ)) Hint: quantile models have a remarkable property equivariance to monotone transformations, which we (should) observe in these examples. (Useful reference: http://www.econ.uiuc.edu/~roger/research/rq/rq.pdf + lecture notes). Is the same property true of the conditional mean function, i.e. is E[Y x] = f[e[y x]]? Why is this a problem? (f) (1/2 page) For each of the above three cases write down an estimators of β (τ) that use data on (y, x) only. (Hint: Case (i) generates the censored quantile regression estimator, case (ii) the maximum score / perception / single-layer neural network estimator, and case (iii) a nonlinear quantile regression estimator.) (g) (1/4 page maximum) Summarize the findings above very briefly, and say why they are important for applications to duration models and censored regression models. 2. (1 page, excluding figures.). Empirical: Reproduce or closely replicate Figures 1, Table 1, then Figure 2 of Koenker s Vignette (posted in Recitation Materials section on alternatively Roger Koenker s website). You may use any software you like, but is probably just easier to use R (which is a great statistical freeware). As usual, you should briefly explain what the canned commands are doing. The Engel data are available from within R or on the MIT Server. MIT Server 3. For this question, use the data in Penn46.ascii. See Section 5 of Koenker and Billias, Quantile Regression for Duration Data, Empirical Economics, 2001, for a description of the 3

program and the data. For part (a), review W. Newey s lecture note on duration models or Wooldridge 20.1-20.2. (a) (1 page) Conventional duration models usually boil down to an empirical relation of the following kind: h (T i ) = x i β + u i (1) where T i is a survival time (here, the duration of an unemployment spell), h () is monotonic and u i is iid with CDF F u. (i) Show that if we assume a Cox proportional hazard model, we can write the duration of unemployment in the form in (a) above with h (T ) = log Λ 0 (T ) where Λ 0 (t) is the integrated baseline hazard. (ii) Show that in the special case of a Weibull baseline hazard, we have log Λ 0 (T ) = γ log T α (iii) Combine your results from (i) and (ii) to show that, under the assumptions of Cox proportional hazard and Weibull baseline hazard, the log of the unemployment duration satisfies the following equation: log T i = x i β + u i This is a special case of the model called the Accelerated Failure Time (AFT) model. (Note: AFT models do not make parametric assumptions on the error term). (b) (1/2 page) Show that an implication of the model (1) is that the covariates only shift the location of the distribution, not the scale, i.e. the conditional quantile function of 4

h (T ) is Q h(t ) (τ x) = x β + F 1 (τ) u for τ (0, 1). ( ) Show that we can write the conditional quantile function of T as Q T (τ x) = h 1 x 1 β + F u (τ) Do covariates affect the location of T only (but not the scale etc.?) (c) (1/2 page) Now consider a quantile regression alternative to (1). Explain how quantile regression can be used to estimate more general models of the form Q h(t ) (τ x) = x β(τ), and how if we wanted, we could recover an estimate of Q T (τ x) = h 1 (x β (τ)) What are pros and cons of quantile regression approach relative to the approach outlined in (a) and (b)? (d) (1 page, excluding graphs) Using the Penn data, estimate the AFT model by maximum likelihood (using the Weibull distribution state what the likelihood function is) and by OLS, using the following regressors: indicators for the 5 treatment groups, with treatments 4 and 6 pooled; indicators for female, black, and hispanic respondents; number of dependents, with 2 indicating two or more dependents; indicators for the 5 quarters of entry to the experiment; indicator for whether the claimant expected to be recalled ; indicator of whether the respondent was young less than 35 or old greater than 54; indicator for whether claimant was employed in the durable goods industry; indicator for whether the claimant was registered in a low employment district (Coatesville, Reading or Lancaster). (All these are coded for you in the data set and it should be clear from the variable names what to use.) Present your results in a neat, clear format that is easy to read. Give a brief discussion of the key results. 5

(e) (1 page, excluding graphs) Again using the Penn data and the same regressors, estimate the model Q log(t ) (τ x) = x β (τ) by quantile regression for τ = {0.20, 0.25,..., 0.75, 0.80}. Present your results graphically. (f) (1/4 page) What do you make of results in (d) and (e)? 1.3 Bayesian and Quasi-Bayesian Estimators 1. Explain briefly what these estimators do. Give a simple one page explanation. 2. Explain what is (quasi) posterior mean, median, quantile. Are posterior mean and medians inferior estimators compared to the extremum estimates? Are they consistent, asymptotically normal? Do posterior quantiles provide valid inference when generalized information equality holds (satisfied for example when GMM function has optimal weighting matrix). (1/2 page max) 1.4 Introduction to MCMC: 1. Consider doing IV median regression using GMM criterion function for the model: y = βx + u, u N (0, 1), where x = z +v, v N (0, 1) and z N(0, 1). Here z is the instrument, x is the endogenous variable, and y is outcome. Suppose the true value of β is 0. Suppose the sample size is 100. Write an MCMC (Metroppolis) algorithm that produces a posterior mean and median estimates of β. Compute also the confidence intervals for β based on posterior quantiles. Report your estimates and confidence intervals. Briefly explain (one paragraph) how you would set up the objective function to use in the quasi-bayes estimation. Briefly explain (a paragraph) what the code is doing. Below I provide the sample code for this problem, which is written in R. Feel free to use it, but make sure to explain it. Print out the graphs as well. Also, I should note that for future work, that there are several professional implementations of the Metropolis algorithm, for example, you can find one by Charles Geyer at his homepage at the stats department of the University of Minnesota (you do not need it for this problem). 6

2 Theory Problems If you do this option you can skip the bootstrap question (1.1) and the empirical portion of the quantile regression question (but still do the non-empirical parts). You also need to do Bayesian and the MCMC questions. 1. Read parts of Andrew s Handbook chapter (p. 2249 58) that deals with stochastic equicontinuity and its use for deriving the asymptotic properties of estimators. Present the Andrews argument for the asymptotic normality. Explain the role of stochastic equicontinuity in this derivation. Explain how one can verify the stochastic equicontinuity via bracketing (see p. 2276 ). 1 2. Use Andrew s approach to characterize the asymptotic distribution of quantile regression estimator. Rigorously verify the stochastic equicontinuity of a relevant empirical process, using the bracketing entropy approach (you may also use symmetrization approach if you like). 1 Andrews, Donald W. K. Empirical Process Methods in Econometrics. In Handbook of Econometrics, vol. 4, edited by Daniel McFadden and Robert Engle. Amsterdam, The Netherlands: North-Holland Publishing Co., 1994, chap. 37, pp. 2247-2294. ISBN: 9780444887665. 7