Semiparametric Models and Estimators

Similar documents
Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas

The Influence Function of Semiparametric Estimators

Asymmetric least squares estimation and testing

Locally Robust Semiparametric Estimation

Cross-fitting and fast remainder rates for semiparametric estimation

Parametric Identification of Multiplicative Exponential Heteroskedasticity

Lecture 9: Quantile Methods 2

Semiparametric Estimation of a Sample Selection Model in the Presence of Endogeneity

MLE and GMM. Li Zhao, SJTU. Spring, Li Zhao MLE and GMM 1 / 22

GENERALIZEDMETHODOFMOMENTS I

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

Nonseparable multinomial choice models in cross-section and panel data

Nonparametric Econometrics

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2014 Instructor: Victor Aguirregabiria

Chapter 8. Quantile Regression and Quantile Treatment Effects

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be

Nonparametric and Semiparametric Estimation Whitney K. Newey Fall 2007

GMM Estimation and Testing II

Rank Estimation of Partially Linear Index Models

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

Generated Covariates in Nonparametric Estimation: A Short Review.

Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity

What s New in Econometrics? Lecture 14 Quantile Methods

A Local Generalized Method of Moments Estimator

A Note on Demand Estimation with Supply Information. in Non-Linear Models

Econ 582 Nonparametric Regression

ECO Class 6 Nonparametric Econometrics

Binary Models with Endogenous Explanatory Variables

Statistical Properties of Numerical Derivatives

Math 494: Mathematical Statistics

Parametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON

Ultra High Dimensional Variable Selection with Endogenous Variables

General Doubly Robust Identification and Estimation

Fall, 2007 Nonlinear Econometrics. Theory: Consistency for Extremum Estimators. Modeling: Probit, Logit, and Other Links.

Semiparametric Estimation of Partially Linear Dynamic Panel Data Models with Fixed Effects

Semiparametric Estimation of Partially Linear Dynamic Panel Data Models with Fixed Effects

An Overview of the Special Regressor Method

Semiparametric modeling and estimation of the dispersion function in regression

Identification and Estimation of Partially Linear Censored Regression Models with Unknown Heteroscedasticity

Asymptotic Statistics-III. Changliang Zou

Flexible Estimation of Treatment Effect Parameters

STAT 512 sp 2018 Summary Sheet

Semiparametric Estimation with Mismeasured Dependent Variables: An Application to Duration Models for Unemployment Spells

General Doubly Robust Identification and Estimation

NBER WORKING PAPER SERIES QUANTILE REGRESSION WITH CENSORING AND ENDOGENEITY. Victor Chernozhukov Iván Fernández-Val Amanda E.

Estimation of Treatment Effects under Essential Heterogeneity

New Developments in Econometrics Lecture 16: Quantile Estimation

Quantile Processes for Semi and Nonparametric Regression

SPRING 2007 EXAM C SOLUTIONS

Modeling Binary Outcomes: Logit and Probit Models

Modelling Non-linear and Non-stationary Time Series

Bootstrap & Confidence/Prediction intervals

Econ 583 Homework 7 Suggested Solutions: Wald, LM and LR based on GMM and MLE

δ -method and M-estimation

Dynamic Discrete Choice Structural Models in Empirical IO

Simple Estimators for Monotone Index Models

L6. NONLINEAR AND BINARY REGRESSION, PREDICTIVE EFFECTS, AND M-ESTIMATION

Single Equation Linear GMM with Serially Correlated Moment Conditions

Greene, Econometric Analysis (6th ed, 2008)

Lecture 32: Asymptotic confidence sets and likelihoods

Spatial Regression. 11. Spatial Two Stage Least Squares. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Location Properties of Point Estimators in Linear Instrumental Variables and Related Models

Nonparametric Identification of a Binary Random Factor in Cross Section Data - Supplemental Appendix

Lecture Notes 15 Prediction Chapters 13, 22, 20.4.

Deductive Derivation and Computerization of Semiparametric Efficient Estimation

Statistics Ph.D. Qualifying Exam: Part II November 9, 2002

Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Advanced Econometrics

Chapter 2: Resampling Maarten Jansen

Semiparametric Estimation of Structural Functions in Nonseparable Triangular Models

Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices

Identification and Estimation of Marginal Effects in Nonlinear Panel Models 1

QUANTILE REGRESSION WITH CENSORING AND ENDOGENEITY

Learning 2 -Continuous Regression Functionals via Regularized Riesz Representers

Introduction to Nonparametric and Semiparametric Estimation. Good when there are lots of data and very little prior information on functional form.

1 Outline. 1. Motivation. 2. SUR model. 3. Simultaneous equations. 4. Estimation

Identification and Estimation of Semiparametric Two Step Models

Heteroskedasticity and Autocorrelation Consistent Standard Errors

Bootstrapping a conditional moments test for normality after tobit estimation

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

UNIVERSITY OF CALIFORNIA Spring Economics 241A Econometrics

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data

Economics 582 Random Effects Estimation

5601 Notes: The Sandwich Estimator

ON ILL-POSEDNESS OF NONPARAMETRIC INSTRUMENTAL VARIABLE REGRESSION WITH CONVEXITY CONSTRAINTS

σ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =

Single Equation Linear GMM with Serially Correlated Moment Conditions

Individual Counterfactuals with Multidimensional Unobserved Heterogeneity

7 Semiparametric Estimation of Additive Models

Comprehensive Examination Quantitative Methods Spring, 2018

Nonlinear Error Correction Model and Multiple-Threshold Cointegration May 23, / 31

Simple Estimators for Semiparametric Multinomial Choice Models

Jinyong Hahn. Department of Economics Tel: (310) Bunche Hall Fax: (310) Professional Positions

Inference on Endogenously Censored Regression Models Using Conditional Moment Inequalities

Introduction. Linear Regression. coefficient estimates for the wage equation: E(Y X) = X 1 β X d β d = X β

A Simple Estimator for Binary Choice Models With Endogenous Regressors

Program Evaluation with High-Dimensional Data

Rockefeller College University at Albany

Transcription:

Semiparametric Models and Estimators Whitney Newey October 2007

Semiparametric Models Data: Z 1,Z 2,... i.i.d. Model: F aset of pdfs. Correct specification: pdf f 0 of Z i in F. Parametric model: F = {f(z θ) :θ Θ}, Θ is finite dimensional. Semiparametric model: F is not finite dimensional but has finite dimensional components.

Ex: Probit, Z = (Y,X), Y {0, 1}, Pr(Y =1 X) =Φ(X 0 β 0 ) F = {Φ(x 0 β) y [1 Φ(x 0 β)] 1 y h(x) :β B, h is pdf of X} β is parametric component, nonparametric component is h(x). Ex: Linear model Z = (Y,X),E[Y X] =X 0 β 0 ; F = {f(z) :E f [Y X] =X 0 β} parametric component is β, everything else nonparametric. β depends on f. Probit pdf is an explicit function of a parametric component β and a nonparametric component h(x). Linear model parameters imposes constraints on pdf.

Binary Choice with Unknown Disturbance Distribution: Z = (Y,X); Let v(x, β) be a known function, Y =1(Y > 0),Y = v(x, β 0 ) ε, ε independent of X, This equation implies that for the CDF G(t) of ε Pr(Y =1 X) =G(v(X, β 0 )), Here the parameter is β and everything else is nonparametric. The model can be written as an explicit function of the parametric component β and two nonparametric components. F =. The v(x, β) notation allows location and scale normalization, e.g. v(x, β) = x 1 + x 0 2 β, x = (x 1,x 0 2 )0, x 1 scalar. Generalization to nonmonotonic G( ).

Censored Regression with Unknown Disturbance Distribution: Z = (Y,X), Y =max{0,y },Y = X 0 β 0 + ε, med(ε X) =0; Parameter β, everything else, including distribution of ε, is nonparametric. F =. Binary choice and censored regression are limited dependent variable models. Semiparametric models are important here because misspecifying the distribution of the disturbances leads to inconsistency of MLE.

Partially Linear Regression: Z = (Y,X,W ), E[Y X, W ] =X 0 β 0 + g 0 (W ). Parameter β, everything else nonparametric, including additive component of regression. Can help with curse of dimensionality, with covariates X entering parametrically. In Hausman and Newey (1995) W is log income and log price, and X includes about 20 time and location dummies. X may be variable of interest and g 0 (Z) some covariates, e.g. sample selection. F =.

Index Regression: Z = (Y,X), v(x, β) aknown function, E[Y X] =τ(v(x, β 0 )), where the function τ( ) is unknown. Binary choice model has E[Y X] =Pr(Y = 1 X) =τ(v(x, β 0 )), with τ( )=G( ). If allow conditional distribution of ε given X to depend (only) on v(x, β 0 ), then binary choice model becomes index model. F =.

Semiparametric Estimators Estimators of β 0. Two kinds; do and do not require nonparametric estimation. Really model specific, but beyond scope to say why. One general kind of estimator: βˆ =arg min nx β B i=1 q(z i,β)/n, β 0 =arg mine[q(z i,β)], β B B set of parameter values. Extremum estimator. Clever choices of q(z, β) in some semiparametric models.

Ex: Censored regression quantiles, Powell (1984, 1986). med(y X) = X 0 β 0. max{0,y} is a monotonic transformation, so med(y X) = max{0,x 0 β 0 }, β 0 =arg mine[ Y i max{0,xi 0 β} ] β Sample analog is nx βˆ =arg min Y i max{0,xi 0 β}. β i=1 Censored least absolute deviations estimator of Powell (1984). Only requires med(y X) = X 0 β 0. Allows for heteroskedasticity.

Consistency and Asymptotic Normality of Minimization Estimators Consistency: If i) E[q(Z, β)] has a unique minimum at β 0, ii) β 0 B and B is compact; iii) q(z, β) is continuous at β with probability one; iv) E[sup β B q(z i,β) ] < ; then βˆ =arg min n β B Pi =1 q(z p i,β) β 0. Well known. Allows for q(z, β) to be discontinuous. Asymptotic Normality: (Van der Vaart, 1995). If βˆ p β 0, β 0 is in theinteriorof B, and i) E[q(Z i,β)] is twice differentiable at β 0 with nonsingular Hessian H; ii) there is d(z) such that E[d(Z) 2 ] exists and for all β, β B, q(z, β ) q(z, β) d(z)kβ βk; iii) with probability one q(z, β) is differentiable at β 0 with derivative m(z), thenfor Σ = E[m(Z)m(Z) 0 ], d n(βˆ β 0 ) N(0,H 1 ΣH 1 ).

Tests and confidence intervals: Either bootstrap or estimate H 1 ΣH 1 Estimating H : Plug into formula for H, or Ĥ is a finite difference approximation to second derivative of P q(z i, βˆ)/n evaluated at βˆ where differences not too small. Estimating Σ : 1 X Σˆ n = m(z i, βˆ)m(z i, βˆ) 0. n i=1

Alternative approach to estimating parameters of semiparametric models is to find g(z, β) with E[g(Z i,β 0 )] = 0, then do GMM. Using objective function is better. Why? Ex: Censored Least Absolute Deviations βˆ =arg min β nx i=1 Y i max{0,xi 0 β}. What is corresponding moment condition? g(z, β) = n When does βˆ solve P i=1 g(z i,β)=0. In some examples we first find g(z, β) andthen"integrate" to get q(z, β).

Estimators with Nonparametric Components Some models require use of nonparametric estimators. Include the partially linear and index regressions. Generalize extremum estimator: There is a function h such that β 0 =arg mine[q(z i,β,h)] β,h Does not work to solve βˆ =arg min nx β,h i=1 q(z i,β,h) In general βˆ not consistent.

General approach is "profile" method or "concentrating out" h. Find h(β) = arg mine[q(z i,β,h)] h Form nonparametric estimator ĥ(β) of h(β) (e.g. locally linear or series regression). Get βˆ =arg min β X i q(z i,β,ĥ(β)).

Ex: Partially linear model (Robinson, 1988). Know E[Y X, W ] minimizes E[(Y H(X, W )) 2 ] over H, sothat (β 0,h 0 ( )) = arg min E[{Y i X 0 i β h(w )} 2 ]. β,h( ) We can use q(z, β, h) = {Y X 0 β h(w )} 2 What would we get if we tried to solve X β =arg min q(z i,β,h)= arg min {Y i X i 0 β h(w i )} 2? β,h i β,h X i Instead we concentrate out h.

h(β) =argmine[{y i X i 0 β h(w i )} 2 ]=? h Get ĥ(β, W) =? Plug this back into sample minimization problem X βˆ = argmin {Y i X i 0 β ĥ(β, W i )} 2 β i = argmin β For series estimator same as X βˆ =argmin {Y i X i 0 β p K (W i ) 0 γ} 2. β,γ i Also, Ĥ(x, w) =x 0 βˆ + ĥ(w) is often of interest. Hausman and Newey (1995) w is ln(price) and ln(income) and Y is ln(demand).

Index regression, as in Ichimura (1993). By E[Y X] = τ 0 (v(x, β 0 )), (β 0,τ 0 ( )) = arg min E[{Y i τ(v(x i,β))} 2 ]. β,τ( ) Why? Concentrating out the τ, τ(x, β) =?. Estimator is βˆ = arg = arg nx min {Y i τˆ(x i,β)} 2 β i=1 min. β

Asymptotic Theory: Difficult because of presence of nonparametric estimator. Need ĥ(β) to converge faster than n 1/4 and other conditions. One helpful result is that estimation of h(β) does not affect asymptotic distribution. Straightforward to estimate asymptotic variance. Ĥ = n 1 X n i=1 q(z i,β, ĥ(β)) β β=βˆ,σˆ = 1 n nx i=1 Vˆ = Ĥ 1 ΣˆĤ 1 q(z i,ˆ β, h(ˆ ˆ β))q(z i, β, ˆ h(ˆ ˆ β)) 0. Under appropriate regularity conditions d ˆ p n(βˆ β 0 ) N(0,V ), V V.

Could also work with moment conditions, where E[g(z, β 0,h)] = 0. For exactly identified case get ĥ(β) and solve nx g(zi,β, ĥ(β)) = 0. i=1 In this case estimation of h does affect asymptotic variance. Formulae available for kernel and series estimates. Simplest for series: Just treat as if it were parametric and calculate two-step adjustment. Ex: Average derivative. h 0 (X) = E[Y X], β 0 = E[ h(x)/ X]; g(z, β, h) = β h(x)/ β. Estimator is 1 X ĥ(x i ) βˆ =. n X