Economics 620, Lecture 19: Introduction to Nonparametric and Semiparametric Estimation

Similar documents
Introduction to Nonparametric and Semiparametric Estimation. Good when there are lots of data and very little prior information on functional form.

Economics 620, Lecture 18: Nonlinear Models

Economics 620, Lecture 9: Asymptotics III: Maximum Likelihood Estimation

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

Economics 620, Lecture 14: Lags and Dynamics

Economics 620, Lecture 4: The K-Varable Linear Model I

Economics 620, Lecture 20: Generalized Method of Moment (GMM)

Economics 620, Lecture 7: Still More, But Last, on the K-Varable Linear Model

13 Endogeneity and Nonparametric IV

Economics 620, Lecture 15: Introduction to the Simultaneous Equations Model

Simple Estimators for Monotone Index Models

Introduction: structural econometrics. Jean-Marc Robin

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas

Rank Estimation of Partially Linear Index Models

i=1 y i 1fd i = dg= P N i=1 1fd i = dg.

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008

7 Semiparametric Methods and Partially Linear Regression

A New Approach to Robust Inference in Cointegration

Optimal Bandwidth Selection in Heteroskedasticity-Autocorrelation Robust Testing

Economics 620, Lecture 13: Time Series I

Nonparametric Identi cation and Estimation of Truncated Regression Models with Heteroskedasticity

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2014 Instructor: Victor Aguirregabiria

MC3: Econometric Theory and Methods. Course Notes 4

POWER MAXIMIZATION AND SIZE CONTROL IN HETEROSKEDASTICITY AND AUTOCORRELATION ROBUST TESTS WITH EXPONENTIATED KERNELS

Nonparametric Econometrics

Day 3B Nonparametrics and Bootstrap

Lecture 4: Linear panel models

Stat 5101 Lecture Notes

Econometrics I. Lecture 10: Nonparametric Estimation with Kernels. Paul T. Scott NYU Stern. Fall 2018

CAE Working Paper # A New Asymptotic Theory for Heteroskedasticity-Autocorrelation Robust Tests. Nicholas M. Kiefer and Timothy J.

Simple Estimators for Semiparametric Multinomial Choice Models

Finansiell Statistik, GN, 15 hp, VT2008 Lecture 15: Multiple Linear Regression & Correlation

Local Rank Estimation of Transformation Models with Functional Coe cients

Lecture Notes on Measurement Error

Nonparametric Regression

Economics 620, Lecture 4: The K-Variable Linear Model I. y 1 = + x 1 + " 1 y 2 = + x 2 + " 2 :::::::: :::::::: y N = + x N + " N

Measuring robustness

The Asymptotic Variance of Semi-parametric Estimators with Generated Regressors

Additive Isotonic Regression

Control Functions in Nonseparable Simultaneous Equations Models 1

Föreläsning /31

Day 4A Nonparametrics

Robust Con dence Intervals in Nonlinear Regression under Weak Identi cation

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2016 Instructor: Victor Aguirregabiria

SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER. Donald W. K. Andrews. August 2011

Economics 620, Lecture 2: Regression Mechanics (Simple Regression)

Serial Correlation Robust LM Type Tests for a Shift in Trend

Lectures 5 & 6: Hypothesis Testing

Econ 582 Nonparametric Regression

Economics 241B Estimation with Instruments

ECON 594: Lecture #6

Identi cation and Estimation of Semiparametric Two Step Models

Confidence intervals for kernel density estimation

ECONOMET RICS P RELIM EXAM August 24, 2010 Department of Economics, Michigan State University

Local regression I. Patrick Breheny. November 1. Kernel weighted averages Local linear regression

Lecture notes to Stock and Watson chapter 12

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

COMP 551 Applied Machine Learning Lecture 20: Gaussian processes

1 A Non-technical Introduction to Regression

1 Outline. Introduction: Econometricians assume data is from a simple random survey. This is never the case.

Identi cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance

41903: Introduction to Nonparametrics

Assignment 2 (Sol.) Introduction to Machine Learning Prof. B. Ravindran

Supplemental Material for KERNEL-BASED INFERENCE IN TIME-VARYING COEFFICIENT COINTEGRATING REGRESSION. September 2017

11. Bootstrap Methods

1 The Multiple Regression Model: Freeing Up the Classical Assumptions

ECON Introductory Econometrics. Lecture 11: Binary dependent variables

GMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails

Economics 620, Lecture 8: Asymptotics I

Regression, Ridge Regression, Lasso

Finite Sample Performance of Semiparametric Binary Choice Estimators

xtunbalmd: Dynamic Binary Random E ects Models Estimation with Unbalanced Panels

Parametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON

Local regression, intrinsic dimension, and nonparametric sparsity

Nonparametric Estimation of Wages and Labor Force Participation

Economics Introduction to Econometrics - Fall 2007 Final Exam - Answers

Comment on HAC Corrections for Strongly Autocorrelated Time Series by Ulrich K. Müller

finite-sample optimal estimation and inference on average treatment effects under unconfoundedness

Empirical Methods in Applied Microeconomics

Lecture 3: Statistical Decision Theory (Part II)

Machine Learning Linear Classification. Prof. Matteo Matteucci

Multivariate Regression: Part I

CAE Working Paper # Fixed-b Asymptotic Approximation of the Sampling Behavior of Nonparametric Spectral Density Estimators

Ultra High Dimensional Variable Selection with Endogenous Variables

Non-parametric Identi cation and Testable Implications of the Roy Model

A Note on the Closed-form Identi cation of Regression Models with a Mismeasured Binary Regressor

Kernel-based density. Nuno Vasconcelos ECE Department, UCSD

Nonparametric Trending Regression with Cross-Sectional Dependence

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS

Chapter 2. Dynamic panel data models

Test for Discontinuities in Nonparametric Regression

LECTURE 2 LINEAR REGRESSION MODEL AND OLS

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

SINGLE-STEP ESTIMATION OF A PARTIALLY LINEAR MODEL

Random Utility Models, Attention Sets and Status Quo Bias

Introduction to the Simultaneous Equations Model

Environmental Econometrics

Models, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley

Estimation of the Binary Response Model using a. Mixture of Distributions Estimator (MOD) Mark Coppejans 1. Department of Economics.

Transcription:

Economics 620, Lecture 19: Introduction to Nonparametric and Semiparametric Estimation Nicholas M. Kiefer Cornell University Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 1 / 18

Good when there are lots of data and very little prior information on functional form. Examples: y = f (x) + " (nonparametric) y = z 0 + f (x) + " (partial linear) y = f (z 0 ) + " (index model) Have to have some restrictions on f to avoid a perfect t. Di erentiability to some order, and bounded derivatives. Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 2 / 18

Estimation Assume the errors are iid and arrange the observations in order of the x i. Consider y = f (x) + ". Moving average estimator: ^f (x i ) = k 1 P y j for k values of j centered on i. Let k increase with the sample size, but more slowly than n. ^f (x i ) = k 1 P y j = k 1 P f (x j ) + k 1 P " j = f (x i ) + f 0 (x i )k 1 P (x j x i ) + 1 2 k 1 f 00 (x i ) P (x j x i ) 2 + k 1 P " j f 0 is multiplied by zero if x is symmetric around x i. Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 3 / 18

Estimation 2 ^f (x i ) = f (x i ) + 24 1 (k=n) 2 f 00 + k 1 P " j approximately. Hence ^f (x i ) = f (x i ) + O((k=n) 2 ) + O p (k 1=2 ): Note these errors are bias and variance (^f (x i ) f (x i )) 2 = O((k=n) 4 ) + O p (k 1 ). Consistent if k! 1 and k=n! 0. Best trades o bias and variance at the same rate: k 1. Or k = O(n 4=5 ), implying (k=n) 4 looks like (^f (x i ) f (x i )) 2 = O p (n 4=5 ): Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 4 / 18

Estimation 3 n 4=5 is the best possible rate. However, this has asymptotic bias (proportional to f 00 ) - so let k go a little slower to in nity. Then the bias term disappears. Interpretation???? JUST A TRICK!!! Generalization: Kernel Regression Really just weighted local averages. ^f (x i ) = P w j (x i )y j : Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 5 / 18

Estimation 4 Kernel: K(u) bounded, symmetric around zero, integrates to 1 (a normalization). Examples: Uniform, Bartlett, Normal (not drawn) Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 6 / 18

Estimation 5 w i (x i ) = K((x j x i )=)( P K((x j x i )=) is like k; in fact k = 2n. Convergence rate is optimized (at n 4=5 ) when = O(n 1=5 ). is the bandwidth. Usually assume a faster rate to eliminate the bias term in constructing con dence intervals. (JUST A TRICK.) Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 7 / 18

Estimation 6 Smoothness Restrictions: Example: jf 0 (x)j < L Solve min P (y i ^y i ) 2 s.t. j(^y i ^y j )=(x i x j )j < L. Adding monotonicity adds the constraint Concavity adds another constraint. ^y i < ^y j for x i < x j : Rates of convergence depend on the dimension of x (here 1) and on the number of derivatives. Maximal rate is n 2m=(2m+d ) where m is # derivatives and d is the dimension of x. Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 8 / 18

Estimation 7 Selection of bandwidth? Try a few and look at the results and residuals!! Formally, use cross valdiation. CV: ^f i (x i ). t ^f i using all data except the ith observation, then predict Then calculate CV () = n 1 P (y i ^f i (x i )) 2 : Choose to minimize this function. Requires a lot of computation. Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 9 / 18

Partial Linear Model y = z + f (x) + " The amazing result is that can be estimated at the parametric rate. y E(yjx) = y E(zjx) f (x) Suggests regressing y Ey jx on z Ezjx. = (z E(zjx)) + ": Estimate these conditional expectations by nonparametric kernel regression. Extends easily to higher dimensional z (estimate many conditional expectation functions and do the regression). Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 10 / 18

Index Models y = f (x 0 ) + " Here x is k-dimensional - the linear index x 0 a ects y nonparametrically. For xed, f can be estimated, for example, with kernel regression, as ^f. Estimate by minimizing n 1 P (y i ^f (x 0 i )) 2 : There is a lot of work on this problem. A basic result is that can be estimated at the usual rate (variance like n 1 ). Binary y generalizes logit, probit. Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 11 / 18

Index Models 2 Identi cation: Note f and are not separately identi ed. (typically one of the = 1). A normalization is necessary To estimate f consistently, at least one of the regressors must be continuous. (Think about it - we will use di erentiability assumptions on f.) Of course, also P x i x 0 i must have full rank. A little more is required. Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 12 / 18

Speci cation Testing NP estimation of residual variance: y i = f (x i ) + " i arranged in order of x, with jf 0 j < L s 2 = 1=2n 1 P (y i y i 1 ) 2 E(s 2 ) = 1=2n 1 P (f (x i ) f (x i 1 )) 2 +1=2En 1 P (" i " i 1 ) 2 First term looks like (f 0 [x i x i 1 ]) 2 < (L=n) 2 (x cont. distributed) Cross product? Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 13 / 18

Speci cation Testing 2 Second term is 2 so the estimator is consisent. Asymptotic distribution: n 1=2 (s 2 2 )! N(0; 4 ): To test against a parametric alternative, calculate s 2 a from the alternative and consider n 1=2 (s 2 a s 2 )=s 2! N(0; 1) reject if large. Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 14 / 18

Higher Dimensions are Problems Suppose y = f (x) = f (x 1 ; x 2 ) + ". Estimate f (x i ) by taking an average of the y in a neighborhood of x i. Suppose the neighborhood is a square? W /uniform x on the unit square, each neighborhood has about 2 n observations. ^f (x i ) = ( 2 n) 1 P y j = ( 2 n) 1 P f (x j ) + ( 2 n) 1 P " j f (x i ) + O( 2 ) + O p (1=(n 1=2 )): Same arguments as before, but now have instead of 1=2 : Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 15 / 18

Higher Dimensions are Problems 2 Consistency requires 0 and n 1=2 1. The optimal rate reduces bias and variance at the same rate. = O(n 1=6 ). Then (^f f ) 2 = O p (n 2=3 ). This implies This rate is optimal and is slower then the rate in the 1-dimensional model. The same arguments work for kernel estimators in higher dimensions. Many variations are available (di erent kernels, bandwidth choices, neighborhoods, etc.). Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 16 / 18

Higher Dimensions are Problems 3 Want, say, 1% of data to form a local average (or local weighted averge w/kernel). Uniform observations, 1 dim, unit interval, local is a.01 length interval. 2 dim, unit square, local is.01 1=2 = :1 unit square - 1/10 the range in each dimension. Generally,.01 1=p where p is the dim. Gets nonlocal fast. Picture? Mean distance to origin increases w/dimension - most points are near the boundary. Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 17 / 18

The source of most of this lecture and a great reference on applied nonparametric and semiparametrics (like the partial linear model) is Adonis Yatchew (2003). Semiparametric Regression for the Applied Econometrician, Cambridge University Press. Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis 18 / 18