Chapter 6 Stochastic Regressors

Similar documents
Chapter 1 Introduction. What are longitudinal and panel data? Benefits and drawbacks of longitudinal data Longitudinal data models Historical notes

Simultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations

Econometric Analysis of Cross Section and Panel Data

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Linear Models in Econometrics

Controlling for Time Invariant Heterogeneity

Applied Microeconometrics (L5): Panel Data-Basics

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

1 Motivation for Instrumental Variable (IV) Regression

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Exogeneity and Causality

Lecture 6: Dynamic panel models 1

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Non-linear panel data modeling

Longitudinal and Panel Data: Analysis and Applications for the Social Sciences. Table of Contents

Potential Outcomes Model (POM)

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Regression with time series

Dealing With Endogeneity

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler

Instrumental Variables

Multiple Linear Regression CIVL 7012/8012

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Applied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid

Linear Regression. Junhui Qian. October 27, 2014

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

1 Estimation of Persistent Dynamic Panel Data. Motivation

Final Exam. Economics 835: Econometrics. Fall 2010

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

ECON3150/4150 Spring 2015

Generalized Method of Moments: I. Chapter 9, R. Davidson and J.G. MacKinnon, Econometric Theory and Methods, 2004, Oxford.

Christopher Dougherty London School of Economics and Political Science

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

An overview of applied econometrics

Econometrics of Panel Data

WISE International Masters

11. Further Issues in Using OLS with TS Data

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

Introduction to Econometrics

Assessing Studies Based on Multiple Regression

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems

Fixed Effects Models for Panel Data. December 1, 2014

Least Squares Estimation-Finite-Sample Properties

Introductory Econometrics

The regression model with one stochastic regressor (part II)

ECON Introductory Econometrics. Lecture 16: Instrumental variables

Econometrics Homework 4 Solutions

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

EFFICIENT ESTIMATION USING PANEL DATA 1. INTRODUCTION

Topic 10: Panel Data Analysis

xtdpdml for Estimating Dynamic Panel Models

1. The Multivariate Classical Linear Regression Model

Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model

4.8 Instrumental Variables

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

Lecture 4: Linear panel models

ECO 310: Empirical Industrial Organization Lecture 2 - Estimation of Demand and Supply

Short T Panels - Review

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research

Time Series Models and Inference. James L. Powell Department of Economics University of California, Berkeley

Linear Regression with Time Series Data

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

Simple Linear Regression Model & Introduction to. OLS Estimation

ECONOMETFUCS FIELD EXAM Michigan State University May 11, 2007

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

The regression model with one stochastic regressor.

Introductory Econometrics

Environmental Econometrics

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Econometrics. 7) Endogeneity

Freeing up the Classical Assumptions. () Introductory Econometrics: Topic 5 1 / 94

What s New in Econometrics. Lecture 1

Missing dependent variables in panel data models

Dynamic Regression Models (Lect 15)

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Intermediate Econometrics

Advanced Econometrics

Review of Econometrics

Title. Description. var intro Introduction to vector autoregressive models

Testing Random Effects in Two-Way Spatial Panel Data Models

Econometrics Master in Business and Quantitative Methods

LECTURE 2 LINEAR REGRESSION MODEL AND OLS

EMERGING MARKETS - Lecture 2: Methodology refresher

Econometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton.

The Simple Linear Regression Model

Estimation of Dynamic Nonlinear Random E ects Models with Unbalanced Panels.

Lab 07 Introduction to Econometrics

Economics 113. Simple Regression Assumptions. Simple Regression Derivation. Changing Units of Measurement. Nonlinear effects

Applied Health Economics (for B.Sc.)

Truncation and Censoring

Econometrics (60 points) as the multivariate regression of Y on X 1 and X 2? [6 points]

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

An Introduction to Structural Equation Modeling with the sem Package for R 1. John Fox

A Course on Advanced Econometrics

Transcription:

Chapter 6 Stochastic Regressors 6. Stochastic regressors in non-longitudinal settings 6.2 Stochastic regressors in longitudinal settings 6.3 Longitudinal data models with heterogeneity terms and sequentially exogenous regressors 6.4 ultivariate responses 6.5 Simultaneous equation models with latent variables Appendix 6A Linear projections

6. Stochastic regressors in nonlongitudinal settings 6.. Endogenous stochastic regressors 6..2 Weak and strong exogeneity 6..3 Causal effects 6..4 Instrumental variable estimation his section introduces stochastic regressors by focusing on purely cross-sectional and purely time series data. It reviews the non-longitudinal setting, to provide a platform for the longitudinal data discussion.

Non-stochastic explanatory variables raditional in the statistics literature otivated by designed experiments X represents the amount of fertilizer applied to a plot of land. However, for survey data, it is natural to think of random regressors. Observational data???? On the one hand, the study of stochastic regressors subsumes that of non-stochastic regressors. With stochastic regressors, we can always adopt the convention that a stochastic quantity with zero variance is simply a deterministic, or non-stochastic, quantity. On the other hand, we may make inferences about population relationships conditional on values of stochastic regressors, essentially treating them as fixed.

Endogenous stochastic regressors An endogenous variable is one that fails an exogeneity requirement more later. It is customary in economics to use the term endogenous to mean a variable that is determined within an economic system whereas an exogenous variable is determined outside the system. hus, the accepted econometric/statistic usage differs from the general economic meaning. If (x i, y i ) are i.i.d, then imposing the conditions E (y i x i ) = x i β and Var (y i x i ) = σ 2 are sufficient to estimate parameters. Define ε i = y i - x i β, and write the first condition as E (ε i x i ) =. Interpret this to mean that ε i and x i are uncorrelated.

Assumptions of the Linear Regression odel with Strictly Exogenous Regressors Wish to analyze the effect of all of the explanatory variables on the responses. hus, define X = (x,, x n ) and require SE. E (y i X) = x i β. SE2. {x,, x n } are stochastic variables. SE3. Var(y i X) = σ 2. SE4. {y i X} are independent random variables. SE5. {y i } is normally distributed, conditional on {X}.

Usual Properties Hold Under SE-SE4, we retain most of the desirable properties of our ordinary least square estimators of β. hese include: the unbiasedness and the Gauss-arkov property of ordinary least square estimators of β. If, in addition, SE5 holds, then the usual t and F statistics have their customary distributions, regardless as to whether or not X is stochastic. Define the disturbance term to be ε i = y i - x i β and write SE as E (ε i X) = is known as strict exogeneity in the econometrics literature.

Some Alternative Assumptions Regressors are said to be predetermined if SEp. E (ε i x i ) = E ( (y i - x i β) x i ) =. he assumption SEp is weaker than SE. SE does not work well with time-series data SEp is sufficient for consistent for consistency, not asymptotic normality. For asymptotic normality, we require a somewhat stronger assumption: SEm. E ( ε i ε i-,, ε, x i,, x ) = for all i. When SEm holds, then {ε i } satisfies the requirements for a martingale difference sequence. Note that SEm implies SEp.

Weak and strong exogeneity For linear model exogeneity We have considered strict exogeneity and predeterminedness. Appropriately done in terms of conditional means. It gives precisely the conditions needed for inference and is directly testable. Now we wish to generalize these concepts to assumptions regarding the entire distribution, not just the mean function. Although stronger than the conditional mean versions, these assumptions are directly applicable to nonlinear models. We now introduce two new kinds of exogeneity, weak and strong exogeneity.

Weak exogeneity A set of variables are said to be weakly exogenous if, when we condition on them, there is no loss of information about the parameters of interest. Weak endogeneity is sufficient for efficient estimation. Suppose that we have random variables (x, y ),, (x, y ) with joint probability density (or mass) function for f(y,, y, x,, x ).. By repeated conditioning, we write this as: f ( y ),..., y, x,..., x = f ( yt, xt y,..., yt, x,..., xt ) t= = { f ( yt y,..., yt, x,..., xt ) f ( xt y,..., yt, x,..., xt ) } t=

Weak exogeneity Suppose that this joint distribution is characterized by vectors of parameters θ and ψ such that ( y,...,, x,..., x ) f y ( y y,..., y, x,..., x, θ) f ( x y,..., y, x,..., x ψ) = f t t t t= t= t t t, We can ignore the second term for inference about θ, treating the x variables as essentially fixed. If this relationship holds, then we say that the explanatory variables are weakly exogenous..

Suppose, in addition, that Strong Exogeneity ( x y,...,, x,..., x, ψ) f ( x x,..., x, ψ) f t y t t = t t that is, conditional on x,, x t-, that the distribution of x t does not depend on past values of y, y,, y t-. hen, we say that {y,, y t- } does not Granger-cause x t. his condition, together with weak exogeneity, suffices for strong exogeneity. his is helpful for prediction purposes.

Causal effects Researchers are interested in causal effects, often more so than measures of association among variables. Statistics has contributed to making causal statements primarily through randomization. Data that arise from this random assignment mechanism are known as experimental. In contrast, most data from the social sciences are observational, where it is not possible to use random mechanisms to randomly allocate observations according to variables of interest. Regression function measures relationships developed through the data gathering mechanism, not necessarily the relationships of interest to researchers.

Structural odels A structural model is a stochastic model representing a causal relationship, as opposed to a relationship that simply captures statistical associations. A sampling based model is derived from our knowledge of the mechanisms used to gather the data. he sampling based model directly generates statistics that can be used to estimate quantities of interest It is also known as an estimable model.

Causal Statements Causal statements are based primarily on substantive hypotheses in which the researcher carefully develops. Causal inference is theoretically driven. Causal processes cannot be demonstrated directly from the data; the data can only present relevant empirical evidence serving as a link in a chain of reasoning about causal mechanisms. Longitudinal data are much more useful in establishing causal relationships than (cross-sectional) regression data because, for most disciplines, the causal variable must precede the effect variables in time. Lazarsfeld and Fiske (938) considered the effect of radio advertising on product sales. raditionally, hearing radio advertisements was thought to increase the likelihood of purchasing a product. Lazarsfeld and Fiske considered whether those that bought the product would be more likely to hear the advertisement, thus positing a reverse in the direction of causality. hey proposed repeatedly interviewing a set of people (the panel ) to clarify the issue.

Instrumental variable estimation Instrumental variable estimation is a general technique to handle problems associated with the disconnect between the structural model and a sampling based model. o illustrate, consider the linear model y i = x i β + ε i, yet not all of the regressors are predetermined, E (ε i x i ). Assume there a set of predetermined variables, w i, where E (ε i w i ) = (predetermined) E (w i w i ) is invertible. An instrumental variable estimator of β is b IV = (X P W X) - X P W y, where P W = W (W W ) - W is a projection matrix and W = (w,, w n ) is the matrix of instrumental variables. Within X P W is X W = x w i i i this sum of cross-products drives the calculation fo the correlation between x and w.

Omitted Variables Application he structural regression function as E (y i x i, u i ) = x i β + γ u i, where u i represents unobserved variables. Example- Card (995) wages in relation to years of education. Additional control variables include years of experience (and its square), regional indicators, racial indicators and so forth. he concern is that the structural model omits an important variable, the man s ability (u), that is correlated with years of education. Card introduces a variable to indicate whether a man grew up in the vicinity of a four-year college as an instrument for years of education. otivation - this variable should be correlated with education yet uncorrelated with ability. Define w i to be the same set of explanatory variables used in the structural equation model but with the vicinity

Instrumental Variables Additional applications include: easurement error problems Endogeneity induced by systems of equations (Section 6.5). he choice of instruments is the most difficult decision faced by empirical researchers using instrumental variable estimation. ry to choose instruments that are highly correlated with the endogeneous explanatory variables. Higher correlation means that the bias as well as standard error of b IV will be lower.

6.2. Stochastic regressors in longitudinal settings his section covers No heterogeneity terms Strictly exogeneous variables Both of these settings are relatively straightforward Without heterogeneity terms, we can use standard (cross-sectional) methods With strictly exogeneous variables, we can directly use the techniques described in Chapters -5

Longitudinal data models without heterogeneity terms Assumptions of the Longitudinal Data odel with Strictly Exogenous Regressors SE. E (y it X) = x it β. SE2. {x it } are stochastic variables. SE3. Var(y i X) = R i. SE4. {y i X} are independent random vectors. SE5. {y i } is normally distributed, conditional on {X}. Recall that X = {X,, X n } is the complete set of regressors over all subjects and time periods.

Longitudinal data models without heterogeneity terms No heterogeneity terms, but one can incorporate dependence among observations from the same subject with the R i matrix (such as an autoregressive model or compound symmetry ). hese strict exogeneity assumptions do not permit lagged dependent variables, a popular approach for incorporating intra-subject relationships among observations. However, one can weaken this to a pre-determined condition such as: SEp. E (ε it x it ) = E ( (y it x it β) x it ) =. Without heterogeneity, longitudinal and panel data models have the same endogeneity concerns as the cross-sectional models.

Longitudinal data models with heterogeneity terms and strictly exogenous regressors From customary usage or a structural modeling viewpoint, it is often important to understand the effects of endogenous regressors when a heterogeneity term α i is present in the model. We consider the linear mixed effects model of the form y it = z it α i + x it β + ε it and its vector version y i = Z i α i + X i β + ε i. Define X* = {X, Z,, X n, Z n } to be the collection of all observed explanatory variables and α = (α,, α n ) to be the collection of all subject-specific terms.

Assumptions of the Linear ixed Effects odel with Strictly Exogenous Regressors Conditional on the Unobserved Effect SEC. E (y i α, X*) = Z i α i + X i β. SEC2. {X*} are stochastic variables. SEC3. Var (y i α, X*) = R i. SEC4. {y i } are independent random vectors, conditional on {α} and {X*}. SEC5. {y i } is normally distributed, conditional on {α} and {X*}. SEC6. E (α i X*) = and Var (α i X*) = D. Further, {α,, α n } are mutually independent, conditional on {X*}. SEC7. {α i } is normally distributed, conditional on {X*}.

Observables Representation of the Linear ixed Effects odel with Strictly Exogenous Regressors Conditional on the Unobserved Effect SE. E (y i X*) = X i β. SE2. {X*} are stochastic variables. SE3a. Var(y i X*) = Z i DZ i + R i. SE4. {y i } are independent random vectors, conditional on {X*}. SE5. {y i } is normally distributed, conditional on {X*}.

Strictly Exogenous Regressors Conditional on the Unobserved Effect hese assumptions are stronger than strict exogeneity. For example, note that E (y i α, X*) = Z i α i + X i β and E (α i X*) = together imply that E (y i X*) = E (E ( y i α, X*) X*) = E (Z i α i + X i β X*) = X i β. hat is, we require strict exogeneity of the disturbances (E (ε i X*) = ) and that the unobserved effects (α) are uncorrelated with the disturbance terms (E (ε i α ) = ).

Example - axpayers Demographic Characteristics S - taxpayer's marital status. HH - head of household DEPEND - number of dependents claimed by the taxpayer. AGE - age 65 or over. Economic Characteristics LNPI - natural logarithm of the sum of all positive income line items on the return, in 983 dollars.. R - marginal tax rate. It is computed on total personal income less exemptions and the standard deduction. EP - Self-employed binary variable. PREP - indicates the presence of a paid preparer. LNAX - natural logarithm of the tax liability, in 983 dollars. his is the response variable of interest.

Example - axpayers Because the data was gathered using a random sampling mechanism, we can interpret the regressors as stochastic. Demographics, and probably EP, can be safely argued as strictly exogenous. LNAX t should not affect LNPI t, because LNPI is the sum of positive income items, not deductions. ax preparer variable (PREP) it may be reasonable to assume that the tax preparer variable is predetermined, although not strictly exogenous. hat is, we may be willing to assume that this year s tax liability does not affect our decision to use a tax preparer because we do not know the tax liability prior to this choice, making the variable predetermined. However, it seems plausible that the prior year s tax liability will affect our decision to retain a tax preparer, thus failing the strict exogeneity test.

axpayer odel -With heterogeneity terms Consider the error components model We interpret the heterogeneity terms to be unobserved subject-specific (taxpayer) characteristics, such as ability, that would influence the expected tax liability. One needs to argue that the disturbances, representing unexpected tax liabilities, are uncorrelated with the unobserved effects. oreover, Assumption SEC6 employs the condition that the unobserved effects are uncorrelated with the observed regressor variables. One may be concerned that individuals with high earnings potential who have historically high levels of tax liability (relative to their control variables) may be more likely to use a tax preparer, thus violating this assumption.

Fixed effects estimation If one is concerned with Assumption SEC6, then a solution may be fixed effects estimation (even when we believe in a random effects model formulation). Intuitively, this is because the fixed effects estimation procedures sweep out the heterogeneity terms they do not rely on the assumption that they are uncorrelated with observed regressors. Some analysts prefer to test the assumption of correlation between unobserved and observed effects by examining the difference between these two estimators Hausman test Section 7.2.

6.3 Longitudinal data models with heterogeneity terms and sequentially exogenous regressors he assumption of strict exogeneity, even when conditioning on unobserved heterogeneity terms, is limiting. Strict exogeneity rules out current values of the response (y it ) feeding back and influencing future values of the explanatory variables (such as x i,t+ ). An alternative assumption introduced by Chamberlain (992) allows for this feedback. We say that the regressors are sequentially exogenous conditional on the unobserved effects if E ( ε it α i, x i,, x it ) =. or (in the error components model) E ( y it α i, x i,, x it ) = α i + x it βfor all i, t. After controlling for α i and x it, no past values of regressors affect the expected value of y it.

Lagged dependent variable model his formulation allows us to consider lagged dependent variables as regressors y it = α i + γ y i,t- + x it β + ε it, his is sequentially exogenous conditional on the unobserved effects o see this, use the set of regressors o it = (, y i,t-, x it ) and E (ε it α i, y i,,, y i,t-, x i,,, x i,t ) =. he explanatory variable y i,t- is not strictly exogenous so that the Section 6.2.2 discussion does not apply.

Estimation difficulties of lagged dependent variable model Estimation of the lagged dependent variable model is difficult because the parameter γ appears in both the mean and variance structure. Cov (y it, y i,t- ) = Cov (α i + γ y i,t- + x it β + ε it, y i,t- ) = Cov (α i, y i,t- ) + γ Var (y i,t- ). and E y it = γ E y i,t- + x it β = γ (γ E y i,t-2 + x i,t- β ) + x it β = = (x it + γ x i,t- + + γ t-2 x i,2 )β + γ t- E y i,. hus, E y it clearly depends on γ. oreover, special estimation techniques are required.

First differencing technique First differencing proves to be a suitable device for handling certain types of endogenous regressors. aking first differences of the lagged dependent variable model yields y it -y i,t- = γ ( y i,t- - y i,t-2 ) + ε it - ε i,t-, eliminating the heterogeneity term. Ordinary least squares estimation using first differences (without an intercept term) yields an unbiased and consistent estimator of γ. First differencing can also fail - see the feedback example.

Example Feedback Consider the error components y it = α i + x it β + ε it where {ε it } are i.i.d. Suppose that the current regressors are influenced by the feedback from the prior period s disturbance through the relation x it = x i,t- + υ i ε i,t-, where {υ i } is an i.i.d. aking differences of the model, we have y it = y it -y i,t- = x it β + ε it where ε it = ε it - ε i,t- and x it = x it - x i,t- = υ i ε i,t-. he ordinary least squares estimator of β are asymptotically biased. Due to the correlation between x it and ε it.

ransform + instrumental variable estimation By a transform, we mean first differencing or fixed effects, to sweep out the heterogeneity. Assume balanced data and that the responses follow the model equation y it = α i + x it β + ε it, yet the regressors are potentially endogenous. Also assume that the current disturbances are uncorrelated with current as well as past instruments. ime-constant heterogeneity parameters are handled via sweeping out their effects, let K be a ( ) upper triangular matrix such that K=.

hus, the transformed system is K y i = K X i β + K ε i, Could use first differences So that = L L O L L K FD = = 2 3 2 2 FD y y y y y y y y y y L L O L L y K

Arrellano and Bover (995) recommend Defining ε i,fod = K FOD ε i, the tth row is hese are known as forward orthogonal deviations. hey are used in time series have slightly better properties. = 2 2 2 2 2 2 2 diag L L O L L L K FOD ( ) + + + = + i t i it FOD it t t t,,,... ε ε ε ε

o define the instrumental variable estimator, let W i be a block diagonal matrix with the tth block given by (w,i w 2,i w 2,it ). hat is, define W * i w =, i w 2, i ( w w w ), i 2, i 2, i2 L L O L ( w ), i w 2, i w 2, i2 L w 2, i, i his implies E W i Kε i =, our sequentially exogeneity assumption.

he estimator We define the instrumental variable estimator as where b IV = ( ) WX Σ IVWX WX ΣIVWy = n = W KX = = W iky WX i i i n Wy i i And IV ( W K ε ε K W ) Σ = E i i i i Σ IV Estimate via two-stage least squares

Feedback Example Recall the relation x it = x i,t- + υ i ε i,t-,. A natural set of instruments is to choose w it = x it. For simplicity, use the first difference transform. With these choices, the tth block of E W i K FD ε i is so the sequentially exogeneity assumption is satisfied. ( ) = + x x t i t i it i,, E ε ε

axpayer Example We suggested that a heterogeneity term may be due to an individual s earning potential this may be correlated with the variable that indicates use of a professional tax preparer. oreover, there was concern that tax liabilities from one year may influence the choice in subsequent tax year s choice of whether or not to use a professional tax preparer. If this is the case, then the instrumental variable estimator provides protection against this sequential endogeneity concern.

6.4 ultivariate responses 6.4. ultivariate regressions 6.4.2 Seemingly unrelated regressions 6.4.3 Simultaneous equations models 6.4.4 Systems of equations with error components

6.5 Simultaneous-Equations odels with Latent Variables 6.5. Cross-Sectional odels 6.5.2 Longitudinal Data Applications