Econ107 Applied Econometrics Topic 5: Specification: Choosing Independent Variables (Studenmund, Chapter 6)

Similar documents
THEORETICAL AUTOCORRELATIONS. ) if often denoted by γ. Note that

Department of Economics University of Toronto

Fall 2009 Social Sciences 7418 University of Wisconsin-Madison. Problem Set 2 Answers (4) (6) di = D (10)

NPTEL Project. Econometric Modelling. Module23: Granger Causality Test. Lecture35: Granger Causality Test. Vinod Gupta School of Management

Lecture VI Regression

Lecture 6: Learning for Control (Generalised Linear Regression)

Advanced time-series analysis (University of Lund, Economic History Department)

Panel Data Regression Models

Data Collection Definitions of Variables - Conceptualize vs Operationalize Sample Selection Criteria Source of Data Consistency of Data

Graduate Macroeconomics 2 Problem set 5. - Solutions

January Examinations 2012

F-Tests and Analysis of Variance (ANOVA) in the Simple Linear Regression Model. 1. Introduction

In the complete model, these slopes are ANALYSIS OF VARIANCE FOR THE COMPLETE TWO-WAY MODEL. (! i+1 -! i ) + [(!") i+1,q - [(!

New M-Estimator Objective Function. in Simultaneous Equations Model. (A Comparative Study)

Robustness Experiments with Two Variance Components

( t) Outline of program: BGC1: Survival and event history analysis Oslo, March-May Recapitulation. The additive regression model

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8)

CS286.2 Lecture 14: Quantum de Finetti Theorems II

2. SPATIALLY LAGGED DEPENDENT VARIABLES

[Link to MIT-Lab 6P.1 goes here.] After completing the lab, fill in the following blanks: Numerical. Simulation s Calculations

Endogeneity. Is the term given to the situation when one or more of the regressors in the model are correlated with the error term such that

. The geometric multiplicity is dim[ker( λi. number of linearly independent eigenvectors associated with this eigenvalue.

Solution in semi infinite diffusion couples (error function analysis)

Motion in Two Dimensions

1 Constant Real Rate C 1

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 4

RELATIONSHIP BETWEEN VOLATILITY AND TRADING VOLUME: THE CASE OF HSI STOCK RETURNS DATA

. The geometric multiplicity is dim[ker( λi. A )], i.e. the number of linearly independent eigenvectors associated with this eigenvalue.

Variants of Pegasos. December 11, 2009

Machine Learning 2nd Edition

John Geweke a and Gianni Amisano b a Departments of Economics and Statistics, University of Iowa, USA b European Central Bank, Frankfurt, Germany

TSS = SST + SSE An orthogonal partition of the total SS

a. (All your answers should be in the letter!

Advanced Machine Learning & Perception

Machine Learning Linear Regression

Economics 120C Final Examination Spring Quarter June 11 th, 2009 Version A

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

V.Abramov - FURTHER ANALYSIS OF CONFIDENCE INTERVALS FOR LARGE CLIENT/SERVER COMPUTER NETWORKS

( ) () we define the interaction representation by the unitary transformation () = ()

PhD/MA Econometrics Examination. January, 2019

Kayode Ayinde Department of Pure and Applied Mathematics, Ladoke Akintola University of Technology P. M. B. 4000, Ogbomoso, Oyo State, Nigeria

CHAPTER 10: LINEAR DISCRIMINATION

Appendix H: Rarefaction and extrapolation of Hill numbers for incidence data

Mechanics Physics 151

ABSTRACT KEYWORDS. Bonus-malus systems, frequency component, severity component. 1. INTRODUCTION

Ordinary Differential Equations in Neuroscience with Matlab examples. Aim 1- Gain understanding of how to set up and solve ODE s

Mechanics Physics 151

5th International Conference on Advanced Design and Manufacturing Engineering (ICADME 2015)

(,,, ) (,,, ). In addition, there are three other consumers, -2, -1, and 0. Consumer -2 has the utility function

Online Supplement for Dynamic Multi-Technology. Production-Inventory Problem with Emissions Trading

UNIVERSITAT AUTÒNOMA DE BARCELONA MARCH 2017 EXAMINATION

Chapter 8 Dynamic Models

Chapter 5. The linear fixed-effects estimators: matrix creation

FTCS Solution to the Heat Equation

PubH 7405: REGRESSION ANALYSIS DIAGNOSTICS IN MULTIPLE REGRESSION

Chapter Lagrangian Interpolation

Estimation of Cost and. Albert Banal-Estanol

Outline. Probabilistic Model Learning. Probabilistic Model Learning. Probabilistic Model for Time-series Data: Hidden Markov Model

Notes on the stability of dynamic systems and the use of Eigen Values.

CHAPTER 5: MULTIVARIATE METHODS

Robust and Accurate Cancer Classification with Gene Expression Profiling

Analysis And Evaluation of Econometric Time Series Models: Dynamic Transfer Function Approach

Computing Relevance, Similarity: The Vector Space Model

[ ] 2. [ ]3 + (Δx i + Δx i 1 ) / 2. Δx i-1 Δx i Δx i+1. TPG4160 Reservoir Simulation 2018 Lecture note 3. page 1 of 5

Mechanics Physics 151

CHAPTER FOUR REPEATED MEASURES IN TOXICITY TESTING

Chapter 6: AC Circuits

Lecture 18: The Laplace Transform (See Sections and 14.7 in Boas)

THE PREDICTION OF COMPETITIVE ENVIRONMENT IN BUSINESS

Chapters 2 Kinematics. Position, Distance, Displacement

Dynamic Team Decision Theory. EECS 558 Project Shrutivandana Sharma and David Shuman December 10, 2005

We are estimating the density of long distant migrant (LDM) birds in wetlands along Lake Michigan.

Clustering (Bishop ch 9)

A Demand System for Input Factors when there are Technological Changes in Production

Financial Econometrics Jeffrey R. Russell Midterm Winter 2009 SOLUTIONS

Performance Analysis for a Network having Standby Redundant Unit with Waiting in Repair

A HIERARCHICAL KALMAN FILTER

for regression Y ˆ ˆ corr Z, X 0 and

US Monetary Policy and the G7 House Business Cycle: FIML Markov Switching Approach

Anomaly Detection. Lecture Notes for Chapter 9. Introduction to Data Mining, 2 nd Edition by Tan, Steinbach, Karpatne, Kumar

Factor models with many assets: strong factors, weak factors, and the two-pass procedure

Introduction to Boosting

Normal Random Variable and its discriminant functions

On One Analytic Method of. Constructing Program Controls

Comparison of Differences between Power Means 1

A Simple Method for Estimating Betas When Factors Are Measured with Error

Impact of Strategic Changes on the Performance of Trucking Firms in the Agricultural Commodity Transportation Market

Linear Response Theory: The connection between QFT and experiments

Displacement, Velocity, and Acceleration. (WHERE and WHEN?)

Math 128b Project. Jude Yuen

THE FORECASTING ABILITY OF A COINTEGRATED VAR DEMAND SYSTEM WITH ENDOGENOUS VS. EXOGENOUS EXPENDITURE VARIABLE

Journal of Econometrics. The limit distribution of the estimates in cointegrated regression models with multiple structural changes

FI 3103 Quantum Physics

Clustering with Gaussian Mixtures

10. A.C CIRCUITS. Theoretically current grows to maximum value after infinite time. But practically it grows to maximum after 5τ. Decay of current :

Is it necessary to seasonally adjust business and consumer surveys. Emmanuelle Guidetti

Volume 30, Issue 4. Abd Halim Ahmad Universiti Utara Malaysia

Scattering at an Interface: Oblique Incidence

Lecture 11 SVM cont

Bayes rule for a classification problem INF Discriminant functions for the normal density. Euclidean distance. Mahalanobis distance

Transcription:

Econ7 Appled Economercs Topc 5: Specfcaon: Choosng Independen Varables (Sudenmund, Chaper 6 Specfcaon errors ha we wll deal wh: wrong ndependen varable; wrong funconal form. Ths lecure deals wh wrong ndependen varables, whch may be due o omed varables, redundan varables (rrelevan varables. Use he followng example under boh ypes: lnw where W = Wage rae of worker. S = Years of formal educaon of worker.. OJT = Effecve years of On-he-Job Tranng of worker. The dea s ha we have forms of human capal: general human capal obaned hrough formal educaon and specfc human capal obaned hrough vocaonal educaon, apprenceshp programmes, ec. Boh may ncrease wages (.e., > and >, bu no a he same rae (.e.,. I. Omng a Relevan Varable. One of he mos common problems n regresson analyss. Could be based n he gnorance of he researcher (.e., varable avalable, bu no used. More lkely, daa unavalable (e.g., Household Economc Survey. Esmae he followng model nsead: S So ha he rue error n he above regresson s = lnw = OJT S = OJT So ha Assumpon does no hold because E( = OJT. More mporanly, n he case where OJT and S are correlaed, looks lke Assumpon 3 does no hold because Cov(, S. As a resul, Gauss-Markov heorem does no apply. In general, OLS esmae of he regresson coeffcen s based, e, ˆ E (

Page - And he bas s where: Suppose ha b >, hen: bas( ˆ ˆ = b b = E( Cov ( S, OJT = Var ( S E ( and he esmaed coeffcen s based upward. Bas s zero when he coeffcen of omed varable s zero or he ncluded and omed varables are uncorrelaed. In addon, he sandard errors on hese esmaed coeffcens wll be based. In he msspecfed model: Bu varance of he 'rue' esmaor s: > where r s he correlaon coeffcen beween S and OJT. Ths means ha: The varance of esmaed coeffcen s also based. We're placng 'oo much' confdence n our coeffcen esmaes. The resul s ha he es wll be msleadng (hs s rue even f r =, because our esmae of σ wll also be based. ˆ Var ( ˆ σ = Σ s Var ( ˆ σ = Σ s (- r If r >,henvar ( ˆ < Var ( ˆ The remedal measure s easy IF we know whch varable has been omed and hs omed varable s avalable. Include n he model. If he omed varable no avalable, mgh ry o fnd a proxy varable ha s closely relaed o hs mssng varable (e.g., use nformaon on he average OJT or people n a parcular ndusry and occupaon. Or a leas sgn he drecon of he bas, and esmae s poenal magnude. The above remedy works n heory. In pracce, somemes s dffcul o know f a varable has been omed. To deec he exsence of he problem of omng

Page - 3 a relevan varable, one common pracce s o examne he sgn of esmaed coeffcens and see f hey mee our expecaon or economc heory. If no, s very lkely ha relevan varables have been omed. The nex sep s o use he drecon of he bas o look for relevan varables. II. Includng an Irrelevan Varable. Suppose rue model doesn' conan OJT. Ths s conssen wh some heorecal models ha predc ha hs human capal wll no affec wages, employers are more lkely o pay for. Thus, he correc regresson model s: bu we esmae: lnw = S lnw = S OJT The problems here are less severe compared o omng a relevan varable. The rue error n he above regresson s = OJT If OJT s rrelevan, should be zero and hence Assumpon holds. Assumpon 3 holds oo. Wha are he properes of he OLS esmaes? ( Esmaed coeffcens are unbased and conssen. ( es s vald f he correc sandard error s used. ( The only problem s ha he esmaed coeffcens are neffcen. Under he 'false' model: E ( ˆ = Under he 'rue' model: Var ( ˆ σ = Σ s (- r Var ( ˆ σ = Σ s Snce f r >,henvar ( ˆ < Var ( ˆ, we're placng 'oo lle' confdence n our coeffcen esmaes (.e., he sandard error on he esmaed coeffcen s larger

Page - 4 han should be. Ths makes he -rao smaller han should be, and makes more lkely ha we won be able o rejec he null when we should. Ths s an easy one o solve n heory. If he varable shouldn be n he regresson, elmnae from he ouse. Bu n pracce, hs sn so easy. The heory n hs example says ha boh specfcaons mgh be rgh. If an ndependen varable may be relevan, nclude. III. How o Decde Wheher o Include Varable or No?. Graphc mehod o deec he problem of omng a relevan varable Plo he resduals and look for 'dsnc paern'. Take he earler example on funconal form of he regresson. We esmae: bu he 'rue' model s: lnw = S lnw = S S u = S u A plo of he resduals agans S would produce a 'deecable' paern (.e., curved downward.

Page - 5. Four crera Economc heory: s here any sound heory? Suden sasc: s sgnfcan n he correc drecon? Has R mproved? Do oher coeffcens change sgn when a varable s ncluded? Include varable f answers are posve. Don necessarly drop nsgnfcan varables. An nsgnfcan fndng can be an mporan resul. Example: (5.6 (. (. =.5. 3.5 n=5, R =. 6 where Coffee= demand for Brazlan coffee n US P bc = prce of Brazlan coffee P = prce of ea = dsposable ncome n US Y d Wha happens f you drop P bc? Coffee= ˆ 9. 7.8P.4P. 35Y bc d Coffee= ˆ 9.3.6P. 36Y d (. (.9 =.6 4. n=5, R =. 6 Wha happens f you add anoher varable, prce of Colomba coffee, P cc Coffee= ˆ 8.P 5.6P.6P. 3Y cc (4. (. (.3 (. = -.8 3 n=5, R =. 65 bc d

Page - 6 3. Three ncorrec echnques for choosng varables Daa mnng: smulaneously ry a whole seres of possble regresson formulaons and hen choose he equaon ha conforms he mos o wha he researcher wans he resuls o look lke. Dong economercs = makng sausages. Sepwse regresson echnque: sysemac way of varable selecon based on R. The compuer program s gven a shoppng ls of possble ndependen varables, and hen bulds he equaon n sep. I always adds o he regresson model he varable whch ncreases R he mos. Problem: ndependen varables could be correlaed. 3 Sequenal specfcaon search: add and drop sequenally (e esmae an undsclosed number of regressons bu only presen a fnal choce as f were he only specfcaon esmaed. When you es a model, you have a ype I error. If you esmae and es oo many models, ype I errors wll accumulae. IV. Lagged Independen Varables Consder he followng regressons: Y Y = X X ( = X X ( where =,, n. Tha s, we have sample of n me-seres observaons. Noe he change of noaon from o o emphasze me seres daa. In equaon (, he effec of X on Y s nsananeous. In equaon (, he effec s fel one perod laer. As long as X s exogenous (no nfluenced by Y, he lagged srucure of he equaon poses no problem. Of course, he nerpreaon of slope coeffcen s dfferen.

Page - 7 V. Akake s Informaon Creron and Schwarz Creron In general he more varables ncluded n he regresson, he smaller wll be he RSS. Bu f a varable only conrbues margnally o he reducon of he RSS, should no be ncluded. AIC and SC (also known BIC measures he RSS wh penaly of addonal parameers. They are defned n regresson models as: AIC = ln(rss/n (K/n SC = ln(rss/n ln(n(k/n You may selec models ha mnmze he AIC or SC. These are called model selecon crera. Noe ha R s also a model selecon creron. You choose model o maxmze R. Compared wh he AIC or SC, R ends o selec a model wh rrelevan varables. VI. Quesons for Dscusson: Q6.3, Q6.9 VII. Compung Exercse: Q6.5 (Johnson, Ch 6, Q6.5, Johnson Ch 6: AIC