Teaching Undergraduate Econometrics

Similar documents
Econometrics Spring School 2016 Econometric Modelling. Lecture 6: Model selection theory and evidence Introduction to Monte Carlo Simulation

IIF Economic Forecasting Summer School 2018, Boulder Colorado, USA

Semi-automatic Non-linear Model Selection

ECON 4160, Lecture 11 and 12

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

Christopher Dougherty London School of Economics and Political Science

Econ 423 Lecture Notes: Additional Topics in Time Series 1

Lecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem

Lecture 7: Dynamic panel models 2

7. Integrated Processes

7. Integrated Processes

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis

CHAPTER 21: TIME SERIES ECONOMETRICS: SOME BASIC CONCEPTS

ECON 4160, Spring term Lecture 12

Department of Economics. Discussion Paper Series

Introduction to Econometrics

New York University Department of Economics. Applied Statistics and Econometrics G Spring 2013

Cointegration and Error Correction Exercise Class, Econometrics II

TESTING FOR CO-INTEGRATION

Econometrics 2, Class 1

Introduction to Eco n o m et rics

Non-Stationary Time Series and Unit Root Testing

Module 3. Descriptive Time Series Statistics and Introduction to Time Series Models

Chapter 12: An introduction to Time Series Analysis. Chapter 12: An introduction to Time Series Analysis

Introduction to Economic Time Series

1 Regression with Time Series Variables

Multivariate Time Series: VAR(p) Processes and Models

Reliability of inference (1 of 2 lectures)

1 Estimation of Persistent Dynamic Panel Data. Motivation

The Drachma/Deutschemark Exchange Rate, : A Monetary Analysis

Choice of Spectral Density Estimator in Ng-Perron Test: Comparative Analysis

Non-Stationary Time Series and Unit Root Testing

Statistical model selection with Big Data

TESTING FOR CO-INTEGRATION PcGive and EViews 1

Volume 03, Issue 6. Comparison of Panel Cointegration Tests

Univariate linear models

8. Instrumental variables regression

PhD/MA Econometrics Examination January 2012 PART A

Non-Stationary Time Series, Cointegration, and Spurious Regression

Problems in model averaging with dummy variables

Testing for non-stationarity

DEPARTMENT OF ECONOMICS DISCUSSION PAPER SERIES

Lecture 3 Stationary Processes and the Ergodic LLN (Reference Section 2.2, Hayashi)

Problem set 1 - Solutions

Econometric Methods for Panel Data

SUPPLEMENTARY TABLES FOR: THE LIKELIHOOD RATIO TEST FOR COINTEGRATION RANKS IN THE I(2) MODEL

Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods

Nonsense Regressions due to Neglected Time-varying Means

Regression Models with Data-based Indicator Variables

Dynamic Regression Models (Lect 15)

Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8]

DEPARTMENT OF ECONOMICS DISCUSSION PAPER SERIES

Economics 618B: Time Series Analysis Department of Economics State University of New York at Binghamton

EC408 Topics in Applied Econometrics. B Fingleton, Dept of Economics, Strathclyde University

Sample Exam Questions for Econometrics

Decision 411: Class 7

Topic 4 Unit Roots. Gerald P. Dwyer. February Clemson University

Econometrics. Week 11. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Linear Regression with Time Series Data

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Econometric Modelling of Time Series with Outlying Observations

Economics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models

Bootstrapping the Grainger Causality Test With Integrated Data

Non-Stationary Time Series and Unit Root Testing

SOME BASICS OF TIME-SERIES ANALYSIS

Cointegration Lecture I: Introduction

Macroeconometric Modelling

by Søren Johansen Department of Economics, University of Copenhagen and CREATES, Aarhus University

Model Selection when there are Multiple Breaks

Time Series Econometrics For the 21st Century

1 Phelix spot and futures returns: descriptive statistics

Stationarity Revisited, With a Twist. David G. Tucek Value Economics, LLC

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Econometrics of Panel Data

Linear Models in Econometrics

9. Linear Regression and Correlation

Limited Dependent Variables and Panel Data

Lecture 6a: Unit Root and ARIMA Models

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

Using EViews Vox Principles of Econometrics, Third Edition

This note introduces some key concepts in time series econometrics. First, we

Explaining Cointegration Analysis: Part II

Bootstrapping Long Memory Tests: Some Monte Carlo Results

Forecasting Bangladesh's Inflation through Econometric Models

ECON 4230 Intermediate Econometric Theory Exam

Linear Regression with Time Series Data

BCT Lecture 3. Lukas Vacha.

OAKLYN PUBLIC SCHOOL MATHEMATICS CURRICULUM MAP EIGHTH GRADE

Bootstrapping Long Memory Tests: Some Monte Carlo Results

A Test of Cointegration Rank Based Title Component Analysis.

Arma-Arch Modeling Of The Returns Of First Bank Of Nigeria

ARDL Cointegration Tests for Beginner

Outline. Nature of the Problem. Nature of the Problem. Basic Econometrics in Transportation. Autocorrelation

This note discusses some central issues in the analysis of non-stationary time

This is a repository copy of The Error Correction Model as a Test for Cointegration.

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Non-Parametric Approach of Heteroskedasticity Robust Estimation of Vector-Autoregressive (VAR) Models

The Properties of Automatic Gets Modelling

Empirical Model Discovery

ECON 4160: Econometrics-Modelling and Systems Estimation Lecture 9: Multiple equation models II

Transcription:

Teaching Undergraduate Econometrics David F. Hendry Department of Economics, Oxford University ESRC Funded under Grant RES-062-23-0061 Teaching Undergraduate Econometrics p.1/38

Introduction Perfect time to review teaching undergraduate econometrics. (1) Massive changes in coverage, approach and methods from maths on blackboards in the 1970s. Huge improvements in: hardware, software, data, graphics and methods. (2) Almost 25 years since first version of PcGive and 21 years since Hendry (1986) computer teaching of econometrics via PcGive. (3) Now use Hendry and Nielsen (2007) as the textbook: based on PcGive (Doornik and Hendry, 2006) within OxMetrics (Doornik, 2006) half taught in IT room with continuous computer access. Teaching Undergraduate Econometrics p.2/38

Econometric Modelling Teaching Undergraduate Econometrics p.3/38

PcGive Doornik and Hendry (2006) discuss computer-based teaching of econometrics at all levels from very elementary, through intermediate to advanced, loosely based on Hendry (1986). Cannot cover all the angles here: will rapidly describe early steps in econometrics, commenting on little tricks that help keep student interest. Then turn to model selection in non-stationary data yes, for undergraduates! PPE does not attract most maths oriented: yet we raised option enrollment from 6 to 48 over 8 years. Seek to excite student interest. Teaching Undergraduate Econometrics p.4/38

Background Assume no elementary statistical theory known. Probability; distributional concepts; location and spread; elementary notions of randomness and distributions of statistics all needed. Need to make econometrics exciting, yet comprehensible. Five central themes: likelihood; relevant empirical applications; mastery of software to implement; emphasize graphics; rigorous evaluation of findings. Introduce derivations as following from need to understand properties: upskill maths to understand empirics, concepts, formulations, and interpretations. Teaching Undergraduate Econometrics p.5/38

Theory first steps How do we advance in 16 weeks from IID binary models to selecting cointegrated equations in the face of structural breaks? First steps introduce elementary statistical theory: binary events in a Bernoulli model with random draws. Explain sample and population distributions; then distribution functions and densities. Leads to inference in the Bernoulli model, discuss expectations and variances, and introduce asymptotic theory and inference. Next introduce continuous variables (wages) and regression via simplest case: y i = β + u i. Builds on estimation of mean, yet leads to logit regression; and on to bivariate regression models. Teaching Undergraduate Econometrics p.6/38

Empirical first steps Simultaneously, teach OxMetrics & conduct empirical work. Large bank of long historical data series: offer students choice of modelling one of U r ; p; w p; gdp. First graph levels: Assume choose U r see figure 8. Essential to explain graphs amazingly poor prior skills. Discuss axes, units, data transforms including role of logs and their properties. Then salient features: major events, cycles, trends, and breaks, leading to concept of non-stationarity. Aim for students who can read published empirical findings, and sensibly conduct and interpret their own empirical research so cannot finesse difficulties such as non-stationarity and model selection. Teaching Undergraduate Econometrics p.7/38

Graph of unemployment rate 0.150 U r,t 0.125 0.100 0.075 0.050 0.025 1880 1900 1920 1940 1960 1980 2000 Teaching Undergraduate Econometrics p.8/38

Empirical second steps Relate to their life prospects; and those of fellow citizens. Everyone must make a new comment however simple about some aspect, every time. Many weeks till axes are mentioned first by anyone! Emphasize manifest non-stationarity. Figure 10 shows possible general comments: and fig. 11 adds specific. Now discuss economic theory and institutions for chosen series. Next plot differences graph discuss: hugely different look. Teaching Undergraduate Econometrics p.9/38

Comments about unemployment rate Units = rate 0.150 upward mean shift 0.125 upward mean shift and higher variance 0.100 Business cycle epoch 0.075 0.050 0.025 downward mean shift and dramatically lower variance 1880 1900 1920 1940 1960 1980 2000 Teaching Undergraduate Econometrics p.10/38

All comments about unemployment rate Units rate 0.150 0.125 upward mean shift leave gold standard upward mean shift and higher variance 0.100 Business cycle epoch US crash leave ERM 0.075 0.050 0.025 Boer war WWI Postwar crash WWII downward mean shift and dramatically lower variance Mrs T Oil crisis Post war reconstruction 1880 1900 1920 1940 1960 1980 2000 Teaching Undergraduate Econometrics p.11/38

Changes in unemployment rate 0.08 U r Constant mean of zero 0.06 Are changes random? 0.04 0.02 0.00 0.02 0.04 High variance Low variance 1880 1900 1920 1940 1960 1980 2000 Teaching Undergraduate Econometrics p.12/38

Theory and evidence Relate distributional assumptions to model formulation: conditioning in a bivariate normal=>linear regression. Leads naturally to interpretation of models; modelling and model design; how to judge a model; and hence concepts of congruence and encompassing. Postulate simple model of golden-growth to explain deviations of unemployment from historical mean. Measure steady-state equilibrium path by: R L,t ln P t ln Y t = ry t. Look at graphs of each and their properties. Teaching Undergraduate Econometrics p.13/38

Graphs for U r,t and ry t 0.2 RLr Dgdp 0.1 0.0 0.1 1880 1900 1920 1940 1960 1980 2000 0.15 Ur ry 0.10 0.05 0.00 1880 1900 1920 1940 1960 1980 2000 Teaching Undergraduate Econometrics p.14/38

Regression concepts Once econometric theory explained, do a regression plot; add projections discuss least squares: see fig. 25. Can also explain slope & intercept graphically. Several key concepts underpin regression: exogeneity; IID errors; normality; functional form; parameter constancy. All are aspects of model (mis)specification. Relate maths derivations to graphs as needed to understand properties but always same principles: data suggests putative DGP model of PDF Likelihood function maximize it get statistic then its distribution apply to data interpret evaluate. Teaching Undergraduate Econometrics p.15/38

Regression for U r,t on ry t 0.15 Ur ry 0.10 0.05 0.00 0.20 0.15 0.10 0.05 0.00 0.05 0.10 0.15 0.20 0.15 Ur ry 0.10 0.05 0.00 0.20 0.15 0.10 0.05 0.00 0.05 0.10 0.15 0.20 Teaching Undergraduate Econometrics p.16/38

The way ahead On theory side, reinterpret estimation of a mean as regression on an intercept. Leads to regression in general; apply to fish market data as if cross-section, then will reinterpret as time series. Next treat price and quantity as a system, leading to identification and simultaneity, using structural breaks as instruments; and on to cointegration; picking up model selection en route. Thus, each topic segues smoothly into the next. Teaching Undergraduate Econometrics p.17/38

Regression as non-parametric Can see relation is OK in tails; erratic in middle. Next write signature and run a regression through it! Explain pixels world coordinates=>data; so can regress. See fig. 19. Helps demystify regression analysis: just line fitting. Could join points: Phillips loops, leading to dynamics. Many routes possible concepts, models, evaluation. Here will do several sequential regressions leads to recursive methods for parameter constancy: fig. 20. Note graphs give opportunity to introduce LaTex: invaluable later as models can be output that way. Teaching Undergraduate Econometrics p.18/38

Regression for a signature 0.15 Ur ry 0.10 0.05 0.00 0.20 0.15 0.10 0.05 0.00 0.05 0.10 0.15 0.20 0.15 Ur ry 0.10 0.05 0.00 0.20 0.15 0.10 0.05 0.00 0.05 0.10 0.15 0.20 Teaching Undergraduate Econometrics p.19/38

Ten regressions for U r 0.150 Ur ry 0.125 0.100 0.075 0.050 0.025 0.000 0.20 0.15 0.10 0.05 0.00 0.05 0.10 0.15 0.20 Teaching Undergraduate Econometrics p.20/38

Distributions Also can plot histogram and interpolated density: fig. 22. Could use to explain non-parametric/kernel approaches. Emphasize the very different features: U r,t like a uniform many values similarly likely. U r,t closer to normal with some outliers. Thus, differencing changes distibutional shape explain as unconditional versus conditional on previous value, so latter is deviation from past. Teaching Undergraduate Econometrics p.21/38

Distributions for unemployment rate 15 Density plots U r 10 5 0.02 0.00 0.02 0.04 0.06 0.08 0.10 0.12 0.14 0.16 0.18 40 U r 30 20 10 0.06 0.04 0.02 0.00 0.02 0.04 0.06 0.08 0.10 Teaching Undergraduate Econometrics p.22/38

Time series and randomness Additional key concept of randomness can explain here. Regression subsumes correlation, so can explain correlograms as correlations between successively longer lagged values. See fig. 24. U r,t has many high correlations; almost a trend; U r,t has almost no autocorrelation: changes in U r,t are surprise like: again unconditional versus conditional. Exploit U r ; U r comparison: see fig. 25. Explain congruence as needing all properties of variables matching in a model. Now ready for formal estimation of the graph line. Teaching Undergraduate Econometrics p.23/38

Correlograms for unemployment rate 1.0 ACF U r 0.5 0.0 0.5 1.0 0 5 10 15 20 1.0 0.5 ACF U r 0.0 0.5 1.0 0 5 10 15 20 Teaching Undergraduate Econometrics p.24/38

Unemployment regression on lag 0.15 U r,t U r,t 1 0.10 0.05 0.10 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1 0.11 0.12 0.13 0.14 0.15 U r,t U r,t 1 0.05 0.00 0.05 0.05 0.04 0.03 0.02 0.01 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 Teaching Undergraduate Econometrics p.25/38

On to model estimation Long-run (later cointegrated) relation: U r,t = β 0 + β 1 ry t. Û r,t = 0.0501 (0.0028) + 0.345 (0.052) σ = 0.0315 R 2 = 0.26 F GUM (1, 126) = 44.64 ry t (1) Unemployment rises/falls as real long-run interest rate is above/below real growth rate (i.e., ry t 0). Explain each measure and its invariance; stress assumptions easily tested: F ar (2, 124) = 180.4 F arch = 229.9 χ 2 nd (2) = 15.02 F het = (2, 123) = 2.62 F reset = (1, 125) = 0.33. Explain each, and stress must do for every formulation. Û r,t graphical output visually confirms the tests: thus, successfully applied key concepts to residuals. Teaching Undergraduate Econometrics p.26/38

U r on ry graphical output 0.15 Ur Fitted 3 r:ur (scaled) 2 0.10 1 0.05 0 0.00 1 0.5 1900 1950 2000 Density r:ur N(0,1) 1.0 1900 1950 2000 ACF r:ur 0.4 0.5 0.3 0.0 0.2 0.1 0.5 3 2 1 0 1 2 3 4 0 5 10 Teaching Undergraduate Econometrics p.27/38

Simple dynamic models Explain multiple testing concepts now: each test derived under separate null; any other rejections contradict assumptions of derivations. Clear that model is badly mis-specified: not clear which assumption(s) is invalid, hence no obvious solution leading to general-to-simple. Other simple model of U r,t on U r,t 1 ; U r,t = γ 0 + γ 1 U r,t 1 + ɛ t follows from graph 25 and can also relate to U r. Û r,t = 0.887 U r,t 1 + 0.006 (0.040) (0.003) σ = 0.017 R 2 = 0.79 F GUM 1, 126) = 485.7 F ar (2, 124) = 3.8 F arch (2, 124) = 0.55 χ 2 nd (2) = 33.0 F het = (2, 123) = 0.42 F reset = (1, 125) = 0.01. Improved, but still mis-specified obvious outlier in 1920. Teaching Undergraduate Econometrics p.28/38

U r,t on U r,t 1 graphical output 0.15 Ur Fitted 5.0 r:ur (scaled) 0.10 2.5 0.05 0.0 0.6 1900 1950 2000 Density r:ur N(0,1) 2.5 1.0 0.5 1900 1950 2000 ACF r:ur 0.4 0.0 0.2 0.5 4 2 0 2 4 6 0 5 10 Teaching Undergraduate Econometrics p.29/38

More general models Discuss lags; measures of lag responses; etc. Long-run solution is 5.3% unemployment; cannot reject unit root, so explain rudiments of stochastic trends... Now move to a dynamic model with regressors: U r,t = β 0 + β 1 ry t + β 2 U r,t 1 + β 3 ry t 1. Û r,t = 0.86 U r,t 1 + 0.007 + 0.24 (0.04) (0.002) (0.03) σ = 0.013 R 2 = 0.88 F GUM (3, 123) = 308.2 ry t 0.10 (0.03) F ar (2, 121) = 2.5 F arch (1, 121) = 3.1 χ 2 nd (2) = 7.2 ry t 1 F het = (6, 116) = 4.2 F reset = (1, 122) = 4.2. Not fully congruent, but much better progressive research. Can reject a unit root, so cointegrated : long-run is U r = 0.052 + 1.02ry. Contrast to coefficient of 0.35 in (1) former down biased. (2) Teaching Undergraduate Econometrics p.30/38

U r,t model graphical output 0.15 Ur Fitted r:ur (scaled) 0.10 2 0.05 0 0.00 2 0.6 1900 1950 2000 Density r:ur N(0,1) 1.0 1900 1950 2000 ACF r:ur 0.5 0.4 0.0 0.2 0.5 3 2 1 0 1 2 3 4 0 5 10 Teaching Undergraduate Econometrics p.31/38

Extensions Now confront constancy by formal recursive methods, building on earlier graphs. Figure 33 records: tests do not reject, despite wandering estimates much to discuss as desired. Few outliers but some negative fitted values (try logit?). May also alleviate heteroscedasticity... Challenge students to formulate alternative explanations: test their proposals against the evidence and (2)! Teaching Undergraduate Econometrics p.32/38

U r,t model recursive output 1.00 0.75 0.50 Ur_1 +/ 2SE 0.02 Constant +/ 2SE 0.00 0.75 0.50 0.25 0.025 1900 1920 1940 1960 1980 2000 ry +/ 2SE 1900 1920 1940 1960 1980 2000 Res1Step 0.4 0.2 0.0 0.2 1.0 1900 1920 1940 1960 1980 2000 ry_1 +/ 2SE 1900 1920 1940 1960 1980 2000 1up CHOWs 1% 0.000 0.5 0.025 1.0 1900 1920 1940 1960 1980 2000 Ndn CHOWs 1% 1900 1920 1940 1960 1980 2000 0.5 1900 1920 1940 1960 1980 2000 Teaching Undergraduate Econometrics p.33/38

Model selection Seen dangers of simple approaches: so general-to-specific needs explained. In a general model, cannot know in advance which variables will matter: some will, but some will not. Must confront for competent practitioners: any test+decision entails selection, so ubiquitous. Sketch theory of model selection in simplest case: 2 decisions (keep/delete) and 2 states (relevant/irrelevant). Translate retain irrelevant variables & exclude relevant as akin to probabilities of type I & II errors. Teaching Undergraduate Econometrics p.34/38

Understanding model selection Consider a perfectly orthogonal regression model: y t = N i=1 β iz i,t + ɛ t (3) where E[z i,t z j,t ] = δ i,j λ and T >> N. Order the N sample t 2 -statistics testing H 0 : β j = 0: t 2 (N) t2 (N+1) t2 (1). Cut-off n between included and excluded variables is: t 2 (n) c α > t 2 (n+1). All larger values retained: all others eliminated. Only one decision needed even for N = 1000: repeated testing does not occur. Path search gives impression of repeated testing. Confused with selecting from 2 1000 = 10 301 possible models. Maintain false null retention at one variable by c α = 1/N. Teaching Undergraduate Econometrics p.35/38

Monte Carlo of model selection How to persuade? Everyone generates a PcNaive sample from same DGP which they design as a group, then all apply Autometrics to their sample. Pool class results and relate to delete/keep calculations. Repeat at a looser/tighter selection criterion. Explain key role of marginal decisions: empirical t close to critical value is danger zone. Now they can handle complicated models by automatic modelling; huge improvement in quality of empirical work and in interpretation of what they find. Teaching Undergraduate Econometrics p.36/38

Conclusions Computer-based teaching of econometrics enhances all levels from very elementary, through intermediate to advanced. Key texts are: Hendry, D.F. and Nielsen, B. (2007), Econometric Modeling: A Likelihood Approach, Princeton University Press; and Doornik, J.A. and Hendry, D.F. (2007), Empirical Econometric Modelling using PcGive, Timberlake, which is about to appear. Undergraduates can progress from binary events in a Bernoulli model with random draws to model selection in non-stationary data in a year-long course. And learn to build empirical models of non-stationary data by automatic modelling. Teaching Undergraduate Econometrics p.37/38

Bibliography References Doornik, J. A. (2006). An Introduction to OxMetrics. London: Timberlake Consultants Press. Doornik, J. A. (2007). Autometrics. Working paper, Economics Department, University of Oxford. Doornik, J. A., and Hendry, D. F. (2006). Empirical Econometric Modelling using PcGive: Volume I. London: Timberlake Consultants Press. Hendry, D. F. (1986). Using PC-GIVE in econometrics teaching. Oxford Bulletin of Economics and Statistics, 48, 87 98. Hendry, D. F., Johansen, S., and Santos, C. (2004). Selecting a regression saturated by indicators. Unpublished paper, Economics Department, University of Oxford. Hendry, D. F., and Nielsen, B. (2007). Econometric Modeling: A Likelihood Approach. Princeton: Princeton University Press. Teaching Undergraduate Econometrics p.38/38