LECTURE 9: GENTLE INTRODUCTION TO

Similar documents
Outline. 11. Time Series Analysis. Basic Regression. Differences between Time Series and Cross Section

7 Introduction to Time Series Time Series vs. Cross-Sectional Data Detrending Time Series... 15

7 Introduction to Time Series

Multiple Regression. Midterm results: AVG = 26.5 (88%) A = 27+ B = C =

Introduction to Econometrics

LECTURE 11. Introduction to Econometrics. Autocorrelation

Basic Regression Analysis with Time Series Data

Economics Introduction to Econometrics - Fall 2007 Final Exam - Answers

LECTURE 10: MORE ON RANDOM PROCESSES

The multiple regression model; Indicator variables as regressors

Applied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid

10. Time series regression and forecasting

Making sense of Econometrics: Basics

Wooldridge, Introductory Econometrics, 2d ed. Chapter 10: Basic regression analysis with time series data We now turn to the analysis of time series

Exercises (in progress) Applied Econometrics Part 1

7. Integrated Processes

Time Series. Chapter Time Series Data

7. Integrated Processes

Covers Chapter 10-12, some of 16, some of 18 in Wooldridge. Regression Analysis with Time Series Data

LECTURE 5. Introduction to Econometrics. Hypothesis testing

Exercise sheet 6 Models with endogenous explanatory variables

EC408 Topics in Applied Econometrics. B Fingleton, Dept of Economics, Strathclyde University

Lecture 8a: Spurious Regression

Solutions to Problem Set 6 (Due December 8) Maximum number of points for Problem set 8 is: 220. Problem 10.6

ECON3150/4150 Spring 2016

Lecture 8a: Spurious Regression

Statistics and Quantitative Analysis U4320. Segment 10 Prof. Sharyn O Halloran

Autoregressive models with distributed lags (ADL)

Testing methodology. It often the case that we try to determine the form of the model on the basis of data

Decision 411: Class 7

9) Time series econometrics

Econ 423 Lecture Notes: Additional Topics in Time Series 1

LECTURE 2: SIMPLE REGRESSION I

Lectures 5 & 6: Hypothesis Testing

UNIVERSIDAD CARLOS III DE MADRID ECONOMETRICS FINAL EXAM (Type B) 2. This document is self contained. Your are not allowed to use any other material.

Exercise Sheet 5: Solutions

Announcements. J. Parman (UC-Davis) Analysis of Economic Data, Winter 2011 February 8, / 45

ECON3150/4150 Spring 2015

Topic 4 Unit Roots. Gerald P. Dwyer. February Clemson University

E 4101/5101 Lecture 9: Non-stationarity

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

Inference with Simple Regression

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

13. Time Series Analysis: Asymptotics Weakly Dependent and Random Walk Process. Strict Exogeneity

In order to carry out a study on employees wages, a company collects information from its 500 employees 1 as follows:

Question 1 [17 points]: (ch 11)

TESTING FOR CO-INTEGRATION

L6: Regression II. JJ Chen. July 2, 2015

Decision 411: Class 3

Oil price and macroeconomy in Russia. Abstract

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler

THE SEASONAL UNIT ROOTS IN DEMOGRAPHIC TIME SERIES ANDTHE POSSIBILITIES OF ITS EXACT TESTING

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

FinQuiz Notes

ARDL Cointegration Tests for Beginner

In this chapter, we begin to study the properties of OLS for estimating linear regression

Time Series Analysis -- An Introduction -- AMS 586

Econ 444, class 11. Robert de Jong 1. Monday November 6. Ohio State University. Econ 444, Wednesday November 1, class Department of Economics

REED TUTORIALS (Pty) LTD ECS3706 EXAM PACK

Applied Econometrics. Professor Bernard Fingleton

EMERGING MARKETS - Lecture 2: Methodology refresher

Time Series Methods. Sanjaya Desilva

Estimates of the Sticky-Information Phillips Curve for the USA with the General to Specific Method

UNIVERSIDAD CARLOS III DE MADRID ECONOMETRICS Academic year 2009/10 FINAL EXAM (2nd Call) June, 25, 2010

Will it float? The New Keynesian Phillips curve tested on OECD panel data

Eco and Bus Forecasting Fall 2016 EXERCISE 2

Decision 411: Class 3

ECON 497 Midterm Spring

Econ 423 Lecture Notes

Time-Series Analysis. Dr. Seetha Bandara Dept. of Economics MA_ECON

in the time series. The relation between y and x is contemporaneous.

Lecture 6a: Unit Root and ARIMA Models

Autocorrelation. Think of autocorrelation as signifying a systematic relationship between the residuals measured at different points in time

Problem Set #6: OLS. Economics 835: Econometrics. Fall 2012

CHAPTER 21: TIME SERIES ECONOMETRICS: SOME BASIC CONCEPTS

CHAPTER 6: SPECIFICATION VARIABLES

Introduction to Econometrics. Review of Probability & Statistics

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 17th Class 7/1/10

Exercise sheet 3 The Multiple Regression Model

Univariate linear models


Ordinary Least Squares Regression Explained: Vartanian

The Multiple Regression Model Estimation

E 4160 Autumn term Lecture 9: Deterministic trends vs integrated series; Spurious regression; Dickey-Fuller distribution and test

Lecture on State Dependent Government Spending Multipliers

Economtrics of money and finance Lecture six: spurious regression and cointegration

Chapter 7. Testing Linear Restrictions on Regression Coefficients

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid

Multiple Regression Analysis

Inference. ME104: Linear Regression Analysis Kenneth Benoit. August 15, August 15, 2012 Lecture 3 Multiple linear regression 1 1 / 58

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

Ref.: Spring SOS3003 Applied data analysis for social science Lecture note

Multiple Regression Analysis

Chapter 7: Correlation and regression

Applied Econometrics. From Analysis of Economic Data, Gary Koop

Are Forecast Updates Progressive?

Describing Associations, Covariance, Correlation, and Causality. Interest Rates and Inflation. Data & Scatter Diagram. Lecture 4

Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity

Practice exam questions

10) Time series econometrics

Transcription:

LECTURE 9: GENTLE INTRODUCTION TO REGRESSION WITH TIME SERIES

From random variables to random processes (cont d) 2 in cross-sectional regression, we were making inferences about the whole population based on a small sample a crucial assumption: random sampling the bridge between population characteristics (distribution of wages in a country) and the probabilistic machinery of random variables (distribution of a wage of a randomly drawn person) unfortunately, with time series, random sampling makes no sense: year GDP inflation unemp 2004 1,957.6 2.6 5.4 2005 2,035.4 2.8 4.5 what would the underlying population be?

From random variables to random processes (cont d) 3 random sampling makes the characteristics of different individuals independent it is difficult to imagine that GDP in 2004 is independent of that in 2005 therefore, we will have to switch to a more advanced theoretical vehicle: random processes those who had taken courses in random processes would tell you it was difficult we will omit many mathematical details, and focus on the intuition

New issues with time series 4 good news: everything we learnt with cross-sectional data will be used in time-series analysis, too bad news: many new pitfalls that can spoil the analysis 1. trends and seasonality: can result in spurious regression (see next slide) 2. lags in economic behaviour: government s expenditure cuts will slowly percolate through the economy lagged effect (the effect of today s cuts will spread over quarters or even years) 3. persistence in time series: governments expenditure itself cannot change too dramatically from one year to another most real-life time series persistent, but the degree differs strong persistence of time series can again produce spurious regression (stationarity, unit-root issues) weak persistence problematic only if applies to u (serial correlation, or autocorrelation of u)

Spurious regression problem 5 both y and x both exhibit a monotonous trend, we will find a relationship even though they have nothing in common Example: Norwegian salmon production vs GDP in the U.S. the data in salmon.gdt contain two annual time series (1983 2011) annual salmon production in Norway GDP in the U.S. (bln. of 2005 dollars) do you think that there is a strong causal relationship? estimated equation in Gretl: ^gdp = 1.34e+04-0.00551*salmon (713) (0.00103) T = 29, R-squared = 0.514 (standard errors in parentheses) Quizz: is salmon significant at the 5 % level? And how about 1 %?

Let s look at the time series first: 6

Fitted values are calculated as: 13,414 0.00551 7

Accounting for trends in the regressions 8 Option 1 detrend all time series, i.e. create new variables where the linear trends have been subtracted two steps involved: (1) regress x t on t (time, values 1,2,,n) (2) save residuals (this is the detrended x t ) salmon series and its linear trend detrended salmon series

Accounting for trends in the regressions (cont d) 9 Option 2: add variable t (time) to the estimated equation Option 1: results ^gdp_detrended = 1.46e-013 + 0.000701*salmon_detrended (52.7) (0.000269) T = 29, R-squared = 0.201 (standard errors in parentheses) Option 2: results ^gdp = 5.14e+03 + 0.000701*salmon + 295*time (304) (0.000274) (9.90) T = 29, R-squared = 0.986 (standard errors in parentheses) salmon coefficient identical, std. errors nearly identical can use both R-squareds different, but most variation explained by time in Option 2 use detrended dependent variable for the R-squared!

10 Frisch-Waugh-Lovell theorem ===================================================================== Dependent variable: ---------------------------------------------------- gdp gdp_detrended gdp (1) (2) (3) (4) --------------------------------------------------------------------- year 294.72010*** 19.29931* (9.90449) (9.90449) salmon 0.00070** 0.00070** (0.00027) (0.00027) salmon_detrended 0.00070** 0.00070 (0.00026) (0.00990) Constant -578,999.30000*** -38,976.06000* (19,909.28000) (19,909.28000) --------------------------------------------------------------------- Observations 29 29 29 29 R2 0.98613 0.20106 0.20106 0.00018 =====================================================================

Using dummies to account for specific events 11 Example: fertility equation frequency: annual data, 1913 1984 gfr = the number of births per 1000 women aged 15 44 ww2 = 1 for years 1941 1945, = 0 otherwise expected fertility β 0 β 1 gfr ww u t 0 1 2t t WW2 period time

Using dummies to account for specific events (cont d) 14 Example: fertility equation 2 pill = 0 before 1963, = 1 afterwards expected fertility β 0 β 1 gfr pill u t 0 1 t t contraceptive pill introduced time

16 gfr β β ww2 β pill u t 0 1 t 2 t t

Seasonality 17 seasonal patterns are noticeable with quarterly, monthly, or daily data note that many time series in the online databases are seasonally adjusted, meaning that specialized algorithms have been used to even out the differences between seasons these series can be used without further ado when using a seasonally unadjusted series, we can still use a simple fix that accounts for the seasonal variation: periodic dummies, i.e. dummy variables that identify individual periods Example: durable goods open durgoods.gdt in Gretl change dataset structure to a quarterly time series (Data Dataset structure) add periodic dummies (Add Periodic dummies) this creates variables dq1,, dq4 (dq1 stands for dummy for quarter 1 )

18

values of the periodic dummies 19

Seasonality (cont d) 20 to describe the seasonal pattern in dishwasher sales, run the regression dish dq1 dq2 dq3 u t 0 1 t 2 t 3 t t the dishwasher time series and the fitted values are shown below, F-test for joint significance: p-value = 0.89 no statistical evidence of seasonality

Seasonality (cont d) 21 with refrigerator series, that s a different story: joint significance: p-value = 0.000 079, strong evidence of seasonality (i.e. we reject the null of no seasonal pattern)

Seasonality (cont d) 22 interpretation: just as with other category dummies we omitted dq4 quarter 4 is the base period e.g., the coefficient on dq1 tells us that in quarter 1, sales are higher by 62,125 than in quarter 4 (on average) Model 2: OLS, using observations 1978:1-1985:4 (T = 32) Dependent variable: frig coefficient std. error t-ratio p-value --------------------------------------------------------- const 1160.00 59.9904 19.34 9.81e-018 *** dq1 62.1250 84.8393 0.7323 0.4701 dq2 307.500 84.8393 3.625 0.0011 *** dq3 409.750 84.8393 4.830 4.42e-05 *** Mean dependent var 1354.844 S.D. dependent var 235.6719 Sum squared resid 806142.4 S.E. of regression 169.6785 R-squared 0.531797 Adjusted R-squared 0.481632 F(3, 28) 10.60102 P-value(F) 0.000079...

Seasonality (cont d) 23 conclusion: with seasonally unadjusted data, it makes sense to add both a time trend and periodic dummies in addition to your independent variables of interest note that this can be done also in case the dependent variable is logged, only the interpretation changes: ^l_frig = 7.05 + 0.0530*dq1 + 0.234*dq2 + 0.304*dq3 (0.0458)(0.0647) (0.0647) (0.0647) T = 32, R-squared = 0.517 (standard errors in parentheses) results imply that in quarter 1, sales increase by 5.3 % compared with the baseline level of quarter 4

^l_frig = 7.30 + 0.183*dq2 + 0.252*dq3-0.0555*dq4-0.0365*time + 0.00113*sq_time (0.0590)(0.0470) (0.0471) (0.0473) (0.00742) (0.000218) T = 32, R-squared = 0.764 Null hypothesis: the regression parameters are zero for the variables dq2, dq3, dq4 Test statistic: F(3, 26) = 19.323, p-value 8.49373e-007

Finite distributed lag (FDL) model 25 Example: fertility equation 3 pe = real dollar value of personal tax exemption gfr pe pe pe pe u t 0 0 t 1 t1 2 t2 3 t3 t here, δ 0 is the impact propensity (= immediate effect) of a unit increase in pe the δ parameters capture the effect of a temporary increase in pe: assume that pe equals c except for period 0, where it increases to c + 1: pe t c 1 2 1 0 1 2 3 4 time

pe t 1 c pe t 1 1 c pe t 2 pe t 3 c c 1 1 a temporary change in pe occurs in subsequent periods in lagged versions of pe gfr t d δ 0 δ 1 δ 2 δ 3 the effect of a temporary change in pe, or the lag distribution 2 1 0 1 2 3 4 time 27

Finite distributed lag (FDL) model (cont d) 29 long-run propensity (LRP): the effect of a permanent unit increase in pe LRP 0 1 2 3 pe t 1 c gfr t d δ 0 δ 1 δ 2 δ 3 the effect of a permanent change in pe, or long-run propensity 2 1 0 1 2 3 4 time

gfr β β ww2 β pill β t δ pe δ pe δ pe δ pe u t 0 1 t 2 t 3 0 t 1 t 1 2 t 2 3 t 3 t

Finite distributed lag (FDL) model (cont d) 32 Estimating LRP a natural estimator of LRP is LRP ˆ ˆ ˆ ˆ 0 1 2 3 so we just add up the coefficients on pe and its lags more work is required in case we need std. errors or 95% CI for LRP we ll use a simple trick: the equation can be rewritten as follows gfr pe pe pe pe u t 0 0 t 1 t1 2 t2 3 t3 t LRP pe ( pe pe ) ( pe pe ) ( pe pe ) u 0 t 1 t1 t 2 t2 t 3 t3 t t A B C this gives us the following procedure: 1. Create variables A, B, and C. in Gretl: Add Define new variable A = pe( 1) pe etc. 2. Regress gfr on pe, A, B and C; now, LRP is the coefficient on pe, and we can read off its std. error and calculate the 95% CI if needed.

LECTURE 9: GENTLE INTRODUCTION TO REGRESSION WITH TIME SERIES