(I AL BL 2 )z t = (I CL)ζ t, where

Similar documents
TAKEHOME FINAL EXAM e iω e 2iω e iω e 2iω

Eco517 Fall 2014 C. Sims MIDTERM EXAM

MID-TERM EXAM ANSWERS. p t + δ t = Rp t 1 + η t (1.1)

Eco517 Fall 2014 C. Sims FINAL EXAM

Eco517 Fall 2004 C. Sims MIDTERM EXAM

ECO 513 Fall 2009 C. Sims HIDDEN MARKOV CHAIN MODELS

Short Questions (Do two out of three) 15 points each

ECO 513 Fall 2008 C.Sims KALMAN FILTER. s t = As t 1 + ε t Measurement equation : y t = Hs t + ν t. u t = r t. u 0 0 t 1 + y t = [ H I ] u t.

CONJUGATE DUMMY OBSERVATION PRIORS FOR VAR S

Regression. ECO 312 Fall 2013 Chris Sims. January 12, 2014

MAXIMUM LIKELIHOOD, SET ESTIMATION, MODEL CRITICISM

Choosing among models

It is easily seen that in general a linear combination of y t and x t is I(1). However, in particular cases, it can be I(0), i.e. stationary.

Bayesian linear regression

Switching Regime Estimation

Model comparison. Christopher A. Sims Princeton University October 18, 2016

A Bayesian perspective on GMM and IV

The Metropolis-Hastings Algorithm. June 8, 2012

EXERCISE ON VAR AND VARMA MODELS

Likelihood-free MCMC

Econ 423 Lecture Notes: Additional Topics in Time Series 1

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

Multivariate GARCH models.

MODEL COMPARISON CHRISTOPHER A. SIMS PRINCETON UNIVERSITY

Econometría 2: Análisis de series de Tiempo

VAR Model. (k-variate) VAR(p) model (in the Reduced Form): Y t-2. Y t-1 = A + B 1. Y t + B 2. Y t-p. + ε t. + + B p. where:

Dynamic Factor Models and Factor Augmented Vector Autoregressions. Lawrence J. Christiano

Statistics & Data Sciences: First Year Prelim Exam May 2018

MCMC 2: Lecture 3 SIR models - more topics. Phil O Neill Theo Kypraios School of Mathematical Sciences University of Nottingham

FINANCIAL ECONOMETRICS AND EMPIRICAL FINANCE -MODULE2 Midterm Exam Solutions - March 2015

Master s Written Examination

Testing Restrictions and Comparing Models

Bayesian inference. Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark. April 10, 2017

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research

Problem Set 6 Solution

Labor-Supply Shifts and Economic Fluctuations. Technical Appendix

Lecture 7a: Vector Autoregression (VAR)

Bayesian Semiparametric GARCH Models

Bayesian Semiparametric GARCH Models

COMMENT ON DEL NEGRO, SCHORFHEIDE, SMETS AND WOUTERS COMMENT ON DEL NEGRO, SCHORFHEIDE, SMETS AND WOUTERS

7. MULTIVARATE STATIONARY PROCESSES

Bayesian Linear Models

STA 294: Stochastic Processes & Bayesian Nonparametrics

STA 2101/442 Assignment 3 1

Bayesian Econometrics

Hypothesis Testing. Econ 690. Purdue University. Justin L. Tobias (Purdue) Testing 1 / 33

MEI Exam Review. June 7, 2002

Bayesian Linear Models

Eco517 Fall 2013 C. Sims MCMC. October 8, 2013

Questions and Answers on Unit Roots, Cointegration, VARs and VECMs

Lecture 7a: Vector Autoregression (VAR)

Inference when identifying assumptions are doubted. A. Theory B. Applications

Inference when identifying assumptions are doubted. A. Theory. Structural model of interest: B 1 y t1. u t. B m y tm. u t i.i.d.

Gibbs Sampling in Endogenous Variables Models

... Econometric Methods for the Analysis of Dynamic General Equilibrium Models

E 4160 Autumn term Lecture 9: Deterministic trends vs integrated series; Spurious regression; Dickey-Fuller distribution and test

TESTING FOR CO-INTEGRATION

Fixed Effects, Invariance, and Spatial Variation in Intergenerational Mobility

STAT Financial Time Series

The linear model is the most fundamental of all serious statistical models encompassing:

GARCH Models. Eduardo Rossi University of Pavia. December Rossi GARCH Financial Econometrics / 50

TEORIA BAYESIANA Ralph S. Silva

STA 4322 Exam I Name: Introduction to Statistics Theory

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Bayesian Inference for DSGE Models. Lawrence J. Christiano

1 Teaching notes on structural VARs.

MCMC algorithms for fitting Bayesian models

Final Exam. Economics 835: Econometrics. Fall 2010

First Year Examination Department of Statistics, University of Florida

Gaussian kernel GARCH models

Structural VARs II. February 17, 2016

MAKING MACRO MODELS BEHAVE REASONABLY

Vector Auto-Regressive Models

Online appendix to On the stability of the excess sensitivity of aggregate consumption growth in the US

BAYESIAN ECONOMETRICS

Computational statistics

1 Hypothesis Testing and Model Selection

Economics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models

VAR Models and Applications

ESTIMATION of a DSGE MODEL

ECON 4160, Spring term Lecture 12

ECON 4160, Autumn term 2017 Lecture 9

Dynamic Factor Models, Factor Augmented VARs, and SVARs in Macroeconomics. -- Part 2: SVARs and SDFMs --

Bayesian Inference for DSGE Models. Lawrence J. Christiano

Econ 583 Final Exam Fall 2008

Regime switching models

Volatility. Gerald P. Dwyer. February Clemson University

1 Using standard errors when comparing estimated values

Dynamic Regression Models (Lect 15)

Advanced Econometrics

Vector Autoregression

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -33 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

MA Advanced Econometrics: Spurious Regressions and Cointegration

STA 302f16 Assignment Five 1

1 Linear Difference Equations

Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8]

An Introduction to Bayesian Linear Regression

Bayesian Estimation of DSGE Models 1 Chapter 3: A Crash Course in Bayesian Inference

27 : Distributed Monte Carlo Markov Chain. 1 Recap of MCMC and Naive Parallel Gibbs Sampling

Transcription:

ECO 513 Fall 2011 MIDTERM EXAM The exam lasts 90 minutes. Answer all three questions. (1 Consider this model: x t = 1.2x t 1.62x t 2 +.2y t 1.23y t 2 + ε t.7ε t 1.9ν t 1 (1 [ εt y t = 1.4y t 1.62y t 2 +.1x t 1.08x t 2 + ν t.6ν t 1 +.2ε t 1 (2 ] {x s, y s, s < t} N(0, I. (3 ν t (a Show that the model above is exactly equivalent in its implications for the distribution of the x, y stochastic process to the one below: x t =.5x t 1.7y t 1 + ε t (4 y t =.8y t 1 +.3x t 1 + ν t, (5 where the assumptions on ε t and ν t are as before. In matrix notation, the first version of the model is A = (I AL BL 2 z t = (I CLζ t, where [ ] [ ] xt εt z t =, ζ y t =, t ν t [ ] [ ] [ ] 1.2.2.62.23.7.9, B =, C =..1 1.4.08.62.2.6 The second, smaller version of the system is in matrix notation (I DLz t = ζ t, where [ ].5.7 D =..3.8 The way two systems like this can be equivalent is through root cancellation, that is through the polynomials on either side of the larger system containing a common factor. That is, we should be able to multiply the smaller system by some first-order matrix polynomial in the lag operator that would give us the larger system. But there is Date: January 12, 2011. c 2011 by Christopher A. Sims. This document is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. http://creativecommons.org/licenses/by-nc-sa/3.0/.

2 MIDTERM EXAM only one first-order polynomial that will convert ζ t in the smaller system to (I CLζ t in the larger system, namely I CL. So we need to check whether (I CL(I DL = I AL BL 2. The rest is just arithmetic. Another way to approach this problem is to observe that the smaller pair of equations can be thought of as expressing ε t and ν t as functions of current and lagged x t and y t. One can then plug in these expressions in place of ε t 1 and ν t 1 in the larger system, to obtain a new system which exactly matches the smaller system. This approach leads you through the same arithmetic as you would carry out in the matrix polynomial approach. (b Is this model consistent with x t, y t being jointly stationary? Explain your answer. We can work with the first-order version of the model. A first order VAR y t = Ay t 1 + ζ t is consistent with stationarity if and only if all the eigenvalues of A are less than one in absolute value. The characteristic equation of A here is (.5 λ(.8 λ +.21, which has roots 1.3 ± 1.69 2.44, 2 i.e. two complex roots whose squared absolute value is 2.44/4 =.61 < 1, so the model is consistent with stationarity. (c Suppose x 3 = 2, x 2 = 1, y 3 = 2, y 2 = 1. What is the conditional expectation of x 5 based on all the data for y and x dated 3 and earlier? [ ] x5 E 3 y 5 so the answer is E 3 [x 5 ] = 1.9. [ ] [ ] = D 2 x3 1.9 =, y 3.08 (2 Suppose you have data on y t, t = 1,..., T and you wish to consider two possible models for its behavior: y t {y t s, s 1} N(γ + ρy t 1, σ 2 (6 log y t {y t s, s 1} N(δ + θ log y t 1, ν 2. (7 Assuming your priors on ρ and θ are the same, make the two parameters a priori independent, and are both flat relative to the likelihood, and that the same holds true for the priors on σ 2 and ν 2 and γ and δ.

( exp ( exp MIDTERM EXAM 3 How would you go about constructing a posterior odds ratio for the two models based on the observed data? Be as explicit as you would have to be in instructing an undergraduate RA who knows matlab but does not know statistics. We would like to integrate the posterior for each model to obtain a posterior odds ratio. The first complication to account for is that the second equation defines a conditional distribution for log y t, not y t, so to make the models apply to the same observed data, we need to recognize the Jacobian from the transformation from log y to y. Second, the problem statement does not tell us how to handle initial conditions. So as in a problem with real data, one has to make decisions about whether to assume stationarity and use the model s implied unconditional distribution for the initial conditions. One reasonable answer would be to use the stationarity assumption, in which case the likelihoods for the two models are (y 1 γ/(1 ρ 2 (1 ρ 2 2σ 2 (log y 1 δ/(1 θ 2 (1 θ 2 2ν 2 (2πσ 2 T/2 (2πν 2 T/2 T exp t=2 T exp t=2 ( (y t ρy t 1 γ 2 2σ 2 and ( (log y t θy t 1 δ 2 In general,model comparisons via marginal likelihood require proper priors, However, in a case like this one, where the parameters in the two models can reasonably be taken to have the same priors, assuming they are both flat relative to the likelihood is OK. The level of the constant pdf one uses to represent a flat prior is arbitrary, yet affects the level of the integrated posterior. But when the parameters in the two models have similar interpretations, assuming that their flat priors have the same height (and thus cancel out is reasonable. Following that course, we have to integrate the two expressions above over ρ, γ, σ 2 and θ, δ, ν 2, respectively. If it were not for the initial conditions component of the likelihood, this could be done analytically, but with that component, some kind of MCMC sampling of the posterior is needed. The brute force approach would be just to start up a Markov chain at the OLS estimates and their associated variance estimates, using random walk Metropolis with normally distributed steps. One could also use the normal-inverse-gamma distribution implied by the likelihood without initial conditions as a proposal distribution in an independence Metropolis-Hastings sampling scheme. Note that the denominator y t s 2ν 2 /y t.

4 MIDTERM EXAM have no effect on the shape of the posterior as a function of the parameters, so the likelihood conditional on initial conditions for that model is still normal-inverse-gamma. The Jacobian terms weights do, though, affect the level of the integrated posterior, and hence are important for the model comparison. I won t go into more detail here about how to do the MCMC, since there are multiple possible approaches, but an A+ answer would have given the kind of detail necessary to instruct the hypothetical undergraduate RA in the question statement. Of course to use the MCMC sample to get an estimate of the integrated posterior, one has to take an average of an appropriate function of the sampled likelihood. Most commonly, people would use a modified harmonic mean approach, i.e. take averages of g(θ j /l(θ j, where g( is a probability density, l( is the unnormalized posterior density, and θ j is an MCMC draw of the parameters. These averages will converge to the inverse of the integrated posterior. Usually people use as g a normal density corresponding to the second order expansion of the log likelihood about its maximum, truncated at some χ 2 value (e.g. 2n, where n is the dimension of θ and rescaled to integrate to one with the truncation. If you answered by treating the initial conditions as uninformative about the parameters, you should have seen that the integrated posteriors can be calculated analytically and explained how to do that integration, rather than suggesting MCMC. Because the likelihood is normal-inverse-gamma, the integral involves integrating the normal in γ, ρ or in θ, δ, followed by integration of the resulting gamma marginal distribution for the variance parameter. Since the likelihood is only proportional to these distributions, you have to keep track of the constants of integration. For the first stage, integrating over ρ and γ, e.g., we need to recognize that the likelihood can be rewritten as (2πσ 2 (T 1/2 exp ( û2 t (β ˆβ X X(β ˆβ 2σ 2, where ˆβ is the OLS estimator of β, which is the vector ρ, γ. X is the (T 1 2 matrix consisting of y 1 to y T 1 in the first column and ones in the second. Then integrating out the normal density in β leaves us with (2πσ 2 (T 3/2 X X ( 1 2 exp û2 t 2σ 2.

MIDTERM EXAM 5 If we treat the prior as flat in 1/σ 2, this is proportional to a Gamma((T 1/2, û 2 t /2 distribution, and when we integrate it we get ( û 2 t 2 (T 1/2 2 1 X X 1 2. For the second model, the calculations are the same, with log y t replacing y t in the definitions of û t and X and θ, δ becoming the β vector. And of course for this model there is also the Jacobian term factor 1/ T 1 1 y t. (3 Suppose we consider a linearized DSGE model of Canadian and US data, where for each country the observables are a short interest rate, real GDP, the GDP deflator, and the wage rate. There are a small number of structural parameters for the US, and a similar small number of structural parameters for Canada, and we treat them as independent in our prior. The linearized model implies a restricted VAR form, with coefficients nonlinear functions of the structural parameters, and has US GDP and GDP deflator entering the equations for the Canadian part of the model, but not the US interest rate or the US wage rate. No Canadian variables appear in the US equations. The residuals in all equations are i.i.d. across equations and zero mean and they span the same space as the variable innovations. (a Does the linearized model imply a Granger causal ordering, with the US first in the ordering? Yes. The criterion for a Granger ordering is that the system can be arranged so that the coefficients on lagged variables are block triangular. That is true here, if we treat the US as the Grangercausally prior block. (b How would you test the Granger causal ordering restriction? Can it be done by considering the US equations alone? As we discussed in class, tests for Granger orderings can be carried out by looking at the restricted subsystem (that for the variables first in the ordering alone, under some conditions. The restrictions implied by the ordering are only on the US equations, which is one of the conditions. But if residuals are correlated across equations, an additional condition is that the other equations of the system are unrestricted, so that if we were to take linear combinations of the equations to make the errors in the two blocks uncorrelated, the transformed version of the Canadian block would be neither more nor less restricted than the original Canadian block. Here, the Canadian

6 MIDTERM EXAM block originally did not contain all the US variables, so an orthogonalizing transformation, by introducing all the lagged US variables, would change it. Therefore testing the Granger ordering requires estimating the full system. Of course, one could relax the restrictions on the US variables in the Canadian equation and test the Granger ordering restriction in that looser model. However the test would give less sharp results if done that way. The problem statement included the assertion that residuals in all equations are i.i.d. across equations. which is unclear at best. What I meant was i.i.d. over time and independent across equations. If interpreted that way, it implies that the linearized system has the form A(L; θy t = ε t, with Var(ε t diagonal. If we treat the US as the second of two blocks, the restriction is that the second block can be written as A 22 (Ly 2t = ε 2t, and it might seem that, since the residuals of this block are assumed independent of those in the first, we could carry out the test using the second block alone. But equations are not directly likelihoods. To carry out the test we need to consider what the test implies for the reduced form of the model, after we have multiplied through by A0 1. With the restriction imposed, the y 2 block involves US-equation parameters alone. But in the unrestricted model, if there are any Canadian variables allowed contemporaneously in the US block, there will be parameters from the first block in the reduced form equation for y 2 and vice versa. Thus in general the fact that the residuals in the two blocks are uncorrelated does not justify testing the restriction by looking at residuals from the second block alone. The special case where this would be justified is where A 0 is block diagonal, so that multiplying through by A0 1 does not mix up the equations.