Non-nested model selection. in unstable environments

Similar documents
Model Comparisons in Unstable Environments

Combining Macroeconomic Models for Prediction

Can News be a Major Source of Aggregate Fluctuations?

Financial Factors in Economic Fluctuations. Lawrence Christiano Roberto Motto Massimo Rostagno

Monitoring Structural Change in Dynamic Econometric Models

Dynamics and Monetary Policy in a Fair Wage Model of the Business Cycle

Estimating and Accounting for the Output Gap with Large Bayesian Vector Autoregressions

A Test for State-Dependent Predictive Ability based on a Markov-Switching Framework

Monitoring Forecasting Performance

What You Match Does Matter: The Effects of Data on DSGE Estimation

Lectures on Structural Change

Warwick Business School Forecasting System. Summary. Ana Galvao, Anthony Garratt and James Mitchell November, 2014

Estimating Macroeconomic Models: A Likelihood Approach

Taylor Rules and Technology Shocks

Forecast combination based on multiple encompassing tests in a macroeconomic DSGE-VAR system

Department of Economics, Vanderbilt University While it is known that pseudo-out-of-sample methods are not optimal for

DSGE Methods. Estimation of DSGE models: GMM and Indirect Inference. Willi Mutschler, M.Sc.

Gaussian Mixture Approximations of Impulse Responses and the Non-Linear Effects of Monetary Shocks

Single Equation Linear GMM with Serially Correlated Moment Conditions

The Natural Rate of Interest and its Usefulness for Monetary Policy

Resolving the Missing Deflation Puzzle. June 7, 2018

Multivariate Out-of-Sample Tests for Granger Causality

Inference when identifying assumptions are doubted. A. Theory B. Applications

Inference when identifying assumptions are doubted. A. Theory. Structural model of interest: B 1 y t1. u t. B m y tm. u t i.i.d.

Identifying the Monetary Policy Shock Christiano et al. (1999)

Gold Rush Fever in Business Cycles

Assessing Structural VAR s

DSGE-Models. Calibration and Introduction to Dynare. Institute of Econometrics and Economic Statistics

Single Equation Linear GMM with Serially Correlated Moment Conditions

Modern Macroeconomics and Regional Economic Modeling

Point, Interval, and Density Forecast Evaluation of Linear versus Nonlinear DSGE Models

DSGE-Models. Limited Information Estimation General Method of Moments and Indirect Inference

Modelling Czech and Slovak labour markets: A DSGE model with labour frictions

Discussion of Riccardo DiCecio \Comovement: It's not a Puzzle" Matteo Iacoviello Boston College

The Return of the Wage Phillips Curve

Vector Autoregressive Model. Vector Autoregressions II. Estimation of Vector Autoregressions II. Estimation of Vector Autoregressions I.

Perceived productivity and the natural rate of interest

Empirical and Policy Performance of a Forward-Looking Monetary Model

Signaling Effects of Monetary Policy

Empirical and Policy Performance of a Forward-Looking Monetary Model

Shrinkage in Set-Identified SVARs

Economics Discussion Paper Series EDP Measuring monetary policy deviations from the Taylor rule

Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market

Dynamic Factor Models Cointegration and Error Correction Mechanisms

Agnostic Structural Disturbances (ASDs): Detecting and Reducing Misspecification in Empirical Macroeconomic Models

Inflation Revisited: New Evidence from Modified Unit Root Tests

Lecture 4 The Centralized Economy: Extensions

Optimal Monetary Policy in a Data-Rich Environment

Comparing Predictive Accuracy, Twenty Years Later: On The Use and Abuse of Diebold-Mariano Tests

PANEL DISCUSSION: THE ROLE OF POTENTIAL OUTPUT IN POLICYMAKING

Econometrics of Panel Data

Bayesian Estimation of DSGE Models: Lessons from Second-order Approximations

Aggregate Demand, Idle Time, and Unemployment

Monetary Economics. Lecture 15: unemployment in the new Keynesian model, part one. Chris Edmond. 2nd Semester 2014

Assessing Structural VAR s

Are Structural VARs Useful Guides for Developing Business Cycle Theories? by Larry Christiano

DSGE Model Forecasting

Final Exam November 24, Problem-1: Consider random walk with drift plus a linear time trend: ( t

Aggregate Demand, Idle Time, and Unemployment

Assessing Structural VAR s

Assessing Structural VAR s

Optimal Inflation Stabilization in a Medium-Scale Macroeconomic Model

Econ 583 Final Exam Fall 2008

Interest Rate Determination & the Taylor Rule JARED BERRY & JAIME MARQUEZ JOHNS HOPKINS SCHOOL OF ADVANCED INTERNATIONAL STUDIES JANURY 2017

Smets and Wouters model estimated with skewed shocks - empirical study of forecasting properties

S TICKY I NFORMATION Fabio Verona Bank of Finland, Monetary Policy and Research Department, Research Unit

Sequential Monte Carlo Methods (for DSGE Models)

Gold Rush Fever in Business Cycles

Predicting bond returns using the output gap in expansions and recessions

General Examination in Macroeconomic Theory SPRING 2013

CHANGE DETECTION IN TIME SERIES

Econometría 2: Análisis de series de Tiempo

Macroeconomics II. Dynamic AD-AS model

Dynamic AD-AS model vs. AD-AS model Notes. Dynamic AD-AS model in a few words Notes. Notation to incorporate time-dimension Notes

DSGE models: problems and some personal solutions. Fabio Canova EUI and CEPR. March 2014

The US Phillips Curve and inflation expectations: A State. Space Markov-Switching explanatory model.

ECON 4160, Lecture 11 and 12

Monetary Economics: Problem Set #4 Solutions

Bayesian Inference for DSGE Models. Lawrence J. Christiano

Simple New Keynesian Model without Capital

Assessing Structural Convergence between Romanian Economy and Euro Area: A Bayesian Approach

Chapter 6. Maximum Likelihood Analysis of Dynamic Stochastic General Equilibrium (DSGE) Models

Searching for the Output Gap: Economic Variable or Statistical Illusion? Mark W. Longbrake* J. Huston McCulloch

LECTURE 11. Introduction to Econometrics. Autocorrelation

epub WU Institutional Repository

Are Uncertainty Shocks Aggregate Demand Shocks?

The Price Puzzle: Mixing the Temporary and Permanent Monetary Policy Shocks.

Optimal Simple And Implementable Monetary and Fiscal Rules

WORKING PAPER SERIES NO. 491 / JUNE 2005

Adaptive Learning and Applications in Monetary Policy. Noah Williams

The Smets-Wouters Model

COMMENT ON DEL NEGRO, SCHORFHEIDE, SMETS AND WOUTERS COMMENT ON DEL NEGRO, SCHORFHEIDE, SMETS AND WOUTERS

Assessing Structural VAR s

WORKING PAPER NO DSGE MODEL-BASED FORECASTING OF NON-MODELLED VARIABLES

Are Policy Counterfactuals Based on Structural VARs Reliable?

Stagnation Traps. Gianluca Benigno and Luca Fornaro

On the Fit and Forecasting Performance of New-Keynesian Models

Graduate Macro Theory II: Notes on Quantitative Analysis in DSGE Models

Vector Autoregressions as a Guide to Constructing Dynamic General Equilibrium Models

Indeterminacy and Sunspots in Macroeconomics

Transcription:

Non-nested model selection in unstable environments Raffaella Giacomini UCLA (with Barbara Rossi, Duke)

Motivation The problem: select between two competing models, based on how well they fit thedata Both models are possibly misspecified Widely documented instability in macro and financial time series = the relative performance of the models may also be unstable Existing techniques for model selection compare the average performance of the models over historical samples = loss of information if relative performancevariesovertime Our goal: propose formal techniques to test whether the relative performance oftwomodelsisstable

Example Model 1 is better than model 2 (= higher likelihood) at the beginning of the sample and then the order switches. One would like to choose model 2 at the end of the sample, but a measure of average performance may find that the models are equivalent.

Example (cont.)

Contributions We propose new model selection tests to evaluate whether the relative performance of two competing models varies over time. Two tests: Fluctuationtestto analyze the evolution of the models relative performance over historical samples Sequential test to monitor the models relative performance in real time, as new data becomes available Valid under general conditions: data instability; models can be nonlinear, multivariate and misspecified; ML estimation Empirical application: evaluate time-variation in performance of Smets and Wouters (2003) DSGE model relative to BVARs.

Outline of the talk Fluctuation test Sequential test Monte Carlo evidence Empirical application to DSGE vs. BVAR Conclusion

Related literature Fluctuation test: Vuong, 1989 = test for equal full-sample average relative performance of non-nested, misspecified models (iid data) Rivers and Vuong, 2002 = heterogeneous + dependent data but tests that models are asymptotically equivalent Diebold- Mariano, 1995; West, 1996; McCracken 2000 etc. = test for equal out-of-sample average relative performance of non-nested, misspecified models Giacomini and White, 2006 = test if out-of-sample relative performance is related to explanatory variables

Related literature (cont.) Rossi, 2005 = test if parameters are stable + significant ( nested model selection under instability, but assumes correct specification) We test a different H 0 : the relative performance is equal at each point in time + consider non-nested, possibly misspecified models Tests for parameter instability (Brown, Durbin and Evans, 1975, Andrews, 1993, Bai, 1998 etc. etc.) We borrow from this literature, but focus on stability of relative performance rather than model s parameters (relative performance may be stable even though parameters change)

Related literature Sequential test: Chu, Stinchcombe and White (1996) = real-time parameter stability within one model Inoue and Rossi (2005) = assess if variable has real-time predictive content for another variable We ask a different question (compare non-nested models based on measures of fit) and have more general assumptions (e.g., both models misspecified)

Fluctuation test - Set-up Select a model for vector y t using other variables z t. Let x t =(yt 0,z0 t )0. Historical sample (x 1,...,x T ). Two competing models, estimated by ML. Idea: re-estimate models recursively starting from observation R<T using either an expanding sample ( recursive scheme ) or a rolling samples of size R ( rolling scheme ) At each time t, measure relative performance as Q t ³ˆθ t Qt (ˆγ t ) where Q t ³ˆθ t is the average log-likelihood over the estimation sample

Fluctuation test - Recursive scheme Performance measure for model 1: Q t ³ˆθ t = t 1 P t j=1 ln f(x j ; ˆθ t ) Q R ³ˆθ R QR (ˆγ R ) 1 R ³ˆγR+1 Q R+1 ³ˆθ R+1 QR+1 1 R+1 Q T ³ˆθ T QT (ˆγ T ) 1 T

Fluctuation test - Rolling scheme Performance measure for model 1: Q t ³ˆθ t = R 1 P t j=t R+1 ln f(x j ; ˆθ t ) Q R ³ˆθ R QR (ˆγ R ) 1 R ³ˆγR+1 Q R+1 ³ˆθ R+1 QR+1 2 R+1 T-R+1 Q T ³ˆθ T QT (ˆγ T ) T

Fluctuation test - Null hypothesis Null hypothesis: H 0 : EQ t (θ t ) EQ t (γ t )=0for all t =1,...,T, θ t and γ t are the pseudo-true parameters. E.g., for the recursive scheme, θ t =argmax θ E t 1 t X j=1 ln f ³ x j ; θ Note that the parameters of the models may be unstable under the null hypothesis

Fluctuation test - Implementation Compute sequences of statistics for t = R,..., T Ft rec = ˆσ 1 ³ t t Qt (ˆθ t ) Q t (ˆγ t ) Ft roll = ˆσ 1 ³ R Qt (ˆθ t ) Q t (ˆγ t ), t ˆσ 2 t is a HAC estimator of the asymptotic variance σ2 t = var( t (Q t (θ t ) Q t(γ t ))) DoesthesamplepathofF t significantly deviate from its hypothesized value of 0?

Fluctuation test - Recursive scheme Intuition: if the models are non-nested, for a particular t, Ft rec can be approximated by a N(0, 1) under H 0 = the sample path of Ft rec behaves like a Brownian motion that starts at zero at time t = R. = can derive boundary lines that are crossed with given probability α. For each t = R,..., T and significance level α: s µ c rec α,t = ± kα rec T R 1+2 t R t T R (Same as for the CUSUM test of Brown, Durbin and Evans, 1975) Typical values of (α, k rec α ) are (0.01, 1.143), (0.05, 0.948) and (0.10, 0.850)

Fluctuation test - Rolling scheme For the rolling scheme, the sample path of Ft rol behaves like the increments of a Brownian bridge = can derive boundary lines that are crossed with given probability α by simulation c roll α,t = ± kα roll kα roll R/T depends on R/T. We give a table with k roll α for typical values of a and

Fluctuation test - Assumptions Standard assumptions that guarantee that a FCLT holds for T 1/2 P t j=1 ln f(x j ; θ) Parameter and data instability allowed under H 0 Data have short memory (= no unit roots) σ 2 t is not o(1) (rules out nested models) R/T ρ finite and positive

Fluctuationtestinpractice(recursive)

Outline of the talk Fluctuation test Sequential test Monte Carlo evidence Empirical application to DSGE vs. BVAR Conclusion

Sequential test Monitor the model-selection decision in the post-historical sample period Supposemodel1wasbestoverthehistoricalsampleuptotimeT : EQ T (θ T ) EQ T (γ T ) > 0 Test hypothesis that model 1 continues to be the best for all subsequent periods: H 0 : EQ t (θ t ) EQ t (γ t ) 0 for t = T +1,T +2,..., against H 1 : EQ t (θ t ) EQ t(γ t ) < 0 at some t T. Doing a sequence of full-sample Vuong s (1989) tests rejects too often = find critical values that control the overall size of the procedure

Sequential test Compute sequence of test statistics for t = T +1,T +2,... J t =ˆσ 1 ³ t t Qt (ˆθ t ) Q t (ˆγ t ) As in Chu, Stinchcombe and White (1995), the critical value at time t for a level α test is: q c α,t = k α +ln(t/t ) Typical values of (α, k α ) are (0.05, 2.7955) and (0.10, 2.5003).

Outline of the talk Fluctuation test Sequential test Monte Carlo evidence Empirical application to DSGE vs. BVAR Conclusion

Monte Carlo evidence Compare our test to Vuong s (1989) in-sample test and to West s (1996) outof-sample test DGP with parameter variation: y t = β t x t + γ t z t + ε t, t =1, 2,...T, T =400 β t = 1+β 1(200<t 250) + (1 β) 1(t>250) γ t = 1+γ 1(200<t 250) + (1 γ) 1(t>250). Model 1: y t = βx t + u 1t. Model 2: y t = γz t + u 2t. Size: β = γ =0.5 = models are equally good. Power: β =0.95,γ =0.4 = time variation in relative performance

Monte Carlo evidence Table 2. Empirical rejection frequencies of nominal 5% tests. Ft rec Vuong West (a) Historical sample Size 0.051 0.047 0.044 Power 0.449 0.047 0.026 (b) Post-historical sample t/t J t Vuong 1.5 0.010 0.121 1.75 0.020 0.152 2 0.032 0.179

Outline of the talk Fluctuation test Sequential test Monte Carlo evidence Empirical application to DSGE vs. BVAR Conclusion

Application: SW s DSGE vs. BVAR Smets and Wouters (2003) (SW): An estimated DSGE model of the Euro Area : estimation of a 7-equation linearized DSGE model with sticky prices and wages, habit formation, capital adjustment costs and variable capacity utilization. Their findings: the DSGE model has comparable fit tothatofatheoreticalvars the parameter estimates have the expected sign

Open questions Have the parameters been stable? Perhaps not. Possible structural changes in the economy (European union introduction, productivity changes, etc.) If the parameters have changed = the performance of the DSGE model may have changed too... so SW s result only holds on average Can we say that the DSGE s performance has been stable over time, and significantly better than that of the VAR?

Application: SW s DSGE vs. BVAR SW sample: quarterly data 1970:2-1999:4 on DGP, consumption, investment, prices, real wages, employment, real interest rate Two questions: 1. Is there evidence of parameter variation in the DSGE parameters? 2. Was the DSGE consistently and significantly better than atheoretical Bayesian VARs over the sample?

Q#1: time-variation in DSGE parameters Estimate DSGE model recursively using rolling samples of size R =70and plot posterior mode persistence of shocks standard deviation of shocks monetary policy parameters

Parameter variation - shocks persistence 1 Productivity shock persistence 1 Investment shock persistence 0.95 0.95 0.9 0.85 0.8 0.75 0.9 0.85 0.8 0.7 0.75 0.65 0 10 20 30 40 50 0.7 0 10 20 30 40 50 1 Govt. spending shock persistence 1 Labor supply shock persistence 0.98 0.96 0.9 0.94 0.92 0.8 0.9 0.7 0.88 0.86 0 10 20 30 40 50 0.6 0 10 20 30 40 50

Parameter variation - standard deviation of shocks productivity shock st. dev. investment shock st. dev. 0.9 0.12 0.8 0.1 0.7 0.08 0.6 0.06 0.5 0.04 0.4 0 10 20 30 40 50 0.02 0 10 20 30 40 50 0.4 Govt. spending shock st. dev. 8 Labor supply shock st. dev. 0.38 7 0.36 0.34 0.32 6 5 0.3 4 0.28 0 10 20 30 40 50 3 0 10 20 30 40 50

Parameter variation monetary policy parameters 2 Inflation coeff. 0.3 d(inflation) coeff. 1.8 0.2 0.1 1.6 0 1.4 0 10 20 30 40 50 Lagged interest rate coeff. 1 0.98 0.96 0.94 0 10 20 30 40 50 d(output gap) coeff. 0.25 0.2 0.15 0.1 0.05 0 10 20 30 40 50-0.1 0 10 20 30 40 50 Output gap coeff. 0.4 0.3 0.2 0.1 0 0 10 20 30 40 50 Interest rate shock st. dev. 0.1 0.08 0.06 0.04 0.02 0 10 20 30 40 50

Summary: time-variation in structural parameters Moderate evidence of parameter variation in the DSGE model: persistence of productivity shock persistence and standard deviation of investment shock monetary policy coefficients fairly stable

Q#2: time-varying performance of DSGE vs BVAR Recursive and rolling fluctuation tests. Total sample T =118. Initial estimation sample R =70 DSGE vs. BVAR(1) with Minnesota priors ³ Compute sequences of difference in average log-likelihoods Q b t θ t Qt (bγ t ) for t = R,..., T b θ t is the posterior mode (a consistent estimator of θ t )

Fluctuation test - recursive 3 DSGE vs. BVAR(1) 2 1 0-1 -2-3 -4-5 1988 1990 1992 1994 1996 1998 2000

Fluctuation test - rolling 4 DSGE vs. BVAR(1) 3 2 1 0-1 -2-3 -4 1988 1990 1992 1994 1996 1998 2000

Conclusion and extensions Proposed a formal method for evaluating time-variation in relative performance of misspecified non-nested models Empirical application confirmed SW s result that a DSGE has comparable performance to a BVAR in recent years Extension: optimal fluctuation test (against random walk alternative)