Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, / 1

Similar documents
Visualizing VAR s: Regularization and Network Tools for High-Dimensional Financial Econometrics

Title. Description. var intro Introduction to vector autoregressive models

Factor models. March 13, 2017

1. Introduction. Hang Qian 1 Iowa State University

Rapid Introduction to Machine Learning/ Deep Learning

ECON 4160, Autumn term 2017 Lecture 9

Financial Econometrics

CS Lecture 3. More Bayesian Networks

STA 4273H: Statistical Machine Learning

Labor-Supply Shifts and Economic Fluctuations. Technical Appendix

Chris Bishop s PRML Ch. 8: Graphical Models

Directed Graphical Models

Based on slides by Richard Zemel

CPSC 540: Machine Learning

Factor models. May 11, 2012

Probabilistic Graphical Networks: Definitions and Basic Results

Structural VARs II. February 17, 2016

Bayesian Linear Regression

Computational Macroeconomics. Prof. Dr. Maik Wolters Friedrich Schiller University Jena

Simultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations

Vector Autoregressive Model. Vector Autoregressions II. Estimation of Vector Autoregressions II. Estimation of Vector Autoregressions I.

Sampling Algorithms for Probabilistic Graphical models

Inference in Bayesian Proxy-SVARs

DAG models and Markov Chain Monte Carlo methods a short overview

Summary of the Bayes Net Formalism. David Danks Institute for Human & Machine Cognition

Structural Vector Autoregressions with Markov Switching. Markku Lanne University of Helsinki. Helmut Lütkepohl European University Institute, Florence

Assessing Structural Convergence between Romanian Economy and Euro Area: A Bayesian Approach

Econometría 2: Análisis de series de Tiempo

Lecture 4: Bayesian VAR

New Introduction to Multiple Time Series Analysis

Bayesian Inference for Gaussian Mixed Graph Models

Web Appendix for The Dynamics of Reciprocity, Accountability, and Credibility

Bayesian Inference for Gaussian Mixed Graph Models

STA 4273H: Statistical Machine Learning

A primer on Structural VARs

Identifying the Monetary Policy Shock Christiano et al. (1999)

Structural VAR Models and Applications

Bayesian Machine Learning

Structural VAR and Financial Networks: A Minimum Distance Approach to Spatial Modeling (Job Market Paper)

Estimating Global Bank Network Connectedness

November 2002 STA Random Effects Selection in Linear Mixed Models

Causality in Econometrics (3)

Bayesian model selection in graphs by using BDgraph package

CS839: Probabilistic Graphical Models. Lecture 2: Directed Graphical Models. Theo Rekatsinas

Graphical models and causality: Directed acyclic graphs (DAGs) and conditional (in)dependence

Dynamic Factor Models and Factor Augmented Vector Autoregressions. Lawrence J. Christiano

Lecture 7a: Vector Autoregression (VAR)

Identifying SVARs with Sign Restrictions and Heteroskedasticity

VAR Model. (k-variate) VAR(p) model (in the Reduced Form): Y t-2. Y t-1 = A + B 1. Y t + B 2. Y t-p. + ε t. + + B p. where:

Network Connectivity and Systematic Risk

Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data

1. Shocks. This version: February 15, Nr. 1

Bayesian Mixed Frequency VAR s

Dynamic chain graphs for high-dimensional time series: an application to real-time traffic flow forecasting

Recall from last time. Lecture 3: Conditional independence and graph structure. Example: A Bayesian (belief) network.

Bayesian networks. Independence. Bayesian networks. Markov conditions Inference. by enumeration rejection sampling Gibbs sampler

STA 414/2104: Machine Learning

Identification of Economic Shocks by Inequality Constraints in Bayesian Structural Vector Autoregression

1. what conditional independencies are implied by the graph. 2. whether these independecies correspond to the probability distribution

Frontiers in Forecasting, Minneapolis February 21-23, Sparse VAR-Models. Christophe Croux. EDHEC Business School (France)

MCMC algorithms for fitting Bayesian models

Lecture 7a: Vector Autoregression (VAR)

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Directed and Undirected Graphical Models

1 : Introduction. 1 Course Overview. 2 Notation. 3 Representing Multivariate Distributions : Probabilistic Graphical Models , Spring 2014

Introduction to Probabilistic Graphical Models

The Monte Carlo Method: Bayesian Networks

Chapter 16. Structured Probabilistic Models for Deep Learning

STA 4273H: Statistical Machine Learning

A MCMC Approach for Learning the Structure of Gaussian Acyclic Directed Mixed Graphs

Probabilistic Graphical Models

2 : Directed GMs: Bayesian Networks

Computational Genomics. Systems biology. Putting it together: Data integration using graphical models

Graphical Representation of Causal Effects. November 10, 2016

Generalized Impulse Response Analysis: General or Extreme?

CENTRE FOR CENTRAL BANKING STUDIES

Probabilistic Graphical Models (I)

ST 740: Markov Chain Monte Carlo

COMMENT ON DEL NEGRO, SCHORFHEIDE, SMETS AND WOUTERS COMMENT ON DEL NEGRO, SCHORFHEIDE, SMETS AND WOUTERS

TAKEHOME FINAL EXAM e iω e 2iω e iω e 2iω

A graph contains a set of nodes (vertices) connected by links (edges or arcs)

CS 2750: Machine Learning. Bayesian Networks. Prof. Adriana Kovashka University of Pittsburgh March 14, 2016

Learning in Bayesian Networks

A Brief Introduction to Graphical Models. Presenter: Yijuan Lu November 12,2004

Intelligent Systems: Reasoning and Recognition. Reasoning with Bayesian Networks

Estimating and Accounting for the Output Gap with Large Bayesian Vector Autoregressions

Class 4: VAR. Macroeconometrics - Fall October 11, Jacek Suda, Banque de France

The Origin of Deep Learning. Lili Mou Jan, 2015

Index. Pagenumbersfollowedbyf indicate figures; pagenumbersfollowedbyt indicate tables.

Cross-sectional space-time modeling using ARNN(p, n) processes

Bayesian spatial hierarchical modeling for temperature extremes

Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8]

Nonperforming Loans and Rules of Monetary Policy

Time Series Analysis for Macroeconomics and Finance

Conditional Independence

Part 6: Multivariate Normal and Linear Models

Multivariate Bayesian Linear Regression MLAI Lecture 11

Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, Jeffreys priors. exp 1 ) p 2

Monetary Policy and Exchange Rate Shocks in Brazil: Sign Restrictions versus a New Hybrid Identification Approach *

LECTURE 3 The Effects of Monetary Changes: Statistical Identification. September 5, 2018

Transcription:

Bayesian Graphical Models for Structural Vector Autoregressive Processes Daniel Ahelegbey, Monica Billio, and Roberto Cassin (2014) March 21, 2015 Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 1 / 1

Background This paper is about combining information from Dynamic Networks to inform the causal structure of Structural Vector Autoregressions The paper discusses using networks for estimating both structural and autoregressive coefficients, but my focus will be on the structural components (the autoregressive ones are easier) Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 2 / 1

Background: What is a SVAR? Consider the structural VAR(L) process: Y t = B 0 Y t + B 1 Y t 1 + + B L Y t L + ε t Where ε t N(0, I p ), and B 0 has 0s on the diagonal Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 3 / 1

Background: What is a SVAR? Consider the structural VAR(L) process: Y t = B 0 Y t + B 1 Y t 1 + + B L Y t L + ε t Where ε t N(0, I p ), and B 0 has 0s on the diagonal Rewriting the SVAR into a reduced-form VAR: Y t = A 1 0 B 1Y t 1 + A 1 0 B 2Y t 2 +... A 1 0 B LY t L + A 1 0 ε t where A 0 = I B 0 The problem is that the structural parameter, A 0 is not identified. Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 3 / 1

Background: Identification Issue Y t = A 1 0 B 1Y t 1 + A 1 0 B 2Y t 2 +... A 1 0 B LY t L + A 1 0 ε t Observe that A 1 0 A 1 0 is the covariance of the error term. Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 4 / 1

Background: Identification Issue Y t = A 1 0 B 1Y t 1 + A 1 0 B 2Y t 2 +... A 1 0 B LY t L + A 1 0 ε t Observe that A 1 0 A 1 0 is the covariance of the error term. But, for any orthogonal matrix Q (so QQ = I ), A 1 0 A 1 0 = A 1 0 QQ A 1 0 = (A 1 0 Q)(A 0Q) So the structural parameters, A 0 are not identified Requires making some identification assumptions to perform structural analysis Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 4 / 1

Background: Directed Acyclic Graphs A Directed Acyclic Graph is a collection G = {V, E}, where V is the set of vertices and E is the set of edges. For example: Figure 1 : DAG from Grzegorcyzk (2001) In this graph, node A is the parent of B and C. B and C are child nodes of A. D is the child node of both B and C, and E is the child node of E. A graph is said to be acyclic if no node is a descendant of itself Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 5 / 1

Connecting DAG and SVAR Y t = B 0 Y t + B 1 Y t 1 + + B L Y t L + ε t Then, there is a one-to-one relationship between the regression matrices and Directed Acyclic Graphs, given as: X j t s X i t B s (i, j) 0 Where X j t s X i t means that X j t s causes in some way, X i t. Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 6 / 1

Connecting DAG and SVAR Y t = B 0 Y t + B 1 Y t 1 + + B L Y t L + ε t Then, there is a one-to-one relationship between the regression matrices and Directed Acyclic Graphs, given as: X j t s X i t B s (i, j) 0 Where X j t s X i t means that X j t s causes in some way, X i t. Note that despite the directed aspect, the assumption of causality is not totally innocuous Think about a stock market ascent that may lead to an increase in GDP. The stock market may not cause GDP to increase, it could merely lead it in time. For more information, see Dawid 2008 This is similar to the notion of Granger Causality (with the exception of contemporaneous causation and conditional independence) Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 6 / 1

DAG and Cholesky One identification of the orthogonal shocks in the VAR is to use a Cholesky Decomposition PP = A 1 0 A 1 0 Acyclicality implies a specific ordering of variables For example, if X 1 X 2, then let X 1 be the first variable in the system, and X 2 the second Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 7 / 1

Local Markov Property A graph is said to be posses the Local Markov Property if N P(X 1, X 2,... X n ) = P(X i pa(x i )) i=1 where pa(x i ) is the set of parent nodes for node X i Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 8 / 1

Local Markov Property A graph is said to be posses the Local Markov Property if N P(X 1, X 2,... X n ) = P(X i pa(x i )) i=1 where pa(x i ) is the set of parent nodes for node X i In the example above, the full likelihood of the graph can be simplified into: P(G) = P(A)P(B A)P(C A)P(D {B, C})P(E D) Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 8 / 1

Estimation Define B s = G s Φ s, 0 s p, where is the Hadamarad element-wise matrix multiplication, and Φ s are the reduced form parameters G s are connectivity matrices that indicates dependence Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 9 / 1

Estimation Define B s = G s Φ s, 0 s p, where is the Hadamarad element-wise matrix multiplication, and Φ s are the reduced form parameters G s are connectivity matrices that indicates dependence Thus, the reduced form parameters of the VAR can be written as: A 0 = I G 0 Φ 0 A i = (I G 0 Φ 0 ) 1 (G i Φ i ) Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, 2015 9 / 1

Estimation: Bayesian Paradigm (priors) Y t = B 0 Y t + B 1 Y t 1 + + B L Y t L + ε t, ε t N(0, I ) Likelihood: The data matrix, X N(0, Σ x ) Prior: Define the probability distribution over a graph as: P(G, Θ) = P(G)P(Θ G) Where G is the set of graph structures (nodes, edges and directions), and Θ is the set of parameters. P(G) 1 Bi are distributed normally Conditional on a complete graph, P(Σ G) IW Bayesian Graphical Models for Structural Vector Autoregressive March Processes 21, 2015 10 / 1

Estimation: Bayesian Paradigm (priors) Y t = B 0 Y t + B 1 Y t 1 + + B L Y t L + ε t, ε t N(0, I ) Likelihood: The data matrix, X N(0, Σ x ) Prior: Define the probability distribution over a graph as: P(G, Θ) = P(G)P(Θ G) Where G is the set of graph structures (nodes, edges and directions), and Θ is the set of parameters. P(G) 1 Bi are distributed normally Conditional on a complete graph, P(Σ G) IW Note: This seems a little strange, since they have not specified a prior on the covariance except conditional on a complete graph. If the graph is not complete, the covariance will not be IW distributed Bayesian Graphical Models for Structural Vector Autoregressive March Processes 21, 2015 10 / 1

Marginal Likelihood The marginal likelihood can be factorized P(X G) = P(X G, Σ x )P(Σ x G)dΣ x Under the assumed likelihood/priors, the marginal likelihood has a closed form They estimate a Multivariate-Normal-Inverse-Wishart process and a Minnesota Prior process Bayesian Graphical Models for Structural Vector Autoregressive March Processes 21, 2015 11 / 1

Model Inference Since everything has closed forms, they implement the following Gibbs Sampler: 1 Sample the graph from the conditional posterior using Metropolis Hastings 2 Sample the reduced form parameters directly from their posterior Bayesian Graphical Models for Structural Vector Autoregressive March Processes 21, 2015 12 / 1

Simulations They compare their scheme to a competitor (the PC algorithm) They find that their model does about 10% better for a small-scale VAR (n=5), but comparably well in a larger system (n=10) Bayesian Graphical Models for Structural Vector Autoregressive March Processes 21, 2015 13 / 1

Simulations They compare their scheme to a competitor (the PC algorithm) They find that their model does about 10% better for a small-scale VAR (n=5), but comparably well in a larger system (n=10) An additional advantage of their model over the PC algorithm is that they can do prediction Bayesian Graphical Models for Structural Vector Autoregressive March Processes 21, 2015 13 / 1

Macroeconomic Time Series application Macro-forecasting based on medium size (n=20) VARs Dataset is quarterly observations from 1959Q1-2008Q4 Rolling windows of 14 years are used to estimate the model Bayesian Graphical Models for Structural Vector Autoregressive March Processes 21, 2015 14 / 1

Goodness of Fit Bayesian Graphical Models for Structural Vector Autoregressive March Processes 21, 2015 15 / 1

Big Data Implications It is hard to tell, but it seems like this estimation is extremely computationally intensive Calculating the contemporaneous dependencies are limited to 5-7 variables Time series are limited to to 50 observations They also apply their process to 19 financial sectors to estimate financial interconnectedness, but only estimate the autoregressive component Bayesian Graphical Models for Structural Vector Autoregressive March Processes 21, 2015 16 / 1

Comments This paper was overall very good Concise, Bayesian method for estimating structural parameters of an SVAR However, they do not tie their results into cholesky factorization or SVAR identification at the end Should we do impulse responses with their method or others? Can the efficiency of the inference scheme be improved Bayesian Graphical Models for Structural Vector Autoregressive March Processes 21, 2015 17 / 1