Endogenous Information Choice

Similar documents
An Introduction to Rational Inattention

Rational Inattention

Christian Hellwig 1 Sebastian Kohls 2 Laura Veldkamp 3. May 2012

Information Choice in Macroeconomics and Finance.

SIGNALS, BELIEFS AND UNUSUAL EVENTS ERID LECTURE DUKE UNIVERSITY

Expectation Formation and Rationally Inattentive Forecasters

1 The Basic RBC Model

Macroeconomics Theory II

Stickiness. September 18, 2017

Optimal Monetary Policy with Informational Frictions

Signaling Effects of Monetary Policy

Costly Social Learning and Rational Inattention

Forward Guidance without Common Knowledge

Cross-Country Differences in Productivity: The Role of Allocation and Selection

The New Keynesian Model: Introduction

Open Economy Macroeconomics: Theory, methods and applications

Imperfect Information and Optimal Monetary Policy

A Modern Equilibrium Model. Jesús Fernández-Villaverde University of Pennsylvania

Knowing What Others Know: Coordination Motives in Information Acquisition

Identifying the Monetary Policy Shock Christiano et al. (1999)

Precautionary Demand for Money in a Monetary Business Cycle Model

Stagnation Traps. Gianluca Benigno and Luca Fornaro

Online Appendix for Slow Information Diffusion and the Inertial Behavior of Durable Consumption

Comprehensive Exam. Macro Spring 2014 Retake. August 22, 2014

Players as Serial or Parallel Random Access Machines. Timothy Van Zandt. INSEAD (France)

General Examination in Macroeconomic Theory SPRING 2013

Business Cycle Dynamics under Rational Inattention

Lecture 4 The Centralized Economy: Extensions

1 Bewley Economies with Aggregate Uncertainty

Estimating and Identifying Vector Autoregressions Under Diagonality and Block Exogeneity Restrictions

Phillips curve and the Tinbergen and Theil approach to economic policy

Chapter 6. Maximum Likelihood Analysis of Dynamic Stochastic General Equilibrium (DSGE) Models

4- Current Method of Explaining Business Cycles: DSGE Models. Basic Economic Models

Microeconomic theory focuses on a small number of concepts. The most fundamental concept is the notion of opportunity cost.

MA Advanced Macroeconomics: 7. The Real Business Cycle Model

Y t = log (employment t )

Macroeconomics II. Dynamic AD-AS model

Dynamic AD-AS model vs. AD-AS model Notes. Dynamic AD-AS model in a few words Notes. Notation to incorporate time-dimension Notes

Business Cycle Dynamics under Rational Inattention

The Lucas Imperfect Information Model

Forward Guidance without Common Knowledge

Macroeconomics Theory II

Multivariate Rational Inattention

Strategic Inattention, Inflation Dynamics and the Non-Neutrality of Money

Taylor Rules and Technology Shocks

General Examination in Macroeconomic Theory

Stabilization policy with rational expectations. IAM ch 21.

Inflation traps, and rules vs. discretion

Introduction to Macroeconomics

Overfitting, Bias / Variance Analysis

1 Teaching notes on structural VARs.

problem. max Both k (0) and h (0) are given at time 0. (a) Write down the Hamilton-Jacobi-Bellman (HJB) Equation in the dynamic programming

Indeterminacy and Sunspots in Macroeconomics

Mechanics of linear quadratic Gaussian rational. inattention tracking problems

Business Cycle Dynamics under Rational Inattention

Ergodicity and Non-Ergodicity in Economics

Modelling Czech and Slovak labour markets: A DSGE model with labour frictions

Macroeconomics Theory II

1 Teaching notes on structural VARs.

Monetary Economics. Lecture 15: unemployment in the new Keynesian model, part one. Chris Edmond. 2nd Semester 2014

Dynamics and Monetary Policy in a Fair Wage Model of the Business Cycle

DISCRETE ACTIONS IN INFORMATION-CONSTRAINED DECISION PROBLEMS

FINM6900 Finance Theory Noisy Rational Expectations Equilibrium for Multiple Risky Assets

Fundamental Disagreement

Department of Agricultural Economics. PhD Qualifier Examination. May 2009

Neoclassical Business Cycle Model

Simple New Keynesian Model without Capital

Lars Svensson 2/16/06. Y t = Y. (1) Assume exogenous constant government consumption (determined by government), G t = G<Y. (2)

Business Cycle Dynamics under Rational Inattention

The welfare cost of energy insecurity

A Dynamic Model of Aggregate Demand and Aggregate Supply

Local Interactions in a Market with Heterogeneous Expectations

1. Money in the utility function (start)

Problem Set 4. Graduate Macro II, Spring 2011 The University of Notre Dame Professor Sims

ECOM 009 Macroeconomics B. Lecture 2

In the Name of God. Sharif University of Technology. Microeconomics 1. Graduate School of Management and Economics. Dr. S.

Ambiguous Business Cycles: Online Appendix

Sentiments and Aggregate Fluctuations

A primer on Structural VARs

Design Patent Damages under Sequential Innovation

Learning to Optimize: Theory and Applications

1. Constant-elasticity-of-substitution (CES) or Dixit-Stiglitz aggregators. Consider the following function J: J(x) = a(j)x(j) ρ dj

Lecture 11: Information theory THURSDAY, FEBRUARY 21, 2019

Volume 30, Issue 3. A note on Kalman filter approach to solution of rational expectations models

Short correct answers are sufficient and get full credit. Including irrelevant (though correct) information in an answer will not increase the score.

Lecture 7. The Dynamics of Market Equilibrium. ECON 5118 Macroeconomic Theory Winter Kam Yu Department of Economics Lakehead University

CS 630 Basic Probability and Information Theory. Tim Campbell

Real Business Cycle Model (RBC)

(a) Write down the Hamilton-Jacobi-Bellman (HJB) Equation in the dynamic programming

Solving a Dynamic (Stochastic) General Equilibrium Model under the Discrete Time Framework

Closed economy macro dynamics: AD-AS model and RBC model.

Solving Models with Heterogeneous Expectations

High-dimensional Problems in Finance and Economics. Thomas M. Mertens

Getting to page 31 in Galí (2008)

Macroeconomics Theory II

Monetary Economics: Solutions Problem Set 1

S TICKY I NFORMATION Fabio Verona Bank of Finland, Monetary Policy and Research Department, Research Unit

General Equilibrium and Welfare

Graduate Macroeconomics 2 Problem set Solutions

Competitive Equilibrium

Transcription:

Endogenous Information Choice Lecture 7 February 11, 2015

An optimizing trader will process those prices of most importance to his decision problem most frequently and carefully, those of less importance less so, and most prices not at all. Of the many sources of risk of importance to him, the business cycle and aggregate behavior generally is, for most agents, of no special importance, and there is no reason for traders to specialize their own information systems for diagnosing general movements correctly. Robert E. Lucas (1977)

Endogenous Information Choice Today we will talk mostly about rational inattention Agents can choose what to observe, i.e. choose D and Σ vv in state space system X t = AX t 1 + Cu t Z t = DX t + v t A and C may in equilibrium depend on D and Σ vv For this to be an interesting question, there has to be costs/constraints associated with acquiring more precise information (otherwise D = I and Σ vv = 0)

Rational Inattention What is it? All information is in principle available, but an agent cannot pay attention to all available information Agents have information processing constraints and therefore chooses to observe the most important information Some history of thought: Tools borrowed from information theory (a branch of applied mathematics) Sims (Carnegie-Rochester 1998) introduced it to economists Price setting model of Mackowiak and Wiederholt (AER 2009) probably the highest impact paper using the method

Entropy Entropy is a term from thermodynamics, roughly meaning the (inverse of) degree of order. Shannon (1948) used the term to describe information flows independently of the medium information is transmitted through In this context, entropy means how much information is required on average to describe the outcome of a random variable Entropy is measured in bits

Entropy The entropy H(x) of a random variable x with probability mass function p (x) is given by H (x) = x X p (x) log 2 p (x) The conditional entropy H (x y) H (x y) = p (x, y) log 2 p (x y) x X y Y quantifies how much uncertainty about variable x remains after observing y. If x and y are independent H (x y) = H (x)

Mutual Information The mutual information I (x; y) of x and y is a measure of how much we learn about x given y, and since mutual information is symmetric, i.e. since I (x; y) = I (y; x) it is also how much we learn about y from observing x. Formally, the mutual information of x and y are I (x; y) = H (x) H (x y) = H (y) H (y x) = I (y; x) Mutual information is independent of scale, i.e. is unaffected by units of measurement.

Bits and entropy: Some intuition How many yes-or-no questions are needed on average to determine the value of a random variable? If variable x can take 2 different values, -1 and 1, one question is sufficient. If variable x can take 4 different values, -2, -1, 1 and 2, two questions are sufficient....and so on....but probabilities of outcomes also matter. Entropy is maximized when all outcomes are equally likely

Differential Entropy Differential entropy h(x) is the generalization of entropy to continuous random variables. Differential entropy of the random variable x with density function p (x) is given by h (x) = p (x) log 2 p (x) dx The conditional entropy h (x y) h (x y) = p (x, y) log 2 p (x y) dxdy quantifies how much uncertainty about variable x remains after observing y. If x and y are independent h (x y) = h (x)

Bits, capacity and precision: Some intuition Let s say an agent has a uniform prior about θ U( 100, 100) It takes one bit for to transmit whether θ is positive or negative It takes one more bit for to transmit whether θ is larger or smaller than 50 (-50) It takes one more bit for to transmit whether θ is larger or smaller than 25 (75,-25,-75)...and so on. With infinite capacity, we could partition the states of the world to an infinite precision

Coin flip Coin flip (equal probability binomial) H(x) = 1 ( ) 1 2 ln 1 ( ) 1 2 2 ln 2 = ln(2) where ln(2)=1 if the logarithm is taken w.r.t. base 2.

Uniform Entropy of uniform x U(0, a) is given by a ( ) 1 1 h(x) = 0 a ln dx a ( ) 1 = ln = ln a a The larger the domain, the larger the entropy.

Gaussian entropy The (log) entropy of a Gaussian random vector x N(0, Σ) is given by ln h(x) = n 2 [ln 2π + 1] + 1 ln Σ 2 where n is the dimension of x so the interesting part is the determinant of the covariance matrix Σ

Gaussian entropy For given variances entropy is maximized if x is a vector of uncorrelated variables: [ ] Σ = 1 0 = 1 0 2 0 1 [ ] Σ = 1 a = 1 a 2 a 1 1 < a < 1 = 1 a 2 < 1 Intuition: If a = 1 or a = 1 we can perfectly transmit a two dimensional vector using a one dimensional signal

Gaussian signals and states In economic (Gaussian) applications the constraint often takes the form h (x) h (x z) e 1 2 κ ( n 2 [ln 2π + 1] + 1 ) ( n 2 ln Σ prior 2 [ln 2π + 1] + 1 ) 2 ln Σ post 1 2 κ ln Σ prior ln Σ post κ that is, the decrease in uncertainty is cannot be too large. In a Kalman filter setting, the above constraint would be ln P t t 1 ln P t t κ

The No forgetting constraint Posterior uncertainty cannot be larger than prior uncertainty diag(σ prior ) > diag(σ post ) That is, an agent cannot intentionally increase the variance of the estimation error of one variable in order to get a more precise estimate of another variable.

Rational inattention and entropy in economics Some nice properties: 1. When information processing capacity is large, behavior is close to full information 2. When a decision maker allocates a lot of attention to observing one variable, mistakes in responses to that variable becomes small 3. A decision maker need to allocate more attention to a variable (with given variance) to achieve a given precision if the variable has low serial correlation

A simple example of optimal information choice Utility function FOC: so that U = E [ (1 λ) (a x 1 ) 2 + λ (a x 2 ) 2] : 0 < λ < 1 2 (1 λ) (a E[x 1 ]) + 2λ (a E[x 2 ]) = 0 a = (1 λ) E[x 1 ] + λe[x 2 ] Plugging in a into U gives the expected loss EU = (1 λ) 2 σ 2 1 + λ 2 σ 2 2 where σ1 2 and σ2 2 of x 1 and x 2. are the posterior error variances of the estimates

A simple example of optimal information choice Choose noise in signal Z to maximize expected utility (i.e. minimize expected loss) [ ] x1 Z = + v x 2 subject to ln Σ prior ln Σ post κ

A simple example of optimal information choice For simplicity, we can restrict ourselves to diagonal covariance matrices ln Σ prior ln Σ post κ Σ prior Σ post e κ Σ prior e κ Σ post = σ1σ 2 2 2 Σ prior ( ) σ 2 1 e κ 1 σ2 2 (If Σ prior is diagonal, the optimal Σ post is also diagonal.)

A simple example of optimal information choice Use that inequality will always be binding in an interior solution and plug into expected loss F.o.c. (1 λ) 2 σ 2 1 + λ 2 Σ prior e κ ( σ 2 1 ) 1 (1 λ) 2 λ 2 Σ prior ( ) σ 2 2 e κ 1 = 0 λ (1 λ) Σ prior 1 2 e 1 2 κ = σ1 2 What happens when λ 1? And when λ 0?

Remember the No forgetting constraint! Make sure that by checking that diag(σ prior ) > diag(σ post ) λ (1 λ) Σ prior 1 2 e 1 2 κ < σ 2 1,prior If inequality is violated at marginal condition, set σ 2 1 = σ2 1,prior.

Application: A macro price setting model Mackowiak and Wiederholt (AER 2009): Optimal Sticky Prices under rational Inattention Sets up a macro price setting model with rational inattention Explains large and persistent real effects of shocks to money supply Can match micro data on large magnitude of individual price changes, while aggregate price changes are small

Application: A macro price setting model The optimal price of a good produced by firm i is given by p it = E [p t + ay t + bz it s ti ] (1) where p t is the aggregate price level, y t are aggregate shocks and z jt are firm specific shocks.

Application: A macro price setting model The signal vector s ti is chosen to minimize [ (Xt ) ( ) ] L i = ce X t t Xt X t t c (2) subject to where I (X t ; s ti ) K (3) X t = [ p t y t z it ] (4)

Some important properties of the model Optimal price in increasing in aggregate price level Idiosyncratic shocks are large relative to aggregate shocks Capacity K is set so that pricing errors are quite small Observing endogenous signals (like the price level) also uses up capacity.

Price responses to idiosyncratic and aggregate shocks Figure 1: Impulse responses of an individual price to an innovation in the idiosycratic state variable, benchmark economy 0.12 0.1 Perfect information Rational inattention Response to noise 0.08 0.06 Impulse responses to shocks of one standard deviation 0.04 0.02 0 0.01 0.008 1 3 5 7 9 11 13 15 17 19 21 Figure 2: Impulse responses of an individual price to an innovation in nominal aggregate demand, benchmark economy Perfect information Rational inattention Response to noise at the fixed point with rational inattention 0.006 0.004 0.002 0 1 3 5 7 9 11 13 15 17 19 21 Periods

Pricing errors and the importance of feedback Figure 3: Simulated price set by an individual firm in the benchmark economy 0.6 Rational inattention Profit-maximizing price 0.4 0.2 0-0.2 Log-deviations from the non-stochastic solution -0.4-0.6 0.06 0.04 0.02 10 20 30 40 50 60 70 80 90 Figure 4: Simulated aggregate price level 0-0.02-0.04-0.06 Aggregate price level, rational inattention, benchmark economy Conditional expectation of the aggregate price level, rational inattention, benchmark economy Aggregate price level, perfect information 10 20 30 40 50 60 70 80 90 Periods

Critique, limitations and further issues The rational inattention framework yields nice results, and as a theory of information processing it has many appealing features. However, there are also some unresolved issues: How can the capacity constraint K be calibrated? In engineering, this is usually a physical constraint. What does it mean for humans? Are we Gaussian channels? Can processed information be traded? There appears to be markets for information. How do they fit in? Is it realistic that entropy alone determines how hard a variable is to form an estimate about?

Critique, limitations and further issues Rational inattention makes testable, but as of yet untested, predictions: If policy changes, agents should reallocate attention in a way that is optimal, given the new stochastic environment. This could perhaps be investigated using the monetary policy change in the early 1980 s in the US, or the switch to inflation targeting in several other countries in the early 90 s.

Other approaches Rational inattentiveness: Mankiw and Reis (2002), Reis (2006a,2006b) Agents update information sets infrequently Random (Calvo-type) determination of exactly when updates occur, but frequency can be micro founded with fixed cost of gathering information When agents update, they observe state perfectly Results in hierarchical information sets and keeps solution simple Is it plausible? Actions changes even when information is not updated which implies smooth trajectories with intermittent jumps Since all agents that update in period t will choose the same action, you will have clustered actions which is not what we see in the data But then again, maybe it is a tractable short cut that captures aggregate dynamics well