Slides II - Dynamic Programming

Similar documents
DYNAMIC LECTURE 5: DISCRETE TIME INTERTEMPORAL OPTIMIZATION

ECON 582: Dynamic Programming (Chapter 6, Acemoglu) Instructor: Dmytro Hryshko

Simple Consumption / Savings Problems (based on Ljungqvist & Sargent, Ch 16, 17) Jonathan Heathcote. updated, March The household s problem X

ECOM 009 Macroeconomics B. Lecture 2

In the Ramsey model we maximized the utility U = u[c(t)]e nt e t dt. Now

Competitive Equilibrium and the Welfare Theorems

The representative agent model

Lecture 2 The Centralized Economy: Basic features

An Application to Growth Theory

Topic 2. Consumption/Saving and Productivity shocks

Macroeconomic Theory II Homework 2 - Solution

Dynamic Optimization Problem. April 2, Graduate School of Economics, University of Tokyo. Math Camp Day 4. Daiki Kishishita.

Graduate Macroeconomics 2 Problem set Solutions

Chapter 3. Dynamic Programming

Equilibrium in a Production Economy

Dynamic Optimization Using Lagrange Multipliers

HOMEWORK #1 This homework assignment is due at 5PM on Friday, November 3 in Marnix Amand s mailbox.

Lecture 2 The Centralized Economy

Lecture 9. Dynamic Programming. Randall Romero Aguilar, PhD I Semestre 2017 Last updated: June 2, 2017

1 Jan 28: Overview and Review of Equilibrium

Lecture 6: Discrete-Time Dynamic Optimization

Neoclassical Business Cycle Model

Macroeconomic Theory and Analysis Suggested Solution for Midterm 1

Lecture 4: The Bellman Operator Dynamic Programming

Comprehensive Exam. Macro Spring 2014 Retake. August 22, 2014

Development Economics (PhD) Intertemporal Utility Maximiza

Lecture 7: Stochastic Dynamic Programing and Markov Processes

Lecture 10. Dynamic Programming. Randall Romero Aguilar, PhD II Semestre 2017 Last updated: October 16, 2017

Dynamic Problem Set 1 Solutions

Economics 210B Due: September 16, Problem Set 10. s.t. k t+1 = R(k t c t ) for all t 0, and k 0 given, lim. and

The Real Business Cycle Model

A simple macro dynamic model with endogenous saving rate: the representative agent model

Macroeconomics Qualifying Examination

Stochastic Problems. 1 Examples. 1.1 Neoclassical Growth Model with Stochastic Technology. 1.2 A Model of Job Search

Lecture 3: Hamilton-Jacobi-Bellman Equations. Distributional Macroeconomics. Benjamin Moll. Part II of ECON Harvard University, Spring

Macroeconomics Qualifying Examination

Dynamic Optimization: An Introduction

Macroeconomics Theory II

Lecture 15. Dynamic Stochastic General Equilibrium Model. Randall Romero Aguilar, PhD I Semestre 2017 Last updated: July 3, 2017

University of Warwick, EC9A0 Maths for Economists Lecture Notes 10: Dynamic Programming

Government The government faces an exogenous sequence {g t } t=0

Chapter 4. Applications/Variations

Lecture 3: Dynamics of small open economies

Uncertainty Per Krusell & D. Krueger Lecture Notes Chapter 6

Permanent Income Hypothesis Intro to the Ramsey Model

1. Using the model and notations covered in class, the expected returns are:

slides chapter 3 an open economy with capital

problem. max Both k (0) and h (0) are given at time 0. (a) Write down the Hamilton-Jacobi-Bellman (HJB) Equation in the dynamic programming

Optimization, Part 2 (november to december): mandatory for QEM-IMAEF, and for MMEF or MAEF who have chosen it as an optional course.

ADVANCED MACROECONOMICS 2015 FINAL EXAMINATION FOR THE FIRST HALF OF SPRING SEMESTER

ECON607 Fall 2010 University of Hawaii Professor Hui He TA: Xiaodong Sun Assignment 2

Economics 2010c: Lectures 9-10 Bellman Equation in Continuous Time

Dynamic Programming Theorems

Notes on Alvarez and Jermann, "Efficiency, Equilibrium, and Asset Pricing with Risk of Default," Econometrica 2000

Basic Deterministic Dynamic Programming

Lecture 2. (1) Aggregation (2) Permanent Income Hypothesis. Erick Sager. September 14, 2015

Asymmetric Information in Economic Policy. Noah Williams

Suggested Solutions to Homework #3 Econ 511b (Part I), Spring 2004

Macroeconomic Theory and Analysis V Suggested Solutions for the First Midterm. max

HOMEWORK #3 This homework assignment is due at NOON on Friday, November 17 in Marnix Amand s mailbox.

1 The Basic RBC Model

ADVANCED MACROECONOMIC TECHNIQUES NOTE 3a

Lecture 5: Some Informal Notes on Dynamic Programming

Foundations of Modern Macroeconomics B. J. Heijdra & F. van der Ploeg Chapter 6: The Government Budget Deficit

Advanced Macroeconomics

Macro 1: Dynamic Programming 1

Fluctuations. Shocks, Uncertainty, and the Consumption/Saving Choice

Optimization Over Time

Neoclassical Growth Model / Cake Eating Problem

Lecture 4 The Centralized Economy: Extensions

Lecture 2. (1) Permanent Income Hypothesis (2) Precautionary Savings. Erick Sager. February 6, 2018

STATE UNIVERSITY OF NEW YORK AT ALBANY Department of Economics

Dynamic (Stochastic) General Equilibrium and Growth

Lecture 6: Recursive Preferences

Foundations of Modern Macroeconomics Second Edition

Economics 2010c: Lecture 2 Iterative Methods in Dynamic Programming

Lecture Notes. Econ 702. Spring 2004

Dynamic optimization: a recursive approach. 1 A recursive (dynamic programming) approach to solving multi-period optimization problems:

Monetary Economics: Solutions Problem Set 1

Advanced Economic Growth: Lecture 21: Stochastic Dynamic Programming and Applications

1 Bewley Economies with Aggregate Uncertainty

Economic Growth: Lecture 13, Stochastic Growth

ECON 2010c Solution to Problem Set 1

Lecture notes for Macroeconomics I, 2004

Basic Techniques. Ping Wang Department of Economics Washington University in St. Louis. January 2018

Dynamic stochastic game and macroeconomic equilibrium

Lecture 6: Competitive Equilibrium in the Growth Model (II)

UNIVERSITY OF VIENNA

G Recitation 3: Ramsey Growth model with technological progress; discrete time dynamic programming and applications

Macroeconomics: A Dynamic General Equilibrium Approach

Session 4: Money. Jean Imbs. November 2010

Lecture 4: Dynamic Programming

Learning to Optimize: Theory and Applications

Introduction to Recursive Methods

Dynamic Macroeconomic Theory Notes. David L. Kelly. Department of Economics University of Miami Box Coral Gables, FL

MA Advanced Macroeconomics: Solving Models with Rational Expectations

Lecture 1: Dynamic Programming

Eco504 Spring 2009 C. Sims MID-TERM EXAM

(a) Write down the Hamilton-Jacobi-Bellman (HJB) Equation in the dynamic programming

Introduction to Numerical Methods

Transcription:

Slides II - Dynamic Programming Julio Garín University of Georgia Macroeconomic Theory II (Ph.D.) Spring 2017 Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 1 / 32

Outline 1. Lagrangian Methods and Intertemporal Optimization 1.1 Endowment Economy 1.1.1 Model I: No assets, and no storage 1.1.2 Model II: Riskless interest-bearing asset 1.2 Production Economy 1.2.1 Model III: A Robinson Crusoe Economy 1.2.2 Model IV: Productive Capital and Riskless Asset 2. Dynamic Programming 2.1 Basics 2.2 Finite Horizon 2.2.1 A simple three-period deterministic model 2.2.2 Generalized T 2.3 Infinite Horizon 2.3.1 Principle of Optimality 2.3.2 Operators and Blackwell s sufficient conditions 2.4 Solution Techniques 2.4.1 Value function iteration 2.4.2 Guess and verify 2.4.3 Policy function iteration Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 2 / 32

Review of Two-Period Intertemporal Optimization Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 3 / 32

Model Overview Agents: Forward looking. Rational and their behavior is optimizing. Preferences: Lifetime utility: U = u(c) + βu(c ) Two periods is unrealistic but useful. Slope of the indifference curve. If u cc < 0: Marginal utility is decreasing in consumption. Quasi-concave utility implies convex preferences. Agent would like to smooth consumption. 4 Technology Scenarios: 1. Endowment economy, no assets. 2. Endowment economy, riskless, interest-bearing asset. 3. Robinson Crusoe economy. 4. Productive capital and riskless asset. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 4 / 32

Model I: Endowment Economy, no Assets, and no Storage Assume no assets and no storage. What is consumption equal to? Technology. Production. For now, we will look at an endowment economy. Stream of income given by y and y. How can output be transferred between periods? We start with a one-period riskless bond. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 5 / 32

Model II: Endowment Economy with Riskless Asset Now households have access to a riskless asset, a, that pays r. Amount borrowed limited by a. Household s problem given by: subject to: max U = u(c) + {c,c,a} βu(c ) We can solve using Lagrangian methods. Importance of the Euler equation. a = y c (1) c = (1 + r)a + y (2) a a (3) Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 6 / 32

Model III: A Robinson Crusoe Economy Now, instead of a riskless asset, households can move resources forward in time by buying productive capital. Increasing and concave production function: y = f (k). Household s problem becomes: subject to: max U = u(c) + {c,c,k } βu(c ) c + k = f (k) + (1 δ)k (4) c = f (k ) + (1 δ)k (5) k is given (6) Again, we can use Lagrangian methods and obtain the Euler. Now the rate of return is f k + 1 δ. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 7 / 32

Model IV: Productive Capital and Riskless Asset The agent faces an asset allocation problem. The problem is given by: subject to: max U = u(c) + {c,c,a,k } βu(c ) c + k + a = f (k) + (1 δ)k (7) c = f (k ) + (1 δ)k + (1 + r)a (8) k is given (9) Using Lagrangian methods, in addition to the standard Euler we obtain. f k + 1 δ = 1 + r. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 8 / 32

Dynamic Programming Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 9 / 32

Why Dynamic Programming? [...] What title, what name, could I choose? In the first place I was interested in planning, in decision making, in thinking. But planning, is not a good word for various reasons. I decided therefore to use the word programming. I wanted to get across the idea that this was dynamic, this was multistage, this was time-varying I thought, let s kill two birds with one stone. Let s take a word that has an absolutely precise meaning, namely dynamic, in the classical physical sense. It also has a very interesting property as an adjective, and that is it s impossible to use the word dynamic in a pejorative sense. Try thinking of some combination that will possibly give it a pejorative meaning. It s impossible. Thus, I thought dynamic programming was a good name. It was something not even a Congressman could object to. So I used it as an umbrella for my activities. Richard Bellman Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 10 / 32

Basics Ultimately we would like to think about multi-period models... or even infinite horizon models. Why? Suppose we have the following problem for a household: subject to: max U 0 = {c t,b t β t u(c t ) } t=0 t=0 c t + b t+1 1 + r = b t + y t for t = 0, 1, 2,... b 0 0 given Now bonds are discounted bonds. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 11 / 32

Why Do We Need Another Tool? For a rational, forward-looking household, consumption/saving decisions are intertemporally related. We could use Lagrangian methods to solve this intertemporal optimization problem. System of many non-linear equations in many unknowns. Hard, but feasible. Dynamic programming helps us to transform this problem into something more tractable, especially for stochastic models. We will start with a three-period problem. Think about problems recursively. Introduce the lingo. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 12 / 32

Three-Period Consumption/Saving Problem The problem is: max U 0 = u(c t ) + βu(c t+1 ) + β 2 u(c t+2 ) {c t+j,b t+j +1} 3 j=0 subject to: c t + b t+1 1 + r = b t + y t c t+1 + b t+2 1 + r = b t+1 + y t+1 c t+2 + b t+3 1 + r = b t+2 + y t+2 b t 0 given Let s assume for now that y t+j = 0 for j = 0, 1, 2. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 13 / 32

Solving the Three-Period Problem: t + 2 We solve the problem backwards. Today s decisions are not independent of future outcomes. The Principle of Optimality. At period t + 2 the problem is: subject to: Since u > 0 b t+3 = 0 and max U 0 = u(c t+2 ) {c t+2,b t+3 } c t+2 + b t+3 1 + r = b t+2 c t+2 f t+2 (b t+2 ) = b t+2 Given the state variable, the policy function, f, tells the optimal value for the choice variable. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 14 / 32

Solving the Three-Period Problem: Value Function We can define V t+2 (b t+2 ) U t+2. The value function, V t+2 (b t+2 ), is defined by the following functional equation: subject to: V t+2 (b t+2 ) = max u(c t+2) {c t+2,b t+3 } c t+2 + b t+3 1 + r = b t+2 The transition equation maps, for a given value of the choice variable, the state variable in one period into the next period s state variable. Note what V (b t+2 ) is equal to. We will come back to this. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 15 / 32

Solving the Three-Period Problem: t + 1 Let s keep working backwards: max U t+1 = u(c t+1 ) + βv t+2 (b t+2 ) {c t+1,b t+2 } subject to: c t+1 + b t+2 1 + r = b t+1 The continuation value is V t+2 (b t+2 ) and is taken as given. Decisions today affect future via changes in state variables, not via changes in future policy functions. This time consistency of decision rules is known as the Principle of Optimality. We can transform this problem into an unconstrained one. What is the control variable? What is the state variable? Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 16 / 32

The F.O.C. for the unconstrained problem is: ( u b t+1 b ) t+2 = β(1 + r)v 1 + r t+2(b t+2 ) Intuition? Given b t+1 we have b t+2 = g t+1 (b t+1 ) and from there we can solve for the optimal consumption from the budget constraint: c t+1 = f t+1 (b t+1 ) Note that these policy functions are time-dependent. Write down the value function representation. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 17 / 32

Solving the Three-Period Problem: t subject to: max U 0 = u(c t ) + βv t+1 (b t+1 ) c t,b t+1 c t + b t+1 1 + r = b t The F.O.C. looks familiar. From there we obtain b t+1 and c t. While we solve backwards in time, we now can use the optimal policy functions to move forward, starting from an initial b t. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 18 / 32

Method Comparison: Lagrange vs. Dynamic Programming We will compare the methods with a very simple utility model. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 19 / 32

Simple Model Method Comparison: Lagrange Method Let s solve: max U 0 = ln(c 0 ) + β ln(c 1 ) + β 2 ln(c 2 ) {c 0,c 1,c 2,b 1,b 2 } subject to: c 0 + b 1 1 + r = b 0 c 1 + b 2 1 + r = b 1 c 2 + b 3 1 + r = b 2 b 0 0 given Note that the solution is in recursive form. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 20 / 32

Simple Model Method Comparison: Dynamic Programming As before, we use backward induction. We know what will happen in period 2. We start with subject to: max ln(c 1) + βv 2 (b 2 ) {c 1,b 2 } c 1 + b 2 1 + r = b 1 From there, let s go to period 1 and 0. Clearly, f i = f i and g i = g i for i = 0, 1. Consistent with Principle of Optimality. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 21 / 32

From Period t to t + T If T, the problem subject to: can be expressed as T V t (b t ) = β i u(ct+i) i=0 = max u(c t ) + βv t+1 (b t+1 ) c t,b t+1 c t + b t+1 1 + r = b t subject to: V (b t ) = max c t,b t+1 u(c t ) + βv (b t+1 ) c t + b t+1 1 + r = b t Value functions and policy functions will be time-autonomous. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 22 / 32

Transversality Condition In the finite horizon we implicity ruled out dying with debt. Now we have a similar condition: transversality condition. Let s put the income process back into the problem. After some work, we find that the condition is given by A relatively weak condition. ( ) 1 n lim b t+n = 0 n 1 + r If we let n and we impose the transversality condition we have ( ) 1 i ( ) 1 i c t+i = b t + y t+i i=0 1 + r 1 + r i=0 Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 23 / 32

A General Version We have the problem subject to: max {u t } t=0 t=0 β t r(x t, u t ) x t+1 = g(x t, u t ) x 0 given r(x t, u t ) is a concave return or payoff function. What are the state and the control variables? Condition on transition equation: It must represent a convex and compact set. Mapping a known model into this problem. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 24 / 32

If we didn t know dynamic programming we would have to find the optimal infinite sequence of choice variables. What about a stochastic setting? If we could solve for that sequence, then we could obtain V (x 0 ). Dynamic programming offers an alternative: We seek the optimal time-invariant policy function. With the transition equation, this gives us the sequence {ut } t=0. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 25 / 32

If we knew V (x), we could find h(x) by solving: subject to: max u,x r(x, u) + βv (x ) x = g(x, u) For each x, compute h(x). No time subscripts. If we knew h(x) we could solve the following for V (x): V (x) = r(x, h(x)) + βv (g(x, h(x))) For each x, compute V (x). However... We don t know h(x) nor V (x)! Have to solve for them simultaneously. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 26 / 32

We will solve for h(x) and V (x) simultaneously, by solving the Bellman equation: V (x) = max r(x, u) + βv (g(x, u)) u How do we know that the recursive solution to this problem is the same as the solution to the sequential problem? Under appropriate assumption the operator in the right side is a contraction mapping. The V that solves the Bellman equation is a fixed point of the operator. When is an operator a contraction mapping? Blackwell s sufficient conditions. Why do we care? When operators are contractions, repeated application of the operator will converge to the fixed point. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 27 / 32

Properties of the Bellman Equation Under standard assumptions: 1. There is a unique solution, V, to the Bellman equation and it is strictly concave. 2. As j, V is approached in the limit by iterating on V j+1 (x) = max r(x, u) + βv j (g(x, u)) u 3. There is a unique and time-invariant policy function. 4. Benveniste-Scheinkman envelope condition. V (x) is differentiable. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 28 / 32

Stochastic Dynamic Programming Not too much changes. Add expectation operator. Might add another state variable. The stochastic element may become a state variable. Example: stochastic growth model with persistent productivity shocks: subject to: V (k, z) = max {c,k } u(c) + β E z z V (k, z ) k + c = zf (k) z would be a state variable even if it is i.i.d. What if we had a growth model with stochastic and i.i.d. depreciation? Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 29 / 32

Solution Techniques: Value Function Iteration Inspired on 2. 1. Make an initial guess about V, call it V i. 2. Use this guess on the r.h.s. of the Bellman equation. 3. Maximizing on the r.h.s. yields new V, call it V i+1 4. Use this new V i+1 as the V i on the r.h.s. in the next iteration. 5. Iterate until V i and the resulting V i+1 are nearly equal. This rarely works in closed form. When solving numerically, the key is how to represent functions like V. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 30 / 32

Solution Techniques: Guess and Verify Guess functional form and verify that it solves the Bellman equation. Only works with problems that have a closed-form solution. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 31 / 32

Solution Techniques: Policy Function Iteration Also known as Howard s Improvement Algorithm. Opposite to VFI. 1. Start with a guess for the policy function, h(x). 2. Calculate the value of following h(x) forever. 3. Calculate new policy function by maximizing r.h.s. of the Bellman with the V from the previous step. 4. Iterate until convergence. There are other techniques as well. Macroeconomic Theory II Slides II - Dynamic Programming Spring 2017 32 / 32