Lecture Notes: Estimation of dynamic discrete choice models

Similar documents
Lecture Notes: Estimation of dynamic discrete choice models

Estimating Single-Agent Dynamic Models

Estimating Single-Agent Dynamic Models

Lecture Notes: Brand Loyalty and Demand for Experience Goods

Recent Developments in Empirical IO: Dynamic Demand and Dynamic Games

Dynamic Discrete Choice Structural Models in Empirical IO

Lecture 10 Demand for Autos (BLP) Bronwyn H. Hall Economics 220C, UC Berkeley Spring 2005

Lecture notes: Rust (1987) Economics : M. Shum 1

Estimating Dynamic Programming Models

Dynamic Discrete Choice Structural Models in Empirical IO. Lectures at Universidad Carlos III, Madrid June 26-30, 2017

Revisiting the Nested Fixed-Point Algorithm in BLP Random Coeffi cients Demand Estimation

Consumer heterogeneity, demand for durable goods and the dynamics of quality

ECO 2901 EMPIRICAL INDUSTRIAL ORGANIZATION

Empirical Industrial Organization (ECO 310) University of Toronto. Department of Economics Fall Instructor: Victor Aguirregabiria

A Practitioner s Guide to Bayesian Estimation of Discrete Choice Dynamic Programming Models

A Computational Method for Multidimensional Continuous-choice. Dynamic Problems

A Dynamic Model of Service Usage, Customer Satisfaction, and Retention

Dynamic Random Utility. Mira Frick (Yale) Ryota Iijima (Yale) Tomasz Strzalecki (Harvard)

Single Agent Dynamics: Dynamic Discrete Choice Models

Bresnahan, JIE 87: Competition and Collusion in the American Automobile Industry: 1955 Price War

Numerical Methods-Lecture XIII: Dynamic Discrete Choice Estimation

(a) Write down the Hamilton-Jacobi-Bellman (HJB) Equation in the dynamic programming

Dynamic Structural Models of Industrial Organization

Estimation of demand for differentiated durable goods

Lecture 3 Unobserved choice sets

Demand in Differentiated-Product Markets (part 2)

Estimation of Static Discrete Choice Models Using Market Level Data

Price Discrimination through Refund Contracts in Airlines

Nevo on Random-Coefficient Logit

Constrained Optimization Approaches to Estimation of Structural Models: Comment

Real Business Cycle Model (RBC)

A Robust Approach to Estimating Production Functions: Replication of the ACF procedure

Product innovation and adoption in market equilibrium: The case of digital cameras.

Lecture 7: Stochastic Dynamic Programing and Markov Processes

Estimating the Pure Characteristics Demand Model: A Computational Note

The BLP Method of Demand Curve Estimation in Industrial Organization

An empirical model of firm entry with endogenous product-type choices

Competition Between Networks: A Study in the Market for Yellow Pages Mark Rysman

Berry et al. (2004), Differentiated products demand systems from a combination of micro and macro data: The new car market

Getting to page 31 in Galí (2008)

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

Small Open Economy RBC Model Uribe, Chapter 4

Oblivious Equilibrium: A Mean Field Approximation for Large-Scale Dynamic Games

1 The Basic RBC Model

Lecture XI. Approximating the Invariant Distribution

exp(v j=) k exp(v k =)

Linear IV Regression Estimators for Structural Dynamic Discrete Choice Models

Online Appendix for Slow Information Diffusion and the Inertial Behavior of Durable Consumption

Solving a Dynamic (Stochastic) General Equilibrium Model under the Discrete Time Framework

Consistency and Asymptotic Normality for Equilibrium Models with Partially Observed Outcome Variables

Are Marijuana and Cocaine Complements or Substitutes? Evidence from the Silk Road

Taylor Rules and Technology Shocks

SEQUENTIAL ESTIMATION OF DYNAMIC DISCRETE GAMES. Victor Aguirregabiria (Boston University) and. Pedro Mira (CEMFI) Applied Micro Workshop at Minnesota

Next, we discuss econometric methods that can be used to estimate panel data models.

Uncertainty Per Krusell & D. Krueger Lecture Notes Chapter 6

1 Bewley Economies with Aggregate Uncertainty

Mini Course on Structural Estimation of Static and Dynamic Games

A Model of Human Capital Accumulation and Occupational Choices. A simplified version of Keane and Wolpin (JPE, 1997)

Lecture 2. (1) Permanent Income Hypothesis (2) Precautionary Savings. Erick Sager. February 6, 2018

1 Introduction to structure of dynamic oligopoly models

A Note on Demand Estimation with Supply Information. in Non-Linear Models

Lecture 4 The Centralized Economy: Extensions

problem. max Both k (0) and h (0) are given at time 0. (a) Write down the Hamilton-Jacobi-Bellman (HJB) Equation in the dynamic programming

Lecture 14 More on structural estimation

Identification of Consumer Switching Behavior with Market Level Data

Lecture 5: Estimation of a One-to-One Transferable Utility Matching Model

Lecture 15. Dynamic Stochastic General Equilibrium Model. Randall Romero Aguilar, PhD I Semestre 2017 Last updated: July 3, 2017

The Lucas Imperfect Information Model

A Note on Bayesian analysis of the random coefficient model using aggregate data, an alternative approach

Dynamic stochastic game and macroeconomic equilibrium

Supplement to The cyclical dynamics of illiquid housing, debt, and foreclosures (Quantitative Economics, Vol. 7, No. 1, March 2016, )

1 Differentiated Products: Motivation

1 Why demand analysis/estimation?

Optimal Replacement of GMC Bus Engines: An Empirical Model of Harold Zurcher. John Rust

Graduate Macroeconomics 2 Problem set Solutions

1 Hotz-Miller approach: avoid numeric dynamic programming

The Small-Open-Economy Real Business Cycle Model

14.461: Technological Change, Lecture 4 Competition and Innovation

Field Course Descriptions

Dynamic Models with Serial Correlation: Particle Filter Based Estimation

Do preferences for private labels respond to supermarket loyalty programs?

Industrial Organization II (ECO 2901) Winter Victor Aguirregabiria. Problem Set #1 Due of Friday, March 22, 2013

slides chapter 3 an open economy with capital

R&D Investment, Exporting, and Productivity Dynamics

Empirical Industrial Organization: Models, Methods, and Applications. Victor Aguirregabiria

The Dark Corners of the Labor Market

Part A: Answer question A1 (required), plus either question A2 or A3.

Session 4: Money. Jean Imbs. November 2010

Productivity Losses from Financial Frictions: Can Self-financing Undo Capital Misallocation?

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo

An Instrumental Variable Approach to Dynamic Models

The Real Business Cycle Model

Signal Extraction in Economics

ECOM 009 Macroeconomics B. Lecture 3

MKTG 555: Marketing Models

Department of Agricultural & Resource Economics, UCB

Specification Test on Mixed Logit Models

(Behavioral Model and Duality Theory)

Identifying Dynamic Games with Serially-Correlated Unobservables

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

Transcription:

Lecture Notes: Estimation of dynamic discrete choice models Jean-François Houde Cornell University November 7, 2016 These lectures notes incorporate material from Victor Agguirregabiria s graduate IO slides at the University of Toronto (http : //individual.utoronto.ca/vaguirre/courses/eco2901/teaching io toronto.html). 1

Key references: Pesendorfer (2002) Demand for storable goods Erdem, Imai, and Keane (2003) Hendel and Nevo (2006a) and Hendel and Nevo (2006b) If consumers can stockpile a storable good, they can benefit from sales : purchase more than consumption when prices are low. This creates a difference between purchasing and consumption elasticity In the short-run, store-level demand elasticity reflect stockpiling behavior In the long-run, store-level demand elasticity reflects consumption decisions Ignoring stockpiling and forward-looking behavior can lead to biases: own and cross price elasticities. If consumers differ in the storage costs firms have an incentive to price discriminate: PD theory of sales. Ketchup pricing example (Pesendorfer 2002): 2

Hendel and Nevo (2006a): Estimate an inventory control model with product differentiation Two challenges: (i) inventories are unobserved (key state variable), and (ii) differentiation creates a dimensionality problem Propose a simplifying assumption that: (i) reduce the computation burden of the model (reduce the dimension), and (ii) simplify the estimation (three-step algorithm). Data: Scanner data: Household panel with purchasing decisions from 1991-1993 Firms: 9 supermarkets (ignore supermarket choice) Store-level data: choice-set and product attributes (detergent), promotions, and prices 3

Market shares and sales: 4

Descriptive evidence (Hendel and Nevo 2006b) Issue: Inventories are unobserved Duration since previous sale promotion is positively correlated with aggregate quantity purchased Proxy for inventory cost (e.g. house size, dog) are negatively correlated with propensity to buy on sale 5

6

7

Model setup: J differentiated brands Weekly decisions: consumption c t, purchases quantity x t {0, 1, 2, 3, 4}, and brand j t Utility function: U(c t, v t, i t+1 ) = γ ln(c t + v t ) [ ] δ 1 i t+1 + δ 2 i 2 t+1 +m }{{} t Inventory cost where m t = j,x d jxt(βa jxt αp jxt + ξ jxt + ɛ jxt ) is the indirect utility from purchasing the outside good plus the value of the chosen brand. Inventory transition: i t+1 = i t c t + x t Note: The function m t implies that taste for brand only enters the purchasing stage, no the consumption stage. That is, the utility of consumption does not depend on the mix of brands in storage. State variables: s t = {i t, v t, p t, a t, ξ t, ɛ t }. Two challenges: 1. Unobserved state variable: i t is unobserved to the econometrician. 2. Curse of dimensionality: {p t, a t, ξ t, ɛ t } is 4 5 J dimension matrix. Note: In Rust (1987), we also had unobserved state variables (i.e. logit errors). However, we assumed that it was conditionally independent of the observed state and separately additive, which allowed us to work with the Emax function. Not the case here: i t is serially correlated and nonseparable. 8

Model choice-probability: Pr(d jxt = 1 s t ) = Where, exp (βa jxt αp jxt + ξ jxt + M(x, s t )) x,j exp (βa j x t αp j x t + ξ j x t + M(x, s t )) M(x, s t ) = max γ ln(c + v t ) [ δ 1 i t+1 + δ 2 it+1] 2 + βe [V (st+1 ) x, c, s t ] c s.t. i t+1 = i t c + x Note that conditional on purchasing size x, the brand choice is a static problem: Pr(d jt = 1 s t, x t = x) = = exp (βa jxt αp jxt + ξ jxt + M(x, s t )) j exp (βa j xt αp j xt + ξ j xt + M(x, s t )) exp (βa jxt αp jxt + ξ jxt ) j exp (βa j xt αp j xt + ξ j xt) This probability can be estimated by MLE: Multinomial logit model. This is the first-step of the estimation procedure. Endogenous prices and advertising? ξ jxt = ξ jx. Brand/size fixed-effects, such that 9

How to deal with the dimensionality problem? Since the brand-choice is static, we can write down the expected utility (i.e. Emax) conditional on size: ( ) ω t (x) = E max βa jxt αp jxt + ξ jxt + ɛ jt j J = ln exp (βa jxt αp jxt + ξ jxt ) j=1 McFadden (1978) calls ω t (x) the inclusive values. It summarizes the information contained in the vector of prices and characteristics available in period t, very much like a quality-adjusted price index. Dynamic programming problem re-defined: V (i t, v t, ω t, ɛ t ) = max γ ln(c v t) C(i t+1 ) + ω t (x) + ɛ xt c,x +βe [V (i t+1, v t+1, ω t+1, ɛ t+1 ) i t, v t, ω t, ɛ t, x, c)] To be able to define the state-vector solely as (i t, v t, ω t, ɛ t ), we need to impose an addition assumption on the transition process of ω t : Pr(ω t+1 a t, p t, ξ t ) = Pr(ω t+1 ω t ) This assumption as been called the Inclusive-value sufficiency assumption (Gowrisankaran and Rysman 2012). 10

Step 2: Estimate the Markov process for ω t (x) 11

Step 3: Dynamic decisions Conditional on buying size x, the optimal consumption path solves the following continuous DP problem: v(x s t ) = max c 0 γ ln(c v t ) C(i t c + x) + βe [V (s t+1 ) s t, x, c)] and the value function takes the familiar form: ( 4 ) V (s t ) = ln exp(v(x s t )) x=0 The CCP used in the estimation of the dynamic purchasing decision model is: Pr(x it s it ) = exp(v(x it s it )) x exp(v(x s it )) Remaining problem: i t is unobserved. Solution: Integrate out the unobserved initial inventory levels. How? Let i i0 = 0. Simulate the model for the first T 0 weeks of the data Record inventory level at T 0 : i T0 (i.e. initial state variable) Evaluate the likelihood from weeks T 0 to T, as if inventory levels were observed Repeat the process S times, and average the likelihood contribution of individual i. Implicit assumption: Stationary process at time T 0. 12

13

Two differences between static and dynamic model: 1. Static model implies larger price coefficient, 2. and ignores the inventory problem Both lead to a larger own price elasticity with the static model, than the long-run own price elasticity with the dynamic model. Point two leads to a lower cross price elasticity (with the static model). Why? In the data the response to sales is mostly coming from people going from not buying to buying the brand on sale. This leads to predict small cross price elasticities with respect to other products, and large cross price elasticity with respect to the outside good. Or more mechanically, in the static model, the choice probability are all relatively small since consumers buy detergent infrequently. The logic cross-partials are equal to the product of the two choice-probabilities: small cross-elasticities. The dynamic model rationalize this phenomenon high unobserved inventories, and conditional choice probabilities (conditional on purchasing). Everybody needs detergent... 14

Demand for durable goods Motivation: In many settings, the timing of purchase is as important as what you purchase. In durable goods markets, this is true because the quality and price of products available at any point in time vary less, than the quality and price of products offer across time. Why? Introduction of new products push the technology frontier upwards, and most firms often respond by using dynamic pricing strategy (i.e. discount products that are about to be discontinued). Challenges: (i) Micro data on product replacement is rare, and (ii) dynamic engine replacement models with product differentiation have very large state space. Examples: Gordon (2010), Gowrisankaran and Rysman (2012) 15

Model: Gowrisankaran and Rysman (2012) Indirect utility over product characteristics (omit time): U ij = β i x j + ξ j + ε ij = δ j + µ ij + ε ij = u ij + ε ij (1) Utility over current product: U i0 = β i x 0 + ε i0 = u i0 + ε i0 (2) Note: product 0 is either the outside option (not using a camcorder), or the last camcorder purchased. GR assume that products do not depreciate, and there are no sunk replacement costs. Bellman equation: Multinomial choice: a i A i = {0, 1,..., J}. Where J is the current number of products available. Common state-space: s i = {u i0, x 1,..., x J, p 1,..., p J } Expected Bellman equation: ( V i (s i ) = E ε [max u i,a α i p ia + ε ia + βe s Vi (s ) s i, a) )] a A i { }] = E ε [max v i (0 s i ) + ε i0, max v i(a s i ) + ε ia a A\0 where p ia = 0 if a = 0, and v i (a s i ) = u ia α i p ia + βe s ( Vi (s ) s i, a) ). Curse of dimensionality: (i) The number of grid points to approximate V i (s i ) grows exponentially with the number of products, and (ii) The calculation of the continuation value involves a s dimension integral (i.e. E s (V x)). 16

Dimension reduction assumption: Recall that if ε ij is distributed according to a type-1 extreme value distribution with location/spread parameters (0, 1), then max a A\0 v i (a x) + ε ia is also EV1 distributed with location parameter: ω i (s i ) = log exp(v i (a s i )) and spread parameter of 1. a A\0 ω i (s i ) is the expected discounted value of buying the preferred camcorder available today, instead of holding to the current one (i.e. outside option). Therefore, we can rewrite the expected Bellman equation as: V i (s i ) = E ε [max {v i (0 s i ) + ε i0, ω i (s i ) + ε i1 }] = log (exp(v i (0 s i )) + exp(ω i (s i ))) So far, we re simply re-writing the problem so that it looks more like a sequential decision problem: (i) replace or not replace, and (ii) which new product to buy conditional on replacing. The value of replacing is ω i (s i ). In order to reduce the dimension of the state space, GR introduce a new assumption: Inclusive value sufficiency: If ω i (s i ) = ω i ( s i ), then g ω (ω i (s ) s i ) = g ω (ω i (s ) s i )). Where g ω (x s i ) is the density of the option value ω. Importantly, this implies that if two different states have the same option value ω, then they also have the same value function. In addition, we assume that ω i evolves over time according to an AR(1) process: ω i,t+1 = ρ i,0 + ρ i,1 ω i,t + η i,t+1 where η i,t is mean zero shock. 17

The IVS assumption, means that consumers can use the scalar ω to forecast the future and evaluate the value relative value of buying now versus waiting. New sufficient state space: s i = (u i0, ω i ). New expected value function: V i (u i0, ω i ) = E ε [ max { ui0 + ε i0 + βe ω [ V i (u i0, ω ) ω i ], ω i + ε i1 }] = log ( exp(u i0 + βe ω [ V i (u i0, ω ) ω]) + exp(ω i ) ) = Γ i (u i0, ω i V i ) where Γ i ( V i ) is a contraction mapping from V i to V i. 18

Equilibrium conditions: 1. Rational expectation: 2. Bellman equation: ω i,t+1 = ρ i,0 + ρ i,1 ω i,t + η i,t+1 V i (u i0, ω i ) = log ( exp(u i0 + βe ω [ V i (u i0, ω ) ω]) + exp(ω i ) ) 3. Consistency of the inclusive value: ω = log exp(u i,a + βe ω [ V i (u i,a, ω ) ω] 4. Aggregate market share: a A\0 ŝ jt = s jt (δ, θ) = 1 S i j 0 P j (u i,j0, ω it α i, β i )w j0 (α i, β i ) where P j (u i,j0, ω it β i, α i ) is the probability of choosing option j conditional on holding option j 0 last period, w j0 (α i, β i ) is the share of consumers of types (α i, β i ) holding option j 0 in period t 1, and S is the number of simulated consumer. In order to estimate θ, we must minimize the following GMM criterium function subject to the equilibrium conditions: min θ s.t. (ξ T Z)Ω 1 (ξ T Z) T δ jt = X jt + ξ jt = s 1 jt (ŝ, θ) ω i,t+1 = ρ i,0 + ρ i,1 ω i,t + η i,t+1 V i (u i0, ω i ) = Γ i (u i0, ω i V ) = log ( exp(u i0 + βe ω [ V i (u i0, ω ) ω]) + exp(ω i ) ) ω i = log exp(u i,a + βe ω [ V i (u i,a, ω i) ω i ] a A\0 19

Solution steps: At every candidate parameter θ 1. Initialization: Sample time-invariant S consumer types: β i State-space grid: (u 0, ω) 2. Outer-loop: Solve δ jt = s 1 jt (ŝ, θ) (a) Starting values: δ 0 jt (b) Inner-loop: Solve transition parameters (ρ i,0, ρ i,1 ), and Bellman equation V i (u 0, ω) 0 i. Initial guesses: V i (u o, ω) and (ρ 0 i,0, ρ0 i,1 ), for all i = 1,..., S ii. Solve inclusive value fixed-point for each t and i: ω i,t = log exp(u i,a + βe ωt+1 [ V i 0 (u i,a, ω t+1 ) ω i,t ] a A\0 iii. Estimate AR(1) process (OLS): ω i,t+1 = ρ 1 i,0 + ρ1 i,1 ω i,t + η i,t+1 iv. Update value function: V i 1 (u 0, ω) = log ( exp(u 0 + βe ω [ V i 0 (u 0, ω ) ω]) + exp(ω) ) v. Convergence check: V 1 V 0 < ɛ 1, ρ 1 i,0 ρ0 i,0 < ɛ 2 and ρ 1 i,1 ρ 0 i,1 < ɛ 3 (c) Calculate predicted market shares: ŝ jt = s jt (δ 0, θ) = 1 P j (u i,j0, ω it α i, β i )w j0 (α i, β i ) S i j 0 where P j (u i,j0, ω it α i, β i ) = exp(v 0 (u i,j0 ))/ (exp(ω i,t ) + exp(v 0 (u i,j0 ))) if j = j 0, and P j (u i,j0, ω it α i, β i ) = exp(ω i,t ) exp(ω i,t ) + exp(v 0 (u i,j0 )) exp(v j,t(u jt, ω t )) exp(ω i,t ) 20

Applications: Video-camcorder market between 2000 and 2006 Main data-set: Market shares and characteristics of 383 models and 11 brands, from March 2000 to May 2006 Auxiliary data: Aggregate penetration and new-sales rates by years 21

Estimation results Implied elasticities: A market-wise temporary price increase of 1% leads to: A contemporaneous decrease in sales of 2.55 percent Most of this decrease is due to delayed purchases: 44 percent of the decrease in sales is recaptured over the following 12 months. The same market-wise permanent price increase leads to a 1.23 percent decrease in demand (permanent). The difference is more modest when we consider a product-level price increase: 2.59 percent versus 2.41 percent (for the leading product in 2003). Why? Consumers substitute to competing brands when the change is permanent in about the same magnitude as the delayed response when the price change is temporary. 22

References Erdem, T., S. Imai, and M. P. Keane (2003). Brand and quantity choice dynamics under price uncertainty. Quantitative Marketing and Economics 1, 5 64. Gordon, B. (2010, September). A dynamic model of consumer replacement cycles in the pc processor industry. Marketing Science 28 (5). Gowrisankaran, G. and M. Rysman (2012). Dynamics of consumer demand for new durable goods. Journal of Political Economy 120, 1173 1219. Hendel, I. and A. Nevo (2006a). Measuring the implications of sales and consumer stockpiling behavior. Econometrica 74 (6), 1637 1673. Hendel, I. and A. Nevo (2006b, Fall). Sales and consumer inventory. Rand Journal of Economics. McFadden, D. (1978). Spatial Interaction theory and residential location, Chapter Modelling the choice of residential location, pp. 531 552. North Holland and Co. Pesendorfer, M. (2002). Retail sales: A study of pricing behavior in supermarkets*. Journal of Business 75 (1), 33 66. Rust, J. (1987). Optimal replacement of gmc bus engines: An empirical model of harold zurcher. Econometrica: Journal of the Econometric Society 55 (5), 999 1033. 23