Identification problems in DSGE models

Identification problems in DSGE models Fabio Canova ICREA-UPF, CREI, AMeN and CEPR August 7 References Canova, F. (99) Sensitivity analysis and model evaluation in dynamic GE economies, International Economic Review. Canova, F. () Validating DSGE models through VARs, CEPR working paper Canova, F. (7) How much structure in empirical models, forthcoming, Palgrave Handbook of Econometrics, volume. Canova, F. and Sala, L. (6) Back to square one: identification issues in DSGE models, ECB working paper Chari, V, Kehoe, P. and McGrattan, E. (7) Business cycle accounting, Econometrica Iskrev N (7) How much do we learn from the estimation of DSGE models - A case study of identification issues in a new Keynesian Business Cycle Model, University of Michigan, manuscript.

DSGE models have become the benchmark for: Understanding business cycles/ transmission of shocks Conduct policy analyses / forecasting exercises E t [A(θ)x t+ + B(θ)x t + C(θ)x t + D(θ)z t+ + F (θ)z t ]= z t+ = G(θ)z t + e t Stationary (log-linearized) RE solution: x t = J(θ)x t + K(θ)e t z t = G(θ)z t + e t Restricted, singular VAR() or state space model.

How are DSGE estimated/evaluated?. Limited information methods i. GMM ii. Indirect Inference: minimum distance estimation matching impulse responses iii. SVAR (magnitude and sign restrictions (Canova ()).. Full Information methods: i. Maximum Likelihood ii. Bayesian methods. Business cycle accounting/calibration Chari et. al. (7)

Matching impulse responses (conditional on some shock j): Model responses: Xt M (θ) =C(θ)( )e j t Data responses: X t = Ŵ ( )e j t (after shock identification). ˆθ = argmin θ g(θ) = X t X M t (θ) W (T ) W (T ) weighting matrix defining distance. ML: ˆθ = argmax θ L(X, θ) Bayesian: ˆθ = R θp(θ X)dθ or θ = argmax L(X, θ)p (θ) (constrained maximum likelihood) θ

Preliminary to estimation: can we recover structural parameters? Identifiability: Mapping from objective function to the parameters well behaved In general need: - Objective function has a unique minimum at θ = θ - Hessian is positive definite and has full rank - Curvature of objective function is sufficient

Difficult to verify in practice because: A) Mapping from structural parameters to solution parameters is unknown (numerical solution) B) Objective function is typically nonlinear function of solution parameters. Different objective functions may have different identification power Standard rank and order conditions can t be used!!!

Definitions i) Solution identification: can we recover structural θ from the aggregate decision rule matrices J(θ),K(θ),G(θ)? ii) Objective function identification: can we recover aggregate decision rule matrices J(θ),K(θ),G(θ) from the objective function? iii) Population identification (convoluting i) and ii)): can we recover the structural parameters from the objective function in population? iv) Sample identification: can we recover structural parameters from the objective function, given a sample of data?

Note: - i) and ii) can occur separately or in conjunction - i) is due to the model specification, ii) may result from the choice of objective function - iv) may occur even if iii) does not - iv) the focus of much of the econometric literature. Here focus on i) and ii). Preview: Problems with DSGE models are in the solution/objective function mapping.

What kind of population problems may DSGE models encounter? Observational equivalence of models. Two models may have the same (minimized) value of the objective function at two different vector of parameters (e.g. a sticky price and a stocky wage model) Observational equivalence within a model. Two vectors of parameters may give the same (minimized) value of the objective function, given a model (e.g. given a sticky price model, get the same responses if Calvo parameteris.or.7). Limited Information identification. A subset of the parameters of the model can t be identified because objective function uses only a portion of the restrictions of the solution.

Partial/under identification within a model. A subset of the structural parameter enter in a particular functional form in the solution/ may disappear from the solution. Weak/asymmetric identification within a model. The population mapping is very flat or asymmetric in some dimension. Local vs. global. Could be due to particular objective function/occur for all objective functions.

Example : Observational equivalence ) x t = λ +λ E t x t+ + λ λ λ +λ x t + v t where: λ λ. ) y t = λ y t + w t ) y t = λ E t y t+ where y t+ = E t y t+ + w t and w t iid (,σ w). Stable RE solution of ) x t = λ x t + λ +λ λ v t Stable RE solution of ) is y t = λ y t + w t. If σ w = λ +λ λ σ v, three processes are indistinguishable from impulse responses. Bayer and Farmer (): Ax t + DE t x t+ = B x t + B E t x t + Cv t. Also: Kim (, JEDC); Ma (, EL); Lubik and Schoefheide (,AER) An and Schorfheide (7,ER).

Example : Under-identification y t = a E t y t+ + a (i t E t π t+ )+v t () π t = a E t π t+ + a y t + v t () i t = a E t π t+ + v t () Solution: y t π t = a a a a v t v t i t v t a,a,a disappear from the solution. Different shocks identify different parameters. ML and distance could have different identification properties.

Example : Weak and partial under-identification max β t X t c φ t φ c t + k t+ = k η t z t +( δ)k t R.E. solution for w t+ =[k t+,c t,y t,z t ]=Aw t + Be t Select β =.98,φ=.,ρ=.9,η =.6,δ =.,z ss = Strategy: simulate data. Compute population objective function. Study its shape and features.

ρ β ρ β ρ φ ρ β - - - -.9.9.8.9.8.8 ρ.8 φ φ.9 -. -. -. -. -. - - - - - - - -6-8 -.99.99.99.99.988.986.98.99...98.98. β δ... δ - -. -. -. -. -. - -.9 -. - -. -.9.9.8.9.8.8 ρ.8 φ φ x -.9.9.9.8.8 ρ.8 φ φ.9.8 -. -. -. -. -. -. -e- -e- -. - -. -. - - - - - -.99 - - -6-8 -.99.99.99.99.988.986.98.99...98.98. β δ... ρ x -.99.99.99.988.986.98.99...98.98. β δ... δ - -. -. -. -. -. -. -. -. -. -. -..9.9.8.8 ρ.8 φ φ.9.9.8 -. -. -. -. -. -. -. -. -. -..99.99.99.99.988.986.98.99...98.98. β δ... δ Figure : Distance surface: Basic, Subset, Matching VAR and Weighted -. -. -. -.

What causes the problems? Lawofmotionofcapitalstockinalmostinvariantto: (a) variations of η and ρ (weak identification) (b) variations of β and δ additive (partial under-identification) Can we reduce problems by: (i) Changing W (T )? (long horizon may have little information) (ii) Matching VAR coefficients? (iii) Altering the objective function? NO

η η φ φ.. Standard solution: Problem!.... β =.98....... β =.99...................... δ..... δ.9.9.8.8...7.6.7.6.......................... δ Figure : Fixing beta...... δ

Identification and objective function What objective function should one use? Likelihood!! It has all the information and can be computed with Kalman filter. What does a prior do? Can help is identification problems are due to small samples but not if due to population problems!!

β β.99 -.99.98 - - - - -.99.99.98.98 β.97...6.8 δ..98.97 - -...6.8. δ x.99.99 - -.98 - -.99.99.98.98 β.97...6.8 δ..98.97 Figure : Likelihood and Posterior -...6.8. δ Posterior not usually updated if likelihood has no information. With constraints, updating is possible (many constraints from the model).

Identification and solution methods An-Schorfheide () Likelihood function better behaved if second order approximation is used. How about distance function? X max E β t [log(c t b c t ) a t N t ] t c t = y t = z t N t, c t external habit; a t stationary labor supply shock; ln( z t z t ) u z t technology shock. Linear solution (only labor supply shocks): ˆN t =(b + ρ) ˆN t bρ ˆN t ( b)û a t () Sargent (978), Kennan (988): b and ρ are not separately identified.

Second order solution (only labor supply shocks): ˆN t = b ˆN t + b(b ) ˆN t ( b)â t ( ( b) + b)â t â t = ρâ t + u a t

Ratio of Curvatures Responses to a labor supply shock.....8...6.8.6.. b ρ Figure : Distance function: linear vs. quadratic

Identification and estimation What if we disregard identification issues and estimate models with a finite sample? y t = π t = h +h y t + +h E ty t+ + φ (i t E t π t+ )+v t ω +ωβ π t + β +ωβ π t+ + (φ +.)( ζβ)( ζ) y t + v t ( + ωβ)ζ i t = λ r i t +( λ r )(λ π π t + λ y y t )+v t h: degree of habit persistence (.8) φ: relative risk aversion () β: discountfactor(.98) ω: degree of price indexation (.) ζ: degree of price stickiness (.68) λ r,λ π,λ y : policy parameters (.,.,.) v t :AR(ρ )(.6);v t :AR(ρ )(.6);v t : i.i.d.

h =.8 ρ =.6 ρ =.6 λ y =. λ π =. λ r =. x - x - x - x - β =.98.98.98.99.98.98.99.98.98.99.98.98.99 φ = ν = ξ =.68..6.7.8.9..6.7.8.9..6.7.8.9..6.7.8.9......................6.8.9.........6.8.9.........6.8.9.........6.8.9........6.6.7.6.6.7.6.6.7.6.6.7.... ω =.7.6...6...6.6.7..6.7.8.9.7.8.9 IS shock.6...6...6.6.7..6.7.8.9.7.8.9 Cost push shock.6...6.6.7..6.7.8.9.6...7.8.9 Monetary policy shock Figure : Distance function shape.6...6...6.6.7..6.7.8.9.7.8.9 All shocks

-. -. -. -. -..8.8.7.7.6.6 Monetary shocks.7 ξ.6 ν 6 8 6 8 ν ξ ξ Cost push shocks - - -.8 -.7 ξ.6 ν 6 8.8.7.7.6.6................7.9...7.9.........7.9.. -. -..8.6.. 6 8 ν -. λ π -...8 λ y. λ π...8 λ y...........8.. λ y.8.6.. λ π......7.9..8.. λ y Figure 6: Distance function and contours plots λ π

8 6 β =.98 φ = 6 ζ =.68.97.98.98.99.99 λ =. r λ =. 6 8 π λ =.. y..6.8...6.8 ρ =.6 6 8 ρ =.6 ω =....6.8 h =.8...6.8...6.8...6.8 Figure 7: Density Estimates, Monetary Shocks

Monetary Cost push IS. Gap. π interest rate..8.6....8.6..... -...... -. -.. -. -. -.....8 -. -. -. -. -. -. -..6.. -. -. -. -. -. -. -.6 Figure 8: Impulse responses, Monetary Shocks

Table : NK model. Matching monetary policy shocks, bias True Population T = T = T= T= wrong β.98..6.7.7.6 φ..7 9. 7.6 8.6 ζ.68. 9. 7...7 λ r..9 7..6.7 9. λ π.. 98.7 78. 7. 7. λ y..9.6 76. 6. 78. ρ.6..... ρ.6.8.9.8.7.7 ω.. 8.9. 98. 8. h.8..9..

Wrong inference = k t+ +( δ)k t + δx t = u t + ψr t = ηδ r x t +( ηδ r )c t ηk t ( η)n t ηu t ez t = R t + φ r R t +( φ r )(φ π π t + φ y y t )+er t = y t + ηk t +( η)n t + ηu t + ez t = N t + k t w t +(+ψ)r t = h E t [ +h c t+ c t + h +h c t h ( + h)ϕ (R t π t+ )] = β E t [ +β x t+ x t + +β x t + χ +β q t + β +β ex t+ +β ex t] = E t [π t+ R t q t + β( δ)q t+ + β rr t+ ] = β E t [ π t+ π t + γ p π t + T p (ηr t +( η)w t ez t + ep t )] +βγ p +βγ p = β E t [ w t+ w t + +βγ p +β w t + β +β π t+ +βγ w +β π t + γ w +βγ w t (w t σn t ϕ h (c t hc t ) ew t )]

δ depreciation rate (.8) λ w wage markup (.) ψ parameter (.6) π steady state π (.6) η share of capital (.9) h habit persistence (.8) ϕ risk aversion (.) σ l inverse elasticity of labor supply (.) β discount factor (.99) χ investment s elasticity to Tobin s q (.) ζ p price stickiness (.887) ζ w wage stickiness (.6) γ p price indexation (.86) γ w wage indexation (.) φ y response to y (.) φ π response to π (.) φ r int. rate smoothing (.779) T p ( βζ p)( ζ p ) T w (+βγ p )ζ p ( βζ w )( ζ w ) (+β)(+(+λ w )σ l λ w )ζ w

x -7 x -7 x -7 x -7 x -7 x -7 9 9 9 9 9 8 8 8 8 8 7 7 7 7 7 6 6 6 6 6.. δ =.8.. η =.9.988.99.99.99 β =.99... h =.8 6 7 χ = 6... φ =. x -7 x -7 x -7 x -7 x -7 x -7 9 9 9 9 9 8 8 8 8 8 7 7 7 7 7 6 6 6 6 6 ν =...6 ψ =.6.8.9 ξ =.887 p.8.9 γ =.86 p.6.7 ξ =.6 w... γ =. w x -7 x -7 x -7 x -7 x -7 9 9 9 9 8 8 8 8 7 7 7 7 6 6 6 6........7.8.98.99 ε =. λ =. λ =. λ =.779 ρ =.997 w y π r z Figure 9: Objective function: monetary and technology shocks

-. distance ξ p - x -.8 -e-.6.. -... ξ p γ...6.8 p γ p distance ξ w distance γ w distance ξ w -e- -. -. -. -. -..8.6.8 -e- -e- -. -e-6.6 -e-6 -e- - - - - - x.8.6..8...6... ξ w γ w...6.8 γ w - -e-6 -e-6 -e-6 -e-7 -e- - - - x.8.6.8.6..8...6... γ w γ p...6.8 γ p - -. -. -..8.6 -. -. -...8...6... ξ w ξ p...6.8 ξ p Figure : Distance surface and Contours Plots

ζ p γ p ζ w γ w Obj.Fun. Baseline.887.86.6. x = lb + std.89.8.6.8e-7 x = lb + std.89.7768.69..7e-7 x = ub - std.88.797.66.6.e-7 x = ub - std.9.77.6 8.7E-7 Case.86.6. x = lb + std..8.6..778e-8 x = lb + std..8.66.79.8e-8 x = ub - std.7..68.979.7e-8 x = ub - std.9.79.68..e-8 Case.6. x = lb + std.96.7.69.6.9e-8 x = lb + std.88.9.6.68.8e-8 x = ub - std.9.77.66.7.e-8 x = ub - std.789.97.6.8.6e-8 Case.86.6 x = lb + std.8.67.9 7.7E-9 x = lb + std.69.7.668.e-6 x = ub - std.6..67.7 7.E-8 x = ub - std.66.67.8.8.6e-6

ζ p γ p ζ w γ w Obj.Fun. Case.887.6.8 x = lb + std.96.7.67.99.6e-7 x = lb + std.976.68.6..e-7 x = ub - std.9.9.677 6.E-7 x = ub - std.96..69..e-7 Case.887. x = lb + std.986.6..7877e-7 x = lb + std.899..6e-7 x = ub - std.9.9..e-7 x = ub - std.9.9. 9.6E-7 Case 6.887. x = lb + std.877..9.7e-6 x = lb + std.899...6e-7 x = ub - std.97.6.. 6.8E-7 x = ub - std.889.99.89.6e-6 Case 7.887. x = lb + std.96.77...6e-6 x = lb + std.9.8..e-7 x = ub - std.96.669...6e-7 x = ub - std.898.9...7e-7

-.. Inflation True Estimated.. Interest rate -.. -.6 -.8 -. -. Real wage Investment -. -. -. - -.6 -. -.8 Consumption. Hours worked -. -. -. -. -. -. -. -. -. output. Capacity utilisation -. -. -. -. -. -. -. -.6 quarters after shock quarters after shock Figure : Impulse responses, Case.

Welfare costs different! L(π,y )=. with true parameters L(π,y )=. with estimated parameters

Detecting identification problems: Ex-ante diagnostics: - Plots/ Preliminary exploration of objective function - Numerical derivatives of the objective function at likely parameter values - Condition number of the Hessian (ratio largest/smallest eigenvalues) Ex-post diagnostics: - Erratic parameter estimates as T increases - Large or non-computable standard errors - Crazy t-test (Choi and Phillips (99), Stock and Wright ()).

Tests: Cragg and Donald (997): Testing rank of Hessian. Under regularity conditions: (vec(ĥ) vec(h)) Ω(vec(Ĥ) vec(h)) χ ((N L )(N L )) N = dim(h),l =rank of H. Anderson (98): Size of characteristic roots of Hessian. Under regularity conditions: P N m i= ˆλ i P Ni= ˆλ i D Normal distribution. Concentration Statistics: C θ (i) = R j6=i g(θ) g(θ )dθ R (θ θ )dθ,i =,... (Stock, Wright and Yogo ()) = measures the global curvature of the objective function around θ.

Difficult to employ: just use as a diagnostic. Applied to last model: rank of H = 6; sum of - characteristics roots is smaller than. of the average root - dimensions of weak or partial identification problems. Which are the parameters is causing problems? β, h, σ l,δ,η,ψ,γ p,γ w,λ w,φ π,φ y,ρ z. Why? Variations of these parameters hardly affect law of motion of states! Almost a rule: for identification need states to react changes in structural parameters.

What to do when identification problems exist? Which type? - If population need respecify the model. - If objective/ limited information use likelihood. - If small sample add information (prior or other data) - Don t proceed as if they do not exist. - Careful with mixed calibration-estimation. Full calibration preferable or Bayesian calibration (Canova (99))

Conclusions: Liu (96), Sims (98): - Traditional models hopelessly under-identified. -Identification often achieved not because we have sufficient information butbecausewewantittobeso. - Proceed with reduced form models

A destructive approach: - Most (large scale) DSGE models are face severe identification problems. - Models are identified not because likelihood (or part of it) is informative, but because we make it informative (via partial calibration or tight priors). - Estimation = confirmatory analysis. - Hard to reject models.

Amoreconstructiveone: (i) Try to respecify the model to get rid of problems (ii) Evaluate numerically the mapping between structural parameters and coefficients of the decision rule. Do extensive exploratory analysis. (iii) Find out what estimation method could work also in presence of identification problems (Stock and Wright (), Rosen ()) (iv) Work out economic reasons for identification problems with submodels or simplified versions of larger ones (v) Be less demanding of your models. semi-structural estimation (e.g. SVARs) Use methodologies why employ