Using data to inform policy
|
|
- Josephine Fitzgerald
- 5 years ago
- Views:
Transcription
1 Using data to inform policy Maximilian Kasy Department of Economics, Harvard University Maximilian Kasy (Harvard) data and policy 1 / 41
2 Introduction The roles of econometrics Forecasting: What will be? (needs no notion of causality) Testing theories: Is statement A correct? (usually involves some notion of causality or structural invariance) Informing policy: What should we do? (requires notion of causality / counterfactuals, and normative evaluation of counterfactuals) Maximilian Kasy (Harvard) data and policy 2 / 41
3 Informing policy Introduction How to use data when choosing tax rates insurance rates, class size, class composition, dosage of a drug, vaccination rates,... Proposed framework data update expectations about social welfare optimal policy: maximize expected social welfare Yields tractable mapping: observed data optimal policy experimental designs which maximize ex-ante expected welfare Maximilian Kasy (Harvard) data and policy 3 / 41
4 Introduction The literatures this talk is based on 1 Statistical decision theory evaluates estimators, tests, experimental designs based on expected loss 2 Optimal policy theory evaluates policy choices based on social welfare 3 this paper policy choice as a statistical decision statistical loss social welfare. objectives: 1 anchoring econometrics in economic policy problems. 2 anchoring policy choices in a principled use of data. Maximilian Kasy (Harvard) data and policy 4 / 41
5 Roadmap Introduction 1 Brief review Statistical decision theory Optimal policy theory 2 General setup 3 Theoretical results Frequentist asymptotics Optimal experimental design 4 Application to the RAND health insurance experiment data Maximilian Kasy (Harvard) data and policy 5 / 41
6 Introduction Decision theory as foundation of econometrics Observed data X A statistical decision a A state of the world θ A loss function L(a, θ) (the negative of utility) A statistical model f (X θ) A decision function a = δ(x ) A risk function R(δ, θ) = E θ [L(δ(X ), θ)] Bayes risk R(δ, π) = R(δ, θ)dπ(θ) Most common applications: 1 Estimation with squared loss: L(a, θ) = (a µ(θ)) 2 2 Testing of H 0 : θ Θ 0 : 1 if a = 1, θ Θ 0 L(a, θ) = c if a = 0, θ / Θ 0 0 else. Maximilian Kasy (Harvard) data and policy 6 / 41
7 Optimal policy problems Optimal insurance and optimal taxation optimal (unemployment) insurance (Baily, 1978; Chetty, 2006) Y {0, 1}: state of an individual, Y = 1 bad state t = x: size of transfer to individuals in the bad state Y x = g(x, ɛ): counterfactual state of an individual m(x) = E[g(x, ɛ)]: share of the population in the bad state, given x λ: marginal value of income to individuals in the bad state, relative to the marginal value of government funds (normative choice!) similar: Optimal top tax rate (Saez, 2001) Y : taxable income t: top tax rate m: tax base Maximilian Kasy (Harvard) data and policy 7 / 41
8 Optimal policy problems Effect of a marginal change dt of t on social welfare: 1 λm dt: mechanical effect on private welfare 2 m dt: mechanical effect on government funds 3 tm (t) dt: behavioral effect on government revenues 4 0: behavioral effect on private welfare (envelope condition) Summing up gives u (t) = (λ 1) m(t) t m (t) = λm(t) (t m(t)). (1) t Integrating t u(t) = λ m(x)dx t m(t). (2) 0 Maximilian Kasy (Harvard) data and policy 8 / 41
9 Optimal policy problems Optimal choice of inputs to a production process Thus: x X R dx : inputs Y = g(x, ɛ): outcome e.g.: Y student long run outcomes, X class size, teacher training,... m(x) = E[g(x, ɛ)]: average production function λ: willingness to pay for Y (normative choice!) objective: max λ E[Y ], net of costs p t u(t) = λ m(t) p t. (3) Maximilian Kasy (Harvard) data and policy 9 / 41
10 General framework The setup 1 policy maker: expected social welfare maximizer 2 u(t): social welfare for policy choice t T R dt u is unknown 3 u = L m + u 0, L and u 0 are known L: linear operator C 1 (X ) C 1 (T ) 4 m(x) = E[g(x, ɛ)] (average structural function) X, Y = g(x, ɛ) observables, ɛ unobserved expectation over distribution of ɛ in target population 5 experimental setting: X ɛ m(x) = E[Y X = x] 6 Gaussian process prior: m GP(µ, C) 7 posterior expectations: best linear predictors in Y Maximilian Kasy (Harvard) data and policy 10 / 41
11 General framework Applied to the two optimal policy problems u = L m + u 0 1 Optimal insurance: u 0 (t) = 0 (Lm)(t) = λ 2 Optimal choice of inputs: t 0 m(x)dx t m(t). (4) u 0 (t) = p t (Lm)(t) = λ m(t). (5) Maximilian Kasy (Harvard) data and policy 11 / 41
12 General framework Questions 1 How to choose t optimally given observations X i, Y i, i = 1,..., n? 2 How to choose design points X i given sample size n? How to choose sample size n? mathematical characterization: 1 How does the optimal choice of t (in a Bayesian sense) behave asymptotically (in a frequentist sense)? (conditional vs. average behavior!) 2 How can we characterize the optimal design? Maximilian Kasy (Harvard) data and policy 12 / 41
13 Theoretical results What we need to derive: 1 posterior expectations 2 optimal policy t = argmax û(t) m(x) := E[m(x) X, Y ] û(t) = E[u(t) X, Y ] 3 optimal design X = argmax υ(x ) where υ(x ) = E[max û] What the paper shows: 1 Frequentist asymptotics asymptotic normality of t estimator of its variance 2 Optimal design effect of perturbations on υ FOC for optimal design asymptotic simplification of FOC Maximilian Kasy (Harvard) data and policy 13 / 41
14 Theoretical results Posterior expectation of m m GP(µ, C) - denote µ i = µ(x i ), C i,j = C(X i, X j ), and C i (x) = C(x, X i ). (6) µ, C, C(x) corresponding vectors / matrix posterior expectations are best linear predictors in Y m(x) :=E[m(x) X, Y ] =E[m(x) X ] + Cov(m(x), Y X ) Var(Y X ) 1 (Y E[Y X ]) =µ(x) + C(x) [C + σ 2 I ] 1 (Y µ). (7) Maximilian Kasy (Harvard) data and policy 14 / 41
15 Theoretical results Alternative representation of m 1 Penalized regression m = argmin ˇm(.) [ 1 σ 2 ] (Y i ˇm(X i )) 2 + ˇm µ 2 C i (8) m µ 2 C : reproducing kernel Hilbert space norm defined by C(x 1,.), C(x 2,.) := C(x 1, x 2 ) 2 Equivalent kernel: approximate discrete F n (x) by continuous F (x) m(x) w 0 (x) + 1 w(x, X i ) Y i n i [ ] 1 w(., x ) = argmin ˇw(x) 2 df(x) + σ2 ˇw(.) 2 2n ˇw 2 C ˇw(x ) (9) Maximilian Kasy (Harvard) data and policy 15 / 41
16 Theoretical results Posterior expectation of u Let D(t, x) : = Cov(u(t), m(x)) = Cov((Lm + u 0 )(t), m(x)) = L x C(x, x). (10) Then û(t) = E[u(t) X, Y ] = E[u(t) X ] + Cov(u(t), Y X ) Var(Y X ) 1 (Y E[Y X ]) = ν(t) + D(t) [C + σ 2 I ] 1 (Y µ). (11) Maximilian Kasy (Harvard) data and policy 16 / 41
17 Optimal policy Theoretical results Expected utility maximization t = t (X, Y ) argmax t First order condition: 0 =! t û(t) = t E[u(t) X, Y ] where û(t) (12) = ν (t) + B(t) [C + σ 2 I ] 1 (Y µ) (13) ( B(t, x) := Cov t = Cov ) u(t), m(x) ( t (Lm + u 0)(t), m(x) = t L x C(x, x). (14) ) Maximilian Kasy (Harvard) data and policy 17 / 41
18 Theoretical results Optimal experimental design maximizes ex-ante expected welfare υ(x ) through choice of design points X assuming that the policy t is chosen (ex-post) as t (X, Y ) where X argmax X υ(x ) (15) υ(x ) := E[max û(t) X ] t = E[û( t ) X ] [ = E max t (ν(t) + D(t) [C + σ 2 I ] 1 (Y µ) )] (16) Maximilian Kasy (Harvard) data and policy 18 / 41
19 Theoretical results Main mathematical results 1 Explicit expression for optimal policy t 2 Frequentist asymptotics of t 1 asymptotically normal, slower than n 2 distribution driven by distribution of û (t ) 3 confidence sets 3 Optimal experimental design design density f (x) increasing in (but less than proportionately) 1 density of t 2 the expected inverse of the curvature u (t) algorithm to choose design based on prior, objective function Maximilian Kasy (Harvard) data and policy 19 / 41
20 Asymptotics Frequentist asymptotics (under regularity conditions) 1 t is asymptotically normal 2 rate of convergence: slower than n 3 asymptotic variance: proportional to 1 asymptotic variance of û (t ) 2 squared inverse of u (t ) 4 confidence sets for t 5 regret u(t ) u( t ): asymptotically generalized χ 2 distribution Remark: By construction: t optimal w.r.t. average risk across m These results: performance of t conditional on m Maximilian Kasy (Harvard) data and policy 20 / 41
21 Outline of proof Asymptotics 1 t as an M-estimator, t = argmax t û(t) Asymptotic normality follows from 1 consistency of û in C 2 2 asymptotic normality of û (t ) 2 Asymptotic normality of û (t ) follows from asymptotic normality of m by linearity of mapping m û similarly for C 2 consistency Maximilian Kasy (Harvard) data and policy 21 / 41
22 Asymptotics 1 t as an M-estimator FOC for t : Thus 2 û and m public finance application: 0 = û ( t ) = û (t ) + û ( t ) ( t t ) production function application: t t = u (t ) 1 û (t ) + t û (t) = (λ 1) m(t) t m (t) (17) û (t) = λ m (t) p (18) in either case, the asymptotic distribution of û is dominated by the distribution of m. Maximilian Kasy (Harvard) data and policy 22 / 41
23 Asymptotics Consistent variance estimator Let ( V = l 2 û ( t ) 1 V n,m û ( t ) 1), where V n,m = i w t ( t, X i ) 2 (Y i m(x i )) 2. Then V / Var( t X, θ) p 1. Maximilian Kasy (Harvard) data and policy 23 / 41
24 Asymptotics Asymptotics of m 1 General conditions for consistency of m: Ghoshal and van der Vaart (2013) 2 m asymptotically equivalent to a nonparametric kernel regression bandwidth parameter that depends on the design density f (x) Silverman (1984); Sollich and Williams (2005) 3 asymptotic normality, rates of convergence for m, and of m : follow from standard results for kernel regression Maximilian Kasy (Harvard) data and policy 24 / 41
25 Optimal design Characterizing the optimal experimental design continuous approximation easier & more elegant to characterize choose a design density f (x), then observe function Y Y (x) = m(x) + σ nf (x) dw (x). dw is standard white noise process independent of m will characterize 1 posterior expectation û = E[u Y ] 2 expected social welfare υ(f ) := E[max t û(t)] 3 how û, υ are affected by a perturbation to the design density f : f + δ f 4 FOC for f which maximizes υ(f ) 5 Asymptotic simplification of this FOC Maximilian Kasy (Harvard) data and policy 25 / 41
26 Optimal design Remarks 1 white noise, Y (.): generalized stochastic processes. Y (x) is not a well defined random variable, but Y (x)g(x)dx is (for g continuous, bounded). 2 posterior expectation of m given Y : Wiener filtering 3 continuous equivalent of discrete setting larger design density increasing number of observations decreasing the variance of noise. 4 continuous model rationalizes the equivalent kernel Maximilian Kasy (Harvard) data and policy 26 / 41
27 Optimal design Operator notation (cf. matrix notation) Identify kernel C(x, x ) with integral operator g Cg, (Cg)(x) := C(x, x )g(x )dx. (19) Similarly for the noise process (Eg)(x) := where F is the operator (Fg)(x) = f (x) g(x). In this notation σ2 σ2 g(x) = nf (x) n F 1, (20) Var(Y ) = C + E Cov(m, Y ) = C Cov(u, Y ) = LC. Maximilian Kasy (Harvard) data and policy 27 / 41
28 Optimal design Posterior means posterior expectation is best linear predictor W is the equivalent kernel. conditions for BLP Thus linearity of expectations m = µ + W (Y µ) (21) Cov ( m m, Y ) =W Var(Y ) Cov(m, Y ) =W (C + E) C = 0 W = C(C + E) 1. (22) û = E[u Y ] = E[Lm + u 0 Y ] = L m + u 0 = ν + LC(C + E) 1 (Y µ). (23) Maximilian Kasy (Harvard) data and policy 28 / 41
29 Optimal design The prior distribution of the posterior mean û Var(Y ) = C + E û = ν + LC(C + E) 1 (Y µ) Therefore û GP(ν, V ), where V := Var(û) = LC(C + E) 1 (LC) t (24) and E = σ2 n F 1. Now consider perturbed design f + δ f where E(δ) = σ2 n (F + δ F ) 1. V (δ) = LC(C + E(δ)) 1 (LC) t (25) Maximilian Kasy (Harvard) data and policy 29 / 41
30 Optimal design Perturbed design continued derivative at δ = 0; denoting W := LW where special case: f unit point mass at x V δ is covariance kernel of where ɛ N(0, 1). V δ = W E δ W t, (26) E δ = σ2 n F 1 F F 1. (27) v(x) := σ nf (x ) w(x, x ) ɛ, (28) Maximilian Kasy (Harvard) data and policy 30 / 41
31 Optimal design Perturbations to the design and expected social welfare Theorem Consider expected social welfare, υ(f + δ f ) at the perturbed design distribution f + δ f, where f is a point mass at the point x. Then δ υ(f + δ f ) = 1 2 nf 2 (x ) E [ tr [ û ( t ) 1 (w t ( t, x )w t ( t, x ) t)]] σ 2 Maximilian Kasy (Harvard) data and policy 31 / 41
32 Optimal design Sketch of proof: 1 Perturbed design equivalent to considering the random function û(δ) := û + δ v, where v(x) = σ nf (x ) w(x, x ) ɛ, (29) and ɛ N(0, 1), ɛ û. 2 Taylor expansions for t and û(δ) yield û(δ, t (δ)) = û(t ) + δv( t ) + δ 1 2 v ( t ) t û ( t ) 1 v ( t ) + o p (δɛ 2 ). (30) 3 Now take expectations. Maximilian Kasy (Harvard) data and policy 32 / 41
33 Optimal design The optimal experimental design f = argmax f υ(f ) Corollary...is characterized by 1 f 2 (x) E [ tr [ û ( t ) 1 (w t ( t, x)w t ( t, x) t)]] = κ. (31) κ is a Lagrange multiplier on the constraint f = 1. Equation (31) holds for all x. Sketch of proof: f solves s.t. max υ(f ) f (x)dx = 1. (32) FOC: δ υ(f + δ f ) = κ f (x)dx = κ. Maximilian Kasy (Harvard) data and policy 33 / 41
34 Application: health insurance The RAND health insurance experiment (cf. Aron-Dine et al., 2013) Between 1974 and 1981 representative sample of 2000 households in six locations across the US families randomly assigned to plans with one of six consumer coinsurance rates 95, 50, 25, or 0 percent 2 more complicated plans (I drop those) Additionally: randomized Maximum Dollar Expenditure limits 5, 10, or 15 percent of family income, up to a maximum of $750 or $1,000 (I pool across those) Maximilian Kasy (Harvard) data and policy 34 / 41
35 Application: health insurance Table : Predicted spending for different coinsurance rates, OLS (1) (2) (3) (4) Share with Spending Share with Spending any in $ any in $ Free Care (0.006) (78.76) (0.006) (72.06) 25% Coinsurance (0.013) (130.5) (0.012) (115.2) 50% Coinsurance (0.018) (273.7) (0.016) (279.6) 95% Coinsurance (0.011) (95.40) (0.009) (88.48) family x month x site X X X X fixed effects covariates X X N Maximilian Kasy (Harvard) data and policy 35 / 41
36 Application: health insurance Assumptions 1 Model: The optimal insurance model as presented before 2 Prior: Gaussian process prior for m, squared exponential in distance, uninformative about level and slope 3 Relative value of funds for sick people vs contributors: λ = 1.5 (crucial parameter!!!) 4 Pooling data: across levels of maximum dollar expenditure Under these assumptions we find: Optimal copay equals 18% (But free care is almost as good) Maximilian Kasy (Harvard) data and policy 36 / 41
37 Application: health insurance Posterior: Spending for different coinsurance rates m m t Maximilian Kasy (Harvard) data and policy 37 / 41
38 Application: health insurance Posterior: Social welfare for different coinsurance rates 600 ûû t Maximilian Kasy (Harvard) data and policy 38 / 41
39 Application: health insurance First order condition for optimal policy; confidence band û t Maximilian Kasy (Harvard) data and policy 39 / 41
40 Summary Application: health insurance This paper combines statistical decision theory and optimal policy theory Model: 1 objective u = L m + u 0 2 m is the ASF m(x) = E[g(x, ɛ)] 3 prior m GP(µ, C) Questions: 1 How to choose t optimally given observations? 2 How to choose design points X i? Main results: 1 Explicit expression for optimal policy t 2 Frequentist asymptotics of t 3 Characterization of optimal experimental design Maximilian Kasy (Harvard) data and policy 40 / 41
41 Application: health insurance Thanks for your time! Maximilian Kasy (Harvard) data and policy 41 / 41
What is to be done? Two attempts using Gaussian process priors
What is to be done? Two attempts using Gaussian process priors Maximilian Kasy Department of Economics, Harvard University Oct 14 2017 1 / 33 What questions should econometricians work on? Incentives of
More informationUsing data to inform policy
Using data to inform policy Maximilian Kasy June 16, 2014 Abstract In this paper, a general framework is proposed for the use of (quasi-) experimental data when choosing policies such as tax rates or the
More informationWhy experimenters should not randomize, and what they should do instead
Why experimenters should not randomize, and what they should do instead Maximilian Kasy Department of Economics, Harvard University Maximilian Kasy (Harvard) Experimental design 1 / 42 project STAR Introduction
More informationOptimal taxation and insurance using machine learning sufficient statistics and beyond
Optimal taxation and insurance using machine learning sufficient statistics and beyond Maximilian Kasy April 2, 218 Abstract How should one use (quasi-)experimental evidence when choosing policies such
More informationEcon 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines
Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines Maximilian Kasy Department of Economics, Harvard University 1 / 37 Agenda 6 equivalent representations of the
More informationEcon 2148, spring 2019 Statistical decision theory
Econ 2148, spring 2019 Statistical decision theory Maximilian Kasy Department of Economics, Harvard University 1 / 53 Takeaways for this part of class 1. A general framework to think about what makes a
More informationEcon 2140, spring 2018, Part IIa Statistical Decision Theory
Econ 2140, spring 2018, Part IIa Maximilian Kasy Department of Economics, Harvard University 1 / 35 Examples of decision problems Decide whether or not the hypothesis of no racial discrimination in job
More informationIdentification in Triangular Systems using Control Functions
Identification in Triangular Systems using Control Functions Maximilian Kasy Department of Economics, UC Berkeley Maximilian Kasy (UC Berkeley) Control Functions 1 / 19 Introduction Introduction There
More informationEmpirical approaches in public economics
Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental
More informationGaussian processes. Chuong B. Do (updated by Honglak Lee) November 22, 2008
Gaussian processes Chuong B Do (updated by Honglak Lee) November 22, 2008 Many of the classical machine learning algorithms that we talked about during the first half of this course fit the following pattern:
More informationMachine learning, shrinkage estimation, and economic theory
Machine learning, shrinkage estimation, and economic theory Maximilian Kasy December 14, 2018 1 / 43 Introduction Recent years saw a boom of machine learning methods. Impressive advances in domains such
More informationSTAT 518 Intro Student Presentation
STAT 518 Intro Student Presentation Wen Wei Loh April 11, 2013 Title of paper Radford M. Neal [1999] Bayesian Statistics, 6: 475-501, 1999 What the paper is about Regression and Classification Flexible
More information4. Distributions of Functions of Random Variables
4. Distributions of Functions of Random Variables Setup: Consider as given the joint distribution of X 1,..., X n (i.e. consider as given f X1,...,X n and F X1,...,X n ) Consider k functions g 1 : R n
More informationLecture notes on statistical decision theory Econ 2110, fall 2013
Lecture notes on statistical decision theory Econ 2110, fall 2013 Maximilian Kasy March 10, 2014 These lecture notes are roughly based on Robert, C. (2007). The Bayesian choice: from decision-theoretic
More informationδ -method and M-estimation
Econ 2110, fall 2016, Part IVb Asymptotic Theory: δ -method and M-estimation Maximilian Kasy Department of Economics, Harvard University 1 / 40 Example Suppose we estimate the average effect of class size
More informationCovariance function estimation in Gaussian process regression
Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian
More informationEcon 2148, fall 2017 Instrumental variables II, continuous treatment
Econ 2148, fall 2017 Instrumental variables II, continuous treatment Maximilian Kasy Department of Economics, Harvard University 1 / 35 Recall instrumental variables part I Origins of instrumental variables:
More informationFinal Exam. Economics 835: Econometrics. Fall 2010
Final Exam Economics 835: Econometrics Fall 2010 Please answer the question I ask - no more and no less - and remember that the correct answer is often short and simple. 1 Some short questions a) For each
More informationDepartment of Agricultural Economics. PhD Qualifier Examination. May 2009
Department of Agricultural Economics PhD Qualifier Examination May 009 Instructions: The exam consists of six questions. You must answer all questions. If you need an assumption to complete a question,
More informationGenerated Covariates in Nonparametric Estimation: A Short Review.
Generated Covariates in Nonparametric Estimation: A Short Review. Enno Mammen, Christoph Rothe, and Melanie Schienle Abstract In many applications, covariates are not observed but have to be estimated
More informationMachine Learning 2017
Machine Learning 2017 Volker Roth Department of Mathematics & Computer Science University of Basel 21st March 2017 Volker Roth (University of Basel) Machine Learning 2017 21st March 2017 1 / 41 Section
More informationHabilitationsvortrag: Machine learning, shrinkage estimation, and economic theory
Habilitationsvortrag: Machine learning, shrinkage estimation, and economic theory Maximilian Kasy May 25, 218 1 / 27 Introduction Recent years saw a boom of machine learning methods. Impressive advances
More informationF9 F10: Autocorrelation
F9 F10: Autocorrelation Feng Li Department of Statistics, Stockholm University Introduction In the classic regression model we assume cov(u i, u j x i, x k ) = E(u i, u j ) = 0 What if we break the assumption?
More informationEconometric Causality
Econometric (2008) International Statistical Review, 76(1):1-27 James J. Heckman Spencer/INET Conference University of Chicago Econometric The econometric approach to causality develops explicit models
More informationECO Class 6 Nonparametric Econometrics
ECO 523 - Class 6 Nonparametric Econometrics Carolina Caetano Contents 1 Nonparametric instrumental variable regression 1 2 Nonparametric Estimation of Average Treatment Effects 3 2.1 Asymptotic results................................
More informationApplied Microeconometrics (L5): Panel Data-Basics
Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics
More informationproblem. max Both k (0) and h (0) are given at time 0. (a) Write down the Hamilton-Jacobi-Bellman (HJB) Equation in the dynamic programming
1. Endogenous Growth with Human Capital Consider the following endogenous growth model with both physical capital (k (t)) and human capital (h (t)) in continuous time. The representative household solves
More informationLecture Notes 1: Decisions and Data. In these notes, I describe some basic ideas in decision theory. theory is constructed from
Topics in Data Analysis Steven N. Durlauf University of Wisconsin Lecture Notes : Decisions and Data In these notes, I describe some basic ideas in decision theory. theory is constructed from The Data:
More informationMCMC Sampling for Bayesian Inference using L1-type Priors
MÜNSTER MCMC Sampling for Bayesian Inference using L1-type Priors (what I do whenever the ill-posedness of EEG/MEG is just not frustrating enough!) AG Imaging Seminar Felix Lucka 26.06.2012 , MÜNSTER Sampling
More informationEndogenous Information Choice
Endogenous Information Choice Lecture 7 February 11, 2015 An optimizing trader will process those prices of most importance to his decision problem most frequently and carefully, those of less importance
More informationGaussian Process Regression
Gaussian Process Regression 4F1 Pattern Recognition, 21 Carl Edward Rasmussen Department of Engineering, University of Cambridge November 11th - 16th, 21 Rasmussen (Engineering, Cambridge) Gaussian Process
More informationEconomics th April 2011
Economics 401 8th April 2011 Instructions: Answer 7 of the following 9 questions. All questions are of equal weight. Indicate clearly on the first page which questions you want marked. 1. Answer both parts.
More informationThe risk of machine learning
/ 33 The risk of machine learning Alberto Abadie Maximilian Kasy July 27, 27 2 / 33 Two key features of machine learning procedures Regularization / shrinkage: Improve prediction or estimation performance
More informationState Space Representation of Gaussian Processes
State Space Representation of Gaussian Processes Simo Särkkä Department of Biomedical Engineering and Computational Science (BECS) Aalto University, Espoo, Finland June 12th, 2013 Simo Särkkä (Aalto University)
More informationThe cover page of the Encyclopedia of Health Economics (2014) Introduction to Econometric Application in Health Economics
PHPM110062 Teaching Demo The cover page of the Encyclopedia of Health Economics (2014) Introduction to Econometric Application in Health Economics Instructor: Mengcen Qian School of Public Health What
More informationThe Bayesian approach to inverse problems
The Bayesian approach to inverse problems Youssef Marzouk Department of Aeronautics and Astronautics Center for Computational Engineering Massachusetts Institute of Technology ymarz@mit.edu, http://uqgroup.mit.edu
More informationBayesian Gaussian / Linear Models. Read Sections and 3.3 in the text by Bishop
Bayesian Gaussian / Linear Models Read Sections 2.3.3 and 3.3 in the text by Bishop Multivariate Gaussian Model with Multivariate Gaussian Prior Suppose we model the observed vector b as having a multivariate
More informationQuantifying mismatch in Bayesian optimization
Quantifying mismatch in Bayesian optimization Eric Schulz University College London e.schulz@cs.ucl.ac.uk Maarten Speekenbrink University College London m.speekenbrink@ucl.ac.uk José Miguel Hernández-Lobato
More informationChapter 2: simple regression model
Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.
More informationECON3150/4150 Spring 2015
ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2
More informationMark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation.
CS 189 Spring 2015 Introduction to Machine Learning Midterm You have 80 minutes for the exam. The exam is closed book, closed notes except your one-page crib sheet. No calculators or electronic items.
More informationProblem Set #6: OLS. Economics 835: Econometrics. Fall 2012
Problem Set #6: OLS Economics 835: Econometrics Fall 202 A preliminary result Suppose we have a random sample of size n on the scalar random variables (x, y) with finite means, variances, and covariance.
More informationIntroduction to second order approximation. SGZ Macro Week 3, Day 4 Lecture 1
Introduction to second order approximation 1 Outline A. Basic concepts in static models 1. What are first and second order approximations? 2. What are implications for accuracy of welfare calculations
More informationg-priors for Linear Regression
Stat60: Bayesian Modeling and Inference Lecture Date: March 15, 010 g-priors for Linear Regression Lecturer: Michael I. Jordan Scribe: Andrew H. Chan 1 Linear regression and g-priors In the last lecture,
More informationMicroeconomic theory focuses on a small number of concepts. The most fundamental concept is the notion of opportunity cost.
Microeconomic theory focuses on a small number of concepts. The most fundamental concept is the notion of opportunity cost. Opportunity Cost (or "Wow, I coulda had a V8!") The underlying idea is derived
More informationQuantitative Economics for the Evaluation of the European Policy
Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,
More informationFixed Effects, Invariance, and Spatial Variation in Intergenerational Mobility
American Economic Review: Papers & Proceedings 2016, 106(5): 400 404 http://dx.doi.org/10.1257/aer.p20161082 Fixed Effects, Invariance, and Spatial Variation in Intergenerational Mobility By Gary Chamberlain*
More informationCourse 1 Solutions November 2001 Exams
Course Solutions November Exams . A For i =,, let R = event that a red ball is drawn form urn i i B = event that a blue ball is drawn from urn i. i Then if x is the number of blue balls in urn, ( R R)
More informationPredicting bond returns using the output gap in expansions and recessions
Erasmus university Rotterdam Erasmus school of economics Bachelor Thesis Quantitative finance Predicting bond returns using the output gap in expansions and recessions Author: Martijn Eertman Studentnumber:
More informationToday. Probability and Statistics. Linear Algebra. Calculus. Naïve Bayes Classification. Matrix Multiplication Matrix Inversion
Today Probability and Statistics Naïve Bayes Classification Linear Algebra Matrix Multiplication Matrix Inversion Calculus Vector Calculus Optimization Lagrange Multipliers 1 Classical Artificial Intelligence
More informationHigh-dimensional Problems in Finance and Economics. Thomas M. Mertens
High-dimensional Problems in Finance and Economics Thomas M. Mertens NYU Stern Risk Economics Lab April 17, 2012 1 / 78 Motivation Many problems in finance and economics are high dimensional. Dynamic Optimization:
More information(a) Write down the Hamilton-Jacobi-Bellman (HJB) Equation in the dynamic programming
1. Government Purchases and Endogenous Growth Consider the following endogenous growth model with government purchases (G) in continuous time. Government purchases enhance production, and the production
More informationEC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)
1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For
More information9. Model Selection. statistical models. overview of model selection. information criteria. goodness-of-fit measures
FE661 - Statistical Methods for Financial Engineering 9. Model Selection Jitkomut Songsiri statistical models overview of model selection information criteria goodness-of-fit measures 9-1 Statistical models
More informationTo Hold Out or Not. Frank Schorfheide and Ken Wolpin. April 4, University of Pennsylvania
Frank Schorfheide and Ken Wolpin University of Pennsylvania April 4, 2011 Introduction Randomized controlled trials (RCTs) to evaluate policies, e.g., cash transfers for school attendance, have become
More informationGaussian processes for inference in stochastic differential equations
Gaussian processes for inference in stochastic differential equations Manfred Opper, AI group, TU Berlin November 6, 2017 Manfred Opper, AI group, TU Berlin (TU Berlin) inference in SDE November 6, 2017
More informationStatistical Machine Learning from Data
Samy Bengio Statistical Machine Learning from Data 1 Statistical Machine Learning from Data Support Vector Machines Samy Bengio IDIAP Research Institute, Martigny, Switzerland, and Ecole Polytechnique
More informationOutline Lecture 2 2(32)
Outline Lecture (3), Lecture Linear Regression and Classification it is our firm belief that an understanding of linear models is essential for understanding nonlinear ones Thomas Schön Division of Automatic
More informationPrimal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing
Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal
More informationDiscrete Dependent Variable Models
Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)
More informationσ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =
Until now we have always worked with likelihoods and prior distributions that were conjugate to each other, allowing the computation of the posterior distribution to be done in closed form. Unfortunately,
More informationAdditive Isotonic Regression
Additive Isotonic Regression Enno Mammen and Kyusang Yu 11. July 2006 INTRODUCTION: We have i.i.d. random vectors (Y 1, X 1 ),..., (Y n, X n ) with X i = (X1 i,..., X d i ) and we consider the additive
More informationPhysics 403. Segev BenZvi. Parameter Estimation, Correlations, and Error Bars. Department of Physics and Astronomy University of Rochester
Physics 403 Parameter Estimation, Correlations, and Error Bars Segev BenZvi Department of Physics and Astronomy University of Rochester Table of Contents 1 Review of Last Class Best Estimates and Reliability
More informationModelling Czech and Slovak labour markets: A DSGE model with labour frictions
Modelling Czech and Slovak labour markets: A DSGE model with labour frictions Daniel Němec Faculty of Economics and Administrations Masaryk University Brno, Czech Republic nemecd@econ.muni.cz ESF MU (Brno)
More informationEco517 Fall 2004 C. Sims MIDTERM EXAM
Eco517 Fall 2004 C. Sims MIDTERM EXAM Answer all four questions. Each is worth 23 points. Do not devote disproportionate time to any one question unless you have answered all the others. (1) We are considering
More informationNon-Bayesian Classifiers Part II: Linear Discriminants and Support Vector Machines
Non-Bayesian Classifiers Part II: Linear Discriminants and Support Vector Machines Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Fall 2018 CS 551, Fall
More informationSTATE UNIVERSITY OF NEW YORK AT ALBANY Department of Economics
STATE UNIVERSITY OF NEW YORK AT ALBANY Department of Economics Ph. D. Comprehensive Examination: Macroeconomics Fall, 202 Answer Key to Section 2 Questions Section. (Suggested Time: 45 Minutes) For 3 of
More informationCSci 8980: Advanced Topics in Graphical Models Gaussian Processes
CSci 8980: Advanced Topics in Graphical Models Gaussian Processes Instructor: Arindam Banerjee November 15, 2007 Gaussian Processes Outline Gaussian Processes Outline Parametric Bayesian Regression Gaussian
More informationFor more information about how to cite these materials visit
Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/
More informationModel Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao
Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics Jiti Gao Department of Statistics School of Mathematics and Statistics The University of Western Australia Crawley
More informationLabor-Supply Shifts and Economic Fluctuations. Technical Appendix
Labor-Supply Shifts and Economic Fluctuations Technical Appendix Yongsung Chang Department of Economics University of Pennsylvania Frank Schorfheide Department of Economics University of Pennsylvania January
More informationOverfitting, Bias / Variance Analysis
Overfitting, Bias / Variance Analysis Professor Ameet Talwalkar Professor Ameet Talwalkar CS260 Machine Learning Algorithms February 8, 207 / 40 Outline Administration 2 Review of last lecture 3 Basic
More informationLecture : Probabilistic Machine Learning
Lecture : Probabilistic Machine Learning Riashat Islam Reasoning and Learning Lab McGill University September 11, 2018 ML : Many Methods with Many Links Modelling Views of Machine Learning Machine Learning
More informationLeast Squares Regression
E0 70 Machine Learning Lecture 4 Jan 7, 03) Least Squares Regression Lecturer: Shivani Agarwal Disclaimer: These notes are a brief summary of the topics covered in the lecture. They are not a substitute
More informationIncentives and Nutrition for Rotten Kids: Intrahousehold Food Allocation in the Philippines
Incentives and Nutrition for Rotten Kids: Intrahousehold Food Allocation in the Philippines Pierre Dubois and Ethan Ligon presented by Rachel Heath November 3, 2006 Introduction Outline Introduction Modification
More informationInference in non-linear time series
Intro LS MLE Other Erik Lindström Centre for Mathematical Sciences Lund University LU/LTH & DTU Intro LS MLE Other General Properties Popular estimatiors Overview Introduction General Properties Estimators
More informationBayesian Inference for DSGE Models. Lawrence J. Christiano
Bayesian Inference for DSGE Models Lawrence J. Christiano Outline State space-observer form. convenient for model estimation and many other things. Bayesian inference Bayes rule. Monte Carlo integation.
More informationEstimation Tasks. Short Course on Image Quality. Matthew A. Kupinski. Introduction
Estimation Tasks Short Course on Image Quality Matthew A. Kupinski Introduction Section 13.3 in B&M Keep in mind the similarities between estimation and classification Image-quality is a statistical concept
More informationAdvanced Macroeconomics
Advanced Macroeconomics The Ramsey Model Marcin Kolasa Warsaw School of Economics Marcin Kolasa (WSE) Ad. Macro - Ramsey model 1 / 30 Introduction Authors: Frank Ramsey (1928), David Cass (1965) and Tjalling
More information(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1.
Problem 1 (21 points) An economist runs the regression y i = β 0 + x 1i β 1 + x 2i β 2 + x 3i β 3 + ε i (1) The results are summarized in the following table: Equation 1. Variable Coefficient Std. Error
More informationAsymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½
University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 1998 Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½ Lawrence D. Brown University
More informationCOMP90051 Statistical Machine Learning
COMP90051 Statistical Machine Learning Semester 2, 2017 Lecturer: Trevor Cohn 17. Bayesian inference; Bayesian regression Training == optimisation (?) Stages of learning & inference: Formulate model Regression
More informationLeast Absolute Shrinkage is Equivalent to Quadratic Penalization
Least Absolute Shrinkage is Equivalent to Quadratic Penalization Yves Grandvalet Heudiasyc, UMR CNRS 6599, Université de Technologie de Compiègne, BP 20.529, 60205 Compiègne Cedex, France Yves.Grandvalet@hds.utc.fr
More information> DEPARTMENT OF MATHEMATICS AND COMPUTER SCIENCE GRAVIS 2016 BASEL. Logistic Regression. Pattern Recognition 2016 Sandro Schönborn University of Basel
Logistic Regression Pattern Recognition 2016 Sandro Schönborn University of Basel Two Worlds: Probabilistic & Algorithmic We have seen two conceptual approaches to classification: data class density estimation
More informationSemi-Nonparametric Inferences for Massive Data
Semi-Nonparametric Inferences for Massive Data Guang Cheng 1 Department of Statistics Purdue University Statistics Seminar at NCSU October, 2015 1 Acknowledge NSF, Simons Foundation and ONR. A Joint Work
More informationStochastic process for macro
Stochastic process for macro Tianxiao Zheng SAIF 1. Stochastic process The state of a system {X t } evolves probabilistically in time. The joint probability distribution is given by Pr(X t1, t 1 ; X t2,
More informationModel Selection and Geometry
Model Selection and Geometry Pascal Massart Université Paris-Sud, Orsay Leipzig, February Purpose of the talk! Concentration of measure plays a fundamental role in the theory of model selection! Model
More information... Econometric Methods for the Analysis of Dynamic General Equilibrium Models
... Econometric Methods for the Analysis of Dynamic General Equilibrium Models 1 Overview Multiple Equation Methods State space-observer form Three Examples of Versatility of state space-observer form:
More informationIntroduction: structural econometrics. Jean-Marc Robin
Introduction: structural econometrics Jean-Marc Robin Abstract 1. Descriptive vs structural models 2. Correlation is not causality a. Simultaneity b. Heterogeneity c. Selectivity Descriptive models Consider
More informationStatistical techniques for data analysis in Cosmology
Statistical techniques for data analysis in Cosmology arxiv:0712.3028; arxiv:0911.3105 Numerical recipes (the bible ) Licia Verde ICREA & ICC UB-IEEC http://icc.ub.edu/~liciaverde outline Lecture 1: Introduction
More informationGaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012
Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature
More informationRidge regression. Patrick Breheny. February 8. Penalized regression Ridge regression Bayesian interpretation
Patrick Breheny February 8 Patrick Breheny High-Dimensional Data Analysis (BIOS 7600) 1/27 Introduction Basic idea Standardization Large-scale testing is, of course, a big area and we could keep talking
More informationThe multiple regression model; Indicator variables as regressors
The multiple regression model; Indicator variables as regressors Ragnar Nymoen University of Oslo 28 February 2013 1 / 21 This lecture (#12): Based on the econometric model specification from Lecture 9
More informationThe Generalized Roy Model and Treatment Effects
The Generalized Roy Model and Treatment Effects Christopher Taber University of Wisconsin November 10, 2016 Introduction From Imbens and Angrist we showed that if one runs IV, we get estimates of the Local
More informationFinancial Econometrics
Financial Econometrics Estimation and Inference Gerald P. Dwyer Trinity College, Dublin January 2013 Who am I? Visiting Professor and BB&T Scholar at Clemson University Federal Reserve Bank of Atlanta
More informationNeutron inverse kinetics via Gaussian Processes
Neutron inverse kinetics via Gaussian Processes P. Picca Politecnico di Torino, Torino, Italy R. Furfaro University of Arizona, Tucson, Arizona Outline Introduction Review of inverse kinetics techniques
More informationBayesian Econometrics
Bayesian Econometrics Christopher A. Sims Princeton University sims@princeton.edu September 20, 2016 Outline I. The difference between Bayesian and non-bayesian inference. II. Confidence sets and confidence
More informationBayesian Inference for DSGE Models. Lawrence J. Christiano
Bayesian Inference for DSGE Models Lawrence J. Christiano Outline State space-observer form. convenient for model estimation and many other things. Preliminaries. Probabilities. Maximum Likelihood. Bayesian
More informationSimple New Keynesian Model without Capital
Simple New Keynesian Model without Capital Lawrence J. Christiano January 5, 2018 Objective Review the foundations of the basic New Keynesian model without capital. Clarify the role of money supply/demand.
More informationApplied Econometrics - QEM Theme 1: Introduction to Econometrics Chapter 1 + Probability Primer + Appendix B in PoE
Applied Econometrics - QEM Theme 1: Introduction to Econometrics Chapter 1 + Probability Primer + Appendix B in PoE Warsaw School of Economics Outline 1. Introduction to econometrics 2. Denition of econometrics
More information