Bayesian Machine Learning in Finance

Size: px
Start display at page:

Download "Bayesian Machine Learning in Finance"

Transcription

1 Bayesian Machine Learning in Finance Patricia(Ning) Ning Computational Finance & Risk Management Dept. of Applied Math Univ. of Washington, Seattle Talk maily based on paper Multivariate Bayesian Structural Time Series Model Forthcoming Journal of Machine Learning Research

2 AI and Big Data Jobs in Hedge Funds

3 AI and Big Data Jobs in Hedge Funds

4 Bayesian Structural Time Series (BSTS) Model Figure: Steven L. Scott Figure: Hal R Varian

5

6 Bayesian Structural Time Series (BSTS) Model BSTS model is a machine learning technique used for feature selection, time series forecasting, nowcasting, inferring causal impact and other. The model consists of three main parts: 1 Kalman filter: The technique for time series decomposition. In this step, a researcher can add different state variables: trend, seasonality, regression, and others. 2 Spike-and-slab method: In this step, the most important regression predictors are selected. 3 Bayesian model averaging: Combining the results and prediction calculation.

7 Multivariate Bayesian Structural Time Series (MBSTS) Model Structural Time Series Models belong to state space models Observation Equation: ỹ t = Z T t α t + ɛ t, ɛ t N m (0, Σ t ), ỹ t : observations, α t : unobserved latent states Transition Equation: α t+1 = T t α t + R t η t, η t N q (0, Q t ), The model matrices Z t, T t, and R t typically contain unknown parameters.

8 MBSTS Model In general, the model in state space form can be written as: ỹ t = µ t + τ t + ω t + ξ t + ɛ t ɛ t iid Nm (0, Σ ɛ ) t = 1, 2..., n. (1) Based on state space form, α t is the collection of these components, namely α t = [ µ T t, τ T t, ω T t, ξt t ] T.

9 MBSTS Model Trend component µ t+1 = µ t + δ t + ũ t, ũ t iid Nm (0, Σ µ ), (2) δ t+1 = D + ρ( δ t D) + ṽ t, ṽ t iid Nm (0, Σ δ ). (3) Seasonality component S i 2 t+1 = τ (i) (i) t k +w τ (i) k=0 t, w t = [w (1) t,, w (m) t ] T iid Nm (0, Σ τ ), (4)

10 MBSTS Model Cyclical effect component ω t+1 = ϱĉos(λ) ω t + ϱŝin(λ) ω t + κ t, κ t iid Nm (0, Σ ω ), ω t+1 = ϱŝin(λ) ω t + ϱĉos(λ) ω t + κ t, κ t iid Nm (0, Σ ω ), (5) where ϱ, ŝin(λ), ĉos(λ) are m m diagonal matrices with diagonal entries equal to ϱ ii, sin(λ ii ) where λ ii = 2π/q i is the frequency with q i being a period such that 0 < λ ii < π, and cos(λ ii ) respectively.

11 MBSTS Model

12 MBSTS Model Regression component ξ (i) t For target series y (i), the x (i) t = β T i x (i) t. (6) = [x (i) t1,..., x (i) tk i ] T is the pool of all available predictors at time t, and β i = [β i1,..., β ij,..., β iki ] T represent corresponding static regression coefficients.

13 MBSTS Model Spike and Slab Regression: In feature selection, a high degree of sparsity is expected, in the sense that the coefficients for the vast majority of predictors will be zero. A natural way to represent sparsity in the Bayesian paradigm is through the spike and slab coefficients. One advantage of working in a fully Bayesian setting is that we do not need to commit to a fixed set of predictors.

14 MBSTS Model Ỹ = M + T + W + X β + Ẽ (7) where Ỹ = vec(y ), M = vec(m), T = vec(t ), W = vec(w ), Ẽ = vec(e), and X, β are written as: X β 1 0 X β 2 X = β = (8) X m β m where X i being n k i matrix, representing all observations of k i candidate predictors for y (i).

15 MBSTS Model Prior distribution and elicitation We define γ ij = 1 if β ij 0, and γ ij = 0 if β ij = 0. Then γ = [γ 1,..., γ m ], where γ i = [γ i1,..., γ iki ]. The spike prior may be written as: γ m k i i=1 j=1 π γ ij ij (1 π ij ) 1 γ ij (9) where π ij is prior inclusion probability of j th predictor for i th response series.

16 MBSTS Model Prior distribution and elicitation A simple slab prior specification is to make β and Σ ɛ prior independent: p(β, Σ ɛ, γ) = p(β γ)p(σ ɛ γ)p(γ) β γ N K (b γ, A 1 γ ) Σ ɛ γ IW (v 0, V 0 ) (10) where b γ is the vector of prior means with the same dimension as β γ, and A γ is the full-model prior information matrix.

17 MBSTS Model By the law of total probability, the full likelihood function is given by p(ỹ, β, Σ ɛ, γ) = p(ỹ β, Σ ɛ, γ) p(β γ) p(σ ɛ γ) p(γ), (11) ( p(ỹ β, Σ ɛ, γ) Σ ɛ n/2 exp 1 2 (Ỹ X γ β γ ) T (Σ 1 ɛ I n )(Ỹ X γ β γ ) ( (12) p(β γ) A γ 1/2 exp 1 ) 2 (β γ b γ ) T A γ (β γ b γ ), (13) ( p(σ ɛ γ) Σ ɛ (v0+m+1)/2 exp tr( 1 ) 2 V 0Σ 1 ɛ ), (14) where Ỹ = Ỹ M T W is the multiple response series Ỹ with time series components subtracted out.

18 MBSTS Model Posterior Inference β Ŷ, Σ ɛ, γ N K ( β γ, ( ˆX T γ ˆX γ + A γ ) 1 ). (15) Σ ɛ Ỹ, β, γ IW (v 0 + n, E T γ E γ + V 0 ). (16) p(γ Σ ɛ, Ỹ ) =C(Σ ɛ, Ỹ A γ 1/2 p(γ) ) ˆX γ T ˆX γ + A γ 1/2 ( exp 1 ) 2 {bt γ A γ b γ Zγ T ( ˆX γ T ˆX γ + A γ ) 1 Z γ }. (17) Σ µ,δ,τ,ω µ, δ, τ, ω IW (w µ,δ,τ,ω + n, W µ,δ,τ,ω + AA T ). (18)

19 MBSTS Model Markov Chain Monte Carlo MCMC methods are a class of algorithms to sample from a probability distributions ((15), (16), (17) and (18)) based on constructing a Markov chain that has the desired distribution as its equilibrium distribution. The state of the chain after a number of steps is then used as a sample from the desired distribution.

20 Model Training Let θ = (Σ µ, Σ δ, Σ τ, Σ ω ) denote the set of state component parameters. Looping through the five steps yields a sequence of draws ψ = (α, θ, γ, Σ ɛ, β) from a Markov chain with stationary distribution p( ψ Y ), the posterior distribution of ψ given Y.

21 MBSTS Model Training 1: Draw the latent state α = ( µ, δ, τ, ω) from given model parameters and Ỹ, namely p(α Ỹ, θ, γ, Σ ɛ, β), using the posterior simulation algorithm Durbin and Koopman (2002). 2: Draw time series state component parameters θ given α, namely simulating θ p(θ Ỹ, α) based on equation (18). 3: Loop over i in an random order, draw each γ i γ i, Ỹ, α, Σ ɛ, namely simulating γ p(γ Ỹ, Σ ɛ ) one by one based on equation (17), using the stochastic search variable selection (SSVS) algorithm from George and McCulloch (1997). 4: Draw β given Σ ɛ, γ, α and Ỹ, namely simulating β p(β Σ ɛ, γ, Ỹ ) based on equation (15). 5: Draw Σ ɛ given γ, α, β and Ỹ, namely simulating Σ ɛ p(σ ɛ γ, Ỹ, β) based on equation (16).

22 Target Series Forecasting Let Ŷ represents the set of values to be forecast. The posterior predictive distribution of Ŷ can be expressed as follows: p(ŷ Y ) = p(ŷ ψ)p( ψ Y )d ψ, (19) where ψ is the set of all model parameters and latent states randomly drawn from p( ψ Y ), then we can draw samples of Ŷ from p(ŷ ψ).

23 Empirical Analysis Data: Daily stock price of Bank of America (BOA), Capital One Financial Corporation (COF), J.P. Morgan (JPM) and Wells Fargo (WFC). Time horizon: 11/27/ /03/2017 Source: Google Finance. Purpose: Trade when its future price is predicted to vary more than p%. Goal: Forecast the trend of stock movement in the next k(= 5) days.

24 We approximate the daily average price as: Pt = (C t + H t + L t )/3, where C t, H t and L t are the close, high, and low quotes for day t respectively. However, instead of using the arithmetic returns, we are interested in the log return V t defined as V t = {log( P t+j /C t )} k j=1. We consider the indicator variable y t = max{v V t }, the maximum value of log returns over the next k days.

25

26 Fundamental analysis claims that markets may incorrectly price a security in the short run but will eventually correct it. Trend Abbr. Trend Abbr. Advertising & marketing advert Air travel airtvl Auto buyers auto Auto financing autoby Automotive autofi Business & industrial bizind Bankruptcy bnkrpt Commercial Lending comlnd Computers & electronics comput Construction constr Credit cards crcard Durable goods durble Education educat Finance & investing invest Financial planning finpln Furniture furntr Insurance insur Jobs jobs Luxury goods luxury Mobile & wireless mobile Mortgage mtge Real estate rlest Rental rental Shopping shop Small business smallbiz Travel travel Unemployment unempl Table: Google domestic trends

27 Technical analysis claims that useful information is already reflected in the price of a security. We selected a representative set of technical indicators to capture the volatility, momentum and trend, close location value, and potential reversals of each stock. Variable Chaikin volatility Yang and Zhang Volatility historical estimator Arms Ease of Movement Value Moving Average Convergence/Divergence Money Flow Index Aroon Indicator Parabolic Stop-and-Reverse Close Location Value Abbr. ChaVol Vol EMV MACD MFI AROON SAR CLV Table: Stock Technical Predictors

28 Figure: True and fitted values of max log return from 11/27/2006 to 10/20/2017 (BOA)

29

30 Empirical posterior inclusion probability for the most likely predictors of max log return. Figure: Bank of America Corp. Figure: Capital One Financial Corp.

31 Empirical posterior inclusion probability for the most likely predictors of max log return. Figure: JPMorgan Chase & Co. Figure: Wells Fargo & Co.

32 Cumulative absolute one-step-ahead prediction error. Figure: All Predictors Without Deaseasonal Figure: Partial Predictors With Deaseasonal

33 One-step-ahead predction of max log return. Figure: Bank of America Corp. Figure: JPMorgan Chase & Co.

34 One-step-ahead predction of max log return. Figure: JPMorgan Chase & Co. Figure: Wells Fargo & Co.

35 Thank you!

Bayesian Variable Selection for Nowcasting Time Series

Bayesian Variable Selection for Nowcasting Time Series Bayesian Variable Selection for Time Series Steve Scott Hal Varian Google August 14, 2013 What day of the week are there the most searches for [hangover]? 1. Sunday 2. Monday 3. Tuesday 4. Wednesday 5.

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and

More information

Riemann Manifold Methods in Bayesian Statistics

Riemann Manifold Methods in Bayesian Statistics Ricardo Ehlers ehlers@icmc.usp.br Applied Maths and Stats University of São Paulo, Brazil Working Group in Statistical Learning University College Dublin September 2015 Bayesian inference is based on Bayes

More information

Generalized Autoregressive Score Models

Generalized Autoregressive Score Models Generalized Autoregressive Score Models by: Drew Creal, Siem Jan Koopman, André Lucas To capture the dynamic behavior of univariate and multivariate time series processes, we can allow parameters to be

More information

Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data

Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data Michael Ellington and Costas Milas Financial Services, Liquidity and Economic Activity Bank of England May

More information

Calibration of Stochastic Volatility Models using Particle Markov Chain Monte Carlo Methods

Calibration of Stochastic Volatility Models using Particle Markov Chain Monte Carlo Methods Calibration of Stochastic Volatility Models using Particle Markov Chain Monte Carlo Methods Jonas Hallgren 1 1 Department of Mathematics KTH Royal Institute of Technology Stockholm, Sweden BFS 2012 June

More information

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017 Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent

More information

Stochastic Processes

Stochastic Processes Stochastic Processes Stochastic Process Non Formal Definition: Non formal: A stochastic process (random process) is the opposite of a deterministic process such as one defined by a differential equation.

More information

Estimating and Accounting for the Output Gap with Large Bayesian Vector Autoregressions

Estimating and Accounting for the Output Gap with Large Bayesian Vector Autoregressions Estimating and Accounting for the Output Gap with Large Bayesian Vector Autoregressions James Morley 1 Benjamin Wong 2 1 University of Sydney 2 Reserve Bank of New Zealand The view do not necessarily represent

More information

Probabilities & Statistics Revision

Probabilities & Statistics Revision Probabilities & Statistics Revision Christopher Ting Christopher Ting http://www.mysmu.edu/faculty/christophert/ : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 January 6, 2017 Christopher Ting QF

More information

Online appendix to On the stability of the excess sensitivity of aggregate consumption growth in the US

Online appendix to On the stability of the excess sensitivity of aggregate consumption growth in the US Online appendix to On the stability of the excess sensitivity of aggregate consumption growth in the US Gerdie Everaert 1, Lorenzo Pozzi 2, and Ruben Schoonackers 3 1 Ghent University & SHERPPA 2 Erasmus

More information

Model-based trend-cycle decompositions. with time-varying parameters

Model-based trend-cycle decompositions. with time-varying parameters Model-based trend-cycle decompositions with time-varying parameters Siem Jan Koopman Kai Ming Lee Soon Yip Wong s.j.koopman@ klee@ s.wong@ feweb.vu.nl Department of Econometrics Vrije Universiteit Amsterdam

More information

MCMC algorithms for fitting Bayesian models

MCMC algorithms for fitting Bayesian models MCMC algorithms for fitting Bayesian models p. 1/1 MCMC algorithms for fitting Bayesian models Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota MCMC algorithms for fitting Bayesian models

More information

STAT 518 Intro Student Presentation

STAT 518 Intro Student Presentation STAT 518 Intro Student Presentation Wen Wei Loh April 11, 2013 Title of paper Radford M. Neal [1999] Bayesian Statistics, 6: 475-501, 1999 What the paper is about Regression and Classification Flexible

More information

Labor-Supply Shifts and Economic Fluctuations. Technical Appendix

Labor-Supply Shifts and Economic Fluctuations. Technical Appendix Labor-Supply Shifts and Economic Fluctuations Technical Appendix Yongsung Chang Department of Economics University of Pennsylvania Frank Schorfheide Department of Economics University of Pennsylvania January

More information

DSGE Model Forecasting

DSGE Model Forecasting University of Pennsylvania EABCN Training School May 1, 216 Introduction The use of DSGE models at central banks has triggered a strong interest in their forecast performance. The subsequent material draws

More information

Modeling conditional distributions with mixture models: Applications in finance and financial decision-making

Modeling conditional distributions with mixture models: Applications in finance and financial decision-making Modeling conditional distributions with mixture models: Applications in finance and financial decision-making John Geweke University of Iowa, USA Journal of Applied Econometrics Invited Lecture Università

More information

Bayesian Compressed Vector Autoregressions

Bayesian Compressed Vector Autoregressions Bayesian Compressed Vector Autoregressions Gary Koop a, Dimitris Korobilis b, and Davide Pettenuzzo c a University of Strathclyde b University of Glasgow c Brandeis University 9th ECB Workshop on Forecasting

More information

The Metropolis-Hastings Algorithm. June 8, 2012

The Metropolis-Hastings Algorithm. June 8, 2012 The Metropolis-Hastings Algorithm June 8, 22 The Plan. Understand what a simulated distribution is 2. Understand why the Metropolis-Hastings algorithm works 3. Learn how to apply the Metropolis-Hastings

More information

Dynamic Matrix-Variate Graphical Models A Synopsis 1

Dynamic Matrix-Variate Graphical Models A Synopsis 1 Proc. Valencia / ISBA 8th World Meeting on Bayesian Statistics Benidorm (Alicante, Spain), June 1st 6th, 2006 Dynamic Matrix-Variate Graphical Models A Synopsis 1 Carlos M. Carvalho & Mike West ISDS, Duke

More information

Warwick Business School Forecasting System. Summary. Ana Galvao, Anthony Garratt and James Mitchell November, 2014

Warwick Business School Forecasting System. Summary. Ana Galvao, Anthony Garratt and James Mitchell November, 2014 Warwick Business School Forecasting System Summary Ana Galvao, Anthony Garratt and James Mitchell November, 21 The main objective of the Warwick Business School Forecasting System is to provide competitive

More information

On Bayesian Computation

On Bayesian Computation On Bayesian Computation Michael I. Jordan with Elaine Angelino, Maxim Rabinovich, Martin Wainwright and Yun Yang Previous Work: Information Constraints on Inference Minimize the minimax risk under constraints

More information

Probabilistic Graphical Models

Probabilistic Graphical Models 2016 Robert Nowak Probabilistic Graphical Models 1 Introduction We have focused mainly on linear models for signals, in particular the subspace model x = Uθ, where U is a n k matrix and θ R k is a vector

More information

Variational Inference via Stochastic Backpropagation

Variational Inference via Stochastic Backpropagation Variational Inference via Stochastic Backpropagation Kai Fan February 27, 2016 Preliminaries Stochastic Backpropagation Variational Auto-Encoding Related Work Summary Outline Preliminaries Stochastic Backpropagation

More information

Course 495: Advanced Statistical Machine Learning/Pattern Recognition

Course 495: Advanced Statistical Machine Learning/Pattern Recognition Course 495: Advanced Statistical Machine Learning/Pattern Recognition Lecturer: Stefanos Zafeiriou Goal (Lectures): To present discrete and continuous valued probabilistic linear dynamical systems (HMMs

More information

Markov Chain Monte Carlo

Markov Chain Monte Carlo Markov Chain Monte Carlo Recall: To compute the expectation E ( h(y ) ) we use the approximation E(h(Y )) 1 n n h(y ) t=1 with Y (1),..., Y (n) h(y). Thus our aim is to sample Y (1),..., Y (n) from f(y).

More information

Non-linear Supervised High Frequency Trading Strategies with Applications in US Equity Markets

Non-linear Supervised High Frequency Trading Strategies with Applications in US Equity Markets Non-linear Supervised High Frequency Trading Strategies with Applications in US Equity Markets Nan Zhou, Wen Cheng, Ph.D. Associate, Quantitative Research, J.P. Morgan nan.zhou@jpmorgan.com The 4th Annual

More information

A Guide to Modern Econometric:

A Guide to Modern Econometric: A Guide to Modern Econometric: 4th edition Marno Verbeek Rotterdam School of Management, Erasmus University, Rotterdam B 379887 )WILEY A John Wiley & Sons, Ltd., Publication Contents Preface xiii 1 Introduction

More information

Dynamic Factor Models and Factor Augmented Vector Autoregressions. Lawrence J. Christiano

Dynamic Factor Models and Factor Augmented Vector Autoregressions. Lawrence J. Christiano Dynamic Factor Models and Factor Augmented Vector Autoregressions Lawrence J Christiano Dynamic Factor Models and Factor Augmented Vector Autoregressions Problem: the time series dimension of data is relatively

More information

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology FE670 Algorithmic Trading Strategies Lecture 3. Factor Models and Their Estimation Steve Yang Stevens Institute of Technology 09/12/2012 Outline 1 The Notion of Factors 2 Factor Analysis via Maximum Likelihood

More information

Time-varying sparsity in dynamic regression models

Time-varying sparsity in dynamic regression models Time-varying sparsity in dynamic regression models Professor Jim Griffin (joint work with Maria Kalli, Canterbury Christ Church University) University of Kent Regression models Often we are interested

More information

High-dimensional Problems in Finance and Economics. Thomas M. Mertens

High-dimensional Problems in Finance and Economics. Thomas M. Mertens High-dimensional Problems in Finance and Economics Thomas M. Mertens NYU Stern Risk Economics Lab April 17, 2012 1 / 78 Motivation Many problems in finance and economics are high dimensional. Dynamic Optimization:

More information

Modeling conditional distributions with mixture models: Theory and Inference

Modeling conditional distributions with mixture models: Theory and Inference Modeling conditional distributions with mixture models: Theory and Inference John Geweke University of Iowa, USA Journal of Applied Econometrics Invited Lecture Università di Venezia Italia June 2, 2005

More information

Switching Regime Estimation

Switching Regime Estimation Switching Regime Estimation Series de Tiempo BIrkbeck March 2013 Martin Sola (FE) Markov Switching models 01/13 1 / 52 The economy (the time series) often behaves very different in periods such as booms

More information

Supplementary Note on Bayesian analysis

Supplementary Note on Bayesian analysis Supplementary Note on Bayesian analysis Structured variability of muscle activations supports the minimal intervention principle of motor control Francisco J. Valero-Cuevas 1,2,3, Madhusudhan Venkadesan

More information

Bayesian Estimation of DSGE Models

Bayesian Estimation of DSGE Models Bayesian Estimation of DSGE Models Stéphane Adjemian Université du Maine, GAINS & CEPREMAP stephane.adjemian@univ-lemans.fr http://www.dynare.org/stepan June 28, 2011 June 28, 2011 Université du Maine,

More information

Modeling Ultra-High-Frequency Multivariate Financial Data by Monte Carlo Simulation Methods

Modeling Ultra-High-Frequency Multivariate Financial Data by Monte Carlo Simulation Methods Outline Modeling Ultra-High-Frequency Multivariate Financial Data by Monte Carlo Simulation Methods Ph.D. Student: Supervisor: Marco Minozzo Dipartimento di Scienze Economiche Università degli Studi di

More information

ECO 513 Fall 2008 C.Sims KALMAN FILTER. s t = As t 1 + ε t Measurement equation : y t = Hs t + ν t. u t = r t. u 0 0 t 1 + y t = [ H I ] u t.

ECO 513 Fall 2008 C.Sims KALMAN FILTER. s t = As t 1 + ε t Measurement equation : y t = Hs t + ν t. u t = r t. u 0 0 t 1 + y t = [ H I ] u t. ECO 513 Fall 2008 C.Sims KALMAN FILTER Model in the form 1. THE KALMAN FILTER Plant equation : s t = As t 1 + ε t Measurement equation : y t = Hs t + ν t. Var(ε t ) = Ω, Var(ν t ) = Ξ. ε t ν t and (ε t,

More information

Gibbs Sampling in Linear Models #2

Gibbs Sampling in Linear Models #2 Gibbs Sampling in Linear Models #2 Econ 690 Purdue University Outline 1 Linear Regression Model with a Changepoint Example with Temperature Data 2 The Seemingly Unrelated Regressions Model 3 Gibbs sampling

More information

The Effects of Monetary Policy on Stock Market Bubbles: Some Evidence

The Effects of Monetary Policy on Stock Market Bubbles: Some Evidence The Effects of Monetary Policy on Stock Market Bubbles: Some Evidence Jordi Gali Luca Gambetti ONLINE APPENDIX The appendix describes the estimation of the time-varying coefficients VAR model. The model

More information

SUPPLEMENT TO MARKET ENTRY COSTS, PRODUCER HETEROGENEITY, AND EXPORT DYNAMICS (Econometrica, Vol. 75, No. 3, May 2007, )

SUPPLEMENT TO MARKET ENTRY COSTS, PRODUCER HETEROGENEITY, AND EXPORT DYNAMICS (Econometrica, Vol. 75, No. 3, May 2007, ) Econometrica Supplementary Material SUPPLEMENT TO MARKET ENTRY COSTS, PRODUCER HETEROGENEITY, AND EXPORT DYNAMICS (Econometrica, Vol. 75, No. 3, May 2007, 653 710) BY SANGHAMITRA DAS, MARK ROBERTS, AND

More information

Probabilistic Reasoning in Deep Learning

Probabilistic Reasoning in Deep Learning Probabilistic Reasoning in Deep Learning Dr Konstantina Palla, PhD palla@stats.ox.ac.uk September 2017 Deep Learning Indaba, Johannesburgh Konstantina Palla 1 / 39 OVERVIEW OF THE TALK Basics of Bayesian

More information

Linear Dynamical Systems (Kalman filter)

Linear Dynamical Systems (Kalman filter) Linear Dynamical Systems (Kalman filter) (a) Overview of HMMs (b) From HMMs to Linear Dynamical Systems (LDS) 1 Markov Chains with Discrete Random Variables x 1 x 2 x 3 x T Let s assume we have discrete

More information

Introduction to Machine Learning

Introduction to Machine Learning Introduction to Machine Learning Brown University CSCI 1950-F, Spring 2012 Prof. Erik Sudderth Lecture 25: Markov Chain Monte Carlo (MCMC) Course Review and Advanced Topics Many figures courtesy Kevin

More information

BAGUS: Bayesian Regularization for Graphical Models with Unequal Shrinkage

BAGUS: Bayesian Regularization for Graphical Models with Unequal Shrinkage BAGUS: Bayesian Regularization for Graphical Models with Unequal Shrinkage Lingrui Gan, Naveen N. Narisetty, Feng Liang Department of Statistics University of Illinois at Urbana-Champaign Problem Statement

More information

Estimating Macroeconomic Models: A Likelihood Approach

Estimating Macroeconomic Models: A Likelihood Approach Estimating Macroeconomic Models: A Likelihood Approach Jesús Fernández-Villaverde University of Pennsylvania, NBER, and CEPR Juan Rubio-Ramírez Federal Reserve Bank of Atlanta Estimating Dynamic Macroeconomic

More information

The profit function system with output- and input- specific technical efficiency

The profit function system with output- and input- specific technical efficiency The profit function system with output- and input- specific technical efficiency Mike G. Tsionas December 19, 2016 Abstract In a recent paper Kumbhakar and Lai (2016) proposed an output-oriented non-radial

More information

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Fin. Econometrics / 53

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Fin. Econometrics / 53 State-space Model Eduardo Rossi University of Pavia November 2014 Rossi State-space Model Fin. Econometrics - 2014 1 / 53 Outline 1 Motivation 2 Introduction 3 The Kalman filter 4 Forecast errors 5 State

More information

2.5 Forecasting and Impulse Response Functions

2.5 Forecasting and Impulse Response Functions 2.5 Forecasting and Impulse Response Functions Principles of forecasting Forecast based on conditional expectations Suppose we are interested in forecasting the value of y t+1 based on a set of variables

More information

A short introduction to INLA and R-INLA

A short introduction to INLA and R-INLA A short introduction to INLA and R-INLA Integrated Nested Laplace Approximation Thomas Opitz, BioSP, INRA Avignon Workshop: Theory and practice of INLA and SPDE November 7, 2018 2/21 Plan for this talk

More information

Bayesian Model Comparison:

Bayesian Model Comparison: Bayesian Model Comparison: Modeling Petrobrás log-returns Hedibert Freitas Lopes February 2014 Log price: y t = log p t Time span: 12/29/2000-12/31/2013 (n = 3268 days) LOG PRICE 1 2 3 4 0 500 1000 1500

More information

Multivariate forecasting with VAR models

Multivariate forecasting with VAR models Multivariate forecasting with VAR models Franz Eigner University of Vienna UK Econometric Forecasting Prof. Robert Kunst 16th June 2009 Overview Vector autoregressive model univariate forecasting multivariate

More information

Bayesian Analysis of Risk for Data Mining Based on Empirical Likelihood

Bayesian Analysis of Risk for Data Mining Based on Empirical Likelihood 1 / 29 Bayesian Analysis of Risk for Data Mining Based on Empirical Likelihood Yuan Liao Wenxin Jiang Northwestern University Presented at: Department of Statistics and Biostatistics Rutgers University

More information

State Space Model, Official Statistics, Bayesian Statistics Introduction to Durbin s paper and related topics

State Space Model, Official Statistics, Bayesian Statistics Introduction to Durbin s paper and related topics State Space Model, Official Statistics, Bayesian Statistics Introduction to Durbin s paper and related topics Yasuto Yoshizoe Aoyama Gakuin University yoshizoe@econ.aoyama.ac.jp yasuto yoshizoe@post.harvard.edu

More information

Introduction to Bayesian Inference

Introduction to Bayesian Inference University of Pennsylvania EABCN Training School May 10, 2016 Bayesian Inference Ingredients of Bayesian Analysis: Likelihood function p(y φ) Prior density p(φ) Marginal data density p(y ) = p(y φ)p(φ)dφ

More information

Tools for Google Data. Hal Varian Google July 2015

Tools for Google Data. Hal Varian Google July 2015 Tools for Google Data Hal Varian Google July 2015 1 Google Trends Google Correlate Google Consumer Surveys 2 Searches for [hangover] Which day of the week are there the most searches for [hangover]? 1:

More information

Approximate Bayesian Computation and Particle Filters

Approximate Bayesian Computation and Particle Filters Approximate Bayesian Computation and Particle Filters Dennis Prangle Reading University 5th February 2014 Introduction Talk is mostly a literature review A few comments on my own ongoing research See Jasra

More information

Monetary and Exchange Rate Policy Under Remittance Fluctuations. Technical Appendix and Additional Results

Monetary and Exchange Rate Policy Under Remittance Fluctuations. Technical Appendix and Additional Results Monetary and Exchange Rate Policy Under Remittance Fluctuations Technical Appendix and Additional Results Federico Mandelman February In this appendix, I provide technical details on the Bayesian estimation.

More information

Point, Interval, and Density Forecast Evaluation of Linear versus Nonlinear DSGE Models

Point, Interval, and Density Forecast Evaluation of Linear versus Nonlinear DSGE Models Point, Interval, and Density Forecast Evaluation of Linear versus Nonlinear DSGE Models Francis X. Diebold Frank Schorfheide Minchul Shin University of Pennsylvania May 4, 2014 1 / 33 Motivation The use

More information

Bayesian spatial hierarchical modeling for temperature extremes

Bayesian spatial hierarchical modeling for temperature extremes Bayesian spatial hierarchical modeling for temperature extremes Indriati Bisono Dr. Andrew Robinson Dr. Aloke Phatak Mathematics and Statistics Department The University of Melbourne Maths, Informatics

More information

Estimation of a bivariate hierarchical Ornstein-Uhlenbeck model for longitudinal data

Estimation of a bivariate hierarchical Ornstein-Uhlenbeck model for longitudinal data Estimation of a bivariate hierarchical Ornstein-Uhlenbeck model for longitudinal data Zita Oravecz & Francis Tuerlinckx (E-mail to: ) IAP workshop, Ghent 19th September, 2008 1 2 The latent OU model 3

More information

Bayesian Dynamic Linear Modelling for. Complex Computer Models

Bayesian Dynamic Linear Modelling for. Complex Computer Models Bayesian Dynamic Linear Modelling for Complex Computer Models Fei Liu, Liang Zhang, Mike West Abstract Computer models may have functional outputs. With no loss of generality, we assume that a single computer

More information

Learning Static Parameters in Stochastic Processes

Learning Static Parameters in Stochastic Processes Learning Static Parameters in Stochastic Processes Bharath Ramsundar December 14, 2012 1 Introduction Consider a Markovian stochastic process X T evolving (perhaps nonlinearly) over time variable T. We

More information

Bagging and Forecasting in Nonlinear Dynamic Models

Bagging and Forecasting in Nonlinear Dynamic Models DBJ Discussion Paper Series, No.0905 Bagging and Forecasting in Nonlinear Dynamic Models Mari Sakudo (Research Institute of Capital Formation, Development Bank of Japan, and Department of Economics, Sophia

More information

Information and Credit Risk

Information and Credit Risk Information and Credit Risk M. L. Bedini Université de Bretagne Occidentale, Brest - Friedrich Schiller Universität, Jena Jena, March 2011 M. L. Bedini (Université de Bretagne Occidentale, Brest Information

More information

Nowcasting Norwegian GDP

Nowcasting Norwegian GDP Nowcasting Norwegian GDP Knut Are Aastveit and Tørres Trovik May 13, 2007 Introduction Motivation The last decades of advances in information technology has made it possible to access a huge amount of

More information

Collaborative Nowcasting for Contextual Recommendation

Collaborative Nowcasting for Contextual Recommendation Collaborative for Contextual Recommendation Yu Sun 1, Nicholas Jing Yuan 2, Xing Xie 3, Kieran McDonald 4, Rui Zhang 5 University of Melbourne { 1 sun.y, 5 rui.zhang}@unimelb.edu.au Microsoft Research

More information

Bayesian Inference for DSGE Models. Lawrence J. Christiano

Bayesian Inference for DSGE Models. Lawrence J. Christiano Bayesian Inference for DSGE Models Lawrence J. Christiano Outline State space-observer form. convenient for model estimation and many other things. Bayesian inference Bayes rule. Monte Carlo integation.

More information

Bayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework

Bayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework HT5: SC4 Statistical Data Mining and Machine Learning Dino Sejdinovic Department of Statistics Oxford http://www.stats.ox.ac.uk/~sejdinov/sdmml.html Maximum Likelihood Principle A generative model for

More information

Bayesian Linear Regression

Bayesian Linear Regression Bayesian Linear Regression Sudipto Banerjee 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. September 15, 2010 1 Linear regression models: a Bayesian perspective

More information

Applications of Random Matrix Theory to Economics, Finance and Political Science

Applications of Random Matrix Theory to Economics, Finance and Political Science Outline Applications of Random Matrix Theory to Economics, Finance and Political Science Matthew C. 1 1 Department of Economics, MIT Institute for Quantitative Social Science, Harvard University SEA 06

More information

Identifying Financial Risk Factors

Identifying Financial Risk Factors Identifying Financial Risk Factors with a Low-Rank Sparse Decomposition Lisa Goldberg Alex Shkolnik Berkeley Columbia Meeting in Engineering and Statistics 24 March 2016 Outline 1 A Brief History of Factor

More information

Default Priors and Effcient Posterior Computation in Bayesian

Default Priors and Effcient Posterior Computation in Bayesian Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature

More information

Approximating high-dimensional posteriors with nuisance parameters via integrated rotated Gaussian approximation (IRGA)

Approximating high-dimensional posteriors with nuisance parameters via integrated rotated Gaussian approximation (IRGA) Approximating high-dimensional posteriors with nuisance parameters via integrated rotated Gaussian approximation (IRGA) Willem van den Boom Department of Statistics and Applied Probability National University

More information

Lecture 4: Dynamic models

Lecture 4: Dynamic models linear s Lecture 4: s Hedibert Freitas Lopes The University of Chicago Booth School of Business 5807 South Woodlawn Avenue, Chicago, IL 60637 http://faculty.chicagobooth.edu/hedibert.lopes hlopes@chicagobooth.edu

More information

Measuring interconnectedness between financial institutions with Bayesian time-varying VARs

Measuring interconnectedness between financial institutions with Bayesian time-varying VARs Measuring interconnectedness between financial institutions with Bayesian time-varying VARs Financial Risk & Network Theory Marco Valerio Geraci 1,2 Jean-Yves Gnabo 2 1 ECARES, Université libre de Bruxelles

More information

Kazuhiko Kakamu Department of Economics Finance, Institute for Advanced Studies. Abstract

Kazuhiko Kakamu Department of Economics Finance, Institute for Advanced Studies. Abstract Bayesian Estimation of A Distance Functional Weight Matrix Model Kazuhiko Kakamu Department of Economics Finance, Institute for Advanced Studies Abstract This paper considers the distance functional weight

More information

Research Division Federal Reserve Bank of St. Louis Working Paper Series

Research Division Federal Reserve Bank of St. Louis Working Paper Series Research Division Federal Reserve Bank of St Louis Working Paper Series Kalman Filtering with Truncated Normal State Variables for Bayesian Estimation of Macroeconomic Models Michael Dueker Working Paper

More information

Conjugate Analysis for the Linear Model

Conjugate Analysis for the Linear Model Conjugate Analysis for the Linear Model If we have good prior knowledge that can help us specify priors for β and σ 2, we can use conjugate priors. Following the procedure in Christensen, Johnson, Branscum,

More information

Regression: Ordinary Least Squares

Regression: Ordinary Least Squares Regression: Ordinary Least Squares Mark Hendricks Autumn 2017 FINM Intro: Regression Outline Regression OLS Mathematics Linear Projection Hendricks, Autumn 2017 FINM Intro: Regression: Lecture 2/32 Regression

More information

Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, / 1

Bayesian Graphical Models for Structural Vector AutoregressiveMarch Processes 21, / 1 Bayesian Graphical Models for Structural Vector Autoregressive Processes Daniel Ahelegbey, Monica Billio, and Roberto Cassin (2014) March 21, 2015 Bayesian Graphical Models for Structural Vector AutoregressiveMarch

More information

Ergodicity in data assimilation methods

Ergodicity in data assimilation methods Ergodicity in data assimilation methods David Kelly Andy Majda Xin Tong Courant Institute New York University New York NY www.dtbkelly.com April 15, 2016 ETH Zurich David Kelly (CIMS) Data assimilation

More information

Hierarchical Modeling for Univariate Spatial Data

Hierarchical Modeling for Univariate Spatial Data Hierarchical Modeling for Univariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Spatial Domain 2 Geography 890 Spatial Domain This

More information

Bayesian Inference. Chapter 4: Regression and Hierarchical Models

Bayesian Inference. Chapter 4: Regression and Hierarchical Models Bayesian Inference Chapter 4: Regression and Hierarchical Models Conchi Ausín and Mike Wiper Department of Statistics Universidad Carlos III de Madrid Master in Business Administration and Quantitative

More information

ABC methods for phase-type distributions with applications in insurance risk problems

ABC methods for phase-type distributions with applications in insurance risk problems ABC methods for phase-type with applications problems Concepcion Ausin, Department of Statistics, Universidad Carlos III de Madrid Joint work with: Pedro Galeano, Universidad Carlos III de Madrid Simon

More information

Machine Learning for OR & FE

Machine Learning for OR & FE Machine Learning for OR & FE Hidden Markov Models Martin Haugh Department of Industrial Engineering and Operations Research Columbia University Email: martin.b.haugh@gmail.com Additional References: David

More information

Combining Macroeconomic Models for Prediction

Combining Macroeconomic Models for Prediction Combining Macroeconomic Models for Prediction John Geweke University of Technology Sydney 15th Australasian Macro Workshop April 8, 2010 Outline 1 Optimal prediction pools 2 Models and data 3 Optimal pools

More information

Accounting for Complex Sample Designs via Mixture Models

Accounting for Complex Sample Designs via Mixture Models Accounting for Complex Sample Designs via Finite Normal Mixture Models 1 1 University of Michigan School of Public Health August 2009 Talk Outline 1 2 Accommodating Sampling Weights in Mixture Models 3

More information

Econ671 Factor Models: Principal Components

Econ671 Factor Models: Principal Components Econ671 Factor Models: Principal Components Jun YU April 8, 2016 Jun YU () Econ671 Factor Models: Principal Components April 8, 2016 1 / 59 Factor Models: Principal Components Learning Objectives 1. Show

More information

Forecast combination and model averaging using predictive measures. Jana Eklund and Sune Karlsson Stockholm School of Economics

Forecast combination and model averaging using predictive measures. Jana Eklund and Sune Karlsson Stockholm School of Economics Forecast combination and model averaging using predictive measures Jana Eklund and Sune Karlsson Stockholm School of Economics 1 Introduction Combining forecasts robustifies and improves on individual

More information

Bayesian Inference. Chapter 4: Regression and Hierarchical Models

Bayesian Inference. Chapter 4: Regression and Hierarchical Models Bayesian Inference Chapter 4: Regression and Hierarchical Models Conchi Ausín and Mike Wiper Department of Statistics Universidad Carlos III de Madrid Advanced Statistics and Data Mining Summer School

More information

Outline. Clustering. Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models

Outline. Clustering. Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models Collaboration with Rudolf Winter-Ebmer, Department of Economics, Johannes Kepler University

More information

Density Estimation. Seungjin Choi

Density Estimation. Seungjin Choi Density Estimation Seungjin Choi Department of Computer Science and Engineering Pohang University of Science and Technology 77 Cheongam-ro, Nam-gu, Pohang 37673, Korea seungjin@postech.ac.kr http://mlg.postech.ac.kr/

More information

Ruin, Operational Risk and How Fast Stochastic Processes Mix

Ruin, Operational Risk and How Fast Stochastic Processes Mix Ruin, Operational Risk and How Fast Stochastic Processes Mix Paul Embrechts ETH Zürich Based on joint work with: - Roger Kaufmann (ETH Zürich) - Gennady Samorodnitsky (Cornell University) Basel Committee

More information

Paul Karapanagiotidis ECO4060

Paul Karapanagiotidis ECO4060 Paul Karapanagiotidis ECO4060 The way forward 1) Motivate why Markov-Chain Monte Carlo (MCMC) is useful for econometric modeling 2) Introduce Markov-Chain Monte Carlo (MCMC) - Metropolis-Hastings (MH)

More information

Bayesian Modeling of Conditional Distributions

Bayesian Modeling of Conditional Distributions Bayesian Modeling of Conditional Distributions John Geweke University of Iowa Indiana University Department of Economics February 27, 2007 Outline Motivation Model description Methods of inference Earnings

More information

The Bias-Variance dilemma of the Monte Carlo. method. Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel

The Bias-Variance dilemma of the Monte Carlo. method. Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel The Bias-Variance dilemma of the Monte Carlo method Zlochin Mark 1 and Yoram Baram 1 Technion - Israel Institute of Technology, Technion City, Haifa 32000, Israel fzmark,baramg@cs.technion.ac.il Abstract.

More information

Modeling Real Estate Data using Quantile Regression

Modeling Real Estate Data using Quantile Regression Modeling Real Estate Data using Semiparametric Quantile Regression Department of Statistics University of Innsbruck September 9th, 2011 Overview 1 Application: 2 3 4 Hedonic regression data for house prices

More information

Random Effects Models for Network Data

Random Effects Models for Network Data Random Effects Models for Network Data Peter D. Hoff 1 Working Paper no. 28 Center for Statistics and the Social Sciences University of Washington Seattle, WA 98195-4320 January 14, 2003 1 Department of

More information

Katsuhiro Sugita Faculty of Law and Letters, University of the Ryukyus. Abstract

Katsuhiro Sugita Faculty of Law and Letters, University of the Ryukyus. Abstract Bayesian analysis of a vector autoregressive model with multiple structural breaks Katsuhiro Sugita Faculty of Law and Letters, University of the Ryukyus Abstract This paper develops a Bayesian approach

More information