Deep Learning in Asset Pricing
|
|
- Junior Morris
- 5 years ago
- Views:
Transcription
1 Deep Learning in Asset Pricing Luyang Chen 1 Markus Pelger 1 Jason Zhu 1 1 Stanford University November 17th 2018 Western Mathematical Finance Conference 2018
2 Motivation Hype: Machine Learning in Investment Same reporter 3 weeks later Efficient markets: Asset returns dominated by unforecastable news Financial return data has very low signal-to noise ratio This paper: Including financial constraints (no-arbitrage) in learning algorithm significantly improves signal 1
3 Motivation Motivation: Asset Pricing The Challenge of Asset Pricing One of the most important questions in finance: Why are asset prices different for different assets? No-Arbitrage Pricing Theory: Stochastic discount factor SDF (also called pricing kernel or equivalent martingale measure) explains differences in risk and asset prices Fundamental question: What is the SDF? Challenges SDF should depend on all available economic information: Very large set of variables Functional form of SDF unknown and likely complex SDF needs to capture time-variation in economic conditions Risk premium in stock returns has a low signal-to-noise ratio 2
4 Motivation This paper Goals of this paper: General non-linear asset pricing model and optimal portfolio design Deep-neural networks applied to all U.S. equity data and large sets of macroeconomic and firm-specific information. Why is it important? 1 Stochastic discount factor (SDF) generates tradeable portfolio with highest risk-adjusted return (Sharpe-ratio=expected excess return/standard deviation) 2 Arbitrage opportunities Find underpriced assets and earn alpha 3 Risk management Understand which information and how it drives the SDF Manage risk exposure of financial assets 3
5 Motivation Contribution of this paper Contribution This Paper: Estimate the SDF with deep neural networks Crucial innovation: Include no-arbitrage condition in the neural network algorithm and combine four neural networks in a novel way Key elements of estimator: 1 Non-linearity: Feed-forward network captures non-linearities 2 Time-variation: Recurrent (LSTM) network finds a small set of economic state processes 3 Pricing all assets: Generative adversarial network identifies the states and portfolios with most unexplained pricing information 4 Dimension reduction: Regularization through no-arbitrage condition 5 Signal-to-noise ratio: No-arbitrage conditions increase the signal to noise-ratio General model that includes all existing models as a special case 4
6 Motivation Contribution of this paper Empirical Contributions Empirically outperforms all benchmark models. Optimal portfolio has out-of-sample annual Sharpe ratio of 2.1. Non-linearities and interaction between firm information matters. Most relevant firm characteristics are price trends, profitability, and capital structure variables. 5
7 Motivation Literature (partial list) Deep-learning for predicting asset prices Gu, Kelly and Xiu (2018) Feng, Polson and Xu (2018) Messmer (2017) Predicting future asset returns with feed forward network Linear or kernel methods for asset pricing of large data sets Lettau and Pelger (2018): Risk-premium PCA Feng, Giglio and Xu (2017): Risk-premium lasso Freyberger, Neuhierl and Weber (2017): Group lasso Kelly, Pruitt and Su (2018): Instrumented PCA 6
8 Model The Model No-arbitrage pricing Ri,t+1 e = excess return (return minus risk-free rate) at time t + 1 for asset i = 1,..., N Fundamental no-arbitrage condition: for all t = 1,..., T and i = 1,..., N E t [M t+1 R e i,t+1] = 0 E t [.] expected value conditioned on information set at time t M t+1 stochastic discount factor SDF at time t + 1. Conditional moments imply infinitely many unconditional moments for any F t -measurable variable I t E[M t+1 R e t+1,ii t ] = 0 7
9 Model The Model No-arbitrage pricing Without loss of generality SDF is projection on the return space M t+1 = 1 + N w i,t Ri,t+1 e i=1 Optimal portfolio N Sharpe-ratio i=1 w i,tr e i,t+1 has highest conditional Portfolio weights w i,t are a general function of macro-economic information I t and firm-specific characteristics I i,t : w i,t = w(i t, I i,t ), Need non-linear estimator with many explanatory variables! Use a feed forward network to estimate w i,t 8
10 Model The Model Equivalent factor model representation No-arbitrage condition is equivalent to with factor F t = M t 1 E t [R e i,t+1] = cov t(r e i,t+1, F t+1) var t (F t+1 ) = β i,t µ F t E t [F t+1 ] Without loss of generality we have a factor representation R e t+1 = β t F t+1 + ɛ t+1 9
11 Model The Model Objects of Interest We use different approaches to estimate: The SDF factor F t The risk loadings β t The unexplained residual ê t = ( I N β t 1 (β t 1 β t 1) 1 β t 1) R e t Asset Pricing Performance Measure Sharpe ratio of SDF factor SR = E[F ] Var(F ) Unexplained variation 1 T T t=1 1 N N t i=1 têi,t 2 Average pricing errors 1 ( N ) 2 1 T N i=1 T i t=1 iê i,t also time weighted 1 ( N N i=1 T i T 1 T i T t=1 iê i,t ) 2 10
12 Estimation Loss Function Objective Function for Estimation Estimate SDF portfolio weights w(.) to minimize the no-arbitrage moment conditions For a set of conditioning variables Î t the loss function is L(Ît) = 1 N N T ( i 1 T i 2. M t+1 R e T T i,t+1ît) i i=1 t=1 Allows unbalanced panel. How can we choose the conditioning variables Ît = f (I t, I i,t ) as general functions of the macroeconomic and firm-specific information? Generative Adversarial Network (GAN) chooses Ît! 11
13 Estimation Generative Adversarial Network (GAN) Determining Moment Conditions Two networks play zero-sum game: 1 one network creates the SDF M t+1 2 other network creates the conditioning variables Î t Iteratively update the two networks: 1 for a given Ît the SDF network minimizes the loss 2 for a given SDF the conditional networks finds Ît with the largest loss (most mispricing) Intuition: find the economic states and assets with the most pricing information 12
14 Estimation Recurrent Neural Network (RNN) Transforming Macroeconomic Time-Series Problems with economic time-series data Time-series data is often non-stationary transformation necessary Asset prices depend on economic states simple differencing of non-stationary data not sufficient Solution: Recurrent Neural Network (RNN) with Long-Short-Term Memory (LSTM) cells Transform all macroeconomic time-series into a low dimensional vector of stationary state variables 13
15 Estimation Example: Non-stationary Macroeconomic Variables Macroeconomic Variables log RPI log S&P : Log RPI : Log S&P500 14
16 Estimation Macroeconomic state processes Figure: Macroeconomic state processes (LSTM Outputs) based on 178 macroeconomic time-series. 15
17 Estimation Neural Networks Model Architecture SDF Network: Update parameters to minimize loss State RNN '! $ Feed Forward Network ) ",$ Construct SDF! %,,! $! ",$ * $+% Moment RNN /! $ Conditional Network: Update parameters to maximize loss Feed Forward Network (! $ Loss Calculation -, $+%. Iterative Optimizer with GAN 16
18 Data Data Data 50 years of monthly observations: 01/ /2016. Monthly stock returns for all U.S. securities from CRSP (around 31,000 stocks) 46 firm-specific characteristics for each stock and every month (usual suspects) I i,t normalized to cross-sectional quantiles 178 macroeconomic variables (124 from FRED, 46 cross-sectional median time-series for characteristics, 8 from Goyal-Welch) I t T-bill rates from Kenneth-French website Training/validation/test split is 20y/5y/25y 17
19 Data Benchmark models Benchmark models 1 Linear factor models (CAPM, Fama-French 5 factors) 2 Instrumented PCA (Kelly et al. (2018): estimate SDF as linear function of characteristics: w i,t = θ I i,t 3 Deep learning return forecasting (Gu et al. (2018)): Predict conditional expected returns E t [R i,t+1 ] Empirical loss function for prediction 1 NT N i=1 t=1 T (R i,t+1 g(i t, I i,t )) 2 Use only simple feedforward network for forecasting Conditional means proportional to β t obtain SDF factor and residual through projections on conditional means 18
20 Results Results - Sharpe Ratio Sharpe Ratios of Benchmark Models Model SR (Train) SR (Valid) SR (Test) FF FF IPCA RtnFcst
21 Results Results - Sharpe Ratio Table: Performances of our approach sorted by validation Sharpe ratio SR SR SR SMV CSMV HL CHL HU CHU LR (Train) (Valid) (Test) (10 4 ) Optimal model: 4 moments, 4 macro states, 4 layers, 64 hidden units, 32 moments 20
22 Results Optimal Portfolio Performance 400 IPCA RtnFcst SDF Cumulated Excess Return Figure: Cumulated Normalized SDF Portfolio. 21
23 Results Results - Sharpe Ratio for Forecasting Approach Performances with Return Forecast Approach Macro Neurons SR (Train) SR (Valid) SR (Test) Y [32, 16, 8] Y [128, 128] N [32, 16, 8] N [128, 128]
24 Results IPCA: Number of Factors Sharpe Ratio train valid test K Figure: Sharpe ratio as a function of the number of factors for IPCA 23
25 Results Results - Asset Pricing Performance Asset Pricing Performance Unexplained Variation Train Valid Test Forecasting Ensemble GAN Fama-McBeth Type Alpha (10 3 ) Train Valid Test Forecasting Ensemble GAN Fama-McBeth Type Alpha (Weighted, 10 4 ) Forecasting Ensemble GAN
26 Results Results - Sharpe Ratio Performance of Benchmark Models Table: SDF Portfolio vs. Fama-French 5 Factors Mkt-RF SMB HML RMW CMA intercept coefficient correlation Conventional factors do no span SDF 25
27 Results Results - Risk of Optimal Portfolios Max One Month Loss and Max Drawdown Max 1 Month Loss Max Drawdown IPCA RtnFcst Ensemble GAN Ensemble GAN less risky 26
28 Results Results - Variable Importance Variables Ranked by Average Absolute Gradient for SDF network ST_REV r12_2 SUV D2P NOA D2A CF2P r12_7 RNA Resid_Var MktBeta BEME OA E2P Investment AT Variance PM NI Q LTurnover LT_Rev Beta SGA2S LME AC Spread PROF IdioVol DPI2A r2_1 ATO S2P FC2Y OL Lev ROE C A2ME PCM ROA CTO CF Rel2High OP r36_
29 Results Results - Variable Importance Variables Ranked by Reduction in R 2 for RtnFcst ST_REV SUV r12_2 LME r2_1 IdioVol Beta CF2P Resid_Var Rel2High FC2Y r12_7 E2P PM Spread ROA AT SGA2S LTurnover OL OP Variance DPI2A A2ME LT_Rev NI S2P NOA PCM Investment MktBeta CF PROF CTO ROE Q ATO D2P BEME r36_13 D2A AC Lev OA C RNA
30 Results Results - Variable Importance Variables Ranked by Reduction in R 2 for IPCA ST_REV SUV Beta r36_13 r2_1 PCM r12_2 D2P C LME AT NI PM D2A MktBeta BEME CF Rel2High Lev OL CTO LT_Rev CF2P NOA ROA ROE E2P DPI2A ATO Investment Variance RNA SGA2S Q LTurnover OP Resid_Var FC2Y r12_7 OA A2ME AC S2P Spread PROF IdioVol
31 Results Size Effect weight LME Figure: SDF weight and market capitalization in test data 30
32 Non-linearities Results - SDF Weights Relationship between Weights and Characteristics weight weight LME BEME Figure: Weight as a function of size (LME) and book-to-market (BEME). Value have close to linear effect but size non-linear 31
33 Non-linearities Results - SDF Weights Relationship between Weights and Characteristics weight weight Investment LME Figure: Weight as a function of investment or profitability Non-linear effect! 32
34 Non-linearities Results - SDF Weights Relationship between Weights and Characteristics BEME weight LME Figure: Weight as a function of size (LME) and book-to-market (BEME). Size and value have non-linear interaction! 33
35 Non-linearities Results - SDF Weights Relationship between Weights and Characteristics ST_REV weight LME BEME Figure: Weight as a function of size, book-to-market and ST-reversal. Complex interaction between multiple variables! 34
36 Non-linearities Results - IPCA Weights Relationship between Weights and Characteristics BEME weight LME Figure: Weight as a function of size (LME) and book-to-market (BEME). Size and value can only be modeled linearly! 35
37 Non-linearities Results - Return Forecasting Weights Relationship between Weights and Characteristics BEME conditional return LME Figure: Weight as a function of size (LME) and book-to-market (BEME). Size and value only loads on extreme values 36
38 Simulation Simulation Setup Consider a single factor model R i,t+1 = β i,t F t+1 + ε i,t+1 The only factor is sampled from N (µ F, σf 2 ). The loadings are β i,t = C i,t with C i,t i.i.d N (0, 1). The residuals are i.i.d N (0, 1). N = 500 and T = 600. Define training/validation/test = 250, 100, 250. Consider σf 2 {0.01, 0.05, 0.1}. Sharpe Ratio of the factor SR = µ F /σ F = 0.3 or SR = 1. 37
39 Simulation Simulation Results: Intuition Intuition: Better noise diversification with our approach Simple return prediction SDF estimator 1 N 1 TN N N ( 1 T i=1 i=1 t=1 N (Ri,t+1 e f (I t)) 2 T t=1 ) 2 Ri,t+1M e t+1g(c i,t ) SDF estimator averages out the noise over the time-series 38
40 Simulation Simulation Results Sharpe Ratio on Test Dataset σ 2 F RtnFcst SDF estimator SR= SR=
41 Simulation Simulation Results Estimated loadings and SDF weights : Our SDF estimator : Return forecasting Our approach detects SDF and loading structure. Simple forecasting approach fails. 40
42 Conclusion Conclusion Methodology Novel combination of deep-neural networks to estimate the pricing kernel Key innovation: Use no-arbitrage condition as criterion function Time-variation explained by macroeconomic states and firm characteristics General asset pricing model that includes all other models as special cases Empirical Results Outperforms benchmark models Non-linearities and interactions are important 41
43 IPCA: Number of Factors Table: Performance with IPCA Number of Factors SR (Train) SR (Valid) SR (Test) A 1
44 Hyper-Parameter Search Search Space CV number of conditional variables: 4, 8, 16, 32 SMV number of macroeconomic state variables: 4, 8, 16, 32 HL number of fully-connected layers: 2, 3, 4 HU number of hidden units in fully-connected layers: 32, 64, 128 D dropout rate (keep probability): 0.9, 0.95 LR learning rate: 0.001, , , Choose best configuration of all possible combinations (1152) of hyper-parameters on validation set. Use ReLU activation function ReLU(x) i = max(x i, 0). A 2
45 Economic Significance of Variables Sensitivity We define the sensitivity of a particular variable as the magnitude of the derivative of weight w with respect to this variable (averaged over the data): Sensitivity(x j ) = 1 C N w(ĩ t, I i,t ) (1) x j i=1 t with C a normalization constant. The analysis is performed with the feed-forward network and we only consider the sensitivity of firm characteristics and state macro variables. A sensitivity of value z for a given variable means that the weight w will approximately change (in magnitude) by z if that variable is changed by a small amount. A 3
46 Interactions between Variables Significance of Interactions We might also want to understand how the output simultaneously depends upon multiple variables. We can measure the economic significance of the interaction between variables x i and x j by the derivative: Sensitivity(x i, x j ) = 1 C N 2 w(ĩt, I i,t ) (2) x i x j i=1 This derivative can be generalized to measure higher-order interactions. t A 4
47 Finite Difference Schemes First-Order and Second-Order Finite Difference Schemes Suppose we have some multivariate function f. Without actually measuring gradients, we can approximate them with finite difference methods f f (x j + ) f (x j ) x j 2 f f (x i +, x j + ) f (x i +, x j ) f (x i, x j + ) + f (x i, x j ) x i x j 2 (4) (3) A 5
48 Feed Forward Network Figure: Feed Forward Network with Dropout A 6
49 Feed Forward Network Network Structure The input layer accepts the raw predictors (or features). h 0 = x i,t = [I i,t, Ĩ t] (5) Each hidden layer takes the output from the previous layer and transforms it into an output as h k = f (h k 1 W k + b k ) k = 1,..., K (6) In our implementation, we use ReLU activation function. ReLU(x) i = max(x i, 0) (7) The output layer is simply a linear transformation of the output from the last hidden layer to a scaler w i,t = h K W K+1 + b K+1 (8) A 7
50 Feed Forward Network Model Complexity Number of hidden layers: K. Let s denote p k to be the number of neurons (or hidden units) in the layer k. The parameters in the layer k are W k R p k 1 p k and b k R p k (9) with p 0 = dim(i i,t ) + dim(ĩt) and p K+1 = 1. Number of parameters: K+1 k=1 (p k 1 + 1)p k. e.g. A 4-hidden-layer network with hidden units [128, 128, 64, 64] has parameters. A 8
51 State RNN Two Reasons for RNN Instead of directly passing macro variables I t as features to the feed forward network, we apply a nonlinear transformation to them with an RNN. Many macro variables themselves are not stationary and have trends. Necessary transformations of I t are essential in generating a statble model. Using RNN allows us to encode all historical information of the macro economy. Intuitively, RNN summarizes all historical macro information into a low dimensional vector of state variables in a data-driven way. A 9
52 Transform Macro Variables via RNN Properties of RNN For any F t -measurable sequence I t, the output sequence Ĩ t is again F t -measurable. The transformation creates no look-ahead bias. Ĩ t contains all the macro information in the past, while I t only uses current information. RNN helps create a stationary macro inputs for the feed forward network. A 10
53 Recurrent Network Recurrent Network with RNN Cell Recurrent Network with LSTM Cell A 11
54 Recurrent Network RNN Cell RNN Cell Structure A vanilla RNN model takes the current input variable x t = I t and the previous hidden state h t 1 and performs a nonlinear transformation to get the current hidden state h t h t = f (h t 1 W h + x t W x ). (10) A 12
55 Recurrent Network LSTM Cell A 13
56 Recurrent Network LSTM Cell Structure An LSTM model creates a new memory cell c t with current input x t and previous hidden state h t 1: c t = tanh(h t 1W (c) h + x tw (c) x ). (11) An input gate i t and a forget gate f t are created to control the final memory cell: i t = σ(h t 1W (i) h + x tw x (i) ) (12) f t = σ(h t 1W (f ) h + x tw x (f ) ) (13) c t = f t c t 1 + i t c t. (14) Finally, an output gate o t is used to control the amount of information stored in the hidden state: o t = σ(h t 1W (o) h + x tw x (o) ) (15) h t = o t tanh(c t). (16) A 14
Regression: Ordinary Least Squares
Regression: Ordinary Least Squares Mark Hendricks Autumn 2017 FINM Intro: Regression Outline Regression OLS Mathematics Linear Projection Hendricks, Autumn 2017 FINM Intro: Regression: Lecture 2/32 Regression
More informationFactor Investing using Penalized Principal Components
Factor Investing using Penalized Principal Components Markus Pelger Martin Lettau 2 Stanford University 2 UC Berkeley February 8th 28 AI in Fintech Forum 28 Motivation Motivation: Asset Pricing with Risk
More informationInference on Risk Premia in the Presence of Omitted Factors
Inference on Risk Premia in the Presence of Omitted Factors Stefano Giglio Dacheng Xiu Booth School of Business, University of Chicago Center for Financial and Risk Analytics Stanford University May 19,
More informationR = µ + Bf Arbitrage Pricing Model, APM
4.2 Arbitrage Pricing Model, APM Empirical evidence indicates that the CAPM beta does not completely explain the cross section of expected asset returns. This suggests that additional factors may be required.
More informationFactors that Fit the Time Series and Cross-Section of Stock Returns
Factors that Fit the Time Series and Cross-Section of Stock Returns Martin Lettau Markus Pelger UC Berkeley Stanford University November 3th 8 NBER Asset Pricing Meeting Motivation Motivation Fundamental
More informationRoss (1976) introduced the Arbitrage Pricing Theory (APT) as an alternative to the CAPM.
4.2 Arbitrage Pricing Model, APM Empirical evidence indicates that the CAPM beta does not completely explain the cross section of expected asset returns. This suggests that additional factors may be required.
More informationMonitoring Forecasting Performance
Monitoring Forecasting Performance Identifying when and why return prediction models work Allan Timmermann and Yinchu Zhu University of California, San Diego June 21, 2015 Outline Testing for time-varying
More informationFactor Models for Asset Returns. Prof. Daniel P. Palomar
Factor Models for Asset Returns Prof. Daniel P. Palomar The Hong Kong University of Science and Technology (HKUST) MAFS6010R- Portfolio Optimization with R MSc in Financial Mathematics Fall 2018-19, HKUST,
More informationLinear Factor Models and the Estimation of Expected Returns
Linear Factor Models and the Estimation of Expected Returns Cisil Sarisoy a,, Peter de Goeij b, Bas J.M. Werker c a Department of Finance, CentER, Tilburg University b Department of Finance, Tilburg University
More informationEquity risk factors and the Intertemporal CAPM
Equity risk factors and the Intertemporal CAPM Ilan Cooper 1 Paulo Maio 2 1 Norwegian Business School (BI) 2 Hanken School of Economics BEROC Conference, Minsk Outline 1 Motivation 2 Cross-sectional tests
More informationFinancial Risk and Returns Prediction with Modular Networked Learning
arxiv:1806.05876v1 [cs.lg] 15 Jun 2018 Financial Risk and Returns Prediction with Modular Networked Learning Carlos Pedro Gonçalves June 18, 2018 University of Lisbon, Instituto Superior de Ciências Sociais
More informationFinancial Econometrics Lecture 6: Testing the CAPM model
Financial Econometrics Lecture 6: Testing the CAPM model Richard G. Pierse 1 Introduction The capital asset pricing model has some strong implications which are testable. The restrictions that can be tested
More informationMultivariate GARCH models.
Multivariate GARCH models. Financial market volatility moves together over time across assets and markets. Recognizing this commonality through a multivariate modeling framework leads to obvious gains
More informationASSET PRICING MODELS
ASSE PRICING MODELS [1] CAPM (1) Some notation: R it = (gross) return on asset i at time t. R mt = (gross) return on the market portfolio at time t. R ft = return on risk-free asset at time t. X it = R
More information17 Factor Models and Principal Components
17 Factor Models and Principal Components 17.1 Dimension Reduction High-dimensional data can be challenging to analyze. They are difficult to visualize, need extensive computer resources, and often require
More informationNetwork Connectivity and Systematic Risk
Network Connectivity and Systematic Risk Monica Billio 1 Massimiliano Caporin 2 Roberto Panzica 3 Loriana Pelizzon 1,3 1 University Ca Foscari Venezia (Italy) 2 University of Padova (Italy) 3 Goethe University
More informationEcon671 Factor Models: Principal Components
Econ671 Factor Models: Principal Components Jun YU April 8, 2016 Jun YU () Econ671 Factor Models: Principal Components April 8, 2016 1 / 59 Factor Models: Principal Components Learning Objectives 1. Show
More informationIdentifying Financial Risk Factors
Identifying Financial Risk Factors with a Low-Rank Sparse Decomposition Lisa Goldberg Alex Shkolnik Berkeley Columbia Meeting in Engineering and Statistics 24 March 2016 Outline 1 A Brief History of Factor
More informationOn Generalized Arbitrage Pricing Theory Analysis: Empirical Investigation of the Macroeconomics Modulated Independent State-Space Model
On Generalized Arbitrage Pricing Theory Analysis: Empirical Investigation of the Macroeconomics Modulated Independent State-Space Model Kai-Chun Chiu and Lei Xu Department of Computer Science and Engineering,
More informationErrata for Campbell, Financial Decisions and Markets, 01/02/2019.
Errata for Campbell, Financial Decisions and Markets, 01/02/2019. Page xi, section title for Section 11.4.3 should be Endogenous Margin Requirements. Page 20, equation 1.49), expectations operator E should
More informationLinear Factor Models and the Estimation of Expected Returns
Linear Factor Models and the Estimation of Expected Returns Cisil Sarisoy, Peter de Goeij and Bas Werker DP 03/2016-020 Linear Factor Models and the Estimation of Expected Returns Cisil Sarisoy a,, Peter
More informationMiloš Kopa. Decision problems with stochastic dominance constraints
Decision problems with stochastic dominance constraints Motivation Portfolio selection model Mean risk models max λ Λ m(λ r) νr(λ r) or min λ Λ r(λ r) s.t. m(λ r) µ r is a random vector of assets returns
More informationFinancial Econometrics Return Predictability
Financial Econometrics Return Predictability Eric Zivot March 30, 2011 Lecture Outline Market Efficiency The Forms of the Random Walk Hypothesis Testing the Random Walk Hypothesis Reading FMUND, chapter
More informationIntroduction to Algorithmic Trading Strategies Lecture 3
Introduction to Algorithmic Trading Strategies Lecture 3 Pairs Trading by Cointegration Haksun Li haksun.li@numericalmethod.com www.numericalmethod.com Outline Distance method Cointegration Stationarity
More informationCross-Sectional Regression after Factor Analysis: Two Applications
al Regression after Factor Analysis: Two Applications Joint work with Jingshu, Trevor, Art; Yang Song (GSB) May 7, 2016 Overview 1 2 3 4 1 / 27 Outline 1 2 3 4 2 / 27 Data matrix Y R n p Panel data. Transposable
More informationComplex Systems Workshop Lecture III: Behavioral Asset Pricing Model with Heterogeneous Beliefs
Complex Systems Workshop Lecture III: Behavioral Asset Pricing Model with Heterogeneous Beliefs Cars Hommes CeNDEF, UvA CEF 2013, July 9, Vancouver Cars Hommes (CeNDEF, UvA) Complex Systems CEF 2013, Vancouver
More informationModel Comparison with Sharpe Ratios
Model Comparison with Sharpe Ratios Francisco Barillas, Raymond Kan, Cesare Robotti, and Jay Shanken Barillas is from Emory University. Kan is from the University of Toronto. Robotti is from the University
More informationDeep learning / Ian Goodfellow, Yoshua Bengio and Aaron Courville. - Cambridge, MA ; London, Spis treści
Deep learning / Ian Goodfellow, Yoshua Bengio and Aaron Courville. - Cambridge, MA ; London, 2017 Spis treści Website Acknowledgments Notation xiii xv xix 1 Introduction 1 1.1 Who Should Read This Book?
More informationUniversity of Pretoria Department of Economics Working Paper Series
University of Pretoria Department of Economics Working Paper Series Predicting Stock Returns and Volatility Using Consumption-Aggregate Wealth Ratios: A Nonlinear Approach Stelios Bekiros IPAG Business
More informationOn the Directional Predictability of Eurostoxx 50
On the Directional Predictability of Eurostoxx 50 Democritus University of Thrace Department of Economics Pragidis I., Plakandaras, V., Karapistoli E. Motivation Eurostoxx is an under-investigated index/concept
More informationInformation Choice in Macroeconomics and Finance.
Information Choice in Macroeconomics and Finance. Laura Veldkamp New York University, Stern School of Business, CEPR and NBER Spring 2009 1 Veldkamp What information consumes is rather obvious: It consumes
More informationModel Comparison with Sharpe Ratios
Model Comparison with Sharpe Ratios Abstract We show how to conduct asymptotically valid tests of model comparison when the extent of model mispricing is gauged by the squared Sharpe ratio improvement
More informationFunctional Coefficient Models for Nonstationary Time Series Data
Functional Coefficient Models for Nonstationary Time Series Data Zongwu Cai Department of Mathematics & Statistics and Department of Economics, University of North Carolina at Charlotte, USA Wang Yanan
More informationEstimating Latent Asset-Pricing Factors
Estimating Latent Asset-Pricing Factors Martin Lettau Haas School of Business, University of California at Berkeley, Berkeley, CA 947, lettau@berkeley.edu,nber,cepr Markus Pelger Department of Management
More informationFinancial Econometrics Short Course Lecture 3 Multifactor Pricing Model
Financial Econometrics Short Course Lecture 3 Multifactor Pricing Model Oliver Linton obl20@cam.ac.uk Renmin University Financial Econometrics Short Course Lecture 3 MultifactorRenmin Pricing University
More informationEcon 423 Lecture Notes: Additional Topics in Time Series 1
Econ 423 Lecture Notes: Additional Topics in Time Series 1 John C. Chao April 25, 2017 1 These notes are based in large part on Chapter 16 of Stock and Watson (2011). They are for instructional purposes
More informationProbabilities & Statistics Revision
Probabilities & Statistics Revision Christopher Ting Christopher Ting http://www.mysmu.edu/faculty/christophert/ : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 January 6, 2017 Christopher Ting QF
More informationLinear Factor Models and the Estimation of Expected Returns
Linear Factor Models and the Estimation of Expected Returns Abstract Standard factor models in asset pricing imply a linear relationship between expected returns on assets and their exposures to one or
More informationMFE Financial Econometrics 2018 Final Exam Model Solutions
MFE Financial Econometrics 2018 Final Exam Model Solutions Tuesday 12 th March, 2019 1. If (X, ε) N (0, I 2 ) what is the distribution of Y = µ + β X + ε? Y N ( µ, β 2 + 1 ) 2. What is the Cramer-Rao lower
More informationForecasting demand in the National Electricity Market. October 2017
Forecasting demand in the National Electricity Market October 2017 Agenda Trends in the National Electricity Market A review of AEMO s forecasting methods Long short-term memory (LSTM) neural networks
More informationEmpirical Methods in Finance Section 2 - The CAPM Model
Empirical Methods in Finance Section 2 - The CAPM Model René Garcia Professor of finance EDHEC Nice, March 2010 Estimation and Testing The CAPM model is a one-period model E [Z i ] = β im E [Z m] with
More informationLecture Notes in Empirical Finance (PhD): Linear Factor Models
Contents Lecture Notes in Empirical Finance (PhD): Linear Factor Models Paul Söderlind 8 June 26 University of St Gallen Address: s/bf-hsg, Rosenbergstrasse 52, CH-9 St Gallen, Switzerland E-mail: PaulSoderlind@unisgch
More informationForecasting the term structure interest rate of government bond yields
Forecasting the term structure interest rate of government bond yields Bachelor Thesis Econometrics & Operational Research Joost van Esch (419617) Erasmus School of Economics, Erasmus University Rotterdam
More informationMonetary policy at the zero lower bound: Theory
Monetary policy at the zero lower bound: Theory A. Theoretical channels 1. Conditions for complete neutrality (Eggertsson and Woodford, 2003) 2. Market frictions 3. Preferred habitat and risk-bearing (Hamilton
More informationEstimating Latent Asset-Pricing Factors
Estimating Latent Asset-Pricing Factors Martin Lettau Haas School of Business, University of California at Berkeley, Berkeley, CA 947, lettau@berkeley.edu,nber,cepr Markus Pelger Department of Management
More informationZHAW Zurich University of Applied Sciences. Bachelor s Thesis Estimating Multi-Beta Pricing Models With or Without an Intercept:
ZHAW Zurich University of Applied Sciences School of Management and Law Bachelor s Thesis Estimating Multi-Beta Pricing Models With or Without an Intercept: Further Results from Simulations Submitted by:
More informationCSC321 Lecture 15: Exploding and Vanishing Gradients
CSC321 Lecture 15: Exploding and Vanishing Gradients Roger Grosse Roger Grosse CSC321 Lecture 15: Exploding and Vanishing Gradients 1 / 23 Overview Yesterday, we saw how to compute the gradient descent
More informationSpecification Test Based on the Hansen-Jagannathan Distance with Good Small Sample Properties
Specification Test Based on the Hansen-Jagannathan Distance with Good Small Sample Properties Yu Ren Department of Economics Queen s University Katsumi Shimotsu Department of Economics Queen s University
More informationFinancial Econometrics
Financial Econometrics Nonlinear time series analysis Gerald P. Dwyer Trinity College, Dublin January 2016 Outline 1 Nonlinearity Does nonlinearity matter? Nonlinear models Tests for nonlinearity Forecasting
More informationThe Conditional Pricing of Systematic and Idiosyncratic Risk in the UK Equity Market. Niall O Sullivan b Francesco Rossi c. This version: July 2014
The Conditional Pricing of Systematic and Idiosyncratic Risk in the UK Equity Market John Cotter a Niall O Sullivan b Francesco Rossi c This version: July 2014 Abstract We test whether firm idiosyncratic
More informationModelling Time Series with Neural Networks. Volker Tresp Summer 2017
Modelling Time Series with Neural Networks Volker Tresp Summer 2017 1 Modelling of Time Series The next figure shows a time series (DAX) Other interesting time-series: energy prize, energy consumption,
More informationUniform Inference for Conditional Factor Models with Instrumental and Idiosyncratic Betas
Uniform Inference for Conditional Factor Models with Instrumental and Idiosyncratic Betas Yuan Xiye Yang Rutgers University Dec 27 Greater NY Econometrics Overview main results Introduction Consider a
More informationLinear Regression. Aarti Singh. Machine Learning / Sept 27, 2010
Linear Regression Aarti Singh Machine Learning 10-701/15-781 Sept 27, 2010 Discrete to Continuous Labels Classification Sports Science News Anemic cell Healthy cell Regression X = Document Y = Topic X
More informationTime Series Models for Measuring Market Risk
Time Series Models for Measuring Market Risk José Miguel Hernández Lobato Universidad Autónoma de Madrid, Computer Science Department June 28, 2007 1/ 32 Outline 1 Introduction 2 Competitive and collaborative
More informationDynamic Matrix-Variate Graphical Models A Synopsis 1
Proc. Valencia / ISBA 8th World Meeting on Bayesian Statistics Benidorm (Alicante, Spain), June 1st 6th, 2006 Dynamic Matrix-Variate Graphical Models A Synopsis 1 Carlos M. Carvalho & Mike West ISDS, Duke
More informationEmpirical properties of large covariance matrices in finance
Empirical properties of large covariance matrices in finance Ex: RiskMetrics Group, Geneva Since 2010: Swissquote, Gland December 2009 Covariance and large random matrices Many problems in finance require
More informationThe Slow Convergence of OLS Estimators of α, β and Portfolio. β and Portfolio Weights under Long Memory Stochastic Volatility
The Slow Convergence of OLS Estimators of α, β and Portfolio Weights under Long Memory Stochastic Volatility New York University Stern School of Business June 21, 2018 Introduction Bivariate long memory
More informationBond Risk Premia with Machine Learning
Bond Risk Premia with Machine Learning Daniele Bianchi Matthias Büchner Andrea Tamoni First draft: December 2017. This draft: September 29, 2018 Abstract We propose, compare, and evaluate a variety of
More informationDeep Reinforcement Learning SISL. Jeremy Morton (jmorton2) November 7, Stanford Intelligent Systems Laboratory
Deep Reinforcement Learning Jeremy Morton (jmorton2) November 7, 2016 SISL Stanford Intelligent Systems Laboratory Overview 2 1 Motivation 2 Neural Networks 3 Deep Reinforcement Learning 4 Deep Learning
More informationVolatility. Gerald P. Dwyer. February Clemson University
Volatility Gerald P. Dwyer Clemson University February 2016 Outline 1 Volatility Characteristics of Time Series Heteroskedasticity Simpler Estimation Strategies Exponentially Weighted Moving Average Use
More informationTTIC 31230, Fundamentals of Deep Learning David McAllester, April Vanishing and Exploding Gradients. ReLUs. Xavier Initialization
TTIC 31230, Fundamentals of Deep Learning David McAllester, April 2017 Vanishing and Exploding Gradients ReLUs Xavier Initialization Batch Normalization Highway Architectures: Resnets, LSTMs and GRUs Causes
More informationWarwick Business School Forecasting System. Summary. Ana Galvao, Anthony Garratt and James Mitchell November, 2014
Warwick Business School Forecasting System Summary Ana Galvao, Anthony Garratt and James Mitchell November, 21 The main objective of the Warwick Business School Forecasting System is to provide competitive
More informationTime-varying parameters: New test tailored to applications in finance and macroeconomics. Russell Davidson and Niels S. Grønborg
Time-varying parameters: New test tailored to applications in finance and macroeconomics Russell Davidson and Niels S. Grønborg CREATES Research Paper 2018-22 Department of Economics and Business Economics
More informationPortfolio Optimization Using a Block Structure for the Covariance Matrix
Journal of Business Finance & Accounting Journal of Business Finance & Accounting, 39(5) & (6), 806 843, June/July 202, 0306-686X doi: 0/j468-595720202279x Portfolio Optimization Using a Block Structure
More informationFORECASTING OF INFLATION IN BANGLADESH USING ANN MODEL
FORECASTING OF INFLATION IN BANGLADESH USING ANN MODEL Rumana Hossain Department of Physical Science School of Engineering and Computer Science Independent University, Bangladesh Shaukat Ahmed Department
More informationThe Instability of Correlations: Measurement and the Implications for Market Risk
The Instability of Correlations: Measurement and the Implications for Market Risk Prof. Massimo Guidolin 20254 Advanced Quantitative Methods for Asset Pricing and Structuring Winter/Spring 2018 Threshold
More informationECONOMICS 7200 MODERN TIME SERIES ANALYSIS Econometric Theory and Applications
ECONOMICS 7200 MODERN TIME SERIES ANALYSIS Econometric Theory and Applications Yongmiao Hong Department of Economics & Department of Statistical Sciences Cornell University Spring 2019 Time and uncertainty
More informationAn economic application of machine learning: Nowcasting Thai exports using global financial market data and time-lag lasso
An economic application of machine learning: Nowcasting Thai exports using global financial market data and time-lag lasso PIER Exchange Nov. 17, 2016 Thammarak Moenjak What is machine learning? Wikipedia
More informationLinear Programming: Chapter 1 Introduction
Linear Programming: Chapter 1 Introduction Robert J. Vanderbei September 16, 2010 Slides last edited on October 5, 2010 Operations Research and Financial Engineering Princeton University Princeton, NJ
More informationRECURRENT NETWORKS I. Philipp Krähenbühl
RECURRENT NETWORKS I Philipp Krähenbühl RECAP: CLASSIFICATION conv 1 conv 2 conv 3 conv 4 1 2 tu RECAP: SEGMENTATION conv 1 conv 2 conv 3 conv 4 RECAP: DETECTION conv 1 conv 2 conv 3 conv 4 RECAP: GENERATION
More informationEE-559 Deep learning LSTM and GRU
EE-559 Deep learning 11.2. LSTM and GRU François Fleuret https://fleuret.org/ee559/ Mon Feb 18 13:33:24 UTC 2019 ÉCOLE POLYTECHNIQUE FÉDÉRALE DE LAUSANNE The Long-Short Term Memory unit (LSTM) by Hochreiter
More informationHomogeneity Pursuit. Jianqing Fan
Jianqing Fan Princeton University with Tracy Ke and Yichao Wu http://www.princeton.edu/ jqfan June 5, 2014 Get my own profile - Help Amazing Follow this author Grace Wahba 9 Followers Follow new articles
More informationSample of Ph.D. Advisory Exam For MathFinance
Sample of Ph.D. Advisory Exam For MathFinance Students who wish to enter the Ph.D. program of Mathematics of Finance are required to take the advisory exam. This exam consists of three major parts. The
More informationComparing Asset Pricing Models: Distance-based Metrics and Bayesian Interpretations. Zhongzhi (Lawrence) He * This version: March 2018.
Comparing Asset Pricing Models: Distance-based Metrics and Bayesian Interpretations Zhongzhi (Lawrence) He * This version: March 2018 Abstract In light of the power problems of statistical tests and undisciplined
More informationCheng Soon Ong & Christian Walder. Canberra February June 2018
Cheng Soon Ong & Christian Walder Research Group and College of Engineering and Computer Science Canberra February June 2018 Outlines Overview Introduction Linear Algebra Probability Linear Regression
More informationCS 229 Project Final Report: Reinforcement Learning for Neural Network Architecture Category : Theory & Reinforcement Learning
CS 229 Project Final Report: Reinforcement Learning for Neural Network Architecture Category : Theory & Reinforcement Learning Lei Lei Ruoxuan Xiong December 16, 2017 1 Introduction Deep Neural Network
More informationMachine Learning for Large-Scale Data Analysis and Decision Making A. Neural Networks Week #6
Machine Learning for Large-Scale Data Analysis and Decision Making 80-629-17A Neural Networks Week #6 Today Neural Networks A. Modeling B. Fitting C. Deep neural networks Today s material is (adapted)
More informationConditional time series forecasting with convolutional neural networks
Conditional time series forecasting with convolutional neural networks Anastasia Borovykh Sander Bohte Cornelis W. Oosterlee arxiv:1703.04691v4 [stat.ml] 8 Jun 2018 This version: June 11, 2018 Abstract
More informationBond Risk Premia with Machine Learning
Bond Risk Premia with Machine Learning Daniele Bianchi Matthias Büchner Andrea Tamoni First draft: December 2017. This draft: November 15, 2018 Abstract We propose, compare, and evaluate a variety of machine
More informationDynamic Financial Index Models: Modeling Conditional Dependencies via Graphs
Bayesian Analysis (211) 6, Number, pp. 639 66 Dynamic Financial Index Models: Modeling Conditional Dependencies via Graphs Hao Wang, Craig Reeson and Carlos M. Carvalho Abstract. We discuss the development
More informationORTHOGO ALIZED EQUITY RISK PREMIA SYSTEMATIC RISK DECOMPOSITIO. Rudolf F. Klein a,* and K. Victor Chow b,* Abstract
ORTHOGO ALIZED EQUITY RIS PREMIA A D SYSTEMATIC RIS DECOMPOSITIO Rudolf F. lein a,* and. Victor Chow b,* Abstract To solve the dependency problem between factors, in the context of linear multi-factor
More informationECON4515 Finance theory 1 Diderik Lund, 5 May Perold: The CAPM
Perold: The CAPM Perold starts with a historical background, the development of portfolio theory and the CAPM. Points out that until 1950 there was no theory to describe the equilibrium determination of
More information14 Week 5 Empirical methods notes
14 Week 5 Empirical methods notes 1. Motivation and overview 2. Time series regressions 3. Cross sectional regressions 4. Fama MacBeth 5. Testing factor models. 14.1 Motivation and Overview 1. Expected
More informationSpeculation and the Bond Market: An Empirical No-arbitrage Framework
Online Appendix to the paper Speculation and the Bond Market: An Empirical No-arbitrage Framework October 5, 2015 Part I: Maturity specific shocks in affine and equilibrium models This Appendix present
More informationA Wavelet Neural Network Forecasting Model Based On ARIMA
A Wavelet Neural Network Forecasting Model Based On ARIMA Wang Bin*, Hao Wen-ning, Chen Gang, He Deng-chao, Feng Bo PLA University of Science &Technology Nanjing 210007, China e-mail:lgdwangbin@163.com
More informationTime-varying coefficient estimation in SURE models. Application to portfolio management. Isabel Casas, Eva Ferreira and Susan Orbe
Time-varying coefficient estimation in SURE models. Application to portfolio management Isabel Casas, Eva Ferreira and Susan Orbe CREATES Research Paper 2017-33 Department of Economics and Business Economics
More informationLinear Programming: Chapter 1 Introduction
Linear Programming: Chapter 1 Introduction Robert J. Vanderbei October 17, 2007 Operations Research and Financial Engineering Princeton University Princeton, NJ 08544 http://www.princeton.edu/ rvdb Resource
More informationRecurrent neural networks
12-1: Recurrent neural networks Prof. J.C. Kao, UCLA Recurrent neural networks Motivation Network unrollwing Backpropagation through time Vanishing and exploding gradients LSTMs GRUs 12-2: Recurrent neural
More informationThe Bond Pricing Implications of Rating-Based Capital Requirements. Internet Appendix. This Version: December Abstract
The Bond Pricing Implications of Rating-Based Capital Requirements Internet Appendix This Version: December 2017 Abstract This Internet Appendix examines the robustness of our main results and presents
More informationNeural Networks with Applications to Vision and Language. Feedforward Networks. Marco Kuhlmann
Neural Networks with Applications to Vision and Language Feedforward Networks Marco Kuhlmann Feedforward networks Linear separability x 2 x 2 0 1 0 1 0 0 x 1 1 0 x 1 linearly separable not linearly separable
More informationNowcasting Norwegian GDP
Nowcasting Norwegian GDP Knut Are Aastveit and Tørres Trovik May 13, 2007 Introduction Motivation The last decades of advances in information technology has made it possible to access a huge amount of
More informationBeta Is Alive, Well and Healthy
Beta Is Alive, Well and Healthy Soosung Hwang * Cass Business School, UK Abstract In this study I suggest some evidence that the popular cross-sectional asset pricing test proposed by Black, Jensen, and
More informationClassification goals: Make 1 guess about the label (Top-1 error) Make 5 guesses about the label (Top-5 error) No Bounding Box
ImageNet Classification with Deep Convolutional Neural Networks Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton Motivation Classification goals: Make 1 guess about the label (Top-1 error) Make 5 guesses
More informationDetermining and Forecasting High-Frequency Value-at-Risk by Using Lévy Processes
Determining and Forecasting High-Frequency Value-at-Risk by Using Lévy Processes W ei Sun 1, Svetlozar Rachev 1,2, F rank J. F abozzi 3 1 Institute of Statistics and Mathematical Economics, University
More informationLecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem
Lecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem Prof. Massimo Guidolin 20192 Financial Econometrics Winter/Spring 2018 Overview Defining cointegration Vector
More informationA new test on the conditional capital asset pricing model
Appl. Math. J. Chinese Univ. 2015, 30(2): 163-186 A new test on the conditional capital asset pricing model LI Xia-fei 1 CAI Zong-wu 2,1 REN Yu 1, Abstract. Testing the validity of the conditional capital
More informationAn Introduction to Independent Components Analysis (ICA)
An Introduction to Independent Components Analysis (ICA) Anish R. Shah, CFA Northfield Information Services Anish@northinfo.com Newport Jun 6, 2008 1 Overview of Talk Review principal components Introduce
More informationPredicting bond returns using the output gap in expansions and recessions
Erasmus university Rotterdam Erasmus school of economics Bachelor Thesis Quantitative finance Predicting bond returns using the output gap in expansions and recessions Author: Martijn Eertman Studentnumber:
More information