Bayesian Optimization

Similar documents
MFM Practitioner Module: Risk & Asset Allocation. John Dodson. February 18, 2015

MFM Practitioner Module: Risk & Asset Allocation. John Dodson. January 25, 2012

Outline. Motivation Contest Sample. Estimator. Loss. Standard Error. Prior Pseudo-Data. Bayesian Estimator. Estimators. John Dodson.

MFM Practitioner Module: Risk & Asset Allocation. John Dodson. January 28, 2015

Fundamentals in Optimal Investments. Lecture I

MFM Practitioner Module: Risk & Asset Allocation. John Dodson. February 3, 2010

FE670 Algorithmic Trading Strategies. Stevens Institute of Technology

Attilio Meucci. Issues in Quantitative Portfolio Management: Handling Estimation Risk

MFM Practitioner Module: Risk & Asset Allocation. John Dodson. February 4, 2015

A Dynamic Model for Investment Strategy

Optimizing allocations

Aggregate Risk. MFM Practitioner Module: Quantitative Risk Management. John Dodson. February 6, Aggregate Risk. John Dodson.

Performance Measures for Ranking and Selection Procedures

MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 7, 2011

Markowitz Efficient Portfolio Frontier as Least-Norm Analytic Solution to Underdetermined Equations

Market Risk. MFM Practitioner Module: Quantitiative Risk Management. John Dodson. February 8, Market Risk. John Dodson.

Particle swarm optimization approach to portfolio optimization

Miloš Kopa. Decision problems with stochastic dominance constraints

F & B Approaches to a simple model

ECO 317 Economics of Uncertainty Fall Term 2009 Slides to accompany 13. Markets and Efficient Risk-Bearing: Examples and Extensions

Model Selection and Geometry

Statistical Inference

Bargaining, Contracts, and Theories of the Firm. Dr. Margaret Meyer Nuffield College

Birgit Rudloff Operations Research and Financial Engineering, Princeton University

Robust Efficient Frontier Analysis with a Separable Uncertainty Model

Online Question Asking Algorithms For Measuring Skill

Computational Connections Between Robust Multivariate Analysis and Clustering

CSC321 Lecture 18: Learning Probabilistic Models

Lecture Notes 1: Decisions and Data. In these notes, I describe some basic ideas in decision theory. theory is constructed from

Parameter Estimation

Information Choice in Macroeconomics and Finance.

Portfolios that Contain Risky Assets 17: Fortune s Formulas

Predictive Hypothesis Identification

Decision theory. 1 We may also consider randomized decision rules, where δ maps observed data D to a probability distribution over

Model comparison and selection

A Very Brief Summary of Statistical Inference, and Examples

Robustness and bootstrap techniques in portfolio efficiency tests

ORIGINS OF STOCHASTIC PROGRAMMING

Multivariate GARCH models.

ROBUST ESTIMATOR FOR MULTIPLE INLIER STRUCTURES

Sequential Monitoring of Clinical Trials Session 4 - Bayesian Evaluation of Group Sequential Designs

Parametric Techniques Lecture 3

Portfolios that Contain Risky Assets 15: Central Limit Theorem Objectives

A Minimax Theorem with Applications to Machine Learning, Signal Processing, and Finance

Robust Optimization: Applications in Portfolio Selection Problems

Modern Portfolio Theory with Homogeneous Risk Measures

A Very Brief Summary of Bayesian Inference, and Examples

Parametric Techniques

Frequentist-Bayesian Model Comparisons: A Simple Example

Testing Restrictions and Comparing Models

ECON4515 Finance theory 1 Diderik Lund, 5 May Perold: The CAPM

Optimal Investment Strategies: A Constrained Optimization Approach

Bios 6649: Clinical Trials - Statistical Design and Monitoring

Z = (X Loc X) (DisSq X) 1 (X Loc X).

IEOR E4703: Monte-Carlo Simulation

Lecture 20 May 18, Empirical Bayes Interpretation [Efron & Morris 1973]

Convex optimization problems. Optimization problem in standard form

Robust Two Step Confidence Sets, and the Trouble with the First Stage F Statistic

COS513 LECTURE 8 STATISTICAL CONCEPTS

Semidefinite and Second Order Cone Programming Seminar Fall 2012 Project: Robust Optimization and its Application of Robust Portfolio Optimization

Technical Appendix Information sales and strategic trading

Bandit Algorithms. Zhifeng Wang ... Department of Statistics Florida State University

Filling in the Blanks: Network Structure and Systemic Risk

Prior Information: Shrinkage and Black-Litterman. Prof. Daniel P. Palomar

Minimax Estimation of Kernel Mean Embeddings

The Behaviour of the Akaike Information Criterion when Applied to Non-nested Sequences of Models

OPTIMAL DESIGN INPUTS FOR EXPERIMENTAL CHAPTER 17. Organization of chapter in ISSO. Background. Linear models

Introduction & Random Variables. John Dodson. September 3, 2008

Noise Fit, Estimation Error and a Sharpe Information Criterion: Linear Case

Advanced Statistics I : Gaussian Linear Model (and beyond)

Maximum Likelihood, Logistic Regression, and Stochastic Gradient Training

4. Convex optimization problems

An Estimation Based Allocation Rule with Super-linear Regret and Finite Lock-on Time for Time-dependent Multi-armed Bandit Processes

Lawrence D. Brown* and Daniel McCarthy*

Chapter 8 - Statistical intervals for a single sample

ECON4510 Finance Theory Lecture 2

AN ADAPTIVE PROCEDURE FOR ESTIMATING COHERENT RISK MEASURES BASED ON GENERALIZED SCENARIOS. Vadim Lesnevski Barry L. Nelson Jeremy Staum

Statistical Approaches to Learning and Discovery. Week 4: Decision Theory and Risk Minimization. February 3, 2003

Statistical Inference of Covariate-Adjusted Randomized Experiments

SOME HISTORY OF STOCHASTIC PROGRAMMING

4. Convex optimization problems

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

For more information about how to cite these materials visit

Statistical Data Mining and Machine Learning Hilary Term 2016

Decision Models Lecture 5 1. Lecture 5. Foreign-Currency Trading Integer Programming Plant-location example Summary and Preparation for next class

Regularization of Portfolio Allocation

Expected Shortfall is not elicitable so what?

Estimating Covariance Using Factorial Hidden Markov Models

Sensor Tasking and Control

BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation

1. Fisher Information

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

Statistics: Learning models from data

Evaluating allocations

Speculation and the Bond Market: An Empirical No-arbitrage Framework

Lecture: Convex Optimization Problems

Mathematics for Economics and Finance

Gaussian Estimation under Attack Uncertainty

LECTURE 5. Introduction to Econometrics. Hypothesis testing

ECE 275A Homework 7 Solutions

Transcription:

Practitioner Course: Portfolio October 15, 2008

No introduction to portfolio optimization would be complete without acknowledging the significant contribution of the Markowitz mean-variance efficient frontier concept which lies at the heart of the two-step procedure Nonetheless, the mean-variance analysis has two major shortcomings which subsequent researchers and practitioners have tried to address: not averse to estimation or model risk no clear accommodation for manager advantage N.B.: These were not issues for Markowitz original application, because he argued that all investors were attempting to solve the same problem with all available information, and therefore a broad capitalization-weighted index should serve as a proxy for α SR for all 1. 1 The author argues that in fact α MV is the implicit benchmark.

Refinements The author discusses four approaches to these shortcomings. Coherent Allocations It may be valuable to re-cast the optimization problem in an explicitly setting. utility maximization optimization In the former case we use subjective probabilities; in the latter case we introduce a risk functional. Heuristic Techniques These are becoming more common in practice. re-sampling subjective characterization These address each of the mean-variance shortcomings in turn and can be applied separately or together within the two-step procedure.

Utility Maximization We have already discussed the limitations of expected utility as an index of satisfaction; in particular objective support and treatment of diversification. One advantage that utility has is it readily accommodates estimation risk. As before, denote the unknown parameters of the market vector by θ. As long as the constraints do not depend upon on these parameters, we can write arg max CE θ(α) α C ( ( )) = arg max E u Ψ θ α α C = arg max u ( α m ) f M θ (m) f θ YN (θ) dθ dm α C where f θ YN ( ) is posterior parameter density, based on the prior density f θ ( ) and the data y N = (x 1,..., x N ).

Utility Maximization Two-Step Procedure As long as the utility u( ) is increasing, this index is consistent with weak stochastic dominance, so the integral needs to be evaluated only along the efficient frontier based upon the predictive characterization with E M = m f M θ (m) f θ YN (θ) dθ dm cov M = m m f M θ (m) f θ YN (θ) dθ dm E M E M N.B.: Often the two-step procedure can be complicated by constraints that depend upon the parameters of the market vector. In this case, the approach will probably fail because we may not be able to pull the maximization out of the outer integral.

optimization is based the concept of opportunity cost. ( ) OC θ (α) = max S θ (β) S θ (α) 0 β C θ In classical optimization, θ is known with certainty and we can choose the allocation with zero opportunity cost. When we only have an estimate for θ, we can obtain a more robust result using a minimax criterion. Minimax Criterion Say we can be confident that the unknown true value of the market vector parameters lies in some range θ Θ. If the allocation constraints depend upon θ, we can define the smallest possible feasible set for this range, C Θ, and choose the optimal allocation as α Θ = arg min α C Θ max θ Θ OC θ(α)

There is a trade-off in selecting the range for the market vector parameters, θ Θ. large enough so that you can be confident that it includes the true value small enough so that the actual opportunity cost incurred from sub-optimality is not too large The author proposes the minimum-measure ellipsoid at some level of confidence based on the posterior characterization of the market vector parameters. Aversion to Market & Estimation Risk The problem for the mean-variance efficient frontier can be reduced to second-order cone programming. arg max α C α ˆµ γ m α ˆΣ α γ e α T α for M µ N (µ, ˆΣ) and µ N (ˆµ, T ).

re-sampling 2 is neither motivated nor justified by probability or decision theory (nor is it re-sampling in the statistical sense). It is a heuristic intended to directly address the instability of optimal portfolios. Simulations Once the parameters for the market vector have been estimated from y N, instead of using ˆθ to maximize S θ (α), 1. use them to generate a large number { of random samples } of the invariants each of length N, y (1) N,..., y (Q) N 2. use the new samples } to re-estimate the parameters, {ˆθ (1),..., ˆθ (Q) 3. use the re-estimated parameters to re-solve the optimization problem, { α (1),..., α (Q)} report back the mean allocation, α RS = 1 Q Q q=1 α(q) 2 The author acknowledges relevant intellectual property rights.

Presumably an argument based upon the Law of Large Numbers can be made to show that this procedure converges. But the author points out several considerations: If the constraints are not linear in the allocations, there is nothing to prevent the procedure from converging to an infeasible portfolio. We know nothing about the relationship between the resampled allocation and the true, unobserved optimal allocation. This technique is comparable to other heuristics for stabilizing the optimal portfolio such as the inclusion of transactions penalties or exogenous diversification rules. Pseudo-Data Interpretation My own speculation is that one could develop a more coherent version of this technique with the pseudo-data interpretation of the conjugate prior, where the pseudo-data vector of length N Q is drawn from a prior based on ˆθ.

The procedure can be interpreted as a technique for specifying a prior on the market vector parameters based on manager advantage. It can be generalized, but was originally presented and is most easily understood in terms of linear views on a normal model for the market invariants. Objective Market Model We assume that the invariants are distributed according to X N (µ, Σ) where µ and Σ are given (often µ = 0 or µ = r f τ 1 based upon the risk-free rate of interest). Manager Expertise The manager s expertise may be limited to a subset of the market, defined by a picks matrix P that defines the linear combinations of factors for which the manager is prepared to provide a view.

Manager View The manager expresses his view, v, on the picks and a scalar confidence, 0 c 1 ( ( ) ) 1 V P X N v, c 1 P Σ P Subjective Measure Updating the objective characterization with the view leads to the (posterior) subjective characterization X V N (µ BL, Σ BL ) with µ BL = µ + c Σ P (P Σ P ) 1 (v P µ) Σ BL = Σ c Σ P (P Σ P ) 1 P Σ

We can then proceed with classical equivalent optimization using the subjective characterization of the market vector M = a + B g(x ). Consistency of The Subjective Measure It may be difficult to quantify the manager s confidence, c. To that end, it is useful to measure how consistent the manager s subjective characterization is with the objective characterization. Focusing on the subjective location, the author notes that Mahalanobis distance (squared) for a normal is chi-squared. He defines the consistency between µ BL and µ as C = 1 F χ 2 N ( (µbl µ) Σ 1 (µ BL µ) ) where χ 2 N G(N, 1). A low level of consistency reflects bold views.