Introduction & Random Variables. John Dodson. September 3, 2008

Similar documents
MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 7, 2011

MFM Practitioner Module: Quantitative Risk Management. John Dodson. September 23, 2015

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Brief Review of Probability

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1

MFM Practitioner Module: Risk & Asset Allocation. John Dodson. February 4, 2015

1: PROBABILITY REVIEW

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

STA2603/205/1/2014 /2014. ry II. Tutorial letter 205/1/

STATISTICS SYLLABUS UNIT I

Probability Distributions Columns (a) through (d)

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable

Week 1 Quantitative Analysis of Financial Markets Distributions A

MFM Practitioner Module: Risk & Asset Allocation. John Dodson. February 3, 2010

Math Review Sheet, Fall 2008

Twelfth Problem Assignment

Multivariate Distributions

Order Statistics and Distributions

Random Variables and Their Distributions

Introduction and Overview STAT 421, SP Course Instructor

CS Lecture 19. Exponential Families & Expectation Propagation

Market Risk. MFM Practitioner Module: Quantitiative Risk Management. John Dodson. February 8, Market Risk. John Dodson.

Probability and Statistics

Stat 5101 Lecture Notes

MFM Practitioner Module: Risk & Asset Allocation. John Dodson. January 28, 2015

Department of Statistics Gauhati University B.Sc. General Syllabus: Statistics (Semester System)

MFM Practitioner Module: Risk & Asset Allocation. John Dodson. January 25, 2012

Subject CS1 Actuarial Statistics 1 Core Principles

Essential Maths 1. Macquarie University MAFC_Essential_Maths Page 1 of These notes were prepared by Anne Cooper and Catriona March.

Question Points Score Total: 76

3 Continuous Random Variables

HANDBOOK OF APPLICABLE MATHEMATICS

Introduction to Probability and Statistics (Continued)

2. Variance and Higher Moments

IEOR 3106: Introduction to Operations Research: Stochastic Models. Professor Whitt. SOLUTIONS to Homework Assignment 2

MATH : EXAM 2 INFO/LOGISTICS/ADVICE

Continuous random variables and probability distributions

A Few Special Distributions and Their Properties

6.1 Moment Generating and Characteristic Functions

Week 2. Review of Probability, Random Variables and Univariate Distributions

Probability. Machine Learning and Pattern Recognition. Chris Williams. School of Informatics, University of Edinburgh. August 2014

Mathematical Statistical Physics Solution Sketch of the math part of the Exam

IAM 530 ELEMENTS OF PROBABILITY AND STATISTICS LECTURE 3-RANDOM VARIABLES

Fundamentals in Optimal Investments. Lecture I

STAT 302 Introduction to Probability Learning Outcomes. Textbook: A First Course in Probability by Sheldon Ross, 8 th ed.

Chapter 1. Probability, Random Variables and Expectations. 1.1 Axiomatic Probability

Random variables. DS GA 1002 Probability and Statistics for Data Science.

Elementary Probability. Exam Number 38119

Stat 2300 International, Fall 2006 Sample Midterm. Friday, October 20, Your Name: A Number:

cf. The Fields medal carries a portrait of Archimedes.

Topic 3: The Expectation of a Random Variable

Chapter 5. Means and Variances

Expectation. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Module 3. Function of a Random Variable and its distribution

Stat 5101 Lecture Slides Deck 4. Charles J. Geyer School of Statistics University of Minnesota

System Simulation Part II: Mathematical and Statistical Models Chapter 5: Statistical Models

X 1 ((, a]) = {ω Ω : X(ω) a} F, which leads us to the following definition:

Summary of basic probability theory Math 218, Mathematical Statistics D Joyce, Spring 2016

MATHEMATICS (MATH) Calendar

Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices.

Statistics 3657 : Moment Generating Functions

DS-GA 1002 Lecture notes 2 Fall Random variables

MAS113 Introduction to Probability and Statistics

Algebra I+ Pacing Guide. Days Units Notes Chapter 1 ( , )

Some Aspects of Universal Portfolio

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY (formerly the Examinations of the Institute of Statisticians) HIGHER CERTIFICATE IN STATISTICS, 1996

Solutions to Math 41 First Exam October 15, 2013

Probability: Handout

Metric spaces and metrizability

Lecture 1: Probability Fundamentals

Generalized Hypothesis Testing and Maximizing the Success Probability in Financial Markets

Expectation. DS GA 1002 Probability and Statistics for Data Science. Carlos Fernandez-Granda

ELEG 3143 Probability & Stochastic Process Ch. 2 Discrete Random Variables


Foundations of Analysis. Joseph L. Taylor. University of Utah

Will Murray s Probability, XXXII. Moment-Generating Functions 1. We want to study functions of them:

GB2 Regression with Insurance Claim Severities

Statistics for scientists and engineers

STATISTICS ANCILLARY SYLLABUS. (W.E.F. the session ) Semester Paper Code Marks Credits Topic

(6, 4) Is there arbitrage in this market? If so, find all arbitrages. If not, find all pricing kernels.

STATISTICAL METHODS FOR SIGNAL PROCESSING c Alfred Hero

1 Review of di erential calculus

Introduction to the Mathematical and Statistical Foundations of Econometrics Herman J. Bierens Pennsylvania State University

Aggregate Risk. MFM Practitioner Module: Quantitative Risk Management. John Dodson. February 6, Aggregate Risk. John Dodson.

ARE211 FINAL EXAM DECEMBER 12, 2003

Probability Distributions for Continuous Variables. Probability Distributions for Continuous Variables

STA205 Probability: Week 8 R. Wolpert

Dr. Maddah ENMG 617 EM Statistics 10/15/12. Nonparametric Statistics (2) (Goodness of fit tests)

Statistical Modeling and Cliodynamics

Slides 8: Statistical Models in Simulation

Multivariate Normal-Laplace Distribution and Processes

Contents 1. Contents

1.6 Families of Distributions

Math 493 Final Exam December 01

Assessing financial model risk

Week Topics of study Home/Independent Learning Assessment (If in addition to homework) 7 th September 2015

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Grade Math (HL) Curriculum

We are going to discuss what it means for a sequence to converge in three stages: First, we define what it means for a sequence to converge to zero

Foundations of Probability and Statistics

Transcription:

& September 3, 2008 Statistics

Statistics Statistics

Course Fall sequence modules portfolio optimization Bill Barr fixed-income markets Chris Bemis calibration & simulation introductions flashcards how to contact, memorable fact module syllabus office hours evaluations & grading module text required Meucci recommended DeGroot, Bertsimas Statistics

Module Goal My goal is to introduce modern concepts to describe uncertainty in the financial markets and how to convert investment manager expertise and investor preferences into optimal quantitative investment strategies. Out of Scope I will not be talking about the economics of financial markets, fundamental or technical investment analysis, or the origin of investment manager expertise. Statistics

Meucci s Program for Asset Allocation detect market invariance select the invariants estimate the market specify the distribution of the invariants decide on approach to dealing with heteroskedasticity model the market project the invariants to the horizon map the invariants back to market prices define investor optimality objective satisfaction constraints collect manager experience prior on the market vector parameters determine optimal allocation two-step approach Statistics

Definitions In practice, a random variable is a quantity that does not have a value known to us right now but will have a value at some point in the future. These are represented as measurable functions of a sample space and usually denoted by upper-case Roman letters. A particular value obtained by a random variable is often denoted by the corresponding lower-case letter. There must be a probability associated with every possible set of outcomes. Such sets are called events and might consist of intervals or points or any combination thereof. The corresponding probability is called the probability measure of the event. Statistics

This structure lends itself to a calculus interpretation, where the probability associated with a set is simply the integral of the probability density over that set. P {X (a, b)} = b a f X (x) dx If X ranges over the real numbers R, then f X ( ) must have certain properties. In particular, it must be a non-negative (generalized) function and lim f X (x) = 0 x f X (x) dx = 1 lim f X (x) = 0 x Statistics

In addition to the density function, there are at least three other characterizations of a random variable distribution function F X (x) = x f X (x ) dx quantile function Q X (p) = F 1 X (p) characteristic function φ X (t) = ei t x f X (x) dx This last is based on the Fourier transform, where i 2 = 1. While the density function is the most common, these four representations are all equivalent and we will be working with each. They can be distinguished by the nature of their arguments; resp. the value of an outcome, the upper range on a set of outcomes, a probability, and a frequency. Statistics

If we have a characterization of a random variable X, it is natural to ask if we can derive the characterization of a function of that variable Y = h(x ), or of a sample, {X 1, X 2,..., X n }, Y = h (X 1, X 2,...). This is in general difficult, but there are some notable easy cases. f, F, and Q for an increasing, invertible function f Y (y) = f ( X h 1 (y) ) h (h 1 (y)) ( F Y (y) = F X h 1 (y) ) Q Y (p) = h (Q X (p)) φ for the mean of a sample, Y n = 1 n n j=1 X j φ Yn (t) = [ ( t )] n φ X n Statistics

We will talk in detail about the topic of estimation later, but if you were asked to give a best guess about the value of random variable, the expected value would be a natural answer. The expected is value is a density-weighted average of all possible values EX = = = 1 0 x f X (x) dx 1 2 F X (x) dx Q X (p) dp = i φ (0) Statistics

While it may not be possible to evaluate the characterization of function of a random variable, it is generally possible to evaluate the expected value of a function of a random variable. Eh(X ) = h(x) f X (x) dx The probability measure of an event A is the expectation of the indicator function for the event P {X A} = E1 A (X ) In general, the expected value of a function of a random variable is not just value of the function at the expected value of the random variable. Eh(X ) h (EX ) Statistics Do not make this mistake!

A foundational result connects the expectation of a random variable with the sample mean. 1 lim n n n X j = EX j=1 almost surely We will demonstrate the weak version of this result using the characteristic function and then look at some numerical results. The key to the derivation is to recognize that ( φ Yn (t) = 1 + i t EX ( t ) ) n + o e i t EX n n Statistics

The LLN, combined with information technology, brought about a revolution in applied mathematics last century, introducing a completely novel way to evaluate integrals. integrals are expectations of functions of random variables h(x) dx = h(x) f X (x) f X (x) dx = E h(x ) f X (X ) expectations are means of random samples E h(x ) a.s. 1 = lim f X (X ) n n n j=1 h(x j ) f X (x j ) you can evaluate an arbitrarily complicated integral if you can identify an appropriate random variable generate a very large sample of random variates cheaply evaluate the integrand and density function Statistics

Statistics Affine equivariance guides us in defining measures for the location and dispersion of a random variable. In particular, we should expect Loc {a X + b} = a Loc {X } + b Disp {a X + b} = a Disp {X } Location The expected value Loc {X } def = EX is a natural candidate for a measure of location. Dispersion The standard deviation is a natural candidate for a measure of dispersion. Disp {X } def = E (X 2 ) (EX ) 2 Statistics

Statistics Other candidates for a measure of location include mode arg max f X median Q X ( 1 2 ) Alternate measures of dispersion include ) 1/2 modal dispersion ( 2 log f arg X max fx inter-quartile range Q X ( 3 4 ) Q X ( 1 4 ) absolute deviation E X EX Standard deviation as a measure of dispersion is justified by an important general result: Chebyshev (Qebyx v) inequality For any r.v. X with a finite standard deviation and k > 0, { } P X EX < k E (X 2 ) (EX ) 2 > 1 1 k 2 Statistics

Denote the expected value and standard deviation of a random variable X by µ and σ. The standardized transformation of X is Z = X µ σ Its characteristic function is ( φ Z (t) = e i µ t ) σ t φ X σ The moments of Z measure the skewness, kurtosis, etc. of X. The easiest way to evaluate these is to note that E (Z n ) = ( i) n φ (n) Z (0) N.B.: Moments do not always exist, and they are generally not adequate to characterize a random variable. Statistics

The most important distribution for X R is the normal or gaussian distribution. It takes two parameters, and is denoted X N ( µ, σ 2). µ is often called the mean. f X (x) = 1 2π e 1 x µ 2 ( σ ) 2 1 ( σ F X (x) = 1 2 erfc 1 2 x µ ) σ Q X (p) = µ 2 σ erfc 1 (2 p) φ X (t) = e i µ t 1 2 σ2 t 2 Statistics density

A foundational result connects the normal distribution with the. For any random variable X with finite expected value µ and standard deviation σ, n lim n 1 n n X j µ N ( 0, σ 2) in distribution j=1 That is, the deviation between the sample mean and the expected value is approximately normal, with standard deviation equal to the standard deviation of the random variable divided by the square root of the sample size. This result can be demonstrated by considering the characteristic function of the RHS above. Statistics

Recall, a random variable is a measurable function of the sample space with respect to a probability measure. It can be useful to work with the same random variable under an alternate probability measure. Radon-Nikodym theorem If P and P are equivalent measures, then there is a random variable Z 0 such that for any random variable X E X = E (Z X ) For example, say that X N (µ, σ 2 ) under P, but X N (µ, σ 2 ) under P. Then the Radon-Nikodym derivative is Z = e X σ ( µ σ µ σ ) + 1 2 ( µ σ ) 2 1 2 ( µ ) 2 σ Statistics and this same Z would apply to any function of X.

The entropy of a distribution is a measure of how much information we do not know about the random variable. H X = E log f X (X ) A technique for increasing the entropy associated with a random variable is to introduce randomness into the characterization of the random variable. For example, X µ N ( µ, σ 2) µ N ( ν, τ 2) If we knew the mean, the r.v. would be normal with a given standard deviation. But we don t know the mean. By letting the mean be random, the entropy is increased. ( σ 2 + τ 2) 1 + log 2π + log σ 2 > 1 H X 1 + log 2π + log = H X µ Statistics

Common s Below is a taxonomy of common distributions, classified primarily by the topology of the support. finite Dirac Bernoulli binomial countable geometric Poisson compact uniform beta semi-compact exponential Lévy Gamma unbounded normal Cauchy stable transforms log-normal chi-squared inverse-gamma Pareto mixtures Student-t negative-binomial non-parametric empirical Statistics