Copulas. MOU Lili. December, 2014

Similar documents
Probability Distribution And Density For Functional Random Variables

Multivariate Non-Normally Distributed Random Variables

Multivariate Distribution Models

On the Choice of Parametric Families of Copulas

Gaussian Process Vine Copulas for Multivariate Dependence

Financial Econometrics and Volatility Models Copulas

A Brief Introduction to Copulas

Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011

Robustness of a semiparametric estimator of a copula

Multivariate Statistics

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Multiple Random Variables

Estimation of Copula Models with Discrete Margins (via Bayesian Data Augmentation) Michael S. Smith

Using copulas to model time dependence in stochastic frontier models

Estimation of multivariate critical layers: Applications to rainfall data

Lecture 5: GPs and Streaming regression

MAXIMUM ENTROPIES COPULAS

Variational Inference with Copula Augmentation

PARSIMONIOUS MULTIVARIATE COPULA MODEL FOR DENSITY ESTIMATION. Alireza Bayestehtashk and Izhak Shafran

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach

Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution

1 Joint and marginal distributions

Information geometry for bivariate distribution control

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula

The Instability of Correlations: Measurement and the Implications for Market Risk

The Origin of Deep Learning. Lili Mou Jan, 2015

A New Generalized Gumbel Copula for Multivariate Distributions

Introduction to Normal Distribution

Multivariate Random Variable

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

12 - Nonparametric Density Estimation

Chapter 5 continued. Chapter 5 sections

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Approximation of multivariate distribution functions MARGUS PIHLAK. June Tartu University. Institute of Mathematical Statistics

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER.

Modeling Dependence of Daily Stock Prices and Making Predictions of Future Movements

On tail dependence coecients of transformed multivariate Archimedean copulas

Non parametric estimation of Archimedean copulas and tail dependence. Paris, february 19, 2015.

Simulating Exchangeable Multivariate Archimedean Copulas and its Applications. Authors: Florence Wu Emiliano A. Valdez Michael Sherris

Explicit Bounds for the Distribution Function of the Sum of Dependent Normally Distributed Random Variables

Trivariate copulas for characterisation of droughts

Lecture 25: Review. Statistics 104. April 23, Colin Rundel

4. Distributions of Functions of Random Variables

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Introduction Outline Introduction Copula Specification Heuristic Example Simple Example The Copula perspective Mutual Information as Copula dependent

Research Projects. Hanxiang Peng. March 4, Department of Mathematical Sciences Indiana University-Purdue University at Indianapolis

Copula Based Independent Component Analysis

EE4601 Communication Systems

Partial Correlation with Copula Modeling

A Goodness-of-fit Test for Copulas

Calibration Estimation for Semiparametric Copula Models under Missing Data

Introduction to Probability and Stocastic Processes - Part I

GAUSSIAN PROCESS REGRESSION

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Exam 2. Jeremy Morris. March 23, 2006

A Probability Distribution Of Functional Random Variable With A Functional Data Analysis Application

STT 843 Key to Homework 1 Spring 2018

ECE 4400:693 - Information Theory

Lecture Quantitative Finance Spring Term 2015

EEL 5544 Noise in Linear Systems Lecture 30. X (s) = E [ e sx] f X (x)e sx dx. Moments can be found from the Laplace transform as

Machine learning - HT Maximum Likelihood

Estimation Under Multivariate Inverse Weibull Distribution

Marginal Specifications and a Gaussian Copula Estimation

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

Hybrid Censoring; An Introduction 2

Continuous Random Variables

Goodness-of-Fit Testing with Empirical Copulas

E X A M. Probability Theory and Stochastic Processes Date: December 13, 2016 Duration: 4 hours. Number of pages incl.

EE/Stats 376A: Homework 7 Solutions Due on Friday March 17, 5 pm

Nonparameteric Regression:

Bivariate Rainfall and Runoff Analysis Using Entropy and Copula Theories

Lecture 3. Conditional distributions with applications

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur

Bivariate Degradation Modeling Based on Gamma Process

Correlation: Copulas and Conditioning

STA 2201/442 Assignment 2

Modelling Dependence with Copulas and Applications to Risk Management. Filip Lindskog, RiskLab, ETH Zürich

Imputation Algorithm Using Copulas

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

matrix-free Elements of Probability Theory 1 Random Variables and Distributions Contents Elements of Probability Theory 2

Chapter 3 sections. SKIP: 3.10 Markov Chains. SKIP: pages Chapter 3 - continued

Copulas and Measures of Dependence

Statistical Data Mining and Machine Learning Hilary Term 2016

Quick Tour of Basic Probability Theory and Linear Algebra

Contents 1. Contents

Using copulas to deal with endogeneity

Bayesian estimation of the discrepancy with misspecified parametric models

SOLUTIONS TO MATH68181 EXTREME VALUES AND FINANCIAL RISK EXAM

Time Varying Hierarchical Archimedean Copulae (HALOC)

Correlation & Dependency Structures

EVANESCE Implementation in S-PLUS FinMetrics Module. July 2, Insightful Corp

Semi-parametric predictive inference for bivariate data using copulas

Number Theory and Probability

Convexity of chance constraints with dependent random variables: the use of copulae

Using copulas to model time dependence in stochastic frontier models Preliminary and Incomplete Please do not cite

Simulating Realistic Ecological Count Data

1. Point Estimators, Review

σ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =

Notes on Random Vectors and Multivariate Normal

Identification of marginal and joint CDFs using Bayesian method for RBDO

Transcription:

Copulas MOU Lili December, 2014

Outline Preliminary Introduction Formal Definition Copula Functions Estimating the Parameters Example Conclusion and Discussion

Preliminary MOU Lili SEKE Team 3/30

Probability on Continous Variables Probability density function (pdf) f f(x) = Cumulative distribution function (cdf) F P (x ɛ X x + ɛ) 2ɛ F (x) = P (X x) Relationship between pdf and cdf f(x) = F (x) = x F (x) x f(t)dt MOU Lili SEKE Team 4/30

Introduction MOU Lili SEKE Team 5/30

From Marginal to Joint Probability We have random variables X = (X 1, X 2,, X d ) If we know the marginal distribution F 1(x 1),, F d (x d ), Say X i U[0, 1] What is the joint distribution? Is the joint distribution unique? No, because we do not know the dependencies among the variables. MOU Lili SEKE Team 6/30

Basic Idea Life will be easier, if... we have some effective way of modeling dependencies between variables. (Copula) Copula + Marginal distribution Joint distribution MOU Lili SEKE Team 7/30

Formal Definition MOU Lili SEKE Team 8/30

Copulas Let U = (U 1,, U d ) be d dimensional random variables, with marginal distribution U i U[0, 1] Define a copula function as a joint cdf on U C(u 1,, u d ) = P (U 1 u 1,, U d u d ) Essentially, a copula is defined as a joint distribution on a unit hyper-cube. MOU Lili SEKE Team 9/30

Sklar s Theorem Theorem For a multivariate distribution F (x 1,, x n), with marginal distribution F i(x i), a copula C s.t. C(F 1(x 1),, F d (x d )) = F (x 1,, x d ) Furthermore, if the marginal distributions are continuous, the copula is unique. MOU Lili SEKE Team 10/30

Proof 1. To prove F (X) U[0, 1], Let y = F X(x) and x = F 1 X (y) = H(y) f Y (y) = H (y) f X[H(y)] = dh(y) f X[H(y)] = dx dy df fx(x) = 1 X(x) 2. By the definition of copula C(u 1,, u d ) = P (U 1 u 1,, U d u d ), we have C(F 1(x 1),, F d (x d )) = P (F 1(X 1) F 1(x 1),, F d (X d ) F d (X d )) = P (X 1 x 1,, X d x d ) = F X (x 1,, x d ) MOU Lili SEKE Team 11/30

Joint Probability Density Function Through differentiation of C(F 1(x 1),, F d (x d )) = F (x 1,, x d ) We obtain d f(x 1,, x n) = c(f 1(x 1),, F d (x d )) f i(x i) i=1 where c(u 1,, u d ) = d C(u 1,, u d ) u 1 u d MOU Lili SEKE Team 12/30

Importance of Sklar s Theorem Instead of specifying directly an odd joint distribution on X 1,, X d, we Model the marginal distribution, which is easy, and Specify a coplua C, modeling nontrivial dependencies between variables. Sklar s Theorem guarantees: Copula(Marginal)=Joint MOU Lili SEKE Team 13/30

Copula Functions MOU Lili SEKE Team 14/30

Archimedean Copula Family C φ (u 1, u d ) = φ 1 ( d i=1 φ(u i) ) Constraints on Φ Verify the marginal distribution F (u 1) = C φ (u 1, +,, + ) = φ 1 φ(u 1) = u 1 U i U[0, 1] MOU Lili SEKE Team 15/30

Examples Glayton, φ(x) = x α 1, α > 0, Gumbel, ( log(x)) α, α > 1, Frank, φ(x) = log ( ) exp( αx) 1, α (, + ), α 0 exp( α) 1 The parameters are learned with training examples. Specially, if α = 1 in Gumbel copula, then variables are independent. MOU Lili SEKE Team 16/30

Glayton φ(x) = x α 1, α > 0 MOU Lili SEKE Team 17/30

"Correct" Copula Substitute u i = F i(x i) into C(F 1(x 1),, F d (x d )) = F (x 1,, x d ) We obtain C(u i,, u d ) = F (F 1 1 (u 1), F 1 d (u d ) Note that for any valid multivariate cumulative function C is valid regardless of F and F i. C(u 1, 1,, 1) = F (F 1 1 (u 1),,, ) = P (F 1 1 (U 1) < F 1 1 (u 1)) = P (U 1 < u 1) = u 1 Let F i to be standard norm distribution, and F to be norm with 0 mean and covariance matrix Σ. MOU Lili SEKE Team 18/30

Example ρ =.4 [Source: http://en.wikipedia.org/wiki/copula_(probability_theory)] MOU Lili SEKE Team 19/30

Estimating the Parameters MOU Lili SEKE Team 20/30

Semi-Parametric Estimation Log-likelihood l(α, θ 1,, θ d ) = ( ) n ( ) n d log c α F θ1 (x (j) 1 ),, F θ d (x (j) d ) + log(f θ1 (x (j) i )) j=1 j=1 i=1 Inference for Margins (IFM), 2 two-stage procedure: 1. Estimate marginal distributions 2. Estimate copula parameters l(α) = log ( c α ( ˆF1(x (j) 1 ),, ˆF d (x (j) d ) )) MOU Lili SEKE Team 21/30

Semi-Parametric Estimation The marginal distribution is estimated empirically. F i(y) = P (X i y) 1 n n j=1 1{x (j) i y} The log-likelihood ( ( l(α) = log c α F1(x (j) 1 ),, F )) d (x (j) d ) MOU Lili SEKE Team 22/30

Example MOU Lili SEKE Team 23/30

Teany Tiny Baby Example Bivariate normal distribution µ = (00) T Σ = ( 1.25 0.43 0.43 1.75 ) 1000 training data samples Frank Archimedean copula semiparametric version of IFM MOU Lili SEKE Team 24/30

Results: Density MOU Lili SEKE Team 25/30

Results: Squared Error MOU Lili SEKE Team 26/30

Results: Marginal Distribution MOU Lili SEKE Team 27/30

Conclusion and Discussion MOU Lili SEKE Team 28/30

Conclusion A copula is defined as a joint cdf on a unit hypercude with marginal distributions U[0, 1]. Copulas link joint distribution with marginal distributions. Copula ( Marginals ) = Joint We can specify copulars explicitly, or via a (joint, marginals) pair Parameters are learned by MLE or its variations. + Copulas provide a means of modeling non-trivial dependencies. - Choosing copulas is a $64,000,000 question. Inappropriate copulas, which fail to model true dependencies in data, may also cause considerable financial losses. MOU Lili SEKE Team 29/30

Discussion Q&A Thanks for listening! MOU Lili SEKE Team 30/30