Multivariate Non-Normally Distributed Random Variables

Similar documents
Financial Econometrics and Volatility Models Copulas

Multivariate Non-Normally Distributed Random Variables in Climate Research Introduction to the Copula Approach

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Copulas. MOU Lili. December, 2014

MAXIMUM ENTROPIES COPULAS

The Instability of Correlations: Measurement and the Implications for Market Risk

Trivariate copulas for characterisation of droughts

A copula goodness-of-t approach. conditional probability integral transform. Daniel Berg 1 Henrik Bakken 2

Multivariate Statistics

A Goodness-of-fit Test for Copulas

Lecture Quantitative Finance Spring Term 2015

How to select a good vine

PROBABILITY DISTRIBUTIONS. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception

Gaussian Process Vine Copulas for Multivariate Dependence

On the Systemic Nature of Weather Risk

Ensemble Copula Coupling (ECC)

Semi-parametric predictive inference for bivariate data using copulas

Multivariate Distributions

Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011

COPULA BASED STOCHASTIC WEATHER GENERATOR AS AN APPLICATION FOR CROP GROWTH MODELS AND CROP INSURANCE. A Dissertation MIRIAM JUAREZ TORRES

A New Generalized Gumbel Copula for Multivariate Distributions

Bayesian Model Averaging for Multivariate Extreme Values

Copulas and dependence measurement

Marginal Specifications and a Gaussian Copula Estimation

Probability Distribution And Density For Functional Random Variables

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business

Multivariate Distribution Models

Robustness of a semiparametric estimator of a copula

Markov Switching Regular Vine Copulas

Two practical tools for rainfall weather generators

Multivariate Capability Analysis Using Statgraphics. Presented by Dr. Neil W. Polhemus

Construction and estimation of high dimensional copulas

Modelling Dependence with Copulas and Applications to Risk Management. Filip Lindskog, RiskLab, ETH Zürich

Ensemble Copula Coupling: Towards Physically Consistent, Calibrated Probabilistic Forecasts of Spatio-Temporal Weather Trajectories

EVANESCE Implementation in S-PLUS FinMetrics Module. July 2, Insightful Corp

A simple graphical method to explore tail-dependence in stock-return pairs

Semi-parametric estimation of non-stationary Pickands functions

Bayesian Inference for Pair-copula Constructions of Multiple Dependence

Lifetime Dependence Modelling using a Generalized Multivariate Pareto Distribution

Copula-based top-down approaches in financial risk aggregation

Copulas, a novel approach to model spatial and spatio-temporal dependence

VaR vs. Expected Shortfall

Bivariate Rainfall and Runoff Analysis Using Entropy and Copula Theories

Lecture 3. Probability - Part 2. Luigi Freda. ALCOR Lab DIAG University of Rome La Sapienza. October 19, 2016

Goodness-of-Fit Testing with Empirical Copulas

Tail negative dependence and its applications for aggregate loss modeling

Copulas, Higher-Moments and Tail Risks

CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS

Contents 1. Coping with Copulas. Thorsten Schmidt 1. Department of Mathematics, University of Leipzig Dec 2006

Deep Variational Inference. FLARE Reading Group Presentation Wesley Tansey 9/28/2016

Pair-copula constructions of multiple dependence

Non parametric estimation of Archimedean copulas and tail dependence. Paris, february 19, 2015.

Bayesian linear regression

Probability and Stochastic Processes

Copula-based approximation of particle breakage as link between DEM and PBM

Statistical Analysis of Financial Data

Copulas. Mathematisches Seminar (Prof. Dr. D. Filipovic) Di Uhr in E

8 Copulas. 8.1 Introduction

Recovering Copulae from Conditional Quantiles

An empirical analysis of multivariate copula models

Behaviour of multivariate tail dependence coefficients

Statistics Ph.D. Qualifying Exam: Part I October 18, 2003

Modelling and Estimation of Stochastic Dependence

Extreme Value Analysis and Spatial Extremes

Correlation & Dependency Structures

Tail Dependence of Multivariate Pareto Distributions

The Use of Copula Functions in Pricing Weather Contracts for the California Wine Industry

Calibration Estimation for Semiparametric Copula Models under Missing Data

Evolution of copulas: Continuous, D Title Application to Quantitative Risk Ma

HANDBOOK OF APPLICABLE MATHEMATICS

Package CopulaRegression

Contents 1. Contents

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach

Reducing Model Risk With Goodness-of-fit Victory Idowu London School of Economics

Bayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework

Hidden Markov Models for precipitation

Parametric estimation methods of multivariate data for multi-component image segmentation

DETERMINATION OF COMBINED PROBABILITIES OF OCCURRENCE OF WATER LEVELS AND WAVE HEIGHTS FOR THE SAFTEY VALIDATION OF DIKES AT THE BALTIC SEA

Estimation of multivariate critical layers: Applications to rainfall data

Pattern Recognition. Parameter Estimation of Probability Density Functions

A measure of radial asymmetry for bivariate copulas based on Sobolev norm

Parameter estimation of a Lévy copula of a discretely observed bivariate compound Poisson process with an application to operational risk modelling

Journal of Statistical Software

Vine copulas with asymmetric tail dependence and applications to financial return data 1. Abstract

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1

Dynamic D-Vine Copula Model with Applications to Value-at-Risk (VaR)

Songklanakarin Journal of Science and Technology SJST R1 Sukparungsee

Simulating Exchangeable Multivariate Archimedean Copulas and its Applications. Authors: Florence Wu Emiliano A. Valdez Michael Sherris

Supplementary Material for On the evaluation of climate model simulated precipitation extremes

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Generating gridded fields of extreme precipitation for large domains with a Bayesian hierarchical model

Estimation theory. Parametric estimation. Properties of estimators. Minimum variance estimator. Cramer-Rao bound. Maximum likelihood estimators

Introduction to Probability and Statistics (Continued)

Multivariate verification: Motivation, Complexity, Examples

Using copulas to model time dependence in stochastic frontier models

Bayesian Models in Machine Learning

Interpolation of daily mean air temperature data via spatial and non-spatial copulas

Statistics: Learning models from data

Research Projects. Hanxiang Peng. March 4, Department of Mathematical Sciences Indiana University-Purdue University at Indianapolis

Efficient estimation of a semiparametric dynamic copula model

Transcription:

Multivariate Non-Normally Distributed Random Variables An Introduction to the Copula Approach Workgroup seminar on climate dynamics Meteorological Institute at the University of Bonn 18 January 2008, Bonn Christian Schölzel 1(2) and Petra Friederichs 1 1) Meteorological Institute at the University of Bonn Bonn, Germany 2) Laboratoire des Sciences du Climat et l Environnement (LSCE) Gif-sur-Yvette, France

Motivation Multivariate non-normally distributed random variables Interest Estimating multivariate probability density functions Problem At least one component might be precipitation wind speed and direction relative humidity cloud cover... extremes Question How to describe random vectors with non-normal marginals?

Background General problem with multivariate random variables Standard textbooks Storch, H. v. and Zwiers, F. W. (1999): Statistical Analysis in Climate Research Wilks, D. S. (2005): Statistical Methods in the Atmospheric Sciences multivariate normal distribution Marginal distributions numerous parametric descriptions Dependence allows for more complexity degrees-of-freedom no canonical extension

Related Approaches How has this problem been addressed so far? Multivariate normal distribution* *) Not an option True/native multivariate distribution Best option, but often restricted in modeling marginals and dependence Mixture models Flexible w.r.t. covariance structure, complex in higher dimensions, no account for the original marginal distribution Marginal and conditional pdf Complexity of the estimation process grows exponentially Multivariate KDE* *) Not a parametric description Johnson distribution Simple concept, limited dependence strucure (precursor & special case)

The copula approach Basics The copula approach A brief (and incomplete) historical overview... 1949: johnson distribution and other precursors 1959: the term copula has been introdced (Sklar) 2000: applications in econometry 2007: applications in hydrology

Copula Definition Sklar s therorm (1959) Assume a m-dimensional random vector X with marginal cumulative distribution functions F X1,..., F Xm Definition copula A copula C X is a multivariate distribution with standard uniform marginal distributions Sklar s theorem (1959) Every joint distribution F X can be written as a function of its marginal distributions F X ( x) = C X (F X1 (x 1 ),..., F Xm (x m)) Note: C X is a copula, since U j = F Xj (X j ) = U j U(0, 1)

Definition Copula Copula density Copula density Furthermore, C X is unique and can be expressed by with u j = F Xj (x j ) u 1 u m Z Z C X (u 1,..., u m) = c X (u 1,..., u m) du 1... u m {z } 0 0 copula density Consequence of Sklar s theorem Every joint probability density can be written as f X ( x) = f X1 (x 1 ) f Xm (x m) c X (u 1,..., u m)

Bivariate Examples Normal-gamma and beta-beta marginals

Copula Families Elliptical and Archimedian copulas family Copula Families Two major classes are Elliptical copulas and Archimedian copulas Elliptical copulas Derived from elliptical distributions Normal (Gaussian) copula * Student s t-copula Archimedian copulas Base on so called generator functions C X (u 1,..., u m) = φ 1 (φ(u 1 ) + + φ(u m)) Clayton copula: φ C (u) = u θ C 1 Frank copula: φ F (u) = log( eθ F u 1 e θ F 1 ) Gumbel copula: φ G (u) = ( log u) θ G

Normal Copulas Derivation based on Sklar s theorem Additional transformation U j = F X (X j ) U(0, 1) Z j = F 1 N (0,1) (U j) N (0, 1) Z = (Z 1,..., Z m) T N (0, Σ) Normal copula Sklar s theorem leads to the copula C X (u 1,..., u m) = F N (0,Σ)`F 1 N (0,1) (u 1),..., F 1 N (0,1) (um) and the copula density c X (u 1,..., u m) = u 1 = f N (0,Σ) C X (u 1,..., u m) u m 1 `F N (0,1) (u 1),..., F (um) 1 N (0,1) Q (um) m 1 j=1 f N (0,1)`F N (0,1)

Title Subtitile Z j = F 1 N (0,1) (U j) U j = F Xj (X j ) X j f 1 N (Σ,1) c X f X

Student t-copula A more flexible case of elliptical copulas Student t-copula Based on an extension of the multivariate t-distribution C X (u 1,..., u m) = F t(ν,σ)`f 1 t(ν) (u 1),..., F 1 t(ν) (um), Tail dependence Definition of upper tail dependence (lower analogously) λ up = lim P X 1 > F 1 X u 1 1 (u) X 2 > F 1 X 2 (u)

Different Copulas Dependence structure and upper/lower tail dependence

Estimation The aim is to fit the whole multivariate distribution Full maximum likelihood (ML) Maximize the log-likelihood functions for the full pdf f X consistent estimates for all parameters often only numerically feasible Inference functions for margins (IFM) Two-stage estimation process (1. margins, 2. copula) computationally more efficient than ML predefined estimators for margins and copula no independent parameter estimates Canonical maximum likelihood (CML) Like IFM but use the empirical CDF of each margin instead consistent estimates of the copula parameters practical use questionable (see discussion)

Goodness-of-Fit The probability integral transform (PIT) Dimension reduction Transform X = (X 1,..., X m) T to a set of independent, uniform variables The probability integral transform (PIT) The PIT of X is defined as T 1 (X 1 ) = F X1 (X 1 ) T 2 (X 2 ) = F X2 X 1 (X 2 X 1 ). T m(x m) = F Xm X 1,...,X m 1 (X m X 1,..., X m 1 ) Hypothesis H 0 X comes from the specified multivariate model (F X ) = Z j = T j (X j ) iid U(0, 1)

Goodness-of-Fit The probability integral transform (PIT) Dimension reduction With the multivariate random variable (above) Z j = T j (X j ) iid U(0, 1) it follows that Y = mx j=1 F 1 N (0,1) (Z j ) has a χ 2 -distribution with m degrees of freedom, so that W = F χ 2 m (Y ) should be a univariate random variable from U(0, 1) univariate GoF tests

Applications Precipitation and minimum temperature at stations Berlin and Potsdam Example A: different random variables Example B: different locations

Multivariate Extremes How are copulas related to multivariate extremes? Copulas are related to multivariate extremes (later)

Capabilities What are copulas good for? The copula approach Simple and straight forward method to find parametric descriptions of multivariate non-normally distributed random variables Characteristics Catch different dependence structures Keep proper parametric descriptions of the margins Low number of parameters Mixture models (alternative and enhancement) More degrees-of-freedom Allow only approximative descriptions of the marginal distributions Limited in modelling tail dependence Based on the assumption of different populations mixture models and copulas can be combined

Limitations What are typical pitfalls? Critics: Mikosch, T. (2006), Copulas: Tales and facts, Extremes, 9, 3 20, Limitations Limited to the existing copula families No general procedure for selecting the copula class Pitfalls Questionable practices in finance, risk management, or insurance Theoretical value of the copula must not be exaggerated There is no dependence separately from the marginal distributions Copulas do not solve the problem of dimensionality

Outlook Where to go? Outlook After numerous successful applications in risk management, financial research and more recently in hydrology, it is very likely that copulas will have growing impact in meteorology and climate research

Navigation An Introduction to the Copula Approach 1 Introduction Motivation Background and Related Work 2 The Copula Approach Basics Copula Families Estimation Goodness-of-Fit 3 Applications 4 Multivariate Extremes 5 Discussion Capabilities Limitations Outlook Appendix