Copula modeling for discrete data

Size: px
Start display at page:

Download "Copula modeling for discrete data"

Transcription

1 Copula modeling for discrete data Christian Genest & Johanna G. Nešlehová in collaboration with Bruno Rémillard McGill University and HEC Montréal ROBUST, September 11, 2016

2 Main question Suppose (X 1, Y 1 ),..., (X n, Y n ) are from a discrete distribution H. More specifically, assume X, Y {0, 1,...}. Can we still do copula modeling?

3 Lack of uniqueness of the copula In the continuous case, there is a unique function C : [0, 1] 2 [0, 1] such that H(x, y) = C{F (x), G(y)}, x, y R. In the discrete case, there are several functions A : [0, 1] 2 [0, 1] such that H(x, y) = A{F (x), G(y)}, x, y R. This class of functions is denoted A, but note that not all its members are copulas!

4 Lack of uniqueness of the copula In the continuous case, C is the distribution function of the pair (U, V ) = (F (X ), G(Y )), i.e., C(u, v) = Pr(U u, V v), u, v (0, 1). In the discrete case, D(u, v) = Pr(U u, V v), u, v (0, 1). is a distribution function too, but it is not a copula! As a consolation, D A.

5 Many paradoxical results follow... As soon as F or G are discrete, the set C H of admissible copulas is infinitely large (though its bounds can be identified). Members of C H can embody completely different types of dependence. Measures of association, dependence concepts, and orderings become margin dependent. Genest & Nešlehová (2007), ASTIN Bulletin.

6 Saving the connection between copulas and dependence? 1. H defines a contingency table. 2. Spread the mass uniformly in each cell. 3. Call the resulting copula C C H the bilinear extension copula. Illustration for Bernoulli variates X and Y :

7 Our main discovery C is the best possible candidate if you want to think of the copula associated with a discrete H, because... C is an absolutely continuous copula. X Y C (u, v) = uv. For any concordance measure, κ(h) = κ(c ). If ( X, Ỹ ) is distributed as C, then DEP(X, Y ) DEP( X, Ỹ ). Here, DEP can refer to PQD, LTD, RTI, SI, LRD.

8 Are copula models for discrete data of interest? A copula model for H, viz. H(x, y) = C{F (x), G(y)} with F (F α ), G (G β ) and C (C θ ) is a perfectly valid construction, even if F and G are discrete.

9 Yes, they are! Y Clayton copula with Kendall's tau 0.5 Gumbel copula with Kendall's tau Y Y Gauss copula with Kendall's tau X X X Y t copula with 4 dof and Kendall's tau -0.5 Clayton copula with Kendall's tau Y Y Frank copula and Kendall's tau X X X X P(5) and Y G(0.6) with various copulas and positive (top) and negative (bottom) association.

10 Theoretical back-up H often inherits dependence properties from C because DEP(U, V ) DEP(X, Y ), where (U, V ) C and DEP means PQD, LTD, RTI, SI, LRD. θ can continue to govern association between X and Y, viz. C θ PQD C θ H θ PQD H θ.

11 Can we do inference? Assume (X 1, Y 1 ),..., (X n, Y n ) is an iid sample from with F and G discrete. H θ (x, y) = C θ {F (x), G(y)} How can one fit and validate such a model? The quest for the answer is the subject of our ongoing research.

12 Naïve suggestion Draw 10,000 samples (X 1, Y 1 ),..., (X n, Y n ) of size n = 100 from H θ (x, y) = C θ {F (x), G(y)}, where C θ is a Clayton copula and F, G are discrete distributions. Since τ = θ/(θ + 2), pick ˆτ {τ n, τ a,n, τ b,n } and let ˆθ = 2 ˆτ 1 ˆτ.

13 Illustration: Poisson margins Take θ = 2 and Poisson margins with E(X ) = 1 and E(Y ) = 2. ˆθ based on τ n ˆθ based on τa,n ˆθ based on τb,n

14 What is going on? It can be seen that τ n is an unbiased estimator of τ(h) = τ(c ). BUT: C C θ for most copula families except at independence. In general, τ a,n and τ b,n are biased estimators of τ(c θ ) because X i = F 1 (U i ) and Y i = G 1 (V i ) (F (X i ), G(Y i )) C θ. In short, the discretization of (U i, V i ) is irreversible.

15 Can t we just randomize? 1. Take a sample (X 1, Y 1 ),..., (X n, Y n ) from H whose margins are count distributions. 2. Add an independent noise to each component of the pair (X i, Y i ), viz. X i = X i + U i 1, Ỹ i = Y i + V i 1, where U 1,..., U n and V 1,..., V n are independent samples from the standard uniform distribution on (0, 1). 3. The randomized sample ( X 1, Ỹ 1 ),..., ( X n, Ỹ n ) then stems from a distribution whose margins are continuous.

16 Two additional estimators 4. Compute τ n ( X, Ỹ ), the sample version of Kendall s tau based on the randomized sample ( X 1, Ỹ 1 ),..., ( X n, Ỹ n ). This gives a moment-estimate of θ, viz. ˆθ = g 1 ( τ n ). 5. Alternatively, one can compute the pseudo-likelihood estimate of θ based on the randomized sample. 6. To eliminate the uncertainty induced by randomization, repeat the previous steps N times and compute the average of the values of the estimate of θ.

17 ... that do not work either. Average and st. deviation of six estimates of θ in the Illustration: Estimate of θ based on τ n(u, V ) τ n(x, Y ) τ a,n(x, Y ) τ b,n (X, Y ) τ n( X, Ỹ ) MLE( X, Ỹ ) Av S.d Randomization is bound to fail, because the copula of ( X, Ỹ ) is C. However, remember that C C θ.

18 The empirical bilinear extension copula Compute the empirical cdf H n corresponding to the sample (X 1, Y 1 ),..., (X n, Y n ). Denote its bilinear extension copula by C n. C n is explicit; its density is given by n ij cn (u, v) = n n i n j for all u (F n (i 1), F n (i)], v (G n (j 1), G n (j)], i, j N. Observe that C n is rank-based.

19 Wait a minute... A sample (X 1, Y 1 ),..., (X n, Y n ) defines a contingency table. Pearson s chi-squared statistic for testing independence χ 2 = I J i=1 j=1 (n ij n i n j /n) 2 n i n j /n Spearman s mid-rank coefficient for testing monotone trend { n } ρ n = 12 (R n 3 i R)(S i S) i=1 Kendall s coefficient for testing monotone trend τ n = 2 n 2 {#(concordant pairs) #(discordant pairs)}

20 Surprise! It can be seen that χ 2 = n ρ n = τ n = {c n (u, v) 1} 2 du dv, {C n (u, v) uv}du dv, C n (u, v) dc n (u, v).

21 Here s an idea In the continuous case, many inferential procedures derive from the limiting behavior of the empirical copula process C n = n (C n C). In the discrete case, one can investigate the asymptotic behavior of the empirical Maltese copula process C n = n (C n C ), hoping that it would be as useful as in the continuous case.

22 Known margins in the continuous case When F and G are known and continuous, C can be estimated by the empirical distribution function B n of the sample (F (X i ), G(Y i )), 1 i n. It is well-known that in this case, B n = n (B n C) converges weakly in C[0, 1] 2 to a C-Brownian sheet B C, i.e., to a centered Gaussian process with covariance function cov{b C (u, v), B C (w, z)} = C(u w, v z) C(u, v)c(w, z).

23 Known margins in the discrete case When F and G are known and supported on N, C can be estimated by bilinear interpolation Bn of the empirical distribution function of the sample (F (X i ), G(Y i )), 1 i n. Theorem As n, the process B n = n (Bn C ) converges weakly in C[0, 1] 2 to a centered Gaussian process B C. Here, B C is no longer a C -Brownian sheet, but a bilinear interpolation thereof.

24 Illustration The limiting process B C is illustrated below in the univariate case when F is binomial with p = 0.4 and N = 3, 10 and 100. Displayed are ten realizations of B n when n = process C_n process C_n process C_n u u u

25 Unknown margins in the continuous case Under suitable regularity conditions, the process C n = n (C n C) converges weakly in C[0, 1] 2 to a centered Gaussian process C, C(u, v) = B C (u, v) u C(u, v) B C (u, 1) v C(u, v) B C (1, v).

26 Bad news in the discrete case Suppose that H is a bivariate Bernoulli distribution with F (0) = p, G(0) = q, H(0, 0) = r, where p, q (0, 1) and r = C(p, q) for some copula C. It can be established that the finite-dimensional margins of C n converge in law, although the limit may not be Gaussian. However, the sequence C n probability unless r = pq. is not asymptotically equicontinuous in In other words, C n does not converge in C[0, 1] 2 unless r = pq.

27 What went wrong? To illustrate, consider a similar process in the univariate case. Take F Bernoulli with F (0) (0, 1) and let F n be its empirical counterpart based on a sample of size n from F. Set F (0) = p and F n (0) = p n and consider { { u, u [0, pn ] u, u [0, p] E n (u) = (1 u) 1 p n p n, u [p n, 1], E(u) = (1 u). 1 p p, u [p, 1] Then E n = n (E n E) does not converge in law in C[0, 1] even though its finite-dimensional margins converge.

28 Illustration n= Frequency Frequency u p= p=0.5 Ten realizations of the process E n for the Bernoulli distribution with p = 0.4 and sample size 1000 (left). Histograms of 5, 000 realizations of E n (u) when u = 0.4 (middle) and u = 0.5 (right) based on samples of size n = 10, 000.

29 All s well that ends well! Consider the set O = ( ) ( ) F (k 1), F (k) G(l 1), G(l). (k,l) N 2 Theorem Let K be an arbitrary compact subset of O. Then C n converges weakly on C(K) as n to C given for every u, v O by B C (u, v) u C (u, v) B C (u, 1) v C (u, v) B C (1, v), where B C is the weak limit of B n.

30 Example: Spearman s rho Consider the non-normalized version of Spearman s rho, viz. ρ = ρ(h) = ρ(c ). Its consistent estimator is given by ρ n = 12 n 3 n (R i R)(S i S) = ρ(cn ), i=1 where R i and S i are the componentwise mid-ranks. Consequently, n {ρ n ρ(h)} = C n (u, v) du dv.

31 It works! Because [0, 1] 2 \ O has Lebesgue measure zero, C n (u, v) du dv = 12 C n (u, v) du dv. O Furthermore, O can be approximated arbitrarily closely by compact sets. This lies at the heart of the following result: Theorem As n, n {ρ n ρ(h)} 12 C (u, v) du dv. O

32 References C. Genest & J. Nešlehová (2007). A primer on copulas for count data. The ASTIN Bulletin, 37, C. Genest & J.G. Nešlehová & B. Rémillard (2014). On the empirical multilinear copula process for count data. Bernoulli, 20, C. Genest & J.G. Nešlehová & B. Rémillard (2014). Asymptotics of the empirical multilinear copula process. Submitted.

33 Research funded by

A measure of radial asymmetry for bivariate copulas based on Sobolev norm

A measure of radial asymmetry for bivariate copulas based on Sobolev norm A measure of radial asymmetry for bivariate copulas based on Sobolev norm Ahmad Alikhani-Vafa Ali Dolati Abstract The modified Sobolev norm is used to construct an index for measuring the degree of radial

More information

Accounting for extreme-value dependence in multivariate data

Accounting for extreme-value dependence in multivariate data Accounting for extreme-value dependence in multivariate data 38th ASTIN Colloquium Manchester, July 15, 2008 Outline 1. Dependence modeling through copulas 2. Rank-based inference 3. Extreme-value dependence

More information

Modelling Dependent Credit Risks

Modelling Dependent Credit Risks Modelling Dependent Credit Risks Filip Lindskog, RiskLab, ETH Zürich 30 November 2000 Home page:http://www.math.ethz.ch/ lindskog E-mail:lindskog@math.ethz.ch RiskLab:http://www.risklab.ch Modelling Dependent

More information

Modelling Dependence with Copulas and Applications to Risk Management. Filip Lindskog, RiskLab, ETH Zürich

Modelling Dependence with Copulas and Applications to Risk Management. Filip Lindskog, RiskLab, ETH Zürich Modelling Dependence with Copulas and Applications to Risk Management Filip Lindskog, RiskLab, ETH Zürich 02-07-2000 Home page: http://www.math.ethz.ch/ lindskog E-mail: lindskog@math.ethz.ch RiskLab:

More information

Financial Econometrics and Volatility Models Copulas

Financial Econometrics and Volatility Models Copulas Financial Econometrics and Volatility Models Copulas Eric Zivot Updated: May 10, 2010 Reading MFTS, chapter 19 FMUND, chapters 6 and 7 Introduction Capturing co-movement between financial asset returns

More information

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline. Practitioner Course: Portfolio Optimization September 10, 2008 Before we define dependence, it is useful to define Random variables X and Y are independent iff For all x, y. In particular, F (X,Y ) (x,

More information

Optimal exact tests for complex alternative hypotheses on cross tabulated data

Optimal exact tests for complex alternative hypotheses on cross tabulated data Optimal exact tests for complex alternative hypotheses on cross tabulated data Daniel Yekutieli Statistics and OR Tel Aviv University CDA course 29 July 2017 Yekutieli (TAU) Optimal exact tests for complex

More information

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a

More information

MULTIDIMENSIONAL POVERTY MEASUREMENT: DEPENDENCE BETWEEN WELL-BEING DIMENSIONS USING COPULA FUNCTION

MULTIDIMENSIONAL POVERTY MEASUREMENT: DEPENDENCE BETWEEN WELL-BEING DIMENSIONS USING COPULA FUNCTION Rivista Italiana di Economia Demografia e Statistica Volume LXXII n. 3 Luglio-Settembre 2018 MULTIDIMENSIONAL POVERTY MEASUREMENT: DEPENDENCE BETWEEN WELL-BEING DIMENSIONS USING COPULA FUNCTION Kateryna

More information

Clearly, if F is strictly increasing it has a single quasi-inverse, which equals the (ordinary) inverse function F 1 (or, sometimes, F 1 ).

Clearly, if F is strictly increasing it has a single quasi-inverse, which equals the (ordinary) inverse function F 1 (or, sometimes, F 1 ). APPENDIX A SIMLATION OF COPLAS Copulas have primary and direct applications in the simulation of dependent variables. We now present general procedures to simulate bivariate, as well as multivariate, dependent

More information

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline. MFM Practitioner Module: Risk & Asset Allocation September 11, 2013 Before we define dependence, it is useful to define Random variables X and Y are independent iff For all x, y. In particular, F (X,Y

More information

Construction and estimation of high dimensional copulas

Construction and estimation of high dimensional copulas Construction and estimation of high dimensional copulas Gildas Mazo PhD work supervised by S. Girard and F. Forbes Mistis, Inria and laboratoire Jean Kuntzmann, Grenoble, France Séminaire Statistiques,

More information

Bivariate Rainfall and Runoff Analysis Using Entropy and Copula Theories

Bivariate Rainfall and Runoff Analysis Using Entropy and Copula Theories Entropy 2012, 14, 1784-1812; doi:10.3390/e14091784 Article OPEN ACCESS entropy ISSN 1099-4300 www.mdpi.com/journal/entropy Bivariate Rainfall and Runoff Analysis Using Entropy and Copula Theories Lan Zhang

More information

Estimation of Copula Models with Discrete Margins (via Bayesian Data Augmentation) Michael S. Smith

Estimation of Copula Models with Discrete Margins (via Bayesian Data Augmentation) Michael S. Smith Estimation of Copula Models with Discrete Margins (via Bayesian Data Augmentation) Michael S. Smith Melbourne Business School, University of Melbourne (Joint with Mohamad Khaled, University of Queensland)

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Gaussian Process Vine Copulas for Multivariate Dependence

Gaussian Process Vine Copulas for Multivariate Dependence Gaussian Process Vine Copulas for Multivariate Dependence José Miguel Hernández-Lobato 1,2 joint work with David López-Paz 2,3 and Zoubin Ghahramani 1 1 Department of Engineering, Cambridge University,

More information

Approximation of multivariate distribution functions MARGUS PIHLAK. June Tartu University. Institute of Mathematical Statistics

Approximation of multivariate distribution functions MARGUS PIHLAK. June Tartu University. Institute of Mathematical Statistics Approximation of multivariate distribution functions MARGUS PIHLAK June 29. 2007 Tartu University Institute of Mathematical Statistics Formulation of the problem Let Y be a random variable with unknown

More information

Unit 14: Nonparametric Statistical Methods

Unit 14: Nonparametric Statistical Methods Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based

More information

Modelling and Estimation of Stochastic Dependence

Modelling and Estimation of Stochastic Dependence Modelling and Estimation of Stochastic Dependence Uwe Schmock Based on joint work with Dr. Barbara Dengler Financial and Actuarial Mathematics and Christian Doppler Laboratory for Portfolio Risk Management

More information

Multivariate Distribution Models

Multivariate Distribution Models Multivariate Distribution Models Model Description While the probability distribution for an individual random variable is called marginal, the probability distribution for multiple random variables is

More information

Copulas and dependence measurement

Copulas and dependence measurement Copulas and dependence measurement Thorsten Schmidt. Chemnitz University of Technology, Mathematical Institute, Reichenhainer Str. 41, Chemnitz. thorsten.schmidt@mathematik.tu-chemnitz.de Keywords: copulas,

More information

Bivariate Paired Numerical Data

Bivariate Paired Numerical Data Bivariate Paired Numerical Data Pearson s correlation, Spearman s ρ and Kendall s τ, tests of independence University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html

More information

On the empirical multilinear copula process for count data

On the empirical multilinear copula process for count data Bernoulli 20(3), 2014, 1344 1371 DOI: 10.3150/13-BEJ524 arxiv:1407.1200v1 [math.st] 4 Jul 2014 On the empirical multilinear copula process for count data CHRISTIAN GENEST 1,*,**, JOHANNA G. NEŠLEHOVÁ1,,

More information

Copula-based tests of independence for bivariate discrete data. Orla Aislinn Murphy

Copula-based tests of independence for bivariate discrete data. Orla Aislinn Murphy Copula-based tests of independence for bivariate discrete data By Orla Aislinn Murphy A thesis submitted to McGill University in partial fulfillment of the requirements for the degree of Masters of Science

More information

Copulas. Mathematisches Seminar (Prof. Dr. D. Filipovic) Di Uhr in E

Copulas. Mathematisches Seminar (Prof. Dr. D. Filipovic) Di Uhr in E Copulas Mathematisches Seminar (Prof. Dr. D. Filipovic) Di. 14-16 Uhr in E41 A Short Introduction 1 0.8 0.6 0.4 0.2 0 0 0.2 0.4 0.6 0.8 1 The above picture shows a scatterplot (500 points) from a pair

More information

Semi-parametric predictive inference for bivariate data using copulas

Semi-parametric predictive inference for bivariate data using copulas Semi-parametric predictive inference for bivariate data using copulas Tahani Coolen-Maturi a, Frank P.A. Coolen b,, Noryanti Muhammad b a Durham University Business School, Durham University, Durham, DH1

More information

Identification of marginal and joint CDFs using Bayesian method for RBDO

Identification of marginal and joint CDFs using Bayesian method for RBDO Struct Multidisc Optim (2010) 40:35 51 DOI 10.1007/s00158-009-0385-1 RESEARCH PAPER Identification of marginal and joint CDFs using Bayesian method for RBDO Yoojeong Noh K. K. Choi Ikjin Lee Received:

More information

Application of Copulas as a New Geostatistical Tool

Application of Copulas as a New Geostatistical Tool Application of Copulas as a New Geostatistical Tool Presented by Jing Li Supervisors András Bardossy, Sjoerd Van der zee, Insa Neuweiler Universität Stuttgart Institut für Wasserbau Lehrstuhl für Hydrologie

More information

Forecasting time series with multivariate copulas

Forecasting time series with multivariate copulas Depend. Model. 2015; 3:59 82 Research Article Open Access Clarence Simard* and Bruno Rémillard Forecasting time series with multivariate copulas DOI 10.1515/demo-2015-0005 5 Received august 17, 2014; accepted

More information

Non parametric estimation of Archimedean copulas and tail dependence. Paris, february 19, 2015.

Non parametric estimation of Archimedean copulas and tail dependence. Paris, february 19, 2015. Non parametric estimation of Archimedean copulas and tail dependence Elena Di Bernardino a and Didier Rullière b Paris, february 19, 2015. a CNAM, Paris, Département IMATH, b ISFA, Université Lyon 1, Laboratoire

More information

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models

More information

A copula goodness-of-t approach. conditional probability integral transform. Daniel Berg 1 Henrik Bakken 2

A copula goodness-of-t approach. conditional probability integral transform. Daniel Berg 1 Henrik Bakken 2 based on the conditional probability integral transform Daniel Berg 1 Henrik Bakken 2 1 Norwegian Computing Center (NR) & University of Oslo (UiO) 2 Norwegian University of Science and Technology (NTNU)

More information

On consistency of Kendall s tau under censoring

On consistency of Kendall s tau under censoring Biometria (28), 95, 4,pp. 997 11 C 28 Biometria Trust Printed in Great Britain doi: 1.193/biomet/asn37 Advance Access publication 17 September 28 On consistency of Kendall s tau under censoring BY DAVID

More information

Trivariate copulas for characterisation of droughts

Trivariate copulas for characterisation of droughts ANZIAM J. 49 (EMAC2007) pp.c306 C323, 2008 C306 Trivariate copulas for characterisation of droughts G. Wong 1 M. F. Lambert 2 A. V. Metcalfe 3 (Received 3 August 2007; revised 4 January 2008) Abstract

More information

Contents 1. Coping with Copulas. Thorsten Schmidt 1. Department of Mathematics, University of Leipzig Dec 2006

Contents 1. Coping with Copulas. Thorsten Schmidt 1. Department of Mathematics, University of Leipzig Dec 2006 Contents 1 Coping with Copulas Thorsten Schmidt 1 Department of Mathematics, University of Leipzig Dec 2006 Forthcoming in Risk Books Copulas - From Theory to Applications in Finance Contents 1 Introdcution

More information

Lecture 2: Repetition of probability theory and statistics

Lecture 2: Repetition of probability theory and statistics Algorithms for Uncertainty Quantification SS8, IN2345 Tobias Neckel Scientific Computing in Computer Science TUM Lecture 2: Repetition of probability theory and statistics Concept of Building Block: Prerequisites:

More information

Properties of Hierarchical Archimedean Copulas

Properties of Hierarchical Archimedean Copulas SFB 649 Discussion Paper 9-4 Properties of Hierarchical Archimedean Copulas Ostap Okhrin* Yarema Okhrin** Wolfgang Schmid*** *Humboldt-Universität zu Berlin, Germany **Universität Bern, Switzerland ***Universität

More information

Copulas and Measures of Dependence

Copulas and Measures of Dependence 1 Copulas and Measures of Dependence Uttara Naik-Nimbalkar December 28, 2014 Measures for determining the relationship between two variables: the Pearson s correlation coefficient, Kendalls tau and Spearmans

More information

Math 494: Mathematical Statistics

Math 494: Mathematical Statistics Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/

More information

A PRIMER ON COPULAS FOR COUNT DATA ABSTRACT KEYWORDS

A PRIMER ON COPULAS FOR COUNT DATA ABSTRACT KEYWORDS A PRIMER ON COPULAS FOR COUNT DATA BY CHRISTIAN GENEST AND JOHANNA NESLEHOVA ABSTRACT The authors review various facts about copulas linking discrete distributions. They show how the possibility of ties

More information

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula Applied Mathematical Sciences, Vol. 4, 2010, no. 14, 657-666 Probability Distributions and Estimation of Ali-Mikhail-Haq Copula Pranesh Kumar Mathematics Department University of Northern British Columbia

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION CHAPTER 1 INTRODUCTION 1.0 Discrete distributions in statistical analysis Discrete models play an extremely important role in probability theory and statistics for modeling count data. The use of discrete

More information

Simulating Realistic Ecological Count Data

Simulating Realistic Ecological Count Data 1 / 76 Simulating Realistic Ecological Count Data Lisa Madsen Dave Birkes Oregon State University Statistics Department Seminar May 2, 2011 2 / 76 Outline 1 Motivation Example: Weed Counts 2 Pearson Correlation

More information

A note about the conjecture about Spearman s rho and Kendall s tau

A note about the conjecture about Spearman s rho and Kendall s tau A note about the conjecture about Spearman s rho and Kendall s tau V. Durrleman Operations Research and Financial Engineering, Princeton University, USA A. Nikeghbali University Paris VI, France T. Roncalli

More information

First steps of multivariate data analysis

First steps of multivariate data analysis First steps of multivariate data analysis November 28, 2016 Let s Have Some Coffee We reproduce the coffee example from Carmona, page 60 ff. This vignette is the first excursion away from univariate data.

More information

Semiparametric Gaussian Copula Models: Progress and Problems

Semiparametric Gaussian Copula Models: Progress and Problems Semiparametric Gaussian Copula Models: Progress and Problems Jon A. Wellner University of Washington, Seattle 2015 IMS China, Kunming July 1-4, 2015 2015 IMS China Meeting, Kunming Based on joint work

More information

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter

More information

Semiparametric Gaussian Copula Models: Progress and Problems

Semiparametric Gaussian Copula Models: Progress and Problems Semiparametric Gaussian Copula Models: Progress and Problems Jon A. Wellner University of Washington, Seattle European Meeting of Statisticians, Amsterdam July 6-10, 2015 EMS Meeting, Amsterdam Based on

More information

On a simple construction of bivariate probability functions with fixed marginals 1

On a simple construction of bivariate probability functions with fixed marginals 1 On a simple construction of bivariate probability functions with fixed marginals 1 Djilali AIT AOUDIA a, Éric MARCHANDb,2 a Université du Québec à Montréal, Département de mathématiques, 201, Ave Président-Kennedy

More information

Application of copula-based BINAR models in loan modelling

Application of copula-based BINAR models in loan modelling Application of copula-based BINAR models in loan modelling Andrius Buteikis 1, Remigijus Leipus 1,2 1 Faculty of Mathematics and Informatics, Vilnius University, Naugarduko 24, Vilnius LT-03225, Lithuania

More information

Copula-based Logistic Regression Models for Bivariate Binary Responses

Copula-based Logistic Regression Models for Bivariate Binary Responses Journal of Data Science 12(2014), 461-476 Copula-based Logistic Regression Models for Bivariate Binary Responses Xiaohu Li 1, Linxiong Li 2, Rui Fang 3 1 University of New Orleans, Xiamen University 2

More information

8 Copulas. 8.1 Introduction

8 Copulas. 8.1 Introduction 8 Copulas 8.1 Introduction Copulas are a popular method for modeling multivariate distributions. A copula models the dependence and only the dependence between the variates in a multivariate distribution

More information

Binary response data

Binary response data Binary response data A Bernoulli trial is a random variable that has two points in its sample space. The two points may be denoted success/failure, heads/tails, yes/no, 0/1, etc. The probability distribution

More information

Approximate Bayesian inference in semiparametric copula models

Approximate Bayesian inference in semiparametric copula models Approximate Bayesian inference in semiparametric copula models Clara Grazian 1 and Brunero Liseo 2 1 Nuffield Department of Medicine, University of Oxford arxiv:1503.02912v4 [stat.me] 16 Jul 2017 2 MEMOTEF,

More information

On Rank Correlation Measures for Non-Continuous Random Variables

On Rank Correlation Measures for Non-Continuous Random Variables On Rank Correlation Measures for Non-Continuous Random Variables Johanna Nešlehová RiskLab, Department of Mathematics, ETH Zürich, 809 Zürich, Switzerland Abstract For continuous random variables, many

More information

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data Outline Frailty modelling of Multivariate Survival Data Thomas Scheike ts@biostat.ku.dk Department of Biostatistics University of Copenhagen Marginal versus Frailty models. Two-stage frailty models: copula

More information

The Influence Function of the Correlation Indexes in a Two-by-Two Table *

The Influence Function of the Correlation Indexes in a Two-by-Two Table * Applied Mathematics 014 5 3411-340 Published Online December 014 in SciRes http://wwwscirporg/journal/am http://dxdoiorg/10436/am01451318 The Influence Function of the Correlation Indexes in a Two-by-Two

More information

Learning Objectives for Stat 225

Learning Objectives for Stat 225 Learning Objectives for Stat 225 08/20/12 Introduction to Probability: Get some general ideas about probability, and learn how to use sample space to compute the probability of a specific event. Set Theory:

More information

Hybrid Copula Bayesian Networks

Hybrid Copula Bayesian Networks Kiran Karra kiran.karra@vt.edu Hume Center Electrical and Computer Engineering Virginia Polytechnic Institute and State University September 7, 2016 Outline Introduction Prior Work Introduction to Copulas

More information

Correlation: Copulas and Conditioning

Correlation: Copulas and Conditioning Correlation: Copulas and Conditioning This note reviews two methods of simulating correlated variates: copula methods and conditional distributions, and the relationships between them. Particular emphasis

More information

Robustness of a semiparametric estimator of a copula

Robustness of a semiparametric estimator of a copula Robustness of a semiparametric estimator of a copula Gunky Kim a, Mervyn J. Silvapulle b and Paramsothy Silvapulle c a Department of Econometrics and Business Statistics, Monash University, c Caulfield

More information

EXPECTED VALUE of a RV. corresponds to the average value one would get for the RV when repeating the experiment, =0.

EXPECTED VALUE of a RV. corresponds to the average value one would get for the RV when repeating the experiment, =0. EXPECTED VALUE of a RV corresponds to the average value one would get for the RV when repeating the experiment, independently, infinitely many times. Sample (RIS) of n values of X (e.g. More accurately,

More information

Package HMMcopula. December 1, 2018

Package HMMcopula. December 1, 2018 Type Package Package HMMcopula December 1, 2018 Title Markov Regime Switching Copula Models Estimation and Goodness of Fit Version 1.0.3 Author Mamadou Yamar Thioub , Bouchra

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur Lecture No. # 29 Introduction to Copulas Hello and welcome to this

More information

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) 1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For

More information

1 Exercises for lecture 1

1 Exercises for lecture 1 1 Exercises for lecture 1 Exercise 1 a) Show that if F is symmetric with respect to µ, and E( X )

More information

STATISTICS SYLLABUS UNIT I

STATISTICS SYLLABUS UNIT I STATISTICS SYLLABUS UNIT I (Probability Theory) Definition Classical and axiomatic approaches.laws of total and compound probability, conditional probability, Bayes Theorem. Random variable and its distribution

More information

Probability and Distributions

Probability and Distributions Probability and Distributions What is a statistical model? A statistical model is a set of assumptions by which the hypothetical population distribution of data is inferred. It is typically postulated

More information

Understand the difference between symmetric and asymmetric measures

Understand the difference between symmetric and asymmetric measures Chapter 9 Measures of Strength of a Relationship Learning Objectives Understand the strength of association between two variables Explain an association from a table of joint frequencies Understand a proportional

More information

Chapter 5 continued. Chapter 5 sections

Chapter 5 continued. Chapter 5 sections Chapter 5 sections Discrete univariate distributions: 5.2 Bernoulli and Binomial distributions Just skim 5.3 Hypergeometric distributions 5.4 Poisson distributions Just skim 5.5 Negative Binomial distributions

More information

ECON 4160, Autumn term Lecture 1

ECON 4160, Autumn term Lecture 1 ECON 4160, Autumn term 2017. Lecture 1 a) Maximum Likelihood based inference. b) The bivariate normal model Ragnar Nymoen University of Oslo 24 August 2017 1 / 54 Principles of inference I Ordinary least

More information

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business Michal Pešta Charles University in Prague Faculty of Mathematics and Physics Ostap Okhrin Dresden University of Technology

More information

Linear Models and Estimation by Least Squares

Linear Models and Estimation by Least Squares Linear Models and Estimation by Least Squares Jin-Lung Lin 1 Introduction Causal relation investigation lies in the heart of economics. Effect (Dependent variable) cause (Independent variable) Example:

More information

STAT 512 sp 2018 Summary Sheet

STAT 512 sp 2018 Summary Sheet STAT 5 sp 08 Summary Sheet Karl B. Gregory Spring 08. Transformations of a random variable Let X be a rv with support X and let g be a function mapping X to Y with inverse mapping g (A = {x X : g(x A}

More information

Chapter 3: Maximum Likelihood Theory

Chapter 3: Maximum Likelihood Theory Chapter 3: Maximum Likelihood Theory Florian Pelgrin HEC September-December, 2010 Florian Pelgrin (HEC) Maximum Likelihood Theory September-December, 2010 1 / 40 1 Introduction Example 2 Maximum likelihood

More information

Frequentist-Bayesian Model Comparisons: A Simple Example

Frequentist-Bayesian Model Comparisons: A Simple Example Frequentist-Bayesian Model Comparisons: A Simple Example Consider data that consist of a signal y with additive noise: Data vector (N elements): D = y + n The additive noise n has zero mean and diagonal

More information

arxiv: v5 [stat.ml] 6 Oct 2018

arxiv: v5 [stat.ml] 6 Oct 2018 Copula Index for Detecting Dependence and Monotonicity between Stochastic Signals Kiran Karra a,, Lamine Mili a a Virginia Tech Department of Electrical and Computer Engineering 900 N. Glebe Rd., Arlington,

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Contents 1. Contents

Contents 1. Contents Contents 1 Contents 6 Distributions of Functions of Random Variables 2 6.1 Transformation of Discrete r.v.s............. 3 6.2 Method of Distribution Functions............. 6 6.3 Method of Transformations................

More information

A Goodness-of-fit Test for Semi-parametric Copula Models of Right-Censored Bivariate Survival Times

A Goodness-of-fit Test for Semi-parametric Copula Models of Right-Censored Bivariate Survival Times A Goodness-of-fit Test for Semi-parametric Copula Models of Right-Censored Bivariate Survival Times by Moyan Mei B.Sc. (Honors), Dalhousie University, 2014 Project Submitted in Partial Fulfillment of the

More information

Chapter 2: Fundamentals of Statistics Lecture 15: Models and statistics

Chapter 2: Fundamentals of Statistics Lecture 15: Models and statistics Chapter 2: Fundamentals of Statistics Lecture 15: Models and statistics Data from one or a series of random experiments are collected. Planning experiments and collecting data (not discussed here). Analysis:

More information

Sections 2.3, 2.4. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis 1 / 21

Sections 2.3, 2.4. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis 1 / 21 Sections 2.3, 2.4 Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 21 2.3 Partial association in stratified 2 2 tables In describing a relationship

More information

Chapter 13 Correlation

Chapter 13 Correlation Chapter Correlation Page. Pearson correlation coefficient -. Inferential tests on correlation coefficients -9. Correlational assumptions -. on-parametric measures of correlation -5 5. correlational example

More information

Vine Copulas. Spatial Copula Workshop 2014 September 22, Institute for Geoinformatics University of Münster.

Vine Copulas. Spatial Copula Workshop 2014 September 22, Institute for Geoinformatics University of Münster. Spatial Workshop 2014 September 22, 2014 Institute for Geoinformatics University of Münster http://ifgi.uni-muenster.de/graeler 1 spatio-temporal data Typically, spatio-temporal data is given at a set

More information

MAS223 Statistical Inference and Modelling Exercises

MAS223 Statistical Inference and Modelling Exercises MAS223 Statistical Inference and Modelling Exercises The exercises are grouped into sections, corresponding to chapters of the lecture notes Within each section exercises are divided into warm-up questions,

More information

Local sensitivity analyses of goodness-of-fit tests for copulas

Local sensitivity analyses of goodness-of-fit tests for copulas Local sensitivity analyses of goodness-of-fit tests for copulas DANIEL BERG University of Oslo and The Norwegian Computing Center JEAN-FRANÇOIS QUESSY Département de mathématiques et d informatique, Université

More information

Algorithms for Uncertainty Quantification

Algorithms for Uncertainty Quantification Algorithms for Uncertainty Quantification Tobias Neckel, Ionuț-Gabriel Farcaș Lehrstuhl Informatik V Summer Semester 2017 Lecture 2: Repetition of probability theory and statistics Example: coin flip Example

More information

Machine learning - HT Maximum Likelihood

Machine learning - HT Maximum Likelihood Machine learning - HT 2016 3. Maximum Likelihood Varun Kanade University of Oxford January 27, 2016 Outline Probabilistic Framework Formulate linear regression in the language of probability Introduce

More information

Vine copulas with asymmetric tail dependence and applications to financial return data 1. Abstract

Vine copulas with asymmetric tail dependence and applications to financial return data 1. Abstract *Manuscript Vine copulas with asymmetric tail dependence and applications to financial return data 1 Aristidis K. Nikoloulopoulos 2, Harry Joe 3 and Haijun Li 4 Abstract In Aas et al. (2009) and Aas and

More information

Counts using Jitters joint work with Peng Shi, Northern Illinois University

Counts using Jitters joint work with Peng Shi, Northern Illinois University of Claim Longitudinal of Claim joint work with Peng Shi, Northern Illinois University UConn Actuarial Science Seminar 2 December 2011 Department of Mathematics University of Connecticut Storrs, Connecticut,

More information

Marginal Specifications and a Gaussian Copula Estimation

Marginal Specifications and a Gaussian Copula Estimation Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required

More information

Lecture 25: Review. Statistics 104. April 23, Colin Rundel

Lecture 25: Review. Statistics 104. April 23, Colin Rundel Lecture 25: Review Statistics 104 Colin Rundel April 23, 2012 Joint CDF F (x, y) = P [X x, Y y] = P [(X, Y ) lies south-west of the point (x, y)] Y (x,y) X Statistics 104 (Colin Rundel) Lecture 25 April

More information

Testing Statistical Hypotheses

Testing Statistical Hypotheses E.L. Lehmann Joseph P. Romano Testing Statistical Hypotheses Third Edition 4y Springer Preface vii I Small-Sample Theory 1 1 The General Decision Problem 3 1.1 Statistical Inference and Statistical Decisions

More information

Chapter 5. Chapter 5 sections

Chapter 5. Chapter 5 sections 1 / 43 sections Discrete univariate distributions: 5.2 Bernoulli and Binomial distributions Just skim 5.3 Hypergeometric distributions 5.4 Poisson distributions Just skim 5.5 Negative Binomial distributions

More information

Risk Aggregation with Dependence Uncertainty

Risk Aggregation with Dependence Uncertainty Introduction Extreme Scenarios Asymptotic Behavior Challenges Risk Aggregation with Dependence Uncertainty Department of Statistics and Actuarial Science University of Waterloo, Canada Seminar at ETH Zurich

More information

Songklanakarin Journal of Science and Technology SJST R1 Sukparungsee

Songklanakarin Journal of Science and Technology SJST R1 Sukparungsee Songklanakarin Journal of Science and Technology SJST-0-0.R Sukparungsee Bivariate copulas on the exponentially weighted moving average control chart Journal: Songklanakarin Journal of Science and Technology

More information

STA 2201/442 Assignment 2

STA 2201/442 Assignment 2 STA 2201/442 Assignment 2 1. This is about how to simulate from a continuous univariate distribution. Let the random variable X have a continuous distribution with density f X (x) and cumulative distribution

More information

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems Review of Basic Probability The fundamentals, random variables, probability distributions Probability mass/density functions

More information

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes TTU, October 26, 2012 p. 1/3 Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes Hao Zhang Department of Statistics Department of Forestry and Natural Resources Purdue University

More information

Lehrstuhl für Statistik und Ökonometrie. Diskussionspapier 87 / Some critical remarks on Zhang s gamma test for independence

Lehrstuhl für Statistik und Ökonometrie. Diskussionspapier 87 / Some critical remarks on Zhang s gamma test for independence Lehrstuhl für Statistik und Ökonometrie Diskussionspapier 87 / 2011 Some critical remarks on Zhang s gamma test for independence Ingo Klein Fabian Tinkl Lange Gasse 20 D-90403 Nürnberg Some critical remarks

More information