Approximating the Integrated Tail Distribution

Similar documents
Large deviations for random walks under subexponentiality: the big-jump domain

The Fundamentals of Heavy Tails Properties, Emergence, & Identification. Jayakrishnan Nair, Adam Wierman, Bert Zwart

Quantile-quantile plots and the method of peaksover-threshold

1 The Glivenko-Cantelli Theorem

Modèles de dépendance entre temps inter-sinistres et montants de sinistre en théorie de la ruine

Efficient rare-event simulation for sums of dependent random varia

Reduced-load equivalence for queues with Gaussian input

Rare Events in Random Walks and Queueing Networks in the Presence of Heavy-Tailed Distributions

On Upper Bounds for the Tail Distribution of Geometric Sums of Subexponential Random Variables

SOME INDICES FOR HEAVY-TAILED DISTRIBUTIONS

IE 303 Discrete-Event Simulation

Chapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Operational Risk and Pareto Lévy Copulas

Introduction to Rare Event Simulation

Rare event simulation for the ruin problem with investments via importance sampling and duality

General Glivenko-Cantelli theorems

Randomly Weighted Sums of Conditionnally Dependent Random Variables

Modelling the risk process

Introduction to Algorithmic Trading Strategies Lecture 10

Feuille d exercices n 1 : Poisson processes. 1. Give a necessary and su cient condition for having

Homework # , Spring Due 14 May Convergence of the empirical CDF, uniform samples

ABC methods for phase-type distributions with applications in insurance risk problems

Math 328 Course Notes

Lecture 2. Distributions and Random Variables

Operational Risk and Pareto Lévy Copulas

Lecture Notes 3 Convergence (Chapter 5)

Glossary availability cellular manufacturing closed queueing network coefficient of variation (CV) conditional probability CONWIP

Multivariate Operational Risk: Dependence Modelling with Lévy Copulas

Poisson Processes. Stochastic Processes. Feb UC3M

The Subexponential Product Convolution of Two Weibull-type Distributions

Multivariate Risk Processes with Interacting Intensities

Asymptotic Tail Probabilities of Sums of Dependent Subexponential Random Variables

IEOR 8100: Topics in OR: Asymptotic Methods in Queueing Theory. Fall 2009, Professor Whitt. Class Lecture Notes: Wednesday, September 9.

Estimation de mesures de risques à partir des L p -quantiles

Ruin Probabilities of a Discrete-time Multi-risk Model

arxiv:math/ v2 [math.pr] 9 Oct 2007

Lifetime Dependence Modelling using a Generalized Multivariate Pareto Distribution

Shape of the return probability density function and extreme value statistics

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions

Practical conditions on Markov chains for weak convergence of tail empirical processes

Parameter Estimation

State-dependent Limited Processor Sharing - Approximations and Optimal Control

The Ruin Probability of a Discrete Time Risk Model under Constant Interest Rate with Heavy Tails

Readings: Finish Section 5.2

Fluid Heuristics, Lyapunov Bounds and E cient Importance Sampling for a Heavy-tailed G/G/1 Queue

Quantitative Modeling of Operational Risk: Between g-and-h and EVT

Research Article Extended Precise Large Deviations of Random Sums in the Presence of END Structure and Consistent Variation

Extreme Value Analysis and Spatial Extremes

University Of Calgary Department of Mathematics and Statistics

Extreme L p quantiles as risk measures

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 22 12/09/2013. Skorokhod Mapping Theorem. Reflected Brownian Motion

large number of i.i.d. observations from P. For concreteness, suppose

WEIGHTED SUMS OF SUBEXPONENTIAL RANDOM VARIABLES AND THEIR MAXIMA

University of Tartu DISTRIBUTIONS IN FINANCIAL MATHEMATICS (MTMS )

Heavy Tails: The Origins and Implications for Large Scale Biological & Information Systems

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr.

Root test. Root test Consider the limit L = lim n a n, suppose it exists. L < 1. L > 1 (including L = ) L = 1 the test is inconclusive.

1 Glivenko-Cantelli type theorems

Asymptotics of random sums of heavy-tailed negatively dependent random variables with applications

Multivariate Distributions

Nonparametric estimation of extreme risks from heavy-tailed distributions

BIRTH DEATH PROCESSES AND QUEUEING SYSTEMS

Economics 241B Review of Limit Theorems for Sequences of Random Variables

3 Continuous Random Variables

Asymptotic Statistics-VI. Changliang Zou

Economics 583: Econometric Theory I A Primer on Asymptotics

Quantifying Stochastic Model Errors via Robust Optimization

Multivariate Normal-Laplace Distribution and Processes

Technical Report No. 03/05, August 2005 THE TAIL PROBABILITY OF DISCOUNTED SUMS OF PARETO-LIKE LOSSES IN INSURANCE Marc J. Goovaerts, Rob Kaas, Roger

Stochastic Simulation

VaR vs. Expected Shortfall

E cient Simulation and Conditional Functional Limit Theorems for Ruinous Heavy-tailed Random Walks

Extreme value theory and high quantile convergence

On the inefficiency of state-independent importance sampling in the presence of heavy tails

Contents LIST OF TABLES... LIST OF FIGURES... xvii. LIST OF LISTINGS... xxi PREFACE. ...xxiii

Kernel density estimation for heavy-tailed distributions...

arxiv: v1 [math.pr] 7 Aug 2014

Analysis methods of heavy-tailed data

arxiv: v2 [math.pr] 22 Apr 2014 Dept. of Mathematics & Computer Science Mathematical Institute

Chapter Learning Objectives. Probability Distributions and Probability Density Functions. Continuous Random Variables

Advanced Statistics II: Non Parametric Tests

Modelling Operational Risk Using Bayesian Inference

Computer simulation on homogeneity testing for weighted data sets used in HEP

Research Reports on Mathematical and Computing Sciences

Statistics 581 Revision of Section 4.4: Consistency of Maximum Likelihood Estimates Wellner; 11/30/2001

Normal Probability Plot Probability Probability

Scandinavian Actuarial Journal. Asymptotics for ruin probabilities in a discrete-time risk model with dependent financial and insurance risks

Stat 5101 Notes: Brand Name Distributions

Asymptotic Ruin Probabilities for a Bivariate Lévy-driven Risk Model with Heavy-tailed Claims and Risky Investments

Operational risk modeled analytically II: the consequences of classification invariance. Vivien BRUNEL

Generalized quantiles as risk measures

Stability and Rare Events in Stochastic Models Sergey Foss Heriot-Watt University, Edinburgh and Institute of Mathematics, Novosibirsk

There recently has been great interest in the performance of queues

H 2 : otherwise. that is simply the proportion of the sample points below level x. For any fixed point x the law of large numbers gives that

Characterization through Hazard Rate of heavy tailed distributions and some Convolution Closure Properties

Functional Limit Theorems for A New Class of Non-Stationary Shot Noise Processes

Sampling Distributions and Asymptotics Part II

A polynomial expansion to approximate ruin probabilities

Subexponential Tails of Discounted Aggregate Claims in a Time-Dependent Renewal Risk Model

Transcription:

Approximating the Integrated Tail Distribution Ants Kaasik & Kalev Pärna University of Tartu VIII Tartu Conference on Multivariate Statistics 26th of June, 2007

Motivating example Let W be a steady-state waiting time of a M/G/1 queue process with service time distribution B µ :=mean service time, ν :=mean interarrival time, µ < ν < B I (x) := x 0 B(y)dy/µ, where B = 1 B, is the integrated tail distribution

Motivating example Let W be a steady-state waiting time of a M/G/1 queue process with service time distribution B µ :=mean service time, ν :=mean interarrival time, µ < ν < B I (x) := x 0 B(y)dy/µ, where B = 1 B, is the integrated tail distribution Then W D = N Y i, Y i are IID with distribution B I and independent of N which has a geometric distribution (the parameter of which can be expressed in terms of µ and ν). i=1

The simulation approach When B is known, P(W > u) can be estimated with excellent precision regardless how big u is (this is true regardless of the tail of B).

The simulation approach When B is known, P(W > u) can be estimated with excellent precision regardless how big u is (this is true regardless of the tail of B). Thus we have the following (simulation) approach: data (1) estimate of B (2) estimate of P(W > u)

The simulation approach When B is known, P(W > u) can be estimated with excellent precision regardless how big u is (this is true regardless of the tail of B). Thus we have the following (simulation) approach: data (1) estimate of B (2) estimate of P(W > u) The estimation process (1) can be the cause of a major error! (we do not know even the right class of distributions)

The simulation approach When B is known, P(W > u) can be estimated with excellent precision regardless how big u is (this is true regardless of the tail of B). Thus we have the following (simulation) approach: data (1) estimate of B (2) estimate of P(W > u) The estimation process (1) can be the cause of a major error! (we do not know even the right class of distributions) It would be better to nd P(W > u) directly as B is of no interest

Subexponential distributions Random variable X has a subexponential distribution i X : Ω (0, ) and F 2 (x) F (x) 2, x, where F is the cdf of X. Important members of the subexponential family of distributions are distributions with a regularly varying tail, lognormal distribution and Weibull distribution.

The proposed approach Let B I be subexponential. Then P(W > u) EN B I (u), where a(x) b(x) i lim a(x)/b(x) = 1. x

The proposed approach Let B I be subexponential. Then P(W > u) EN B I (u), where a(x) b(x) i lim a(x)/b(x) = 1. x Idea: µ µ n (LLN) B(y) B n (y) (Glivenko-Cantelli) thus B I (x) = x B(y)dy/µ? x B 0 0 n (y)dy/µ n

Main result Let X n be a sequence of IID positive random variables with a nite mean µ and cumulative distribution function B with B n its empirical counterpart. Also denote the sample mean with µ n = (X 1 +... + X n )/n. Then the following result holds P ( sup x x 0 B x n (y)dy 0 µ n B(y)dy µ n 0 ) = 1.

Simulation study (1) The theorem does not say anything about the rate of convergence. We studied it for the Pareto ( B(x) = (1 + x) α, α > 1) and Weibull ( B(x) = e x β, 0 < β < 1) case with the help of simulations. Let ɛ n be "half-width of the condence interval of B" i.e. P(sup B I n(x) B I (x) > ɛ n ) = 0.05

Simulation study (1) The theorem does not say anything about the rate of convergence. We studied it for the Pareto ( B(x) = (1 + x) α, α > 1) and Weibull ( B(x) = e x β, 0 < β < 1) case with the help of simulations. Let ɛ n be "half-width of the condence interval of B" i.e. P(sup B I n(x) B I (x) > ɛ n ) = 0.05 How big is the quantile ɛ n? (initial width) How does the ratio ɛ 2n /ɛ n behave? (rate of convergence) How big is E(sup B I n(x) B I (x) )? (average supremum error of the estimate)

Simulation study (2) n α=2 α=3 100 0.2564 0.1769 1000 0.1192 0.0636 10000 0.0470 0.0212 100000 0.0181 0.0068 Table: Half-width of the 95%-condence interval for the Pareto case n β=1/3 β=1/2 100 0.3675 0.2135 1000 0.1463 0.0704 10000 0.0481 0.0224 100000 0.0154 0.0071 Table: Half-width of the 95%-condence interval for the Weibull case

Simulation study (3) n α=2 α=3 100 0.7970 (0.7835;0.8094) 0.7459 (0.7342;0.7570) 1000 0.7593 (0.7424;0.7752) 0.7265 (0.7158;0.7373) 10000 0.7541 (0.7382;0.7706) 0.7092 (0.6995;0.7183) 100000 0.7350 (0.7201;0.7476) 0.7131 (0.7039;0.7222) Table: Quantile ratio with 95%-condence intervals for the Pareto case n β=1/3 β=1/2 100 0.7688 (0.7612;0.7774) 0.7259 (0.7168;0.7351) 1000 0.7305 (0.7212;0.7399) 0.7046 (0.6952;0.7132) 10000 0.7100 (0.7017;0.7183) 0.7115 (0.7033;0.7195) 100000 0.7128 (0.7039;0.7215) 0.7009 (0.6922;0.7101) Table: Quantile ratio with 95%-condence intervals for the Weibull case

Simulation study (4) n α=2 α=3 100 0.1319 0.0855 1000 0.0572 0.0306 10000 0.0230 0.0103 100000 0.0087 0.0033 Table: Mean supremum absolute error for the Pareto case n β=1/3 β=1/2 100 0.1885 0.1027 1000 0.0700 0.0342 10000 0.0236 0.0110 100000 0.0076 0.0035 Table: Mean supremum absolute error for the Weibull case

Onwards... The asymptotic equivalence P(W > u) EN B I (u) remains valid for the case of a GI /G/1 queue (we need an additional assumption that also B is subexponential). There is also a dual problem in insurance risk context (the ultimate ruin probability for a company dealing with subexponential claims).

Onwards... The asymptotic equivalence P(W > u) EN B I (u) remains valid for the case of a GI /G/1 queue (we need an additional assumption that also B is subexponential). There is also a dual problem in insurance risk context (the ultimate ruin probability for a company dealing with subexponential claims). Our proposed approach has an obvious shortfall in practice: support of the approximating distribution is a nite interval.

Onwards... The asymptotic equivalence P(W > u) EN B I (u) remains valid for the case of a GI /G/1 queue (we need an additional assumption that also B is subexponential). There is also a dual problem in insurance risk context (the ultimate ruin probability for a company dealing with subexponential claims). Our proposed approach has an obvious shortfall in practice: support of the approximating distribution is a nite interval. The way onwards is clear: sample data must be used to t a generalized Pareto distribution to the tail, so that we have an approximating distribution support of which is (0, ).

Thank you for listening!