On the Importance of Dispersion Modeling for Claims Reserving: Application of the Double GLM Theory

Similar documents
Claims Reserving under Solvency II

Prediction Uncertainty in the Bornhuetter-Ferguson Claims Reserving Method: Revisited

DELTA METHOD and RESERVING

Chain ladder with random effects

CHAIN LADDER FORECAST EFFICIENCY

Peng Shi. Actuarial Research Conference August 5-8, 2015

Double generalized linear compound poisson models to insurance claims data

A Loss Reserving Method for Incomplete Claim Data

Analytics Software. Beyond deterministic chain ladder reserves. Neil Covington Director of Solutions Management GI

STA216: Generalized Linear Models. Lecture 1. Review and Introduction

Stochastic Incremental Approach for Modelling the Claims Reserves

GREG TAYLOR ABSTRACT

Solutions to the Spring 2015 CAS Exam ST

A HOMOTOPY CLASS OF SEMI-RECURSIVE CHAIN LADDER MODELS

Bootstrapping the triangles

1/15. Over or under dispersion Problem

arxiv: v1 [stat.ap] 17 Jun 2013

UNIVERSITY OF TORONTO Faculty of Arts and Science

Introduction to General and Generalized Linear Models

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business

Generalized Linear Models (GLZ)

Ratemaking with a Copula-Based Multivariate Tweedie Model

Non-Life Insurance: Mathematics and Statistics

GARY G. VENTER 1. CHAIN LADDER VARIANCE FORMULA

A few basics of credibility theory

Some explanations about the IWLS algorithm to fit generalized linear models

Generalized linear mixed models (GLMMs) for dependent compound risk models

Calendar Year Dependence Modeling in Run-Off Triangles

Estimating prediction error in mixed models

Credibility Theory for Generalized Linear and Mixed Models

Generalized Linear Models. Last time: Background & motivation for moving beyond linear

Generalized linear mixed models for dependent compound risk models

Spring 2012 Math 541A Exam 1. X i, S 2 = 1 n. n 1. X i I(X i < c), T n =

Chapter 5: Generalized Linear Models

Solutions to the Spring 2018 CAS Exam MAS-1

Modeling Overdispersion

Introduction to Rare Event Simulation

Generalized Linear Models. Kurt Hornik

Problem Selected Scores

Synchronous bootstrapping of loss reserves

Stat 710: Mathematical Statistics Lecture 12

Likelihood-Based Methods

Linear Regression Models P8111

STA 216: GENERALIZED LINEAR MODELS. Lecture 1. Review and Introduction. Much of statistics is based on the assumption that random

First Year Examination Department of Statistics, University of Florida

Stochastic Loss Reserving with the Collective Risk Model

Multivariate negative binomial models for insurance claim counts

Institute of Actuaries of India

Lecture 8. Poisson models for counts

36-463/663: Multilevel & Hierarchical Models

Individual reserving a survey. R in Insurance, Paris 08-June-2017 Alexandre Boumezoued, joint work with Laurent Devineau

Mathematical statistics

Neural networks (not in book)

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

Mathematical statistics

arxiv: v1 [stat.me] 25 Jan 2015 Table of Contents

Reserving for multiple excess layers

Now consider the case where E(Y) = µ = Xβ and V (Y) = σ 2 G, where G is diagonal, but unknown.

DEPARTMENT OF ACTUARIAL STUDIES RESEARCH PAPER SERIES

Solutions to the Spring 2016 CAS Exam S

STAT5044: Regression and Anova

Applying the proportional hazard premium calculation principle

Multilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2

Generalized Linear Models I

Information geometry for bivariate distribution control

Generalized Linear Models

Standard Error of Technical Cost Incorporating Parameter Uncertainty

On the Distribution of Discounted Loss Reserves Using Generalized Linear Models

New Statistical Methods That Improve on MLE and GLM Including for Reserve Modeling GARY G VENTER

Parameter Estimation and Statistical Test in Modeling Geographically Weighted Poisson Inverse Gaussian Regression

2.1 Linear regression with matrices

Two Hours. Mathematical formula books and statistical tables are to be provided THE UNIVERSITY OF MANCHESTER. 26 May :00 16:00

Poisson regression: Further topics

Generalized Linear Models Introduction

Tail Conditional Expectations for. Extended Exponential Dispersion Models

STA 2201/442 Assignment 2

Reparametrization of COM-Poisson Regression Models with Applications in the Analysis of Experimental Count Data

GB2 Regression with Insurance Claim Severities

Lecture 9 STK3100/4100

Ph.D. Qualifying Exam Monday Tuesday, January 4 5, 2016

Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014

Lecture 34: Properties of the LSE

ABSTRACT 1. INTRODUCTION

SPH 247 Statistical Analysis of Laboratory Data. April 28, 2015 SPH 247 Statistics for Laboratory Data 1

if n is large, Z i are weakly dependent 0-1-variables, p i = P(Z i = 1) small, and Then n approx i=1 i=1 n i=1

Generalized Linear Models 1

Genetic Networks. Korbinian Strimmer. Seminar: Statistical Analysis of RNA-Seq Data 19 June IMISE, Universität Leipzig

Solutions to the Fall 2018 CAS Exam MAS-1

Individual loss reserving with the Multivariate Skew Normal framework

On the existence of maximum likelihood estimators in a Poissongamma HGLM and a negative binomial regression model

The finite-time Gerber-Shiu penalty function for two classes of risk processes

Standard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification,

Generalized linear mixed models (GLMMs) for dependent compound risk models

Poisson Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

High-Throughput Sequencing Course

Computational methods for mixed models

Triangles in Life and Casualty

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

CSE 312 Final Review: Section AA

Transcription:

On the Importance of Dispersion Modeling for Claims Reserving: Application of the Double GLM Theory Danaïl Davidov under the supervision of Jean-Philippe Boucher Département de mathématiques Université du Québec à Montréal August 2009 D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 1 / 37

Table of Contents 1 Introduction 2 Tweedie GLM Model Definition Log-likelihood Overdispersion 3 Variance Modeling Variance in regressions Dispersion Models Double GLMs 4 Swiss Motor Industry Example Analysis of results 5 Conclusion D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 2 / 37

Introduction Definition A loss reserve is a provision for an insurer s liability for claims Notes D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 3 / 37

Introduction Definition Notes A loss reserve is a provision for an insurer s liability for claims A stochastic model uses random variables in a regression framework D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 3 / 37

Introduction Definition Notes A loss reserve is a provision for an insurer s liability for claims A stochastic model uses random variables in a regression framework Claims reserves models presented here use GLM theory as introduced in England, Verrall 2002 D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 3 / 37

Model Definition Notations C i,j Incremental payments D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 4 / 37

Model Definition Notations C i,j Incremental payments w i,j Exposure D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 4 / 37

Model Definition Notations C i,j Incremental payments w i,j Exposure r i,j Incremental number of payments D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 4 / 37

Model Definition Notations C i,j Incremental payments w i,j Exposure r i,j Incremental number of payments Y i,j Normalized incremental payments D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 4 / 37

Model Definition Notations C i,j Incremental payments w i,j Exposure r i,j Incremental number of payments Y i,j Normalized incremental payments Y i,j = C i,j w i,j D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 4 / 37

Model Definition Hypotheses C i,j is a compound Poisson-Gamma distribution D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 5 / 37

Model Definition Hypotheses C i,j is a compound Poisson-Gamma distribution Frequency Poisson with mean ϑ i,j w i,j and variance ϑ i,j w i,j D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 5 / 37

Model Definition Hypotheses C i,j is a compound Poisson-Gamma distribution Frequency Poisson with mean ϑ i,j w i,j and variance ϑ i,j w i,j Severity Gamma with mean τ i,j and variance ντ 2 i,j D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 5 / 37

Model Definition Hypotheses C i,j is a compound Poisson-Gamma distribution Frequency Poisson with mean ϑ i,j w i,j and variance ϑ i,j w i,j Severity Gamma with mean τ i,j and variance ντ 2 i,j Using the following parametrisation p = ν + 2 ν + 1 µ = ϑτ φ = ϑ1 p τ 2 p (2 p), p (1, 2) D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 5 / 37

Model Definition Tweedie Model Y i,j Tweedie(µ i,j, p, φ, w i,j ) D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 6 / 37

Model Definition Tweedie Model Y i,j Tweedie(µ i,j, p, φ, w i,j ) µ i,j = e Xβ D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 6 / 37

Model Definition Tweedie Model Y i,j Tweedie(µ i,j, p, φ, w i,j ) µ i,j = e Xβ E[Y i,j ] = µ i,j, Var[Y i,j ] = φ w i,j µ p i,j D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 6 / 37

Log-likelihood function Tweedie Model l = i,j r i,j log ( ) (wi,j /φ) ν+1 yi,j ν (p 1) ν (2 p) log ( r i,j!γ ( r i,j ν ) y i,j ) + w i,j φ µ (y 1 p i,j i,j 1 p µ 2 p ) i,j 2 p D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 7 / 37

Parameters FIGURE: Parameter main influence D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 8 / 37

Parameter p Tweedie Model Can be estimated only when the number of payments is known D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 9 / 37

Parameter p Tweedie Model Can be estimated only when the number of payments is known Otherwise, it s supposed fixed and known D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 9 / 37

Parameter p Tweedie Model Can be estimated only when the number of payments is known Otherwise, it s supposed fixed and known p and φ need both to be estimated at the same time D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 9 / 37

Parameter φ Optimizing φ using the likelihood principle φ p = i,j w i,j (y i,j µ 1 p i,j 1 p µ2 p i,j 2 p (1 + ν) i,j r i,j ) D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 10 / 37

Parameter φ FIGURE: Optimizing p using the likelihood principle for φ D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 11 / 37

Parameter φ Optimizing φ using the deviance principle φ p = i,j 2 y (y 1 p i,j µ 1 p i,j i,j N Q 1 p y 2 p i,j µ 2 p i,j 2 p ) D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 12 / 37

Parameter φ FIGURE: Optimizing p using the deviance principle for φ D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 13 / 37

Parameter w i,j Note The exposure has been incorporated in the begining, within the initial hypothesis D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 14 / 37

Parameter w i,j Note The exposure has been incorporated in the begining, within the initial hypothesis Different ways of incorporating the exposure within the initial hypothesis would lead to different models D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 14 / 37

Frequency vs Severity Compound Poisson Model Y = N k=1 X i Case 1 Case 2 Case 3 E[N] 10 20 10 Var[N] 10 20 10 E[X] 10 10 20 Var[X] 100 100 400 E[Y] 100 200 200 Var[Y] 2000 4000 8000 TABLE: Mean and variance of total costs for various situations D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 15 / 37

Frequency vs Severity Typical situation in a long-tail business Decreasing average frequency D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 16 / 37

Frequency vs Severity Typical situation in a long-tail business Decreasing average frequency Increasing average severity D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 16 / 37

Frequency vs Severity Typical situation in a long-tail business Decreasing average frequency Increasing average severity Increasing variance in the severity D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 16 / 37

Impact of the Distribution FIGURE: Fitted curve for Normal, Poisson and Gamma models D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 17 / 37

Dispersion Models Model Definition Y i,j Tweedie(µ i,j, p, φ i,j, w i,j ) D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 18 / 37

Dispersion Models Model Definition Y i,j Tweedie(µ i,j, p, φ i,j, w i,j ) µ i,j = e Xβ D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 18 / 37

Dispersion Models Model Definition Y i,j Tweedie(µ i,j, p, φ i,j, w i,j ) µ i,j = e Xβ E[Y i,j ] = µ i,j, Var[Y i,j ] = φ i,j µ p i,j D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 18 / 37

Dispersion Models Model Definition Y i,j Tweedie(µ i,j, p, φ i,j, w i,j ) µ i,j = e Xβ E[Y i,j ] = µ i,j, Var[Y i,j ] = φ i,j µ p i,j φ i,j = e V γ D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 18 / 37

Dispersion Models Log-likelihood l = ( ) (wi,j /φ i,j ) ν+1 yi,j ν r i,j log (p 1) ν log ( r i,j!γ ( r i,j ν ) ) y i,j (2 p) i,j + w i,j µ (y 1 p i,j i,j φ i,j 1 p µ 2 p ) i,j 2 p D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 19 / 37

Dispersion Models Notes The p parameter is optimized at the same time as all other parameters using implicitly the likelihood principle D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 20 / 37

Dispersion Models Notes The p parameter is optimized at the same time as all other parameters using implicitly the likelihood principle Accident years do not have a significant impact on the dispersion parameter D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 20 / 37

Dispersion Models Notes The p parameter is optimized at the same time as all other parameters using implicitly the likelihood principle Accident years do not have a significant impact on the dispersion parameter Due to lack of data in the last column, the model was build so that the last two columns have the same dispersion parameter D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 20 / 37

Dispersion Models Notes The p parameter is optimized at the same time as all other parameters using implicitly the likelihood principle Accident years do not have a significant impact on the dispersion parameter Due to lack of data in the last column, the model was build so that the last two columns have the same dispersion parameter Possibility to incorporate trends in the dispersion parameter by using the Hoerl s curve parametrisation D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 20 / 37

Double GLMs Algorithm 1 Start with the initial exposure and find the normalized incremental payments D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 21 / 37

Double GLMs Algorithm 1 Start with the initial exposure and find the normalized incremental payments 2 Find the deviance between fitted and observed values D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 21 / 37

Double GLMs Algorithm 1 Start with the initial exposure and find the normalized incremental payments 2 Find the deviance between fitted and observed values 3 Model the deviance D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 21 / 37

Double GLMs Algorithm 1 Start with the initial exposure and find the normalized incremental payments 2 Find the deviance between fitted and observed values 3 Model the deviance 4 Establish the new exposure and start over D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 21 / 37

Double GLMs FIGURE: Inter-relationship between the two sub-models D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 22 / 37

Iterative Weighted Least Squares Algorithm additional specifications Analogous to Fisher s weighted scoring method for optimization D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 23 / 37

Iterative Weighted Least Squares Algorithm additional specifications Analogous to Fisher s weighted scoring method for optimization IWLS implicitly uses the deviance principle for estimating φ D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 23 / 37

Restricted Maximum Likelihood Notes Maximum likelihood estimators are biased downwards when the number of estimators is large compared to the number of observations D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 24 / 37

Restricted Maximum Likelihood Notes Maximum likelihood estimators are biased downwards when the number of estimators is large compared to the number of observations REML produces estimators which are approximately and sometimes exactly unbiased D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 24 / 37

Restricted Maximum Likelihood Notes Maximum likelihood estimators are biased downwards when the number of estimators is large compared to the number of observations REML produces estimators which are approximately and sometimes exactly unbiased Approximately maximizes the penalized log-likelihood l p (y; γ; p) = l(y; β γ ; γ; p) + 1 X 2 log T WX D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 24 / 37

Optimizing p Algorithm 1 Suppose p fixed and known D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 25 / 37

Optimizing p Algorithm 1 Suppose p fixed and known 2 Evaluate all the other parameters using the DGLM D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 25 / 37

Optimizing p Algorithm 1 Suppose p fixed and known 2 Evaluate all the other parameters using the DGLM 3 Evaluate the penalized log-likelihood D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 25 / 37

Optimizing p Algorithm 1 Suppose p fixed and known 2 Evaluate all the other parameters using the DGLM 3 Evaluate the penalized log-likelihood 4 Start over with different values for p and compare the penalized log-likelihood D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 25 / 37

Reserve Variability Mean Square Error of Prediction Dispersion Models : overdispersion included implicitly in the parameter covariance matrix D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 26 / 37

Reserve Variability Mean Square Error of Prediction Dispersion Models : overdispersion included implicitly in the parameter covariance matrix GLMs : overdispersion is included manually in the parameter covariance matrix D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 26 / 37

Data Analyzed D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 27 / 37

Data Analyzed D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 28 / 37

Data Analyzed D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 29 / 37

Optimizing p FIGURE: Restricted log-likelihood for various p in a DGLM D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 30 / 37

Analysis of Results D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 31 / 37

Analysis of Results D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 32 / 37

Analysis of Results FIGURE: Adjusting the most deviant observations has a bigger influence on the deviance principle D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 33 / 37

Analysis of Results FIGURE: Contribution of each cell to the dispersion. p = 1.1741, w i,j 1 D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 34 / 37

Conclusion Further Discussion Lack of observations and abundance of parameters is a hostile environment for DGLMs D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 35 / 37

Conclusion Further Discussion Lack of observations and abundance of parameters is a hostile environment for DGLMs The deviance cannot be estimated in the last column for a DGLM and is hence ignored when estimating φ D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 35 / 37

Conclusion Further Discussion Lack of observations and abundance of parameters is a hostile environment for DGLMs The deviance cannot be estimated in the last column for a DGLM and is hence ignored when estimating φ The algorithm does not converge for p fixed when there are more than 7 parameters D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 35 / 37

Conclusion Further Discussion Lack of observations and abundance of parameters is a hostile environment for DGLMs The deviance cannot be estimated in the last column for a DGLM and is hence ignored when estimating φ The algorithm does not converge for p fixed when there are more than 7 parameters The algorithm does converge for p fixed for one parameter, but p cannot be optimized D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 35 / 37

Conclusion Further Discussion Lack of observations and abundance of parameters is a hostile environment for DGLMs The deviance cannot be estimated in the last column for a DGLM and is hence ignored when estimating φ The algorithm does not converge for p fixed when there are more than 7 parameters The algorithm does converge for p fixed for one parameter, but p cannot be optimized The "bounds" of convergence need to be explored further D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 35 / 37

References England, PD and Verrall, RJ (2002). Stochastic Claims Reserving in General Insurance. Institute of Actuaries and Faculty of Actuaries Smyth, G. and Jorgensen, B. (2002). Fitting Tweedie s Compound Poisson Model to Insurance Claims Data : Dispersion Modelling. ASTIN Bulletin, 1, 143 157 Wüthrich, MV (2003). Claims Reserving Using Tweedie s Compound Poisson Model. ASTIN Bulletin, 331 346 D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 36 / 37

The end D. Davidov (UQAM) Loss Reserves Variance Modeling August 2009 37 / 37