GARY G. VENTER 1. CHAIN LADDER VARIANCE FORMULA

Similar documents
Prediction Uncertainty in the Bornhuetter-Ferguson Claims Reserving Method: Revisited

CHAIN LADDER FORECAST EFFICIENCY

Triangles in Life and Casualty

Claims Reserving under Solvency II

Chain ladder with random effects

Stochastic Incremental Approach for Modelling the Claims Reserves

A HOMOTOPY CLASS OF SEMI-RECURSIVE CHAIN LADDER MODELS

Various Extensions Based on Munich Chain Ladder Method

Analytics Software. Beyond deterministic chain ladder reserves. Neil Covington Director of Solutions Management GI

A Loss Reserving Method for Incomplete Claim Data

Calendar Year Dependence Modeling in Run-Off Triangles

Identification of the age-period-cohort model and the extended chain ladder model

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business

ONE-YEAR AND TOTAL RUN-OFF RESERVE RISK ESTIMATORS BASED ON HISTORICAL ULTIMATE ESTIMATES

New Statistical Methods That Improve on MLE and GLM Including for Reserve Modeling GARY G VENTER

Greg Taylor. Taylor Fry Consulting Actuaries University of Melbourne University of New South Wales. CAS Spring meeting Phoenix Arizona May 2012

Various extensions based on Munich Chain Ladder method

Bootstrapping the triangles

GREG TAYLOR ABSTRACT

DEPARTMENT OF ACTUARIAL STUDIES RESEARCH PAPER SERIES

Individual loss reserving with the Multivariate Skew Normal framework

ABSTRACT 1. INTRODUCTION

The Chain Ladder Reserve Uncertainties Revisited

On the Importance of Dispersion Modeling for Claims Reserving: Application of the Double GLM Theory

DELTA METHOD and RESERVING

Reserving for multiple excess layers

Stochastic Loss Reserving with the Collective Risk Model

arxiv: v1 [stat.ap] 17 Jun 2013

Individual reserving a survey. R in Insurance, Paris 08-June-2017 Alexandre Boumezoued, joint work with Laurent Devineau

Regression Models - Introduction

Chapter 1 Statistical Inference

A Multivariate Bayesian Claim Count Development Model With Closed Form Posterior and Predictive Distributions

arxiv: v1 [stat.ap] 19 Jun 2013

Forecasting with the age-period-cohort model and the extended chain-ladder model

Synchronous bootstrapping of loss reserves

Multivariate Chain Ladder

New Perspectives and Methods in Loss Reserving Using Generalized Linear Models

Regression Models - Introduction

ECO 513 Fall 2009 C. Sims HIDDEN MARKOV CHAIN MODELS

Generalized Mack Chain-Ladder Model of Reserving with Robust Estimation

Recent Development in Claims Reserving

EFFECTS OF VARIATIONS FROM GAMMA-POISSON ASSUMPTIONS. Abstract

A Comparison of Resampling Methods for Bootstrapping Triangle GLMs

Varieties of Count Data

On the Distribution of Discounted Loss Reserves Using Generalized Linear Models

1. The OLS Estimator. 1.1 Population model and notation

ASA Section on Survey Research Methods

ORF 245 Fundamentals of Engineering Statistics. Final Exam

Ridge Regression. Summary. Sample StatFolio: ridge reg.sgp. STATGRAPHICS Rev. 10/1/2014

Confidence Bounds for Discounted Loss Reserves

Regression Analysis Tutorial 34 LECTURE / DISCUSSION. Statistical Properties of OLS

arxiv: v1 [stat.me] 25 Jan 2015 Table of Contents

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

Consistency Under Sampling of Exponential Random Graph Models

EC408 Topics in Applied Econometrics. B Fingleton, Dept of Economics, Strathclyde University

Currie, Iain Heriot-Watt University, Department of Actuarial Mathematics & Statistics Edinburgh EH14 4AS, UK

MATH2206 Prob Stat/20.Jan Weekly Review 1-2

Estimating and Testing the US Model 8.1 Introduction

NUCLEAR SAFETY AND RELIABILITY WEEK 3

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Supplementary Note on Bayesian analysis

The distribution of Γ A Fourier decomposing Jeffery Lewins, January 2014

Generalized Linear Models for Non-Normal Data

Neural, Parallel, and Scientific Computations 18 (2010) A STOCHASTIC APPROACH TO THE BONUS-MALUS SYSTEM 1. INTRODUCTION

Mustafa H. Tongarlak Bruce E. Ankenman Barry L. Nelson

Prompt. Commentary. Mathematical Foci

Statistics 572 Semester Review

Estimation and sample size calculations for correlated binary error rates of biometric identification devices

Modules 1-2 are background; they are the same for regression analysis and time series.

DEPARTMENT MATHEMATIK SCHWERPUNKT MATHEMATISCHE STATISTIK UND STOCHASTISCHE PROZESSE

Lack-of-fit Tests to Indicate Material Model Improvement or Experimental Data Noise Reduction

THE N-VALUE GAME OVER Z AND R

Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

Notes on Random Variables, Expectations, Probability Densities, and Martingales

Regression. Oscar García

Regression Clustering

EXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science

Midterm. Introduction to Machine Learning. CS 189 Spring Please do not open the exam before you are instructed to do so.

Localization and Correlation in Ensemble Kalman Filters

MIDTERM EXAMINATION (Spring 2011) STA301- Statistics and Probability

Module 1 Linear Regression

Chapter 3: Regression Methods for Trends

Chapter 1. Linear Regression with One Predictor Variable

6 Evolution of Networks

Midterm 2 V1. Introduction to Artificial Intelligence. CS 188 Spring 2015

Everything Old Is New Again: Connecting Calculus To Algebra Andrew Freda

Chapter 3. Introduction to Linear Correlation and Regression Part 3

A Rejoinder to Mackintosh and some Remarks on the. Concept of General Intelligence

Assessing Structural VAR s

Comparing IRT with Other Models

11. Generalized Linear Models: An Introduction

CREDIBILITY IN THE REGRESSION CASE REVISITED (A LATE TRIBUTE TO CHARLES A. HACHEMEISTER) H.BUHLMANN. ETH Zurich. A. GlSLER. Winterhur- Versicherungen

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Section 1.1: A Preview of Calculus When you finish your homework, you should be able to

Non-Life Insurance: Mathematics and Statistics

Loss Reserving Using Estimation Methods Designed for Error Reduction

Diagnostics of Linear Regression

Signalized Intersection Delay Models

ECON3327: Financial Econometrics, Spring 2016

Transcription:

DISCUSSION OF THE MEAN SQUARE ERROR OF PREDICTION IN THE CHAIN LADDER RESERVING METHOD BY GARY G. VENTER 1. CHAIN LADDER VARIANCE FORMULA For a dozen or so years the chain ladder models of Murphy and Mack have been known to give the same reserve estimates but differ slightly in their assumptions and in their calculations of the reserve variance. Up until now no one had provided a detailed analysis of the source of the difference in variance. The current (BBMW) paper shows that this arises from a slight difference in the estimates of the parameter variance piece, and this in turn comes from the fact that for Murphy the errors in the estimates of the development factors are independent, while under Mack s assumptions they are merely uncorrelated. This would suggest that any given data could be tested for the difference in assumptions by computing correlations of functions of the estimated factors. For instance a finding that some functions of the factors are correlated but the factors themselves are uncorrelated would support Mack s model over Murphy s. Now the Mack-Quarg-Braun (MQB) discussion has short-cut this approach by showing that under the basic chain ladder assumptions of Mack or BBMW, the squares of consecutive development factors are negatively correlated. This would seem to rule out the independence assumption and Murphy s formula. The BBMW model has the same formulas as Murphy s. However it is not entirely clear if they have the same problem with independence of factor estimates. Mack showed that the estimates are conditionally uncorrelated given the portion of the triangle available at the relevant points. BBMW introduce a resampling methodology which yields hypothetical new factors identical in distribution to the old ones and which are conditionally independent given the entire original triangle. (Conditional on the whole triangle the original factors are deterministic.) They then use this conditional independence to derive Murphy s variance result for the new factors. The result holds for the original factors because they have the same distribution as the new factors. Thus BBMW assert that they are not assuming independence. This is a novel and interesting approach but could use some more vetting to be sure it is valid. However independence of factors seems to creep in anyway, although as a conclusion rather than an assumption. MQB s result of non-independence of adjacent factor estimates seems to arise naturally from the denominator of one factor being almost the numerator of the next, and it is not clear that resampling avoids this. Astin Bulletin 36 (2), 566-571. doi: 10.2143/AST.36.2.2017940 2006 by Astin Bulletin. All rights reserved.

DISCUSSION OF THE MEAN SQUARE ERROR OF PREDICTION 567 In fact Mack s original finding that the factor estimates are uncorrelated is somewhat surprising in this context. 2. APPLICABILITY OF CHAIN LADDER TO THE EXAMPLE Many actuaries have interpreted the existence of models of the chain ladder as support for using it. A more nuanced view is that having a model provides a method for testing applicability of the chain ladder to a given data set 1. BBMW note that they are not representing that either model applies to the data in their example. This leaves the applicability open. Some testing methodology is reviewed below and applied to the example development triangle. The assumptions of both of the chain ladder models are found to be violated by this data. The models of Mack and Murphy-BBMW express the expected incremental loss at one lag as a factor times the cumulative loss at the previous lag. Whether or not this holds for a given loss triangle is easy to test by regression methods. The incremental loss at lag d + 1 (call this y) can be estimated as a linear regression on the cumulative loss at lag d (called x). If the model applies, the additive constant from the regression should not be significantly different from zero, but the factor should be. Because the variance is assumed to be proportional to x, both y and x should be divided by x before doing the regression. Figure 1. 1 See for instance Mack(1994), Venter (1998), Barnett and Zehnwirth (2000).

568 G.G. VENTER The regressions were carried out for the first five columns of the sample array. The combination of factor significant and constant not fails for all of them. The only significant regression is in fact 1 st to 2 nd, but the constant there is also significant, with higher losses in the first period predicting lower losses emerging in the second. The sample points are graphed in Figure 1. Points from a factor model would look like a straight line through the origin, with some randomness around it, which this does not. Another assumption of the chain-ladder models is that the development factors are not correlated. This is violated for the 3 rd to 4 th and 4 th to 5 th factors, which are significantly negatively correlated. Independence of the factor estimates, which is required for Murphy-BBMW but not Mack, is also violated. This can be checked by looking at the correlation of squared deviations of the individual factors from the column mean factors. These are significantly positive for the 3 rd to 4 th factor with both the 2 nd to 3 rd and 4 th to 5 th factors. The chain-ladder independence assumptions also imply that there are no calendar-year (diagonal) effects in the triangle. This implication can be violated for instance by accelerated or stalled claim department activity in a calendar year. Such a departure would often be made up for in a later year or years, so more than one diagonal can be affected. A similar pattern can arise from inflation operating on calendar years. Inflation operating on year of origin is built into the factor approach, as each year gets its own level. But there can appear to be inflation by year of origin that is actually generated by calendar year inflation. If the latter varies by year, a pattern of high and low residuals can show up by diagonal. A large variation in residuals among diagonals could suggest that either calendar year inflation or claim department instability is affecting the data. Table 1 shows the average residual and fraction of positive residuals for the last 3 diagonals of the example. Only the second to last is comfortably random, suggesting that there are diagonal effects that have not been picked up by the model. TABLE 1 Average residual Fraction positive (160,758) 1 of 7 22,930 3 of 8 74,814 6 of 8 3. MULTIPLICATIVE FIXED-EFFECTS MODEL COMPARISON The differences among the chain-ladder variances discussed so far are relatively minor. There is a quite different model which gives the same reserves as the chain-ladder but different fitted values and a different variance. Instead of

DISCUSSION OF THE MEAN SQUARE ERROR OF PREDICTION 569 an incremental cell s expected losses being a factor times the previous cumulative, it is a percentage of ultimate losses. E.g., in a paid loss triangle expected paids follow a payout pattern based on ultimate losses even though the ultimate is not known. Each cell is modeled as a random innovation plus the product of a row and column parameter (e.g., ultimate and fraction paid), so this can be called a multiplicative fixed-effects model. It agrees with the chain-ladder estimates when losses follow an aggregate distribution consisting of a Poisson frequency and a Constant Severity (PCS). That is, each cell of the incremental triangle has a Poisson loss frequency and all cells have the same constant loss severity. This is of course a gross over-simplification of the loss generation process, but nevertheless it might give a reasonable account of the statistical behavior of some loss triangles. Denoting b as the constant severity, and x as the aggregate loss for a cell, the distributional assumption is that x/b is Poisson. The resulting aggregate loss distribution has also been called the over-dispersed Poisson. It has mean bl and variance b 2 l, and so variance-to-mean ratio b, which does not have to be above 1. The interest here is that if the row and column factors are estimated by maximum likelihood, then the reserve calculation is the chain-ladder algorithm. For the pure Poisson case this was shown by Hachemeister and Stanard (1975) although it was not published formally until Kremer (1985) in German and Mack (1991) in English. The extension to the PCS model was by Renshaw and Verrall (1998). A good presentation is Clark (2003), who assumes a parameterized distribution for the payout pattern, but a discrete distribution is a special case. He also discusses the Cape Cod model, which is PCS with all accident years at the same level. None of the cited papers compare the resulting variance to that from Mack or Murphy, however. The PCS model estimates the expected value of the observed cells of the triangle by the last diagonal backed down with the development factors. From the chain ladder viewpoint this is predicting past losses from later information. But the PCS is not the chain ladder. The paradigm of loss emergence as a fixed payout pattern can be traced back at least to the Bornheutter-Ferguson (BF) model, so PCS is more like a parameterized version of BF than of the chain ladder. Backing down from the latest diagonal often gives better predictions of observed incremental losses, and so may give an advantage in goodness of fit to the PCS. But this fit has to be penalized due to the extra parameters. It is actually not easy to compare fits, because the chain-ladder models are conditional given the triangle, whereas PCS uses the data in the triangle to fit unobserved parameters unconditionally. Perhaps the best way to compare the fits is to calculate the variance of the estimated reserves for each model, including parameter uncertainty, and interpret the model with the lower variance as having the best fit. This credits the PCS if it has the better fit but penalizes it for having more parameters. Clark shows how to measure the PCS variance with the information matrix from MLE and the delta method, which gives the parameter variance for a

570 G.G. VENTER Figure 2. function of MLE-estimated parameters. In this case the function is the estimated reserve. For the triangle in the BBMW paper, the PCS prediction standard error can be shown to be 2,827,000 compared to 2,447,000 for Mack s model. Here the better fit from backing down from the latest diagonal does not compensate for the extra parameter variance of the PCS. The process standard deviation reduced by almost 50%, but the increase in the parameter standard deviation was nearly 70%. Of course the PCS itself has assumptions that need to be checked. The analogue to comparing incremental losses to previous cumulative is to compare them to estimated ultimate, because PCS postulates that expected incrementals are factors times ultimate losses. Figure 2 graphs the 1 st to 2 nd incrementals vs. expected ultimate. This looks more plausible as a straight line through the origin than were the same incrementals as a function of losses at 1 st, shown in Figure 1. The PCS model has other key assumptions, like constant variance-to-mean ratio and no diagonal effects. Without going into detail, the constant ratio has support in this triangle, but there is still a problem with diagonal effects. The PCS actually assumes all cells are independent, so any correlation of any functions of rows, columns, or diagonals would be problematic. As noted above, some such were found for this triangle, so the PCS model assumptions are also violated.

DISCUSSION OF THE MEAN SQUARE ERROR OF PREDICTION 571 3. CONCLUSIONS The BBMW paper provides some clarity as to why the Mack and Murphy models have different variance results, coming down to the issue of independent vs. uncorrelated factor estimates. However the work of the MQB reviewers supports non-correlation but not independence, and so the Mack formula. Neither the Mack nor Murphy-BBMW chain ladder assumptions are met by the data in the example triangle. The PCS multiplicative fixed-effects model gives the same reserve estimate and provides a significantly better fit but a higher variance estimate for this data, and its assumptions are not satisfied either. There are numerous other models in the literature that could be tried. However the negative correlation of the 3 rd and 4 th developments could be a problem for many of them. REFERENCES BARNETT, G. and ZEHNWIRTH, B. (2000) Best Estimates for Reserves, CAS Proceedings, 245-321. CLARK, D.R. (2003) LDF Curve-Fitting and Stochastic Reserving: A Maximum Likelihood Approach. CAS Forum (Fall): 41-92. HACHEMEISTER, C. and STANARD, J. (1975) IBNR Claims Count Estimation With Static Lag Functions. Presented at 1975 ASTIN Colloquium, Portimao, Portugal. KREMER, E. (1985) Einfuhrung in die Versicherungsmathematik, Vandenhoek & Ruprecht, Gottingen. MACK, T. (1991) A simple parametric model for rating automobile insurance or estimating IBNR claims reserves. ASTIN Bulletin 21(1), 93-109. MACK, T. (1994) Measuring the Variability of Chain Ladder Reserve Estimates, CAS Forum (Spring): 101-182. MURPHY, D. (1994) Unbiased Loss Development Factors, CAS Proceedings LXXXI: 154-222. RENSHAW, A. and VERRALL, R.J. (1998) A Stochastic Model Underlying the Chain Ladder Technique. British Actuarial Journal 4, 903-23. VENTER, G. (1998) Testing the Assumptions of Age-to-Age Factors. CAS Proceedings, LXXXV: 807-47.