Synchronous bootstrapping of loss reserves

Similar documents
Greg Taylor. Taylor Fry Consulting Actuaries University of Melbourne University of New South Wales. CAS Spring meeting Phoenix Arizona May 2012

CHAIN LADDER FORECAST EFFICIENCY

Claims Reserving under Solvency II

Bootstrapping the triangles

Chain ladder with random effects

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business

On the Importance of Dispersion Modeling for Claims Reserving: Application of the Double GLM Theory

Calendar Year Dependence Modeling in Run-Off Triangles

A HOMOTOPY CLASS OF SEMI-RECURSIVE CHAIN LADDER MODELS

The Algebra of the Kronecker Product. Consider the matrix equation Y = AXB where

Prediction Uncertainty in the Bornhuetter-Ferguson Claims Reserving Method: Revisited

DELTA METHOD and RESERVING

Various Extensions Based on Munich Chain Ladder Method

Stochastic Incremental Approach for Modelling the Claims Reserves

Reserving for multiple excess layers

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley

GARY G. VENTER 1. CHAIN LADDER VARIANCE FORMULA

The Chain Ladder Reserve Uncertainties Revisited

Analytics Software. Beyond deterministic chain ladder reserves. Neil Covington Director of Solutions Management GI

A few basics of credibility theory

Various extensions based on Munich Chain Ladder method

DEPARTMENT OF ACTUARIAL STUDIES RESEARCH PAPER SERIES

A Loss Reserving Method for Incomplete Claim Data

Combining data from two independent surveys: model-assisted approach

Minwise hashing for large-scale regression and classification with sparse data

The Fuzziness in Regression Models

LECTURE 2 LINEAR REGRESSION MODEL AND OLS

Variance Reduction and Ensemble Methods

Bagging and Other Ensemble Methods

AGEC 661 Note Fourteen

ONE-YEAR AND TOTAL RUN-OFF RESERVE RISK ESTIMATORS BASED ON HISTORICAL ULTIMATE ESTIMATES

Bootstrap, Jackknife and other resampling methods

Institute of Actuaries of India

arxiv: v1 [stat.ap] 19 Jun 2013

1. The OLS Estimator. 1.1 Population model and notation

Individual loss reserving with the Multivariate Skew Normal framework

arxiv: v1 [stat.ap] 17 Jun 2013

y it = α i + β 0 ix it + ε it (0.1) The panel data estimators for the linear model are all standard, either the application of OLS or GLS.

Block Bootstrap Prediction Intervals for Vector Autoregression

E 4160 Autumn term Lecture 9: Deterministic trends vs integrated series; Spurious regression; Dickey-Fuller distribution and test

Generalized Mack Chain-Ladder Model of Reserving with Robust Estimation

Delta Boosting Machine and its application in Actuarial Modeling Simon CK Lee, Sheldon XS Lin KU Leuven, University of Toronto

Heteroskedasticity in Panel Data

MS&E 226: Small Data

Heteroskedasticity in Panel Data

Cross validation of prediction models for seasonal time series by parametric bootstrapping

Weighted Least Squares

Classification: Logistic Regression from Data

Lecture 13: Simple Linear Regression in Matrix Format. 1 Expectations and Variances with Vectors and Matrices

Instrumental Variables, Simultaneous and Systems of Equations

Classification: Logistic Regression from Data

Multivariate Regression

In the bivariate regression model, the original parameterization is. Y i = β 1 + β 2 X2 + β 2 X2. + β 2 (X 2i X 2 ) + ε i (2)

Triangles in Life and Casualty

Introduction to Machine Learning

Cohort Effect Structure in the Lee-Carter Residual Term. Naoki Sunamoto, FIAJ. Fukoku Mutual Life Insurance Company

Financial Econometrics

G E INTERACTION USING JMP: AN OVERVIEW

Variance. Standard deviation VAR = = value. Unbiased SD = SD = 10/23/2011. Functional Connectivity Correlation and Regression.

Fractional Imputation in Survey Sampling: A Comparative Review

Forecasting with ARMA Models

Recent Development in Claims Reserving

Basics of Multivariate Modelling and Data Analysis

Bootstrap tests of multiple inequality restrictions on variance ratios

A Short Course in Basic Statistics

Kevin Ewans Shell International Exploration and Production

Ridge Estimation and its Modifications for Linear Regression with Deterministic or Stochastic Predictors

Multivariate Random Variable

PSC 504: Differences-in-differeces estimators

at least 50 and preferably 100 observations should be available to build a proper model

Econ 424 Time Series Concepts

Research Article Investigating Mortality Uncertainty Using the Block Bootstrap

Mortality projection based on the Wang transform

ECON 4160, Spring term Lecture 12

A New Combined Approach for Inference in High-Dimensional Regression Models with Correlated Variables

A Non-parametric bootstrap for multilevel models

EL1820 Modeling of Dynamical Systems

Dimension Reduction in Abundant High Dimensional Regressions

Regression: Ordinary Least Squares

Peter Hoff Linear and multilinear models April 3, GLS for multivariate regression 5. 3 Covariance estimation for the GLM 8

Chapter 2: Unit Roots

Zellner s Seemingly Unrelated Regressions Model. James L. Powell Department of Economics University of California, Berkeley

Time Series Models and Inference. James L. Powell Department of Economics University of California, Berkeley

Combining information from different sources: A resampling based approach

Dynamic Regression Models (Lect 15)

Lecture 13 Family Selection. Bruce Walsh lecture notes Synbreed course version 4 July 2013

Automatic Autocorrelation and Spectral Analysis

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis

Bumpbars: Inference for region detection. Yuval Benjamini, Hebrew University

A Modified Fractionally Co-integrated VAR for Predicting Returns

Principles of forecasting

Econ 623 Econometrics II Topic 2: Stationary Time Series

2.5 Forecasting and Impulse Response Functions

. a m1 a mn. a 1 a 2 a = a n

Econometrics 2, Class 1

GENERALIZED LINEAR MODELING APPROACH TO STOCHASTIC WEATHER GENERATORS

Regression Analysis Chapter 2 Simple Linear Regression

Cross-Validation with Confidence

OMISSION OF RELEVANT EXPLANATORY VARIABLES IN SUR MODELS: OBTAINING THE BIAS USING A TRANSFORMATION

Structural VAR Models and Applications

Transcription:

Synchronous bootstrapping of loss reserves Greg Taylor Taylor Fry Consulting Actuaries University of Melbourne University of New South Wales Gráinne McGuire Taylor Fry Consulting Actuaries ASTIN Colloquium Zurich 4-7 September 2005 1

Overview We shall be concerned with forecasting There are multiple data sets There is dependency (correlation) between them (Seemingly Unrelated Regressions) We require estimation of forecast error Separately by data set For aggregated data sets Forecast error to be estimated by bootstrapping 2

Overview We shall be concerned with forecasting (loss reserving) There are multiple data sets (lines of business (LoBs)) There is dependency (correlation) between them (Seemingly Unrelated Regressions) We require estimation of forecast error Separately by data set (LoB) For aggregated data sets (LoBs) Forecast error to be estimated by bootstrapping 3

Statement of problem Consider an insurance portfolio consisting of LoBs labelled i=1,2,,i Let Z i denote some technical liability associated with LoB i e.g. loss reserve The Z i are not necessarily stochastically independent Let Z=Σ i Z i = Total Liability across all LoBs Estimate the distribution of Z 4

Data and model set-up LoB 1 Y 1 =g 1 (X 1,β 1 )+ε 1 Y 1 =g 1 (X 1,β 1 )+ε 1 LoB 2 Correlated Y 2 =g 2 (X 2,β 2 )+ε 2 Z 1 =α 1 (U 1,β 1 )+η 1 Y 2 =g 2 (X 2,β 2 )+ε 2 Z 2 =α 2 (U 2,β 2 )+η 2 5

Sources of correlation 6

Correlated noise Y i = µ i + ε i = g i (X i,β i ) + ε i Y j = µ j + ε j = g j (X j,β j ) + ε j C ij = Corr(Y i,y j ) = Corr(ε i,ε j ) 7

Correlated noise (cont d) LoB 1 LoB 2 µ 1ad + ε 1ad expected noise µ 2ad + ε 2ad expected noise CORRELATED 8

Parameter correlation Assumed model Y i = µ i + ε i = g i (X i,β i ) + ε i BUT model mis-specified. True model is Y i = µ + i + ε + i = g + i(x + i,β i,γ i ) + ε + i where γ i is an additional vector of unrecognised parameters g + i, X + i are a function and design matrix that accommodate the unrecognised parameters ε + i and ε + j are independent Note that ε i = ε + i + b i where b i = g + i(x + i,β i,γ i ) - g i (X i,β i ) [bias] Covariance E[ε i ε jt ] = b i b j T Mis-specification creates bias and correlation 9

Parameter correlation (cont d) Shared row parameters With unrecognised variation between rows LoB1 Level parameter α 1 LoB2 Level parameter α 4 Uncorrelated noise If parameter variation by row modelled, then no correlation If not modelled, correlation created 10

Conventional bootstrap re-visited Data Model Fitted Standardised Residuals 11

Conventional bootstrap re-visited Data Model REAL DATA PSEUDO DATA Standardised Residuals (permuted) Fitted New Data (pseudo data) New Model Standardised Residuals New fitted 12

Conventional bootstrap re-visited Data Model REAL DATA PSEUDO DATA Standardised Residuals (permuted) FUTURE Forecast mean values Fitted New Data (pseudo data) New Model Standardised Residuals (resampled again) Standardised Residuals New fitted Process error Replicate many times Bootstrap realisation 13

Conventional bootstrap re-visited Data Model REAL DATA PSEUDO DATA Standardised Residuals (permuted) FUTURE Forecast mean values Fitted New Data (pseudo data) New Model Standardised Residuals (resampled again) Standardised Residuals New fitted Process error Replicate many times Bootstrap realisation 14

Conventional bootstrap re-visited Example Standardised Residuals (permuted) 15

Conventional bootstrap of two data sets DATA SET 1 DATA SET 2 Standardised Residuals (permuted) Standardised Residuals (permuted) Any correlation lost due to independent permutation of residuals 16

Synchronous bootstrapping Conventional (independent) bootstrapping of correlated data sets destroys the correlations The solution is to synchronise the permutations applied to the residuals of the different data sets Kirschner, Kerley & Isaacs (CAS, 2002) The form of synchronisation depends on the assumed form of correlation 17

Synchronous bootstrap correlated noise DATA SET 1 DATA SET 2 Standardised Residuals Standardised Residuals (permuted) (permuted) Assume: Background correlation between noise terms of non-corresponding cells Different correlation for corresponding cells 18

Synchronous bootstrap correlated (shared) row parameters LoB 1 LoB 2 Level parameter α 1 Level parameter α 4 Uncorrelated noise Parameter variation by row not recognised, creating correlation between corresponding rows of different data sets 19

Synchronous bootstrap correlated (shared) row parameters Synchronise by restricting permutations to within rows in each data set (specific permutations need not be synchronised) High (low) residuals in LoB 1 will tend to be associated with high (low) residuals in LoB 2 LoB 1 LoB 2 Level parameter α 1 Assume: Level parameter α 4 Background correlation between observations in noncorresponding rows (e.g. zero) Different correlation between observations in corresponding rows 20

Numerical results 21

Point-wise bootstrap 3 triangles Each 20x20 Triangles have identical expectations Rows within triangles have identical expectations Each row s expectation follows a Hoerl curve (PPCI) Individual cells gamma distributed about expectations Gamma distributions for corresponding cells of different triangles subject to correlation of about 80% Otherwise independent (within and between triangles) Correlated noise Point-wise synchronised bootstrap 22

Point-wise bootstrap (cont d) Basis of estimation True (simulation) Independent bootstrap Synchronous pointwise bootstrap Pair-wise correlation of LoB loss reserves CoV of aggregate loss reserve across 3 LoBs 0.81 5.4% -0.00 3.0% 0.79 5.0% 23

Row-wise bootstrap Same data generation (of 3 data sets) as before except that No correlated noise Expected level of Hoerl curve follows geometric random walk through accident periods Same level for each data set for given accident period Model specification deliberately overlooks variation in row parameter (Row) parameter correlation Row-wise synchronised bootstrap 24

Row-wise bootstrap examples of data sets Sampled triples of data triangles 1000% Level (relative to first accident period 100% 10% 1 3 5 7 9 11 13 15 17 19 Accident period Sample 1 Sample 2 Sample 3 Sample 4 Sample 5 25

Row-wise bootstrap efficiency measurement Descriptor of sample = ratio R =Var[Σ i Z i ] / Σ i Var[Z i ] = 1 for independent Z i = 3 for fully correlated Z i Efficiency measure = (Estimated R 1) (True R 1) 26

Row-wise bootstrap (cont d) Sample Descriptor Efficiency measure 1 2.98 73% 2 2.48 60% 3 2.77 24% 4 2.71 26% 5 2.86 103% 27