Pricing and Risk Analysis of a Long-Term Care Insurance Contract in a non-markov Multi-State Model

Similar documents
Long-Term Care with Multi-State Models

Laboratoire SAF 50 Avenue Tony Garnier Lyon cedex 07

Helms, Czado, Gschlößl: Calculation of LTC Premiums based on direct estimates of transition probabilities

Regression analysis of interval censored competing risk data using a pseudo-value approach

Technische Universität München. Estimating LTC Premiums using. GEEs for Pseudo-Values

Robust estimates of state occupancy and transition probabilities for Non-Markov multi-state models

EXPERIENCE-BASED LONGEVITY ASSESSMENT

Multi-state models: prediction

A multistate additive relative survival semi-markov model

Multistate Modeling and Applications

Multi-state Models: An Overview

On Properties of QIC in Generalized. Estimating Equations. Shinpei Imori

Multilevel Statistical Models: 3 rd edition, 2003 Contents

I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S (I S B A)

A multi-state model for the prognosis of non-mild acute pancreatitis

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:

arxiv: v1 [stat.ap] 17 Jun 2013

Nonparametric Model Construction

Analysing geoadditive regression data: a mixed model approach

INVERSE PROBLEMS FOR MARKOV MODELS IN HEALTH INSURANCE. Semyen I. Spivak. The Bashkir State University, Ufa, Russia

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Estimation for Modified Data

Bayesian Nonparametric Regression for Diabetes Deaths

University of California, Berkeley

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Stat 5101 Lecture Notes

Homogeneous Backward Semi-Markov Reward Models for Insurance Contracts

Lecture 5 Models and methods for recurrent event data

Statistical Inference and Methods

Quantile Regression for Residual Life and Empirical Likelihood

Exercises. (a) Prove that m(t) =

Using Estimating Equations for Spatially Correlated A

Understanding product integration. A talk about teaching survival analysis.

Multivariate Survival Data With Censoring.

Lifetime Dependence Modelling using a Generalized Multivariate Pareto Distribution

Subject CS1 Actuarial Statistics 1 Core Principles

Marginal Specifications and a Gaussian Copula Estimation

Model and Working Correlation Structure Selection in GEE Analyses of Longitudinal Data

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

Fall 2003 Society of Actuaries Course 3 Solutions = (0.9)(0.8)(0.3) + (0.5)(0.4)(0.7) (0.9)(0.8)(0.5)(0.4) [1-(0.7)(0.3)] = (0.

UNIVERSITY OF CALIFORNIA, SAN DIEGO

A multi-state model for the prognosis of non-mild acute pancreatitis

Product-limit estimators of the gap time distribution of a renewal process under different sampling patterns

Cox s proportional hazards model and Cox s partial likelihood

MLC Fall 2015 Written Answer Questions. Model Solutions

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

A note on the decomposition of number of life years lost according to causes of death

May 2013 MLC Solutions

A simple bivariate count data regression model. Abstract

Error distribution function for parametrically truncated and censored data

RMSC 2001 Introduction to Risk Management

Statistical Analysis of Competing Risks With Missing Causes of Failure

On the existence of maximum likelihood estimators in a Poissongamma HGLM and a negative binomial regression model

Generalized Estimating Equations (gee) for glm type data

HANDBOOK OF APPLICABLE MATHEMATICS

A Study of the Impact of a Bonus-Malus System in Finite and Continuous Time Ruin Probabilities in Motor Insurance

Duration-Based Volatility Estimation

Lecture 3. Truncation, length-bias and prevalence sampling

SPRING 2005 EXAM M SOLUTIONS. = When (as given in the problem), (x) dies in the second year from issue, the curtate future lifetime x + 1 is 0, so

Integrated approaches for analysis of cluster randomised trials

11 Survival Analysis and Empirical Likelihood

Dependence structures with applications to actuarial science

Counts using Jitters joint work with Peng Shi, Northern Illinois University

Statistics in medicine

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen

University of California, Berkeley

RMSC 2001 Introduction to Risk Management

A Guide to Modern Econometric:

Testing for Regime Switching in Singaporean Business Cycles

Modeling the scale parameter ϕ A note on modeling correlation of binary responses Using marginal odds ratios to model association for binary responses

MULTIDIMENSIONAL SMOOTHING BY ADAPTIVE LOCAL

Survival Analysis I (CHL5209H)

Assessing the effect of a partly unobserved, exogenous, binary time-dependent covariate on -APPENDIX-

Institute of Actuaries of India

Currie, Iain Heriot-Watt University, Department of Actuarial Mathematics & Statistics Edinburgh EH14 4AS, UK

Bayesian non-parametric model to longitudinally predict churn

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Stochastic Modelling Unit 1: Markov chain models

FUNCTIONAL DATA ANALYSIS. Contribution to the. International Handbook (Encyclopedia) of Statistical Sciences. July 28, Hans-Georg Müller 1

Yuxin Zhang. September 21st, 2018

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm

Estimation of cumulative distribution function with spline functions

Survival Analysis Math 434 Fall 2011

Research Statement. Zhongwen Liang

Correlation: Copulas and Conditioning

Longitudinal analysis of ordinal data

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements

Visualizing multiple quantile plots

Subject CT4 Models. October 2015 Examination INDICATIVE SOLUTION

Model Assumptions; Predicting Heterogeneity of Variance

A Practitioner s Guide to Generalized Linear Models

Frailty Models and Copulas: Similarities and Differences

Inference Methods for the Conditional Logistic Regression Model with Longitudinal Data Arising from Animal Habitat Selection Studies

The Bayesian Approach to Multi-equation Econometric Model Estimation

Philosophy and Features of the mstate package

Chapter 1. Bayesian Inference for D-vines: Estimation and Model Selection

Econ 582 Nonparametric Regression

Experience Rating in General Insurance by Credibility Estimation

Multivariate Assays With Values Below the Lower Limit of Quantitation: Parametric Estimation By Imputation and Maximum Likelihood

Transcription:

Pricing and Risk Analysis of a Long-Term Care Insurance Contract in a non-markov Multi-State Model Quentin Guibert Univ Lyon, Université Claude Bernard Lyon 1, ISFA, Laboratoire SAF EA2429, F-69366, Lyon, France Prim Act, Paris, France Email: q.guibert@hotmail.fr 2017 PARTY Winter School 8-13 January 2017, Ascona, Switzerland Joint work with F. Planchet (ISFA and Prim Act)

Context Multi-state models are the most natural tools for pricing and reserving Long Term Care (LTC) insurance guarantees. In literature, a Markov model is generally used may be too strong. Lack of (detailed) national data. It s hard to have precise and accurate insurance data. Classical inference methodologies: are based on crude intensities, use GLM Poisson models that depend on age or duration time (e.g. Haberman and Pitacco, 1998; Pritchard, 2006; Levantesi and Menzietti, 2012; Fong et al., 2015). Guibert and Planchet 2017 PARTY, 8-13 January 2017 2/20

Aims Helms et al. (2005) use a direct regression model for annual transition probabilities with a Markov model based on pseudo-values (Andersen et al., 2003). This allows to measure the effect of observed covariates. However, this framework requires that the Markov assumption to be satisfied. We propose dynamic regression methods using directly the transition probabilities of a non-markov illness-death model. Interests for pricing: Flexible and takes into account the effect of covariates over time. Offers a way to avoid or reduce the bias induced by the Markov assumption. Guibert and Planchet 2017 PARTY, 8-13 January 2017 3/20

Long-Term Care Insurance Model Consider an illness-death model X with only one heavy dependency state. Transition probabilities h j between s and t with individual covariates Z p hj (s, t; Z (s)) = P (X (t) = j X (s) = h, Z (s)). A very simple annuity guarantee paid in dependency. State 1. Health p 12 (s, t) 2. Disability 3 p 13 (s, t) p 23 (s, t) 2 3. Death 1 Premium π S1 Time Guibert and Planchet 2017 PARTY, 8-13 January 2017 4/20

Long-Term Care Insurance Model Consider an illness-death model X with only one heavy dependency state. Transition probabilities h j between s and t with individual covariates Z p hj (s, t; Z (s)) = P (X (t) = j X (s) = h, Z (s)). A very simple annuity guarantee paid in dependency. State 1. Health p 12 (s, t) 2. Disability 3 p 13 (s, t) p 23 (s, t) 2 Annuity B 3. Death 1 Premium π S1 Time Guibert and Planchet 2017 PARTY, 8-13 January 2017 4/20

Long-Term Care Insurance Model Consider an illness-death model X with only one heavy dependency state. Transition probabilities h j between s and t with individual covariates Z p hj (s, t; Z (s)) = P (X (t) = j X (s) = h, Z (s)). A very simple annuity guarantee paid in dependency. State 1. Health p 12 (s, t) 2. Disability 3 p 13 (s, t) p 23 (s, t) 2 Annuity B 3. Death 1 Premium π S1 S2 Time Guibert and Planchet 2017 PARTY, 8-13 January 2017 4/20

Long-Term Care Insurance Model Actuarial calculation in discrete time at the valuation time 0 for a s-year old insured. Actuarial value of premium payments a 1 (s; Z (s)) = ω s 1 τ=0 v τ p 11 (s, s + τ; Z (s)) π. Actuarial values of annuity payments in state h = 1, 2 A h (s; Z (s)) = ω s 1 τ=0 v τ p h2 (s, s + τ; Z (s)) B. Under the Markov assumption, we can estimate p hj (t) = p hj (t 1, t), and then use the Chapman-Kolmogorov equations. Calculating these probabilities is not simple without the Markov assumption. For state 1, we suppose that it s verified (no previous state). Guibert and Planchet 2017 PARTY, 8-13 January 2017 5/20

Long-Term Care Insurance Model Actuarial calculation in discrete time at the valuation time 0 for a s-year old insured. Actuarial value of premium payments a 1 (s; Z (s)) = ω s 1 τ=0 v τ p 11 (s, s + τ; Z (s)) π. Actuarial values of annuity payments in state h = 1, 2 A h (s; Z (s)) = ω s 1 τ=0 v τ p h2 (s, s + τ; Z (s)) B. Under the Markov assumption, we can estimate p hj (t) = p hj (t 1, t), and then use the Chapman-Kolmogorov equations. Calculating these probabilities is not simple without the Markov assumption. For state 1, we suppose that it s verified (no previous state). Guibert and Planchet 2017 PARTY, 8-13 January 2017 5/20

Long-Term Care Insurance Model Actuarial calculation in discrete time at the valuation time 0 for a s-year old insured. Actuarial value of premium payments a 1 (s; Z (s)) = ω s 1 τ=0 v τ p 11 (s, s + τ; Z (s)) π. Actuarial values of annuity payments in state h = 1, 2 A h (s; Z (s)) = ω s 1 τ=0 v τ p h2 (s, s + τ; Z (s)) B. Under the Markov assumption, we can estimate p hj (t) = p hj (t 1, t), and then use the Chapman-Kolmogorov equations. Calculating these probabilities is not simple without the Markov assumption. For state 1, we suppose that it s verified (no previous state). Guibert and Planchet 2017 PARTY, 8-13 January 2017 5/20

Non-Parametric Transition Probabilities Based on de Uña-Álvarez and Meira-Machado (2015) and Guibert and Planchet (2016) Introduce S, the lifetime in healthy state, and T, the overall lifetime. Let C and L be a right-censoring and a left-truncation variables. Truncation only occurs in state 1. Suppose (C, L) = (S, T, Z) and C L. If L S, we observe { S = min (S, C) and γ = 1 {S C}, T = min (T, C) and δ = 1 {T C}. = Consistent estimators of transition probabilities { } By selecting individuals in L 1 s = i : S i > s, we have p 11 (s, t) = 1 Ĥ 1 s (t), p 12 (s, t) = Ĥ 1 s (t) F 1 s (t), with Ĥ 1 s (t) and F 1 s (t) be the Kaplan-Meier c.d.f of S and T on L 1 s (Tsai et al., 1987). Guibert and Planchet 2017 PARTY, 8-13 January 2017 6/20

How Calculate Jackknife Pseudo-Values? Let n 1 s, the number of individuals alive and not truncated at time s. 1 For each individual i { 1,..., n 1 s }, remove i from the sample 2 Compute probabilities of interest on this subsample of cardinal ( n 1 s 1 ) p ( i) 11 (s, t) and p ( i) 12 (s, t) 3 Compute their jackknife pseudo-values ( ) n 1 s p 11 (s, t) n 1 s 1 p ( i) 11 (s, t) and ( ) n 1 s p 12 (s, t) n 1 s 1 p ( i) 12 (s, t) Guibert and Planchet 2017 PARTY, 8-13 January 2017 7/20

Dynamic Pseudo-Values Define a set of landmark time points S = {s 1,..., s K} and, for each s k, a grid of points T sk. Let be a (multivariate) parameter θ = E [f (S, T)] for some function f. For varying landmark time points s and for several prediction points t, our aim is to construct a regression GLM model of the form ( ) θ i = E [f (S i, T i) Z i] = g 1 β Z i, with g, an invertible link function. For each landmark s, define the jackknife pseudo-values for individual i at each time t T s responses in the regression model. h j fst hj Pseudo-values Selection p 11 (s, t) 1 θ11 {S>t} ist = n 1 s p 11 (s, t) ( n 1 s 1 ) p ( i) 11 (s, t) L 1 s p 12 (s, t) 1 {S t} 1 θ12 {T t} ist = n 1 s p 12 (s, t) ( n 1 s 1 ) p ( i) 12 (s, t) L 1 s Stacked models for each transition h j: hj index is suppressed to simplify. Guibert and Planchet 2017 PARTY, 8-13 January 2017 8/20

Regression Approach Idea: If landmark times are close enough, the effect of covariates should vary smoothly over s. Our approach is inspired by the dynamic regression model developed by Nicolaie et al. (2013). Consider f = (f st, s S, t T s), θ i = (θ ist, s S, t T s) and a link ( ) θ ist = g 1 β (s) Z it (s). Each component l of the vector β (s) can be written as a polynomial function β l (s) = β l q l (s). We consider simultaneously all landmarks, which can now interact with covariates. Polynomial function can depend on time s and time t. Guibert and Planchet 2017 PARTY, 8-13 January 2017 9/20

Regression Approach The vector β, such that β (s) = Q (s) β, is estimated using the GEE approach (Liang and Zeger, 1986), i.e. by resolving the score equation U (β (s)) = ( ) ) β (s) θis M 1 i ( θis θ is = 0 i A working covariance matrix M i should be fixed (e.g. independance, AR1). To have consistent results, we follow Nicolaie et al. (2013) and choose an independence working correlation matrix. Model selection can be done using the QIC criteria (Pan, 2001). Variance of β (s) is obtained by using a sandwich estimator. Convergence and asymptotic normality of β (s) is not formally demonstrated here, but is undoubtedly satisfy (Graw et al., 2009). However, performances should be investigated. Guibert and Planchet 2017 PARTY, 8-13 January 2017 10/20

The approach of Helms et al. (2005) Annual transition probabilities p 11 (t), p 12 (t), and p 23 (t) are estimated based on the Aalen-Johansen estimator (Aalen and Johansen, 1978) for Markov models. Compute the jackknife pseudo-values for these probabilities. Define a regression GLM model for their jackknife pseudo-values. Parameters are also estimated using the GEE approach. Guibert and Planchet 2017 PARTY, 8-13 January 2017 11/20

Insurance Data Individual insurance data with left-truncation (only in healthy state) and right-censoring. 15,900 (63% of women) contracts observed on a 13-years period and almost 47% are censored in healthy state only an extract of the database to have acceptable computation times. Age period considered: 65-90. Mortality varies a lot with pathologies causing entry into dependency. Guibert and Planchet 2017 PARTY, 8-13 January 2017 12/20

Estimated Transition Probabilities 1.00 0.75 p11(s,t) 0.50 0.25 Age (s) 65 69 75 79 85 70 80 90 Age (t) Figure: p 11 (s, t) Guibert and Planchet 2017 PARTY, 8-13 January 2017 13/20

Estimated Transition Probabilities 0.15 Age (s) 65 69 75 79 85 0.10 p12(s,t) 0.05 0.00 70 80 90 Age (t) Figure: p 12 (s, t) Guibert and Planchet 2017 PARTY, 8-13 January 2017 13/20

Premiums Calculation - Markov vs. non-markov estimation Principle of equivalence for a s-year old insured a 1 (s; Z (s)) = A 1 (s; Z (s)) Annual payments. B = 1 Eur, v = (1 + 0.01) 1, Age limit= 90. 0.07 Approach Markov Non Markov 0.06 Price 0.05 0.04 65 70 75 80 85 Age Figure: Comparison of Premiums. Guibert and Planchet 2017 PARTY, 8-13 January 2017 14/20

Results for p 11 (s, t) and p 12 (s, t) with non-markov specification S = {65, 67,..., 85} and, for each s, T s = {s + 1, s + 2,..., 90}. Link tested for the GLM specification: identity, logit, clog-log. Set of factors tested: the arrival age t, t 2 and t 3, the gender, the generation, and their interactions with age s and s 2. Model selection: QIC, log-lik and the number of parameters. Validation: Graphical analysis of residuals. Best model specifications p 11 (s, t): clog-log (θ ist) s ( t + t 2 + Gender ) + s 2 (t + Gender) + s t 3 1 {t s>3}. p 12 (s, t): θ ist s t + t Gender + s 2. The gender effect is important and increases with age. Surprisingly, the generation has no real effect. Possible explanation: the length of our data. Guibert and Planchet 2017 PARTY, 8-13 January 2017 15/20

Fitted values for p 12 (s, t) Age (s) = 67 Age (s) = 71 0.075 0.050 0.025 0.000 p12(s,t) Age (s) = 77 Age (s) = 81 0.075 0.050 0.025 Sex F 0.000 M 70 75 80 85 90 70 75 80 85 90 Age (t) Figure: p 12 (s, t) - Predicted transition probabilities for men and women with associated 95% prediction intervals. Guibert and Planchet 2017 PARTY, 8-13 January 2017 16/20

Results for p 11 (t), p 12 (t) and p 23 (t) with Markov specification t {65, 66,..., 90}. Working covariance matrix: independence and autoregressive(ar-i) Same analysis and set of factors. Best model specifications p 11 (t): clog-log (θ it) t + t 2 + t 3 + Gender. p 12 (t): θ it t + t 2 + t 3. p 23 (t): clog-log (θ it) t + t 2 + t 3 + t 4 + t 5 + Gender. The models estimated with independent and autoregression working covariance matrices give similar results. The gender effect is significant, but remains very small, whereas it s seems to be higher if we consider residuals for p 11 (t) and p 23 (t). The generation has no real effect. Guibert and Planchet 2017 PARTY, 8-13 January 2017 17/20

Actuarial Application - Premiums Comparison 0.06 Premium 0.05 0.04 Gender F M 65 70 75 80 85 Age Figure: Comparison of Premiums for men and women. Solid line: Markov. Dotted line: Non-Markov. Guibert and Planchet 2017 PARTY, 8-13 January 2017 18/20

Conclusion New dynamic regression approach based on pseudo-values for the transition probabilities of a non-markov illness-death model. flexible methodologies, the effects of covariates are directly estimated on pseudo-values more interpretable. Measure the bias induced by the use of a Markov specification. New researches are need to check models performances and analyze the asymptotic properties of our estimators. Guibert and Planchet 2017 PARTY, 8-13 January 2017 19/20

References I Aalen, O. O. and Johansen, S. (1978). An Empirical Transition Matrix for Non-Homogeneous Markov Chains Based on Censored Observations. Scandinavian Journal of Statistics, 5(3), 141 150. Andersen, P. K., Klein, J. P., and Rosthøj, S. (2003). Generalised linear models for correlated pseudo-observations with applications to multi-state models. Biometrika, 90(1), 15 27. de Uña-Álvarez, J. and Meira-Machado, L. (2015). Nonparametric estimation of transition probabilities in the non-markov illness-death model: A comparative study. Biometrics. Fong, J. H., Shao, A. W., and Sherris, M. (2015). Multistate Actuarial Models of Functional Disability. North American Actuarial Journal, 19(1), 41 59. Graw, F., Gerds, T. A., and Schumacher, M. (2009). On pseudo-values for regression analysis in competing risks models. Lifetime Data Analysis, 15(2), 241 255. Guibert, Q. and Planchet, F. (2016). Non-Parametric Inference of Transition Probabilities Based on Aalen-Johansen Integral Estimators for Acyclic Multi-State Models: Application to LTC Insurance. Submit. Haberman, S. and Pitacco, E. (1998). Actuarial Models for Disability Insurance. Chapman and Hall/CRC, 1 edition. Helms, F., Czado, C., and Gschlößl, S. (2005). Calculation of LTC Premiums Based on Direct Estimates of Transition Probabilities. ASTIN Bulletin: The Journal of the International Actuarial Association, 35(02), 455 469. Levantesi, S. and Menzietti, M. (2012). Managing longevity and disability risks in life annuities with long term care. Insurance: Mathematics and Economics, 50(3), 391 401. Liang, K.-Y. and Zeger, S. L. (1986). Longitudinal data analysis using generalized linear models. Biometrika, 73(1), 13 22. Nicolaie, M. A., van Houwelingen, J. C., de Witte, T. M., and Putter, H. (2013). Dynamic Pseudo-Observations: A Robust Approach to Dynamic Prediction in Competing Risks. Biometrics, 69(4), 1043 1052. Pan, W. (2001). Akaike s information criterion in generalized estimating equations. Biometrics, 57(1), 120 125. Pritchard, D. J. (2006). Modeling disability in long-term care insurance. North American Actuarial Journal, 10(4), 48 75. Tsai, W.-Y., Jewell, N. P., and Wang, M.-C. (1987). A note on the product-limit estimator under right censoring and left truncation. Biometrika, 74(4), 883 886. Guibert and Planchet 2017 PARTY, 8-13 January 2017 20/20