On the existence of maximum likelihood estimators in a Poissongamma HGLM and a negative binomial regression model

Similar documents
Unconditional Distributions Obtained from Conditional Specification Models with Applications in Risk Theory

Counts using Jitters joint work with Peng Shi, Northern Illinois University

On the Importance of Dispersion Modeling for Claims Reserving: Application of the Double GLM Theory

Simulation of truncated normal variables. Christian P. Robert LSTA, Université Pierre et Marie Curie, Paris

Multivariate negative binomial models for insurance claim counts

Pricing and Risk Analysis of a Long-Term Care Insurance Contract in a non-markov Multi-State Model

Parameter Estimation

Comparison of Approximations for Compound Poisson Processes

Invariant HPD credible sets and MAP estimators

Credibility Theory for Generalized Linear and Mixed Models

Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

PQL Estimation Biases in Generalized Linear Mixed Models

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:

DEPARTMENT MATHEMATIK SCHWERPUNKT MATHEMATISCHE STATISTIK UND STOCHASTISCHE PROZESSE

f(x θ)dx with respect to θ. Assuming certain smoothness conditions concern differentiating under the integral the integral sign, we first obtain

GB2 Regression with Insurance Claim Severities

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session CPS014) p.4149

Multivariate count data generalized linear models: Three approaches based on the Sarmanov distribution. Catalina Bolancé & Raluca Vernic

Consistent Bivariate Distribution

Least Absolute Shrinkage is Equivalent to Quadratic Penalization

AN EXTREME-VALUE ANALYSIS OF THE LIL FOR BROWNIAN MOTION 1. INTRODUCTION

The Credibility Estimators with Dependence Over Risks

Applying the proportional hazard premium calculation principle

Lawrence D. Brown* and Daniel McCarthy*

H-LIKELIHOOD ESTIMATION METHOOD FOR VARYING CLUSTERED BINARY MIXED EFFECTS MODEL

Lecture 1 October 9, 2013

arxiv: v2 [stat.me] 1 Jan 2018

Poisson Regression. Ryan Godwin. ECON University of Manitoba

On the Irreducibility of the Commuting Variety of the Symmetric Pair so p+2, so p so 2

A finite mixture of bivariate Poisson regression models with an application to insurance ratemaking

Monotonicity and Aging Properties of Random Sums

Estimation of parametric functions in Downton s bivariate exponential distribution

Using Estimating Equations for Spatially Correlated A

ABC methods for phase-type distributions with applications in insurance risk problems

Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed

Summary of Extending the Rank Likelihood for Semiparametric Copula Estimation, by Peter Hoff

Generalized Linear Models for a Dependent Aggregate Claims Model

Graduate Econometrics I: Maximum Likelihood I

SPRING 2007 EXAM C SOLUTIONS

Effective Sample Size in Spatial Modeling

Type II Bivariate Generalized Power Series Poisson Distribution and its Applications in Risk Analysis

Focusing on structural assumptions in regression on functional variable.

Tail Dependence of Multivariate Pareto Distributions

On Simulations form the Two-Parameter. Poisson-Dirichlet Process and the Normalized. Inverse-Gaussian Process

36-720: The Rasch Model

Frailty Models and Copulas: Similarities and Differences

Chapter 4: Asymptotic Properties of the MLE

Ph.D. Qualifying Exam Monday Tuesday, January 4 5, 2016

arxiv: v1 [math.st] 26 Jun 2011

Simulation of Tail Dependence in Cot-copula

On a simple construction of bivariate probability functions with fixed marginals 1

ON THE BIVARIATE GENERALIZED POISSON DISTRIBUTION. University "Ovidius" Constanta, Romania

Supplement to A Hierarchical Approach for Fitting Curves to Response Time Measurements

Asymptotics of random sums of heavy-tailed negatively dependent random variables with applications

Generalized linear mixed models (GLMMs) for dependent compound risk models

P n. This is called the law of large numbers but it comes in two forms: Strong and Weak.

Neural, Parallel, and Scientific Computations 18 (2010) A STOCHASTIC APPROACH TO THE BONUS-MALUS SYSTEM 1. INTRODUCTION

Ratemaking with a Copula-Based Multivariate Tweedie Model

A MIXTURE INTEGER-VALUED GARCH MODEL

Bonus Malus Systems in Car Insurance

1. (Regular) Exponential Family

Inference for the Positive Stable Laws Based on a Special Quadratic Distance

STA 2201/442 Assignment 2

A Study of the Impact of a Bonus-Malus System in Finite and Continuous Time Ruin Probabilities in Motor Insurance

A MIXTURE INTEGER-VALUED GARCH MODEL

LINEAR FLOW IN POROUS MEDIA WITH DOUBLE PERIODICITY

Introduction to Probabilistic Machine Learning

Course 4 Solutions November 2001 Exams

On the coverage probability of the Clopper-Pearson confidence interval

Polynomial approximation of mutivariate aggregate claim amounts distribution

Rank Regression with Normal Residuals using the Gibbs Sampler

Statistics 135 Fall 2008 Final Exam

Estimating the efficient price from the order flow : a Brownian Cox process approach

Subject CS1 Actuarial Statistics 1 Core Principles

Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011

Bayesian Nonparametric Regression for Diabetes Deaths

arxiv:math/ v1 [math.nt] 27 Feb 2004

Maximum Likelihood Estimation

An adaptive kriging method for characterizing uncertainty in inverse problems

Convergence at first and second order of some approximations of stochastic integrals

Peng Shi. Actuarial Research Conference August 5-8, 2015

Model Selection for Semiparametric Bayesian Models with Application to Overdispersion

ISSN X Quadratic loss estimation of a location parameter when a subset of its components is unknown

Randomly Weighted Sums of Conditionnally Dependent Random Variables

Statistics Ph.D. Qualifying Exam: Part I October 18, 2003

Clinical Trial Design Using A Stopped Negative Binomial Distribution. Michelle DeVeaux and Michael J. Kane and Daniel Zelterman

COS513 LECTURE 8 STATISTICAL CONCEPTS

Principal Component Analysis Applied to Polytomous Quadratic Logistic

Parameter Estimation

Linear Models A linear model is defined by the expression

arxiv: v1 [stat.ap] 16 Nov 2015

Statistics 3858 : Maximum Likelihood Estimators

Flexible Spatio-temporal smoothing with array methods

MINIMUM EXPECTED RISK PROBABILITY ESTIMATES FOR NONPARAMETRIC NEIGHBORHOOD CLASSIFIERS. Maya Gupta, Luca Cazzanti, and Santosh Srivastava

Joint work with Nottingham colleagues Simon Preston and Michail Tsagris.

Generalized Linear Models (GLZ)

RMSC 2001 Introduction to Risk Management

Approximating the Conway-Maxwell-Poisson normalizing constant

Maximum likelihood estimation in the logistic regression model with a cure fraction

Transcription:

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin Session CPS027) p.6433 On the existence of maximum likelihood estimators in a Poissongamma HGLM and a negative binomial regression model GNING Lucien Diégane Laboratoire d Etudes et de Recherches en Statistiques et Développement LERSAD), UFR Sciences Appliquées et echnologies, Université Gaston Berger. Saint-Louis B.P. 234), Sénégal E-mail: luciendesgning@yahoo.fr Pierre-Loti-Viaud, Daniel Universit Pierre et Marie Curie, LSA 4 place Jussieu Paris F-75252, Cedex 05), France E-mail: daniel.pierre-loti-viaud@upmc.fr Introduction Poisson-gamma HGLM are members of the hierarchical generalized linear model family see [8]), which is an extension of generalized linear models and generalized linear mixed models. HGLM are applied particularly to longitudinal data that are generally correlated. An important motivation comes from the use of Poisson-gamma HGLM to describe numbers of claim in order to evaluate an a posteriori premium in non-life insurance for more details see [2] or [3]). It is then important to establish the existence of MLE and possibly the uniqueness for a credible foundation for the method. In this paper, we are interested in the existence of MLE of parameters in a Poissongamma HGLM where the regressors are not dependent of time. In this sense, our best achievement is the finding of a sufficient condition for the existence of MLE in this type of Poisson-gamma HGLM. hen, we remark that this model is linked to a negative binomial regression model in which the index parameter is unknown. By doing so, the condition obtained is also sufficient for the existence of MLE of parameters in the former as well as in the latter model. A necessary and sufficient condition for the existence and uniqueness of MLE of the index parameter in a negative binomial sample has been established in [9], we show that our condition is a natural generalization of this latter. he remainder of this paper is organized as follows. In the first section, we introduce Poisson-gamma HGLM and define notation that will be useful later. In the second section, we give our main result and preliminary results that we will be used to prove the main result. he paper concludes by proposing a conjecture. Indeed, it seems quite likely that our condition is also a necessary and sufficient condition for the existence and the uniqueness of MLE in both models considered, the Poisson-gamma HGLM where the regressors are not dependent of time and the negative binomial regression in the case of an unkwown index parameter. Poisson-gamma HGLM We may work here with an insurance portfolio composed of n policyholders or insureds, each insured being observed during periods. Let Y kt, be the number of reported claims for insured k during period t, and x 1,..., x J, be the covariates which represent the observable non random characteristics of insureds, taking x ktj, k = 1,..., n, t = 1,...,, j = 1,..., J, as the observed value of the j th covariate for the insured k at the period t. Furthermore, let for each insured k, Θ k be a non-observed real random characteristic. We suppose that:

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin Session CPS027) p.6434 H1) each insured is represented by the random vector Θ k Y k1... Y k ), and these vectors are independent; H2) given Θ k, the numbers of claims Y k1,..., Y k are independent, and satisfy: 1) L Y kt / Θk =θ k ) = Pλkt θ k ), k = 1,..., n, t = 1,...,, where log λ kt = x kt β, and β = β 1... β J ) is the regression parameters vector for explanatory variables x kt = x kt1... x ktj ). Moreover, we suppose that: H3) LΘ k ) = γa, a), k = 1,..., n. At the following of this work, we take x kt =x k and λ kt =λ k for k = 1,..., n, t = 1,...,. Now, let X R, be the n J matrix defined by: X R = x 1 x 2. x n, and F R the subspace ImX R ). We suppose that X R is a full rank matrix. After that, we note that the defined model satisfies to following constraint: 2) log λ F R, where λ = λ 1... λ n ). Existence results for MLE In this section, we state a sufficient condition for the existence of MLE of the parameters in a Poissongamma HGLM where λ kt = λ k. As we will prove it, this condition is also sufficient for the existence of MLE of the parameters in a negative binomial model in which the index parameter is unknown. First, we give the expression of the log-likelihood function of the model. Next, we give preliminary results that we will use to prove the already mentioned condition. Log-likelihood function Given Θ k, the periodical numbers of claims Y k1,..., Y k are independent. he joint probability function of Y k1,..., Y k is thus given by: PY k1 =y k1,..., Y k =y k )= = + 0 + 0 PY k1 =y k1,..., Y k =y k /Θ k = θ k )fθ k )dθ k PYkt =y kt /Θ k = θ k ) ) fθ k )dθ k, t=1 where f is the density function of the gamma distribution. With straightforward calculation, we deduce from this expression that the likelihood function of the model is given by: 3) L y a, λ) = n ) Γa+sk ) a a ) sk ) λk Γa) t=1 y, kt! a+ λ k a+ λ k

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin Session CPS027) p.6435 where s k = t=1 y kt, k = 1,..., n. As a result, the log-likelihood function of the model minus some constant is given by: l y a, β) = s k x k β ) a + s k log a + e x β) ) k + a log a + ) Γa + sk ) log. Γa) herefore, using the properties of the function Γ see [1]), we rewrite this log-likelihood function as: l y a, β)= s k x k β ) a + s k log a + e x β) ) k + a log a + s k 1 j=0 loga + j), where, for k = 1,..., n, s k 1 j=0 loga + j) = 0, if s k =0. Let now r = 1 a, and Φ the function defined on ]0, + [ by: 4) Φr, β) = l y 1 r, β) = s k x k β 1 r + s ) k log 1 + r e x β)) k + s k 1 j=0 log 1 + rj ). In the following of this paper we take S k = t=1 Y kt, k = 1,..., n, S = S 1... S n ) and we denote diagu k ), the diagonal matrix with elements the u k, k = 1,..., n. Preliminary results We obtain the following results. Lemma 1 Let K be a compact space in R J, then Φr, β) converges uniformly on K to the function β Φ0, β) defined by: 5) Φ0, β) = sk x k β e x k β), as r 0. Furthermore, the function β Φ0, β) is the log-likelihood function minus some constant of a Poisson regression model for clustered data. Lemma 2 he function β Φ0, β) is strictly concave on R J, and if there exists δ F R S k + δ k > 0 for each k = 1,..., n, then there exists a unique ˆβ0) R J satisfying: such that Φ0, ˆβ0)) = max Φ0, β). β RJ Lemma 3 For r > 0, the function β Φr, β) is strictly concave on R J, and if there exists δ F R such that S k + δ k > 0 for each k = 1,..., n, then there exists a unique ˆβr) R J satisfying: Φr, ˆβr)) = max Φr, β). β RJ Lemma 4 If there exists δ F such that S R k +δ k > 0 for each k = 1,..., n, then for all neighborhood V of ˆβ0), and for enough small r, the function β Φr, β) has a local maximum on V which is ˆβr). heorem 1 If there exists δ F R such that S k + δ k > 0 for each k = 1,..., n, then: lim ˆβr) = ˆβ0). r 0

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin Session CPS027) p.6436 heorem 2 If there exists δ F R neighborhood of 0, we have: such that S k + δ k > 0 for each k = 1,..., n, then for r > 0 in a ˆβr) = ˆβ0) + rd ˆβ0) + or), where: D ˆβ0) = X diag ) ex k ˆβ0) 1 )X R R X diag 2 e 2x k ˆβ0) )1 R n X diag ex k ˆβ0) )S ), R and 1 n = 1... 1) R n. Main result Now, we state in the theorem below the sufficient condition for the existence of MLE of the model. We denote by: ˆλ k 0) = e x k ˆβ0), k = 1,..., n, when ˆβ0) exists. heorem 3 If the following conditions C1) and C2) are satisfied, then the MLE of the parameters a and β exists. C1): If there exists δ F such that S R k + δ k > 0 for each k = 1,..., n. C2): 1 n n 1 t=1 Y kt ˆλ k 0) ) 2 1 1 n n 1 t=1 Y kt) ) > 0. Remark 1 If =1, then the random variables are Y kt i.i.d. In this case C2) become the empirical variance is strictly greater than the empirical mean which is a necessary and sufficient condition for the existence and the uniqueness of the MLE of the index parameter in a negative binomial sample see [9]). Remark 2 he condition C2) is interpreted as the difference between the residual variance of the Poisson regression model which is not hierarchical) associated with our model, and the empirical mean divided by the number of periods. Link to the negative binomial regression model We complete the presentation of our results by presenting the link between a Poisson-gamma HGLM where λ kt = λ k and a negative binomial regression model. Proposition 1 he Poisson-gamma HGLM with λ kt = λ k, corresponds to a negative binomial regression model associated at the variables S k, k = 1,..., n with a logarithmic link and an unknown index parameter. Corollary 1 It comes from the proposition 1 that, if C1) and C2) are satisfied, then MLEs of all parameters including the index parameter ) of a negative binomial regression model with logarithmic link exists. Conclusion he Remark 1 and the Corollary 1 encourage us to declare the following conjecture.

Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin Session CPS027) p.6437 Conjecture. he MLE s of the parameters a and β of the Poisson-gamma HGLM model withλ kt = λ k, and that of the negative binomial regression, exists and are unique, if and only if C1) and C2) are satisfied. References [1] Abramowitz M., Stegun I. 1972). Handbook of Mathematical Functions. Dover Publications, New York. [2] Boucher J., Denuit M., Guilln M.2008). Models of Insurance claim counts with time dependance based on generalization of Poisson and negative binomial distributions. Casuality Actuarial Society, 21), 136 161. [3] Dionne G., Vanasse Ch. 1989). A generalization of automobile insurance rating models: the negative binomial binomial distribution with a regression component. Astin Bulletin 192), 199 212. [4] Gning L., Utilisation des mlanges de lois de Poisson et de lois binomiales ngatives pour tablir des tarifs a priori et a posteriori en assurance non vie. hse de mathmatiques en prparation. [5] Gning L., Pierre-Loti-Viaud, D. 2010). Sur les estimateurs du maximum de vraisemblance dans les modles multiplicatifs de Poisson et binomiale ngative. Afrika Statistika 513), 297 305. [6] Hilbe J.M. 2007). Negative Binomial Regression. Cambridge University Press, Cambridge. [7] Johnson N., Kotz S., Balakrishnan N. 1996). Discrete Multivariate Distributions 2 e dition).wiley, New York. [8] Lee Y., Nelder J. 1996). Hierarchical Generalized Linear Models. Journal of Royal Statistical Society, Series B, Methodological) 58, 619 678. [9] Levin B., Reeds J. 1977). Compound multinomial likelihoods functions are unimodal: proof a conjecture of I.J.Good. he Annals of Statistics. 5, 79 87. [10] Santner J., Duffy E. 1989). he Statistical Analysis of Discrete Data. Springer-Verlag, New York. [11] Spivak M.1979). A comprehensive introduction to differential geometry. Publish or Perish deuxime dition, Houston. [12] Zeger S., Liang K. 1986). Longitudinal Data Analysis using generalized linear models. Biometrika, 73, 13 22. ABSRAC We are interested in the existence of maximum likelihood estimators MLE) for all parameters in a hierarchical generalized linear model HGLM) of Poisson-gamma type, and in a negative binomial regression model. hese models are widely used in pricing non-life insurance see [2], [3], [6]). However, for these models, it seems that the existence of MLE has not been studied in literature. In this paper, we address this issue for first time for a particular sub-model, indeed, we give a sufficient condition for the existence of MLE of parameters in the Poisson-gamma HGLM where the regressors are not dependent of time. We also show that the condition obtained is sufficient for the existence of MLE of parameters in the negative binomial regression model in the case of an unknown index parameter. In this latter case, our condition appears as a natural extension of the necessary and sufficient condition known for the existence and uniqueness of MLE of the index parameter for a sample of negative binomial distirbution.