The LmB Conferences on Multivariate Count Analysis
|
|
- Zoe White
- 5 years ago
- Views:
Transcription
1 The LmB Conferences on Multivariate Count Analysis Title: On Poisson-exponential-Tweedie regression models for ultra-overdispersed count data Rahma ABID, C.C. Kokonendji & A. Masmoudi Address: Besançon:
2 Some related works: Abid et al. (2018a). Geometric dispersion models with quadratic v-functions. Submission Needing Revision. Abid et al. (2018b). Geometric Tweedie regression models for continuous and semicontinuous data with variation phenomenon. Submitted for publication. Abid et al. (2018c). On Poisson-exponential-Tweedie regression models for ultra-overdispersed count data. To submit asap. Abid et al. (2018d). Multivariate Poisson-exponential-Tweedie regression models for the analysis of maintenance building data. Work in progress. Rahma ABID, Poisson-exponential-Tweedie regression models 2
3 Outline: 1 Introduction: Motivations 2 Poisson-exponential-Tweedie (PET) models (= Geometric Poisson-Tweedie) 3 PET regression models 4 Simulation studies and applications 5 Conclusion & Perspectives Rahma ABID, Poisson-exponential-Tweedie regression models 3
4 1. Introduction: Motivations The overdispersion phenomenon is frequent and can be induced by the zero-inflation ones, these phenomena are defined w.r. to Poisson; see, e.g., ab c d. Count models have been built through compounding and mixture of Poisson distribution. 1) How to do when the degree or level of overdispersion is very high? 2) Should we relativize its measure with respect to another reference count distribution than Poisson? 3) How to built a new family of ultra-overdispersed count models? a Hinde, J. and Demétrio, C.G.B. (1998). Overdispersion: Models and Estimation. Associacao Brasileira de Estatistica, Sao Paulo. b Kokonendji, C.C., Dossou-Gbete, S. and Demétrio, C.G.B. (2004). Some discrete exponential dispersion models: Poisson-Tweedie and Hinde-Demetrio classes. Statistics and Operations Research Transactions, 28, c Kokonendji, C.C., Demétrio, C.G.B. and Zocchi, S.S. (2007). On Hinde-Demétrio regression models for overdispersed count data. Statistical Methodology 4, d Bonat, W.H., Jørgensen, B., Kokonendji, C.C., Hinde, J. & Demétrio, C.G.B. (2018). Extended Poisson-Tweedie: properties and regression models for count data. Statistical Modelling 18: Rahma ABID, Poisson-exponential-Tweedie regression models 4
5 Example of application Application to the repairable systems in reliability e : Data of number of buildings subject to maintenance (e.g., plumbing, roof, heating, cooling system, etc.) until the year The buildings are belonging to one system being 46 years old: (Y 1,..., Y i,..., Y 46 ). Dispersion index = / = : ultra-overdispersed. Some relative references: - Jørgensen, B. & Kokonendji, C.C. (2011). Dispersion models for geometric sums. Brazilian Journal of Probability and Statistics 25: Jørgensen, B. and Kokonendji, C.C. (2016). Discrete dispersion models and their Tweedie asymptotics. AStA Advances in Statistical Analyses 100: e Yeoeman, A. (1987). Forecasting Building Maintenance Using The Weibull Process, M.S.Thesis, University of Missouri-Rolla, United States. Rahma ABID, Poisson-exponential-Tweedie regression models 5
6 Geometric sums of count random variables A geometric sum of Poisson-Tweedie models defined by Y = G PT l, where PT 1, PT 2,... are i.i.d. such a Poisson-Tweedie PT random variable and G Geom(q). Decomposition of number of buildings maintenance actions Y i having age i as a geometric sum of maintenance actions per building. l=1 Rahma ABID, Poisson-exponential-Tweedie regression models 6
7 Geometric sums of count random variables A geometric sum of Poisson-Tweedie models defined by Y = G PT l, where PT 1, PT 2,... are i.i.d. such a Poisson-Tweedie PT random variable and G Geom(q). Decomposition of number of buildings maintenance actions Y i having age i as a geometric sum of maintenance actions per building. For p {0} [1, ), the class of Poisson-Tweedie (PTw p ( m, φ)): l=1 Z Tw p ( m, φ) and PT Z Poisson(Z) PT PTw p ( m, φ) has moments EPT = m and VarPT = m + φ m p. Given the expectation m = EY, its variance is of the form VarY = m + m 2 + φm p. Rahma ABID, Poisson-exponential-Tweedie regression models 6
8 2.Poisson-exponential-Tweedie (PET) models (= Geometric Poisson-Tweedie) The class of Exponential-Poisson-Tweedie: X Exp(1), [Y X] Z Poisson(Z) and Z Tw p (Xm, X 1 p φ). (1) The class of Poisson-exponential-Tweedie (PETw p (m, φ)): Y Z Poisson(Z), Z Tw p (Xm, X 1 p φ) and X Exp(1). (2) Proposition (Abid et al., 2018c) Let Y 1 and Y 2 two random variables defined by (1) and (2), respectively. Then (i) Y 1 and Y 2 have the same distributions. (ii) VarY 1 = m + m 2 + φm p. Rahma ABID, Poisson-exponential-Tweedie regression models 7
9 Table: Summary of PET models with support S p = N of distributions. Type(s) of PET = Geometric PT p Type(s) of Tweedie Geometric Hermite p = 0 Gaussian [Do not exist] 0 < p < 1 [Do not exist] Geometric Neyman Type A p = 1 Poisson Geometric Poisson compound Poisson 1 < p < 2 Gamma compound Poisson Geometric Pólya-Aeppli p = 3/2 Non-central gamma Geometric negative binomial p = 2 Gamma Geometric Poisson positive stable p > 2 Positive stable Geometric Poisson-inverse Gaussian p = 3 Inverse Gaussian Rahma ABID, Poisson-exponential-Tweedie regression models 8
10 Table: Summary of PET models with support S p = N of distributions. Type(s) of PET = Geometric PT p Type(s) of Tweedie Geometric Hermite p = 0 Gaussian [Do not exist] 0 < p < 1 [Do not exist] Geometric Neyman Type A p = 1 Poisson Geometric Poisson compound Poisson 1 < p < 2 Gamma compound Poisson Geometric Pólya-Aeppli p = 3/2 Non-central gamma Geometric negative binomial p = 2 Gamma Geometric Poisson positive stable p > 2 Positive stable Geometric Poisson-inverse Gaussian p = 3 Inverse Gaussian Y PETw p (m, φ) has pmf P(Y = y) = 0 0 exp{ z x}z y Tw p (mx, φx 1 p )(z)dzdx. y! No closed-form available Approximation by Monte Carlo integration and Tweedie simulations rtweedie() in R (Dunn, 2013). Estimation and inference based on the likelihood function is difficult Model selection: estimation of parameters by regression. Rahma ABID, Poisson-exponential-Tweedie regression models 8
11 3. PET regression models Dispersion and zero-inflation indexes w.r. to Poisson: P-DI = VarY EY andf P-ZI = EY + log P(Y = 0). f Other definitions: P-DI = (VarY EY)/EY and P-ZI = 1 + log P(Y = 0)/EY. Rahma ABID, Poisson-exponential-Tweedie regression models 9
12 3. PET regression models Dispersion and zero-inflation indexes w.r. to Poisson: P-DI = VarY EY andf P-ZI = EY + log P(Y = 0). Dispersion and zero-inflation indexes w.r. to negative binomial: NB-DI = VarY EY + (EY) 2 and NB-ZI = log(1 + EY) + log P(Y = 0). Heavy tail index is independent of the reference model: HT = P(Y = y + 1) P(Y = y) for y. Proposition (Abid et al., 2018c) PET is overdispersed and zero-inflated w.r. to P and NB, respectively. f Other definitions: P-DI = (VarY EY)/EY and P-ZI = 1 + log P(Y = 0)/EY. Rahma ABID, Poisson-exponential-Tweedie regression models 9
13 Dispersion indexes of PET w.r. to Poisson and NB Figure: Dispersion indexes of PET distribution as a function of m by dispersion and power parameters. Rahma ABID, Poisson-exponential-Tweedie regression models 10
14 Zero-inflation indexes of PET w.r. to Poisson and NB Figure: Zero-inflation indexes of PET distribution as a function of m by dispersion and power parameters. Rahma ABID, Poisson-exponential-Tweedie regression models 11
15 Estimation and inference: Quasi likelihood approach Consider a cross-sectional data set, (y i, x i ), i = 1,..., n, where y i are i.i.d realizations of Y i PETw p (m i, φ), x i and β are (Q 1) vectors of known covariates and unknown regression parameters, respectively. EY i = m i = exp(x i β) VarY i = m i + m 2 i + φm p i = V i. Rahma ABID, Poisson-exponential-Tweedie regression models 12
16 Estimation and inference: Quasi likelihood approach Consider a cross-sectional data set, (y i, x i ), i = 1,..., n, where y i are i.i.d realizations of Y i PETw p (m i, φ), x i and β are (Q 1) vectors of known covariates and unknown regression parameters, respectively. EY i = m i = exp(x i β) VarY i = m i + m 2 i + φm p i = V i. Models with m 2 p m 1 p < φ < 0 are permitted May be no specific probability distribution Rahma ABID, Poisson-exponential-Tweedie regression models 12
17 Dispersion indexes of PET w.r. to Poisson and NB for φ < 0 Figure: Dispersion indexes for PET distribution by negative dispersion and power parameters. Rahma ABID, Poisson-exponential-Tweedie regression models 13
18 Dominant features of PET models Table: Reference models and dominant features by dispersion and power parameter values in respect to the Poisson and negative binomial models. Reference PET Dominant features Dispersion Power Poisson/negative binomial Equi/Equi - Geometric Hermite Over, under φ 0 p = 0 Geometric Neyman Type A Over, under, ZI φ 0 p = 1 Geometric Poisson compound Poisson Over, under, ZI φ 0 1 < p < 2 Geometric Pólya-Aeppli Over, under, ZI φ 0 p = 1.5 Geometric negative binomial Over, under φ 0 p = 2 Geometric Poisson positive stable Over, HT φ > 0 p > 2 Geometric Poisson-inverse Gaussian Over, HT φ > 0 p = 3 Rahma ABID, Poisson-exponential-Tweedie regression models 14
19 Estimating function approach The quasi-score function for β: n m i ψ β (β, γ) = V 1 i (y i m i ),..., β 1 i=1 n m i V 1 i (y i m i ) β. Q The Pearson estimating function for variance parameters γ = (φ, p): n ψ γ (β, γ) = V 1 n i φ {(y V 1 i m i ) 2 i V i }, p {(y i m i ) 2 V i }. i=1 The chaser algorithm (Jørgensen & Knudsen, 2004) is defined by i=1 i=1 β (i+1) = β (i) S 1 β ψ β(β (i), φ (i) ) with S βjk γ (i+1) = γ (i) αs 1 γ ψ γ (β (i+1), γ (i) ) = E ( ψ β βj (β, φ) ) and S γjk = n k i=1 V 1 i γ j V V i 1 i V γ i k Rahma ABID, Poisson-exponential-Tweedie regression models 15
20 4.1 Simulation studies The expectation and the variance of the PET random variable are given by m i = exp(β 0 + β 1 x 1i + β 2 x 2i ) and V i = m i + m 2 i + φm p, i where x 1 and x 2 are sequences from 1 to 1 with length equals to the sample size. The regression coefficients were fixed at the values, β 0 = 1, β 1 = 1 and β 2 = 0.9. We use different sample sizes (n = 500, 1000 and 5000) generating 1000 data sets in each case. We considered three values of the Tweedie power parameter p = 1.5, 2, 3 combined with three values of the dispersion parameter φ = 0.5, 1, 1.5. Rahma ABID, Poisson-exponential-Tweedie regression models 16
21 Average bias for parameters Figure: Average bias for each parameter by sample size and simulation scenarios. Rahma ABID, Poisson-exponential-Tweedie regression models 17
22 Confidence intervals for parameters Figure: Confidence intervals for each parameter by simulation scenarios. Rahma ABID, Poisson-exponential-Tweedie regression models 18
23 4.2 Three Applications Accidents of private cars in Switzerland (Klugman, 2004): Data analysed in Aryuyuen and Bodhisuwan (2013) using the NB-GE distribution. P-DI = 1.154; NB-DI = P-overdispersed; NB-equidispersed. P-ZI = 0.154; NB-ZI = P-zero-inflated; NB-zero-deflated. Rahma ABID, Poisson-exponential-Tweedie regression models 19
24 4.2 Three Applications Accidents of private cars in Switzerland (Klugman, 2004): Data analysed in Aryuyuen and Bodhisuwan (2013) using the NB-GE distribution. P-DI = 1.154; NB-DI = P-overdispersed; NB-equidispersed. P-ZI = 0.154; NB-ZI = P-zero-inflated; NB-zero-deflated. Table: Parameter estimates for different models. Number of Observed Fitting distributions accidents frequencies Poisson NB NB-GE PT PET Parameters λ = r = r = p = p = estimates q = α = φ = φ = β = m = m = Chi-squares p-value < Rahma ABID, Poisson-exponential-Tweedie regression models 19
25 Accident occurrence in car insurance on Tunisian data: Data from an insurer who operates in the market for automobile insurance in Tunisia. P-DI = 9.100; NB-DI = NB-overdispersed. P-ZI = 0.268; NB-ZI = NB-zero-inflated. Table: Parameter estimates and standard errors (SE) for PET and PT models; paic for models. Parameter PET PT Intercept (0.111) (0.111) Car age (0.003) (0.003) Car power (0.009) (0.009) Driver age (0.002) (0.002) φ (0.093) (0.096) p (0.149) (0.075) paic Rahma ABID, Poisson-exponential-Tweedie regression models 20
26 Buildings maintenance data (Yeoeman, 1987): Data on the number of occurrences of repairs for buildings. The number of maintenance for all buildings is available during four years: 1982, 1983, 1984, For a given year i, the total number of buildings maintenance Y i in the i-th time frame follows the PET model PETw p (m i, φ), i = 1,..., 46. Table: Estimated dispersion and zero-inflation indexes of datasets. Dataset P-DI NB-DI P-ZI NB-ZI No 1 (1982) No 2 (1983) No 3 (1984) No 4 (1985) Rahma ABID, Poisson-exponential-Tweedie regression models 21
27 Table: Parameter estimates and standard errors (SE) for PET and PT indicated by italic symbols; paic for models. Parameter No 1 (1982) No 2 (1983) No 3 (1984) No 4 (1985) Intercept (0.299) (0.048) (0.298) (0.300) (0.299) (0.477) (0.298) (0.300) Age (0.010) (0.058) (0.111) (0.111) (0.010) (0.144) (0.110) (0.111) φ (0.000) (0.001) (0.001) (0.000) (0.312) (1.336) (0.372) (0.131) p (0.000) (0.035) (0.004) (0.000) (0.007) (0.014) (0.002) (0.000) paic Rahma ABID, Poisson-exponential-Tweedie regression models 22
28 5. Conclusion & Perspectives 1 Model selection in the PET to deal with count ultra-overdispersed data. 2 Negative binomial dispersion and zero-inflation indexes relativize the ultra-overdispersion and the excess of zeros. Rahma ABID, Poisson-exponential-Tweedie regression models 23
29 5. Conclusion & Perspectives 1 Model selection in the PET to deal with count ultra-overdispersed data. 2 Negative binomial dispersion and zero-inflation indexes relativize the ultra-overdispersion and the excess of zeros. 3 Multivariate version of PET sums: G 1 (Q) G k (Q) S (Q; PT) = PT l1,..., PT lk, l=1 l=1 where Q = {q ij } k is a suitable matrix of parameters. i,j=1 Independent or correlated components ( G j (Q) ) k j=1? 4 Given Y 1,..., Y n be a n-variate response vector on N d, d 1, EY i = m i = G 1 (X i β), cov(y i, Y j ) = Σ 1/2 i (ρ ij I d ) Σ 1/2 j, Σ i = diag d (m i ) + diag d (m 2 i ) + diag d (mp i ) 1/2 diag d (φ)diag d (m p i ) 1/2. Rahma ABID, Poisson-exponential-Tweedie regression models 23
30 Further references 1 Abid, R., Kokonendji, C.C. and Masmoudi, A. (2018a). Geometric dispersion models with quadratic v-functions. Submission Needing Revision. 2 Abid, R., Kokonendji, C.C. and Masmoudi, A. (2018b). Geometric Tweedie regression models for continuous and semicontinuous data with variation phenomenon. Submitted for publication. 3 Abid, R., Kokonendji, C.C. and Masmoudi, A. (2018c). On Poisson-exponential-Tweedie regression models for ultra-overdispersed count data. To submit asap. 4 Dunn, P.K. (2013). Tweedie exponential family models. version R package URL 5 Jørgensen, B. and Knudsen, S.J. (2004). Parameter orthogonality and bias adjustment for estimating functions. Scandinavian Journal of Statistics 31: Klugman, S.A., Panger, H.H. and Willmot, G.E. (2004). Loss Models: From Data to Decisions, 2nd edn. Wiley, Hoboken, NJ. 7 Tweedie, M.C.K. (1984). An index which distinguishes between some important exponential families. In Statistics: Applications and New Directions. Proceedings of the Indian Statistical Institute Golden Jubilee International Conference (J. K. Ghosh and J. Roy, eds.), pp , Indian Statistical Institute, Calcutta.... Thank You Rahma ABID, Poisson-exponential-Tweedie regression models 24
31 Geometric dispersion models (0) Let µ be a probability measure. (1) The geometric cumulant function of µ (Jørgensen & Kokonendji, 2011) is C µ (θ) = 1 1 L µ (θ) on (µ) = { θ R; 0 < L µ (θ) < }. (2) In general, the application θ C µ(θ) is not strictly monotone on (µ). -Let (µ) (µ) be an interval for which C µ is strictly monotone on (µ). -The application θ C µ(θ) is a diffeomorphism between (µ) and C µ ( (µ)) =: Φ µ. Denote ϕ µ := (C µ) 1. (3) v-function: m V µ (m) = C µ (ϕ µ )(m) on Φ µ. -Denote Φ µ := C µ( (µ)) and Φ + µ := C µ( + (µ)). Then, V µ (m) < 0 on Φ µ and V µ (m) > 0 on Φ + µ. Note: V µ = Var E 2. Rahma ABID, Poisson-exponential-Tweedie regression models 25
32 How to derive GDMs from EDMs with (µ) = (µ)? Let µ be a probability measure. (1) If there exists a probability measure ν such that C µ (θ) = K ν (θ), then (µ) = {θ (µ); C µ (θ) = K ν (θ) > 0} = (µ). ν(µ) : Prop. below. (2) If there exists a probability measure ν such that C µ (θ) = K ν (θ), then (µ) = {θ (µ); C µ (θ) = K (θ) < 0} = (µ). ν(µ)?! ν Proposition (Exponential mixtures distributions) Let µ be a probability measure and ν an infinitely divisible σ-finite positive measure. Consider F(ν) = {P(θ, ν); θ Θ(ν)}. The following assertions are equivalent: (i) For all m Φ + µ, V µ (m) = V F(ν) (m). (ii) The measure µ is an exponential mixture measure µ(dy) = e x P( γ, ν x )(dy)dx, (3) 0 with γ Θ(ν) and ν x denotes the x-th convolution of ν, that is L ν x (θ) = (L ν (θ)) x. Note: Under assumption of infinite divisibility, the corresponding exponential mixture has v-function identical to the variance function V F(ν). Rahma ABID, Poisson-exponential-Tweedie regression models 26
Extended Poisson-Tweedie: properties and regression models for count data
Extended Poisson-Tweedie: properties and regression models for count data arxiv:1608.06888v2 [stat.me] 11 Sep 2016 Wagner H. Bonat and Bent Jørgensen and Célestin C. Kokonendji and John Hinde and Clarice
More informationExtended Poisson Tweedie: Properties and regression models for count data
Extended Poisson Tweedie: Properties and regression models for count data Wagner H. Bonat 1,2, Bent Jørgensen 2,Célestin C. Kokonendji 3, John Hinde 4 and Clarice G. B. Demétrio 5 1 Laboratory of Statistics
More informationEmpirical Comparison of ML and UMVU Estimators of the Generalized Variance for some Normal Stable Tweedie Models: a Simulation Study
Applied Mathematical Sciences, Vol. 10, 2016, no. 63, 3107-3118 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ams.2016.69238 Empirical Comparison of and Estimators of the Generalized Variance for
More informationParameters Estimation Methods for the Negative Binomial-Crack Distribution and Its Application
Original Parameters Estimation Methods for the Negative Binomial-Crack Distribution and Its Application Pornpop Saengthong 1*, Winai Bodhisuwan 2 Received: 29 March 2013 Accepted: 15 May 2013 Abstract
More informationReparametrization of COM-Poisson Regression Models with Applications in the Analysis of Experimental Count Data
Reparametrization of COM-Poisson Regression Models with Applications in the Analysis of Experimental Count Data Eduardo Elias Ribeiro Junior 1 2 Walmes Marques Zeviani 1 Wagner Hugo Bonat 1 Clarice Garcia
More informationOn Equi-/Over-/Underdispersion. and Related Properties of Some. Classes of Probability Distributions. Vladimir Vinogradov
On Equi-/Over-/Underdispersion and Related Properties of Some Classes of Probability Distributions Vladimir Vinogradov (Ohio University, on leave at the Fields Institute, University of Toronto and York
More informationMultivariate Regression Models in R: The mcglm package
Multivariate Regression Models in R: The mcglm package Prof. Wagner Hugo Bonat R Day Laboratório de Estatística e Geoinformação - LEG Universidade Federal do Paraná - UFPR 15 de maio de 2018 Introduction
More informationOn Lévy measures for in nitely divisible natural exponential families
On Lévy measures for in nitely divisible natural exponential families Célestin C. Kokonendji a;, a University of Pau - LMA & IUT STID, Pau, France Mohamed Khoudar b b University of Pau - IUT STID, Pau,
More informationUsing Estimating Equations for Spatially Correlated A
Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship
More informationTABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1
TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1 1.1 The Probability Model...1 1.2 Finite Discrete Models with Equally Likely Outcomes...5 1.2.1 Tree Diagrams...6 1.2.2 The Multiplication Principle...8
More informationParametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1
Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson
More informationLecture 1: August 28
36-705: Intermediate Statistics Fall 2017 Lecturer: Siva Balakrishnan Lecture 1: August 28 Our broad goal for the first few lectures is to try to understand the behaviour of sums of independent random
More informationGauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA
JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter
More informationThe combined model: A tool for simulating correlated counts with overdispersion. George KALEMA Samuel IDDI Geert MOLENBERGHS
The combined model: A tool for simulating correlated counts with overdispersion George KALEMA Samuel IDDI Geert MOLENBERGHS Interuniversity Institute for Biostatistics and statistical Bioinformatics George
More informationIrr. Statistical Methods in Experimental Physics. 2nd Edition. Frederick James. World Scientific. CERN, Switzerland
Frederick James CERN, Switzerland Statistical Methods in Experimental Physics 2nd Edition r i Irr 1- r ri Ibn World Scientific NEW JERSEY LONDON SINGAPORE BEIJING SHANGHAI HONG KONG TAIPEI CHENNAI CONTENTS
More informationGeneralized Quasi-likelihood versus Hierarchical Likelihood Inferences in Generalized Linear Mixed Models for Count Data
Sankhyā : The Indian Journal of Statistics 2009, Volume 71-B, Part 1, pp. 55-78 c 2009, Indian Statistical Institute Generalized Quasi-likelihood versus Hierarchical Likelihood Inferences in Generalized
More informationA Note on Weighted Count Distributions
Journal of Statistical Theory and Applications Volume 11, Number 4, 2012, pp. 337-352 ISSN 1538-7887 A Note on Weighted Count Distributions Célestin C. Kokonendji and Marta Pérez-Casany Abstract As particular
More informationZero inflated negative binomial-generalized exponential distribution and its applications
Songklanakarin J. Sci. Technol. 6 (4), 48-491, Jul. - Aug. 014 http://www.sst.psu.ac.th Original Article Zero inflated negative binomial-generalized eponential distribution and its applications Sirinapa
More informationLecture 2: Repetition of probability theory and statistics
Algorithms for Uncertainty Quantification SS8, IN2345 Tobias Neckel Scientific Computing in Computer Science TUM Lecture 2: Repetition of probability theory and statistics Concept of Building Block: Prerequisites:
More informationUnconditional Distributions Obtained from Conditional Specification Models with Applications in Risk Theory
Unconditional Distributions Obtained from Conditional Specification Models with Applications in Risk Theory E. Gómez-Déniz a and E. Calderín Ojeda b Abstract Bivariate distributions, specified in terms
More informationInstitute of Actuaries of India
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2018 Examinations Subject CT3 Probability and Mathematical Statistics Core Technical Syllabus 1 June 2017 Aim The
More informationDELTA METHOD and RESERVING
XXXVI th ASTIN COLLOQUIUM Zurich, 4 6 September 2005 DELTA METHOD and RESERVING C.PARTRAT, Lyon 1 university (ISFA) N.PEY, AXA Canada J.SCHILLING, GIE AXA Introduction Presentation of methods based on
More informationAlgorithms for Uncertainty Quantification
Algorithms for Uncertainty Quantification Tobias Neckel, Ionuț-Gabriel Farcaș Lehrstuhl Informatik V Summer Semester 2017 Lecture 2: Repetition of probability theory and statistics Example: coin flip Example
More informationCHAPTER 1 INTRODUCTION
CHAPTER 1 INTRODUCTION 1.0 Discrete distributions in statistical analysis Discrete models play an extremely important role in probability theory and statistics for modeling count data. The use of discrete
More informationMultivariate Normal-Laplace Distribution and Processes
CHAPTER 4 Multivariate Normal-Laplace Distribution and Processes The normal-laplace distribution, which results from the convolution of independent normal and Laplace random variables is introduced by
More informationEstimating function analysis for a class of Tweedie regression models
Title Estimating function analysis for a class of Tweedie regression models Author Wagner Hugo Bonat Deartamento de Estatística - DEST, Laboratório de Estatística e Geoinformação - LEG, Universidade Federal
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationI I FINAL, 01 Jun 8.4 to 31 May TITLE AND SUBTITLE 5 * _- N, '. ', -;
R AD-A237 850 E........ I N 11111IIIII U 1 1I!til II II... 1. AGENCY USE ONLY Leave 'VanK) I2. REPORT DATE 3 REPORT TYPE AND " - - I I FINAL, 01 Jun 8.4 to 31 May 88 4. TITLE AND SUBTITLE 5 * _- N, '.
More informationHigh-Throughput Sequencing Course
High-Throughput Sequencing Course DESeq Model for RNA-Seq Biostatistics and Bioinformatics Summer 2017 Outline Review: Standard linear regression model (e.g., to model gene expression as function of an
More informationMonte Carlo Studies. The response in a Monte Carlo study is a random variable.
Monte Carlo Studies The response in a Monte Carlo study is a random variable. The response in a Monte Carlo study has a variance that comes from the variance of the stochastic elements in the data-generating
More informationPart 8: GLMs and Hierarchical LMs and GLMs
Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course
More information1/15. Over or under dispersion Problem
1/15 Over or under dispersion Problem 2/15 Example 1: dogs and owners data set In the dogs and owners example, we had some concerns about the dependence among the measurements from each individual. Let
More informationDeccan Education Society s FERGUSSON COLLEGE, PUNE (AUTONOMOUS) SYLLABUS UNDER AUTOMONY. SECOND YEAR B.Sc. SEMESTER - III
Deccan Education Society s FERGUSSON COLLEGE, PUNE (AUTONOMOUS) SYLLABUS UNDER AUTOMONY SECOND YEAR B.Sc. SEMESTER - III SYLLABUS FOR S. Y. B. Sc. STATISTICS Academic Year 07-8 S.Y. B.Sc. (Statistics)
More informationPreface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of
Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of Probability Sampling Procedures Collection of Data Measures
More informationTesting Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata
Maura Department of Economics and Finance Università Tor Vergata Hypothesis Testing Outline It is a mistake to confound strangeness with mystery Sherlock Holmes A Study in Scarlet Outline 1 The Power Function
More informationON GENERALIZED VARIANCE OF NORMAL-POISSON MODEL AND POISSON VARIANCE ESTIMATION UNDER GAUSSIANITY
ON GENERALIZED VARIANCE OF NORMAL-POISSON MODEL AND POISSON VARIANCE ESTIMATION UNDER GAUSSIANITY Khoirin Nisa 1, Célestin C. Kokonendji 2, Asep Saefuddin 3, AjiHamim Wigena 3 and I WayanMangku 4 1 Department
More informationOn Hinde-Demetrio Regression Models for Overdispersed Count Data
On Hinde-Demetrio Regression Models for Overdispersed Count Data Célestin Kokonendji, Clarice G.B. Demétrio, Silvio S. Zocchi To cite this version: Célestin Kokonendji, Clarice G.B. Demétrio, Silvio S.
More informationPoisson Regression. Ryan Godwin. ECON University of Manitoba
Poisson Regression Ryan Godwin ECON 7010 - University of Manitoba Abstract. These lecture notes introduce Maximum Likelihood Estimation (MLE) of a Poisson regression model. 1 Motivating the Poisson Regression
More informationPRINCIPLES OF STATISTICAL INFERENCE
Advanced Series on Statistical Science & Applied Probability PRINCIPLES OF STATISTICAL INFERENCE from a Neo-Fisherian Perspective Luigi Pace Department of Statistics University ofudine, Italy Alessandra
More informationStatistics: A review. Why statistics?
Statistics: A review Why statistics? What statistical concepts should we know? Why statistics? To summarize, to explore, to look for relations, to predict What kinds of data exist? Nominal, Ordinal, Interval
More informationTail negative dependence and its applications for aggregate loss modeling
Tail negative dependence and its applications for aggregate loss modeling Lei Hua Division of Statistics Oct 20, 2014, ISU L. Hua (NIU) 1/35 1 Motivation 2 Tail order Elliptical copula Extreme value copula
More informationContents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1
Contents Preface to Second Edition Preface to First Edition Abbreviations xv xvii xix PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 1 The Role of Statistical Methods in Modern Industry and Services
More informationIntroduction to Reliability Theory (part 2)
Introduction to Reliability Theory (part 2) Frank Coolen UTOPIAE Training School II, Durham University 3 July 2018 (UTOPIAE) Introduction to Reliability Theory 1 / 21 Outline Statistical issues Software
More informationNonparametric estimation of the number of zeros in truncated count distributions
Nonparametric estimation of the number of zeros in truncated count distributions Célestin C. KOKONENDJI University of Franche-Comté, France Laboratoire de Mathématiques de Besançon - UMR 6623 CNRS-UFC
More informationPetr Volf. Model for Difference of Two Series of Poisson-like Count Data
Petr Volf Institute of Information Theory and Automation Academy of Sciences of the Czech Republic Pod vodárenskou věží 4, 182 8 Praha 8 e-mail: volf@utia.cas.cz Model for Difference of Two Series of Poisson-like
More informationMODELING COUNT DATA Joseph M. Hilbe
MODELING COUNT DATA Joseph M. Hilbe Arizona State University Count models are a subset of discrete response regression models. Count data are distributed as non-negative integers, are intrinsically heteroskedastic,
More informationBayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework
HT5: SC4 Statistical Data Mining and Machine Learning Dino Sejdinovic Department of Statistics Oxford http://www.stats.ox.ac.uk/~sejdinov/sdmml.html Maximum Likelihood Principle A generative model for
More informationMultivariate negative binomial models for insurance claim counts
Multivariate negative binomial models for insurance claim counts Peng Shi (Northern Illinois University) and Emiliano A. Valdez (University of Connecticut) 9 November 0, Montréal, Quebec Université de
More informationMIXED POISSON DISTRIBUTIONS ASSOCIATED WITH HAZARD FUNCTIONS OF EXPONENTIAL MIXTURES
MIXED POISSON DISTRIBUTIONS ASSOCIATED WITH HAZARD FUNCTIONS OF EXPONENTIAL MIXTURES Moses W. Wakoli and Joseph A. M. Ottieno 2 Abstra The hazard funion of an exponential mixture charaerizes an infinitely
More informationMathematical statistics
October 1 st, 2018 Lecture 11: Sufficient statistic Where are we? Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation
More informationCovariance function estimation in Gaussian process regression
Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian
More informationIntroduction to Algorithmic Trading Strategies Lecture 10
Introduction to Algorithmic Trading Strategies Lecture 10 Risk Management Haksun Li haksun.li@numericalmethod.com www.numericalmethod.com Outline Value at Risk (VaR) Extreme Value Theory (EVT) References
More informationProbability Distributions Columns (a) through (d)
Discrete Probability Distributions Columns (a) through (d) Probability Mass Distribution Description Notes Notation or Density Function --------------------(PMF or PDF)-------------------- (a) (b) (c)
More informationMixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data
Mixtures of Negative Binomial distributions for modelling overdispersion in RNA-Seq data Cinzia Viroli 1 joint with E. Bonafede 1, S. Robin 2 & F. Picard 3 1 Department of Statistical Sciences, University
More informationClassification. Chapter Introduction. 6.2 The Bayes classifier
Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode
More informationChapter 5. Statistical Models in Simulations 5.1. Prof. Dr. Mesut Güneş Ch. 5 Statistical Models in Simulations
Chapter 5 Statistical Models in Simulations 5.1 Contents Basic Probability Theory Concepts Discrete Distributions Continuous Distributions Poisson Process Empirical Distributions Useful Statistical Models
More informationModeling Longitudinal Count Data with Excess Zeros and Time-Dependent Covariates: Application to Drug Use
Modeling Longitudinal Count Data with Excess Zeros and : Application to Drug Use University of Northern Colorado November 17, 2014 Presentation Outline I and Data Issues II Correlated Count Regression
More informationIntroduction to Bayesian Inference
University of Pennsylvania EABCN Training School May 10, 2016 Bayesian Inference Ingredients of Bayesian Analysis: Likelihood function p(y φ) Prior density p(φ) Marginal data density p(y ) = p(y φ)p(φ)dφ
More informationZERO INFLATED POISSON REGRESSION
STAT 6500 ZERO INFLATED POISSON REGRESSION FINAL PROJECT DEC 6 th, 2013 SUN JEON DEPARTMENT OF SOCIOLOGY UTAH STATE UNIVERSITY POISSON REGRESSION REVIEW INTRODUCING - ZERO-INFLATED POISSON REGRESSION SAS
More informationGeneralized Linear Models. Last time: Background & motivation for moving beyond linear
Generalized Linear Models Last time: Background & motivation for moving beyond linear regression - non-normal/non-linear cases, binary, categorical data Today s class: 1. Examples of count and ordered
More informationIntroduction to Rare Event Simulation
Introduction to Rare Event Simulation Brown University: Summer School on Rare Event Simulation Jose Blanchet Columbia University. Department of Statistics, Department of IEOR. Blanchet (Columbia) 1 / 31
More informationGeneralized linear mixed models (GLMMs) for dependent compound risk models
Generalized linear mixed models (GLMMs) for dependent compound risk models Emiliano A. Valdez joint work with H. Jeong, J. Ahn and S. Park University of Connecticut 52nd Actuarial Research Conference Georgia
More informationOn the Importance of Dispersion Modeling for Claims Reserving: Application of the Double GLM Theory
On the Importance of Dispersion Modeling for Claims Reserving: Application of the Double GLM Theory Danaïl Davidov under the supervision of Jean-Philippe Boucher Département de mathématiques Université
More informationReview. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda
Review DS GA 1002 Statistical and Mathematical Models http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall16 Carlos Fernandez-Granda Probability and statistics Probability: Framework for dealing with
More informationGeneralized linear mixed models for dependent compound risk models
Generalized linear mixed models for dependent compound risk models Emiliano A. Valdez joint work with H. Jeong, J. Ahn and S. Park University of Connecticut ASTIN/AFIR Colloquium 2017 Panama City, Panama
More informationp y (1 p) 1 y, y = 0, 1 p Y (y p) = 0, otherwise.
1. Suppose Y 1, Y 2,..., Y n is an iid sample from a Bernoulli(p) population distribution, where 0 < p < 1 is unknown. The population pmf is p y (1 p) 1 y, y = 0, 1 p Y (y p) = (a) Prove that Y is the
More informationReading Material for Students
Reading Material for Students Arnab Adhikari Indian Institute of Management Calcutta, Joka, Kolkata 714, India, arnaba1@email.iimcal.ac.in Indranil Biswas Indian Institute of Management Lucknow, Prabandh
More informationStatistical techniques for data analysis in Cosmology
Statistical techniques for data analysis in Cosmology arxiv:0712.3028; arxiv:0911.3105 Numerical recipes (the bible ) Licia Verde ICREA & ICC UB-IEEC http://icc.ub.edu/~liciaverde outline Lecture 1: Introduction
More informationLocal Mixtures and Exponential Dispersion Models
Local Mixtures and Exponential Dispersion Models by Paul Marriott Department of Statistics & Applied Probability, National University of Singapore, 3 Science Drive, Singapore 117543. email: stapkm@nus.edu.sg
More informationA Practitioner s Guide to Generalized Linear Models
A Practitioners Guide to Generalized Linear Models Background The classical linear models and most of the minimum bias procedures are special cases of generalized linear models (GLMs). GLMs are more technically
More informationResearch Article The Laplace Likelihood Ratio Test for Heteroscedasticity
International Mathematics and Mathematical Sciences Volume 2011, Article ID 249564, 7 pages doi:10.1155/2011/249564 Research Article The Laplace Likelihood Ratio Test for Heteroscedasticity J. Martin van
More informationNumerical Analysis for Statisticians
Kenneth Lange Numerical Analysis for Statisticians Springer Contents Preface v 1 Recurrence Relations 1 1.1 Introduction 1 1.2 Binomial CoefRcients 1 1.3 Number of Partitions of a Set 2 1.4 Horner's Method
More informationApplied Probability and Stochastic Processes
Applied Probability and Stochastic Processes In Engineering and Physical Sciences MICHEL K. OCHI University of Florida A Wiley-Interscience Publication JOHN WILEY & SONS New York - Chichester Brisbane
More informationSTA216: Generalized Linear Models. Lecture 1. Review and Introduction
STA216: Generalized Linear Models Lecture 1. Review and Introduction Let y 1,..., y n denote n independent observations on a response Treat y i as a realization of a random variable Y i In the general
More informationProduct Held at Accelerated Stability Conditions. José G. Ramírez, PhD Amgen Global Quality Engineering 6/6/2013
Modeling Sub-Visible Particle Data Product Held at Accelerated Stability Conditions José G. Ramírez, PhD Amgen Global Quality Engineering 6/6/2013 Outline Sub-Visible Particle (SbVP) Poisson Negative Binomial
More information1 Presessional Probability
1 Presessional Probability Probability theory is essential for the development of mathematical models in finance, because of the randomness nature of price fluctuations in the markets. This presessional
More informationSubject CS1 Actuarial Statistics 1 Core Principles
Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and
More informationLattice Data. Tonglin Zhang. Spatial Statistics for Point and Lattice Data (Part III)
Title: Spatial Statistics for Point Processes and Lattice Data (Part III) Lattice Data Tonglin Zhang Outline Description Research Problems Global Clustering and Local Clusters Permutation Test Spatial
More informationDouble generalized linear compound poisson models to insurance claims data
Syddansk Universitet Double generalized linear compound poisson models to insurance claims data Andersen, Daniel Arnfeldt; Bonat, Wagner Hugo Published in: Electronic Journal of Applied Statistical Analysis
More informationBrief Review on Estimation Theory
Brief Review on Estimation Theory K. Abed-Meraim ENST PARIS, Signal and Image Processing Dept. abed@tsi.enst.fr This presentation is essentially based on the course BASTA by E. Moulines Brief review on
More informationDependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.
Practitioner Course: Portfolio Optimization September 10, 2008 Before we define dependence, it is useful to define Random variables X and Y are independent iff For all x, y. In particular, F (X,Y ) (x,
More informationIE 303 Discrete-Event Simulation
IE 303 Discrete-Event Simulation 1 L E C T U R E 5 : P R O B A B I L I T Y R E V I E W Review of the Last Lecture Random Variables Probability Density (Mass) Functions Cumulative Density Function Discrete
More informationStandard Error of Technical Cost Incorporating Parameter Uncertainty
Standard Error of Technical Cost Incorporating Parameter Uncertainty Christopher Morton Insurance Australia Group This presentation has been prepared for the Actuaries Institute 2012 General Insurance
More informationStudent-t Process as Alternative to Gaussian Processes Discussion
Student-t Process as Alternative to Gaussian Processes Discussion A. Shah, A. G. Wilson and Z. Gharamani Discussion by: R. Henao Duke University June 20, 2014 Contributions The paper is concerned about
More informationSCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models
SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION
More informationABC methods for phase-type distributions with applications in insurance risk problems
ABC methods for phase-type with applications problems Concepcion Ausin, Department of Statistics, Universidad Carlos III de Madrid Joint work with: Pedro Galeano, Universidad Carlos III de Madrid Simon
More informationLecture-19: Modeling Count Data II
Lecture-19: Modeling Count Data II 1 In Today s Class Recap of Count data models Truncated count data models Zero-inflated models Panel count data models R-implementation 2 Count Data In many a phenomena
More informationSome Theoretical Properties and Parameter Estimation for the Two-Sided Length Biased Inverse Gaussian Distribution
Journal of Probability and Statistical Science 14(), 11-4, Aug 016 Some Theoretical Properties and Parameter Estimation for the Two-Sided Length Biased Inverse Gaussian Distribution Teerawat Simmachan
More informationMAS223 Statistical Inference and Modelling Exercises
MAS223 Statistical Inference and Modelling Exercises The exercises are grouped into sections, corresponding to chapters of the lecture notes Within each section exercises are divided into warm-up questions,
More informationKernel families of probability measures. Saskatoon, October 21, 2011
Kernel families of probability measures Saskatoon, October 21, 2011 Abstract The talk will compare two families of probability measures: exponential, and Cauchy-Stjelties families. The exponential families
More informationSemiparametric Generalized Linear Models
Semiparametric Generalized Linear Models North American Stata Users Group Meeting Chicago, Illinois Paul Rathouz Department of Health Studies University of Chicago prathouz@uchicago.edu Liping Gao MS Student
More informationDiscrete Choice Modeling
[Part 6] 1/55 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent Class 11 Mixed Logit 12 Stated Preference
More informationFrailty Modeling for clustered survival data: a simulation study
Frailty Modeling for clustered survival data: a simulation study IAA Oslo 2015 Souad ROMDHANE LaREMFiQ - IHEC University of Sousse (Tunisia) souad_romdhane@yahoo.fr Lotfi BELKACEM LaREMFiQ - IHEC University
More informationStat 5101 Notes: Brand Name Distributions
Stat 5101 Notes: Brand Name Distributions Charles J. Geyer September 5, 2012 Contents 1 Discrete Uniform Distribution 2 2 General Discrete Uniform Distribution 2 3 Uniform Distribution 3 4 General Uniform
More informationSimulating Realistic Ecological Count Data
1 / 76 Simulating Realistic Ecological Count Data Lisa Madsen Dave Birkes Oregon State University Statistics Department Seminar May 2, 2011 2 / 76 Outline 1 Motivation Example: Weed Counts 2 Pearson Correlation
More informationRMSC 2001 Introduction to Risk Management
RMSC 2001 Introduction to Risk Management Tutorial 4 (2011/12) 1 February 20, 2012 Outline: 1. Failure Time 2. Loss Frequency 3. Loss Severity 4. Aggregate Claim ====================================================
More informationOutline of GLMs. Definitions
Outline of GLMs Definitions This is a short outline of GLM details, adapted from the book Nonparametric Regression and Generalized Linear Models, by Green and Silverman. The responses Y i have density
More informationStat 535 C - Statistical Computing & Monte Carlo Methods. Arnaud Doucet.
Stat 535 C - Statistical Computing & Monte Carlo Methods Arnaud Doucet Email: arnaud@cs.ubc.ca 1 CS students: don t forget to re-register in CS-535D. Even if you just audit this course, please do register.
More informationExtreme Value Analysis and Spatial Extremes
Extreme Value Analysis and Department of Statistics Purdue University 11/07/2013 Outline Motivation 1 Motivation 2 Extreme Value Theorem and 3 Bayesian Hierarchical Models Copula Models Max-stable Models
More informationTesting Statistical Hypotheses
E.L. Lehmann Joseph P. Romano Testing Statistical Hypotheses Third Edition 4y Springer Preface vii I Small-Sample Theory 1 1 The General Decision Problem 3 1.1 Statistical Inference and Statistical Decisions
More information