Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Similar documents
The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

Linear Approximation with Regularization and Moving Least Squares

Non-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2)

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

Estimation: Part 2. Chapter GREG estimation

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

Chat eld, C. and A.J.Collins, Introduction to multivariate analysis. Chapman & Hall, 1980

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

x = , so that calculated

Statistics for Economics & Business

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

Chapter 13: Multiple Regression

Comparison of Regression Lines

Negative Binomial Regression

Chapter 15 - Multiple Regression

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Homework Assignment 3 Due in class, Thursday October 15

Random Partitions of Samples

Conjugacy and the Exponential Family

Simulation and Probability Distribution

QUASI-LIKELIHOOD APPROACH TO RATER AGREEMENT PLUS LINEAR BY LINEAR ASSOCIATION MODEL FOR ORDINAL CONTINGENCY TABLES

Learning from Data 1 Naive Bayes

A Robust Method for Calculating the Correlation Coefficient

NUMERICAL DIFFERENTIATION

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

STAT 511 FINAL EXAM NAME Spring 2001

Hidden Markov Models & The Multivariate Gaussian (10/26/04)

Andreas C. Drichoutis Agriculural University of Athens. Abstract

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

Lecture 10 Support Vector Machines II

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE

Expectation Maximization Mixture Models HMMs

First Year Examination Department of Statistics, University of Florida

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Hidden Markov Models

Statistics for Business and Economics

Primer on High-Order Moment Estimators

Composite Hypotheses testing

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2017 Instructor: Victor Aguirregabiria

Lecture 3: Probability Distributions

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Limited Dependent Variables

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification

Chapter 11: Simple Linear Regression and Correlation

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Games of Threats. Elon Kohlberg Abraham Neyman. Working Paper

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal

Lecture Notes on Linear Regression

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek

Applications of GEE Methodology Using the SAS System

On mutual information estimation for mixed-pair random variables

Numerical Heat and Mass Transfer

Basic Business Statistics, 10/e

CIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

STAT 3008 Applied Regression Analysis

Lecture 12: Classification

Chapter 8 Indicator Variables

A Comparative Study for Estimation Parameters in Panel Data Model

Efficient nonresponse weighting adjustment using estimated response probability

Modeling and Simulation NETW 707

Marginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling

A note on regression estimation with unknown population size

U-Pb Geochronology Practical: Background

Polynomial Regression Models

However, since P is a symmetric idempotent matrix, of P are either 0 or 1 [Eigen-values

Gaussian Mixture Models

Gaussian process classification: a message-passing viewpoint

Statistical inference for generalized Pareto distribution based on progressive Type-II censored data with random removals

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE

Supplementary material: Margin based PU Learning. Matrix Concentration Inequalities

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

Analysis of Discrete Time Queues (Section 4.6)

Lecture 6: Introduction to Linear Regression

e i is a random error

An (almost) unbiased estimator for the S-Gini index

Chapter 20 Duration Analysis

Notes on Frequency Estimation in Data Streams

a. (All your answers should be in the letter!

Statistical Hypothesis Testing for Returns to Scale Using Data Envelopment Analysis

Lecture 3 Stat102, Spring 2007

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics

Convergence of random processes

Transcription:

Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore, Maryland 21250, U.S.A. jnew1@math.umbc.edu, nagaraj@math.umbc.edu AND J. G. MOREL The Proctor & Gamble Company, Bometrcs and Statstcal Scences Department, Wnton Hll Busness Center, 6280 Center Hll Avenue, Cncnnat, Oho 45224, U.S.A. morel.jg@pg.com SUMMARY Computatonal detals and dervatons of hgher order moments from two multnomal overdsperson models are presented n ths report. In addton, we provde the analytcal forms for all frst through fourth order moments and ther matrx forms. Some key words: Drchlet-Multnomal dstrbuton; Fnte-Mxture dstrbuton; Moment calculatons; Multnomal overdsperson. 1. INTRODUCTION A key assumpton of multnomal data s that each count s comprsed of ndependent observatons. In many practcal stuatons however, the ndvdual observatons makng up the counts are correlated or clustered. Ths causes the data to exhbt more varablty than can be explaned by the multnomal model. Ths phenomenon s known as extra-varaton or overdsperson. There are several ways of analyzng overdspersed multnomal data. Lkelhood approaches nclude the Drchlet-Multnomal dstrbuton by Mosmann (1962 and the Fnte-Mxture dstrbuton by Morel & Nagaraj (1993 and Neerchal & Morel (1998. Ths report provdes computatonal detals and dervatons of hgher order moments from these two multnomal overdsperson models. Snce dervatons of the moments from these two dstrbutons requre the moments from a multnomal dstrbuton, we also provde the frst through fourth order moments from the multnomal dstrbuton. The computaton of these moments requre nontrval algebra and result n complcated expressons. Complete dervatons can be found n Newcomer (2008. 2. THE MULTINOMIAL DISTRIBUTION Suppose T = (T 1, T 2,...,T s a k-dmensonal multnomal random varable wth parameters p = (p 1, p 2,...,p and cluster sze m. Let x (a = x(x 1 (x a + 1. Mosmann (1962 shows that the jont factoral moments of the multnomal dstrbuton are gven by [ ] µ (a1,a 2,...,a = E T (a 1 1 T (a 2 2 T (a ( a p a 1 1 pa 2 2 pa. (1 1

2 J. T. NEWCOMER, N. K. NEERCHAL AND J. G. MOREL Usng ths expresson, t can be shown that the frst through fourth order moments are gven by: ( E (T p, = 1, 2,...,k 1. ( E (T T j (2 p p j, j. ( E ( T 2 (2 p 2 + mp, = 1, 2,...,k 1. (v E (T T j T l (3 p p j p l, j l. (v E ( T 2 T j (3 p 2 p j + m (2 p p j, j. (v E ( T 3 (3 p 3 + 3m (2 p 2 + mp, = 1, 2,...,k 1. (v E (T T j T l T r (4 p p j p l p r, j l r. (v E ( T 2 T j T l (4 p 2 p j p l + m (3 p p j p l, j l. (x E ( T 2 Tj 2 (4 p 2 p 2 j + m (3 ( p 2 p j + p p 2 j + m (2 p p j, j. (x E ( T 3 T j (4 p 3 p j + 3m (3 p 2 p j + m (2 p p j, j. (x E ( T 4 (4 p 4 + 6m (3 p 3 + 7m (2 p 2 + mp, = 1, 2,...,k 1. 3. THE FINITE-MIXTURE DISTRIBUTION The Fnte-Mxture Dstrbuton by Morel & Nagaraj (1993 provdes a way to model categorcal data exhbtng overdsperson when the overdsperson s beleved to arse due to clumped multnomal samplng. Ths model can be generated as follows. Let Y, Y1 0, Y 2 0,...,Y m 0 be ndependent and dentcally dstrbuted k-dmensonal multnomal random varables wth parameters π = (π 1, π 2,...,π and cluster sze 1. Also let U 1, U 2,...,U m be ndependent and dentcally dstrbuted unform(0,1 random varables and let ρ be a real number, 0 < ρ < 1. We defne the random varable T as T = Y m I (U ρ + m Y 0 I (U > ρ, (2 where I( s the ndcator functon. Sectons 3 1 and 3 2 provde the frst through fourth order moments, ther matrx forms, and the dervatons for ths dstrbuton. 3 1. Fnte-Mxture Moments Suppose T = (T 1, T 2,...,T s dstrbuted as a Fnte-Mxture random varable wth parameters π = (π 1, π 2,...,π and ρ and cluster sze m. Let x (a = x(x 1 (x a + 1. The frst through fourth order moments are gven by: ( E (T π, = 1, 2,...,k 1. ( E (T T j (2 (1 ρ(1 + ρπ π j, j.

Multnomal Overdsperson Moments 3 ( E ( T 2 (2 (1 ρ(1 + ρπ 2 + m [ 1 + (m 1ρ 2] π, = 1, 2,...,k 1. (v E (T T j T l (3 (1 ρ 2 (1 + 2ρπ π j π l, j l. (v E ( T 2 T j (3 (1 ρ 2 (1 + 2ρπ 2 π j + m (2 (1 ρ [ 1 + ρ + (m 2ρ 2] π π j, j. (v E ( T 3 (3 (1 ρ 2 (1 + 2ρπ 3 + 3m (2 (1 ρ [ 1 + ρ + (m 2ρ 2] π 2 + m [ 1 + 3(m 1ρ 2 + (m 1(m 2ρ 3] π, = 1, 2,...,k 1. (v E (T T j T l T r (4 (1 ρ 3 (1 + 3ρπ π j π l π r, j l r. (v E ( T 2 T j T l (4 (1 ρ 3 (1 + 3ρπ 2 π j π l + m (3 (1 ρ 2 [ 1 + 2ρ + (m 3ρ 2] π π j π l, j l. (x E ( T 2 Tj 2 (4 (1 ρ 3 (1 + 3ρπ 2 πj 2 + m (3 (1 ρ 2 [ 1 + 2ρ + (m 3ρ 2] ( π πj 2 + π 2 π j + m (2 (1 ρ [ 1 + ρ + 2(m 2ρ 2] π π j, j. (x E ( T 3 T j (4 (1 ρ 3 (1 + 3ρπ 3 π j + 3m (3 (1 ρ 2 [ 1 + 2ρ + (m 3ρ 2] π 2 π j + m (2 (1 ρ [ 1 + ρ + 3(m 1ρ 2 + (m 1(m 2ρ 3] π π j, j. (x E ( T 4 (4 (1 ρ 3 (1 + 3ρπ 4 + 6m (3 (1 ρ 2 [ 1 + 2ρ + (m 3ρ 2] π 3 and + m (2 (1 ρ [ 1 + 7ρ + 18(m 2ρ 2 + 4(m 2(m 3ρ 3] π 2 + m [ 1 + 7(m 1ρ 2 + 6(m 1(m 2ρ 3 + (m 1(m 2(m 3ρ 4] π, = 1, 2,...,k 1. The expressons above allow us to wrte these moments n matrx form. Frst, defne T 2 = (T 2 1, T 2 2,...,T 2 T = (T 1 T 2, T 1 T 3, T 2 T 3,...,T 1 T,...,T k 2 T. Here T 2 s a vector of the squared counts and T s a vector contanng the unque off dagonal terms of TT. Smlarly defne, π 2 = (π 2 1, π 2 2,...,π 2, π = (π 1 π 2, π 1 π 3, π 2 π 3,...,π 1 π,...,π k 2 π, and π 3 = (π 3 1, π 3 2,...,π 3, π 4 = (π 4 1, π 4 2,...,π 4.

4 J. T. NEWCOMER, N. K. NEERCHAL AND J. G. MOREL Also letπ,π (2, andπ (3 be defned recursvely as follows. LetΠ 2 = (π 1 π 2, π 1 π 2, Π (2 2 = (π1 2π 2, π 1 π2 2, and Π (3 2 = π1 2π 2 + π 1 π2 2. The update equatons are gven by, ( Π Π r 1 π r dag(π r 1 r = π r π r 1, and Π (2 r = ( Π (3 (3 Π r = r 1 0 (r 1(r 2 2 ( (2 Π r 1 π r dag(πr 1 2 πrπ 2 r 1, 0 (r 1(r 2 2 π r Π r 1 r 1 π r π r 1 π r 1 + π2 rdag(π r 1 Π where 0 s s a s-dmensonal column vector of zeros, π r 1 denotes the frst r 1 elements of the vector π, and dag(x denotes a dagonal matrx wth the elements of x on the dagonals. The matrx forms of the frst through fourth order moments can be represented by: ( V ar (T [ 1 + (m 1ρ 2] dag(π ππ. ( Cov ( T,T 2 = 2m (2 (1 ρ [ 1 + ρ + (m 2ρ 2] dag(π 2 ππ 2 + m [ 1 + 3(m 1ρ 2 + (m 1(m 2ρ 3] dag(π ππ. ( V ar ( T 2 (4 (1 ρ 3 (1 + 3ρπ 2 π 2 + m (2 (1 ρ [ 1 + ρ + 2(m 2ρ 2] ππ + m (3 (1 ρ 2 [ 1 + 2ρ + (m 3ρ 2] ( ππ 2 + π 2 π + 4m (3 (1 ρ 2 [ 1 + 2ρ + (m 3ρ 2] dag(π 3 + 2m (2 (1 ρ [ 3ρ + 8(m 2ρ 2 + 2(m 2(m 3ρ 3] dag(π 2 + m [ 1 + 7(m 1ρ 2 + 6(m 1(m 2ρ 3 + (m 1(m 2(m 3ρ 4] dag(π (m (2 (1 ρ 2 2 π 2 π 2 + m 2 (1 + (m 1ρ 2 2 ππ +m 2 (m 1(1 ρ 2 (1 + (m 1ρ 2 ( ππ 2 + π 2 π., (v (v (v Cov ( T,T (2 (1 ρ [ 1 + ρ + (m 2ρ 2] Π 2ππ. Cov ( T 2, T = 2m (3 (1 ρ 2 [ 1 + 2ρ + (m 3ρ 2] Π (2 + m (2 (1 ρ [ 1 + ρ + 3(m 1ρ 2 + (m 1(m 2ρ 3] Π 2m (2 (1 ρ 2 [ (2m 3 + 2(2m 3ρ + (2m 2 8m + 9ρ 2] π 2 π 2m (2 (1 ρ [ 1 + ρ + (3m + 5ρ 2 + (m 2 3m + 3ρ 3] ππ. V ar ( T (3 (1 ρ 2 [ 1 + 2ρ + (m 3ρ 2] Π (3 + m (2 (1 ρ [ 1 + ρ + 2(m 2ρ 2] dag(π 2m (2 (1 ρ 2 [ (2m 3 + 2(2m 3ρ + (2m 2 8m + 9ρ 2] π π.

Multnomal Overdsperson Moments 5 3 2. Dervaton For the Fnte-Mxture dstrbuton, Morel & Nagaraj (1993 show that the dstrbuton of T s a fnte mxture of multnomals. Ths allows us to wrte the probablty mass functon of T as P(T = t = k π P(X = t, where X ( = 1,...,k 1 s a multnomal random varable wth parameters p = (1 ρπ + ρe and cluster sze m, X k s a multnomal random varable wth parameters p k = (1 ρπ and cluster sze m, and e s the th column of the (k 1 (k 1 dentty matrx. Usng ths representaton, we can wrte the moment generatng functon of T as φ T (z = k π φ X (z, (3 where φ X (z s the moment generatng functon of X ( = 1,...,k whch are defned as above. Usng equaton (3 we can defne the mxed moments of T as E ( T a 1 1 T a = k ( π E X a 1 1 Xa (. (4 Therefore, the moments of T from a Fnte-Mxture dstrbuton can be computed usng equaton (4 and the moments from a multnomal dstrbuton, where X ( = 1,...,k are defned above. We provde the examples below to demonstrate the algebrac technques requred to compute these moments analytcally. As noted above, complete dervatons can be found n Newcomer (2008. ( E (T T j = = s j k π s E (X s X sj π s m (2 (1 ρπ (1 ρπ j + π m (2 [(1 ρπ + ρ](1 ρπ j + π j m (2 (1 ρπ [(1 ρπ j + ρ] ( + 1 π s m (2 (1 ρπ (1 ρπ j = π s m (2 (1 ρ 2 π π j + 2m (2 ρ(1 ρπ π j + m (2 (1 ρ 2 π π j π s m (2 (1 ρ 2 π π j (2 (1 ρ(1 + ρπ π j, j.

6 J. T. NEWCOMER, N. K. NEERCHAL AND J. G. MOREL (x E ( T 2 T 2 j = s j = k π s E ( XsX 2 sj 2 π s m (4 (1 ρ 4 π 2 πj 2 + m (3 (1 ρ 3 [ π 2 π j + π πj 2 ] + m (2 (1 ρ 2 π π j + π m (4 [(1 ρπ + ρ] 2 [(1 ρπ j ] 2 + m (3 [(1 ρπ + ρ] 2 (1 ρπ j + π m (3 [(1 ρπ + ρ] [(1 ρπ j ] 2 + m (2 [(1 ρπ + ρ](1 ρπ j + π j m (4 [(1 ρπ ] 2 [(1 ρπ j + ρ] 2 + m (3 [(1 ρπ ] 2 [(1 ρπ j + ρ] + π j m (3 (1 ρπ [(1 ρπ j + ρ] 2 + m (2 (1 ρπ [(1 ρπ j + ρ] ( + 1 π s m (4 (1 ρ 4 π 2 πj 2 + m (3 (1 ρ 3 [ π 2 π j + π πj] 2 + +m (2 (1 ρ 2 π π j = 4m (4 ρ(1 ρ 3 π 2 π 2 j + m (4 ρ 2 (1 ρ 2 π 2 π j + m (4 ρ 2 (1 ρ 2 π π 2 j+ + 3m (3 ρ(1 ρ 2 π 2 π j + 3m (3 ρ(1 ρ 2 π π 2 j + 2m (3 ρ 2 (1 ρπ π j + + 2m (2 ρ(1 ρπ π j + m (4 (1 ρ 4 π 2 π 2 j + m (3 (1 ρ 3 [ π 2 π j + π π 2 j] + + m (2 (1 ρ 2 π π j (4 (1 ρ 3 (1 + 3ρπ 2 π 2 j + m (3 (1 ρ 2 [ 1 + 2ρ + (m 3ρ 2] ( π π 2 j + π 2 π j, j. 4. THE DIRICHLET-MULTINOMIAL DISTRIBUTION The Drchlet-Multnomal dstrbuton, derved by Mosmann (1962, provdes another model for multnomal data exhbtng overdsperson. Here however, the cause of the overdsperson arses when the cell probabltes, P = (P 1, P 2,...,P, vary randomly accordng to a contnuous jont dstrbuton f(p. Condtonally, gven P, we assume T P s dstrbuted as a k- dmensonal multnomal random varable. The uncondtonal probablty mass functon of T s gven by P(T = t = m! t 1! t k! k j=1 p t j j f(pdp. (5 If we assume that f(p follows a Drchlet dstrbuton wth parameters (Cπ 1, Cπ 2..., Cπ, where C C(ρ = (1 ρ2 or ρ = C ρ 2 C+1, then t can be shown that (5 becomes k m! Γ(C j=1 P(T = t = Γ(t + Cπ t 1! t k! Γ(m + C k j=1 Γ(Cπ. (6 Sectons 4 1 and 4 2 provde the frst through fourth order moments, ther matrx forms, and the dervatons for ths dstrbuton.

Multnomal Overdsperson Moments 7 4 1. Drchlet-Multnomal Moments Suppose T = (T 1, T 2,...,T s a k-dmensonal Drchlet-Multnomal random varable wth parameters ρ and π = (π 1, π 2,...,π and cluster sze m. Also, we let x (a = x(x 1 (x a + 1 and defne C C(ρ = (1 ρ2. The frst through fourth order moments are ρ 2 then gven by: ( E (T π, = 1, 2,...,k 1. ( E (T T j (2 C (C + 1 π π j, j. ( E ( T 2 (2 (Cπ + 1 (C + 1 π + mπ, = 1, 2,...,k 1. (v E (T T j T l (3 C 2 (C + 1 (C + 2 π π j π l, j l. (v E ( T 2 T j (3 C (Cπ + 1 (C + 1 (C + 2 π π j + m (2 C C + 1 π π j, j. (v E ( T 3 (3 (Cπ + 1(Cπ + 2 π + 3m (2(Cπ + 1 (C + 1(C + 2 C + 1 π + mπ = 1, 2,...,k 1. (v E (T T j T l T r (4 C 3 (C + 1(C + 2 (C + 3 π π j π l π r, j l r. (v E ( T 2 T j T l (4 C 2 (Cπ + 1 (C + 1 (C + 2(C + 3 π π j π l + + m (3 C 2 (C + 1 (C + 2 π π j π l, j l. (x E ( T 2 Tj 2 (4 C (Cπ + 1 (Cπ j + 1 (C + 1(C + 2(C + 3 π π j + + m (3C (Cπ + Cπ j + 2 (C + 1(C + 2 π π j + m (2 C C + 1 π π j, j. (x E ( T 3 T j (4 C (Cπ + 1(Cπ + 2 (C + 1 (C + 2(C + 3 π π j + 3m (3 C (Cπ + 1 (C + 1 (C + 2 π π j + + m (2 C C + 1 π π j, j. (x E ( T 4 (4 (Cπ + 1(Cπ + 2(Cπ + 3 π + (C + 1 (C + 2 (C + 3 + 6m (3(Cπ + 1(Cπ + 2 π + 7m (2(Cπ + 1 (C + 1(C + 2 C + 1 π + mπ, = 1,...,k 1.

8 J. T. NEWCOMER, N. K. NEERCHAL AND J. G. MOREL The matrx forms of these moments are represented by: ( [ dag(π V ar (T 1 + (m 1 C+1] C ππ. ( Cov ( T,T 2 = 2m(2 C C + 2 + m(c + 1 C + 2 [ 1 + ] (m 1 dag(π 2 ππ 2 (C + 1 [ (3m 2 (m 1(m 2 1 + + (C + 1 (C + 1 2 ] dag(π ππ. ( V ar ( T 2 = + + m (4 C 3 (C + 1(C + 2(C + 3 π2 π 2 m (3 C 2 (m + C (C + 1(C + 2(C + 3 ππ 2 + π 2 π 4m (3 C 3 (C + 1(C + 2(C + 3 dag(π3 + m(2 C [(m 2(m 3 + 2(m 2(C + 3 + (C + 2(C + 3] ππ (C + 1(C + 2(C + 3 + m(2 C [10(m 2(m 3 + 16(m 2(C + 3 + 6(C + 2(C + 3] dag(π 2 (C + 1(C + 2(C + 3 m[6(m 1(m 2(m 3 + 12(m 1(m 2(C + 3] + dag(π (C + 1(C + 2(C + 3 7m(m 1(C + 2(C + 3 + (C + 1(C + 2(C + 3 + dag(π (C + 1(C + 2(C + 3 (m (2 (1 ρ 2 2 π 2 π 2 + m 2 (1 + (m 1ρ 2 2 ππ +m 2 (m 1(1 ρ 2 (1 + (m 1ρ 2 ( ππ 2 + π 2 π. (v Cov ( T,T (2 C(m + C Π (C + 1(C + 2 2ππ. (v Cov ( T 2, T = 2m (3 C 2 (m + C (C + 1(C + 2(C + 3 Π (2 + m(2 C [2(m 2(m 3 + 3(m 2(C + 3 + (C + 2(C + 3] Π (C + 1(C + 2(C + 3 2m(2 C 2 (m + C[2mC + 3(m C 1] (C + 1 2 π 2 π (C + 2(C + 3 2m(2 C(m + C[C(C + 1 + m(2c + 3] (C + 1 2 ππ. (C + 2(C + 3

(v V ar ( T = Multnomal Overdsperson Moments 9 m (3 C 2 (m + C (C + 1(C + 2(C + 3 Π (3 + m(2 C [(m 2(m 3 + 2(m 2(C + 3 + (C + 2(C + 3] dag(π (C + 1(C + 2(C + 3 2m(2 C 2 (m + C[2mC + 3(m C 1] (C + 1 2 π π. (C + 2(C + 3 where C = (1 ρ2 ρ 2 and Π, Π (2, and Π (3 are defned as above. 4 2. Dervaton In the case of the Drchlet-Multnomal dstrbuton, T P s dstrbuted as a k-dmensonal Multnomal random varable wth parameters P = (P 1,...,P and P vares randomly accordng to a Drchlet dstrbuton wth parameters (Cπ 1,...,Cπ, where C = ( 1 ρ 2 /ρ 2. Suppose P = (P 1,...,P s dstrbuted as a Drchlet random varable wth parameters Cπ 1,...,Cπ. The moments of P are gven by E ( P a 1 1 P a B(Cπ + a =, (7 B(Cπ where, for some vector α = (α 1,...,α, j=1 Γ(α j B(α = (. Γ j=1 α j Usng a condtonng argument, we can derve the formula for the generalzed factoral moments from a Drchlet-Multnomal dstrbuton from the generalzed factoral moments of a multnomal dstrbuton, ( ( E T (a 1 1 T (a a E ( P a 1 1 P a. (8 Equatons (7 and (8 allow us to compute the moments from a Drchlet-Multnomal dstrbuton. As before, we provde the followng examples to demonstrate the algebrac technques requred to compute these moments analytcally. (v E ( T 3 = E (T (3 (3 E(P 3 + 3 + 3E ( T 2 2E (T m (2(Cπ + 1 (C + 1 π + mπ 2mπ (3(Cπ + 1(Cπ + 2 π + 3m (2(Cπ + 1 (C + 1(C + 2 C + 1 π + mπ, = 1, 2,...,k 1. (v E (T T j T l T r (4 E(P P j P l P r (4 C 3 j l r. (C + 1(C + 2(C + 3 π π j π l π r,

10 J. T. NEWCOMER, N. K. NEERCHAL AND J. G. MOREL (v E ( T 2 ( T j T l = E T (2 T j T l + E (T T j T l = (4 E(P 2 P j P l + m (3 C 2 (C + 1 (C + 2 π π j π l (4 C 2 (Cπ + 1 (C + 1 (C + 2(C + 3 π π j π l + + m (3 C 2 (C + 1(C + 2 π π j π l,, j l. 5. COMPARISON OF MOMENTS FROM TWO MULTINOMIAL OVERDISPERSION MODELS Ths secton compares the thrd and fourth order moments from the fnte-mxture and drchlet-multnomal dstrbutons. Suppose T = (T 1, T 2, T 3 s a 4-category multnomal overdsperson model wth category probabltes π = (π 1 = 0.1, π 2 = 0.2, π 3 = 0.3. We assume that T follows ether a fnte-mxture or drchlet-multnomal dstrbuton. Fgure 1 depcts varous moments, varances, and covarances for T as a functon of the dsperson parameter ρ for these two dstrbutons. It s clear that the hgher order moments are dfferent for these two models. These dfferences can be a vtal tool n determnng whch dstrbuton fts a partcular dataset better, especally snce the frst two moments of these dstrbutons are the same. It s therefore possble that goodness-of-ft statstcs could be developed based on these hgher order moments. In addton, snce the frst two moments are the same, estmaton methods based on the frst two moments, such as generalzed estmatng equatons (GEE, provde the same estmate regardless of whch dstrbuton s thought to be the workng model. Newcomer et al. (2008 propose a novel extenson of GEE whch ncorporates the thrd and fourth order moments. Ths method therefore allows one to choose between the two models as a workng model and parameter estmates wll be dfferent based on the choce made. By ncorporatng ths addtonal nformaton nto the estmaton procedure we are able to obtan more effcent estmates than those from the usual GEE procedure. REFERENCES MOREL, J. & NAGARAJ, N. (1993. A fnte mxture dstrbuton for modelng multnomal extra varaton. Bometrka 80, 363 371. MOSIMANN, J. E. (1962. On the compound multnomal dstrbuton, the multvarate β-dstrbuton, and correlaton among proportons. Bometrka 49, 65 82. NEERCHAL, N. & MOREL, J. (1998. Large cluster results for two parametrc multnomal extra varaton models. Journal of the Amercan Statstcal Assocaton 93, 1078 1087. NEWCOMER, J. (2008. Estmaton procedures for multnomal models wth overdsperson. Ph.D. Thess, Unversty of Maryland, Baltmore County. NEWCOMER, J., NEERCHAL, N. & MOREL, J. (2008. An effcent alternatve to GEE for multnomal overdspersed data. In preparaton for Bometrka.

Multnomal Overdsperson Moments 11 E(T 1 4 0 100000 200000 Correlaton (ρ (a E(T 1 2 T2 2 0 500 1500 2500 Correlaton (ρ (b E(T 1 2 T 2T3 0 500 1000 1500 0 50000 150000 Correlaton (ρ (d 50000 30000 10000 Var(T 1 4 Correlaton (ρ (e Cov(T 1 2,T 2T3 3500 2000 500 Cov(T 1 2,T2 2 Correlaton (ρ (c Correlaton (ρ (e Fg. 1. Plots of varous thrd and fourth order moments as a functon of ρ for the fnte-mxture (red and drchletmultnomal (blue dstrbutons.