CHAPTER 4 BIVARIATE DISTRIBUTION EXTENSION

Similar documents
A proposed discrete distribution for the statistical modeling of

Chapter 6 Principles of Data Reduction

Properties and Hypothesis Testing

Lecture 7: Properties of Random Samples

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

( θ. sup θ Θ f X (x θ) = L. sup Pr (Λ (X) < c) = α. x : Λ (x) = sup θ H 0. sup θ Θ f X (x θ) = ) < c. NH : θ 1 = θ 2 against AH : θ 1 θ 2

Random Variables, Sampling and Estimation

The Sampling Distribution of the Maximum. Likelihood Estimators for the Parameters of. Beta-Binomial Distribution

Exponential Families and Bayesian Inference

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

Topic 9: Sampling Distributions of Estimators

R. van Zyl 1, A.J. van der Merwe 2. Quintiles International, University of the Free State

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

Chapter 13, Part A Analysis of Variance and Experimental Design

Probability and statistics: basic terms

Provläsningsexemplar / Preview TECHNICAL REPORT INTERNATIONAL SPECIAL COMMITTEE ON RADIO INTERFERENCE

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

CS284A: Representations and Algorithms in Molecular Biology

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.

Describing the Relation between Two Variables

1 Inferential Methods for Correlation and Regression Analysis

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

G. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Confidence interval for the two-parameter exponentiated Gumbel distribution based on record values

Bayesian Methods: Introduction to Multi-parameter Models

Problems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:

A statistical method to determine sample size to estimate characteristic value of soil parameters

1 Models for Matched Pairs


NCSS Statistical Software. Tolerance Intervals

Power and Type II Error

Output Analysis (2, Chapters 10 &11 Law)

Chapter 11 Output Analysis for a Single Model. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation


Q-BINOMIALS AND THE GREATEST COMMON DIVISOR. Keith R. Slavin 8474 SW Chevy Place, Beaverton, Oregon 97008, USA.

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

The target reliability and design working life

UCLA STAT 110B Applied Statistics for Engineering and the Sciences

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Introducing Sample Proportions

Modeling and Estimation of a Bivariate Pareto Distribution using the Principle of Maximum Entropy

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

Algebra of Least Squares

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

4. Partial Sums and the Central Limit Theorem

Problem Set 4 Due Oct, 12

Control Charts for Mean for Non-Normally Correlated Data

1 Review of Probability & Statistics

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

On a Smarandache problem concerning the prime gaps

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

The z-transform. 7.1 Introduction. 7.2 The z-transform Derivation of the z-transform: x[n] = z n LTI system, h[n] z = re j

A General Family of Estimators for Estimating Population Variance Using Known Value of Some Population Parameter(s)

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

Mathematical Modeling of Optimum 3 Step Stress Accelerated Life Testing for Generalized Pareto Distribution

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

f(x i ; ) L(x; p) = i=1 To estimate the value of that maximizes L or equivalently ln L we will set =0, for i =1, 2,...,m p x i (1 p) 1 x i i=1

Introducing Sample Proportions

An Introduction to Randomized Algorithms

Chapter Vectors

arxiv: v1 [math.nt] 16 Nov 2009

Chapter 10: Power Series

Similarity Solutions to Unsteady Pseudoplastic. Flow Near a Moving Wall

Section 14. Simple linear regression.

It should be unbiased, or approximately unbiased. Variance of the variance estimator should be small. That is, the variance estimator is stable.

Basis for simulation techniques

Optimally Sparse SVMs

Stat 319 Theory of Statistics (2) Exercises

Estimation for Complete Data

Polynomials with Rational Roots that Differ by a Non-zero Constant. Generalities

Stat 421-SP2012 Interval Estimation Section

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

(7 One- and Two-Sample Estimation Problem )

Elementary Statistics

ECON 3150/4150, Spring term Lecture 3

Expectation and Variance of a random variable

Maximum Likelihood Estimation

Roger Apéry's proof that zeta(3) is irrational

(6) Fundamental Sampling Distribution and Data Discription

Chapter 8: Estimating with Confidence

Lecture 18: Sampling distributions

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov

Infinite Sequences and Series

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

Transcription:

CHAPTER 4 BIVARIATE DISTRIBUTION EXTENSION 4. Itroductio Numerous bivariate discrete distributios have bee defied ad studied (see Mardia, 97 ad Kocherlakota ad Kocherlakota, 99) based o various methods of costructio. For example, Famoye ad Cosul (995) proposed a bivariate geeralized Poisso distributio (BGPD) by usig the method of trivariate reductio. However, the rage of the correlatio is restricted ad it oly permits positive correlatio betwee the two radom variables, X ad X. The bivariate egative biomial distributio (BNBD) defied by Lee (999) based o copula fuctio is very complicated ad oly be used for over dispersed data. A bivariate Poisso distributio was proposed by Lakshamiaraya et al. (999) with a flexible correlatio that varies over the full rage (, ). The model is formulated as a product of Poisso margials with a multiplicative factor. Famoye () adopted the approach of Lakshamiaraya et al. to costruct a ew bivariate geeralized Poisso distributio (BGPD) which allows a more flexible correlatio structure compared to the existig BGPD. This chapter is a extesio of Chapter where two ew bivariate GIT, distributios are defied ad examied. We shall itroduce the ew bivariate distributios as the alterative bivariate discrete distributios that allow more flexibility i modellig ad less limitatio o the correlatio betwee the two radom variables. 5

I sectio, we defie two ew bivariate GIT, distributio (BGITD) based o the covolutio of two bivariate distributios ad the classical trivariate reductio method. The some properties of these two BGITD are preseted i sectio, ad i sectio 4 the characteristic of the distributios are studied. Next, we discuss the estimatio of the parameters of BGITD by maximum likelihood estimatio ad the estimatio based o the probability geeratig fuctio (pgf) i sectio 5. I sectio 6, we illustrate these two methods of estimatio with three real life data together with test of goodess-of-fit. 4. Formulatio of Bivariate GIT, distributios Two formulatios of the bivariate GIT, distributios are cosidered: (a) covolutio of bivariate biomial ad bivariate egative biomial distributios (Type I) ad (b) the method of trivariate reductio (Type II). 4.. Covolutio (Type I) The pgf of GIT, distributio ca be writte as p + pt p pt p ϕ( t) = = + p t p + p p + p p t (4.) If p =, it reduces to the pgf of the biomial distributio ad whe p =, it is the egative biomial pgf. Thus, the GIT, distributio is a covolutio of the biomial p B, p + p ad egative biomial NB (, ) p distributios. I a similar maer, we defie a BGITD (Type I BGITD) as the mixture of bivariate biomial ad bivariate egative biomial distributios. 5

Joit probability geeratig fuctio of Type I BGITD The joit pgf is give by q ϕ( t, t) = ( p + pt + pt ) qt qt (4.) where p + p + p = q + q + q = ad, = positive iteger. Joit probability mass fuctio From (4.), we solve for the joit pmf as metioed i (.5) x x u v x x x x! ( u+ v+ )! q q u+ v = u= v= ( x u)!( x v)!( x x u v)! u! v!( )! p p f ( x, x ) q p p p p + + where max( x + x u v),,, = positive iteger ad p + p + p = q + q + q = (4.) 4.. Trivariate Reductio (Type II) The Type II BGITD is formulated by the method of trivariate reductio (Mardia, 97). As explaied i sectio (.4.), we have X = ( Y + Y ) X = ( Y + Y ) where Y, Y ad Y are idepedet GIT, radom variables. Now let ϕ, ϕ ad ϕ be the pgf of Y, Y ad Y respectively, that is p + pt q + qt r + rt ϕ = ; ϕ = ; ϕ =. pt qt rt 5

Joit probability geeratig fuctio By defiitio, the joit pgf of ( X, X ) is writte as ϕ( t, t ) p + p ( tt ) q + qt r + rt = p( tt ) qt rt (4.4) where p + p + p = q + q + q = r + r + r = ad,, = positive iteger. Joit probability mass fuctio From (4.4), the joit pmf is derived as + k + x k + x k k k k p p q r F, k; k+ ; mi( x k x k x k, x ) p p x x f ( x, x ) = p q r q r k= q r F, ( x k); ( x k) + ; F, ( x k); ( x k) + ; where p + p + p = q + q + q = r + r + r = ad,, = positive iteger. qq rr (4.5) 4. Properties of the Bivariate Distributios 4.. Properties of Type I BGITD Margial distributios Usig (.) ad (.), the margial pgf of X i, i =, ca easily be foud as q ϕ X ( t) = ( p ) i i + pit q + qi ( t) (4.6) Margial meas ad variaces Usig (4.), we ca fid the expected meas ad variaces of the margial X ad X as follows. For i =, 5

E X q = + p i ( i ) q q ( q + q ) Var( X ) = + p ( p ) i i i i i q i Covariace ad correlatio The margial meas ad variaces are used to yield the covariace, Cov( X, X ) ad the correlatio, ρ ( X, X ) of X ad X. q q Cov( X, X ) = p p (4.7) q ρ( X, X ) = q q p p q (( q ( q ) + p ( p ) q )( q ( q ) + p ( p ) q )) (4.8) To further ivestigate the rage of correlatio, we look ito differet parameter sets of Type I BGITD. As illustrated i Table 4., five cases are cosidered. Note that from Table 4., the covariace ad the correlatio of Type I BGITD ca be either positive or egative. The correlatio provides a full rage value which is from - to. Table 4.: Numerical examples case 4 5 p.84.59..84. p.5.4..5. p...98..98 q.84.84.84.. q.5.5.5.. q....98.98 54

Table 4.: Covariace ad correlatio of Type I BGITD case 4 5 = = Cov( X, X ) 59.87 59.76 6. -.6.4 ρ ( X, X ).96.96.96 -.89.4 =5 > = Cov( X, X ) 59.7 58.8 6. -.6 -.4 ρ ( X, X ).96.959.96 -.947 -.66 = < =5 Cov( X, X ) 699.87 699.76 6. -.5.4 ρ ( X, X ).96.96.96 -.688.68 From (4.), the first few probabilities are f (,) = p q ( + ) f (,) = f (,) p p q p f (,) = f (,) ( p + p q ) p ( ( p pq + p ( p + pq )) p p f (,) = f (,) ( + ) qq + + p p Coditioal distributio Next, we determie the coditioal distributio o X give X = x by () Applyig the derivative product rule () Successive differetiatio. Firstly, settig ϕ ( t, t) = j( t, t) g( t, t) as a covolutio of two idepedet bivariate radom variables, where j( t, t ) ad g( t, t ) are respectively the pgfs of the biomial ad egative biomial distributios. By defiitio, the pgf of the coditioal distributio of X give X = x is 55

ϕ ϕ ( t y) = ϕ (, x ) X (, x ) ( t,) (,) where ϕ ( x, x ) x + x ( t, t ) = ϕ( t, t ) x x t t t = u, t = v Usig the derivative product rule, we have ϕ x x (, x ) k x k ( t, t) = ϕ( t, t) = x j g t t k = k = t = x (4.9) where ( ) j p p t t = = + (4.) i i i ( ) i j = p p + pt ( j), i t = j= g t = q = qt + i i i qt j= (4.) (4.) i g = q q ( + j), i t = (4.) With the substitutio of equatios (4.), (4.), (4.) ad (4.) ito equatio (4.9), the umerator part of equatio (.4) ca be obtaied. A further substitutio of t= ito equatio (4.9) will lead us to solve for the deomiator part of equatio (.4). The, the coditioal distributio of X give X = x ca be determied. Secodly, by successive differetiatio with respect to t ad settig t =, we have ϕ ( t x ) X p + pt q = p + p qt x x x x q p ( + i) + ( j) qt i= p pt j= k x k y k x k y p q + ( j) ( + i) k= k p pt qt j= i= x x x x x q p ( + i) + ( j) q i= p p j= x x x k k x k x p q + ( j) ( + i) k= k p p q j= i= x, x 56

ad for x =, we have ϕ ( t x ) X 4.. Properties of Type II BGITD p + pt q = p + p qt Margial distributios Similarly, usig (.) ad (.), the margial pgf of X ad X are give as ϕ ( t) X p + pt q + qt r + r = pt qt r ϕ ( t) X p + pt q + q r + rt = pt q rt Margial meas ad variaces From (4.4), the meas ad variaces of the margial X ad X are obtaied as p q E( X) = + p q p r E( X ) = + p r ( p )( p ) ( q )( q ) Var( X ) = + ( p) ( q) ( p )( p ) ( r )( r ) Var( X ) = + ( p) ( r ) Covariace ad correlatio The covariace ad the correlatio of X ad X are give as ( p )( p ) Cov( X, X ) = ( p) (4.4) ρ( X, X ) = ( p )( p )( q + q )( r + r ) ( ( p )( p)( q + q) + ( q )( q)( p + p) ) ( ( p )( p)( r + r ) + ( r )( r )( p + p) ) (4.5) 57

Sice ( p )( p) >, it follows that for this model, the correlatio of X ad X are always positive. The first few probabilities are give by f (,) = p q r ( )( ) (,) (,) f f q = q q ( )( ) (,) (,) f f r = r r ( p + p p) ( q + qq )( r + r r ) f (,) = f (,) + p qr Coditioal distributio The coditioal distributio o X give X = x is determied by usig the derivative product rule as the successive differetiatio of equatio (4.4) is tedious. Settig q ϕ( t, t ) e( t, t ) h( t, t ) + q t = qt (4.6) where e( t, t ) p + p t t = ptt ad h( t, t ) r + r t = rt By defiitio, ϕ (, x ) x ( t, t ) = ϕ( t, t ) x t t = x + qt e( t, t) h( t, t) x qt t t = q = x q + q x t k = e h q t k x k k = t = 58

The successive differetiatio of e( t, t ) ad h( t, t ) with respect to t gives us e t = ( p ) = (4.7) i k i i i ( i )! k i k k e = t p p ( p p p) ( j), i t = + (4.8) k = k ( k )! j= h = r (4.9) t = ( ) i k i i ( i )! k i k k h = r r ( r r r ) ( j), i t = + k = k ( k )! j= (4.) (, x ) Similarly, by usig equatios (4.7), (4.8), (4.9) ad (4.), ϕ ( t, t ) ca be solved. The coditioal distributio of X give X = x ca the be easily derived. 4. Characteristics of the Distributios To explore the structure of the distributios graphically, a few figures are preseted by varyig the parameters of the Type I ad Type II BGITD. Note that all the figures are scaled by multiplyig the y-axis with the value. Let the joit pmf be pr( x, x ) with x =,,,..., r ad x =,,,..., s. 4.. Characteristics of Type I BGITD To study the effect of chages of the parameters ad of the Type I BGITD, we set r =8 ad s =8 ad = r + s =6 (as max( x + x) from equatio (4.)) ad = (Figure 4.). I Figure 4., the value of is icreased with fixed whereas i Figure 4. we icrease the value of but fix the value of (same as the value of i Figure 4.). 59

It is clear from Figures 4. ad 4. that icreasig the value of shifts the local mode away from the origi, (, ) ad decreases the height of the peak. By comparig Figures 4. ad 4., it is see that the local modes are flatter ad located further away. I coclusio, if the data are located close to the origi ad the local mode is sharp, small positive itegers value of ad are suggested. Larger value of is suggested if the value of the mode is low ad situated slightly away from the origi. Meawhile, large positive itegers for both ad are recommeded oly whe the local mode ad the data are placed quite far away from the origi ad the data form a ice bell-shaped. This observatio will be useful i the empirical modellig of data. 8 6 4 4 5 6 7 8 9 S S5 S9 S9 S8 S7 S6 S5 S4 S S S 4 5 6 7 8 9 A local mode at (,) with value 94. Figure 4.: p =., p =. q =., q =., = 6, = 5 S S9 4 S6 5 9 7 S S 4 7 6 9 S S S7 S4 S A local mode at (4,4) with value 4. Figure 4.: p =., p =. q =., q =., =4, = 6

S S9 5 S6 S S 5 5 9 7 S S 4 7 6 9 S7 S4 S Local modes are (7, 7), (7,8) ad (8,7) with value. Figure 4.: p =., p =. q =., q =., =4, = 4.. Characteristics of Type II BGITD Figures 4.4 to 4. are displayed i order to uderstad the behaviour of Type II BGIT with differet combiatio of the positive itegers, ad. We examie the chages of, ad as give i Table 4.. It is easily see that by varyig, ad, differet characteristics of Type II BGIT models are displayed. I Figure 4.4, where, ad are set at, we achieve a very sharp ad high value of the local mode. However, whe is icreasig ( ad fixed) as i Figure 4.5, the cotours chage to ellipses. Furthermore, the height of its local mode is decreased ad situated further from the origi i compariso with Figure 4.4. I Figure 4.6, is icreased but = = ad it is foud that a flatter mode is formed; the cotour plots show a shift to the x-axis. O the other had, whe = = ad is icreased, Figure 4.7 is see to be a reflectio of Figure 4.6 through the origi, with the heights of the modes uchaged. Figures 4.8 to 4. show the effect of icreasig two parameters ad fixig the remaiig oe. It is observed that i Figure 4.8 the local mode has the same locatio as the oe i Figure 4.5 but the peak of the local mode is much lower ad the elliptic 6

cotours are broader. The plot i Figure 4.9 is similar with Figure 4.6 while Figures 4. ad 4.7 are similar (with reflectio). The plots idicate the flexibility of the shape of the Type II BGIT ad small, ad give local modes situated very close to the origi. To apply Type II BGIT i data aalysis the modes of the data will idicate the values of, ad. Table 4.: Chage of the positive itegers, ad Figure 4.4 4.5 4.6 4.7 4.8 4.9 4. 5 5 5 5 5 5 5 5 5 S 8 6 4 4 7 6 9 S S S9 4 7 6 9 S9 S7 S5 S S S9 S7 S5 S S A local mode at (,) with value 98. Figure 4.4: p =., p =.4, q =., q =.4, r =., r =.4, = = = 4 5 5 5 5 7 9 5 S S6 S 5 9 7 5 9 S S8 S5 S S9 S6 S S S7 S4 S A local mode at (9,9) with value 6. Figure 4.5: p =., p =.4, q =., q =.4, r =., r =.4, = 5, = = 6

S 5 5 5 5 6 6 7 6 S7 S4 S 5 9 7 5 9 S8 S5 S S9 S6 S S S7 S4 S Local modes at (8,) ad (9,) with value. Figure 4.6: p =., p =.4, q =., q =.4, r =., r =.4, =, = 5, = S 5 5 5 5 6 6 6 S7 S4 S 5 9 7 5 9 S8 S5 S S9 S6 S S S7 S4 S Local modes at (,8) ad (,9) with value. Figure 4.7: p =., p =.4, q =., q =.4, r =., r =.4, =, =, = 5 S 6 S8 S5 4 S S9 8 6 S6 S S S7 4 S46 S S6 4 7 6 9 5 8 S4 S 9 5 7 4 S A local mode at (9,9) with value 5. Figure 4.8: p =., p =.4, q =., q =.4, r =., r =.4, =, = 5, = 5 6

7 S 6 S8 4 S5 8 6 4 9 5 7 4 S46 S S6 S 4 7 6 9 5 8 S S9 S6 S S S7 S4 S Local modes at (5,9), (6,9) ad (6,) with value 5. Figure 4.9: p =., p =.4, q =., q =.4, r =., r =.4, = 5, = 5, = S 6 4 S8 S5 S S9 S6 8 6 4 7 S S S46 S S6 4 7 6 9 5 8 S7 S4 S 9 5 7 4 S Local modes at (9,5), (9,6) ad (,6) with value 5. Figure 4.: p =., p =.4, q =., q =.4, r =., r =.4, = 5, =, = 5 4.4 Parameter Estimatio I this sectio, two methods of parameter estimatio are cosidered. Due to the complicated joit pmf, simulated aealig is proposed for the umerical optimizatio uder these two estimatios. For the Type I BGITD distributio, we shall cosider the case where the parameters ad are fixed ad for the Type II BGITD distributio, the parameters, ad are fixed. 4.4. Maximum Likelihood Estimatio (MLE) For the probability fuctio based estimatio method, we cosider maximum likelihood estimatio where the log-likelihood fuctio 64

r s = ij i= j= l L π l f ( i, j) is maximized, where π ij is the observed frequecy i the (i, j) cell for i =,,,, r ; j =,,,, s. 4.4. The Pgf-based Miimum Helliger-type Distace Estimatio (PGFBE) The use of pgf is popular as geerally it has a simpler form tha the probability mass fuctio ad this leads to simpler statistical iferece procedures which shorte computatio times. From Sim ad Og (), the uivariate pgf-based estimator is exteded to bivariate distributio as follows ( ) T = ψ ( t, t ) ϕ( t, t ) dt dt where k k i j ijtt i j ψ ( t, t ) = is the empirical pgf ad ϕ ( t, t) is the pgf of the distributio. 4.5 Numerical examples To illustrate the applicatio i data fittig, three real life data sets are cosidered: a) Number of accidets sustaied by experieced shuters over successive periods of time. (Arbous ad Kerrich, 95) b) Number of patiets i two boxes i a room of the critical care ad emergecy service i the Sa Agusti Hospital (Liares, Spai). (Rodriguez et al., 6). c) Number of times baco ad eggs were purchased o four cosecutive shoppig trips. (Peter ad Bruce, 5) These data are fitted by the bivariate egative biomial distributio (Subrahmaiam, 966), Type I ad Type II BGITD by MLE ad the pgf-based 65

miimum Helliger-type distace estimatio. The performace of these parameter estimatio methods will be discussed ad the expected frequecies are displayed i Tables 4.4, 4.6 ad 4.8 respectively. No groupig has bee doe ad the chi-square goodess-of-fit (GOF) tests are carried out to ivestigate how well the observed distributio fits the estimated distributio. The parameters, ad are chose which provide the lowest chi-square value for the fit. For Type I BGITD, the iitial parameter estimate of correspodig to the bivariate egative biomial compoet is first treated as a real umber. The, we choose the positive itegers of parameters ad based o the lowest chi-square value as metioed above. For type II BGITD, Table 4. is used to assist us to choose the suitable positive itegers. I Table 4.4 (Arbous ad Kerrich, 95) X represets the umber of accidets i 5- year period 94-47 ad X is the umber of accidets i 6- year period 97-4. From the data, we fid that the sample mea ad variace for X are.9754 ad.969 respectively. Meawhile, for X, the sample mea is x =.75 ad sample variace is ˆ x σ =.655. The sample correlatio is foud to be.585. I Table 4.6, the data were obtaied from a room of Critical Care ad Emergecy Service i Sa Agustí Hospital (Liares, Spai) where the referece populatio is more tha 5, people. Patiets are dealt with i oe of two boxes where they ca stay a maximum of 4 hours without preferece (radom). The hadlig of patiets i shifts worked i ad is cosidered ad the observed frequecies of patiets dealt with i each box are show i Table 4.7. Similarly, we compute the sample mea ad variace for X ad X : x =.66, ˆ x σ =.4768, 66

x =.9868 ad ˆ x σ =.47. The sample correlatio is foud to be slightly egative which is -.9. Hece, we fit the data with the Type I BGITD. Table 4.8 exhibits the purchase of baco ad eggs from the Iformatio Resources, Ic., a cosumer pael based i a large U.S. city. A sample of 548 households over four cosecutive store trips was tracked. For each household, a total umber of baco purchases i their four shoppig trips ad the total umber of egg purchases for the same trip were couted. From the data, the sample mea ad variace for X ad X are x =.956, sample correlatio is.. ˆ x ˆ x σ =.44, x =.747 ad σ =.574. I additio, the Example Table 4.4: Number of accidets sustaied by experieced shuters over successive periods of time X 4 5 6 Total 8 8 5 BNBD*.9 6.67 7.98.7.4..9 5.8 BNBD^. 6.97 8.4.95.9.6.7 5.5 BGITD*.45 8.8 8.89.9.8.4. 5.4 BGITD^.79 7.77 8.48.7.7.5. 5.74 BGITD*.8 7.4 6.89.77..45.8 49.7 BGITD^.56 6.86 7.89.87.89.5.6 5.8 4 4 4 BNBD*.5.8 8.6.77.5.5.8 9.75 BNBD^.5 4. 8.56.84.4.46.4 4.94 BGITD*.5.6 7.47.4.4.5.8 9.4 BGITD^..8 7.9.4.9.55. 8.78 BGITD*. 5.6 8.9.58.44.58. 4.67 BGITD^.6 5. 8.96.75.8.8. 4.8 4 5 4 7 BNBD* 4.5 6.6 4.54.5.6.47.7 9.44 BNBD^ 4.8 6. 4.79.58..4.4 9.74 BGITD* 4.88 5.67.88...44.9 8.8 BGITD^ 4.84 5.6.9.9.9.5. 8. BGITD*.77 6.5 5.4.44.98.4.6 9.9 BGITD^ 4.5 6.4 4.9.4.9..9 9.8 X 67

Table 4.4, cotiued. 9 BNBD*...9..65.. 7.6 BNBD^.8.9.9..6.6. 7.5 BGITD*.44..66.7.6..4 7. BGITD^.4..7.5.67.5.7 7.5 BGITD*.5.6.88.9.49..8 7.6 BGITD^.4.4.88.6.45.6.5 7.6 4 BNBD*..64.66.49.9.5.7.6 BNBD^.7.57.6.45.6..5. BGITD*.9.65.64.48..7.9.74 BGITD^.9.68.68.5.5...97 BGITD*.4.7.6.9.9.8..45 BGITD^.49.78.66.9.8.7..59 5 BNBD*.8.7..7..6..8 BNBD^.6.4.7.4.9.5..66 BGITD*.....4.9.5. BGITD^...5..7..7.5 BGITD*.4.4...7...8 BGITD^.7.7..4.6...9 6 BNBD*..4.6.5.4...4 BNBD^...4.4....7 BGITD*..6.8.8.6.4..8 BGITD^..7.9.9.7.5.4.44 BGITD*.5.8.7.4....7 BGITD^.6..8.5.... 7 BNBD*.......8.45 BNBD^.......4.8 BGITD*.......68.8 BGITD^.......9.5 BGITD*.......5.6 BGITD^.......8.9 * = MLE, ^ = PGF based estimator, BGITD = type I BGITD, BGITD = type II BGITD ML estimates: BNBD: ˆp =.964, ˆp =.475, ˆp =.4, ˆv =.876. Type I BGITD: ˆp =.48, ˆp =.75, ˆq =., ˆq =.64, =, =. 68

Type II BGITD: ˆp =.9855, ˆp =., ˆq =449, ˆq =.7, ˆr =.555, ˆr =.9, =, =, =. PGF-based estimates: BNBD: ˆp =59, ˆp =., ˆp =.9, ˆv =5.89 Type I BGITD: ˆp =.8, ˆp =.64, ˆq =.6, ˆq =.689, =, =. Type II BGITD: ˆp =.987, ˆp =., ˆq =455, ˆq =.59, ˆr =.549, ˆr =.54, =, =, =. By usig the estimated parameters as show above, we compute the correlatio for the estimated distributios. All of the estimated correlatio values are close to the sample correlatio which is.585. MLE PGFBE Model ρ ( X, X ) Model ρ ( X, X ) BNBD*.8 BNBD^.8 BGITD*.8 BGITD^. BGITD*. BGITD^. Chi-square GOF test A summary of the show i the followig table. χ values, the full model log-likelihood values ad the P-values are Table 4.5: Summary statistics for Table 4.4 Log-likelihood Degree of Model χ value P-value freedom BNBD*.7-4.6. 5 BGITD* 75.4-4.5.59 5 BGITD* 6.4-4.48.74 49 It is to be observed that i Table 4.5, the fit by Type I ad Type II BGITD are sigificatly better tha the BNBD based upo the χ values. The Type II BGITD is preferred over the Type I BGITD as its P-value is higher ad the result is sigificat. 69

Example Table 4.6: Number of patiets i two boxes i a room of the critical care ad emergecy service i the Sa Agusti Hospital (Liares, Spai) Box, X Box, X 4 5 6 Total BGITD*.4..6..4.9.5 4.46 BGITD^..5.45.8.96.8.5.7 4 5 4 BGITD*..4.7 4.58 4.56.7.58 8.4 BGITD^.5.9.4.89 4.7.4.68 6.5 4 BGITD*.6.9 6.9 9.7 7.98 4.76.97 4.5 BGITD^.45.4 5.88 8.4 7.9 5.9.8.45 4 5 45 BGITD*.4 4.64 9..7 7.99.97. 8.9 BGITD^.8.95 8.48.6 8.55 4.59.64 8.66 4 7 5 BGITD*.7 4.65 8.9 8.4 5..99.5 9.4 BGITD^. 4.7 8. 8.6 5.79.49.67.86 5 4 4 4 4 BGITD*.94.6 4.86 4...6. 5.78 BGITD^.86.4 5. 4.67.5.8.5 7.44 6 BGITD*.55.6..4.5. 4..47 BGITD^.54.75.5.68.68.5 5.5.5 *= MLE, ^= PGF based estimator, BGITD=type I BGITD, BGITD=type II BGITD ML estimates: Type I BGITD: ˆp =.488, ˆp =.55, ˆq =., ˆq =., =, =. PGF-based estimates: Type I BGITD: ˆp =.58, ˆp =.6, ˆq =., ˆq =., =, =. 7

The correlatio for the estimated distributio is compute as follows. Model ρ ( X, X ) BGITD* -. BGITD^ -.5 By comparig the sample correlatio, the estimated correlatio from type I BGITD seems to overestimate the correlatio of the data i Table 4.6. Chi square GOF test A summary of the χ values, log-likelihood values ad the P-values are show below. Table 4.7: Summary statistics for Table 4.6 Log-likelihood Degree of Model χ value P-value freedom BGIT*.46-5.75.876 44 Table 4.7 shows that the fit of type I BGIT is sigificat where it provides low value of χ ad a high P-value. 7

Example : Table 4.8: Number of times baco ad eggs were purchased o four cosecutive shoppig trips 4 Total 54 5 4 6 4 BNBD* 4.8 5.5 4.6.6.68 45.5 BNBD^ 5.9 5.84 9.59..4 44.5 BGITD* 45..9 9.7..5 4.79 BGITD^ 5.84 6.6 4.6.4 4.9 46.9 BGITD* 4.8 5.5 4.6.6.68 45.5 BGITD^ 5.9 5.84 9.59..4 44.5 4 9 6 6 86 BNBD* 4.65 4.49 6.86 5.65.7 5.7 BNBD^ 4.4 4.55 7.8 5.67.69.5 BGITD* 4.54.65 5. 6.6. 96.66 BGITD^ 7.98 9. 5. 6.84.84 9.8 BGITD* 4.65 4.49 6.86 5.65.7 5.7 BGITD^ 4.4 4.55 7.8 5.67.69.5 8 8 BNBD* 6.76 6.76 4.6.8.59.55 BNBD^ 4.64 5.6 4.9.96.64 7.79 BGITD* 6.7 6.98 4.4..97.6 BGITD^ 5.4 5.9 4.9..6 8.87 BGITD* 6.76 6.76 4.6.8.59.55 BGITD^ 4.64 5.6 4.9.96.64 7.79 4 6 BNBD*...77.46.7.65 BNBD^.6.76.67.5..78 BGITD*.8.4..67.4 4.6 BGITD^.77.9.94.6.7.8 BGITD*...77.46.7.65 BGITD^.6.76.67.5..78 4 BNBD*.9.9..8.6.9 BNBD^.8..9.7.4.77 BGITD*.7.8.6.8.78 4.67 BGITD^..9..5 5.68 6. BGITD*.9.9..8.6.9 BGITD^.8..9.7.4.77 *= MLE, ^= PGF based estimator, BGITD=type I BGITD, BGITD=type II BGITD ML estimates: BNBD, ˆp =.659, ˆp =.78, ˆp =.44, ˆv =.674. Type I BGITD, ˆp =.64, ˆp =., ˆq =.798, ˆq =.6, =8, =. 7

Type II BGITD, ˆp =.9, ˆp =., ˆq =764, ˆq =.4, ˆr =.88, ˆr =., =, =, =. PGF-based estimates: BNBD, ˆp =.699, ˆp =.895, ˆp =., ˆv =.669. Type I BGITD, ˆp =.66, ˆp =., ˆq =.6, ˆq =.4, =8, =. Type II BGITD, ˆp =.898, ˆp =., ˆq =.776, ˆq =., ˆr =.864, ˆr =., =, =, =. The correlatios for the estimated distributios are compute as follows. MLE PGFBE Model ρ ( X, X ) Model ρ ( X, X ) BNBD*. BNBD^.6 BGITD*.4 BGITD^.6 BGITD*.9 BGITD^.4 All of the estimated correlatio values appear to be slightly lower tha the sample correlatio which is.. Chi-square GOF test The summary of the χ, the log-likelihood ad the P- values are show below. Table 4.9: Summary statistics for Table 4.8 Log-likelihood Degree of Model χ value P-value freedom BNBD* 4.54 -.4.4 BGITD*.57-999.8.6 BGITD* 44.87 -.9.4 8 All of the estimated distributios performed well as the P-value, Type I BGITD fits better tha BNBD ad Type II BGITD. χ values are low. Based o the 7