On Extreme Bernoulli and Dependent Families of Bivariate Distributions

Similar documents
Copyright c 2006 Jason Underdown Some rights reserved. choose notation. n distinct items divided into r distinct groups.

Part IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

Write your Registration Number, Test Centre, Test Code and the Number of this booklet in the appropriate places on the answersheet.

2 (Statistics) Random variables

STA2603/205/1/2014 /2014. ry II. Tutorial letter 205/1/

Notes 12 Autumn 2005

MATH c UNIVERSITY OF LEEDS Examination for the Module MATH2715 (January 2015) STATISTICAL METHODS. Time allowed: 2 hours

EE4601 Communication Systems

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

Part IA Probability. Theorems. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

4. Distributions of Functions of Random Variables

Lecture 11. Probability Theory: an Overveiw

Formulas for probability theory and linear models SF2941

Math 3215 Intro. Probability & Statistics Summer 14. Homework 5: Due 7/3/14

Marshall-Olkin Bivariate Exponential Distribution: Generalisations and Applications

Theoretical Properties of Weighted Generalized. Rayleigh and Related Distributions

Chapter 7. Basic Probability Theory

CS 591, Lecture 2 Data Analytics: Theory and Applications Boston University

Asymptotic Statistics-III. Changliang Zou

Bivariate distributions

Random Variables and Their Distributions

Economics 573 Problem Set 5 Fall 2002 Due: 4 October b. The sample mean converges in probability to the population mean.

18 Bivariate normal distribution I

Statistics for Economists Lectures 6 & 7. Asrat Temesgen Stockholm University

TAMS39 Lecture 2 Multivariate normal distribution

Mathematical Statistics 1 Math A 6330

Lecture 2: Repetition of probability theory and statistics

Exam 1 - Math Solutions

Final Exam # 3. Sta 230: Probability. December 16, 2012

University of Illinois ECE 313: Final Exam Fall 2014

1 Presessional Probability

Algorithms for Uncertainty Quantification

Multivariate probability distributions and linear regression

MAS113 Introduction to Probability and Statistics. Proofs of theorems

UNIT-2: MULTIPLE RANDOM VARIABLES & OPERATIONS

Machine learning - HT Maximum Likelihood

Probability and Statistics Notes

Random Variables. Lecture 6: E(X ), Var(X ), & Cov(X, Y ) Random Variables - Vocabulary. Random Variables, cont.

Lecture 1: August 28

Lecture 2: Review of Probability

Statistics for Managers Using Microsoft Excel/SPSS Chapter 4 Basic Probability And Discrete Probability Distributions

Practice Problem - Skewness of Bernoulli Random Variable. Lecture 7: Joint Distributions and the Law of Large Numbers. Joint Distributions - Example

Lecture 16 : Independence, Covariance and Correlation of Discrete Random Variables

Chapter 5 continued. Chapter 5 sections

STAT Chapter 5 Continuous Distributions

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Lecture 13. Poisson Distribution. Text: A Course in Probability by Weiss 5.5. STAT 225 Introduction to Probability Models February 16, 2014

Discrete Distributions

Introduction to Statistical Inference Self-study

STAT 302 Introduction to Probability Learning Outcomes. Textbook: A First Course in Probability by Sheldon Ross, 8 th ed.

Bivariate Distributions

Week 2. Review of Probability, Random Variables and Univariate Distributions

MATH Notebook 5 Fall 2018/2019

EEL 5544 Noise in Linear Systems Lecture 30. X (s) = E [ e sx] f X (x)e sx dx. Moments can be found from the Laplace transform as

for valid PSD. PART B (Answer all five units, 5 X 10 = 50 Marks) UNIT I

Chapter 5. Chapter 5 sections

Let X and Y denote two random variables. The joint distribution of these random

Random Vectors, Random Matrices, and Matrix Expected Value

Perhaps the simplest way of modeling two (discrete) random variables is by means of a joint PMF, defined as follows.

Discrete Random Variables

Dennis Bricker Dept of Mechanical & Industrial Engineering The University of Iowa

Statistics for Economists. Lectures 3 & 4

HW4 : Bivariate Distributions (1) Solutions

A Probability Primer. A random walk down a probabilistic path leading to some stochastic thoughts on chance events and uncertain outcomes.

Measure-theoretic probability

MULTINOMIAL PROBABILITY DISTRIBUTION

Gaussian vectors and central limit theorem

ACM 116: Lectures 3 4

CSC 411: Lecture 09: Naive Bayes

Conditional distributions (discrete case)

A BIVARIATE MARSHALL AND OLKIN EXPONENTIAL MINIFICATION PROCESS

Covariance. Lecture 20: Covariance / Correlation & General Bivariate Normal. Covariance, cont. Properties of Covariance

Math Bootcamp 2012 Miscellaneous

Basics on Probability. Jingrui He 09/11/2007

Generating and characteristic functions. Generating and Characteristic Functions. Probability generating function. Probability generating function

Continuous Random Variables

Review of Probabilities and Basic Statistics

Bivariate scatter plots and densities

Probability Models. 4. What is the definition of the expectation of a discrete random variable?

Probability reminders

EXAMINATIONS OF THE HONG KONG STATISTICAL SOCIETY

BASICS OF PROBABILITY

1 Random Variable: Topics

MAS113 Introduction to Probability and Statistics

Random Variables. P(x) = P[X(e)] = P(e). (1)

Summary of basic probability theory Math 218, Mathematical Statistics D Joyce, Spring 2016

A Marshall-Olkin Gamma Distribution and Process

The Canonical Gaussian Measure on R

E X A M. Probability Theory and Stochastic Processes Date: December 13, 2016 Duration: 4 hours. Number of pages incl.

Section 8.1. Vector Notation

Relationship between probability set function and random variable - 2 -

Test Problems for Probability Theory ,

Class 8 Review Problems solutions, 18.05, Spring 2014

Preliminary statistics

CS37300 Class Notes. Jennifer Neville, Sebastian Moreno, Bruno Ribeiro

Multivariate Gaussian Distribution. Auxiliary notes for Time Series Analysis SF2943. Spring 2013

Topic 2: Probability & Distributions. Road Map Probability & Distributions. ECO220Y5Y: Quantitative Methods in Economics. Dr.

STAT/MATH 395 PROBABILITY II

(c) Find the product moment correlation coefficient between s and t.

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Transcription:

Int J Contemp Math Sci, Vol 3, 2008, no 23, 1103-1112 On Extreme Bernoulli and Dependent Families of Bivariate Distributions Broderick O Oluyede Department of Mathematical Sciences Georgia Southern University, Statesboro, GA 30460, USA Boluyede@GeorgiaSouthernedu Mavis Pararai Department of Mathematics Indiana University of Pennsylvania Indiana, PA 15705, USA Abstract The obective and purpose of this note is to generate bivariate distributions via extreme Bernoulli distributions and obtain results on positively and negatively dependent families of bivariate binomial distributions generated by extreme Bernoulli distributions Some distributional properties and results are presented The factorial moment generating functions, correlation functions, conditional distributions and the regression functions are given Mathematics Subect Classifications: 60E05; 60F99 Keywords: Extreme Bernoulli distributions; Conditional distributions; Bivariate binomial distributions 1 Introduction Bernoulli distribution is an important distribution that arises in many applications in probability and statistics There are several distributions and models that take into account the concept of dependence in the parameters for certain classes of distributions Lancaster 1958), Hamdan and Jensen 1976), Takeuchi and Takemura 1987) obtained expressions for association correlation) functions in the form of series bilinear in suitable orthogonal polynomials

1104 B O Oluyede and M Pararai Oluyede 1994) studied a family of bivariate binomial distributions generated by extreme Bernoulli distributions, see also references therein, including results by Marshall and Olkin 1985) on bivariate distributions generated by the Bernoulli distribution Hamdan 1972) provided applications of canonical expansion of the bivariate binomial distribution In this paper, we present new probability distributions generated by extreme Bernoulli distributions and discuss the properties of these distributions including the factorial moment generating functions, correlation functions, conditional distributions and the regression functions In section 2, some basic results leading to the generated distributions are presented In section 3, distributional results including their properties and some comparisons are presented Section 4 offers useful results on families of positive and negative dependent distributions including some comparisons In Section 5, a brief discussion and concluding remarks are given 2 Extreme Bernoulli Distributions and Some Basic Results Let P p i ) and Q q i )ber c probability matrices with r r p i q i i1 i1 c c for 1, 2,,c, p i q i for i 1, 2,, r 1 1 Then P maorizes Q, denoted by P > m Q if k h1 p h) k h1 q h), k 1, 2,,rc 1, where p h) and q h), h 1, 2,,k, are elements of P and Q respectively, arranged in a nondecreasing order Now let p 11, p 10, p 01 and p 00 be the probabilities of classification of individuals into one of four mutually exclusive classes AB, AB, AB, AB A and B denotes negation) An example will be to classify patients as diseased or healthy using standard diagnostic as opposed to surgical procedures The number of occurrences of A and B, denoted by X and Y are random variables having the binomial distributions with parameters n, p 1+ ) and n, ) respectively, where p 1+ p 11 + p 10 and p 11 + p 01 The oint probability function of X and Y is given by Aitken and Gonin 1935) The distribution is given by P X x, Y y) n! s s!x s)!y s)!n x y + s! ps 11 px s 10 py s 01 p n x y+s 00, 1) for x, y 0, 1, 2,, n When sampling without replacement from a finite population, a bivariate hypergeometric distribution is obtained leading to a bivariate binomial distribution in the limit

Extreme Bernoulli distributions 1105 The extreme Bernoulli distributions can be displayed as follows [ ] p E 1 1+ 0 if p p 1+ 1 p 1+ ; +1 [ ] 0 p1+ E 2 if p 1 p 1+ p 1+ 1 +1 2 ; [ ] p+1 p E 3 1+ if p 0 1 p 1+ ; 1+ and [ ] p1+ + p E 4 +1 1 1 if 1 1 p 1+ 0 2 p 1+ 2) The extreme Bernoulli distributions E k, k 1, 2, 3, 4 can be obtained as follows Let p i ), i 0, 1 0, 1, be a probability matrix with p 1+ p 11 +p 10, p 11 + p 01 If p 1+, then p 1+, 0) > m p 11,p 10 ) and p 1+, 1 ) > m p 01,p 00 ) Similarly, if p 1+, then,p 1+ ) > m p 11,p 10 ) and 1 p 1+, 0) > m p 01,p 00 ) The results now follows from the fact that max0,p 1+ + p 1+ 1) p 11 minp 1+, ) The bivariate distributions generated to E k, k 1, 2, 3, 4 are as follows Suppose n independent drawings are made from these populations and let X i, i, 0, 1 be the number of occurrences of event whose cell probabilities are given by E k, k 1, 2, 3, 4 For E 1, P E1 X 11 n 1,X 10 n 2,X 01 n 3 ) n!pn 1 1+ p 1+ ) n 3 1 ) n n 1+n 3 ), n 1!n 3!n n 1 + n 3 ))! 3) where n i 0, i 1,2,3,andn 1 + n 2 + n 3 n With X X 11 + X 10 and Y X 11 + X 01, the oint distribution of X and Y is given by n!p x 1+ p 1+ ) y x 1 ) n y, n y x 0 P E1 X x, Y y) x!y x)!n y)! 4) 0, otherwise For E 2, n!p x 1+ py +1 1 p 1+ ) n x y 0 x, y n, P E2 X x, Y y) x!y!n x y)! x + y n 0, otherwise For E 3, n!p y +1 p 1+ ) x y 1 p 1+ ) n x, n x y 0 P E3 X x, Y y) x!x y)!n x)! 0 otherwise 5) 6)

1106 B O Oluyede and M Pararai For E 4, n!p 1+ + 1) x 1 p 1+ ) n y 1 ) n x 0 x, y n, P E4 X x, Y y) x!n y)!n x)! x + y n 0, otherwise 7) We now examine obtain the regression functions EY X x) and EX Y y) for the distributions given in 1) and those generated by E 1 and E 2 respectively The regression functions for the distributions generated by E 3 and E 4 can be obtained similarly obtained For the distribution given by 1), after some simplification, the conditional distribution of X given Y y is P X Y y) minx,y) max0,x+y n) n y x ) y ) ) ) y ) x ) n x y+ p11 p01 p10 p00 8) This is the distribution of the sum of two random variables One distributed as binomial with parameters y and p 11, and the other as binomial with parameters n y and p 10 The regression function is given by EX Y y) n ) p10 + y p11 p ) 10 9) For the bivariate distribution generated by E 1, the regression function is EX Y y) y p 1+ ) and for E2, the corresponding regression function is given by EY X x) n x) p+1 1 p 1+ 3 Some Distributions and Properties ) 10) In this section, we examine some distributions and their properties including those generated by αi +1 α)e i,i1,2,3,4,0 α 1, where I is the fourfold independence table with entries p hk p h+ p +k,h,k0, 1 First, consider the convex combination of P E1 x, y) and P E2 x, y) The corresponding distribution P E x, y) has the covariance function CovX, Y )np 1+ α ) The regression function is given by EY X x) n α ) + xαp 1+ ) 11) 1 p 1+ 1 p 1+ Now consider the family generated by a linear combination of the independence model I and E i, i 1, 2, 3, 4 Here, we consider the family generated

Extreme Bernoulli distributions 1107 by A i 1 α)i + α)e i, 0 α 1, i 1, 2, 3, 4 Suppose n independent drawings are made from the population A 1 1 α)i + αe 1, and let X hk, h, k 0, 1, be the number of occurrences of the event whose cell probabilities are given by A 1 Then, the oint distribution of X and Y is given by P A1 X x, Y Y ) minx,y) max0,x+y n) n! [1 α)p 1+ + αp 1+ ] [αp 1+ ] x!x )! [1 α)p 0+ + α p 1+ )] y [1 α) p 0+ + α1 )] n x y+ y )!n x y + )! x, y 0, 1, 2,, n, 0 α 1, p 1+ 12) The conditional distribution of X given Y y is given by P A1 X x Y y) minx,y) max0,x+y n) y ) n y x ) 1 α)p1+ + αp 1+ ) ) y 1 α)p0+ + α p 1+ ) αp1+ ) n x y+ 1 α)p0+ + α1 p 1+ ) ) x This is the distribution of the sum of two random variables One distributed as binomial ) y, 1 α)p 1+ +αp 1+ and the other as binomial n y, αp1+ ) The regression function is EX Y y) nαp 1+ + y 1 α)p 1+ + αp 1+ αp 1+ 13) Also, the variance of X given Y y is given by VarX Y y) n y) αp 1+ + y 1 α)p 1+ + αp 1+ αp 1+ 14) For the extreme Bernoulli distribution generated by A 2 1 α)i + αe 2, suppose n independent drawings are made from this population and let X hk, h, k 0, 1, be the number of occurrences of the event whose cell probabilities are given by A 2 Then the oint distribution of X and Y is given by

1108 B O Oluyede and M Pararai P A2 X x, Y Y ) minx,y) max0,x+y n) n! [1 α)p 1+ ] [1 α)p 1+ + αp 1+ ] x!x )!y )!n x y + )! [1 α)p 0+ + α ] y [1 α) p 0+ + α1 )] n x y+ x, y 0, 1, 2,, n, 0 α 1 15) The corresponding conditional distribution of X given Y y is given by P A2 X x Y y) minx,y) max0,x+y n) y ) n y x ) 1 α)p1+ ) ) y 1 α)p0+ + α αp1+ +1 α)p 1+ ) n x y+ 1 α)p0+ + α1 p 1+ ) The regression function reduces to EX Y y) n [ 1 α)p 1+ + αp 1+ The factorial moment generating function is given by ] ) x y αp 1+ 16) Mt 1,t 2 ) [1 α)p 0+ + α1 p 1+ ) + 1 α)p 1+ + αp 1+ )t 1 from which the correlation +1 α)p 0+ + α )t 2 +1 α)p 1+ t 1 t 2 ] n, CorrX, Y ) α ) 1/2 p1+ p 0+ Proposition For the bivariate binomial distribution generated by A 2 1 α)i + E 2, α CorrX, Y ) 0, if p 1+ + 1 17)

Extreme Bernoulli distributions 1109 4 Positive and Negative Dependent Families of Distributions In this section, we present positive and negative dependent families of bivariate binomial distributions generated by the extreme Bernoulli distributions discussed in section 2 For the distributions generated by {Γ :p 11 p 1+ }, we have, P 2 X x, Y Y ) minx,y) max0,x+y n) n! [1 α)p 1+ + αp 1+ ] [αp 1+ ] x!x )!y )!n x y + )! [1 α) p 1+ )+α ] y [α p 0+ +1 α)1 )] n x y+ x, y 0, 1, 2,, n, p 1+, 0 α 1 18) The corresponding conditional distribution of X given Y y is given by P 2 X x Y y) minx,y) max0,x+y n) y ) n y x ) 1 α)p1+ + αp 1+ ) y 1 α)p+1 )+αp 0+ αp1+ The regression function is given by ) n x y+ 1 α)p0+ + α1 ) ) x EX Y y) nαp 1+ + y αp 1+ +1 α)p 1+ αp 1+ 19) The factorial moment generating function is Mt 1,t 2 )[αp 0+ +1 α)1 )+αp 1+ t 1 +αp 0+ +1 α) p 1+ ))t 2 +αp 1+ +1 α) )t 1 t 2 ] n The correlation CorrX, Y ) is given by ) 1/2 p1+ CorrX, Y )1 α) 0 p 0+ Proposition For the bivariate binomial distribution generated by A 2 1 α)i + E 2, 0 CorrX, Y ) 1 α), if p 1+ + 1 20) )

1110 B O Oluyede and M Pararai The bivariate binomial distributions generated by {Δ :p 11 p 1+ }, is P 3 X x, Y Y ) minx,y) max0,x+y n) n![αp 1+ ] [αp 1+ +1 α)p 1+ ] x!x )!y )!n x y + )! [1 α) + αp 0+ ] y [α p 0+ +1 α)1 p 1+ )] n x y+ x, y 0, 1, 2,, n, p 1+, 0 α 1 21) The corresponding conditional distribution of X given Y y is given by P 3 X x Y y) minx,y) max0,x+y n) y ) n y x ) ) αp1+ ) y 1 α)p+1 + αp 0+ αp1+ +1 α)p 1+ αp0+ +1 α)1 p 1+ ) The regression function is given by ) n x y+ ) x EX Y y) n αp 0+ +1 α)p 1+ + y αp 1+ +1 α)p 1+ + αp 0+ 22) Proposition For the bivariate binomial distribution generated by A 2 1 α)i + E 2, CorrX, Y ) 1 α) ) 1/2 p1+ 0 23) p 0+ Let P C X x, Y y) be the bivariate binomial distribution generated by αe 1 +1 α)e 2, 0 α 1 If y<xand x + y n, then the oint probability function is given by P αe1 +1 α)e 2 X x, Y y) P C X x, Y y) n x ) p x1+1 p 1+ ) n x minx,y) max0,x+y n) n x y ) x ) ) αp1+ 1 αp ) x ) y 1+ p+1 αp 1+ 1 p ) n x y+ +1 αp 1+, p 1+ p 0+ p 0+ x, y 0, 1, 2,, n, 0 α 1 24) p 1+

Extreme Bernoulli distributions 1111 The following result provides a lower bound for the oint cumulative distribution function P C X x, Y y) in terms of the bivariate binomial distributions generated by E 1 and E 2 respectively Theorem For y x and p 1+, P C X x, Y y) P E1 X x, Y y)p E2 X x, Y y) provided α Proof: For y x, and p 1+, P C X x, Y y) P E1 X x, Y y)p E2 X x, Y y)k x, y) where { n n k0 h0 } τ n k, h) φ k x, p 1+ )φ h y, ), k!h! 0 if k h αp 1+ p 1+ ) τ n k, h) ) k, if k h, n k [ K y x)!x! n x y)!y! 1 p 1+ ) n x ][ ] n x y 1 x, y), y! n x)! p 1+ ) y x p x 1+ 1 p 1+ and φ k x, p 1+ ) is the kth Krawchouk polynomial Note that { [ ] n y x)!x! n x y)!y! 1 p 1+ lim n K x, y) n lim y! n x)! 1 p 1+ [ ] x [ ] y } p+1 p 1+ )1 p 1+ ) 1 p1+ 1, p 1+ 1 p 1+ ) p 1+ so that P C X x, Y y) P E1 x, y)p E2 x, y) { n } n τ n k, h) φ k x, p 1+ )φ h y, ) k!h! k0 h0 Consequently, for α, P C X x, Y y) P E1 X x, Y y)p E2 X x, Y y) Corollary Fory x, p 1+, and α, P C X x, Y y) P E1 X x, Y y)p E2 X x, Y y)

1112 B O Oluyede and M Pararai 5 Concluding Remarks Several bivariate binomial distributions generated by extreme Bernoulli distributions are derived Important properties of these distributions including factorial moment generating functions and the regression functions are presented Families of positive and negative dependent bivariate binomial distributions generated by the extreme Bernoulli distributions as well as their properties are presented The representations of convex combinations of these extreme distributions are given in terms of Krawchouk polynomials Some comparisons and dependence of the random variables X and Y are explored Also, bounds on certain oint probabilities are given ACKNOWLEDGEMENT: The authors expresses their gratitude to the editor and referee for many useful comments and suggestions References [1] A C Aitken, and H T Gonin, On fourfold sampling with and without replacement, Proceedings of Royal Society of Edinburg, 55, 1935), 114-125 [2] M A Hamdan, Canonical expansion of the bivariate binomial distribution with unequal marginal index, International Statistical Review, 40, 1972), 277-280 [3] M A Hamdan, and D R Jensen, A bivariate binomial distribution with some applications, Australian Journal of Statistics, 18, 1976), 163-169 [4] H O Lancaster, The structure of bivariate distributions, Annals of Mathematical Statistics, 29, 1958), 719-736 [5] A W Marshall, and I Olkin, A family of bivariate distributions generated by Bernoulli distribution, Journal of the American Statistical Association, 80, 1985), 332-338 [6] B O Oluyede, A family of bivariate binomial distributions generated by extreme Bernoulli distributions, Communications in Statistics, 235), 1994), 1531-1547 [7] K Takeuchi, and A Takemura, On sum of 0-1 random variables multivariate) Annals of the Institute of Statistical Mathematics, 39, 1987), 307-324 Received: December 10, 2007