arxiv: v1 [math.st] 24 Oct 2016

Similar documents
STK4011 and STK9011 Autumn 2016

Point Estimation: definition of estimators

Chapter 4 Multiple Random Variables

ρ < 1 be five real numbers. The

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Chapter 5 Properties of a Random Sample

Lecture 3 Probability review (cont d)

3. Basic Concepts: Consequences and Properties

Lecture 3. Sampling, sampling distributions, and parameter estimation

Special Instructions / Useful Data

Functions of Random Variables

Chapter 4 Multiple Random Variables

Point Estimation: definition of estimators

Summary of the lecture in Biostatistics

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

X ε ) = 0, or equivalently, lim

Some Statistical Inferences on the Records Weibull Distribution Using Shannon Entropy and Renyi Entropy

VOL. 3, NO. 11, November 2013 ISSN ARPN Journal of Science and Technology All rights reserved.

Estimation of Stress- Strength Reliability model using finite mixture of exponential distributions

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULE 2

Estimation of the Loss and Risk Functions of Parameter of Maxwell Distribution

A new Family of Distributions Using the pdf of the. rth Order Statistic from Independent Non- Identically Distributed Random Variables

CHAPTER VI Statistical Analysis of Experimental Data

Bayes Interval Estimation for binomial proportion and difference of two binomial proportions with Simulation Study

Chapter 3 Sampling For Proportions and Percentages

Journal of Mathematical Analysis and Applications

Bayes Estimator for Exponential Distribution with Extension of Jeffery Prior Information

Random Variables. ECE 313 Probability with Engineering Applications Lecture 8 Professor Ravi K. Iyer University of Illinois

Module 7: Probability and Statistics

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then

Parameter, Statistic and Random Samples

Study of Correlation using Bayes Approach under bivariate Distributions

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Chain Rules for Entropy

Q-analogue of a Linear Transformation Preserving Log-concavity

Random Variate Generation ENM 307 SIMULATION. Anadolu Üniversitesi, Endüstri Mühendisliği Bölümü. Yrd. Doç. Dr. Gürkan ÖZTÜRK.

ENGI 3423 Simple Linear Regression Page 12-01

Lecture Notes Types of economic variables

THE ROYAL STATISTICAL SOCIETY 2010 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULE 2 STATISTICAL INFERENCE

Chapter 14 Logistic Regression Models

A NEW MODIFIED GENERALIZED ODD LOG-LOGISTIC DISTRIBUTION WITH THREE PARAMETERS

Comparison of Parameters of Lognormal Distribution Based On the Classical and Posterior Estimates

MAX-MIN AND MIN-MAX VALUES OF VARIOUS MEASURES OF FUZZY DIVERGENCE

1 Solution to Problem 6.40

22 Nonparametric Methods.

CHAPTER 3 POSTERIOR DISTRIBUTIONS

Comparing Different Estimators of three Parameters for Transmuted Weibull Distribution

Simulation Output Analysis

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

Statistics MINITAB - Lab 5

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes

Non-uniform Turán-type problems

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5

Some identities involving the partial sum of q-binomial coefficients

Random Variables and Probability Distributions

IFYMB002 Mathematics Business Appendix C Formula Booklet

Generating Multivariate Nonnormal Distribution Random Numbers Based on Copula Function

The Generalized Inverted Generalized Exponential Distribution with an Application to a Censored Data

Bayesian Inferences for Two Parameter Weibull Distribution Kipkoech W. Cheruiyot 1, Abel Ouko 2, Emily Kirimi 3

BAYESIAN ESTIMATOR OF A CHANGE POINT IN THE HAZARD FUNCTION

Minimax Estimation of the Parameter of the Burr Type Xii Distribution

A NEW LOG-NORMAL DISTRIBUTION

MEASURES OF DISPERSION

Research Article A New Iterative Method for Common Fixed Points of a Finite Family of Nonexpansive Mappings

A be a probability space. A random vector

Assignment 5/MATH 247/Winter Due: Friday, February 19 in class (!) (answers will be posted right after class)

Modified Moment Estimation for a Two Parameter Gamma Distribution

Law of Large Numbers

Confidence Intervals for Double Exponential Distribution: A Simulation Approach

9.1 Introduction to the probit and logit models

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

Interval Estimation of a P(X 1 < X 2 ) Model for Variables having General Inverse Exponential Form Distributions with Unknown Parameters

BAYESIAN INFERENCES FOR TWO PARAMETER WEIBULL DISTRIBUTION

M2S1 - EXERCISES 8: SOLUTIONS

Chapter 8: Statistical Analysis of Simulated Data

A Markov Chain Competition Model

Bounds on the expected entropy and KL-divergence of sampled multinomial distributions. Brandon C. Roy

. The set of these sums. be a partition of [ ab, ]. Consider the sum f( x) f( x 1)

A New Measure of Probabilistic Entropy. and its Properties

Simple Linear Regression

MYUNG HWAN NA, MOON JU KIM, LIN MA

Qualifying Exam Statistical Theory Problem Solutions August 2005

A New Family of Transformations for Lifetime Data

Homework 1: Solutions Sid Banerjee Problem 1: (Practice with Asymptotic Notation) ORIE 4520: Stochastics at Scale Fall 2015

Bayes (Naïve or not) Classifiers: Generative Approach

Unimodality Tests for Global Optimization of Single Variable Functions Using Statistical Methods

STATISTICAL INFERENCE

h-analogue of Fibonacci Numbers

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Part 4b Asymptotic Results for MRR2 using PRESS. Recall that the PRESS statistic is a special type of cross validation procedure (see Allen (1971))

Entropy ISSN by MDPI

Analysis of Variance with Weibull Data

Solution of General Dual Fuzzy Linear Systems. Using ABS Algorithm

18.413: Error Correcting Codes Lab March 2, Lecture 8

The Mathematics of Portfolio Theory

4. Standard Regression Model and Spatial Dependence Tests

Class 13,14 June 17, 19, 2015

Goodness of Fit Test for The Skew-T Distribution

Transcription:

arxv:60.07554v [math.st] 24 Oct 206 Some Relatoshps ad Propertes of the Hypergeometrc Dstrbuto Peter H. Pesku, Departmet of Mathematcs ad Statstcs York Uversty, Toroto, Otaro M3J P3, Caada E-mal: pesku@pascal.math.yorku.ca Abstract The bomal ad Posso dstrbutos have terestg relatoshps wth the beta ad gamma dstrbutos, respectvely, whch volve ther cumulatve dstrbuto fuctos ad the use of cojugate prors Bayesa statstcs. We brefly dscuss these relatoshps ad some propertes resultg from them whch play a mportat role the costructo of exact ested two-sded cofdece tervals ad the computato of two-taled P-values. The purpose of ths artcle s to show that such relatoshps also exst betwee the hypergeometrc dstrbuto ad a specal case of the Polya (or beta-bomal dstrbuto, ad to derve some propertes of the hypergeometrc dstrbuto resultg from these relatoshps. KEY WORDS: Beta, bomal, gamma, Posso, ad Polya (or beta-bomal dstrbutos; Cojugate pror dstrbuto; Cumulatve dstrbuto fucto; Posteror dstrbuto.. INTRODUCTION The bomal ad Posso dstrbutos have terestg relatoshps wth the beta ad gamma dstrbutos, respectvely, whch volve ther cumulatve dstrbuto fuctos ad the use of cojugate prors Bayesa statstcs. We wll brefly dscuss these relatoshps ad some propertes resultg from them Sectos 2 ad 3 for the bomal ad Posso dstrbutos, respectvely. The resultg propertes play a mportat role the costructo of exact ested two-sded bomal ad Posso cofdece tervals, ad the computato of exact two-taled bomal ad Posso P-values. The purpose of ths artcle s to show that such relatoshps also exst betwee the hypergeometrc dstrbuto ad a specal case of the Polya (or beta-bomal dstrbuto, ad to derve some propertes of the hypergeometrc dstrbuto resultg from these relatoshps. We shall do ths Secto 4. 2. RELATIONSHIPS AND PROPERTIES OF THE BINOMIAL DISTRIBUTION Suppose that radom varable X has a bomal dstrbuto wth parameters ad p, deoted by X BIN(,p, where s a postvetegerad 0 p. The, foragvead for 0 < p <, the probablty mass fucto (pmf of X, deoted by f X (x p, s ( f X (x p P(X x p p x (p x, x 0,,...,, x 0, otherwse,

ad f X (0 0 f X (. Suppose that radom varable Y has a beta dstrbuto wth parameters α > 0 ad β > 0, deoted by Y BETA(α,β. The the probablty desty fucto (pdf of Y, deoted by f Y (y α,β, s x0 f Y (y α,β Γ(αβ Γ(αΓ(β yα (y β, 0 y, 0, otherwse, where the gamma fucto Γ(κ t κ e t dt for all κ > 0. 0 Successve tegrato by parts leads to a relatoshp betwee the cumulatve dstrbuto fuctos (cdf s of the bomal ad beta dstrbutos. If X BIN(,p ad Y BETA(, for teger, 0, the ( p x (p x! p t (t dt. ( x!(! That s, F X ( p P(X p P(Y p, F Y (p,. For fxed teger, 0, t follows from equato ( that the fucto P(X p s cotuous ad decreasg p; for fxed teger j, j, P(X j p P(X j p s cotuous ad creasg p; ad for fxed tegers ad j, j, P( X j p s cotuous, ad creasg for 0 p < p (,j ad decreasg for p (,j p wth maxmum at p p (,j {[( (j/j ] /(j }. Also, p (0,j 0 for 0 j ad p (, for. Suppose that the bomal parameter p s ukow ad we wsh to estmate t. I Bayesa statstcs, formato obtaed from the data x, a realzato of X BIN(, p, s combed wth pror formato about p that s specfed a pror dstrbuto wth pdf g(p ad summarzed a posterordstrbuto wth pdf h(p x whch s dervedfrom the jot dstrbuto f X (x pg(p, ad accordg to Bayes formula s h(p x 0 f X (x pg(p 0 f X(x pg(pdp. (2 Because h(p x s geerally ot avalable closed form, the favoured types of prors utl the troducto of Markov cha Mote Carlo methods have bee those allowg explct computatos, amely cojugate prors. These are pror dstrbutos for whch the correspodg posteror dstrbutos are themselves members of the orgal pror famly, the Bayesa updatg beg accomplshed through updatg of parameters. For a realzato x of X BIN(,p, a famly of cojugate prors s the famly of beta dstrbutos BETA(α, β where we ote from equato (2 that for x 0,,...,, ( x p x (p x Γ(αβ Γ(αΓ(β pα (p β h(p x ( 0 x px (p x Γ(αβ Γ(αΓ(β pα (p β dp Γ(αβ Γ(αxΓ(β x pαx (p βx, 0 p, 0, otherwse. 2

That s, the posteror dstrbuto s also beta wth updated parameters αx ad β x. 3. RELATIONSHIPS AND PROPERTIES OF THE POISSON DISTRIBUTION Suppose that radom varable X has a Posso dstrbuto wth parameter λ 0, deoted by X POI(λ. The, for λ > 0, the pmf of X, deoted by f X (x λ, s f X (x λ P(X x λ eλ λ x, x 0,,2,..., x! 0, otherwse, ad f X (0 0. Suppose radom varable Y has a gammadstrbuto wth parametersα > 0 ad β > 0, deoted by Y GAM(α,β. The the pdf of Y, deoted by f Y (y α,β, s f Y (y α,β β α Γ(α yα e y/β, y > 0, 0, otherwse. Successve tegrato by parts leads to a relatoshp betwee the cdf s of the Posso ad gamma dstrbutos. If X POI(λ ad Y GAM(,2 for oegatve teger, the e λ λ x x0 x! 2! 2λ 0 t e t/2 dt. (3 That s, F X ( λ P(X λ P(Y 2λ,2 F Y (2λ,2. For fxed oegatve teger, t follows from equato (3 that the fucto P(X λ s cotuous ad decreasg λ; for postve teger j, P(X j λ P(X j λ s cotuous ad creasg λ; ad for j, P( X j λ s cotuous, ad creasg for 0 λ < λ(,j ad decreasg for λ λ(,j wth maxmum at λ λ(,j ( j /(j. Also, λ(0,j 0 for j 0. Suppose that the Posso parameter λ s ukow ad we wsh to estmate t usg Bayesa methods. For a realzato x of X POI(λ, a famly of cojugate prors s the famly of gamma dstrbutos GAM(α,β where for x 0,,2,, the pdf h(λ x of the posteror dstrbuto s gve by h(λ x 0 e λ λ x x! e λ λ x x! β α Γ(α λα e λ/β β α Γ(α λα e λ/β dλ [β/(β] αx Γ(αx λαx e λ/[β/(β], λ > 0, 0, otherwse. That s, the posteror dstrbuto s also gamma wth updated parameters αx ad β/(β. 3

4. RELATIONSHIPS AND PROPERTIES OF THE HYPERGEOMETRIC DISTRIBUTION Suppose that teger-valued radom varable X has a hypergeometrc dstrbuto wth parameters, M, ad N, deoted by X HYP(,M,N, where, M, ad N are tegers wth N ad 0 M N. The, for gve ad N, ad for 0 < M < N, the pmf of X, deoted by f X (x M, s ( M NM f X (x M P(X x M x( x ( N, max(0,n M x m(,m, 0, otherwse, (4 ad f X (0 0 f X ( N. Suppose that radom varable Y has a specally defed dscrete dstrbuto wth parameters a, b, ad c, deoted by Y ABC(a,b,c, where a, b, ad c are oegatve tegers. The, for c > 0, the pmf of Y, deoted by f Y (y a,b,c, s f Y (y a,b,c P(Y y a,b,c ( ay ( bcy a b ( abc ab 0, otherwse,, y 0,,...,c, ad f Y (0 a,b,0. We ote that formula (2.6 of Feller (968, p.65 ca be used to prove that c ( ( ay bcy a b y0 ( abc. ab We also ote that the ABC dstrbuto s just a specal case of the Polya (or beta-bomal dstrbuto (Dyer ad Perce, 993, p.230. From equato (4, t easly follows that P(X M for 0 M N. For 0 < N ad 0 M N, we have from equato (4 that ( N ( ( M N M P(X M x x x0 ( [( ( ] M N M N M x x x x0 ( ( M N M ( ( M N M x x x x x0 x0 ( ( M N M ( ( M N M x x x x x x0 4

( M ( M ( M ( N M [( M x x0 ( N M ( N M ( M x ( ( M N M ]( N M x ( ( M N M x x x0 ( N P(X M, (5 where by defto ( ( M 0, M ( 0 f M <, ad NM 0 f M > N. Furthermore, from the recurso relatoshp equato (5, t follows that P(X M N km N km ( k ( N k /( N ( ( k N k M k0 /( N ( ( k N k /( N. (6 That s, f X HYP(,M,N ad Y ABC(,,N for teger, 0 < N, the F X ( M P(X M P(Y M,,N F Y (M,,N where, partcular, P(X M, f 0 M, 0, f N < M N. (7 For 0 < j < N ad 0 M N, we have from equato (5 that ( ( ( N N N P( X j M P(X j M P(X M ( ( ( M N M N P(X j M j j ( ( ( M N M N P(X M ( ( ( ( M N M M N M j j ( N P( X j M. (8 5

Smlar to the determato of equato (6, t follows from the recurso relatoshp equato (8 that P( X j M where, partcular, Nj km N kmj M l0 ( k j ( N k j /( N N lm ( l ( ( /( j k j N k N j j N lm ( ( /( l N l N ( ( l N l Mj k0 /( N ( ( j k j N k j j /( N ( N l /( N P( X j M 0, f ether 0 M < or N j < M N. (0 We ote equato (8 that the dfferece ( ( ( ( ( M N M M N M N < 0, f M, j j ( N j > 0, f M N j, ( j ad for M < N j, the same dfferece ( ( ( ( M N M M N M j j M! (N M! j!(m j! (j!(n M j! M! (N M! (!(M! (!(N M! M!(N M! (!(M j!(j!(n M! [ (j (N M j (N M ] (2 (M (M j ( (j where as M creases, the term /(N M j (N M creases ad the term /(M (M j decreases so that as M creases betwee ad N j, the dfferece ( ( M NM ( j j M ( NM goes from beg egatve to beg postve ad stayg postve. (9 6

I summary, P(X M equals for 0 M N, ad for fxed teger, 0 < N, we see from equatos (6 ad (7 that P(X M equals for 0 M, s decreasg for < M N, ad equals 0 for N < M N; P(X M equals 0 for 0 M N, ad for fxed teger j, j N, P(X j M P(X j M equals 0 for0 M j, s creasgforj < M Nj, adequalsfornj < M N; ad we see from equatos (8 to (2 that for fxed tegers ad j, 0 < j < N where we defe M,N (,j m{m M N j ad ( M j ( NM ( j M ( NM P( X j M equals 0 for 0 M <, s creasg for M < M,N (,j, s decreasg for M,N (,j < M N j, ad equals 0 for N j < M N wth maxmum at ether M,N (,j f ( ( M NM ( j j > M ( NM for M M,N (,j so that P( X j M (,N (,j > P( X j M,N (,j or maxmum at both M,N (,j ad M,N (,j f M ( NM ( j j M ( NM for M M,N (,j so that P( X j M,N (,j P( X j M,N (,j. Suppose that the hypergeometrc parameters ad N are kow but M s ot ad we wsh to estmate t usg Bayesamethods. For a realzato x of X HYP(,M,N, a famly of cojugate prors for M x s the famly of dscrete dstrbutos ABC(a,b,N where for x 0,,...,, the pmf h(m x of the posteror dstrbuto for M s gve by h(m x ( M x( NM x ( am a ( bnm b ( N ( abn ab Nx ( M x( NM x ( am a ( bnm b Mx ( N ( abn ab ( am ( bnm ax bx ( abn ab, x M N x, 0, otherwse, (3 from whch t easly follows that the pmf h(m x x of the posteror dstrbuto for M x s gve by h(m x x ( axmx ( bxnmx ax bx ( axbxn axbx, 0 M x N, 0, otherwse. (4 That s, the posteror dstrbuto for M x s also ABC wth updated parameters ax, bx, ad N. Fally, we ote that as a famly of cojugate prors for the hypergeometrc dstrbuto HYP(,M,N, the famly of dscrete dstrbutos ABC(a,b,N has, addto to umodal members, strctly creasg members ABC(a, 0, N, strctly decreasg members ABC(0, b, N, ad the dscrete uform dstrbuto ABC(0, 0, N. }, 7

REFERENCES Dyer, D. ad Perce, R. L. (993, O the choce of the pror dstrbuto hypergeometrc samplg, Commucatos Statstcs - Theory ad Methods, 22(8, 225-246. Feller, W. (968, A Itroducto to Probablty Theory ad Its Applcatos, Vol., (3rd ed., Joh Wley & Sos, Ic. 8