arxv:60.07554v [math.st] 24 Oct 206 Some Relatoshps ad Propertes of the Hypergeometrc Dstrbuto Peter H. Pesku, Departmet of Mathematcs ad Statstcs York Uversty, Toroto, Otaro M3J P3, Caada E-mal: pesku@pascal.math.yorku.ca Abstract The bomal ad Posso dstrbutos have terestg relatoshps wth the beta ad gamma dstrbutos, respectvely, whch volve ther cumulatve dstrbuto fuctos ad the use of cojugate prors Bayesa statstcs. We brefly dscuss these relatoshps ad some propertes resultg from them whch play a mportat role the costructo of exact ested two-sded cofdece tervals ad the computato of two-taled P-values. The purpose of ths artcle s to show that such relatoshps also exst betwee the hypergeometrc dstrbuto ad a specal case of the Polya (or beta-bomal dstrbuto, ad to derve some propertes of the hypergeometrc dstrbuto resultg from these relatoshps. KEY WORDS: Beta, bomal, gamma, Posso, ad Polya (or beta-bomal dstrbutos; Cojugate pror dstrbuto; Cumulatve dstrbuto fucto; Posteror dstrbuto.. INTRODUCTION The bomal ad Posso dstrbutos have terestg relatoshps wth the beta ad gamma dstrbutos, respectvely, whch volve ther cumulatve dstrbuto fuctos ad the use of cojugate prors Bayesa statstcs. We wll brefly dscuss these relatoshps ad some propertes resultg from them Sectos 2 ad 3 for the bomal ad Posso dstrbutos, respectvely. The resultg propertes play a mportat role the costructo of exact ested two-sded bomal ad Posso cofdece tervals, ad the computato of exact two-taled bomal ad Posso P-values. The purpose of ths artcle s to show that such relatoshps also exst betwee the hypergeometrc dstrbuto ad a specal case of the Polya (or beta-bomal dstrbuto, ad to derve some propertes of the hypergeometrc dstrbuto resultg from these relatoshps. We shall do ths Secto 4. 2. RELATIONSHIPS AND PROPERTIES OF THE BINOMIAL DISTRIBUTION Suppose that radom varable X has a bomal dstrbuto wth parameters ad p, deoted by X BIN(,p, where s a postvetegerad 0 p. The, foragvead for 0 < p <, the probablty mass fucto (pmf of X, deoted by f X (x p, s ( f X (x p P(X x p p x (p x, x 0,,...,, x 0, otherwse,
ad f X (0 0 f X (. Suppose that radom varable Y has a beta dstrbuto wth parameters α > 0 ad β > 0, deoted by Y BETA(α,β. The the probablty desty fucto (pdf of Y, deoted by f Y (y α,β, s x0 f Y (y α,β Γ(αβ Γ(αΓ(β yα (y β, 0 y, 0, otherwse, where the gamma fucto Γ(κ t κ e t dt for all κ > 0. 0 Successve tegrato by parts leads to a relatoshp betwee the cumulatve dstrbuto fuctos (cdf s of the bomal ad beta dstrbutos. If X BIN(,p ad Y BETA(, for teger, 0, the ( p x (p x! p t (t dt. ( x!(! That s, F X ( p P(X p P(Y p, F Y (p,. For fxed teger, 0, t follows from equato ( that the fucto P(X p s cotuous ad decreasg p; for fxed teger j, j, P(X j p P(X j p s cotuous ad creasg p; ad for fxed tegers ad j, j, P( X j p s cotuous, ad creasg for 0 p < p (,j ad decreasg for p (,j p wth maxmum at p p (,j {[( (j/j ] /(j }. Also, p (0,j 0 for 0 j ad p (, for. Suppose that the bomal parameter p s ukow ad we wsh to estmate t. I Bayesa statstcs, formato obtaed from the data x, a realzato of X BIN(, p, s combed wth pror formato about p that s specfed a pror dstrbuto wth pdf g(p ad summarzed a posterordstrbuto wth pdf h(p x whch s dervedfrom the jot dstrbuto f X (x pg(p, ad accordg to Bayes formula s h(p x 0 f X (x pg(p 0 f X(x pg(pdp. (2 Because h(p x s geerally ot avalable closed form, the favoured types of prors utl the troducto of Markov cha Mote Carlo methods have bee those allowg explct computatos, amely cojugate prors. These are pror dstrbutos for whch the correspodg posteror dstrbutos are themselves members of the orgal pror famly, the Bayesa updatg beg accomplshed through updatg of parameters. For a realzato x of X BIN(,p, a famly of cojugate prors s the famly of beta dstrbutos BETA(α, β where we ote from equato (2 that for x 0,,...,, ( x p x (p x Γ(αβ Γ(αΓ(β pα (p β h(p x ( 0 x px (p x Γ(αβ Γ(αΓ(β pα (p β dp Γ(αβ Γ(αxΓ(β x pαx (p βx, 0 p, 0, otherwse. 2
That s, the posteror dstrbuto s also beta wth updated parameters αx ad β x. 3. RELATIONSHIPS AND PROPERTIES OF THE POISSON DISTRIBUTION Suppose that radom varable X has a Posso dstrbuto wth parameter λ 0, deoted by X POI(λ. The, for λ > 0, the pmf of X, deoted by f X (x λ, s f X (x λ P(X x λ eλ λ x, x 0,,2,..., x! 0, otherwse, ad f X (0 0. Suppose radom varable Y has a gammadstrbuto wth parametersα > 0 ad β > 0, deoted by Y GAM(α,β. The the pdf of Y, deoted by f Y (y α,β, s f Y (y α,β β α Γ(α yα e y/β, y > 0, 0, otherwse. Successve tegrato by parts leads to a relatoshp betwee the cdf s of the Posso ad gamma dstrbutos. If X POI(λ ad Y GAM(,2 for oegatve teger, the e λ λ x x0 x! 2! 2λ 0 t e t/2 dt. (3 That s, F X ( λ P(X λ P(Y 2λ,2 F Y (2λ,2. For fxed oegatve teger, t follows from equato (3 that the fucto P(X λ s cotuous ad decreasg λ; for postve teger j, P(X j λ P(X j λ s cotuous ad creasg λ; ad for j, P( X j λ s cotuous, ad creasg for 0 λ < λ(,j ad decreasg for λ λ(,j wth maxmum at λ λ(,j ( j /(j. Also, λ(0,j 0 for j 0. Suppose that the Posso parameter λ s ukow ad we wsh to estmate t usg Bayesa methods. For a realzato x of X POI(λ, a famly of cojugate prors s the famly of gamma dstrbutos GAM(α,β where for x 0,,2,, the pdf h(λ x of the posteror dstrbuto s gve by h(λ x 0 e λ λ x x! e λ λ x x! β α Γ(α λα e λ/β β α Γ(α λα e λ/β dλ [β/(β] αx Γ(αx λαx e λ/[β/(β], λ > 0, 0, otherwse. That s, the posteror dstrbuto s also gamma wth updated parameters αx ad β/(β. 3
4. RELATIONSHIPS AND PROPERTIES OF THE HYPERGEOMETRIC DISTRIBUTION Suppose that teger-valued radom varable X has a hypergeometrc dstrbuto wth parameters, M, ad N, deoted by X HYP(,M,N, where, M, ad N are tegers wth N ad 0 M N. The, for gve ad N, ad for 0 < M < N, the pmf of X, deoted by f X (x M, s ( M NM f X (x M P(X x M x( x ( N, max(0,n M x m(,m, 0, otherwse, (4 ad f X (0 0 f X ( N. Suppose that radom varable Y has a specally defed dscrete dstrbuto wth parameters a, b, ad c, deoted by Y ABC(a,b,c, where a, b, ad c are oegatve tegers. The, for c > 0, the pmf of Y, deoted by f Y (y a,b,c, s f Y (y a,b,c P(Y y a,b,c ( ay ( bcy a b ( abc ab 0, otherwse,, y 0,,...,c, ad f Y (0 a,b,0. We ote that formula (2.6 of Feller (968, p.65 ca be used to prove that c ( ( ay bcy a b y0 ( abc. ab We also ote that the ABC dstrbuto s just a specal case of the Polya (or beta-bomal dstrbuto (Dyer ad Perce, 993, p.230. From equato (4, t easly follows that P(X M for 0 M N. For 0 < N ad 0 M N, we have from equato (4 that ( N ( ( M N M P(X M x x x0 ( [( ( ] M N M N M x x x x0 ( ( M N M ( ( M N M x x x x x0 x0 ( ( M N M ( ( M N M x x x x x x0 4
( M ( M ( M ( N M [( M x x0 ( N M ( N M ( M x ( ( M N M ]( N M x ( ( M N M x x x0 ( N P(X M, (5 where by defto ( ( M 0, M ( 0 f M <, ad NM 0 f M > N. Furthermore, from the recurso relatoshp equato (5, t follows that P(X M N km N km ( k ( N k /( N ( ( k N k M k0 /( N ( ( k N k /( N. (6 That s, f X HYP(,M,N ad Y ABC(,,N for teger, 0 < N, the F X ( M P(X M P(Y M,,N F Y (M,,N where, partcular, P(X M, f 0 M, 0, f N < M N. (7 For 0 < j < N ad 0 M N, we have from equato (5 that ( ( ( N N N P( X j M P(X j M P(X M ( ( ( M N M N P(X j M j j ( ( ( M N M N P(X M ( ( ( ( M N M M N M j j ( N P( X j M. (8 5
Smlar to the determato of equato (6, t follows from the recurso relatoshp equato (8 that P( X j M where, partcular, Nj km N kmj M l0 ( k j ( N k j /( N N lm ( l ( ( /( j k j N k N j j N lm ( ( /( l N l N ( ( l N l Mj k0 /( N ( ( j k j N k j j /( N ( N l /( N P( X j M 0, f ether 0 M < or N j < M N. (0 We ote equato (8 that the dfferece ( ( ( ( ( M N M M N M N < 0, f M, j j ( N j > 0, f M N j, ( j ad for M < N j, the same dfferece ( ( ( ( M N M M N M j j M! (N M! j!(m j! (j!(n M j! M! (N M! (!(M! (!(N M! M!(N M! (!(M j!(j!(n M! [ (j (N M j (N M ] (2 (M (M j ( (j where as M creases, the term /(N M j (N M creases ad the term /(M (M j decreases so that as M creases betwee ad N j, the dfferece ( ( M NM ( j j M ( NM goes from beg egatve to beg postve ad stayg postve. (9 6
I summary, P(X M equals for 0 M N, ad for fxed teger, 0 < N, we see from equatos (6 ad (7 that P(X M equals for 0 M, s decreasg for < M N, ad equals 0 for N < M N; P(X M equals 0 for 0 M N, ad for fxed teger j, j N, P(X j M P(X j M equals 0 for0 M j, s creasgforj < M Nj, adequalsfornj < M N; ad we see from equatos (8 to (2 that for fxed tegers ad j, 0 < j < N where we defe M,N (,j m{m M N j ad ( M j ( NM ( j M ( NM P( X j M equals 0 for 0 M <, s creasg for M < M,N (,j, s decreasg for M,N (,j < M N j, ad equals 0 for N j < M N wth maxmum at ether M,N (,j f ( ( M NM ( j j > M ( NM for M M,N (,j so that P( X j M (,N (,j > P( X j M,N (,j or maxmum at both M,N (,j ad M,N (,j f M ( NM ( j j M ( NM for M M,N (,j so that P( X j M,N (,j P( X j M,N (,j. Suppose that the hypergeometrc parameters ad N are kow but M s ot ad we wsh to estmate t usg Bayesamethods. For a realzato x of X HYP(,M,N, a famly of cojugate prors for M x s the famly of dscrete dstrbutos ABC(a,b,N where for x 0,,...,, the pmf h(m x of the posteror dstrbuto for M s gve by h(m x ( M x( NM x ( am a ( bnm b ( N ( abn ab Nx ( M x( NM x ( am a ( bnm b Mx ( N ( abn ab ( am ( bnm ax bx ( abn ab, x M N x, 0, otherwse, (3 from whch t easly follows that the pmf h(m x x of the posteror dstrbuto for M x s gve by h(m x x ( axmx ( bxnmx ax bx ( axbxn axbx, 0 M x N, 0, otherwse. (4 That s, the posteror dstrbuto for M x s also ABC wth updated parameters ax, bx, ad N. Fally, we ote that as a famly of cojugate prors for the hypergeometrc dstrbuto HYP(,M,N, the famly of dscrete dstrbutos ABC(a,b,N has, addto to umodal members, strctly creasg members ABC(a, 0, N, strctly decreasg members ABC(0, b, N, ad the dscrete uform dstrbuto ABC(0, 0, N. }, 7
REFERENCES Dyer, D. ad Perce, R. L. (993, O the choce of the pror dstrbuto hypergeometrc samplg, Commucatos Statstcs - Theory ad Methods, 22(8, 225-246. Feller, W. (968, A Itroducto to Probablty Theory ad Its Applcatos, Vol., (3rd ed., Joh Wley & Sos, Ic. 8