Research Article Metric Divergence Measures and Information Value in Credit Scoring

Size: px
Start display at page:

Download "Research Article Metric Divergence Measures and Information Value in Credit Scoring"

Transcription

1 Joural of Mathematics Volume 013, Article ID 84871, 10 pages Research Article Metric Divergece Measures ad Iformatio Value i Credit Scorig Guopig Zeg Thik Fiace, 4150 Iteratioal Plaza, Fort Worth, TX 76109, USA Correspodece should be addressed to Guopig Zeg; guopigtx@yahoocom Received 13 August 013; Accepted 4 September 013 Academic Editor: Baodig Liu Copyright 013 Guopig Zeg This is a ope access article distributed uder the Creative Commos Attributio Licese, which permits urestricted use, distributio, ad reproductio i ay medium, provided the origial work is properly cited Recetly, a series of divergece measures have emerged from iformatio theory ad statistics ad umerous iequalities have bee established amog them However, oe of them are a metric i topology I this paper, we propose a class of metric divergece measures, amely, L p (P Q), P 1,ad study their mathematical properties We the study a importat divergece measure widely used i credit scorig, called iformatio value I particular, we explore the mathematical reasoig of weight of evidece ad suggest a better alterative to weight of evidece Fially, we propose usig L p (P Q) as alteratives to iformatio value to overcome its disadvatages 1 Itroductio The iformatio measure is a importat cocept i Iformatio theory ad statistics It is related to the system of measuremet of iformatio or the amout of iformatiobasedotheprobabilitiesoftheevetsthatcovey iformatio Divergece measures are a importat type of iformatio measures They are commoly used to fid appropriate distace or differece betwee two probability distributios Let Δ ={P=(p 1,p,,p ) p i 0, p i =1},, be the set of fiite discrete probability distributios as i [1] For all P, Q Δ, the followig divergece measures are well kow i the literature of iformatio theory ad statistics Helliger Discrimiatio [] h (P Q) = 1 (1) ( p i ) () Shao s Etropy [3] H (P) = p i log (p i ), (3) which is sometimes referred to as measure of ucertaity The etropy H(P) of a discrete radom variable is defied i terms of its probability distributio P ad is a good measure of radomess or ucertaity Note that i the origial defiitio of Shao s Etropy thelogistothebaseadetropyisexpressedibitsi iformatio theory The log ca be ay other bases ad the etropy will be a costat factor of the oe i base by the chage-base formula of the logarithm fuctio Hece, without loss of geerality, we will assume all the logs are atural logarithms Kullback ad Leibler s Relative Iformatio [4] D (P Q) = p i l ( p i ) (4) Its symmetric form is the well-kow J-divergece

2 Joural of Mathematics J-Divergece (Jeffreys [5], Kullback, ad Leibler [4]) J (P Q) =D(P Q) +D(Q P) = Triagular Discrimiatio [6] Δ (P Q) = (p i ) (p i ) l ( p i ) (5) p i + (6) Symmetric Chi-Square Divergece (Dragomir et al [7]) Oe has ψ (P Q) =χ (P Q) +χ (Q P) = (p i ) (p i + ) p i, where χ (P Q) = ((p i ) /p i ) is the well-kow χ - divergece (Pearso [8]) Jese-Shao Divergece (Sibso [9], Burbea, ad Rao [10, 11]) I (P Q) = 1 [ p i l ( p i )+ q p i + l ( )] i p i + Arithmetic-Geometric Divergece (Taeja [1]) Moreover T (P Q) = p i + Taeja s Divergece (Taeja [1]) Oe has d (P Q) =1 ( p i + (7) (8) l ( p i + p i ) (9) )( p i + ) (10) The iformatio measures J(P Q), I(P Q) ad T(P Q) cabewritteas J (P Q) =4[I (P Q) +T(P Q)], I (P Q) = 1 T (P Q) = 1 [D (P P+Q [D (P+Q )+D(Q P+Q )], P)+D( P+Q Q)] (11) Relative Iformatio of Type s Cressie ad Read [13] cosidered the oe-parametric geeralizatio of iformatio measure D(P Q), called the relative iformatio of type s give by D s (P Q) = [s (s 1)] 1 [ p s i q1 s i 1], s=0,1 (1) It also has some special cases: (i) lim s 0 D s (P Q) = D(Q P), (ii) lim s 1 D s (P Q)=D(P Q), (iii) D 1 (P Q) = (1/)ψ (Q P), (iv) D 1/ (P Q)=4h(P Q), (v) D (P Q) = (1/)ψ (P Q), (vi) D (P Q)=D 1 (Q P), (vii) D 1 (P Q)=D 0 (Q P) It is show that D s (P Q)is oegative ad covex i P ad Q i [1] J-Divergece of Type s [14] V s (P Q) =D s (P Q) +D s (Q P) = [s (s 1)] 1 [ (p s i q1 s i It admits the followig particular cases: +q s i p1 s i ) ], s =0,1 (13) (i) lim s 0 V s (P Q) = lim s 1 V s (P Q)=J(P Q), (ii) lim s 1 V s (P Q)=J(P Q), (iii) V 1 (P Q)=V (P Q) = (1/)Ψ(P Q), (iv) V 0 (P Q)=V 1 (P Q)=J(P Q), (v) V 1/ (P Q)=8h(P Q) Uified Geeralizatio of Jese-Shao Divergece ad Arithmetic-Geometry Mea Divergece [14] W s (P Q) = 1 [D s ( P+Q P)+D s ( P+Q It admits the followig particular cases: (i) W 1 (P Q) = (1/4)Δ(P Q), (ii) W 0 (P Q)=I(P Q), (iii) W 1/ (P Q)=4d(P Q), (iv) W 1 (P Q)=T(P Q), (v) W (P Q)=(1/16)Ψ(P Q) Q)] (14) Taeja proved [14]thatallthe3s-type iformatio measures D s (P Q), V s (P Q), adw s (P Q) are oegative adcovexithepair(p, Q) He also obtaied iequalities regardig the various divergece measures: 1 Δ (P Q) I(P Q) h(p Q) 4d(P Q) J (P Q) T(P Q) 1 (15) 16 ψ (P Q)

3 Joural of Mathematics 3 Here, we observe that Δ(P Q)>0Hece,fromtheabove iequalities we see that I(P Q), h(p Q), d(p Q), J(P Q), T(P Q), adψ(p Q)are all positive We also ote that all p i i the origial defiitio of Δ i [1] are required to be positive Yet, i realty some p i may be 0 I this case I(P Q), T(P Q), ψ(p Q), adδ(p Q) will be udefied We have exteded the defiitio of Δ to iclude the cases whe p i =0 We assume that 0 l(0) = 0, which is easily justified by cotiuity sice x l(x) 0 as x 0 For coveiece, we also assume 0 l(0/0) = 0 ad t l(t/0) = 0 for t>0 A problem with the above divergece measures is that oe of them are a real distace, that is, a metric, i topology I this paper, we will study a class of metric divergece measures L p (P Q) We the study the uderlyig mathematics of a special divergece measure called iformatio value, which is widely used i credit scorig We propose usig L p (P Q) as alteratives to IV i order to overcome the disadvatages of iformatio value The rest of this paper is orgaized as follows I Sectio, after reviewig the metric space, we disprove that the above divergece measures are metrics We the study a class of metric divergece measures L p (P Q) Sectio 3 is cocered with iformatio value We examie a rule of thumb ad weight of evidece ad suggest a better alterative to weight of evidece We the propose usig L p (P Q) as alteratives of IV to overcome the disadvatages of iformatio value Sectio 4 presets some umerical results Fially, the paper is cocluded i Sectio 5 Metric Divergece Measures 1 Review of Metric Space Defiitio 3 Two metrics d 1 ad d are equivalet if there exist positive costats α ad β such that α d (x, y) d 1 (x, y) β d (x, y) If two metrics d 1 ad d are equivalet, they will have the same covergece Nometric Divergece Measures Propositio 4 Noe of the divergece measures Δ(P Q), I(P Q), h(p Q), d(p Q), J(P Q), T(P Q), ad ψ(p Q)areametricitopologyIdeed,oeofthemsatisfy thetriagleiequality Proof We disprove them either umerically or aalytically by costructig couter examples i Δ (a) Let p = (0, 1), q = (1, 0), ad r = (05, 05) The Δ(P Q) =, Δ(P R) = Δ(R Q) = /3, ad Δ(P R)+Δ(R Q)=4/3<=Δ(P Q) (b) Let p = (05, 05), q = (0, 08), adr = (03, 07) The I(P Q) = , I(R Q) = , I(P R) = , adi(p R) + I(R Q) < I(P Q) (c) Let p = (0, 1), q = (1, 0), ad r = (05, 05) The h(p, Q) = 1, h(p R) = h(r Q) = (1/)( ),adh(p R) + h(r Q) = <1=h(P Q) (d) Let p = (0, 08), q = (04, 06), adr = (03, 07) The Defiitio 1 Suppose a real valued fuctio d:m M R ad that for all x, y, z of set M (1) d(x, y) 0 (oegative), () d(x, x) = 0 if ad oly if x=y(idetity), (3) d(x, y) = d(y, x) (symmetry), (4) d(x, z) d(x, y) + d(y, z) (triagle iequality) Such a distace fuctio d is called a metric o M,adthe pair (D, M) is called a metric space If d satisfies (1) (3) but ot ecessarily (4), it is called a semimetric Ametricspaceisatopologicalspaceiaaturalmaer, ad therefore all defiitios ad theorems about geeral topological spaces also apply to a metric space For istace, i a metric space oe ca defie ope ad closed sets, covergece of sequeces of poits, compact space, ad coected space D (P R) +D(R Q) D(P Q) = 0 l l l l l 1 08 l 4 3 = 0 (l 3 l 1 ) + 08 (l 8 7 l 4 3 ) + 03 l l 7 6 = 0 l l l l 7 6 = 0 l l l l 6 7 (16) Defiitio Ametricd 1 is said to be upper bouded by aother metric d if there exists a positive costat c such that d 1 (x, y) c d (x, y) for all x, y MIthiscase, d is said to be lower bouded by d 1 If d 1 is upper bouded by d, the the covergece i the metric space (M, d ) implies the covergece i the metric space (M, d 1 ) = 01 l l 6 7 = 01 (l 6 7 l 4 9 ) = 01 l <01 l 1= Hece, D(P R)+D(R Q)<D(P Q) Ideed, D(P Q)is ot symmetric either (see [15])

4 4 Joural of Mathematics (e) Let p = (0, 08), q = (04, 06), r = (03, 07)The Whe p=,weobtaitheeuclideadistace: J (P R) +J(R Q) = (01 l l 8 7 ) + (01 l l 7 6 ) = 01 l 8 3, (17) L (P Q) = Whe p,wehave (p i ) () L (P Q) = max 1 i p i (3) J (P Q) = 0 l + 0 l 4 3 = 0 l 8 3 Hece, J(P R)+J(R Q)<J(P Q) (f) Let p = (05, 05), q = (0, 08), ad r = (03, 07) The T(P Q) = , T(P R) = , T(R Q) = , adt(p R) + T(R Q) < T(P Q) (g) Let p = (05, 05), q = (0, 08), r = (03, 07) The, ψ(p Q) = 369/400, ψ(q R) = 185/1680, ψ(p R) = 368/1050,adψ(P R)+ψ(R Q) < ψ(p Q) 3 A Natural Metric Divergece Measure If we pick up the commo part of D(P Q)ad J(P Q),wewillobtaia metric divergece measure Z (P Q) = l (p i) l ( ) = l (p i ) (18) Sice p i 1ad p i 1for all 1 i,bothd(p Q) ad J(P Q) are upper bouded by Z(P Q);thatis, D(P Q) Z(P Q), J(P Q) Z(P Q) 4 L p -Divergece Recall that for a real umber p 1, the l p -orm of vector x = (x 1,x,,x ) R is defied by x p =( x i p ) (19) We will apply the l p -metric from the l p -orm to divergece measures to obtai l p -divergece For coveiece, we will use the upper case otatio Defiitio 5 For two probability distributios P ad Q,oe defies their L p -divergece as L p (P Q) = p p i p (0) Here, p 1is used for superscript, subscript, ad radial root It should ot be cofused with the vector P I particular, whe p=1,wehavethel 1 distace: L 1 (P Q) = p i (1) It is kow that l p -orms are decreasig i p Moreover, all l p metrics are equivalet Lemma 6 If r>p 1, the the l p -orms i R satisfy x r x p ( 1/r) x r (4) Corollary 7 If r>p 1, the the L p -divergeces satisfy L r (P Q) L p (P Q) ( 1/r) L r (P Q) (5) Theorem 8 L p -divergeces are all bouded by costat for p 1;thatis,L p (P Q) I particular, L (P Q) 1 ad L (P Q) Proof We first prove the geeral case From Corollary 7, it is sufficiet to prove that the L 1 -divergece is bouded by Let P = (p 1,p,,p ) ad Q = (q 1,q,,q ) be two probability distributios Without loss of geerality, let us assume that p 1 q 1, p q,,p k q k,ad p k+1 < q K+1,,p <q The,wehave L 1 (P Q) = = p i k k (p i )+ p i + ( p i ) p i + =1+1= (6) Notig that p i 1for 1 i,wehave(p i ) p i 1Hece, L (P Q) = L (P Q) = max 1 i p i 1, (p i ) p i = L 1 (P Q) Therefore, we have proved the particular cases (7) The followig result shows that the relative etropy D(P Q) islowerboudedbythesquareofthe L 1 (P Q)Itsproof ca be foud at [15]

5 Joural of Mathematics 5 Lemma 9 D (P Q) 1 (L 1 (P Q)) (8) Theorem 10 The square root of J(P Q) is lower bouded by L 1 (P Q);thatis, J (P Q) L 1 (P Q) (9) Proof Applyig lemma to D(P Q) ad D(Q P), we obtai D (P Q) +D(Q P) 1 (L 1 (P Q)) + 1 (L 1 (Q P)) =(L 1 (P Q)) (30) Note that the left had side is othig but J(P Q) The proof is completed by takig the square root o both sides Remark 11 J(P Q) ad hece J(P Q)are ubouded by ay costats This ca be see by takig P = (s, 1 s) ad Q = (1 s, s), 0 < s < 1,adtakiglimits 0 Sice L p -divergeces are all bouded by costat for p 1, J(P Q) ad hece J(P Q) are ot equivalet to L p -divergeces We ow establish the covexity property for L p - divergece, which is useful i optimizatio Theorem 1 L p (P Q) is covex i the pair (p, q), thatis, if (P 1,Q 1 ),ad(p,q ) are two pairs of probability distributios, the ( +( =(λ λ(p 1i q 1i ) p ) (1 λ) (p i q i ) p ) (p 1i q 1i ) p ) + (1 λ) ( (p i q i ) p ) =λl p (P 1 Q 1 )+(1 λ) L p (P Q ) (3) Here, the first iequality is from the well-kow Mikowski s Iequality It follows from the followig results that we ca geerate ifiitely may metric divergece measures usig the existig oes Propositio 13 If d 1 (P Q) ad d (P Q) are two metric divergece measures, so are the followig 3 measures: (1) α d 1 (P Q) + β d (P Q) for all α 0, β 0ad α+β>0, () max(d 1 (P Q), d (P Q)), (3) (d 1 (P Q)) +(d (P Q)) Proof The proof of (1) ad () is trivial ad hece will be omitted As for (3), it is sufficiet to verify the triagle iequality sice oegative, idetity, ad symmetry are all easy to verify To begi with, let us first prove a iequality: for ay oegative a, b, c, d, L p (λp 1 + (1 λ) P λq 1 + (1 λ) Q ) λl p (P 1 Q 1 )+(1 λ) L p (P Q ) (31) a +b +c +d + ac + bd a +b + c +d (33) for all 0 λ 1 Proof Let P 1 = (p 11,p 1,,p 1 ), P = (p 1,p,,p ), Q 1 =(q 11,q 1,,q 1 ),adq =(q 1,q,,q )The L p (λp 1 + (1 λ) P λq 1 + (1 λ) Q ) =( =( λp 1i + (1 λ) p i λq 1i (1 λ) q i p ) λ(p 1i q 1i )+(1 λ) (p i q i ) p ) It is easy to see that iequality (33) isequivalettothe followig iequality: ac + bd (a +b )(c +d ) (34) Iequality (34) is equivalet to the followig iequality: abcd a d +b c (35) Iequality (35) is equivalet to the followig iequality: (ad bc) 0 (36) Sice iequality (36) is always true, iequality (34)istrue

6 6 Joural of Mathematics Now, let us assume P, Q, R are 3 arbitrary probability distributios Sice d 1 (P Q) ad d (P Q) satisfy the triagle iequality, we have (d 1 (P Q)) +(d (P Q)) ([d 1 (P R) +d 1 (R Q)] +[d (P R) +d (R Q)] ) 1/ =(d 1 (P R) +d 1 (P R) d 1 (R Q) +d 1 (R Q) +d (P R) +d (P R) d (R Q) +d (R Q))1/ =([d 1 (P R) +d (P R)] +[d 1 (R Q) +d (R Q)] +d 1 (P R) d 1 (R Q) +d (P R) d (R Q)) 1/ d 1 (P R) +d (P R) + d 1 (R Q) +d (R Q) The last iequality results from iequality (33) (37) Remark 14 da Costa ad Taeja [16] showthat W s (P Q) ad V s (P Q)are metrics divergece measures for all s Sice Δ(P Q), I(P Q), h(p Q), d (P Q), J(P Q), T(P Q), ad ψ(p Q) are all costat factors of special cases of W s (P Q) or V s (P Q),theyareallmetric divergece measures by Propositio 13 Yet,da Costa ad Taeja did ot disprove or discuss ay applicatios of these divergece measures 3 Iformatio Value i Credit Scorig Iformatio value, or IV i short, is a widely used measure i credit scorig i the fiacial idustry It is a umerical value to quatify the predictive power of a idepedet cotiuous variable x i capturig the biary depedet variable y Mathematically, it is defied as [17] IV = ( g i g b i b ) l (g i/g ), (38) b i /b where istheumberofbisorgroupsofvari-ablex, g i ad b i are the umbers of good ad bad accouts with bi i,adg ad b are the total umber of good accouts ad bad accouts i the populatio Hece, g i /g ad b i /b are distributios of good accouts ad bad accouts Therefore, g i g = b i =1 (39) b Usually, good meas y=0ad bad meas y=1itcould be the other way, sice IV is symmetric about good ad bad If g i /g = b i /b for all,,,theiv=0;thatis,x has o iformatio o y IV is maily used to reduce the umber of variables as the iitial step i the logistic regressio, especially i big data with may variables IV is based o a aalysis of each idividual predictor i tur without takig ito accout the other predictors 31 IV ad WOE Oe advatage of IV is its close tie with weight of evidece (WOE), defied by l((g i /g)/(b i /b)) WOE measures the stregth of each grouped attribute i separatig good ad bad accouts Accordig to [17], WOE is the log of odds ratio, which measures odds of beig good Moreover, WOE is mootoic ad liear Yet, WOE is ot a accurate measure i that it is ot the log of odds ratio ad hece its liearity is ot guarateed Ideed, g i /g ad b i /b are from two differet probability distributios They represet the umber of good accouts i bi i divided by the total umber of good accouts i the populatio ad the umber of bad accouts i bi i divided bythetotalumberofbadaccoutsithepopulatio, respectively I geeral, g i /g+b i /b =1 ascabeseefrom Exhibit 6 i [17] To make WOE a log of odds, let us chage its defiitio to WOE1 =l ( g i/ i )=l ( g i )= l ( b i ) (40) b i / i b i g i ad deote it by WOE1 The cacelled i = g i +b i is the umber of accouts i bi i,adsog i / i +b i / i =1 As is well kow, the logistic regressio models the log odds, expressed i coditioal probabilities, as a liear fuctio of the idepedet variable; that is, P (Y =1 x) l ( P (Y =0 x) )=β 0 +β 1 x (41) Whe x falls ito bi i, l(p(y = 1 x)/p(y = 0 x)) becomes l(b i /g i )= l(g i /b i ) Hece, the WOE1 values are either cotiuously icreasig or cotiuously decreasig i aliearfashio IV ad WOE1 ca be used together to select idepedet variables for logistic regressio Whe a cotiuous variable x has a large IV, we make it a cadidate variable for logistic regressio if WOE1 values are liear It is commo to plot the WOE1 values versus the mea values of x at bi i 3ARuleofThumbofIVItuitively, the larger the IV, the more predictive the idepedet variable However, if IV is too large, it should be checked for over predictig For istace, x maybeapostkowledgevariable To quatify IV, a rule of thumb is proposed i [17, 18]: (i) less tha 00: upredictive, (ii) 00 to 01: weak, (iii) 01 to 03: medium, (iv) 03+: strog

7 Joural of Mathematics 7 I additio, mathematical reasoig of the rule of thumb isgivei[18] I more detail, IV ca be expressed as the average of likelihood ratio test statistics G(P, Q) ad G(Q, P) of Chi-square distributios with ( 1) degrees of freedom: IV = p i l ( p i )+q l ( ) i p i =G(P, Q) +G(Q, P) (4) The close relatioship betwee IV ad the likelihood ratio test allows usig the Chi-square distributio to assig a sigificace level However, this is doubtful O the oe had, G(P, Q) ad G(Q, P) are ot ecessarily idepedet O the other had, eve if they are idepedet, it is ot eough Let us assume that IV follows a Chi-square distributio with ( 1) degrees of freedom Yet, the critical values of the Chi-square distributio are too large compared with the values i the rule of thumb, as ca be see from the Chi-square table i may books about Probability, say [19] We oly list the first several rows of Table 1 Whe Table 1 growsasdficreases,thevaluesieachcolumwill icrease Oe may use the Excel fuctio CHIINV(p, df) or its ewer ad more accurate versio CHISQINVRT(p, df) to build Table 1, which returs the iverse of the right-tailed probability 1 pof the Chi-square distributio with df degrees of freedom Thecriticalvaluesareassmallasthevaluesitheruleof thumb oly whe the degrees of freedom are as small as 6 For istace, there is a probability of = 0995 that a Chi-square distributio with 6 degrees of freedom will be larger tha or equal to 068, that is, CHIINV (0995, 6) = 068 (43) Yet, there is a probability of 0995 that a Chi-square radom variable with 10 degrees of freedom will be larger tha or equal to 16 There is a probability of 0995 that a Chi-square radom variable with 18 degrees of freedom will be larger tha or equal to 66 O the basis of the above, the rule of thumb is more or less empirical 33 Calculatio of IV The calculatio of IV is simple oce biig is doe I this sese, IV is a subjective measure It depeds o how the biig is doe ad how may bis are used Differet biig methods may result i differet IV values, whereas the logistic regressio i the later stages will ot use the iformatio of these bis I practice, 10 or 0 bis are used The more the bis, the better the chace the good accouts will be separated from the bad accouts Yet, we caot divide the values of x idefiitely sice we may ot avoid 0 good accout or 0 bad accout i some bis To overcome the limitatio of the logarithm fuctio i the J-divergece, the biig should avoid 0 good accout or 0 bad accout i ay bis The idea of biig is to assig values of x with similar behaviors to the same group or bi I particular, the same values of x must fall ito the same bi A atural way of biig is to sort the data first ad the divide them ito bis with a equal umber of observatios (the last bi may have less umber of observatios) This works well if x has o repeatig values at all I reality, x ofte has repeatig values (called the tied values i statistics), which may cause problems whe the tied values of x fall ito differet bis Proc Rak i SAS serves, a good cadidate for biig (as opposed to fuctio cut i R) Whe there are o tied values i x, it simply divides the values of x ito bis Whe there are tied values i x, it treats the tied values by its optio TIES Proc Rak begis with sortig the values of x withi a BY group It the assigs each omissig value a ordial umber that idicates its rak or positio i the sequece I case of ties, optio TIES will be used Depedig o whether TIES = LOW, HIGH, or MEAN (default oe), the lowest rak, highest rak, or the average rak will be assiged to all the tied values Next, the followig formula is used to calculate thebiigvalueofeachomissigvalueofx: rak, (44) m+1 where is the floor fuctio, rak the value s rak, the umber of bis, ad m the umber of omissig observatios Note that the rage of the biig values is from 0 to 1 Fially, all the values of x arebiedaccordigto theirbiigvaluesicaseoebihaslesstha5%ofthe populatio, we may combie this bi with its eighborig bi To illustrate the use of Proc Rak with = 10 ad TIES=MEAN,letuslookataimagiarydatasetwithoe variable age ad 100 observatios Assume this dataset has bee sorted ad has fifty observatios with a value of 10, thirty observatios with a value of 0, te observatios with avalueof30,ieobservatioswithavalueof40,adoe observatiowithavalueof50 The first 50 observatios have a tied value of 10 Each of them will be assiged a average rak of k = 55 ad hece a biig value of (55 10)/101 = The ext 30 observatios have a tied value of 0 Each of them will be assiged a average rak of ( )/30 = 655 ad hece a biig value of (655 10)/101 = 6 The ext 10 observatios have a tied value of 30 Each of them will be assiged a average rak of ( )/10 = 855 ad hece a biig value of (855 10)/101 = 8 The ext 9 observatios have a tied value of 40 Each of them will be assiged a average rak of ( )/9 = 95 ad hece a biig value of (95 10)/101 = 9 Thelast observatio has a rak of 100 ad hece will be assiged a biig value of (100 10)/101 = 9 Isummary,the100 observatios are divided ito 4 bis: the first 50 observatios, the ext 30 observatios, the ext 10 observatios, ad the last 10 observatios Remark 15 Missig values are ot raked ad are left missig i Proc Rak Yet, they may be kept i a separate bi by meas of Proc Summary or Proc Meas i the calculatio of IV Remark 16 If x has less tha k differet values, the umber of bis by Proc Rak will be less that k

8 8 Joural of Mathematics Table 1: Chi-square table DF P = After biig is doe for x, asimplesasprogramca be writte to calculate IV Meawhile, WOE1 are calculated per bi as for WOE i [17] If IV is less tha 00, we will throw this idepedet variable If IV is large tha 03, over predictig will be checked If IV is betwee 00 ad 03 ad WOE1 are liear, we will iclude this idepedet variable as a cadidate variable i logistic regressio If IV is betwee 00 ad 03 but WOE1 are ot liear, we may make trasformatios of the idepedet variable to make WOE1 more liear If a trasformatio ca preserve the rak of the origial idepedet variable, the biig by Proc Rak will be preserved Therefore, we have obtaied the followig result Propositio 17 IV, whe biig by Proc Rak, is ivariat uder ay strictly mootoic trasformatios 34 Mathematical Properties of IV IV is the iformatio statistic for the differece betwee the iformatio i the good accouts ad the iformatio i the bad accouts Ideed, IV is the J-divergece with distributios of good accouts ad bad accouts Thus, IV is lower bouded by the square of L 1 (P Q) by Theorem 10 Property 1 IV (L 1 (P Q)) Property IV satisfiesthe iequalities (15); that is, 1 Δ (P Q) I(P Q) h(p Q) 4 4d(P Q) IV T(P Q) 8 1 ψ (P Q) 16 I particular, IV is upper bouded by ψ(p Q): IV 1 (45) (p i ) (p i + ) p i (46) Note that there is a direct proof of the above iequality, which is much easier tha that i [14] Let us assume without loss of geerality that p 1 q 1, p q,,p k q k,adp k+1 < q K+1,,p <q Proof Usig the idetity p i / = 1 + ((p i )/q 1 ) ad makig Taylor s expasio of fuctio l(p i / ) aroud 1 for,,,k,weobtai l ( p i )= l (1) + 1 p i 1 (1+ξ) (p i ) 1 (47) p i Multiplyig p i 0 ad summig up from 1 to k, we obtai k (p i ) l ( p i ) 1 k (p i ) 1 (p i ) (48) Similarly, l( /p i ) (1/)(( p i )/p i ) Multiplyig p i 0 ad summig up from k to,weobtai ( p i ) l ( ) 1 p i 1 The proof is completed by otig that (p i ) l ( p i ) = = k k (p i ) l ( p i )+ (p i ) l ( p i )+ ( p i ) p i (p i ) p i (p i ) l ( p i ) ( p i ) l ( p i ) (49) (50)

9 Joural of Mathematics 9 Table : Outlier domiatio to IV Bi Cout Goods Distr good Bads Distr bad IV cotr % % % % Total % % Theorem 18 IV is covex i the pair (p, q) Proof Applyig Theorem 7 from [15] to both D(P Q) ad D(Q P),weobtai J(λP 1 + (1 λ) P λq 1 + (1 λ) Q ) Weight of evidece Age Figure 1: Logical Tred of WOE1 for Variable Age =D(λP 1 + (1 λ) P λq 1 + (1 λ) Q ) +D(λQ 1 + (1 λ) Q λp 1 + (1 λ) P ) λd(p 1 Q 1 )+(1 λ) D(P Q ) +λd(q 1 P 1 )+(1 λ) D(Q P ) =λ(d(p 1 Q 1 )+D(Q 1 P 1 )) + (1 λ)(d (P Q )+D(Q P )) =λj(p 1 Q 1 )+(1 λ) J(P Q ) (51) cabealterativestoiviparticular, L p (P Q)(p 1)are good alteratives to IV They overcome all the 3 shortcomigs of IV: (1) L p (P Q)are all metrics; () they allow bis to have 0 bad accouts or 0 good accouts; ad (3) They all have a much arrow rage, from 0 to While L P (P Q)dootseemtohaveatiewithweight of evidece, they ca be as quatifiable as IV For istace, we may adopt a rule of thumb for L P (P Q): (i) weak: 6% of its upper boud, (ii) medium: 6% to 30% of its upper boud, (iii) strog: Larger tha 30% of its upper boud Remark 19 J(P Q)is ot covex albeit a metric Property 3 If more tha 95% of populatio of x have the same value, the IV = 0 I particular, if x has just oe value, the IV = 0 Proof Assume more tha 95% of populatio of x have the same value x 0 The,allthepopulatiowithvaluex 0 will fall ito the same bi, called the majority bi The rest of populatio whose values are differet from x 0 will be combied ito the majority bi Thus, there will be oly oe bi for all the values of x Therefore g 1 =g, b 1 =b, ad hece IV = 0 Remark 0 Ifthepopulatiowhosevaluesareot x 0 are ot combied ito the majority bi, the IV could be larger tha 00 As show i Table, x has observatios, where 958% or 9580 observatios have the same value, say, ad the rest of 40 observatios have aother value, say 4 Both bis cotribute a value larger tha 00 to IV Statistically, 4% of the populatio are outliers ad ca be eglected Hece, it is more meaigful to say x has o iformatio to y 35 Alteratives to IV As we have see above, IV has 3 shortcomigs: (1) it is ot a metric; () o groups are allowed to have 0 bad accouts or 0 good accouts; ad (3) its rage is too broad, from 0 to Theoretically, ay divergece measures of the differece or distace betwee good distributios ad bad distributios I particular, (i) weak: 01 for L 1 (P Q), 0085forL (P Q), ad 006 for L (P Q), (ii) medium: 01 to 060 for L 1 (P Q),0085to044for L (P Q), ad 006 to 030 for L (P Q), (iii) strog: 060+ for L 1 (P Q), 044+forL (P Q), 03+ for L (P Q) Remark 1 If L 1 (P Q) > 01,theIV> The lower boud 01 of L 1 (P Q) cabeadjusted as eeded It ca also be combied with IV to ehace the accuracy For istace, for the umber of idepedet variables is large eough, we may select oly those which satisfy both lower bouds of L 1 (P Q)ad IV 4 Numerical Results To illustrate our results, we use Exhibit 6 i [17]butaddoe colum for WOE1 We use the real WOE, ot its more userfriedly form 100 times WOE From Table 3,weseethatL <L <L 1 < IV From Figure 1, we also see that the WOE1 for omissig values has alieartredforvariableage 5 Coclusios I this paper, we have proposed a class of metric divergece measures, amely, L p (P Q), p 1, ad studied their mathematical properties We studied iformatio value, a

10 10 Joural of Mathematics Table 3: Calculatio of IV ad WOE Age Cout Tot distr Goods Distr good Bads Distr bad WOE WOE1 Missig % % % % % 960 5% % % % % % % % % % % 00 51% % % % Total % % % IV = 06681,ad IV = L 1 =06684,L =0987,adL =01659 importat divergece measure widely used i credit scorig After explorig the mathematical reasoig of a rule of thumb ad weight of evidece, we suggested a alterative to weight of evidece Fially, we proposed usig L p (P Q) as alteratives to iformatio value to overcome its disadvatages Refereces [1] I J Taeja, Geeralized relative iformatio ad iformatio iequalities, Joural of Iequalities i Pure ad Applied Mathematics,vol5,o1,pp1 19,004 [] E Helliger, Neue begrüdug der theorie der quadratische forme vo uedliche viele veräderliche, Joural Für Die Reie ud Agewadte Mathematik, vol136,pp10 71, 1909 [3] C E Shao, A mathematical theory of commuicatio, The Bell System Techical Joural,vol7,pp379 43,1948 [4] S Kullback ad R A Leibler, O iformatio ad sufficiecy, Aals of Mathematical Statistics,vol,pp79 86,1951 [5] H Jeffreys, A ivariat form for the prior probability i estimatio problems, Proceedigs of the Royal Society,vol186, pp ,1946 [6] F Topsøe, Some iequalities for iformatio divergece ad related measures of discrimiatio, Istitute of Electrical ad Electroics Egieers, vol 46, o 4, pp , 000 [7] S S Dragomir, J Šude, ad C Buşe, New iequalities for Jeffreys divergece measure, Tamsui Oxford Joural of Mathematical Scieces,vol16,o,pp95 309,000 [8] K Pearso, O the criterio that a give system of deviatios from the probable i the case of correlated system of variables is such that it ca be reasoable supposed to have arise from radom samplig, Philosophical Magazie, vol 50, pp , 199 [9] R Sibso, Iformatio radius, Zeitschrift für Wahrscheilichkeitstheorie ud Verwadte Gebiete, vol14,o,pp , 1969 [10] J Burbea ad C R Rao, Etropy differetial metric, distace ad divergece measures i probability spaces: a uified approach, Joural of Multivariate Aalysis, vol 1, o 4, pp , 198 [11] J Burbea ad C R Rao, O the covexity of some divergece measures based o etropy fuctios, Istitute of Electrical ad Electroics Egieers,vol8,o3,pp ,198 [1] I J Taeja, New developmets i geeralized iformatio measures, i Advaces i Imagig ad Electro Physics, P W Hawkes,Ed,vol91,pp37 136,1995 [13] N Cressie ad T R C Read, Multiomial goodess-of-fit tests, JouraloftheRoyalStatisticalSocietyB, vol 46, o 3, pp , 1984 [14] I J Taeja, Geeralized Symmetric Divergece Measures ad Iequalities, RGMIA Research Report Collectio, vol 7, o 4, 004 [15] TMCoveradJAThomas,Elemets of IformatioTheory, Joh Wiley & Sos, New York, NY, USA, 1991 [16] G A T F da Costa ad I J Taeja, Geeralized Symmetric Divergece Measures ad Metric Spaces, ComputigResearch Repository, 011 [17] N Siddiqi, Credit Risk Scorecards Developig ad Implemetig Itelliget Credit Scorig,JohWiley&Sos,006 [18] N Siddiqi, Credit Risk Scorecards Developmet ad Implemetatio Usig SAS, LULU, 011 [19] D Dowig ad J Clark, Barro s E-Z Statistics, Barro s Educatioal Series, 009

11 Advaces i Operatios Research Volume 014 Advaces i Decisio Scieces Volume 014 Joural of Applied Mathematics Algebra Volume 014 Joural of Probability ad Statistics Volume 014 The Scietific World Joural Volume 014 Iteratioal Joural of Differetial Equatios Volume 014 Volume 014 Submit your mauscripts at Iteratioal Joural of Advaces i Combiatorics Mathematical Physics Volume 014 Joural of Complex Aalysis Volume 014 Iteratioal Joural of Mathematics ad Mathematical Scieces Mathematical Problems i Egieerig Joural of Mathematics Volume Volume 014 Volume Volume 014 Discrete Mathematics Joural of Volume Discrete Dyamics i Nature ad Society Joural of Fuctio Spaces Abstract ad Applied Aalysis Volume Volume Volume 014 Iteratioal Joural of Joural of Stochastic Aalysis Optimizatio Volume 014 Volume 014

Research Article A Unified Weight Formula for Calculating the Sample Variance from Weighted Successive Differences

Research Article A Unified Weight Formula for Calculating the Sample Variance from Weighted Successive Differences Discrete Dyamics i Nature ad Society Article ID 210761 4 pages http://dxdoiorg/101155/2014/210761 Research Article A Uified Weight Formula for Calculatig the Sample Variace from Weighted Successive Differeces

More information

Research Article Approximate Riesz Algebra-Valued Derivations

Research Article Approximate Riesz Algebra-Valued Derivations Abstract ad Applied Aalysis Volume 2012, Article ID 240258, 5 pages doi:10.1155/2012/240258 Research Article Approximate Riesz Algebra-Valued Derivatios Faruk Polat Departmet of Mathematics, Faculty of

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3 MATH 337 Sequeces Dr. Neal, WKU Let X be a metric space with distace fuctio d. We shall defie the geeral cocept of sequece ad limit i a metric space, the apply the results i particular to some special

More information

Research Article Bounds on Nonsymmetric Divergence Measure in terms of Other Symmetric and Nonsymmetric Divergence Measures

Research Article Bounds on Nonsymmetric Divergence Measure in terms of Other Symmetric and Nonsymmetric Divergence Measures Iteratioal Scholarly Research Notices Volume 04, Article ID 8037, 9 pages http://dx.doi.org/0./04/8037 Research Article Bouds o Nosymmetric Divergece Measure i terms of Other Symmetric ad Nosymmetric Divergece

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Convergence of random variables. (telegram style notes) P.J.C. Spreij Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space

More information

Research Article Some E-J Generalized Hausdorff Matrices Not of Type M

Research Article Some E-J Generalized Hausdorff Matrices Not of Type M Abstract ad Applied Aalysis Volume 2011, Article ID 527360, 5 pages doi:10.1155/2011/527360 Research Article Some E-J Geeralized Hausdorff Matrices Not of Type M T. Selmaogullari, 1 E. Savaş, 2 ad B. E.

More information

Lecture 3 The Lebesgue Integral

Lecture 3 The Lebesgue Integral Lecture 3: The Lebesgue Itegral 1 of 14 Course: Theory of Probability I Term: Fall 2013 Istructor: Gorda Zitkovic Lecture 3 The Lebesgue Itegral The costructio of the itegral Uless expressly specified

More information

Chapter 6 Infinite Series

Chapter 6 Infinite Series Chapter 6 Ifiite Series I the previous chapter we cosidered itegrals which were improper i the sese that the iterval of itegratio was ubouded. I this chapter we are goig to discuss a topic which is somewhat

More information

Research Article Nonexistence of Homoclinic Solutions for a Class of Discrete Hamiltonian Systems

Research Article Nonexistence of Homoclinic Solutions for a Class of Discrete Hamiltonian Systems Abstract ad Applied Aalysis Volume 203, Article ID 39868, 6 pages http://dx.doi.org/0.55/203/39868 Research Article Noexistece of Homocliic Solutios for a Class of Discrete Hamiltoia Systems Xiaopig Wag

More information

Research Article A Note on Ergodicity of Systems with the Asymptotic Average Shadowing Property

Research Article A Note on Ergodicity of Systems with the Asymptotic Average Shadowing Property Discrete Dyamics i Nature ad Society Volume 2011, Article ID 360583, 6 pages doi:10.1155/2011/360583 Research Article A Note o Ergodicity of Systems with the Asymptotic Average Shadowig Property Risog

More information

Research Article Moment Inequality for ϕ-mixing Sequences and Its Applications

Research Article Moment Inequality for ϕ-mixing Sequences and Its Applications Hidawi Publishig Corporatio Joural of Iequalities ad Applicatios Volume 2009, Article ID 379743, 2 pages doi:0.55/2009/379743 Research Article Momet Iequality for ϕ-mixig Sequeces ad Its Applicatios Wag

More information

Entropy Rates and Asymptotic Equipartition

Entropy Rates and Asymptotic Equipartition Chapter 29 Etropy Rates ad Asymptotic Equipartitio Sectio 29. itroduces the etropy rate the asymptotic etropy per time-step of a stochastic process ad shows that it is well-defied; ad similarly for iformatio,

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Sequences. Notation. Convergence of a Sequence

Sequences. Notation. Convergence of a Sequence Sequeces A sequece is essetially just a list. Defiitio (Sequece of Real Numbers). A sequece of real umbers is a fuctio Z (, ) R for some real umber. Do t let the descriptio of the domai cofuse you; it

More information

Lecture 7: October 18, 2017

Lecture 7: October 18, 2017 Iformatio ad Codig Theory Autum 207 Lecturer: Madhur Tulsiai Lecture 7: October 8, 207 Biary hypothesis testig I this lecture, we apply the tools developed i the past few lectures to uderstad the problem

More information

Comparison Study of Series Approximation. and Convergence between Chebyshev. and Legendre Series

Comparison Study of Series Approximation. and Convergence between Chebyshev. and Legendre Series Applied Mathematical Scieces, Vol. 7, 03, o. 6, 3-337 HIKARI Ltd, www.m-hikari.com http://d.doi.org/0.988/ams.03.3430 Compariso Study of Series Approimatio ad Covergece betwee Chebyshev ad Legedre Series

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

MAT1026 Calculus II Basic Convergence Tests for Series

MAT1026 Calculus II Basic Convergence Tests for Series MAT026 Calculus II Basic Covergece Tests for Series Egi MERMUT 202.03.08 Dokuz Eylül Uiversity Faculty of Sciece Departmet of Mathematics İzmir/TURKEY Cotets Mootoe Covergece Theorem 2 2 Series of Real

More information

Fall 2013 MTH431/531 Real analysis Section Notes

Fall 2013 MTH431/531 Real analysis Section Notes Fall 013 MTH431/531 Real aalysis Sectio 8.1-8. Notes Yi Su 013.11.1 1. Defiitio of uiform covergece. We look at a sequece of fuctios f (x) ad study the coverget property. Notice we have two parameters

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

7 Sequences of real numbers

7 Sequences of real numbers 40 7 Sequeces of real umbers 7. Defiitios ad examples Defiitio 7... A sequece of real umbers is a real fuctio whose domai is the set N of atural umbers. Let s : N R be a sequece. The the values of s are

More information

Sequences and Series of Functions

Sequences and Series of Functions Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges

More information

ECE 901 Lecture 14: Maximum Likelihood Estimation and Complexity Regularization

ECE 901 Lecture 14: Maximum Likelihood Estimation and Complexity Regularization ECE 90 Lecture 4: Maximum Likelihood Estimatio ad Complexity Regularizatio R Nowak 5/7/009 Review : Maximum Likelihood Estimatio We have iid observatios draw from a ukow distributio Y i iid p θ, i,, where

More information

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i

More information

Math 113 Exam 3 Practice

Math 113 Exam 3 Practice Math Exam Practice Exam will cover.-.9. This sheet has three sectios. The first sectio will remid you about techiques ad formulas that you should kow. The secod gives a umber of practice questios for you

More information

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014. Product measures, Toelli s ad Fubii s theorems For use i MAT3400/4400, autum 2014 Nadia S. Larse Versio of 13 October 2014. 1. Costructio of the product measure The purpose of these otes is to preset the

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

Distribution of Random Samples & Limit theorems

Distribution of Random Samples & Limit theorems STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

sin(n) + 2 cos(2n) n 3/2 3 sin(n) 2cos(2n) n 3/2 a n =

sin(n) + 2 cos(2n) n 3/2 3 sin(n) 2cos(2n) n 3/2 a n = 60. Ratio ad root tests 60.1. Absolutely coverget series. Defiitio 13. (Absolute covergece) A series a is called absolutely coverget if the series of absolute values a is coverget. The absolute covergece

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

Introduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT

Introduction to Extreme Value Theory Laurens de Haan, ISM Japan, Erasmus University Rotterdam, NL University of Lisbon, PT Itroductio to Extreme Value Theory Laures de Haa, ISM Japa, 202 Itroductio to Extreme Value Theory Laures de Haa Erasmus Uiversity Rotterdam, NL Uiversity of Lisbo, PT Itroductio to Extreme Value Theory

More information

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function. MATH 532 Measurable Fuctios Dr. Neal, WKU Throughout, let ( X, F, µ) be a measure space ad let (!, F, P ) deote the special case of a probability space. We shall ow begi to study real-valued fuctios defied

More information

MAS111 Convergence and Continuity

MAS111 Convergence and Continuity MAS Covergece ad Cotiuity Key Objectives At the ed of the course, studets should kow the followig topics ad be able to apply the basic priciples ad theorems therei to solvig various problems cocerig covergece

More information

Commutativity in Permutation Groups

Commutativity in Permutation Groups Commutativity i Permutatio Groups Richard Wito, PhD Abstract I the group Sym(S) of permutatios o a oempty set S, fixed poits ad trasiet poits are defied Prelimiary results o fixed ad trasiet poits are

More information

Rates of Convergence by Moduli of Continuity

Rates of Convergence by Moduli of Continuity Rates of Covergece by Moduli of Cotiuity Joh Duchi: Notes for Statistics 300b March, 017 1 Itroductio I this ote, we give a presetatio showig the importace, ad relatioship betwee, the modulis of cotiuity

More information

ENGI Series Page 6-01

ENGI Series Page 6-01 ENGI 3425 6 Series Page 6-01 6. Series Cotets: 6.01 Sequeces; geeral term, limits, covergece 6.02 Series; summatio otatio, covergece, divergece test 6.03 Stadard Series; telescopig series, geometric series,

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 3 9/11/2013. Large deviations Theory. Cramér s Theorem MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/5.070J Fall 203 Lecture 3 9//203 Large deviatios Theory. Cramér s Theorem Cotet.. Cramér s Theorem. 2. Rate fuctio ad properties. 3. Chage of measure techique.

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

A Relationship Between the One-Way MANOVA Test Statistic and the Hotelling Lawley Trace Test Statistic

A Relationship Between the One-Way MANOVA Test Statistic and the Hotelling Lawley Trace Test Statistic http://ijspccseetorg Iteratioal Joural of Statistics ad Probability Vol 7, No 6; 2018 A Relatioship Betwee the Oe-Way MANOVA Test Statistic ad the Hotellig Lawley Trace Test Statistic Hasthika S Rupasighe

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Ma 530 Infinite Series I

Ma 530 Infinite Series I Ma 50 Ifiite Series I Please ote that i additio to the material below this lecture icorporated material from the Visual Calculus web site. The material o sequeces is at Visual Sequeces. (To use this li

More information

Math 113 Exam 3 Practice

Math 113 Exam 3 Practice Math Exam Practice Exam 4 will cover.-., 0. ad 0.. Note that eve though. was tested i exam, questios from that sectios may also be o this exam. For practice problems o., refer to the last review. This

More information

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution Iteratioal Mathematical Forum, Vol., 3, o. 3, 3-53 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/.9/imf.3.335 Double Stage Shrikage Estimator of Two Parameters Geeralized Expoetial Distributio Alaa M.

More information

ECE 901 Lecture 13: Maximum Likelihood Estimation

ECE 901 Lecture 13: Maximum Likelihood Estimation ECE 90 Lecture 3: Maximum Likelihood Estimatio R. Nowak 5/7/009 The focus of this lecture is to cosider aother approach to learig based o maximum likelihood estimatio. Ulike earlier approaches cosidered

More information

Lecture 13: Maximum Likelihood Estimation

Lecture 13: Maximum Likelihood Estimation ECE90 Sprig 007 Statistical Learig Theory Istructor: R. Nowak Lecture 3: Maximum Likelihood Estimatio Summary of Lecture I the last lecture we derived a risk (MSE) boud for regressio problems; i.e., select

More information

Chapter 5. Inequalities. 5.1 The Markov and Chebyshev inequalities

Chapter 5. Inequalities. 5.1 The Markov and Chebyshev inequalities Chapter 5 Iequalities 5.1 The Markov ad Chebyshev iequalities As you have probably see o today s frot page: every perso i the upper teth percetile ears at least 1 times more tha the average salary. I other

More information

f n (x) f m (x) < ɛ/3 for all x A. By continuity of f n and f m we can find δ > 0 such that d(x, x 0 ) < δ implies that

f n (x) f m (x) < ɛ/3 for all x A. By continuity of f n and f m we can find δ > 0 such that d(x, x 0 ) < δ implies that Lecture 15 We have see that a sequece of cotiuous fuctios which is uiformly coverget produces a limit fuctio which is also cotiuous. We shall stregthe this result ow. Theorem 1 Let f : X R or (C) be a

More information

CHAPTER 10 INFINITE SEQUENCES AND SERIES

CHAPTER 10 INFINITE SEQUENCES AND SERIES CHAPTER 10 INFINITE SEQUENCES AND SERIES 10.1 Sequeces 10.2 Ifiite Series 10.3 The Itegral Tests 10.4 Compariso Tests 10.5 The Ratio ad Root Tests 10.6 Alteratig Series: Absolute ad Coditioal Covergece

More information

Research Article A New Second-Order Iteration Method for Solving Nonlinear Equations

Research Article A New Second-Order Iteration Method for Solving Nonlinear Equations Abstract ad Applied Aalysis Volume 2013, Article ID 487062, 4 pages http://dx.doi.org/10.1155/2013/487062 Research Article A New Secod-Order Iteratio Method for Solvig Noliear Equatios Shi Mi Kag, 1 Arif

More information

Math 113 Exam 4 Practice

Math 113 Exam 4 Practice Math Exam 4 Practice Exam 4 will cover.-.. This sheet has three sectios. The first sectio will remid you about techiques ad formulas that you should kow. The secod gives a umber of practice questios for

More information

Lecture Notes for Analysis Class

Lecture Notes for Analysis Class Lecture Notes for Aalysis Class Topological Spaces A topology for a set X is a collectio T of subsets of X such that: (a) X ad the empty set are i T (b) Uios of elemets of T are i T (c) Fiite itersectios

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

CS322: Network Analysis. Problem Set 2 - Fall 2009

CS322: Network Analysis. Problem Set 2 - Fall 2009 Due October 9 009 i class CS3: Network Aalysis Problem Set - Fall 009 If you have ay questios regardig the problems set, sed a email to the course assistats: simlac@staford.edu ad peleato@staford.edu.

More information

Bayesian Methods: Introduction to Multi-parameter Models

Bayesian Methods: Introduction to Multi-parameter Models Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested

More information

Self-normalized deviation inequalities with application to t-statistic

Self-normalized deviation inequalities with application to t-statistic Self-ormalized deviatio iequalities with applicatio to t-statistic Xiequa Fa Ceter for Applied Mathematics, Tiaji Uiversity, 30007 Tiaji, Chia Abstract Let ξ i i 1 be a sequece of idepedet ad symmetric

More information

Optimally Sparse SVMs

Optimally Sparse SVMs A. Proof of Lemma 3. We here prove a lower boud o the umber of support vectors to achieve geeralizatio bouds of the form which we cosider. Importatly, this result holds ot oly for liear classifiers, but

More information

Review Article Incomplete Bivariate Fibonacci and Lucas p-polynomials

Review Article Incomplete Bivariate Fibonacci and Lucas p-polynomials Discrete Dyamics i Nature ad Society Volume 2012, Article ID 840345, 11 pages doi:10.1155/2012/840345 Review Article Icomplete Bivariate Fiboacci ad Lucas p-polyomials Dursu Tasci, 1 Mirac Ceti Firegiz,

More information

An Introduction to Randomized Algorithms

An Introduction to Randomized Algorithms A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis

More information

INFINITE SEQUENCES AND SERIES

INFINITE SEQUENCES AND SERIES 11 INFINITE SEQUENCES AND SERIES INFINITE SEQUENCES AND SERIES 11.4 The Compariso Tests I this sectio, we will lear: How to fid the value of a series by comparig it with a kow series. COMPARISON TESTS

More information

1 Convergence in Probability and the Weak Law of Large Numbers

1 Convergence in Probability and the Weak Law of Large Numbers 36-752 Advaced Probability Overview Sprig 2018 8. Covergece Cocepts: i Probability, i L p ad Almost Surely Istructor: Alessadro Rialdo Associated readig: Sec 2.4, 2.5, ad 4.11 of Ash ad Doléas-Dade; Sec

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak

More information

A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence

A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece,, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet as

More information

MA131 - Analysis 1. Workbook 9 Series III

MA131 - Analysis 1. Workbook 9 Series III MA3 - Aalysis Workbook 9 Series III Autum 004 Cotets 4.4 Series with Positive ad Negative Terms.............. 4.5 Alteratig Series.......................... 4.6 Geeral Series.............................

More information

Series III. Chapter Alternating Series

Series III. Chapter Alternating Series Chapter 9 Series III With the exceptio of the Null Sequece Test, all the tests for series covergece ad divergece that we have cosidered so far have dealt oly with series of oegative terms. Series with

More information

Lecture 8: Convergence of transformations and law of large numbers

Lecture 8: Convergence of transformations and law of large numbers Lecture 8: Covergece of trasformatios ad law of large umbers Trasformatio ad covergece Trasformatio is a importat tool i statistics. If X coverges to X i some sese, we ofte eed to check whether g(x ) coverges

More information

Bounds for the Extreme Eigenvalues Using the Trace and Determinant

Bounds for the Extreme Eigenvalues Using the Trace and Determinant ISSN 746-7659, Eglad, UK Joural of Iformatio ad Computig Sciece Vol 4, No, 9, pp 49-55 Bouds for the Etreme Eigevalues Usig the Trace ad Determiat Qi Zhog, +, Tig-Zhu Huag School of pplied Mathematics,

More information

Complex Analysis Spring 2001 Homework I Solution

Complex Analysis Spring 2001 Homework I Solution Complex Aalysis Sprig 2001 Homework I Solutio 1. Coway, Chapter 1, sectio 3, problem 3. Describe the set of poits satisfyig the equatio z a z + a = 2c, where c > 0 ad a R. To begi, we see from the triagle

More information

Recurrence Relations

Recurrence Relations Recurrece Relatios Aalysis of recursive algorithms, such as: it factorial (it ) { if (==0) retur ; else retur ( * factorial(-)); } Let t be the umber of multiplicatios eeded to calculate factorial(). The

More information

Stochastic Matrices in a Finite Field

Stochastic Matrices in a Finite Field Stochastic Matrices i a Fiite Field Abstract: I this project we will explore the properties of stochastic matrices i both the real ad the fiite fields. We first explore what properties 2 2 stochastic matrices

More information

Once we have a sequence of numbers, the next thing to do is to sum them up. Given a sequence (a n ) n=1

Once we have a sequence of numbers, the next thing to do is to sum them up. Given a sequence (a n ) n=1 . Ifiite Series Oce we have a sequece of umbers, the ext thig to do is to sum them up. Give a sequece a be a sequece: ca we give a sesible meaig to the followig expressio? a = a a a a While summig ifiitely

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

Chapter 10: Power Series

Chapter 10: Power Series Chapter : Power Series 57 Chapter Overview: Power Series The reaso series are part of a Calculus course is that there are fuctios which caot be itegrated. All power series, though, ca be itegrated because

More information

Advanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology

Advanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology Advaced Aalysis Mi Ya Departmet of Mathematics Hog Kog Uiversity of Sciece ad Techology September 3, 009 Cotets Limit ad Cotiuity 7 Limit of Sequece 8 Defiitio 8 Property 3 3 Ifiity ad Ifiitesimal 8 4

More information

If a subset E of R contains no open interval, is it of zero measure? For instance, is the set of irrationals in [0, 1] is of measure zero?

If a subset E of R contains no open interval, is it of zero measure? For instance, is the set of irrationals in [0, 1] is of measure zero? 2 Lebesgue Measure I Chapter 1 we defied the cocept of a set of measure zero, ad we have observed that every coutable set is of measure zero. Here are some atural questios: If a subset E of R cotais a

More information

Sequences A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence

Sequences A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece 1, 1, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet

More information

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + 62. Power series Defiitio 16. (Power series) Give a sequece {c }, the series c x = c 0 + c 1 x + c 2 x 2 + c 3 x 3 + is called a power series i the variable x. The umbers c are called the coefficiets of

More information

Discrete-Time Systems, LTI Systems, and Discrete-Time Convolution

Discrete-Time Systems, LTI Systems, and Discrete-Time Convolution EEL5: Discrete-Time Sigals ad Systems. Itroductio I this set of otes, we begi our mathematical treatmet of discrete-time s. As show i Figure, a discrete-time operates or trasforms some iput sequece x [

More information

10.6 ALTERNATING SERIES

10.6 ALTERNATING SERIES 0.6 Alteratig Series Cotemporary Calculus 0.6 ALTERNATING SERIES I the last two sectios we cosidered tests for the covergece of series whose terms were all positive. I this sectio we examie series whose

More information

Random Models. Tusheng Zhang. February 14, 2013

Random Models. Tusheng Zhang. February 14, 2013 Radom Models Tusheg Zhag February 14, 013 1 Radom Walks Let me describe the model. Radom walks are used to describe the motio of a movig particle (object). Suppose that a particle (object) moves alog the

More information

Axioms of Measure Theory

Axioms of Measure Theory MATH 532 Axioms of Measure Theory Dr. Neal, WKU I. The Space Throughout the course, we shall let X deote a geeric o-empty set. I geeral, we shall ot assume that ay algebraic structure exists o X so that

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

arxiv: v1 [math.fa] 3 Apr 2016

arxiv: v1 [math.fa] 3 Apr 2016 Aticommutator Norm Formula for Proectio Operators arxiv:164.699v1 math.fa] 3 Apr 16 Sam Walters Uiversity of Norther British Columbia ABSTRACT. We prove that for ay two proectio operators f, g o Hilbert

More information

Probability and statistics: basic terms

Probability and statistics: basic terms Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample

More information

Review Article Complete Convergence for Negatively Dependent Sequences of Random Variables

Review Article Complete Convergence for Negatively Dependent Sequences of Random Variables Hidawi Publishig Corporatio Joural of Iequalities ad Applicatios Volume 010, Article ID 50793, 10 pages doi:10.1155/010/50793 Review Article Complete Covergece for Negatively Depedet Sequeces of Radom

More information

MA131 - Analysis 1. Workbook 3 Sequences II

MA131 - Analysis 1. Workbook 3 Sequences II MA3 - Aalysis Workbook 3 Sequeces II Autum 2004 Cotets 2.8 Coverget Sequeces........................ 2.9 Algebra of Limits......................... 2 2.0 Further Useful Results........................

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

The Random Walk For Dummies

The Random Walk For Dummies The Radom Walk For Dummies Richard A Mote Abstract We look at the priciples goverig the oe-dimesioal discrete radom walk First we review five basic cocepts of probability theory The we cosider the Beroulli

More information

Definition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4.

Definition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4. 4. BASES I BAACH SPACES 39 4. BASES I BAACH SPACES Sice a Baach space X is a vector space, it must possess a Hamel, or vector space, basis, i.e., a subset {x γ } γ Γ whose fiite liear spa is all of X ad

More information

Lecture 12: November 13, 2018

Lecture 12: November 13, 2018 Mathematical Toolkit Autum 2018 Lecturer: Madhur Tulsiai Lecture 12: November 13, 2018 1 Radomized polyomial idetity testig We will use our kowledge of coditioal probability to prove the followig lemma,

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

The natural exponential function

The natural exponential function The atural expoetial fuctio Attila Máté Brookly College of the City Uiversity of New York December, 205 Cotets The atural expoetial fuctio for real x. Beroulli s iequality.....................................2

More information

Mathematical Methods for Physics and Engineering

Mathematical Methods for Physics and Engineering Mathematical Methods for Physics ad Egieerig Lecture otes Sergei V. Shabaov Departmet of Mathematics, Uiversity of Florida, Gaiesville, FL 326 USA CHAPTER The theory of covergece. Numerical sequeces..

More information

Lesson 10: Limits and Continuity

Lesson 10: Limits and Continuity www.scimsacademy.com Lesso 10: Limits ad Cotiuity SCIMS Academy 1 Limit of a fuctio The cocept of limit of a fuctio is cetral to all other cocepts i calculus (like cotiuity, derivative, defiite itegrals

More information