BAYESIAN NETWORK REASONING WITH UNCERTAIN EVIDENCES

Size: px
Start display at page:

Download "BAYESIAN NETWORK REASONING WITH UNCERTAIN EVIDENCES"

Transcription

1 Internatonal Journal of Uncertanty, Fuzzness and Knowledge-Based Systes Vol. 8, No. 5 (200) World Scentfc Publshng Copany DOI: 0.42/S BAYESIAN NETWORK REASONING WITH UNCERTAIN EVIDENCES YUN PENG Unversty of Maryland Baltore County, Coputer Scence and Electrcal Engneerng, 000 Hlltop Crcle, Baltore, MD 2250, USA ypeng@ubc.edu SHENYONG ZHANG Unversty of Scence and Technology of Chna, Hefe, Anhu , Chna Unversty of Maryland Baltore County, Coputer Scence and Electrcal Engneerng, 000 Hlltop Crcle, Baltore, MD 2250, USA RONG PAN NexTag Inc., San Mateo, CA 94402, USA Receved 3 October 2009 Revsed 2 June 200 Ths paper nvestgates the proble of belef update n Bayesan networks (BN) wth uncertan evdence. Two types of uncertan evdences are dentfed: vrtual evdence (reflectng the uncertanty one has about a reported observaton) and soft evdence (reflectng the uncertanty of an event one observes). Each of the two types of evdence has ts own characterstcs and obeys a belef update rule that s dfferent fro hard evdence, and dfferent fro each other. The partcular ephass s on belef update wth ultple uncertan evdences. Effcent algorths for BN reasonng wth consstent and nconsstent uncertan evdences are developed, and ther convergences analyzed. These algorths can be seen as cobnng the technques of tradtonal BN reasonng, Pearl s vrtual evdence ethod, Jeffrey s rule, and the teratve proportonal fttng procedure. Keywords: Bayesan networks; belef update; probablstc reasonng; uncertan evdence.. Introducton Ths paper consders the proble of probablstc reasonng wth uncertan evdences. A regular evdence, called hard evdence n the lterature, s an observaton of a rando varable, say X, havng a partcular value (or n a partcular state), say a, represented as an nstantaton X = a. However, t s not always possble to observe the value a varable s havng n a partcular case, or to have a coplete trust on a claed observaton, thus brngng uncertanty to the evdences. Ths paper focuses on two types of uncertan evdences. The frst type, called soft evdence as suggested by others, 9 can be nterpreted as evdence of uncertanty, and s represented as a probablty dstrbuton of one or ore varables. The second type, called vrtual evdence, can be nterpreted as evdence wth uncertanty, and s represented as a lkelhood rato. 6 These two types of evdences reflect dfferent knds of uncertanty and each obeys a belef update rule that s dfferent fro hard evdence, and dfferent fro each other. 539

2 540 Y. Peng, S.-Y. Zhang & R. Pan Based on an n-depth exanaton of these two types of uncertan evdences, we have developed effcent algorths for belef update n Bayesan networks (BN) wth such evdences. We focus on BN because of ts popularty n ntellgent systes and ts te and space effcency n representng and reasonng wth probablstc nforaton. 5 However, any theoretcal results we obtaned hold for belef update of ont dstrbutons that are not represented by BNs. Related exstng work can be found n Refs. 6, 9, 3, 20 and 2. Pearl was aong the frst to rase the ssue of uncertan evdence and proposed the vrtual evdence ethod. 6 However, as can be seen n Sec. 3, ths ethod s not drectly applcable to the stuaton n whch ultple soft evdences are presented. Chan and Darwche provded a thorough analyss that connects Pearl s vrtual evdence ethod and Jeffrey s rule for both general ont dstrbutons as well as BNs. 3 They also showed that a soft evdence can be converted nto a vrtual evdence, and as the result, belef update wth a sngle soft evdence can be carred out by Pearl s vrtual evdence ethod for both BN and ont dstrbutons. They argued that ultple uncertan evdences should not be allowed for belef update at the sae te. Volel, on the other hand, argued that ultple uncertan evdences, even f they are nconsstent wth each other, should be allowed, and developed an algorth, naed GEMA, for such purpose. 2 However, GEMA was devsed for general ont dstrbutons, not for BNs. Valtorta et al. proposed to extend the teratve proportonal fttng procedure (IPFP) for BN belef update wth ultple consstent soft evdences. 9 Our research extends these works n a nuber of sgnfcant ways. The results presented n ths paper can be suarzed as follows. () We forally establshed the equvalence of Jeffrey s rule, I-proecton (a central operaton of IPFP), and vrtual evdence ethod, when dealng wth a sngle uncertan evdence. We also establshed that Pearl s vrtual evdence ethod works for ultple vrtual evdences but not for ultple soft evdences. (2) We, for the frst te, proved that I-proecton and IPFP, whch s known to nze the I-dvergence (or Kullback-Lebler dstance), also nzes the total varaton between the source and the proected dstrbutons. (3) We developed BN- IPFP, an effcent algorth that cobnes Pearl s vrtual evdence ethod and IPFP for BN belef update wth ultple consstent soft evdences, and proved ts convergence. (4) We developed SMOOTH, an algorth for belef update wth nconsstent soft evdences and proved ts convergence for the case of two evdences. SMOOTH can be easly ncorporated nto BN-IPFP for BN update wth nconsstent evdences. The rest of the paper s organzed as follows. Secton 2 provdes techncal prelnares wth bref ntroductons to Jeffrey s rule, I-proecton, and IPFP. Secton 3 analyzes the two types of uncertan evdences. Secton 4 develops two versons of algorth BN-IPFP. Secton 5 dscusses ssues related to nconsstent evdences and develops algorth SMOOTH. Secton 6 concludes wth a dscusson on evdental reasonng n whch dfferent types (hard, vrtual, and soft) evdences are gven ether sequentally or at the sae te, followed by the drectons of future research.

3 BN Reasonng wth Uncertan Evdences 54 For presentatonal clarty, proofs of theores of our own (Theores 2, 4, 6, 7) are gven n the Appendx. We re-stated soe theores of others that are of edate relevancy to ths work, ther proofs are referred to ther orgnal publcatons. A nuber of coputer experents wth artfcal data were conducted to valdate our results and to copare the perforances wth dfferent ethods. All experents were run on an Intel Core 2 CPU of 2.40G Hz and 2.0G axu eory for the JVM (Java Vrtual Machne). Netca a Java API and ts uncton tree based nference engne were used for standard BN nference. 2. Prelnares Throughout ths paper, we use upper-case X = (X, X 2,, X n ) for the set of all rando varables of nterest and X for ndvdual rando varables; lower-case x and x denote partcular and arbtrary nstantaton(s) of the respectve varable(s); and bold upper-case 2 X, X denote the set of all possble nstantatons. Y, Y, Y, X are for subsets of X, and y and Y for ther nstantatons slarly. Upper-case P, Q, R, S, T are reserved for probablty dstrbuton; P(X) ndcates a ont dstrbuton; and Q( Y ) denotes the argnal dstrbuton of Q( X ) over a subset of varables Y. Bold upper case P, Q, R, S, T are reserved for sets of dstrbutons. In partcular, PR ( Y ) = { P( X ) P( Y ) = R( Y )} denotes the set of all dstrbutons over X whose argnals over Y X equal R(Y). 2.. Jeffrey s rule and I-proecton How to update a dstrbuton P(X) by another lower densonal dstrbuton R(Y), Y X, has been debated for a long te n the atheatcs and phlosophy countes. 2,6,3 One of the dffcultes stes fro the fact that the Bayes rule cannot drectly apply here because R(y), although actng as a condton for the update, tself s not an event. One approach proposed by R. Jeffrey 2 s based on two prncples: the new, posteror dstrbuton Q(X) should ) satsfy R(Y) (.e., Q(Y) = R(Y)) and 2) keep the condtonal dstrbuton of X, gveny X, unchanged (e.g., Q(X\Y Y) = P(X\Y Y)). The second prncple, known as probablty kneatcs, has the effect of keepng the change n the update nu. Then for a gven R(Y) and Z X \ Y, we can copute the probabltes P( z, y) Q( z) = Σ P( z y) R( y) = Σ R( y) y Y y Y P( y) where y Y ndcates the suaton s over all nstantatons of Y. Equaton () s known as Jeffrey's rule 2 or J-condtonng. Fro (), let Z X \ Y, then, for any y we have the updated dstrbuton () a Netca: Bayesan network tool fro Norsys Software Corp.

4 542 Y. Peng, S.-Y. Zhang & R. Pan R( y) P( x, y) P( x) f P( y) 0 Q( x) = R( y) = P( y) (2) P( y) 0 otherwse Two functons have been used wdely to easure the dstance or dfference between two dstrbutons over X. Ther defntons are gven below. Defnton. 20 The I-dvergence (also known as Kullback-Lebler dstance and relatve entropy) between P(X) and Q(X) s gven by P( x) P( x)log f P << Q I( P Q) = P( x) > 0 Q( x) + otherwse where P << Q, denotng P s donated by Q, f { x P( x) > 0} { x Q( x) > 0}. Note that I ( P Q) 0 for all P and Q, the equalty holds only f P = Q. Also note that n general I ( P Q) I ( Q P), so I-dvergence s not a true dstance etrc. (3) Defnton 2. The total varaton between P(X) and Q(X) s defned as δ ( P, Q) = P( x) Q( x) x X Now we defne I-proecton, one of the central concepts for our work. (4) Defnton 3. Q(x) s sad to be an I-proecton b of P(x) on a convex set of dstrbutons S f I( Q P) n I( Qɶ P) (5) = Qɶ S It has been shown that because of the convexty of Q I-proecton s unque. 6 We are partcularly nterested n I-proectons on P R( Y ), the set of dstrbutons whose argnals over Y equal R(Y). PR ( Y ) s known to be convex and the I-proecton of P(x) on PR ( Y ) can be calculated by 20 R( y) P( x) f P( y) 0 Q( x) = P( y) (6) 0 otherwse Note that (6) s exactly the sae as (2). Ths proves the followng theore. Theore. Let Q(X) be the dstrbuton resulted fro updatng P(X) by R( Y ), Y X usng Jeffrey s rule of (2). Then Q(X) s the I-proecton of P(X) on P R( Y ). b I-proecton defned here s also called I-proecton n the lterature. Snce I-dvergence s not syetrc, another proecton, naely, I 2-proecton Q on Q s defned that nzes the I-dvergence I ( P Q '). Unlke I -proecton, I 2-proecton n general s not unque. In ths paper, all I-proectons refer to I -proectons.

5 BN Reasonng wth Uncertan Evdences 543 Next we show that I-proecton by (6) not only nzes the I-dvergence, but also the total varaton. Theore 2. Let Q(X) be the I-proecton of P(x) on P R( Y ). Then δ ( P, Q) = n δ ( P, Qɶ ). Qɶ P R ( y) 2.2. IPFP For a sngle constrant R(Y), the I-proecton of P(X) on PR ( Y ) fnds a dstrbuton that satsfes ths constrant and s closest to P(X) (n ters of I-dvergence), provded R( Y ) << P( Y ). Iteratve proportonal fttng procedure (IPFP) extends ths dea to odfy P(X) wth ultple constrants by contnuously proectng the dstrbuton resulted fro the prevous teraton to P ( of the next constrant R( Y ). Ths procedure s forally R Y ) defned as follows. Defnton Let R = ( R( Y ), R( Y )) be a set of constrants and Q ( ) 0 X the ntal dstrbuton. Then for k =, 2,, = + ( k ) od, and R( Y ) << Qk ( Y ) for all k,, IPFP s defned by ( R y ) ( ) ( Qk x f Qk y ) 0 Q ( ) ( > k x = Qk y ) (7) 0 otherwse In (7), s the nuber of constrants, k s the teraton ndex, and deternes the constrant used at step k. For clarty, n the rest of ths paper, we wrte (7) as R( y ) Qk ( x) = Qk ( x) (7-) Q ( y ) wth the understandng that Qk ( x ) = 0 when Q ( ) 0 k y =. IPFP frst appeared n the lterature n Ref. 3, and shortly after was used as a procedure to estate cell frequences n contngency tables under soe argnal constrants. 8 IPFP was extended n Refs. and 5 to also allow condtonal dstrbutons as constrants (condtonal or C-IPFP). The convergence of IPFP was studed n Refs. 7, 0, and 7 wth proofs under dfferent condtons, the convergence of C-IPFP can be found n Ref. 5. For our purpose, we cte a result fro Ref. 20 n the theore below, whch s based on the I-dvergence geoetry developed n Ref. 7. Theore 3. Let R = ( R( Y ), R( Y )) be a set of constrants. If S = = P, then R( Y ) IPFP of (7) converges and the convergng dstrbuton Q * ( X ) s the I-proecton of Q ( X ) 0 on S. If S, these constrants are sad to be consstent wth each other, and each dstrbuton n S satsfes all constrants n R. Therefore, at convergence, Q * ( X ), as the k

6 544 Y. Peng, S.-Y. Zhang & R. Pan I-proecton on S, has the nu I-dvergence aong those that satsfy all constrants n R. Next we show that IPFP also nzes the total varaton n the next two theores. Theore 4. Consder an ntal dstrbuton Q ( X ) 0 and a set of consstent constrants R = ( R( Y ), R( Y )). Let Q * ( X ) be the convergng dstrbuton when applyng IPFP on Q ( X ) 2 0 usng constrants n R, let Y = Y Y Y and Q * ( Y ) be the convergng dstrbuton when applyng IPFP on Q ( ) 0 Y usng constrants n R. Then Q ( y) ( ) = ( ). (8) ( ) * Q* x Q0 x Q 0 y Coparng Theore 4 and (7-), IPFP on Q ( ) 0 X wth constrants s equvalent to odfyng Q ( X ) by a sngle constrant * 0 Q ( Y ). That s, Q * ( X ) s the I-proecton of Q ( X ) on 0 P. Ths, together wth Theore 2, leads to the followng theore. Q * ( Y ) Theore 5. Let Q * ( X ) be the convergng dstrbuton usng IPFP wth an ntal dstrbuton Q ( X ) and a set of constrants 0 R = ( R( Y ), R( Y )) wth * S = P. Then δ ( P, Q ) n δ ( P, Qɶ ). = R( Y ) = Qɶ S To the best of our knowledge, Theores 2, 4, and 5 are orgnal results whch have not been reported n the lterature before. IPFP bears a great reseblance wth another faly of procedures known as alternatng proecton, whch fnds a pont n the ntersecton of several convex sets by a sequence of proectons onto these sets. Alternatng proecton has been wdely used as an optzaton ethod n areas of saplng theory, sgnal processng, and neural networks. A coprehensve revew of ths ethod can be found n Ref. 4. The dfference fro IPFP s that alternatng proecton s prarly for Eucldean spaces and t tends to nze the square dstances whle IPFP s for probablty spaces and t nzes I-dvergence (and the total varaton by our result n Theore 5) but not the square dstances. 0 Several IPFP-based algorths we wll dscuss, especally those for nconsstent evdences, can fnd ther counterparts n alternatng proecton procedures. 3. Uncertan Evdences Evdences presented for belef update ay be uncertan for varous reasons. A reported observaton ay not be totally trusted due to errors or nose n the observaton or reportng process; t ay be based due to the observer s preference; t ay not hold when the te or locaton s dfferent. Aong all types of uncertan evdences, ths paper concentrates on two of the, naed vrtual evdence and soft evdence. 3.. Vrtual evdences Pearl 6 proposed the vrtual evdence ethod to deal wth BN belef update when one s uncertan about a cla of a hard evdence (.e., an event), say, X = a. Suppose we beleve wth probablty p that ths cla s actually due to the occurrence of X = a, then the probablty t s not occurrng s p. The vrtual evdence ethod requres ths uncertanty nforaton be gven as a lkelhood rato L( X ) = p : ( p), not necessarly

7 BN Reasonng wth Uncertan Evdences 545 the specfc probabltes. To reason wth vrtual evdence n a BN, Pearl s ethod extends the gven BN by creatng a bnary vrtual node, U wth state u standng for the event that X = a s claed to have occurred. The vrtual node U has X as ts only parent and ts condtonal probablty table (CPT) satsfes P( u X = a) : P( u X a) = L( X ). Then the belef update (wth the claed observaton and the uncertanty about ths cla n the for of the lkelhood rato L) can be done by nstantatng U to u (.e., treatng u as a hard evdence). Many BN engnes accept a lkelhood rato as nput for the update wthout explctly ntroducng the vrtual node. Ths ethod s generalzed n Ref. 3 to any arbtrary set of utually exclusve and exhaustve events and the assocated lkelhood rato, and fro BN to any ont dstrbutons. Under ths generalzaton, vrtual evdence on Y X s represented as a lkelhood rato L( Y ) = P( ob( y ) y ) : P( ob( y ) y ) : : P( ob( y ) y ), () () (2) (2) ( s) ( s) where y(), y(2),..., y( s) Y are all nstantatons of Y, ob( y ) ( ) denotes the event that we observed Y = y s True, and ( ) P( ob( y ) y ) s nterpreted as the probablty we ( ) ( ) observe Y = y( ) f Y s ndeed n state y ( ) Soft evdences Soft evdence, naed by Valtorta, 9 s gven as a dstrbuton R( Y ), Y X. Ths knd of evdence can be seen n any places. For exaple, one ay not be able to observe the precse state of a varable for a gven case but ay know ts dstrbuton. Also soetes t s ore portant to know the dstrbuton of a varable than ts precse state at a gven oent. When two BNs (or soe other data and knowledge sources of probablstc or statstcal nature) nteract wth each other, the nforaton exchanged between the s often n the for of probablty dstrbutons of shared varables. For a gven soft evdence, say R( X ), even though we are uncertan about the specfc state X s n, we are certan about ts dstrbuton. In other words, R( X ) s a true (and certan) observaton, and ths dstrbuton should be preserved n the updated ont dstrbuton Q* (.e., Q*( X ) = R( X ) ). In ths sense, soft evdences should be treated the sae as hard evdence. In fact, a hard evdence, say X = a, s a specal case of soft evdence ( R( X = a) =, R( X = b) = 0 for all states b a ). As suggested n Ref. 3, Jeffrey s rule of (2) s a natural choce for updatng a ont dstrbuton P(X) by a soft evdence R(Y X) because the updated dstrbuton preserves R(Y) whle akng nu changes to the orgnal dstrbuton. However, Jeffrey s rule cannot drectly apply when the ont dstrbuton s represented as a BN. Ths can be overcoe by convertng a soft evdence to a vrtual evdence, as suggested by Ref. 3. Consder a dstrbuton P(X) and a soft evdence R( Y ), Y X. All possble nstantatons of Y, y, () y(2),, y( l ) Y, for a utually exclusve and exhaustve set of events. R(Y) then can be converted to a vrtual evdence wth the lkelhood rato R( y ) R( y ) R( y ) L( y) : : : P( y ) P( y ) P( y ) () (2) ( l ) = (9) () (2) ( l )

8 546 Y. Peng, S.-Y. Zhang & R. Pan As shown n Theore 5 of Ref. 3, when ths vrtual evdence s appled to P(X), the new dstrbuton s exactly the sae as the one obtaned by applyng R(Y) usng the Jeffrey s rule of (2) Multple uncertan evdences Lke hard evdences, ultple uncertan evdences can arrve at the sae te or n a sequence. There s no proble for belef update by ultple vrtual evdences, because what s requred s that the updated dstrbuton preserves the gven lkelhoods. Update can be done by sply treatng each vrtual evdence as a hard evdence on the vrtual node and nstantatng that node to true. Note that, snce a vrtual node U s ndependent of all other vrtual nodes, gven the parent of U (.e., they are d-separate), the lkelhood rato reflected on U wll not be affected by the belef update operatons wth other vrtual (and hard) evdences. However, ths s not the case when updatng by two soft evdences se = R( Y ) and 2 se2 = R( Y ). To satsfy both se and se2, the updated dstrbuton Q s requred to have 2 2 ts argnals Q( Y ) = R( Y ) and Q( Y ) = R( Y ). Update cannot be done by frst convertng se and se2 to two vrtual evdences and then applyng the vrtual evdence ethod wth these two vrtual evdences. Ths s because, after applyng the frst evdence, there s no way to hold Q( Y ) = R( Y ) when the second evdence s appled. Furtherore, as can be seen n the exaple below, when the soft evdences are presented n dfferent orders or altogether, dfferent update results wll be generated. Ths proble, known as the coutatvty of terated revsons, has been vewed as a proble for Jeffrey s rule. 3,22 Exaple. As depcted n Fg., we are gven a BN of four bnary varables A, B, C, and D and two soft evdences se: R(B) = (0.7, 0.3) and se2: R(C) = (0.3, 0.7). To convert the to vrtual evdences, we frst copute fro the BN the argnals P(B) = (0.44, 0.56) and P(C) = (0.45, 0.55), then copute the lkelhood ratos by (9) as L(B) = 0.7/0.44:0.3/0.56 =.5909: and slarly L(C) = : A A B A C B C D Fg.. An exaple BN of 4 varables.

9 BN Reasonng wth Uncertan Evdences 547 As can be seen n rows 2 and 3 of Table below, when the two vrtual evdences are appled separately, the updated belefs satsfy the correspondng se and se2 (belef on B = and C = are updated to 0.7 and 0.3, respectvely). Rows 4 and 5 show the update results when these two vrtual evdences appled together and n a sequence, respectvely. It s not surprse that the results are the sae, snce, as entoned earler, belef update wth ultple vrtual evdences are equvalent to belef update wth ultple hard evdences of the vrtual evdence nodes. Let U and U2 be the two vrtual evdence nodes. It can be verfed that P(u B =, u2):p(u B = 0, u2) = L(B) and P(u2 C =, u): P(u2 C=0, u) = L(C),.e., the lkelhood ratos are preserved when the other evdence s presented. However, as can be seen n Rows 4 and 5, none of these two soft evdences s satsfed by the resultng dstrbutons. To deal wth ths proble, one ay suggest that, before applyng se2, we frst recalculate a new lkelhood rato L (C) for se2 usng the dstrbuton updated by se (Row 2). By (9), we have L (C) = 0.3/0.425:0.7/0.575 = :.274. Row 6 shows the update result where se2 s satsfed but belef on B = s oved away fro what s requred by se (fro to 0.70). Table. Belef update on BN of Exaple. Evdences Belef on B = Belef on C =. orgnal usng L(B) usng L(C) L(B) and L(C) L(B) then L(C) L(B) then L (C) Soe argued based on the All thngs consdered nterpretaton of soft evdence, that belef update wth such evdences should not be coutatve. 3 In contrast, we argue that soft evdences are true observatons of dstrbutons of soe events, and as such, they all should be preserved n the updated posteror dstrbuton; also that, f one or ore such dstrbutons exst, the one wth the nu I-dvergence to the orgnal dstrbuton can be found by IPFP, usng these evdences as constrants. However, IPFP works on full ont dstrbutons, and thus s not drectly applcable to belef update n BNs. In the next secton, we develop algorth BN-IPFP for BN belef update wth ultple soft evdences. Ths algorth frst converts all soft evdences to vrtual evdence for and then terates n IPFP style to update the BN untl t settles down to a dstrbuton that satsfes all gven soft evdences. Another ssue that arses wth ultple soft evdences s that these evdences ay not be consstent wth each other,.e., there s no dstrbuton that satsfes all gven evdences. Ths proble s dealt wth n Sec BN-IPFP The proble s stated as follows. We are gven a BN on varables X = (X, X 2,,, X n ) wth the ont probablty P( X ) = Π P( X π ), where P( X π ) s the CPT for X X

10 548 Y. Peng, S.-Y. Zhang & R. Pan varable X, and a set of soft evdences R = ( R( Y ), R( Y )) where Y, Y2,, Y X. Suppose the constrants n R ) are consstent, and 2) satsfy the donance condton: for all ( R Y ) R, P( Y ) << R( Y ). Then the belef update of the gven BN by R s to fnd Q*(X) whch ) satsfes all evdence n R; and 2) has nu I-dvergence to P(X). For sall BNs, one can explctly generate the full ont dstrbuton P(X) fro the gven BN and then apply IPFP usng the soft evdences n R as constrants to update the dstrbuton. Ths, however, s nfeasble for large BN, because the dstrbuton would be prohbtvely large and IPFP would be coputatonally extreely expenses as t needs to lterally odfy each entry of the ont dstrbuton table n each teraton. To address ths proble, Valtorta, K and Volel have devsed a varaton of Juncton-Tree (JT) algorth based on IPFP 9 that utlzes the nterdependences captured n the BN structure. One verson of ths algorth works n stuaton where all varables n each Y are contaned n one clque C n the JT. Then the belef update goes teratvely over the evdences n cycle: n each teraton, Q( C ) s updated by the correspondng R( Y ) and then the change of Q( C ) s propagated to the rest of the JT by the regular JT ethod. The general stuaton where a soft evdence ay nvolve varables n ore than one clques s dealt wth by another verson called bg clque algorth. In ths algorth, when constructng the JT, all soft evdence nodes (.e., those varables that are nvolved n any of the soft evdences) are fully connected wth each other by addtonal undrected edges. After trangulaton, all soft evdence nodes appear n a sngle clque (the Bg Clque). The belef update s done by frst updatng the bg clque usng all evdences n R by runnng IPFP to convergence and then propagatng the resultng dstrbuton of ths clque to the rest of the JT. The Bg Clque algorth becoes te and space neffcent when the sze of the bg clque tself becoes large. Both versons are shown to converge and the convergng ont dstrbuton satsfes all evdences n R, provded these constrants are consstent to each other. One ltaton wth these JT based belef update algorths s that they cannot be easly adopted by those usng nference echanss other than JT. Also, they requre ncorporatng IPFP operatons nto the JT procedure, causng re-codng of the exstng JT nference engne. The authors of Ref. 9 entoned the possblty of pleentng the frst verson of ther algorth as a wrapper around Hugn shell or other JT engnes, but no suggeston of how ths can be done was gven. To address these ssues, we propose two new algorths for BN nference wth ultple soft evdences. Both algorths utlze IPFP, although n qute dfferent ways. The frst algorth cobnes the dea of IPFP and the encodng of soft evdence by vrtual evdence of (9). The second algorth s based on Theore 4, t s slar to the Bg Clque algorth but t decouples the IPFP fro JT (or any specfc BN nference engne). These two algorths are presented n the next two subsectons. 4.. BN-IPFP- As shown earler, although a sngle soft evdence can be appled to BN belef update by frst convertng t to a vrtual evdence, ths approach does not work wth ultple

11 BN Reasonng wth Uncertan Evdences 549 evdences. As can be seen n Exaple at the end of last secton, after updatng by se 2, the dstrbuton no long satsfes se. What s needed s a ethod that can convert soft evdences n R to one or ore lkelhood ratos whch, when appled as vrtual evdences to the BN, preserve argnal dstrbutons specfed n every soft evdence. Algorth BN-IPFP- presented below accoplshes ths by cobnng the dea of IPFP and the vrtual evdence ethod. Roughly speakng, ths algorth goes as follows. Lke the IPFP, t s an teratve process, startng wth Q 0 (X) = P(X), and one soft evdence R( Y ) s consdered at each teraton. If the argnal Q ( ) k Y of the current dstrbuton equals R( Y ), then t does nothng; otherwse, a new vrtual evdence (n the for of a lkelhood rato) s created based on the current Q ( ) k Y and R( Y ) accordng to (9) and appled to odfy Q ( ) Y. The algorth s gven below. k Algorth BN-IPFP-. Consder a BN wth pror dstrbuton P(x), and a set of consstent soft evdences R = ( R( Y ), R( Y )). We use the followng teratve procedure for belef update:. Q 0 (X) = P(X); k = ; 2. Repeat the followng untl convergence; 2. = + ( k ) od ; l = + ( k ) / ; 2.2 construct vrtual evdence wth lkelhood rato ( ( ()) ( (2)) ( )) R y R y R y s L, l ( Y ) = : :... : Q ( y ) Q ( y ) Q ( y ) where () (2) ( s ) k () k (2) k ( s ) y, y,..., y Y are state confguratons of Y ; 2.3 Obtan Q k (X) by updatng Q k- (X) wth L, ( Y ) usng Pearl s vrtual evdence ethod; 2.4 k = k + ; l The core of ths algorth s Step 2.2, whch adds a new vrtual evdence wth lkelhood rato L ( ), y where the second subscrpt, l, s the nuber of vrtual evdences l created for R( y ), ncreented for every teratons. Note that the sequence of lkelhood ratos for each ( R Y ) can be cuulated as a sngle one L( Y ) = Π ll, l( Y ) BN-IPFP-2 BN-IPFP- ay becoe expensve when the gven BN s large because t coputes the argnal Q ( ) k Y (Step 2.2) and updates the belefs of the entre BN (Step 2.3) n each teraton. Algorth BN-IPFP-2 avods repeated BN coputaton by frst constructng a sngle vrtual evdence node fro the argnal of P(Y), where Y contans all varables n all of the gven soft evdences, and then updatng the BN by ths vrtual evdence.

12 550 Y. Peng, S.-Y. Zhang & R. Pan Algorth BN-IPFP-2. Consder a BN wth pror dstrbuton P(X), and a set of 2 consstent soft evdences R = ( R( Y ), R( Y )). Let Y = Y Y Y. We use the followng procedure for belef update:. Use any BN nference ethod to obtan P(Y) fro P(X). 2. Update P(Y) by IPFP usng R = ( R( Y ), R( Y )) as constrants untl convergng to Q*(Y). 3. Construct a vrtual evdence wth lkelhood rato L(Y) coputed fro Q*(Y) and P(y) by (9). 4. Apply L(Y) as a sngle vrtual evdence to update P(X). The convergence and correctness of both BN-IPFP algorths are establshed n Theore 6 below. Theore 6. If soft evdences n R = ( R( Y ), R( Y )) are consstent wth each other and P( Y ) << R( Y ) for all R( Y ) R, then both algorths BN-IPFP- and BN-IPFP-2 converge to the sae dstrbuton, whch s the I-proecton of P(X) on S = = P. R( Y ) (a) BN-IPFP- (b) BN-IPFP-2 Fg. 2. Runnng results of Exaple wth BN-IPFP- and 2. Fgure 2 shows the runnng results of BN-IPFP- and 2 for the exaple BN gven n Fg.. The two vrtual evdence nodes VE0 and VE n Fg. 2 (a) are generated by BN- IPFP- for the two soft evdences R(B) and R(C); the vrtual evdence VE In Fg. 2 (b) s created fro R(B) and R(C) accordng to BN-IPFP-2. Both algorths converge n 4 teratons to the sae dstrbuton that satsfes both constrants R(B) and R(C).The fnal cobned lkelhood ratos at convergence are L*(B) = (.0:0.354) and L*(C) = (0.578:.0) for BN-IPFP- and L*(B, C) = (0.578:.0: 0.205:0.354) for BN-IPFP Te and space perforance The teratons of BN-IPFP-, BN-IPFP-2 and Bg Clque algorth all converge to the sae dstrbuton. At each teraton, Bg Clque algorth updates belefs of the ont

13 BN Reasonng wth Uncertan Evdences 55 probabltes of the bg clque C, BN-IPFP-2 updates the ont dstrbuton of Y, and BN- IPFP- updates the belef of the whole BN,.e., all varables n X. Clearly, Y C X. However, the te coplexty for one teraton of Bg Clque s C O (2 ), and Y O (2 ) for IPFP because both requre odfyng a ont dstrbuton table. On the other hand, the te coplexty of BN-IPFP- s equal to the coplexty of the BN nference algorth t uses for belef update, whch s often ore effcent than odfyng the ont dstrbuton. For exaple, f we use JT, the te coplexty for one teraton of BN- IPFP- s exponental to the sze of the largest clque n JT of the orgnal BN, whch ay be saller than C and Y, especally for sparse BNs. Both Bg Clque and BN-IPFP-2 are space neffcent, they need exponental space for the ont potental of C, and the ont dstrbuton of Y, respectvely. In contrast, BN- IPFP- only needs addtonal space for vrtual evdence, whch s Y O ( Σ = 2 ). BN-IPFP- 2 s thus ore sutable for probles wth a large BN but a sall nuber of soft evdence varables and BN-IPFP- s ore effcent when the nuber of soft evdence varables s large. Also, both BN-IPFP- and 2 have the advantage that users do not have to stck to uncton tree and odfy the JT related procedures n the nference engne. They can be easly pleented as wrappers on any BN nference engnes. To eprcally evaluate our algorths and to get a sense of how expensve these two algorths ay be, we have conducted soe experents wth artfcally constructed BNs of dfferent szes and wth dfferent constrant sets. The reported eory consupton does not nclude those that was used by the JT-based nference engne of Netca, but the reported runnng te s the total runnng te. Experent 4- copares the algorths perforance wth varyng nuber of soft evdences. It used a BN of 5 varables and three sets of 2, 4, 8 soft evdences each. One half of these evdences nvolved 2 varables, and the other half nvolved varable. The experent results are gven n Table 2. It can be seen that, when the nuber of evdences ncreases, both the te and eory consuptons for BN-IPFP- ncrease at uch slower rates than BN-IPFP-2. Table 2. Experent 4-: perforance wth dfferent nubers of soft evdences. # of # Iteratons Exec. Te Meory evdences BN-IPFP- BN-IPFP-2 BN-IPFP- BN-IPFP-2 BN-IPFP- BN-IPFP s 0.62s 590, , s 0.83s 726, , s 5.34s 926,896 2,544,536 Experent 4-2 copares the algorths perforances wth dfferent sze of BN. Four BNs of 30, 60, 20, and 240 bnary varables were used, each of whch was updated by the sae set of 4 soft evdences nvolvng a total of 6 varables. For each algorth, experental runs for the four BNs were all converged after the sae nuber of teratons (43 for BN-IPFP- and 4 for BN-IPFP-2).

14 552 Y. Peng, S.-Y. Zhang & R. Pan Table 3. Experent 4-2: perforance of BN wth dfferent sze. Sze of # of Iteratons Exec. Te Meory BN BN-IPFP- BN-IPFP-2 BN-IPFP- BN-IPFP-2 BN-IPFP- BN-IPFP s 0.67s (0.64s) 72,848 69, s 0.69s (0.66s) 723,944 69, s 0.72s (0.66s) 726,904 69, s 3.3s (0.72s) 726, ,842 Fro Table 3 we can see that when the nuber of soft evdences s fxed, the runnng te of BN-IPFP-2 ncreases slowly wth the ncrease of the network sze. Especally, the te for IPFP on P(Y) (the te n parentheses) ncreases only slghtly. Ths s because coputng the sngle constrant Q*(Y) (Step 2) s the ost expensve step n BN-IPFP-2 and Y s fxed. On the other hand, the executon te for BN-IPFP- ncreases at a uch faster pace (roughly exponentally). Ths s because each teraton requres updatng the entre BN. These experents results confr our theoretcal analyss for the proposed algorths. 5. Inconsstent Soft Evdences A set of soft evdences or constrants R = ( R( Y ), R( Y )) s sad to be nconsstent f S = P. Snce there does not exst a dstrbuton that satsfes all constrants n = R( Y ) R, IPFP or ethods based on IPFP such as those we developed n the prevous secton wll not converge. Instead, the update wll go nto cycles around several dstrbutons, 2 and the specfc dstrbutons t cycles around ay be dfferent, dependng on the order the constrants are presented. 4 Several approaches to ths proble based on IPFP have been suggested n the lterature. A sple approach s to frst run IPFP untl t goes nto a * * * cycle of Q ( X ), Q ( X ),, Q ( X ), each of whch satsfes one of the gven constrants, 2 * * and then take the average of these dstrbutons Q ( X ) = Σ = Q ( X ) / as the soluton. Several dsadvantages can be seen for ths sple approach. The result ay be dfferent when these constrants are presented n dfferent orders; there s not uch we can say about * * * * Q ( x) except that t s soewhere n the ddle of Q ( X ), Q2 ( X ),, Q( X ). Moreover, ths approach s hard to apply to BN because t operates on full ont dstrbutons. Another approach odfes the IPFP of (7-) as follows 20 : R( y ) Qk ( x) = ( αk ) Qk ( x) + αkqk ( x) (0) Q ( y ) where 0 < α k <. Ths approach wll be referred to as SR-IPFP, as t can be seen to be analogous to the seral relaxaton ethod of alternatng proecton that can be used to fnd an approxate soluton when the soluton set S s epty (see Eq. (38) of Ref. 4). Ths ethod converges wth constant αk = α when R s consstent; t converges when R s nconsstent f α k gradually decreases toward 0. To allow each constrant to take ts effect, α needs to start wth a value very close to and to decrease very slowly. k k

15 BN Reasonng wth Uncertan Evdences 553 However, f the decreasng rate s too sall, the convergence wll take too any teratons; on the other hand, f the rate s too bg, the process wll be based n favor of earler constrants. A ore prncpled ethod was proposed n Ref. 2, naed GEMA (Generalzed EM Algorth). GEMA assgns a weght w to each constrant ( R Y ) R, Σ = w =, whch can be understood as the credblty one has for the evdence. The update s agan an teratve process, and t takes two steps n each teraton. Take as an exaple, consder the k th teraton that starts wth Q ( ) k X. In Step, t frst uses (7-) to copute I- proectons of Q ( ) k X to P ( for each R Y R( Y ), denoted Qɶ ) k, ( X ), and then takes a weghted su of these k I-proectons to obtan a dstrbuton Qɶ k ( X ) = Σ w Q ɶ = k, ( X ). In Step 2, GEMA frst coputes argnals Rɶ ( Y ) = Qɶ ( ) k Y, then perfors steps of the standard IPFP on Q ( ) k X usng these Rɶ ( Y ) as constrants to obtan Qk ( X ). Note that these new constrants are consstent wth each other snce they are argnals fro the sae dstrbuton Q ɶ ( ) k X. It has been shown that GEMA converges to a dstrbuton whch has a nu I-aggregate Ψ, the weghted su of I-dvergences to all of the orgnal constrants n R: Ψ ( ( ) ( ),, ( ) K Q X R Y R Y = w ( ( ) ( )) I R Y Q y. () = GEMA can be seen as analogous to a parallel ethod of alternatng proecton that can be used to fnd an approxate soluton when the soluton set s epty (see Eq. (35) of Ref. 4). Our experents (see Subsecton 5.3) show that the te perforance of GEMA s very senstve to the data. For soe cobnatons of Q ( X ) = P( X ) 0 and R, t converges wthn a few hundreds of teratons, but for other cobnatons of slar sze, llons of teratons are needed. 5.. Algorth SMOOTH One thng n coon for both GEMA and SR-IPFP of (0) s that both of the only odfy the ont dstrbuton Q ( ) k X whle keepng the constrants unchanged through the teratons. Alternatvely, one can ake the odfcaton b-drectonal: at each teraton, not only the ont dstrbutons are pulled closer to the constrants but also the constrants are pulled towards the ont dstrbutons. By dong so, the nconsstency aong the constrants s gradually reduced or soothened, whch ay lead to a faster convergence. Based on ths dea we developed our new ethod SMOOTH. The procedure of SMOOTH conssts of two phases. Phase perfors the standard IPFP usng all of the orgnal constrants n R. It stops when the process converges (for consstent constrants) or starts to cycle (for nconsstent constrants). Phase 2, executed only when cycle s detected at the end of Phase, dffers fro Phase n that at each teraton, not only the current dstrbuton Q ( ) k X s odfed by the chosen constrant R( Y ), R( Y ) tself s also odfed by Q ( ) k X. Specfcally, we denote the odfed constrants as R ( l Y ), wth ( ) ( R ) 0 Y = R Y and l = + ( k ) /. At teraton k, frst the constrant s odfed by R ( Y ) = αr ( Y ) + ( α) Q ( Y ) (2) l l k

16 554 Y. Peng, S.-Y. Zhang & R. Pan where α (0,) s the sooth factor and t controls the speed of soothng. Fro (2) we can see that the odfed constrant R ( Y ) s a xture of the prevous constrant l R ( ) l Y and the argnal of the current dstrbuton Q ( ) k X. Snce Q ( ) k X has been odfed by all other constrants, (2) has the effect of pullng P R ( l Y ) closer to P, Rl ( Y ), thus reducng or soothng the nconsstency aong the constrants. To ensure that the soothng s unbased α should be chosen as very close to. Then Qk s odfed by the new constrant by By (2), (3) can be rewrtten as R ( y ) Q x = Q x (3) Q ( y ) l k ( ) k ( ) k R ( y ) Q x Q x Q ( y ) l k ( ) = k ( ) k = Q k α R ( y ) + ( α ) Q ( y ) ( x) l k Qk ( y ) R ( y ) = αq ( x) + ( ) Q ( x) (3-) k l α Qk ( y ) k Equaton (3-) s very slar to (0) for SR-IPFP. The dfferent s that (0) always uses the orgnal constrants whle n (3-) a changed constrant s used at each teraton. It s ths dfference that akes SMOOTH converges wth constantα when the constrants are nconsstent. The algorth SMOOTH s gven below. Algorth SMOOTH. Consder an ntal dstrbuton P(x) and a set of soft evdences R = ( R( Y ), R( Y )). SMOOTH conssts of the followng two phases: Phase : do the standard IPFP usng all constrants n R untl t converges or goes nto cycles; f convergence s reached then ext; Phase 2:. for = to, ( ) ( R ) 0 y = R y ; 2. k = ; 3. repeat the followng untl convergng 3. = + ( k ) od ; l = + ( k ) / ; 3.2 Rl ( y ) = αrl ( y ) + ( α) Qk ( y ) ; Rl ( y ) 3.3 Qk ( x) = Qk ( x) ; Q k ( y ) 3.4 k = k + ;

17 BN Reasonng wth Uncertan Evdences 555 Note that SMOOTH s exactly the sae as the standard IPFP except that n Phase 2 t uses odfed constrants, not the orgnal one to update the current Q k. Ths akes SMOOTH drectly applcable to BN belef update n BN-IPFP style. The only thng that needs to be changed when applyng SMOOTH to BN s to replace the operaton of I- proecton (Step 3.3 n Phase 2) by vrtual evdence ethod of BN-IPFP- (Steps 2.2 and 2.3) of Sec. 4. Next we nvestgate the convergence of SMOOTH Convergence and perforance of SMOOTH Accordng to the algorth, when the set of constrants s consstent, SMOOTH s reduced to the standard IPFP, and t converges n Phase. Next we dscuss what happens when constrants are not consstent. Fgure 3 shows an exaple nvolvng four constrants ( = 4) where S ( X ) = P ( X ) s the set of all dstrbutons whose argnal on Y equals R( Y ). At R( Y ) the end of Phase, a cycle (sold lnes) s fored through Q0,, Q0,2, Q0,3, Q 0,4. In the frst teraton of Phase 2, constrant R ( Y ) = R( Y ) s odfed to 0 R ( ) Y by (2). Ths changes S ( X ) to 0, S,( X ), whch s closer to Q 0,4 than S ( X ) 0,. As the process contnues, S, l ( X ) are ovng closer to each other, and the cycles (dotted lnes) fored by the resultng dstrbutons becoe saller and saller untl they erge nto a sngle dstrbuton. Q 0,3 S 0,3 S,2 S 0,2 Q,2 Q 0,2 Q, Q 0,4 Q 0, S 0,4 S, S 0, Fg. 3. Exaple showng the convergence of SMOOTH. We forally establsh the convergence of SMOOTH for = 2 n the next theore. Theore 7. For an ntal dstrbuton P(X), two nconsstent soft evdences 2 R( Y ), R( Y ), and α (0,), Phase 2 of SMOOTH converges. Experents show that Phase 2 of SMOOTH converges for > 2, and when * α the convergng dstrbuton Q nzes the su of dstances, n both I- dvergence and total varaton, to all constrants n R. We leave ths general cla as a conecture.

18 556 Y. Peng, S.-Y. Zhang & R. Pan The te perforance of SMOOTH, lke all IPFP based ethods, depends on the nuber of teratons t takes to reach convergence. Experents show that SMOOTH oves towards the convergence pont farly fast at the begnnng, even wth α very close to. However, t slows down drastcally at the end, forng a long and flat tal (see Fg. 4 where 90% of the te s spent to brng the flat tal to the convergence pont). As dscussed before, keepng α large at the begnnng ensures nforaton n the orgnal constrants s not lost too soon by soothng before t gets a chance to be absorbed. When the process gets closer to the convergence pont, we can afford to use saller α snce ost nforaton of the orgnal constrants that can be absorbed has largely been absorbed. By (2), a saller α pulls the constrants toward the current Q k faster, leadng to a faster convergence at the end. We have experented wth a nuber of schedules for reducng α. The one perfored best s the sgod functon: α k = exp( A k / B) /( + exp( A k / B)) (4) where k s the teraton steps of Phase 2. It can be seen by (4) that wth a large postve A, α s close to at the begnnng (k s sall), and close to 0 when k becoes very large, and that α decreases very slowly at the two ends, but fast n the ddle. Paraeter A controls how long α s to rean large (longer for larger A) and B controls how fast k α ncreases n the ddle (faster for saller B). If the desred ntal value α0 s gven, then A can be deterned by α 0 = /( + exp( A)). For exaple, to have α0 0.99, we set A = We call SMOOTH usng (4) to reduce α k Accelerated SMOOTH (A-SMOOTH for short). Replacng α by αk n (3-), when k, snce αk 0, so Qk ( x) Qk ( x), therefore, a-smooth converges Experents To eprcally valdate algorth SMOOTH and to get a sense of how well t perfors n coparson to the exstng ethods, we have conducted coputer experents wth dfferent ntal dstrbutons and dfferent constrants. The algorths copared n the experents nclude: () GEMA, (2) SR-IPFP, (3) SMOOTH, (4) A-SMOOTH. For SR-IPFP, we use α k = /( + k) n (0), whch s the fastest schedule for reducng α k suggested by the authors. 20 For SMOOTH we set α 0.99 n Phase 2, and for A-SMOOTH, we set A = and B = 50. We use the nuber of I-proectons nstead of the nuber of teratons to easure the te perforance of an algorth because an teraton ay nvolve dfferent nuber of I- proectons for dfferent algorths. For exaple, nuber of I-proectons n one teraton s for our SMOOTH and 2 for GEMA ( for each of the two steps). In all our experents, convergence s reached f at teraton k = l the su of total varatons ( ) ( 2 Σ = Qk + y Qk + y ) s wthn the gven error bound of 0.,, Experent 5- uses the data taken fro Ref. 2 nvolvng three varables X X 2 X 3. The ntal ont dstrbuton JPD s a unfor dstrbuton of the three varables. Three constrants, each a dstrbuton of two varables, are generated accordng to the schee n Table 4. These constrants are consstent wth each other when ε = 4 / 20 (called CONS0), nconsstent when ε = 3 / 20 (called CONS).

19 BN Reasonng wth Uncertan Evdences 557 Table 4. Constrant generator. P, =, 2 X X = 0 = X = 0 X = + + / 2 ε ε ε / 2 ε = 0 X = P X X = 0 = X ε / 2 ε / 2 ε ε The experent results for consstent constrants CONS0 are gven n Table 5. All three algorths converged to the sae the I-proecton on S = P ( x) P ( x) P ( x). SMOOTH s sgnfcantly faster than the other two. R( y) R( y2 ) R( y3) Ths s because for the consstent constrants SMOOTH s reduced to the standard IPFP (only Phase s executed). Table 5. Experent 5- results for CONS0 ( ε = 4 / 20 ). Algorth GEMA SR-IPFP SMOOTH # proectons I-dvergence Experent 5-2 copares perforance wth nconsstent constrants CONS n whch every two constrants are consstent wth each other, but they together are nconsstent wth the thrd one. Besdes JPD, another ntal ont dstrbutons JPD2 s also used. The experent results are gven n Table 6 where for the two versons of SMOOTH nubers of I-proectons for both phases are gven. It can be seen fro the I-dvergences of the convergng dstrbutons to the ntal dstrbutons and the I-aggregates that GEMA, SMOOTH, and A-SMOOTH converge to dstrbutons that are very close to each other, wth A-SMOOTH sgnfcantly faster than the others (SR-IPFP was stopped when the te lt of 0 llon I-proectons s reached before the convergence). Table 6. Experent 5-2 results for Inconsstent CONS ( ε = 3 / 20 ). # proectons I-dvergence I-aggregate GEMA JPD- 7,744, JPD-2 9,064, SR- IPFP JPD- >0,000, JPD-2 >0,000, SMOOTH JPD JPD A-SMOOTH JPD JPD

20 558 Y. Peng, S.-Y. Zhang & R. Pan We plot I-aggregates of all the four algorths for JPD n Fg. 4. The plot starts at the 78 th I-proecton, whch s the begnnng of Phase 2 of SMOOTH and A-SMOOTH, and ends at the 4200 th I-proecton. It s clear that I-aggregate decreases fastest for A-SMOOTH, followed by SMOOTH, wth CC-IPFP the slowest. Fg. 4. Plot of I-aggregates of the four algorths. Experent 5-3. To see that GEMA s data senstve, we generated another set of 3 constrants (CONS2), each of whch also nvolves two of the three varables X, X 2, X 3. Unlke CONS shown n Table 4, CONS2 s par-wse nconsstent. The results usng CONS2 aganst JPD and JPD2 are gven n Table 7. It can be seen fro Tables 6 and 7 that GEMA s very slow for three of the four cobnatons of JPDs and constrants but very fast (780 I-proectons) for one cobnaton (JPD+CONS2). Slar phenoena have also been observed n soe of our other experents. On the other hand, both versons of SMOOTH have unfor perforance for all cobnatons. Table 7. Experent 5-3 result for CONS2. Algorth GEMA SR-IPFP SMOOTH JPD 780 >0,000, JPD2 2,400,542 >0,000, Experent 5-4 tests the scalablty of these algorths wth larger JPDs of 8 and 5 varables. The results shown n Tables 8 and 9 are consstent wth those reported earler for saller JPD. For these experents we dd not run SR-IPFP because t took too uch te to reach a pont that was close to a convergence.

21 BN Reasonng wth Uncertan Evdences 559 Table 8. Result for JPD of 8 varables and 4 nconsstent constrants. # proectons I-dvergence I-aggregate GEMA SMOOTH A-MOOTH Table 9. Result for JPD of 5 varables and 4 nconsstent constrants. # proectons I-dvergence I-aggregate GEMA SMOOTH A-MOOTH Fnally, we conducted an experent to copare the perforance of belef updates on full ont dstrbutons and on BNs. The experent reported n Table 0 used a BN of 4 bnary varables and 4 nconsstent constrants nvolvng a total of 7 dstrct varables. Both GEMA and SMOOTH were run on the full ont dstrbuton (of 0 4 entres) generated fro ths BN. The SMOOTH verson of BN-IPFP- was run drectly on the BN. As can be seen n Table 0, belef updates on the full JPD are several orders of agntudes slower than that on the BN. When these constrants were odfed to be consstent, the convergence te for the standard IPFP on the full JPD was 27 second whle the te for BN-IIPFP- was only second. Table 0. Result for nconsstent constrants: Full JPD vs BN. Algorths # proectons Te Full JPD usng GEMA s Full JPD usng SMOOTH s SMOOTH on BN-IPFP s Recall (Table 6) that GEMA took ore than 7 llon I-proectons to converge n Experent 5-2 to odfy the belef n a tny JPD of only three varables. We appled the SMOOTH verson of BN-IPFP- to the sae task after frst convertng the orgnal JPD to a BN of three nodes; and, uch to our surprse, t took only 02 I-proectons to converge! Although anecdotal, these results clearly deonstrated sgnfcant coputatonal advantages of usng BN to represent ont dstrbutons and the practcal value of belef update ethods based on BN such as the algorths we developed n ths work. 6. Conclusons In ths paper we presented our results on Bayesan network belef update wth uncertan evdences. We defned two types of uncertan evdences. The vrtual evdence, gven as a lkelhood rato, represents uncertanty one has for an observaton and t requres the

22 560 Y. Peng, S.-Y. Zhang & R. Pan lkelhood rato be preserved n updated BN. The soft evdence, gven as a dstrbuton over one or ore varables, represents the uncertanty of an event one s observng, and t requres ths dstrbuton be preserved n the updated BN. After establshng the close relatons between the Pearl s vrtual evdence ethod, the Jeffrey s rule, and the I- proecton, we developed the effcent algorths for BN belef updates wth ultple soft evdences. One advantage of BN-IPFP-, n contrast to soe exstng ethods, s that t can easly work wth any BN nference engnes. BN-IPFP-2 can provde effcent coputaton when the nuber of varables nvolved n the soft evdences s sall. Algorth SMOOTH was developed by odfyng the standard IPFP to support belef update wth nconsstent evdences. The convergence of these algorths was analyzed and experents of lted scales were conducted to valdate these algorths and to deonstrate ther effectveness. In addton, we for the frst te forally establshed that Equaton (6), whch s used to copute I-proecton n IPFP, not only nzes the I- dvergence but also the total varaton between the source and the proected dstrbutons. BN belef update ay be subect to ultple evdences of dfferent types (hard, vrtual, and soft), and these evdences ay arrve at the sae te or at dfferent te. Our BN-IPFP- s flexble to support such nference. When all evdences arrve at the sae te or hard and vrtual evdences arrve before soft evdences, one can frst update the belefs wth the gven hard and vrtual evdences usng the conventonal BN nference ethods and then apply BN-IPFP- on the updated BN. A hard or vrtual evdence arrvng after soft evdences havng been absorbed wll change the belefs n the BN, f ths change causes Q( Y ) R( Y ) for any soft evdence R( Y ) (.e., L, l ( Y ) :: n Step 2.2 of BN-IPFP-), then BN-IPFP- s actvated and the teratons renewed untl convergence. As entoned earler, one can use vrtual evdence to represent the doubt he has on a hard evdence, ths can also be appled when one s n doubt of a soft evdence. Recall that n our approach, a soft evdence R( Y ) s frst converted nto a vrtual evdence wth a vrtual node U. If our doubt of R( Y ) can be represented as a lkelhood L(U), then we can create another vrtual node V wth U as ts only parent and ts CPT deterned by L(U). Then nstantaton of V to true wll apply R( Y ) wth uncertanty of L(U) to the BN. We are contnung our research effort n ths frutful area along several drectons. Our proof of convergence of SMOOTH s only done for the case of two nconsstent constrants, we are actvely workng on generalzng t to any arbtrary nuber of constrants. Our experents show that SMOOTH has a unfor te perforance whle GEMA s data senstve and t soete converges uch faster than SMOOTH. We are exanng the factors that ay be the causes for the perforance dfferences and hopng to fnd a way to utlze soe of the fndngs to prove the effcency. We realzed that GEMA, although orgnally devsed for general ont dstrbutons, ay be adapted to BNs. We are workng on developng a BN verson of GEMA algorth. In ths work, we consdered constrants R( y ) as soft evdences to odfy the current belefs. These low densonal dstrbutons can also be peces of new knowledge whch are ore up-to-date, ore accurate, or ore locaton specfc, and absorbng these nto a larger dstrbuton s a process of knowledge ntegraton or knowledge-base update. In the

1 Definition of Rademacher Complexity

1 Definition of Rademacher Complexity COS 511: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture #9 Scrbe: Josh Chen March 5, 2013 We ve spent the past few classes provng bounds on the generalzaton error of PAClearnng algorths for the

More information

Excess Error, Approximation Error, and Estimation Error

Excess Error, Approximation Error, and Estimation Error E0 370 Statstcal Learnng Theory Lecture 10 Sep 15, 011 Excess Error, Approxaton Error, and Estaton Error Lecturer: Shvan Agarwal Scrbe: Shvan Agarwal 1 Introducton So far, we have consdered the fnte saple

More information

XII.3 The EM (Expectation-Maximization) Algorithm

XII.3 The EM (Expectation-Maximization) Algorithm XII.3 The EM (Expectaton-Maxzaton) Algorth Toshnor Munaata 3/7/06 The EM algorth s a technque to deal wth varous types of ncoplete data or hdden varables. It can be appled to a wde range of learnng probles

More information

Modify Bayesian Network Structure with Inconsistent Constraints

Modify Bayesian Network Structure with Inconsistent Constraints Modfy Bayesan Network Structure wth Inconsstent Constrants Y Sun and Yun Peng Department of Computer Scence and Electrcal Engneerng Unversty of Maryland Baltmore County Baltmore, MD, 21250 Abstract Ths

More information

System in Weibull Distribution

System in Weibull Distribution Internatonal Matheatcal Foru 4 9 no. 9 94-95 Relablty Equvalence Factors of a Seres-Parallel Syste n Webull Dstrbuton M. A. El-Dacese Matheatcs Departent Faculty of Scence Tanta Unversty Tanta Egypt eldacese@yahoo.co

More information

COS 511: Theoretical Machine Learning

COS 511: Theoretical Machine Learning COS 5: Theoretcal Machne Learnng Lecturer: Rob Schapre Lecture #0 Scrbe: José Sões Ferrera March 06, 203 In the last lecture the concept of Radeacher coplexty was ntroduced, wth the goal of showng that

More information

Applied Mathematics Letters

Applied Mathematics Letters Appled Matheatcs Letters 2 (2) 46 5 Contents lsts avalable at ScenceDrect Appled Matheatcs Letters journal hoepage: wwwelseverco/locate/al Calculaton of coeffcents of a cardnal B-splne Gradr V Mlovanovć

More information

BAYESIAN CURVE FITTING USING PIECEWISE POLYNOMIALS. Dariusz Biskup

BAYESIAN CURVE FITTING USING PIECEWISE POLYNOMIALS. Dariusz Biskup BAYESIAN CURVE FITTING USING PIECEWISE POLYNOMIALS Darusz Bskup 1. Introducton The paper presents a nonparaetrc procedure for estaton of an unknown functon f n the regresson odel y = f x + ε = N. (1) (

More information

Three Algorithms for Flexible Flow-shop Scheduling

Three Algorithms for Flexible Flow-shop Scheduling Aercan Journal of Appled Scences 4 (): 887-895 2007 ISSN 546-9239 2007 Scence Publcatons Three Algorths for Flexble Flow-shop Schedulng Tzung-Pe Hong, 2 Pe-Yng Huang, 3 Gwoboa Horng and 3 Chan-Lon Wang

More information

Computational and Statistical Learning theory Assignment 4

Computational and Statistical Learning theory Assignment 4 Coputatonal and Statstcal Learnng theory Assgnent 4 Due: March 2nd Eal solutons to : karthk at ttc dot edu Notatons/Defntons Recall the defnton of saple based Radeacher coplexty : [ ] R S F) := E ɛ {±}

More information

Multipoint Analysis for Sibling Pairs. Biostatistics 666 Lecture 18

Multipoint Analysis for Sibling Pairs. Biostatistics 666 Lecture 18 Multpont Analyss for Sblng ars Bostatstcs 666 Lecture 8 revously Lnkage analyss wth pars of ndvduals Non-paraetrc BS Methods Maxu Lkelhood BD Based Method ossble Trangle Constrant AS Methods Covered So

More information

Least Squares Fitting of Data

Least Squares Fitting of Data Least Squares Fttng of Data Davd Eberly Geoetrc Tools, LLC http://www.geoetrctools.co/ Copyrght c 1998-2014. All Rghts Reserved. Created: July 15, 1999 Last Modfed: February 9, 2008 Contents 1 Lnear Fttng

More information

Module 9. Lecture 6. Duality in Assignment Problems

Module 9. Lecture 6. Duality in Assignment Problems Module 9 1 Lecture 6 Dualty n Assgnment Problems In ths lecture we attempt to answer few other mportant questons posed n earler lecture for (AP) and see how some of them can be explaned through the concept

More information

1 Review From Last Time

1 Review From Last Time COS 5: Foundatons of Machne Learnng Rob Schapre Lecture #8 Scrbe: Monrul I Sharf Aprl 0, 2003 Revew Fro Last Te Last te, we were talkng about how to odel dstrbutons, and we had ths setup: Gven - exaples

More information

On the number of regions in an m-dimensional space cut by n hyperplanes

On the number of regions in an m-dimensional space cut by n hyperplanes 6 On the nuber of regons n an -densonal space cut by n hyperplanes Chungwu Ho and Seth Zeran Abstract In ths note we provde a unfor approach for the nuber of bounded regons cut by n hyperplanes n general

More information

Least Squares Fitting of Data

Least Squares Fitting of Data Least Squares Fttng of Data Davd Eberly Geoetrc Tools, LLC http://www.geoetrctools.co/ Copyrght c 1998-2015. All Rghts Reserved. Created: July 15, 1999 Last Modfed: January 5, 2015 Contents 1 Lnear Fttng

More information

AN ANALYSIS OF A FRACTAL KINETICS CURVE OF SAVAGEAU

AN ANALYSIS OF A FRACTAL KINETICS CURVE OF SAVAGEAU AN ANALYI OF A FRACTAL KINETIC CURE OF AAGEAU by John Maloney and Jack Hedel Departent of Matheatcs Unversty of Nebraska at Oaha Oaha, Nebraska 688 Eal addresses: aloney@unoaha.edu, jhedel@unoaha.edu Runnng

More information

FREQUENCY DISTRIBUTIONS Page 1 of The idea of a frequency distribution for sets of observations will be introduced,

FREQUENCY DISTRIBUTIONS Page 1 of The idea of a frequency distribution for sets of observations will be introduced, FREQUENCY DISTRIBUTIONS Page 1 of 6 I. Introducton 1. The dea of a frequency dstrbuton for sets of observatons wll be ntroduced, together wth some of the mechancs for constructng dstrbutons of data. Then

More information

Structure and Drive Paul A. Jensen Copyright July 20, 2003

Structure and Drive Paul A. Jensen Copyright July 20, 2003 Structure and Drve Paul A. Jensen Copyrght July 20, 2003 A system s made up of several operatons wth flow passng between them. The structure of the system descrbes the flow paths from nputs to outputs.

More information

On Pfaff s solution of the Pfaff problem

On Pfaff s solution of the Pfaff problem Zur Pfaff scen Lösung des Pfaff scen Probles Mat. Ann. 7 (880) 53-530. On Pfaff s soluton of te Pfaff proble By A. MAYER n Lepzg Translated by D. H. Delpenc Te way tat Pfaff adopted for te ntegraton of

More information

On the Construction of Polar Codes

On the Construction of Polar Codes On the Constructon of Polar Codes Ratn Pedarsan School of Coputer and Councaton Systes, Lausanne, Swtzerland. ratn.pedarsan@epfl.ch S. Haed Hassan School of Coputer and Councaton Systes, Lausanne, Swtzerland.

More information

Collaborative Filtering Recommendation Algorithm

Collaborative Filtering Recommendation Algorithm Vol.141 (GST 2016), pp.199-203 http://dx.do.org/10.14257/astl.2016.141.43 Collaboratve Flterng Recoendaton Algorth Dong Lang Qongta Teachers College, Haou 570100, Chna, 18689851015@163.co Abstract. Ths

More information

Designing Fuzzy Time Series Model Using Generalized Wang s Method and Its application to Forecasting Interest Rate of Bank Indonesia Certificate

Designing Fuzzy Time Series Model Using Generalized Wang s Method and Its application to Forecasting Interest Rate of Bank Indonesia Certificate The Frst Internatonal Senar on Scence and Technology, Islac Unversty of Indonesa, 4-5 January 009. Desgnng Fuzzy Te Seres odel Usng Generalzed Wang s ethod and Its applcaton to Forecastng Interest Rate

More information

The Parity of the Number of Irreducible Factors for Some Pentanomials

The Parity of the Number of Irreducible Factors for Some Pentanomials The Party of the Nuber of Irreducble Factors for Soe Pentanoals Wolfra Koepf 1, Ryul K 1 Departent of Matheatcs Unversty of Kassel, Kassel, F. R. Gerany Faculty of Matheatcs and Mechancs K Il Sung Unversty,

More information

An Optimal Bound for Sum of Square Roots of Special Type of Integers

An Optimal Bound for Sum of Square Roots of Special Type of Integers The Sxth Internatonal Syposu on Operatons Research and Its Applcatons ISORA 06 Xnang, Chna, August 8 12, 2006 Copyrght 2006 ORSC & APORC pp. 206 211 An Optal Bound for Su of Square Roots of Specal Type

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

Elastic Collisions. Definition: two point masses on which no external forces act collide without losing any energy.

Elastic Collisions. Definition: two point masses on which no external forces act collide without losing any energy. Elastc Collsons Defnton: to pont asses on hch no external forces act collde thout losng any energy v Prerequstes: θ θ collsons n one denson conservaton of oentu and energy occurs frequently n everyday

More information

NP-Completeness : Proofs

NP-Completeness : Proofs NP-Completeness : Proofs Proof Methods A method to show a decson problem Π NP-complete s as follows. (1) Show Π NP. (2) Choose an NP-complete problem Π. (3) Show Π Π. A method to show an optmzaton problem

More information

On the Construction of Polar Codes

On the Construction of Polar Codes On the Constructon of Polar Codes Ratn Pedarsan School of Coputer and Councaton Systes, Lausanne, Swtzerland. ratn.pedarsan@epfl.ch S. Haed Hassan School of Coputer and Councaton Systes, Lausanne, Swtzerland.

More information

ITERATIVE ESTIMATION PROCEDURE FOR GEOSTATISTICAL REGRESSION AND GEOSTATISTICAL KRIGING

ITERATIVE ESTIMATION PROCEDURE FOR GEOSTATISTICAL REGRESSION AND GEOSTATISTICAL KRIGING ESE 5 ITERATIVE ESTIMATION PROCEDURE FOR GEOSTATISTICAL REGRESSION AND GEOSTATISTICAL KRIGING Gven a geostatstcal regresson odel: k Y () s x () s () s x () s () s, s R wth () unknown () E[ ( s)], s R ()

More information

Xiangwen Li. March 8th and March 13th, 2001

Xiangwen Li. March 8th and March 13th, 2001 CS49I Approxaton Algorths The Vertex-Cover Proble Lecture Notes Xangwen L March 8th and March 3th, 00 Absolute Approxaton Gven an optzaton proble P, an algorth A s an approxaton algorth for P f, for an

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification E395 - Pattern Recognton Solutons to Introducton to Pattern Recognton, Chapter : Bayesan pattern classfcaton Preface Ths document s a soluton manual for selected exercses from Introducton to Pattern Recognton

More information

Revision: December 13, E Main Suite D Pullman, WA (509) Voice and Fax

Revision: December 13, E Main Suite D Pullman, WA (509) Voice and Fax .9.1: AC power analyss Reson: Deceber 13, 010 15 E Man Sute D Pullan, WA 99163 (509 334 6306 Voce and Fax Oerew n chapter.9.0, we ntroduced soe basc quanttes relate to delery of power usng snusodal sgnals.

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

VQ widely used in coding speech, image, and video

VQ widely used in coding speech, image, and video at Scalar quantzers are specal cases of vector quantzers (VQ): they are constraned to look at one sample at a tme (memoryless) VQ does not have such constrant better RD perfomance expected Source codng

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

The Impact of the Earth s Movement through the Space on Measuring the Velocity of Light

The Impact of the Earth s Movement through the Space on Measuring the Velocity of Light Journal of Appled Matheatcs and Physcs, 6, 4, 68-78 Publshed Onlne June 6 n ScRes http://wwwscrporg/journal/jap http://dxdoorg/436/jap646 The Ipact of the Earth s Moeent through the Space on Measurng the

More information

Markov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement

Markov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement Markov Chan Monte Carlo MCMC, Gbbs Samplng, Metropols Algorthms, and Smulated Annealng 2001 Bonformatcs Course Supplement SNU Bontellgence Lab http://bsnuackr/ Outlne! Markov Chan Monte Carlo MCMC! Metropols-Hastngs

More information

On the Revision of Probabilistic Beliefs using Uncertain Evidence

On the Revision of Probabilistic Beliefs using Uncertain Evidence On the Revson of Probablstc Belefs usng Uncertan Evdence He Chan and Adnan Darwche Computer Scence Department Unversty of Calforna, Los Angeles Los Angeles, CA 90095 {he,darwche}@cs.ucla.edu Abstract We

More information

Difference Equations

Difference Equations Dfference Equatons c Jan Vrbk 1 Bascs Suppose a sequence of numbers, say a 0,a 1,a,a 3,... s defned by a certan general relatonshp between, say, three consecutve values of the sequence, e.g. a + +3a +1

More information

Week 5: Neural Networks

Week 5: Neural Networks Week 5: Neural Networks Instructor: Sergey Levne Neural Networks Summary In the prevous lecture, we saw how we can construct neural networks by extendng logstc regresson. Neural networks consst of multple

More information

Quantum Particle Motion in Physical Space

Quantum Particle Motion in Physical Space Adv. Studes Theor. Phys., Vol. 8, 014, no. 1, 7-34 HIKARI Ltd, www.-hkar.co http://dx.do.org/10.1988/astp.014.311136 Quantu Partcle Moton n Physcal Space A. Yu. Saarn Dept. of Physcs, Saara State Techncal

More information

arxiv: v2 [math.co] 3 Sep 2017

arxiv: v2 [math.co] 3 Sep 2017 On the Approxate Asyptotc Statstcal Independence of the Peranents of 0- Matrces arxv:705.0868v2 ath.co 3 Sep 207 Paul Federbush Departent of Matheatcs Unversty of Mchgan Ann Arbor, MI, 4809-043 Septeber

More information

Scattering by a perfectly conducting infinite cylinder

Scattering by a perfectly conducting infinite cylinder Scatterng by a perfectly conductng nfnte cylnder Reeber that ths s the full soluton everywhere. We are actually nterested n the scatterng n the far feld lt. We agan use the asyptotc relatonshp exp exp

More information

Several generation methods of multinomial distributed random number Tian Lei 1, a,linxihe 1,b,Zhigang Zhang 1,c

Several generation methods of multinomial distributed random number Tian Lei 1, a,linxihe 1,b,Zhigang Zhang 1,c Internatonal Conference on Appled Scence and Engneerng Innovaton (ASEI 205) Several generaton ethods of ultnoal dstrbuted rando nuber Tan Le, a,lnhe,b,zhgang Zhang,c School of Matheatcs and Physcs, USTB,

More information

Temperature. Chapter Heat Engine

Temperature. Chapter Heat Engine Chapter 3 Temperature In prevous chapters of these notes we ntroduced the Prncple of Maxmum ntropy as a technque for estmatng probablty dstrbutons consstent wth constrants. In Chapter 9 we dscussed the

More information

Section 8.3 Polar Form of Complex Numbers

Section 8.3 Polar Form of Complex Numbers 80 Chapter 8 Secton 8 Polar Form of Complex Numbers From prevous classes, you may have encountered magnary numbers the square roots of negatve numbers and, more generally, complex numbers whch are the

More information

Fermi-Dirac statistics

Fermi-Dirac statistics UCC/Physcs/MK/EM/October 8, 205 Fer-Drac statstcs Fer-Drac dstrbuton Matter partcles that are eleentary ostly have a type of angular oentu called spn. hese partcles are known to have a agnetc oent whch

More information

Lectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix

Lectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix Lectures - Week 4 Matrx norms, Condtonng, Vector Spaces, Lnear Independence, Spannng sets and Bass, Null space and Range of a Matrx Matrx Norms Now we turn to assocatng a number to each matrx. We could

More information

Finite Fields and Their Applications

Finite Fields and Their Applications Fnte Felds and Ther Applcatons 5 009 796 807 Contents lsts avalable at ScenceDrect Fnte Felds and Ther Applcatons www.elsever.co/locate/ffa Typcal prtve polynoals over nteger resdue rngs Tan Tan a, Wen-Feng

More information

Fall 2012 Analysis of Experimental Measurements B. Eisenstein/rev. S. Errede. ) with a symmetric Pcovariance matrix of the y( x ) measurements V

Fall 2012 Analysis of Experimental Measurements B. Eisenstein/rev. S. Errede. ) with a symmetric Pcovariance matrix of the y( x ) measurements V Fall Analyss o Experental Measureents B Esensten/rev S Errede General Least Squares wth General Constrants: Suppose we have easureents y( x ( y( x, y( x,, y( x wth a syetrc covarance atrx o the y( x easureents

More information

Outline. Bayesian Networks: Maximum Likelihood Estimation and Tree Structure Learning. Our Model and Data. Outline

Outline. Bayesian Networks: Maximum Likelihood Estimation and Tree Structure Learning. Our Model and Data. Outline Outlne Bayesan Networks: Maxmum Lkelhood Estmaton and Tree Structure Learnng Huzhen Yu janey.yu@cs.helsnk.f Dept. Computer Scence, Unv. of Helsnk Probablstc Models, Sprng, 200 Notces: I corrected a number

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

PGM Learning Tasks and Metrics

PGM Learning Tasks and Metrics Probablstc Graphcal odels Learnng Overvew PG Learnng Tasks and etrcs Learnng doan epert True dstrbuton P* aybe correspondng to a PG * dataset of nstances D{d],...d]} sapled fro P* elctaton Network Learnng

More information

Introducing Entropy Distributions

Introducing Entropy Distributions Graubner, Schdt & Proske: Proceedngs of the 6 th Internatonal Probablstc Workshop, Darstadt 8 Introducng Entropy Dstrbutons Noel van Erp & Peter van Gelder Structural Hydraulc Engneerng and Probablstc

More information

Lecture Space-Bounded Derandomization

Lecture Space-Bounded Derandomization Notes on Complexty Theory Last updated: October, 2008 Jonathan Katz Lecture Space-Bounded Derandomzaton 1 Space-Bounded Derandomzaton We now dscuss derandomzaton of space-bounded algorthms. Here non-trval

More information

PROBABILITY AND STATISTICS Vol. III - Analysis of Variance and Analysis of Covariance - V. Nollau ANALYSIS OF VARIANCE AND ANALYSIS OF COVARIANCE

PROBABILITY AND STATISTICS Vol. III - Analysis of Variance and Analysis of Covariance - V. Nollau ANALYSIS OF VARIANCE AND ANALYSIS OF COVARIANCE ANALYSIS OF VARIANCE AND ANALYSIS OF COVARIANCE V. Nollau Insttute of Matheatcal Stochastcs, Techncal Unversty of Dresden, Gerany Keywords: Analyss of varance, least squares ethod, odels wth fxed effects,

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

The Order Relation and Trace Inequalities for. Hermitian Operators

The Order Relation and Trace Inequalities for. Hermitian Operators Internatonal Mathematcal Forum, Vol 3, 08, no, 507-57 HIKARI Ltd, wwwm-hkarcom https://doorg/0988/mf088055 The Order Relaton and Trace Inequaltes for Hermtan Operators Y Huang School of Informaton Scence

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Determination of the Confidence Level of PSD Estimation with Given D.O.F. Based on WELCH Algorithm

Determination of the Confidence Level of PSD Estimation with Given D.O.F. Based on WELCH Algorithm Internatonal Conference on Inforaton Technology and Manageent Innovaton (ICITMI 05) Deternaton of the Confdence Level of PSD Estaton wth Gven D.O.F. Based on WELCH Algorth Xue-wang Zhu, *, S-jan Zhang

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

Chapter 12 Lyes KADEM [Thermodynamics II] 2007

Chapter 12 Lyes KADEM [Thermodynamics II] 2007 Chapter 2 Lyes KDEM [Therodynacs II] 2007 Gas Mxtures In ths chapter we wll develop ethods for deternng therodynac propertes of a xture n order to apply the frst law to systes nvolvng xtures. Ths wll be

More information

Power law and dimension of the maximum value for belief distribution with the max Deng entropy

Power law and dimension of the maximum value for belief distribution with the max Deng entropy Power law and dmenson of the maxmum value for belef dstrbuton wth the max Deng entropy Bngy Kang a, a College of Informaton Engneerng, Northwest A&F Unversty, Yanglng, Shaanx, 712100, Chna. Abstract Deng

More information

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography CSc 6974 and ECSE 6966 Math. Tech. for Vson, Graphcs and Robotcs Lecture 21, Aprl 17, 2006 Estmatng A Plane Homography Overvew We contnue wth a dscusson of the major ssues, usng estmaton of plane projectve

More information

,..., k N. , k 2. ,..., k i. The derivative with respect to temperature T is calculated by using the chain rule: & ( (5) dj j dt = "J j. k i.

,..., k N. , k 2. ,..., k i. The derivative with respect to temperature T is calculated by using the chain rule: & ( (5) dj j dt = J j. k i. Suppleentary Materal Dervaton of Eq. 1a. Assue j s a functon of the rate constants for the N coponent reactons: j j (k 1,,..., k,..., k N ( The dervatve wth respect to teperature T s calculated by usng

More information

Small-Sample Equating With Prior Information

Small-Sample Equating With Prior Information Research Report Sall-Saple Equatng Wth Pror Inforaton Sauel A Lvngston Charles Lews June 009 ETS RR-09-5 Lstenng Learnng Leadng Sall-Saple Equatng Wth Pror Inforaton Sauel A Lvngston and Charles Lews ETS,

More information

What is LP? LP is an optimization technique that allocates limited resources among competing activities in the best possible manner.

What is LP? LP is an optimization technique that allocates limited resources among competing activities in the best possible manner. (C) 998 Gerald B Sheblé, all rghts reserved Lnear Prograng Introducton Contents I. What s LP? II. LP Theor III. The Splex Method IV. Refneents to the Splex Method What s LP? LP s an optzaton technque that

More information

princeton univ. F 17 cos 521: Advanced Algorithm Design Lecture 7: LP Duality Lecturer: Matt Weinberg

princeton univ. F 17 cos 521: Advanced Algorithm Design Lecture 7: LP Duality Lecturer: Matt Weinberg prnceton unv. F 17 cos 521: Advanced Algorthm Desgn Lecture 7: LP Dualty Lecturer: Matt Wenberg Scrbe: LP Dualty s an extremely useful tool for analyzng structural propertes of lnear programs. Whle there

More information

Notes on Frequency Estimation in Data Streams

Notes on Frequency Estimation in Data Streams Notes on Frequency Estmaton n Data Streams In (one of) the data streamng model(s), the data s a sequence of arrvals a 1, a 2,..., a m of the form a j = (, v) where s the dentty of the tem and belongs to

More information

ASYMMETRIC TRAFFIC ASSIGNMENT WITH FLOW RESPONSIVE SIGNAL CONTROL IN AN URBAN NETWORK

ASYMMETRIC TRAFFIC ASSIGNMENT WITH FLOW RESPONSIVE SIGNAL CONTROL IN AN URBAN NETWORK AYMMETRIC TRAFFIC AIGNMENT WITH FLOW REPONIVE IGNAL CONTROL IN AN URBAN NETWORK Ken'etsu UCHIDA *, e'ch KAGAYA **, Tohru HAGIWARA *** Dept. of Engneerng - Hoado Unversty * E-al: uchda@eng.houda.ac.p **

More information

Markov Chain Monte Carlo Lecture 6

Markov Chain Monte Carlo Lecture 6 where (x 1,..., x N ) X N, N s called the populaton sze, f(x) f (x) for at least one {1, 2,..., N}, and those dfferent from f(x) are called the tral dstrbutons n terms of mportance samplng. Dfferent ways

More information

Handling Overload (G. Buttazzo, Hard Real-Time Systems, Ch. 9) Causes for Overload

Handling Overload (G. Buttazzo, Hard Real-Time Systems, Ch. 9) Causes for Overload PS-663: Real-Te Systes Handlng Overloads Handlng Overload (G Buttazzo, Hard Real-Te Systes, h 9) auses for Overload Bad syste desgn eg poor estaton of worst-case executon tes Sultaneous arrval of unexpected

More information

3.1 ML and Empirical Distribution

3.1 ML and Empirical Distribution 67577 Intro. to Machne Learnng Fall semester, 2008/9 Lecture 3: Maxmum Lkelhood/ Maxmum Entropy Dualty Lecturer: Amnon Shashua Scrbe: Amnon Shashua 1 In the prevous lecture we defned the prncple of Maxmum

More information

MMA and GCMMA two methods for nonlinear optimization

MMA and GCMMA two methods for nonlinear optimization MMA and GCMMA two methods for nonlnear optmzaton Krster Svanberg Optmzaton and Systems Theory, KTH, Stockholm, Sweden. krlle@math.kth.se Ths note descrbes the algorthms used n the author s 2007 mplementatons

More information

Maximizing the number of nonnegative subsets

Maximizing the number of nonnegative subsets Maxmzng the number of nonnegatve subsets Noga Alon Hao Huang December 1, 213 Abstract Gven a set of n real numbers, f the sum of elements of every subset of sze larger than k s negatve, what s the maxmum

More information

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law:

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law: CE304, Sprng 2004 Lecture 4 Introducton to Vapor/Lqud Equlbrum, part 2 Raoult s Law: The smplest model that allows us do VLE calculatons s obtaned when we assume that the vapor phase s an deal gas, and

More information

Chapter Newton s Method

Chapter Newton s Method Chapter 9. Newton s Method After readng ths chapter, you should be able to:. Understand how Newton s method s dfferent from the Golden Secton Search method. Understand how Newton s method works 3. Solve

More information

Lecture 12: Discrete Laplacian

Lecture 12: Discrete Laplacian Lecture 12: Dscrete Laplacan Scrbe: Tanye Lu Our goal s to come up wth a dscrete verson of Laplacan operator for trangulated surfaces, so that we can use t n practce to solve related problems We are mostly

More information

On Syndrome Decoding of Punctured Reed-Solomon and Gabidulin Codes 1

On Syndrome Decoding of Punctured Reed-Solomon and Gabidulin Codes 1 Ffteenth Internatonal Workshop on Algebrac and Cobnatoral Codng Theory June 18-24, 2016, Albena, Bulgara pp. 35 40 On Syndroe Decodng of Punctured Reed-Soloon and Gabduln Codes 1 Hannes Bartz hannes.bartz@tu.de

More information

Lecture 19 of 42. MAP and MLE continued, Minimum Description Length (MDL)

Lecture 19 of 42. MAP and MLE continued, Minimum Description Length (MDL) Lecture 19 of 4 MA and MLE contnued, Mnu Descrpton Length (MDL) Wednesday, 8 February 007 Wlla H. Hsu, KSU http://www.kddresearch.org Readngs for next class: Chapter 5, Mtchell Lecture Outlne Read Sectons

More information

LECTURE :FACTOR ANALYSIS

LECTURE :FACTOR ANALYSIS LCUR :FACOR ANALYSIS Rta Osadchy Based on Lecture Notes by A. Ng Motvaton Dstrbuton coes fro MoG Have suffcent aount of data: >>n denson Use M to ft Mture of Gaussans nu. of tranng ponts If

More information

Robust Algorithms for Preemptive Scheduling

Robust Algorithms for Preemptive Scheduling DOI 0.007/s00453-0-978-3 Robust Algorths for Preeptve Schedulng Leah Epsten Asaf Levn Receved: 4 March 0 / Accepted: 9 Noveber 0 Sprnger Scence+Busness Meda New York 0 Abstract Preeptve schedulng probles

More information

CHAPTER 10 ROTATIONAL MOTION

CHAPTER 10 ROTATIONAL MOTION CHAPTER 0 ROTATONAL MOTON 0. ANGULAR VELOCTY Consder argd body rotates about a fxed axs through pont O n x-y plane as shown. Any partcle at pont P n ths rgd body rotates n a crcle of radus r about O. The

More information

Uncertainty and auto-correlation in. Measurement

Uncertainty and auto-correlation in. Measurement Uncertanty and auto-correlaton n arxv:1707.03276v2 [physcs.data-an] 30 Dec 2017 Measurement Markus Schebl Federal Offce of Metrology and Surveyng (BEV), 1160 Venna, Austra E-mal: markus.schebl@bev.gv.at

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

International Journal of Mathematical Archive-9(3), 2018, Available online through ISSN

International Journal of Mathematical Archive-9(3), 2018, Available online through   ISSN Internatonal Journal of Matheatcal Archve-9(3), 208, 20-24 Avalable onlne through www.ja.nfo ISSN 2229 5046 CONSTRUCTION OF BALANCED INCOMPLETE BLOCK DESIGNS T. SHEKAR GOUD, JAGAN MOHAN RAO M AND N.CH.

More information

Centroid Uncertainty Bounds for Interval Type-2 Fuzzy Sets: Forward and Inverse Problems

Centroid Uncertainty Bounds for Interval Type-2 Fuzzy Sets: Forward and Inverse Problems Centrod Uncertanty Bounds for Interval Type-2 Fuzzy Sets: Forward and Inverse Probles Jerry M. Mendel and Hongwe Wu Sgnal and Iage Processng Insttute Departent of Electrcal Engneerng Unversty of Southern

More information

Slobodan Lakić. Communicated by R. Van Keer

Slobodan Lakić. Communicated by R. Van Keer Serdca Math. J. 21 (1995), 335-344 AN ITERATIVE METHOD FOR THE MATRIX PRINCIPAL n-th ROOT Slobodan Lakć Councated by R. Van Keer In ths paper we gve an teratve ethod to copute the prncpal n-th root and

More information

Reliability estimation in Pareto-I distribution based on progressively type II censored sample with binomial removals

Reliability estimation in Pareto-I distribution based on progressively type II censored sample with binomial removals Journal of Scentfc esearch Developent (): 08-3 05 Avalable onlne at wwwjsradorg ISSN 5-7569 05 JSAD elablty estaton n Pareto-I dstrbuton based on progressvely type II censored saple wth bnoal reovals Ilhan

More information

AGC Introduction

AGC Introduction . Introducton AGC 3 The prmary controller response to a load/generaton mbalance results n generaton adjustment so as to mantan load/generaton balance. However, due to droop, t also results n a non-zero

More information

Generalized Linear Methods

Generalized Linear Methods Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set

More information

COMP th April, 2007 Clement Pang

COMP th April, 2007 Clement Pang COMP 540 12 th Aprl, 2007 Cleent Pang Boostng Cobnng weak classers Fts an Addtve Model Is essentally Forward Stagewse Addtve Modelng wth Exponental Loss Loss Functons Classcaton: Msclasscaton, Exponental,

More information

Canonical transformations

Canonical transformations Canoncal transformatons November 23, 2014 Recall that we have defned a symplectc transformaton to be any lnear transformaton M A B leavng the symplectc form nvarant, Ω AB M A CM B DΩ CD Coordnate transformatons,

More information

Chapter - 2. Distribution System Power Flow Analysis

Chapter - 2. Distribution System Power Flow Analysis Chapter - 2 Dstrbuton System Power Flow Analyss CHAPTER - 2 Radal Dstrbuton System Load Flow 2.1 Introducton Load flow s an mportant tool [66] for analyzng electrcal power system network performance. Load

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

Physics 5153 Classical Mechanics. Principle of Virtual Work-1

Physics 5153 Classical Mechanics. Principle of Virtual Work-1 P. Guterrez 1 Introducton Physcs 5153 Classcal Mechancs Prncple of Vrtual Work The frst varatonal prncple we encounter n mechancs s the prncple of vrtual work. It establshes the equlbrum condton of a mechancal

More information

Assortment Optimization under MNL

Assortment Optimization under MNL Assortment Optmzaton under MNL Haotan Song Aprl 30, 2017 1 Introducton The assortment optmzaton problem ams to fnd the revenue-maxmzng assortment of products to offer when the prces of products are fxed.

More information