On Quantifying the Accuracy of Maximum Likelihood Estimation of Participant Reliability in Social Sensing
|
|
- Albert Randall
- 5 years ago
- Views:
Transcription
1 On Quantfyng the Accuracy of Maxmum Lkelhood Estmaton of Partcpant Relablty n Socal Sensng Dong Wang, Tarek Abdelzaher Department of Computer Scence Unversty of Illnos Urbana, IL dwang24@llnos.edu, zaher@llnos.edu Lance Kaplan Charu C. Aggarwal Networked Sensng & Fuson IBM Research Branch Yorktown Heghts, NY US Army Research Laboratory charu@us.bm.com Adelph, MD lance.m.kaplan@us.army.ml ABSTRACT Ths paper presents a confdence nterval quantfcaton of maxmum lkelhood estmaton of partcpant relablty n socal sensng applcatons. The work s motvated by the emergence of socal sensng as a data collecton paradgm, where humans perform the data collecton tasks. A key challenge n socal sensng applcatons les n the uncertan nature of human measurements. Unlke well-calbrated and well-tested nfrastructure sensors, humans are less relable, and the lkelhood that partcpants measurements are correct s often unknown a pror. Hence, t s hard to estmate the accuracy of conclusons made based on socal sensng data. In prevous work, we developed a maxmum lkelhood estmator of relablty of both partcpants and facts concluded from the data. Ths paper presents an analytcally-founded bound that quantfes the accuracy of such maxmum lkelhood estmaton n socal sensng. A confdence nterval s derved by leveragng the asymptotc normalty of maxmum lkelhood estmaton and computng the approxmaton of Cramer-Rao bound (CRB) for the estmaton parameters. The proposed quantfcaton approach s emprcally valdated and shown to accurately bound the actual estmaton error gven suffcent number of partcpants under dfferent sensng topologes. 1. INTRODUCTION Socal sensng, where ndvduals act as sensors, s a key emergng category of sensng applcatons. Yet, quantfyng the relablty of data collected from human sources remans one of the man challenges n utlzng socal sensng n msson-crtcal systems. Ths relablty problem has long been known n mltary scenaros and s becomng ncreasngly mportant n commercal and cvl settngs as well. The man research queston s one of quantfyng a level of con- Permsson to make dgtal or hard copes of all or part of ths work for personal or classroom use s granted wthout fee provded that copes are not made or dstrbuted for proft or commercal advantage and that copes bear ths notce and the full ctaton on the frst page. To copy otherwse, to republsh, to post on servers or to redstrbute to lsts, requres pror specfc permsson and/or a fee. Ths artcle was presented at: 8th Internatonal Workshop on Data Management for Sensor Networks (DMSN 2011) Copyrght fdence n nformaton reported by a group of human observers. Ths paper quantfes confdence n socal sensng observatons by computng a confdence nterval of a maxmum lkelhood estmator of partcpant relablty from socal sensng data. The confdence nterval s computed based on the approxmaton of Cramer-Rao bound (CRB) [4, whch quantfes the varance of a mnmum-varance unbased estmator. Ths bound approxmates the CRB by leveragng observatons from partcpants and knowledge of truthfulness of facts estmated from an maxmum lkelhood estmator of socal sensng [14. We consder a sensng applcaton n whch data are collected from a large populaton, where the relablty (.e., the probablty of correctness) of ndvdual partcpants, and hence observatons made by them, s not known a pror. We am to derve the confdence nterval n the maxmum lkelhood estmaton of partcpant and observaton relablty gven no pror knowledge other than the nformaton descrbng who reported whch observatons. The maxmum lkelhood estmaton problem tself s not the topc of ths paper. An approach based on expectaton maxmzaton [5 s descrbed n pror work [14. Ths paper quantfes the confdence approxmately n the answer reported by such an estmator. We concern ourselves wth bnary measurements only; for example, reportng whether or not a gven person or object was seen at a gven locaton. Our dervaton leverages the asymptotc normalty of maxmum lkelhood estmaton and computes the approxmaton of Cramer-Rao bound (CRB) of the estmaton parameters used n an expectaton maxmzaton scheme. It s shown that the probablty of the estmated partcpant relablty fallng nto the derved confdence nterval s always greater than the gven confdence level, as long as enough partcpants report suffcent observatons, and as long as some partcpants make the same observaton. In other words, t s shown that our confdence wndow correctly bounds estmaton error. The rest of ths paper s organzed as follows: In Secton 2, we revew a maxmum lkelhood estmaton approach for socal sensng applcatons and formulate the problem of dervng the confdence nterval of partcpant relablty. The dervaton of the proposed approxmaton for CRB quantfcaton on maxmum lkelhood estmaton s dscussed n Secton 3. Evaluaton results are presented n Secton 4. We
2 revew related work n Secton 5. Fnally, we conclude the paper n Secton PROBLEM FORMULATION Frst, we revew the maxmum lkelhood estmaton approach for truth dscovery n socal sensng applcatons. We consder a socal sensng applcaton model where a group of M partcpants, S 1,..., S M, make ndvdual observatons about a set of N measured varables C 1,..., C N n ther envronment. Each measured varable denotes the exstence or lack thereof of certan phenomenon of applcaton s nterests. In ths effort, we consder only bnary varables and assume, wthout loss of generalty, that ther normal state s negatve. Hence, partcpants report only when a postve value s encountered. Each partcpant generally observes only a subset of all varables. Our goal s to determne the confdence nterval of the partcpant relablty maxmum lkelhood estmaton for a gven confdence level based only on the nformaton of whch observatons are reported by whch partcpant. Let S C j denote an observaton reported by partcpant S clamng that C j s true. Let P (Cj) t and P (C f j ) denote the probablty that the actual varable C j s ndeed true and false, respectvely. Dfferent partcpants may make dfferent numbers of observatons. Let the probablty that partcpant S makes an observaton be s. Further, let the probablty that partcpant S s rght be t and the probablty that t s wrong be 1 t. Note that, ths probablty depends on the partcpant s relablty, whch s not known a pror. Formally, t s defned as: t = P (C t j S C j) (1) Let us also defne a as the (unknown) probablty that partcpant S reports a varable to be true when t s ndeed true, and b as the (unknown) probablty that partcpant S reports a varable to be true when t s n realty false. Formally, a and b are defned as follows: a = P (S C j C t j) b = P (S C j C f j ) (2) From the defnton of t, a and b, we can determne ther relatonshp usng the Bayesan theorem: a = P (S C j Cj) t = P (SCj, Ct j) P (Cj t) = P (Ct j S C j)p (S C j) P (Cj t) b = P (S C j C f j ) = P (SCj, Cf j ) P (C f j ) = P (C f j SCj)P (SCj) P (C f j ) (3) The key nput to the maxmum lkelhood estmator algorthm s a matrx SC, where S C j = 1 when partcpant S reports that C j s true, and S C j = 0 otherwse. Let us call t the observaton matrx. For ntalzaton, we also defne the background bas d to be the overall pror probablty that a randomly chosen measured varable s true. Note that, ths value can be known from past statstcs. It does not ndcate, however, whether any partcular measured varable s true or not. To ntalze the algorthm, we set P (C t j) = d and set P (S C j) = s. Pluggng these, together wth t nto the defnton of a and b, we get the ntal values: t s a = d (1 t) s b = 1 d The best (n the sense of maxmum lkelhood) estmate ˆt of the relablty of each partcpant S can be obtaned by usng the Expectaton Maxmzaton (EM) algorthm [5. In the EM algorthm, a latent varable Z s ntroduced for each measured varable to ndcate whether t s true or not (.e., z j s 1 when the measured varable C j s true and 0 otherwse). The observaton matrx SC s treated as the observed data X, and θ = (a 1, a 2,...a M ; b 1, b 2,...b M ) s the parameter vector of the model we want to estmate. The EM algorthm teratvely performs an expectaton step (Estep) and a maxmzaton step (M-step) to compute the best estmate of the parameter θ that maxmzes the expected logarthm lkelhood functon. The lkelhood functon used by EM scheme s gven by: L(θ; X, Z) = p(x, Z θ) { N M = a S C j (1 a ) (1 S C j ) d z j + j=1 M =1 =1 b S C j (1 b ) (1 S C j ) (1 d) (1 z j) An output of the EM algorthm s the maxmum lkelhood estmaton of each partcpant s relablty, whch s most consstent wth the observaton matrx SC. However, an mportant problem that remans unanswered from the maxmum lkelhood estmaton of the EM scheme s: what s the confdence nterval of the resultng partcpant relablty estmaton? Only by answerng ths queston, can we completely characterze estmaton performance, and hence partcpant relablty n socal sensng applcatons. The goal of ths paper s to demonstrate, n an analytcally founded manner, how to compute the confdence nterval of each partcpant s relablty. Formally, ths s gven by: (ˆt MLE } (4) (5) c lower p, ˆt MLE + c upper p ) c% (6) where c% s the confdence level of the estmaton nterval, c lower p and c upper p represent the lower and upper bound on the estmaton devaton from the maxmum lkelhood estmaton ˆt MLE respetvely. We target to fnd c lower p and for a gven c% and an observaton Matrx SC. c upper p 3. RELIABILITY DERIVATION In ths secton, we derve a confdence nterval based on the approxmaton of Cramer-Rao Bound and the aforementoned formulaton of maxmum lkelhood estmaton of partcpant relablty. The log-lkelhood functon (or logprobablty densty functon) of the maxmum lkelhood estmaton we get from EM can be expressed as: l em(x; θ) = log p em(x; θ) { [ N M = z j (S C j log a + (1 S C j) log(1 a ) + log d) j=1 =1 + (1 z j) [ M } (S C j log b + (1 S C j) log(1 b ) + log(1 d)) =1 (7)
3 The lkelhood (or probablty densty functon) s: p em(x; θ) = exp(l em(x; θ)) (8) The goal here s to show that the confdence nterval of the estmated parameter θ can be asymptotcally characterzed by the approxmaton of Cramer-Rao bound (CRB) gven the observaton matrx as well as estmated truthfulness of each measured varable from EM scheme. In statstc mathematcs and nformaton theory, the Fsher nformaton s a way of measurng the amount of nformaton that an observable random varable X carres about an estmated parameter θ upon whch the probablty of X depends. The partal dervatve of the log-lkelhood functon l(x; θ) wth respect to θ s called the score. A score vector ψ(x; θ) for a k 1 estmaton vector θ = [θ 1, θ 2,..., θ k T s defned as: l(x; θ) l(x; θ) l(x; θ) ψ(x; θ) = [,,..., T (9) θ 1 θ 2 θ k The Fsher nformaton s defned as the second moment of the score vector: I(θ) = E X[ψ(X; θ)ψ(x; θ) T (10) where the expectaton s taken over all values for X wth respect to the probablty functon p(x; θ) for any gven value of θ. Hence, the Fsher nformaton for the above estmaton vector θ takes the form of an k k matrx, the Fsher Informaton Matrx, wth the representatve element: l(x; θ) l(x; θ) (I(θ)),j = E X[( )( ) (11) θ θ j If l(x; θ) s twce dfferentable wth respect to θ (whch happens to be the case for EM model), under certan regularty condtons, the Fsher Informaton Matrx may also be wrtten as [10: (I(θ)),j = E X[ 2 l(x; θ) θ θ j (12) In estmaton theory and statstcs, the Cramer-Rao bound (CRB) expresses a lower bound on the varance of estmators of a determnstc parameter. In ts smplest form, the bound states the varance of any unbased estmator s at least as hgh as the nverse of the Fsher nformaton [10. The estmator that reaches ths lower bound s sad to be effcent. The maxmum lkelhood estmator posses a number of attractve asymptotc propertes. One of them s called asymptotc normalty, whch bascally states the MLE estmator s asymptotcally dstrbuted wth Gaussan behavor as the data sample sze goes up, n partcular[3: n(ˆθmle θ 0) d N(0, I 1 (ˆθ MLE)) (13) where n s the sample sze, θ 0 and ˆθ MLE are the true value and the maxmum lkelhood estmaton of the parameter θ respectvely. The Fsher nformaton at the MLE s used to to estmate ts true value [10. Hence, the asymptotc normalty property means that n a regular case of estmaton and n the dstrbuton lmtng sense, the maxmum lkelhood estmator ˆθ MLE s unbased and ts covarance reaches the Cramer-Rao bound (.e., an effcent estmator). Snce the estmator we obtan from the EM algorthm s a maxmum lkelhood estmator of the parameter θ, we now show how to leverage the asymptotc normalty and the approxmaton of Cramer-Rao bound to derve a confdence nterval that quantfes the estmaton accuracy of the estmated parameter θ for the model of the EM scheme. We frst compute the approxmaton of Fsher Informaton Matrx from the log-lkelhood functon gven by Equaton (7). Note that ths computaton utlzes the estmated truthfulness of each measured varable from EM scheme, hence offers approxmated results. Accordng to pror work [14, the maxmum lkelhood estmator ˆθ MLE s gven by: â MLE = ˆbMLE j SJ Z c j N j=1 Zc j = K j SJ Z c j N N j=1 Zc j (14) where SJ s the set of measured varables reported by partcpant S and Zj c s the converged value of Z(t, j) (.e., p(z j = 1 X j, θ (t) )) from EM algorthm. Observe that each â MLE ˆb MLE or s computed from N ndependent samples (.e., measured varables). The Fsher nformaton n a random sample of sze n s n tmes the Fsher nformaton n one observaton [10, and hence I n(θ) = ni(θ) (15) Pluggng l em(x; θ) gven by Equaton (7) nto the Fsher Informaton Matrx defned n Equaton (12), we have: (I(ˆθ MLE)),j (16) 0 j [ = E 1 2 l em(x;a ) X N a =â MLE = j [1, M [ E 1 X N a 2 2 l em(x;b ) b 2 b =ˆb MLE = j (M, 2M Observe that the Fsher Informaton Matrx of the maxmumlkelhood estmator from the EM scheme s a dagonal matrx, hence the nverse of ths matrx s: (I 1 (ˆθ MLE)),j (17) 0 j [ E X = [ E X N 2 lem(x;a ) a 2 N 2 lem(x;b ) b 2 a =â MLE b =ˆb MLE = j [1, M = j (M, 2M From the asymptotc normalty of the maxmum lkelhood estmator specfed by Equaton (13), we know that (ˆθ MLE θ 0) d N(0, 1 N I 1 (ˆθ MLE)). Therefore, substtutng (I 1 (ˆθ MLE)) by Equaton (17) nto Equaton (13), we obtan the covarance matrx Cov(ˆθ MLE) of the asymptotc normal dstrbuton for the maxmum lkelhood estmaton of EM scheme, whch s gven by: (Cov(ˆθ MLE)),j (18) 0 j [ 1 E X = 2 lem(x;a ) a =â MLE = j [1, M a 2 [ 1 E X = j (M, 2M 2 lem(x;b ) b 2 b =ˆb MLE Usng the converged log-lkelhood functon of Equaton (7) and substtutng Equaton (14) nto Equaton (18), the above covarance matrx can be further wrtten as:
4 (Cov(ˆθ MLE)),j (19) 0 j â = MLE (1 â MLE ) = j [1, M N d ˆbMLE (1 ˆb MLE ) = j (M, 2M N (1 d) Note that, the actual CRB bound s a functon of both M and N. However, the approxmaton CRB bound derved s ndependent of M. Let us denote the varance of estmaton error on parameter a as V ar(â MLE ). Recall the relaton between partcpant relablty and estmaton parameter a s a = t s. For a gven topology, s d and d are known constants, (ˆt MLE t 0 ) also follows a norm dstrbuton wth 0 mean and varance gven by: V ar(ˆt MLE ) = ( d s ) 2 V ar(â MLE ) (20) Hence, we are able to derve the confdence nterval that can be used to quantfy the estmaton accuracy of the maxmum lkelhood estmaton from the EM scheme. The confdence nterval of the relablty estmaton of partcpant S (.e., ˆt MLE (ˆt MLE ) at confdence level p s gven by: c p V ar(ˆt MLE ), ˆt MLE + c p V ar(ˆt MLE )) (21) where c p s the standard score (z-score) of the confdence level p. For example, for the 95% confdence level, c p = Note that the derved confdence nterval of the partcpant relablty maxmum lkelhood estmator can be computed by smply usng the converged maxmum leklhood estmaton of the EM scheme. Ths completes the dervaton. 4. EVALUATION In ths secton, we carry out smulaton experments to evaluate the performance of the computed confdence nterval of partcpant relablty n socal sensng. We bult a smulator n Matlab that generates a random number of partcpants and measured varables. A random probablty P s assgned to each partcpant S representng hs/her relablty (.e., the ground truth probablty that they report correct observatons). For each partcpant S, L observatons are generated. Each observaton has a probablty P of beng true (.e., reportng a varable as true correctly) and a probablty 1 P of beng false (reportng a varable as true when t s not). One can thnk of these varables as observed problems. Partcpants do not report lack of problems. Hence, they never report a varable to be false. We let P be unformly dstrbuted between 0.5 and 1 n our experments 1. We evaluate the derved confdence nterval on partcpant relablty over three dfferent observaton matrx scales: small, medum and large. The smulaton parameters of the three observaton matrx scales are lsted n Table 1. The average observatons reported by each partcpant s set to 100. For each observaton matrx scale, we run the EM algorthm and compute the confdence nterval on partcpant relablty based on Equaton (21). We repeat the experments In prncple, there s no ncentve for a partcpant to le more than 50% of the tme, snce negatng ther statements would then gve a more accurate truth tmes for each observaton matrx scale and call the experments wth the actual estmaton error fallng outsde the confdence nterval outlers. We choose three representatve confdence levels (.e., 68%, 90%, 95% 2 ), respectvely. For a gven confdence level, we further defne the partcpant who has the largest number of outlers over all experments as the worst-case partcpant. Hence, we record the number of ourlters of every worst-case partcpant for a gven confdence level and compare t wth the theoretcal maxmum number of outlers. Observaton Number Number of Number of Matrx of Partcpantsured True Mea- False Measured Scale Var- ables Var- ables Small Medum Large Table 1: Parameters of Three Typcal Observaton Matrx Scale Fgure 1 shows the confdence nterval bounds on the partcpant relablty estmaton error wth three dfferent confdence levels for the small observaton matrx. Note that the CRB s smply a functon of the ground truth parameter values. However, t s reasonable to substtute the true (but unknown) parameter values wth ther ML estmates [10. Ths s the reason that bounds n the fgure appear to fluctuate rather than beng flat. Observe that the actual estmaton error on partcpant relablty s well bounded by ts correspondng confdence nterval. Specally, the numbers of outlers for the worst-case partcpant at confdence levels 68%, 90% and 95% are 24, 8 and 5 out of 100 experments. They are less than the theorectal maxmum number of outlers for the three confdence levels (.e., 32, 10 and 5 out of 100 experments, predcted usng our derved bound). Smlar results are observed for the medum and large observaton matrces as well, whch are shown n Fgure 2 and Fgure 3. A summary of the comparson between confdence nterval bounds n estmatng partcpant relablty and the theoretcal results s shown n Table 2. We observe that the probablty of the estmated partcpant relablty fallng nto the derved confdence nterval s always greater than the correspondng confdence level. Snce the derved bound s an approxmaton of true CRB (.e., t depends on the correct estmaton of truthfulness of measured varables from EM scheme), we study the condtons when such approxmated bound fals to bound the actual error of the estmaton parameters. We fx the true and false measured varables to be 1000 respectvely, the average observatons per partcpant s set to 100. We vary the number of partcpants from very small (.e., 5) to large (.e., 205). Reported results are averaged over 100 experments. Fgure 4 shows the square root of the average MSE (mean squared error) of 3 confdence nterval bounds on the estmated parameter a and b when the number of partcpants vares. Observe that the hgh confdence bounds (.e., 95% or 90% ) fal to bound the root of MSE on a or b only when the number of partcpants (M) s very small. Ths s due to 2 They correspond to one, two and three tmes standard devaton confdence ntervals of normal dstrbuton
5 (a) 68% Confdence Bound (b) 90% Confdence Bound (c) 95% Confdence Bound Fgure 1: CRB Confdence Bounds on Partcpant Relablty for Small Observaton Matrx (a) 68% Confdence Bound (b) 90% Confdence Bound (c) 95% Confdence Bound Fgure 2: CRB Confdence Bounds on Partcpant Relablty for Medum Observaton Matrx (a) 68% Confdence Bound (b) 90% Confdence Bound (c) 95% Confdence Bound Fgure 3: CRB Confdence Bounds on Partcpant Relablty for Large Observaton Matrx (a) Confdence Bound on a (b) Confdence Bound on b Fgure 4: CRB Confdence Bound on a and b versus Vayng M the poor estmaton results of measured varbles when too few partcpants report ther observatons. However, when M s reasonablly suffcent (e.g., 25 n the experment), hgh confdence bounds always bound the square root of the average MSE on the estmated parameters correctly. 5. RELATED WORK Socal sensng has receved sgnfcant attenton due to the great ncrease n the number of moble sensors owned by ndvduals and the prolferaton of Internet connectvty. To assess the credblty of partcpants and facts reported n partcpatory sensng and other socal sensng applcatons, a relevant body of work, called fact-fnders, n the machne learnng and data mnng communtes performs trust analyss. The basc fact-fnders nclude Hubs and Authortes [11,
6 Observaton Matrx Scale Small Medum Large Confdence Level of Estmaton Theoretcal Maxmum Outlers/Total Experments 68% 32/100 24/100 90% 10/100 8/100 95% 5/100 5/100 68% 32/100 25/100 90% 10/100 6/100 95% 5/100 4/100 68% 32/100 25/100 90% 10/100 9/100 95% 5/100 2/100 CRB Bound Worst Case Outlers/Total Experments Table 2: CRB Bound on Partcpant Probablty versus Theoretcal Results Average.Log [12, and TruthFnder [15. Other fact-fnders enhance the basc framework by ncorporatng analyss on propertes or dependences wthn assertons or sources [9, 2, 8, 7, 6. Fact-fndng n the case of socal sensng s more challengng due to the hghly dynamc nature of socal sensng applcatons [1. Moreover, the outputs of fact-fnders are generally rankngs of credblty values of partcpants and facts. Such rankngs cannot be used to drectly quantfy the partcpant relablty or fact correctness. Recent work presented a Bayesan Interpretaton scheme [13 representng an ntal effort to provde a probablty nterpretaton of rankng outputs from fact-fnders. However, t remans an approxmaton approach n whch the accuracy of truth estmaton s very senstve to ntal condtons of teratons. Due to ths lmtaton, a maxmum lkelhood estmaton approach usng EM algorthm s proposed to provde the frst optmal soluton to the truth dscovery problem n socal sensng [14. The EM scheme was shown to outperform Bayesan nterpretaton and other state-of-art factfnders. However, only average estmaton accuraces were reported n both of above schemes. The confdence nterval of the estmaton accuracy has not been found. In contrast, ths paper derves, for the frst tme, the confdence nterval of partcpant relablty based on the approxmaton of CRB, hence completes the quantfcaton of partcpant relablty estmaton n socal sensng. In estmaton theory and statstcs, the Cramer-Rao bound refers to a lower bound on the varance of estmators of a determnstc parameter [4. The bound states the varance of any unbased estmator s lower-bounded by the nverse of Fsher nformaton [10. One of the key propertes of maxmum lkelhood estmaton s asymptotc normalty. An EM scheme provdes maxmum lkelhood estmaton of partcpant relablty for socal sensng applcatons. Ths paper provdes the frst quantfcaton approach to compute the confdence nterval for partcpant relablty maxmumlkelhood estmaton based on the approxmaton of CRB by leveragng results from estmaton and nformaton theory. 6. CONCLUSION Ths paper descrbed a quantfcaton approach to compute the confdence nterval of the maxmum lkelhood estmaton on partcpant relablty based on the approxmaton of CRB n socal sensng applcatons. Ths quantfcaton approach completely characterzes the estmaton performance of partcpant relablty wthout knowng the trustworthness of partcpants a pror. The derved confdence nterval s obtaned by leveragng the asymptotc normalty of maxmum lkelhood estmaton and can be easly computed from the approxmated Fsher nformaton contaned n partcpants relablty estmatons. Evaluaton results show that the error n the estmated partcpant relablty s well bounded by the computed bound. In future work, a tghter CRB bound can probably be derved wthout knowng the truthfulness of each measured varable. Ths actual CRB bound s expected to better track the actual MSE of the estmated parameters. Acknowledgements Research reported n ths paper was sponsored by the Army Research Laboratory and was accomplshed under Cooperatve Agreement Number W911NF The vews and conclusons contaned n ths document are those of the authors and should not be nterpreted as representng the offcal polces, ether expressed or mpled, of the Army Research Laboratory or the U.S. Government. The U.S. Government s authorzed to reproduce and dstrbute reprnts for Government purposes notwthstandng any copyrght notaton here on. 7. REFERENCES [1 C. Aggarwal and T. Abdelzaher. Integratng sensors and socal networks. Socal Network Data Analytcs, Sprnger, expected n [2 L. Bert-Equlle, A. D. Sarma, X. Dong, A. Maran, and D. Srvastava. Salng the nformaton ocean wth awareness of currents: Dscovery and applcaton of source dependence. In CIDR 09, [3 G. Casella and R. Berger. Statstcal Inference. Duxbury Press, [4 H. Cramer. Mathematcal Methods of Statstcs. Prnceton Unv. Press., [5 A. P. Dempster, N. M. Lard, and D. B. Rubn. Maxmum lkelhood from ncomplete data va the em algorthm. JOURNAL OF THE ROYAL STATISTICAL SOCIETY, SERIES B, 39(1):1 38, [6 X. Dong, L. Bert-Equlle, Y. Hu, and D. Srvastava. Global detecton of complex copyng relatonshps between sources. PVLDB, 3(1): , [7 X. Dong, L. Bert-Equlle, and D. Srvastava. Truth dscovery and copyng detecton n a dynamc world. VLDB, 2(1): , [8 X. L. Dong, L. Bert-Equlle, and D. Srvastava. Integratng conflctng data: the role of source dependence. Proc. VLDB Endow., 2: , August [9 A. Galland, S. Abteboul, A. Maran, and P. Senellart. Corroboratng nformaton from dsagreeng vews. In WSDM, pages , [10 R. V. Hogg and A. T. Crag. Introducton to mathematcal statstcs. Prentce Hall, [11 J. M. Klenberg. Authortatve sources n a hyperlnked envronment. Journal of the ACM, 46(5): , [12 J. Pasternack and D. Roth. Knowng what to beleve (when you already know somethng). In Internatonal Conference on Computatonal Lngustcs (COLING), [13 D. Wang, T. Abdelzaher, H. Ahmad, J. Pasternack, D. Roth, M. Gupta, J. Han, O. Fatemeh, and H. Le. On bayesan nterpretaton of fact-fndng n nformaton networks. In 14th Internatonal Conference on Informaton Fuson (Fuson 2011), [14 D. Wang, T. Abdelzaher, and L. Kaplan. On truth dscovery n socal sensng: A maxmum lkelhood estmaton approach. UIUC Techncal Report, [15 X. Yn, J. Han, and P. S. Yu. Truth dscovery wth multple conflctng nformaton provders on the web. IEEE Trans. on Knowl. and Data Eng., 20: , June 2008.
ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)
Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of
More informationComputing MLE Bias Empirically
Computng MLE Bas Emprcally Kar Wa Lm Australan atonal Unversty January 3, 27 Abstract Ths note studes the bas arses from the MLE estmate of the rate parameter and the mean parameter of an exponental dstrbuton.
More informationOn Truth Discovery in Social Sensing: A Maximum Likelihood Estimation Approach
On Truth Dscovery n Socal Sensng: A Maxmum Lkelhood Estmaton Approach Dong Wang, Tarek Abdelzaher Department of Computer Scence Unversty of Illnos Urbana, IL 6181 dwang24@llnos.edu, zaher@cs.llnos.edu
More informationEcon107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)
I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes
More informationOn an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1
On an Extenson of Stochastc Approxmaton EM Algorthm for Incomplete Data Problems Vahd Tadayon Abstract: The Stochastc Approxmaton EM (SAEM algorthm, a varant stochastc approxmaton of EM, s a versatle tool
More informationComposite Hypotheses testing
Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter
More informationUsing T.O.M to Estimate Parameter of distributions that have not Single Exponential Family
IOSR Journal of Mathematcs IOSR-JM) ISSN: 2278-5728. Volume 3, Issue 3 Sep-Oct. 202), PP 44-48 www.osrjournals.org Usng T.O.M to Estmate Parameter of dstrbutons that have not Sngle Exponental Famly Jubran
More information2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification
E395 - Pattern Recognton Solutons to Introducton to Pattern Recognton, Chapter : Bayesan pattern classfcaton Preface Ths document s a soluton manual for selected exercses from Introducton to Pattern Recognton
More information1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands
Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of
More informationLinear Approximation with Regularization and Moving Least Squares
Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...
More informationChapter 13: Multiple Regression
Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to
More informationStatistics II Final Exam 26/6/18
Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the
More informationPredictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore
Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.
More informationModule 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur
Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:
More informationStatistics for Economics & Business
Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable
More informationGlobal Sensitivity. Tuesday 20 th February, 2018
Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values
More informationParametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010
Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton
More information4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA
4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected
More informationECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics
ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott
More informationTracking with Kalman Filter
Trackng wth Kalman Flter Scott T. Acton Vrgna Image and Vdeo Analyss (VIVA), Charles L. Brown Department of Electrcal and Computer Engneerng Department of Bomedcal Engneerng Unversty of Vrgna, Charlottesvlle,
More informationLecture Notes on Linear Regression
Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume
More informationSTAT 3008 Applied Regression Analysis
STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,
More informationx = , so that calculated
Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to
More informationChapter 11: Simple Linear Regression and Correlation
Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests
More informationJoint Statistical Meetings - Biopharmaceutical Section
Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve
More informationStat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors
Stat60: Bayesan Modelng and Inference Lecture Date: February, 00 Reference Prors Lecturer: Mchael I. Jordan Scrbe: Steven Troxler and Wayne Lee In ths lecture, we assume that θ R; n hgher-dmensons, reference
More informationDepartment of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6
Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.
More informationUncertainty and auto-correlation in. Measurement
Uncertanty and auto-correlaton n arxv:1707.03276v2 [physcs.data-an] 30 Dec 2017 Measurement Markus Schebl Federal Offce of Metrology and Surveyng (BEV), 1160 Venna, Austra E-mal: markus.schebl@bev.gv.at
More information18.1 Introduction and Recap
CS787: Advanced Algorthms Scrbe: Pryananda Shenoy and Shjn Kong Lecturer: Shuch Chawla Topc: Streamng Algorthmscontnued) Date: 0/26/2007 We contnue talng about streamng algorthms n ths lecture, ncludng
More informationLOW BIAS INTEGRATED PATH ESTIMATORS. James M. Calvin
Proceedngs of the 007 Wnter Smulaton Conference S G Henderson, B Bller, M-H Hseh, J Shortle, J D Tew, and R R Barton, eds LOW BIAS INTEGRATED PATH ESTIMATORS James M Calvn Department of Computer Scence
More informationA Robust Method for Calculating the Correlation Coefficient
A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal
More informationPower law and dimension of the maximum value for belief distribution with the max Deng entropy
Power law and dmenson of the maxmum value for belef dstrbuton wth the max Deng entropy Bngy Kang a, a College of Informaton Engneerng, Northwest A&F Unversty, Yanglng, Shaanx, 712100, Chna. Abstract Deng
More informationUncertainty as the Overlap of Alternate Conditional Distributions
Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant
More informationNotes on Frequency Estimation in Data Streams
Notes on Frequency Estmaton n Data Streams In (one of) the data streamng model(s), the data s a sequence of arrvals a 1, a 2,..., a m of the form a j = (, v) where s the dentty of the tem and belongs to
More informationLecture 3 Stat102, Spring 2007
Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture
More informationLimited Dependent Variables
Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages
More informationA Comparative Study for Estimation Parameters in Panel Data Model
A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and
More informationASYMPTOTIC PROPERTIES OF ESTIMATES FOR THE PARAMETERS IN THE LOGISTIC REGRESSION MODEL
Asymptotc Asan-Afrcan Propertes Journal of Estmates Economcs for and the Econometrcs, Parameters n Vol. the Logstc, No., Regresson 20: 65-74 Model 65 ASYMPTOTIC PROPERTIES OF ESTIMATES FOR THE PARAMETERS
More informationThe Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction
ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also
More informationEstimation: Part 2. Chapter GREG estimation
Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the
More information3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X
Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number
More informationU-Pb Geochronology Practical: Background
U-Pb Geochronology Practcal: Background Basc Concepts: accuracy: measure of the dfference between an expermental measurement and the true value precson: measure of the reproducblty of the expermental result
More informationprinceton univ. F 13 cos 521: Advanced Algorithm Design Lecture 3: Large deviations bounds and applications Lecturer: Sanjeev Arora
prnceton unv. F 13 cos 521: Advanced Algorthm Desgn Lecture 3: Large devatons bounds and applcatons Lecturer: Sanjeev Arora Scrbe: Today s topc s devaton bounds: what s the probablty that a random varable
More informationSee Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)
Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes
More informationTime-Varying Systems and Computations Lecture 6
Tme-Varyng Systems and Computatons Lecture 6 Klaus Depold 14. Januar 2014 The Kalman Flter The Kalman estmaton flter attempts to estmate the actual state of an unknown dscrete dynamcal system, gven nosy
More informationStatistical inference for generalized Pareto distribution based on progressive Type-II censored data with random removals
Internatonal Journal of Scentfc World, 2 1) 2014) 1-9 c Scence Publshng Corporaton www.scencepubco.com/ndex.php/ijsw do: 10.14419/jsw.v21.1780 Research Paper Statstcal nference for generalzed Pareto dstrbuton
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models
More informationComparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method
Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method
More informationCOMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS
Avalable onlne at http://sck.org J. Math. Comput. Sc. 3 (3), No., 6-3 ISSN: 97-537 COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS
More informationEconomics 130. Lecture 4 Simple Linear Regression Continued
Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do
More informationParametric fractional imputation for missing data analysis
Secton on Survey Research Methods JSM 2008 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Wayne Fuller Abstract Under a parametrc model for mssng data, the EM algorthm s a popular tool
More informationChapter Newton s Method
Chapter 9. Newton s Method After readng ths chapter, you should be able to:. Understand how Newton s method s dfferent from the Golden Secton Search method. Understand how Newton s method works 3. Solve
More informationMotion Perception Under Uncertainty. Hongjing Lu Department of Psychology University of Hong Kong
Moton Percepton Under Uncertanty Hongjng Lu Department of Psychology Unversty of Hong Kong Outlne Uncertanty n moton stmulus Correspondence problem Qualtatve fttng usng deal observer models Based on sgnal
More informationLinear Regression Analysis: Terminology and Notation
ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented
More informationMATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2)
1/16 MATH 829: Introducton to Data Mnng and Analyss The EM algorthm (part 2) Domnque Gullot Departments of Mathematcal Scences Unversty of Delaware Aprl 20, 2016 Recall 2/16 We are gven ndependent observatons
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation
Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear
More informationMaximum Likelihood Estimation
Maxmum Lkelhood Estmaton INFO-2301: Quanttatve Reasonng 2 Mchael Paul and Jordan Boyd-Graber MARCH 7, 2017 INFO-2301: Quanttatve Reasonng 2 Paul and Boyd-Graber Maxmum Lkelhood Estmaton 1 of 9 Why MLE?
More informationBasic Business Statistics, 10/e
Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours
UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x
More informationGoodness of fit and Wilks theorem
DRAFT 0.0 Glen Cowan 3 June, 2013 Goodness of ft and Wlks theorem Suppose we model data y wth a lkelhood L(µ) that depends on a set of N parameters µ = (µ 1,...,µ N ). Defne the statstc t µ ln L(µ) L(ˆµ),
More information2 STATISTICALLY OPTIMAL TRAINING DATA 2.1 A CRITERION OF OPTIMALITY We revew the crteron of statstcally optmal tranng data (Fukumzu et al., 1994). We
Advances n Neural Informaton Processng Systems 8 Actve Learnng n Multlayer Perceptrons Kenj Fukumzu Informaton and Communcaton R&D Center, Rcoh Co., Ltd. 3-2-3, Shn-yokohama, Yokohama, 222 Japan E-mal:
More information2016 Wiley. Study Session 2: Ethical and Professional Standards Application
6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton
More informationKernel Methods and SVMs Extension
Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general
More informationx i1 =1 for all i (the constant ).
Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by
More informationTransfer Functions. Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: ( ) system
Transfer Functons Convenent representaton of a lnear, dynamc model. A transfer functon (TF) relates one nput and one output: x t X s y t system Y s The followng termnology s used: x y nput output forcng
More informationCS 2750 Machine Learning. Lecture 5. Density estimation. CS 2750 Machine Learning. Announcements
CS 750 Machne Learnng Lecture 5 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square CS 750 Machne Learnng Announcements Homework Due on Wednesday before the class Reports: hand n before
More informationParameters Estimation of the Modified Weibull Distribution Based on Type I Censored Samples
Appled Mathematcal Scences, Vol. 5, 011, no. 59, 899-917 Parameters Estmaton of the Modfed Webull Dstrbuton Based on Type I Censored Samples Soufane Gasm École Supereure des Scences et Technques de Tuns
More informationChapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems
Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons
More informationHidden Markov Models & The Multivariate Gaussian (10/26/04)
CS281A/Stat241A: Statstcal Learnng Theory Hdden Markov Models & The Multvarate Gaussan (10/26/04) Lecturer: Mchael I. Jordan Scrbes: Jonathan W. Hu 1 Hdden Markov Models As a bref revew, hdden Markov models
More informationGrover s Algorithm + Quantum Zeno Effect + Vaidman
Grover s Algorthm + Quantum Zeno Effect + Vadman CS 294-2 Bomb 10/12/04 Fall 2004 Lecture 11 Grover s algorthm Recall that Grover s algorthm for searchng over a space of sze wors as follows: consder the
More informationMarkov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement
Markov Chan Monte Carlo MCMC, Gbbs Samplng, Metropols Algorthms, and Smulated Annealng 2001 Bonformatcs Course Supplement SNU Bontellgence Lab http://bsnuackr/ Outlne! Markov Chan Monte Carlo MCMC! Metropols-Hastngs
More informationLecture 12: Classification
Lecture : Classfcaton g Dscrmnant functons g The optmal Bayes classfer g Quadratc classfers g Eucldean and Mahalanobs metrcs g K Nearest Neghbor Classfers Intellgent Sensor Systems Rcardo Guterrez-Osuna
More informationECE559VV Project Report
ECE559VV Project Report (Supplementary Notes Loc Xuan Bu I. MAX SUM-RATE SCHEDULING: THE UPLINK CASE We have seen (n the presentaton that, for downlnk (broadcast channels, the strategy maxmzng the sum-rate
More informationLECTURE 9 CANONICAL CORRELATION ANALYSIS
LECURE 9 CANONICAL CORRELAION ANALYSIS Introducton he concept of canoncal correlaton arses when we want to quantfy the assocatons between two sets of varables. For example, suppose that the frst set of
More informationj) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1
Random varables Measure of central tendences and varablty (means and varances) Jont densty functons and ndependence Measures of assocaton (covarance and correlaton) Interestng result Condtonal dstrbutons
More informationSmall Area Interval Estimation
.. Small Area Interval Estmaton Partha Lahr Jont Program n Survey Methodology Unversty of Maryland, College Park (Based on jont work wth Masayo Yoshmor, Former JPSM Vstng PhD Student and Research Fellow
More informationThe Order Relation and Trace Inequalities for. Hermitian Operators
Internatonal Mathematcal Forum, Vol 3, 08, no, 507-57 HIKARI Ltd, wwwm-hkarcom https://doorg/0988/mf088055 The Order Relaton and Trace Inequaltes for Hermtan Operators Y Huang School of Informaton Scence
More informationEcon Statistical Properties of the OLS estimator. Sanjaya DeSilva
Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate
More informationComparison of Regression Lines
STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence
More informationApplication of B-Spline to Numerical Solution of a System of Singularly Perturbed Problems
Mathematca Aeterna, Vol. 1, 011, no. 06, 405 415 Applcaton of B-Splne to Numercal Soluton of a System of Sngularly Perturbed Problems Yogesh Gupta Department of Mathematcs Unted College of Engneerng &
More informationEEE 241: Linear Systems
EEE : Lnear Systems Summary #: Backpropagaton BACKPROPAGATION The perceptron rule as well as the Wdrow Hoff learnng were desgned to tran sngle layer networks. They suffer from the same dsadvantage: they
More informationThe EM Algorithm (Dempster, Laird, Rubin 1977) The missing data or incomplete data setting: ODL(φ;Y ) = [Y;φ] = [Y X,φ][X φ] = X
The EM Algorthm (Dempster, Lard, Rubn 1977 The mssng data or ncomplete data settng: An Observed Data Lkelhood (ODL that s a mxture or ntegral of Complete Data Lkelhoods (CDL. (1a ODL(;Y = [Y;] = [Y,][
More informationMaximum Likelihood Estimation (MLE)
Maxmum Lkelhood Estmaton (MLE) Ken Kreutz-Delgado (Nuno Vasconcelos) ECE 175A Wnter 01 UCSD Statstcal Learnng Goal: Gven a relatonshp between a feature vector x and a vector y, and d data samples (x,y
More informationSolutions Homework 4 March 5, 2018
1 Solutons Homework 4 March 5, 018 Soluton to Exercse 5.1.8: Let a IR be a translaton and c > 0 be a re-scalng. ˆb1 (cx + a) cx n + a (cx 1 + a) c x n x 1 cˆb 1 (x), whch shows ˆb 1 s locaton nvarant and
More informationENG 8801/ Special Topics in Computer Engineering: Pattern Recognition. Memorial University of Newfoundland Pattern Recognition
EG 880/988 - Specal opcs n Computer Engneerng: Pattern Recognton Memoral Unversty of ewfoundland Pattern Recognton Lecture 7 May 3, 006 http://wwwengrmunca/~charlesr Offce Hours: uesdays hursdays 8:30-9:30
More informationSingular Value Decomposition: Theory and Applications
Sngular Value Decomposton: Theory and Applcatons Danel Khashab Sprng 2015 Last Update: March 2, 2015 1 Introducton A = UDV where columns of U and V are orthonormal and matrx D s dagonal wth postve real
More informationConvergence of random processes
DS-GA 12 Lecture notes 6 Fall 216 Convergence of random processes 1 Introducton In these notes we study convergence of dscrete random processes. Ths allows to characterze phenomena such as the law of large
More informationHomework Assignment 3 Due in class, Thursday October 15
Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.
More informationNON-CENTRAL 7-POINT FORMULA IN THE METHOD OF LINES FOR PARABOLIC AND BURGERS' EQUATIONS
IJRRAS 8 (3 September 011 www.arpapress.com/volumes/vol8issue3/ijrras_8_3_08.pdf NON-CENTRAL 7-POINT FORMULA IN THE METHOD OF LINES FOR PARABOLIC AND BURGERS' EQUATIONS H.O. Bakodah Dept. of Mathematc
More informationUSE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE
STATISTICA, anno LXXV, n. 4, 015 USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE Manoj K. Chaudhary 1 Department of Statstcs, Banaras Hndu Unversty, Varanas,
More informationCHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE
CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE Analytcal soluton s usually not possble when exctaton vares arbtrarly wth tme or f the system s nonlnear. Such problems can be solved by numercal tmesteppng
More informationTopic- 11 The Analysis of Variance
Topc- 11 The Analyss of Varance Expermental Desgn The samplng plan or expermental desgn determnes the way that a sample s selected. In an observatonal study, the expermenter observes data that already
More informationRetrieval Models: Language models
CS-590I Informaton Retreval Retreval Models: Language models Luo S Department of Computer Scence Purdue Unversty Introducton to language model Ungram language model Document language model estmaton Maxmum
More informationChapter 4: Regression With One Regressor
Chapter 4: Regresson Wth One Regressor Copyrght 2011 Pearson Addson-Wesley. All rghts reserved. 1-1 Outlne 1. Fttng a lne to data 2. The ordnary least squares (OLS) lne/regresson 3. Measures of ft 4. Populaton
More informationInterval Estimation in the Classical Normal Linear Regression Model. 1. Introduction
ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model
More informationLecture 10 Support Vector Machines II
Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed
More informationRelevance Vector Machines Explained
October 19, 2010 Relevance Vector Machnes Explaned Trstan Fletcher www.cs.ucl.ac.uk/staff/t.fletcher/ Introducton Ths document has been wrtten n an attempt to make Tppng s [1] Relevance Vector Machnes
More informationPopulation Design in Nonlinear Mixed Effects Multiple Response Models: extension of PFIM and evaluation by simulation with NONMEM and MONOLIX
Populaton Desgn n Nonlnear Mxed Effects Multple Response Models: extenson of PFIM and evaluaton by smulaton wth NONMEM and MONOLIX May 4th 007 Carolne Bazzol, Sylve Retout, France Mentré Inserm U738 Unversty
More informationHidden Markov Models
Hdden Markov Models Namrata Vaswan, Iowa State Unversty Aprl 24, 204 Hdden Markov Model Defntons and Examples Defntons:. A hdden Markov model (HMM) refers to a set of hdden states X 0, X,..., X t,...,
More informationExpectation Maximization Mixture Models HMMs
-755 Machne Learnng for Sgnal Processng Mture Models HMMs Class 9. 2 Sep 200 Learnng Dstrbutons for Data Problem: Gven a collecton of eamples from some data, estmate ts dstrbuton Basc deas of Mamum Lelhood
More information