On Quantifying the Accuracy of Maximum Likelihood Estimation of Participant Reliability in Social Sensing

Size: px
Start display at page:

Download "On Quantifying the Accuracy of Maximum Likelihood Estimation of Participant Reliability in Social Sensing"

Transcription

1 On Quantfyng the Accuracy of Maxmum Lkelhood Estmaton of Partcpant Relablty n Socal Sensng Dong Wang, Tarek Abdelzaher Department of Computer Scence Unversty of Illnos Urbana, IL dwang24@llnos.edu, zaher@llnos.edu Lance Kaplan Charu C. Aggarwal Networked Sensng & Fuson IBM Research Branch Yorktown Heghts, NY US Army Research Laboratory charu@us.bm.com Adelph, MD lance.m.kaplan@us.army.ml ABSTRACT Ths paper presents a confdence nterval quantfcaton of maxmum lkelhood estmaton of partcpant relablty n socal sensng applcatons. The work s motvated by the emergence of socal sensng as a data collecton paradgm, where humans perform the data collecton tasks. A key challenge n socal sensng applcatons les n the uncertan nature of human measurements. Unlke well-calbrated and well-tested nfrastructure sensors, humans are less relable, and the lkelhood that partcpants measurements are correct s often unknown a pror. Hence, t s hard to estmate the accuracy of conclusons made based on socal sensng data. In prevous work, we developed a maxmum lkelhood estmator of relablty of both partcpants and facts concluded from the data. Ths paper presents an analytcally-founded bound that quantfes the accuracy of such maxmum lkelhood estmaton n socal sensng. A confdence nterval s derved by leveragng the asymptotc normalty of maxmum lkelhood estmaton and computng the approxmaton of Cramer-Rao bound (CRB) for the estmaton parameters. The proposed quantfcaton approach s emprcally valdated and shown to accurately bound the actual estmaton error gven suffcent number of partcpants under dfferent sensng topologes. 1. INTRODUCTION Socal sensng, where ndvduals act as sensors, s a key emergng category of sensng applcatons. Yet, quantfyng the relablty of data collected from human sources remans one of the man challenges n utlzng socal sensng n msson-crtcal systems. Ths relablty problem has long been known n mltary scenaros and s becomng ncreasngly mportant n commercal and cvl settngs as well. The man research queston s one of quantfyng a level of con- Permsson to make dgtal or hard copes of all or part of ths work for personal or classroom use s granted wthout fee provded that copes are not made or dstrbuted for proft or commercal advantage and that copes bear ths notce and the full ctaton on the frst page. To copy otherwse, to republsh, to post on servers or to redstrbute to lsts, requres pror specfc permsson and/or a fee. Ths artcle was presented at: 8th Internatonal Workshop on Data Management for Sensor Networks (DMSN 2011) Copyrght fdence n nformaton reported by a group of human observers. Ths paper quantfes confdence n socal sensng observatons by computng a confdence nterval of a maxmum lkelhood estmator of partcpant relablty from socal sensng data. The confdence nterval s computed based on the approxmaton of Cramer-Rao bound (CRB) [4, whch quantfes the varance of a mnmum-varance unbased estmator. Ths bound approxmates the CRB by leveragng observatons from partcpants and knowledge of truthfulness of facts estmated from an maxmum lkelhood estmator of socal sensng [14. We consder a sensng applcaton n whch data are collected from a large populaton, where the relablty (.e., the probablty of correctness) of ndvdual partcpants, and hence observatons made by them, s not known a pror. We am to derve the confdence nterval n the maxmum lkelhood estmaton of partcpant and observaton relablty gven no pror knowledge other than the nformaton descrbng who reported whch observatons. The maxmum lkelhood estmaton problem tself s not the topc of ths paper. An approach based on expectaton maxmzaton [5 s descrbed n pror work [14. Ths paper quantfes the confdence approxmately n the answer reported by such an estmator. We concern ourselves wth bnary measurements only; for example, reportng whether or not a gven person or object was seen at a gven locaton. Our dervaton leverages the asymptotc normalty of maxmum lkelhood estmaton and computes the approxmaton of Cramer-Rao bound (CRB) of the estmaton parameters used n an expectaton maxmzaton scheme. It s shown that the probablty of the estmated partcpant relablty fallng nto the derved confdence nterval s always greater than the gven confdence level, as long as enough partcpants report suffcent observatons, and as long as some partcpants make the same observaton. In other words, t s shown that our confdence wndow correctly bounds estmaton error. The rest of ths paper s organzed as follows: In Secton 2, we revew a maxmum lkelhood estmaton approach for socal sensng applcatons and formulate the problem of dervng the confdence nterval of partcpant relablty. The dervaton of the proposed approxmaton for CRB quantfcaton on maxmum lkelhood estmaton s dscussed n Secton 3. Evaluaton results are presented n Secton 4. We

2 revew related work n Secton 5. Fnally, we conclude the paper n Secton PROBLEM FORMULATION Frst, we revew the maxmum lkelhood estmaton approach for truth dscovery n socal sensng applcatons. We consder a socal sensng applcaton model where a group of M partcpants, S 1,..., S M, make ndvdual observatons about a set of N measured varables C 1,..., C N n ther envronment. Each measured varable denotes the exstence or lack thereof of certan phenomenon of applcaton s nterests. In ths effort, we consder only bnary varables and assume, wthout loss of generalty, that ther normal state s negatve. Hence, partcpants report only when a postve value s encountered. Each partcpant generally observes only a subset of all varables. Our goal s to determne the confdence nterval of the partcpant relablty maxmum lkelhood estmaton for a gven confdence level based only on the nformaton of whch observatons are reported by whch partcpant. Let S C j denote an observaton reported by partcpant S clamng that C j s true. Let P (Cj) t and P (C f j ) denote the probablty that the actual varable C j s ndeed true and false, respectvely. Dfferent partcpants may make dfferent numbers of observatons. Let the probablty that partcpant S makes an observaton be s. Further, let the probablty that partcpant S s rght be t and the probablty that t s wrong be 1 t. Note that, ths probablty depends on the partcpant s relablty, whch s not known a pror. Formally, t s defned as: t = P (C t j S C j) (1) Let us also defne a as the (unknown) probablty that partcpant S reports a varable to be true when t s ndeed true, and b as the (unknown) probablty that partcpant S reports a varable to be true when t s n realty false. Formally, a and b are defned as follows: a = P (S C j C t j) b = P (S C j C f j ) (2) From the defnton of t, a and b, we can determne ther relatonshp usng the Bayesan theorem: a = P (S C j Cj) t = P (SCj, Ct j) P (Cj t) = P (Ct j S C j)p (S C j) P (Cj t) b = P (S C j C f j ) = P (SCj, Cf j ) P (C f j ) = P (C f j SCj)P (SCj) P (C f j ) (3) The key nput to the maxmum lkelhood estmator algorthm s a matrx SC, where S C j = 1 when partcpant S reports that C j s true, and S C j = 0 otherwse. Let us call t the observaton matrx. For ntalzaton, we also defne the background bas d to be the overall pror probablty that a randomly chosen measured varable s true. Note that, ths value can be known from past statstcs. It does not ndcate, however, whether any partcular measured varable s true or not. To ntalze the algorthm, we set P (C t j) = d and set P (S C j) = s. Pluggng these, together wth t nto the defnton of a and b, we get the ntal values: t s a = d (1 t) s b = 1 d The best (n the sense of maxmum lkelhood) estmate ˆt of the relablty of each partcpant S can be obtaned by usng the Expectaton Maxmzaton (EM) algorthm [5. In the EM algorthm, a latent varable Z s ntroduced for each measured varable to ndcate whether t s true or not (.e., z j s 1 when the measured varable C j s true and 0 otherwse). The observaton matrx SC s treated as the observed data X, and θ = (a 1, a 2,...a M ; b 1, b 2,...b M ) s the parameter vector of the model we want to estmate. The EM algorthm teratvely performs an expectaton step (Estep) and a maxmzaton step (M-step) to compute the best estmate of the parameter θ that maxmzes the expected logarthm lkelhood functon. The lkelhood functon used by EM scheme s gven by: L(θ; X, Z) = p(x, Z θ) { N M = a S C j (1 a ) (1 S C j ) d z j + j=1 M =1 =1 b S C j (1 b ) (1 S C j ) (1 d) (1 z j) An output of the EM algorthm s the maxmum lkelhood estmaton of each partcpant s relablty, whch s most consstent wth the observaton matrx SC. However, an mportant problem that remans unanswered from the maxmum lkelhood estmaton of the EM scheme s: what s the confdence nterval of the resultng partcpant relablty estmaton? Only by answerng ths queston, can we completely characterze estmaton performance, and hence partcpant relablty n socal sensng applcatons. The goal of ths paper s to demonstrate, n an analytcally founded manner, how to compute the confdence nterval of each partcpant s relablty. Formally, ths s gven by: (ˆt MLE } (4) (5) c lower p, ˆt MLE + c upper p ) c% (6) where c% s the confdence level of the estmaton nterval, c lower p and c upper p represent the lower and upper bound on the estmaton devaton from the maxmum lkelhood estmaton ˆt MLE respetvely. We target to fnd c lower p and for a gven c% and an observaton Matrx SC. c upper p 3. RELIABILITY DERIVATION In ths secton, we derve a confdence nterval based on the approxmaton of Cramer-Rao Bound and the aforementoned formulaton of maxmum lkelhood estmaton of partcpant relablty. The log-lkelhood functon (or logprobablty densty functon) of the maxmum lkelhood estmaton we get from EM can be expressed as: l em(x; θ) = log p em(x; θ) { [ N M = z j (S C j log a + (1 S C j) log(1 a ) + log d) j=1 =1 + (1 z j) [ M } (S C j log b + (1 S C j) log(1 b ) + log(1 d)) =1 (7)

3 The lkelhood (or probablty densty functon) s: p em(x; θ) = exp(l em(x; θ)) (8) The goal here s to show that the confdence nterval of the estmated parameter θ can be asymptotcally characterzed by the approxmaton of Cramer-Rao bound (CRB) gven the observaton matrx as well as estmated truthfulness of each measured varable from EM scheme. In statstc mathematcs and nformaton theory, the Fsher nformaton s a way of measurng the amount of nformaton that an observable random varable X carres about an estmated parameter θ upon whch the probablty of X depends. The partal dervatve of the log-lkelhood functon l(x; θ) wth respect to θ s called the score. A score vector ψ(x; θ) for a k 1 estmaton vector θ = [θ 1, θ 2,..., θ k T s defned as: l(x; θ) l(x; θ) l(x; θ) ψ(x; θ) = [,,..., T (9) θ 1 θ 2 θ k The Fsher nformaton s defned as the second moment of the score vector: I(θ) = E X[ψ(X; θ)ψ(x; θ) T (10) where the expectaton s taken over all values for X wth respect to the probablty functon p(x; θ) for any gven value of θ. Hence, the Fsher nformaton for the above estmaton vector θ takes the form of an k k matrx, the Fsher Informaton Matrx, wth the representatve element: l(x; θ) l(x; θ) (I(θ)),j = E X[( )( ) (11) θ θ j If l(x; θ) s twce dfferentable wth respect to θ (whch happens to be the case for EM model), under certan regularty condtons, the Fsher Informaton Matrx may also be wrtten as [10: (I(θ)),j = E X[ 2 l(x; θ) θ θ j (12) In estmaton theory and statstcs, the Cramer-Rao bound (CRB) expresses a lower bound on the varance of estmators of a determnstc parameter. In ts smplest form, the bound states the varance of any unbased estmator s at least as hgh as the nverse of the Fsher nformaton [10. The estmator that reaches ths lower bound s sad to be effcent. The maxmum lkelhood estmator posses a number of attractve asymptotc propertes. One of them s called asymptotc normalty, whch bascally states the MLE estmator s asymptotcally dstrbuted wth Gaussan behavor as the data sample sze goes up, n partcular[3: n(ˆθmle θ 0) d N(0, I 1 (ˆθ MLE)) (13) where n s the sample sze, θ 0 and ˆθ MLE are the true value and the maxmum lkelhood estmaton of the parameter θ respectvely. The Fsher nformaton at the MLE s used to to estmate ts true value [10. Hence, the asymptotc normalty property means that n a regular case of estmaton and n the dstrbuton lmtng sense, the maxmum lkelhood estmator ˆθ MLE s unbased and ts covarance reaches the Cramer-Rao bound (.e., an effcent estmator). Snce the estmator we obtan from the EM algorthm s a maxmum lkelhood estmator of the parameter θ, we now show how to leverage the asymptotc normalty and the approxmaton of Cramer-Rao bound to derve a confdence nterval that quantfes the estmaton accuracy of the estmated parameter θ for the model of the EM scheme. We frst compute the approxmaton of Fsher Informaton Matrx from the log-lkelhood functon gven by Equaton (7). Note that ths computaton utlzes the estmated truthfulness of each measured varable from EM scheme, hence offers approxmated results. Accordng to pror work [14, the maxmum lkelhood estmator ˆθ MLE s gven by: â MLE = ˆbMLE j SJ Z c j N j=1 Zc j = K j SJ Z c j N N j=1 Zc j (14) where SJ s the set of measured varables reported by partcpant S and Zj c s the converged value of Z(t, j) (.e., p(z j = 1 X j, θ (t) )) from EM algorthm. Observe that each â MLE ˆb MLE or s computed from N ndependent samples (.e., measured varables). The Fsher nformaton n a random sample of sze n s n tmes the Fsher nformaton n one observaton [10, and hence I n(θ) = ni(θ) (15) Pluggng l em(x; θ) gven by Equaton (7) nto the Fsher Informaton Matrx defned n Equaton (12), we have: (I(ˆθ MLE)),j (16) 0 j [ = E 1 2 l em(x;a ) X N a =â MLE = j [1, M [ E 1 X N a 2 2 l em(x;b ) b 2 b =ˆb MLE = j (M, 2M Observe that the Fsher Informaton Matrx of the maxmumlkelhood estmator from the EM scheme s a dagonal matrx, hence the nverse of ths matrx s: (I 1 (ˆθ MLE)),j (17) 0 j [ E X = [ E X N 2 lem(x;a ) a 2 N 2 lem(x;b ) b 2 a =â MLE b =ˆb MLE = j [1, M = j (M, 2M From the asymptotc normalty of the maxmum lkelhood estmator specfed by Equaton (13), we know that (ˆθ MLE θ 0) d N(0, 1 N I 1 (ˆθ MLE)). Therefore, substtutng (I 1 (ˆθ MLE)) by Equaton (17) nto Equaton (13), we obtan the covarance matrx Cov(ˆθ MLE) of the asymptotc normal dstrbuton for the maxmum lkelhood estmaton of EM scheme, whch s gven by: (Cov(ˆθ MLE)),j (18) 0 j [ 1 E X = 2 lem(x;a ) a =â MLE = j [1, M a 2 [ 1 E X = j (M, 2M 2 lem(x;b ) b 2 b =ˆb MLE Usng the converged log-lkelhood functon of Equaton (7) and substtutng Equaton (14) nto Equaton (18), the above covarance matrx can be further wrtten as:

4 (Cov(ˆθ MLE)),j (19) 0 j â = MLE (1 â MLE ) = j [1, M N d ˆbMLE (1 ˆb MLE ) = j (M, 2M N (1 d) Note that, the actual CRB bound s a functon of both M and N. However, the approxmaton CRB bound derved s ndependent of M. Let us denote the varance of estmaton error on parameter a as V ar(â MLE ). Recall the relaton between partcpant relablty and estmaton parameter a s a = t s. For a gven topology, s d and d are known constants, (ˆt MLE t 0 ) also follows a norm dstrbuton wth 0 mean and varance gven by: V ar(ˆt MLE ) = ( d s ) 2 V ar(â MLE ) (20) Hence, we are able to derve the confdence nterval that can be used to quantfy the estmaton accuracy of the maxmum lkelhood estmaton from the EM scheme. The confdence nterval of the relablty estmaton of partcpant S (.e., ˆt MLE (ˆt MLE ) at confdence level p s gven by: c p V ar(ˆt MLE ), ˆt MLE + c p V ar(ˆt MLE )) (21) where c p s the standard score (z-score) of the confdence level p. For example, for the 95% confdence level, c p = Note that the derved confdence nterval of the partcpant relablty maxmum lkelhood estmator can be computed by smply usng the converged maxmum leklhood estmaton of the EM scheme. Ths completes the dervaton. 4. EVALUATION In ths secton, we carry out smulaton experments to evaluate the performance of the computed confdence nterval of partcpant relablty n socal sensng. We bult a smulator n Matlab that generates a random number of partcpants and measured varables. A random probablty P s assgned to each partcpant S representng hs/her relablty (.e., the ground truth probablty that they report correct observatons). For each partcpant S, L observatons are generated. Each observaton has a probablty P of beng true (.e., reportng a varable as true correctly) and a probablty 1 P of beng false (reportng a varable as true when t s not). One can thnk of these varables as observed problems. Partcpants do not report lack of problems. Hence, they never report a varable to be false. We let P be unformly dstrbuted between 0.5 and 1 n our experments 1. We evaluate the derved confdence nterval on partcpant relablty over three dfferent observaton matrx scales: small, medum and large. The smulaton parameters of the three observaton matrx scales are lsted n Table 1. The average observatons reported by each partcpant s set to 100. For each observaton matrx scale, we run the EM algorthm and compute the confdence nterval on partcpant relablty based on Equaton (21). We repeat the experments In prncple, there s no ncentve for a partcpant to le more than 50% of the tme, snce negatng ther statements would then gve a more accurate truth tmes for each observaton matrx scale and call the experments wth the actual estmaton error fallng outsde the confdence nterval outlers. We choose three representatve confdence levels (.e., 68%, 90%, 95% 2 ), respectvely. For a gven confdence level, we further defne the partcpant who has the largest number of outlers over all experments as the worst-case partcpant. Hence, we record the number of ourlters of every worst-case partcpant for a gven confdence level and compare t wth the theoretcal maxmum number of outlers. Observaton Number Number of Number of Matrx of Partcpantsured True Mea- False Measured Scale Var- ables Var- ables Small Medum Large Table 1: Parameters of Three Typcal Observaton Matrx Scale Fgure 1 shows the confdence nterval bounds on the partcpant relablty estmaton error wth three dfferent confdence levels for the small observaton matrx. Note that the CRB s smply a functon of the ground truth parameter values. However, t s reasonable to substtute the true (but unknown) parameter values wth ther ML estmates [10. Ths s the reason that bounds n the fgure appear to fluctuate rather than beng flat. Observe that the actual estmaton error on partcpant relablty s well bounded by ts correspondng confdence nterval. Specally, the numbers of outlers for the worst-case partcpant at confdence levels 68%, 90% and 95% are 24, 8 and 5 out of 100 experments. They are less than the theorectal maxmum number of outlers for the three confdence levels (.e., 32, 10 and 5 out of 100 experments, predcted usng our derved bound). Smlar results are observed for the medum and large observaton matrces as well, whch are shown n Fgure 2 and Fgure 3. A summary of the comparson between confdence nterval bounds n estmatng partcpant relablty and the theoretcal results s shown n Table 2. We observe that the probablty of the estmated partcpant relablty fallng nto the derved confdence nterval s always greater than the correspondng confdence level. Snce the derved bound s an approxmaton of true CRB (.e., t depends on the correct estmaton of truthfulness of measured varables from EM scheme), we study the condtons when such approxmated bound fals to bound the actual error of the estmaton parameters. We fx the true and false measured varables to be 1000 respectvely, the average observatons per partcpant s set to 100. We vary the number of partcpants from very small (.e., 5) to large (.e., 205). Reported results are averaged over 100 experments. Fgure 4 shows the square root of the average MSE (mean squared error) of 3 confdence nterval bounds on the estmated parameter a and b when the number of partcpants vares. Observe that the hgh confdence bounds (.e., 95% or 90% ) fal to bound the root of MSE on a or b only when the number of partcpants (M) s very small. Ths s due to 2 They correspond to one, two and three tmes standard devaton confdence ntervals of normal dstrbuton

5 (a) 68% Confdence Bound (b) 90% Confdence Bound (c) 95% Confdence Bound Fgure 1: CRB Confdence Bounds on Partcpant Relablty for Small Observaton Matrx (a) 68% Confdence Bound (b) 90% Confdence Bound (c) 95% Confdence Bound Fgure 2: CRB Confdence Bounds on Partcpant Relablty for Medum Observaton Matrx (a) 68% Confdence Bound (b) 90% Confdence Bound (c) 95% Confdence Bound Fgure 3: CRB Confdence Bounds on Partcpant Relablty for Large Observaton Matrx (a) Confdence Bound on a (b) Confdence Bound on b Fgure 4: CRB Confdence Bound on a and b versus Vayng M the poor estmaton results of measured varbles when too few partcpants report ther observatons. However, when M s reasonablly suffcent (e.g., 25 n the experment), hgh confdence bounds always bound the square root of the average MSE on the estmated parameters correctly. 5. RELATED WORK Socal sensng has receved sgnfcant attenton due to the great ncrease n the number of moble sensors owned by ndvduals and the prolferaton of Internet connectvty. To assess the credblty of partcpants and facts reported n partcpatory sensng and other socal sensng applcatons, a relevant body of work, called fact-fnders, n the machne learnng and data mnng communtes performs trust analyss. The basc fact-fnders nclude Hubs and Authortes [11,

6 Observaton Matrx Scale Small Medum Large Confdence Level of Estmaton Theoretcal Maxmum Outlers/Total Experments 68% 32/100 24/100 90% 10/100 8/100 95% 5/100 5/100 68% 32/100 25/100 90% 10/100 6/100 95% 5/100 4/100 68% 32/100 25/100 90% 10/100 9/100 95% 5/100 2/100 CRB Bound Worst Case Outlers/Total Experments Table 2: CRB Bound on Partcpant Probablty versus Theoretcal Results Average.Log [12, and TruthFnder [15. Other fact-fnders enhance the basc framework by ncorporatng analyss on propertes or dependences wthn assertons or sources [9, 2, 8, 7, 6. Fact-fndng n the case of socal sensng s more challengng due to the hghly dynamc nature of socal sensng applcatons [1. Moreover, the outputs of fact-fnders are generally rankngs of credblty values of partcpants and facts. Such rankngs cannot be used to drectly quantfy the partcpant relablty or fact correctness. Recent work presented a Bayesan Interpretaton scheme [13 representng an ntal effort to provde a probablty nterpretaton of rankng outputs from fact-fnders. However, t remans an approxmaton approach n whch the accuracy of truth estmaton s very senstve to ntal condtons of teratons. Due to ths lmtaton, a maxmum lkelhood estmaton approach usng EM algorthm s proposed to provde the frst optmal soluton to the truth dscovery problem n socal sensng [14. The EM scheme was shown to outperform Bayesan nterpretaton and other state-of-art factfnders. However, only average estmaton accuraces were reported n both of above schemes. The confdence nterval of the estmaton accuracy has not been found. In contrast, ths paper derves, for the frst tme, the confdence nterval of partcpant relablty based on the approxmaton of CRB, hence completes the quantfcaton of partcpant relablty estmaton n socal sensng. In estmaton theory and statstcs, the Cramer-Rao bound refers to a lower bound on the varance of estmators of a determnstc parameter [4. The bound states the varance of any unbased estmator s lower-bounded by the nverse of Fsher nformaton [10. One of the key propertes of maxmum lkelhood estmaton s asymptotc normalty. An EM scheme provdes maxmum lkelhood estmaton of partcpant relablty for socal sensng applcatons. Ths paper provdes the frst quantfcaton approach to compute the confdence nterval for partcpant relablty maxmumlkelhood estmaton based on the approxmaton of CRB by leveragng results from estmaton and nformaton theory. 6. CONCLUSION Ths paper descrbed a quantfcaton approach to compute the confdence nterval of the maxmum lkelhood estmaton on partcpant relablty based on the approxmaton of CRB n socal sensng applcatons. Ths quantfcaton approach completely characterzes the estmaton performance of partcpant relablty wthout knowng the trustworthness of partcpants a pror. The derved confdence nterval s obtaned by leveragng the asymptotc normalty of maxmum lkelhood estmaton and can be easly computed from the approxmated Fsher nformaton contaned n partcpants relablty estmatons. Evaluaton results show that the error n the estmated partcpant relablty s well bounded by the computed bound. In future work, a tghter CRB bound can probably be derved wthout knowng the truthfulness of each measured varable. Ths actual CRB bound s expected to better track the actual MSE of the estmated parameters. Acknowledgements Research reported n ths paper was sponsored by the Army Research Laboratory and was accomplshed under Cooperatve Agreement Number W911NF The vews and conclusons contaned n ths document are those of the authors and should not be nterpreted as representng the offcal polces, ether expressed or mpled, of the Army Research Laboratory or the U.S. Government. The U.S. Government s authorzed to reproduce and dstrbute reprnts for Government purposes notwthstandng any copyrght notaton here on. 7. REFERENCES [1 C. Aggarwal and T. Abdelzaher. Integratng sensors and socal networks. Socal Network Data Analytcs, Sprnger, expected n [2 L. Bert-Equlle, A. D. Sarma, X. Dong, A. Maran, and D. Srvastava. Salng the nformaton ocean wth awareness of currents: Dscovery and applcaton of source dependence. In CIDR 09, [3 G. Casella and R. Berger. Statstcal Inference. Duxbury Press, [4 H. Cramer. Mathematcal Methods of Statstcs. Prnceton Unv. Press., [5 A. P. Dempster, N. M. Lard, and D. B. Rubn. Maxmum lkelhood from ncomplete data va the em algorthm. JOURNAL OF THE ROYAL STATISTICAL SOCIETY, SERIES B, 39(1):1 38, [6 X. Dong, L. Bert-Equlle, Y. Hu, and D. Srvastava. Global detecton of complex copyng relatonshps between sources. PVLDB, 3(1): , [7 X. Dong, L. Bert-Equlle, and D. Srvastava. Truth dscovery and copyng detecton n a dynamc world. VLDB, 2(1): , [8 X. L. Dong, L. Bert-Equlle, and D. Srvastava. Integratng conflctng data: the role of source dependence. Proc. VLDB Endow., 2: , August [9 A. Galland, S. Abteboul, A. Maran, and P. Senellart. Corroboratng nformaton from dsagreeng vews. In WSDM, pages , [10 R. V. Hogg and A. T. Crag. Introducton to mathematcal statstcs. Prentce Hall, [11 J. M. Klenberg. Authortatve sources n a hyperlnked envronment. Journal of the ACM, 46(5): , [12 J. Pasternack and D. Roth. Knowng what to beleve (when you already know somethng). In Internatonal Conference on Computatonal Lngustcs (COLING), [13 D. Wang, T. Abdelzaher, H. Ahmad, J. Pasternack, D. Roth, M. Gupta, J. Han, O. Fatemeh, and H. Le. On bayesan nterpretaton of fact-fndng n nformaton networks. In 14th Internatonal Conference on Informaton Fuson (Fuson 2011), [14 D. Wang, T. Abdelzaher, and L. Kaplan. On truth dscovery n socal sensng: A maxmum lkelhood estmaton approach. UIUC Techncal Report, [15 X. Yn, J. Han, and P. S. Yu. Truth dscovery wth multple conflctng nformaton provders on the web. IEEE Trans. on Knowl. and Data Eng., 20: , June 2008.

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Computing MLE Bias Empirically

Computing MLE Bias Empirically Computng MLE Bas Emprcally Kar Wa Lm Australan atonal Unversty January 3, 27 Abstract Ths note studes the bas arses from the MLE estmate of the rate parameter and the mean parameter of an exponental dstrbuton.

More information

On Truth Discovery in Social Sensing: A Maximum Likelihood Estimation Approach

On Truth Discovery in Social Sensing: A Maximum Likelihood Estimation Approach On Truth Dscovery n Socal Sensng: A Maxmum Lkelhood Estmaton Approach Dong Wang, Tarek Abdelzaher Department of Computer Scence Unversty of Illnos Urbana, IL 6181 dwang24@llnos.edu, zaher@cs.llnos.edu

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1

On an Extension of Stochastic Approximation EM Algorithm for Incomplete Data Problems. Vahid Tadayon 1 On an Extenson of Stochastc Approxmaton EM Algorthm for Incomplete Data Problems Vahd Tadayon Abstract: The Stochastc Approxmaton EM (SAEM algorthm, a varant stochastc approxmaton of EM, s a versatle tool

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family IOSR Journal of Mathematcs IOSR-JM) ISSN: 2278-5728. Volume 3, Issue 3 Sep-Oct. 202), PP 44-48 www.osrjournals.org Usng T.O.M to Estmate Parameter of dstrbutons that have not Sngle Exponental Famly Jubran

More information

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification E395 - Pattern Recognton Solutons to Introducton to Pattern Recognton, Chapter : Bayesan pattern classfcaton Preface Ths document s a soluton manual for selected exercses from Introducton to Pattern Recognton

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Tracking with Kalman Filter

Tracking with Kalman Filter Trackng wth Kalman Flter Scott T. Acton Vrgna Image and Vdeo Analyss (VIVA), Charles L. Brown Department of Electrcal and Computer Engneerng Department of Bomedcal Engneerng Unversty of Vrgna, Charlottesvlle,

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Joint Statistical Meetings - Biopharmaceutical Section

Joint Statistical Meetings - Biopharmaceutical Section Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve

More information

Stat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors

Stat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors Stat60: Bayesan Modelng and Inference Lecture Date: February, 00 Reference Prors Lecturer: Mchael I. Jordan Scrbe: Steven Troxler and Wayne Lee In ths lecture, we assume that θ R; n hgher-dmensons, reference

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Uncertainty and auto-correlation in. Measurement

Uncertainty and auto-correlation in. Measurement Uncertanty and auto-correlaton n arxv:1707.03276v2 [physcs.data-an] 30 Dec 2017 Measurement Markus Schebl Federal Offce of Metrology and Surveyng (BEV), 1160 Venna, Austra E-mal: markus.schebl@bev.gv.at

More information

18.1 Introduction and Recap

18.1 Introduction and Recap CS787: Advanced Algorthms Scrbe: Pryananda Shenoy and Shjn Kong Lecturer: Shuch Chawla Topc: Streamng Algorthmscontnued) Date: 0/26/2007 We contnue talng about streamng algorthms n ths lecture, ncludng

More information

LOW BIAS INTEGRATED PATH ESTIMATORS. James M. Calvin

LOW BIAS INTEGRATED PATH ESTIMATORS. James M. Calvin Proceedngs of the 007 Wnter Smulaton Conference S G Henderson, B Bller, M-H Hseh, J Shortle, J D Tew, and R R Barton, eds LOW BIAS INTEGRATED PATH ESTIMATORS James M Calvn Department of Computer Scence

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

Power law and dimension of the maximum value for belief distribution with the max Deng entropy

Power law and dimension of the maximum value for belief distribution with the max Deng entropy Power law and dmenson of the maxmum value for belef dstrbuton wth the max Deng entropy Bngy Kang a, a College of Informaton Engneerng, Northwest A&F Unversty, Yanglng, Shaanx, 712100, Chna. Abstract Deng

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

Notes on Frequency Estimation in Data Streams

Notes on Frequency Estimation in Data Streams Notes on Frequency Estmaton n Data Streams In (one of) the data streamng model(s), the data s a sequence of arrvals a 1, a 2,..., a m of the form a j = (, v) where s the dentty of the tem and belongs to

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

ASYMPTOTIC PROPERTIES OF ESTIMATES FOR THE PARAMETERS IN THE LOGISTIC REGRESSION MODEL

ASYMPTOTIC PROPERTIES OF ESTIMATES FOR THE PARAMETERS IN THE LOGISTIC REGRESSION MODEL Asymptotc Asan-Afrcan Propertes Journal of Estmates Economcs for and the Econometrcs, Parameters n Vol. the Logstc, No., Regresson 20: 65-74 Model 65 ASYMPTOTIC PROPERTIES OF ESTIMATES FOR THE PARAMETERS

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Estimation: Part 2. Chapter GREG estimation

Estimation: Part 2. Chapter GREG estimation Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

U-Pb Geochronology Practical: Background

U-Pb Geochronology Practical: Background U-Pb Geochronology Practcal: Background Basc Concepts: accuracy: measure of the dfference between an expermental measurement and the true value precson: measure of the reproducblty of the expermental result

More information

princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 3: Large deviations bounds and applications Lecturer: Sanjeev Arora

princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 3: Large deviations bounds and applications Lecturer: Sanjeev Arora prnceton unv. F 13 cos 521: Advanced Algorthm Desgn Lecture 3: Large devatons bounds and applcatons Lecturer: Sanjeev Arora Scrbe: Today s topc s devaton bounds: what s the probablty that a random varable

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Time-Varying Systems and Computations Lecture 6

Time-Varying Systems and Computations Lecture 6 Tme-Varyng Systems and Computatons Lecture 6 Klaus Depold 14. Januar 2014 The Kalman Flter The Kalman estmaton flter attempts to estmate the actual state of an unknown dscrete dynamcal system, gven nosy

More information

Statistical inference for generalized Pareto distribution based on progressive Type-II censored data with random removals

Statistical inference for generalized Pareto distribution based on progressive Type-II censored data with random removals Internatonal Journal of Scentfc World, 2 1) 2014) 1-9 c Scence Publshng Corporaton www.scencepubco.com/ndex.php/ijsw do: 10.14419/jsw.v21.1780 Research Paper Statstcal nference for generalzed Pareto dstrbuton

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS Avalable onlne at http://sck.org J. Math. Comput. Sc. 3 (3), No., 6-3 ISSN: 97-537 COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Parametric fractional imputation for missing data analysis

Parametric fractional imputation for missing data analysis Secton on Survey Research Methods JSM 2008 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Wayne Fuller Abstract Under a parametrc model for mssng data, the EM algorthm s a popular tool

More information

Chapter Newton s Method

Chapter Newton s Method Chapter 9. Newton s Method After readng ths chapter, you should be able to:. Understand how Newton s method s dfferent from the Golden Secton Search method. Understand how Newton s method works 3. Solve

More information

Motion Perception Under Uncertainty. Hongjing Lu Department of Psychology University of Hong Kong

Motion Perception Under Uncertainty. Hongjing Lu Department of Psychology University of Hong Kong Moton Percepton Under Uncertanty Hongjng Lu Department of Psychology Unversty of Hong Kong Outlne Uncertanty n moton stmulus Correspondence problem Qualtatve fttng usng deal observer models Based on sgnal

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2)

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2) 1/16 MATH 829: Introducton to Data Mnng and Analyss The EM algorthm (part 2) Domnque Gullot Departments of Mathematcal Scences Unversty of Delaware Aprl 20, 2016 Recall 2/16 We are gven ndependent observatons

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maxmum Lkelhood Estmaton INFO-2301: Quanttatve Reasonng 2 Mchael Paul and Jordan Boyd-Graber MARCH 7, 2017 INFO-2301: Quanttatve Reasonng 2 Paul and Boyd-Graber Maxmum Lkelhood Estmaton 1 of 9 Why MLE?

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

Goodness of fit and Wilks theorem

Goodness of fit and Wilks theorem DRAFT 0.0 Glen Cowan 3 June, 2013 Goodness of ft and Wlks theorem Suppose we model data y wth a lkelhood L(µ) that depends on a set of N parameters µ = (µ 1,...,µ N ). Defne the statstc t µ ln L(µ) L(ˆµ),

More information

2 STATISTICALLY OPTIMAL TRAINING DATA 2.1 A CRITERION OF OPTIMALITY We revew the crteron of statstcally optmal tranng data (Fukumzu et al., 1994). We

2 STATISTICALLY OPTIMAL TRAINING DATA 2.1 A CRITERION OF OPTIMALITY We revew the crteron of statstcally optmal tranng data (Fukumzu et al., 1994). We Advances n Neural Informaton Processng Systems 8 Actve Learnng n Multlayer Perceptrons Kenj Fukumzu Informaton and Communcaton R&D Center, Rcoh Co., Ltd. 3-2-3, Shn-yokohama, Yokohama, 222 Japan E-mal:

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Transfer Functions. Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: ( ) system

Transfer Functions. Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: ( ) system Transfer Functons Convenent representaton of a lnear, dynamc model. A transfer functon (TF) relates one nput and one output: x t X s y t system Y s The followng termnology s used: x y nput output forcng

More information

CS 2750 Machine Learning. Lecture 5. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 5. Density estimation. CS 2750 Machine Learning. Announcements CS 750 Machne Learnng Lecture 5 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square CS 750 Machne Learnng Announcements Homework Due on Wednesday before the class Reports: hand n before

More information

Parameters Estimation of the Modified Weibull Distribution Based on Type I Censored Samples

Parameters Estimation of the Modified Weibull Distribution Based on Type I Censored Samples Appled Mathematcal Scences, Vol. 5, 011, no. 59, 899-917 Parameters Estmaton of the Modfed Webull Dstrbuton Based on Type I Censored Samples Soufane Gasm École Supereure des Scences et Technques de Tuns

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

Hidden Markov Models & The Multivariate Gaussian (10/26/04)

Hidden Markov Models & The Multivariate Gaussian (10/26/04) CS281A/Stat241A: Statstcal Learnng Theory Hdden Markov Models & The Multvarate Gaussan (10/26/04) Lecturer: Mchael I. Jordan Scrbes: Jonathan W. Hu 1 Hdden Markov Models As a bref revew, hdden Markov models

More information

Grover s Algorithm + Quantum Zeno Effect + Vaidman

Grover s Algorithm + Quantum Zeno Effect + Vaidman Grover s Algorthm + Quantum Zeno Effect + Vadman CS 294-2 Bomb 10/12/04 Fall 2004 Lecture 11 Grover s algorthm Recall that Grover s algorthm for searchng over a space of sze wors as follows: consder the

More information

Markov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement

Markov Chain Monte Carlo (MCMC), Gibbs Sampling, Metropolis Algorithms, and Simulated Annealing Bioinformatics Course Supplement Markov Chan Monte Carlo MCMC, Gbbs Samplng, Metropols Algorthms, and Smulated Annealng 2001 Bonformatcs Course Supplement SNU Bontellgence Lab http://bsnuackr/ Outlne! Markov Chan Monte Carlo MCMC! Metropols-Hastngs

More information

Lecture 12: Classification

Lecture 12: Classification Lecture : Classfcaton g Dscrmnant functons g The optmal Bayes classfer g Quadratc classfers g Eucldean and Mahalanobs metrcs g K Nearest Neghbor Classfers Intellgent Sensor Systems Rcardo Guterrez-Osuna

More information

ECE559VV Project Report

ECE559VV Project Report ECE559VV Project Report (Supplementary Notes Loc Xuan Bu I. MAX SUM-RATE SCHEDULING: THE UPLINK CASE We have seen (n the presentaton that, for downlnk (broadcast channels, the strategy maxmzng the sum-rate

More information

LECTURE 9 CANONICAL CORRELATION ANALYSIS

LECTURE 9 CANONICAL CORRELATION ANALYSIS LECURE 9 CANONICAL CORRELAION ANALYSIS Introducton he concept of canoncal correlaton arses when we want to quantfy the assocatons between two sets of varables. For example, suppose that the frst set of

More information

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1 Random varables Measure of central tendences and varablty (means and varances) Jont densty functons and ndependence Measures of assocaton (covarance and correlaton) Interestng result Condtonal dstrbutons

More information

Small Area Interval Estimation

Small Area Interval Estimation .. Small Area Interval Estmaton Partha Lahr Jont Program n Survey Methodology Unversty of Maryland, College Park (Based on jont work wth Masayo Yoshmor, Former JPSM Vstng PhD Student and Research Fellow

More information

The Order Relation and Trace Inequalities for. Hermitian Operators

The Order Relation and Trace Inequalities for. Hermitian Operators Internatonal Mathematcal Forum, Vol 3, 08, no, 507-57 HIKARI Ltd, wwwm-hkarcom https://doorg/0988/mf088055 The Order Relaton and Trace Inequaltes for Hermtan Operators Y Huang School of Informaton Scence

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Application of B-Spline to Numerical Solution of a System of Singularly Perturbed Problems

Application of B-Spline to Numerical Solution of a System of Singularly Perturbed Problems Mathematca Aeterna, Vol. 1, 011, no. 06, 405 415 Applcaton of B-Splne to Numercal Soluton of a System of Sngularly Perturbed Problems Yogesh Gupta Department of Mathematcs Unted College of Engneerng &

More information

EEE 241: Linear Systems

EEE 241: Linear Systems EEE : Lnear Systems Summary #: Backpropagaton BACKPROPAGATION The perceptron rule as well as the Wdrow Hoff learnng were desgned to tran sngle layer networks. They suffer from the same dsadvantage: they

More information

The EM Algorithm (Dempster, Laird, Rubin 1977) The missing data or incomplete data setting: ODL(φ;Y ) = [Y;φ] = [Y X,φ][X φ] = X

The EM Algorithm (Dempster, Laird, Rubin 1977) The missing data or incomplete data setting: ODL(φ;Y ) = [Y;φ] = [Y X,φ][X φ] = X The EM Algorthm (Dempster, Lard, Rubn 1977 The mssng data or ncomplete data settng: An Observed Data Lkelhood (ODL that s a mxture or ntegral of Complete Data Lkelhoods (CDL. (1a ODL(;Y = [Y;] = [Y,][

More information

Maximum Likelihood Estimation (MLE)

Maximum Likelihood Estimation (MLE) Maxmum Lkelhood Estmaton (MLE) Ken Kreutz-Delgado (Nuno Vasconcelos) ECE 175A Wnter 01 UCSD Statstcal Learnng Goal: Gven a relatonshp between a feature vector x and a vector y, and d data samples (x,y

More information

Solutions Homework 4 March 5, 2018

Solutions Homework 4 March 5, 2018 1 Solutons Homework 4 March 5, 018 Soluton to Exercse 5.1.8: Let a IR be a translaton and c > 0 be a re-scalng. ˆb1 (cx + a) cx n + a (cx 1 + a) c x n x 1 cˆb 1 (x), whch shows ˆb 1 s locaton nvarant and

More information

ENG 8801/ Special Topics in Computer Engineering: Pattern Recognition. Memorial University of Newfoundland Pattern Recognition

ENG 8801/ Special Topics in Computer Engineering: Pattern Recognition. Memorial University of Newfoundland Pattern Recognition EG 880/988 - Specal opcs n Computer Engneerng: Pattern Recognton Memoral Unversty of ewfoundland Pattern Recognton Lecture 7 May 3, 006 http://wwwengrmunca/~charlesr Offce Hours: uesdays hursdays 8:30-9:30

More information

Singular Value Decomposition: Theory and Applications

Singular Value Decomposition: Theory and Applications Sngular Value Decomposton: Theory and Applcatons Danel Khashab Sprng 2015 Last Update: March 2, 2015 1 Introducton A = UDV where columns of U and V are orthonormal and matrx D s dagonal wth postve real

More information

Convergence of random processes

Convergence of random processes DS-GA 12 Lecture notes 6 Fall 216 Convergence of random processes 1 Introducton In these notes we study convergence of dscrete random processes. Ths allows to characterze phenomena such as the law of large

More information

Homework Assignment 3 Due in class, Thursday October 15

Homework Assignment 3 Due in class, Thursday October 15 Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.

More information

NON-CENTRAL 7-POINT FORMULA IN THE METHOD OF LINES FOR PARABOLIC AND BURGERS' EQUATIONS

NON-CENTRAL 7-POINT FORMULA IN THE METHOD OF LINES FOR PARABOLIC AND BURGERS' EQUATIONS IJRRAS 8 (3 September 011 www.arpapress.com/volumes/vol8issue3/ijrras_8_3_08.pdf NON-CENTRAL 7-POINT FORMULA IN THE METHOD OF LINES FOR PARABOLIC AND BURGERS' EQUATIONS H.O. Bakodah Dept. of Mathematc

More information

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE STATISTICA, anno LXXV, n. 4, 015 USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE Manoj K. Chaudhary 1 Department of Statstcs, Banaras Hndu Unversty, Varanas,

More information

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE Analytcal soluton s usually not possble when exctaton vares arbtrarly wth tme or f the system s nonlnear. Such problems can be solved by numercal tmesteppng

More information

Topic- 11 The Analysis of Variance

Topic- 11 The Analysis of Variance Topc- 11 The Analyss of Varance Expermental Desgn The samplng plan or expermental desgn determnes the way that a sample s selected. In an observatonal study, the expermenter observes data that already

More information

Retrieval Models: Language models

Retrieval Models: Language models CS-590I Informaton Retreval Retreval Models: Language models Luo S Department of Computer Scence Purdue Unversty Introducton to language model Ungram language model Document language model estmaton Maxmum

More information

Chapter 4: Regression With One Regressor

Chapter 4: Regression With One Regressor Chapter 4: Regresson Wth One Regressor Copyrght 2011 Pearson Addson-Wesley. All rghts reserved. 1-1 Outlne 1. Fttng a lne to data 2. The ordnary least squares (OLS) lne/regresson 3. Measures of ft 4. Populaton

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

Lecture 10 Support Vector Machines II

Lecture 10 Support Vector Machines II Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed

More information

Relevance Vector Machines Explained

Relevance Vector Machines Explained October 19, 2010 Relevance Vector Machnes Explaned Trstan Fletcher www.cs.ucl.ac.uk/staff/t.fletcher/ Introducton Ths document has been wrtten n an attempt to make Tppng s [1] Relevance Vector Machnes

More information

Population Design in Nonlinear Mixed Effects Multiple Response Models: extension of PFIM and evaluation by simulation with NONMEM and MONOLIX

Population Design in Nonlinear Mixed Effects Multiple Response Models: extension of PFIM and evaluation by simulation with NONMEM and MONOLIX Populaton Desgn n Nonlnear Mxed Effects Multple Response Models: extenson of PFIM and evaluaton by smulaton wth NONMEM and MONOLIX May 4th 007 Carolne Bazzol, Sylve Retout, France Mentré Inserm U738 Unversty

More information

Hidden Markov Models

Hidden Markov Models Hdden Markov Models Namrata Vaswan, Iowa State Unversty Aprl 24, 204 Hdden Markov Model Defntons and Examples Defntons:. A hdden Markov model (HMM) refers to a set of hdden states X 0, X,..., X t,...,

More information

Expectation Maximization Mixture Models HMMs

Expectation Maximization Mixture Models HMMs -755 Machne Learnng for Sgnal Processng Mture Models HMMs Class 9. 2 Sep 200 Learnng Dstrbutons for Data Problem: Gven a collecton of eamples from some data, estmate ts dstrbuton Basc deas of Mamum Lelhood

More information