Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks

Size: px
Start display at page:

Download "Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks"

Transcription

1 Combiig Microarrays ad Biological Kowledge for Estimatig Gee Networks via Bayesia Networks Seiya Imoto 1, Tomoyuki Higuchi 2, Takao Goto 1, Kousuke Tashiro 3, Satoru Kuhara 3 ad Satoru Miyao 1 1 Huma Geome Ceter, Istitute of Medical Sciece, Uiversity of Tokyo Shirokaedai, Miato-ku, Tokyo, , Japa fimoto, takao, miyaog@ims.u-tokyo.ac.jp 2 The Istitute of Statistical Mathematics, 4-6-7, Miami-Azabu, Miato-ku, Tokyo, , Japa higuchi@ism.ac.jp 3 Graduate School of Geetic Resources Techology, Kyushu Uiversity Hakozaki, Higashi-ku, Fukuoka, , Japa fktashiro, kuharag@grt.kyushu-u.ac.jp Abstract We propose a statistical method for estimatig a gee etwork based o Bayesia etworks from microarray gee expressio data together with biological kowledge icludig protei-protei iteractios, protei-dna iteractios, bidig site iformatio, existig literature ad so o. Ufortuately, microarray data do ot cotai eough iformatio for costructig gee etworks accurately i may cases. Our method adds biological kowledge to the estimatio method of gee etworks uder a Bayesia statistical framework, ad also cotrols the trade-off betwee microarray iformatio ad biological kowledge automatically. We coduct Mote Carlo simulatios to show the effectiveess of the proposed method. We aalyze Saccharomyces cerevisiae gee expressio data as a applicatio. 1. Itroductio I recet years, a large amout of gee expressio data has bee collected ad estimatig a gee etwork has become oe of the cetral topics i the field of bioiformatics. Several methodologies have bee proposed for costructig a gee etwork based o gee expressio data, such as Boolea etworks [1, 2, 32, 42], differetial equatio models [7, 10, 11, 32] ad Bayesia etworks [13, 14, 17, 18, 20, 22, 23, 37]. Mai drawback for the gee etwork costructio from microarray data is that while the gee etwork cotais a large umber of gees, the iformatio cotaied i gee expressio data is limited by the umber of microarrays, their quality, the experimetal desig, oise, ad measuremet errors. Therefore, estimated gee etworks cotai some icorrect gee regulatios, which caot be evaluated from a biology viewpoit. I particular, the directio of gee regulatio is difficult to decide usig gee expressio data oly. Hece, the use of biological kowledge, icludig protei-protei ad protei-dna iteractios [3, 5, 16, 21, 25], sequeces of the bidig site of the gees cotrolled by trascriptio regulators [31, 40, 47], literature ad so o, are cosidered to be a key for microarray data aalysis. The use of biological kowledge has previously received cosiderable attetio for extractig more iformatio from microarray data [4, 6, 18, 33, 36, 38, 41]. I this paper, we provide a geeral framework for combiig microarray data ad biological kowledge aimed at estimatig a gee etwork by usig a Bayesia etwork model. If the gee regulatio mechaisms are completely kow, we ca model the gee etwork easily. However, may parts of the true gee etwork are still ukow ad eed to be estimated from data. Hece, it is ecessary to costruct a suitable criterio for evaluatig estimated gee

2 etworks i order to obtai a optimal etwork. While criteria proposed previously for evaluatig a Bayesia etwork model oly measure the closeess betwee a model ad microarray data, we derive a criterio for selectig etworks based o microarray data ad biological kowledge. The proposed criterio is coducted by two compoets: Oe shows the fitess of the model to the microarray data ad the other reflects biological kowledge, which is modeled uder a probabilistic framework. Our proposed method automatically tues the balace betwee the biological kowledge ad microarray data based o our criterio ad estimates a gee etwork from the combied data. I Sectio 2.1, we describe our statistical model for costructig gee etworks ad itroduce a criterio for evaluatig etworks i Sectio 2.2. A statistical framework for represetig biological kowledge is described i Sectio 2.3. I Sectio 2.4, we illustrate how to model various types of biological kowledge i practice. Mote Carlo simulatios, i Sectio 3.1, are coducted to show the effectiveess of the proposed method. We apply our method to Saccharomyces cerevisiae gee expressio data i Sectio Method for Estimatig Gee Networks 2.1. Bayesia etwork ad oparametric heteroscedastic regressio model Bayesia etworks [26] are a type of graphical models for capturig complex relatioships amog a large amout of radom variables by the directed acyclic graph ecodig the Markov assumptio. I the cotext of Bayesia etworks, a gee correspods to a radom variable show as a ode, while gee regulatios are show by directed edges. Thus gee iteractios are modeled by the coditioal distributio of each gee. We use Bayesia etwork ad oparametric heteroscedastic regressio models [23] for costructig gee etworks from microarray data. Suppose that we have sets of microarrays fx 1 ;:::;x g of p gees, where x i = (x i1 ; :::; x ip ) T is a p dimesioal gee expressio vector obtaied by ith microarray. Here, x ij is a expressio value of jth gee, deoted by gee j, measured by ith microarray after required ormalizatios ad trasformatio [39]. Ordiary, x ij is give by log 2 (R ij =G ij ),wherer ij ad G ij are ormalized itesities of Cy5 ad Cy3 for gee j measured by ith microarray. The iteractio betwee gee j ad its parets is modeled by the oparametric additive regressio model [19] with heterogeeous error variaces x ij = m j1 (p (j) i1 )+ + m jqj (p(j) iqj )+" ij; where p (j) ik is the expressio value of kth paret of gee j measured by ith microarray ad " ij depeds idepedetly ad ormally o mea 0 ad variace ff 2 ij. Here, m jk ( ) is a smooth fuctio costructed by B-splies [9, 12, 24] of the form Mjk X m jk (p (j) ik )= m=1 fl (j) mk b(j) mk (p(j) ik ); where fb (j) 1k ( );:::;b(j) Mjk;k ( )g is a prescribed set of B- splies ad fl (j) mk are parameters. Hece, a Bayesia etwork ad oparametric heteroscedastic regressio model ca be represeted as f(x i ; G )= py j=1 f j (x ij jp ij ; j ) for i = 1;:::;,where G is a parameter vector ad f j (x ij jp ij ; j ) is a desity of Gaussia distributio with mea m j1 (p (j) i1 )+ + m jqj (p(j) iqj ) ad variace ff2 ij. If gee j has o paret gees, we use μ j ad ffj 2 istead of m j1 (p (j) i1 )+ + m jqj (p(j) iqj ) ad ff2 ij, respectively. This model has several advatages. Ulike Boolea etworks ad discrete Bayesia etworks [13, 14, 17, 18, 20, 37], o discretizatio of gee expressio data, which leads to iformatio loss, is required. Secod, eve oliear relatioships betwee gees are automatically extracted based o gee expressio data Criterio for evaluatig etworks Some gee etworks are partially kow, but may mechaisms of gee regulatios are still ukow. Therefore we eed to estimate ukow structures of the gee etwork from the data. Hece, the costructio of a suitable criterio for measurig the closeess betwee a estimated gee etwork ad the true oe is a essetial problem for statistical gee etwork modelig. Followig the result of Imoto et al. [23], a criterio for evaluatig a estimated gee etwork ca be derived from Bayes approach. At first, we briefly itroduce the derivatio of their criterio. We the explai how exted their criterio for combiig microarray data ad biological kowledge. Whe we costruct a gee etwork G by usig a Bayesia etwork model, the posterior probability of the etwork is obtaied as the product of prior probability of the etwork ß(G) ad the margial likelihood divided by the ormalizig costat. After droppig the ormalizig costat, the posterior probability of the etwork is proportioal to ß(G) Z Y i=1 f (x i ; G )ß( G j )d G ;

3 where ß( G j ) is a prior distributio o the parameter vector G with hyperparameter vector satisfyig log ß( G j ) =O(). The essetial problem for costructig a criterio based o the posterior probability of the etwork is how to compute the margial likelihood give by a high dimesioal itegral. Imoto et al. [23] used the Laplace approximatio for itegrals [8, 30, 45] ad derived a criterio, amed BNRC hetero (Bayesia etwork ad Noparametric heteroscedastic Regressio Criterio), of the form where BNRChetero(G) = 2logß(G) +log l ( G jx) = 1 fi fi fi fi fi 2ß J (^ G ) fi 2l (^ G jx); X i=1 J ( G fl ( G T G log f(x i ; G )+ 1 log ß( Gj ); ad ^ G is the mode of l ( G jx). Suppose that the prior distributio ß( G j ) is factorized as ß( G j ) = Y j;k ß jk (fl jk j jk ); where fl jk =(fl (j) 1k ;:::;fl(j) Mjk;k )T is a parameter vector ad jk is a hyperparameter. We use a sigular M jk variate ormal distributio as the prior distributio o fl jk, 2ß ß jk (fl jk j jk )= jk jk j jk (Mjk 2)=2 1=2 + exp jk 2 flt jkk jk fl jk ; where K jk is a M jk M jk symmetric positive P semidefiite matrix satisfyig fl T jk K Mjk jkfl jk = ff=3 (fl(j) ffk 2fl (j) ff 1;k + fl(j) ff 2;k )2. The we P have the decompositio p BNRChetero = 2logß(G) + j=1 BNRC(j) hetero.here BNRC (j) hetero is a score for gee j ad give by BNRC (j) hetero + X qjx i=1 k=1 = ( qjx k=1 M jk + 1) log( 2ß ) log w ij + log(2ß ^ff 2 j )+ flog jλjkj M jk log(^ff 2 j )g L 3 ={1} gee gee 1 2 L 4 ={2} U 13 gee 3 gee 4 L 5 ={3,4} U 35 gee 5 U 24 U 45 Figure 1. A gee etwork ad its eergy. The idex sets L 3, L 4 ad L 5 are illustrated ad L 1 ad L 2 are defied by empty sets. The local eergies are E 3 = U 13, E 4 = U 24 ad E 5 = U 35 + U 45. The total eergy of this etwork is E= E 3 + E 4 + E 5 = U 13 + U 24 + U 35 + U 45. log(2^ff 2 j ) log jk jkj + Xqj + k=1 f(m jk 2) log ψ! 2ß ^ff j 2 fi jk fi jk + ^fl T ^ff jkk j 2 jk ^fl jk g; where w ij ;i = 1; :::; are weights of the heterogeeous error variace ffij 2 = w ij 1 ff2 j ad Λj = Bjk T W jb jk +fi jk K jk with B jk = (b jk (p (j) 1k ); :::; b jk(p (j) k ))T, b jk (p (j) ik ) = (b(j) 1k (p(j) ik ); :::; b(j) Mjk;k (p(j) ik ))T, W j = diag(w 1j ; :::; w j ) ad fi jk = ffj 2 jk. The details of the parameter estimatio are described i Imoto et al. [23] Addig biological kowledge The criterio BNRC hetero (G), itroduced i the previous sectio, cotais two quatities: the prior probability ß(G) of the etwork, ad the margial likelihood of the data. The margial likelihoodshows the fitess of the model to the microarray data. The biological kowledge ca the be added ito the prior probability of the etwork ß(G). Let U ij be the iteractio eergy of the edge from gee i to gee j ad let U ij be categorized ito I values, H 1 ; :::; H I, based o biological kowledge. For example, if we kow a priori gee i regulates gee j,wesetu ij = H 1. However, if we do ot kow whether gee k regulates gee l or ot, we set U kl = H 2. Note that 0 <H 1 <H 2. The

4 total eergy of the etwork G ca the be defied as E(G) = X fi;jg2g U ij ; where the sum is take over the existig edges i the etwork G. Uder the Bayesia etwork framework, the total eergy ca be decomposed ito the sum of the local eergies px X px E(G) = U ij = j=1 i2lj j=1 E j ; (1) where L j is a idex set of parets of gee j ad E j = P i2lj U ij is a local eergy defied by gee j ad its parets. Figure 1 shows a example of a gee etwork ad its eergy. The probability of a etwork G, ß(G), is aturally modeled by the Gibbs distributio [15] ß(G) =Z 1 expf E(G)g; (2) where (> 0) is a hyperparameter ad Z is a ormalizig costat called the partitio fuctio X Z = expf E(G)g: G2G Here G is the set of possible etworks. By replacig H 1 ; :::; H I with 1 ;:::; I, respectively, the ormalizig costat Z is a fuctio of 1 ; :::; I. We call j a iverse ormalized temperature. By substitutig (1) ito (2), we have ß(G) = Z 1 py j=1 = Z 1 py expf E j g Y exp( ff(i;j) ); j=1 i2lj with ff(i; j) = k for U ij = H k. Hece, by addig biological kowledge ito the prior probability of the etwork, BNRC hetero ca be rewritte as BNRChetero(G; 1 ; :::; I )=2logZ px X + f2 j=1 i2lj ff(i;j) +BNRC (j) heterog: (3) We ca choose a optimal etwork uder the give 1 ; :::; I. Also the optimal values of 1 ; :::; I are obtaied as the miimizer of (3). Therefore, we ca represet a algorithm for estimatig a gee etwork from microarray data ad biological kowledge as follows: Step1: Set the values 1 ; :::; I. Step2: Estimate a gee etwork by miimizig BNRChetero(G) uder the give 1 ;:::; I. Step3: Repeat Step1 ad Step2 agaist the cadidate values of 1 ; :::; I. Step4: A optimal gee etwork is obtaied from the cadidate etworks obtaied i Step3. I Step2, we use the greedy hill-climbig algorithm for learig etworks. The details are show i Imoto et al. [23]. Note that the proposed prior probabilityof the etwork ca be used for other types of Bayesia etwork models, such as discrete Bayesia etworks ad dyamic Bayesia etworks [29, 34, 36, 43]. The computatio of partitio fuctio, Z, is itractable eve for moderately sized gee etworks. To avoid this problem, we compute upper ad lower bouds of the partial fuctio ad use them for choosig the optimal values of 1 ;:::; I. A upper boud is obtaied by directed graphs, which are allowed to cotai cyclic graphs. Thus the true value of the partitio fuctio is ot greater tha the upper boud. A lower boud is computed by multi-level directed graphs with followig assumptios: (A1) There is oe top gee ad (A2) Gees at the same level have a commo paret gee that is located o oe upper level of them. We also cosider joied graphs of some multi-level directed graphs satisfyig (A1) ad (A2). Sice the umber of possible graphs is much larger tha those icluded i the computatio, the true value of the partitio fuctio should be greater tha the lower boud. Sice the optimizatio of the etwork structure for fixed 1 ; :::; I does ot deped o the value of the partitio fuctio, our method works well i practice. Of course, whe the umber of gees is small, we ca perform a exhaustive search ad compute the partitio fuctio completely. However, we thik that the developmet of a effective algorithm to eumerate all possible etworks or approximate the partitio fuctio is a importat problem Prior desig for various biological kowledge I this subsectio, we show some examples of biological kowledge ad how to iclude them ito the prior probability i practice. We cosider usig two values 1 ad 2 satisfyig 0 < 1 < 2 for represetig biological kowledge. Basically, we allocate 1 to a kow relatioship ad 2 otherwise. The prior iformatio ca be summarized as a p p matrix U whose (i; j) elemet, u ij, correspods to 1 or 2. Protei-protei iteractios The umber of kow protei-protei iteractios is rapidly icreasig ad kept i some public databases such

5 g 1 = " 1 ; g 2 = :7g 1 + " 2 g 5 = :7g 1 + " 5 ; g 10 =1=f1 + exp( 4g 3 )g + " 10 g 3 = ( 1 +" 3 (g 1» :5) g 1 + " 3 (jg 1 j <:5) 1+" 3 (g 1 :5) g 6 = ( :8g 3 + " 6 (g 3» 1) (g 3 +1) 1:5 + " 6 ( 1 < jg 3 j < 0) 1+" 6 (g 3 1) g 4 = :4g1 +1+" 4 (jg 1 j»:3) (g 1 +1) 2 + " 4 (jg 1 j <:3) g 8 = :2g3 1+" 8 (g 3» :2) 1:4g 3 + " 8 (g 3 >:2) g 11 = :7g 6 + " 11 ; g 14 = :7g 6 + " 14 ; g 15 =1=f1 +exp( 4g 8 )g + " 15 :4g3 +1+" 9 (jg 3 j»:3) g 9 = (g 3 +1) 1:2 + " 9 (jg 3 j <:3) g 12 = ( 1 +" 12 (g 6 < :5) g 6 + " 12 (jg 6 j»:5) 1+" 12 (jg 6 j >:5) g 13 = :4g6 +1+" 13 (jg 6 j»:3) (g 6 +1) 2 + " 13 (jg 6 j <:3) g 16 = :8g 8 + " 16 g 19 =1=f1 + exp( 4g 10 )g + " 19 g 20 =1:1g 10 + " 20 (a) g 17 = :2g8 1+" 17 (g 8» :2) 1:4g 8 + " 17 (g 8 >:2) g 18 = :4g8 +1(jg 8 j >:3 (g 8 +1) 1:2 (g 8» :3) (b) Figure 2. Artificial gee etwork ad fuctioal structures betwee odes. as GRID [16] ad BIND [3, 5]. Protei-protei iteractios show at least two proteis form a complex. Therefore, represetig protei-protei iteractios by a directed graph is ot suitable. However, they ca be icluded i our method. If we kow gee i ad gee j create a protei-protei iteractio, we set u ij = u ji = 1.Isuch a case, we will decide whether we make a virtual ode correspodig to a protei complex theoretically [35]. Protei-DNA iteractios Protei-DNA iteractios show gee regulatios by trascriptio factors ad ca be modeled more easily tha protei-protei iteractios. Whe gee i is a trascriptio regulator ad cotrols gee j,wesetu ij = 1 ad u ji = 2. Sequeces Gees that are cotrolled by a trascriptio regulator might have a cosesus motif i their promoter DNA sequeces. If gee j1,...,gee j have a cosesus motif ad are cotrolled by gee i,wesetu ij1 = = u ij = 1 ad u j1i = = u j i = 2. Previously, cosesus motifs were ofte used for the evaluatio of estimated gee etworks from a biological viewpoit. This iformatio, however, ca be itroduced directly ito our method. Oe straightforward way is the use of kow regulatory motifs kept i public databases such as SCPD [40] ad YTF [47]. As for a advaced method, Tamada et al. [44] proposed a method for simultaeously estimatig a gee etwork ad detectig regulatory motifs based o our method, ad succeeded i estimatig a accurate gee etwork ad detectig a true regulatory motif. Gee etworks ad pathways The iformatio of gee etworks ca be itroduced directly ito our method by trasformig the prescribed etworkstructuresitothe matrix U. We ca the estimate ageeetworkbasedou ad microarray data. Our method also ca use gee etworks estimated by other techiques such as boolea etworks, differetial equatio models, ad so o. Also, some databases, such as KEGG [28], cotai several kow gee etworks ad pathways. This iformatio ca be used similarly. Literature Some research has bee performed to extract iformatio from a huge amout of literature [27]. Literature cotai various kids of iformatio icludig biological kowledge described above. So we ca model literature iformatio i the same way. 3. Computatioal Experimets 3.1. Mote Carlo simulatios Before aalyzig real gee expressio data, we perform Mote Carlo simulatios to examie the properties of the proposed method. We assume a artificial etwork with 20 odes show i Figure 2 (a). The fuctioal relatioships betwee odes are listed i Figure 2 (b). A etwork will be rebuilt from simulated data cosistig of 50 or 100 observatios, which correspods to 50 or 100 microarrays. As for the biological kowledge, we tried the followig situatios: (Case 1) we kow some gee regulatios (100%, 75%, 50% or 25% out of 19 edges show i Figure 2 (a)) ad (Case 2) we kow some gee regulatios, but some (1, 2, or 3) icorrect edges are kept i the database. The cadidate values of 1 ad 2 are f0:5; 1:0g ad f 1 ; 2:5; 5:0; 7:5; 10:0g, respectively.

6 With kowledge Without kowledge Appear i both methods True edge 1 BNRC hetero Figure 3. The behavior of BNRC hetero whe 1 = 0.5. We ca fid out the optimal iverse ormalized temperature 2 is 5.0. Figure 4. A example of resultig etworks based o 100 samples. We used 1 =0.5ad 2 = 5.0 that are selected by our criterio (see Figure 3). Figure 4 shows two estimated etworks: Oe is estimated by 100 observatios (microarrays) aloe. We use 1 = 2 = 0:5, i.e. we did ot use ay kowledge (we deote this etwork by N 0 for coveiece). The other is estimated by 100 observatios ad prior iformatio of 75% gee regulatios, i.e. we kow 14 correct relatios out of the all 19 correct edges (we deote this etwork by N 1 ). Edges appearig i both etworks are colored gree, while edges appearig i N 0 or N 1 oly are colored blue ad red, respectively. By addig prior kowledge, it is clear that we succeeded i reducig the umber of false positives. We also fid additioal four correct relatioships. Figure 3 shows the behavior of BNRC hetero whe 1 =0:5. Wefid that the optimal value of 2 is 5.0. From the Mote Carlo simulatios, we observed that 2 ca be selected by usig middle values (depicted by a blue lie) of upper ad lower bouds or upper bouds i practice. For the selectio of 1, we use the middle value of the upper ad lower bouds of the score of our criterio. The results of the Mote Carlo simulatios are summarized as follows: I (Case 1), we obtaied etworks more accurately as log as we add correct kowledge. We observed that the umber of false positives decreased drastically. We presume the reaso is the ature of directed acyclic graphs. Sice a Bayesia etwork model is a directed acyclic graph, oe icorrect estimate may affect the relatios i its eighborhood. However, by addig some correct kowledge, we ca restrict the search space of the Bayesia etwork model learig effectively. I (Case 2), the results deped o the type of icorrect kowledge. (i) If we use misdirected relatios, e.g. gee 8! gee 3,as prior kowledge, serious problems occur. Sice microarray data to some degree support the misdirected relatios, they ted to receive a better criterio score. (ii) If we add idirect relatios such as gee 1! gee 8,we observed that our method cotrolled the balace betwee this prior iformatio ad microarray data ad could decide whether the prior relatio is true. (iii) If irrelevat relatios such as gee 20! gee 5 are added as prior iformatio, we observed that our method could reject these prior iformatio, because, the microarray data do ot support these relatios Example usig experimetal data I this subsectio, we demostrate our method by aalyzig Saccharomyces cerevisiae gee expressio data obtaied by disruptig 100 gees, which are almost all trascriptio factors. We focus o five gees, MCM1, SWI5, ACE2, SNF2 ad STE12 (see Table 1) ad extract gees that are regulated by these 5 gees from the Yeast Proteome Database [46]. Thus, we costruct a prior etwork show i Figure 5, based o the database iformatio. We iclude the prior etwork i our Bayesia etwork estimatio method.

7 MCM1 : trascriptio factor of the MADS box family MET14, CDC6, MET2, CDC5, MET6, SIC1, STE6, CLN2, PCL2, STE2, ACE2, MET16, MET3, MET4, CAR1, SWI5, PCL9, CLB1, MET17, EGT2, ARG5,6, PMA1, RME1, CLB2 SWI5 : trascriptio factor CDC6, SIC1, CLN2, PCL2, PCL9, EGT2, RME1, CTS1, HO ACE2 : metallothioei expressio activator SNF2 : CLN2, EGT2, HO, CTS1, RME1 compoet of SWI/SNF global trascriptio activator complex CTS1, HO STE12 : trascriptioal activator STE6, FAR1, KAR3, SST2, FUS1, STE2, BAR1, AGA1, AFR1, CIK1 Table 1. Five trascriptio factors ad their regulatig gees. SWI5 YDR146C PCL9 YDL179W PCL2 YDL127W SIC1 YLR079W MCM1 YMR043W SWI5 YDR146C PCL9 YDL179W PCL2 YDL127W SIC1 YLR079W MCM1 YMR043W ACE2 YLR131C RME1 YGR044C CLN2 YPL256C EGT2 YNL327W CDC6 YJL194W CTS1 YLR286C HO YDL227C CAR1 YPL111W MET14 YKL001C MET2 YNL277W ARG5,6 YER069W CDC5 YMR001C PMA1 YGL008C CLB1 YGR108W SNF2 YOR290C CLB2 YPR119W BAR1 YIL015W FUS1 YCL027W MET3 YJR010W MET16 YPR167C AGA1 YNR044W MET6 YER091C MET4 YNL103W MET17 YLR303W FAR1 YJL157C KAR3 YPR141C AFR1 YDR085C CIK1 YMR198W STE2 YFL026W STE6 YKL209C STE12 YHR084W CLN2 YPL256C EGT2 RME1 YNL327W YGR044C CDC6 YJL194W ACE2 YLR131C CTS1 YLR286C HO YDL227C MET2 MET17 YNL277W YLR303W MET3 YJR010W CDC5 YMR001C CAR1 MET16 YPL111W YPR167C CLB1 MET4 YGR108W YNL103W PMA1 YGL008C MET14 YKL001C ARG5,6 YER069W CLB2 MET6 YPR119W YER091C AGA1 YNR044W FAR1 BAR1 YJL157C YIL015W KAR3 SNF2 YPR141C YOR290C FUS1 CIK1 YCL027W YMR198W AFR1 YDR085C STE2 YFL026W STE6 YKL209C STE12 YHR084W Figure 5. Prior kowledge etwork. The gees that are i each shadowed circle are regulated by the paret gees. Figure 6. Resultig etwork based o microarray oly. That is, the purpose of this aalysis is to estimate the gee etwork cotaiig above 36 gees from microarray data together with the prior etwork. Figure 6 shows the estimated gee etwork usig microarray data oly. There are may o-prior edges ad may of them are probably false positives. I additio, we fid three misdirected relatios: SWI5! MCM1, HO! ACE2 ad STE6! STE12. By addig the prior etwork, we obtai the gee etwork show i Figure 8. As for the iverse ormalized temperatures 1 ad 2,weset 1 = 0:5 ad choose the optimal value of 2. We also estimated a gee etwork based o 1 =1ad foud the results described below to be essetially uchaged. Figure 7 shows the behavior of BNRC hetero with respect to 2. We fid that the optimal value of 2 is 2.5. Figure 8 shows the resultig etwork based o microarray data ad the biological kowledge represeted by the prior etwork i Figure 5. We show the edges that correspod to the prior kowledge i black. The edges betwee gees that are regulated by the same trascriptio factor i the prior etwork are show i blue. The red edges do ot correspod to the prior kowledge. I particular, we fid that the relatioships aroud MCM1 improve drastically. The etwork based o microarray oly (Figure 6) idicates that oly SIC1 ad ACE2 are regulated by MCM1. Note that the uderlied gees correspod to the prior etwork iformatio. After addig the prior kowledge ad optimizig the iverse ormalized temperatures, we fid that 10 gees out of 24 gees that are listed as co-regulated gees of MCM1 i Table 1 are extracted. Also, the relatioships aroud STE12

8 BNRC hetero SWI5 YDR146C ACE2 YLR131C RME1 YGR044C CLN2 YPL256C CDC6 YJL194W PCL9 YDL179W PCL2 YDL127W SIC1 YLR079W EGT2 YNL327W CTS1 YLR286C HO YDL227C CAR1 YPL111W MET14 YKL001C PMA1 YGL008C SNF2 YOR290C CLB2 YPR119W MET2 YNL277W ARG5,6 YER069W CDC5 YMR001C CLB1 YGR108W BAR1 YIL015W MET3 YJR010W MET16 YPR167C AGA1 YNR044W FUS1 YCL027W MET17 YLR303W MET6 YER091C MET4 YNL103W FAR1 YJL157C KAR3 YPR141C AFR1 YDR085C CIK1 YMR198W MCM1 YMR043W STE2 YFL026W STE6 YKL209C STE12 YHR084W Figure 7. Optimizatio of 2. We ca fid out that the optimal value of 2 is 2.5. Figure 8. Resultig etwork based o microarray data ad biological kowledge. The iverse ormalized temperatures are selected by our criterio ( 1 =0.5, 2 =2.5). become clearer. Before addig prior kowledge, the estimated etwork i Figure 6 suggests FUS1, AFR1, KAR3, BAR1, MET4, MET16 ad MCM1 are regulated by STE12, while STE12 is cotrolled by HO, STE6 ad MET3. O the other had, the etwork i Figure 8 shows that STE12 regulates FUS1, AFR1, KAR3, CIK1, STE2, STE6, HO ad MCM1. Note that the three misdirected relatios described above are corrected i Figure 8. The differece betwee the iverse ormalized temperatures 1 = 0:5 ad 2 = 2:5 is small, because the score of the criterio is added as 2 1 or 2 2, whe we add a edge that is listed or ot listed i the prior etwork, respectively. Therefore, microarray data cotai this iformatio ad we succeeded i extractig this iformatio with the slight help of the prior etwork. We optimized the iverse ormalized temperature 2 based o the proposed criterio. From the etwork based o the optimal iverse ormalized temperatures, we ca fid the gap betwee microarray data ad biological kowledge. By comparig Figure 6 with Figure 8, we fid that the microarray data reflect the relatioship betwee seve gees (CLN2, RME1, CDC6, EGT2, PCL2, PCL9 ad SIC1) ad two trascriptio factors (MCM1 ad SWI5). O the other had, we fid that there are somewhat large differeces betwee microarray data ad the prior etwork for the relatioship betwee MCM1 ad the thirtee gees that are i the biggest circle. 4. Discussio I this paper we proposed a geeral framework for combiig microarray data ad biological kowledge aimed at estimatig a gee etwork. A advatage of our method is the balace betwee microarray iformatio ad biological kowledge is optimized by the proposed criterio. By addig biological kowledge ito our Bayesia etwork estimatio method, we succeeded i extractig more iformatio from microarray data ad estimatig the gee etwork more accurately. We believe that the combiatio of microarray data ad biological kowledge gives a ew perspective for uderstadig the systems of livig creatures. We cosider the followig problems as our future works: (1) I the real applicatio, we demostrated how to use the gee etwork that is obtaied biologically as a prior kowledge. There are various types of biological kowledge we listed i Sectio 2.4. It is a very importat problem how to use such kowledge together with microarray data i practice. (2) From biological kowledge, we determiistically decided the category to which edges belog, e.g. u 11 = 1, u 12 = 2, ad so o. However, biological kowledge cotais some errors. I fact, u ij ca be viewed as a radom variable, ad a statistical model ca be costructed for u ij. I that sese, our method ca be exteded as a Bayesia etwork estimatio method with a self-repairig database mechaism. We would like to ivestigate these problems i a future paper.

9 Refereces [1] T. Akutsu, S. Miyao ad S. Kuhara. Idetificatio of geetic etworks from a small umber of gee expressio patters uder the Boolea etwork model. Pacific Symposium o Biocomputig, 4,17-28,1999. [2] T. Akutsu, S. Miyao ad S. Kuhara. Iferrig qualitative relatios i geetic etworks ad metabolic pathways. Bioiformatics, 16, ,2000 [3] G.D. Bader, I. Doaldso, C. Woltig, B.F.F. Ouellette, T. Pawso ad C.W.V. Hogue. BIND-The biomolecular iteractio etwork database. Nucleic Acids Research, 29, , [4] H. Baai, S. Ieaga, A. Shiohara, M. Takeda ad S. Miyao. A strig patter regressio algorithm ad its applicatio to patter discovery i log itros. Geome Iformatics, 13,3-11,2002. [5] BIND [6] H.J. Bussemaker,H. Li ad E.D. Siggia. Regulatory elemet detectio usig correlatio with expressio. Nature Geetics, 27, ,2001. [7] T. Che, H. He ad G. Church. Modelig gee expressio with differetial equatios. Pacific Symposium o Biocomputig, 4,29-40,1999. [8] A.C. Daviso. Approximate predictive likelihood. Biometrika, 73, , [9] C. de Boor. A Practical Guide to Splies. Spriger, Berli [10] M.J.L. de Hoo, S. Imoto ad S. Miyao. Iferrig gee regulatory etworks from time-ordered gee expressio data usig differetial equatios. Proc. 5th Iteratioal Coferece o Discovery Sciece, Lecture Note i Artificial Itelligece, 2534, Spriger-Verlag, , [11] M.J.L. de Hoo, S. Imoto, K. Kobayashi, N. Ogasawara ad S. Miyao. Iferrig gee regulatory etworks from timeordered gee expressio data of Bacillus subtilis usig differetial equatios. Pacific Symposium o Biocomputig, 8, 17-28, [12] P.H.C. Eilers ad B. Marx. Flexible smoothig with B- splies ad pealties (with discussio). Statistical Sciece, 11, , [13] N. Friedma ad M. Goldszmidt. Learig Bayesia etworks with local structure. i M.I. Jorda ed., Kluwer Academic Publisher, , [14] N. Friedma, M. Liial, I. Nachma ad D. Pe er. Usig Bayesia etwork to aalyze expressio data. J. Comp. Biol., 7, , [15] S. Gema ad D. Gema. Stochastic relaxatio, Gibbs distributio ad the Bayesia restoratios. IEEE Trasactios o Patter Aalysis ad Machie Itelligece, 6, , [16] GRID [17] A.J. Hartemik, D.K. Gifford, T.S. Jaakkola ad R.A. Youg. Usig graphical models ad geomic expressio data to statistically validate models of geetic regulatory etworks. Pacific Symposium o Biocomputig, 6, , [18] A.J. Hartemik, D.K. Gifford, T.S. Jaakkola ad R.A. Youg. Combiig locatio ad expressio data for pricipled discovery of geetic regulatory etwork models. Pacific Symposium o Biocomputig, 7, ,2002. [19] T. Hastie ad R. Tibshirai. Geeralized Additive Models. Chapma & Hall, [20] D. Heckerma. A tutorial o learig with Bayesia etworks. i M.I. Jorda ed., Kluwer Academic Publisher, , [21] T.Ideker,O.Ozier,B.SchwikowskiadA.F.Siegel.Discoverig regulatory ad sigallig circuits i molecular iteractio etworks. Bioiformatics, 18 (ISMB 2002), S233- S240, [22] S. Imoto, T. Goto ad S. Miyao. Estimatio of geetic etworks ad fuctioal structures betwee gees by usig Bayesia etworks ad oparametric regressio. Pacific Symposium o Biocomputig, 7, ,2002. [23] S. Imoto, S. Kim, T. Goto, S. Aburatai, K. Tashiro, S. Kuhara ad S. Miyao. Bayesia etwork ad oparametric heteroscedastic regressio for oliear modelig of geetic etwork. Joural of Bioiformatics ad Computatioal Biology, i press. (Prelimiary versio has appeared i Proc. 1st IEEE Computer Society Bioiformatics Coferece, , 2002). [24] S. Imoto ad S. Koishi. Selectio of smoothig parameters i B-splie oparametric regressio models usig iformatio criteria. A. Ist. Statist. Math., i press. [25] T. Ito, T. Chiba, R. Ozawa, M. Yoshida, M. Hattori ad Y. Sakaki. A comprehesive two-hybrid aalysis to explore the yeast protei iteractome. Proc. Natl. Acad. Sci. USA, 97, , [26] F.V. Jese. A itroductio to Bayesia Networks.Uiversity College Lodo Press, [27] T.-K. Jesse, A. L greid. J. Komorowski ad E. Hovig. A literature etwork of huma gees for high-throughput aalysis of gee expressio.nature Geetics, 28, [28] KEGG

10 [29] S. Kim, S. Imoto ad S. Miyao. Dyamic Bayesia etwork ad oparametric regressio for oliear modelig of gee etworks from time series gee expressio data. Proc. 1st Iteratioal Workshop o Computatioal Methods i Systems Biology, Lecture Note i Computer Sciece, 2602, Spriger-Verlag, , [30] S. Koishi, T. Ado ad S. Imoto. Bayesia iformatio criteria ad smoothig parameter selectio i radial basis fuctio etworks. submitted for publicatio. [31] T.I. Lee, N.J. Rialdi, F. Robert, D.T. Odom, Z. Bar-Joseph, G.K. Gerber, N.M. Haett, C.T. Harbiso, C.M. Thompso, I. Simo, J. Zeitliger, E.G. Jeigs, H.L. Murray, D.B. Gordo, B.Re, J.J. Wyrick, J-B. Tage, T.L. Volkert, E. Fraekel, D.K. Gifford ad R.A. Youg. Trascriptioal regulatory etworks i Saccharomyces cerevisiae. Sciece, 298, , 2002 [32] Y. Maki, D. Tomiaga, M. Okamoto, S. Wataabe ad Y. Eguchi. Developmet of a system for the iferece of large scale geetic etworks. Pacific Symposium o Biocomputig, 6, , [42] I. Shmulevich, E.R. Dougherty, S. Kim ad W. Zhag. Probabilistic Boolea etworks: a rule-based ucertaity model for gee regulatory etworks. Bioiformatics, 18, , [43] V.A. Smith, E.D. Jarvis ad A.J. Hartemik. Evaluatig fuctioal etwork iferece usig simulatios of complex biological systems. Bioiformatics, 18 (ISMB 2002), S216- S224, [44] Y. Tamada, S. Kim, H. Baai, S. Imoto K. Tashiro, S. Kuhara ad S. Miyao. Estimatig gee etworks from gee expressio data by combiig Bayesia etwork model with promoter elemet detectio. Bioiformatics, (ECCB 2003), i press. [45] L. Tierey ad J.B. Kadae. Accurate approximatios for posterior momets ad margial desities. J. Amer. Statist. Assoc., 81, 82-86, [46] YPD databases/ypd.shtml [47] YTF [33] D.R. Masys. Likig microarray data to the literature. Nature Geetics, 28, 9-10, [34] K. Murphy ad S. Mia. Modellig gee expressio data usig dyamic Bayesia etworks. Techical report, Computer Sciece Divisio, Uiversity of Califoria, Berkeley, CA [35] N. Nariai, S. Kim, S. Imoto ad S. Miyao. Usig proteiprotei iteractios for refiig gee etworks estimated from microarray data by Bayesia etworks. uder preparatio. [36] I.M. Og, J.D. Glaser ad D. Page. Modellig regulatory pathways i E. coli from time series expressio profiles. Bioiformatics, 18 (ISMB2002), S241-S248, [37] D. Pe er, A. Regev, G. Elida ad N. Friedma. Iferrig subetworks from perturbed expressio profiles. Bioiformatics, 17 (ISMB 2001), S215-S224, [38] Y. Pilpel, P. Sudarsaam ad G.M. Church. Idetifyig regulatory etworks by combiatorial aalysis of promoter elemets. Nature Geetics, 29,153-9,2001. [39] J. Quackebush. Microarray data ormalizatio ad trasformatio. Nature Geetics, 32, ,2002. [40] SCPD [41] E. Segal, Y. Barash, I. Simo, N. Friedma ad D. Koller. From promoter sequece to expressio: a probabilistic framework. Proc. 6th Aual Iteratioal Coferece o Research i Computatioal Molecular Biology (RECOMB 2002), , 2002.

Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks

Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks Seiya Imoto 1, Tomoyuki Higuchi 2, Takao Goto 1, Kousuke Tashiro 3, Satoru Kuhara 3 and Satoru Miyano 1

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Clustering. CM226: Machine Learning for Bioinformatics. Fall Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar.

Clustering. CM226: Machine Learning for Bioinformatics. Fall Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar. Clusterig CM226: Machie Learig for Bioiformatics. Fall 216 Sriram Sakararama Ackowledgmets: Fei Sha, Ameet Talwalkar Clusterig 1 / 42 Admiistratio HW 1 due o Moday. Email/post o CCLE if you have questios.

More information

Study the bias (due to the nite dimensional approximation) and variance of the estimators

Study the bias (due to the nite dimensional approximation) and variance of the estimators 2 Series Methods 2. Geeral Approach A model has parameters (; ) where is ite-dimesioal ad is oparametric. (Sometimes, there is o :) We will focus o regressio. The fuctio is approximated by a series a ite

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Lecture 7: Density Estimation: k-nearest Neighbor and Basis Approach

Lecture 7: Density Estimation: k-nearest Neighbor and Basis Approach STAT 425: Itroductio to Noparametric Statistics Witer 28 Lecture 7: Desity Estimatio: k-nearest Neighbor ad Basis Approach Istructor: Ye-Chi Che Referece: Sectio 8.4 of All of Noparametric Statistics.

More information

A Note on Effi cient Conditional Simulation of Gaussian Distributions. April 2010

A Note on Effi cient Conditional Simulation of Gaussian Distributions. April 2010 A Note o Effi ciet Coditioal Simulatio of Gaussia Distributios A D D C S S, U B C, V, BC, C April 2010 A Cosider a multivariate Gaussia radom vector which ca be partitioed ito observed ad uobserved compoetswe

More information

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS

A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS J. Japa Statist. Soc. Vol. 41 No. 1 2011 67 73 A RANK STATISTIC FOR NON-PARAMETRIC K-SAMPLE AND CHANGE POINT PROBLEMS Yoichi Nishiyama* We cosider k-sample ad chage poit problems for idepedet data i a

More information

Statistical Pattern Recognition

Statistical Pattern Recognition Statistical Patter Recogitio Classificatio: No-Parametric Modelig Hamid R. Rabiee Jafar Muhammadi Sprig 2014 http://ce.sharif.edu/courses/92-93/2/ce725-2/ Ageda Parametric Modelig No-Parametric Modelig

More information

Introductory statistics

Introductory statistics CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key

More information

Information-based Feature Selection

Information-based Feature Selection Iformatio-based Feature Selectio Farza Faria, Abbas Kazeroui, Afshi Babveyh Email: {faria,abbask,afshib}@staford.edu 1 Itroductio Feature selectio is a topic of great iterest i applicatios dealig with

More information

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss ECE 90 Lecture : Complexity Regularizatio ad the Squared Loss R. Nowak 5/7/009 I the previous lectures we made use of the Cheroff/Hoeffdig bouds for our aalysis of classifier errors. Hoeffdig s iequality

More information

Element sampling: Part 2

Element sampling: Part 2 Chapter 4 Elemet samplig: Part 2 4.1 Itroductio We ow cosider uequal probability samplig desigs which is very popular i practice. I the uequal probability samplig, we ca improve the efficiecy of the resultig

More information

On an Application of Bayesian Estimation

On an Application of Bayesian Estimation O a Applicatio of ayesia Estimatio KIYOHARU TANAKA School of Sciece ad Egieerig, Kiki Uiversity, Kowakae, Higashi-Osaka, JAPAN Email: ktaaka@ifokidaiacjp EVGENIY GRECHNIKOV Departmet of Mathematics, auma

More information

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution Iteratioal Mathematical Forum, Vol., 3, o. 3, 3-53 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/.9/imf.3.335 Double Stage Shrikage Estimator of Two Parameters Geeralized Expoetial Distributio Alaa M.

More information

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015 ECE 8527: Itroductio to Machie Learig ad Patter Recogitio Midterm # 1 Vaishali Ami Fall, 2015 tue39624@temple.edu Problem No. 1: Cosider a two-class discrete distributio problem: ω 1 :{[0,0], [2,0], [2,2],

More information

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters

More information

CSE 527, Additional notes on MLE & EM

CSE 527, Additional notes on MLE & EM CSE 57 Lecture Notes: MLE & EM CSE 57, Additioal otes o MLE & EM Based o earlier otes by C. Grat & M. Narasimha Itroductio Last lecture we bega a examiatio of model based clusterig. This lecture will be

More information

Chapter 12 EM algorithms The Expectation-Maximization (EM) algorithm is a maximum likelihood method for models that have hidden variables eg. Gaussian

Chapter 12 EM algorithms The Expectation-Maximization (EM) algorithm is a maximum likelihood method for models that have hidden variables eg. Gaussian Chapter 2 EM algorithms The Expectatio-Maximizatio (EM) algorithm is a maximum likelihood method for models that have hidde variables eg. Gaussia Mixture Models (GMMs), Liear Dyamic Systems (LDSs) ad Hidde

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

PAijpam.eu ON TENSOR PRODUCT DECOMPOSITION

PAijpam.eu ON TENSOR PRODUCT DECOMPOSITION Iteratioal Joural of Pure ad Applied Mathematics Volume 103 No 3 2015, 537-545 ISSN: 1311-8080 (prited versio); ISSN: 1314-3395 (o-lie versio) url: http://wwwijpameu doi: http://dxdoiorg/1012732/ijpamv103i314

More information

Machine Learning Brett Bernstein

Machine Learning Brett Bernstein Machie Learig Brett Berstei Week 2 Lecture: Cocept Check Exercises Starred problems are optioal. Excess Risk Decompositio 1. Let X = Y = {1, 2,..., 10}, A = {1,..., 10, 11} ad suppose the data distributio

More information

Chapter 6 Principles of Data Reduction

Chapter 6 Principles of Data Reduction Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a

More information

R. van Zyl 1, A.J. van der Merwe 2. Quintiles International, University of the Free State

R. van Zyl 1, A.J. van der Merwe 2. Quintiles International, University of the Free State Bayesia Cotrol Charts for the Two-parameter Expoetial Distributio if the Locatio Parameter Ca Take o Ay Value Betwee Mius Iity ad Plus Iity R. va Zyl, A.J. va der Merwe 2 Quitiles Iteratioal, ruaavz@gmail.com

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

Overview of Estimation

Overview of Estimation Topic Iferece is the problem of turig data ito kowledge, where kowledge ofte is expressed i terms of etities that are ot preset i the data per se but are preset i models that oe uses to iterpret the data.

More information

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula Joural of Multivariate Aalysis 102 (2011) 1315 1319 Cotets lists available at ScieceDirect Joural of Multivariate Aalysis joural homepage: www.elsevier.com/locate/jmva Superefficiet estimatio of the margials

More information

Algebra of Least Squares

Algebra of Least Squares October 19, 2018 Algebra of Least Squares Geometry of Least Squares Recall that out data is like a table [Y X] where Y collects observatios o the depedet variable Y ad X collects observatios o the k-dimesioal

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Minimum Dominating Set Approach to Analysis and Control of Biological Networks

Minimum Dominating Set Approach to Analysis and Control of Biological Networks Miimum Domiatig Set Approach to Aalysis ad Cotrol of Biological Networks Tatsuya Akutsu Bioiformatics Ceter Istitute for Chemical Research, Kyoto Uiversity Joit work with Jose Nacher i Toho Uiversity Motivatio:

More information

Exponential Families and Bayesian Inference

Exponential Families and Bayesian Inference Computer Visio Expoetial Families ad Bayesia Iferece Lecture Expoetial Families A expoetial family of distributios is a d-parameter family f(x; havig the followig form: f(x; = h(xe g(t T (x B(, (. where

More information

Correlation Regression

Correlation Regression Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother

More information

Discrete Orthogonal Moment Features Using Chebyshev Polynomials

Discrete Orthogonal Moment Features Using Chebyshev Polynomials Discrete Orthogoal Momet Features Usig Chebyshev Polyomials R. Mukuda, 1 S.H.Og ad P.A. Lee 3 1 Faculty of Iformatio Sciece ad Techology, Multimedia Uiversity 75450 Malacca, Malaysia. Istitute of Mathematical

More information

n n i=1 Often we also need to estimate the variance. Below are three estimators each of which is optimal in some sense: n 1 i=1 k=1 i=1 k=1 i=1 k=1

n n i=1 Often we also need to estimate the variance. Below are three estimators each of which is optimal in some sense: n 1 i=1 k=1 i=1 k=1 i=1 k=1 MATH88T Maria Camero Cotets Basic cocepts of statistics Estimators, estimates ad samplig distributios 2 Ordiary least squares estimate 3 3 Maximum lielihood estimator 3 4 Bayesia estimatio Refereces 9

More information

6.867 Machine learning, lecture 7 (Jaakkola) 1

6.867 Machine learning, lecture 7 (Jaakkola) 1 6.867 Machie learig, lecture 7 (Jaakkola) 1 Lecture topics: Kerel form of liear regressio Kerels, examples, costructio, properties Liear regressio ad kerels Cosider a slightly simpler model where we omit

More information

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov Microarray Ceter BIOSTATISTICS Lecture 5 Iterval Estimatios for Mea ad Proportio dr. Petr Nazarov 15-03-013 petr.azarov@crp-sate.lu Lecture 5. Iterval estimatio for mea ad proportio OUTLINE Iterval estimatios

More information

Summary and Discussion on Simultaneous Analysis of Lasso and Dantzig Selector

Summary and Discussion on Simultaneous Analysis of Lasso and Dantzig Selector Summary ad Discussio o Simultaeous Aalysis of Lasso ad Datzig Selector STAT732, Sprig 28 Duzhe Wag May 4, 28 Abstract This is a discussio o the work i Bickel, Ritov ad Tsybakov (29). We begi with a short

More information

REGRESSION WITH QUADRATIC LOSS

REGRESSION WITH QUADRATIC LOSS REGRESSION WITH QUADRATIC LOSS MAXIM RAGINSKY Regressio with quadratic loss is aother basic problem studied i statistical learig theory. We have a radom couple Z = X, Y ), where, as before, X is a R d

More information

Advanced Stochastic Processes.

Advanced Stochastic Processes. Advaced Stochastic Processes. David Gamarik LECTURE 2 Radom variables ad measurable fuctios. Strog Law of Large Numbers (SLLN). Scary stuff cotiued... Outlie of Lecture Radom variables ad measurable fuctios.

More information

DISTRIBUTION LAW Okunev I.V.

DISTRIBUTION LAW Okunev I.V. 1 DISTRIBUTION LAW Okuev I.V. Distributio law belogs to a umber of the most complicated theoretical laws of mathematics. But it is also a very importat practical law. Nothig ca help uderstad complicated

More information

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random Part III. Areal Data Aalysis 0. Comparative Tests amog Spatial Regressio Models While the otio of relative likelihood values for differet models is somewhat difficult to iterpret directly (as metioed above),

More information

Expectation-Maximization Algorithm.

Expectation-Maximization Algorithm. Expectatio-Maximizatio Algorithm. Petr Pošík Czech Techical Uiversity i Prague Faculty of Electrical Egieerig Dept. of Cyberetics MLE 2 Likelihood.........................................................................................................

More information

Research Article A Unified Weight Formula for Calculating the Sample Variance from Weighted Successive Differences

Research Article A Unified Weight Formula for Calculating the Sample Variance from Weighted Successive Differences Discrete Dyamics i Nature ad Society Article ID 210761 4 pages http://dxdoiorg/101155/2014/210761 Research Article A Uified Weight Formula for Calculatig the Sample Variace from Weighted Successive Differeces

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Preponderantly increasing/decreasing data in regression analysis

Preponderantly increasing/decreasing data in regression analysis Croatia Operatioal Research Review 269 CRORR 7(2016), 269 276 Prepoderatly icreasig/decreasig data i regressio aalysis Darija Marković 1, 1 Departmet of Mathematics, J. J. Strossmayer Uiversity of Osijek,

More information

An Introduction to Randomized Algorithms

An Introduction to Randomized Algorithms A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis

More information

THE SYSTEMATIC AND THE RANDOM. ERRORS - DUE TO ELEMENT TOLERANCES OF ELECTRICAL NETWORKS

THE SYSTEMATIC AND THE RANDOM. ERRORS - DUE TO ELEMENT TOLERANCES OF ELECTRICAL NETWORKS R775 Philips Res. Repts 26,414-423, 1971' THE SYSTEMATIC AND THE RANDOM. ERRORS - DUE TO ELEMENT TOLERANCES OF ELECTRICAL NETWORKS by H. W. HANNEMAN Abstract Usig the law of propagatio of errors, approximated

More information

Chapter 10: Power Series

Chapter 10: Power Series Chapter : Power Series 57 Chapter Overview: Power Series The reaso series are part of a Calculus course is that there are fuctios which caot be itegrated. All power series, though, ca be itegrated because

More information

11 Hidden Markov Models

11 Hidden Markov Models Hidde Markov Models Hidde Markov Models are a popular machie learig approach i bioiformatics. Machie learig algorithms are preseted with traiig data, which are used to derive importat isights about the

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

Lecture 33: Bootstrap

Lecture 33: Bootstrap Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece

More information

Kolmogorov-Smirnov type Tests for Local Gaussianity in High-Frequency Data

Kolmogorov-Smirnov type Tests for Local Gaussianity in High-Frequency Data Proceedigs 59th ISI World Statistics Cogress, 5-30 August 013, Hog Kog (Sessio STS046) p.09 Kolmogorov-Smirov type Tests for Local Gaussiaity i High-Frequecy Data George Tauche, Duke Uiversity Viktor Todorov,

More information

A Note on Box-Cox Quantile Regression Estimation of the Parameters of the Generalized Pareto Distribution

A Note on Box-Cox Quantile Regression Estimation of the Parameters of the Generalized Pareto Distribution A Note o Box-Cox Quatile Regressio Estimatio of the Parameters of the Geeralized Pareto Distributio JM va Zyl Abstract: Makig use of the quatile equatio, Box-Cox regressio ad Laplace distributed disturbaces,

More information

Lecture 11 and 12: Basic estimation theory

Lecture 11 and 12: Basic estimation theory Lecture ad 2: Basic estimatio theory Sprig 202 - EE 94 Networked estimatio ad cotrol Prof. Kha March 2 202 I. MAXIMUM-LIKELIHOOD ESTIMATORS The maximum likelihood priciple is deceptively simple. Louis

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Optimization Methods MIT 2.098/6.255/ Final exam

Optimization Methods MIT 2.098/6.255/ Final exam Optimizatio Methods MIT 2.098/6.255/15.093 Fial exam Date Give: December 19th, 2006 P1. [30 pts] Classify the followig statemets as true or false. All aswers must be well-justified, either through a short

More information

THE KALMAN FILTER RAUL ROJAS

THE KALMAN FILTER RAUL ROJAS THE KALMAN FILTER RAUL ROJAS Abstract. This paper provides a getle itroductio to the Kalma filter, a umerical method that ca be used for sesor fusio or for calculatio of trajectories. First, we cosider

More information

EE / EEE SAMPLE STUDY MATERIAL. GATE, IES & PSUs Signal System. Electrical Engineering. Postal Correspondence Course

EE / EEE SAMPLE STUDY MATERIAL. GATE, IES & PSUs Signal System. Electrical Engineering. Postal Correspondence Course Sigal-EE Postal Correspodece Course 1 SAMPLE STUDY MATERIAL Electrical Egieerig EE / EEE Postal Correspodece Course GATE, IES & PSUs Sigal System Sigal-EE Postal Correspodece Course CONTENTS 1. SIGNAL

More information

GUIDELINES ON REPRESENTATIVE SAMPLING

GUIDELINES ON REPRESENTATIVE SAMPLING DRUGS WORKING GROUP VALIDATION OF THE GUIDELINES ON REPRESENTATIVE SAMPLING DOCUMENT TYPE : REF. CODE: ISSUE NO: ISSUE DATE: VALIDATION REPORT DWG-SGL-001 002 08 DECEMBER 2012 Ref code: DWG-SGL-001 Issue

More information

w (1) ˆx w (1) x (1) /ρ and w (2) ˆx w (2) x (2) /ρ.

w (1) ˆx w (1) x (1) /ρ and w (2) ˆx w (2) x (2) /ρ. 2 5. Weighted umber of late jobs 5.1. Release dates ad due dates: maximimizig the weight of o-time jobs Oce we add release dates, miimizig the umber of late jobs becomes a sigificatly harder problem. For

More information

Lecture 9: September 19

Lecture 9: September 19 36-700: Probability ad Mathematical Statistics I Fall 206 Lecturer: Siva Balakrisha Lecture 9: September 9 9. Review ad Outlie Last class we discussed: Statistical estimatio broadly Pot estimatio Bias-Variace

More information

Economics Spring 2015

Economics Spring 2015 1 Ecoomics 400 -- Sprig 015 /17/015 pp. 30-38; Ch. 7.1.4-7. New Stata Assigmet ad ew MyStatlab assigmet, both due Feb 4th Midterm Exam Thursday Feb 6th, Chapters 1-7 of Groeber text ad all relevat lectures

More information

Statistical Inference Based on Extremum Estimators

Statistical Inference Based on Extremum Estimators T. Rotheberg Fall, 2007 Statistical Iferece Based o Extremum Estimators Itroductio Suppose 0, the true value of a p-dimesioal parameter, is kow to lie i some subset S R p : Ofte we choose to estimate 0

More information

Bayesian Control Charts for the Two-parameter Exponential Distribution

Bayesian Control Charts for the Two-parameter Exponential Distribution Bayesia Cotrol Charts for the Two-parameter Expoetial Distributio R. va Zyl, A.J. va der Merwe 2 Quitiles Iteratioal, ruaavz@gmail.com 2 Uiversity of the Free State Abstract By usig data that are the mileages

More information

Bayesian Methods: Introduction to Multi-parameter Models

Bayesian Methods: Introduction to Multi-parameter Models Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested

More information

Lecture 24: Variable selection in linear models

Lecture 24: Variable selection in linear models Lecture 24: Variable selectio i liear models Cosider liear model X = Z β + ε, β R p ad Varε = σ 2 I. Like the LSE, the ridge regressio estimator does ot give 0 estimate to a compoet of β eve if that compoet

More information

Department of Mathematics

Department of Mathematics Departmet of Mathematics Ma 3/103 KC Border Itroductio to Probability ad Statistics Witer 2017 Lecture 19: Estimatio II Relevat textbook passages: Larse Marx [1]: Sectios 5.2 5.7 19.1 The method of momets

More information

Estimation of Gumbel Parameters under Ranked Set Sampling

Estimation of Gumbel Parameters under Ranked Set Sampling Joural of Moder Applied Statistical Methods Volume 13 Issue 2 Article 11-2014 Estimatio of Gumbel Parameters uder Raked Set Samplig Omar M. Yousef Al Balqa' Applied Uiversity, Zarqa, Jorda, abuyaza_o@yahoo.com

More information

Modeling and Estimation of a Bivariate Pareto Distribution using the Principle of Maximum Entropy

Modeling and Estimation of a Bivariate Pareto Distribution using the Principle of Maximum Entropy Sri Laka Joural of Applied Statistics, Vol (5-3) Modelig ad Estimatio of a Bivariate Pareto Distributio usig the Priciple of Maximum Etropy Jagathath Krisha K.M. * Ecoomics Research Divisio, CSIR-Cetral

More information

Output Analysis and Run-Length Control

Output Analysis and Run-Length Control IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%

More information

ADVANCED SOFTWARE ENGINEERING

ADVANCED SOFTWARE ENGINEERING ADVANCED SOFTWARE ENGINEERING COMP 3705 Exercise Usage-based Testig ad Reliability Versio 1.0-040406 Departmet of Computer Ssciece Sada Narayaappa, Aeliese Adrews Versio 1.1-050405 Departmet of Commuicatio

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

Monte Carlo Integration

Monte Carlo Integration Mote Carlo Itegratio I these otes we first review basic umerical itegratio methods (usig Riema approximatio ad the trapezoidal rule) ad their limitatios for evaluatig multidimesioal itegrals. Next we itroduce

More information

CS284A: Representations and Algorithms in Molecular Biology

CS284A: Representations and Algorithms in Molecular Biology CS284A: Represetatios ad Algorithms i Molecular Biology Scribe Notes o Lectures 3 & 4: Motif Discovery via Eumeratio & Motif Represetatio Usig Positio Weight Matrix Joshua Gervi Based o presetatios by

More information

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function. MATH 532 Measurable Fuctios Dr. Neal, WKU Throughout, let ( X, F, µ) be a measure space ad let (!, F, P ) deote the special case of a probability space. We shall ow begi to study real-valued fuctios defied

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

ECE 901 Lecture 13: Maximum Likelihood Estimation

ECE 901 Lecture 13: Maximum Likelihood Estimation ECE 90 Lecture 3: Maximum Likelihood Estimatio R. Nowak 5/7/009 The focus of this lecture is to cosider aother approach to learig based o maximum likelihood estimatio. Ulike earlier approaches cosidered

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

Testing Statistical Hypotheses for Compare. Means with Vague Data

Testing Statistical Hypotheses for Compare. Means with Vague Data Iteratioal Mathematical Forum 5 o. 3 65-6 Testig Statistical Hypotheses for Compare Meas with Vague Data E. Baloui Jamkhaeh ad A. adi Ghara Departmet of Statistics Islamic Azad iversity Ghaemshahr Brach

More information

x a x a Lecture 2 Series (See Chapter 1 in Boas)

x a x a Lecture 2 Series (See Chapter 1 in Boas) Lecture Series (See Chapter i Boas) A basic ad very powerful (if pedestria, recall we are lazy AD smart) way to solve ay differetial (or itegral) equatio is via a series expasio of the correspodig solutio

More information

Exercises Advanced Data Mining: Solutions

Exercises Advanced Data Mining: Solutions Exercises Advaced Data Miig: Solutios Exercise 1 Cosider the followig directed idepedece graph. 5 8 9 a) Give the factorizatio of P (X 1, X 2,..., X 9 ) correspodig to this idepedece graph. P (X) = 9 P

More information

Section 14. Simple linear regression.

Section 14. Simple linear regression. Sectio 14 Simple liear regressio. Let us look at the cigarette dataset from [1] (available to dowload from joural s website) ad []. The cigarette dataset cotais measuremets of tar, icotie, weight ad carbo

More information

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation Cofidece Iterval for tadard Deviatio of Normal Distributio with Kow Coefficiets of Variatio uparat Niwitpog Departmet of Applied tatistics, Faculty of Applied ciece Kig Mogkut s Uiversity of Techology

More information

Support vector machine revisited

Support vector machine revisited 6.867 Machie learig, lecture 8 (Jaakkola) 1 Lecture topics: Support vector machie ad kerels Kerel optimizatio, selectio Support vector machie revisited Our task here is to first tur the support vector

More information

DETERMINATION OF MECHANICAL PROPERTIES OF A NON- UNIFORM BEAM USING THE MEASUREMENT OF THE EXCITED LONGITUDINAL ELASTIC VIBRATIONS.

DETERMINATION OF MECHANICAL PROPERTIES OF A NON- UNIFORM BEAM USING THE MEASUREMENT OF THE EXCITED LONGITUDINAL ELASTIC VIBRATIONS. ICSV4 Cairs Australia 9- July 7 DTRMINATION OF MCHANICAL PROPRTIS OF A NON- UNIFORM BAM USING TH MASURMNT OF TH XCITD LONGITUDINAL LASTIC VIBRATIONS Pavel Aokhi ad Vladimir Gordo Departmet of the mathematics

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Lecture 19: Convergence

Lecture 19: Convergence Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may

More information

Riesz-Fischer Sequences and Lower Frame Bounds

Riesz-Fischer Sequences and Lower Frame Bounds Zeitschrift für Aalysis ud ihre Aweduge Joural for Aalysis ad its Applicatios Volume 1 (00), No., 305 314 Riesz-Fischer Sequeces ad Lower Frame Bouds P. Casazza, O. Christese, S. Li ad A. Lider Abstract.

More information

ON POINTWISE BINOMIAL APPROXIMATION

ON POINTWISE BINOMIAL APPROXIMATION Iteratioal Joural of Pure ad Applied Mathematics Volume 71 No. 1 2011, 57-66 ON POINTWISE BINOMIAL APPROXIMATION BY w-functions K. Teerapabolar 1, P. Wogkasem 2 Departmet of Mathematics Faculty of Sciece

More information

ANOTHER WEIGHTED WEIBULL DISTRIBUTION FROM AZZALINI S FAMILY

ANOTHER WEIGHTED WEIBULL DISTRIBUTION FROM AZZALINI S FAMILY ANOTHER WEIGHTED WEIBULL DISTRIBUTION FROM AZZALINI S FAMILY Sulema Nasiru, MSc. Departmet of Statistics, Faculty of Mathematical Scieces, Uiversity for Developmet Studies, Navrogo, Upper East Regio, Ghaa,

More information

Spectral Partitioning in the Planted Partition Model

Spectral Partitioning in the Planted Partition Model Spectral Graph Theory Lecture 21 Spectral Partitioig i the Plated Partitio Model Daiel A. Spielma November 11, 2009 21.1 Itroductio I this lecture, we will perform a crude aalysis of the performace of

More information

Geometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT

Geometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT OCTOBER 7, 2016 LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT Geometry of LS We ca thik of y ad the colums of X as members of the -dimesioal Euclidea space R Oe ca

More information

Gamma Distribution and Gamma Approximation

Gamma Distribution and Gamma Approximation Gamma Distributio ad Gamma Approimatio Xiaomig Zeg a Fuhua (Frak Cheg b a Xiame Uiversity, Xiame 365, Chia mzeg@jigia.mu.edu.c b Uiversity of Ketucky, Leigto, Ketucky 456-46, USA cheg@cs.uky.edu Abstract

More information

Pattern Classification, Ch4 (Part 1)

Pattern Classification, Ch4 (Part 1) Patter Classificatio All materials i these slides were take from Patter Classificatio (2d ed) by R O Duda, P E Hart ad D G Stork, Joh Wiley & Sos, 2000 with the permissio of the authors ad the publisher

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics 8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These

More information

IP Reference guide for integer programming formulations.

IP Reference guide for integer programming formulations. IP Referece guide for iteger programmig formulatios. by James B. Orli for 15.053 ad 15.058 This documet is iteded as a compact (or relatively compact) guide to the formulatio of iteger programs. For more

More information

Bayesian and E- Bayesian Method of Estimation of Parameter of Rayleigh Distribution- A Bayesian Approach under Linex Loss Function

Bayesian and E- Bayesian Method of Estimation of Parameter of Rayleigh Distribution- A Bayesian Approach under Linex Loss Function Iteratioal Joural of Statistics ad Systems ISSN 973-2675 Volume 12, Number 4 (217), pp. 791-796 Research Idia Publicatios http://www.ripublicatio.com Bayesia ad E- Bayesia Method of Estimatio of Parameter

More information

Power Weighted Quantile Regression and Its Application

Power Weighted Quantile Regression and Its Application Joural of Data Sciece 12(2014), 535-544 Power Weighted Quatile Regressio ad Its Applicatio Xue J. Ma 1 ad Feg X. He 2 1 Remi Uiversity of Chia 2 North Chia Electric Power Uiversity Abstract: I the paper,

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

Reliability and Queueing

Reliability and Queueing Copyright 999 Uiversity of Califoria Reliability ad Queueig by David G. Messerschmitt Supplemetary sectio for Uderstadig Networked Applicatios: A First Course, Morga Kaufma, 999. Copyright otice: Permissio

More information