Analysis and visualization of protein-protein interactions. Olga Vitek Assistant Professor Statistics and Computer Science

Size: px
Start display at page:

Download "Analysis and visualization of protein-protein interactions. Olga Vitek Assistant Professor Statistics and Computer Science"

Transcription

1 1 Analysis and visualization of protein-protein interactions Olga Vitek Assistant Professor Statistics and Computer Science

2 2 Outline 1. Protein-protein interactions 2. Using graph structures to study protein-protein interactions 3. Clustering of graphs 4. Evaluation of clusters

3 3 An Introduction to Bioinformatics Algorithms "#$%&$'"()%*"+,%-$.. A cell is a smallest structural unit of an organism that is capable of independent functioning All cells have some common features

4 4 An Introduction to Bioinformatics Algorithms "#$%&"#$%'()%*+,%-./0%/1 2(1/34'*5/( Replication Transcription Translation This model is known as the central dogma

5 Why should we study proteins? Ubiquitin-conjugating enzyme E2 G1 (PDB entry 2AWF) 5 Proteins: large molecules made up of amino acids accomplish most of the function of the living cells by interacting (i.e. entering in physical contact) with other molecules linear structures fold into 3-dimensional shapes the structure is used to accomplish the function

6 6 Proteins accomplish function by forming complexes A protein complex is a group of tightly interacting proteins also called functional module protein interactions within the complex help accomplish its function Example: exosome a complex of 11 proteins degrades RNA molecules ring structure ensures the function

7 7 Discovery of the complex helps understanding its function Example: exosome first discovered in yeast helped discovering an equivalent complex in humans has clinical implications target of autoimmune disease chemotherapies for cancer block its activity Knowledge of protein complexes speeds up biological and clinical research

8 8 Complexity of a bacterial cell Often study simpler model organisms to gain insight into the function of the cell

9 9 Modern technologies determine protein interactions on large scale New terms Proteome: all proteins that exist in an organism Interactome: all protein-protein interactions New questions Interactions of individual proteins Network-wide patterns of interactions New challenges Large, complex, noisy datasets Computational approaches are key

10 10 B I N D DNA damage response and repair DNA metabolism DNA recombination DNA replication RNA localization and processing autophagy budding carbohydrate metabolism cell cycle cell growth and/or maintenance cell organization and biogenesis cell shape and cell size control general metabolism mating nucleolar and ribosome biogenesis protein amino acid phosphorylation/desphosphorylation protein biosynthesis protein degradation protein metabolism and modification protein transport signal transduction sporulation stress response transcription transport Kss1 Bck2 Ste7 Dig1 Ste12 Ste11 Bem3 Ylr154c Rvb1 Aco1 Rvs167 Ded1 Gyp5 Ecm29 Hom6 Ilv5 Lys12 Pho84 Pre10 Ubp7 Ura7 Met18 Ylr243w Ubi4 Idh1 Apg12 Crm1 Fet3 Kap122 Kgd1 Met10 Pfk1 Rpn1 Car1 Hyp2 Ipp1 Mdh1 Cnb1 Cmp2 Cns1 Ecm10 Yhb1 Cof1 Crn1 Srv2 Hta1 Hhf1 Hir2 Htb1 Kap114 Kri1 Nap1 Nop4 Ret1 Rpc82 Rpo31 Spt16 Rrp13 Ylr222c Cct2 Cys4 Dig2 Pim1 Rpa135 Rpn6 Tec1 Yer093c Ygl245w Yjr072c Yol078w Lsm8 Apa1 Gar1 Lsm2 Qcr2 Rpn12 Rpn8 Rrp42 Smb1 Tif6 Ygl117w Mig1 Mss116 Nop12 Nop2 Brx1 Ckb1 Cox6 Fet4 Nsa2 Nip7 Nmd3 Nop1 Rrp1 Sik1 Nug1 Nsa1 Noc2 Ypl009c Rpc19 Rpn5 Emp24 Kre6 Rpn9 Rpt1 Rpt2 Sip2 Arc35 Gal83 Idh2 Sec53 Snf1 Snf4 Tcp1 Smt3 Cpr1 Gph1 Pst2 Rod1 Sip1 Yor267c Spo12 Pse1 Tem1 Cdc15 Gcd11 Mcx1 Sar1 Ybr281c Yhr033w Ynk1 Ubc6 Atp4 Gcn1 Los1 Pol5 Sec7 Uba1 Ybl004w Ykl056c Ypt1 Yju2 Cor1 Ded81 Prp19 Dss4 Gdi1 Mrs6 Sec4 Cdc53 Por1 Skp1 Ela1 Loc1 Yra1 Cic1 Glc7 Glc8 Mhp1 Reg1 Kap104 Ynl035c Dim1 Enp1 Nab2 Hrp1 Mrt4 Prp6 Hsl7 Rpp0 Ylr287c Fyv14 Krr1 Yjr041c Rpf2 Kre33 Tsr1 Tif4631 Drs1 Erb1 Ydr131c Yrb2 Yhr197w Arf1 Arf2 Cmd1 Cmk2 Ede1 Myo2 Myo4 Myo5 Pgm2 Vas1 Vps13 Ils1 Sod1 Hch1 She3 Dmc1 Est1 Puf6 Rrp12 Cbf5 Dbp7 Hsh49 Pet56 Pdi1 Pwp1 Yer077c Yjl109c Ykl014c Fpr1 Hom3 Gsp1 Gsp2 Srm1 Rna1 Mog1 Rse1 Ist3 Bud13 Car2 Ssz1 Ydr341c Sah1 Lsm7 Pat1 Ade5 Dhh1 Lsm4 Prp24 Met30 Rub1 Sis1 Tef4 Met4 Met31 Nop13 Ebp2 Imd2 Imd3 Rrp5 Rad10 Rad1 Sod2 Ubc1 Adk1 Yhr115c Ynl311c Ypt6 Ric1 Cdc11 Cdc3 Cdc10 Cdc12 Tif4632 Ctf13 Hrt1 Rtt101 Sec27 Guf1 Yll034c Tps1 Ubp15 Rpc40 Rpa190 Sgn1 Ygr250c Spt2 Cka1 Cka2 Ygr052w His4 Gpi16 Cdc42 Bem4 San1 Gbp2 Hpr1 Sub2 Thp2 Mft1 Rlr1 Rho2 Ubc12 Ula1 Clb2 Cdc28 Dia2 Cks1 Hat1 Vps21 Ypt10 Ypt52 Ypt32 Ypt31 Ypt7 Gpa2 Ymr029c Grr1 Pfk2 Pfs2 Rpt3 Ygl004c Rpn10 Snp1 Bcy1 Tpk3 Tpk1 Sti1 Ypt53 Cce1 Rnr3 Cmk1 Vph2 Cpr6 Qns1 Trr1 Caf120 Ptc4 Ydr247w Gin4 Ydr071c Mas1 Mas2 Gis4 Prb1 Sxm1 Ecm1 Lhp1 Yjl149w Cdc13 Fun11 Clu1 Tif2 Gnd1 Kip3 Hef3 Inp52 Sap190 Nta1 Hsp104 Vma4 Tfp1 Mge1 Ptc5 Yal027w Spo13 Ydr219c Yhr122w Ydr306c Yku80 Msu1 Ylr352w Yol128c Oye2 Bur2 Cln1 Clb5 Clb3 Yer138c Ydr170wa Nup85 Hem15 Ymr209c Cbp3 Nup84 Seh1 Pre1 Pup2 Pup3 Ylr199c Pre5 Pre2 Pre3 Pre6 Pre7 Scl1 Ykl206c Pre9 Pre8 Rsp5 Rvs161 Bop2 Sgt1 Ufo1 Cdc4 Yil007c Yta6 Ygr086c Ypl004c Top2 Cdh1 Cct3 Cat5 Yjl068c Gyp6 Trs120 Trs130 Rpb5 Rpa43 Ybr203w Yck1 Yck2 Ypr015c Bud20 Ydr101c Prp43 Rsm25 Afg2 Fyv4 Ypl013c Rsm24 Yjl122w Ybl044w Nog1 Ydr036c Htb2 Mam33 Ygl068w Cdc23 Swm1 Cdc5 Mcd1 Smc1 Gsy1 Fpr3 Sds22 Fin1 Gpa1 Hex3 Ssk1 Hrr25 Prp3 Prp4 Ymr226c Apc1 Nop58 Ygr145w Kap95 Cbp2 She4 Mlc1 Nuf1 Hul5 Myo3 Faa4 Rnq1 Ydr279w Ado1 Lsm1 Ygr173w Ydr152w Ykl078w Ylr097c Adh2 Imd4 Pet127 Tis11 Ytm1 Yrb1 Gal3 Bbc1 Cdc39 Mms1 Pma2 Yar009c Ygp1 Ylr035ca Ylr106c Adr1 Mkt1 Dur1 Ecm33 Hsp12 Sec26 Qri8 Ahp1 Dop1 Sry1 Rvb2 Npl3 Pub1 Fun12 Ste23 Ssf1 Aac3 Kre31 Rli1 Srp1 Sup45 Ygr090w Tpt1 Apc2 Mak5 Ynl116w Dbf2 Mob1 Ess1 Tfg1 Rpo21 Tom1 Spt5 Hxt6 Rpb3 Ynl253w Isu1 Nfs1 Lys1 Fol2 Tal1 Cct5 Cct6 Cft1 Hgh1 Mer1 Lhs1 Rpn3 Rpn7 Rpt5 Nas6 Arp2 Rpt4 Rpn11 Cpr3 Cdc33 Dut1 Ygr066c Thi22 Tpk2 Ybr028c Yck3 Ygr154c Ssn8 Sno2 Ygr111w Yjl207c Yil113w Slt2 Ynl260c Asi3 Ypl170w Pma1 Ynl227c Cdc14 Mcr1 Dpm1 Atp7 Atp5 Fur1 Hms1 Ydr453c Atp3 Spe3 Aut2 Spc72 Ydr229w Irr1 Smc3 Faa1 Cyr1 Dbf20 Ala1 Egd2 Axl1 Rpb10 Dbp8 Cpa2 Rnr1 Rnr2 Ydl086w Pdc6 Pdc5 Ynl157w Prs5 Dia4 Bud32 Grx4 Ykr038c Yml036w Isw2 Isw1 Kns1 Msg5 Fus3 Msh1 Pph22 Cdc55 Ppe1 Rts1 Tpd3 Ygr161c Hta2 Ygl121c Rpb11 Tap42 Pro1 Arc1 Fum1 Rad51 Mlh1 Tos3 Ykr096w Yak1 Dnm1 Vps1 Ylr270w Ypl247c Mpc54 Yhr112c Pyc2 Ybl108w Oye3 Ycr079w Vid31 Cdc60 Pro2 Pyc1 Ypl110c Ykl215c Emg1 Yer030w Ynl099c Siw14 Ypk2 Pet112 Ygr016w Yel023c Yor154w Yor220w Rad3 Dun1 Far1 Gpd1 Gpd2 Sen1 Ste20 Sec6 Ylr368w Rad26 Ach1 Adh4 Bio3 Erg20 Yhr076w Ymr318c Rhr2 Ydr326c Ras2 Ras1 Acc1 Gfa1 Isa2 Ngl2 Rpa49 Rpc25 Rpo26 Tbs1 Yfl042c Rpc34 Sen15 Egd1 Erg13 Erg6 Frs2 Gnd2 Grx1 Aat2 Afr1 Lro1 Met6 Ntf2 Prm2 Rsn1 Scp160 Ses1 Snu13 Ths1 Vma5 Wtm1 Ybr025c Dog1 Has1 Prp8 Sap185 Sit4 Ylr386w Yhr214wa Ydl025c Yal049c Yhr009c Apt1 Aro1 Ydr128w Ylr238w Erg1 Cdc7 Bfr2 Bir1 Nut1 Thi3 Ylr231c Ylr331c Bms1 Cdc46 Ctf4 Dbp10 Lst4 Mcm2 Mcm3 Spb1 Ssf2 Ybl104c Pph3 Ybl046w Ynl201c Prp11 Cop1 Nan1 Rex2 Shm2 Ssk2 Bck1 Gal7 Kic1 Kin2 Mkk2 Smk1 Ydr214w Ylr187w Gdb1 Rad50 Ubr1 Ylr241w Yor173w Ykl161c Ynl056w Yol045w Fun30 Fun31 Aac1 Apg17 Cvt9 Ppx1 REP1 Sec18 Tyr1 Yhr020w Ynl208w Yor086c Grx3 Idp2 Pho81 Sec23 Yor073w Dog2 Msk1 Rgd1 Vma22 Aip1 Arp7 Mse1 Ydr239c Ypr115w Rad54 Ynl134c Cdc9 Dbp9 Pol30 Yor378w Sld2 Trl1 Lif1 Dnl4 Mec3 Mak16 Ydr198c Ygl146c Nop15 Mlh3 Pso2 Mgm101 Rad2 Pex15 Mre11 Vma8 Xrs2 Rad7 Elc1 Rfa2 Hir3 Ubc13 Aro9 Mms2 Rad18 Whi2 Csr2 Tdp1 Shp1 Yen1 Fpr4 Lcd1 Adh5 Mag1 Ai1 Hho1 Hht1 Mph1 Msh2 Yor155c Mus81 Anc1 Cdc16 Mes1 Mms4 Nhp2 Rad53 Rpc10 Yer078c Ntg1 Rfc2 Tif34 Rad24 Rfc3 Rfc5 Ylr413w Asf1 Ptc2 Swi4 Tbf1 Ymr135c Yta7 Rad59 Bem2 Hor2 Ilv2 Opy1 Pgm1 Ptc3 Rad52 Rad9 Rfa1 Rfa3 Hxt7 Yjr141w Brr2 Rfc4 Rom2 Vac8 Sml1 Adh3 Nat1 Dpb11 Hpr5 Phr1 Msd1 Pol4 Rho5 Rad16 Htz1 Pdx1 Rad30 Map2 Ycl042w Efd1 Rfc1 Rhc18 Imd1 Set1 Bre2 Trf4 Mtr4 Ydl175c Yil079c Ypl146c Ddc1 Suv3 Rad17 Mgt1 Rip1 Sof1 Ira2 Rad14 Rad4 Rad28 Ccl1 Kin28 Lsc1 Tfb3 Ybr184w Ald5 Bul1 Lsb1 Ykr018c Ylr392c Sir3 Gas1 Sir1 Ubp8 Sir4 Blm3 Sir2 Yfl006w Yku70 Pex19 Arc15 Arc18 Arc19 Arc40 Arp3 Puf3 Fhl1 Fkh1 Fkh2 Gcn3 Hmo1 Ceg1 Ckb2 Fyv8 Gcd2 Gcd7 Mbp1 Net1 Sec2 Sin3 Sui2 Sui3 Ubp12 Ure2 Ygr017w Ymr144w Ino80 Lap4 Ams1 Bik1 Pib1 Pph21 Prk1 Abp1 Akl1 Dis3 Eaf3 Fip1 Nam8 Nup1 Nup2 Nup60 Pap1 Pct1 Reb1 Rnt1 Rrp4 Rrp43 Rrp6 Rtt103 Sif2 Snu56 Sto1 Tra1 Ume1 Ypr090w Fal1 Gcd1 Gcd6 Ubc4 Qcr7 Ufd4 Ydl100c Ybr014c Yer083c Ygl020c Ydr200c Yfr008w Yol054w Gac1 Pob3 Ycr030c Yor056c Ypr093c Rpb9 Rgp1 Hts1 Apm3 Apl5 Las17 Bzz1 Gal2 Pep1 Sla1 Sqt1 Vma6 Vrp1 Yhm2 Ynr065c Pds1 Pkh2 Ygr033c Cef1 Clf1 Snt309 Ygr205w Sat4 Pho85 Ydr516c Ygr165w Num1 Sef1 Skt5 Syf1 Ygl081w Nip1 Gsy2 Ylr016c Smc4 Ynl094w Ypl150w Aro4 Cbk1 Sgt2 Ssd1 Gip2 Hal5 Itr2 Kin82 Ynr047w Ksp1 Chs1 Pri2 Yhr186c Lem3 Nmd5 Yhr199c Ylr326w Ppq1 Yhl010c Nha1 Ybl049w Ykr017c Trx2 Usa1 Pex6 Chk1 Ctr1 Ssk22 Bud7 Mae1 Bfa1 Nup53 Ybr063c Dpb2 Lst8 Pho86 Srp54 Ykr051w Ylr271w Yml020w Ypr003c Rrp3 Slc1 Tfc7 Ygr266w Nog2 Met16 Slx1 Ybt1 Ynl040w Bmh1 Bnr1 Boi2 Kcs1 Nth1 Yfr017c Yil028w Cyk3 Sok1 Stu1 Svl3 Caf4 Ent2 Ybl029w Osh7 Sap1 Pex7 Fzo1 New1 Rrp9 Rtf1 Spt7 Ate1 Epl1 Mpt1 Vid21 Arp4 Esa1 Tuf1 Sdf1 Yel064c Snt1 Trm3 Yil112w Ylr409c Ymr155w Yrf1-3 Zds2 Wtm2 Swd3 Sfp1 Sgs1 Ydr316w Bud9 Dak2 Kin1 Ynl182c Q0032 Bub3 Q0092 Rpf1 Ypd1 Vps35 Ybr242w Yil137c Sec28 Ypl222w Bud3 Cin8 Hir1 Mak11 Mck1 Pnt1 Yil105c Msi1 Set3 Pkh1 Prp46 Cdc54 Ypl113c Sap155 Eap1 Ybr187w Ycr076c Ykr007w Btn2 Hxt5 Swe1 Ahc1 Kel1 Tep1 Mlp2 Tup1 Cyc8 Ssy5 Sph1 Ydl156w Rpl23b Bcp1 Ypl208w Rpn13 Pol2 Ygl104c Noc3 Yhl035c Hot1 Ydr116c Ykl082c Cna1 Ygr263c Ctk1 Cdc37 Hrb1 Elp2 Elp3 Jip1 Iki3 Zms1 Hat2 Hif1 Hog1 Rck2 Pac11 Ybl064c Dyn2 Pbs2 Pps1 Ade13 Ppz2 Yor054c Pwp2 Ygr210c Sec13 Sec31 Nup133 Yhl039w Las1 Yor283w Ycl039w Ydr255c Vid28 Fyv10 Vid30 Yer066ca Ygr223c Erg10 Yjr061w Kkq8 Yjl069c Arp10 Dip2 Dip5 Mum2 Yil055c Bmh2 Cln2 Gcn2 Ynl213c Pop2 Tfc4 Kel2 Bud14 Mih1 Pac2 Ydr449c Ygr130c Nsr1 Sul2 Ydr267c Ubp9 Yol111c Ygl131c Yor353c Ctk3 Stb3 Fap1 Mek1 Pcl9 Psr1 Ser1 Ufd2 Tsl1 Dsk2 Hmf1 Ydr049w Npl4 Rad23 Ylr247c Exo70 Cin5 Ymr291w Vps33 Ypl236c Hym1 Yfr039c Ltp1 Mot1 Mob2 Pcl6 Rpg1 Top1 Yfr011c Yol087c Tif35 Yor227w Aut10 Apg2 Ptc1 Ris1 Apg7 Ape3 Yml072c Cpa1 Prp28 Rpd3 Ydr266c Ymr093w Rok1 Are2 Bre1 Yhr149c Ypl055c Ime4 Pep3 Ymr086w Sum1 Lsb3 Yfr024c Prp12 Ylr422w Yor042w Yjl045w Ygr280c Cik1 Elm1 Scd5 Ydr412w Ptp2 Ydl063c Dps1 Msn5 Gal11 Prp31 Skm1 Hmg2 Swd1 Ygr067c Yir003w Rlp24 Gpi15 Yhr046c Bub2 Ism1 Ylr152c Dcp2 Crz1 Dcp1 Sfb3 Tgl1 Ybr225w Sgm1 Mds3 Pin4 Sas10 Yel015w Ynl207w Rpm2 Cdc25 Ltv1 Yor215c Mec1 Rad27 Yhr196w Tps2 Scs2 Mms22 Esc4 Gdh2 Arl3 Chd1 Mlc2 Prs3 Ira1 Rpa12 Yer067w Yhr087w Ura3 Isa1 Ygr150c Ygr198w Red1 Rim11 Gcr2 Yjr028w Cki1 Pmd1 Prs2 Yer160c Yjr027w Ybr094w Ybr267w Ydr339c Pmc1 Ydr365c Ynr054c Ycr087w Ydr102c Yfr003c Yfr016c Cap2 Cap1 Ylr427w Are1 Msc3 Grs1 Swi1 Kex2 Tao3 Ccr4 Cdc36 Yer084w Dbp2 Osh3 Ptp3 Swi5 Stb4 Faa2 Hfi1 Ygr002c Tel1 Ybr280c Ybr139w Aah1 Ycr001w Yfl034w Ypr143w Yhr105w Ssq1 Mub1 Spb4 Ubr2 Sth1 Ysc84 Scp1 Sps1 Apm1 Shs1 Gcn5 Ada2 Taf60 Sgf29 Hap2 Cis1 Ykl214c Ypr085c Hap5 Ypl166w Nop16 Ynl063w Grh1 Hap3 Mdh2 Exg1 Yil108w Med4 Zrg17 Psr2 Ssl2 Rad55 Rck1 Rim15 Rlf2 Ygl060w Ski8 Ski2 Ski3 Sln1 Taf90 Prp40 Ykl099c Yml093w Ykr060w Yor145c Lcp5 Caf20 Cdc20 Mad3 Cse2 Ime2 Sdh2 Ydr372c Ykr046c Ste4 Ybl036c Ydl193w Ydr482c Ygr054w Ino4 Ydr324c Msh3 Ygl220w Yll029w Yjr110w Rgr1 Yfl030w Pcl7 Efb1 Kgd2 Krs1 Pmi40 Lpd1 Arg1 Trp5 Ara1 Thr4 Sac6 Ape2 Pab1 Mdh3 Acs2 Hom2 Ydl124w Bat1 Gcy1 Ade6 Cys3 Ade3 Ura1 Rho4 Msn4 Van1 Mnn9 Bni1 Cdc47 Yjr029w Trp3 Imh1 Pdx3 Thi21 Ade17 Ymr145c Ktr3 Nop14 Mdj1 Cvt19 Dld3 Cdc123 Sui1 Arg4 Rho1 Yfr044c Ilv3 Apg5 Ymr315w Ymr102c Afg3 Tfg2 Bgl2 Cbp6 Psd1 Oac1 Pet9 Ayr1 Pro3 Scw4 Msh6 Yil104c Ymr323w Pda1 Nbp2 Ppz1 Snu66 Sks1 Nup145 Ctf19 Ypl181w Gsf2 Spc24 Glt1 Spc25 Ymr196w Rmt2 Yjr070c Sen2 Asc1 Trp2 Rcl1 Apl2 Ylr328w Apl4 Pib2 Rlp7 Apm4 Rad6 Yjl107c Sec21 Erg27 Gpt2 Ynl181w Ydl204w Om45 Ret2 Yer049w Ydr398w Sgd1 Pox1 Npr2 Yer182w Vps8 Ydr233c Vid24 Yor172w Cac2 Ydl113c Ynl124w Sac1 Wbp1 Hsm3 Rpl5 Nup120 Vps41 Crc1 Figure S3: View of the entire HMS-PCI Dataset. Thick blue lines represent literature-derived interactions from PreBIND+MIPS in the HMS-PCI dataset. Thin orange lines represent potential novel interactions. Arrows point from bait to associated protein. Functional annotation derived from Gene Ontology.( Ho et al., Nature, 2002 New technologies determine proteinprotein interactions on a large scale Such datasets are being increasingly produced, and are publicly available

11 11 Outline 1. Protein-protein interactions 2. Using graph structures to study protein-protein interactions 3. Clustering of graphs 4. Evaluation of clusters

12 12 Representing protein-protein interactions using graphs Interaction attributes: type confidence direction experimentally determined interaction protein A protein B Protein attributes: name function quantitative data Protein attributes: name function quantitative data

13 13 Representing protein-protein interactions using graphs Interaction attributes: type confidence direction experimentally determined interaction Experimental artifact protein A protein B Protein attributes: name function quantitative data Protein attributes: name function quantitative data

14 14 Representing protein-protein interactions using graphs Use graphs to represent the large-scale information on proteins, interactions and their attributes

15 Graph-based representation 15 of protein-protein interactions Ste4 Wtm1 Akl1 Psr1 Ynl094w Cdh1 Nip1 Aat vt19 9c Cln1 Yol111c Rgd1 Rrp42 Ilv3 Ubp7 Ste12 Apa1 Gis4 Aro1 Arp3 Ybr280c Pcl9 Sph1 Arc35 Rpa49 Ynl201c Bul1 Tup1 Ppq1 Yjr141w Ski8 Met10 Ski2 Yfr044c Caf20 Trp2 Nsr1 Esc4 Arf1 Cdc3 ce DNA DNA met DNA recomb DNA replication general metabolism mating nucleolar and ribosome bio protein amino acid phosphorylat protein biosynthesis protein degradation Ubr1 Cdc23 Ach Ygr130c Bck1 Pkh2 Yil113w Pdx3 Whi2 Mms22 Rad9 R View data as a graph Proteins are nodes and interactions are edges Nodes have attributes e.g. known function Directed edges experimental artifact

16 The interactions are determined 16 by tag-affinity purification (TAP) a 1 Tag b c Bait Isolate protein complex 5 SDS PAGE Affinity column A protein ( bait ) is labeled by a chemical The bait forms its interactions (collects prey ) The bait, and all other proteins in the complex are isolated d Protein 1 Protein 2 Protein 3 Protein 4 Protein 5 Excise bands Digest with trypsin Analyse by mass spectrometry and bioinformatics All components of the complex are identified by mass spectrometry Kumar & Snyder, Nature, 2002

17 The interactions are determined 17 by tag-affinity purification (TAP) prey Tag bait KLNFMTP... protein complex PNGFLKK... SRKNFSL... KFWQTY... KKRLMTP... cell 1. Tagging 2. Purification 3. Identification

18 The technology yields false positive and false negative interactions 18 tag Can not distinguish between various types of complexes chain star complete graph Use the spoke model to represent results of experiments directed edges from bait to prey multiple proteins in a complex can be used as a bait direction of edges reflects experimental design, but not the underlying biology bait spoke prey

19 19 Global graph-based summaries: degree of a node Each node is labeled with its degree Degree of a node: the number of edges that the node has to other nodes degree distribution: fraction of nodes in the network with a different degree mean degree: average degree over all nodes

20 20 Degree distribution: Gavin et al., 2002 Node Count Degree Only a few nodes have a large number of edges

21 21 Global graph-based summaries: clustering coefficient Clustering coefficient of a node: the fraction of the neighbors of a node that are also neighbors Clustering coefficient of a network: average clustering coefficient over all neighbors

22 22 Mean degree vs clustering coefficient of experimental networks Krogan2004 Clustering coefficients Tong2002 Hazbun2003 Ho2002 Uetz2000 Li2004 Ito2001 Stelzl2005 Gavin2002 Gavin2006 Krogan Mean degree APMS Y2H Two technologies: tag affinity purification yeast-twohybrid

23 23 Conclusion from these summaries for protein interaction networks: Most nodes have a low degree (i.e. few neighbors) Some nodes have a high clustering coefficient (i.e. their neighbors are also neighbors) Of interest are protein clusters (i.e. groups of proteins that interact with each other more closely than outside the group) close interactions can help infer biological function challenge: large and noisy datasets

24 24 Outline 1. Protein-protein interactions 2. Using graph structures to study protein-protein interactions 3. Clustering of graphs 4. Evaluation of clusters

25 Our goal: find protein clusters in the 25 large and noisy interaction graph Gavin et al., Nature, 2002

26 26 Step 1: de-noise the interaction graph We are more confident in protein interactions if they are determined using multiple baits remove isolated subgraphs determine connected components subgraphs where there is a directed path from each protein to every other protein Gavin et al., Nature, 2002

27 27 Step 1: de-noise the interaction graph We are more confident in protein interactions if they are determined using multiple baits remove isolated subgraphs determine connected components subgraphs where there is a directed path from each protein to every other protein Isolated component Gavin et al., Nature, 2002

28 Step 2: based on the graph topology, 28 find protein clusters in the connected components Finding clusters ignore directions of edges use Markov Cluster (MCL) algorithm for clustering The output are sets of closely interacting proteins Not every protein is expected to cluster Gavin et al., Nature, 2002

29 Step 2: based on the graph topology, 29 find protein clusters in the connected components Output of a clustering procedure Exosome example: additional proteins were found by clustering the network Gavin et al., Nature, 2002 Gavin et al., Nature, 2006

networks? networks are spaces generated by interaction

networks? networks are spaces generated by interaction networks? networks are spaces generated by interaction DNA damage response and repair DNA metabolism DNA recombination DNA replication RNA localization and processing autophagy budding carbohydrate metabolism

More information

CELL CYCLE RESPONSE STRESS AVGPCC. YER179W DMC1 meiosis-specific protein unclear

CELL CYCLE RESPONSE STRESS AVGPCC. YER179W DMC1 meiosis-specific protein unclear ORFNAME LOCUS DESCRIPTION DATE HUBS: ESSENTIAL k AVGPCC STRESS RESPONSE CELL CYCLE PHEROMONE TREATMENT UNFOLDED PROTEIN RESPONSE YER179W DMC1 meiosis-specific protein 9-0.132-0.228-0.003-0.05 0.138 0.00

More information

Identification of Topological Network Modules in Perturbed Protein Interaction Networks. Supplementary Figure S1: Relative abundance of INO80 subunits

Identification of Topological Network Modules in Perturbed Protein Interaction Networks. Supplementary Figure S1: Relative abundance of INO80 subunits Supplementary Information For: Identification of Topological Network Modules in Perturbed Protein Interaction Networks Mihaela E. Sardiu 1#, Joshua M. Gilmore 1#, Brad Groppe 2, Laurence Florens 1, and

More information

GENETICS. Supporting Information

GENETICS. Supporting Information GENETICS Supporting Information http://www.genetics.org/cgi/content/full/genetics.110.117655/dc1 Trivalent Arsenic Inhibits the Functions of Chaperonin Complex XuewenPan,StefanieReissman,NickR.Douglas,ZhiweiHuang,DanielS.Yuan,

More information

Cy, Nu. Pre9 Rpn10 Rpn2 Rpn8 141: Pre10 Pre2 Pre3 Pre6 Pup3 Scl1

Cy, Nu. Pre9 Rpn10 Rpn2 Rpn8 141: Pre10 Pre2 Pre3 Pre6 Pup3 Scl1 Supplementary table S2. List of protein es. The first column provides the identifier and the second the name of the, if known. The third column contains the core proteins (i.e. those present in 2/3 rd

More information

Using graphs to relate expression data and protein-protein interaction data

Using graphs to relate expression data and protein-protein interaction data Using graphs to relate expression data and protein-protein interaction data R. Gentleman and D. Scholtens October 31, 2017 Introduction In Ge et al. (2001) the authors consider an interesting question.

More information

Alterations in DNA Replication and Histone Levels Promote Histone Gene Amplification in Saccharomyces cerevisiae

Alterations in DNA Replication and Histone Levels Promote Histone Gene Amplification in Saccharomyces cerevisiae Supporting Information http://www.genetics.org/cgi/content/full/genetics.109.113662/dc1 Alterations in DNA Replication and Histone Levels Promote Histone Gene Amplification in Saccharomyces cerevisiae

More information

MRP4 MRPS5. function. Their functional link can be checked by double or triple deletion experiments. YDR036C

MRP4 MRPS5. function. Their functional link can be checked by double or triple deletion experiments. YDR036C Table 5. The list of 202 derived functional modules MRP4 and MRPS5 are structural constituents of mitochondrial small ribosomal subunit involved in protein biosynthesis. Therefore, yet unannotated YDR036C

More information

Hierarchical modelling of automated imaging data

Hierarchical modelling of automated imaging data Darren Wilkinson http://tinyurl.com/darrenjw School of Mathematics & Statistics, Newcastle University, UK Theory of Big Data 2 UCL, London 6th 8th January, 2016 Overview Background: Budding yeast as a

More information

Introduction to Microarray Data Analysis and Gene Networks lecture 8. Alvis Brazma European Bioinformatics Institute

Introduction to Microarray Data Analysis and Gene Networks lecture 8. Alvis Brazma European Bioinformatics Institute Introduction to Microarray Data Analysis and Gene Networks lecture 8 Alvis Brazma European Bioinformatics Institute Lecture 8 Gene networks part 2 Network topology (part 2) Network logics Network dynamics

More information

From protein networks to biological systems

From protein networks to biological systems FEBS 29314 FEBS Letters 579 (2005) 1821 1827 Minireview From protein networks to biological systems Peter Uetz a,1, Russell L. Finley Jr. b, * a Research Center Karlsruhe, Institute of Genetics, P.O. Box

More information

A Re-annotation of the Saccharomyces cerevisiae Genome

A Re-annotation of the Saccharomyces cerevisiae Genome Comparative and Functional Genomics Comp Funct Genom 2001; 2: 143 154. DOI: 10.1002 / cfg.86 Research Article A Re-annotation of the Saccharomyces cerevisiae Genome V. Wood*, K. M. Rutherford, A Ivens,

More information

The yeast interactome (unit: g303204)

The yeast interactome (unit: g303204) The yeast interactome (unit: g303204) Peter Uetz 1 & Andrei Grigoriev 2 1 Institute of Genetics (ITG), Forschungszentrum Karlsruhe, Karlsruhe, Germany 2 GPC Biotech, Martinsried, Germany Addresses 1 Institute

More information

Linking the Signaling Cascades and Dynamic Regulatory Networks Controlling Stress Responses

Linking the Signaling Cascades and Dynamic Regulatory Networks Controlling Stress Responses Linking the Signaling Cascades and Dynamic Regulatory Networks Controlling Stress Responses Anthony Gitter, Miri Carmi, Naama Barkai, and Ziv Bar-Joseph Supplementary Information Supplementary Results

More information

Defining the Budding Yeast Chromatin Associated Interactome

Defining the Budding Yeast Chromatin Associated Interactome Supplementary Data Defining the Budding Yeast Chromatin Associated Interactome Jean-Philippe Lambert 1, Jeffrey Fillingham 2,3, Mojgan Siahbazi 1, Jack Greenblatt 3, Kristin Baetz 1, Daniel Figeys 1 1

More information

Modeling and Visualizing Uncertainty in Gene Expression Clusters using Dirichlet Process Mixtures

Modeling and Visualizing Uncertainty in Gene Expression Clusters using Dirichlet Process Mixtures JOURNAL OF LATEX CLASS FILES, VOL. 1, NO. 8, AUGUST 2002 1 Modeling and Visualizing Uncertainty in Gene Expression Clusters using Dirichlet Process Mixtures Carl Edward Rasmussen, Bernard J de la Cruz,

More information

Bayesian inference for stochastic kinetic models. intracellular reaction networks

Bayesian inference for stochastic kinetic models. intracellular reaction networks for stochastic models of intracellular reaction networks Darren Wilkinson School of Mathematics & Statistics and Centre for Integrated Systems Biology of Ageing and Nutrition Newcastle University, UK IMA

More information

Signal recognition YKL122c An01g02800 strong similarity to signal recognition particle 68K protein SRP68 - Canis lupus

Signal recognition YKL122c An01g02800 strong similarity to signal recognition particle 68K protein SRP68 - Canis lupus Supplementary Table 16 Components of the secretory pathway Aspergillus niger A.niger orf A.niger gene Entry into ER Description of putative Aspergillus niger gene Best homolog to putative A.niger gene

More information

Hotspots and Causal Inference For Yeast Data

Hotspots and Causal Inference For Yeast Data Hotspots and Causal Inference For Yeast Data Elias Chaibub Neto and Brian S Yandell October 24, 2012 Here we reproduce the analysis of the budding yeast genetical genomics data-set presented in Chaibub

More information

Ascospore Formation in the Yeast Saccharomyces cerevisiae

Ascospore Formation in the Yeast Saccharomyces cerevisiae MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, Dec. 2005, p. 565 584 Vol. 69, No. 4 1092-2172/05/$08.00 0 doi:10.1128/mmbr.69.4.565 584.2005 Copyright 2005, American Society for Microbiology. All Rights Reserved.

More information

Supplementary Information

Supplementary Information Supplementary Information For the article "The organization of transcriptional activity in the yeast, S. cerevisiae" by I. J. Farkas, H. Jeong, T. Vicsek, A.-L. Barabási, and Z. N. Oltvai For the Referees

More information

Systems biology and biological networks

Systems biology and biological networks Systems Biology Workshop Systems biology and biological networks Center for Biological Sequence Analysis Networks in electronics Radio kindly provided by Lazebnik, Cancer Cell, 2002 Systems Biology Workshop,

More information

Adaptation of Saccharomyces cerevisiae to high hydrostatic pressure causing growth inhibition

Adaptation of Saccharomyces cerevisiae to high hydrostatic pressure causing growth inhibition FEBS 29557 FEBS Letters 579 (2005) 2847 2852 Adaptation of Saccharomyces cerevisiae to high hydrostatic pressure causing growth inhibition Hitoshi Iwahashi a, *, Mine Odani b, Emi Ishidou a, Emiko Kitagawa

More information

Detecting topological patterns in complex networks

Detecting topological patterns in complex networks Detecting topological patterns in complex networks Sergei Maslov Brookhaven National Laboratory Networks in complex systems Complex systems Large number of components They interact with each other All

More information

Phylogenetic classification of transporters and other membrane proteins from Saccharomyces cerevisiae

Phylogenetic classification of transporters and other membrane proteins from Saccharomyces cerevisiae Funct Integr Genomics (2002) 2:154 170 DOI 10.1007/s10142-002-0060-8 REVIEW Benoît De Hertogh Elvira Carvajal Emmanuel Talla Bernard Dujon Philippe Baret André Goffeau Phylogenetic classification of transporters

More information

The Repetitive Sequence Database and Mining Putative Regulatory Elements in Gene Promoter Regions ABSTRACT

The Repetitive Sequence Database and Mining Putative Regulatory Elements in Gene Promoter Regions ABSTRACT JOURNAL OF COMPUTATIONAL BIOLOGY Volume 9, Number 4, 2002 Mary Ann Liebert, Inc. Pp. 621 640 The Repetitive Sequence Database and Mining Putative Regulatory Elements in Gene Promoter Regions JORNG-TZONG

More information

(Fold Change) acetate biosynthesis ALD6 acetaldehyde dehydrogenase 3.0 acetyl-coa biosynthesis ACS2 acetyl-coa synthetase 2.5

(Fold Change) acetate biosynthesis ALD6 acetaldehyde dehydrogenase 3.0 acetyl-coa biosynthesis ACS2 acetyl-coa synthetase 2.5 Supplementary Table 1 Overexpression strains that showed resistance or hypersensitivity to rapamycin. Strains that were more than two-fold enriched on average (and at least 1.8-fold enriched in both experiment

More information

ActiveNetworks Cross-Condition Analysis of Functional Genomic Data

ActiveNetworks Cross-Condition Analysis of Functional Genomic Data ActiveNetworks Cross-Condition Analysis of Functional Genomic Data T. M. Murali April 18, 2006 Motivation: Manual Systems Biology Biologists want to study a favourite stress, e.g., oxidative stress or

More information

DISCOVERING PROTEIN COMPLEXES IN DENSE RELIABLE NEIGHBORHOODS OF PROTEIN INTERACTION NETWORKS

DISCOVERING PROTEIN COMPLEXES IN DENSE RELIABLE NEIGHBORHOODS OF PROTEIN INTERACTION NETWORKS 1 DISCOVERING PROTEIN COMPLEXES IN DENSE RELIABLE NEIGHBORHOODS OF PROTEIN INTERACTION NETWORKS Xiao-Li Li Knowledge Discovery Department, Institute for Infocomm Research, Heng Mui Keng Terrace, 119613,

More information

Causal Model Selection Hypothesis Tests in Systems Genetics: a tutorial

Causal Model Selection Hypothesis Tests in Systems Genetics: a tutorial Causal Model Selection Hypothesis Tests in Systems Genetics: a tutorial Elias Chaibub Neto and Brian S Yandell July 2, 2012 1 Motivation Current efforts in systems genetics have focused on the development

More information

Screening the yeast deletant mutant collection for hypersensitivity and hyper-resistance to sorbate, a weak organic acid food preservative

Screening the yeast deletant mutant collection for hypersensitivity and hyper-resistance to sorbate, a weak organic acid food preservative Yeast Yeast 2004; 21: 927 946. Published online in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/yea.1141 Yeast Functional Analysis Report Screening the yeast deletant mutant collection

More information

Supplemental Table S2. List of 166 Transcription Factors Deleted in Mutants Assayed in this Study ORF Gene Description

Supplemental Table S2. List of 166 Transcription Factors Deleted in Mutants Assayed in this Study ORF Gene Description Supplemental Table S2. List of 166 Transcription Factors Deleted in Mutants Assayed in this Study ORF Gene Description YER045C ACA1* a Basic leucine zipper (bzip) transcription factor of the ATF/CREB family,

More information

Table S1, Yvert et al.

Table S1, Yvert et al. Table S1, Yvert et al. List of 593 clusters of genes showing correlated expression in the cross. For each gene, 90 expression values (BY: 2 independent cultures, RM: 2 independent cultures, 86 segregants:

More information

Bayesian networks for reconstructing transcriptional regulatory networks

Bayesian networks for reconstructing transcriptional regulatory networks Bayesian networks for reconstructing transcriptional regulatory networks Prof Su-In Lee Computer Science & Genome Sciences University of Washington, Seattle GENOME 541 Introduction to Computational Molecular

More information

Finding molecular complexes through multiple layer clustering of protein interaction networks. Bill Andreopoulos* and Aijun An

Finding molecular complexes through multiple layer clustering of protein interaction networks. Bill Andreopoulos* and Aijun An Int. J. Bioinformatics Research and Applications, Vol. x, No. x, xxxx 1 Finding molecular complexes through multiple layer clustering of protein interaction networks Bill Andreopoulos* and Aijun An Department

More information

Genome-wide expression screens indicate a global role for protein kinase CK2 in chromatin remodeling

Genome-wide expression screens indicate a global role for protein kinase CK2 in chromatin remodeling JCS epress online publication date 4 March 2003 Research Article 1563 Genome-wide expression screens indicate a global role for protein kinase CK2 in chromatin remodeling Thomas Barz, Karin Ackermann,

More information

Causal Graphical Models in Systems Genetics

Causal Graphical Models in Systems Genetics 1 Causal Graphical Models in Systems Genetics 2013 Network Analysis Short Course - UCLA Human Genetics Elias Chaibub Neto and Brian S Yandell July 17, 2013 Motivation and basic concepts 2 3 Motivation

More information

Yeast require an Intact Tryptophan Biosynthesis Pathway and Exogenous Tryptophan for Resistance to Sodium Dodecyl Sulfate

Yeast require an Intact Tryptophan Biosynthesis Pathway and Exogenous Tryptophan for Resistance to Sodium Dodecyl Sulfate Yeast require an Intact Tryptophan Biosynthesis Pathway and Exogenous Tryptophan for Resistance to Sodium Dodecyl Sulfate Laura M. Ammons 1,2, Logan R. Bingham 1,2, Sarah Callery 1,2, Elizabeth Corley

More information

Discussion of Function talk IV

Discussion of Function talk IV Discussion of Function talk IV 1. Enzyme Seq-Fun 2. Annotation 3. Integration Sequence and Function relationship taking one example: Enzymes well known functionality is defines conserved essential tree

More information

Genome-wide protein interaction screens reveal functional networks involving Sm-like proteins

Genome-wide protein interaction screens reveal functional networks involving Sm-like proteins Yeast Yeast 2000; 17: 95±110. Research Article Genome-wide protein interaction screens reveal functional networks involving Sm-like proteins Micheline Fromont-Racine 1{, Andrew E. Mayes 2{, Adeline Brunet-Simon

More information

From Networks through Genes to Mechanisms: Understanding Robustness. Animesh Ray Keck Graduate Institute

From Networks through Genes to Mechanisms: Understanding Robustness. Animesh Ray Keck Graduate Institute From Networks through Genes to Mechanisms: Understanding Robustness Animesh Ray Keck Graduate Institute Our Aim: Is modularity a design principle in the evolution of cells? Mechanism Organism Network Modules

More information

Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks

Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks Seiya Imoto 1, Tomoyuki Higuchi 2, Takao Goto 1, Kousuke Tashiro 3, Satoru Kuhara 3 and Satoru Miyano 1

More information

Inferring Transcriptional Regulatory Networks from Gene Expression Data II

Inferring Transcriptional Regulatory Networks from Gene Expression Data II Inferring Transcriptional Regulatory Networks from Gene Expression Data II Lectures 9 Oct 26, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday

More information

Matrix-based pattern matching

Matrix-based pattern matching Regulatory sequence analysis Matrix-based pattern matching Jacques van Helden Jacques.van-Helden@univ-amu.fr Aix-Marseille Université, France Technological Advances for Genomics and Clinics (TAGC, INSERM

More information

Comparative Methods for the Analysis of Gene-Expression Evolution: An Example Using Yeast Functional Genomic Data

Comparative Methods for the Analysis of Gene-Expression Evolution: An Example Using Yeast Functional Genomic Data Comparative Methods for the Analysis of Gene-Expression Evolution: An Example Using Yeast Functional Genomic Data Todd H. Oakley,* 1 Zhenglong Gu,* Ehab Abouheif, 3 Nipam H. Patel, 2 and Wen-Hsiung Li*

More information

Research Article Predicting Protein Complexes in Weighted Dynamic PPI Networks Based on ICSC

Research Article Predicting Protein Complexes in Weighted Dynamic PPI Networks Based on ICSC Hindawi Complexity Volume 2017, Article ID 4120506, 11 pages https://doi.org/10.1155/2017/4120506 Research Article Predicting Protein Complexes in Weighted Dynamic PPI Networks Based on ICSC Jie Zhao,

More information

DNA microarrays simultaneously monitor the expression

DNA microarrays simultaneously monitor the expression Transitive functional annotation by shortest-path analysis of gene expression data Xianghong Zhou*, Ming-Chih J. Kao*, and Wing Hung Wong* *Department of Biostatistics, Harvard School of Public Health,

More information

MINIREVIEW. Eric M. Rubenstein and Martin C. Schmidt*

MINIREVIEW. Eric M. Rubenstein and Martin C. Schmidt* EUKARYOTIC CELL, Apr. 2007, p. 571 583 Vol. 6, No. 4 1535-9778/07/$08.00 0 doi:10.1128/ec.00026-07 Copyright 2007, American Society for Microbiology. All Rights Reserved. MINIREVIEW Mechanisms Regulating

More information

ppendix E - Growth Rates of Individual Genes on Various Non-Fementable Carbon So ORF Gene Lactate Glycerol EtOH Description

ppendix E - Growth Rates of Individual Genes on Various Non-Fementable Carbon So ORF Gene Lactate Glycerol EtOH Description ppendix E - Growth Rates of Individual Genes on Various Non-Fementable Carbon So ORF Gene Lactate Glycerol EtOH Description YAL1C MDM1 2 2 Mitochondrial outer membrane protein involved in mitochondrial

More information

Electronic Supplementary Material (ESI) for Integrative Biology This journal is The Royal Society of Chemistry Figure S1

Electronic Supplementary Material (ESI) for Integrative Biology This journal is The Royal Society of Chemistry Figure S1 Electronic Supplementary Material (ESI) for Integrative iology Figure S1 Electronic Supplementary Material (ESI) for Integrative iology SMNet specificity in permuted networks SMNet sensitivity in permuted

More information

Ross E Curtis 1,2, Seyoung Kim 2, John L Woolford Jr 3, Wenjie Xu 3 and Eric P Xing 4*

Ross E Curtis 1,2, Seyoung Kim 2, John L Woolford Jr 3, Wenjie Xu 3 and Eric P Xing 4* Curtis et al. BMC Genomics 2013, 14:196 RESEARCH ARTICLE Open Access Structured association analysis leads to insight into Saccharomyces cerevisiae gene regulation by finding multiple contributing eqtl

More information

Detecting temporal protein complexes from dynamic protein-protein interaction networks

Detecting temporal protein complexes from dynamic protein-protein interaction networks Detecting temporal protein complexes from dynamic protein-protein interaction networks Le Ou-Yang, Dao-Qing Dai, Xiao-Li Li, Min Wu, Xiao-Fei Zhang and Peng Yang 1 Supplementary Table Table S1: Comparative

More information

Robust Sparse Estimation of Multiresponse Regression and Inverse Covariance Matrix via the L2 distance

Robust Sparse Estimation of Multiresponse Regression and Inverse Covariance Matrix via the L2 distance Robust Sparse Estimatio of Multirespose Regressio ad Iverse Covariace Matrix via the L2 distace Aurélie C. Lozao IBM Watso Research Ceter Yorktow Heights, New York aclozao@us.ibm.com Huijig Jiag IBM Watso

More information

Case story: Analysis of the Cell Cycle

Case story: Analysis of the Cell Cycle DNA microarray analysis, January 2 nd 2006 Case story: Analysis of the Cell Cycle Center for Biological Sequence Analysis Outline Introduction Cell division and cell cycle regulation Experimental studies

More information

Intersection of RNA Processing and Fatty Acid Synthesis and Attachment in Yeast Mitochondria

Intersection of RNA Processing and Fatty Acid Synthesis and Attachment in Yeast Mitochondria Intersection of RNA Processing and Fatty Acid Synthesis and Attachment in Yeast Mitochondria Item Type text; Electronic Dissertation Authors Schonauer, Melissa Publisher The University of Arizona. Rights

More information

Analysis and visualization of protein-protein interactions. Olga Vitek Assistant Professor Statistics and Computer Science

Analysis and visualization of protein-protein interactions. Olga Vitek Assistant Professor Statistics and Computer Science 1 Analysis and visualization of protein-protein interactions Olga Vitek Assistant Professor Statistics and Computer Science 2 Outline 1. Protein-protein interactions 2. Using graph structures to study

More information

Clustering gene expression data using graph separators

Clustering gene expression data using graph separators 1 Clustering gene expression data using graph separators Bangaly Kaba 1, Nicolas Pinet 1, Gaëlle Lelandais 2, Alain Sigayret 1, Anne Berry 1. LIMOS/RR-07-02 13/02/2007 - révisé le 11/05/2007 1 LIMOS, UMR

More information

Received 17 January 2005/Accepted 22 February 2005

Received 17 January 2005/Accepted 22 February 2005 EUKARYOTIC CELL, May 2005, p. 849 860 Vol. 4, No. 5 1535-9778/05/$08.00 0 doi:10.1128/ec.4.5.849 860.2005 Copyright 2005, American Society for Microbiology. All Rights Reserved. A Two-Hybrid Screen of

More information

Constraint-Based Workshops

Constraint-Based Workshops Constraint-Based Workshops 2. Reconstruction Databases November 29 th, 2007 Defining Metabolic Reactions ydbh hslj ldha 1st level: Primary metabolites LAC 2nd level: Neutral Formulas C 3 H 6 O 3 Charged

More information

Corrections. MEDICAL SCIENCES. For the article Inhibitors of soluble epoxide

Corrections. MEDICAL SCIENCES. For the article Inhibitors of soluble epoxide Corrections MEDICAL SCIENCES. For the article Inhibitors of soluble epoxide hydrolase attenuate vascular smooth muscle cell proliferation, by Benjamin B. Davis, David A. Thompson, Laura L. Howard, Christophe

More information

Rule learning for gene expression data

Rule learning for gene expression data Rule learning for gene expression data Stefan Enroth Original slides by Torgeir R. Hvidsten The Linnaeus Centre for Bioinformatics Predicting biological process from gene expression time profiles Papers:

More information

A network of protein protein interactions in yeast

A network of protein protein interactions in yeast A network of protein protein interactions in yeast Benno Schwikowski 1,2 *, Peter Uetz 3, and Stanley Fields 3,4 1 The Institute for Systems Biology, 4225 Roosevelt Way NE, Suite 200, Seattle, WA 98105.

More information

PROTEINS are the building blocks of all the organisms and

PROTEINS are the building blocks of all the organisms and IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, VOL. 9, NO. 3, MAY/JUNE 2012 717 A Coclustering Approach for Mining Large Protein-Protein Interaction Networks Clara Pizzuti and Simona

More information

Discovering modules in expression profiles using a network

Discovering modules in expression profiles using a network Discovering modules in expression profiles using a network Igor Ulitsky 1 2 Protein-protein interactions (PPIs) Low throughput measurements: accurate, scarce High throughput: more abundant, noisy Large,

More information

Bioinformatics. Transcriptome

Bioinformatics. Transcriptome Bioinformatics Transcriptome Jacques.van.Helden@ulb.ac.be Université Libre de Bruxelles, Belgique Laboratoire de Bioinformatique des Génomes et des Réseaux (BiGRe) http://www.bigre.ulb.ac.be/ Bioinformatics

More information

Dissection of transcriptional regulation networks and prediction of gene functions in Saccharomyces cerevisiae Boorsma, A.

Dissection of transcriptional regulation networks and prediction of gene functions in Saccharomyces cerevisiae Boorsma, A. UvA-DARE (Digital Academic Repository) Dissection of transcriptional regulation networks and prediction of gene functions in Saccharomyces cerevisiae Boorsma, A. Link to publication Citation for published

More information

NETWORK CLUSTERING METHODS

NETWORK CLUSTERING METHODS 1 NETWORK CLUSTERING METHODS Dr. Alioune Ngom School of Computer Science University of Windsor angom@uwindsor.ca Winter 2013 Why clustering? 2 A cluster is a group of related objects In biological nets,

More information

Large-scale phenotypic analysis reveals identical contributions to cell functions of known and unknown yeast genes

Large-scale phenotypic analysis reveals identical contributions to cell functions of known and unknown yeast genes Yeast Yeast 2001; 18: 1397 1412. DOI: 10.1002 /yea.784 Yeast Functional Analysis Report Large-scale phenotypic analysis reveals identical contributions to cell functions of known and unknown yeast genes

More information

Predicting Protein Functions and Domain Interactions from Protein Interactions

Predicting Protein Functions and Domain Interactions from Protein Interactions Predicting Protein Functions and Domain Interactions from Protein Interactions Fengzhu Sun, PhD Center for Computational and Experimental Genomics University of Southern California Outline High-throughput

More information

Cytoscape An open-source software platform for the exploration of molecular interaction networks

Cytoscape An open-source software platform for the exploration of molecular interaction networks Cytoscape An open-source software platform for the exploration of molecular interaction networks Systems Biology Group UP Biologie Systémique Institut Pasteur, Paris Overview 1. Molecular interaction networks

More information

Genetic basis of mitochondrial function and morphology in Saccharomyces. Kai Stefan Dimmer, Stefan Fritz, Florian Fuchs, Marlies Messerschmitt, Nadja

Genetic basis of mitochondrial function and morphology in Saccharomyces. Kai Stefan Dimmer, Stefan Fritz, Florian Fuchs, Marlies Messerschmitt, Nadja MBC in Press, published on February 4, 2002 as 10.1091/mbc.01-12-0588 Genetic basis of mitochondrial function and morphology in Saccharomyces cerevisiae Kai Stefan Dimmer, Stefan Fritz, Florian Fuchs,

More information

Active Interaction Mapping Reveals the Hierarchical Organization of Autophagy

Active Interaction Mapping Reveals the Hierarchical Organization of Autophagy Resource Active Interaction Mapping Reveals the Hierarchical Organization of Autophagy Graphical Abstract Authors Michael H. Kramer, Jean-Claude Farré, Koyel Mitra,..., J. Michael Cherry, Suresh Subramani,

More information

Using Networks to Integrate Omic and Semantic Data: Towards Understanding Protein Function on a Genome Scale

Using Networks to Integrate Omic and Semantic Data: Towards Understanding Protein Function on a Genome Scale Using Networks to Integrate Omic and Semantic Data: Towards Understanding Protein Function on a Genome Scale Biomarker Data Analysis.10.01, 9:25-9:50 Mark B Gerstein Yale (Comp. Bio. & Bioinformatics)

More information

Compositional Correlation for Detecting Real Associations. Among Time Series

Compositional Correlation for Detecting Real Associations. Among Time Series Compositional Correlation for Detecting Real Associations Among Time Series Fatih DIKBAS Civil Engineering Department, Pamukkale University, Turkey Correlation remains to be one of the most widely used

More information

Spatio-temporal dynamics of yeast mitochondrial biogenesis: transcriptional and post-transcriptional. mrna oscillatory modules.

Spatio-temporal dynamics of yeast mitochondrial biogenesis: transcriptional and post-transcriptional. mrna oscillatory modules. Spatio-temporal dynamics of yeast mitochondrial biogenesis: transcriptional and post-transcriptional mrna oscillatory modules. Gaëlle Lelandais, Yann Saint-Georges, Colette Geneix, Liza Al-Shikhley, Geneviève

More information

Supplemental Material can be found at:

Supplemental Material can be found at: Supplemental Material can be found at: http://www.jbc.org/cgi/content/full/m705570200/dc1 THE JOURNAL OF BIOLOGICAL CHEMISTRY VOL. 283, NO. 13, pp. 8318 8330, March 28, 2008 2008 by The American Society

More information

A L A BA M A L A W R E V IE W

A L A BA M A L A W R E V IE W A L A BA M A L A W R E V IE W Volume 52 Fall 2000 Number 1 B E F O R E D I S A B I L I T Y C I V I L R I G HT S : C I V I L W A R P E N S I O N S A N D TH E P O L I T I C S O F D I S A B I L I T Y I N

More information

ACCEPTED. Title: Increased respiration in the sch9 mutant is required for increasing chronological life span but not replicative life span

ACCEPTED. Title: Increased respiration in the sch9 mutant is required for increasing chronological life span but not replicative life span EC Accepts, published online ahead of print on 9 May 2008 Eukaryotic Cell doi:10.1128/ec.00330-07 Copyright 2008, American Society for Microbiology and/or the Listed Authors/Institutions. All Rights Reserved.

More information

Protein complexes identification based on go attributed network embedding

Protein complexes identification based on go attributed network embedding Xu et al. BMC Bioinformatics (2018) 19:535 https://doi.org/10.1186/s12859-018-2555-x RESEARCH ARTICLE Protein complexes identification based on go attributed network embedding Bo Xu 1,2*,KunLi 1, Wei Zheng

More information

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics GENOME Bioinformatics 2 Proteomics protein-gene PROTEOME protein-protein METABOLISM Slide from http://www.nd.edu/~networks/ Citrate Cycle Bio-chemical reactions What is it? Proteomics Reveal protein Protein

More information

Honors Thesis: Supplementary Material. LCC 4700: Undergraduate Thesis Writing

Honors Thesis: Supplementary Material. LCC 4700: Undergraduate Thesis Writing Honors Thesis: Supplementary Material LCC 4700: Undergraduate Thesis Writing The Evolutionary Impact of Functional RNA Secondary Structures within Protein- Coding Regions in Yeast Charles Warden Advisor:

More information

Inferring Transcriptional Regulatory Networks from High-throughput Data

Inferring Transcriptional Regulatory Networks from High-throughput Data Inferring Transcriptional Regulatory Networks from High-throughput Data Lectures 9 Oct 26, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20

More information

Consistent dissection of the protein interaction network by combining global and local metrics

Consistent dissection of the protein interaction network by combining global and local metrics Genome Biology This Provisional PDF corresponds to the article as it appeared upon acceptance. Copyedited and fully formatted PDF and full text (HTML) versions will be made available soon. Consistent dissection

More information

Growth of Yeast, Saccharomyces cerevisiae, under Hypergravity Conditions

Growth of Yeast, Saccharomyces cerevisiae, under Hypergravity Conditions Syracuse University SURFACE Syracuse University Honors Program Capstone Projects Syracuse University Honors Program Capstone Projects Spring 5-1-2011 Growth of Yeast, Saccharomyces cerevisiae, under Hypergravity

More information

The Yeast Nuclear Pore Complex: Composition, Architecture, and Transport Mechanism

The Yeast Nuclear Pore Complex: Composition, Architecture, and Transport Mechanism The Yeast Nuclear Pore Complex: Composition, Architecture, and Transport Mechanism Michael P. Rout,* John D. Aitchison, Adisetyantari Suprapto,* Kelly Hjertaas, Yingming Zhao,* and Brian T. Chait* *The

More information

Bacterial genome chimaerism and the origin of mitochondria

Bacterial genome chimaerism and the origin of mitochondria 49 Bacterial genome chimaerism and the origin of mitochondria Ankur Abhishek, Anish Bavishi, Ashay Bavishi, and Madhusudan Choudhary Abstract: Many studies have sought to determine the origin and evolution

More information

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it? Proteomics What is it? Reveal protein interactions Protein profiling in a sample Yeast two hybrid screening High throughput 2D PAGE Automatic analysis of 2D Page Yeast two hybrid Use two mating strains

More information

Received 29 May 2006/Accepted 8 June 2006

Received 29 May 2006/Accepted 8 June 2006 EUKARYOTIC CELL, Aug. 2006, p. 1388 1398 Vol. 5, No. 8 1535-9778/06/$08.00 0 doi:10.1128/ec.00154-06 Copyright 2006, American Society for Microbiology. All Rights Reserved. Heterologous Expression Implicates

More information

Clustering Mixed Numerical and Low Quality Categorical Data: Significance Metrics on a Yeast Example

Clustering Mixed Numerical and Low Quality Categorical Data: Significance Metrics on a Yeast Example Clustering Mixed Numerical and Low Quality Categorical Data: Significance Metrics on a Yeast Example Bill Andreopoulos Department of Computer Science, York University, Toronto, Canada, M3J1P3 billa@cs.yorku.ca

More information

networks in molecular biology Wolfgang Huber

networks in molecular biology Wolfgang Huber networks in molecular biology Wolfgang Huber networks in molecular biology Regulatory networks: components = gene products interactions = regulation of transcription, translation, phosphorylation... Metabolic

More information

Inferring gene regulatory relationships from gene expression data

Inferring gene regulatory relationships from gene expression data Bachelor thesis Computer Science Radboud University Inferring gene regulatory relationships from gene expression data Author: T.A. (Tom) van Bussel s4221435 First supervisor/assessor: Dr. ir. Tom Claassen

More information

Discovering molecular pathways from protein interaction and ge

Discovering molecular pathways from protein interaction and ge Discovering molecular pathways from protein interaction and gene expression data 9-4-2008 Aim To have a mechanism for inferring pathways from gene expression and protein interaction data. Motivation Why

More information

Genome Evolution Greg Lang, Department of Biological Sciences

Genome Evolution Greg Lang, Department of Biological Sciences Genome Evolution Greg Lang, Department of Biological Sciences BioS 010: Bioscience in the 21st Century Mechanisms of genome evolution Gene Duplication Genome Rearrangement Whole Genome Duplication Gene

More information

Mg 2 Deprivation Elicits Rapid Ca 2 Uptake and Activates Ca 2 /Calcineurin Signaling in Saccharomyces cerevisiae

Mg 2 Deprivation Elicits Rapid Ca 2 Uptake and Activates Ca 2 /Calcineurin Signaling in Saccharomyces cerevisiae EUKARYOTIC CELL, Apr. 2007, p. 592 599 Vol. 6, No. 4 1535-9778/07/$08.00 0 doi:10.1128/ec.00382-06 Copyright 2007, American Society for Microbiology. All Rights Reserved. Mg 2 Deprivation Elicits Rapid

More information

Genome-Scale Gene Function Prediction Using Multiple Sources of High-Throughput Data in Yeast Saccharomyces cerevisiae ABSTRACT

Genome-Scale Gene Function Prediction Using Multiple Sources of High-Throughput Data in Yeast Saccharomyces cerevisiae ABSTRACT OMICS A Journal of Integrative Biology Volume 8, Number 4, 2004 Mary Ann Liebert, Inc. Genome-Scale Gene Function Prediction Using Multiple Sources of High-Throughput Data in Yeast Saccharomyces cerevisiae

More information

ESTIMATION OF GENETIC NETWORKS AND FUNCTIONAL STRUCTURES BETWEEN GENES BY USING BAYESIAN NETWORKS AND NONPARAMETRIC REGRESSION

ESTIMATION OF GENETIC NETWORKS AND FUNCTIONAL STRUCTURES BETWEEN GENES BY USING BAYESIAN NETWORKS AND NONPARAMETRIC REGRESSION ESTIMATION OF GENETIC NETWORKS AND FUNCTIONAL STRUCTURES BETWEEN GENES BY USING BAYESIAN NETWORKS AND NONPARAMETRIC REGRESSION SEIYA IMOTO, TAKAO GOTO and SATORU MIYANO Human Genome Center, Institute of

More information

Dynamic modular architecture of protein-protein interaction networks beyond the dichotomy of date and party hubs

Dynamic modular architecture of protein-protein interaction networks beyond the dichotomy of date and party hubs Dynamic modular architecture of protein-protein interaction networks beyond the dichotomy of date and party hubs Xiao Chang 1,#, Tao Xu 2,#, Yun Li 3, Kai Wang 1,4,5,* 1 Zilkha Neurogenetic Institute,

More information

Mechanism of Metabolic Control: Target of Rapamycin Signaling Links Nitrogen Quality to the Activity of the Rtg1 and Rtg3 Transcription Factors

Mechanism of Metabolic Control: Target of Rapamycin Signaling Links Nitrogen Quality to the Activity of the Rtg1 and Rtg3 Transcription Factors Mechanism of Metabolic Control: Target of Rapamycin Signaling Links Nitrogen Quality to the Activity of the Rtg1 and Rtg3 Transcription Factors Arash Komeili,* Karen P. Wedaman, Erin K. O Shea,* and Ted

More information

TABLE 1: Mutants sensitive to sodium arsenite

TABLE 1: Mutants sensitive to sodium arsenite TABLE 1: Mutants sensitive to sodium arsenite Transcription SRB8 REF2 PGD1 SMI1 SNF6 SWI3 SWI6 PHO2 MSN5 GLO3 PHO4 DNA-directed RNA polymerase II holoenzyme and Srb10 CDK subcomplex subunit RNA 3'-end

More information

Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks

Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks Combiig Microarrays ad Biological Kowledge for Estimatig Gee Networks via Bayesia Networks Seiya Imoto 1, Tomoyuki Higuchi 2, Takao Goto 1, Kousuke Tashiro 3, Satoru Kuhara 3 ad Satoru Miyao 1 1 Huma Geome

More information