Hierarchical modelling of automated imaging data

Size: px
Start display at page:

Download "Hierarchical modelling of automated imaging data"

Transcription

1 Darren Wilkinson School of Mathematics & Statistics, Newcastle University, UK Theory of Big Data 2 UCL, London 6th 8th January, 2016

2 Overview Background: Budding yeast as a model for genetics High-throughput robotic genetic experiments Image analysis and data processing (Stochastic) modelling of growth curves Hierarchical modelling of genetic interaction Joint work with Jonathan Heydari, Keith Newman, Conor Lawless and David Lydall (and others in the Lydall lab )

3 Saccharomyces cerevisiae Budding yeast biology and genetics Synthetic genetic array Telomeres HTP yeast SGA robotic screens Saccharomyces cerevisiae, often known as budding yeast, and sometimes as brewer s yeast or baker s yeast, is a single-celled eukaryotic organism Eukaryotic cells contain a nucleus (and typically other organelles, such as mitochondria) It is useful as a model for higher eukaryotes, having a great deal of biological function conserved with humans It is the most heavily studied and well-characterised model organism in biology (eg. first fully sequenced eukaryote)

4 Synthetic Genetic Array (SGA) Budding yeast biology and genetics Synthetic genetic array Telomeres HTP yeast SGA robotic screens Possible to obtain a library of around 4,500 mutant strains, each of which has one of the non-essential genes silenced through insertion of a (kanmx) antibiotic resistance cassette and tagged with a unique DNA barcode These strains (stored frozen in 96-well plates) can be manipulated by robots in 96-well plates (8 12), or on solid agar in 96, 384 or 1536-spot format Synthetic Genetic Array (SGA) is a clever genetic procedure using robots to systematically introduce an additional mutation into each strain in the library by starting from a specially constructed query strain containing the new mutation

5 Telomeres Yeast Budding yeast biology and genetics Synthetic genetic array Telomeres HTP yeast SGA robotic screens The ends of linear chromosomes require special protection in order not to be targeted by DNA damage repair machinery (bacteria often avoid this problem by having just one chromosome arranged in a single loop) Telomeres are the ends of the chromosomal DNA (which have a special sequence), bound with special telomere-capping proteins that protect the telomeres CDC13 is an essential telomere-capping protein in yeast cdc13-1 is a point-mutation of cdc13, encoding a temperature-sensitive protein which functions similarly to wild-type CDC13 below around 25 C, and leads to telomere-uncapping above this temperature

6 Basic structure of an experiment Budding yeast biology and genetics Synthetic genetic array Telomeres HTP yeast SGA robotic screens 1 Introduce a mutation (such as cdc13-1) into an SGA query strain, and then use SGA technology (and a robot) to cross this strain with the single deletion library in order to obtain a new library of double mutants 2 Inoculate the strains into liquid media, grow up to saturation then spot back on to solid agar 4 times 3 Incubate the 4 different copies at different temperatures (treatments), and image the plates multiple times to see how quickly the different strains are growing 4 Repeat steps 2 and 3 four times (to get some idea of experimental variation) 5 Repeat steps 2 to 4 with a control library that does not include the query mutation

7 Budding yeast biology and genetics Synthetic genetic array Telomeres HTP yeast SGA robotic screens Some numbers relating to an experiment Initial SGA work (introducing mutations into the query and the library) takes around 1 month of calendar time, and several days of robot time The inoculation, spotting and imaging of the 8 repeats takes 1 month of calendar time, and around 2 weeks of robot time The experiment uses around 3,000 of consumables (plastics and media) The library is distributed across well plates or 18 solid agar plates (in 384 format, or 1536 in quadruplicate) If each plate is imaged 30 times, there will be around 35k high-resolution photographs of plates in 384 format, corresponding to around 13 million colony growth measurements (400k time series) This is big data!

8 pipeline pipeline Image processing (from images to colony size measurements) Fitness modelling (from colony size growth curves to strain fitness measures) (from strain fitness measures to identification of genetically interacting strains, ranked by effect size) Possible to carry out three stages separately, but benefits to joint modelling through borrowed strength and proper propagation of uncertainty. Not practical to integrate image processing step into the joint model, but possible to jointly model second two stages.

9 Yeast pipeline Automated image analysis (Colonyzer)

10 pipeline Growth curve A B Normalised cell density (AU) his3δ htz1δ Time since inoculation (h)

11 pipeline We want something between a simple smoothing of the data and a detailed model of yeast cell growth and division Logistic growth models are ideal simple semi-mechanistic models with interpretable parameters related to strain fitness Basic deterministic model: dx dt = rx(1 x/k), subject to initial condition x = P at t = 0 r is the growth rate and K is the carrying capacity Analytic solution: x(t) = KP e rt K + P (e rt 1)

12 Statistical model Yeast pipeline Model observational measurements {Y t1, Y t2,...} with Y ti = x ti + ε ti Can fit to observed data y ti using non-linear least squares or MCMC Can fit all (400k) time courses simultaneously in a large hierarchical model which effectively borrows strength, especially across repeats, but also across genes

13 Growth curve model Yeast pipeline l m n ŷ lmn y lmn Time Point Repeat K lm r lm orf τ K l τ K l ν l K o l r o l σ K,o σ τ,k σ r,o σ τ,r σ ν Population K p τ K,p r p τ r,p P ν p

14 Colony fitness Yeast pipeline The results of model fitting are estimates (or posterior distributions) of r and K for each yeast colony, and also the corresponding gene level parameters Both r and K are indicative of colony fitness keep separate where possible Often useful to have a scalar measure of fitness many possibilities, including rk, r log K, or MDR MDP, where MDR is the maximal doubling rate and MDP is the maximal doubling potential Statistical summaries can be fed in as data to the next level of analysis (or, ultimately, modelled jointly as a giant hierarchical model)

15 Epistasis Yeast pipeline From Wikipedia: Epistasis is the interaction between genes. Epistasis takes place when the action of one gene is modified by one or several other genes, which are sometimes called modifier genes. The gene whose phenotype is expressed is said to be epistatic, while the phenotype altered or suppressed is said to be hypostatic. Epistasis and genetic interaction refer to the same phenomenon; however, epistasis is widely used in population genetics and refers especially to the statistical properties of the phenomenon.

16 Multiplicative model Yeast pipeline Consider two genes with alleles a/a and b/b with a and b representing wild type (note that A and B could potentially represent knock-outs of a and b) Four genotypes: aa, Ab, ab, AB. Use [ ] to denote some quantitative phenotypic measure (eg. fitness ) for each genotype Multiplicative model of genetic independence: [AB] [ab] = [Ab] [ab] no epistasis [AB] [ab] > [Ab] [ab] synergistic epistasis [AB] [ab] < [Ab] [ab] antagonistic epistasis Perhaps simpler if re-written in terms of relative fitness: [AB] [ab] = [Ab] [ab] [ab] [ab]

17 pipeline Genetic independence and HTP data Suppose that we have scaled our data so that it is consistent with a multiplicative model what do we expect to see? The independence model [AB] [ab] = [Ab] [ab] translates to [query : abc ] [wt] = [query] [abc ] In other words [query : abc ] = [query] [wt] [abc ] That is, the double-mutant differs from the single-deletion by a constant multiplicative factor that is independent of the particular single-deletion ie. a scatter-plot of double against single will show them all lying along a straight line

18 Statistical modelling Yeast pipeline Assume that F clm is the fitness measurement for repeat m of gene deletion l in condition c (c = 1 for the single deletion and c = 2 for the corresponding double-mutant) Model: F clm N( ˆF cl, 1/ν cl ) log ˆF cl = α c + Z l + δ l γ cl δ l Bern(p) δ l is a variable selection indicator of genetic interaction Then usual Bayesian hierarchical stuff...

19 Genetic interaction model pipeline c α c l n ˆF clm F clm Repeat σ γ c γ cl ν cl Condition orf Z l δ l Z p σz ν p σ ν Population

20 pipeline Genetic interaction results VPS8 BUD14 KIN3 MUM2 UBC4 CHK1 RIF1 :::MRC1 MRC1 MAK31 PTC6 RPN4 RPL13A YDL109C YDL118W YDL119C RPP1B RPL35B RPL35A ARX1 MTC5 SAN1 RAD9 SWM1 YDR269C IPK1 SAC7 RPL27B RPL37B MAK10 NOP16 TMA20 YER119C A SCS2 FTR1 RAD24 BMH1 BUD27 RPO41 UBP6 RPL2A PIB2 RPL24A DBP3 RPL9A ARO2 TOS3 ARO8 YGL217C UPF3 RPL11B PPT1 PHB2 RPL8A SSF1 NMD2 EST3 YIL055C FYV10 DPH1 MPH1 PET130 TMA22 GEF1 MRT4 ELM1 HAP4 ZRT3 LST4 DPH2 DOA1 MEH1 UTH1 BAS1 RIC1 ERG3 YLR111W SRN2 RPL37A EST1 YLR261C TAL1 SUR4 REH1 VIP1 RIF2 OGG1 VPS9 GTR1 NAM7 ECM5 YMR206W YKU70 YNL011C RPL16B ESBP6 RAD50 MCK1 KRE1 YNR005C STI1 EXO1 SHE4 STD1 CKA2 HIS3 RAD17 HAT1 PNG1 DDC1 YPR044C TKL1 CLB2 VPS4 EDE1 YBL104C PAT1 CYK3 EBS1 RPP2B BST1 LRP1 YIL057C MNI1 FKH1 VPS51 YPT6 RPL6B YML010C B RPL42A MNT4 RTC1 RIM1 DIE2 COX23 CST6 JJJ3 RPL43A LTE1 SLA1 ARA1 UFD2 RVS167 POT1 CCW12 SUB1 YMR057C FET3 NPT1 CLB5 HMT1 YDL012C PHO2 RPL24B YPS6 IRC21 YPL080C ELP3 MNI2 RPL8B CKB2 NUP2 HSP26 GUP1 AIR1 SNX4 YJR154W CDC73 MFT1 QCR9 THP2 RPL16A HPR5 DID2 STM1 PGM2 MRE11 CCC2 UFD4 RPL36A MET18 LDB19 PHO13 OCA6 PPH3 VID28 RAD23 SKI8RPL34A BNR1 YMR193C A PET122 VPS35 POL32 YLR402W RPL29 DBF2 HGH1 ELP2 BUD28 PPQ1 GPH1 AVT5 MDM34 CTP1 BEM2 RMD11 TEL1 CRD1 CRP1 UPS1 LIA1 COX12 ICT1 GUF1 YMR153C A MSS18 SYS1 DPH5 COX7 RPL4A RRD1 UBX4 OAC1 YPL062W NAT1 DPB4 SIM1 HCM1 RHR2 YER093C A GPD2 VPS21 :::RTT103 KAP122 SER2 KEX1 XBP1 BCK1 VPS24 PFK26 VAM10 AYR1 YIL161W TNA1 CBP4 HSP104 VPS60 YGL218W COQ10 YDR271C CLG1 DYN3 ALD6 CAP2 SIW14 NAP1 YNL226W SRB2 YLR218C YLR290C MON1 YPR050C EOS1 OST3 YDL176W PUS7 IMG2 GZF3 FKH2 YKE4 TRP1 VRP1 DCC1 IRA2 TIP41 BUB2 RPL26A SLT2 YLR338W DPB3 FEN1 FCY2 RPS4A RPL43B IKI3 NUP53 PUF6 FUS3 SKI2 NPR1 RPL21B UBX7 YGR259C MCX1 YKR035C ZWF1 PUF4 ARO1 ARP1 NUP133 YJL185C YOR251C VID24 RNR3 YPT7 PTC2 AEP2 PER1 SAS4 YBR266C CBF1 DRS2 YLR143W OST4 GIM3 ALG3 PTC1 ELP4 CKB1 YML013C A DYN1 YBL083C STP1 PSD1 TRM44 ARC18 YDR262W ALG12 RPL40B SAS5 BMH2 MAK3 KTI12 YGL042C JNM1 CAT5 GDH1 YTA7 TEF4 CTF4 YKR074W VPS1 ELG1 NRG2 KNS1 VPS5 GPA2 FPS1 PEX32 VID30 VPS17 YBR277C MCM16 YBL059W PHO80 BEM4 EMI5 SBP1 OTU2 LSM6 RPL23A VPS38 UBP3 GID8 DBR1 RPL17B SWF1 MNE1 YMR074C VHS2 SPT2 SWI4 FIT2 CYT2 JJJ1 CBT1 PAC1 SKI7 MBP1 RPL33B PAN2 YNL120C NPL4 EAF3 YOR309C SFH5 CSN12 IMP2' SHE1 FMP35 YDL050C YBR144C PBP2 YLR184W YGL149W RPL9B INP52 YGL057C NTO1 QCR2 RTF1 DOT1 OMS1 VAM7 CYT1 EMI1 NBP2 VAC14 RAV1 YLR407W RPP1A CAC2 MRPL1 LAC1 HPT1 XRS2 YJL206C KEX2 MDM38 MMM1 ERG24 ACE2 ALD3 CTK1 VAM3 RPE1 OYE2 ERJ5 RVS161 PKR1 ERG6 OCA4 YBR028C GOS1 MRN1 HAP5 MGR2 SAS2 AAH1 CYC2 CHL1 TOM70 RSM25 RBS1 TMA19 SWC5 UBA4 HSE1 ELP6 MTC3 HAP2 UBP2 HSP12 YAP1 SET2 APQ12 APT1 PMS1 BFA1 YUR1 LEU3 VAM6 YJL120W GRS1 HAP3 ASC1 TOP1 DIP5 YNR029C UME1 ALG9 RMD5 SIR3 BTS1 YDR203W TPS3 LDB18 ERD1 ALG6 URM1 PHO87 DSK2 GET1 CKA1 MLH1 YPR098C DEP1 YPR039W GEM1 YDR266C REI1 RNH202 AKL1 MET16 YDR049W YJL211C PEX8 PHB1 MDM10 YLR217W LRG1 YBR025C EFT2 YBR246W SNF1 YDR149C BRE5 PEX15 PAN3 PAH1 FYV1 YHL005C MIR1 VTS1 RPL22A YPL102C IRC3 PUF3 YDR348C TOM7 THR4 PEX2 YAP1801 PEX13 PMP3 MNN11 QCR6 SYT1 YLR091W YPR084W YGL024W BUL1 ALG5 PPZ1 KAR9 HSP82 TCM62 CPS1 CCZ1 AZF1 SYF2 YML090W OCA5 YDR537C CYM1 ATP10 URE2 KRE11 YBL071C NBA1 YPL035C PEP8 RGS2 PTP3 YOR052C BNA2 YBR232C YKL121W NCS2 YBR226C DEG1 CSM3 TOM6 MMS22 SPE2 YMR310C HIR3 RTT109 YBR099C MIS1 ASR1 RTC4 IMP2 RAD27 YPR004C Fitness (F) of double mutants at 27 C (doublings / day) 2 double mutants at 27 C (doublings / day) 2 Fitness (F) of orfδ ura3δ orfδ cdc13-1

21 pipeline Joint modelling of growth curves and genetic interaction We can integrate together the hierarchical growth curve model and the genetic interaction model into a combined joint model This has usual advantages of properly borrowing strength, proper propagation of uncertainty, etc. Also very convenient to avoid requiring a scalar measure of fitness If y clmn is the colony size at time point n in repeat m of gene l in condition c, then y clmn N(ŷ clmn, 1/ν cl ) ŷ clmn = X(t clmn ; K clm, r clm, P ) log K clm N(α c + K o l + δ l γ cl, 1/τ K cl ) log r clm N(β c + r o l + δ lω cl, 1/τ r cl )

22 Joint model Yeast pipeline c l m n ŷclmn y clmn Time Point σ γ c γ cl K clm r clm Repeat α c Condition σ ω c ω cl β c ν cl τ K l τ r l orf K o l δ l r o l σ K,o σ τ,k σ r,o σ τ,r σ ν Population K p r p τ K,p τ r,p P ν p

23 pipeline Joint model results VPS8 LTE1 BUD14 KIN3 SLA1 FUS3 EDE1 YBL059W ALG3 YBL083C RPL23A TEL1 AVT5 YBL104C YBR025C RPL4A HMT1 MUM2 AKL1 HSP26 UBC4 VID24 CCZ1 ARA1 PEX32 RPS9B MCX1 YBR266C UBX7 CHK1 RIF1 YBR277C DPB3 SNF5 CTP1 SGF29 :::MRC1 MRC1 MAK31 RIM1 FEN1 BUD31 HCM1 PAT1 PTC6 PTC1 YDL012C RPN4 NAT1 MBP1 RPP1A RPL13A PHO2 YDL109C CYK3 YDL118W YDL119C RPP1B RPL35B CRD1 YDL176W UFD2 RPL35A OST4 PHO13 TRP1 OCA6 PPH3 VPS41 ARX1 DPB4 ARO1 MTC5 SAN1 SAS4 EBS1 RAD9 SWM1 YDR262W YDR269C CCC2 YDR271C :::RTT103 IPK1 MSN5 RPP2B EFT2 RVS167 SAC7 HPT1 DOT1 STP1 RPL27B VPS60 PUF6 RPL37B EMI1 RAD23 MAK10 NOP16 TMA20 FCY2 RPL34A PTC2 YER093C A SWI4 YER119C A SCS2 FTR1 PET122 BEM2 RAD24 BMH1 BUD27 BST1 STE2 RPO41 UBP6 RPL2A RPL29 PUF4 KAP122 CKB1 PIB2 YGL024W RPL24A YGL042C DBP3 GUP1 MON1 RPL9A ARO2 YGL149W KEM1 TOS3 ARO8 KEX1 VAM7 SKI8 CLG1 YGL217C YGL218W MDM34 UPF3 RPL11B DBF2 PCP1 PPT1 RPL24B QCR9 HGH1 ELP2 DIE2 PHB2 YGR259C TNA1 YTA7 OTU2 OPI1 RMD11 RPL8A SLT2 SSF1 NMD2 LRP1 COX23 ARP1 THP2 EST3 VID28 YKE4 CAP2 CST6 RHR2 YIL055C YIL057C AIR1 FYV10 XBP1 DPH1 PFK26 MNI1 SIM1 AYR1 MET18 FKH1 RPL16A RRD1 IMP2' BNR1 POT1 YIL161W MPH1 YPS6 SYS1 PET130 SNX4 HPR5 BCK1 GSH1 GZF3 YJL120W VPS35 RPL17B YJL185C YJL211C TMA22 GEF1 POL32 CBF1 LIA1 CSN12 GRR1 RPL43B JJJ3 CPA2 RPS4A YJR154W MRT4 UFD4 VPS24 ELM1 TEF4 HAP4 KTI12 OAC1 ZRT3 LST4 DPH2 DOA1 MEH1 VPS51 YKR035C DID2 UTH1 NAP1 NUP133 RPL40B BAS1 KNS1 HSP104 FPS1 RPL8B COX12 RIC1 ERG3 RPL22A BUD28 ICT1 CCW12 YLR111W SRN2 YLR143W STM1 DPH5 RPL37A UPS1 YLR218C EST1 YLR261C YPT6 GUF1 NUP2 VRP1 YLR338W RPL26A TAL1 SUR4 IKI3 REH1 SKI2 YLR402W VIP1 CDC73 RPL6B RIF2 YPT7 ERG6 YML010C B YML013C A YMD8 OGG1 MFT1 VPS9 GTR1 SUB1 CSM3 BUB2 YMR057C FET3 UBX4 IRC21 NAM7 PGM2 PKR1 YMR153C A ALD3 ECM5 YMR193C A RPL36A YMR206W MRE11 COX7 AEP2 YKU70 JNM1 DYN3 YNL011C SIW14 FKH2 RPL16B EOS1 YNL120C ESBP6 RPL42A PSD1 YNL226W ZWF1 RAD50 ERG24 MCK1 KRE1 YNR005C CSE2 MNT4 PHO80 COQ10 GSH2 GPD2 EMI5 IRA2 RTC1 STI1 EXO1 SHE4 CKB2 STD1 CKA2 VAM10 VPS5 OST3 VPS21 CAT5 VPS17 ELG1 HIS3 NPT1 SAS5 RPL33B PUS7 PAC1 YOR309C RPL20B LDB19 RAD17 GDH1 HAT1 TRM44 ALD6 YPL062W BTS1 RPL21B YPL080C ELP3 PNG1 BEM4 PPQ1 MRN1 DDC1 YPL205C HSP82 NTO1 TIP41 RPL43A YPR044C MNI2 MCM16 SPE3 TKL1 CLB2 CLB5 MSS18 GPH1 VPS4 YBR226C PBP2 YDL050C BMH2 SWF1 YDR266C IES5 UBP3 CBP4 CRP1 RNR3 RPE1 SFH5 RAV1 CTK1 VPS1 PEX13 VPS38 ARC18 VAC14 YLR407W PPZ1 CAC2 YMR310C ALG12 MDM12 SKI7 YOR251C MNE1 OMS1 KRE28 TRP5 YGL057C VID30 SBP1 UBA4 YJL206C CYT2 YKR074W LDB18 YLR290C TPS3 PMS1 HAP5 ALG5 QCR2 DRS2 HAP3 YBR144C RMD5 RNH202 SSD1 OCA5 GOS1 VHS2 DBR1 ASC1 NUP53 RPL9B JJJ1 YPL102C RVS161 NCS6 SER2 YMR074C UBP2 EAF3 REI1 PER1 IMG2 OYE2 CBT1 YAP1 YKU80 SAS2 AAH1 ALG9 YDR203W RPL14A BCH2 MDM38 AKR2 YOR052C DCC1 ERD1 GET1 MMM1 YNR029C YPR050C GEM1 NBP2 CPS1 GID8 NPR1 YNR020C CTF4 FMP35 OCA4 PEX8 PEP8 YUR1 RPS1B FIT2 MRPL1 GPA2 MTC3 SET2 YLL029W TOM7 ALG6 MGR2 SLX9 YLR091W YDR049W PMP3 YAP1801 YNL109W VAM3 MIS1 HSE1 CYT1 RTF1 SPT2 ACE2 IMP2 SPE2 AHA1 YDR537C PAN2 CPR7 LEU3 SIN4 LEO1 DYN1 ATP10 BUL1 :::GAL11 HSP12 DSK2 MAK3 SRB2 MOG1 MDM10 RBS1 BNA2 ELP4 PHO87 LRG1 YLR184W PAN3 LRE1 ARC1 PAH1 AZF1 NMD4 ERJ5 YKL121W PEX15 LSM1 NRG2 YDR348C IOC3 HAP2 POC4 VAM6 DCW1 SPE1 YOR131C YBR028C APT1 BRE5 IBD2 PTH1 RSM25 YPR039W GRS1 LHP1 YOR008C A TOP1 KRE11 MET16 PEX2 TOM70 RGA1 VPS30 GIM3 CYC2 VTS1 MDL1PKH3 MLH1 LAC1 RPS24A STE4 PEX14 FYV1 MKS1 KEX2 TMA16 CHL1 PUF3 YDR029W RTC4 TLG2 TMA19 CYM1 BUB3 YBR232C ARL3 HCR1 STB5 MNN11 MID2 RRN10 FKS1 OCA1 MMS2 URE2 YML108W ARP8 APL4 BFA1 LSM6 SFL1 APL1 APC9 SNF1 NCS2 DEG1 TPM1 RTG2 IES2 RHO5 MIR1 TCM62 FMP36 PEX10 SSA1 TGS1 YAL004W UBC13 DIP5 MFB1 MVB12 ICE2 BUD19 PHB1 ROT2 YBR285W SIF2 YML102C A MFA1 PEX6 YPR098C XRS2 TOM6 IRC3 DEP1 SHE1 MGA2 YLR217W LAS21 CUE3 YML053C ISU2 INP52 PTP3 ASH1 QCR6 NIF3 NBA1 YPL041C COX8 ALG8 NFI1 MRPS9 TOS1 IOC4 RPL41B YDR467C RAD27 ARL1 TSA1 YPR097W BUB1 RAX2 FIS1 MMS22 TPS1 GDS1 RSM28 UME1 CKA1 PDE2 RCE1 ASM4 VIK1 YPS7 CWH43 VPS27 MEP3 ABP1 SWC5 STE50 RPS27A ELF1 YPL035C VPS75 MSC6 :::GUK1 YNL198C SKG3 SER1 VPS53 YNL171C PEX12 YNL105W YDR506C KAR9 YPR004C YBR224W RIT1 CHS3 YOR082C RBL2 PHO88 RNH203 VPS29 CTF18 PET494 YBL071C IXR1 PRS5 IRC15 ILV6 APE2 GGA2 YCF1 YML090W YBR246W YLR118C SYC1 YPR096C MSH2 RTG1 MET3 DIA2 OCA2 STB1 YIL166C SPO14 SWA2 YER077C SRL1 YPL105C GIS4 ADE1 YBR134W YDR149C YBR099C OST5 EAF6 LEM3 NPL4 YAF9 GAS1 ASR1 YJL215C YLR282C RPP2A YHL005C SOH1 NHP10 YMR144W RTC6 AHP1 RNH201 YPL225W PSR2 THR4 WWM1 BUD13 URM1 JSN1 KES1 YPR084W MMS1 ELP6 DCR2 YPR148C OSH3 SNC2 SKI3 MPA43 PEX5 JID1 YGR242W PET127 PIN4 COX5B YGL235W YGL046W CWH41 IOC2 AUA1 COX5A YDL041W YKL158W INP53 STE11 ADY4 KIP2 RGS2 BEM3 MLH2 YAL058C A YPR090W YOL079W CHZ1 OPI10 RPS30B YCR051W YLR334C APP1 SCD6 COG7 PRE9 YLR404W PAM17 YSA1 RPA14 PEX4 NRM1 ERF2 YML119W CCS1 REX4 RHO2 RFM1 FYV12 GCV2 AAD3 CTI6 HMX1 PRM4 YOR022C ATG27 ITR1 SKT5 HOS2 SCS7 YIL064W ARF1 SYT1 ARR1 RPS23B YNR004W YHL044W APL3 YBR238C GTR2 MAD1 VTC4 CSR2 KAR3 RTG3 SYF2 SPO11 PUS4 PRX1 MRP49 INP2 YDL071C NDL1 ZRT1 SRF6 YIA6 HUL5 OM45 FMP25 YER078C APS2 COG8 GIS3 MCR1 SLM1 GIM4 TFP1 NUP60 PBY1 SBA1 PMT2 IRC8 YPL183C RTT102 RPL19A YER066W RAD52 SEM1 YNL140C SHE9 ECM22 BSP1 ALB1 YOL153C YGR226C MLS1 KEL1 SRB8 MGR1 MNN2 BUG1 SIR4 ERV14 FSF1 YMC2 PIH1 AGA1 PEX25 YKL077W UBA3 NCE102 VPS13 YNG1 HST2 YHM2 ENT2 THR1 YDR445C PEX30 YGL132W YPT32 BIK1 POM34 ATH1 YLR434C MED1 MET31 ATP18 HPC2 LYS20 RPS14A HOM2 SRF4 SSK1 YER084W IMD4 SSH1 PEX3 YIL102C PCT1 PHO86 BUD21 ATG5 MMR1 double mutants at 27 C (doublings / day) 2 Fitness (F) of double mutants at 27 C (doublings / day) 2 Fitness (F) of orfδ ura3δ orfδ cdc13-1

24 Summary References Big data issues Understanding conflict between model and data in big data contexts does more data demand more complex models? Model simplifications and improvements to MCMC (block updates and proposals, reparameterisations, GVS, etc.) Basic parallelisation strategies (parallel chains, parallellelised single chain) Investigation of data parallel strategies (consensus Monte Carlo, etc.) Novel representations and implementations of Bayesian hierarchical models using functional programming languages

25 Summary References Summary Modern bioscience is generating large, complex data sets which require sophisticated modelling in order to answer questions of scientific interest Big data forces trade-offs between statistical accuracy and computational tractability Stochastic dynamic models are much more flexible than deterministic models, but come at a computational cost the LNA can sometimes represent an excellent compromise Notions of genetic interaction translate directly to statistical models of interaction Big hierarchical variable selection models are useful in genomics, but can be computationally challenging

26 Summary References Funding acknowledgements Major funders of this work: BBSRC MRC Wellcome Trust Cancer Research UK

27 References Yeast Summary References Addinall, S. G., Holstein, E., Lawless, C., Yu, M., Chapman, K., Taschuk, M., Young, A., Ciesiolka, A., Lister, A., Wipat, A., Wilkinson, D. J., Lydall, D. A. (2011) Quantitative fitness analysis shows that NMD proteins and many other protein complexes suppress or enhance distinct telomere cap defects. PLoS Genetics, 7:e Heydari, J. J., Lawless, C., Lydall, D. A., Wilkinson, D. J. (2016) Bayesian hierarchical modelling for inferring genetic interactions in yeast, Journal of the Royal Statistical Society, Series C, early view. Heydari, J. J., Lawless, C., Lydall, D. A., Wilkinson, D. J. (2014) Fast Bayesian parameter estimation for stochastic logistic growth models, BioSystems, 122: Lawless, C., Wilkinson, D. J., Addinall, S. G., Lydall, D. A. (2010) Colonyzer: automated quantification of characteristics of microorganism colonies growing on solid agar, BMC Bioinformatics, 11:287. Wilkinson, D. J. (2009) Stochastic modelling for quantitative description of heterogeneous biological systems, Nature Reviews Genetics. 10(2): Wilkinson, D. J. (2011) Stochastic Modelling for Systems Biology, second edition. Chapman & Hall/CRC Press.

GENETICS. Supporting Information

GENETICS. Supporting Information GENETICS Supporting Information http://www.genetics.org/cgi/content/full/genetics.110.117655/dc1 Trivalent Arsenic Inhibits the Functions of Chaperonin Complex XuewenPan,StefanieReissman,NickR.Douglas,ZhiweiHuang,DanielS.Yuan,

More information

Alterations in DNA Replication and Histone Levels Promote Histone Gene Amplification in Saccharomyces cerevisiae

Alterations in DNA Replication and Histone Levels Promote Histone Gene Amplification in Saccharomyces cerevisiae Supporting Information http://www.genetics.org/cgi/content/full/genetics.109.113662/dc1 Alterations in DNA Replication and Histone Levels Promote Histone Gene Amplification in Saccharomyces cerevisiae

More information

Analysis and visualization of protein-protein interactions. Olga Vitek Assistant Professor Statistics and Computer Science

Analysis and visualization of protein-protein interactions. Olga Vitek Assistant Professor Statistics and Computer Science 1 Analysis and visualization of protein-protein interactions Olga Vitek Assistant Professor Statistics and Computer Science 2 Outline 1. Protein-protein interactions 2. Using graph structures to study

More information

Using graphs to relate expression data and protein-protein interaction data

Using graphs to relate expression data and protein-protein interaction data Using graphs to relate expression data and protein-protein interaction data R. Gentleman and D. Scholtens October 31, 2017 Introduction In Ge et al. (2001) the authors consider an interesting question.

More information

CELL CYCLE RESPONSE STRESS AVGPCC. YER179W DMC1 meiosis-specific protein unclear

CELL CYCLE RESPONSE STRESS AVGPCC. YER179W DMC1 meiosis-specific protein unclear ORFNAME LOCUS DESCRIPTION DATE HUBS: ESSENTIAL k AVGPCC STRESS RESPONSE CELL CYCLE PHEROMONE TREATMENT UNFOLDED PROTEIN RESPONSE YER179W DMC1 meiosis-specific protein 9-0.132-0.228-0.003-0.05 0.138 0.00

More information

networks? networks are spaces generated by interaction

networks? networks are spaces generated by interaction networks? networks are spaces generated by interaction DNA damage response and repair DNA metabolism DNA recombination DNA replication RNA localization and processing autophagy budding carbohydrate metabolism

More information

Cy, Nu. Pre9 Rpn10 Rpn2 Rpn8 141: Pre10 Pre2 Pre3 Pre6 Pup3 Scl1

Cy, Nu. Pre9 Rpn10 Rpn2 Rpn8 141: Pre10 Pre2 Pre3 Pre6 Pup3 Scl1 Supplementary table S2. List of protein es. The first column provides the identifier and the second the name of the, if known. The third column contains the core proteins (i.e. those present in 2/3 rd

More information

Introduction to Microarray Data Analysis and Gene Networks lecture 8. Alvis Brazma European Bioinformatics Institute

Introduction to Microarray Data Analysis and Gene Networks lecture 8. Alvis Brazma European Bioinformatics Institute Introduction to Microarray Data Analysis and Gene Networks lecture 8 Alvis Brazma European Bioinformatics Institute Lecture 8 Gene networks part 2 Network topology (part 2) Network logics Network dynamics

More information

Bayesian inference for stochastic kinetic models. intracellular reaction networks

Bayesian inference for stochastic kinetic models. intracellular reaction networks for stochastic models of intracellular reaction networks Darren Wilkinson School of Mathematics & Statistics and Centre for Integrated Systems Biology of Ageing and Nutrition Newcastle University, UK IMA

More information

Ascospore Formation in the Yeast Saccharomyces cerevisiae

Ascospore Formation in the Yeast Saccharomyces cerevisiae MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, Dec. 2005, p. 565 584 Vol. 69, No. 4 1092-2172/05/$08.00 0 doi:10.1128/mmbr.69.4.565 584.2005 Copyright 2005, American Society for Microbiology. All Rights Reserved.

More information

From protein networks to biological systems

From protein networks to biological systems FEBS 29314 FEBS Letters 579 (2005) 1821 1827 Minireview From protein networks to biological systems Peter Uetz a,1, Russell L. Finley Jr. b, * a Research Center Karlsruhe, Institute of Genetics, P.O. Box

More information

Hotspots and Causal Inference For Yeast Data

Hotspots and Causal Inference For Yeast Data Hotspots and Causal Inference For Yeast Data Elias Chaibub Neto and Brian S Yandell October 24, 2012 Here we reproduce the analysis of the budding yeast genetical genomics data-set presented in Chaibub

More information

Causal Graphical Models in Systems Genetics

Causal Graphical Models in Systems Genetics 1 Causal Graphical Models in Systems Genetics 2013 Network Analysis Short Course - UCLA Human Genetics Elias Chaibub Neto and Brian S Yandell July 17, 2013 Motivation and basic concepts 2 3 Motivation

More information

A Re-annotation of the Saccharomyces cerevisiae Genome

A Re-annotation of the Saccharomyces cerevisiae Genome Comparative and Functional Genomics Comp Funct Genom 2001; 2: 143 154. DOI: 10.1002 / cfg.86 Research Article A Re-annotation of the Saccharomyces cerevisiae Genome V. Wood*, K. M. Rutherford, A Ivens,

More information

Modeling and Visualizing Uncertainty in Gene Expression Clusters using Dirichlet Process Mixtures

Modeling and Visualizing Uncertainty in Gene Expression Clusters using Dirichlet Process Mixtures JOURNAL OF LATEX CLASS FILES, VOL. 1, NO. 8, AUGUST 2002 1 Modeling and Visualizing Uncertainty in Gene Expression Clusters using Dirichlet Process Mixtures Carl Edward Rasmussen, Bernard J de la Cruz,

More information

Supplementary Information

Supplementary Information Supplementary Information For the article "The organization of transcriptional activity in the yeast, S. cerevisiae" by I. J. Farkas, H. Jeong, T. Vicsek, A.-L. Barabási, and Z. N. Oltvai For the Referees

More information

Linking the Signaling Cascades and Dynamic Regulatory Networks Controlling Stress Responses

Linking the Signaling Cascades and Dynamic Regulatory Networks Controlling Stress Responses Linking the Signaling Cascades and Dynamic Regulatory Networks Controlling Stress Responses Anthony Gitter, Miri Carmi, Naama Barkai, and Ziv Bar-Joseph Supplementary Information Supplementary Results

More information

Defining the Budding Yeast Chromatin Associated Interactome

Defining the Budding Yeast Chromatin Associated Interactome Supplementary Data Defining the Budding Yeast Chromatin Associated Interactome Jean-Philippe Lambert 1, Jeffrey Fillingham 2,3, Mojgan Siahbazi 1, Jack Greenblatt 3, Kristin Baetz 1, Daniel Figeys 1 1

More information

The yeast interactome (unit: g303204)

The yeast interactome (unit: g303204) The yeast interactome (unit: g303204) Peter Uetz 1 & Andrei Grigoriev 2 1 Institute of Genetics (ITG), Forschungszentrum Karlsruhe, Karlsruhe, Germany 2 GPC Biotech, Martinsried, Germany Addresses 1 Institute

More information

Identification of Topological Network Modules in Perturbed Protein Interaction Networks. Supplementary Figure S1: Relative abundance of INO80 subunits

Identification of Topological Network Modules in Perturbed Protein Interaction Networks. Supplementary Figure S1: Relative abundance of INO80 subunits Supplementary Information For: Identification of Topological Network Modules in Perturbed Protein Interaction Networks Mihaela E. Sardiu 1#, Joshua M. Gilmore 1#, Brad Groppe 2, Laurence Florens 1, and

More information

Bayesian hierarchical modelling for inferring genetic interactions in yeast

Bayesian hierarchical modelling for inferring genetic interactions in yeast Appl. Statist. (2016) 65, Part 3, pp. 367 393 Bayesian hierarchical modelling for inferring genetic interactions in yeast Jonathan Heydari, Conor Lawless, David A. Lydall and Darren J. Wilkinson Newcastle

More information

Signal recognition YKL122c An01g02800 strong similarity to signal recognition particle 68K protein SRP68 - Canis lupus

Signal recognition YKL122c An01g02800 strong similarity to signal recognition particle 68K protein SRP68 - Canis lupus Supplementary Table 16 Components of the secretory pathway Aspergillus niger A.niger orf A.niger gene Entry into ER Description of putative Aspergillus niger gene Best homolog to putative A.niger gene

More information

Causal Model Selection Hypothesis Tests in Systems Genetics: a tutorial

Causal Model Selection Hypothesis Tests in Systems Genetics: a tutorial Causal Model Selection Hypothesis Tests in Systems Genetics: a tutorial Elias Chaibub Neto and Brian S Yandell July 2, 2012 1 Motivation Current efforts in systems genetics have focused on the development

More information

Screening the yeast deletant mutant collection for hypersensitivity and hyper-resistance to sorbate, a weak organic acid food preservative

Screening the yeast deletant mutant collection for hypersensitivity and hyper-resistance to sorbate, a weak organic acid food preservative Yeast Yeast 2004; 21: 927 946. Published online in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/yea.1141 Yeast Functional Analysis Report Screening the yeast deletant mutant collection

More information

The Repetitive Sequence Database and Mining Putative Regulatory Elements in Gene Promoter Regions ABSTRACT

The Repetitive Sequence Database and Mining Putative Regulatory Elements in Gene Promoter Regions ABSTRACT JOURNAL OF COMPUTATIONAL BIOLOGY Volume 9, Number 4, 2002 Mary Ann Liebert, Inc. Pp. 621 640 The Repetitive Sequence Database and Mining Putative Regulatory Elements in Gene Promoter Regions JORNG-TZONG

More information

Table S1, Yvert et al.

Table S1, Yvert et al. Table S1, Yvert et al. List of 593 clusters of genes showing correlated expression in the cross. For each gene, 90 expression values (BY: 2 independent cultures, RM: 2 independent cultures, 86 segregants:

More information

MRP4 MRPS5. function. Their functional link can be checked by double or triple deletion experiments. YDR036C

MRP4 MRPS5. function. Their functional link can be checked by double or triple deletion experiments. YDR036C Table 5. The list of 202 derived functional modules MRP4 and MRPS5 are structural constituents of mitochondrial small ribosomal subunit involved in protein biosynthesis. Therefore, yet unannotated YDR036C

More information

Phylogenetic classification of transporters and other membrane proteins from Saccharomyces cerevisiae

Phylogenetic classification of transporters and other membrane proteins from Saccharomyces cerevisiae Funct Integr Genomics (2002) 2:154 170 DOI 10.1007/s10142-002-0060-8 REVIEW Benoît De Hertogh Elvira Carvajal Emmanuel Talla Bernard Dujon Philippe Baret André Goffeau Phylogenetic classification of transporters

More information

Supplemental Table S2. List of 166 Transcription Factors Deleted in Mutants Assayed in this Study ORF Gene Description

Supplemental Table S2. List of 166 Transcription Factors Deleted in Mutants Assayed in this Study ORF Gene Description Supplemental Table S2. List of 166 Transcription Factors Deleted in Mutants Assayed in this Study ORF Gene Description YER045C ACA1* a Basic leucine zipper (bzip) transcription factor of the ATF/CREB family,

More information

DISCOVERING PROTEIN COMPLEXES IN DENSE RELIABLE NEIGHBORHOODS OF PROTEIN INTERACTION NETWORKS

DISCOVERING PROTEIN COMPLEXES IN DENSE RELIABLE NEIGHBORHOODS OF PROTEIN INTERACTION NETWORKS 1 DISCOVERING PROTEIN COMPLEXES IN DENSE RELIABLE NEIGHBORHOODS OF PROTEIN INTERACTION NETWORKS Xiao-Li Li Knowledge Discovery Department, Institute for Infocomm Research, Heng Mui Keng Terrace, 119613,

More information

Ross E Curtis 1,2, Seyoung Kim 2, John L Woolford Jr 3, Wenjie Xu 3 and Eric P Xing 4*

Ross E Curtis 1,2, Seyoung Kim 2, John L Woolford Jr 3, Wenjie Xu 3 and Eric P Xing 4* Curtis et al. BMC Genomics 2013, 14:196 RESEARCH ARTICLE Open Access Structured association analysis leads to insight into Saccharomyces cerevisiae gene regulation by finding multiple contributing eqtl

More information

Yeast require an Intact Tryptophan Biosynthesis Pathway and Exogenous Tryptophan for Resistance to Sodium Dodecyl Sulfate

Yeast require an Intact Tryptophan Biosynthesis Pathway and Exogenous Tryptophan for Resistance to Sodium Dodecyl Sulfate Yeast require an Intact Tryptophan Biosynthesis Pathway and Exogenous Tryptophan for Resistance to Sodium Dodecyl Sulfate Laura M. Ammons 1,2, Logan R. Bingham 1,2, Sarah Callery 1,2, Elizabeth Corley

More information

ActiveNetworks Cross-Condition Analysis of Functional Genomic Data

ActiveNetworks Cross-Condition Analysis of Functional Genomic Data ActiveNetworks Cross-Condition Analysis of Functional Genomic Data T. M. Murali April 18, 2006 Motivation: Manual Systems Biology Biologists want to study a favourite stress, e.g., oxidative stress or

More information

Adaptation of Saccharomyces cerevisiae to high hydrostatic pressure causing growth inhibition

Adaptation of Saccharomyces cerevisiae to high hydrostatic pressure causing growth inhibition FEBS 29557 FEBS Letters 579 (2005) 2847 2852 Adaptation of Saccharomyces cerevisiae to high hydrostatic pressure causing growth inhibition Hitoshi Iwahashi a, *, Mine Odani b, Emi Ishidou a, Emiko Kitagawa

More information

Finding molecular complexes through multiple layer clustering of protein interaction networks. Bill Andreopoulos* and Aijun An

Finding molecular complexes through multiple layer clustering of protein interaction networks. Bill Andreopoulos* and Aijun An Int. J. Bioinformatics Research and Applications, Vol. x, No. x, xxxx 1 Finding molecular complexes through multiple layer clustering of protein interaction networks Bill Andreopoulos* and Aijun An Department

More information

The geneticist s questions. Deleting yeast genes. Functional genomics. From Wikipedia, the free encyclopedia

The geneticist s questions. Deleting yeast genes. Functional genomics. From Wikipedia, the free encyclopedia From Wikipedia, the free encyclopedia Functional genomics..is a field of molecular biology that attempts to make use of the vast wealth of data produced by genomic projects (such as genome sequencing projects)

More information

Genome-wide expression screens indicate a global role for protein kinase CK2 in chromatin remodeling

Genome-wide expression screens indicate a global role for protein kinase CK2 in chromatin remodeling JCS epress online publication date 4 March 2003 Research Article 1563 Genome-wide expression screens indicate a global role for protein kinase CK2 in chromatin remodeling Thomas Barz, Karin Ackermann,

More information

(Fold Change) acetate biosynthesis ALD6 acetaldehyde dehydrogenase 3.0 acetyl-coa biosynthesis ACS2 acetyl-coa synthetase 2.5

(Fold Change) acetate biosynthesis ALD6 acetaldehyde dehydrogenase 3.0 acetyl-coa biosynthesis ACS2 acetyl-coa synthetase 2.5 Supplementary Table 1 Overexpression strains that showed resistance or hypersensitivity to rapamycin. Strains that were more than two-fold enriched on average (and at least 1.8-fold enriched in both experiment

More information

Systems biology and biological networks

Systems biology and biological networks Systems Biology Workshop Systems biology and biological networks Center for Biological Sequence Analysis Networks in electronics Radio kindly provided by Lazebnik, Cancer Cell, 2002 Systems Biology Workshop,

More information

Detecting temporal protein complexes from dynamic protein-protein interaction networks

Detecting temporal protein complexes from dynamic protein-protein interaction networks Detecting temporal protein complexes from dynamic protein-protein interaction networks Le Ou-Yang, Dao-Qing Dai, Xiao-Li Li, Min Wu, Xiao-Fei Zhang and Peng Yang 1 Supplementary Table Table S1: Comparative

More information

Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks

Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks Seiya Imoto 1, Tomoyuki Higuchi 2, Takao Goto 1, Kousuke Tashiro 3, Satoru Kuhara 3 and Satoru Miyano 1

More information

Intersection of RNA Processing and Fatty Acid Synthesis and Attachment in Yeast Mitochondria

Intersection of RNA Processing and Fatty Acid Synthesis and Attachment in Yeast Mitochondria Intersection of RNA Processing and Fatty Acid Synthesis and Attachment in Yeast Mitochondria Item Type text; Electronic Dissertation Authors Schonauer, Melissa Publisher The University of Arizona. Rights

More information

Detecting topological patterns in complex networks

Detecting topological patterns in complex networks Detecting topological patterns in complex networks Sergei Maslov Brookhaven National Laboratory Networks in complex systems Complex systems Large number of components They interact with each other All

More information

Using Networks to Integrate Omic and Semantic Data: Towards Understanding Protein Function on a Genome Scale

Using Networks to Integrate Omic and Semantic Data: Towards Understanding Protein Function on a Genome Scale Using Networks to Integrate Omic and Semantic Data: Towards Understanding Protein Function on a Genome Scale Biomarker Data Analysis.10.01, 9:25-9:50 Mark B Gerstein Yale (Comp. Bio. & Bioinformatics)

More information

Comparative Methods for the Analysis of Gene-Expression Evolution: An Example Using Yeast Functional Genomic Data

Comparative Methods for the Analysis of Gene-Expression Evolution: An Example Using Yeast Functional Genomic Data Comparative Methods for the Analysis of Gene-Expression Evolution: An Example Using Yeast Functional Genomic Data Todd H. Oakley,* 1 Zhenglong Gu,* Ehab Abouheif, 3 Nipam H. Patel, 2 and Wen-Hsiung Li*

More information

Spatio-temporal dynamics of yeast mitochondrial biogenesis: transcriptional and post-transcriptional. mrna oscillatory modules.

Spatio-temporal dynamics of yeast mitochondrial biogenesis: transcriptional and post-transcriptional. mrna oscillatory modules. Spatio-temporal dynamics of yeast mitochondrial biogenesis: transcriptional and post-transcriptional mrna oscillatory modules. Gaëlle Lelandais, Yann Saint-Georges, Colette Geneix, Liza Al-Shikhley, Geneviève

More information

Growth of Yeast, Saccharomyces cerevisiae, under Hypergravity Conditions

Growth of Yeast, Saccharomyces cerevisiae, under Hypergravity Conditions Syracuse University SURFACE Syracuse University Honors Program Capstone Projects Syracuse University Honors Program Capstone Projects Spring 5-1-2011 Growth of Yeast, Saccharomyces cerevisiae, under Hypergravity

More information

Large-scale phenotypic analysis reveals identical contributions to cell functions of known and unknown yeast genes

Large-scale phenotypic analysis reveals identical contributions to cell functions of known and unknown yeast genes Yeast Yeast 2001; 18: 1397 1412. DOI: 10.1002 /yea.784 Yeast Functional Analysis Report Large-scale phenotypic analysis reveals identical contributions to cell functions of known and unknown yeast genes

More information

Case story: Analysis of the Cell Cycle

Case story: Analysis of the Cell Cycle DNA microarray analysis, January 2 nd 2006 Case story: Analysis of the Cell Cycle Center for Biological Sequence Analysis Outline Introduction Cell division and cell cycle regulation Experimental studies

More information

Bayesian networks for reconstructing transcriptional regulatory networks

Bayesian networks for reconstructing transcriptional regulatory networks Bayesian networks for reconstructing transcriptional regulatory networks Prof Su-In Lee Computer Science & Genome Sciences University of Washington, Seattle GENOME 541 Introduction to Computational Molecular

More information

Constraint-Based Workshops

Constraint-Based Workshops Constraint-Based Workshops 2. Reconstruction Databases November 29 th, 2007 Defining Metabolic Reactions ydbh hslj ldha 1st level: Primary metabolites LAC 2nd level: Neutral Formulas C 3 H 6 O 3 Charged

More information

Inferring Transcriptional Regulatory Networks from Gene Expression Data II

Inferring Transcriptional Regulatory Networks from Gene Expression Data II Inferring Transcriptional Regulatory Networks from Gene Expression Data II Lectures 9 Oct 26, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday

More information

Genetic basis of mitochondrial function and morphology in Saccharomyces. Kai Stefan Dimmer, Stefan Fritz, Florian Fuchs, Marlies Messerschmitt, Nadja

Genetic basis of mitochondrial function and morphology in Saccharomyces. Kai Stefan Dimmer, Stefan Fritz, Florian Fuchs, Marlies Messerschmitt, Nadja MBC in Press, published on February 4, 2002 as 10.1091/mbc.01-12-0588 Genetic basis of mitochondrial function and morphology in Saccharomyces cerevisiae Kai Stefan Dimmer, Stefan Fritz, Florian Fuchs,

More information

Received 17 January 2005/Accepted 22 February 2005

Received 17 January 2005/Accepted 22 February 2005 EUKARYOTIC CELL, May 2005, p. 849 860 Vol. 4, No. 5 1535-9778/05/$08.00 0 doi:10.1128/ec.4.5.849 860.2005 Copyright 2005, American Society for Microbiology. All Rights Reserved. A Two-Hybrid Screen of

More information

ppendix E - Growth Rates of Individual Genes on Various Non-Fementable Carbon So ORF Gene Lactate Glycerol EtOH Description

ppendix E - Growth Rates of Individual Genes on Various Non-Fementable Carbon So ORF Gene Lactate Glycerol EtOH Description ppendix E - Growth Rates of Individual Genes on Various Non-Fementable Carbon So ORF Gene Lactate Glycerol EtOH Description YAL1C MDM1 2 2 Mitochondrial outer membrane protein involved in mitochondrial

More information

Hierarchical Modularity and the Evolution of Genetic Interactomes across Species

Hierarchical Modularity and the Evolution of Genetic Interactomes across Species Resource Hierarchical Modularity and the Evolution of Genetic Interactomes across Species Colm J. Ryan, 1,4,5,17 Assen Roguev, 1,4,17 Kristin Patrick, 1,2,4 Jiewei Xu, 1,4 Harlizawati Jahari, 1,4,7 Zongtian

More information

Corrections. MEDICAL SCIENCES. For the article Inhibitors of soluble epoxide

Corrections. MEDICAL SCIENCES. For the article Inhibitors of soluble epoxide Corrections MEDICAL SCIENCES. For the article Inhibitors of soluble epoxide hydrolase attenuate vascular smooth muscle cell proliferation, by Benjamin B. Davis, David A. Thompson, Laura L. Howard, Christophe

More information

Rule learning for gene expression data

Rule learning for gene expression data Rule learning for gene expression data Stefan Enroth Original slides by Torgeir R. Hvidsten The Linnaeus Centre for Bioinformatics Predicting biological process from gene expression time profiles Papers:

More information

Genome-wide protein interaction screens reveal functional networks involving Sm-like proteins

Genome-wide protein interaction screens reveal functional networks involving Sm-like proteins Yeast Yeast 2000; 17: 95±110. Research Article Genome-wide protein interaction screens reveal functional networks involving Sm-like proteins Micheline Fromont-Racine 1{, Andrew E. Mayes 2{, Adeline Brunet-Simon

More information

7.06 Problem Set #4, Spring 2005

7.06 Problem Set #4, Spring 2005 7.06 Problem Set #4, Spring 2005 1. You re doing a mutant hunt in S. cerevisiae (budding yeast), looking for temperaturesensitive mutants that are defective in the cell cycle. You discover a mutant strain

More information

Predicting Protein Functions and Domain Interactions from Protein Interactions

Predicting Protein Functions and Domain Interactions from Protein Interactions Predicting Protein Functions and Domain Interactions from Protein Interactions Fengzhu Sun, PhD Center for Computational and Experimental Genomics University of Southern California Outline High-throughput

More information

Matrix-based pattern matching

Matrix-based pattern matching Regulatory sequence analysis Matrix-based pattern matching Jacques van Helden Jacques.van-Helden@univ-amu.fr Aix-Marseille Université, France Technological Advances for Genomics and Clinics (TAGC, INSERM

More information

TABLE 1: Mutants sensitive to sodium arsenite

TABLE 1: Mutants sensitive to sodium arsenite TABLE 1: Mutants sensitive to sodium arsenite Transcription SRB8 REF2 PGD1 SMI1 SNF6 SWI3 SWI6 PHO2 MSN5 GLO3 PHO4 DNA-directed RNA polymerase II holoenzyme and Srb10 CDK subcomplex subunit RNA 3'-end

More information

Inferring gene regulatory relationships from gene expression data

Inferring gene regulatory relationships from gene expression data Bachelor thesis Computer Science Radboud University Inferring gene regulatory relationships from gene expression data Author: T.A. (Tom) van Bussel s4221435 First supervisor/assessor: Dr. ir. Tom Claassen

More information

Electronic Supplementary Material (ESI) for Integrative Biology This journal is The Royal Society of Chemistry Figure S1

Electronic Supplementary Material (ESI) for Integrative Biology This journal is The Royal Society of Chemistry Figure S1 Electronic Supplementary Material (ESI) for Integrative iology Figure S1 Electronic Supplementary Material (ESI) for Integrative iology SMNet specificity in permuted networks SMNet sensitivity in permuted

More information

How many protein-coding genes are there in the Saccharomyces cerevisiae genome?

How many protein-coding genes are there in the Saccharomyces cerevisiae genome? Yeast Yeast 2002; 19: 619 629. Published online in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002 / yea.865 Research Article How many protein-coding genes are there in the Saccharomyces

More information

DNA microarrays simultaneously monitor the expression

DNA microarrays simultaneously monitor the expression Transitive functional annotation by shortest-path analysis of gene expression data Xianghong Zhou*, Ming-Chih J. Kao*, and Wing Hung Wong* *Department of Biostatistics, Harvard School of Public Health,

More information

Bioinformatics. Transcriptome

Bioinformatics. Transcriptome Bioinformatics Transcriptome Jacques.van.Helden@ulb.ac.be Université Libre de Bruxelles, Belgique Laboratoire de Bioinformatique des Génomes et des Réseaux (BiGRe) http://www.bigre.ulb.ac.be/ Bioinformatics

More information

The geneticist s questions

The geneticist s questions The geneticist s questions a) What is consequence of reduced gene function? 1) gene knockout (deletion, RNAi) b) What is the consequence of increased gene function? 2) gene overexpression c) What does

More information

Research Article Predicting Protein Complexes in Weighted Dynamic PPI Networks Based on ICSC

Research Article Predicting Protein Complexes in Weighted Dynamic PPI Networks Based on ICSC Hindawi Complexity Volume 2017, Article ID 4120506, 11 pages https://doi.org/10.1155/2017/4120506 Research Article Predicting Protein Complexes in Weighted Dynamic PPI Networks Based on ICSC Jie Zhao,

More information

Received 29 May 2006/Accepted 8 June 2006

Received 29 May 2006/Accepted 8 June 2006 EUKARYOTIC CELL, Aug. 2006, p. 1388 1398 Vol. 5, No. 8 1535-9778/06/$08.00 0 doi:10.1128/ec.00154-06 Copyright 2006, American Society for Microbiology. All Rights Reserved. Heterologous Expression Implicates

More information

Robust Sparse Estimation of Multiresponse Regression and Inverse Covariance Matrix via the L2 distance

Robust Sparse Estimation of Multiresponse Regression and Inverse Covariance Matrix via the L2 distance Robust Sparse Estimatio of Multirespose Regressio ad Iverse Covariace Matrix via the L2 distace Aurélie C. Lozao IBM Watso Research Ceter Yorktow Heights, New York aclozao@us.ibm.com Huijig Jiag IBM Watso

More information

Genetics 275 Notes Week 7

Genetics 275 Notes Week 7 Cytoplasmic Inheritance Genetics 275 Notes Week 7 Criteriafor recognition of cytoplasmic inheritance: 1. Reciprocal crosses give different results -mainly due to the fact that the female parent contributes

More information

Computational Genomics. Systems biology. Putting it together: Data integration using graphical models

Computational Genomics. Systems biology. Putting it together: Data integration using graphical models 02-710 Computational Genomics Systems biology Putting it together: Data integration using graphical models High throughput data So far in this class we discussed several different types of high throughput

More information

Systematic identification of gene annotation errors in the widely used yeast mutation collections

Systematic identification of gene annotation errors in the widely used yeast mutation collections Systematic identification of gene annotation errors in the widely used yeast mutation collections Taly Ben-Shitrit 1,6, Nir Yosef 2,3,6, Keren Shemesh 1, Roded Sharan 4, Eytan Ruppin 4,5 & Martin Kupiec

More information

Eukaryotic Gene Expression

Eukaryotic Gene Expression Eukaryotic Gene Expression Lectures 22-23 Several Features Distinguish Eukaryotic Processes From Mechanisms in Bacteria 123 Eukaryotic Gene Expression Several Features Distinguish Eukaryotic Processes

More information

Dissection of transcriptional regulation networks and prediction of gene functions in Saccharomyces cerevisiae Boorsma, A.

Dissection of transcriptional regulation networks and prediction of gene functions in Saccharomyces cerevisiae Boorsma, A. UvA-DARE (Digital Academic Repository) Dissection of transcriptional regulation networks and prediction of gene functions in Saccharomyces cerevisiae Boorsma, A. Link to publication Citation for published

More information

Localizing proteins in the cell from their phylogenetic profiles

Localizing proteins in the cell from their phylogenetic profiles Localizing proteins in the cell from their phylogenetic profiles Edward M. Marcotte*, Ioannis Xenarios*, Alexander M. van der Bliek*, and David Eisenberg* *Molecular Biology Institute, University of California

More information

MINIREVIEW. Eric M. Rubenstein and Martin C. Schmidt*

MINIREVIEW. Eric M. Rubenstein and Martin C. Schmidt* EUKARYOTIC CELL, Apr. 2007, p. 571 583 Vol. 6, No. 4 1535-9778/07/$08.00 0 doi:10.1128/ec.00026-07 Copyright 2007, American Society for Microbiology. All Rights Reserved. MINIREVIEW Mechanisms Regulating

More information

Honors Thesis: Supplementary Material. LCC 4700: Undergraduate Thesis Writing

Honors Thesis: Supplementary Material. LCC 4700: Undergraduate Thesis Writing Honors Thesis: Supplementary Material LCC 4700: Undergraduate Thesis Writing The Evolutionary Impact of Functional RNA Secondary Structures within Protein- Coding Regions in Yeast Charles Warden Advisor:

More information

A L A BA M A L A W R E V IE W

A L A BA M A L A W R E V IE W A L A BA M A L A W R E V IE W Volume 52 Fall 2000 Number 1 B E F O R E D I S A B I L I T Y C I V I L R I G HT S : C I V I L W A R P E N S I O N S A N D TH E P O L I T I C S O F D I S A B I L I T Y I N

More information

Prof. Fahd M. Nasr. Lebanese university Faculty of sciences I Department of Natural Sciences.

Prof. Fahd M. Nasr. Lebanese university Faculty of sciences I Department of Natural Sciences. Prof. Fahd M. Nasr Lebanese university Faculty of sciences I Department of Natural Sciences fnasr@ul.edu.lb B3206 Microbial Genetics Eukaryotic M. G. The yeast Saccharomyces cerevisiae as a genetic model

More information

Central postgenomic. Transcription regulation: a genomic network. Transcriptome: the set of all mrnas expressed in a cell at a specific time.

Central postgenomic. Transcription regulation: a genomic network. Transcriptome: the set of all mrnas expressed in a cell at a specific time. Transcription regulation: a genomic network Nicholas Luscombe Laboratory of Mark Gerstein Department of Molecular Biophysics and Biochemistry Yale University Understand Proteins, through analyzing populations

More information

PARAMETER UNCERTAINTY QUANTIFICATION USING SURROGATE MODELS APPLIED TO A SPATIAL MODEL OF YEAST MATING POLARIZATION

PARAMETER UNCERTAINTY QUANTIFICATION USING SURROGATE MODELS APPLIED TO A SPATIAL MODEL OF YEAST MATING POLARIZATION Ching-Shan Chou Department of Mathematics Ohio State University PARAMETER UNCERTAINTY QUANTIFICATION USING SURROGATE MODELS APPLIED TO A SPATIAL MODEL OF YEAST MATING POLARIZATION In systems biology, we

More information

Heterozygous BMN lines

Heterozygous BMN lines Optical density at 80 hours 0.8 0.6 0.4 0.2 0.8 0.6 0.4 0.2 0.8 0.6 0.4 0.2 0.8 0.6 0.4 0.2 a YPD b YPD + 1µM nystatin c YPD + 2µM nystatin d YPD + 4µM nystatin 1 3 5 6 9 13 16 20 21 22 23 25 28 29 30

More information

Basel Abu-Jamous 1, Rui Fa 1, David J Roberts 2,3 and Asoke K Nandi 1,4*

Basel Abu-Jamous 1, Rui Fa 1, David J Roberts 2,3 and Asoke K Nandi 1,4* Abu-Jamous et al. BMC Bioinformatics 2014, 15:322 RESEARCH ARTICLE Open Access Comprehensive analysis of forty yeast microarray datasets reveals a novel subset of genes (APha-RiB) consistently negatively

More information

Predicting Cellular Growth from Gene Expression Signatures

Predicting Cellular Growth from Gene Expression Signatures from Gene Expression Signatures Edoardo M. Airoldi 1,2., Curtis Huttenhower 1,2., David Gresham 1,3, Charles Lu 1,3, Amy A. Caudy 1, Maitreya J. Dunham 4, James R. Broach 3, David Botstein 1,3 *, Olga

More information

Pathway Association Analysis Trey Ideker UCSD

Pathway Association Analysis Trey Ideker UCSD Pathway Association Analysis Trey Ideker UCSD A working network map of the cell Network evolutionary comparison / cross-species alignment to identify conserved modules The Working Map Network-based classification

More information

Bacterial genome chimaerism and the origin of mitochondria

Bacterial genome chimaerism and the origin of mitochondria 49 Bacterial genome chimaerism and the origin of mitochondria Ankur Abhishek, Anish Bavishi, Ashay Bavishi, and Madhusudan Choudhary Abstract: Many studies have sought to determine the origin and evolution

More information

Compositional Correlation for Detecting Real Associations. Among Time Series

Compositional Correlation for Detecting Real Associations. Among Time Series Compositional Correlation for Detecting Real Associations Among Time Series Fatih DIKBAS Civil Engineering Department, Pamukkale University, Turkey Correlation remains to be one of the most widely used

More information

Using ADE Mutants to Study Respiration and Fermentation in Yeast Extensions to Basic Lab

Using ADE Mutants to Study Respiration and Fermentation in Yeast Extensions to Basic Lab Using ADE Mutants to Study Respiration and Fermentation in Yeast Extensions to Basic Lab Introduction: David Form, Nashoba Regional High School And Jessica Forton, Melrose High School Bioinformatics Component

More information

Stochastic simulations

Stochastic simulations Stochastic simulations Application to molecular networks Literature overview Noise in genetic networks Origins How to measure and distinguish between the two types of noise (intrinsic vs extrinsic)? What

More information

Evolving Antibiotic Resistance in Bacteria by Controlling Mutation Rate

Evolving Antibiotic Resistance in Bacteria by Controlling Mutation Rate Evolving Antibiotic Resistance in Bacteria by Controlling Mutation Rate Roman V. Belavkin 1 1 School of Science and Technology Middlesex University, London NW4 4BT, UK December 11, 2015, Tokyo University

More information

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)

Using phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression) Using phylogenetics to estimate species divergence times... More accurately... Basics and basic issues for Bayesian inference of divergence times (plus some digression) "A comparison of the structures

More information

ESTIMATION OF GENETIC NETWORKS AND FUNCTIONAL STRUCTURES BETWEEN GENES BY USING BAYESIAN NETWORKS AND NONPARAMETRIC REGRESSION

ESTIMATION OF GENETIC NETWORKS AND FUNCTIONAL STRUCTURES BETWEEN GENES BY USING BAYESIAN NETWORKS AND NONPARAMETRIC REGRESSION ESTIMATION OF GENETIC NETWORKS AND FUNCTIONAL STRUCTURES BETWEEN GENES BY USING BAYESIAN NETWORKS AND NONPARAMETRIC REGRESSION SEIYA IMOTO, TAKAO GOTO and SATORU MIYANO Human Genome Center, Institute of

More information

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON PROKARYOTE GENES: E. COLI LAC OPERON CHAPTER 13 CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON Figure 1. Electron micrograph of growing E. coli. Some show the constriction at the location where daughter

More information

Yeast Genome-wide Screens to Ascertain the Genetic Landscape for Barth Syndrome. Christopher R. McMaster, PhD Dalhousie University

Yeast Genome-wide Screens to Ascertain the Genetic Landscape for Barth Syndrome. Christopher R. McMaster, PhD Dalhousie University Yeast Genome-wide Screens to Ascertain the Genetic Landscape for Barth Syndrome Christopher R. McMaster, PhD Dalhousie University Using Systematic Genetics to Identify Modifies Genes that Affect Fitness

More information

ACCEPTED. Title: Increased respiration in the sch9 mutant is required for increasing chronological life span but not replicative life span

ACCEPTED. Title: Increased respiration in the sch9 mutant is required for increasing chronological life span but not replicative life span EC Accepts, published online ahead of print on 9 May 2008 Eukaryotic Cell doi:10.1128/ec.00330-07 Copyright 2008, American Society for Microbiology and/or the Listed Authors/Institutions. All Rights Reserved.

More information

A Genome-Wide Screen with Nicotinamide to Identify Sirtuin-Dependent Pathways in Saccharomyces cerevisiae

A Genome-Wide Screen with Nicotinamide to Identify Sirtuin-Dependent Pathways in Saccharomyces cerevisiae MUTANT SCREEN REPORT A Genome-Wide Screen with Nicotinamide to Identify Sirtuin-Dependent Pathways in Saccharomyces cerevisiae John S. Choy,*,1 Bayan Qadri,* Leah Henry,* Kunal Shroff,* Olatomiwa Bifarin,*

More information

The Evolution of Sex Chromosomes through the. Baldwin Effect

The Evolution of Sex Chromosomes through the. Baldwin Effect The Evolution of Sex Chromosomes through the Baldwin Effect Larry Bull Computer Science Research Centre Department of Computer Science & Creative Technologies University of the West of England, Bristol

More information