Inferring Transcriptional Regulatory Networks from Gene Expression Data II

Size: px
Start display at page:

Download "Inferring Transcriptional Regulatory Networks from Gene Expression Data II"

Transcription

1 Inferring Transcriptional Regulatory Networks from Gene Expression Data II Lectures 9 Oct 26, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20 Johnson Hall (JHN) Review: Gene Regulation Expression level of Regulator controls the expression levels of Targets it binds to. Regulator s expression is predictive of Targets expression E Regulator Regulator E Targets Targets AGTCTTAACGTTTGACCGCTAATT Segal et al., Nature Genetics 2003; Lee et al., PNAS

2 Review: Inferring the regulatory networks Input: Gene expression data Output: Bayesian network representing how genes regulate each other at the transcriptional level, and how these interactions govern gene expression levels. Algorithm: Score-based structure learning of Bayesian networks Challenge: Too many possible structures Solutions: Control the complexity of the structure, Module networks Expression data measurement of mrna levels of all genes Arrays: conditions, individuals, etc A e 1 C B e 6 e Q Q 2x10 4 (for human) Transcriptional regulatory network Module 2-3 x x Review: The Module Networks Concept X3 X5 X2 Repressor X4 Activator X3 = X 1 Module 1 Module 4 X4 Module 3 X6 activator expression repressor expression target gene expression induced Activator X3 false Repressor X4 false Tree CPDs regulation program Module 3 genes Module3 Linear CPDs repressed 4 2

3 Outline Motivation Why are we interested in inferring the regulatory network? Algorithms for learning regulatory networks Tree-CPDs with Bayesian score Linear Gaussian CPDs with regularization Evaluation of the method Statistical evaluation Biological interpretation Advanced topics Many models can get similar scores. Which one would you choose? A gene can be involved in multiple modules. Possible to incorporate prior knowledge? 5 Motivations Gene A regulates gene B s expression Important basic biology questions Gene A regulates gene B in condition C Condition C: disease states (cancer/normal, subtypes), phenotypes, species (evolutionary processes), developmental stages, cell types, experimental conditions Example application (C: disease states) Understanding histologic transformation in lymphoma Lymphoma: the most common type of blood cancer in the US. Transformation of Follicular Lymphoma (FL) to Diffuse Large B-Cell Lymphoma (DLBCL) Occurs in 40-60% of patients; Dramatically worse prognosis Goal: Infer the mechanisms that drive transformation. 6 3

4 Predicting Cancer Transformation Network-based approach Different network features transformation mechanism? Early diagnosis of transformation; therapeutic implications? Share as many networks features as possible more robust than inferring two networks separately. Follicular Lymphoma e 1 e 5 e Q N1 patients with FL N2 patients with DLBCL Diffuse Large B-Cell Lymphoma regulatory networks gene expression data Outline Motivation Algorithms for learning regulatory networks Tree-CPDs with Bayesian score Linear Gaussian CPDs with regularization Evaluation of the method Statistical evaluation Biological interpretation Advanced topics Many models can get similar scores. Which one would you choose? A gene can be involved in multiple modules. Possible to incorporate prior knowledge? 8 4

5 false Goal false Heat CMK1 Shock? HAP4 Identify modules (member genes) Discover module regulation program Regulation program genes (X s) experiments Candidate regulators X 1 Module 1 (μ A,σ A ) false X3 X4 X3 X2 Module 2 X4 P(Level) 0 Level Context A P(Level) fals e 3 Level Context B P(Level) Level Context C X5 Module 3 X6 9 Learning Structure learning Find the structure that maximizes Bayesian score log P(S D) (or via regularization) Expectation Maximization (EM) algorithm M-step: Given a partition of the genes into modules, learn the best regulation program (tree CPD) for each module. E-step: Given the inferred regulatory programs, we reassign genes into modules such that the associated regulation program best predicts each gene s behavior. 10 5

6 Learning Regulatory Network Iterative procedure (EM) Cluster genes into modules (E-step) Learn a regulatory program for each module (tree model) (M-step) Maximum increase in the structural score (Bayesian) M1 Candidate regulators M22 KEM1 MKT1 MSK1 DHH1 ECM18 UTH1 GPA1 HAP4 TEC1 ASG7 MFA1 MEC3 HAP1 SGS1 SEC59 RIM15 SAS5 PHO5 PHO2 PHO3 PHM6 SPL2 PHO84 PHO4 GIT1 VTC3 11 From Expression to Regulation (M-step) Combinatorial search over the space of trees Arrays sorted in original order HAP4 PHO5 PHO2 PHO3 PHM6 PHO84 PHO4 HAP4 GIT1 VTC3 Arrays sorted according to expression of HAP4 Segal et al. Nat Genet

7 From Expression to Regulation (M-step) HAP4 SIP4 PHO5 PHO2 PHO3 PHM6 PHO84 PHO4 GIT1 VTC3 SIP4 HAP4 Segal et al. Nat Genet Learning Control Programs u Score: log P(M D) log P(D M,, )P(, ) d d 0 u Score of HAP4 split: log P(M D) HAP4 log P(D HAP4 M,, )P(, ) d d + log P(D HAP4 M,, )P(, ) d d Segal et al. Nat Genet

8 Learning Control Programs u Split as long as the score improves HAP4 HAP4 YGR043C u 0 0 u Score of HAP4 split: log P(M D) log P(D HAP4 M,, )P(, ) d d + log P(D HAP4 M,, )P(, ) d d Score of HAP4/YGR043C split: log P(M D) log P(D HAP4 M,, )P(, ) d d + log P(D HAP4 D YGR043C M,, )P(, ) d d 15 + log P(D HAP4 D YGR043C M,, )P(, ) d d Review Learning Regulatory Network Iterative procedure Cluster genes into modules (E-step) Learn a regulatory program for each module (M-step) KEM1 HAP4 MKT1 false MSK1 DHH1 Maximum increase in Bayesian score UTH1 ECM18 TEC1 GPA1 HAP4 M1 M22 ASG7 MFA1 Candidate regulators MEC3 HAP1 MSK1 false SGS1 SEC59 RIM15 SAS5 PHO5 PHO2 PHO3 PHM6 SPL2 PHO84 PHO4 GIT1 VTC3 16 8

9 Module Networks* Activator Gene1 Gene2 Gene3 Genes in module activator expression Tree regression repressor expression target gene expression induced repressed activator false repressor false Repressor regulation program module genes Learning quickly runs out of statistical power Poor regulator selection lower in the tree Many correct regulators not selected Arbitrary choice among correlated regulators Combinatorial search Multiple local optima * Segal et al., Nature Genetics 2003 Regulation as Linear Regression minimize w (w 1 x 1 + w N x N -E Module ) 2 x 1 x 2 w w 1 2 w N parameters E Module E Targets = w 1 x 1 ++w N x N +ε x N S1011 S321 HAP1 S120 S321 SGS1 SEC59 SAS5 S22 S1 ECM18 ASG7 MEC3 UTH1 GPA1 MFA1 TEC1 RIM15 PHO3 PHO5 PHO84 PHM6 PHO2 PHO4 SPL2 GIT1 VTC3 But we often have very large N and linear regression gives them all nonzero weight! Problem: This objective learns too many regulators 18 9

10 Lasso* (L 1 ) Regression L 1 term minimize w (w 1 x 1 + w N x N -E Module ) 2 + C w i x 1 x 2 x N L 2 L1 parameters w w 1 2 w N E Module Induces sparsity in the solution w (many w i s set to zero) Provably selects right features when many features are irrelevant Convex optimization problem No combinatorial search Unique global optimum Efficient optimization * Tibshirani, 1996 Learning Regulatory Network Cluster genes into modules Learn a regulatory program for each module S1 S22 S1011 S321 S120 S321 ECM18 UTH1 GPA1 TEC1 ASG7 MFA1 MEC3 HAP1 SGS1 SEC59 RIM15 SAS5 PHO3 PHO5 PHO84 L PHM6 1 regression minimize PHO2 PHO4 w (Σw i x SPL2 i -E Targets )2+ C w i GIT1 VTC3 Lee et al., PLoS Genet

11 Learning the regulatory network Multiple regression tasks S22 S1 S120 S1011 ECM18 ASG7 MEC3 S321 UTH1 GPA1 S321 MFA1 TEC1 HAP1 SGS1 RIM15 SEC59 SAS5 Module M PHO3 PHO5 PHO84 PHM6 PHO2 PHO4 SPL2 GIT1 VTC3 Module 1 minimize w1 (Σ w 1n x n E module1 ) 2 + C w 1n : minimize wn (Σ w Mn x n E modulem ) 2 + C w Mn Module 1 x 1 x 2 E module 1 x N w w =0 w 1N Module M x 1 x 2 : E module M x N w w M1 =0 M2 w MN =0 Outline Motivation Algorithms for learning regulatory networks Evaluation of the method Statistical evaluation Biological interpretation Advanced topics Many models can get similar scores. Which one would you choose? A gene can be involved in multiple modules. Possible to incorporate prior knowledge? 22 11

12 Statistical Evaluation Cross-validation test Divide the data (experiments) into training and test data Compute the likelihood function for the Test data experiments Test likelihood How well the network fits to the test data? Test data genes? Regulatory network 23 Module Evaluation Criteria Are the module genes functionally coherent? Do the regulators have regulatory roles in the predicted conditions C (see slide 6)? Are the genes in the module known targets of the predicted regulators? Are the regulators consistent with the cisregulatory motifs (TF binding sites) found in promoters of the module genes? 24 12

13 Functional Coherence genes m Modules k N n Module 1 Cholesterol synthesis How significant is the overlap? Known functional categories Gene ontology (GO) Predicted targets of regulators Sharing TF binding sites : Calculate P(# overlap k K, n, N; two groups are independent) based on the hypergeometric distribution 25 Module Functional Coherence Coherence (%) Modules >60% Coherent 41 Modules >40% Coherent u Metabolic: AA, respiration, glycolysis, galactose u Stress: Oxidative stress, osmotic stress u Cellular localization: Nucleas, ER u Cellular processes: Cell cycle, sporulation, mating u Molecular functions: Protein folding, RNA & DNA processing, trafficking 26 13

14 Respiration Module u HAP4 known to up regulate Oxid. Phos. 27 Respiration Module u HAP4 known to up regulate Oxid. Phos. u HAP4, MSN4, XBP1 known to be regulators under predicted conditions 28 14

15 Respiration Module u HAP4 known to up regulate Oxid. Phos. u HAP4, MSN4, XBP1 known to be regulators under predicted conditions u HAP4 Binding site found in 39/55 genes 29 Respiration Module u HAP4 known to up regulate Oxid. Phos. u HAP4, MSN4, XBP1 known to be regulators under predicted conditions u HAP4 Binding site found in 39/55 genes u MSN4 Binding site found in 28/55 genes 30 15

16 Outline Motivation Algorithms for learning regulatory networks Evaluation of the method Advanced topics Many models can get similar scores. Which one would you choose? A gene can be involved in multiple modules. Possible to incorporate prior knowledge? 31 Structural learning via boostrapping Many networks that achieve similar scores Which one would you choose? Estimate the robustness of each network or each edge. How?? Learn the networks from multiple datasets. Inferring sub-networks from perturbed expression profiles, Pe er et al. Bioinformatics

17 Bootstrapping Sampling with replacement genes experiments Original data Bootstrap data 1 data 2 data N Inferring sub-networks from perturbed expression profiles, Pe er et al. Bioinformatics Bootstrapping Sampling with replacement Estimated confidence of each edge i # networks that contain the edge total # networks (N) Bootstrap data 1 data 2 data N Inferring sub-networks from perturbed expression profiles, Pe er et al. Bioinformatics

18 Overlapping Processes The living cell is a complex system Example, the cell cycle Cell cycle: the series of events that take place in a cell leading to its division and duplication. Genes functionally relevant to cell cycle regulation in the specific cell cycle phase Genes participate in multiple processes Partial figure from Maaser and Borlak Oct Mutually exclusive clustering as a common approach to analyzing gene expression (+) genes likely to share a common function (-) group genes into mutually exclusive clusters (-) no info about genes relation to one another 35 Decomposition of Processes Model an expression level of a gene as a mixture of regulatory modules. Hard EM vs soft EM HAP4 false MSK1 false DHH1 w 1 false PUF3 X w 2 : w N HAP1 false HAP5 false false Probabilistic discovery of overlapping cellular processes and their regulation Battle et al. Journal of Computational Biology

Inferring Transcriptional Regulatory Networks from High-throughput Data

Inferring Transcriptional Regulatory Networks from High-throughput Data Inferring Transcriptional Regulatory Networks from High-throughput Data Lectures 9 Oct 26, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20

More information

Bayesian networks for reconstructing transcriptional regulatory networks

Bayesian networks for reconstructing transcriptional regulatory networks Bayesian networks for reconstructing transcriptional regulatory networks Prof Su-In Lee Computer Science & Genome Sciences University of Washington, Seattle GENOME 541 Introduction to Computational Molecular

More information

Inferring Protein-Signaling Networks II

Inferring Protein-Signaling Networks II Inferring Protein-Signaling Networks II Lectures 15 Nov 16, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20 Johnson Hall (JHN) 022

More information

Inferring Protein-Signaling Networks

Inferring Protein-Signaling Networks Inferring Protein-Signaling Networks Lectures 14 Nov 14, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20 Johnson Hall (JHN) 022 1

More information

Introduction to Bioinformatics

Introduction to Bioinformatics CSCI8980: Applied Machine Learning in Computational Biology Introduction to Bioinformatics Rui Kuang Department of Computer Science and Engineering University of Minnesota kuang@cs.umn.edu History of Bioinformatics

More information

Computational Genomics. Systems biology. Putting it together: Data integration using graphical models

Computational Genomics. Systems biology. Putting it together: Data integration using graphical models 02-710 Computational Genomics Systems biology Putting it together: Data integration using graphical models High throughput data So far in this class we discussed several different types of high throughput

More information

Learning gene regulatory networks Statistical methods for haplotype inference Part I

Learning gene regulatory networks Statistical methods for haplotype inference Part I Learning gene regulatory networks Statistical methods for haplotype inference Part I Input: Measurement of mrn levels of all genes from microarray or rna sequencing Samples (e.g. 200 patients with lung

More information

Rule learning for gene expression data

Rule learning for gene expression data Rule learning for gene expression data Stefan Enroth Original slides by Torgeir R. Hvidsten The Linnaeus Centre for Bioinformatics Predicting biological process from gene expression time profiles Papers:

More information

Computational Genomics. Reconstructing dynamic regulatory networks in multiple species

Computational Genomics. Reconstructing dynamic regulatory networks in multiple species 02-710 Computational Genomics Reconstructing dynamic regulatory networks in multiple species Methods for reconstructing networks in cells CRH1 SLT2 SLR3 YPS3 YPS1 Amit et al Science 2009 Pe er et al Recomb

More information

Predicting Protein Functions and Domain Interactions from Protein Interactions

Predicting Protein Functions and Domain Interactions from Protein Interactions Predicting Protein Functions and Domain Interactions from Protein Interactions Fengzhu Sun, PhD Center for Computational and Experimental Genomics University of Southern California Outline High-throughput

More information

Gene Regulatory Networks II Computa.onal Genomics Seyoung Kim

Gene Regulatory Networks II Computa.onal Genomics Seyoung Kim Gene Regulatory Networks II 02-710 Computa.onal Genomics Seyoung Kim Goal: Discover Structure and Func;on of Complex systems in the Cell Identify the different regulators and their target genes that are

More information

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008

6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008 MIT OpenCourseWare http://ocw.mit.edu 6.047 / 6.878 Computational Biology: Genomes, Networks, Evolution Fall 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Computational methods for predicting protein-protein interactions

Computational methods for predicting protein-protein interactions Computational methods for predicting protein-protein interactions Tomi Peltola T-61.6070 Special course in bioinformatics I 3.4.2008 Outline Biological background Protein-protein interactions Computational

More information

Sparse Linear Models (10/7/13)

Sparse Linear Models (10/7/13) STA56: Probabilistic machine learning Sparse Linear Models (0/7/) Lecturer: Barbara Engelhardt Scribes: Jiaji Huang, Xin Jiang, Albert Oh Sparsity Sparsity has been a hot topic in statistics and machine

More information

Chapter 15 Active Reading Guide Regulation of Gene Expression

Chapter 15 Active Reading Guide Regulation of Gene Expression Name: AP Biology Mr. Croft Chapter 15 Active Reading Guide Regulation of Gene Expression The overview for Chapter 15 introduces the idea that while all cells of an organism have all genes in the genome,

More information

REVIEW SESSION. Wednesday, September 15 5:30 PM SHANTZ 242 E

REVIEW SESSION. Wednesday, September 15 5:30 PM SHANTZ 242 E REVIEW SESSION Wednesday, September 15 5:30 PM SHANTZ 242 E Gene Regulation Gene Regulation Gene expression can be turned on, turned off, turned up or turned down! For example, as test time approaches,

More information

Discovering molecular pathways from protein interaction and ge

Discovering molecular pathways from protein interaction and ge Discovering molecular pathways from protein interaction and gene expression data 9-4-2008 Aim To have a mechanism for inferring pathways from gene expression and protein interaction data. Motivation Why

More information

Written Exam 15 December Course name: Introduction to Systems Biology Course no

Written Exam 15 December Course name: Introduction to Systems Biology Course no Technical University of Denmark Written Exam 15 December 2008 Course name: Introduction to Systems Biology Course no. 27041 Aids allowed: Open book exam Provide your answers and calculations on separate

More information

Self Similar (Scale Free, Power Law) Networks (I)

Self Similar (Scale Free, Power Law) Networks (I) Self Similar (Scale Free, Power Law) Networks (I) E6083: lecture 4 Prof. Predrag R. Jelenković Dept. of Electrical Engineering Columbia University, NY 10027, USA {predrag}@ee.columbia.edu February 7, 2007

More information

Latent Variable models for GWAs

Latent Variable models for GWAs Latent Variable models for GWAs Oliver Stegle Machine Learning and Computational Biology Research Group Max-Planck-Institutes Tübingen, Germany September 2011 O. Stegle Latent variable models for GWAs

More information

10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison

10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison 10-810: Advanced Algorithms and Models for Computational Biology microrna and Whole Genome Comparison Central Dogma: 90s Transcription factors DNA transcription mrna translation Proteins Central Dogma:

More information

GLOBEX Bioinformatics (Summer 2015) Genetic networks and gene expression data

GLOBEX Bioinformatics (Summer 2015) Genetic networks and gene expression data GLOBEX Bioinformatics (Summer 2015) Genetic networks and gene expression data 1 Gene Networks Definition: A gene network is a set of molecular components, such as genes and proteins, and interactions between

More information

Computational Network Biology Biostatistics & Medical Informatics 826 Fall 2018

Computational Network Biology Biostatistics & Medical Informatics 826 Fall 2018 Computational Network Biology Biostatistics & Medical Informatics 826 Fall 2018 Sushmita Roy sroy@biostat.wisc.edu https://compnetbiocourse.discovery.wisc.edu Sep 6 th 2018 Goals for today Administrivia

More information

Introduction to clustering methods for gene expression data analysis

Introduction to clustering methods for gene expression data analysis Introduction to clustering methods for gene expression data analysis Giorgio Valentini e-mail: valentini@dsi.unimi.it Outline Levels of analysis of DNA microarray data Clustering methods for functional

More information

GENE REGULATION AND PROBLEMS OF DEVELOPMENT

GENE REGULATION AND PROBLEMS OF DEVELOPMENT GENE REGULATION AND PROBLEMS OF DEVELOPMENT By Surinder Kaur DIET Ropar Surinder_1998@ yahoo.in Mob No 9988530775 GENE REGULATION Gene is a segment of DNA that codes for a unit of function (polypeptide,

More information

56:198:582 Biological Networks Lecture 10

56:198:582 Biological Networks Lecture 10 56:198:582 Biological Networks Lecture 10 Temporal Programs and the Global Structure The single-input module (SIM) network motif The network motifs we have studied so far all had a defined number of nodes.

More information

Lecture 8: Temporal programs and the global structure of transcription networks. Chap 5 of Alon. 5.1 Introduction

Lecture 8: Temporal programs and the global structure of transcription networks. Chap 5 of Alon. 5.1 Introduction Lecture 8: Temporal programs and the global structure of transcription networks Chap 5 of Alon 5. Introduction We will see in this chapter that sensory transcription networks are largely made of just four

More information

Computational Biology: Basics & Interesting Problems

Computational Biology: Basics & Interesting Problems Computational Biology: Basics & Interesting Problems Summary Sources of information Biological concepts: structure & terminology Sequencing Gene finding Protein structure prediction Sources of information

More information

BME 5742 Biosystems Modeling and Control

BME 5742 Biosystems Modeling and Control BME 5742 Biosystems Modeling and Control Lecture 24 Unregulated Gene Expression Model Dr. Zvi Roth (FAU) 1 The genetic material inside a cell, encoded in its DNA, governs the response of a cell to various

More information

Understanding Science Through the Lens of Computation. Richard M. Karp Nov. 3, 2007

Understanding Science Through the Lens of Computation. Richard M. Karp Nov. 3, 2007 Understanding Science Through the Lens of Computation Richard M. Karp Nov. 3, 2007 The Computational Lens Exposes the computational nature of natural processes and provides a language for their description.

More information

3.B.1 Gene Regulation. Gene regulation results in differential gene expression, leading to cell specialization.

3.B.1 Gene Regulation. Gene regulation results in differential gene expression, leading to cell specialization. 3.B.1 Gene Regulation Gene regulation results in differential gene expression, leading to cell specialization. We will focus on gene regulation in prokaryotes first. Gene regulation accounts for some of

More information

Mixture models for analysing transcriptome and ChIP-chip data

Mixture models for analysing transcriptome and ChIP-chip data Mixture models for analysing transcriptome and ChIP-chip data Marie-Laure Martin-Magniette French National Institute for agricultural research (INRA) Unit of Applied Mathematics and Informatics at AgroParisTech,

More information

Gene Ontology and Functional Enrichment. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein

Gene Ontology and Functional Enrichment. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein Gene Ontology and Functional Enrichment Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein The parsimony principle: A quick review Find the tree that requires the fewest

More information

CSEP 590A Summer Lecture 4 MLE, EM, RE, Expression

CSEP 590A Summer Lecture 4 MLE, EM, RE, Expression CSEP 590A Summer 2006 Lecture 4 MLE, EM, RE, Expression 1 FYI, re HW #2: Hemoglobin History Alberts et al., 3rd ed.,pg389 2 Tonight MLE: Maximum Likelihood Estimators EM: the Expectation Maximization Algorithm

More information

CSEP 590A Summer Tonight MLE. FYI, re HW #2: Hemoglobin History. Lecture 4 MLE, EM, RE, Expression. Maximum Likelihood Estimators

CSEP 590A Summer Tonight MLE. FYI, re HW #2: Hemoglobin History. Lecture 4 MLE, EM, RE, Expression. Maximum Likelihood Estimators CSEP 59A Summer 26 Lecture 4 MLE, EM, RE, Expression FYI, re HW #2: Hemoglobin History 1 Alberts et al., 3rd ed.,pg389 2 Tonight MLE: Maximum Likelihood Estimators EM: the Expectation Maximization Algorithm

More information

Computational Systems Biology

Computational Systems Biology Computational Systems Biology Vasant Honavar Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Graduate Program Center for Computational Intelligence, Learning, & Discovery

More information

Introduction to clustering methods for gene expression data analysis

Introduction to clustering methods for gene expression data analysis Introduction to clustering methods for gene expression data analysis Giorgio Valentini e-mail: valentini@dsi.unimi.it Outline Levels of analysis of DNA microarray data Clustering methods for functional

More information

Learning in Bayesian Networks

Learning in Bayesian Networks Learning in Bayesian Networks Florian Markowetz Max-Planck-Institute for Molecular Genetics Computational Molecular Biology Berlin Berlin: 20.06.2002 1 Overview 1. Bayesian Networks Stochastic Networks

More information

Evidence for dynamically organized modularity in the yeast protein-protein interaction network

Evidence for dynamically organized modularity in the yeast protein-protein interaction network Evidence for dynamically organized modularity in the yeast protein-protein interaction network Sari Bombino Helsinki 27.3.2007 UNIVERSITY OF HELSINKI Department of Computer Science Seminar on Computational

More information

Gene regulation II Biochemistry 302. February 27, 2006

Gene regulation II Biochemistry 302. February 27, 2006 Gene regulation II Biochemistry 302 February 27, 2006 Molecular basis of inhibition of RNAP by Lac repressor 35 promoter site 10 promoter site CRP/DNA complex 60 Lewis, M. et al. (1996) Science 271:1247

More information

Computational Biology Course Descriptions 12-14

Computational Biology Course Descriptions 12-14 Computational Biology Course Descriptions 12-14 Course Number and Title INTRODUCTORY COURSES BIO 311C: Introductory Biology I BIO 311D: Introductory Biology II BIO 325: Genetics CH 301: Principles of Chemistry

More information

Unit 3: Control and regulation Higher Biology

Unit 3: Control and regulation Higher Biology Unit 3: Control and regulation Higher Biology To study the roles that genes play in the control of growth and development of organisms To be able to Give some examples of features which are controlled

More information

Lecture 4: Transcription networks basic concepts

Lecture 4: Transcription networks basic concepts Lecture 4: Transcription networks basic concepts - Activators and repressors - Input functions; Logic input functions; Multidimensional input functions - Dynamics and response time 2.1 Introduction The

More information

Bioinformatics. Transcriptome

Bioinformatics. Transcriptome Bioinformatics Transcriptome Jacques.van.Helden@ulb.ac.be Université Libre de Bruxelles, Belgique Laboratoire de Bioinformatique des Génomes et des Réseaux (BiGRe) http://www.bigre.ulb.ac.be/ Bioinformatics

More information

Lecture 18 June 2 nd, Gene Expression Regulation Mutations

Lecture 18 June 2 nd, Gene Expression Regulation Mutations Lecture 18 June 2 nd, 2016 Gene Expression Regulation Mutations From Gene to Protein Central Dogma Replication DNA RNA PROTEIN Transcription Translation RNA Viruses: genome is RNA Reverse Transcriptase

More information

Complete all warm up questions Focus on operon functioning we will be creating operon models on Monday

Complete all warm up questions Focus on operon functioning we will be creating operon models on Monday Complete all warm up questions Focus on operon functioning we will be creating operon models on Monday 1. What is the Central Dogma? 2. How does prokaryotic DNA compare to eukaryotic DNA? 3. How is DNA

More information

Network Biology-part II

Network Biology-part II Network Biology-part II Jun Zhu, Ph. D. Professor of Genomics and Genetic Sciences Icahn Institute of Genomics and Multi-scale Biology The Tisch Cancer Institute Icahn Medical School at Mount Sinai New

More information

A New Method to Build Gene Regulation Network Based on Fuzzy Hierarchical Clustering Methods

A New Method to Build Gene Regulation Network Based on Fuzzy Hierarchical Clustering Methods International Academic Institute for Science and Technology International Academic Journal of Science and Engineering Vol. 3, No. 6, 2016, pp. 169-176. ISSN 2454-3896 International Academic Journal of

More information

STRUCTURAL BIOINFORMATICS I. Fall 2015

STRUCTURAL BIOINFORMATICS I. Fall 2015 STRUCTURAL BIOINFORMATICS I Fall 2015 Info Course Number - Classification: Biology 5411 Class Schedule: Monday 5:30-7:50 PM, SERC Room 456 (4 th floor) Instructors: Vincenzo Carnevale - SERC, Room 704C;

More information

Prokaryotic Gene Expression (Learning Objectives)

Prokaryotic Gene Expression (Learning Objectives) Prokaryotic Gene Expression (Learning Objectives) 1. Learn how bacteria respond to changes of metabolites in their environment: short-term and longer-term. 2. Compare and contrast transcriptional control

More information

Co-ordination occurs in multiple layers Intracellular regulation: self-regulation Intercellular regulation: coordinated cell signalling e.g.

Co-ordination occurs in multiple layers Intracellular regulation: self-regulation Intercellular regulation: coordinated cell signalling e.g. Gene Expression- Overview Differentiating cells Achieved through changes in gene expression All cells contain the same whole genome A typical differentiated cell only expresses ~50% of its total gene Overview

More information

Regulation of Gene Expression

Regulation of Gene Expression Chapter 18 Regulation of Gene Expression PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from

More information

Updated: 10/11/2018 Page 1 of 5

Updated: 10/11/2018 Page 1 of 5 A. Academic Division: Health Sciences B. Discipline: Biology C. Course Number and Title: BIOL1230 Biology I MASTER SYLLABUS 2018-2019 D. Course Coordinator: Justin Tickhill Assistant Dean: Melinda Roepke,

More information

Discovering modules in expression profiles using a network

Discovering modules in expression profiles using a network Discovering modules in expression profiles using a network Igor Ulitsky 1 2 Protein-protein interactions (PPIs) Low throughput measurements: accurate, scarce High throughput: more abundant, noisy Large,

More information

Comparative Network Analysis

Comparative Network Analysis Comparative Network Analysis BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2016 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under CC BY-NC 4.0 by

More information

Gene regulation I Biochemistry 302. Bob Kelm February 25, 2005

Gene regulation I Biochemistry 302. Bob Kelm February 25, 2005 Gene regulation I Biochemistry 302 Bob Kelm February 25, 2005 Principles of gene regulation (cellular versus molecular level) Extracellular signals Chemical (e.g. hormones, growth factors) Environmental

More information

Geert Geeven. April 14, 2010

Geert Geeven. April 14, 2010 iction of Gene Regulatory Interactions NDNS+ Workshop April 14, 2010 Today s talk - Outline Outline Biological Background Construction of Predictors The main aim of my project is to better understand the

More information

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis Probabilistic Latent Semantic Analysis Yuriy Sverchkov Intelligent Systems Program University of Pittsburgh October 6, 2011 Outline Latent Semantic Analysis (LSA) A quick review Probabilistic LSA (plsa)

More information

Network motifs in the transcriptional regulation network (of Escherichia coli):

Network motifs in the transcriptional regulation network (of Escherichia coli): Network motifs in the transcriptional regulation network (of Escherichia coli): Janne.Ravantti@Helsinki.Fi (disclaimer: IANASB) Contents: Transcription Networks (aka. The Very Boring Biology Part ) Network

More information

Transcription Regulation and Gene Expression in Eukaryotes FS08 Pharmacenter/Biocenter Auditorium 1 Wednesdays 16h15-18h00.

Transcription Regulation and Gene Expression in Eukaryotes FS08 Pharmacenter/Biocenter Auditorium 1 Wednesdays 16h15-18h00. Transcription Regulation and Gene Expression in Eukaryotes FS08 Pharmacenter/Biocenter Auditorium 1 Wednesdays 16h15-18h00. Promoters and Enhancers Systematic discovery of transcriptional regulatory motifs

More information

L3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus

L3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus L3.1: Circuits: Introduction to Transcription Networks Cellular Design Principles Prof. Jenna Rickus In this lecture Cognitive problem of the Cell Introduce transcription networks Key processing network

More information

A general co-expression network-based approach to gene expression analysis: comparison and applications

A general co-expression network-based approach to gene expression analysis: comparison and applications BMC Systems Biology This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text (HTML) versions will be made available soon. A general co-expression

More information

CS-E5880 Modeling biological networks Gene regulatory networks

CS-E5880 Modeling biological networks Gene regulatory networks CS-E5880 Modeling biological networks Gene regulatory networks Jukka Intosalmi (based on slides by Harri Lähdesmäki) Department of Computer Science Aalto University January 12, 2018 Outline Modeling gene

More information

BMD645. Integration of Omics

BMD645. Integration of Omics BMD645 Integration of Omics Shu-Jen Chen, Chang Gung University Dec. 11, 2009 1 Traditional Biology vs. Systems Biology Traditional biology : Single genes or proteins Systems biology: Simultaneously study

More information

Network Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai

Network Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai Network Biology: Understanding the cell s functional organization Albert-László Barabási Zoltán N. Oltvai Outline: Evolutionary origin of scale-free networks Motifs, modules and hierarchical networks Network

More information

scrna-seq Differential expression analysis methods Olga Dethlefsen NBIS, National Bioinformatics Infrastructure Sweden October 2017

scrna-seq Differential expression analysis methods Olga Dethlefsen NBIS, National Bioinformatics Infrastructure Sweden October 2017 scrna-seq Differential expression analysis methods Olga Dethlefsen NBIS, National Bioinformatics Infrastructure Sweden October 2017 Olga (NBIS) scrna-seq de October 2017 1 / 34 Outline Introduction: what

More information

Chapter 16 Lecture. Concepts Of Genetics. Tenth Edition. Regulation of Gene Expression in Prokaryotes

Chapter 16 Lecture. Concepts Of Genetics. Tenth Edition. Regulation of Gene Expression in Prokaryotes Chapter 16 Lecture Concepts Of Genetics Tenth Edition Regulation of Gene Expression in Prokaryotes Chapter Contents 16.1 Prokaryotes Regulate Gene Expression in Response to Environmental Conditions 16.2

More information

MODULE -4 BAYEIAN LEARNING

MODULE -4 BAYEIAN LEARNING MODULE -4 BAYEIAN LEARNING CONTENT Introduction Bayes theorem Bayes theorem and concept learning Maximum likelihood and Least Squared Error Hypothesis Maximum likelihood Hypotheses for predicting probabilities

More information

56:198:582 Biological Networks Lecture 8

56:198:582 Biological Networks Lecture 8 56:198:582 Biological Networks Lecture 8 Course organization Two complementary approaches to modeling and understanding biological networks Constraint-based modeling (Palsson) System-wide Metabolism Steady-state

More information

Sample Size Estimation for Studies of High-Dimensional Data

Sample Size Estimation for Studies of High-Dimensional Data Sample Size Estimation for Studies of High-Dimensional Data James J. Chen, Ph.D. National Center for Toxicological Research Food and Drug Administration June 3, 2009 China Medical University Taichung,

More information

Prokaryotic Gene Expression (Learning Objectives)

Prokaryotic Gene Expression (Learning Objectives) Prokaryotic Gene Expression (Learning Objectives) 1. Learn how bacteria respond to changes of metabolites in their environment: short-term and longer-term. 2. Compare and contrast transcriptional control

More information

UNIT 6 PART 3 *REGULATION USING OPERONS* Hillis Textbook, CH 11

UNIT 6 PART 3 *REGULATION USING OPERONS* Hillis Textbook, CH 11 UNIT 6 PART 3 *REGULATION USING OPERONS* Hillis Textbook, CH 11 REVIEW: Signals that Start and Stop Transcription and Translation BUT, HOW DO CELLS CONTROL WHICH GENES ARE EXPRESSED AND WHEN? First of

More information

What is the expectation maximization algorithm?

What is the expectation maximization algorithm? primer 2008 Nature Publishing Group http://www.nature.com/naturebiotechnology What is the expectation maximization algorithm? Chuong B Do & Serafim Batzoglou The expectation maximization algorithm arises

More information

networks in molecular biology Wolfgang Huber

networks in molecular biology Wolfgang Huber networks in molecular biology Wolfgang Huber networks in molecular biology Regulatory networks: components = gene products interactions = regulation of transcription, translation, phosphorylation... Metabolic

More information

Gibbs Sampling Methods for Multiple Sequence Alignment

Gibbs Sampling Methods for Multiple Sequence Alignment Gibbs Sampling Methods for Multiple Sequence Alignment Scott C. Schmidler 1 Jun S. Liu 2 1 Section on Medical Informatics and 2 Department of Statistics Stanford University 11/17/99 1 Outline Statistical

More information

Biology. Biology. Slide 1 of 26. End Show. Copyright Pearson Prentice Hall

Biology. Biology. Slide 1 of 26. End Show. Copyright Pearson Prentice Hall Biology Biology 1 of 26 Fruit fly chromosome 12-5 Gene Regulation Mouse chromosomes Fruit fly embryo Mouse embryo Adult fruit fly Adult mouse 2 of 26 Gene Regulation: An Example Gene Regulation: An Example

More information

Fuzzy Clustering of Gene Expression Data

Fuzzy Clustering of Gene Expression Data Fuzzy Clustering of Gene Data Matthias E. Futschik and Nikola K. Kasabov Department of Information Science, University of Otago P.O. Box 56, Dunedin, New Zealand email: mfutschik@infoscience.otago.ac.nz,

More information

Group lasso for genomic data

Group lasso for genomic data Group lasso for genomic data Jean-Philippe Vert Mines ParisTech / Curie Institute / Inserm Machine learning: Theory and Computation workshop, IMA, Minneapolis, March 26-3, 22 J.P Vert (ParisTech) Group

More information

A Case Study -- Chu et al. The Transcriptional Program of Sporulation in Budding Yeast. What is Sporulation? CSE 527, W.L. Ruzzo 1

A Case Study -- Chu et al. The Transcriptional Program of Sporulation in Budding Yeast. What is Sporulation? CSE 527, W.L. Ruzzo 1 A Case Study -- Chu et al. An interesting early microarray paper My goals Show arrays used in a real experiment Show where computation is important Start looking at analysis techniques The Transcriptional

More information

Variable Selection in Structured High-dimensional Covariate Spaces

Variable Selection in Structured High-dimensional Covariate Spaces Variable Selection in Structured High-dimensional Covariate Spaces Fan Li 1 Nancy Zhang 2 1 Department of Health Care Policy Harvard University 2 Department of Statistics Stanford University May 14 2007

More information

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016 Boolean models of gene regulatory networks Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016 Gene expression Gene expression is a process that takes gene info and creates

More information

Bayesian Networks Inference with Probabilistic Graphical Models

Bayesian Networks Inference with Probabilistic Graphical Models 4190.408 2016-Spring Bayesian Networks Inference with Probabilistic Graphical Models Byoung-Tak Zhang intelligence Lab Seoul National University 4190.408 Artificial (2016-Spring) 1 Machine Learning? Learning

More information

Evolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites

Evolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites Evolutionary analysis of the well characterized endo16 promoter reveals substantial variation within functional sites Paper by: James P. Balhoff and Gregory A. Wray Presentation by: Stephanie Lucas Reviewed

More information

Comparison of Shannon, Renyi and Tsallis Entropy used in Decision Trees

Comparison of Shannon, Renyi and Tsallis Entropy used in Decision Trees Comparison of Shannon, Renyi and Tsallis Entropy used in Decision Trees Tomasz Maszczyk and W lodzis law Duch Department of Informatics, Nicolaus Copernicus University Grudzi adzka 5, 87-100 Toruń, Poland

More information

Basic Synthetic Biology circuits

Basic Synthetic Biology circuits Basic Synthetic Biology circuits Note: these practices were obtained from the Computer Modelling Practicals lecture by Vincent Rouilly and Geoff Baldwin at Imperial College s course of Introduction to

More information

GWAS IV: Bayesian linear (variance component) models

GWAS IV: Bayesian linear (variance component) models GWAS IV: Bayesian linear (variance component) models Dr. Oliver Stegle Christoh Lippert Prof. Dr. Karsten Borgwardt Max-Planck-Institutes Tübingen, Germany Tübingen Summer 2011 Oliver Stegle GWAS IV: Bayesian

More information

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON PROKARYOTE GENES: E. COLI LAC OPERON CHAPTER 13 CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON Figure 1. Electron micrograph of growing E. coli. Some show the constriction at the location where daughter

More information

GENES AND CHROMOSOMES III. Lecture 5. Biology Department Concordia University. Dr. S. Azam BIOL 266/

GENES AND CHROMOSOMES III. Lecture 5. Biology Department Concordia University. Dr. S. Azam BIOL 266/ GENES AND CHROMOSOMES III Lecture 5 BIOL 266/4 2014-15 Dr. S. Azam Biology Department Concordia University CELL NUCLEUS AND THE CONTROL OF GENE EXPRESSION OPERONS Introduction All cells in a multi-cellular

More information

Control of Gene Expression in Prokaryotes

Control of Gene Expression in Prokaryotes Why? Control of Expression in Prokaryotes How do prokaryotes use operons to control gene expression? Houses usually have a light source in every room, but it would be a waste of energy to leave every light

More information

Name: SBI 4U. Gene Expression Quiz. Overall Expectation:

Name: SBI 4U. Gene Expression Quiz. Overall Expectation: Gene Expression Quiz Overall Expectation: - Demonstrate an understanding of concepts related to molecular genetics, and how genetic modification is applied in industry and agriculture Specific Expectation(s):

More information

Regulatory Inferece from Gene Expression. CMSC858P Spring 2012 Hector Corrada Bravo

Regulatory Inferece from Gene Expression. CMSC858P Spring 2012 Hector Corrada Bravo Regulatory Inferece from Gene Expression CMSC858P Spring 2012 Hector Corrada Bravo 2 Graphical Model Let y be a vector- valued random variable Suppose some condi8onal independence proper8es hold for some

More information

Topic 4 - #14 The Lactose Operon

Topic 4 - #14 The Lactose Operon Topic 4 - #14 The Lactose Operon The Lactose Operon The lactose operon is an operon which is responsible for the transport and metabolism of the sugar lactose in E. coli. - Lactose is one of many organic

More information

Control of Gene Expression

Control of Gene Expression Control of Gene Expression Mechanisms of Gene Control Gene Control in Eukaryotes Master Genes Gene Control In Prokaryotes Epigenetics Gene Expression The overall process by which information flows from

More information

STAAR Biology Assessment

STAAR Biology Assessment STAAR Biology Assessment Reporting Category 1: Cell Structure and Function The student will demonstrate an understanding of biomolecules as building blocks of cells, and that cells are the basic unit of

More information

SCOTCAT Credits: 20 SCQF Level 7 Semester 1 Academic year: 2018/ am, Practical classes one per week pm Mon, Tue, or Wed

SCOTCAT Credits: 20 SCQF Level 7 Semester 1 Academic year: 2018/ am, Practical classes one per week pm Mon, Tue, or Wed Biology (BL) modules BL1101 Biology 1 SCOTCAT Credits: 20 SCQF Level 7 Semester 1 10.00 am; Practical classes one per week 2.00-5.00 pm Mon, Tue, or Wed This module is an introduction to molecular and

More information

Data Mining Techniques

Data Mining Techniques Data Mining Techniques CS 622 - Section 2 - Spring 27 Pre-final Review Jan-Willem van de Meent Feedback Feedback https://goo.gl/er7eo8 (also posted on Piazza) Also, please fill out your TRACE evaluations!

More information

Bi 1x Spring 2014: LacI Titration

Bi 1x Spring 2014: LacI Titration Bi 1x Spring 2014: LacI Titration 1 Overview In this experiment, you will measure the effect of various mutated LacI repressor ribosome binding sites in an E. coli cell by measuring the expression of a

More information

Identifying Signaling Pathways

Identifying Signaling Pathways These slides, excluding third-party material, are licensed under CC BY-NC 4.0 by Anthony Gitter, Mark Craven, Colin Dewey Identifying Signaling Pathways BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2018

More information

STA 414/2104: Lecture 8

STA 414/2104: Lecture 8 STA 414/2104: Lecture 8 6-7 March 2017: Continuous Latent Variable Models, Neural networks With thanks to Russ Salakhutdinov, Jimmy Ba and others Outline Continuous latent variable models Background PCA

More information

Bioinformatics: Network Analysis

Bioinformatics: Network Analysis Bioinformatics: Network Analysis Kinetics of Gene Regulation COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay Nakhleh, Rice University 1 The simplest model of gene expression involves only two steps: the

More information