CSCI1950 Z Computa3onal Methods for Biology Lecture 24. Ben Raphael April 29, hgp://cs.brown.edu/courses/csci1950 z/ Network Mo3fs

Similar documents
Networks. Can (John) Bruce Keck Founda7on Biotechnology Lab Bioinforma7cs Resource

CSCI1950 Z Computa3onal Methods for Biology* (*Working Title) Lecture 1. Ben Raphael January 21, Course Par3culars

Network motifs in the transcriptional regulation network (of Escherichia coli):

CSCI1950 Z Computa4onal Methods for Biology Lecture 5

Gene Regulatory Networks II Computa.onal Genomics Seyoung Kim

CSCI1950 Z Computa4onal Methods for Biology Lecture 4. Ben Raphael February 2, hhp://cs.brown.edu/courses/csci1950 z/ Algorithm Summary

Priors in Dependency network learning

Self Similar (Scale Free, Power Law) Networks (I)

Random Boolean Networks

Lecture 8: Temporal programs and the global structure of transcription networks. Chap 5 of Alon. 5.1 Introduction

Biological Networks Analysis

Biological Networks. Gavin Conant 163B ASRC

CSCI 360 Introduc/on to Ar/ficial Intelligence Week 2: Problem Solving and Op/miza/on

Gene Regula*on, ChIP- X and DNA Mo*fs. Statistics in Genomics Hongkai Ji

GLOBEX Bioinformatics (Summer 2015) Genetic networks and gene expression data

Comparative Network Analysis

Transcrip:on factor binding mo:fs

Graph structure learning for network inference

Graph Alignment and Biological Networks

Complex (Biological) Networks

Complex (Biological) Networks

Network models: random graphs

Latent Dirichlet Alloca/on

56:198:582 Biological Networks Lecture 10

Graphical Models. Lecture 10: Variable Elimina:on, con:nued. Andrew McCallum

BIOINF 4371 Drug Design 1 Oliver Kohlbacher & Jens Krüger

Lecture 6: The feed-forward loop (FFL) network motif

Bayesian networks Lecture 18. David Sontag New York University

Boolean Gossip Networks

CSCI 1010 Models of Computa3on. Lecture 11 Proving Languages NP-Complete

CISC 636 Computational Biology & Bioinformatics (Fall 2016)

Statistical analysis of biological networks.

ECS 253 / MAE 253, Lecture 15 May 17, I. Probability generating function recap

Networks in systems biology

Modularity and Graph Algorithms

networks in molecular biology Wolfgang Huber

The Trouble with Community Detection

Today s Lecture: HMMs

Sta$s$cal sequence recogni$on

CSCI 360 Introduc/on to Ar/ficial Intelligence Week 2: Problem Solving and Op/miza/on. Instructor: Wei-Min Shen

Introduction to Bioinformatics. Shifra Ben-Dor Irit Orr

Network Alignment 858L

Decision Trees Lecture 12

Cellular automata, entropy and box- coun4ng dimension

Basic modeling approaches for biological systems. Mahesh Bule

Chapter 8: The Topology of Biological Networks. Overview

Machine Learning & Data Mining CS/CNS/EE 155. Lecture 11: Hidden Markov Models

56:198:582 Biological Networks Lecture 8

Principles of Gene Expression

Pseudospectral Methods For Op2mal Control. Jus2n Ruths March 27, 2009

Design and characterization of chemical space networks

Prokaryo'c Operon Model Ac'vity

Founda'ons of Large- Scale Mul'media Informa'on Management and Retrieval. Lecture #4 Similarity. Edward Chang

CISC 889 Bioinformatics (Spring 2004) Hidden Markov Models (II)

Probability and Structure in Natural Language Processing

An Introduction to Exponential-Family Random Graph Models

letter Network motifs in the transcriptional regulation network of Escherichia coli ... Z n X 1 X 2 X 3... X n Z 1 Z 2 Z 3 Z 4...

Probabilistic models of biological sequence motifs

Parameter Es*ma*on: Cracking Incomplete Data

Phylogene)cs. IMBB 2016 BecA- ILRI Hub, Nairobi May 9 20, Joyce Nzioki

56:198:582 Biological Networks Lecture 9

Chapter 4 Dynamic Bayesian Networks Fall Jin Gu, Michael Zhang

Basics on bioinforma-cs Lecture 7. Nunzio D Agostino

Opinion Dynamics on Triad Scale Free Network

Network diffusion-based analysis of high-throughput data for the detection of differentially enriched modules

Mini course on Complex Networks

Graph Detection and Estimation Theory

Introduc)on to Ar)ficial Intelligence

Network alignment and querying

ECS 253 / MAE 253 April 26, Intro to Biological Networks, Motifs, and Model selection/validation

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016

Bioinformatics I. CPBS 7711 October 29, 2015 Protein interaction networks. Debra Goldberg

Algorithms, Lecture 3 on NP : Nondeterminis7c Polynomial Time

Overview. Overview. Social networks. What is a network? 10/29/14. Bioinformatics I. Networks are everywhere! Introduction to Networks

HOMEWORK #2 - MATH 3260

Protein Complex Identification by Supervised Graph Clustering

Structures and Hyperstructures in Metabolic Networks

Networks & pathways. Hedi Peterson MTAT Bioinformatics

Lecture 3: Markov chains.

Semi-Markov/Graph Cuts

SYSTEMS BIOLOGY 1: NETWORKS

Diclique clustering in a directed network

Evolutionary Tree Analysis. Overview

Systems biology and biological networks

Rela%ons and Their Proper%es. Slides by A. Bloomfield

SUPPLEMENTARY METHODS

Bayesian Networks. Why Joint Distributions are Important. CompSci 570 Ron Parr Department of Computer Science Duke University

Profiling Human Cell/Tissue Specific Gene Regulation Networks

ECS 289 F / MAE 298, Lecture 15 May 20, Diffusion, Cascades and Influence

COMPARATIVE PATHWAY ANNOTATION WITH PROTEIN-DNA INTERACTION AND OPERON INFORMATION VIA GRAPH TREE DECOMPOSITION

Chapter 15 Active Reading Guide Regulation of Gene Expression

Preface. Contributors

Machine Learning & Data Mining CS/CNS/EE 155. Lecture 8: Hidden Markov Models

Bellman s Curse of Dimensionality

Minimum Edit Distance. Defini'on of Minimum Edit Distance

Computer Vision. Pa0ern Recogni4on Concepts Part I. Luis F. Teixeira MAP- i 2012/13

Evolu&on of Cellular Interac&on Networks. Pedro Beltrao Krogan and Lim UCSF

CS281A/Stat241A Lecture 19

BIOINF 4120 Bioinforma2cs 2 - Structures and Systems -

Predicting causal effects in large-scale systems from observational data

Transcription:

CSCI1950 Z Computa3onal Methods for Biology Lecture 24 Ben Raphael April 29, 2009 hgp://cs.brown.edu/courses/csci1950 z/ Network Mo3fs Subnetworks with more occurrences than expected by chance. How to find? Exhaus3ve: Count all k node subgraphs. Heuris3c methods: sampling, greedy, etc. Approximate coun3ng via randomized algorithms. 1

Network Mo3fs Subnetworks with more occurrences than expected by chance. How to assess sta3s3cal significance? Compare number of occurrences to random network. Random Networks Occurrence of mo3fs depend strongly on network topology. What is an appropriate ensemble of random networks? (null model) 2

Random Networks One parameter governing occurrence of mo3fs is degree distribu3on. hgps://nwb.slis.indiana.edu/community/?n=customfillings.analysisofbiologicalnetworks Preserving Degree Distribu3on How to sample a graph with the same degree sequence? Method of Newman, Strogatz and Watts (2001) 1. Assign indegree i(v) and outdegree o(v) to vertex v according to degree sequence. 2. Randomly pair o(v) and i(w). 3

Network Mo3fs Transcrip3onal regulatory network of E. coli: 116 transcrip3on factors ~700 genes (operons) 577 interac3ons. Shen Orr et al. 2002 E. coli Network Mo3fs Enumerated all 3 and 4 node mo3fs. Looked for iden3cal rows in adjacency matrix (SIM) Used clustering algorithm to iden3fy DOR. Shen Orr et al. 2002 4

Coun3ng Subnetworks G = (V,E). V = n. E = m. Network centric approach Count/enumerate all subgraphs with k ver3ces. Imprac3cal for large n, m, k Query based approach Enumerate query graphs Q. For each Q, count occurrences. (Subgraph isomorphism) Q could be a non induced subgraph. Coun3ng non induced subgraphs Suppose want to count paths in G = (V,E). Idea: use color coding to count colorful paths Dynamic programming solu3on (Whiteboard) Can extend dynamic program to count trees and bounded treewidth graphs. 5

Rela3on between Forward and Viterbi VITERBI Ini0aliza0on: V 0 (0) = 1 V k (0) = 0, for all k > 0 Itera0on: FORWARD Ini0aliza0on: f 0 (0) = 1 f k (0) = 0, for all k > 0 Itera0on: V j (i) = e j (x i ) max k V k (i 1) a kj f l (i) = e l (x i ) Σ k f k (i 1) a kl Termina0on: Termina0on: P(x, π*) = max k V k (N) P(x) = Σ k f k (N) a k0 Importance of Network Mo3fs Building block of networks. Indicate modular structure of biological networks. Appearance of some mo3fs might be explained by par3cular dynamics (e.g. feedforward and feedback loops) Healthy skep3cism about all these claims, par3cularly because data is incomplete. 6

Network Integra3on Given: G = (V,E) interac3on network. V = genes E = protein DNA or proteinprotein interac3ons Normalized expression z score z ij for gene i in condi3on/sample j. Goal: Find ac3ve subnetworks. Subgraphs whose genes are are differen3ally expressed in many condi3ons. Ideker, et al. (2002); Chuang et al. (2007) (Whiteboard) Network Integra3on Given: G = (V,E) interac3on network. V = genes E = protein DNA or proteinprotein interac3ons M = [ z ij ] z scores of gene i in condi3on/sample j. Ideker, et al. (2002); Chuang et al. (2007) Goal: Find A* = argmax r A A: connected subgraph 7

Finding High scoring subnetwork Simulated Annealing: Global op3miza3on method. Iden3fy set of ac3ve nodes. G w = working subgraph induced by ac3ve nodes. Based on idea of random, local search similar to MCMC. Temperature func3on controls when moves to subop3mal neighbors are permitng. Temperature decreased during search, so that eventually segle in local op3mum. Results 8

Future: Knockout Experiments & Reverse Engineering Input: Signal Output: Gene/protein expression. Given input output rela3onship for normal ( wild type ) and mutant ( knockout ) cells, what can one infer about the network? Topology: hard or impossible de novo: too many combina3ons. New interac3ons or signs of exis3ng interac3ons. Future: Engineering Networks Engineer biological networks to perform new tasks. Change metabolic networks to create cells that produce new products. 9

Sources Shen Orr, S.S., Milo, R., Mangan, S., et al. 2002. Network mo3fs in the transcrip3onal regula3on network of Escherichia coli. Nature Gene;cs 31, 64 68. Newman, M.E.J., Strogatz, S.H., and WaGs, D.J. 2001. Random graphs with arbitrary degree distribu3ons and their applica3ons. Phys. Rev. E 64, 026118 026134. Ideker T, Ozier O, Schwikowski B, Siegel AF. Discovering regulatory and signalling circuits in molecular interac3on networks. Bioinforma;cs. 2002;18 Suppl 1:S233 40. Chuang HY, Lee E, Liu YT, Lee D, Ideker T. 2007. Network based classifica3on of breast cancer metastasis. Mol Syst Biol. 2007;3:140. 10