Biological Networks Analysis

Similar documents
Complex (Biological) Networks

Complex (Biological) Networks

Self Similar (Scale Free, Power Law) Networks (I)

Network motifs in the transcriptional regulation network (of Escherichia coli):

Biological Networks. Gavin Conant 163B ASRC

networks in molecular biology Wolfgang Huber

Erzsébet Ravasz Advisor: Albert-László Barabási

Networks. Can (John) Bruce Keck Founda7on Biotechnology Lab Bioinforma7cs Resource

BioControl - Week 6, Lecture 1

SYSTEMS BIOLOGY 1: NETWORKS

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?

56:198:582 Biological Networks Lecture 9

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

Random Boolean Networks

56:198:582 Biological Networks Lecture 10

Networks in systems biology

Lecture 8: Temporal programs and the global structure of transcription networks. Chap 5 of Alon. 5.1 Introduction

Written Exam 15 December Course name: Introduction to Systems Biology Course no

Network Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai

Chapter 8: The Topology of Biological Networks. Overview

Lecture 6: The feed-forward loop (FFL) network motif

56:198:582 Biological Networks Lecture 8

Types of biological networks. I. Intra-cellurar networks

Cell biology traditionally identifies proteins based on their individual actions as catalysts, signaling

L3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus

7.32/7.81J/8.591J: Systems Biology. Fall Exam #1

FUNDAMENTALS of SYSTEMS BIOLOGY From Synthetic Circuits to Whole-cell Models

Bioinformatics I. CPBS 7711 October 29, 2015 Protein interaction networks. Debra Goldberg

Overview. Overview. Social networks. What is a network? 10/29/14. Bioinformatics I. Networks are everywhere! Introduction to Networks

Cross-fertilization between Proteomics and Computational Synthesis

Graph Alignment and Biological Networks

The architecture of complexity: the structure and dynamics of complex networks.

Introduction to Bioinformatics

CSCI1950 Z Computa3onal Methods for Biology Lecture 24. Ben Raphael April 29, hgp://cs.brown.edu/courses/csci1950 z/ Network Mo3fs

13.4 Gene Regulation and Expression

6.207/14.15: Networks Lecture 12: Generalized Random Graphs

Introduction to Systems Biology

arxiv: v1 [q-bio.mn] 30 Dec 2008

Seminar in Bioinformatics, Winter Network Motifs. An Introduction to Systems Biology Uri Alon Chapters 5-6. Meirav Zehavi

SUPPLEMENTARY METHODS

Bi 8 Lecture 11. Quantitative aspects of transcription factor binding and gene regulatory circuit design. Ellen Rothenberg 9 February 2016

V 6 Network analysis

Graph Theory Properties of Cellular Networks

Lecture 4: Transcription networks basic concepts

Graph Theory and Networks in Biology

Comparing transcription factor regulatory networks of human cell types. The Protein Network Workshop June 8 12, 2015

Graph Theory and Networks in Biology arxiv:q-bio/ v1 [q-bio.mn] 6 Apr 2006

Lecture 4: Yeast as a model organism for functional and evolutionary genomics. Part II

Analysis of Biological Networks: Network Robustness and Evolution

Chapter 15 Active Reading Guide Regulation of Gene Expression

Biological networks CS449 BIOINFORMATICS

Profiling Human Cell/Tissue Specific Gene Regulation Networks

Graph Theory Approaches to Protein Interaction Data Analysis

Divergence Pattern of Duplicate Genes in Protein-Protein Interactions Follows the Power Law

letter Network motifs in the transcriptional regulation network of Escherichia coli ... Z n X 1 X 2 X 3... X n Z 1 Z 2 Z 3 Z 4...

Comparative Network Analysis

Interaction Network Analysis

Systems biology and biological networks

Statistical analysis of biological networks.

Cellular Biophysics SS Prof. Manfred Radmacher

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016

Network models: random graphs

Simulation of Gene Regulatory Networks

arxiv:q-bio/ v1 [q-bio.mn] 13 Aug 2004

V 5 Robustness and Modularity

NETWORK BIOLOGY: UNDERSTANDING THE CELL S FUNCTIONAL ORGANIZATION

Gene Ontology and Functional Enrichment. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein

When one model is not enough: Combining epistemic tools in systems biology

Measuring TF-DNA interactions

Bioinformatics 2. Large scale organisation. Biological Networks. Non-biological networks

Department of Pathology, Northwestern University Medical School, Chicago, IL 60611

Clustering of Pathogenic Genes in Human Co-regulatory Network. Michael Colavita Mentor: Soheil Feizi Fifth Annual MIT PRIMES Conference May 17, 2015

Network Alignment 858L

Supplementary Note 1

CS224W: Analysis of Networks Jure Leskovec, Stanford University

Gene Autoregulation via Intronic micrornas and its Functions

Gene Regulation and Expression

Complex networks: an introduction

Exploiting network-based approaches for understanding gene regulation and function

GLOBEX Bioinformatics (Summer 2015) Genetic networks and gene expression data

Applications of Complex Networks

Frequent Pattern Finding in Integrated Biological Networks

Regulation of Gene Expression

Evolving Scale-Free Topologies using a Gene Regulatory Network Model

Clustering. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein. Some slides adapted from Jacques van Helden

Basic modeling approaches for biological systems. Mahesh Bule

Networks & pathways. Hedi Peterson MTAT Bioinformatics

BIOINFORMATICS CS4742 BIOLOGICAL NETWORKS

CS-E5880 Modeling biological networks Gene regulatory networks

GRAPH-THEORETICAL COMPARISON REVEALS STRUCTURAL DIVERGENCE OF HUMAN PROTEIN INTERACTION NETWORKS

Analysis of Complex Systems

Structure and Centrality of the Largest Fully Connected Cluster in Protein-Protein Interaction Networks

Correlation Networks

Overview of Network Theory

Bacterial Genetics & Operons

Analog Electronics Mimic Genetic Biochemical Reactions in Living Cells

Networks as a tool for Complex systems

Preface. Contributors

Temporal Networks aka time-varying networks, time-stamped graphs, dynamical networks...

Clustering. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein. Some slides adapted from Jacques van Helden

Transcription:

Biological Networks Analysis Degree Distribution and Network Motifs Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein

Networks: Networks vs. graphs A collection of nodesand links A quick review Directed/undirected; weighted/non-weighted, Networks as models vs. networks as tools Many types of biological networks The shortest path problem Dijkstra s algorithm 1. Initialize: Assign a distance value, D, to each node. Set D=0 for startnode and to infinity for all others. 2. For each unvisited neighbor of the current node: Calculate tentative distance, D t, through current node and if D t < D: D D t. Mark node as visited. 3. Continue with the unvisited node with the smallest distance

Comparing networks We want to find a way to compare networks. Similar (not identical) topology Common design principles We seek measures of network topology that are: Simple Capture global organization Potentially important (equivalent to, for example, GC content for genomes) Summary statistics

Node degree / rank Degree = Number of neighbors Node degree in PPI networks correlates with: Gene essentiality Conservation rate Likelihood to cause human disease

Degree distribution P(k): probability that a node has a degree of exactly k Common distributions: Poisson: Exponential: Power-law:

The power-law distribution Power-law distribution has a heavy tail! Characterized by a small number of highly connected nodes, known as hubs A.k.a. scale-free network Hubs are crucial: Affect errorand attacktolerance of complex networks (Albert et al. Nature, 2000)

Nodes 150,000 routers Edges physical links The Internet P(k) ~ k -2.3 Govindan and Tangmunarunkit, 2000

Movieactorcollaboration network Tropic Thunder(2008) Nodes 212,250 actors Edges co-appearance in a movie P(k) ~ k -2.3 Barabasi and Albert, Science, 1999

Protein proteininteraction networks Nodes Proteins Edges Interactions (yeast) P(k) ~ k -2.5 Yook et al, Proteomics, 2004

Nodes Metabolites Edges Reactions Metabolic networks P(k) ~ k -2.2±2 A.Fulgidus (archae) E. Coli (bacterium) Metabolic networks across all kingdoms of life are scale-free C.Elegans (eukaryote) Averaged (43 organisms) Jeong et al., Nature, 2000

Why do so many real-life networks exhibit a power-law degree distribution? Is it selected for? Is it expected by change? Does it have anything to do with the way networks evolve? Does it have functional implications??

Network motifs Going beyond degree distribution Generalization of sequence motifs Basic building blocks Evolutionary design principles?

What are network motifs? Recurring patterns of interaction (sub-graphs) that are significantly overrepresented(w.r.t. a background model) 13 possible 3-nodes sub-graphs (199 possible 4-node sub-graphs) R. Milo et al. Network motifs: simple building blocks of complex networks. Science, 2002

Finding motifs in the network 1a. Scan all n-node sub-graphs in the real network 1b. Record number of appearances of each sub-graph (consider isomorphic architectures) 2. Generate a large set of random networks 3a. Scan for all n-node sub-graphs in random networks 3b. Record number of appearances of each sub-graph 4. Compare each sub-graph s data and identify motifs

Finding motifs in the network

Network randomization How should the set of random networks be generated? Do we really want completely random networks? What constitutes a good null model? Preserve in- and out-degree

Generation of randomized networks Network randomization algorithm : Start with the real network and repeatedly swap randomly chosen pairs of connections (X1 Y1, X2 Y2 is replaced by X1 Y2, X2 Y1) X1 Y1 X1 Y1 X2 Y2 X2 Y2 (Switching is prohibited if the either of the X1 Y2 or X2 Y1 already exist) Repeat until the network is well randomized

Motifs in transcriptional regulatory networks E. Coli network 424 operons (116 TFs) 577 interactions Significant enrichment of motif # 5 X Master TF Y Specific TF Z Target (40 instances vs. 7±3) Feed-Forward Loop (FFL) S. Shen-Orr et al. Nature Genetics 2002

What s so cool about FFLs Boolean Kinetics dy / dt = F( X, T ) ay dz / dt = F( X, T ) F( Y, T ) az y y z A simple cascade has slower shutdown A coherent feed-forward loop can act as a circuit that rejects transient activation signals from the general transcription factor and responds only to persistent signals, while allowing for a rapid system shutdown.

Network motifs in biological networks Why do these networks have similar motifs? Why is this network so different?

Motif-based network super-families R. Milo et al. Superfamilies of evolved and designed networks. Science, 2004

Computational representation of networks A B C D List of edges: (ordered) pairs of nodes [ (A,C), (C,B), (D,B), (D,C) ] Connectivity Matrix A B C D A 0 0 1 0 B 0 0 0 0 C 0 1 0 0 D 0 1 1 0 Name:D ngr: p1 p2 Object Oriented Name:C ngr: p1 Name:B ngr: Name:A ngr: p1 Which is the most useful representation?

Generation of randomized networks Algorithm B (Generative): Record marginal weights of original network Start with an empty connectivity matrix M Choose a row n& a column m according to marginal weights If M nm = 0, set M nm = 1; Update marginal weights Repeat until all marginal weights are 0 If no solution is found, start from scratch A C B D A B C D A 0 0 1 0 1 B 0 0 0 0 0 C 0 1 0 0 2 D 0 1 1 0 2 0 2 2 0 A B C D A 0 0 0 0 1 B 0 0 0 0 0 C 0 0 0 0 2 D 0 0 0 0 2 0 2 2 0 A B C D A 0 0 0 0 1 B 0 0 0 0 0 C 0 0 0 0 2 D 0 0 0 0 2 0 2 2 0 A B C D A 0 0 0 0 1 B 0 0 0 0 0 C 0 1 0 0 1 D 0 0 0 0 2 0 1 2 0