Self Similar (Scale Free, Power Law) Networks (I)

Similar documents
networks in molecular biology Wolfgang Huber

Biological Networks Analysis

Chapter 8: The Topology of Biological Networks. Overview

Erzsébet Ravasz Advisor: Albert-László Barabási

BioControl - Week 6, Lecture 1

Heavy Tails: The Origins and Implications for Large Scale Biological & Information Systems

Network Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai

Biological Networks. Gavin Conant 163B ASRC

The architecture of complexity: the structure and dynamics of complex networks.

Network models: random graphs

Complex (Biological) Networks

Complex (Biological) Networks

6.207/14.15: Networks Lecture 12: Generalized Random Graphs

1 Complex Networks - A Brief Overview

7.32/7.81J/8.591J: Systems Biology. Fall Exam #1

Written Exam 15 December Course name: Introduction to Systems Biology Course no

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

Complex-Network Modelling and Inference

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?

Graph Theory and Networks in Biology arxiv:q-bio/ v1 [q-bio.mn] 6 Apr 2006

SYSTEMS BIOLOGY 1: NETWORKS

56:198:582 Biological Networks Lecture 8

Networks as a tool for Complex systems

CS224W: Analysis of Networks Jure Leskovec, Stanford University

Social Networks- Stanley Milgram (1967)

3.B.1 Gene Regulation. Gene regulation results in differential gene expression, leading to cell specialization.

CS224W: Social and Information Network Analysis

Data Mining and Analysis: Fundamental Concepts and Algorithms

L3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus

Complex networks: an introduction

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016

Graph Theory and Networks in Biology

56:198:582 Biological Networks Lecture 10

Lecture 8: Temporal programs and the global structure of transcription networks. Chap 5 of Alon. 5.1 Introduction

Lecture 4: Transcription networks basic concepts

1 Mechanistic and generative models of network structure

Bioinformatics I. CPBS 7711 October 29, 2015 Protein interaction networks. Debra Goldberg

Overview. Overview. Social networks. What is a network? 10/29/14. Bioinformatics I. Networks are everywhere! Introduction to Networks

Networks. Can (John) Bruce Keck Founda7on Biotechnology Lab Bioinforma7cs Resource

Network models: dynamical growth and small world

Analysis of Biological Networks: Network Robustness and Evolution

Bi 1x Spring 2014: LacI Titration

Introduction to Bioinformatics

Control of Gene Expression in Prokaryotes

Graph Theory Properties of Cellular Networks

Name Period The Control of Gene Expression in Prokaryotes Notes

Regulation of Gene Expression

56:198:582 Biological Networks Lecture 9

NETWORK BIOLOGY: UNDERSTANDING THE CELL S FUNCTIONAL ORGANIZATION

Chapter 15 Active Reading Guide Regulation of Gene Expression

Computational Biology: Basics & Interesting Problems

Overview of Network Theory

Graph Theory Approaches to Protein Interaction Data Analysis

Erdős-Renyi random graphs basics

Inferring Transcriptional Regulatory Networks from Gene Expression Data II

Unit 3: Control and regulation Higher Biology

Systems biology and biological networks

Adventures in random graphs: Models, structures and algorithms

Network motifs in the transcriptional regulation network (of Escherichia coli):

Understanding Science Through the Lens of Computation. Richard M. Karp Nov. 3, 2007

Interaction Network Analysis

Lecture 4: Yeast as a model organism for functional and evolutionary genomics. Part II

Complex Graphs and Networks Lecture 3: Duplication models for biological networks

Analog Electronics Mimic Genetic Biochemical Reactions in Living Cells

Name: SBI 4U. Gene Expression Quiz. Overall Expectation:

Theoretical Physics Methods for Computational Biology. Third lecture

13.4 Gene Regulation and Expression

Decision Making and Social Networks

How Scale-free Type-based Networks Emerge from Instance-based Dynamics

Genetic transcription and regulation

Gene Regulation and Expression

Computational Genomics. Systems biology. Putting it together: Data integration using graphical models

CSCI1950 Z Computa3onal Methods for Biology Lecture 24. Ben Raphael April 29, hgp://cs.brown.edu/courses/csci1950 z/ Network Mo3fs

Course plan Academic Year Qualification MSc on Bioinformatics for Health Sciences. Subject name: Computational Systems Biology Code: 30180

Genetic transcription and regulation

UNIT 6 PART 3 *REGULATION USING OPERONS* Hillis Textbook, CH 11

Computational Cell Biology Lecture 4

BME 5742 Biosystems Modeling and Control

LINK ANALYSIS. Dr. Gjergji Kasneci Introduction to Information Retrieval WS

Shlomo Havlin } Anomalous Transport in Scale-free Networks, López, et al,prl (2005) Bar-Ilan University. Reuven Cohen Tomer Kalisky Shay Carmi

Graph Alignment and Biological Networks

Networks & pathways. Hedi Peterson MTAT Bioinformatics

CS-E5880 Modeling biological networks Gene regulatory networks

Bi 8 Lecture 11. Quantitative aspects of transcription factor binding and gene regulatory circuit design. Ellen Rothenberg 9 February 2016

Almost giant clusters for percolation on large trees

Random Lifts of Graphs

Biological networks CS449 BIOINFORMATICS

Complex Networks, Course 303A, Spring, Prof. Peter Dodds

Lecture 6: The feed-forward loop (FFL) network motif

Metanetworks of artificially evolved regulatory networks

Biology. Biology. Slide 1 of 26. End Show. Copyright Pearson Prentice Hall

Cell biology traditionally identifies proteins based on their individual actions as catalysts, signaling

Simulation of Gene Regulatory Networks

Types of biological networks. I. Intra-cellurar networks

Welcome to Class 21!

FUNDAMENTALS of SYSTEMS BIOLOGY From Synthetic Circuits to Whole-cell Models

Directed Scale-Free Graphs

Random Boolean Networks

V 5 Robustness and Modularity

Gene regulation II Biochemistry 302. February 27, 2006

Transcription:

Self Similar (Scale Free, Power Law) Networks (I) E6083: lecture 4 Prof. Predrag R. Jelenković Dept. of Electrical Engineering Columbia University, NY 10027, USA {predrag}@ee.columbia.edu February 7, 2007 Jelenković (Columbia University) Self Similar Networks February 7, 2007 1 / 31

Outline 1 Cell as a Regulatory Network Background Examples: Gene Regulatory Network 2 Scale free Network Erdös Rényi model Scale Free Network Jelenković (Columbia University) Self Similar Networks February 7, 2007 2 / 31

Outline 1 Cell as a Regulatory Network Background Examples: Gene Regulatory Network 2 Scale free Network Erdös Rényi model Scale Free Network Jelenković (Columbia University) Self Similar Networks February 7, 2007 3 / 31

Introduction Network symbols Nodes: biological objects (proteins, genes) Edges: interaction between nodes Examples Network node Edges Metabolic networks metabolites interaction Transcriptional interactions genes regulation Protein folding networks residue folding neighbors Jelenković (Columbia University) Self Similar Networks February 7, 2007 4 / 31

DNA & Genes Jelenković (Columbia University) Self Similar Networks February 7, 2007 5 / 31

Gene Transcription Regulation The transcription rate is controlled by the promoter. Transcription Factors (TF), including activators and repressors, binds the sites in promoter. TFs are regulated by other TFs, and form a network. TFs are encoded in genes. Jelenković (Columbia University) Self Similar Networks February 7, 2007 6 / 31

Gene Regulatory Network Cell s gene regulatory network refers to the coordinated on and off switching of genes by regulatory proteins that bind to non-coding DNA. How to discover edges? Most work in this area has focused on reconstructing the network from data/experiments, for example, find the correlation function ρ of the number of proteins, the hypothesis is that if two genes are positively/negatively regulated, then ρ is close to ±1, meaning, A appears with high probability if B is present, then... Also, some researchers use mutual information as a measure of gene closeness. Jelenković (Columbia University) Self Similar Networks February 7, 2007 7 / 31

Outline 1 Cell as a Regulatory Network Background Examples: Gene Regulatory Network 2 Scale free Network Erdös Rényi model Scale Free Network Jelenković (Columbia University) Self Similar Networks February 7, 2007 8 / 31

http://www.biochemj.org/bj/381/0001/bj3810001.htm Figure: Regulatory network of transcription factors (TFs) in E. coli. Jelenković (Columbia University) Self Similar Networks February 7, 2007 9 / 31

http://www.biomedcentral.com/1471-2105/5/199 Figure: Hierarchical structure and modules in the E. coli transcriptional regulatory network The original unorganized network vs. the hierarchical regulation structure. Nodes in the graph are operons. Links show the transcriptional regulatory relationships. The global regulators found in this work are shown in red. Jelenković (Columbia University) Self Similar Networks February 7, 2007 10 / 31

http://www.biomedcentral.com/1471-2105/5/199 Operons in different modules are shown in different colors. The ten global regulators form the core part of the network. The periphery modules are connected mainly through the global regulators. Depending on the connectivity between the modules and their connectivity to the global regulators, these modules can be further grouped to larger modules at a higher level. Figure: Functional modules in the transcriptional regulatory network of E. coli Jelenković (Columbia University) Self Similar Networks February 7, 2007 11 / 31

Characterizing metabolic networks of E. Coli Network biology (Barabasi & Oltvai, Nature, 2004) (d) The degree distribution, P(k) of the metabolic network illustrates its scale-free topology. (e) The scaling of the clustering coefficient C(k) (defined later) with the degree k illustrates the hierarchical architecture of metabolism. (f) The flux distribution in the central metabolism of E. Coli follows a power law, which indicates that most reactions have small metabolic flux, whereas a few reactions, with high fluxes, carry most of the metabolic activity. Jelenković (Columbia University) Self Similar Networks February 7, 2007 12 / 31

Questions What is the topology of this network? Are there basic structures (subgraphs/subnetworks, motifs)? How do we model the operations of regulatory networks? (analogy circuits: gates, logic?) How does evolution change regulatory networks? Impact of natural selection (fitness), motifs.. Resilience to attacks (targeted or random), disease, etc. We could have a whole course on gene regulatory networks (Spring 2008). Jelenković (Columbia University) Self Similar Networks February 7, 2007 13 / 31

Power Law Random Graph Scale Free Network The observations of power-law distributions in the connectivity of complex networks came as a surprise to researchers deeply rooted in the tradition of random networks. Traditional random graph - Erdos Renyi model VS Scale Free Network - Barabási model Figure: Concentrated Degree distribution: Poisson Figure: Power Law Degree distribution Jelenković (Columbia University) Self Similar Networks February 7, 2007 14 / 31

Outline 1 Cell as a Regulatory Network Background Examples: Gene Regulatory Network 2 Scale free Network Erdös Rényi model Scale Free Network Jelenković (Columbia University) Self Similar Networks February 7, 2007 15 / 31

Introduction to Erdös Rényi Model G(n, p) is a graph with n nodes where an edge has probability p to be selected. Average degree d = ED = p(n 1) pn; P[D = k] = ( ) m k p k (1 p) m k (d k /k!)exp( d). Sharply concentrated around its mean, i.e., Poisson-like. Percolation transition, threshold behavior at d = 1. If d < 1, then with high probability the network is forming mostly trees and no component is larger than log n. If d > 1, there is a unique giant component. Jelenković (Columbia University) Self Similar Networks February 7, 2007 16 / 31

The Clustering Coefficient of a Network Let N(u) denote the set of neighbors of u in a graph: N(u) = {v : (u, v) G}. The clustering coefficient of u: let ( k = N(u) (i.e., the number of neighbors of u); k ) 2 = max possible # of edges between vertices in N(u); c(u) = (actual # of edges between vertices in N(u))/ ( k 2). 0 c(u) 1; measure of cliquishness of u s neighborhood. Clustering coefficient of a graph: average of c(u) over all vertices. Real networks often have high clustering C real C rnd. Jelenković (Columbia University) Self Similar Networks February 7, 2007 17 / 31

Main Network Types Jelenković (Columbia University) Self Similar Networks February 7, 2007 18 / 31

Internet Topology (Michalis Faloutsos, Petros Faloutsos & Christos Faloutsos 1999) Figure: The structure of Internet at a) the router level and b) the inter-domain level. The hosts connect to routers in LANs. Figure: Log-log plot of the outdegree d, versus the rank in the sequence of decreasing outdegree. Data in Nov 97 and April 98. Jelenković (Columbia University) Self Similar Networks February 7, 2007 19 / 31

Internet Topology The outdegree, indegree distribution follow power laws. The total number of pairs of nodes within h hops follow power laws. The eigenvalues λ 1 λ 2 λ n follow power laws. (the eigenvalues of a graph are closely related to many basic topological properties such as the diameter, the number of edges, the number of spanning trees, the number of connected components...) Jelenković (Columbia University) Self Similar Networks February 7, 2007 20 / 31

Outline 1 Cell as a Regulatory Network Background Examples: Gene Regulatory Network 2 Scale free Network Erdös Rényi model Scale Free Network Jelenković (Columbia University) Self Similar Networks February 7, 2007 21 / 31

Many Natural Networks A heavy-tailed degree distribution: a small but distinctive number of high-degree vertices serve as hubs. Few connected components: often only 1 or a small number independent of network size Small diameter: often growing only logarithmically with network size A high degree of clustering Jelenković (Columbia University) Self Similar Networks February 7, 2007 22 / 31

Mechanisms resulting power law random graph Preferential Attachment: the rich get richer As new connections form, they attach to a node with a probability proportional to the existing number of connections (growth and preferential attachment). Copying Models The linear growth copying model was introduced by Kleinberg et al. in 1999. New mechanism: generalized random walk (GRW) The evolvement of large scale systems (e.g., self-assemble DNA network, Internet, social network) is attributed to rules lying into two categories: global information and local information. (The preceding copying model can be viewed as a special case of random walk attachments.) Jelenković (Columbia University) Self Similar Networks February 7, 2007 23 / 31

Preferential Attachment Informal derivation At time 0, one node is present, and at each step t + +, a new vertex is added, with one undirected edge preferentially ( k i ) attached to one existing node. Assume that vertex i was added to the system at time t i. Then, at time t, dk i dt = k ( ) i t 1/2 2t k i =. t i Jelenković (Columbia University) Self Similar Networks February 7, 2007 24 / 31

Preferential Attachment: Continued... Thus, the degree D distribution is obtained by which implies P[D = x] 1 x 3. P[D > x] = P[t i < t/x 2 ] = t x 2 (t + 1), A rigorous analysis of preferential attachment was first given by Bollobás et al. Let the number of vertices of G n with indegree equal to d be X n (d), and consider G n as one graph from the process {G t : 0 t n}. The martingale X t = E[X n (d) G t ] satisfies that X t+1 X t is bounded by two. Applying Azuma-Hoeffding inequality, we obtain that X n (d) is very concentrated around its mean, and thus only need to compute E[X n (d)]. Jelenković (Columbia University) Self Similar Networks February 7, 2007 25 / 31

Preferential Attachment: Continued... What if attaching proportional to k α i? If α > 1, eventually one person gets all the links. There is a finite time after which no one else gets anything! If α < 1, the degree distribution follows a stretched exponential. Limitation of preferential attachment Global information. Number of nodes increases linearly. Jelenković (Columbia University) Self Similar Networks February 7, 2007 26 / 31

The origin of the scale-free topology in biological networks The new protein has the same structure as the old one, so they both interact with the same proteins. Therefore proteins with a large number of interactions tend to gain links more often, as it is more likely that they interact with the protein that has been duplicated. Jelenković (Columbia University) Self Similar Networks February 7, 2007 27 / 31

Discovering motifs Motifs are those patterns which occur significantly more frequently in real than in equivalent randomized networks. Look for all possible two- or three-node configurations. Jelenković (Columbia University) Self Similar Networks February 7, 2007 28 / 31

Yeast Regulatory Network Motifs Lee et al, Science 2002 Jelenković (Columbia University) Self Similar Networks February 7, 2007 29 / 31

Motifs Of The Yeast Protein Network S. Wuchty, Z. Oltvai & A.-L. Barabasi, Nature Genetics, 2003 Jelenković (Columbia University) Self Similar Networks February 7, 2007 30 / 31

Scale free network caused by random walk Figure: Node=2000, Random Walk p = 0.6. Jelenković (Columbia University) Self Similar Networks February 7, 2007 31 / 31