Cell biology traditionally identifies proteins based on their individual actions as catalysts, signaling

Similar documents
Network Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai

The architecture of complexity: the structure and dynamics of complex networks.

Divergence Pattern of Duplicate Genes in Protein-Protein Interactions Follows the Power Law

Department of Pathology, Northwestern University Medical School, Chicago, IL 60611

Analysis of Biological Networks: Network Robustness and Evolution

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

Evidence for dynamically organized modularity in the yeast protein-protein interaction network

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?

Deterministic scale-free networks

Types of biological networks. I. Intra-cellurar networks

Erzsébet Ravasz Advisor: Albert-László Barabási

Self-organized scale-free networks

BioControl - Week 6, Lecture 1

NETWORK BIOLOGY: UNDERSTANDING THE CELL S FUNCTIONAL ORGANIZATION

Supplementary Information

networks in molecular biology Wolfgang Huber

The geneticist s questions

PROTEOMICS. Is the intrinsic disorder of proteins the cause of the scale-free architecture of protein-protein interaction networks?

Cross-fertilization between Proteomics and Computational Synthesis

Interaction Network Topologies

Connectivity and expression in protein networks: Proteins in a complex are uniformly expressed

Biological Networks Analysis

Graph Alignment and Biological Networks

University of Notre Dame. Zoltán N. Oltvai Northwestern Univ., Medical School

Biological Networks. Gavin Conant 163B ASRC

How Scale-free Type-based Networks Emerge from Instance-based Dynamics

arxiv:q-bio/ v1 [q-bio.mn] 13 Aug 2004

Peeling the yeast protein network

Computational approaches for functional genomics

Graph Theory and Networks in Biology arxiv:q-bio/ v1 [q-bio.mn] 6 Apr 2006

arxiv:physics/ v1 [physics.bio-ph] 30 Sep 2002

Towards Detecting Protein Complexes from Protein Interaction Data

A MODEL OF LARGE-SCALE PROTEOME EVOLUTION

Cellular Biophysics SS Prof. Manfred Radmacher

Graph Theory and Networks in Biology

Self Similar (Scale Free, Power Law) Networks (I)

Application of random matrix theory to microarray data for discovering functional gene modules

Complex (Biological) Networks

arxiv:cond-mat/ v1 [cond-mat.dis-nn] 4 May 2000

Systems biology and biological networks

Fine-scale dissection of functional protein network. organization by dynamic neighborhood analysis

An introduction to SYSTEMS BIOLOGY

GRAPH-THEORETICAL COMPARISON REVEALS STRUCTURAL DIVERGENCE OF HUMAN PROTEIN INTERACTION NETWORKS

Genome-Scale Gene Function Prediction Using Multiple Sources of High-Throughput Data in Yeast Saccharomyces cerevisiae ABSTRACT

Understanding Science Through the Lens of Computation. Richard M. Karp Nov. 3, 2007

Networks as a tool for Complex systems

Node Degree Distribution in Amino Acid Interaction Networks

Network landscape from a Brownian particle s perspective

Social Networks- Stanley Milgram (1967)

Global organization of metabolic fluxes in the bacterium, Escherichia coli

Bioinformatics I. CPBS 7711 October 29, 2015 Protein interaction networks. Debra Goldberg

Overview. Overview. Social networks. What is a network? 10/29/14. Bioinformatics I. Networks are everywhere! Introduction to Networks

Weighted gene co-expression analysis. Yuehua Cui June 7, 2013

Comparison of Protein-Protein Interaction Confidence Assignment Schemes

Theoretical Physics Methods for Computational Biology. Third lecture

CHAPTER 1 SCALE-FREE NETWORKS IN BIOLOGY

Proteomics. 2 nd semester, Department of Biotechnology and Bioinformatics Laboratory of Nano-Biotechnology and Artificial Bioengineering

V14 Graph connectivity Metabolic networks

Research Article A Topological Description of Hubs in Amino Acid Interaction Networks

Complex (Biological) Networks

Topology of technology graphs: Small world patterns in electronic circuits

Scale- free networks in cell biology

Lecture 10: May 19, High-Throughput technologies for measuring proteinprotein

Computational Network Biology Biostatistics & Medical Informatics 826 Fall 2018

arxiv: v2 [q-bio.mn] 5 Sep 2016

Computational Systems Biology

Dr. Amira A. AL-Hosary

Introduction to Bioinformatics

The complex topology of chemical plants

Welcome to HST.508/Biophysics 170

The Generalized Topological Overlap Matrix For Detecting Modules in Gene Networks

Measuring the shape of degree distributions

Analysis of Complex Systems

Introduction to Bioinformatics Integrated Science, 11/9/05

Comparative Network Analysis

Preface. Contributors

Biological Knowledge Discovery Through Mining Multiple Sources of High-Throughput Data

ANAXOMICS METHODOLOGIES - UNDERSTANDING

arxiv: v1 [q-bio.mn] 30 Dec 2008

Overview of Network Theory

A General Framework for Weighted Gene Co-Expression Network Analysis. Steve Horvath Human Genetics and Biostatistics University of CA, LA

The Activity Reaction Core and Plasticity of Metabolic Networks

Finding important hubs in scale-free gene networks

biologically-inspired computing lecture 5 Informatics luis rocha 2015 biologically Inspired computing INDIANA UNIVERSITY

Written Exam 15 December Course name: Introduction to Systems Biology Course no

Bayesian Inference of Protein and Domain Interactions Using The Sum-Product Algorithm

Nature Genetics: doi: /ng Supplementary Figure 1. Icm/Dot secretion system region I in 41 Legionella species.

identifiers matched to homologous genes. Probeset annotation files for each array platform were used to

A network of protein protein interactions in yeast

Scale-free networks in cell biology

Tiffany Samaroo MB&B 452a December 8, Take Home Final. Topic 1

A General Model for Amino Acid Interaction Networks

V 5 Robustness and Modularity

Evolution of Complex Modular Biological Networks

Evolutionary Games and Computer Simulations

Proteomics Systems Biology

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Using Evolutionary Approaches To Study Biological Pathways. Pathways Have Evolved

METABOLIC PATHWAY PREDICTION/ALIGNMENT

Chapter 8: The Topology of Biological Networks. Overview

Transcription:

Lethality and centrality in protein networks Cell biology traditionally identifies proteins based on their individual actions as catalysts, signaling molecules, or building blocks of cells and microorganisms. Currently, we witness the emergence of a post-genomic view that expands the protein s role, regarding it as an element in a network of proteinprotein interactions as well, with a 'contextual' or 'cellular' function within functional modules 1, 2. Here we provide quantitative support for this paradigm shift by demonstrating that the phenotypic consequence of a single gene deletion in the yeast, S. cerevisiae, is affected, to a high degree, by the topologic position of its protein product in the complex, hierarchical web of molecular interactions. The S. cerevisiae protein-protein interaction network we investigate has 187 proteins as nodes, connected by 224 identified direct physical interactions, and is derived from combined, nonoverlapping data 3, 4 obtained mostly by systematic two-hybrid analyses 3. Due to its size, a complete map of the network (Fig. 1a), while informative, in itself offers little insight into its large-scale characteristics. Thus, our first goal was to identify the architecture of this network, determining if it is best described by an inherently uniform exponential topology with proteins on average possessing the same number of links, or by a highly heterogeneous scale-free topology with proteins having widely different connectivities 5. As we show in Fig. 1b, the probability that a given yeast protein interacts with k other yeast proteins follows a power-law 5 with an exponential cutoff 6 at k c 2, a topology that is also shared by the protein-protein interaction network of the bacterium, H. pylori 7. This indicates that the network of protein interactions in two separate organisms forms a highly inhomogeneous scale-free network in which a few highly connected proteins play a central role in mediating interactions among numerous, less connected proteins. An important known consequence of the inhomogeneous structure is the network s simultaneous tolerance against random errors coupled with fragility against the removal of the most connected nodes 8. Indeed, we find that random mutations in the genome of S. cerevisiae, -modeled by the removal of randomly selected yeast proteins-, do not affect the overall topology of the network. In contrast, when 1

the most connected proteins are computationally eliminated, the network diameter increases rapidly. This simulated tolerance against random mutation is in agreement with systematic mutagenesis studies, which identified a striking capacity of yeast to tolerate the deletion of a substantial number of individual proteins from its proteome 9, 1. Yet, if indeed this is due to a topological component to error tolerance, on average less connected proteins should prove less essential than highly connected ones. To assess this hypothesis, we rank ordered all interacting proteins based on the number of links they have and correlated this with the phenotypic effect of their individual removal from the yeast proteome. As shown in Fig. 1c, the likelihood that removal of a protein will prove lethal clearly correlates with the number of interactions the protein has. For example, while proteins with five or less links constitute ~93% of the total number of proteins we find that only ~21% of them are essential. In contrast, only ~.7% of the yeast proteins with known phenotypic profile have more than 15 links but single deletion of ~62% of these proves lethal. This implies that highly-connected proteins with a central role in the network s architecture are three times more likely to prove essential than proteins with low number of links to other proteins. The simultaneous emergence of an inhomogeneous structure in both metabolic- 5, 11 and protein interaction networks indicates the evolutionary selection of a common large scale structure of biological networks, and strongly suggests that future systematic protein-protein interaction studies in other organisms will uncover an essentially identical protein network topology. The correlation between the connectivity and indispensability of a given protein confirms that despite the importance of individual biochemical function and genetic redundancy, the robustness against mutations in yeast is also derived from the organization of interactions and topologic position of individual proteins 12. Thus, a better understanding of cell dynamics and robustness will be obtained from integrated approaches that simultaneously incorporate the individual and contextual properties of all constituents of complex cellular networks. 2

H. Jeong 1, S. P. Mason 2, A.-L. Barabási 1 and Z. N. Oltvai 2 1 Department of Physics, University of Notre Dame, Notre Dame, IN 46556 and 2 Department of Pathology, Northwestern University Medical School, Chicago, IL 6611 E-mail: alb@nd.edu, zno8@nwu.edu References 1. Hartwell, L.H., Hopfield, J.J., Leibler, S. & Murray, A.W. Nature 42, C47-52 (1999). 2. Eisenberg, D., Marcotte, E.M., Xenarios, I. & Yeates, T.O. Nature 45, 823-6 (2). 3. Uetz, P., et al. Nature 43, 623-7 (2). 4. Xenarios, I., et al. Nucleic Acids Res 28, 289-91 (2). 5. Jeong, H., Tombor, B., Albert, R., Oltvai, Z.N. & Barabási, A.-L. Nature 47, 651-654 (2). 6. Amaral, L.A., Scala, A., Barthelemy, M. & Stanley, H.E. Proc. Natl. Acad. Sci. 97, 11149-11152 (2). 7. Rain, J.-C., et al. Nature 49, 211-215 (21). 8. Albert, R., Jeong, H. & Barabási, A.-L. Nature 46, 378-382 (2). 9. Winzeler, E.A., et al. Science 285, 91-6 (1999). 1. Ross-Macdonald, P., et al. Nature 42, 413-8 (1999). 11. Fell, D.A. & Wagner, A. in Animating the cellular map (eds Hofmeyr, J.-H. S., Rohwer, J. M. & Snoep, J.L.) 79-85 (Stellenbosch University Press, Stellenbosch, 2). 12. Wagner, A. Nat Genet 24, 355-61 (2). 13. Costanzo, M.C., et al. Nucleic Acids Res 28, 73-6 (2). 3

Figure Legend Characteristics of the yeast proteome. (a) Map of protein-protein interactions. The largest cluster, which contains ~78% of all proteins is shown. The color of a node signifies the phenotypic effect of removing the corresponding protein (red=lethal, green=non-lethal, orange=slow growth, yellow=unknown) (b) Connectivity distribution P(k) of interacting yeast proteins giving the probability that a given protein interacts with k other proteins. The exponential cutoff 6 indicates that the number of proteins with more than 2 interactions is slightly less than expected for pure scale-free networks. In the absence of data on the link directions, all interactions have been considered bi-directional. The parameter controlling the short length scale correction has value k 1. (c) The fraction of essential proteins with exactly k links versus their connectivity k in the yeast proteome. The list of 1572 mutants with known phenotypic profile was obtained from the Proteome database 13. Detailed statistical analysis, including r=.75 value for Pearson s linear correlation coefficient, demonstrates a positive correlation between lethality and connectivity. For additional details see http://www.nd.edu/~networks/cell. 4

(a) Pajek c log(p(k))+k/k -2-4 -6-8 (b) % of essential proteins 8 6 4 2 1 1 1 k+k (c) 5 1 15 # of links 2 5