Representing metabolic networks

Similar documents
Stoichiometric vector and matrix

Stoichiometric network analysis

Metabolic modelling. Metabolic networks, reconstruction and analysis. Esa Pitkänen Computational Methods for Systems Biology 1 December 2009

Predicting rice (Oryza sativa) metabolism

Description of the algorithm for computing elementary flux modes

Biological networks CS449 BIOINFORMATICS

An introduction to biochemical reaction networks

Written Exam 15 December Course name: Introduction to Systems Biology Course no

Chapter 7: Metabolic Networks

BCH Graduate Survey of Biochemistry

Transformation of Energy! Energy is the ability to do work.! Thermodynamics is the study of the flow and transformation of energy in the universe.

Program for the rest of the course

Biochemical Pathways

Outline. Metabolism: Energy and Enzymes. Forms of Energy. Chapter 6

Self Similar (Scale Free, Power Law) Networks (I)

Data Mining and Analysis: Fundamental Concepts and Algorithms

1. Why do you have to breath in

V14 Graph connectivity Metabolic networks

Biochemistry 3300 Problems (and Solutions) Metabolism I

Electron Transport Chain (Respiratory Chain) - exercise - Vladimíra Kvasnicová

Photosynthesis. Nearly all of the usable energy on this planet came, at one time or another, from the sun by the process of photosynthesis

Expanding Metabolic Networks: Scopes of Compounds, Robustness and Evolution

Oxidative Phosphorylation versus. Photophosphorylation

Ch. 10 Photosynthesis: The Calvin Cycle Life from Air

Mechanism of CO 2 Fixation: Rubisco Step 1

2 4 Chemical Reactions and Enzymes

Pathways that Harvest and Store Chemical Energy

2) This is a learning problem. You will learn something. You will appreciate this information. If not soon, than someday. Ready?

37 Modelling and experimental evidence for two separate steady states in the photosynthetic Calvin cycle

Chapter 15 part 2. Biochemistry I Introduction to Metabolism Bioenergetics: Thermodynamics in Biochemistry. ATP 4- + H 2 O ADP 3- + P i + H +

Methodology article Inferring branching pathways in genome-scale metabolic networks Esa Pitkänen* 1, Paula Jouhten 2 and Juho Rousu 1

Photosynthesis: The Calvin Cycle

Chemistry 5.07SC Biological Chemistry I Fall Semester, 2013

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Photosynthesis and Cellular Respiration

Cellular Energy: Respiration. Goals: Anaerobic respiration

Q1. The diagram shows a summary of the light-independent reaction of photosynthesis.

Exam 3 Review (4/12/2011) Lecture note excerpt covering lectures (Exam 3 topics: Chapters 8, 12, 14 & 15)

2015 AP Biology PRETEST Unit 3: Cellular Energetics Week of October

Bio102 Problems Photosynthesis

Photosynthesis Review Packet

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?

Cell Energy: The Big Picture. So, What Exactly is ATP. Adenosine Triphosphate. Your turn to Practice converting ATP to ADP:

Structural Analysis of Expanding Metabolic Networks

Lecture 7: Enzymes and Energetics

Photosynthesis. Chapter 10. Active Lecture Questions for use with Classroom Response Systems Biology, Seventh Edition Neil Campbell and Jane Reece

Systems Biology Across Scales: A Personal View XIV. Intra-cellular systems IV: Signal-transduction and networks. Sitabhra Sinha IMSc Chennai

Chapter 6: Energy and Metabolism

networks in molecular biology Wolfgang Huber

Bioinformatics: Network Analysis

Chapter 8: An Introduction to Metabolism

V15 Flux Balance Analysis Extreme Pathways

Photosynthesis and Cellular Respiration

Overview of Kinetics

CP Biology Unit 5 Cell Energy Study Guide. Electron Carriers Electron Transport Chain Fermentation Glycolysis Krebs cycle Light-Dependent Reactions

Modeling (bio-)chemical reaction networks

Be sure to understand:

AP Biology Cellular Respiration

Supplementary Information

Making Sugars. Carbon Dioxide. Properties of Carbon Dioxide

I. Flow of Energy in Living Things II. Laws of Thermodynamics & Free Energy III. Activation Energy IV. Enzymes V. Reaction Coupling VI.

Biological Pathways Representation by Petri Nets and extension

2015 Biology Unit #3 Quiz 1 Photosynthesis, Cellular Respiration and Fermentation Week of November

Integrated Knowledge-based Reverse Engineering of Metabolic Pathways

SABIO-RK Integration and Curation of Reaction Kinetics Data Ulrike Wittig

The Life of a Cell. The Chemistry of Life. A View of the Cell. Cellular Transport and the Cell Cycle. Energy in a Cell

Photosynthesis and Cellular Respiration

LIGHT DEPENDENT & INDEPENDENT REACTIONS

Using Atom Mapping Rules for an Improved Detection of Relevant Routes in Weighted Metabolic Networks. TORSTEN BLUM and OLIVER KOHLBACHER ABSTRACT

Chapter 8 Photosynthesis

f) Adding an enzyme does not change the Gibbs free energy. It only increases the rate of the reaction by lowering the activation energy.

BIOCHEMISTRY. František Vácha. JKU, Linz.

56:198:582 Biological Networks Lecture 9

Ch. 6 & 7 Photosynthesis & Cellular Respiration

Cellular Energetics Review

Harvesting energy: photosynthesis & cellular respiration part 1

BIOLOGICAL SCIENCE. Lecture Presentation by Cindy S. Malone, PhD, California State University Northridge. FIFTH EDITION Freeman Quillin Allison

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

Cellular Energy. How Organisms Obtain Energy Section 2: Photosynthesis Section 3: Cellular Respiration. Click on a lesson name to select.

Assimilation of Carbon in Plants

V19 Metabolic Networks - Overview

Network Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai

2 4 Chemical Reactions and Enzymes Chemical Reactions

Name Date Class. Photosynthesis and Respiration

Biology Chapter 8 Test: Cellular Energy

Energy and Cellular Metabolism

6.207/14.15: Networks Lecture 12: Generalized Random Graphs

MITOCW watch?v=ojvz7pvvz-o

Unit 5 Cellular Energy

PHOTOSYNTHESIS. Chapter 8

Chapter 8: Energy and Metabolism

Chemistry 5.07SC Biological Chemistry I Fall Semester, 2013

Chapter 6: Energy Flow in the Life of a Cell

Chapter 8: An Introduction to Metabolism

Chapter 8: An Introduction to Metabolism

8.2 Photosynthesis Draw and label a diagram showing the structure of a chloroplast as seen in electron micrographs

7.32/7.81J/8.591J: Systems Biology. Fall Exam #1

QUALITATIVE PATH ANALYSIS OF METABOLIC PATHWAYS USING PETRI NETS FOR GENERIC MODELLING

Study Guide: Fall Final Exam H O N O R S B I O L O G Y : U N I T S 1-5

Photosynthesis and Cellular Respiration Unit

Transcription:

Representing metabolic networks Given a reconstructed network, how should we represent it? For computational and statistical analyses, we need to be exact, much more so than when communicating between humans Everything relevant should be included What is relevant depends on the questions that we want to solve (picture:e.coli glycolysis, EMP database, www.empproject.com/) etabolic Modelling Spring 2007 Juho Rousu 1

Levels of abstraction Graph representations: Connectivity of reactions/metabolites, structure of the metabolic network Stoichiometric (reaction equation) representation: capabilities of the network, flow analysis, steady-state analyses Kinetic models: dynamic behaviour under changing conditions etabolic Modelling Spring 2007 Juho Rousu 2

Representing metabolic networks as graphs For structural analysis of metabolic networks, the most frequently encountered representations are: Enzyme interaction network Reaction graph Substrate graph We will also look briefly at Atom-level representations Feasible pathways etabolic Modelling Spring 2007 Juho Rousu 3

Example reaction List Let us consider the following example list of reactions, implementing a part of pentose-phosphate pathway of E. Coli: R 1 : β-d-glucose 6-phosphate (βg6p) + NADP + zwf 6-phosphoglucono-lactone (6PGL) + NADPH R 2 : 6-phosphoglucono δ-lactone + H 2 O pgl 6-phosphogluconate (6PG) R 3 : 6-phosphogluconate + 1 NADP + gnd ribulose 5-phosphate (R5P) + NADPH R 4 : ribulose 5-phosphate (R5P) rpe xylulose 5-phosphate (X5P) R 5 : α-d-glucose 6-phosphate(αG6P) gpi βg6p R 6 : α-d-glucose 6-phosphate(αG6P) gpi β-d-fructose-6-phosphate (βf6p) R 7 : β-d-glucose 6-phosphate(αG6P) gpi βf6p etabolic Modelling Spring 2007 Juho Rousu 4

Enzyme interaction network (1/2) Enzymes as nodes Link between two enzymes if they catalyze reactions that have common metabolites A special kind of protein-protein interaction network gpi pgl zwf rpe gnd etabolic Modelling Spring 2007 Juho Rousu 5

Enzyme interaction network (2/2) R 1 : βg6p + NADP + zwf 6PGL + NADPH R 2 : 6PGL + H 2 O pgl 6PG R 3 : 6PG + NADP + gnd R5P + NADPH R 4 : R5P rpe X5P R 5 : αg6p gpi βg6p R 6 : αg6p gpi βf6p R 7 : βg6p gpi βf6p gpi pgl Edge Supporting metabolites zwf (gpi,zwf) βg6p (zwf, pgl) 6PGL (zwf, gnd) NADP, NADPH (pgl,gnd) (gnd, rpe) 6PG R5P rpe gnd etabolic Modelling Spring 2007 Juho Rousu 6

Reaction clumping in Enzyme networks As each enzyme is represented once in the network reactions catalyzed by the same enzyme will be clumped together: For example, an alcohol dehydrogenase enzyme (ADH) catalyzes a large group of reactions of the template: an alcohol + NAD + <=> an aldehyde or ketone + NADH + H+ Mandelonitrile lyase catalyzes a single reaction: Mandelonitrile <=> Cyanide + Benzaldehyde The interaction between the two is very specific, only via benzaldehyde, but this is not deducible from the enzyme network alone etabolic Modelling Spring 2007 Juho Rousu 7

Reaction graph A reaction graph removes the reaction clumping property of enzyme networks. Nodes correspond to reactions A connecting edge between two reaction nodes R 1 and R 2 denotes that they share a metabolite Difference to enzyme networks Each reaction catalyzed by an enzyme as a separate node A reaction is represented once, even if it has multiple catalyzing enzymes etabolic Modelling Spring 2007 Juho Rousu 8

Reaction graph example R 1 : βg6p + NADP + zwf 6PGL + NADPH R 2 : 6PGL + H 2 O pgl 6PG R 3 : 6PG + NADP + gnd R5P + NADPH R 4 : R5P rpe X5P R 5 : αg6p gpi βg6p R 6 : αg6p gpi βf6p R 7 : βg6p gpi βf6p R6 R7 R1 R2 Edge Supporting metabolites (R 6,R 7 ): βf6p (R 6,R 5 ): αg6p (R 5,R 7 ) βg6p (R 7,R 1 ): βg6p (R 1,R 2 ): 6PGL (R 1, R 3 ): NADP, NADPH (R 2,R 3 ): 6PG (R 3, R 4 ): R5P R4 R3 R5 etabolic Modelling Spring 2007 Juho Rousu 9

Substrate graph A dual representation to a reaction graph is a substrate graph. Nodes correspond to metabolites Connecting edge between two metabolites A and B denotes that there is a reaction where both occur as substrates, both occur as products or one as product and the other as substrate A reaction A + B C + D is spread among a set of edges {(A, C), (A, B), (A, D), (B, C), (B, D), (C, D)} etabolic Modelling Spring 2007 Juho Rousu 10

Substrate graph example R 1 : βg6p + NADP + zwf 6PGL + NADPH R 2 : 6PGL + H 2 O pgl 6PG R 3 : 6PG + NADP + gnd R5P + NADPH R 4 : R5P rpe X5P R 5 : αg6p gpi βg6p R 6 : αg6p gpi βf6p R 7 : βg6p gpi βf6p Edge Common reactions (G6P,NADPH) R 1 (G6P,6PGL) R 1 (G6P.NADP + ) R 1 (NADP +,NADPH) R 1,R 3 (NADP +, 6PGL) R 1 G6P NADPH (NADP +, 6PG) R 3 αg6p (NADP +, R5P) R 3... βf6p NADP + H O 2 6PGL X5P R5P 6PG etabolic Modelling Spring 2007 Juho Rousu 11

Graph analyses of metabolism Enzyme interaction networks, reaction graphs and substrate graph can all be analysed in similar graph concepts and algorithms We can compute basic statistics of the graphs: Connectivity of nodes: degree k(v) of node v how many edges are attached to each node. Path length between pairs of nodes Clustering coeefficient: how tightly connected the graph is etabolic Modelling Spring 2007 Juho Rousu 12

Clustering coefficient (1/2) Clustering coefficient C(v) for node v measures to what extent v is within a tight cluster Let G = (V, E) be a graph with nodes V and edges E Let N(v) be the set of nodes adjacent to v The clustering coefficient is the relative number of edges between the nodes in N(v): C(v) = {(v, v ) E v, v N(v)} N max, where N max = max{ N(v) ( N(v) 1)/2, 1} Maximum C(v) = 1 occurs when N(v) is a fully connected graph Clustering coefficient of the whole graph is the node average: C(G) = 1/n v (C(v))) etabolic Modelling Spring 2007 Juho Rousu 13

Clustering coefficient (2/2) N(gpi) = {zwf}, C(gpi) = 0 gpi N(zwf) = {gpi, pgl, gnd}, C(zwf) = {(pgl,gnd)} 3 = 1/3 N(pgl) = {zwf, gnd}, C(pgl) = {(zwf,gnd)} 1 = 1/1 N(gnd) = {rpe, zwf, gnd}, C(gnd) = {(zwf,pgl)} 3 = 1/3 N(rpe) = {gnd}, C(rpe) = 0 C(G) = 1/3 zwf rpe pgl gnd etabolic Modelling Spring 2007 Juho Rousu 14

Comparison to graphs with known generating mechanism One way to analyse our graphs is to compare the above statistics to graphs that we have generated ourself and thus know the generating mechanism. We will use the following comparison: Erdös-Renyi (ER) random graph Small-world graphs with preferential attachment etabolic Modelling Spring 2007 Juho Rousu 15

Erdös-Renyi random graph Generation of ER graph: Start with a network with n nodes and no edges. Draw an edge between each pair of nodes is with probability p. etabolic Modelling Spring 2007 Juho Rousu 16

Preferential attachment Preferential attachment (PA) is a mechanism that is proposed to generate many networks occurring in nature. Start with a small number n 0 of nodes and no edges. Iterate the following: insert a new node v j, draw m n 0 edges from v i existing nodes v i with probability p P k i+1 j (k j+1) When drawing new edges, nodes with many edges already are preferred over nodes with few or no edges. etabolic Modelling Spring 2007 Juho Rousu 17

Degree distributions of ER and PA graphs P(k) - The probability of encountering a node with degree k: a fat tail: high variance and high number of nodes with high degree. Erdös-Renyi random graph: P(k) ( n k) p k (1 p) n k. Distribution tightly peaked around the average degree: low variance. Frequency of nodes with very high degree is low. Preferential attachment: P(k) k γ. Frequency distribution is scale-free: log P(k) and log k are linearly correlated. Distribution has etabolic Modelling Spring 2007 Juho Rousu 18

Degree distributions in metabolism (Wagner & Fell, 2000) Degree distributions of substrate and reaction graphs etabolic Modelling Spring 2007 Juho Rousu 19

Degree distributions in metabolism Substrate graph shows a fat-tailed distribution, consistent with a network generated via preferential attachment. etabolic Modelling Spring 2007 Juho Rousu 20

Small-world graphs Graphs fulfilling the following two criteria are called small-world graphs Small average shortest path length between two nodes, the same level as ER graphs, lower than many regular graphs: shortcuts accross the graphs go via hubs High clustering coefficient compared to ER graph: the neighbors of nodes are more often linked than in ER graphs. Graphs generated with preferential attachment are small-world graphs. However, small-world graphs can be generated with other mechanisms as well. etabolic Modelling Spring 2007 Juho Rousu 21

Metabolic graphs as small worlds Path lengths in reaction and substrate graphs are about the same as Erdös-Renyi random graph with the same average connectivity Clustering coefficients are much larger than in ER graphs Reaction and substrate graphs resemble small-world graphs etabolic Modelling Spring 2007 Juho Rousu 22

Pitfalls in substrate graph analysis: co-factors Short paths in substrate graphs may not be biologically relevant Shortest paths between metabolites in otherwise distant parts of metabolism tend to go through co-factor metabolites (NADP, NAPH, ATP, ADP). However, transfer of atoms occurs only between the co-factors Quick remedy used in most studies: remove co-factors from the graph but sometimes it is difficult to decide which ones should be removed and which ones to leave. etabolic Modelling Spring 2007 Juho Rousu 23

Atom-level representation Better solution is to trace the atoms accross pathways An acceptable path needs to involve transfer of atoms from source to target. This paradigm is used by Arita in his ARM software (www.metabolome.jp) Spurious pathways caused by the co-factor problem are filtered out etabolic Modelling Spring 2007 Juho Rousu 24

Pitfalls in substrate graph analysis: self-suffiency The shortest path may not correlate well with the effort that the cell needs to make the conversion The conversions require other metabolites to be produced than the ones along the direct path. Arguably a feasible pathway should be self-sufficiently capable of performing the conversion from sources to target metabolites etabolic Modelling Spring 2007 Juho Rousu 25

Feasible pathway vs. shortest simple path Feasible pathway contains the yellow reactions r2, r3, r6 and r7 Shortest simple path has length 2, corresponding to the simple path through r3 and r7 etabolic Modelling Spring 2007 Juho Rousu 26

Feasible pathway vs. shortest simple path Simple path length distribution shows the small-world property: most paths are short 6000 5000 Shortest-path length d s Upper bound for metabolic distance d m Production distance d p Feasible pathway size (in the figure: green) shows no small world property: many conversions between two metabolites that involve a large number of enzymes Frequency 4000 3000 2000 1000 0 0 10 20 30 40 50 60 70 80 Distance etabolic Modelling Spring 2007 Juho Rousu 27

Robustness & small world property It has been claimed that the small-world property gives metabolic networks robustness towards random mutations. 1 0.9 metabolites (as easily). Simple paths Feasible metabolisms As evidence the conservation of short pathways under random gene deletions has been offered However, the smallest feasible pathways are not as robust, showing that even random mutations can quickly damage the cells capability to make conversions between Ratio of conserved pathways 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 200 400 600 800 1000 Number of deleted reactions etabolic Modelling Spring 2007 Juho Rousu 28