Stoichiometric network analysis

Similar documents
Stoichiometric vector and matrix

Representing metabolic networks

Metabolic modelling. Metabolic networks, reconstruction and analysis. Esa Pitkänen Computational Methods for Systems Biology 1 December 2009

Description of the algorithm for computing elementary flux modes

Predicting rice (Oryza sativa) metabolism

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

Grundlagen der Systembiologie und der Modellierung epigenetischer Prozesse

BASIC NOTIONS. x + y = 1 3, 3x 5y + z = A + 3B,C + 2D, DC are not defined. A + C =

Chapter 7: Metabolic Networks

Jim Lambers MAT 610 Summer Session Lecture 1 Notes

V19 Metabolic Networks - Overview

MAT 242 CHAPTER 4: SUBSPACES OF R n

Systems II. Metabolic Cycles

Carbon labeling for Metabolic Flux Analysis. Nicholas Wayne Henderson Computational and Applied Mathematics. Abstract

Equivalence of metabolite fragments and flow analysis of isotopomer distributions for flux estimation

Chapter 2: Matrix Algebra

V15 Flux Balance Analysis Extreme Pathways

Review of Linear Algebra

Linear Algebra Review

Linear Algebra March 16, 2019

1300 Linear Algebra and Vector Geometry Week 2: Jan , Gauss-Jordan, homogeneous matrices, intro matrix arithmetic

EIGENVALUES AND EIGENVECTORS 3

Flux Balance Analysis

Knowledge Discovery and Data Mining 1 (VO) ( )

LINEAR ALGEBRA: THEORY. Version: August 12,

Hypergraphs, Metabolic Networks, Bioreaction Systems. G. Bastin

Lecture 7. Econ August 18

V14 extreme pathways

Linear Algebra. Session 8

Chap 3. Linear Algebra

ECON 186 Class Notes: Linear Algebra

Chapter 3. Loop and Cut-set Analysis

Introduction to Bioinformatics

Structural Analysis of Expanding Metabolic Networks

Foundations of Matrix Analysis

Chapter 5: Integer Compositions and Partitions and Set Partitions

Abstract Vector Spaces and Concrete Examples

Vector Spaces. 9.1 Opening Remarks. Week Solvable or not solvable, that s the question. View at edx. Consider the picture

Stoichiometric analysis of metabolic networks

LINEAR ALGEBRA BOOT CAMP WEEK 1: THE BASICS

Notes on the Matrix-Tree theorem and Cayley s tree enumerator

Chapter 1: Linear Equations

Engineering Model Reduction and Entropy-based Lyapunov Functions in Chemical Reaction Kinetics

Algebraic Methods in Combinatorics

Linear Algebra II. 2 Matrices. Notes 2 21st October Matrix algebra

3. Vector spaces 3.1 Linear dependence and independence 3.2 Basis and dimension. 5. Extreme points and basic feasible solutions

10. Smooth Varieties. 82 Andreas Gathmann

Notes taken by Graham Taylor. January 22, 2005

Math 3108: Linear Algebra

Solution to Homework 8, Math 2568

International Journal of Scientific & Engineering Research, Volume 6, Issue 2, February ISSN

4.6 Bases and Dimension

Codes for Partially Stuck-at Memory Cells

Matrices. In this chapter: matrices, determinants. inverse matrix

The set of all solutions to the homogeneous equation Ax = 0 is a subspace of R n if A is m n.

October 25, 2013 INNER PRODUCT SPACES

b 1 b 2.. b = b m A = [a 1,a 2,...,a n ] where a 1,j a 2,j a j = a m,j Let A R m n and x 1 x 2 x = x n

The Distribution of Optimal Strategies in Symmetric Zero-sum Games

Chapter 3 - From Gaussian Elimination to LU Factorization

FUNDAMENTALS of SYSTEMS BIOLOGY From Synthetic Circuits to Whole-cell Models

What is A + B? What is A B? What is AB? What is BA? What is A 2? and B = QUESTION 2. What is the reduced row echelon matrix of A =

1300 Linear Algebra and Vector Geometry

Metabolic Network Analysis

Introduction on metabolism & refresher in enzymology

Mobile Robotics 1. A Compact Course on Linear Algebra. Giorgio Grisetti

Math 54 HW 4 solutions

Biological Pathways Representation by Petri Nets and extension

Matrices and Vectors

Lecture 1: Review of linear algebra

BioControl - Week 6, Lecture 1

Selected exercises from Abstract Algebra by Dummit and Foote (3rd edition).

Chapter 3. Directions: For questions 1-11 mark each statement True or False. Justify each answer.

Cellular Metabolic Models

Algebraic methods for the study of biochemical reaction networks

Support Vector Machine Classification via Parameterless Robust Linear Programming

Chapter 5: Integer Compositions and Partitions and Set Partitions

22A-2 SUMMER 2014 LECTURE Agenda

EXERCISE SET 5.1. = (kx + kx + k, ky + ky + k ) = (kx + kx + 1, ky + ky + 1) = ((k + )x + 1, (k + )y + 1)

ECON0702: Mathematical Methods in Economics

On Two Class-Constrained Versions of the Multiple Knapsack Problem

All of my class notes can be found at

Math 322. Spring 2015 Review Problems for Midterm 2

ECON2285: Mathematical Economics

10. Linear Systems of ODEs, Matrix multiplication, superposition principle (parts of sections )

A guideline to model reduction by stoichiometric decomposition for biochemical network analysis

Linear Algebra. The analysis of many models in the social sciences reduces to the study of systems of equations.

Definition 2.3. We define addition and multiplication of matrices as follows.

(a). W contains the zero vector in R n. (b). W is closed under addition. (c). W is closed under scalar multiplication.

Chapter 1: Linear Equations

Dot Products, Transposes, and Orthogonal Projections

Linear Algebra Review

UNIT 1 DETERMINANTS 1.0 INTRODUCTION 1.1 OBJECTIVES. Structure

MATH 2030: ASSIGNMENT 4 SOLUTIONS

OPTIMALITY AND STABILITY OF SYMMETRIC EVOLUTIONARY GAMES WITH APPLICATIONS IN GENETIC SELECTION. (Communicated by Yang Kuang)

Math Models of OR: Extreme Points and Basic Feasible Solutions

AFFINE AND PROJECTIVE GEOMETRY, E. Rosado & S.L. Rueda 4. BASES AND DIMENSION

2. Every linear system with the same number of equations as unknowns has a unique solution.

Introduction to Mobile Robotics Compact Course on Linear Algebra. Wolfram Burgard, Cyrill Stachniss, Kai Arras, Maren Bennewitz

Flux balance analysis in metabolic networks

1 Matrices and Systems of Linear Equations

Transcription:

Stoichiometric network analysis In stoichiometric analysis of metabolic networks, one concerns the effect of the network structure on the behaviour and capabilities of metabolism. Questions that can be tackled include: Discovery of pathways that carry a distinct biological function (e.g. glycolysis) from the network, discovery of dead ends and futile cycles, dependent subsets of enzymes Identification of optimal and suboptimal operating conditions for an organism Analysis of network flexibility and robustness, e.g. under gene knockouts

Stoichiometric coefficients Soitchiometric coefficients denote the proportion of substrate and product molecules involved in a reaction. For example, for a reaction r : A + B 2C, the stoichiometric coefficients for A, B and C are 1, 1 and 2, respectively. Assignment of the coeefficients is not unique: we could as well choose 1/2, 1/2, 1 as the coefficients However, the relative sizes of the coeefficients remain in any valid choice. Note! We will denote both the name of a metabolite and its concentration by the same symbol.

Reaction rate and concentration vectors Let us assume that our metabolic network has the reactions R = {R 1, R 2,..., R r } Let the reaction R i operate with rate v i We collect the individual reaction rates to a rate vector v = (v 1,..., v r ) T Similarly, the concentration vector X (t) = (X 1 (t),..., X m (t)) T contains the concentration of each metabolite in the system (at time t)

Stoichiometric vector and matrix The stoichiometric coefficients of a reaction are collected to a vector s r In s r there is a one position for each metabolite in the metabolic system The stoichiometric co-efficient of the reaction are inserted to appropriate positions, e.g. for the reaction r : A + B 2C, s r = A B C 0 0 1 0 0 1 0 0 2

Stoichiometric matrix The stoichiometric vectors can be combined into the stoichiometric matrix S. In the matrix S, the is one row for each metabolite M 1, dots, M m and one column for each reaction R 1,..., R r. The coefficients s j along the j th column are the stoichiometric coeefficients of of the reaction j. s 11 s 1j s 1r......... S = s i1 s ij s ir......... s m1 s mj s mr

Systems equations In a network of m metabolites and r reactions, the dynamics of the system are characterized by the systems equations dx i dt = r s ij v j, for i = 1,..., m j=1 X i is the concentration of the ith metabolite v j is the rate of the jth reaction and s ij is the stoichiometric coefficient of ith metabolite in the jth reaction. Intuitively, each system equation states that the rate of change of concentration of a is the sum of metabolite flows to and from the metabolite.

Systems equations in matrix form The systems equation can be expressed in vector form as dx i dt = r s ij v j = Si T v, j=1 where S i contains the stoichiometric coefficients of a single metabolite, that is a row of the stoichiometric matrix All the systems equations of different equations together can then be expressed by a matrix equation Above, the vector dx dt = Sv, ( dx dt = dx1 dt,..., dx ) T n dt collects the rates of concentration changes of all metabolites

Steady state analysis Most applications of stoichiometric matrix assume that the system is in so called steady state In a steady state, the concentrations of metabolites remain constant over time, thus the derivative of the concentration is zero: dx i r = s ij v j = 0, for i = 1,..., n dt j=1 The requires the production to equal consumption of each metabolite, which forces the reaction rates to be invariant over time.

Steady state analysis and fluxes The steady-state reaction rates v j, j = 1,..., r are called the fluxes Note: Biologically, live cells do not exhibit true steady states (unless they are dead) In suitable conditions (e.g. continuous bioreactor cultivations) steady-state can be satisfied approximately. Pseudo-steady state or quasi-steady state are formally correct terms, but rarely used dx i dt = r s ij v j = 0, for i = 1,..., n j=1

Defining the system boundary When analysing a metabolic system we need to consider what to include in our system We have the following choices: 1. Metabolites and reactions internal to the cell (leftmost picture) 2. (1) + exchange reactions transporting matter accross the cell membrane (middle picture) 3. (1) + (2) + Metabolites outside the cell (rightmost picture) (Picture from Palsson: Systems Biology, 2006)

System boundary and the total stoichiometric matrix The placement of the system boundary reflects in the [ ] SII S stoichiometric matrix that will S = IE 0 S EE partition into four blocks: S II : contains the stoichiometric coefficients of internal metabolites w.r.t internal reactions S IE : coefficients of internal metabolites in exchange reactions i.e. reactions transporting metabolites accross the system boundary S EI (= 0) : coefficients of external metabolites w.r.t internal reactions; always identically zero S EE : coefficients of external metabolites w.r.t exchange reactions; this is a diagonal matrix.

Exchange stoichiometrix matrix In most applications handled on this course we will not consider external compounds The (exchange) stoichiometric matrix, containing the internal metabolites and both internal and exchange reactions, will be used Our metabolic system will be then open, containing exhange reactions of type A, and B S = [ S II S IE ]

System boundary and steady state analysis Exchange stoichiometric matrix is used for steady state analysis for a reason: it will not force the external metabolites to satisfy the steady state condition dx i dt = r s ij v j = 0, for i = 1,..., n j=1 Requiring steady state for external metabolites would drive the rates of exchange reactions to zero That is, in steady-state, no transport of substrates into the system or out of the system would be possible!

Internal stoichiometrix matrix The internal stoichiometric matrix, containing only the internal metabolites and internal reactions can be used for analysis of conserved pools in the metabolic system The system is closed with no exchange of material to and from the system S = [ S II ]

System boundary of our example system Our example system is a closed one: we do not have exchange reactions carrying to or from the system. We can change our system to an open one, e..g by introducing a exchange reaction R 8 : αg6p feeding αg6p into the system and another reaction R 9 : X 5P to push X 5P out of the system R 1 : βg6p + NADP + zwf 6PGL + NADPH R 2 : 6PGL + H 2 O pgl 6PG R 3 : 6PG + NADP + gnd R5P + NADPH R 4 : R5P rpe X5P R 5 : αg6p gpi βg6p R 6 : αg6p gpi βf6p R 7 : βg6p gpi βf6p

Example The stoichiometric matrix of our extended example contains two extra columns, corresponding to the exchange reactions R 8 : αg6p and R 9 : X 5P βg6p 1 0 0 0 1 0 1 0 0 αg6p 0 0 0 0 1 1 0 1 0 βf 6P 0 0 0 0 0 1 1 0 0 6PGL 1 1 0 0 0 0 0 0 0 6PG 0 1 1 0 0 0 0 0 0 R5P 0 0 1 1 0 0 0 0 0 X 5P 0 0 0 1 0 0 0 0 1 NADP + 1 0 1 0 0 0 0 0 0 NADPH 1 0 1 0 0 0 0 0 0 H 2 O 0 1 0 0 0 0 0 0 0

Steady state analysis, continued The requirements of non-changing concentrations dx i dt = r s ij v j = 0, for i = 1,..., n j=1 constitute a set of linear equations constraining to the reaction rates v j. We can write this set of linear constraints in matrix form with the help of the stoichiometric matrix S and the reaction rate vector v dx = Sv = 0, dt A reaction rate vector v satisfying the above is called the flux vector.

Null space of the stoichiometrix matrix Any flux vector v that the cell can maintain in a steady-state is a solution to the homogeneous system of equations By definition, the set contains all valid flux vectors Sv = 0 N (S) = {u Su = 0} In linear algebra N (A) is referred to as the null space of the matrix A Studying the null space of the stoichiometric matrix can give us important information about the cell s capabilities

Null space of the stoichiometric matrix The null space N (S) is a linear vector space, so all properties of linear vector spcaes follow, e.g: N (S) contains the zero vector, and closed under linear combination: v 1, v 2 N (S) = α 1 v 1 + αv 2 N (S) The null space has a basis {k 1,..., k q }, a set of q min(n, r) linearly independent vectors, where r is the number of reactions and n is the number of metabolites. The choice of basis is not unique, but the number q of vector it contains is determined by the rank of S.

Null space and feasible steady state rate vectors The kernel K = (k 1,..., k q ) of the stoichiometric matrix formed by the above basis vectors has a row corresponding to each reaction. (Note: the term kernel here has no relation to kernel methods and SVMs) K characterizes the feasible steady state reaction rate vectors: for each feasible flux vector v, there is a vector b R q such that Kb = v In other words, any steady state flux vector is a linear combination b 1 k 1 + + b q k q of the basis vectors of N (S).

Identifying dead ends in metabolism From the matrix K, one can identify reactions that can only have zero rate in a steady state. Such reactions may indicate a dead end: if the reaction is not properly connected the rest of the network, the reaction cannot operate in a steady state Such reactions necessarily have the corresponding row K j identically equal to zero, K j = 0

Proof outline This can be easily proven by contradiction using the the equation Kb = v: Assume reaction R j is constrained to have zero rate in steady state, but assume for some i, k ji 0. Then we can pick the i th basis vector of K as the feasible solution v = k i. Then v j = k ji 0 and the jth reaction has non-zero rate in a steady state.

Enzyme subsets An enzyme subset is a group of enzymes which, in a steady state, must always operate together so that their reaction rates have a fixed ratio. Consider a pair of reactions R 1 and R 2 in the metabolic network that form a linear sequence. 1 1 1 A r 1 B 2 r 2 C 1 1 D E

Enzyme subsets Let B be a metabolite that is an intermediate within the pathway produced by R 1 and consumed by R 2 for which the steady-state assumption holds. Due to the steady state assumption, it must hold true that 1 1 A r 1 B 2 1 D v 1 s i1 + v 2 s i2 = 0 r 2 1 E giving v 2 = v 1 s i1 /s i2. That is, the rates of the two reactions are linearly dependent. 1 C

Enzyme subsets Also other than linear pathways may be force to operate in lock-step. In the figure, R1 and R4 form an enzyme subset, but R2 and R3 are not in that subset. A R 1 B R2 R3 R 4 C D

Identifying enzyme subsets Enzyme subsets are easy to recognize from the matrix K: the rows corresponding to an enzyme subset are scalar multiples of each other. That is, there is a constant α that satisfies K j = αk j where K j denotes the j th row of the kernel matrix K This is again easy to see from the equation Kb = v.

Proof outline Assume that reactions along rows j, j in K correspond to an enzyme subset. Now assume contrary to the claim that the rows are not scalar multiples of each other. Then we can find a pair of columns i, i, where K ji = αk j i and K ji = βk j i and α β. Both columns i, i are feasible flux vectors. By the above, the rates of j and j differ by factor α in the flux vector given by the column i and by factor β in the flux vector given by the column i. Thus the ratio of reaction rates of j, j can vary and the reactions are not force to operate with a fixed ratio, which is a contradiction.

Independent components Finally, the matrix K can be used to discover subnetworks that can work independently from the rest of the metabolism, in a steady state. j 1 Such components are characterized by a block-diagonal K: K ji 0 for a subset of rows (j 1,..., j s ) and a subset of columns (i 1,..., i t ). K = j s 0 0 Given such a block we can change b i1,..., b it freely, and that will only affect v j1,..., v js

Example: Null space of PPP Consider again the set of reactions from the penthose-phospate pathway R 1 : βg6p + NADP + zwf 6PGL + NADPH R 2 : 6PGL + H 2 O pgl 6PG R 3 : 6PG + NADP + gnd R5P + NADPH R 4 : R5P rpe X5P R 5 : αg6p gpi βg6p R 6 : αg6p gpi βf6p R 7 : βg6p gpi βf6p R 8 : αg6p R 9 : X 5P S = βg6p 2 1 0 0 0 1 0 1 0 3 0 αg6p 0 0 0 0 1 1 0 1 0 βf 6P 0 0 0 0 0 1 1 0 0 6PGL 1 1 0 0 0 0 0 0 0 6PG 0 1 1 0 0 0 0 0 0 R5P 0 0 1 1 0 0 0 0 0 X 5P 0 0 0 1 0 0 0 0 1 NADP + 6 1 0 1 0 0 0 0 0 0 7 NADPH 4 1 0 1 0 0 0 0 0 0 5 H 2 O 0 1 0 0 0 0 0 0 0

Null space of PPP Null space of this system has only one vector K = (0, 0, 0, 0, 0.5774, 0.5774, 0.5774, 0, 0, 0) T Thus, in a steady state only reactions R 5, R 6 and R 7 can have non-zero fluxes. The reason for this is that there are no producers of NADP + or H 2 O and no consumers of NADPH. Thus our PPP is effectively now a dead end! R 1 : βg6p + NADP + zwf 6PGL + NADPH R 2 : 6PGL + H 2 O pgl 6PG R 3 : 6PG + NADP + gnd R5P + NADPH R 4 : R5P rpe X5P R 5 : αg6p gpi βg6p R 6 : αg6p gpi βf6p R 7 : βg6p gpi βf6p R 8 : αg6p R 9 : X 5P

Null space of PPP To give our PPP non-trivial (fluxes different from zero) steady states, we need to modify our system We add reaction R 10 : H 2 O as a water source We add reaction R 11 : NADPH NADP + to regenerate NADP + from NADPH. We could also have removed the metabolites in question to get the same effect R 1 : βg6p + NADP + zwf 6PGL + NADPH R 2 : 6PGL + H 2 O pgl 6PG R 3 : 6PG + NADP + gnd R5P + NADPH R 4 : R5P rpe X5P R 5 : αg6p gpi βg6p R 6 : αg6p gpi βf6p R 7 : βg6p gpi βf6p R 8 : αg6p R 9 : X 5P R 10 : H 2 O R 11 : NADPH NADP +

Enzyme subsets of PPP From the kernel, we can immediately identify enzyme subsets that operate with fixed flux ratios in any steady state: reactions {R 1 R 4, R 8 R 11 } are one subset: R 11 has double rate to all the others {R 6, R 7 } are another: R 6 has the opposite sign of R 7 R 5 does not belong to non-trivial enzyme subsets, so it is not forced to operate in lock-step with other reactions 0.2727 0.1066 0.2727 0.1066 0.2727 0.1066 0.2727 0.1066 0.3920 0.4667 K = 0.1193 0.5733 0.1193 0.5733 0.2727 0.1066 0.2727 0.1066 0.2727 0.1066 0.5454 0.2132