1 / 51 Modelos Formales en Bioinformática Javier Campos, Jorge Júlvez University of Zaragoza
2 / 51 Outline 1 Systems Biology 2 Population Dynamics Example 3 Formal Models 4 Stochasticity
3 / 51 Outline 1 Systems Biology 2 Population Dynamics Example 3 Formal Models 4 Stochasticity
4 / 51 Formal Models in Bioinformatics What is systems biology? Systems biology is the study of all the elements in a biological system (all genes, mrnas, proteins, etc) and their relationships one to another in response to perturbations. Systems approaches attempt to study the behaviour of all the elements in a system and relate these behaviours to the systems or emergent properties.
5 / 51 Formal Models in Bioinformatics Bioinformatics Basics 3$4$.54*6"7/813"8$/81
6 / 51 Formal Models in Bioinformatics Systems Biology: Interaction in Networks Systems Biology: Interaction in Networks
7 / 51 Formal Models in Bioinformatics What is systems biology?...systematic study of complex interactions in biological systems, thus using a new perspective (integration instead of reduction) to study them... one of the goals of systems biology is to discover new emergent properties (Wikipedia) Systems biology is the study of an organism, viewed as an integrated and interacting network of genes, proteins and biochemical reactions... systems biologists focus on all the components and the interactions among them, all as part of one system (Institute for Systems Biology, Washington) To understand complex biological systems requires the integration of experimental and computational research in other words a systems biology approach (Kitano, 2002)
Systems Biology Population Dynamics Example Formal Models Stochasticity 4)+56*071 4)+56*071 Formal Models in Bioinformatics! :)+"(6'$/1 Networks! 8).)1*)&-'"96.1 Gene regulation )1*)&-'"96.1 1*)&-'"96.1! <*6+)$.=>! <*6+)$.=>*6+)$.1$. 4)+56*071! ;$&."''$.&11 Metabolic Pathway! :)+"(6'$/1 "(6'$/1 (6'$/1!?) Signalling! <*6+)$.=>*6+)$.1$.+)*"/96.1 Protein-protein interaction!"#$!%&$'()*+,(*-.)'%"/%-01 A:B1C.+*61!?)#)'6>!?)#)'6>@).+"'1 8 / 51
9 / 51 Formal Models in Bioinformatics
Formal Models in Bioinformatics Metabolic Pathway 3)+"(4'$/15"+67"891!"#$!%&$'()*+,(*-.)'%"/%-01?3@1A.+*41 6:;<==/"%)>;"98%4*&=+44'9=;"+67"89= 21 10 / 51
11 / 51 ist perspective: Consider a spherical cow with negligible Systems Biology is a multidisciplinar field Which Even is the E. volume coli isof not a cow? that spherical!! A chemist perspective: Dissolve the cow in H 2 SO 4, weight the result and measure the volume. Sometimes it is important not to destroy the cow.
11 / 51 ist perspective: Consider a spherical cow with negligible Systems Biology is a multidisciplinar field Which Even is the E. volume coli isof not a cow? that spherical!! A chemist perspective: Dissolve the cow in H 2 SO 4, weight the result and measure the volume. Sometimes it is important not to destroy the cow. An engineering perspective: Immerse the cow in a tank of water and measure the volume. Some cows cannot swim. Water pressure might change volume.
12 / 51 Systems Biology is a multidisciplinar field physicist perspective: Consider a spherical cow with negligible ass Which is theeven volume E. coli of aiscow? not that spherical!! A mathematician perspective: Cut the cow into pieces and sum up the pieces. The total might not be equal to the sum of its parts emergent properties.
12 / 51 Systems Biology is a multidisciplinar field physicist perspective: Consider a spherical cow with negligible ass Which is theeven volume E. coli of aiscow? not that spherical!! A mathematician perspective: Cut the cow into pieces and sum up the pieces. The total might not be equal to the sum of its parts emergent properties. A physicist perspective: Consider a spherical cow with negligible mass... Even E. coli is not that spherical
13 / 51 Outline 1 Systems Biology 2 Population Dynamics Example 3 Formal Models 4 Stochasticity
14 / 51 Systems Biology Population Dynamics Example Formal Models Stochasticity tical immunology delling a population Deterministic birth process Formal Models in Bioinformatics thematics to model the time evolution of a population Mathematics to model the time evolution of a population
15 / 51 Systems Biology Population Dynamics Example Formal Models Stochasticity Theoretical immunology Modelling a population First attemt at combining mathematics and biology Formal Models in Bioinformatics From penguins to rabbits Mathematics to model the time evolution of a population From penguins to rabbits..
16 / 51 Modelling the time evolution of a population Early attempts to make use of mathematics in biology Leonardo de Pisa or Fibonacci (1202) At month 0 there is a pair of rabbits (one female and one male). Every pair of rabbits (one female and one male) can mate at the age of one month. The female rabbit always produces a new pair of rabbits (one female and one male) every month from the second month on. As there is no death, all rabbits survive. What is the number of pairs of rabbits in month n?
First attemt at combining mathematics and biology Systems Biology Population Dynamics Example Formal Models Stochasticity 17 / 51 om penguins to rabbits Modelling the time evolution of a population Rabbits population
Modelling the time evolution of a population Leonardo de Pisa or Fibonacci (1202) At month 0 there is a pair of rabbits (one female and one male). Every pair of rabbits (one female and one male) can mate at the age of one month. The female rabbit always produces a new pair of rabbits (one female and one male) every month from the second month on. As there is no death, all rabbits survive. The number of pairs in month n, R n, satisfies: R 0 = 1 R 1 = 1 R 2 = 1 + 1 = 2 R 3 = 2 + 1 = 3 R 4 = 3 + 2 = 5 R n+1 = R n + R n 1 R 5 = 5 + 3 = 8 18 / 51
19 / 51 Modelling the time evolution of a population Deterministic modelling Let N(t) be the population of those penguins at time t. N(t) is the number of individuals in the population at time t. The change in the number of penguins in a small time interval, from t to t + t, is given by: N(t + ) = N(t) + births deaths + migration This equation is a conservation equation for the number of individuals of the population. The form of the various terms on the right-hand-side requires essential feedback from biologists.
20 / 51 Modelling the time evolution of a population Deterministic birth process Let us assume that: There are no death events in the population. There are only birth events in the population. The birth rate (number of births per unit of time), b, is the same for all individuals of the population. We have: N(t + t) = N(t) + births
21 / 51 Modelling the time evolution of a population Deterministic birth process Change in the population is due to birth events. The births in the time interval [t, t + t] due to a single individual is b t.
21 / 51 Modelling the time evolution of a population Deterministic birth process Change in the population is due to birth events. The births in the time interval [t, t + t] due to a single individual is b t. The births in the time interval [t, t + t] due to all individuals is N(t)b t.
21 / 51 Modelling the time evolution of a population Deterministic birth process Change in the population is due to birth events. The births in the time interval [t, t + t] due to a single individual is b t. The births in the time interval [t, t + t] due to all individuals is N(t)b t. N(t + t) N(t) N(t + t) = N(t)+N(t)b t = = bn(t) t
21 / 51 Modelling the time evolution of a population Deterministic birth process Change in the population is due to birth events. The births in the time interval [t, t + t] due to a single individual is b t. The births in the time interval [t, t + t] due to all individuals is N(t)b t. N(t + t) N(t) N(t + t) = N(t)+N(t)b t = = bn(t) t For a very small time interval, t 0, N(t + t) N(t) lim = dn(t) = bn(t) t 0 t dt This equation can be easily solved by integration.
22 / 51 Modelling the time evolution of a population Deterministic birth process N(t): Number of individuals at time t dn(t) = bn(t) dt If the population at time t = t 0 is given by N 0, we have: N(t) = N 0 e b(t t 0) In a deterministic birth process the population size is predicted at time t with absolute certainty, once the initial size N 0 and birth rate b are given. The population size N(t) and time t are both continuous variables (both take real values) and not discrete (take integer values).
22 / 51 Modelling the time evolution of a population Deterministic birth process N(t): Number of individuals at time t dn(t) = bn(t) dt If the population at time t = t 0 is given by N 0, we have: N(t) = N 0 e b(t t 0) In a deterministic birth process the population size is predicted at time t with absolute certainty, once the initial size N 0 and birth rate b are given. The population size N(t) and time t are both continuous variables (both take real values) and not discrete (take integer values). Is this a good mathematical population growth model?
23 / 51 Modelling the time evolution of a population Stochastic birth process Let X t be the dicrete random variable that describes the number of individuals of the population at time t. The stochastic process that describes the population satisfies: X t {1, 2,...} and t [0, + ) Denote by p n (t) the probability that at time t the size of the population is n, i.e., the probability that at time t there are n individuals in the population: p n (t) = Prob(X t = n)
24 / 51 Modelling the time evolution of a population Stochastic birth process Consider a small time interval [t, t + t] How is X t+ t related to X We have the following rules: There are no death events in the population. There are birth events in the population: the probability that a birth takes place in t is b t. The probability of more than one birth in a time interval t is negligible (no twin births allowed).
a population Systems Biology Population Dynamics Example Formal Models Stochasticity tic birth process tic Modelling birth process the time II evolution of a population Stochastic birth process 1 2 3 n 1 n n +1 The probability that a population of size n 1 increases to The probability n in thethat timeainterval population (t, t + of t) sizeis n(n 1 1)b t. increases to n in the time interval (t, Thet + probability t) is b that t a population (n 1). of size n increases to The probability n + 1 inthat the time a population interval (t, oft size + t) n increases is nb t. to n + 1 in the time interval (t, If at t + time t) t the is b population t n. has n individuals, the probability that no birth event takes place in the time interval If at time t the population has n individuals, the probability that no birth (t, t + t) is 1 nb t. event takes place in the time interval (t, t + t) is 1 b t n. Evolution equation for p n (t): 25 / 51
a population Systems Biology Population Dynamics Example Formal Models Stochasticity tic birth process tic Modelling birth process the time II evolution of a population Stochastic birth process 1 2 3 n 1 n n +1 The probability that a population of size n 1 increases to The probability n in thethat timeainterval population (t, t + of t) sizeis n(n 1 1)b t. increases to n in the time interval (t, Thet + probability t) is b that t a population (n 1). of size n increases to The probability n + 1 inthat the time a population interval (t, oft size + t) n increases is nb t. to n + 1 in the time interval (t, If at t + time t) t the is b population t n. has n individuals, the probability that no birth event takes place in the time interval If at time t the population has n individuals, the probability that no birth (t, t + t) is 1 nb t. event takes Evolution place in equation the time for interval p (t, t + t) is 1 b t n. n (t): Evolution equation for p n (t): p n (t + t) = (n 1)b tp n 1 (t) + (1 nb t)p n (t) 25 / 51
26 / 51 Modelling the time evolution of a population Theoretical immunology Immunology T cell T cell
Theoretical immunology Immunology Modelling the time evolution of a population Cell division Cell division Birth Birth event event: At time t there are n cells. At time t there are n cells. During the time interval t there is a single birth event. During the time interval t there is a single birth event. At time t + t there are n + 1 cells. At time t + t there are n + 1 cells. 27 / 51
Theoretical immunology Immunology Modelling the time evolution of a population Cell death Cell death Death event: Death event At time t there are n cells. At time t there are n cells. During the time interval t there is a single death event. During the time interval t there is a single death event. At time t + t there are n 1 cells. At time t + t there are n 1 cells. 28 / 51
29 / 51 Theoretical immunology Immunology Modelling the time evolution of a population Cell-cell interactions lead to events Cell-cell interactions lead to events
30 / 51 Theoretical immunology Immunology Modelling the time evolution of a population T cell and tumour cell T cell and Tumour cell
31 / 51 Outline 1 Systems Biology 2 Population Dynamics Example 3 Formal Models 4 Stochasticity
32 / 51 Modelling in systems biology!"#$%&#'()*+*,"'' Modelling: Design and construction of models of existing biological! -*.%++)/,0'.%#),/'1/.'2*/#$3425*/'*6'&*.%+#'*6'%7)#5/,' systems, which explain observed properties and 8)*+*,)21+'#"#$%	':;)2;'%7<+1)/'*8#%3=%.'<3*<%35%#'1/.' predict the <3%.)2$'$;%'3%#<*/#%'$*'%7<%3)&%/$1+')/$%3=%/5*/#' response to experimental interventions. wetlab experiments natural biosystem wetlab experiments observed behaviour predicted behaviour formalizing understanding model (knowledge) model-based experiment design.1=).>,)+8%3$?834/%+>12>4@' (-C'D/$3*' AB'
<&3(,%$($./+0".&")&+&("#$%&"7$*&0($:&& 33 / 51 Modelling in systems biology Model:!"#$%& Formal representation of the real world. Simplified abstract view of the complex reality $,*$-$./+0".&")&/1$&*$+%&2"*%#& $#&+5-/*+6/&73$2&")&/1$&6"(,%$8&*$+%3/9:&
34 / 51 Modelling in systems biology Why model? A model can generate new insights A model can make testable predictions E.g., predict the effect of drugs on an organism E.g., predict the effect on an inhibitor on a pathway A model can test conditions that may be difficult to study in the laboratory A model can rule out particular explanations for an experimental observation A model can help you identify what s right and wrong with your hypotheses
35 / 51 Modelling in systems biology Textbook view of the cell and reality!"#$%&&'()*"+(&,($-"(."//(012(3"0/*$4 Three D EM image of a pancreatic Beta cell Campbell, Reece & Mitchell (1998) Biology, 5 th Edition
36 / 51 Modelling in systems biology In silico humans - spatial & temporal scales In silico humans-spatial & temporal scales 1 m person 1 mm electrical length scale of cardiac tissue 1 mm cardiac sarcomere spacing 1 nm pore diameter in a membrane protein Range = 10 9 10 9 s (70 yrs) human lifetime 10 6 s (10 days) protein turnover 10 3 s (1 hour) digest food 1 s heart beat 1 ms ion channel HH gating 1 ms Brownian motion Range = 10 15 Requires a hierarchy of inter-related models gene reg. networks pathway models stochastic models ODEs PDEs (continuum models)
37 / 51 Modelling in systems biology Levels of abstraction!"#"$%&'(&)*%+,-./'0&!"#$%&#' +%./0%#' ()*$#' +,-' Adrianantoandro et al. Mol Sys Bio 2006 3-#43564$*",+7*,80"$5-.589& :;<&=0+,'& 12&
38 / 51 Modelling in systems biology Models must be validated by experimental data: Simulations must be accurate representations of the real world.!"#$%&'!"#$%&',-).%-/%#' 01*%-#$21*314' ()*%+' 53)+)4362+' #"#$%&' 92+3*2/)1' 5%:293);-'.-%*36/)1' 71)81'.-).%-/%#' ;171)81'.-).%-/%#' ()*%+',-).%-/%#'
39 / 51 Formal Models in Bioinformatics A Framework for Modelling Define all the components of the system Systematically perturb and monitor components of the system Reconcile the experimentally observed responses with those predicted by the model Design and perform new perturbation experiments to distinguish between multiple or competing model hypotheses. (Ideker, Galitski & Hood, 2001)
40 / 51 Modelling in systems biology Models should be:!"#$%& Readable. Unambiguous. Analysable. Executable. 0".&")&/1$&*$+%&2"*%#& /&73$2&")&/1$&6"(,%$8&*$+%3/9:&
40 / 51 Modelling in systems biology Models should be:!"#$%& Readable. Unambiguous. Analysable. Executable. 0".&")&/1$&*$+%&2"*%#& Suggestions: Occam s razor: Don t overcomplicate things. /&73$2&")&/1$&6"(,%$8&*$+%3/9:& Einstein: Everything should be made as simple as possible, but not simpler.
41 / 51 Modelling in systems biology Modelling Regimes Randomness State Space Deterministic Stochastic Discrete Molecular Dynamics Stochastic Simulation Algorithms Continuous Ordinary Differential Equation Stochastic Differential Equation
42 / 51 Modelling in systems biology Modelling Regimes Discrete and stochastic: Small numbers of molecules. Exact description via Stochastic Simulation Algorithm (SSA) - Gillespie. Large computational time.
42 / 51 Modelling in systems biology Modelling Regimes Discrete and stochastic: Small numbers of molecules. Exact description via Stochastic Simulation Algorithm (SSA) - Gillespie. Large computational time. Continuous and stochastic: A bridge connecting discrete and continuous models. Described by SDEs Chemical Langevin Equation.
42 / 51 Modelling in systems biology Modelling Regimes Discrete and stochastic: Small numbers of molecules. Exact description via Stochastic Simulation Algorithm (SSA) - Gillespie. Large computational time. Continuous and stochastic: A bridge connecting discrete and continuous models. Described by SDEs Chemical Langevin Equation. Continuous and deterministic: Law of Mass Action. The Reaction Rate equations. Described by ordinary differential equations. Not valid if molecular populations of some critical reactant species are small.
43 / 51 Outline 1 Systems Biology 2 Population Dynamics Example 3 Formal Models 4 Stochasticity
44 / 51 Modelling in systems biology Biological evidence of noise Stochasticity is evident in all biological processes the proliferation of both noise and noise reduction systems is a hallmark of organismal evolution Federoff et al.(2002). Transcription in higher eukaryotes occurs with a relatively low frequency in biologic time and is regulated in a probabilistic manner Hume (2000). Gene regulation is a noisy business Mcadams et al. (1999). Initiation of gene transcription is a discrete process in which individual protein-coding genes in an off state can be stochastically switched on, resulting in sporadic pulses of mrna production Sano 2001. It is essential to study individual cells and to measure the cell to cell variations in biological response, rather than averaging over cell populations Zatorsky, Rosenfeld et al. 2006.
45 / 51 Modelling in systems biology Origin of Stochasticity Intrinsic noise due to small numbers of molecules (e.g. mrna, DNA loci, TFs). Uncertainty of knowing when a reaction occurs and which reaction it is. Relative statistical uncertainty is inversely proportional to the square root of the number of molecules. Applies equally well to studying channel behaviour via the concept of channel molecules. Extrinsic noise due to (external) environmental effects (extrinsic factors are: stage in cell cycle, number of RNAP or ribosomes, cellular environment).
Modelling formalisms Markov chains Based on the concept of state of the system Solution techniques: Enumerative Transient and steady-state analysis Exact and approximate analysis Drawbacks: Low abstraction level Model size equals number of states of the system Only in very particular cases aggregation techniques exist 46 / 51
47 / 51 Modelling formalisms Queueing networks High abstraction level The number of states characterizing the system grows exponentially on the model size. Solution techniques: Enumerative (based on Markov chains) Reduction/transformation-based Structurally based (product-form solution, exact) Transient and steady-state analysis Exact, approximate and bounds Drawbacks: Lack of synchronization primitive Extensions exist but destroying analysis possibilities
Modelling formalisms Stochastic Petri nets Abstraction level similar to queueing networks With synchronization primitive SPN =Petri nets+ timing interpretation=queueing networks+ synchronizations Wide range of qualitative (logical properties) analysis techniques: Enumerative (based on Markov chains) Reduction/transformation-based Structurally based Petri nets as a formal modelling paradigm a conceptual framework to obtain specific formalisms based on common concepts and principles at different life-cycle phases 48 / 51
49 / 51 Modelling formalisms Stochastic Petri nets (cont.) Analysis techniques: Exact: mainly enumerative (based on Markov chains) Bounding techniques (structurally based) Approximation techniques (reduction/transformation) Drawbacks: Lack of a product-form solution for efficient exact analysis in most cases
50 / 51 Contents of the Course 1 Discrete and Continuous Markov chains 2 Birth and Death Processes 3 Stochastic Simulation 4 Hidden Markov Chains 5 Stochastic Petri nets Course info: http://webdiis.unizar.es/asignaturas/spn/
Acknowledgments Much of the material in the course is based on the following courses: Deterministic models in mathematical biology, Magic 042 - Lecture 1 Carmen Molina-París, Department of Applied Mathematics, School of Mathematics, University of Leeds Una Introducción a la Biología de Sistemas Raúl Guantes (UAM), Juan F. Poyatos (CNB) A conceptual framework for BioModel Engineering (Systems Biology, Synthetic Biology) Rainer Breitling, Groningen, NL; David Gilbert, Brunel, UK; Monika Heiner, Cottbus, DE A Petri Net Perspective on Systems and Synthetic Biology Monika Heiner, Brandenburg University of Technology Cottbus, DE Dept. of CS Systems Biology: Stochastic models and Simulation Kevin Burrage, Institute for Molecular Bioscience, The University of Queensland, Australia. 51 / 51