SPA for quantitative analysis: Lecture 6 Modelling Biological Processes

Similar documents
Modelling Biochemical Pathways with Stochastic Process Algebra

Models and Languages for Computational Systems Biology Lecture 1

Some investigations concerning the CTMC and the ODE model derived from Bio-PEPA

Modeling biological systems with delays in Bio-PEPA

Modelling Biological Compartments in Bio-PEPA

Modelling biological compartments in Bio-PEPA

Biological Pathways Representation by Petri Nets and extension

Cells in silico: a Holistic Approach

UNIVERSITÀ DEGLI STUDI DI TRENTO

Modelling protein trafficking: progress and challenges

Varieties of Stochastic Calculi

86 Part 4 SUMMARY INTRODUCTION

Population models from PEPA descriptions

38050 Povo Trento (Italy), Via Sommarive 14 CAUSAL P-CALCULUS FOR BIOCHEMICAL MODELLING

A compartmental model of the camp/pka/mapk pathway in Bio-PEPA

Qualitative and Quantitative Analysis of a Bio-PEPA Model of the Gp130/JAK/STAT Signalling Pathway

Towards a framework for multi-level modelling in Computational Biology

Cellular Systems Biology or Biological Network Analysis

Formal Methods and Systems Biology: The Calculus of Looping Sequences

Regulation and signaling. Overview. Control of gene expression. Cells need to regulate the amounts of different proteins they express, depending on

Simulation of Chemical Reactions

This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and

Chapter 6- An Introduction to Metabolism*

Introduction Probabilistic Programming ProPPA Inference Results Conclusions. Embedding Machine Learning in Stochastic Process Algebra.

Stochastic Simulation of Biochemical Reactions

Where Membrane Meet Complexes

Course plan Academic Year Qualification MSc on Bioinformatics for Health Sciences. Subject name: Computational Systems Biology Code: 30180

Deducing Interactions in Partially Unspecified Biological Systems

Stochastic Process Algebra models of a Circadian Clock

Investigating modularity in the analysis of process algebra models of biochemical systems

Lectures on Medical Biophysics Department of Biophysics, Medical Faculty, Masaryk University in Brno. Biocybernetics

Chapter 15 Active Reading Guide Regulation of Gene Expression

Compare and contrast the cellular structures and degrees of complexity of prokaryotic and eukaryotic organisms.

From cell biology to Petri nets. Rainer Breitling, Groningen, NL David Gilbert, London, UK Monika Heiner, Cottbus, DE

Unravelling the biochemical reaction kinetics from time-series data

An Introduction to Metabolism

Introduction to Bioinformatics

FUNDAMENTALS of SYSTEMS BIOLOGY From Synthetic Circuits to Whole-cell Models

Program for the rest of the course

Written Exam 15 December Course name: Introduction to Systems Biology Course no

Prokaryotic Gene Expression (Learning Objectives)

Multiple Choice Review- Eukaryotic Gene Expression

Lecture 8: Temporal programs and the global structure of transcription networks. Chap 5 of Alon. 5.1 Introduction

An introduction to SYSTEMS BIOLOGY

Process Calculi for Biological Processes

Basic Synthetic Biology circuits

Unit 3: Control and regulation Higher Biology

Basic modeling approaches for biological systems. Mahesh Bule

Regulation of metabolism

V19 Metabolic Networks - Overview

L3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus

CHAPTER : Prokaryotic Genetics

Topic 4 - #14 The Lactose Operon

Name: SBI 4U. Gene Expression Quiz. Overall Expectation:

Grundlagen der Systembiologie und der Modellierung epigenetischer Prozesse

Lecture: Computational Systems Biology Universität des Saarlandes, SS Introduction. Dr. Jürgen Pahle

Petri net models. tokens placed on places define the state of the Petri net

FROM ODES TO LANGUAGE-BASED, EXECUTABLE MODELS OF BIOLOGICAL SYSTEMS

A Structured Approach Part IV. Model Checking in Systems and Synthetic Biology

Introduction to Bioinformatics

natural development from this collection of knowledge: it is more reliable to predict the property

RESEARCH PROPOSAL MODELING A CELLULAR TRANSMEMBRANE SIGNALING SYSTEM THROUGH INTERACTION WITH G-PROTEINS BY USING CONCURRENT

Semantics and Verification

Activation of a receptor. Assembly of the complex

V14 extreme pathways

Introduction. Gene expression is the combined process of :

Modeling Biological Systems in Stochastic Concurrent Constraint Programming

Modeling Biological Systems in Stochastic Concurrent Constraint Programming

Gene Network Science Diagrammatic Cell Language and Visual Cell

Hybrid Model of gene regulatory networks, the case of the lac-operon

Computational Modelling in Systems and Synthetic Biology

Simulation of Kohn s Molecular Interaction Maps Through Translation into Stochastic CLS+

Name Period The Control of Gene Expression in Prokaryotes Notes

Formal Modeling of Biological Systems with Delays

Plant Molecular and Cellular Biology Lecture 8: Mechanisms of Cell Cycle Control and DNA Synthesis Gary Peter

Map of AP-Aligned Bio-Rad Kits with Learning Objectives

From Petri Nets to Differential Equations An Integrative Approach for Biochemical Network Analysis

Big Idea 3: Living systems store, retrieve, transmit and respond to information essential to life processes. Tuesday, December 27, 16

BIOLOGY 10/11/2014. An Introduction to Metabolism. Outline. Overview: The Energy of Life

Biology Assessment. Eligible Texas Essential Knowledge and Skills

STAAR Biology Assessment

Sig2GRN: A Software Tool Linking Signaling Pathway with Gene Regulatory Network for Dynamic Simulation

REGULATION OF GENE EXPRESSION. Bacterial Genetics Lac and Trp Operon

COMPUTER SIMULATION OF DIFFERENTIAL KINETICS OF MAPK ACTIVATION UPON EGF RECEPTOR OVEREXPRESSION

Modelling in Systems Biology

Stochastic Simulation of Biological Systems with Dynamical Compartments

Deducing Interactions in Partially Unspecified Biological Systems

Lecture 4: Transcription networks basic concepts

Enzyme Enzymes are proteins that act as biological catalysts. Enzymes accelerate, or catalyze, chemical reactions. The molecules at the beginning of

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016

BioControl - Week 6, Lecture 1

2. The study of is the study of behavior (capture, storage, usage) of energy in living systems.

DISI - University of Trento Optimizing the Execution of Biological Models

biological networks Claudine Chaouiya SBML Extention L3F meeting August

9/25/2011. Outline. Overview: The Energy of Life. I. Forms of Energy II. Laws of Thermodynamics III. Energy and metabolism IV. ATP V.

Organization of Genes Differs in Prokaryotic and Eukaryotic DNA Chapter 10 p

Stochastic Simulation.

MITOCW enzyme_kinetics

The Calculus of Looping Sequences

Understanding Science Through the Lens of Computation. Richard M. Karp Nov. 3, 2007

Transcription:

1/ 223 SPA for quantitative analysis: Lecture 6 Modelling Biological Processes Jane Hillston LFCS, School of Informatics The University of Edinburgh Scotland 7th March 2013

Outline 2/ 223 1 Introduction Some Biological Background 2 Process Calculi for Systems Biology 3 Bio-PEPA: Syntax 4 Formal Semantics 5 Bio-PEPA Analysis 6 Formal Mappings 7 Example

Introduction 3/ 223 Outline 1 Introduction Some Biological Background 2 Process Calculi for Systems Biology 3 Bio-PEPA: Syntax 4 Formal Semantics 5 Bio-PEPA Analysis 6 Formal Mappings 7 Example

Introduction 4/ 223 Systems Biology Biological advances mean that much more is now known about the components of cells and the interactions between them. Systems biology aims to develop a better understanding of the processes involved. Formalisms from theoretical computer science have found a new role in developing models for systems biology, allowing biologists to test hypotheses and prioritise experiments.

Introduction 5/ 223 Systems Biology Biological advances mean that much more is now known about the components of cells and the interactions between them. Systems biology aims to develop a better understanding of the processes involved. Formalisms from theoretical computer science have found a new role in developing models for systems biology, allowing biologists to test hypotheses and prioritise experiments.

Introduction 6/ 223 Systems Biology Biological advances mean that much more is now known about the components of cells and the interactions between them. Systems biology aims to develop a better understanding of the processes involved. Formalisms from theoretical computer science have found a new role in developing models for systems biology, allowing biologists to test hypotheses and prioritise experiments.

Introduction 7/ 223 Systems Biology Methodology Natural System Measurement Observation Biological Phenomena Explanation Interpretation Induction Modelling Systems Analysis Deduction Inference Formal System

Introduction 8/ 223 Systems Biology Methodology Natural System Measurement Observation Biological Phenomena Explanation Interpretation Induction Modelling Systems Analysis Deduction Inference Formal System

Introduction 9/ 223 Systems Biology Methodology Natural System Measurement Observation Biological Phenomena Explanation Interpretation Induction Modelling Systems Analysis Deduction Inference Formal System

Introduction 10/ 223 Systems Biology Methodology Natural System Measurement Observation Biological Phenomena Explanation Interpretation Induction Modelling Systems Analysis Deduction Inference Formal System

Introduction 11/ 223 Systems Biology Methodology Natural System Measurement Observation Biological Phenomena Explanation Interpretation Induction Modelling Systems Analysis Deduction Inference Formal System

Introduction 12/ 223 Systems Biology Methodology Natural System Measurement Observation Biological Phenomena Explanation Interpretation Induction Modelling Systems Analysis Deduction Inference Formal System

Introduction 13/ 223 Systems Biology Methodology Natural System Measurement Observation Biological Phenomena Explanation Interpretation Induction Modelling Systems Analysis Deduction Inference Formal System

Introduction 14/ 223 Systems Biology Methodology Natural System Measurement Observation Biological Phenomena Explanation Interpretation Induction Modelling Systems Analysis Deduction Inference Formal System

Introduction 15/ 223 Systems Biology Methodology Natural System Measurement Observation Biological Phenomena Explanation Interpretation Induction Modelling Systems Analysis Deduction Inference Formal System

Introduction 16/ 223 Systems Biology Methodology Natural System Measurement Observation Biological Phenomena Explanation Interpretation Induction Modelling Systems Analysis Deduction Inference Formal System

Introduction 17/ 223 Systems Biology Methodology Natural System Measurement Observation Biological Phenomena Explanation Interpretation Induction Modelling Systems Analysis Deduction Inference Formal System

Introduction Some Biological Background 18/ 223 Networks in cells We can distinguish three distinct types of links or networks in cells Gene networks: Genes control the production of proteins but are themselves regulated by the same or different proteins. Signal transduction networks: External stimuli initiate messages that are carried through a cell via a cascade of biochemical reactions. Metabolic pathways: The survival of the cell depends on its ability to transform nutrients into energy.

Introduction Some Biological Background 19/ 223 Networks in cells We can distinguish three distinct types of links or networks in cells Gene networks: Genes control the production of proteins but are themselves regulated by the same or different proteins. Signal transduction networks: External stimuli initiate messages that are carried through a cell via a cascade of biochemical reactions. Metabolic pathways: The survival of the cell depends on its ability to transform nutrients into energy.

Introduction Some Biological Background 20/ 223 Networks in cells We can distinguish three distinct types of links or networks in cells Gene networks: Genes control the production of proteins but are themselves regulated by the same or different proteins. Signal transduction networks: External stimuli initiate messages that are carried through a cell via a cascade of biochemical reactions. Metabolic pathways: The survival of the cell depends on its ability to transform nutrients into energy.

Introduction Some Biological Background 21/ 223 Networks in cells We can distinguish three distinct types of links or networks in cells Gene networks: Genes control the production of proteins but are themselves regulated by the same or different proteins. Signal transduction networks: External stimuli initiate messages that are carried through a cell via a cascade of biochemical reactions. Metabolic pathways: The survival of the cell depends on its ability to transform nutrients into energy.

Introduction Some Biological Background 22/ 223 Extracellular signalling Extracellular signalling communication between cells. Signalling molecules released by one cell migrate to another; These molecules enter the cell and instigate a pathway, or series of reactions, which carries the information from the membrane to the nucleus; For example, the Ras/Raf-1/MEK/ERK pathway conveys differentiation signals to the nucleus of a cell. Signal Raf MEK ERK Activated ERK enters nucleus Specific gene activation cellular response to signal

Introduction Some Biological Background 23/ 223 Cell signalling All signalling is biochemical: Increasing protein concentration broadcasts the information about an event; for example, that a gene promoter is on. The message is received by a concentration dependent response at the protein signal s site of action. This stimulates a response at the signalling protein s site of action. Signals propagate through a series of protein accumulations.

Introduction Some Biological Background 24/ 223 Cell signalling All signalling is biochemical: Increasing protein concentration broadcasts the information about an event; for example, that a gene promoter is on. The message is received by a concentration dependent response at the protein signal s site of action. This stimulates a response at the signalling protein s site of action. Signals propagate through a series of protein accumulations.

Introduction Some Biological Background 25/ 223 Cell signalling All signalling is biochemical: Increasing protein concentration broadcasts the information about an event; for example, that a gene promoter is on. The message is received by a concentration dependent response at the protein signal s site of action. This stimulates a response at the signalling protein s site of action. Signals propagate through a series of protein accumulations.

Introduction Some Biological Background 26/ 223 Cell signalling All signalling is biochemical: Increasing protein concentration broadcasts the information about an event; for example, that a gene promoter is on. The message is received by a concentration dependent response at the protein signal s site of action. This stimulates a response at the signalling protein s site of action. Signals propagate through a series of protein accumulations.

Introduction Some Biological Background 27/ 223 Cell signalling All signalling is biochemical: Increasing protein concentration broadcasts the information about an event; for example, that a gene promoter is on. The message is received by a concentration dependent response at the protein signal s site of action. This stimulates a response at the signalling protein s site of action. Signals propagate through a series of protein accumulations.

Introduction Some Biological Background 28/ 223 Signal transduction pathways A series of biochemical reactions serve to pass a message from the cell membrane to the nucleus.

Introduction Some Biological Background 29/ 223 Gene expression pathways Genetic activity is controlled by molecular signals that determine when and how often a given gene is transcribed. The product encoded by one gene often regulates the expression of other genes. For appropriate combinations of input signals transcription is initiated and protein product accumulates when production exceeds degradation. Links are established between genes when the product of one regulates the expression of another. Thus networks of interaction can be deduced and these may be quite complex.

Introduction Some Biological Background 30/ 223 Gene expression pathways Genetic activity is controlled by molecular signals that determine when and how often a given gene is transcribed. The product encoded by one gene often regulates the expression of other genes. For appropriate combinations of input signals transcription is initiated and protein product accumulates when production exceeds degradation. Links are established between genes when the product of one regulates the expression of another. Thus networks of interaction can be deduced and these may be quite complex.

Introduction Some Biological Background 31/ 223 Gene expression pathways Genetic activity is controlled by molecular signals that determine when and how often a given gene is transcribed. The product encoded by one gene often regulates the expression of other genes. For appropriate combinations of input signals transcription is initiated and protein product accumulates when production exceeds degradation. Links are established between genes when the product of one regulates the expression of another. Thus networks of interaction can be deduced and these may be quite complex.

Introduction Some Biological Background 32/ 223 Gene expression pathways Genetic activity is controlled by molecular signals that determine when and how often a given gene is transcribed. The product encoded by one gene often regulates the expression of other genes. For appropriate combinations of input signals transcription is initiated and protein product accumulates when production exceeds degradation. Links are established between genes when the product of one regulates the expression of another. Thus networks of interaction can be deduced and these may be quite complex.

Introduction Some Biological Background 33/ 223 Gene expression pathways Genetic activity is controlled by molecular signals that determine when and how often a given gene is transcribed. The product encoded by one gene often regulates the expression of other genes. For appropriate combinations of input signals transcription is initiated and protein product accumulates when production exceeds degradation. Links are established between genes when the product of one regulates the expression of another. Thus networks of interaction can be deduced and these may be quite complex.

Introduction Some Biological Background 34/ 223 Dynamic issues In biochemical regulatory networks, the delay between events are determined by the delay while signal molecule concentrations accumulate or decline sufficiently. All reactions rely on the random process of molecules colliding with the cell. Thus the accumulation of protein is a stochastic process affected by several factors in the cell (temperature, ph, etc.). Thus the reaction time is a distribution rather than a deterministic time, and this can be shown to be an exponential distribution for basis molecular reactions.

Introduction Some Biological Background 35/ 223 Dynamic issues In biochemical regulatory networks, the delay between events are determined by the delay while signal molecule concentrations accumulate or decline sufficiently. All reactions rely on the random process of molecules colliding with the cell. Thus the accumulation of protein is a stochastic process affected by several factors in the cell (temperature, ph, etc.). Thus the reaction time is a distribution rather than a deterministic time, and this can be shown to be an exponential distribution for basis molecular reactions.

Introduction Some Biological Background 36/ 223 Dynamic issues In biochemical regulatory networks, the delay between events are determined by the delay while signal molecule concentrations accumulate or decline sufficiently. All reactions rely on the random process of molecules colliding with the cell. Thus the accumulation of protein is a stochastic process affected by several factors in the cell (temperature, ph, etc.). Thus the reaction time is a distribution rather than a deterministic time, and this can be shown to be an exponential distribution for basis molecular reactions.

Introduction Some Biological Background 37/ 223 Dynamic issues In biochemical regulatory networks, the delay between events are determined by the delay while signal molecule concentrations accumulate or decline sufficiently. All reactions rely on the random process of molecules colliding with the cell. Thus the accumulation of protein is a stochastic process affected by several factors in the cell (temperature, ph, etc.). Thus the reaction time is a distribution rather than a deterministic time, and this can be shown to be an exponential distribution for basis molecular reactions.

Introduction Some Biological Background 38/ 223 Stochastic behaviour The stochastic reaction rate of a chemical reaction is a function of only those molecular species involved as reactants or catalysts, and a stochastic rate constant c. The stochastic rate constant takes into account volume, temperature, ph and other environmental factors. The stoichiometry of the reaction how many molecules of each reactant species are required also has an impact. Commonly the law of mass action is used to determine the effective rate of a reaction: this is the product of the stochastic rate constant and the amount of each of the reactants.

Introduction Some Biological Background 39/ 223 Using Stochastic Process Algebras Process algebras have several attractive features which can be useful for modelling and understanding biological systems: Process algebraic formulations are compositional and make interactions/constraints explicit. Structure can also be apparent. Equivalence relations allow formal comparison of high-level descriptions. There are well-established techniques for reasoning about the behaviours and properties of models, supported by software. These include qualitative and quantitative analysis, and model checking.

Introduction Some Biological Background 40/ 223 Using Stochastic Process Algebras Process algebras have several attractive features which can be useful for modelling and understanding biological systems: Process algebraic formulations are compositional and make interactions/constraints explicit. Structure can also be apparent. Equivalence relations allow formal comparison of high-level descriptions. There are well-established techniques for reasoning about the behaviours and properties of models, supported by software. These include qualitative and quantitative analysis, and model checking.

Introduction Some Biological Background 41/ 223 Using Stochastic Process Algebras Process algebras have several attractive features which can be useful for modelling and understanding biological systems: Process algebraic formulations are compositional and make interactions/constraints explicit. Structure can also be apparent. Equivalence relations allow formal comparison of high-level descriptions. There are well-established techniques for reasoning about the behaviours and properties of models, supported by software. These include qualitative and quantitative analysis, and model checking.

Introduction Some Biological Background 42/ 223 Using Stochastic Process Algebras Process algebras have several attractive features which can be useful for modelling and understanding biological systems: Process algebraic formulations are compositional and make interactions/constraints explicit. Structure can also be apparent. Equivalence relations allow formal comparison of high-level descriptions. There are well-established techniques for reasoning about the behaviours and properties of models, supported by software. These include qualitative and quantitative analysis, and model checking.

Introduction Some Biological Background 43/ 223 Using Stochastic Process Algebras Process algebras have several attractive features which can be useful for modelling and understanding biological systems: Process algebraic formulations are compositional and make interactions/constraints explicit. Structure can also be apparent. Equivalence relations allow formal comparison of high-level descriptions. There are well-established techniques for reasoning about the behaviours and properties of models, supported by software. These include qualitative and quantitative analysis, and model checking.

Introduction Some Biological Background 44/ 223 Molecular processes as concurrent computations A correspondence between cellular processes and process alebras was first highlighted by Regev and her co-authors in the early 2000s. Concurrency Concurrent computational processes Molecular Biology Molecules Synchronous communication Molecular interaction Transition or mobility Biochemical modification or relocation Metabolism Enzymes and metabolites Binding and catalysis Metabolite synthesis Signal Transduction Interacting proteins Binding and catalysis Protein binding, modification or sequestration [Regev et al 2000]

Introduction Some Biological Background 45/ 223 Molecular processes as concurrent computations A correspondence between cellular processes and process alebras was first highlighted by Regev and her co-authors in the early 2000s. Concurrency Concurrent computational processes Molecular Biology Molecules Synchronous communication Molecular interaction Transition or mobility Biochemical modification or relocation Metabolism Enzymes and metabolites Binding and catalysis Metabolite synthesis Signal Transduction Interacting proteins Binding and catalysis Protein binding, modification or sequestration [Regev et al 2000]

Introduction Some Biological Background 46/ 223 Calculi for Systems Biology Several different process caluli have been developed or adapted for application in systems biology. Each of them has different properties able to render different aspects of biological phenomena. They may be divided into two main categories: Calculi defined originally in computer science and then applied in biology, such as the biochemical stochastic π-calculus, SCCP, CCS-R and PEPA; Calculi defined specifically by observing biological structures and phenomena, such as BioAmbients, Brane Calculi, Beta-binders, BlenX and Bio-PEPA.

Introduction Some Biological Background 47/ 223 Calculi for Systems Biology Several different process caluli have been developed or adapted for application in systems biology. Each of them has different properties able to render different aspects of biological phenomena. They may be divided into two main categories: Calculi defined originally in computer science and then applied in biology, such as the biochemical stochastic π-calculus, SCCP, CCS-R and PEPA; Calculi defined specifically by observing biological structures and phenomena, such as BioAmbients, Brane Calculi, Beta-binders, BlenX and Bio-PEPA.

Introduction Some Biological Background 48/ 223 Calculi for Systems Biology Several different process caluli have been developed or adapted for application in systems biology. Each of them has different properties able to render different aspects of biological phenomena. They may be divided into two main categories: Calculi defined originally in computer science and then applied in biology, such as the biochemical stochastic π-calculus, SCCP, CCS-R and PEPA; Calculi defined specifically by observing biological structures and phenomena, such as BioAmbients, Brane Calculi, Beta-binders, BlenX and Bio-PEPA.

Introduction Some Biological Background 49/ 223 Calculi for Systems Biology Several different process caluli have been developed or adapted for application in systems biology. Each of them has different properties able to render different aspects of biological phenomena. They may be divided into two main categories: Calculi defined originally in computer science and then applied in biology, such as the biochemical stochastic π-calculus, SCCP, CCS-R and PEPA; Calculi defined specifically by observing biological structures and phenomena, such as BioAmbients, Brane Calculi, Beta-binders, BlenX and Bio-PEPA.

Introduction Some Biological Background 50/ 223 Stochastic π-calculus The stochastic π-calculus has been used to model and analyse a wide variety of biological systems. Examples include metabolic pathways, gene transcription and signal transduction. Analysis is mainly based on stochastic simulation (Gillespie s algorithm). Two tools: BioSPI and SPIM which implement slightly different versions of the language. There has also been some work on a graphical notation associated with the SPIM tool.

Introduction Some Biological Background 51/ 223 Stochastic π-calculus The stochastic π-calculus has been used to model and analyse a wide variety of biological systems. Examples include metabolic pathways, gene transcription and signal transduction. Analysis is mainly based on stochastic simulation (Gillespie s algorithm). Two tools: BioSPI and SPIM which implement slightly different versions of the language. There has also been some work on a graphical notation associated with the SPIM tool.

Introduction Some Biological Background 52/ 223 Stochastic π-calculus The stochastic π-calculus has been used to model and analyse a wide variety of biological systems. Examples include metabolic pathways, gene transcription and signal transduction. Analysis is mainly based on stochastic simulation (Gillespie s algorithm). Two tools: BioSPI and SPIM which implement slightly different versions of the language. There has also been some work on a graphical notation associated with the SPIM tool.

Introduction Some Biological Background 53/ 223 Stochastic π-calculus The stochastic π-calculus has been used to model and analyse a wide variety of biological systems. Examples include metabolic pathways, gene transcription and signal transduction. Analysis is mainly based on stochastic simulation (Gillespie s algorithm). Two tools: BioSPI and SPIM which implement slightly different versions of the language. There has also been some work on a graphical notation associated with the SPIM tool.

Introduction Some Biological Background 54/ 223 Stochastic π-calculus The stochastic π-calculus has been used to model and analyse a wide variety of biological systems. Examples include metabolic pathways, gene transcription and signal transduction. Analysis is mainly based on stochastic simulation (Gillespie s algorithm). Two tools: BioSPI and SPIM which implement slightly different versions of the language. There has also been some work on a graphical notation associated with the SPIM tool.

Introduction Some Biological Background 55/ 223 Stochastic π-calculus Stochastic π-calculus [Priami, 1995] extends the π-calculus with exponentially-distributed rates.

Introduction Some Biological Background 56/ 223 Stochastic π-calculus Stochastic π-calculus [Priami, 1995] extends the π-calculus with exponentially-distributed rates. 0 Nil

Introduction Some Biological Background 57/ 223 Stochastic π-calculus Stochastic π-calculus [Priami, 1995] extends the π-calculus with exponentially-distributed rates. 0 Nil (π, r).p Prefix

Introduction Some Biological Background 58/ 223 Stochastic π-calculus Stochastic π-calculus [Priami, 1995] extends the π-calculus with exponentially-distributed rates. 0 Nil (π, r).p Prefix (νn)p New

Introduction Some Biological Background 59/ 223 Stochastic π-calculus Stochastic π-calculus [Priami, 1995] extends the π-calculus with exponentially-distributed rates. 0 Nil (π, r).p Prefix (νn)p New [x = y]p Matching

Introduction Some Biological Background 60/ 223 Stochastic π-calculus Stochastic π-calculus [Priami, 1995] extends the π-calculus with exponentially-distributed rates. 0 Nil (π, r).p Prefix (νn)p New [x = y]p Matching P 1 P 2 Parallel

Introduction Some Biological Background 61/ 223 Stochastic π-calculus Stochastic π-calculus [Priami, 1995] extends the π-calculus with exponentially-distributed rates. 0 Nil (π, r).p Prefix (νn)p New [x = y]p Matching P 1 P 2 Parallel P 1 + P 2 Choice

Introduction Some Biological Background 62/ 223 Stochastic π-calculus Stochastic π-calculus [Priami, 1995] extends the π-calculus with exponentially-distributed rates. 0 Nil (π, r).p Prefix (νn)p New [x = y]p Matching P 1 P 2 Parallel P 1 + P 2 Choice!P Replication

Introduction Some Biological Background 63/ 223 Stochastic π-calculus Stochastic π-calculus [Priami, 1995] extends the π-calculus with exponentially-distributed rates. 0 Nil (π, r).p Prefix (νn)p New [x = y]p Matching P 1 P 2 Parallel P 1 + P 2 Choice!P Replication where π is either x(y) (input), xy (output ) or τ (silent).

Introduction Some Biological Background 64/ 223 Example: The VICE project The aim was to construct a minimal cell in silico in order to track the dynamics of a complete metabolome. Thus a VIrtual CEll was defined as a stochastic π-calculus model, which seems to behave as a simplified prokaryote. Started from a published minimal gene set which eliminated duplicated genes and other redundancies from the smallest known bacterial genomes. This was further reduced to 180 different genes. Experimental results were in accordance with those available from in vivo experiments. Some extensions to the stochastic π-calculus were needed.

Introduction Some Biological Background 65/ 223 Example: The VICE project The aim was to construct a minimal cell in silico in order to track the dynamics of a complete metabolome. Thus a VIrtual CEll was defined as a stochastic π-calculus model, which seems to behave as a simplified prokaryote. Started from a published minimal gene set which eliminated duplicated genes and other redundancies from the smallest known bacterial genomes. This was further reduced to 180 different genes. Experimental results were in accordance with those available from in vivo experiments. Some extensions to the stochastic π-calculus were needed.

Introduction Some Biological Background 66/ 223 Example: The VICE project The aim was to construct a minimal cell in silico in order to track the dynamics of a complete metabolome. Thus a VIrtual CEll was defined as a stochastic π-calculus model, which seems to behave as a simplified prokaryote. Started from a published minimal gene set which eliminated duplicated genes and other redundancies from the smallest known bacterial genomes. This was further reduced to 180 different genes. Experimental results were in accordance with those available from in vivo experiments. Some extensions to the stochastic π-calculus were needed.

Introduction Some Biological Background 67/ 223 Example: The VICE project The aim was to construct a minimal cell in silico in order to track the dynamics of a complete metabolome. Thus a VIrtual CEll was defined as a stochastic π-calculus model, which seems to behave as a simplified prokaryote. Started from a published minimal gene set which eliminated duplicated genes and other redundancies from the smallest known bacterial genomes. This was further reduced to 180 different genes. Experimental results were in accordance with those available from in vivo experiments. Some extensions to the stochastic π-calculus were needed.

Introduction Some Biological Background 68/ 223 Example: The VICE project The aim was to construct a minimal cell in silico in order to track the dynamics of a complete metabolome. Thus a VIrtual CEll was defined as a stochastic π-calculus model, which seems to behave as a simplified prokaryote. Started from a published minimal gene set which eliminated duplicated genes and other redundancies from the smallest known bacterial genomes. This was further reduced to 180 different genes. Experimental results were in accordance with those available from in vivo experiments. Some extensions to the stochastic π-calculus were needed.

Process Calculi for Systems Biology 69/ 223 Outline 1 Introduction Some Biological Background 2 Process Calculi for Systems Biology 3 Bio-PEPA: Syntax 4 Formal Semantics 5 Bio-PEPA Analysis 6 Formal Mappings 7 Example

Process Calculi for Systems Biology 70/ 223 Process Algebras for Systems Biology The process algebra of choice for Regev and her co-workers was the stochastic π-calculus. This work was hugely influential and many people followed their lead in applying the stochastic π-calculus to model intracellular processes. However the style of modelling in the π-calculus is not always ideal for representing biochemical processes. Nevertheless a certain flavour of the π-calculus still predominately influences many of the process algebras for systems biology.

Process Calculi for Systems Biology 71/ 223 Process Algebras for Systems Biology The process algebra of choice for Regev and her co-workers was the stochastic π-calculus. This work was hugely influential and many people followed their lead in applying the stochastic π-calculus to model intracellular processes. However the style of modelling in the π-calculus is not always ideal for representing biochemical processes. Nevertheless a certain flavour of the π-calculus still predominately influences many of the process algebras for systems biology.

Process Calculi for Systems Biology 72/ 223 Process Algebras for Systems Biology The process algebra of choice for Regev and her co-workers was the stochastic π-calculus. This work was hugely influential and many people followed their lead in applying the stochastic π-calculus to model intracellular processes. However the style of modelling in the π-calculus is not always ideal for representing biochemical processes. Nevertheless a certain flavour of the π-calculus still predominately influences many of the process algebras for systems biology.

Process Calculi for Systems Biology 73/ 223 Process Algebras for Systems Biology The process algebra of choice for Regev and her co-workers was the stochastic π-calculus. This work was hugely influential and many people followed their lead in applying the stochastic π-calculus to model intracellular processes. However the style of modelling in the π-calculus is not always ideal for representing biochemical processes. Nevertheless a certain flavour of the π-calculus still predominately influences many of the process algebras for systems biology.

Process Calculi for Systems Biology 74/ 223 Issues with the π-calculus The restriction to always consider actions as occurring in conjugate pairs does not always match well with biochemical reactions where you might want more than two molecules to be involved. The molecule-as-processes abstraction can lead to problems pragmatically. By focussing on the individual molecules the calculus forces the modeller into an individuals-based interpretation of the model. This means that simulation is often the only feasible interpretation.

Process Calculi for Systems Biology 75/ 223 Issues with the π-calculus The restriction to always consider actions as occurring in conjugate pairs does not always match well with biochemical reactions where you might want more than two molecules to be involved. The molecule-as-processes abstraction can lead to problems pragmatically. By focussing on the individual molecules the calculus forces the modeller into an individuals-based interpretation of the model. This means that simulation is often the only feasible interpretation.

Process Calculi for Systems Biology 76/ 223 Bio-PEPA: motivations With Bio-PEPA we have been experimented with the more abstract mapping between elements of signalling pathways and process algebra constructs: species as processes. We also wanted to be able to capture more of the biological features expressed in the models such as those found in the BioModels database.

Process Calculi for Systems Biology 77/ 223 Bio-PEPA: motivations With Bio-PEPA we have been experimented with the more abstract mapping between elements of signalling pathways and process algebra constructs: species as processes. We also wanted to be able to capture more of the biological features expressed in the models such as those found in the BioModels database.

Process Calculi for Systems Biology 78/ 223 Motivations for Abstraction Our motivations for seeking more abstraction: Process algebra-based analyses such as comparing models (e.g. for equivalence or simulation) and model checking are only possible if the state space is not prohibitively large. The data that we have available to parameterise models is sometimes speculative rather than precise.

Process Calculi for Systems Biology 79/ 223 Motivations for Abstraction Our motivations for seeking more abstraction: Process algebra-based analyses such as comparing models (e.g. for equivalence or simulation) and model checking are only possible if the state space is not prohibitively large. The data that we have available to parameterise models is sometimes speculative rather than precise.

Process Calculi for Systems Biology 80/ 223 Motivations for Abstraction Our motivations for seeking more abstraction: Process algebra-based analyses such as comparing models (e.g. for equivalence or simulation) and model checking are only possible if the state space is not prohibitively large. The data that we have available to parameterise models is sometimes speculative rather than precise.

Process Calculi for Systems Biology 81/ 223 Motivations for Abstraction Our motivations for seeking more abstraction: Process algebra-based analyses such as comparing models (e.g. for equivalence or simulation) and model checking are only possible if the state space is not prohibitively large. The data that we have available to parameterise models is sometimes speculative rather than precise. This suggests that it can be useful to use semi-quantitative models rather than quantitative ones.

Process Calculi for Systems Biology 82/ 223 Alternative Representations ODEs Abstract SPA model Stochastic Simulation

Process Calculi for Systems Biology 83/ 223 Alternative Representations ODEs population view Abstract SPA model Stochastic Simulation individual view

Process Calculi for Systems Biology 84/ 223 Discretising the population view We can discretise the continuous range of possible concentration values into a number of distinct states. These form the possible states of the component representing the reagent.

Process Calculi for Systems Biology 85/ 223 Alternative Representations Abstract SPA model ODEs CTMC with M levels Stochastic Simulation

Process Calculi for Systems Biology 86/ 223 Alternative Representations Abstract SPA model ODEs population view CTMC with M levels abstract view Stochastic Simulation individual view

Process Calculi for Systems Biology 87/ 223 Alternative models The ODE model can be regarded as an approximation of a CTMC in which the number of molecules is large enough that the randomness averages out and the system is essentially deterministic. The full molecular view can be used for detailed study of the stochastic behaviour, usually via stochastic simulation In models with levels, each level of granularity gives rise to a CTMC, and the behaviour of this sequence of Markov processes converges to the behaviour of the system of ODEs. Some analyses which can be carried out via numerical solution of the CTMC are not readily available from ODEs or stochastic simulation.

Process Calculi for Systems Biology 88/ 223 Alternative models The ODE model can be regarded as an approximation of a CTMC in which the number of molecules is large enough that the randomness averages out and the system is essentially deterministic. The full molecular view can be used for detailed study of the stochastic behaviour, usually via stochastic simulation In models with levels, each level of granularity gives rise to a CTMC, and the behaviour of this sequence of Markov processes converges to the behaviour of the system of ODEs. Some analyses which can be carried out via numerical solution of the CTMC are not readily available from ODEs or stochastic simulation.

Process Calculi for Systems Biology 89/ 223 Alternative models The ODE model can be regarded as an approximation of a CTMC in which the number of molecules is large enough that the randomness averages out and the system is essentially deterministic. The full molecular view can be used for detailed study of the stochastic behaviour, usually via stochastic simulation In models with levels, each level of granularity gives rise to a CTMC, and the behaviour of this sequence of Markov processes converges to the behaviour of the system of ODEs. Some analyses which can be carried out via numerical solution of the CTMC are not readily available from ODEs or stochastic simulation.

Process Calculi for Systems Biology 90/ 223 Alternative models The ODE model can be regarded as an approximation of a CTMC in which the number of molecules is large enough that the randomness averages out and the system is essentially deterministic. The full molecular view can be used for detailed study of the stochastic behaviour, usually via stochastic simulation In models with levels, each level of granularity gives rise to a CTMC, and the behaviour of this sequence of Markov processes converges to the behaviour of the system of ODEs. Some analyses which can be carried out via numerical solution of the CTMC are not readily available from ODEs or stochastic simulation.

Process Calculi for Systems Biology 91/ 223 Modelling biological features There are some features of biochemical reaction systems which are not readily captured by π-calculus-based stochastic process algebras. Particular problems are encountered with: stoichiometry the multiplicity in which an entity participates in a reaction; general kinetic laws although mass action is widely used other kinetics are also commonly employed. multiway reactions although thermodynamic arguments can be made that there are never more than two reagents involved in a reaction, in practice it is often useful to model at a more abstract level.

Process Calculi for Systems Biology 92/ 223 Modelling biological features There are some features of biochemical reaction systems which are not readily captured by π-calculus-based stochastic process algebras. Particular problems are encountered with: stoichiometry the multiplicity in which an entity participates in a reaction; general kinetic laws although mass action is widely used other kinetics are also commonly employed. multiway reactions although thermodynamic arguments can be made that there are never more than two reagents involved in a reaction, in practice it is often useful to model at a more abstract level.

Process Calculi for Systems Biology 93/ 223 Modelling biological features There are some features of biochemical reaction systems which are not readily captured by π-calculus-based stochastic process algebras. Particular problems are encountered with: stoichiometry the multiplicity in which an entity participates in a reaction; general kinetic laws although mass action is widely used other kinetics are also commonly employed. multiway reactions although thermodynamic arguments can be made that there are never more than two reagents involved in a reaction, in practice it is often useful to model at a more abstract level.

Process Calculi for Systems Biology 94/ 223 Modelling biological features There are some features of biochemical reaction systems which are not readily captured by π-calculus-based stochastic process algebras. Particular problems are encountered with: stoichiometry the multiplicity in which an entity participates in a reaction; general kinetic laws although mass action is widely used other kinetics are also commonly employed. multiway reactions although thermodynamic arguments can be made that there are never more than two reagents involved in a reaction, in practice it is often useful to model at a more abstract level.

Process Calculi for Systems Biology 95/ 223 Modelling biological features There are some features of biochemical reaction systems which are not readily captured by π-calculus-based stochastic process algebras. Particular problems are encountered with: stoichiometry the multiplicity in which an entity participates in a reaction; general kinetic laws although mass action is widely used other kinetics are also commonly employed. multiway reactions although thermodynamic arguments can be made that there are never more than two reagents involved in a reaction, in practice it is often useful to model at a more abstract level.

Process Calculi for Systems Biology 96/ 223 Illustration Consider a conversion of a substrate S, with stoichiometry 2, to a product P, under the influence of an enzyme E, i.e. 2 S E P In the stochastic π-calculus (for example) this must modelled as a sequence of unary and binary reactions: S + S 2S 2S + E 2S :E 2S :E P :E P :E P + E

Process Calculi for Systems Biology 97/ 223 Illustration Consider a conversion of a substrate S, with stoichiometry 2, to a product P, under the influence of an enzyme E, i.e. 2 S E P In the stochastic π-calculus (for example) this must modelled as a sequence of unary and binary reactions: S + S 2S 2S + E 2S :E 2S :E P :E P :E P + E

Process Calculi for Systems Biology 98/ 223 Illustration cont. The problems with this are various: Rates must be found for all the intermediate steps. Alternate intermediate states are possible and it may not be known which is the appropriate one. The number of states of the system is significantly increased which has implications for computational efficiency/tractability. The use of multiway synchronisation, and the reagent-centric style of modelling, avoids these problems

Process Calculi for Systems Biology 99/ 223 Illustration cont. The problems with this are various: Rates must be found for all the intermediate steps. Alternate intermediate states are possible and it may not be known which is the appropriate one. The number of states of the system is significantly increased which has implications for computational efficiency/tractability. The use of multiway synchronisation, and the reagent-centric style of modelling, avoids these problems

Process Calculi for Systems Biology 100/ 223 Illustration cont. The problems with this are various: Rates must be found for all the intermediate steps. Alternate intermediate states are possible and it may not be known which is the appropriate one. The number of states of the system is significantly increased which has implications for computational efficiency/tractability. The use of multiway synchronisation, and the reagent-centric style of modelling, avoids these problems

Process Calculi for Systems Biology 101/ 223 Illustration cont. The problems with this are various: Rates must be found for all the intermediate steps. Alternate intermediate states are possible and it may not be known which is the appropriate one. The number of states of the system is significantly increased which has implications for computational efficiency/tractability. The use of multiway synchronisation, and the reagent-centric style of modelling, avoids these problems

Process Calculi for Systems Biology 102/ 223 Illustration cont. The problems with this are various: Rates must be found for all the intermediate steps. Alternate intermediate states are possible and it may not be known which is the appropriate one. The number of states of the system is significantly increased which has implications for computational efficiency/tractability. The use of multiway synchronisation, and the reagent-centric style of modelling, avoids these problems

Process Calculi for Systems Biology 103/ 223 Illustration cont. The reaction 2S E P represents the enzymatic reaction from the substrate S, with stoichiometry 2, to the product P with enzyme E. In Bio-PEPA this is described as: S E P def = (α, 2) S def = (α, 1) E def = (α, 1) P (S(l S0 ) {α} E(l E0)) {α} P(l P0) The dynamics is described by the law f α = v E S2 (K+S 2 ).

Process Calculi for Systems Biology 104/ 223 Illustration cont. The reaction 2S E P represents the enzymatic reaction from the substrate S, with stoichiometry 2, to the product P with enzyme E. In Bio-PEPA this is described as: S E P def = (α, 2) S def = (α, 1) E def = (α, 1) P (S(l S0 ) {α} E(l E0)) {α} P(l P0) The dynamics is described by the law f α = v E S2 (K+S 2 ).

Process Calculi for Systems Biology 105/ 223 Illustration cont. The reaction 2S E P represents the enzymatic reaction from the substrate S, with stoichiometry 2, to the product P with enzyme E. In Bio-PEPA this is described as: S E P def = (α, 2) S def = (α, 1) E def = (α, 1) P (S(l S0 ) {α} E(l E0)) {α} P(l P0) The dynamics is described by the law f α = v E S2 (K+S 2 ).

Process Calculi for Systems Biology 106/ 223 Bio-PEPA In Bio-PEPA: Unique rates are associated with each reaction (action) type, separately from the specification of the logical behaviour. These rates may be specified by functions. The representation of an action within a component (species) records the stoichiometry of that entity with respect to that reaction. The role of the entity is also distinguished. The local states of components are quantitative rather than functional, i.e. distinct states of the species are represented as distinct components, not derivatives of a single component.

Process Calculi for Systems Biology 107/ 223 Bio-PEPA In Bio-PEPA: Unique rates are associated with each reaction (action) type, separately from the specification of the logical behaviour. These rates may be specified by functions. The representation of an action within a component (species) records the stoichiometry of that entity with respect to that reaction. The role of the entity is also distinguished. The local states of components are quantitative rather than functional, i.e. distinct states of the species are represented as distinct components, not derivatives of a single component.

Process Calculi for Systems Biology 108/ 223 Bio-PEPA In Bio-PEPA: Unique rates are associated with each reaction (action) type, separately from the specification of the logical behaviour. These rates may be specified by functions. The representation of an action within a component (species) records the stoichiometry of that entity with respect to that reaction. The role of the entity is also distinguished. The local states of components are quantitative rather than functional, i.e. distinct states of the species are represented as distinct components, not derivatives of a single component.

Process Calculi for Systems Biology 109/ 223 Bio-PEPA In Bio-PEPA: Unique rates are associated with each reaction (action) type, separately from the specification of the logical behaviour. These rates may be specified by functions. The representation of an action within a component (species) records the stoichiometry of that entity with respect to that reaction. The role of the entity is also distinguished. The local states of components are quantitative rather than functional, i.e. distinct states of the species are represented as distinct components, not derivatives of a single component.

Process Calculi for Systems Biology 110/ 223 Bio-PEPA reagent-centric example A c_a b_a ab_c C B c_b A B C def = (ab c, 1) A + (b a, 1) A + (c a, 1) A def = (ab c, 1) B + (b a, 1) B + (c b, 1) B def = (c a, 1) C + (c b, 1) C + (ab c, 1) C ( ) A(l A0 ) B(l B0) {ab c,b a} {ab c,c a,c b} C(l C0)

Process Calculi for Systems Biology 111/ 223 Reagent-centric view Role Impact on reaction rate Impact on reagent Reactant positive impact, decreases level e.g. proportional to current concentration Product no impact, increases level except at saturation Enzyme positive impact, level unchanged e.g. proportional to current concentration Inhibitor negative impact, e.g. inversely proportional to current concentration level unchanged

Bio-PEPA: Syntax 112/ 223 Outline 1 Introduction Some Biological Background 2 Process Calculi for Systems Biology 3 Bio-PEPA: Syntax 4 Formal Semantics 5 Bio-PEPA Analysis 6 Formal Mappings 7 Example

Bio-PEPA: Syntax 113/ 223 The syntax Sequential component (species component) S ::= (α, κ) op S S + S C where op = Model component P ::= P P S(l) L The parameter l is abstract, recording quantitative information about the species. Depending on the interpretation, this quantity may be: number of molecules (SSA), concentration (ODE) or a level within a semi-quantitative model (CTMC).

Bio-PEPA: Syntax 114/ 223 The syntax Sequential component (species component) S ::= (α, κ) op S S + S C where op = Model component P ::= P P S(l) L The parameter l is abstract, recording quantitative information about the species. Depending on the interpretation, this quantity may be: number of molecules (SSA), concentration (ODE) or a level within a semi-quantitative model (CTMC).

Bio-PEPA: Syntax 115/ 223 The syntax Sequential component (species component) S ::= (α, κ) op S S + S C where op = Model component P ::= P P S(l) L The parameter l is abstract, recording quantitative information about the species. Depending on the interpretation, this quantity may be: number of molecules (SSA), concentration (ODE) or a level within a semi-quantitative model (CTMC).

Bio-PEPA: Syntax 116/ 223 The syntax Sequential component (species component) S ::= (α, κ) op S S + S C where op = Model component P ::= P P S(l) L The parameter l is abstract, recording quantitative information about the species. Depending on the interpretation, this quantity may be: number of molecules (SSA), concentration (ODE) or a level within a semi-quantitative model (CTMC).

Bio-PEPA: Syntax 117/ 223 The syntax Sequential component (species component) S ::= (α, κ) op S S + S C where op = Model component P ::= P P S(l) L The parameter l is abstract, recording quantitative information about the species. Depending on the interpretation, this quantity may be: number of molecules (SSA), concentration (ODE) or a level within a semi-quantitative model (CTMC).

Bio-PEPA: Syntax 118/ 223 The syntax Sequential component (species component) S ::= (α, κ) op S S + S C where op = Model component P ::= P P S(l) L The parameter l is abstract, recording quantitative information about the species. Depending on the interpretation, this quantity may be: number of molecules (SSA), concentration (ODE) or a level within a semi-quantitative model (CTMC).

Bio-PEPA: Syntax 119/ 223 The syntax Sequential component (species component) S ::= (α, κ) op S S + S C where op = Model component P ::= P P S(l) L The parameter l is abstract, recording quantitative information about the species. Depending on the interpretation, this quantity may be: number of molecules (SSA), concentration (ODE) or a level within a semi-quantitative model (CTMC).

Bio-PEPA: Syntax 120/ 223 The syntax Sequential component (species component) S ::= (α, κ) op S S + S C where op = Model component P ::= P P S(l) L The parameter l is abstract, recording quantitative information about the species. Depending on the interpretation, this quantity may be: number of molecules (SSA), concentration (ODE) or a level within a semi-quantitative model (CTMC).

Bio-PEPA: Syntax 121/ 223 The syntax Sequential component (species component) S ::= (α, κ) op S S + S C where op = Model component P ::= P P S(l) L The parameter l is abstract, recording quantitative information about the species. Depending on the interpretation, this quantity may be: number of molecules (SSA), concentration (ODE) or a level within a semi-quantitative model (CTMC).

Bio-PEPA: Syntax 122/ 223 The syntax Sequential component (species component) S ::= (α, κ) op S S + S C where op = Model component P ::= P P S(l) L The parameter l is abstract, recording quantitative information about the species. Depending on the interpretation, this quantity may be: number of molecules (SSA), concentration (ODE) or a level within a semi-quantitative model (CTMC).

Bio-PEPA: Syntax 123/ 223 The Bio-PEPA system A Bio-PEPA system P is a 6-tuple V, N, K, F R, Comp, P, where: V is the set of compartments; N is the set of quantities describing each species (step size, number of levels, location,...); K is the set of parameter definitions; F R is the set of functional rate definitions; Comp is the set of definitions of sequential components; P is the model component describing the system.

Formal Semantics 124/ 223 Outline 1 Introduction Some Biological Background 2 Process Calculi for Systems Biology 3 Bio-PEPA: Syntax 4 Formal Semantics 5 Bio-PEPA Analysis 6 Formal Mappings 7 Example

Formal Semantics 125/ 223 Semantics The semantics of Bio-PEPA is given as a small-step operational semantics, intended for deriving the CTMC with levels. We define two relations over the processes: 1 capability relation, that supports the derivation of quantitative information; 2 stochastic relation, that gives the rates associated with each action.

Formal Semantics 126/ 223 Semantics The semantics of Bio-PEPA is given as a small-step operational semantics, intended for deriving the CTMC with levels. We define two relations over the processes: 1 capability relation, that supports the derivation of quantitative information; 2 stochastic relation, that gives the rates associated with each action.

Formal Semantics 127/ 223 Semantics The semantics of Bio-PEPA is given as a small-step operational semantics, intended for deriving the CTMC with levels. We define two relations over the processes: 1 capability relation, that supports the derivation of quantitative information; 2 stochastic relation, that gives the rates associated with each action.

Formal Semantics 128/ 223 Semantics: prefix rules prefixreac ((α, κ) S)(l) (α,[s: (l,κ)]) c S(l κ) κ l N prefixprod prefixmod ((α, κ) S)(l) (α,[s: (l,κ)]) c S(l + κ) 0 l (N κ) ((α, κ) op S)(l) (α,[s:op(l,κ)]) c S(l) 0 < l N

Formal Semantics 129/ 223 Semantics: prefix rules prefixreac ((α, κ) S)(l) (α,[s: (l,κ)]) c S(l κ) κ l N prefixprod prefixmod ((α, κ) S)(l) (α,[s: (l,κ)]) c S(l + κ) 0 l (N κ) ((α, κ) op S)(l) (α,[s:op(l,κ)]) c S(l) 0 < l N

Formal Semantics 130/ 223 Semantics: prefix rules prefixreac ((α, κ) S)(l) (α,[s: (l,κ)]) c S(l κ) κ l N prefixprod prefixmod ((α, κ) S)(l) (α,[s: (l,κ)]) c S(l + κ) 0 l (N κ) ((α, κ) op S)(l) (α,[s:op(l,κ)]) c S(l) 0 < l N with op =,, or

Formal Semantics 131/ 223 Semantics: constant and choice rules Choice1 S 1 (l) (α,v) c S 1(l ) (S 1 + S 2 )(l) (α,v) c S 1(l ) Choice2 Constant S 2 (l) (α,v) c S 2(l ) (S 1 + S 2 )(l) (α,v) c S 2(l ) S(l) (α,s:[op(l,κ))] c S (l ) C(l) (α,c:[op(l,κ))] c S (l ) with C def = S

Formal Semantics 132/ 223 Semantics: constant and choice rules Choice1 S 1 (l) (α,v) c S 1(l ) (S 1 + S 2 )(l) (α,v) c S 1(l ) Choice2 Constant S 2 (l) (α,v) c S 2(l ) (S 1 + S 2 )(l) (α,v) c S 2(l ) S(l) (α,s:[op(l,κ))] c S (l ) C(l) (α,c:[op(l,κ))] c S (l ) with C def = S

Formal Semantics 133/ 223 Semantics: constant and choice rules Choice1 S 1 (l) (α,v) c S 1(l ) (S 1 + S 2 )(l) (α,v) c S 1(l ) Choice2 Constant S 2 (l) (α,v) c S 2(l ) (S 1 + S 2 )(l) (α,v) c S 2(l ) S(l) (α,s:[op(l,κ))] c S (l ) C(l) (α,c:[op(l,κ))] c S (l ) with C def = S

Formal Semantics 134/ 223 Semantics: cooperation rules coop1 coop2 P 1 P 1 P 1 (α,v) c P 1 P (α,v) 2 L c P 1 L P 2 P 2 (α,v) c P 2 P (α,v) 2 L c P 1 L P 2 with α / L with α / L coopfinal P 1 (α,v 1 ) c P 1 P 2 (α,v 2 ) c P 2 P 1 P (α,v 1 ::v 2 ) 2 L c P 1 L P 2 with α L

Formal Semantics 135/ 223 Semantics: cooperation rules coop1 coop2 P 1 P 1 P 1 (α,v) c P 1 P (α,v) 2 L c P 1 L P 2 P 2 (α,v) c P 2 P (α,v) 2 L c P 1 L P 2 with α / L with α / L coopfinal P 1 (α,v 1 ) c P 1 P 2 (α,v 2 ) c P 2 P 1 P (α,v 1 ::v 2 ) 2 L c P 1 L P 2 with α L

Formal Semantics 136/ 223 Semantics: cooperation rules coop1 coop2 P 1 P 1 P 1 (α,v) c P 1 P (α,v) 2 L c P 1 L P 2 P 2 (α,v) c P 2 P (α,v) 2 L c P 1 L P 2 with α / L with α / L coopfinal P 1 (α,v 1 ) c P 1 P 2 (α,v 2 ) c P 2 P 1 P (α,v 1 ::v 2 ) 2 L c P 1 L P 2 with α L

Formal Semantics 137/ 223 Semantics: rates and transition system In order to derive the rates we consider the stochastic relation s P Γ P, with γ Γ := (α, r) and r R +. The relation is defined in terms of the previous one: P (α j,v) c P V, N, K, F R, Comp, P (α j,r αj ) s V, N, K, F R, Comp, P r αj represents the parameter of an exponential distribution and the dynamic behaviour is determined by a race condition. The rate r αj is defined as f αj (V, N, K)/h.

Formal Semantics 138/ 223 Semantics: rates and transition system In order to derive the rates we consider the stochastic relation s P Γ P, with γ Γ := (α, r) and r R +. The relation is defined in terms of the previous one: P (α j,v) c P V, N, K, F R, Comp, P (α j,r αj ) s V, N, K, F R, Comp, P r αj represents the parameter of an exponential distribution and the dynamic behaviour is determined by a race condition. The rate r αj is defined as f αj (V, N, K)/h.

Formal Semantics 139/ 223 Semantics: rates and transition system In order to derive the rates we consider the stochastic relation s P Γ P, with γ Γ := (α, r) and r R +. The relation is defined in terms of the previous one: P (α j,v) c P V, N, K, F R, Comp, P (α j,r αj ) s V, N, K, F R, Comp, P r αj represents the parameter of an exponential distribution and the dynamic behaviour is determined by a race condition. The rate r αj is defined as f αj (V, N, K)/h.

Formal Semantics 140/ 223 Semantics: rates and transition system In order to derive the rates we consider the stochastic relation s P Γ P, with γ Γ := (α, r) and r R +. The relation is defined in terms of the previous one: P (α j,v) c P V, N, K, F R, Comp, P (α j,r αj ) s V, N, K, F R, Comp, P r αj represents the parameter of an exponential distribution and the dynamic behaviour is determined by a race condition. The rate r αj is defined as f αj (V, N, K)/h.

Formal Semantics 141/ 223 Semantics: rates and transition system In order to derive the rates we consider the stochastic relation s P Γ P, with γ Γ := (α, r) and r R +. The relation is defined in terms of the previous one: P (α j,v) c P V, N, K, F R, Comp, P (α j,r αj ) s V, N, K, F R, Comp, P r αj represents the parameter of an exponential distribution and the dynamic behaviour is determined by a race condition. The rate r αj is defined as f αj (V, N, K)/h.

Formal Semantics 142/ 223 Example: Michaelis-Menten The reaction S E P represents the enzymatic reaction from the substrate S to the product P with enzyme E. The dynamics is described by the law v E S (K+S). S E P def = (α, 1) S def = (α, 1) E def = (α, 1) P (S(l S0 ) {α} E(l E0 )) {α} P(l P0)

Formal Semantics 143/ 223 Example: Michaelis-Menten The reaction S E P represents the enzymatic reaction from the substrate S to the product P with enzyme E. The dynamics is described by the law v E S (K+S). S E P def = (α, 1) S def = (α, 1) E def = (α, 1) P (S(l S0 ) {α} E(l E0 )) {α} P(l P0)

Formal Semantics 144/ 223 Example: Michaelis-Menten The reaction S E P represents the enzymatic reaction from the substrate S to the product P with enzyme E. The dynamics is described by the law v E S (K+S). S E P def = (α, 1) S def = (α, 1) E def = (α, 1) P (S(l S0 ) {α} E(l E0 )) {α} P(l P0)

Bio-PEPA Analysis 145/ 223 Outline 1 Introduction Some Biological Background 2 Process Calculi for Systems Biology 3 Bio-PEPA: Syntax 4 Formal Semantics 5 Bio-PEPA Analysis 6 Formal Mappings 7 Example

Bio-PEPA Analysis 146/ 223 Analysis A Bio-PEPA system is a formal, intermediate and compositional representation of the system. From it we can obtain a CTMC (with levels) a ODE system for simulation and other kinds of analysis a Gillespie model for stochastic simulation a PRISM model for model checking Each of these kinds of analysis can be of help for studying different aspects of the biological model. Moreover they can be used in conjunction.

Bio-PEPA Analysis 147/ 223 Analysis A Bio-PEPA system is a formal, intermediate and compositional representation of the system. From it we can obtain a CTMC (with levels) a ODE system for simulation and other kinds of analysis a Gillespie model for stochastic simulation a PRISM model for model checking Each of these kinds of analysis can be of help for studying different aspects of the biological model. Moreover they can be used in conjunction.

Bio-PEPA Analysis 148/ 223 Analysis A Bio-PEPA system is a formal, intermediate and compositional representation of the system. From it we can obtain a CTMC (with levels) a ODE system for simulation and other kinds of analysis a Gillespie model for stochastic simulation a PRISM model for model checking Each of these kinds of analysis can be of help for studying different aspects of the biological model. Moreover they can be used in conjunction.

Bio-PEPA Analysis 149/ 223 Analysis A Bio-PEPA system is a formal, intermediate and compositional representation of the system. From it we can obtain a CTMC (with levels) a ODE system for simulation and other kinds of analysis a Gillespie model for stochastic simulation a PRISM model for model checking Each of these kinds of analysis can be of help for studying different aspects of the biological model. Moreover they can be used in conjunction.

Bio-PEPA Analysis 150/ 223 Analysis A Bio-PEPA system is a formal, intermediate and compositional representation of the system. From it we can obtain a CTMC (with levels) a ODE system for simulation and other kinds of analysis a Gillespie model for stochastic simulation a PRISM model for model checking Each of these kinds of analysis can be of help for studying different aspects of the biological model. Moreover they can be used in conjunction.

Bio-PEPA Analysis 151/ 223 Analysis A Bio-PEPA system is a formal, intermediate and compositional representation of the system. From it we can obtain a CTMC (with levels) a ODE system for simulation and other kinds of analysis a Gillespie model for stochastic simulation a PRISM model for model checking Each of these kinds of analysis can be of help for studying different aspects of the biological model. Moreover they can be used in conjunction.

Bio-PEPA Analysis 152/ 223 The abstraction When modelling in Bio-PEPA we use the following abstraction: Each species i is described by a Bio-PEPA component C i. Each reaction j is associated with an action type α j and its dynamics is described by a specific function f αj. Given a reaction j, all the species/components cooperate on the action type α j and consequently, reactants decrease their levels, while products increase them. The species components are then composed together to describe the behaviour of the system.

Bio-PEPA Analysis 153/ 223 The abstraction When modelling in Bio-PEPA we use the following abstraction: Each species i is described by a Bio-PEPA component C i. Each reaction j is associated with an action type α j and its dynamics is described by a specific function f αj. Given a reaction j, all the species/components cooperate on the action type α j and consequently, reactants decrease their levels, while products increase them. The species components are then composed together to describe the behaviour of the system.

Bio-PEPA Analysis 154/ 223 The abstraction When modelling in Bio-PEPA we use the following abstraction: Each species i is described by a Bio-PEPA component C i. Each reaction j is associated with an action type α j and its dynamics is described by a specific function f αj. Given a reaction j, all the species/components cooperate on the action type α j and consequently, reactants decrease their levels, while products increase them. The species components are then composed together to describe the behaviour of the system.

Bio-PEPA Analysis 155/ 223 The abstraction When modelling in Bio-PEPA we use the following abstraction: Each species i is described by a Bio-PEPA component C i. Each reaction j is associated with an action type α j and its dynamics is described by a specific function f αj. Given a reaction j, all the species/components cooperate on the action type α j and consequently, reactants decrease their levels, while products increase them. The species components are then composed together to describe the behaviour of the system.

Bio-PEPA Analysis 156/ 223 State representation The state of the system at any time consists of the local states of each of its sequential/species components. The local states of components are quantitative rather than functional, i.e. distinct states of the species are represented as distinct components, not derivatives of a single component. A component varying its state corresponds to it varying its quantity or level. This is captured by an integer parameter associated with the species and the effect of a reaction is to vary that parameter by a number of levels corresponding to the stoichiometry of this species in the reaction as we saw in the formal semantics.

Bio-PEPA Analysis 157/ 223 State representation The state of the system at any time consists of the local states of each of its sequential/species components. The local states of components are quantitative rather than functional, i.e. distinct states of the species are represented as distinct components, not derivatives of a single component. A component varying its state corresponds to it varying its quantity or level. This is captured by an integer parameter associated with the species and the effect of a reaction is to vary that parameter by a number of levels corresponding to the stoichiometry of this species in the reaction as we saw in the formal semantics.

Bio-PEPA Analysis 158/ 223 State representation The state of the system at any time consists of the local states of each of its sequential/species components. The local states of components are quantitative rather than functional, i.e. distinct states of the species are represented as distinct components, not derivatives of a single component. A component varying its state corresponds to it varying its quantity or level. This is captured by an integer parameter associated with the species and the effect of a reaction is to vary that parameter by a number of levels corresponding to the stoichiometry of this species in the reaction as we saw in the formal semantics.

Bio-PEPA Analysis 159/ 223 State representation The state of the system at any time consists of the local states of each of its sequential/species components. The local states of components are quantitative rather than functional, i.e. distinct states of the species are represented as distinct components, not derivatives of a single component. A component varying its state corresponds to it varying its quantity or level. This is captured by an integer parameter associated with the species and the effect of a reaction is to vary that parameter by a number of levels corresponding to the stoichiometry of this species in the reaction as we saw in the formal semantics.

Bio-PEPA Analysis 160/ 223 Analysis with Bio-PEPA Bio-PEPA SSA CTMCs ODEs StochKit Dizzy PRISM CVODES Matlab

Formal Mappings 161/ 223 Outline 1 Introduction Some Biological Background 2 Process Calculi for Systems Biology 3 Bio-PEPA: Syntax 4 Formal Semantics 5 Bio-PEPA Analysis 6 Formal Mappings 7 Example

Formal Mappings 162/ 223 CTMC Analysis of the Markov process can yield quite detailed information about the dynamic behaviour of the model. A steady state analysis provides statistics for average behaviour over a long run of the system. A transient analysis provides statistics relating to the evolution of the model over a fixed period. This will be dependent on the starting state. In the biological context transient analysis is appropriate much more frequently than steady state analysis. If molecule counts are kept in states then either analysis rapidly becomes infeasible. Bio-PEPA models can also be simulated using the Gillespie Stochastic Simulation Algorithm for CTMCs that we considered for PEPA.

Formal Mappings 163/ 223 CTMC Analysis of the Markov process can yield quite detailed information about the dynamic behaviour of the model. A steady state analysis provides statistics for average behaviour over a long run of the system. A transient analysis provides statistics relating to the evolution of the model over a fixed period. This will be dependent on the starting state. In the biological context transient analysis is appropriate much more frequently than steady state analysis. If molecule counts are kept in states then either analysis rapidly becomes infeasible. Bio-PEPA models can also be simulated using the Gillespie Stochastic Simulation Algorithm for CTMCs that we considered for PEPA.

Formal Mappings 164/ 223 CTMC Analysis of the Markov process can yield quite detailed information about the dynamic behaviour of the model. A steady state analysis provides statistics for average behaviour over a long run of the system. A transient analysis provides statistics relating to the evolution of the model over a fixed period. This will be dependent on the starting state. In the biological context transient analysis is appropriate much more frequently than steady state analysis. If molecule counts are kept in states then either analysis rapidly becomes infeasible. Bio-PEPA models can also be simulated using the Gillespie Stochastic Simulation Algorithm for CTMCs that we considered for PEPA.

Formal Mappings 165/ 223 CTMC Analysis of the Markov process can yield quite detailed information about the dynamic behaviour of the model. A steady state analysis provides statistics for average behaviour over a long run of the system. A transient analysis provides statistics relating to the evolution of the model over a fixed period. This will be dependent on the starting state. In the biological context transient analysis is appropriate much more frequently than steady state analysis. If molecule counts are kept in states then either analysis rapidly becomes infeasible. Bio-PEPA models can also be simulated using the Gillespie Stochastic Simulation Algorithm for CTMCs that we considered for PEPA.

Formal Mappings 166/ 223 CTMC Analysis of the Markov process can yield quite detailed information about the dynamic behaviour of the model. A steady state analysis provides statistics for average behaviour over a long run of the system. A transient analysis provides statistics relating to the evolution of the model over a fixed period. This will be dependent on the starting state. In the biological context transient analysis is appropriate much more frequently than steady state analysis. If molecule counts are kept in states then either analysis rapidly becomes infeasible. Bio-PEPA models can also be simulated using the Gillespie Stochastic Simulation Algorithm for CTMCs that we considered for PEPA.

Formal Mappings 167/ 223 CTMC Analysis of the Markov process can yield quite detailed information about the dynamic behaviour of the model. A steady state analysis provides statistics for average behaviour over a long run of the system. A transient analysis provides statistics relating to the evolution of the model over a fixed period. This will be dependent on the starting state. In the biological context transient analysis is appropriate much more frequently than steady state analysis. If molecule counts are kept in states then either analysis rapidly becomes infeasible. Bio-PEPA models can also be simulated using the Gillespie Stochastic Simulation Algorithm for CTMCs that we considered for PEPA.

Formal Mappings 168/ 223 CTMC with levels Models are based on discrete levels of concentration within a species. The granularity of the system is defined in terms of the step size h of the concentration intervals. We define the same step size h for all the species. This follows from the law of conservation of mass. If l i is the concentration level for the species i, the concentration is taken to be x i = l i h.

Formal Mappings 169/ 223 CTMC with levels Models are based on discrete levels of concentration within a species. The granularity of the system is defined in terms of the step size h of the concentration intervals. We define the same step size h for all the species. This follows from the law of conservation of mass. If l i is the concentration level for the species i, the concentration is taken to be x i = l i h.

Formal Mappings 170/ 223 CTMC with levels Models are based on discrete levels of concentration within a species. The granularity of the system is defined in terms of the step size h of the concentration intervals. We define the same step size h for all the species. This follows from the law of conservation of mass. If l i is the concentration level for the species i, the concentration is taken to be x i = l i h.

Formal Mappings 171/ 223 CTMC with levels Models are based on discrete levels of concentration within a species. The granularity of the system is defined in terms of the step size h of the concentration intervals. We define the same step size h for all the species. This follows from the law of conservation of mass. If l i is the concentration level for the species i, the concentration is taken to be x i = l i h.

Formal Mappings 172/ 223 CTMC with levels The rate of a transition is set to be consistent with the granularity. The granularity must be specified by the modeller as the expected range of concentration values and the number of levels considered. The structure of the CTMC derived from Bio-PEPA, which we term the CTMC with levels, will depend on the granularity of the model. As the granularity tends to zero the behaviour of this CTMC with levels tends to the behaviour of the ODEs [CDHC FBTC08].

Formal Mappings 173/ 223 CTMC with levels The rate of a transition is set to be consistent with the granularity. The granularity must be specified by the modeller as the expected range of concentration values and the number of levels considered. The structure of the CTMC derived from Bio-PEPA, which we term the CTMC with levels, will depend on the granularity of the model. As the granularity tends to zero the behaviour of this CTMC with levels tends to the behaviour of the ODEs [CDHC FBTC08].

Formal Mappings 174/ 223 CTMC with levels The rate of a transition is set to be consistent with the granularity. The granularity must be specified by the modeller as the expected range of concentration values and the number of levels considered. The structure of the CTMC derived from Bio-PEPA, which we term the CTMC with levels, will depend on the granularity of the model. As the granularity tends to zero the behaviour of this CTMC with levels tends to the behaviour of the ODEs [CDHC FBTC08].

Formal Mappings 175/ 223 CTMC with levels The rate of a transition is set to be consistent with the granularity. The granularity must be specified by the modeller as the expected range of concentration values and the number of levels considered. The structure of the CTMC derived from Bio-PEPA, which we term the CTMC with levels, will depend on the granularity of the model. As the granularity tends to zero the behaviour of this CTMC with levels tends to the behaviour of the ODEs [CDHC FBTC08].

Formal Mappings 176/ 223 ODE system The derivation of the ODEs from the Bio-PEPA is straightforward, based on the definitions of the species components. 1 definition of the (N M) stoichiometry matrix D, where N is the number of species and M is the number of reactions; 2 definition of the kinetic law vector v KL containing the kinetic law of each reaction; 3 association of the variable x i with each component C i and definition of the vector x. The ODE system is then obtained as dx dt = D v KL

Formal Mappings 177/ 223 ODE system The derivation of the ODEs from the Bio-PEPA is straightforward, based on the definitions of the species components. 1 definition of the (N M) stoichiometry matrix D, where N is the number of species and M is the number of reactions; 2 definition of the kinetic law vector v KL containing the kinetic law of each reaction; 3 association of the variable x i with each component C i and definition of the vector x. The ODE system is then obtained as dx dt = D v KL

Formal Mappings 178/ 223 ODE system The derivation of the ODEs from the Bio-PEPA is straightforward, based on the definitions of the species components. 1 definition of the (N M) stoichiometry matrix D, where N is the number of species and M is the number of reactions; 2 definition of the kinetic law vector v KL containing the kinetic law of each reaction; 3 association of the variable x i with each component C i and definition of the vector x. The ODE system is then obtained as dx dt = D v KL

Formal Mappings 179/ 223 ODE system The derivation of the ODEs from the Bio-PEPA is straightforward, based on the definitions of the species components. 1 definition of the (N M) stoichiometry matrix D, where N is the number of species and M is the number of reactions; 2 definition of the kinetic law vector v KL containing the kinetic law of each reaction; 3 association of the variable x i with each component C i and definition of the vector x. The ODE system is then obtained as dx dt = D v KL

Formal Mappings 180/ 223 ODE system There are advantages to be gained by using a process algebra model as an intermediary to the derivation of the ODEs. The ODEs can be automatically generated from the descriptive process algebra model, thus reducing human error. The process algebra model allow us to derive properties of the model, such as freedom from deadlock, before numerical analysis is carried out. The algebraic formulation of the model emphasises interactions between the biochemical entities.

Formal Mappings 181/ 223 ODE system There are advantages to be gained by using a process algebra model as an intermediary to the derivation of the ODEs. The ODEs can be automatically generated from the descriptive process algebra model, thus reducing human error. The process algebra model allow us to derive properties of the model, such as freedom from deadlock, before numerical analysis is carried out. The algebraic formulation of the model emphasises interactions between the biochemical entities.

Formal Mappings 182/ 223 ODE system There are advantages to be gained by using a process algebra model as an intermediary to the derivation of the ODEs. The ODEs can be automatically generated from the descriptive process algebra model, thus reducing human error. The process algebra model allow us to derive properties of the model, such as freedom from deadlock, before numerical analysis is carried out. The algebraic formulation of the model emphasises interactions between the biochemical entities.

Formal Mappings 183/ 223 ODE system There are advantages to be gained by using a process algebra model as an intermediary to the derivation of the ODEs. The ODEs can be automatically generated from the descriptive process algebra model, thus reducing human error. The process algebra model allow us to derive properties of the model, such as freedom from deadlock, before numerical analysis is carried out. The algebraic formulation of the model emphasises interactions between the biochemical entities.

Formal Mappings 184/ 223 PRISM model and model checking Analysing models of biological processes via stochastic model-checking has considerable appeal. As with stochastic simulation the answers which are returned from model-checking give a thorough stochastic treatment to the small-scale phenomena. However, in contrast to a simulation run which generates just one trajectory, probabilistic model-checking gives a definitive answer so it is not necessary to re-run the analysis repeatedly and compute ensemble averages of the results. Building a reward structure over the model it is possible to express complex analysis questions.

Formal Mappings 185/ 223 PRISM model and model checking Analysing models of biological processes via stochastic model-checking has considerable appeal. As with stochastic simulation the answers which are returned from model-checking give a thorough stochastic treatment to the small-scale phenomena. However, in contrast to a simulation run which generates just one trajectory, probabilistic model-checking gives a definitive answer so it is not necessary to re-run the analysis repeatedly and compute ensemble averages of the results. Building a reward structure over the model it is possible to express complex analysis questions.

Formal Mappings 186/ 223 PRISM model and model checking Analysing models of biological processes via stochastic model-checking has considerable appeal. As with stochastic simulation the answers which are returned from model-checking give a thorough stochastic treatment to the small-scale phenomena. However, in contrast to a simulation run which generates just one trajectory, probabilistic model-checking gives a definitive answer so it is not necessary to re-run the analysis repeatedly and compute ensemble averages of the results. Building a reward structure over the model it is possible to express complex analysis questions.

Formal Mappings 187/ 223 PRISM model and model checking Analysing models of biological processes via stochastic model-checking has considerable appeal. As with stochastic simulation the answers which are returned from model-checking give a thorough stochastic treatment to the small-scale phenomena. However, in contrast to a simulation run which generates just one trajectory, probabilistic model-checking gives a definitive answer so it is not necessary to re-run the analysis repeatedly and compute ensemble averages of the results. Building a reward structure over the model it is possible to express complex analysis questions.

Formal Mappings 188/ 223 PRISM model and model checking Stochastic model checking in PRISM is based on a CTMC and the logic CSL. Formally the mapping from Bio-PEPA is based on the structured operational semantics, generating the underlying CTMC in the usual way. In practice, it is more straightforward to directly map to the input language of the tool, the language of interacting, reactive modules. From a Bio-PEPA description one module is generated for each species component with an additional module to capture the functional rate information.

Formal Mappings 189/ 223 PRISM model and model checking Stochastic model checking in PRISM is based on a CTMC and the logic CSL. Formally the mapping from Bio-PEPA is based on the structured operational semantics, generating the underlying CTMC in the usual way. In practice, it is more straightforward to directly map to the input language of the tool, the language of interacting, reactive modules. From a Bio-PEPA description one module is generated for each species component with an additional module to capture the functional rate information.

Formal Mappings 190/ 223 PRISM model and model checking Stochastic model checking in PRISM is based on a CTMC and the logic CSL. Formally the mapping from Bio-PEPA is based on the structured operational semantics, generating the underlying CTMC in the usual way. In practice, it is more straightforward to directly map to the input language of the tool, the language of interacting, reactive modules. From a Bio-PEPA description one module is generated for each species component with an additional module to capture the functional rate information.

Formal Mappings 191/ 223 PRISM model and model checking Stochastic model checking in PRISM is based on a CTMC and the logic CSL. Formally the mapping from Bio-PEPA is based on the structured operational semantics, generating the underlying CTMC in the usual way. In practice, it is more straightforward to directly map to the input language of the tool, the language of interacting, reactive modules. From a Bio-PEPA description one module is generated for each species component with an additional module to capture the functional rate information.

Example 192/ 223 Outline 1 Introduction Some Biological Background 2 Process Calculi for Systems Biology 3 Bio-PEPA: Syntax 4 Formal Semantics 5 Bio-PEPA Analysis 6 Formal Mappings 7 Example

Example 193/ 223 Goldbeter s model [Goldbeter 91] Goldbeter s model describes the activity of the cyclin in the cell cycle. The cyclin promotes the activation of a cdk (cdc2) which in turn activates a cyclin protease. This protease promotes cyclin degradation. This leads to a negative feedback loop. In the model most of the kinetic laws are of kind Michaelis-Menten and this can be reflected in the Bio-PEPA model

Example 194/ 223 Goldbeter s model [Goldbeter 91] Goldbeter s model describes the activity of the cyclin in the cell cycle. The cyclin promotes the activation of a cdk (cdc2) which in turn activates a cyclin protease. This protease promotes cyclin degradation. This leads to a negative feedback loop. In the model most of the kinetic laws are of kind Michaelis-Menten and this can be reflected in the Bio-PEPA model

Example 195/ 223 Goldbeter s model [Goldbeter 91] Goldbeter s model describes the activity of the cyclin in the cell cycle. The cyclin promotes the activation of a cdk (cdc2) which in turn activates a cyclin protease. This protease promotes cyclin degradation. This leads to a negative feedback loop. In the model most of the kinetic laws are of kind Michaelis-Menten and this can be reflected in the Bio-PEPA model

Example 196/ 223 Goldbeter s model [Goldbeter 91] Goldbeter s model describes the activity of the cyclin in the cell cycle. The cyclin promotes the activation of a cdk (cdc2) which in turn activates a cyclin protease. This protease promotes cyclin degradation. This leads to a negative feedback loop. In the model most of the kinetic laws are of kind Michaelis-Menten and this can be reflected in the Bio-PEPA model

Example 197/ 223 Goldbeter s model [Goldbeter 91] Goldbeter s model describes the activity of the cyclin in the cell cycle. The cyclin promotes the activation of a cdk (cdc2) which in turn activates a cyclin protease. This protease promotes cyclin degradation. This leads to a negative feedback loop. In the model most of the kinetic laws are of kind Michaelis-Menten and this can be reflected in the Bio-PEPA model

Example 198/ 223 The biological model R1 CYCLIN (C) R2 cdc2 inactive (M ) R3 cdc2 active (M) R7 R4 Protease inactive (X ) R5 Protease active (X) R6

Example 199/ 223 The biological model (2) There are three different biological species involved: cyclin, the protein protagonist of the cycle, represented as C; cdc2 kinase, in both active and inactive form. The variables used to represent them are M and M, respectively; cyclin protease, in both active and inactive form. The variable are X and X.

Example 200/ 223 The biological model (2) There are three different biological species involved: cyclin, the protein protagonist of the cycle, represented as C; cdc2 kinase, in both active and inactive form. The variables used to represent them are M and M, respectively; cyclin protease, in both active and inactive form. The variable are X and X.

Example 201/ 223 The biological model (2) There are three different biological species involved: cyclin, the protein protagonist of the cycle, represented as C; cdc2 kinase, in both active and inactive form. The variables used to represent them are M and M, respectively; cyclin protease, in both active and inactive form. The variable are X and X.

Example 202/ 223 The biological model (2) There are three different biological species involved: cyclin, the protein protagonist of the cycle, represented as C; cdc2 kinase, in both active and inactive form. The variables used to represent them are M and M, respectively; cyclin protease, in both active and inactive form. The variable are X and X.

Example 203/ 223 Reactions id reaction react. prod. mod. kinetic laws α 1 creation of cyclin - C - v i α 2 degradation of cyclin C - - kd C activation of M M C C V M1 M α 3 α 4 α 5 α 6 α 7 cdc2 kinase deactivation of cdc2 kinase activation of cyclin protease deactivation of cyclin protease X triggered degradation of cyclin M M - X X M X X - C - X (K c +C) (K 1+M ) M V 2 (K 2+M) X M V M3 (K 3+X ) X V 4 K 4+X C v d X C+K d α 1, α 2 have mass-action kinetics; others are Michaelis-Menten.

Example 204/ 223 Translation into Bio-PEPA Definition of the set N : N = [C : h C, N c ; M : h M, N M ; M : h M, N M ; Definition of functional rates (F): X : h X, N X, ; X : h X, N X ] f α1 = fma(v i ); f α2 = fma(k d ); f α4 = fmm(v 2, K 2 ); f α5 = fmm(v 3, K 3 ); f α6 = fmm(v 4, K 4 ); f α7 = fmm(v d, K d ); f α3 = v 1 C M K c + C K1 + M

Example 205/ 223 Translation into Bio-PEPA Definition of the set N : N = [C : h C, N c ; M : h M, N M ; M : h M, N M ; Definition of functional rates (F): X : h X, N X, ; X : h X, N X ] f α1 = fma(v i ); f α2 = fma(k d ); f α4 = fmm(v 2, K 2 ); f α5 = fmm(v 3, K 3 ); f α6 = fmm(v 4, K 4 ); f α7 = fmm(v d, K d ); f α3 = v 1 C M K c + C K1 + M

Example 206/ 223 The Bio-PEPA model Definition of species components (Comp): C M M X X def = (α 1, 1) C + (α 2, 1) C + (α 7, 1) C + (α 3, 1) C def = (α 4, 1) M + (α 3, 1) M def = (α 3, 1) M + (α 4, 1) M + (α 5, 1) M def = (α 6, 1) X + (α 5, 1) X def = (α 5, 1) X + (α 6, 1) X + (α 7, 1) X Definition of the model component (P): C(l 0C ) {α 3 } M(l 0M) {α 3,α 4 } M (l 0M ) {α 5,α 7 } X (l 0X ) {α 5,α 6 } X (l 0X )

Example 207/ 223 The Bio-PEPA model Definition of species components (Comp): C M M X X def = (α 1, 1) C + (α 2, 1) C + (α 7, 1) C + (α 3, 1) C def = (α 4, 1) M + (α 3, 1) M def = (α 3, 1) M + (α 4, 1) M + (α 5, 1) M def = (α 6, 1) X + (α 5, 1) X def = (α 5, 1) X + (α 6, 1) X + (α 7, 1) X Definition of the model component (P): C(l 0C ) {α 3 } M(l 0M) {α 3,α 4 } M (l 0M ) {α 5,α 7 } X (l 0X ) {α 5,α 6 } X (l 0X )

Example 208/ 223 ODEs The stoichiometry matrix D: α 1 α 2 α 3 α 4 α 5 α 6 α 7 C +1 0 0 0 0 0 1 x C M 0 0 1 +1 0 0 0 x M M 0 0 +1 1 0 0 0 x M X 0 0 0 0 1 +1 0 x X X 0 0 0 0 +1 1 0 x X The vector that contains the kinetic laws is: w = ( v i 1, k d x C, V M1 x C K c + x C x M (K 1 + x M ), V 2 x M (K 2 + x M ), V M3 x M x X V 4 x X, (K 3 + x X ) (K 4 + x X ), v d x C x X (K d + x C ) )

Example 209/ 223 ODEs The stoichiometry matrix D: α 1 α 2 α 3 α 4 α 5 α 6 α 7 C +1 0 0 0 0 0 1 x C M 0 0 1 +1 0 0 0 x M M 0 0 +1 1 0 0 0 x M X 0 0 0 0 1 +1 0 x X X 0 0 0 0 +1 1 0 x X The vector that contains the kinetic laws is: w = ( v i 1, k d x C, V M1 x C K c + x C x M (K 1 + x M ), V 2 x M (K 2 + x M ), V M3 x M x X V 4 x X, (K 3 + x X ) (K 4 + x X ), v d x C x X (K d + x C ) )

Example 210/ 223 ODEs (2) The system of ODEs is obtained as d x dt = D w, where x T =: (x C, x M, x M, x X, x X ) is the vector of the species variables: dx C dt dx M dt dx M dt dx X dt dx X dt = v i 1 k d x C v d x C x X (K d + x C ) = V M1 x C K c + x C x M (K 1 + x M ) + V 2 x M (K 2 + x M ) = + V M1 x C K c + x C x M (K 1 + x M ) V 2 x M (K 2 + x M ) = V M3 x M x X (K 3 + x X ) = V M3 x M x X (K 3 + x X ) + V 4 x X (K 4 + x X ) V 4 x X (K 4 + x X )

Example 211/ 223 ODE results K 1 = K 2 = K 3 = K 4 = 0.02µM

Example 212/ 223 ODE results K 1 = K 2 = K 3 = K 4 = 40µM