Inferring Update Sequences in Boolean Gene Regulatory Networks

Similar documents
Simulation of Gene Regulatory Networks

Classification of Random Boolean Networks

Synchronous state transition graph

Mixed-Integer Nonlinear Programming

Logic-Based Modeling in Systems Biology

Contextual Random Boolean Networks

Branch-and-Bound for the Travelling Salesman Problem

Artificial Neural Networks Examination, March 2004

arxiv: v2 [cs.dm] 29 Mar 2013

5.3 METABOLIC NETWORKS 193. P (x i P a (x i )) (5.30) i=1

Classification of Random Boolean Networks

DESIGNING CNN GENES. Received January 23, 2003; Revised April 2, 2003

AC&ST AUTOMATIC CONTROL AND SYSTEM THEORY SYSTEMS AND MODELS. Claudio Melchiorri

Dynamical Systems in Biology

Average-Consensus of Multi-Agent Systems with Direct Topology Based on Event-Triggered Control

Lecture Network analysis for biological systems

Descriptional Complexity of Formal Systems (Draft) Deadline for submissions: April 25, 2010 Final versions: July 8, 2010

NONLINEAR THREE POINT BOUNDARY VALUE PROBLEM

Sensitivity to Synchronism of Boolean Automata Networks

Hopfield Neural Network and Associative Memory. Typical Myelinated Vertebrate Motoneuron (Wikipedia) Topic 3 Polymers and Neurons Lecture 5

Moment-based Availability Prediction for Bike-Sharing Systems

Asynchronous Boolean networks and hereditarily bijective maps

Existence of Positive Periodic Solutions of Mutualism Systems with Several Delays 1

An homotopy method for exact tracking of nonlinear nonminimum phase systems: the example of the spherical inverted pendulum

Artificial Neural Networks Examination, June 2004

Constraint Qualification Failure in Action

Asynchronous random Boolean network model based on elementary cellular automata

Fixed point theorems for Boolean networks expressed in terms of forbidden subnetworks

On decomposing graphs of large minimum degree into locally irregular subgraphs

Data Gathering and Personalized Broadcasting in Radio Grids with Interferences

I/O monotone dynamical systems. Germán A. Enciso University of California, Irvine Eduardo Sontag, Rutgers University May 25 rd, 2011

arxiv:math/ v2 [math.ds] 3 Apr 2006

EEE 241: Linear Systems

On the convergence of Boolean automata networks without negative cycles

Fuzzy Cognitive Maps Learning through Swarm Intelligence

Random Boolean Networks

Feasibility pump for aircraft deconfliction with speed regulation

The intersection and the union of the asynchronous systems

Output Regulation of Uncertain Nonlinear Systems with Nonlinear Exosystems

ACTA PHYSICA DEBRECINA XLVI, 47 (2012) MODELLING GENE REGULATION WITH BOOLEAN NETWORKS. Abstract

A NEW PROOF OF THE WIENER HOPF FACTORIZATION VIA BASU S THEOREM

1 The Algebraic Normal Form

Data Gathering and Personalized Broadcasting in Radio Grids with Interferences

Section 7.1 Relations and Their Properties. Definition: A binary relation R from a set A to a set B is a subset R A B.

Algorithms and methods of the BoolNet R package

arxiv: v2 [cs.dm] 18 Nov 2010

Organization. I MCMC discussion. I project talks. I Lecture.

EXISTENCE OF NEUTRAL STOCHASTIC FUNCTIONAL DIFFERENTIAL EQUATIONS WITH ABSTRACT VOLTERRA OPERATORS

The Power of Extra Analog Neuron. Institute of Computer Science Academy of Sciences of the Czech Republic

Nonlinear Control Lecture 1: Introduction

A Robust Controller for Scalar Autonomous Optimal Control Problems

Joint work with Nguyen Hoang (Univ. Concepción, Chile) Padova, Italy, May 2018

Lecture 5: Linear Systems. Transfer functions. Frequency Domain Analysis. Basic Control Design.

Robust multi-sensor scheduling for multi-site surveillance

Introduction to Random Boolean Networks

Renormalization: An Attack on Critical Exponents

Workshop on Discrete Harmonic Analysis Newton Institute, March 2011

arxiv: v1 [cs.sy] 25 Oct 2017

Q-Learning for Markov Decision Processes*

Synchronous Sequential Circuit

Complexity Theory VU , SS The Polynomial Hierarchy. Reinhard Pichler

Outline. Complexity Theory EXACT TSP. The Class DP. Definition. Problem EXACT TSP. Complexity of EXACT TSP. Proposition VU 181.

Haploid & diploid recombination and their evolutionary impact

CHALMERS, GÖTEBORGS UNIVERSITET. EXAM for ARTIFICIAL NEURAL NETWORKS. COURSE CODES: FFR 135, FIM 720 GU, PhD

Robust discrete time control

Distributed detection of topological changes in communication networks. Riccardo Lucchese, Damiano Varagnolo, Karl H. Johansson

On Asymptotic Synchronization of Interconnected Hybrid Systems with Applications

Lecture 6. Lorenz equations and Malkus' waterwheel Some properties of the Lorenz Eq.'s Lorenz Map Towards definitions of:

Event-Triggered Decentralized Dynamic Output Feedback Control for LTI Systems

Deterministic Nonlinear Modeling of Ant Algorithm with Logistic Multi-Agent System

STRONGLY CONNECTED SPACES

Reducing neuronal networks to discrete dynamics

Coupled Random Boolean Network Forming an Artificial Tissue

Modeling and Control Overview

Reinforcement Learning In Continuous Time and Space

The complexity of recursive constraint satisfaction problems.

Index. FOURTH PROOFS n98-book 2009/11/4 page 261

Robust computation with dynamical systems

A reactive framework for Ant Colony Optimization

A. The Hopfield Network. III. Recurrent Neural Networks. Typical Artificial Neuron. Typical Artificial Neuron. Hopfield Network.

Transfer Line Balancing Problem

Computing with Inter-spike Interval Codes in Networks of Integrate and Fire Neurons

CSE 473: Artificial Intelligence Spring 2014

An Integral-type Constraint Qualification for Optimal Control Problems with State Constraints

Final Exam December 12, 2017

3. Abstract Boolean Algebras

Existence and uniqueness of solutions for a continuous-time opinion dynamics model with state-dependent connectivity

Cartesian product of hypergraphs: properties and algorithms

QUANTIZED SYSTEMS AND CONTROL. Daniel Liberzon. DISC HS, June Dept. of Electrical & Computer Eng., Univ. of Illinois at Urbana-Champaign

Intuitionistic Fuzzy Estimation of the Ant Methodology

On the oriented chromatic index of oriented graphs

Biological networks CS449 BIOINFORMATICS

A. The Hopfield Network. III. Recurrent Neural Networks. Typical Artificial Neuron. Typical Artificial Neuron. Hopfield Network.

HOPFIELD neural networks (HNNs) are a class of nonlinear

Chapter 1 Fundamental Concepts

Entrainment and Chaos in the Hodgkin-Huxley Oscillator

Neural Nets in PR. Pattern Recognition XII. Michal Haindl. Outline. Neural Nets in PR 2

Learning and Memory in Neural Networks

A heuristic algorithm for the Aircraft Landing Problem

Time-delay feedback control in a delayed dynamical chaos system and its applications

Transcription:

Inferring Update Sequences in Boolean Gene Regulatory Networks Fabien Tarissan a Camilo La Rota b a Complex System Institute (ISC) & CNRS, Palaiseau, France b Complex System Institute (IXXI), Lyon, France Key words: Mathematical programming, Inverse problems, Gene regulatory networks reconstruction 1 Introduction This paper employs mathematical programming and mixed integer linear programming techniques for solving a problem arising in the study of genetic regulatory networks. More precisely, we solve the inverse problem consisting in the determination of the sequence of updates in the digraph representing the gene regulatory network (GRN) of Arabidopsis thaliana in such a way that the generated gene activity is as close as possible to the observed data. Differences among cells of different tissues depend on the specific set of genes that are active in each tissue. Therefore, one usually assumes that the different steady states of a GRN dynamics correspond to the different possible cell fates ([7]). This leads to explain the changes observed during the development of the organisms by the fact that perturbations on specific elements of the network make the system switch from one steady state to another. Some hypothesis can be made about these perturbations, which are then treated as initial conditions for the new tissue being formed. However, an important unknown is (are) the update sequence(s) of the gene activity that let the system evolve from a given set of initial conditions to the set of steady states. Indeed, different update sequences determine different sets of basins of attraction of the GRNs. However, the steady states remain the same under any sequence. Usually, a specific update sequence is assumed to rule the dynamics of the GRNs [1,3]. The present study differs from this approach in that we sought to infer the update sequence from the biological observations. It also differs from our previous paper as we focus here on asynchronous sequences whereas in [6] the updates were synchronous. Email addresses: tarissan@lix.polytechnique.fr (Fabien Tarissan), camilo.larota@ens-lyon.fr (Camilo La Rota). Preprint submitted to CTW 2009 16 March 2009

2 The problem Given a directed graph G = (V, E), a discrete set T of time instants (which we suppose to be an initial contiguous proper subset of N) and the following functions: a function α : E {+1, 1} called the arc sign function; a function ω : E R + called the arc weight function; a function χ : V T {0, 1} called the gene state function; a function ι : V {0, 1} called the initial configuration; a function θ : V R called the threshold function; a function γ : V T {0, 1} called the updating function. A gene regulatory network (GRN) is a 8-tuple (G, T, α, ω, θ, χ, ι, γ) such that: v V χ(v, 0)=ι(v) (1) H(v, t 1) if γ(v, t) = 1 v V, t T {0} χ(v, t) = (2) χ(v, t 1) otherwise where H is the Heaviside function defined for v V and t T by 1 if α(u, v)ω(u, v)χ(u, t) θ(v) H(v, t)= u δ (v) 0 otherwise, (3) with δ (v) = {u V (u, v) E} for all v V. Eqns. (1)-(2)-(3) together are called the evolution rules of the GRN. For any particular t T, χ(, t) : V {0, 1} is called a configuration. Since the evolution rules relate a configuration at time t with a configuration at time t 1, χ(, t) is called a fixed configuration (or fixed point) if it remains invariant under the application of one complete cycle of updates encoded by γ. Furthermore, as long as the evolution rules are purely deterministic (as is modelled above), a fixed point of a GRN is determined by its initial configuration. In this paper we deal with an inverse problem related to the estimation of update sequence in GRNs. More precisely, we address the following. Update Sequence Estimation in GRNs (USEGRN). Given a digraph G, a time instant set T, an arc sign function α, an arc weight function ω, a threshold function θ and a set I of initial configurations, find an update function γ with the property that for all ι I there exists a gene activation function χ such that (G, T, α, ω, θ, χ, ι, γ) are GRNs whose fixed points are at a minimum distance to observed data. In other words, we attempt to estimate the sequence of updates in a GRN from the knowledge of the digraph topology in such a way that (a) the GRN 2

evolution rules are consistent with respect to a certain set of initial configurations and (b) the fixed points induced by the estimated values are as close as possible to the observed ones. As the reader might notice, the problem strongly depends on the modelling of the update sequence encoded by γ. In [3], the authors proposed to describe such a sequence by means of periods and delays parameters for each gene. Assuming p v and d v to be such values for gene v, we can reformulate Equation 2 in the previous modellisation according to the following relation: t T γ(v, t) = 1 n N s.t. t = np v + d v 3 The mathematical programming formulation The methodology we shall follow is that of modelling the USEGRN by means of a mathematical programming formulation: min x f(x) subject to g(x) 0, where x R n are the decision variables and f : R n R is the objective function to be minimized subject to a set of constraints g : R n R m which may also include variable ranges or integrality constraints on the variables. The primary concern in solving the USEGRN is thus modellistic rather than algorithmic. One of the foremost difficulties is that of employing a static modelling paradigm such as mathematical programming in order to describe a problem whose very definition depends on time. Another important difficulty resides in describing the necessary and sufficient conditions for a configuration to be a fixed point in a mathematical form. We solve this difficulty by introducing two decision variables: a binary variable s stating that the network has been stable for at least two successive time steps; a binary variable y that will indicate the first time the network is stable. The last difficulty concerns the proper modelling of the update sequence as proposed in [3]. The solution relies on the use of two binary variables π and δ for each gene and indexed over the possible values for the periods and the delays. Then, π v (p) (resp. δ v (d)) is set to 1 if the period (resp. delay) of v is p (resp. d). We provide above such a formulation: Sets: V of genes in the network, E of edges in the network, T of time instants, P of periods values, D of delay values and R of regions. Parameters: ι : R V {0, 1} is the initial configuration of the network (vector of boolean values affected to the genes) for each region. α : A {+1, 1} is the sign of the arc weights; w : V R + is the arc weight function; θ : V R is the threshold function; 3

φ : V R {0, 1} is the targeted fixed configuration for region r. Variables: for all r R, v V, t T, x t r,v {0, 1} is the activation state of gene v at time t in region r; for all r R, v V, t T, h t r,v {0, 1} is the projection of state of gene v at time t in region r according to Heaviside function; s : R T {0, 1} is a decision variable indicating that the network is stable during at least two successive time steps in region r. y : R T {0, 1} is a decision variable that indicates the first time the network reaches a stable state in region r. for all v V, p P, π v,p {0, 1} is a decision variable that indicates that the periodicity of gene v is p. for all v V, d D δ v,d {0, 1} is a decision variable that indicates that the delay of gene v is d. Objective function: min r R t T \{1} ( (y t 1 r yr t ) x t r,v φ r,v ). Constraints: Heaviside function computation rule (for all t T \ {1}, v V, r R) : θ v h t r,v V (1 ht r,v ) u δ (v) α uv w uv x t 1 r,u (θ v 1)(1 h t r,v ) + V ht r,v state transition rules (for all r R, v V, p P, d D): x 0 r,v = ι r,v t T \ {1} s.t. t np + d π v,p δ v,d x t r,v = π v,p δ v,d x t 1 r,v t T \ {1} s.t. t = np + d π v,p δ v,d x t r,v = π v,p δ v,d h t 1 r,v π v,p δ v,d d p fixed point conditions (for all r R, t T \ {1}): x t r,v xt 1 r,v V st r x t r,v x t 1 r,v s t r yr tft r = 0 s u r = ft r u>t (1 y t r ) ft r ( P + D ) 2 yr τ τ T 4 Reformulations and solutions The above problem is a nonconvex Mixed-Integer Non-Linear Problem that can be reformulated exactly to a Mixed-Integer Linear Problem using the techniques proposed in [5]. After standard mathematical manipulations, all the nonlinearities reduce to product terms of binary and/or integer variables, which can be reformulated by adding new auxiliary variables and constraints 4

as follows: xy terms (x, y : binary) η 0 xz terms (x : binary, z : integer) ζ z L x η y ζ z + ( z L + z U )(1 x) η x ζ z U x η x + y 1 ζ z ( z L + z U )(1 x) where z L and z U stand for the boundaries of z and η and ζ are the new variables that replace the products in the equations. We solved to optimality a few real-life instances from the GRN of Arabidopsis thaliana using AMPL [2] to model the problem and CPLEX [4] to solve it. The size of the GRNs involved were such that CPLEX obtained the optimal solution in a matter of minutes. 5 Acknowledgements This work was supported by EU FP6 FET Project Morphex. We are deeply grateful to L. Liberti (LIX) for his feedback on the mathematical formulation. References [1] J. Demongeot, A. Elena, and S. Sené. Robustness in Regulatory Networks: a Multi-Disciplinary Approach. Acta Biotheoretica, 56(1-2):27 49, 2008. [2] R. Fourer and D. Gay. The AMPL Book. Duxbury Press, Pacific Grove, 2002. [3] Carlos Gershenson. Classification of random boolean networks. In Abbass Standish and Bedau, editors, Artificial Life VIII, Proceedings of the Eighth International Conference on Artificial Life, pages 1 8. MIT Press, 2002. [4] ILOG. ILOG CPLEX 10.1 User s Manual. ILOG S.A., Gentilly, France, 2006. [5] L. Liberti, S. Cafieri, and F. Tarissan. Reformulations in mathematical programming: A computational approach. In A. Abraham, A.-E. Hassanien, and P. Suarry, editors, Global Optimization: Theoretical Foundations and Application, Studies in Computational Intelligence. Springer, New York, to appear. [6] F. Tarissan, C. La Rota, and L. Liberti. Network reconstruction: a mathematical programming approach. In Proceedings of the European Conference of Complex Systems (ECCS 08), to appear. [7] R Thomas and M Kaufman. Multistationarity, the basis of cell differentiation and memory. i. structural conditions of multistationarity and other nontrivial behavior. Chaos, 11(1):170 179, Mar 2001. 5