ACTA PHYSICA DEBRECINA XLVI, 47 (2012) MODELLING GENE REGULATION WITH BOOLEAN NETWORKS. Abstract

Similar documents
Computational Biology: Basics & Interesting Problems

Simulation of Gene Regulatory Networks

BME 5742 Biosystems Modeling and Control

UNIT 5. Protein Synthesis 11/22/16

From Gene to Protein

7.32/7.81J/8.591J: Systems Biology. Fall Exam #1

Lecture 7: Simple genetic circuits I

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016

A Simple Protein Synthesis Model

GCD3033:Cell Biology. Transcription

RNA & PROTEIN SYNTHESIS. Making Proteins Using Directions From DNA

Reading Assignments. A. Genes and the Synthesis of Polypeptides. Lecture Series 7 From DNA to Protein: Genotype to Phenotype

Chapters 12&13 Notes: DNA, RNA & Protein Synthesis

Molecular Biology - Translation of RNA to make Protein *

Chapter 17. From Gene to Protein. Biology Kevin Dees

Control of Gene Expression in Prokaryotes

CS-E5880 Modeling biological networks Gene regulatory networks

Written Exam 15 December Course name: Introduction to Systems Biology Course no

More Protein Synthesis and a Model for Protein Transcription Error Rates

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation.

(Lys), resulting in translation of a polypeptide without the Lys amino acid. resulting in translation of a polypeptide without the Lys amino acid.

1. In most cases, genes code for and it is that

Chapter 15 Active Reading Guide Regulation of Gene Expression

3.B.1 Gene Regulation. Gene regulation results in differential gene expression, leading to cell specialization.

10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison

Random Boolean Networks

Lecture 4: Transcription networks basic concepts

GENE ACTIVITY Gene structure Transcription Transcript processing mrna transport mrna stability Translation Posttranslational modifications

Flow of Genetic Information

PROTEIN SYNTHESIS INTRO

Graph Alignment and Biological Networks

arxiv:math/ v2 [math.ds] 3 Apr 2006

Translation and Operons

From gene to protein. Premedical biology

Name: SBI 4U. Gene Expression Quiz. Overall Expectation:

Problem Set 5. 1 Waiting times for chemical reactions (8 points)

Introduction to Bioinformatics

REVIEW SESSION. Wednesday, September 15 5:30 PM SHANTZ 242 E

13.4 Gene Regulation and Expression

Translation. Genetic code

15.2 Prokaryotic Transcription *

Lecture 18 June 2 nd, Gene Expression Regulation Mutations

Organic Chemistry Option II: Chemical Biology

Algorithms in Computational Biology (236522) spring 2008 Lecture #1

Topic 4 - #14 The Lactose Operon

Name Period The Control of Gene Expression in Prokaryotes Notes

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation.

Introduction to molecular biology. Mitesh Shrestha

Chapter 16 Lecture. Concepts Of Genetics. Tenth Edition. Regulation of Gene Expression in Prokaryotes

Biology. Biology. Slide 1 of 26. End Show. Copyright Pearson Prentice Hall

Translation Part 2 of Protein Synthesis

Computational Cell Biology Lecture 4

Basic Synthetic Biology circuits

Controlling Gene Expression

UNIT 6 PART 3 *REGULATION USING OPERONS* Hillis Textbook, CH 11

Boolean networks for modeling and analysis of gene regulation

APGRU6L2. Control of Prokaryotic (Bacterial) Genes

Network motifs in the transcriptional regulation network (of Escherichia coli):

Prokaryotic Regulation

L3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus

Networks in systems biology

Videos. Bozeman, transcription and translation: Crashcourse: Transcription and Translation -

Introduction to Bioinformatics

Genetic transcription and regulation

CHAPTER 13 PROKARYOTE GENES: E. COLI LAC OPERON

What is the central dogma of biology?

Biology I Fall Semester Exam Review 2014

Number of questions TEK (Learning Target) Biomolecules & Enzymes

Types of RNA. 1. Messenger RNA(mRNA): 1. Represents only 5% of the total RNA in the cell.

V19 Metabolic Networks - Overview

Lesson Overview. Ribosomes and Protein Synthesis 13.2

Gene regulation: From biophysics to evolutionary genetics

Full file at CHAPTER 2 Genetics

56:198:582 Biological Networks Lecture 8

Controlling chaos in random Boolean networks

Biology 2018 Final Review. Miller and Levine

Honors Biology Reading Guide Chapter 11

Gene Autoregulation via Intronic micrornas and its Functions

Gene Regulation and Expression

Hybrid Model of gene regulatory networks, the case of the lac-operon

4. Why not make all enzymes all the time (even if not needed)? Enzyme synthesis uses a lot of energy.

Related Courses He who asks is a fool for five minutes, but he who does not ask remains a fool forever.

Unit 3: Control and regulation Higher Biology

GLOBEX Bioinformatics (Summer 2015) Genetic networks and gene expression data

Regulation of Gene Expression

MATHEMATICAL MODELS - Vol. III - Mathematical Modeling and the Human Genome - Hilary S. Booth MATHEMATICAL MODELING AND THE HUMAN GENOME

Lecture 5: September Time Complexity Analysis of Local Alignment

Cell Growth and Division

Bioinformatics 2 - Lecture 4

CHAPTER : Prokaryotic Genetics

Honors Biology Fall Final Exam Study Guide

Gene Switches Teacher Information

arxiv: v1 [q-bio.mn] 7 Nov 2018

Regulation of Gene Expression *

Problem Set 2. 1 Competitive and uncompetitive inhibition (12 points) Systems Biology (7.32/7.81J/8.591J)

V14 extreme pathways

Noisy Attractors and Ergodic Sets in Models. of Genetic Regulatory Networks

(Anti-)Stable Points and the Dynamics of Extended Systems

12-5 Gene Regulation

Prokaryotic Gene Expression (Learning Objectives)

Transcription:

ACTA PHYSICA DEBRECINA XLVI, 47 (2012) MODELLING GENE REGULATION WITH BOOLEAN NETWORKS E. Fenyvesi 1, G. Palla 2 1 University of Debrecen, Department of Experimental Physics, 4032 Debrecen, Egyetem 1, Hungary 2 Statistical and Biological Physics Research Group of HAS (Pázmány P. stny. 1/A, H-1117 Budapest, Hungary) Abstract Due to the changing environment, living cells must continously adapt their protein synthesis to react to the various inputs coming from their surrounding. A key element in the protein synthesis is the transcription of DNA to RNA. The so called regulatory proteins can catalyze or block the transcription of other proteins, thus help the cell in sensing the environment and regulating the rate of transcription of structural genes. A very simple model of the above process is provided by Boolean networks, where the state of each protein (gene) is either "on" or "o", and the interaction between proteins is represented by a directed graph. Here we overview the motivation and settings of this model and highlight possibilities for further research with a special focus on applying the latest results of control theory to Boolean networks. I. Introduction Proteins are essentially important ingredients of cell structure, catalising biochemical reactions, transporting dierent types of molecules, and taking part in a number of other vital processes. Twenty kinds of amino acids are 47

available for building up proteins. Each protein is consisting of hundreds of amino acids. All information that is needed to create a protein, is encoded in the DNA with the help of four bases (A, T, C, G). These bases form two complementer pairs (A-T, C-G), which follow each other along the double helix of DNA. Each amino acid is encoded with a basis triplet. (A doublet wouldn't be enough, since only 4 2 = 16 dierent pairs can be constituted from two bases.) All the triplets that encode a given amino acid constitute a gene. Transcription is the process of creating a complementary RNA copy of a sequence of DNA. During transcription, a DNA sequence is read by an RNA polymerase, which produces a complementary, antiparallel RNA strand. As opposed to DNA replication, transcription is resulting in an RNA complement that includes uracil (U) in all instances where thymine (T) would have occurred in a DNA complement. If the gene transcribed encodes a protein, the result of transcription is messenger RNA (mrna), which will then be used to create that protein via the process of translation [6]. Regulatory proteins help the cell sense the environment and regulate the rate of transcription of structural genes (in this case regulatory proteins are called transcription factors). The genes encoding the regulatory proteins are referred to as regulatory genes. There are two types of regulatory proteins, negative-acting and positive-acting proteins. Transcription factors bind to either enhancer or promoter regions of DNA adjacent to the genes that they regulate, but they also use a variety of other mechanisms for the regulation of gene expression [6]. The interaction between genes through transcription factors can be represented by a network, in which nodes correspond to genes, and a directed link between a pair of genes represents interaction [1]. In this paper we overwiev the simplest model for gene regulation proposed by Boolean networks. In Sect. II. we discuss the motivation for representing gene regulatory networks with Boolean networks, whereas in Sect. III we overwiev of general properties of Boolean networks. Finally in Sect. IV we depict further ideas for research in this direction. 48

II. Modelling gene regulatory networks There were many eorts to describe the topology and dynamics of gene regulatory networks. One of the possible methods is examining the cell in action through measurment. Then we can make statements of the gene regulatory network by using methods of statistical physics [1]. Another method is modelling the topology and dynamics of the network, testing the model by computer simulations, and then comparing the results of the simulations with available measurement data. We usually model gene regulatory networks with directed graphs. Nodes represent genes, and directed edges show how a given gene acts on another gene. Topology has a large eect on the behaviour of the network. The quantities used to describe the topology a network in general are the distribution of in- and out-degrees, in- and out-degree correlations, assortativity, occurence of loops and dierent kinds of small subgraphs, community structure [3, 4]. In order to study the eect of the topology on the dynamics we usually have to examine these features one by one. In order to take into account the time-dependent nature of the gene transcription, we assign a time-dependent state, x i (t) to each node. The value of this variable can correspond to, e.g., the transcription factor concentration [1]. In this case, the quantity changes continously in time. Linear dynamics is the simplest continous dynamics [2]. Most real systems are driven by nonlinear processes, but the controllability of most nonlinear systems is in many aspects structurally similar to that of nonlinear systems, prompting us to start our study using the canonical linear, timeinvariant dynamics [3]. To simplify the discussion further, we suppose that a gene can be in one of two states: allowed (turned on), when x i (t) = 1, or blocked (turned o), when x i (t) = 0. In this case we have a Boolean network [5]. A Boolean network can be generalized by introducing more than two discrete states (x i (t) 0, 1, 2,... n) [4]. We denote the state wector of the entire graph by x(t) = (x 1 (t), x 2 (t),..., x N (t)), where N is the number of nodes in the graph. In the next step we choose the dynamical rules governing the behaviour of the network in time. A cell interacts with its environment, so the behaviour 49

of the gene regulatory network can be controlled from outside of the cell. Our goal is to nd how to control a gene regulatory network. Moreover, small uctuations can occur in gene expression pattern and in chemical reactions inside the cell. We would also like to learn if these uctuations die out, or grow quickly. If these die out, the cell remains in its original state, but if these grow, the cell will turn into another state. We aim at understanding how stable is the behaviour of a gene regulatory network against small uctuations [4]. We can see that examining the dynamical stability and controlability of a model network is essentially important in order to have more knowledge of real gene regulatory networks. Additionally, this knowledge can be applied to other systems, which can be described with the same network model and are governed by similar dynamical rules. III. Boolean networks Deterministic Boolean networks are formally dened by a state vector x(t) = (x 1 (t), x 2 (t),... x N (t)) T, where x i {0, 1}, and a set of update functions f i such that x i (t) = f i (x ji;1 (t 1), x ji;2 (t 1),... ), where j i;k denote the indices of the Ki in nodes that input to node i [4]. The update function f i is dened for each node i by specifying a 2 Kin i -entry truth table in the following way: we assign 0 with probability p i or 1 with probability 1 p i for every possible input signal. We dene "sensitivity" q i as the probability that the output f i changes when given two dierent input strings [4]. If the Boolean functions are completely random, q i = 2p i (1 p i ). As a special case we can create a regular N K network whose nodes have uniformly Ki in = K entries [4]. If the graph structure and the update functions are xed, we can set up the system from a given initial state. The network evolves in descrete timesteps. Since there are nite possible states (2 N ) of the network, it will return to a previous state after at most 2 N timesteps, and then it will show periodical behaviour. To examine the stability of a Boolean network, consider two close initial states, x(t) and x(t). Their divergence can be quantied by the Hamming distance: h(t) = N i=1 x i(t) x i (t). If h(t) 0 as t, the network is 50

stable. If the network is instable, h(t) quickly increases to O(N), signifying chaotic behaviour. A critical network is at the border separating stability 1 and chaos [4]. There is a critical number of entries K c = 2p(1 p). If K < K c, the network is stable. If K > K c, it is unstable, and if K = K c, it is critical [4]. E.g., with K = 10 entries the network is unstable for q = 0.215, q = 0.3, q = 0.4 and q = 0.5. We can see this behaviour for various values of sensitivity on Fig.1., showing our simulation results. We generated a regular N K network with N = 10000 nodes and K = 10 connections, and started the simulations from 100 dierent initially close state-pairs (their initial Hamming distance was 100), and averaged their Hamming distances at each timesteps. At the beginning the average Hamming distance quickly groves showing chaotic behaviour. But after some timesteps it will saturate at a constant value, because the network will get to an attractor and remains there. So the Hamming distance of a state-pair will saturate at a constant value, and because of this, the average of all Haming distances will saturate at a constant value, too. A verage Hamming distance of an N-K Boolean network 8000 7000 q=0.215 q=0.3 6000. q=0.4 q=0.5 5000 h(t) 4000 3000 2000 1000 0 0 5 10 15 20 25 30 t Figure 1: Evolution of the average Hamming distance for a Boolean network of N = 10000 and K = 10, for various values of the sensitivity. We can create a directed graph that shows the transition between all 51

10 11 8 12 2 3 7 4 27 5 9 25 14 13 15 29 18 17 0 6 20 22 26 28 31 1 24 30 21 23 16 19 Figure 2: A graph of states of an N K Boolean network of N = 5, K = 2 and p = 0.5. Attractors are cycles, marked with red. possible states of a given Boolean network (with given update functions) if we assign a node index for every state in the following way: imagine the state vector of the network as a binary number, then convert this binary number to a decimal number. This decimal number will be a node index. Directed rings and loops correspond to attractors (cycles and xpoints). This method works only for small graphs, where N = 5 to N = 20. Our hope is that examinig these small graphs can reveal interesting eects, which may be generalised for larger graphs as well, where the order of the number of nodes is as large as in real gene regulatory networks. The networks on gure 2 and 3 shows the transiton between states of a reguar N K network, where N = 10, K = 2 and p i = p = 0.5. The most relevant dierence between them is that the graph on g. 3 is connected, while the other on g. 2 is not connected. On g. 2 two attractors can be seen, marked with red. On g. 3 a loop marked also with red shows the xpoint of the system. It is visible, that by the same N, K and p parameters we get topologically dierent networks of states. 52

1 3 19 17 13 8 14 10 24 28 31 26 27 22 6 21 12 30 29 9 25 18 16 15 20 11 4 0 5 7 2 23 Figure 3: Another graph of states of an N K Boolean network of N = 5, K = 2 and p = 0.5. The attractor is a xpoint. 53

IV. Discussion and outlook In summary we have shown that Boolean networks oer a simple and intuitive model for gene regulatory networks. The attractors of the Boolean network can be regarded as periodical cycles and steady states of the modelled gene regulatory network. A cell adapts to changes in its environment with the help of regulatory proteins and gene regulatory network. Therefore, if we assume that the environmental conditions are steady for a long time and the cell has adopted to these conditions, the regulatory network is in one of its attractors. However, if there is a change in the environment, after some time the network will get to another attractor driving the cell to its new steady state or periodical cycle. If the graph of the states of gene regulatory network is not connected, there is no transition between separate subgraphs. (On g. 2 we can see two separate subgraphs.) Each connected subgraph has exactly one attractor, so there has to be a controller outside of the regulatory network that can drive the system from one subgraph to another. These outside controllers could correspond to e.g., special regulatory proteins. These controller regulatory proteins are able to change the state of the whole network. Another natural assumption is that the optimal way of control while changing from one attractor to another happens with the help of as few controller nodes as possible. The aim of our further work is examining the dependence of the minimum number of controller nodes on dierent parameters of a given Boolean network. References 54 [1] Lezon, T. R., Banavar, J. R., Cieplak, M., Maritan A. and Fedoro, N. V., Proc. Natl. Acad. Sci. 103, 1903319038 (2006). [2] Kalman, R. E., J. Soc. Indus. Appl. Math. Ser. A 1, 152192 (1963). [3] Liu YY, Slotine JJ, Barabási AL., Nature. 473(7346), 16773 (2011) [4] Andrew Pomerance, Edward Ott, Michelle Girvan and Wolfgang Losert, PNAS 106, 20 (82098214). [5] Kauman S., J Theor Biol. 22, 437467 (1969).

[6] James E. Darnell, Harvey F. Lodish, David Baltimore, Molecular cell biology (Scientic American Books, 1986). 55