arxiv:physics/ v1 [physics.bio-ph] 30 Sep 2002

Similar documents
Emergence of Rules in Cell Society: Differentiation, Hierarchy, and Stability

Compartmentalization and Cell Division through Molecular Discreteness and Crowding in a Catalytic Reaction Network

Dynamical-Systems Perspective to Stem-cell Biology: Relevance of oscillatory gene expression dynamics and cell-cell interaction

Cell biology traditionally identifies proteins based on their individual actions as catalysts, signaling

Rhythms Emerge in a Collection of `Blind' Chemicals by the Use of `Genetic Switches'

Inter Intra Molecular Dynamics as an Iterated Function System

Effects of Interactive Function Forms in a Self-Organized Critical Model Based on Neural Networks

Symbiotic Sympatric Speciation: Compliance with Interaction-driven Phenotype Differentiation from a Single Genotype

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 3 Oct 2005

Self-organized Criticality in a Modified Evolution Model on Generalized Barabási Albert Scale-Free Networks

arxiv:physics/ v1 [physics.bio-ph] 27 Jun 2001

Divergence Pattern of Duplicate Genes in Protein-Protein Interactions Follows the Power Law

Cellular Reproduction

Connectivity and expression in protein networks: Proteins in a complex are uniformly expressed

Computational Structural Bioinformatics

16 The Cell Cycle. Chapter Outline The Eukaryotic Cell Cycle Regulators of Cell Cycle Progression The Events of M Phase Meiosis and Fertilization

Photosynthesis and Cellular Respiration

Analysis and Simulation of Biological Systems

Curriculum Links. AQA GCE Biology. AS level

Evolution of Phenotype as selection of Dynamical Systems 1 Phenotypic Fluctuation (Plasticity) versus Evolution

Living organisms are composed of many different types of

Science Online Instructional Materials Correlation to the 2010 Biology Standards of Learning and Curriculum Framework

CHAPTER 1 Life: Biological Principles and the Science of Zoology

Tetsuya Yomo. Graduate School of Information Science and Technology Graduate School of Frontier Biosciences Osaka University ERATO, JST, Japan

Experiments with a Turing gas: Mimicking chemical evolution*

How fast elements can affect slow dynamics

Science Textbook and Instructional Materials Correlation to the 2010 Biology Standards of Learning and Curriculum Framework. Publisher Information

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 13 Jan 1999

56:198:582 Biological Networks Lecture 8

I. Molecules & Cells. A. Unit One: The Nature of Science. B. Unit Two: The Chemistry of Life. C. Unit Three: The Biology of the Cell.

Mitochondria Mitochondria were first seen by kollicker in 1850 in muscles and called them sarcosomes. Flemming (1882) described these organelles as

PATTERN DYNAMICS OF A MULTI-COMPONENT REACTION DIFFUSION SYSTEM: DIFFERENTIATION OF REPLICATING SPOTS

Small-world structure of earthquake network

arxiv: v1 [q-bio.mn] 7 Nov 2018

Types of biological networks. I. Intra-cellurar networks

Mass Extinction in a Simple Mathematical Biological Model. Abstract

arxiv: v1 [cond-mat.stat-mech] 6 Mar 2008

Warm Up. What are some examples of living things? Describe the characteristics of living things

5.1 Cell Division and the Cell Cycle

Graph Alignment and Biological Networks

Answer Key. Cell Growth and Division

Self-organized scale-free networks

The Characteristics of Life. AP Biology Notes: #1

Honors Biology-CW/HW Cell Biology 2018

Basic modeling approaches for biological systems. Mahesh Bule

I. Specialization. II. Autonomous signals

CHAPTER 12 - THE CELL CYCLE (pgs )

A Simple Model of Evolution with Variable System Size

CHAPTER 13 MEIOSIS AND SEXUAL LIFE CYCLES. Section A: An Introduction to Heredity

Small RNA in rice genome

Biology-Integrated Year-at-a-Glance ARKANSAS STATE SCIENCE STANDARDS

Self-organized Criticality and Synchronization in a Pulse-coupled Integrate-and-Fire Neuron Model Based on Small World Networks

A.P. Biology Lecture Notes Unit 1A - Themes of Life

Curriculum Map. Biology, Quarter 1 Big Ideas: From Molecules to Organisms: Structures and Processes (BIO1.LS1)

TEST SUMMARY AND FRAMEWORK TEST SUMMARY

Branching processes, budding yeast, and k-nacci numbers

I. Molecules and Cells: Cells are the structural and functional units of life; cellular processes are based on physical and chemical changes.

Unit 2: Cellular Chemistry, Structure, and Physiology Module 5: Cellular Reproduction

Symbiotic sympatric speciation: consequence of interaction-driven phenotype differentiation through developmental plasticity

!"#$%&!"&'(&%")(*(+& '4567,846/-&*,/69450.:&*,.;42/9450&<&#7595.=097,.4.& #259,40& &95&523/0,--,.

Total

Effects of Interactive Function Forms and Refractoryperiod in a Self-Organized Critical Model Based on Neural Networks

Grade Seven Science Focus on Life Sciences. Main Ideas in the Study of Cells

Cell Biology 1.1- Introduction to Cells

Cell Division and Reproduction

arxiv:physics/ v1 [physics.bio-ph] 8 Feb 2000 Formalizing the gene centered view of evolution

ECOL/MCB 320 and 320H Genetics

RCPS Curriculum Pacing Guide Subject: Biology. Remembering, Understanding, Applying, Analyzing, Evaluating, Creating

Chapter Chemical Uniqueness 1/23/2009. The Uses of Principles. Zoology: the Study of Animal Life. Fig. 1.1

Philipsburg-Osceola Area School District Science Department. Standard(s )

Lecture 4: Transcription networks basic concepts

Structural Properties of Generative Form by Hormonal Proliferation Algorithm

Introduction to cells

Understanding Science Through the Lens of Computation. Richard M. Karp Nov. 3, 2007

Evolutionary Games and Computer Simulations

Testing the transition state theory in stochastic dynamics of a. genetic switch

Molecular evolution - Part 1. Pawan Dhar BII

2015 FALL FINAL REVIEW

growth, the replacement of damaged cells, and development from an embryo into an adult. These cell division occur by means of a process of mitosis

What is Systems Biology

Computational Biology: Basics & Interesting Problems

Correlations to Next Generation Science Standards. Life Sciences Disciplinary Core Ideas. LS-1 From Molecules to Organisms: Structures and Processes

Chapter 6: Cell Growth and Reproduction Lesson 6.1: The Cell Cycle and Mitosis

Cluster Analysis of Gene Expression Microarray Data. BIOL 495S/ CS 490B/ MATH 490B/ STAT 490B Introduction to Bioinformatics April 8, 2002

98 Washington State K-12 Science Learning Standards Version 1.2

56:198:582 Biological Networks Lecture 10

Biological Networks: Comparison, Conservation, and Evolution via Relative Description Length By: Tamir Tuller & Benny Chor

FINAL VERSION_ Secondary Preservice Teacher Standards -- Life Science AFK12SE/NGSS Strand Disciplinary Core Idea

Dark Matter Abundance in Brane Cosmology

5.1. Cells have distinct phases of growth, reproduction, and normal functions. G 1. Cell Growth and Division CHAPTER 5 THE CELL CYCLE KEY CONCEPT

Structures and Functions of Living Organisms (LS1)

Why do we have to cut our hair, nails, and lawn all the time?

Text of objective. Investigate and describe the structure and functions of cells including: Cell organelles

11/24/13. Science, then, and now. Computational Structural Bioinformatics. Learning curve. ECS129 Instructor: Patrice Koehl

Course Name: Biology Level: A Points: 5 Teacher Name: Claire E. Boudreau

Fuzzy Clustering of Gene Expression Data

1-3 Studying Life. Slide 1 of 45. End Show. Copyright Pearson Prentice Hall

Lecture 8: Temporal programs and the global structure of transcription networks. Chap 5 of Alon. 5.1 Introduction

Simulation of cell-like self-replication phenomenon in a two-dimensional hybrid cellular automata model

Which row in the chart correctly identifies the functions of structures A, B, and C? A) 1 B) 2 C) 3 D) 4

Transcription:

Zipf s Law in Gene Expression Chikara Furusawa Center for Developmental Biology, The Institute of Physical arxiv:physics/0209103v1 [physics.bio-ph] 30 Sep 2002 and Chemical Research (RIKEN), Kobe 650-0047, JAPAN Kunihiko Kaneko Department of Pure and Applied Sciences Univ. of Tokyo, Komaba, Meguro-ku, Tokyo 153-8902, JAPAN (Dated: February 2, 2008) Abstract Using data from gene expression databases on various organisms and tissues, including yeast, nematodes, human normal and cancer tissues, and embryonic stem cells, we found that the abundances of expressed genes exhibit a power-law distribution with an exponent close to -1, i.e., they obey Zipf s law. Furthermore, by simulations of a simple model with an intra-cellular reaction network, we found that Zipf s law of chemical abundance is a universal feature of cells where such a network optimizes the efficiency and faithfulness of self-reproduction. These findings provide novel insights into the nature of the organization of reaction dynamics in living cells. PACS numbers: 87.17.Aa, 87.80.Vt, 89.75.Fb 1

In a cell, an enormous number of organized chemical reactions are required to maintain its living state. Although enumeration of detailed cellular processes and the construction of complicated models is important for a complete description of cellular behavior, it is also necessary to search for universal laws with regard to the intra-cellular reactions common to all living systems, and then to unravel the logic of life leading to such universal features. For example, scale-free networks have recently been discussed as a universal property of some biochemical reaction networks within existing organisms [1, 2]. These studies, however, only focused on the properties of the network topologies, while the reaction dynamics of the networks were not discussed. Here, we report a universal property of the reaction dynamics that occurs within cells, namely a power-law distribution of the abundance of expressed genes with an exponent close to -1, i.e. a power-law distribution that obeys Zipf s law [3]. By using an abstract model of a cell with simple reaction dynamics, we show that this power-law behavior in the chemical abundances generally appears when the reaction dynamics leads to a faithful and efficient self-reproduction of a cell. These findings provide insights into the nature of the organization of complex reaction dynamics in living cells. In order to investigate possible universal properties of the reaction dynamics, we examined the distributions of the abundances of expressed genes (that are approximately equal to the abundances of the corresponding proteins) in 6 organisms and more than 40 tissues based on data publicly available from SAGE (Serial Analysis of Gene Expression) databases [4, 5, 6]. SAGE allows the number of copies of any given mrna to be quantitatively evaluated by determining the abundances of the short sequence tags which uniquely identify it [7]. In Fig.1, we show the rank-ordered distributions of the expressed genes, where the ordinate indicates the of the observed sequence tags (i.e. the population ratio of the corresponding mrna to the total mrna), and the abscissa shows the rank determined from this. As shown, the distributions follow a power-law with an exponent close to -1 (Zipf s law). We observed this power-law distribution for all the available samples, including 18 human normal tissues, human cancer tissues, mouse (including embryonic stem cells), rat, nematode (C. elegans), and yeast (S.cerevisiae) cells. All the data over 40 samples (except for 2 plant data) show the power-law distributions with the exponent in the range from 1 0.86. Even though there are some factors which may bias the results of the SAGE experiments, such as sequencing errors and non-uniqueness of tag sequences, it seems rather unlikely that the distribution is an artifact of the experimental procedure. 2

The abundance of each protein is the result of a complex network of chemical reactions that is influenced by possibly a large number of factors including other proteins and genes. Then, why is Zipf s law universally observed, and what class of reaction dynamics will show the observed power-law distribution? Because the power-law distribution applies to a wide range of existing organisms, it is expected that it appears as a general feature of the reaction dynamics of cellular systems. In order to investigate the above questions, we adopt a simple model of cellular dynamics that captures only its basic features. It consists of intra-cellular catalytic reaction networks that transform nutrient chemicals into proteins. By studying a class of simple models with these features, we clarify the conditions under which the reaction dynamics leads to a powerlaw distribution of the chemical abundances. Of course, real intra-cellular processes are much more complicated, but if the mechanism is universal, the power-law should be valid regardless of how complicated the actual processes are. Hence it is relevant to study as simple as possible a model when trying to understand a universal law in real data. Consider a cell consisting of a variety of chemicals. The internal state of the cell can be represented by a set of numbers (n 1, n 2,, n k ), where n i is the number of molecules of the chemical species i with i ranging from i = 1 to k. For the internal chemical reaction dynamics, we chose a catalytic network among these k chemical species, where each reaction from some chemical i to some other chemical j is assumed to be catalyzed by a third chemical l, i.e. (i+l j +l). The rate of increase of n j (and decrease of n i ) through this reaction is given by ǫn i n l /N 2, where ǫ is the coefficient for the chemical reaction. For simplicity all the reaction coefficients were chosen to be equal [8], and the connection paths of this catalytic network were chosen randomly such that the probability of any two chemicals i and j to be connected is given by the connection rate ρ [9]. Some resources (nutrients) are supplied from the environment by diffusion through the membrane (with a diffusion coefficient D), to ensure the growth of a cell. Through the calaytic reactions, these nutrients[10] are transformed into other chemicals. Some of these chemicals may penetrate [8] the membrane and diffuse out while others will not. With the synthesis of the unpenetrable chemicals that do not diffuse out, the total number of chemicals N = i n i in a cell can increase, and accordingly the cell volume will increase. We study how this cell growth is sustained by dividing a cell into two when the volume is 3

larger than some threshold. For simplicity the division is assumed to occur when the total number of molecules N = i n i in a cell exceeds a given threshold N max. Chosen randomly, the mother cell s molecules are evenly split among the two daughter cells. In our numerical simulations, we randomly pick up a pair of molecules in a cell, and transform them according to the reaction network. In the same way, diffusion through the membrane is also computed by randomly choosing molecules inside the cell and nutrients in the environment. In the case with N k (i.e. continuous limit), the reaction dynamics is represented by the following rate equation: dn i /dt = j,l Con(j, i, l) ǫ n j n l /N 2 j,l Con(i, j, l ) ǫ n i n l /N 2 + Dσ i (n i /V n i /N), where Con(i, j, l) is 1 if there is a reaction i + l j + l, and 0 otherwise, whereas σ i takes 1 if the chemical i is penetrable, and 0 otherwise. The third term describes the transport of chemicals through the membrane, where n i is a constant, representing the number of the i-th chemical species in the environment and V denotes the volume of the environment in units of the initial cell size. The number n i is nonzero only for the nutrient chemicals. If the total number of molecules N max is larger than the number of chemical species k, the population ratios {n i /N} are generally fixed, since the daughter cells inherit the chemical compositions of their mother cells. For k > N max [11], the population ratios do not settle down and can change from generation to generation. In both cases, depending on the membrane diffusion coefficient D, the intra-cellular reaction dynamics can be classified into the three classes [12]. First, there is a critical value D = D c beyond which the cell cannot grow continuously. When D > D c, the flow of nutrients from the environment is so fast that the internal reactions transforming them into chemicals sustaining metabolism cannot keep up. In this case all the molecules in the cell will finally be substituted by the nutrient chemicals and the cell stops growing since the nutrients alone cannot catalyze any reactions to generate unpenetrable chemicals. Continuous cellular growth and successive divisions are possible only for D D c. When the diffusion coefficient D is sufficiently small, the internal reactions progress faster than the flow of nutrients from the environment, and all the existing chemical 4

species have small numbers of approximately the same level. A stable reaction network organization is obtained only at the intermediate diffusion coefficient below D c, where some chemical species have much larger number of molecules than others. The rank-ordered number distributions of chemical species in our model are plotted in Fig.2, where the ordinate indicates the number of molecules n i and abscissa shows the rank determined by n i. As shown in the figure, the slope in the rank-ordered number distribution increases with an increase of the diffusion coefficient D. We found that at the critical point D = D c, the distribution converges to a power-law with an exponent -1. The power-law distribution at this critical point is maintained by a hierarchical organization of catalytic reactions, where the synthesis of higher ranking chemicals is catalyzed by lower ranking chemicals. For example, major chemical species (with e.g. n i > 1000) are directly synthesized from nutrients and catalyzed by chemicals that are slightly less abundant (e.g. n i 200). The latter chemicals are mostly synthesized from nutrients (or other major chemicals), and catalyzed by chemicals that are much less abundant. In turn these chemicals are catalyzed by chemicals that are even less abundant, and this hierarchy of catalytic reactions continues until it reaches the minor chemical species (with e.g. n i < 5) [13]. Based on this catalytic hierarchy, the observed exponent -1 can be explained using a mean field approximation. First, we replace the concentration n i /N of each chemical i, except the nutrient chemicals, by a single average concentration (mean field) x, while the concentrations of nutrient chemicals S is given by the average concentration S = 1 k x, where k is the number of non-nutrient chemical species. From this mean field equation, we obtain S = DS 0 D+ǫρ with S 0 = j n j/v. With linear stability analysis, the solution with S 1 is stable if D < ǫρ S 0 1 D c. Indeed, this critical value does not differ much from numerical observation. Next, we study how the concentrations of non-nutrient chemicals differentiate. Suppose that chemicals {i 0 } are synthesized directly from nutrients through catalyzation by chemicals j. As the next step of the mean-field approximation we assume the concentrations of the chemicals {i 0 } are larger than the others. Now we represent the dynamics by two mean-field concentrations; the concentration of {i 0 } chemicals, x 0, and the concentration of the others, x 1. The solution with x 0 x 1 satisfies x 0 x 1 /ρ at the critical point D c. Since the fraction of the {i 0 } chemicals among the non-nutrient chemicals is ρ, the relative abundance of the 5

chemicals {i 0 } is inversely proportional to this fraction. Similarly, one can compute the relative abundances of the chemicals of the next layer synthesized from i 0. At D D c, this hierarchy of the catalytic network is continued. In general a given layer of the hierarchy is defined by the chemicals whose synthesis from the nutrients is catalyzed by the layer one step down in the hierarchy. The abundance of chemical species in a given layer is 1/ρ times larger than chemicals in the layer one step down. Then, in the same way as this hierarchical organization of chemicals, the increase of chemical abundances and the decrease of number of chemical species are given by factors of 1/ρ and ρ, respectively. This is the reason for the emergence of power-law with an exponent -1 in the rank-ordered distribution [14]. In general, as the flow of nutrients from the environment increases, the hierarchical catalyzation network pops up from random reaction networks. This hierarchy continues until it covers all chemicals, at D D c 0. Hence, the emergence of a power-law distribution of chemical abundances near the critical point is quite general, and does not rely on the details of our model, such as the network configuration or the kinetic rules of the reactions. Instead it is a universal property of a cell with an intra-cellular reaction network to grow, by taking in nutrients, at the critical state, as has been confirmed from simulations of a variety of models. There are two reasons to assume that such a critical state of the reaction dynamics is adopted in existing cellular systems. First, as shown in Fig.3, the growth speed of a cell is maximal at D = D c. This suggests that a cell whose reaction dynamics are in the critical state should be selected by natural selection. Second, at the critical point, the similarity of chemical compositions between the mother and daughter cell is maximal as shown in Fig.3. Indeed, for k > N, the chemical compositions differ significantly from generation to generation when D D c. When D D c, several semi-stable states with distinct chemical compositions appear. Daughter cells in the semi-stable states inherit chemical compositions that are nearly identical to their mother cells over many generations, until fluctuations in molecule numbers induce a transition to another semi-stable state. This means that the most faithful transfer of the information determining a cell s intra-cellular state is at the critical state. (Inheritance of chemical compositions is also discussed in [16] in connection with the origin of reproducing cells). In this state, cells of specific chemical compositions are reproduced and can also evolve into other states. For these reasons, it is natural to conclude that evolution favors a critical state [15] for the reaction dynamics. 6

Last, we investigated the relationship between the abundance of a chemical species and the number of reaction paths connected with it. By comparing the SAGE data and the protein-protein interaction data in yeast (S.cerevisiae) [17, 18], obtained by systematic twohybrid analysis, we found that there is a significant negative correlation between the abundance of any given mrna and the number of protein-protein interaction links that the corresponding protein takes part in (p < 0.01; determined by randomization test). In our model simulations, this negative correlation between the abundance of chemical species and the number of possible catalytic paths of the chemical is also found. In this sense, chemicals minor in abundance can play a relatively important role in the control of the behavior of a cell[19]. In the future it will be important to study this kind of interplay in the context of evolution since the evolution of reaction networks has only been discussed in the context of network topology [1, 2]. We would like to thank Tetsuya Yomo and Lars Martin Jakt for stimulating discussions and Frederick H. Willeboordse and Adam Ponzi for critical reading of the manuscript. Grantin-Aids for Scientific Research from the Ministry of Education, Science and Culture of Japan (11CE2006). [1] H. Jeong, et al., Nature 407, 651 (2000). [2] H. Jeong, S. P. Mason, A.-L. Barabási, Nature 411, 41 (2001). [3] G. K. Zipf, Human Behavior and the Principle of Least Effort (Addison-Wesley, Cambridge, 1949). [4] A.E. Lash et al.,genome Research 10(7), 1051 (2000). [5] V.E. Velculescu et al., Cell 88, 243 (1997): SAGE Data is available from http://www.sagenet.org/ [6] S. J. Jones et al., Genome Res. 11(8), 1346 (2001): SAGE Data is available from http://elegans.bcgsc.bc.ca/sage/ [7] V.E. Velculescu, L. Zhang, B. Vogelstein, K. W. Kinzler, Science 270, 484 (1995). [8] Even if the reaction coefficient and diffusion coefficient of penetrating chemicals are not identical but distributed, the results reported here are obtained. [9] K. Kaneko, T. Yomo, Bull. Math. Biol 59, 139 (1997); C. Furusawa, K. Kaneko, Phys. Rev. 7

Lett. 84, 6130 (2000); Jour. Theor. Biol. 209, 395 (2001); J. D. Farmer, S. A. Kauffman, and N. H. Packard, Physica D 22D, 50 (1986). In contrast to these studies, auto-catalytic reactions are not relevant to the present study. [10] The nutrient chemicals have no catalytic activity in order to prevent the occurrence of catalytic reactions in the environment. [11] Note that, in the case k > N max the number of some chemical species n i is 0, while a subpopulation of chemical species sustains the intra-cellular dynamics. [12] These three classes of intra-cellular dynamics also appear when changing the connection rate ρ. There is a critical value ρ = ρ c, where in the case ρ < ρ c the cell stops growing. The power-law distribution of chemical abundances with an exponent -1 appears at ρ = ρ c. [13] In the case depicted in Fig.2, a hierarchical organization of catalytic reactions with 5 6 layers is observed at the critical point. [14] Within a given layer, a further hierarchy exists, which again leads to the Zipf rank distribution. For details, see C. Furusawa and K. Kaneko, to be published. [15] P. Bak and K. Sneppen, Phys. Rev. Lett. 71, 4083 (1993); P. Bak, How Nature works(springer, New York, 1996) [16] D. Segré, B. Danfna, D. Lancet, Proc. Natl. Acad. Sci. USA 97(8), 4112 (2000). [17] P. Uewtz et al., Nature 403, 623 (2000). [18] I. Xenarios et al., Nucleic Acids Res. 28, 289 (2000). [19] K. Kaneko, T. Yomo, Jour. Theor. Biol. 312, 563 (2002). 8

(a) 10-1 (b) 10-1 10-2 α=-1.0 10-2 α=-1.0 (c) (e) 10-5 10 0 10 1 10 2 10 3 10-5 10 0 10 1 10 2 10 3 ranking ranking 10-1 10-2 α=-1.0 10 0 10 1 10 2 10 3 10 4 ranking 10-2 α=-1.0 (d) (f) 10-1 10-2 α=-1.0 10-5 10 0 10 1 10 2 10 3 10 4 ranking 10-1 10-2 α=-1.0 10-5 10 0 10 1 10 2 10 3 10 4 ranking 10 0 10 1 10 2 10 3 ranking FIG. 1: Rank-ordered distributions of expressed genes. (a), Human liver; (b), kidney; (c), Human colorectal cancer (caco2); (d), Mouse embryonic stem cells; (e),c. elegans; (f) yeast (Saccharomyces cerevisiae). The exponent α of the power law is in the range from 1 0.86 for all the samples inspected, except for two plant data (seedlings of Arabidopsis thaliana and the trunk of Pinus taeda), whose exponents are approximately 0.63. 9

(a) number of molecules 10 4 10 3 10 2 10 1 D=0.1 D=0.02 D=0.01 D=0.001 number of molecules 10 4 10 3 10 2 10 1 10 0 10 0 10 1 10 2 10 3 10 4 α=-1 rank (b) 10 0 10 0 10 1 10 2 10 3 10 4 10 5 rank FIG. 2: Rank-ordered number distributions of chemical species. (a) Distributions with different diffusion coefficients D are overlaid. The parameters were set as k = 5 10 6, N max = 5 10 5, and ρ = 0.022. 30 % of chemical species are penetrating the membrane, and others are not. Within the penetrable chemicals, 10 chemical species are continuously supplied to the environment, as nutrients. In this figure, the numbers of nutrient chemicals in a cell are not plotted. With these parameters, D c is approximately 0.1. (b) Distributions at the critical points with different total number of chemicals k are overlaid. The numbers of chemicals were set as k = 5 10 4, k = 5 10 5, and k = 5 10 6, respectively. Other parameters were set the same as those in (a). 10

growth speed (a.u.) 0.2 0.15 0.1 0.05 growth speed similarity D=Dc 1 0.8 0.6 0.4 0.2 similarity H 0 0.001 0.01 0.1 diffusion coefficient D 0 FIG. 3: The growth speed of a cell and the similarity between the chemical compositions of the mother and daughter cells, plotted as a function of the diffusion coefficient D. The growth speed is measured as the inverse of the time for a cell to divide. The degree of similarity between two different states m (mother) and d (daughter) is measured as the scalar product of k-dimensional vectors H(n m,n d ) = (n m / n m ) (n d / n d ), where n = (n 1,n 2,...,n k ) represents the chemical composition of a cell and n is the norm of n [16]. Both the growth speed and the similarity are averaged over 500 cell divisions. Note that the case H = 1 indicates an identical chemical composition between the mother and daughter cells. 11