Exploring the Evolution of Modularity in Gene Regulatory Networks

Size: px
Start display at page:

Download "Exploring the Evolution of Modularity in Gene Regulatory Networks"

Transcription

1 University of Vermont UVM UVM Honors College Senior Theses Undergraduate Theses 2015 Exploring the Evolution of Modularity in Gene Regulatory Networks Mariko L. Totten University of Vermont, mtotten@uvm.edu Follow this and additional works at: Recommended Citation Totten, Mariko L., "Exploring the Evolution of Modularity in Gene Regulatory Networks" (2015). UVM Honors College Senior Theses. Paper 69. This Honors College Thesis is brought to you for free and open access by the Undergraduate Theses at UVM. It has been accepted for inclusion in UVM Honors College Senior Theses by an authorized administrator of UVM. For more information, please contact donna.omalley@uvm.edu.

2 Exploring the Evolution of Modularity in Gene Regulatory Networks An Undergraduate Honors College Thesis Presented By: Mariko Totten Advised By: Joshua Bongard, Ph. D. Department of Computer Science University of Vermont, Burlington, April 23,

3 Contents Abstract Introduction Modeling the GRN Modularity in the GRN Espinosa-Soto and the Evolution of Modularity in GRNs Further Exploration of the Evolution of Modularity in GRNs Methods Network Model Simulating Specialization Network Evaluations and Fitness Modularity Evolutionary Simulations Statistical Analysis Results Robustness and Modularity in Specialized GRNs Biased Mutation in the Evolution of Modularity in GRNs Discussion Implications a Influence of Robustness on the Evolution of Modularity in Specialized GRNs b Influence of Biased Mutation on the Evolution of Modularity in Specialized GRNs Analysis Summary Limitations Future Research Conclusion References

4 Abstract Modularity has been observed in biological systems of all shapes and sizes. Intuitively, we recognize that this component-based structure can increase efficiency and robustness in a system. Gene regulatory networks are a perfect example of a biological system that exhibits modularity, however, just how this modularity evolved is still unresolved. The research of Espinosa-Soto and Wagner supports the hypothesis the modularity observed in biological gene regulatory networks evolved as a result of specialization in gene activity (Espinosa-Soto & Wagner, 2010). By manipulating the implementation of their research we may further explore the conditions that drive the evolution of modularity. Specifically, this study explores how the robustness of a network and use of biased mutation may influence how modularity is evolved in gene regulatory networks. 3

5 1. Introduction This study investigates the evolution of modularity in gene regulatory networks (GRNs). GRNs dictate cell behavior and organism development and can be modeled using Boolean networks. GRNs have a modular structuring, but how this modularity came to evolve in the first place is still a topic of debate. Computer modeling can provide critical insight into the behavior of GRNs and could formulate hypotheses to probe how modularity evolves. Research performed by Carlos Espinosa-Soto and Andreas Wagner in their 2010 study Specialization Can Drive the Evolution of Modularity (Espinosa-Soto & Wagner, 2010) concluded that, as the title suggests, specialization in gene activity may cause modularity to evolve in GRNs. This research examined how certain aspects of the model used by Espinosa-Soto and Wagner (2010) influenced their findings on the evolution of modularity. 1.1 Modeling the GRN While every cell in an organism contains the same set of genes the way individual cells behave varies extraordinarily. This is because a cell s behavior is defined by how the cell s genes are expressed. The process of changing a cell s gene expression through the activation and repression of genes is called gene regulation. Subsequently, the GRN, which is comprised generally of genes and the interactions between them, is the control system for organism development. Gene activity pattern (GAP) refers to the collective activity states of all of the genes in a cell at a given time. It is this pattern that defines a cell s behavior and subsequently an organism s phenotype. In this way, the GAP can be viewed as a cell s identifying signature. For example, the gene regulation that occurs during development may cause stem cells to fall into two unique GAPs that distinguish which cells are to become liver cells and which will become 4

6 heart cells. Gene regulation also occurs over the course of an organism s lifetime in response to environmental changes. In this case, external inputs may pressure a set of cells to adjust their current GAP, for example causing fur to grow thicker in colder weather. As one can imagine, when this network is composed of thousands of inter-regulating genes the complexity of interactions and resulting behavior becomes difficult to follow. Biologists can research the actions of specific genes but difficulty studying the network as a whole. In order to predict the behavior of a network it becomes critical to look at the entire set of interactions rather than focusing on individual genes (Fernández & Solé, 2003). Mathematical models can be used to simulate complex interactions in order to emulate a GRN. Mathematically, a GRN can be understood as a mixed graph, G(N,U,D) in which the nodes, N, represent the network s genes and their associated activity. The set of directed edges, D, represent the direct causal interactions between genes, and the undirected edges, U, indicate associations between genes as a result of hidden confounding variables (Das, Caragea, Welch, & Hsu, 2010). Specifically, researchers have used Boolean networks as a simplified model for observing and predicting the behavior of GRNs. The biological basis for the development of Boolean networks as models of genetic regulatory networks lies in the fact that during regulation of functional states, the cell exhibits switch-like behavior, (Shmulevich, Dougherty, Kim, & Zhang, 2002). A Boolean network is comprised of a set of nodes with associated Boolean states. Nodes can be either on or off, which emulate genes in either the activated or repressed state. The state of each node is determined at every time step through a Boolean function, which uses logical functions to specify output for a given set of inputs (Fernández & Solé, 2003). In other words, the expression of each gene is determined at these time steps as a Boolean function of the current GAP (see Figure 1). Despite its obvious simplifications, Boolean networks have been 5

7 Figure 1. The left image shows a graphical representation of a Boolean network with nodes a, b, and c, and their interactions indicated by edges. The truth table on the right describes the logical functions used to determine the output of each node (Speith, 2004). shown to accurately predict the segmentation of Drosophila melanogaster in the work of Reka Albert and Hans Othmer (Albert & Othmer, 2003). By applying computational evolution to these Boolean networks we can produce a model of how GRNs may evolve. Computational evolution refers to the concept of using computational algorithms to simulate the evolution of a population in order to produce a more fit population. In particular, the evolution of GRNs can be modeled using canonical genetic algorithms. These algorithms are characterized by two main components selection and variation. First, the population must be assigned a goal by which its members will be evaluated. How well a member of the population is able to achieve this goal is called the member s fitness. The level of fitness a member attains in relation to others in the population determines whether that member will be selected to seed the next generation. The best performing members at the end of the fitness evaluation then undergo variation through random mutation and/or sexual recombination in order to produce a new generation of population members. This process of selection and variation over many generations creates pressure to continually improve the average fitness of the population (Eiben, 2003). Evolutionary algorithms can be used to find non-intuitive solutions to a problem or, in the case of this research, examine topics related to the evolutionary process. Specifically this research investigates how modularity may be produced through evolution. 6

8 1.2 Modularity in the GRN Modularity is the partitioning of a structure into semi-autonomous sub-units, and is ubiquitous in biology (Callebaut & Rasskin-Gutman, 2005; Schlosser & Wagner, 2004)). Inherently we can understand the benefits of such a structuring, and modularity has been identified as one of the three critical components in the study of complex biological systems, along with robustness and emergence (Aderem, 2005). In order to understand a biological system it is critical to study not only how a system may be partitioned into modules, but what effect this has on the system and how this modular structuring came to evolve in the first place. There are two outstanding characteristics of modules: their integration concerning their internal relations (between their components) and their autonomy concerning their external relations (to elements of the context) (Schlosser & Wagner, 2004, p. 4-5). Thus the behavior of a module relies strongly on the functioning of other elements within that module, but little on the elements outside of it. As a result of the simple characteristic of modularity, there are two critical benefits of a modularly structured system. First, modularity improves the robustness of a system, or the ability to maintain behavior while experiencing external perturbations. When perturbations cause changes to a single module, any problems that arise are restricted to that module with little effect on the entire system (Aderem, 2005). Additionally, connectivity within a module may help to rapidly and effectively correct for the perturbation. Furthermore, modularity sets up that system for more efficient and effective evolution. Modularity improves evolvability through the dissociation of developmental processes (e.g. heterochrony), the duplication of subsequent divergence of developmental modules, and the cooption of features into new functions (Lorenz, Jeng, & Deem, 2011, p. 33). In simpler terms, modularity provides the ability for change to occur within one module without affecting the 7

9 greater system, such that each module develops to perform a specific independent function. These functions can then change over time to become more effective or be slightly adjusted to be utilized in a different circumstance. Modularity in a biological network refers to the idea that the network can be partitioned into densely interconnected groups of nodes with little intragroup connectivity (see Figure 2). Specifically in a GRN, this means that within a module the genes greatly influence the activity Figure 2. This graphical representation of a network demonstrates the partitioning of nodes into modules, indicated by the red, green and blue colored groups. Observing the edges between nodes shows dense connectivity within each module and sparse connectivity between modules (Amsen, 2013). state of one another, but do not greatly affect the state of any genes outside of the module. Modularity in the regulatory network is especially noteworthy because modularity in the phenotype is an immediate consequence of a developmental modularity, (Lorenz et al., 2011, p. 33). Besides imposing modularity on the phenotype, modularity in the GRN also has been observed to foster robustness and evolvability. GRNs maintain their intrinsic behavior even when they are perturbed externally, (Espinosa-Soto & Wagner, 2010, 2) and make the evolution of new complex circuits and resulting phenotypes easier, (Lorenz et al., 2011, p.23). While modularity and its effects have been observed in GRNs, there is still no definitive understanding of how and to what strength this modularity evolves. Previous research suggested three prominent theories regarding why modularity evolves: 1) Modularity arises from a pressure 8

10 to reduce the connection costs of interactions between nodes (Clune, Mouret, & Lipson, 2013); 2) Modularity occurs as a result of simultaneous directional and stabilizing selection (Wagner, 1996); 3) Modularly-varying evolutionary goals due to changes in organisms environment drives the evolution of modularity (Kashtan & Alon, 2005). 1.3 Espinosa-Soto and the Evolution of Modularity in GRNs A study by Espinosa-Soto and Wagner provided an alternative hypothesis for the evolution of modularity in GRNs (Espinosa-Soto & Wagner, 2010). Their work used evolutionary computation on a Boolean model of GRNs to support their hypothesis that specialization in gene activity drives the regulatory network toward a modular organization. Specialization is the creation of a new GAP that may occur during a lifetime, such as alteration of a specific body function due to certain environmental conditions. For example, consider the gene regulation that occurs during organism development. The environment surrounding a stem cell signals whether that cell should become a liver cell rather than a heart cell. Subsequently, some portion of the GAP is defined to exhibit the unique characteristics of a liver cell. Alternatively, other traits are consistent throughout many different types of cells in the organism despite different environmental conditions. For example, in all cells the regulatory network should produce the portion of the GAP that builds cell walls. The favoring of a ubiquitous trait such as this is referred to as generalization. Consequentially, selection must simultaneously favor maintaining the activity states of some set of genes while changing the desired activity states of others. Espinosa-Soto and Wagner (2010) suggest that modularity arises within the network so that both of these patterns can be accommodated simultaneously. Espinosa-Soto and Wagner s evolutionary model (2010) supported their hypothesis that modularity develops in GRN s as a result of specialization (see Figure 3.) 9

11 Figure 3. These figures show the results of the research performed by Espinosa-Soto and Wagner (2010). (A) shows the GAPs used their evolutionary simulations. Genes 0-4 in either GAP share the same activation states, to represent the ubiquitous expression of a trait. The states of genes 5-9 differ between GAPs to indicate specialization in gene activity unique to certain regions of the body or environmental conditions. (B) and (C) show the level of modularity, as measured by a normalized Q value (see equation 3), after selection for GAP I versus GAP I and II. Specifically (B) shows the modularity of the highest performing network in each population and (C) shows the average modularity of a population. In both graphs we see that modularity tends to be higher after selection for both GAPs. (D) and (E) are examples of high fitness networks, with modules indicated by different colors. (D) is a network after selecting for GAP I and (E) is after selection for I and II. Clearly, (E) is far more modular than (D). 1.4 Further Exploration of the Evolution of Modularity in GRNs The research performed by Espinosa-Soto and Wagner (2010) provides a framework to continue exploring the evolution of modularity in GRNs. By recreating the basis of their research we can then manipulate aspects of their implementation to examine how these conditions may affect how strongly modularity evolves. This research seeks to explore two critical aspects of their implementation. 10

12 First, the equation for random mutation used by Espinosa-Soto and Wagner (2010) pressured networks to be sparsely connected, averaging two to three connections per gene. The researchers explained that this bias was used because it is an accurate reflection of the low connectivity observed in natural GRNs, among other biological networks. While this is true, the logic behind these specific numbers lies in the property of criticality in GRNs (Torres-Sosa, Huang, & Aldana, 2012). This property is a result of how the average number of incoming connections into each node of a network influences the average cycle length of the attractor a network falls into. Criticality refers to the distinct number of regulators that keeps this cycle length relatively small, around 1, but still allows the network to experience variation. Research has shown that in the right circumstances this level of criticality tends to be reached by two to three regulators per gene (Espinosa-Soto & Wagner, 2010). Additionally, by pressuring networks to remain sparse, this model essentially applies connection costs to the GRN. Recall, one theory for the evolution of modularity suggested that connection costs of interactions pressures networks to become modular (Clune et al., 2013). It is expected that removing this biased mutation will remove connection costs and allow more regulators per gene, thus pushing the network out of criticality, and subsequently will decrease the level of modularity in a specialized GRN. Additionally, Espinosa-Soto and Wagner s evolutionary model (2010) favored very robust networks. Networks were evaluated 500 times in the face of different perturbations, so that only the most robust networks received high fitness. However, as noted before, there is a significant relationship between modularity and robustness (Aderem, 2005). Studies suggest that modularity creates robustness in a network by isolating problems within a module and strengthening the ability of a module to correct itself. In turn it is conceivable that robustness 11

13 may reinforce modularity, because a network that does not face many perturbations does not need to count on modules to confine problems. Therefore, it is informative to examine different levels of perturbations, which will in turn create selection pressure for varyingly robust networks, to observe how this may influence the evolution of modularity in specialized GRNs. 12

14 2. Methods The research was conducted so as to replicate the 2010 research of Espinosa-Soto and Wagner s investigation of the evolution of modularity through specialization. Their methods were re-implemented as closely as possible given computational power and time restraints. Several specific variables were then modified to examine their role in the evolution of modularity. 2.1 Network Model For this research, the GRN was modeled using a Boolean network. Though this model is a very simplified representation of a complex system, it can be argued that a Boolean network effectively captures the computational nature of gene regulation (Fernández & Solé, 2003). The network is comprised of ten genes each of which can be in either in an active (on) or repressed (off) state. The composite state of the genes at a given time defines a GAP, which is represented by a vector s t = [s t, s t,, s t ]. An element s i t of this vector corresponds to the expression of gene i at time t, and can hold a value of either -1 (repressed) or 1 (active) (see Figure 4). Figure 4. In this example of a GAP in the 10 gene network each block represents the activation state of its associated gene. White blocks indicate an activated gene and black blocks indicate a repressed gene. The GAP at time t is determined by the interactions between the network s genes (see Figure 5A). These interactions are represented by a 10x10 adjacency matrix, A = [a ij ] (See Figure 5B). An element a ij of this matrix signifies the influence gene j is exerting on gene i in the form of the activation (a ij = 1), repression (a ij = -1), or no interaction (a ij = 0). Using these 13

15 values, the state of a gene at the next time step can be determined by multiplying the current state of the gene by its regulators, as described in the formula: s j t+ j=1 a ij s j t ] (1) where (x) equals 1 if x > 0 or -1 otherwise. Figure 5. (A) shows a network of genes and their associated interactions. Thicker edged indicate that an interaction exists both from gene j to gene i and from gene i to gene j. Grey edges indicate a repressive influence and black edges indicate an activating influence. Bolded nodes indicate the existence of a self-loop, that is that gene i influences itself. (B) is the associated A ij adjacency matrix for this network, where a ij indicates the influence of gene j on gene i. The GAP of a network dictates the expression of phenotypic traits. Specifically, in this model we consider the resulting phenotypic trait to be defined by the point attractors that the GAP falls into after a number of time steps. This model updates a network using the equation defined above until the network falls into one of two forms of attractors: fixed point, in which the same GAP is sustained indefinitely after a certain time step, or a cyclic attractor which repeatedly produces the same sequence of GAPs. 2.2 Simulating Specialization Recall, specialization in GAPs refers to the idea that over a lifetime new GAPs may evolve in a specific body part or under certain environmental conditions (Espinosa-Soto & 14

16 Wagner, 2010). In order to demonstrate how specialization increases modularity, networks were first evolved without specialization. These networks were evaluated on their ability to attain GAP I. After these networks were successfully evolved, specialization was simulated by introducing a second pattern, GAP II. Networks were then evaluated on their ability to attain both pattern I and II. These patterns were specifically chosen so that patterns I and II shared the activity state of the first 5 genes (genes 0-4), and differed in activity state of the last 5 genes (genes 5-9) (see Figure 6). Figure 6. Networks are evolved to demonstrate either just GAP I or both GAP I and II. Black and white boxes represent inactive (s i = -1) and active (s i = 1) genes, respectively. The first five genes, indicated by the blue region, share the same activation in both GAPs. The latter 5 genes, indicated by the red region, differ in activation states Espinosa-Soto and Wagner (2010) suggested that this specialization increases modularity because a network was most successful if it could maintain the state of the first five genes but change the states of the last five when facing different initial conditions. Thus a network would benefit by breaking interactions between these two sets of genes, creating two independently functioning modules. 2.3 Network Evaluations and Fitness The fitness of each network is based on its ability to produce a fixed-point attractor for a specified GAP. Each network was evaluated using the following steps. First, the network s gene activity state is initialized to a perturbation of the desired GAP. These perturbations were created by altering the state of each gene in the desired GAP using a probability of p = Then, beginning with this initial perturbation, the GAP was calculated at every time step using 15

17 equation (1) until the network fell into either a point or cyclic attractor. Cyclic attractors were given a fitness of 0, because the goal of the network is to stably attain a GAP to simulate the stable expression of a phenotypic trait. Those networks that fell into point attractors were given a fitness value based on the Hamming distance, D, between the desired GAP and the final GAP of the system once it had fallen into a point attractor. Specifically, for each perturbation, a network was assigned a value of (1-D/D max ) 5, with accordance to Espinosa-Soto and Wagner (2010). Thus, best performing networks were those that were able to be initialized with a perturbation of a desired GAP, then remedy this perturbation so that the network fell into a point attractor that expressed the desired trait or GAP (see Figure 7). Figure 7. This figure demonstrates the process of a successful network for GAP I. At t = 0 the network s genes are initialized with the perturbation of GAP I (genes 1, 6, and 7 have been switched). Equation (1) is then used to produce the new activation states at each time step. At t = 4 and t = 5 the exact same GAP is expressed, indicating that the network has fallen into a point attractor. The final activation state of each gene matches that of GAP I, so the Hamming distance is 0 and this network, at least with this perturbation, has performed well. In order for a network to be robust it needed to be able to produce the desired GAPs in the face of many perturbations. The level of robustness in a network directly correlates to the number of perturbations the network experiences during evaluation. Espinosa-Soto and Wagner (2010) sought to examine only very robust networks, so all networks were evaluated against 500 perturbations. For this research, networks were evaluated against = 20, 30, 40, 50, 75, 16

18 or 100 perturbations in order to examine how the level of robustness influenced the strength of modularity that evolved. Thus, this process of initializing a network with a perturbation and then updating until an attractor was reached was repeated for the assigned number of perturbations,. Then the mean of the values for each of the perturbations was calculated, g = ( i=1 Finally, the network fitness was found using the equation = f(g) = 1 e -3g (2) This fitness function was used in order to replicate the work of Espinosa-Soto and Wagner (2010). 2.4 Modularity Modularity was measured using by calculating Q values to identify the strength of intermodule versus intra-module connectivity. For our model, we assumed that modularity would favor the partitioning of the network into two modules. These modules would be separated into genes that shared activity states in both GAPs (genes 0-4) and those that had opposing gene activity states between GAPs (genes 5-9). Specifically, Q is measured using the equation: Q = i=0 ( l i /L (d i /2L) 2 ) (3) In this equation, i represents the 2 modules that were identified above. L indicates the total number of edges in the network, l i stands for the number of edges in module i, and d i represents the total number of edges that leave, and enter, module i. This equation for modularity is a nonnormalized substitute for the Q measurement used by Espinosa-Soto and Wagner (2010). 17

19 2.5 Evolutionary Simulations Evolution was performed on populations of 100 networks. The initial population was created by randomly initializing a network with 20 interactions and then creating 99 mutations of this network. The mutation process was performed one of two ways. First, to evaluate the varying levels of perturbations, mutation was performed as per the research of Espinosa-Soto and Wagner (2010). In this case, networks were mutated using a biased mutation operator that favored networks with low connectivity, about 2-3 regulators per gene. For each gene, the probability of a mutation occurring ( ) was Given that a gene was to undergo mutation, the probability of losing an interaction was calculated p(u) = ( 4r u ) / ( 4r u + ( N r u ) ) (4) In this equation, r u represents the number of regulators on gene u and N is the number of genes in a network i.e. the maximum possible number of regulators on a gene. Conversely, the probability of acquiring an interaction was calculated using the equation q(u) = 1 p(u). A gene that was selected to acquire a new interaction had equal probability of gaining an activating or repressing regulator. A second goal of this research was to investigate the influence of biased mutation on the evolution of modularity. In order to create control data to compare to that of biased mutation, evolution was performed using the simplest form of mutation. This completely unbiased mutation chose a single interaction at random from a network and then reassigned it a new random interaction. The new interaction had equal probability of flipping the state of the initial interaction value, removing the interaction, or remaining the same. This unbiased mutation was applied to populations using 75 perturbations, the minimal level of robustness found in this research to successfully recreate the results of Espinosa-Soto and Wagner (2010). 18

20 Once the initial population was created, networks were evaluated as described to attain fitness values, and the next generation was created using fitness proportional selection. That is, each network in the current generation was assigned a probability of selection based on the ratio of its fitness performance to the cumulative performance of the entire population, P(i) = i / ( j j ). These probability ratios were then used to choose which networks would contribute to the next generation, such that those networks that performed better were more likely to be selected. Every network that was selected to contribute to the next generation then went through the appropriate mutation process as described above. Populations were evolved for 500 generations, at which point networks could stably attain GAP I. Populations were then evolved for another 1500 generations and were evaluated for their ability to generate both GAP I and II. Q values were measured at the 500 th and 2000 th generation. A total of 20 evolutionary runs were performed for each level of perturbations. 20 additional runs were performed with the unbiased mutator for 75 perturbations. 2.6 Statistical Analysis For each evolutionary run we examined the Q values for the networks with the highest fitness at 500 generations (after evolving for GAP I) and 2000 generations (after evolving for GAP I and II). The highest fitness in a population can often be achieved by multiple networks in that population. In some instances this is because the same network was selected from the previous generation and was not changed during mutation. For populations with multiple mostfit networks, duplicate networks were thrown out and then the mean Q value of the remaining best networks was calculated as a representative of the Q value of the most-fit network. Thus for a given level of perturbation, each of the 20 evolutionary runs was assigned two Q values, one for the most-fit network after 500 generations and the other for the most fit 19

21 network after 2000 generations. A one-tailed Student s T test was performed with an of 0.05 to evaluate if the mean Q value after 2000 generations was significantly higher than the mean Q after 500 generations. The null hypothesis (h x 0 ) was that II <= I and the alternative hypothesis (h x 1 ) was that II > I where I was the mean Q of the most fit network after evolving for only GAP I (500 generations), II was the mean Q of the most fit network after evolving for both GAP I and II (2000 generations) and x was the number of perturbations used in the network evaluations. 20

22 3. Results Examination of the mean modularity of high-performing networks after evolving populations for just GAP I and then both GAP I and II provides insight into the extent that modularity increases due to specialization. 3.1 Robustness and Modularity in Specialized GRNs This research investigated how the level of robustness, as measured by the number of perturbations used to evaluate a network, influenced the evolution of modularity in a specialized GRN. For more robust networks evaluated using 75 and 100 perturbations we are 95% confident that we can reject the null hypothesis that the mean level of modularity, as measured by Q, of the highest performing networks after evaluating for GAP I and GAP II was equal to or less than the mean modularity after evaluation for GAP I alone. Alternatively, for an alpha of 0.05 the analysis did not suggest that we can reject the null hypothesis for less robust networks with any perturbation level lower than 75 ( = 20, 30, 40, 50 ) (see Table 1). For populations that were evaluated with 75 perturbations the mean modularity increased by 17.2% between evolution for GAP I and evolution for GAP I and II, an increase of from to This produced a t statistic of and a corresponding p-value of Populations evaluated with 100 perturbations increased 17.3% from to 0.417, with a t- statistic of and p-value of Since in both these instances p < 0.05, this data suggests that the mean Q value of the fittest networks after introducing specialization in GAPs is significantly higher than without specialization. The mean Q value also increased in population evaluated with 20, 30, 40, and 50 perturbations, but not enough to be considered statistically significant. The p-values for these means, in order of increasing number of perturbations, were 0.398, 0.061, 0.093, and

23 Number of Perturbations ( ) Mean Q of Fittest Networks - GAP I Mean Q of Fittest Networks - GAP I & II T-Statistic P-Value (One-Sided Student's T Test) Table 1.Statistical analysis of mean modularity of the fittest networks in a population before and after the introduction of specialization in gene activity for varying levels of robustness. 3.2 Biased Mutation in the Evolution of Modularity in GRNs The second part of this research investigated the effect of biased mutation on the evolution of GRNs. = 75 perturbations were used since this level of robustness demonstrated a significant increase in modularity after specialization. When all bias was removed from mutation there was no statistical evidence that the level of modularity increased after evolving for GAP I & II. In fact, the mean Q value of the fittest networks decreased by 1%, producing a t statistic of and a one-sided p-value of (see Figure 9.) Type of Mutation ( = 75 ) Biased Mutation Simple Mutation Mean Q of Fittest Networks - GAP I Mean Q of Fittest Networks - GAP I & II T-Statistic (38 dof) P-Value (One-Sided Student's T Test) Table 2. Statistical analysis of mean modularity of the fittest networks in a population before and after the introduction of specialization in gene activity with and without mutation bias. 22

24 4. Discussion 4.1 Implications This research examined how the level of modularity evolved in GRNs, the dependent variable, was influenced by two different independent variables network robustness and criticality. 4.1a Influence of Robustness on the Evolution of Modularity in Specialized GRNs. The examination of varying levels of perturbation on evolving modularity found statistical evidence that modularity was evolved in networks evaluated with at least 75 to 100 perturbations but not in networks with 20, 30, 40, or 50 perturbations. These results suggest that increasing the robustness of a network helps drive the evolution of modularity. However in order to examine the relationship between robustness and modularity it is critical to first examine the underlying relationship between robustness and fitness. Recall that fitness is calculated by averaging performance over perturbations of the desired GAP, where the probability that a gene state will be altered is Consider first the scenario where a network is evaluated only once against an unperturbed copy of the desired GAP (we will call this = 0). In this case it is not difficult for a network to find a solution since it does not need to be at all robust to change; a network that is able to obtain the desired GAP in one generation will be just as successful in the next generation (see Figure 8A). Now consider = 10; some networks may get lucky and encounter no perturbations at one generation, receive a high fitness, and subsequently get copied to the next generation. However in the next generation it encounters many perturbations, a totally new experience for this network, so its fitness drops. In this case, it may be common for some networks in a population to perform very well after only a few generations, however, the spread of fitness across a population is very large because a 23

25 network s fitness may vary a lot between generations depending on the perturbations it encounters (see Figure 8B). Finally, consider the case where = 100. Now, achieving a high fitness is not so trivial because a network has to be able to accommodate a variety of different conditions. However, this network, if not mutated, is less likely to experience a large change in fitness from one generation to the next because it has already seen a variety of perturbations. So, as we see in Figure 8C, it takes longer to increase fitness but the spread of population fitness decreases. A B C Figure 8. Each of these graphs demonstrate the fitness (y-axis) of each of 100 networks in a population over the first 10 generations of evolution (x-axis) using the number of perturbations indicated in the upper left corner. This scenario displays the correlation between the robustness and fitness of our model GRNs. Increasing the number of perturbations that a network is evaluated against increases the evolutionary pressure for the population to produce robust networks. Subsequently, the more robust a population is, the more difficult it is to produce a network with very high fitness but also the more consistently networks behave from generation to generation. It is important to note the differences in the fitness landscape as the number of perturbations increase because we need to ensure that the differences in statistical significance of modularity noted at different levels of perturbations are in fact a result of changes in robustness, not in the underlying effects on fitness. Specifically, in order to infer any significance it is necessary to evolve networks to a reasonably high level of fitness before analyzing Q values. Since it takes longer for more robust networks to achieve a high fitness it is critical that at all 24

26 levels of perturbation evolution is carried out for enough generations for the population to produce networks of a reasonably high fitness, for evaluation of both GAP I and GAP I and II. Espinosa-Soto and Wagner (2010) designed their research to evolve highly robust networks, using = 500 perturbations, and were able to obtain significant results by evaluating for GAP I for 500 generations and GAP I and II for an additional 1500 generations. Since this was a far greater level of robustness than performed in any of this research, it is assumed that using these same constraints provided enough time for the full evolution of all levels of perturbations. Additionally, Figure 9 demonstrates that there is no obvious difference in the fitness landscape of the fittest networks between varying levels of perturbations. While the effects of changing levels of robustness was obvious when examining populations for the first few generations, these differences were essentially smoothed out over many generations with the chosen numbers of perturbations. A B Introduce GAP II Introduce GAP II Figure 9. These graphs show the fitness of the best performing networks over 2000 generations. (A) gives the mean fitness of all 20 runs of the best performing network at each generation. (B) shows the fitness of the best performing network out of all 20 runs at each generation. 25

27 Given that there are no obvious differences in the fitness landscapes over 2000 generations between different levels of perturbations we can assume that it is in fact the variations in robustness that cause modularity to evolve differently. The fact that modularity only increased significantly when networks were evolved to be robust, = 75 and = 100, this suggests that robustness plays an important role in driving the evolution of modularity (see Figure 10). Recall that modularity is known to increase robustness in two ways. First, when a perturbation occurs its undesirable effects are isolated within a module due to sparse connectivity between modules. Secondly, within the module, dense connectivity is able to correct for the perturbation in the next time steps. By applying selection pressure for robustness in evolutionary simulations pressure is put on the evolving networks to discover modularity in order to solve problems created by perturbations. Specifically, this research identified the threshold of robustness that is necessary in order for the expected evolution of modularity to occur. A statistically significant increase in modularity occurred only at 75 perturbations or above. Networks evolved with lower levels of perturbations found solutions that were far less modular (see Figure 11). 26

28 Figure 10. Each of the above graphs shows the mean Q value of the best performing networks over 20 generations after evolution for GAP I and GAP I & II at each level of perturbation. Q was found to be statistically higher after evolving with specialization at 75 and 100 perturbations. 27

29 Figure 8. This figure shows two networks that were produced by 2000 generations of evolution and achieved the highest fitness in their population. The modules used in the calculation of Q are indicated by blue and green nodes. Grey edges are used for inter-module interactions and black edges are drawn for intra-module interactions. Bolded edges indicate that gene i influences gene j and gene j also influences gene i. Bolded nodes indicate that gene i has an interaction with itself (self-loop). (A) shows a network that was evaluated with 20 perturbations (Q = 0.192) and (B) shows a network evaluated with 100 perturbations (Q = 0.735). While both networks were able to relatively successfully produce specialized GAPs, network (B) was far more modular than network (A). 4.1b Influence of Biased Mutation on the Evolution of Modularity in Specialized GRNs. This research also began to investigate how the use of a biased mutation operator drove the evolution of modularity. This biased mutation forced the networks to average 2 3 regulators per gene, thus pressuring the networks toward criticality. Initial analysis of GRNs without the biased mutation showed no significant evolution of modularity (see Figure Figure 12. (A) and (B) show the mean Q value of the fittest networks after evolution with and without specialization with 75 perturbations. (A) shows that there was a statistically significant increase in modularity when evolution was performed with a biased mutator. (B) shows the same evolution without a biased mutation and shows no indication of an increase in modularity. (C) shows the fitness of mutation with and without mutation bias with 75 perturbations. 28

30 12). This result is likely due to the association between connection cost modularity (Clune et al., 2013). Furthermore, removing the pressure for two to three regulators per gene allows networks to evolve more connections, which causes these networks to behave less consistently. It should also be noted that simple mutation achieved relatively high Q values giving the impression of high modularity, though not increasing as a result of specialization. This is likely a misrepresentation due to a non-normalized measurement of Q calculated on networks with overall higher numbers of connections. 4.2 Analysis Summary This study found that 1) modularity evolved in specialized GRNs only when networks are evaluated with 75 or more perturbations 2) without bias mutation modularity does not evolve in specialized GRNs. While there were some limitations to this study, these findings provide opportunity for future research. 4.3 Limitations While this research provided new insight into the evolution of modularity in GRNs there were several aspects that limited the significance of the results. Namely, time constraints limited the amount and quality of the data that was collected. Given the computational power of the available machine, performing a single run at each level of perturbations required about 20 hours, and a single run with unbiased mutation at 75 perturbations required about 7 hours. For this reason, a total of only 20 runs were used for the statistical analysis. While this sample size is large enough to perform the Student s t test with 38 degrees of freedom, more runs would have provided a better idea of the distribution of Q values, a more definite idea of the true mean Q values, and stronger idea of statistical significance. 29

31 Furthermore, there are likely better measurements of modularity than the Q value used in this research. For one, Q was computed based on an assumed partitioning of the communities into two modules: genes 0-4 and genes 5-9. While these were the hypothesized regions of interest, forcing this assumption onto our measurement may have clouded the true modularity of a network. It would have been more appropriate to use a community detection algorithm, such as the Louvain method, to maximize the measurement of modularity (De Meo, Ferrara, Fiumara, & Provetti, 2011). Additionally, rather than using a raw Q value it would have been more indicative if this Q value had been normalized using the equation Q normal = ( Q Q ran ) / ( Q max Q ran ). Here Q is the maximal Q value of the observed network, Q ran is average Q value of random networks with the same degree distribution of the observed network, and Q max is the highest Q value attained by the group of random networks. Thus Q normal tells how modular a network is relative to random networks with the same attributes. This method of using community maximization and normalizing Q is especially important in measuring modularity of the networks evolved without the biased mutator since these networks tended to be more densely connected. 4.4 Future Research In addition to addressing the aforementioned limitations it would be interesting to research the effect of sexual recombination and crossover hotspots on the evolution of modularity. Along with random mutation, sexual recombination plays a crucial role in creating variation in the GRN between generations. Recombination refers to the process of one doublestranded DNA molecule joining with another; specifically in the context of meiosis, the process of two homologous chromosomes exchanging large portions of their DNA (this is called 30

32 crossing over ), (Hey, 2004, p. 0730). When a new organism is sexually reproduced, its genome is a composite of splicing together sections of the parents DNA. Within the mass of interconnected genes in the GRN lay recombination hotspots (Hey, 2004). These hotspots along the genome indicate locations where rates of breaking and recombining DNA are much higher. The locations of hotspots provide insight into the organization of the new GRN during sexual recombination. Currently, the relationship between cross-over hotspots and modularity has not been studied. However, due to their importance in sexual recombination, it is desirable to include them when modeling the evolution of the GRN. Given that modularity is favored by evolution in specialized GRNs, it is expected that evolution will utilize cross-over hotspots to facilitate and maintain modularity in the GRN. Like Espinosa-Soto and Wagner (2010), the model used in this research relied entirely on random mutation to create evolutionary variation between generations. Alternatively, during recombination a child network could inherit its values from recombining the values of two successful parent networks in order to produce the next generation. Additionally, the incorporation of crossover hotspots could be approached in several ways. First, sexual recombination could be implemented by using fitness proportional selection to choose two parent networks and then recombining these networks in a random way to produce a child network. This random form of recombination will provide control data to see if the addition of crossover hotspots facilitates the evolution of modularity. Next, crossover hotspots could be modeled by assigning each population an array, c i = (c 0,, c N-2 ), in which an element, c i, indicates the probability of a crossover occurring between gene i and gene i+1. Elements with a high probability represent crossover hotspots, and the highest probability in the array will dictate where recombination will occur. This array would be evolved along with the networks in a 31

33 population to determine the most effective position of recombination. Examination of how evolution discovers modularity and where evolution decides to recombine could give significant insight into the relationship between modularity and crossover hotspots. 5. Conclusion Modeling using mathematics and computer science can provide a unique contribution to biological science. Biology examines a problem with empirical evidence that can be collected and observed from the world. Subsequently, biology is restricted by the tools and technology available to observe and understand the inner workings of organic complex systems. This is especially restrictive in observing the behavior of large networks such as GRNs. Consequently, modeling with computer science and mathematics can be a cost effective and realistic way of studying the GRN. More explicitly, modeling using evolutionary computation has the ability to imitate the natural process of evolution in a semi-controlled and measurable environment. This ties perfectly into modularity because, while modularity has been extensively studied in the biological world, its evolutionary origin is still highly debated. By modeling the evolution of modular biological systems we can gain insight into how modularity might evolve. The results created by these models can then provide new hypotheses that may be tested later with biological experimentation. As discussed before, modularity in gene regulatory networks is crucial to the development and evolution of life but there is no definite answer for how this modularity arises during evolution. This model of GRNs with evolutionary computation provided new insight into the importance of robustness and criticality in evolving modularity in the GRN. Specifically, we have discovered that networks need to express a significant amount of robustness in order to 32

Dr. Amira A. AL-Hosary

Dr. Amira A. AL-Hosary Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological

More information

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut

Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological

More information

What does it take to evolve behaviorally complex organisms?

What does it take to evolve behaviorally complex organisms? 1 Neural Systems and Artificial Life Group Institute of Psychology National Research Council, Rome What does it take to evolve behaviorally complex organisms? Raffaele Calabretta 1, Andrea Di Ferdinando

More information

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics.

Major questions of evolutionary genetics. Experimental tools of evolutionary genetics. Theoretical population genetics. Evolutionary Genetics (for Encyclopedia of Biodiversity) Sergey Gavrilets Departments of Ecology and Evolutionary Biology and Mathematics, University of Tennessee, Knoxville, TN 37996-6 USA Evolutionary

More information

Enduring understanding 1.A: Change in the genetic makeup of a population over time is evolution.

Enduring understanding 1.A: Change in the genetic makeup of a population over time is evolution. The AP Biology course is designed to enable you to develop advanced inquiry and reasoning skills, such as designing a plan for collecting data, analyzing data, applying mathematical routines, and connecting

More information

4. Identify one bird that would most likely compete for food with the large tree finch. Support your answer. [1]

4. Identify one bird that would most likely compete for food with the large tree finch. Support your answer. [1] Name: Topic 5B 1. A hawk has a genetic trait that gives it much better eyesight than other hawks of the same species in the same area. Explain how this could lead to evolutionary change within this species

More information

Specialization Can Drive the Evolution of Modularity

Specialization Can Drive the Evolution of Modularity Specialization Can Drive the Evolution of Modularity Carlos Espinosa-Soto 1,2 *, Andreas Wagner 1,2,3 1 Department of Biochemistry, University of Zurich, Zurich, Switzerland, 2 The Swiss Institute of Bioinformatics,

More information

The Evolution of Gene Dominance through the. Baldwin Effect

The Evolution of Gene Dominance through the. Baldwin Effect The Evolution of Gene Dominance through the Baldwin Effect Larry Bull Computer Science Research Centre Department of Computer Science & Creative Technologies University of the West of England, Bristol

More information

Non-Adaptive Evolvability. Jeff Clune

Non-Adaptive Evolvability. Jeff Clune Non-Adaptive Evolvability! Jeff Clune Assistant Professor Computer Science Evolving Artificial Intelligence Laboratory Evolution Fails to Optimize Mutation Rates (though it would improve evolvability)

More information

Molecular evolution - Part 1. Pawan Dhar BII

Molecular evolution - Part 1. Pawan Dhar BII Molecular evolution - Part 1 Pawan Dhar BII Theodosius Dobzhansky Nothing in biology makes sense except in the light of evolution Age of life on earth: 3.85 billion years Formation of planet: 4.5 billion

More information

Big Idea 1: The process of evolution drives the diversity and unity of life.

Big Idea 1: The process of evolution drives the diversity and unity of life. Big Idea 1: The process of evolution drives the diversity and unity of life. understanding 1.A: Change in the genetic makeup of a population over time is evolution. 1.A.1: Natural selection is a major

More information

Exercise 3 Exploring Fitness and Population Change under Selection

Exercise 3 Exploring Fitness and Population Change under Selection Exercise 3 Exploring Fitness and Population Change under Selection Avidians descended from ancestors with different adaptations are competing in a selective environment. Can we predict how natural selection

More information

Introduction to Digital Evolution Handout Answers

Introduction to Digital Evolution Handout Answers Introduction to Digital Evolution Handout Answers Note to teacher: The questions in this handout and the suggested answers (in red, below) are meant to guide discussion, not be an assessment. It is recommended

More information

AP Curriculum Framework with Learning Objectives

AP Curriculum Framework with Learning Objectives Big Ideas Big Idea 1: The process of evolution drives the diversity and unity of life. AP Curriculum Framework with Learning Objectives Understanding 1.A: Change in the genetic makeup of a population over

More information

Computational approaches for functional genomics

Computational approaches for functional genomics Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding

More information

A A A A B B1

A A A A B B1 LEARNING OBJECTIVES FOR EACH BIG IDEA WITH ASSOCIATED SCIENCE PRACTICES AND ESSENTIAL KNOWLEDGE Learning Objectives will be the target for AP Biology exam questions Learning Objectives Sci Prac Es Knowl

More information

AP Biology Essential Knowledge Cards BIG IDEA 1

AP Biology Essential Knowledge Cards BIG IDEA 1 AP Biology Essential Knowledge Cards BIG IDEA 1 Essential knowledge 1.A.1: Natural selection is a major mechanism of evolution. Essential knowledge 1.A.4: Biological evolution is supported by scientific

More information

Written Exam 15 December Course name: Introduction to Systems Biology Course no

Written Exam 15 December Course name: Introduction to Systems Biology Course no Technical University of Denmark Written Exam 15 December 2008 Course name: Introduction to Systems Biology Course no. 27041 Aids allowed: Open book exam Provide your answers and calculations on separate

More information

Evolutionary dynamics of populations with genotype-phenotype map

Evolutionary dynamics of populations with genotype-phenotype map Evolutionary dynamics of populations with genotype-phenotype map Esther Ibáñez Marcelo, Tomas Alarcon Cor Biomat 2013: Mathematics of Planet Earth CENTRE DE RECERCA MATEMÀTICA 17-21 June 2013 Esther Ibáñez

More information

Modes of Macroevolution

Modes of Macroevolution Modes of Macroevolution Macroevolution is used to refer to any evolutionary change at or above the level of species. Darwin illustrated the combined action of descent with modification, the principle of

More information

Chapter 8: Introduction to Evolutionary Computation

Chapter 8: Introduction to Evolutionary Computation Computational Intelligence: Second Edition Contents Some Theories about Evolution Evolution is an optimization process: the aim is to improve the ability of an organism to survive in dynamically changing

More information

arxiv: v1 [cs.ne] 11 Jul 2018

arxiv: v1 [cs.ne] 11 Jul 2018 Why Don t the Modules Dominate? Investigating the Structure of a Well-Known Modularity-Inducing Problem Domain Zhenyue Qin Australian National University Canberra, ACT, Australia zhenyue.qin@anu.edu.au

More information

Embodied Embeddings for Hyperneat

Embodied Embeddings for Hyperneat Embodied Embeddings for Hyperneat Collin Cappelle, Josh Bongard The University of Vermont, Burlington, VT 05405 collin.cappelle@uvm.edu Abstract A long time goal of evolutionary roboticists is to create

More information

Meiosis, Sexual Reproduction, & Genetic Variability

Meiosis, Sexual Reproduction, & Genetic Variability Meiosis, Sexual Reproduction, & Genetic Variability Teachers Guide NARRATION FOR MEIOSIS, SEXUAL REPRODUCTION, AND GENETIC VARIABILITY Since the members of no species, even California redwoods or giant

More information

SI Appendix. 1. A detailed description of the five model systems

SI Appendix. 1. A detailed description of the five model systems SI Appendix The supporting information is organized as follows: 1. Detailed description of all five models. 1.1 Combinatorial logic circuits composed of NAND gates (model 1). 1.2 Feed-forward combinatorial

More information

The Relationship Between Evolvability and Robustness in the Evolution of Boolean Networks

The Relationship Between Evolvability and Robustness in the Evolution of Boolean Networks The Relationship Between Evolvability and Robustness in the Evolution of Boolean Networks David Shorten 1 and Geoff Nitschke 1 1 Department of Computer Science, University of Cape Town dshorten@cs.uct.ac.za,

More information

Mutation Selection on the Metabolic Pathway and the Effects on Protein Co-evolution and the Rate Limiting Steps on the Tree of Life

Mutation Selection on the Metabolic Pathway and the Effects on Protein Co-evolution and the Rate Limiting Steps on the Tree of Life Ursinus College Digital Commons @ Ursinus College Mathematics Summer Fellows Student Research 7-21-2016 Mutation Selection on the Metabolic Pathway and the Effects on Protein Co-evolution and the Rate

More information

Grade 11 Biology SBI3U 12

Grade 11 Biology SBI3U 12 Grade 11 Biology SBI3U 12 } We ve looked at Darwin, selection, and evidence for evolution } We can t consider evolution without looking at another branch of biology: } Genetics } Around the same time Darwin

More information

NOTES Ch 17: Genes and. Variation

NOTES Ch 17: Genes and. Variation NOTES Ch 17: Genes and Vocabulary Fitness Genetic Drift Punctuated Equilibrium Gene flow Adaptive radiation Divergent evolution Convergent evolution Gradualism Variation 17.1 Genes & Variation Darwin developed

More information

Evolution of Populations. Chapter 17

Evolution of Populations. Chapter 17 Evolution of Populations Chapter 17 17.1 Genes and Variation i. Introduction: Remember from previous units. Genes- Units of Heredity Variation- Genetic differences among individuals in a population. New

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

Network motifs in the transcriptional regulation network (of Escherichia coli):

Network motifs in the transcriptional regulation network (of Escherichia coli): Network motifs in the transcriptional regulation network (of Escherichia coli): Janne.Ravantti@Helsinki.Fi (disclaimer: IANASB) Contents: Transcription Networks (aka. The Very Boring Biology Part ) Network

More information

Add Up and Cross Over Sordaria Genetics Simulation

Add Up and Cross Over Sordaria Genetics Simulation Introduction Add Up and Cross Over Sordaria Genetics Simulation Publication No. Crossing over occurs during metaphase I of meiosis. During crossing over, homologous pairs of chromosomes exchange sections

More information

Development. biologically-inspired computing. lecture 16. Informatics luis rocha x x x. Syntactic Operations. biologically Inspired computing

Development. biologically-inspired computing. lecture 16. Informatics luis rocha x x x. Syntactic Operations. biologically Inspired computing lecture 16 -inspired S S2 n p!!! 1 S Syntactic Operations al Code:N Development x x x 1 2 n p S Sections I485/H400 course outlook Assignments: 35% Students will complete 4/5 assignments based on algorithms

More information

Introduction to Genetics

Introduction to Genetics Introduction to Genetics The Work of Gregor Mendel B.1.21, B.1.22, B.1.29 Genetic Inheritance Heredity: the transmission of characteristics from parent to offspring The study of heredity in biology is

More information

Structure learning in human causal induction

Structure learning in human causal induction Structure learning in human causal induction Joshua B. Tenenbaum & Thomas L. Griffiths Department of Psychology Stanford University, Stanford, CA 94305 jbt,gruffydd @psych.stanford.edu Abstract We use

More information

A Genetic Algorithm Approach for Doing Misuse Detection in Audit Trail Files

A Genetic Algorithm Approach for Doing Misuse Detection in Audit Trail Files A Genetic Algorithm Approach for Doing Misuse Detection in Audit Trail Files Pedro A. Diaz-Gomez and Dean F. Hougen Robotics, Evolution, Adaptation, and Learning Laboratory (REAL Lab) School of Computer

More information

Essential knowledge 1.A.2: Natural selection

Essential knowledge 1.A.2: Natural selection Appendix C AP Biology Concepts at a Glance Big Idea 1: The process of evolution drives the diversity and unity of life. Enduring understanding 1.A: Change in the genetic makeup of a population over time

More information

Biological networks CS449 BIOINFORMATICS

Biological networks CS449 BIOINFORMATICS CS449 BIOINFORMATICS Biological networks Programming today is a race between software engineers striving to build bigger and better idiot-proof programs, and the Universe trying to produce bigger and better

More information

Computational Genomics. Systems biology. Putting it together: Data integration using graphical models

Computational Genomics. Systems biology. Putting it together: Data integration using graphical models 02-710 Computational Genomics Systems biology Putting it together: Data integration using graphical models High throughput data So far in this class we discussed several different types of high throughput

More information

Map of AP-Aligned Bio-Rad Kits with Learning Objectives

Map of AP-Aligned Bio-Rad Kits with Learning Objectives Map of AP-Aligned Bio-Rad Kits with Learning Objectives Cover more than one AP Biology Big Idea with these AP-aligned Bio-Rad kits. Big Idea 1 Big Idea 2 Big Idea 3 Big Idea 4 ThINQ! pglo Transformation

More information

Correlations to Next Generation Science Standards. Life Sciences Disciplinary Core Ideas. LS-1 From Molecules to Organisms: Structures and Processes

Correlations to Next Generation Science Standards. Life Sciences Disciplinary Core Ideas. LS-1 From Molecules to Organisms: Structures and Processes Correlations to Next Generation Science Standards Life Sciences Disciplinary Core Ideas LS-1 From Molecules to Organisms: Structures and Processes LS1.A Structure and Function Systems of specialized cells

More information

A GENETIC ALGORITHM FOR FINITE STATE AUTOMATA

A GENETIC ALGORITHM FOR FINITE STATE AUTOMATA A GENETIC ALGORITHM FOR FINITE STATE AUTOMATA Aviral Takkar Computer Engineering Department, Delhi Technological University( Formerly Delhi College of Engineering), Shahbad Daulatpur, Main Bawana Road,

More information

Full file at CHAPTER 2 Genetics

Full file at   CHAPTER 2 Genetics CHAPTER 2 Genetics MULTIPLE CHOICE 1. Chromosomes are a. small linear bodies. b. contained in cells. c. replicated during cell division. 2. A cross between true-breeding plants bearing yellow seeds produces

More information

BIOLOGY 111. CHAPTER 1: An Introduction to the Science of Life

BIOLOGY 111. CHAPTER 1: An Introduction to the Science of Life BIOLOGY 111 CHAPTER 1: An Introduction to the Science of Life An Introduction to the Science of Life: Chapter Learning Outcomes 1.1) Describe the properties of life common to all living things. (Module

More information

Introduction. Spatial Multi-Agent Systems. The Need for a Theory

Introduction. Spatial Multi-Agent Systems. The Need for a Theory Introduction Spatial Multi-Agent Systems A spatial multi-agent system is a decentralized system composed of numerous identically programmed agents that either form or are embedded in a geometric space.

More information

Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables

Abstract. Three Methods and Their Limitations. N-1 Experiments Suffice to Determine the Causal Relations Among N Variables N-1 Experiments Suffice to Determine the Causal Relations Among N Variables Frederick Eberhardt Clark Glymour 1 Richard Scheines Carnegie Mellon University Abstract By combining experimental interventions

More information

U.S. - Canadian Border Traffic Prediction

U.S. - Canadian Border Traffic Prediction Western Washington University Western CEDAR WWU Honors Program Senior Projects WWU Graduate and Undergraduate Scholarship 12-14-2017 U.S. - Canadian Border Traffic Prediction Colin Middleton Western Washington

More information

- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes.

- mutations can occur at different levels from single nucleotide positions in DNA to entire genomes. February 8, 2005 Bio 107/207 Winter 2005 Lecture 11 Mutation and transposable elements - the term mutation has an interesting history. - as far back as the 17th century, it was used to describe any drastic

More information

Self-healing tile sets: rates of regrowth in damaged assemblies

Self-healing tile sets: rates of regrowth in damaged assemblies University of Arkansas, Fayetteville ScholarWorks@UARK Mathematical Sciences Undergraduate Honors Theses Mathematical Sciences 5-2015 Self-healing tile sets: rates of regrowth in damaged assemblies Kevin

More information

Roadmap. Sexual Selection. Evolution of Multi-Gene Families Gene Duplication Divergence Concerted Evolution Survey of Gene Families

Roadmap. Sexual Selection. Evolution of Multi-Gene Families Gene Duplication Divergence Concerted Evolution Survey of Gene Families 1 Roadmap Sexual Selection Evolution of Multi-Gene Families Gene Duplication Divergence Concerted Evolution Survey of Gene Families 2 One minute responses Q: How do aphids start producing males in the

More information

APES C4L2 HOW DOES THE EARTH S LIFE CHANGE OVER TIME? Textbook pages 85 88

APES C4L2 HOW DOES THE EARTH S LIFE CHANGE OVER TIME? Textbook pages 85 88 APES C4L2 HOW DOES THE EARTH S LIFE CHANGE OVER TIME? Textbook pages 85 88 Big Ideas Concept 4-2A The scientific theory of evolution explains how life on Earth changes over time through changes in the

More information

5.3 METABOLIC NETWORKS 193. P (x i P a (x i )) (5.30) i=1

5.3 METABOLIC NETWORKS 193. P (x i P a (x i )) (5.30) i=1 5.3 METABOLIC NETWORKS 193 5.3 Metabolic Networks 5.4 Bayesian Networks Let G = (V, E) be a directed acyclic graph. We assume that the vertices i V (1 i n) represent for example genes and correspond to

More information

Reproduction- passing genetic information to the next generation

Reproduction- passing genetic information to the next generation 166 166 Essential Question: How has biological evolution led to the diversity of life? B-5 Natural Selection Traits that make an organism more or less likely to survive in an environment and reproduce

More information

NOTES CH 17 Evolution of. Populations

NOTES CH 17 Evolution of. Populations NOTES CH 17 Evolution of Vocabulary Fitness Genetic Drift Punctuated Equilibrium Gene flow Adaptive radiation Divergent evolution Convergent evolution Gradualism Populations 17.1 Genes & Variation Darwin

More information

Genetic Algorithms. Donald Richards Penn State University

Genetic Algorithms. Donald Richards Penn State University Genetic Algorithms Donald Richards Penn State University Easy problem: Find the point which maximizes f(x, y) = [16 x(1 x)y(1 y)] 2, x, y [0,1] z (16*x*y*(1-x)*(1-y))**2 0.829 0.663 0.497 0.331 0.166 1

More information

AP Biology Curriculum Framework

AP Biology Curriculum Framework AP Biology Curriculum Framework This chart correlates the College Board s Advanced Placement Biology Curriculum Framework to the corresponding chapters and Key Concept numbers in Campbell BIOLOGY IN FOCUS,

More information

O 3 O 4 O 5. q 3. q 4. Transition

O 3 O 4 O 5. q 3. q 4. Transition Hidden Markov Models Hidden Markov models (HMM) were developed in the early part of the 1970 s and at that time mostly applied in the area of computerized speech recognition. They are first described in

More information

Evolutionary Dynamics and Extensive Form Games by Ross Cressman. Reviewed by William H. Sandholm *

Evolutionary Dynamics and Extensive Form Games by Ross Cressman. Reviewed by William H. Sandholm * Evolutionary Dynamics and Extensive Form Games by Ross Cressman Reviewed by William H. Sandholm * Noncooperative game theory is one of a handful of fundamental frameworks used for economic modeling. It

More information

Synchronous state transition graph

Synchronous state transition graph Heike Siebert, FU Berlin, Molecular Networks WS10/11 2-1 Synchronous state transition graph (0, 2) (1, 2) vertex set X (state space) edges (x,f(x)) every state has only one successor attractors are fixed

More information

The Evolution of Sex Chromosomes through the. Baldwin Effect

The Evolution of Sex Chromosomes through the. Baldwin Effect The Evolution of Sex Chromosomes through the Baldwin Effect Larry Bull Computer Science Research Centre Department of Computer Science & Creative Technologies University of the West of England, Bristol

More information

Tigard-Tualatin School District Science Grade Level Priority Standards

Tigard-Tualatin School District Science Grade Level Priority Standards Sixth Grade Science Physical Science 6.1 Structure and Function: Living and non-living systems are organized groups of related parts that function together and have characteristics and properties. 6.1P.1

More information

Mathematical models in population genetics II

Mathematical models in population genetics II Mathematical models in population genetics II Anand Bhaskar Evolutionary Biology and Theory of Computing Bootcamp January 1, 014 Quick recap Large discrete-time randomly mating Wright-Fisher population

More information

Campbell Biology AP Edition 11 th Edition, 2018

Campbell Biology AP Edition 11 th Edition, 2018 A Correlation and Narrative Summary of Campbell Biology AP Edition 11 th Edition, 2018 To the AP Biology Curriculum Framework AP is a trademark registered and/or owned by the College Board, which was not

More information

PRINCIPLES OF MENDELIAN GENETICS APPLICABLE IN FORESTRY. by Erich Steiner 1/

PRINCIPLES OF MENDELIAN GENETICS APPLICABLE IN FORESTRY. by Erich Steiner 1/ PRINCIPLES OF MENDELIAN GENETICS APPLICABLE IN FORESTRY by Erich Steiner 1/ It is well known that the variation exhibited by living things has two components, one hereditary, the other environmental. One

More information

Binary fission occurs in prokaryotes. parent cell. DNA duplicates. cell begins to divide. daughter cells

Binary fission occurs in prokaryotes. parent cell. DNA duplicates. cell begins to divide. daughter cells Chapter 11 Chapter 11 Some eukaryotes reproduce through mitosis. Binary fission is similar in function to mitosis. Asexual reproduction is the creation of offspring from a single parent. Binary fission

More information

A Simple Haploid-Diploid Evolutionary Algorithm

A Simple Haploid-Diploid Evolutionary Algorithm A Simple Haploid-Diploid Evolutionary Algorithm Larry Bull Computer Science Research Centre University of the West of England, Bristol, UK larry.bull@uwe.ac.uk Abstract It has recently been suggested that

More information

Introduction to Meiosis Many organisms pass their genes to their offspring through.

Introduction to Meiosis   Many organisms pass their genes to their offspring through. MEIOSIS NAME DATE 1 Introduction to Meiosis http://vcell.ndsu.nodak.edu/animations/meiosis/movie-flash.htm Many organisms pass their genes to their offspring through. This begins when two gametes unite

More information

Formalizing the gene centered view of evolution

Formalizing the gene centered view of evolution Chapter 1 Formalizing the gene centered view of evolution Yaneer Bar-Yam and Hiroki Sayama New England Complex Systems Institute 24 Mt. Auburn St., Cambridge, MA 02138, USA yaneer@necsi.org / sayama@necsi.org

More information

Answer Key. Cell Growth and Division

Answer Key. Cell Growth and Division Cell Growth and Division Answer Key SECTION 1. THE CELL CYCLE Cell Cycle: (1) Gap1 (G 1): cells grow, carry out normal functions, and copy their organelles. (2) Synthesis (S): cells replicate DNA. (3)

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

11.1 The Process of Meiosis

11.1 The Process of Meiosis OpenStax-CNX module: m52675 1 11.1 The Process of Meiosis Shannon McDermott Based on The Process of Meiosis by OpenStax College This work is produced by OpenStax-CNX and licensed under the Creative Commons

More information

Teachers Guide. Overview

Teachers Guide. Overview Teachers Guide Overview BioLogica is multilevel courseware for genetics. All the levels are linked so that changes in one level are reflected in all the other levels. The BioLogica activities guide learners

More information

11-4 Meiosis Meiosis. Slide 1 of 35. Copyright Pearson Prentice Hall

11-4 Meiosis Meiosis. Slide 1 of 35. Copyright Pearson Prentice Hall 11-4 Meiosis 1 of 35 Each organism must inherit a single copy of every gene from each of its parents. Gametes are formed by a process that separates the two sets of genes so that each gamete ends up with

More information

10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison

10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison 10-810: Advanced Algorithms and Models for Computational Biology microrna and Whole Genome Comparison Central Dogma: 90s Transcription factors DNA transcription mrna translation Proteins Central Dogma:

More information

HS-LS2-3. Construct and revise an explanation based on evidence for the cycling of matter and flow of energy in aerobic and anaerobic conditions.

HS-LS2-3. Construct and revise an explanation based on evidence for the cycling of matter and flow of energy in aerobic and anaerobic conditions. Boone County Biology Curriculum Map Unit 1, Matter and Energy How do organisms obtain and use the energy they need to live and grow? How do matter and energy move through ecosystems? Construct an explanation

More information

www.lessonplansinc.com Topic: Dinosaur Evolution Project Summary: Students pretend to evolve two dinosaurs using genetics and watch how the dinosaurs adapt to an environmental change. This is a very comprehensive

More information

Confidence Interval Estimation

Confidence Interval Estimation Department of Psychology and Human Development Vanderbilt University 1 Introduction 2 3 4 5 Relationship to the 2-Tailed Hypothesis Test Relationship to the 1-Tailed Hypothesis Test 6 7 Introduction In

More information

Parents can produce many types of offspring. Families will have resemblances, but no two are exactly alike. Why is that?

Parents can produce many types of offspring. Families will have resemblances, but no two are exactly alike. Why is that? Parents can produce many types of offspring Families will have resemblances, but no two are exactly alike. Why is that? Meiosis and Genetic Linkage Objectives Recognize the significance of meiosis to sexual

More information

Haploid & diploid recombination and their evolutionary impact

Haploid & diploid recombination and their evolutionary impact Haploid & diploid recombination and their evolutionary impact W. Garrett Mitchener College of Charleston Mathematics Department MitchenerG@cofc.edu http://mitchenerg.people.cofc.edu Introduction The basis

More information

Topic 7: Evolution. 1. The graph below represents the populations of two different species in an ecosystem over a period of several years.

Topic 7: Evolution. 1. The graph below represents the populations of two different species in an ecosystem over a period of several years. 1. The graph below represents the populations of two different species in an ecosystem over a period of several years. Which statement is a possible explanation for the changes shown? (1) Species A is

More information

Sexual Reproduction and Genetics

Sexual Reproduction and Genetics Chapter Test A CHAPTER 10 Sexual Reproduction and Genetics Part A: Multiple Choice In the space at the left, write the letter of the term, number, or phrase that best answers each question. 1. How many

More information

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1 Math 66/566 - Midterm Solutions NOTE: These solutions are for both the 66 and 566 exam. The problems are the same until questions and 5. 1. The moment generating function of a random variable X is M(t)

More information

V. Evolutionary Computing. Read Flake, ch. 20. Genetic Algorithms. Part 5A: Genetic Algorithms 4/10/17. A. Genetic Algorithms

V. Evolutionary Computing. Read Flake, ch. 20. Genetic Algorithms. Part 5A: Genetic Algorithms 4/10/17. A. Genetic Algorithms V. Evolutionary Computing A. Genetic Algorithms 4/10/17 1 Read Flake, ch. 20 4/10/17 2 Genetic Algorithms Developed by John Holland in 60s Did not become popular until late 80s A simplified model of genetics

More information

CHEM 181: Chemical Biology

CHEM 181: Chemical Biology Instructor Prof. Jane M. Liu (SN-216) jane.liu@pomona.edu CHEM 181: Chemical Biology Office Hours Anytime my office door is open or by appointment COURSE OVERVIEW Class TR 8:10-9:25 am Prerequisite: CHEM115

More information

Reconciling Explanations for the Evolution of Evolvability

Reconciling Explanations for the Evolution of Evolvability Reconciling Explanations for the Evolution of Evolvability Bryan Wilder and Kenneth O. Stanley To Appear In: Adaptive Behavior journal. London: SAGE, 2015. Abstract Evolution s ability to find innovative

More information

THE WORK OF GREGOR MENDEL

THE WORK OF GREGOR MENDEL GENETICS NOTES THE WORK OF GREGOR MENDEL Genetics-. - Austrian monk- the father of genetics- carried out his work on. Pea flowers are naturally, which means that sperm cells fertilize the egg cells in

More information

NGSS Example Bundles. Page 1 of 23

NGSS Example Bundles. Page 1 of 23 High School Conceptual Progressions Model III Bundle 2 Evolution of Life This is the second bundle of the High School Conceptual Progressions Model Course III. Each bundle has connections to the other

More information

The concept of the adaptive landscape

The concept of the adaptive landscape 1 The concept of the adaptive landscape The idea of a fitness landscape was introduced by Sewall Wright (1932) and it has become a standard imagination prosthesis for evolutionary theorists. It has proven

More information

arxiv:physics/ v1 [physics.bio-ph] 8 Feb 2000 Formalizing the gene centered view of evolution

arxiv:physics/ v1 [physics.bio-ph] 8 Feb 2000 Formalizing the gene centered view of evolution arxiv:physics/0002016v1 [physics.bio-ph] 8 Feb 2000 Formalizing the gene centered view of evolution Y. Bar-Yam New England Complex Systems Institute 24 Mt. Auburn St., Cambridge MA Department of Molecular

More information

A Simple Model of Unbounded Evolutionary Versatility as a Largest-Scale Trend in Organismal Evolution

A Simple Model of Unbounded Evolutionary Versatility as a Largest-Scale Trend in Organismal Evolution Submitted to Artificial Life March 8, 2000 A Simple Model of Unbounded Evolutionary Versatility as a Largest-Scale Trend in Organismal Evolution Peter D. Turney Institute for Information Technology National

More information

Biological Anthropology

Biological Anthropology Biological Anthropology Sample Exam 1 Multiple-Choice Questions For each of the following questions, circle the answer that is most correct. Each question is worth two (2) points. 1. Which of the following

More information

Ohio Tutorials are designed specifically for the Ohio Learning Standards to prepare students for the Ohio State Tests and end-ofcourse

Ohio Tutorials are designed specifically for the Ohio Learning Standards to prepare students for the Ohio State Tests and end-ofcourse Tutorial Outline Ohio Tutorials are designed specifically for the Ohio Learning Standards to prepare students for the Ohio State Tests and end-ofcourse exams. Biology Tutorials offer targeted instruction,

More information

Investigation 7: Cell Division Part B: Meiosis and Crossing Over

Investigation 7: Cell Division Part B: Meiosis and Crossing Over Background Investigation 7: Cell Division Part B: Meiosis and Crossing Over Ascomycota are a diverse group of fungi including the familiar single-celled baker s yeast, the complex morel mushroom, and the

More information

Row and Column Distributions of Letter Matrices

Row and Column Distributions of Letter Matrices College of William and Mary W&M ScholarWorks Undergraduate Honors Theses Theses, Dissertations, & Master Projects 5-2016 Row and Column Distributions of Letter Matrices Xiaonan Hu College of William and

More information

Computational Biology: Basics & Interesting Problems

Computational Biology: Basics & Interesting Problems Computational Biology: Basics & Interesting Problems Summary Sources of information Biological concepts: structure & terminology Sequencing Gene finding Protein structure prediction Sources of information

More information

9-4 Meiosis Meiosis. Slide 1 of 35

9-4 Meiosis Meiosis. Slide 1 of 35 9-4 Meiosis 11-4 Meiosis 1 of 35 11-4 Meiosis Each organism must inherit a single copy of every gene from each of its parents. Gametes are formed by a process that separates the two sets of genes so that

More information

Orleans Southwest and Lamoille North Science Curriculum (Grade Cluster 9-12)

Orleans Southwest and Lamoille North Science Curriculum (Grade Cluster 9-12) Orleans Southwest and Lamoille North Science Curriculum (Grade Cluster 9-12) Big Idea: Life Science 1. Structure and Functions 2. Changes in Living Systems 3. Ecosystems and Energy Big Idea: Scientific

More information

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148

UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 UNIT 8 BIOLOGY: Meiosis and Heredity Page 148 CP: CHAPTER 6, Sections 1-6; CHAPTER 7, Sections 1-4; HN: CHAPTER 11, Section 1-5 Standard B-4: The student will demonstrate an understanding of the molecular

More information

4/6/2014. Chromosome Number

4/6/2014. Chromosome Number Meiosis 1of 35 Each organism must inherit a single copy of every gene from each of its parents. Gametes are formed by a process that separates the two sets of genes so that each gamete ends up with just

More information