Pathway Databases: A Case Study in Computational Symbolic Theories

Size: px
Start display at page:

Download "Pathway Databases: A Case Study in Computational Symbolic Theories"

Transcription

1 sponsors development of the information technology infrastructure necessary for a National Virtual Observatory (NVO). There is a close cooperation with the particle physics community through the Grid Physics Network (GriPhyN). NASA supports astronomy mission archives and discipline data centers while developing a roadmap for their federation. Impressively, these projects are all cooperating, and are working toward a future Global Virtual Observatory to benefit the international astronomical community and the public alike. There are similar efforts under way in other areas of science as well. The Virtual Observatory has had and will have significant interactions with other science communities, both learning from some and providing a model for others. References 1. The Hubble Space Telescope, 2. Chandra X-Ray Observatory Center, harvard.edu. 3. The Sloan Digital Sky Survey website, 4. The Two Micron All Sky Survey, caltech.edu/2mass. 5. Digitized Palomar Observatory Sky Survey, www. astro.caltech.edu/ george/dposs. 6. SIMBAD Astronomical Database, 7. NASA/IPAC Extragalactic Database, ipac.caltech.edu. 8. Virtual Observatories of the Future, American Society for Physics Conference Series, vol. 25, R. J. Brunner, S. G. Djorgovski, A. S. Szalay, Eds., (The Astronomical Society of the Pacific, San Francisco, 2001). 9. X. Fan et al., e-print available at abs/astro-ph/ Vizier Service, VizieR/ 11. The Semantic Web, Web Services, I. Foster, C. Kesselman, Eds., The Grid: Blueprint for a New Computing Infrastructure (Kaufmann, San Francisco, 1998). 13. Public access Web site to the SDSS data, skyserver.sdss.org/ VIEWPOINT Pathway Databases: A Case Study in Computational Symbolic Theories Peter D. Karp A pathway database (DB) is a DB that describes biochemical pathways, reactions, and enzymes. The EcoCyc pathway DB (see describes the metabolic, transport, and genetic-regulatory networks of Escherichia coli. EcoCyc is an example of a computational symbolic theory, which is a DB that structures a scientific theory within a formal ontology so that it is available for computational analysis. It is argued that by encoding scientific theories in symbolic form, we open new realms of analysis and understanding for theories that would otherwise be too large and complex for scientists to reason with effectively. What happens when a scientific theory is too large to be grasped by a single mind? Decades of experimentation by molecular biologists to characterize the molecular components of single cells, combined with recent advances in genomics, have thrust biology into the position where the theoretical understanding of a system such as the biochemical network of E. coli is too large for a single scientist to grasp. This situation has a number of disturbing consequences: It becomes extremely difficult to determine whether such theories are internally consistent or are consistent with external data, to refine theories that are inconsistent, or to understand all of the implications of such large theories. As more details of such a complex system are elucidated experimentally, it is not so clear that our understanding of the system as a whole increases if the new understanding cannot be integrated with the larger theory it pertains to in a coherent fashion. In this article I argue that as scientific theories reach a certain complexity, it becomes essential to encode those theories in a symbolic form within a computer database (DB). I describe pathway DBs as a case study in encoding Bioinformatics Research Group, SRI International, EK223, 333 Ravenswood Avenue, Menlo Park, CA 94025, USA. pkarp@ai.sri.com scientific theories in computers. Although the scientific community clearly accepts the need to encode the ever-expanding quantity of scientific data within DBs, DBs of scientific theories, such as a theory describing the transcriptional regulation of E. coli genes, are much rarer. By data I mean measurements made from individual experiments; by theory I mean relationships inferred from the interpretation and synthesis of many experimental results. The biological sciences are particularly well suited to the DB approach because many theories in biology have a qualitative nature; they describe semantic relationships between systems with many different molecular components, and the causal relationships between these components have been measured in a qualitative rather than a quantitative fashion. The DB approach is probably less appropriate for quantitative theories that are best described by systems of differential equations, or other types of mathematical models in analytical form. The theory of the E. coli metabolic network is an example of a theory whose size and complexity are too large for a mind to grasp. The metabolic network is essentially a chemical processing factory within each E. coli cell that enables the organism to convert small molecule chemicals that it finds in its environment into the building blocks of its own structures, and to extract energy from those chemicals. The E. coli metabolic network, illustrated in Fig. 1, involves 791 chemical compounds organized into 744 enzyme-catalyzed biochemical reactions (1). On average, each compound is involved in 2.1 reactions. I posit that the majority of scientists cannot grasp every intricate detail of this complex network. Omission of even a single step from the network can be fatal for the cell. One might argue that the biomedical literature is one embodiment of the theory of the E. coli metabolic network, and that as the biomedical literature enters electronic form, we need not be concerned with the size and complexity of biology theories. Although efforts to bring the biomedical literature online are tremendously useful, there are serious limitations to what they will achieve: We cannot compute effectively with theories within the biomedical literature. Natural-language texts still remain largely opaque to computers, despite many advances in natural-language processing. For example, one relatively simple question we might wish to ask of the E. coli metabolic network is how many of its reaction steps are catalyzed by multiple enzymes, meaning they have backup systems, and therefore would targeting a drug toward one of the enzymes catalyzing those steps be ineffective? Answering this question by using a pathway DB such as the EcoCyc pathway DB is trivial, but answering this question by processing the biomedical literature with a computer program would earn the programmer a Ph.D. in computer science. Pathway Databases A pathway is a linked set of biochemical reactions linked in the sense that the product of one reaction is a reactant of, or an enzyme that catalyzes, a subsequent reaction. A pathway DB is a bioinformatics DB that describes biochemical pathways and their component reactions, 2040

2 enzymes, and substrates. Most pathway DBs created to date describe metabolic pathways, but pathway DBs containing signaling and geneticregulatory pathways are now beginning to appear. Most pathway DBs contain computable descriptions of pathways structured by using a formal ontology, as opposed to textual or semistructured descriptions of pathways. A pathway/ genome DB (PGDB) integrates pathway information with information about the complete genome of an organism. A PGDB is one type of model-organism DB (MOD), although most MODs do not contain pathway information. Historically, pathway DBs arose at the intersection of genomics, biochemistry, DBs, and artificial intelligence. Genome DBs aim to catalog the molecular parts of an organism whose genome has been sequenced. They generally focus more on describing the genome map and sequence of the organism than they do on describing the functions of each gene in a structured computable fashion (functions are described using short English phrases). Biochemists have had a long-standing effort to catalog the catalytic activities of known enzymes, in printed form. The EcoCyc project (2, 3) began in 1992 with the goals of integrating within a single DB the then incomplete genome map of E. coli, with detailed descriptions of the enzymes and pathways of E. coli metabolism. Thus, EcoCyc is a PGDB. By 1996, EcoCyc contained more than 100 pathways and 520 enzymes, entered by Riley s group at the Marine Biological Laboratory. Enzymes were connected to their genes, and to the genome sequence, when known. The DB contained detailed descriptions of the reaction catalyzed by each enzyme, the range of substrates the enzyme would accept, the chemicals known to activate or inhibit the enzyme, and its subunit structure. The DB also described each small molecule enzyme substrate. EcoCyc pathways include those for biosynthesis of cellular building blocks such as amino acids and cell wall components, those for catabolism of the different carbon sources that E. coli can utilize, and those for extracting energy from chemical compounds. In 1998 we integrated the complete genome sequence of E. coli determined by the Blattner laboratory into EcoCyc (4), and we expanded the EcoCyc collaboration to include additional biologists curating information on other aspects of the E. coli biochemical machinery: the transporters that move small molecules from the outside of the cell to the inside, and the mechanisms by which E. coli gene expression is controlled at the transcriptional level. Version 5.6 of EcoCyc released in June 2001 describes 162 E. coli transporters and 629 transcription Fig. 1. An overview of the full known metabolic map of E. coli. Each blue and yellow line in this diagram represents a single enzyme-catalyzed reaction; each node represents a single metabolite. Glycolysis is in the middle of the diagram, biosynthetic pathways are to its left, catabolic pathways are to its right, and reactions that have not been assigned to a pathway are grouped along the far right-hand side. The shape of each metabolite encodes its chemical class: for example, amino acids are shown as triangles, and shaded nodes indicate phosphorylated compounds. This diagram was produced by combining automated graph layout algorithms with some manual positioning of regions of this graph. The reactions highlighted in yellow show the results of a species comparison between E. coli and the metabolic network predicted for the yeast S. cerevisiae from its genome by the PathoLogic program. Reaction lines in blue indicate reactions found in E. coli only; reaction lines in yellow indicate reactions found in both E. coli and S. cerevisiae. The species comparison does not generally include the transport reactions shown in the cellular membrane that surrounds the diagram. SCIENCE VOL SEPTEMBER

3 units within E. coli, and a total of 165 E. coli metabolic pathways. Other pathway DBs include KEGG (5), WIT (6), MetaCyc (2), the Connections Map DB (see and UM-BBD (7); for a review see (8). In parallel with the development of EcoCyc, SRI developed a software environment called the Pathway Tools (9) that supports query, analysis, and visualization operations for PGDBs. The Pathway Tools allows users to query Eco- Cyc DB by a variety of criteria including name matching and classification hierarchies (such as a taxonomy of metabolic pathways). Query results are displayed by using a library of visualization tools that automatically generate drawings of metabolic pathways, reactions, chemical compounds, chromosomes, and transcription units. A visualization tool called the Overview diagram depicts the full metabolic map of E. coli and its transporters. This tool is a powerful device for understanding global properties of the E. coli metabolic network. For example, the user can ask the software to highlight all occurrences of a given metabolite to understand all the pathways that can operate on it. The user also can highlight all metabolic reactions that are activated or inhibited by a specific metabolite at the substrate level, or whose transcription is controlled by a given transcription factor, to understand the regulation of the metabolic network. We have also developed a technique for painting gene expression data sets onto the Overview, thus providing an organizing framework to aid the interpretation of these complex data sets in a pathway context (see expression.html). More recently, the Pathway Tools has been generalized so that it can manage PGDBs for multiple organisms simultaneously. In the course of 1 week, a user of the PathoLogic component of the Pathway Tools (see below) can create a new PGDB for a sequenced microorganism. Once created, users can refine the PGDB by using a suite of interactive editors within the Pathway Tools, and can publish the PGDB on the Web. We have thus created a reusable generic model-organism DB toolkit that is now being used to create PGDBs as resources for the scientific communities that research many microorganisms. The SRI Web site at contains PGDBs for eight microorganisms that were created by using the Pathway Tools. Computing with Biological Theories in Pathway Databases The artificial intelligence subfield of knowledge representation is concerned with devising symbolic encodings of complex collections of information in a manner that supports inference (reasoning) processes across that information. Key strategies in knowledge representation include (i) devising an ontology (DB schema) that captures important semantic distinctions in an accurate fashion, and that precisely defines the meaning of different DB fields, (ii) rigorously following the definitions in that ontology to encode a theory within the DB, and (iii) extending the ontology when new domain concepts are found to fall outside the scope of the ontology. The EcoCyc ontology (10) contains about 1000 classes that encode key concepts in biochemistry and molecular biology, and more than 200 slots that define properties of and relationships among those classes. The EcoCyc DB is structured according to this ontology and consists of an interconnected web of frames (objects) stored in a frame knowledge representation system (similar to an objectoriented DB). Each frame represents a distinct biological object (such as a gene or a protein), and the labeled connections between those frames represent distinct semantic relationships among the objects, such as the relationship of a gene to its protein product, or the relationship of a protein to a reaction that it catalyzes. Reasoning across such a DB is accomplished through computations that traverse this network, and represents a distinct nonnumerical style of computing that, in our experience, most scientists are not familiar with. It is symbolic computing that allows us to exploit the power of computational qualitative theories. The following sections explore global computations that can be applied to pathway DBs. Computing global properties of a pathway DB. By integrating the fragments of biological theories that are scattered through the biomedical literature, we can discover global properties of those theories that were heretofore elusive. Ouzounis and Karp wrote a set of programs that computed statistics on relationships in the DB that showed how very simplistic is the classical notion of one gene, one enzyme, and one reaction (1). The program found that E. coli contains 100 enzymes that catalyze more than one biochemical reaction, 68 cases where the same reaction is catalyzed by more than one enzyme, and 99 cases where one reaction is used in multiple E. coli pathways. More recently, Karp and Collado studied the global properties of the E. coli genetic network stored within EcoCyc, which is the most detailed model of the genetic network of any organism. The model describes the control of 630 E. coli transcription units (containing 27% of all E. coli genes) by 97 transcription-factor proteins. Figure 2 shows a visualization of the E. coli genetic network defined by EcoCyc. A number of interesting properties are present in this network. A significant portion of the network is unconnected, that is, 50 of the 85 transcription factors do not regulate other transcription factors they regulate other genes that do not encode transcription factors, and, in some cases, they regulate themselves. The network is not very deep it has a maximum depth of three nodes. Only two transcription factors (CRP and Fnr) directly control more than two other transcription factors. Aside from autoregulation (when a transcription factor directly controls its own expression), there are only two feedback loops in this graph (between MarR and MarA, and between GutR and GutM). Negative autoregulation is the dominant form of feedback. Jeong et al. studied the topology of the protein interaction network for the yeast Saccharomyces cerevisiae, and found that the network topology is heterogeneous and scale-free, meaning that there are relatively few highly connected nodes, and that the probability of finding a network node ( protein) with many connections (interactions) follows a power law (11). They also found that deletion of proteins with high numbers of interactions was more likely to be lethal to the organism. In earlier work, they found that metabolic networks are also scalefree (12). Pathway Prediction from Sequenced Genomes Another form of inference with a Pathway DB occurred in 1995 when Karp, Ouzounis, and Paley demonstrated that the EcoCyc DB could be used to predict the metabolic network of an organism from its genome (13). The PathoLogic program that they developed takes two inputs: an annotated genome sequence that includes the locations and predicted functions of genes within the genome, and a reference pathway DB. The output produced by the program is a new PGDB that includes a set of pathways predicted to be present in the organism by PathoLogic. PathoLogic uses SRI s MetaCyc pathway DB (2) as the reference DB; MetaCyc contains 450 pathways from many different organisms. PathoLogic matches enzymes in the annotated genomes against enzymes in the MetaCyc DB and computes a score for the presence of different pathways on the basis of the number of matching enzymes, and their positions within the pathway. Prediction of Pathway Flux Rates Schilling and Palsson have devised a numerical method to predict the reaction flux rates for the entire metabolic network of an organism (14). Given experimental measurements of the mass composition of metabolites within the cell, and a list of all metabolic reactions known to occur in the cell (such as those provided by EcoCyc), an optimization procedure is used to calculate the equilibrium rates at which substrates are processed by each metabolic reaction. The flux rates predicted by their technique have been verified experimentally. An extension of this computational technique was used to predict the lethality of E. coli deletion mutants; it correctly predicted the growth potential of mutant strains in 86% of the genes examined (15). 2042

4 Consistency of Metabolic Network with Cellular Growth-Media Requirements The growth of E. coli can be supported by a number of alternative chemically characterized growth media, such as a combination of glucose, ammonia, and minerals. The E. coli metabolic network is able to synthesize all of the compounds essential for its growth (such as the amino acids and nucleoside triphosphates) from these simple precursors. Therefore, it should be possible to verify the completeness and correctness of the theory of the E. coli metabolic network embodied in the EcoCyc DB by computationally propagating the known chemical nutrients of E. coli through the EcoCyc network, and determining whether the resulting chemical products included all of the compounds known to be essential for growth. A qualitative simulation of the E. coli metabolic network is obtained qualitative in the sense that the approach is not attempting to predict the quantities of chemical products that E. coli metabolism produces over time, but to predict if certain chemical products can be produced from a given set of precursors. This qualitative simulation was obtained by using a computational device called a production system, which is the basis of many expert systems. A production system consists of a set of rules (corresponding to reactions) and a set of propositions currently listed as true in the working memory of the production system (which correspond to metabolites). Each rule is of the form A B 3 C D, meaning that if A and B are present in working memory, then add the presence of C and D to working memory. A production-rule inference engine repeatedly searches for a rule for which all of the propositions on its left side are present in working memory, and fires the rule by adding the propositions on its right side to working memory. Every metabolic reaction of the form A B C was computationally translated into a production rule of the form A B 3 C. And the working memory of the production system was initialized to contain a proposition for each known growth nutrient of E. coli. The initial outcome of the resulting qualitative simulation (16) was that known E. coli growth media could not produce the compounds essential for E. coli growth. Examination of these results revealed (i) the existence of bugs in the EcoCyc DB (since corrected); (ii) the existence of metabolic intermediates required for the production of some essential compounds, but for which the metabolic synthesis route is unknown; and (iii) that we had neglected to include certain important precursors in our simulation, such as some proteins that are metabolic substrates (such as thioredoxin and acyl carrier protein). Once these discrepancies were resolved (16), it could be verified that all 41 essential compounds could be produced from the M63 minimal growth medium. Discussion Pathway DBs have several purposes. They are encyclopedic references for pathway information that can be queried by scientists who want to search out specific facts, or search for patterns. They can also be queried by computer programs that perform global analyses and pattern searches. The symbolic nature of pathway DBs means that many types of programs and inference procedures can be written to compute with this information. Although one might argue that computational theories have existed for many years in the form of Fortran programs that model physical systems, the procedural nature of a Fortran program means that it can be used in only one way, namely, to be executed. The knowledge representation community has long recognized the flexibility that results from symbolic representations (17). Pathway DBs are a mechanism for easing the cognitive overload produced by genome data: An analysis of the pathway content of a microbial genome reduces the complexity of that genome by allowing the scientist to think in terms of hundreds of pathways rather than in terms of thousands of gene products. Similarly, Fig. 2. A visualization of the known network of transcription factors in EcoCyc. Each circle represents a single transcription factor. A blue arrow from protein A to protein B indicates that A activates the transcription of B. Pink arrows indicate repression of transcription, and yellow arrows represent both positive and negative regulation of transcription. Circles with a blue outline represent positive self-regulation. Pink circles represent negative self-regulation. Yellow circles with a thick outline represent both positive and negative self-regulation. Circles with a thin yellow outline do not regulate their own transcription. Thus, for example, IHF represses transcription of its own genes and those of OmpR, but activates transcription of the genes for TdcA. The majority of transcription factors depicted here regulate the transcription of many other genes; this diagram shows only regulation of other transcription factors. SCIENCE VOL SEPTEMBER

5 pathway DBs can impose an organizing framework on complex gene expression (or proteomics) data sets that facilitates their interpretation. Future challenges for pathway DBs include modeling of large signaling networks in eukaryotic organisms; performing automated layout similar to that shown in Fig. 1 of the much larger pathway networks that exist in eukaryotic organisms, and supporting methods for user navigation through such a larger pathway network; defining standard ontologies for exchange of pathway data among different DBs and application programs; and creating new analysis algorithms for extracting new insights from pathway networks, such as to aid drug design by analyzing diseased human pathway networks, or predicting optimal drug targets for antimicrobial drug design. One lesson for computer scientists provided by pathway DBs (and by other bioinformatics applications) concerns the importance of DB content to solving computational problems. Most computer scientists focus their attention on algorithms, thinking that the best way to solve a hard computational problem is through a better algorithm. However, for problems such as predicting the pathway complement of an organism from its genome, or predicting metabolic products that an organism can produce from a given growth medium, I know of no algorithms that can solve these problems without being coupled with an accurate and well-designed pathway DB. By encoding scientific theories in a symbolic DB, scientists can more easily check those theories for internal consistency and for consistency with external data, can more easily refine theories that are found to violate external data, and can more easily assess the global properties of the system that such a theory describes. The genome revolution is increasing the need for pathway DBs in the biological sciences, and similar developments will occur in other sciences. However, effective implementation of this paradigm is hampered because most biologists (and most other scientists) receive essentially no education in DBs or knowledge representation. Although many scientists learn a computer programming language as part of their undergraduate education, introductory programming courses completely omit DB and knowledge representation concepts such as data models, ontologies, DB query languages, logical inference, DB design, and formal grammars which explains why many biological DBs do not have a regular syntactic structure, much less a consistent or precisely defined semantics. As science enters the information age, it is crucial that the computerscience education that scientists receive covers symbolic computing as well as numerical computing. References and Notes 1. C. Ouzounis, P. Karp, Genome Res. 10, 568 (2000). 2. P. Karp et al., Nucleic Acids Res. 28, 56 (2000). 3. P. Karp, in Nucleic Acid and Protein Databases and How To Use Them (Academic Press, London, 1999), pp F. Blattner et al., Science 277, 1453 (1997). 5. M. Kanehisa, S. Goto, Nucleic Acids Res. 28, 27 (2000). 6. R. Overbeek et al., Nucleic Acids Res. 28, 123 (2000). 7. L. Ellis, C. Hershberger, L. Wackett, Nucleic Acids Res. 28, 377 (2000). 8. P. D. Karp, Trends Biochem. Sci. 23, 114 (1998). 9. P. Karp, M. Krummenacker, S. Paley, J. Wagg, Trends Biotechnol. 17, 275 (1999). 10. P. D. Karp, Bioinformatics 16, 269 (2000). 11. H. Jeong, S. P. Mason, A.-L. Barabasi, Z. N. Oltvai, Nature 411, 41 (2001). 12. H. Jeong, B. Tombor, R. Albert, Z. N. Oltvai, A.-L. Barabasi, Nature 407, 651 (2000). 13. P. Karp, C. Ouzounis, S. Paley, in Proceedings of the Fourth International Conference on Intelligent Systems for Molecular Biology, D. States, P. Agarwal, T. Gaasterland, L. Hunter, R. Smith, Eds. (American Association for Artificial Intelligence, Menlo Park, CA, 1996). 14. C. Schilling, B. Palsson, Proc. Natl. Acad. Sci. U.S.A. 95, 4193 (1988). 15. J. Edwards, B. Palsson, Proc. Natl. Acad. Sci. U.S.A. 97, 5528 (2000). 16. P. Romero, P. Karp, in Proceedings of the Pacific Symposium on Biocomputing, R. Altman, T. Klein, Eds. (World Scientific, Singapore, 2001), pp T. Winograd, in Representation and Understanding (Academic Press, New York, 1975), pp J. Collado-Vides, I. Paulsen, M. Riley, and M. Saier are collaborators on the EcoCyc project. J. Collado-Vides assisted in the analysis of the E. coli genetic network. S. Paley produced Fig. 2. C. Ouzounis, T. Garvey, A. Rzhetsky, and I. Sim provided valuable comments on this manuscript. The development of EcoCyc, MetaCyc, and the Pathway Tools has been funded by grant 1-R01- RR from the Comparative Medicine Program of the NIH National Center for Research Resources. VIEWPOINT Limits on Silicon Nanoelectronics for Terascale Integration James D. Meindl,* Qiang Chen, Jeffrey A. Davis Throughout the past four decades, silicon semiconductor technology has advanced at exponential rates in both performance and productivity. Concerns have been raised, however, that the limits of silicon technology may soon be reached. Analysis of fundamental, material, device, circuit, and system limits reveals that silicon technology has an enormous remaining potential to achieve terascale integration (TSI) of more than 1 trillion transistors per chip. Such massive-scale integration is feasible assuming the development and economical mass production of doublegate metal-oxide-semiconductor field effect transistors with gate oxide thickness of about 1 nanometer, silicon channel thickness of about 3 nanometers, and channel length of about 10 nanometers. The development of interconnecting wires for these transistors presents a major challenge to the achievement of nanoelectronics for TSI. Silicon technology has advanced at exponential rates in both performance and productivity throughout the past four decades. From School of Electrical and Computer Engineering, Microelectronics Research Center, Georgia Institute of Technology, Atlanta, GA , USA. *To whom correspondence should be addressed. E- mail: james.meindl@mirc.gatech.edu 1960 to 2000, the energy transfer associated with a binary switching transition the canonical digital computing operation decreased by about five orders of magnitude and the number of transistors per chip increased by about nine orders of magnitude. Such exponential advances must eventually come to a halt imposed by a hierarchy of physical limits. The five levels of this hierarchy are defined as fundamental, material, device, circuit, and system (1). A coherent analysis of the key limits at each of these levels reveals that silicon technology has an enormous remaining potential to achieve TSI of more than 1 trillion transistors per chip, with critical device dimensions or channel lengths in the 10-nm range. This potential represents more than a three-decade increase in the number of transistors per chip and more than a one-decade reduction in minimum transistor feature size compared with the state of the art in Fundamental physical limits that are independent of the characteristics of any particular material, device structure, circuit configuration, or system architecture are virtually impenetrable barriers to future advances of TSI. Binary switching transitions implemented with transistors are indispensable to performing computation in a digital system. The energy transfer per binary transition is a revealing metric for comparing the performance of 2044

The EcoCyc Database. January 25, de Nitrógeno, UNAM,Cuernavaca, A.P. 565-A, Morelos, 62100, Mexico;

The EcoCyc Database. January 25, de Nitrógeno, UNAM,Cuernavaca, A.P. 565-A, Morelos, 62100, Mexico; The EcoCyc Database Peter D. Karp, Monica Riley, Milton Saier,IanT.Paulsen +, Julio Collado-Vides + Suzanne M. Paley, Alida Pellegrini-Toole,César Bonavides ++, and Socorro Gama-Castro ++ January 25, 2002

More information

V19 Metabolic Networks - Overview

V19 Metabolic Networks - Overview V19 Metabolic Networks - Overview There exist different levels of computational methods for describing metabolic networks: - stoichiometry/kinetics of classical biochemical pathways (glycolysis, TCA cycle,...

More information

V14 extreme pathways

V14 extreme pathways V14 extreme pathways A torch is directed at an open door and shines into a dark room... What area is lighted? Instead of marking all lighted points individually, it would be sufficient to characterize

More information

Biological Pathways Representation by Petri Nets and extension

Biological Pathways Representation by Petri Nets and extension Biological Pathways Representation by and extensions December 6, 2006 Biological Pathways Representation by and extension 1 The cell Pathways 2 Definitions 3 4 Biological Pathways Representation by and

More information

Written Exam 15 December Course name: Introduction to Systems Biology Course no

Written Exam 15 December Course name: Introduction to Systems Biology Course no Technical University of Denmark Written Exam 15 December 2008 Course name: Introduction to Systems Biology Course no. 27041 Aids allowed: Open book exam Provide your answers and calculations on separate

More information

AN EVIDENCE ONTOLOGY FOR USE IN PATHWAY/GENOME DATABASES

AN EVIDENCE ONTOLOGY FOR USE IN PATHWAY/GENOME DATABASES AN EVIDENCE ONTOLOGY FOR USE IN PATHWAY/GENOME DATABASES Appears in Pacific Symposium on Biocomputing 2004 Pages 190-201, World Scientific Publishers, Singapore P.D. KARP, S. PALEY, C.J. KRIEGER SRI International,

More information

Types of biological networks. I. Intra-cellurar networks

Types of biological networks. I. Intra-cellurar networks Types of biological networks I. Intra-cellurar networks 1 Some intra-cellular networks: 1. Metabolic networks 2. Transcriptional regulation networks 3. Cell signalling networks 4. Protein-protein interaction

More information

86 Part 4 SUMMARY INTRODUCTION

86 Part 4 SUMMARY INTRODUCTION 86 Part 4 Chapter # AN INTEGRATION OF THE DESCRIPTIONS OF GENE NETWORKS AND THEIR MODELS PRESENTED IN SIGMOID (CELLERATOR) AND GENENET Podkolodny N.L. *1, 2, Podkolodnaya N.N. 1, Miginsky D.S. 1, Poplavsky

More information

Cell biology traditionally identifies proteins based on their individual actions as catalysts, signaling

Cell biology traditionally identifies proteins based on their individual actions as catalysts, signaling Lethality and centrality in protein networks Cell biology traditionally identifies proteins based on their individual actions as catalysts, signaling molecules, or building blocks of cells and microorganisms.

More information

Course plan Academic Year Qualification MSc on Bioinformatics for Health Sciences. Subject name: Computational Systems Biology Code: 30180

Course plan Academic Year Qualification MSc on Bioinformatics for Health Sciences. Subject name: Computational Systems Biology Code: 30180 Course plan 201-201 Academic Year Qualification MSc on Bioinformatics for Health Sciences 1. Description of the subject Subject name: Code: 30180 Total credits: 5 Workload: 125 hours Year: 1st Term: 3

More information

V14 Graph connectivity Metabolic networks

V14 Graph connectivity Metabolic networks V14 Graph connectivity Metabolic networks In the first half of this lecture section, we use the theory of network flows to give constructive proofs of Menger s theorem. These proofs lead directly to algorithms

More information

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Bioinformatics. Dept. of Computational Biology & Bioinformatics Bioinformatics Dept. of Computational Biology & Bioinformatics 3 Bioinformatics - play with sequences & structures Dept. of Computational Biology & Bioinformatics 4 ORGANIZATION OF LIFE ROLE OF BIOINFORMATICS

More information

Introduction to Bioinformatics

Introduction to Bioinformatics Systems biology Introduction to Bioinformatics Systems biology: modeling biological p Study of whole biological systems p Wholeness : Organization of dynamic interactions Different behaviour of the individual

More information

Networks & pathways. Hedi Peterson MTAT Bioinformatics

Networks & pathways. Hedi Peterson MTAT Bioinformatics Networks & pathways Hedi Peterson (peterson@quretec.com) MTAT.03.239 Bioinformatics 03.11.2010 Networks are graphs Nodes Edges Edges Directed, undirected, weighted Nodes Genes Proteins Metabolites Enzymes

More information

METABOLIC PATHWAY PREDICTION/ALIGNMENT

METABOLIC PATHWAY PREDICTION/ALIGNMENT COMPUTATIONAL SYSTEMIC BIOLOGY METABOLIC PATHWAY PREDICTION/ALIGNMENT Hofestaedt R*, Chen M Bioinformatics / Medical Informatics, Technische Fakultaet, Universitaet Bielefeld Postfach 10 01 31, D-33501

More information

Chapter 6- An Introduction to Metabolism*

Chapter 6- An Introduction to Metabolism* Chapter 6- An Introduction to Metabolism* *Lecture notes are to be used as a study guide only and do not represent the comprehensive information you will need to know for the exams. The Energy of Life

More information

Pathway Bioinformatics: Inference, Visualization, and Analysis. Peter D. Karp, Ph.D.

Pathway Bioinformatics: Inference, Visualization, and Analysis. Peter D. Karp, Ph.D. Pathway Bioinformatics: Inference, Visualization, and Analysis Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International pkarp@ai.sri.com BioCyc.org EcoCyc.org, MetaCyc.org, HumanCyc.org 1 SRI

More information

2 GENE FUNCTIONAL SIMILARITY. 2.1 Semantic values of GO terms

2 GENE FUNCTIONAL SIMILARITY. 2.1 Semantic values of GO terms Bioinformatics Advance Access published March 7, 2007 The Author (2007). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

More information

BMD645. Integration of Omics

BMD645. Integration of Omics BMD645 Integration of Omics Shu-Jen Chen, Chang Gung University Dec. 11, 2009 1 Traditional Biology vs. Systems Biology Traditional biology : Single genes or proteins Systems biology: Simultaneously study

More information

Integration of functional genomics data

Integration of functional genomics data Integration of functional genomics data Laboratoire Bordelais de Recherche en Informatique (UMR) Centre de Bioinformatique de Bordeaux (Plateforme) Rennes Oct. 2006 1 Observations and motivations Genomics

More information

GOSAP: Gene Ontology Based Semantic Alignment of Biological Pathways

GOSAP: Gene Ontology Based Semantic Alignment of Biological Pathways GOSAP: Gene Ontology Based Semantic Alignment of Biological Pathways Jonas Gamalielsson and Björn Olsson Systems Biology Group, Skövde University, Box 407, Skövde, 54128, Sweden, [jonas.gamalielsson][bjorn.olsson]@his.se,

More information

BIOLOGY 10/11/2014. An Introduction to Metabolism. Outline. Overview: The Energy of Life

BIOLOGY 10/11/2014. An Introduction to Metabolism. Outline. Overview: The Energy of Life 8 An Introduction to Metabolism CAMPBELL BIOLOGY TENTH EDITION Reece Urry Cain Wasserman Minorsky Jackson Outline I. Forms of Energy II. Laws of Thermodynamics III. Energy and metabolism IV. ATP V. Enzymes

More information

Biological Networks: Comparison, Conservation, and Evolution via Relative Description Length By: Tamir Tuller & Benny Chor

Biological Networks: Comparison, Conservation, and Evolution via Relative Description Length By: Tamir Tuller & Benny Chor Biological Networks:,, and via Relative Description Length By: Tamir Tuller & Benny Chor Presented by: Noga Grebla Content of the presentation Presenting the goals of the research Reviewing basic terms

More information

An intelligent client application for on-line astronomical information

An intelligent client application for on-line astronomical information An intelligent client application for on-line astronomical information Chenzhou CUI Chinese Virtual Observatory Project National Astronomical Observatory of China VO concept Virtual Observatory (VO) is

More information

Erzsébet Ravasz Advisor: Albert-László Barabási

Erzsébet Ravasz Advisor: Albert-László Barabási Hierarchical Networks Erzsébet Ravasz Advisor: Albert-László Barabási Introduction to networks How to model complex networks? Clustering and hierarchy Hierarchical organization of cellular metabolism The

More information

Gene Network Science Diagrammatic Cell Language and Visual Cell

Gene Network Science Diagrammatic Cell Language and Visual Cell Gene Network Science Diagrammatic Cell Language and Visual Cell Mr. Tan Chee Meng Scientific Programmer, System Biology Group, Bioinformatics Institute Overview Introduction Why? Challenges Diagrammatic

More information

Department of Astrophysical Sciences Peyton Hall Princeton, New Jersey Telephone: (609)

Department of Astrophysical Sciences Peyton Hall Princeton, New Jersey Telephone: (609) Princeton University Department of Astrophysical Sciences Peyton Hall Princeton, New Jersey 08544-1001 Telephone: (609) 258-3808 Email: strauss@astro.princeton.edu January 18, 2007 Dr. George Helou, IPAC

More information

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it? Proteomics What is it? Reveal protein interactions Protein profiling in a sample Yeast two hybrid screening High throughput 2D PAGE Automatic analysis of 2D Page Yeast two hybrid Use two mating strains

More information

9/25/2011. Outline. Overview: The Energy of Life. I. Forms of Energy II. Laws of Thermodynamics III. Energy and metabolism IV. ATP V.

9/25/2011. Outline. Overview: The Energy of Life. I. Forms of Energy II. Laws of Thermodynamics III. Energy and metabolism IV. ATP V. Chapter 8 Introduction to Metabolism Outline I. Forms of Energy II. Laws of Thermodynamics III. Energy and metabolism IV. ATP V. Enzymes Overview: The Energy of Life Figure 8.1 The living cell is a miniature

More information

Transitioning BioCyc to a Subscription Model

Transitioning BioCyc to a Subscription Model Transitioning BioCyc to a Subscription Model Peter D. Karp SRI International ecocyc.org biocyc.org metacyc.org BioCyc.org Collection of 9,300 Pathway/Genome Databases Pathway/Genome Database (PGDB) combines

More information

V15 Flux Balance Analysis Extreme Pathways

V15 Flux Balance Analysis Extreme Pathways V15 Flux Balance Analysis Extreme Pathways Stoichiometric matrix S: m n matrix with stochiometries of the n reactions as columns and participations of m metabolites as rows. The stochiometric matrix is

More information

L3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus

L3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus L3.1: Circuits: Introduction to Transcription Networks Cellular Design Principles Prof. Jenna Rickus In this lecture Cognitive problem of the Cell Introduce transcription networks Key processing network

More information

BioControl - Week 6, Lecture 1

BioControl - Week 6, Lecture 1 BioControl - Week 6, Lecture 1 Goals of this lecture Large metabolic networks organization Design principles for small genetic modules - Rules based on gene demand - Rules based on error minimization Suggested

More information

Optimality, Robustness, and Noise in Metabolic Network Control

Optimality, Robustness, and Noise in Metabolic Network Control Optimality, Robustness, and Noise in Metabolic Network Control Muxing Chen Gal Chechik Daphne Koller Department of Computer Science Stanford University May 18, 2007 Abstract The existence of noise, or

More information

BIOINFORMATICS: An Introduction

BIOINFORMATICS: An Introduction BIOINFORMATICS: An Introduction What is Bioinformatics? The term was first coined in 1988 by Dr. Hwa Lim The original definition was : a collective term for data compilation, organisation, analysis and

More information

Computational approaches for functional genomics

Computational approaches for functional genomics Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding

More information

Understanding Science Through the Lens of Computation. Richard M. Karp Nov. 3, 2007

Understanding Science Through the Lens of Computation. Richard M. Karp Nov. 3, 2007 Understanding Science Through the Lens of Computation Richard M. Karp Nov. 3, 2007 The Computational Lens Exposes the computational nature of natural processes and provides a language for their description.

More information

An Introduction to Metabolism

An Introduction to Metabolism LECTURE PRESENTATIONS For CAMPBELL BIOLOGY, NINTH EDITION Jane B. Reece, Lisa A. Urry, Michael L. Cain, Steven A. Wasserman, Peter V. Minorsky, Robert B. Jackson Chapter 8 An Introduction to Metabolism

More information

ATLAS of Biochemistry

ATLAS of Biochemistry ATLAS of Biochemistry USER GUIDE http://lcsb-databases.epfl.ch/atlas/ CONTENT 1 2 3 GET STARTED Create your user account NAVIGATE Curated KEGG reactions ATLAS reactions Pathways Maps USE IT! Fill a gap

More information

Chapter 15 Active Reading Guide Regulation of Gene Expression

Chapter 15 Active Reading Guide Regulation of Gene Expression Name: AP Biology Mr. Croft Chapter 15 Active Reading Guide Regulation of Gene Expression The overview for Chapter 15 introduces the idea that while all cells of an organism have all genes in the genome,

More information

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics GENOME Bioinformatics 2 Proteomics protein-gene PROTEOME protein-protein METABOLISM Slide from http://www.nd.edu/~networks/ Citrate Cycle Bio-chemical reactions What is it? Proteomics Reveal protein Protein

More information

Biological Networks. Gavin Conant 163B ASRC

Biological Networks. Gavin Conant 163B ASRC Biological Networks Gavin Conant 163B ASRC conantg@missouri.edu 882-2931 Types of Network Regulatory Protein-interaction Metabolic Signaling Co-expressing General principle Relationship between genes Gene/protein/enzyme

More information

Chapter 8: An Introduction to Metabolism

Chapter 8: An Introduction to Metabolism Chapter 8: An Introduction to Metabolism Key Concepts 8.1 An organism s metabolism transforms matter and energy, subject to the laws of thermodynamics 8.2 The free-energy change of a reaction tells us

More information

Chapter 7: Metabolic Networks

Chapter 7: Metabolic Networks Chapter 7: Metabolic Networks 7.1 Introduction Prof. Yechiam Yemini (YY) Computer Science epartment Columbia University Introduction Metabolic flux analysis Applications Overview 2 1 Introduction 3 Metabolism:

More information

New Results on Energy Balance Analysis of Metabolic Networks

New Results on Energy Balance Analysis of Metabolic Networks New Results on Energy Balance Analysis of Metabolic Networks Qinghua Zhou 1, Simon C.K. Shiu 2,SankarK.Pal 3,andYanLi 1 1 College of Mathematics and Computer Science, Hebei University, Baoding City 071002,

More information

Content Descriptions Based on the Georgia Performance Standards. Biology

Content Descriptions Based on the Georgia Performance Standards. Biology Content Descriptions Based on the Georgia Performance Standards Biology Introduction The State Board of Education is required by Georgia law (A+ Educational Reform Act of 2000, O.C.G.A. 20-2-281) to adopt

More information

CS 229 Project: A Machine Learning Framework for Biochemical Reaction Matching

CS 229 Project: A Machine Learning Framework for Biochemical Reaction Matching CS 229 Project: A Machine Learning Framework for Biochemical Reaction Matching Tomer Altman 1,2, Eric Burkhart 1, Irene M. Kaplow 1, and Ryan Thompson 1 1 Stanford University, 2 SRI International Biochemical

More information

SPA for quantitative analysis: Lecture 6 Modelling Biological Processes

SPA for quantitative analysis: Lecture 6 Modelling Biological Processes 1/ 223 SPA for quantitative analysis: Lecture 6 Modelling Biological Processes Jane Hillston LFCS, School of Informatics The University of Edinburgh Scotland 7th March 2013 Outline 2/ 223 1 Introduction

More information

Pathways that Harvest and Store Chemical Energy

Pathways that Harvest and Store Chemical Energy 6 Pathways that Harvest and Store Chemical Energy Energy is stored in chemical bonds and can be released and transformed by metabolic pathways. Chemical energy available to do work is termed free energy

More information

Lectures on Medical Biophysics Department of Biophysics, Medical Faculty, Masaryk University in Brno. Biocybernetics

Lectures on Medical Biophysics Department of Biophysics, Medical Faculty, Masaryk University in Brno. Biocybernetics Lectures on Medical Biophysics Department of Biophysics, Medical Faculty, Masaryk University in Brno Norbert Wiener 26.11.1894-18.03.1964 Biocybernetics Lecture outline Cybernetics Cybernetic systems Feedback

More information

An Introduction to Metabolism

An Introduction to Metabolism Chapter 8 An Introduction to Metabolism Edited by Shawn Lester PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley

More information

CHEM 121: Chemical Biology

CHEM 121: Chemical Biology Instructors Prof. Jane M. Liu (HS-212) jliu3@drew.edu x3303 Office Hours Anytime my office door is open CHEM 121: Chemical Biology Class MF 2:30-3:45 pm PRE-REQUISITES: CHEM 117 COURSE OVERVIEW This upper-level

More information

Computational Biology Course Descriptions 12-14

Computational Biology Course Descriptions 12-14 Computational Biology Course Descriptions 12-14 Course Number and Title INTRODUCTORY COURSES BIO 311C: Introductory Biology I BIO 311D: Introductory Biology II BIO 325: Genetics CH 301: Principles of Chemistry

More information

SYSTEMS BIOLOGY 1: NETWORKS

SYSTEMS BIOLOGY 1: NETWORKS SYSTEMS BIOLOGY 1: NETWORKS SYSTEMS BIOLOGY Starting around 2000 a number of biologists started adopting the term systems biology for an approach to biology that emphasized the systems-character of biology:

More information

Rick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30

Rick Ebert & Joseph Mazzarella For the NED Team. Big Data Task Force NASA, Ames Research Center 2016 September 28-30 NED Mission: Provide a comprehensive, reliable and easy-to-use synthesis of multi-wavelength data from NASA missions, published catalogs, and the refereed literature, to enhance and enable astrophysical

More information

An Introduction to Metabolism

An Introduction to Metabolism Chapter 8 An Introduction to Metabolism PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from

More information

Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting.

Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting. Genome Annotation Bioinformatics and Computational Biology Genome Annotation Frank Oliver Glöckner 1 Genome Analysis Roadmap Genome sequencing Assembly Gene prediction Protein targeting trna prediction

More information

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016 Boolean models of gene regulatory networks Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016 Gene expression Gene expression is a process that takes gene info and creates

More information

An Introduction to Metabolism

An Introduction to Metabolism Chapter 8 An Introduction to Metabolism Dr. Wendy Sera Houston Community College Biology 1406 Key Concepts in Chapter 8 1. An organism s metabolism transforms matter and energy, subject to the laws of

More information

An Introduction to Metabolism

An Introduction to Metabolism CAMPBELL BIOLOGY IN FOCUS Urry Cain Wasserman Minorsky Jackson Reece 6 An Introduction to Metabolism Lecture Presentations by Kathleen Fitzpatrick and Nicole Tunbridge Overview: The Energy of Life The

More information

Biochemistry: A Review and Introduction

Biochemistry: A Review and Introduction Biochemistry: A Review and Introduction CHAPTER 1 Chem 40/ Chem 35/ Fundamentals of 1 Outline: I. Essence of Biochemistry II. Essential Elements for Living Systems III. Classes of Organic Compounds IV.

More information

FCModeler: Dynamic Graph Display and Fuzzy Modeling of Regulatory and Metabolic Maps

FCModeler: Dynamic Graph Display and Fuzzy Modeling of Regulatory and Metabolic Maps FCModeler: Dynamic Graph Display and Fuzzy Modeling of Regulatory and Metabolic Maps Julie Dickerson 1, Zach Cox 1 and Andy Fulmer 2 1 Iowa State University and 2 Proctor & Gamble. FCModeler Goals Capture

More information

Network Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai

Network Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai Network Biology: Understanding the cell s functional organization Albert-László Barabási Zoltán N. Oltvai Outline: Evolutionary origin of scale-free networks Motifs, modules and hierarchical networks Network

More information

ADVANCED PLACEMENT BIOLOGY

ADVANCED PLACEMENT BIOLOGY ADVANCED PLACEMENT BIOLOGY Description Advanced Placement Biology is designed to be the equivalent of a two-semester college introductory course for Biology majors. The course meets seven periods per week

More information

An Introduction to Metabolism

An Introduction to Metabolism An Introduction to Metabolism I. All of an organism=s chemical reactions taken together is called metabolism. A. Metabolic pathways begin with a specific molecule, which is then altered in a series of

More information

Outline. Terminologies and Ontologies. Communication and Computation. Communication. Outline. Terminologies and Vocabularies.

Outline. Terminologies and Ontologies. Communication and Computation. Communication. Outline. Terminologies and Vocabularies. Page 1 Outline 1. Why do we need terminologies and ontologies? Terminologies and Ontologies Iwei Yeh yeh@smi.stanford.edu 04/16/2002 2. Controlled Terminologies Enzyme Classification Gene Ontology 3. Ontologies

More information

General Biology. The Energy of Life The living cell is a miniature factory where thousands of reactions occur; it converts energy in many ways

General Biology. The Energy of Life The living cell is a miniature factory where thousands of reactions occur; it converts energy in many ways Course No: BNG2003 Credits: 3.00 General Biology 5. An Introduction into Cell Metabolism The Energy of Life The living cell is a miniature factory where thousands of reactions occur; it converts energy

More information

The architecture of complexity: the structure and dynamics of complex networks.

The architecture of complexity: the structure and dynamics of complex networks. SMR.1656-36 School and Workshop on Structure and Function of Complex Networks 16-28 May 2005 ------------------------------------------------------------------------------------------------------------------------

More information

Gene Ontology and overrepresentation analysis

Gene Ontology and overrepresentation analysis Gene Ontology and overrepresentation analysis Kjell Petersen J Express Microarray analysis course Oslo December 2009 Presentation adapted from Endre Anderssen and Vidar Beisvåg NMC Trondheim Overview How

More information

Models and Languages for Computational Systems Biology Lecture 1

Models and Languages for Computational Systems Biology Lecture 1 Models and Languages for Computational Systems Biology Lecture 1 Jane Hillston. LFCS and CSBE, University of Edinburgh 13th January 2011 Outline Introduction Motivation Measurement, Observation and Induction

More information

CHEM 181: Chemical Biology

CHEM 181: Chemical Biology Instructor Prof. Jane M. Liu (SN-216) jane.liu@pomona.edu CHEM 181: Chemical Biology Office Hours Anytime my office door is open or by appointment COURSE OVERVIEW Class TR 8:10-9:25 am Prerequisite: CHEM115

More information

Lab 2 Worksheet. Problems. Problem 1: Geometry and Linear Equations

Lab 2 Worksheet. Problems. Problem 1: Geometry and Linear Equations Lab 2 Worksheet Problems Problem : Geometry and Linear Equations Linear algebra is, first and foremost, the study of systems of linear equations. You are going to encounter linear systems frequently in

More information

Name Period The Control of Gene Expression in Prokaryotes Notes

Name Period The Control of Gene Expression in Prokaryotes Notes Bacterial DNA contains genes that encode for many different proteins (enzymes) so that many processes have the ability to occur -not all processes are carried out at any one time -what allows expression

More information

Analysis and Simulation of Biological Systems

Analysis and Simulation of Biological Systems Analysis and Simulation of Biological Systems Dr. Carlo Cosentino School of Computer and Biomedical Engineering Department of Experimental and Clinical Medicine Università degli Studi Magna Graecia Catanzaro,

More information

CSE370: Introduction to Digital Design

CSE370: Introduction to Digital Design CSE370: Introduction to Digital Design Course staff Gaetano Borriello, Brian DeRenzi, Firat Kiyak Course web www.cs.washington.edu/370/ Make sure to subscribe to class mailing list (cse370@cs) Course text

More information

Integrated Knowledge-based Reverse Engineering of Metabolic Pathways

Integrated Knowledge-based Reverse Engineering of Metabolic Pathways Integrated Knowledge-based Reverse Engineering of Metabolic Pathways Shuo-Huan Hsu, Priyan R. Patkar, Santhoi Katare, John A. Morgan and Venkat Venkatasubramanian School of Chemical Engineering, Purdue

More information

An Introduction to Metabolism

An Introduction to Metabolism Chapter 8 An Introduction to Metabolism PowerPoint Lecture Presentations for Biology Eighth Edition Neil Campbell and Jane Reece Lectures by Chris Romero, updated by Erin Barley with contributions from

More information

STAAR Biology Assessment

STAAR Biology Assessment STAAR Biology Assessment Reporting Category 1: Cell Structure and Function The student will demonstrate an understanding of biomolecules as building blocks of cells, and that cells are the basic unit of

More information

An Introduction to Metabolism

An Introduction to Metabolism An Introduction to Metabolism PREFACE The living cell is a chemical factory with thousands of reactions taking place, many of them simultaneously This chapter is about matter and energy flow during life

More information

Biological networks CS449 BIOINFORMATICS

Biological networks CS449 BIOINFORMATICS CS449 BIOINFORMATICS Biological networks Programming today is a race between software engineers striving to build bigger and better idiot-proof programs, and the Universe trying to produce bigger and better

More information

An Introduction to Metabolism

An Introduction to Metabolism An Introduction to Metabolism PREFACE The living cell is a chemical factory with thousands of reactions taking place, many of them simultaneously This chapter is about matter and energy flow during life

More information

A Multiobjective GO based Approach to Protein Complex Detection

A Multiobjective GO based Approach to Protein Complex Detection Available online at www.sciencedirect.com Procedia Technology 4 (2012 ) 555 560 C3IT-2012 A Multiobjective GO based Approach to Protein Complex Detection Sumanta Ray a, Moumita De b, Anirban Mukhopadhyay

More information

networks in molecular biology Wolfgang Huber

networks in molecular biology Wolfgang Huber networks in molecular biology Wolfgang Huber networks in molecular biology Regulatory networks: components = gene products interactions = regulation of transcription, translation, phosphorylation... Metabolic

More information

Introduction to Bioinformatics

Introduction to Bioinformatics CSCI8980: Applied Machine Learning in Computational Biology Introduction to Bioinformatics Rui Kuang Department of Computer Science and Engineering University of Minnesota kuang@cs.umn.edu History of Bioinformatics

More information

Metabolic modelling. Metabolic networks, reconstruction and analysis. Esa Pitkänen Computational Methods for Systems Biology 1 December 2009

Metabolic modelling. Metabolic networks, reconstruction and analysis. Esa Pitkänen Computational Methods for Systems Biology 1 December 2009 Metabolic modelling Metabolic networks, reconstruction and analysis Esa Pitkänen Computational Methods for Systems Biology 1 December 2009 Department of Computer Science, University of Helsinki Metabolic

More information

Compare and contrast the cellular structures and degrees of complexity of prokaryotic and eukaryotic organisms.

Compare and contrast the cellular structures and degrees of complexity of prokaryotic and eukaryotic organisms. Subject Area - 3: Science and Technology and Engineering Education Standard Area - 3.1: Biological Sciences Organizing Category - 3.1.A: Organisms and Cells Course - 3.1.B.A: BIOLOGY Standard - 3.1.B.A1:

More information

Dynamic optimisation identifies optimal programs for pathway regulation in prokaryotes. - Supplementary Information -

Dynamic optimisation identifies optimal programs for pathway regulation in prokaryotes. - Supplementary Information - Dynamic optimisation identifies optimal programs for pathway regulation in prokaryotes - Supplementary Information - Martin Bartl a, Martin Kötzing a,b, Stefan Schuster c, Pu Li a, Christoph Kaleta b a

More information

BA, BSc, and MSc Degree Examinations

BA, BSc, and MSc Degree Examinations Examination Candidate Number: Desk Number: BA, BSc, and MSc Degree Examinations 2017-8 Department : BIOLOGY Title of Exam: Molecular Biology and Biochemistry Part I Time Allowed: 1 hour and 30 minutes

More information

Slide 1 / Describe the setup of Stanley Miller s experiment and the results. What was the significance of his results?

Slide 1 / Describe the setup of Stanley Miller s experiment and the results. What was the significance of his results? Slide 1 / 57 1 Describe the setup of Stanley Miller s experiment and the results. What was the significance of his results? Slide 2 / 57 2 Explain how dehydration synthesis and hydrolysis are related.

More information

C. Schedule Description: An introduction to biological principles, emphasizing molecular and cellular bases for the functions of the human body.

C. Schedule Description: An introduction to biological principles, emphasizing molecular and cellular bases for the functions of the human body. I. CATALOG DESCRIPTION: A. Division: Science Department: Biology Course ID: BIOL 102 Course Title: Human Biology Units: 4 Lecture: 3 hours Laboratory: 3 hours Prerequisite: None B. Course Description:

More information

AP Biology Review Chapters 6-8 Review Questions Chapter 6: Metabolism: Energy and Enzymes Chapter 7: Photosynthesis Chapter 8: Cellular Respiration

AP Biology Review Chapters 6-8 Review Questions Chapter 6: Metabolism: Energy and Enzymes Chapter 7: Photosynthesis Chapter 8: Cellular Respiration AP Biology Review Chapters 6-8 Review Questions Chapter 6: Metabolism: Energy and Enzymes 1. Understand and know the first and second laws of thermodynamics. What is entropy? What happens when entropy

More information

Teacher Notes for How do biological organisms use energy? 1

Teacher Notes for How do biological organisms use energy? 1 Teacher Notes for How do biological organisms use energy? 1 This analysis and discussion activity introduces students to the basic principles of how biological organisms use energy. The focus is on understanding

More information

Grade Level: AP Biology may be taken in grades 11 or 12.

Grade Level: AP Biology may be taken in grades 11 or 12. ADVANCEMENT PLACEMENT BIOLOGY COURSE SYLLABUS MRS. ANGELA FARRONATO Grade Level: AP Biology may be taken in grades 11 or 12. Course Overview: This course is designed to cover all of the material included

More information

An Introduction to Metabolism

An Introduction to Metabolism LECTURE PRESENTATIONS For CAMPBELL BIOLOGY, NINTH EDITION Jane B. Reece, Lisa A. Urry, Michael L. Cain, Steven A. Wasserman, Peter V. Minorsky, Robert B. Jackson Chapter 8 An Introduction to Metabolism

More information

Supplementary Information

Supplementary Information Supplementary Information For the article"comparable system-level organization of Archaea and ukaryotes" by J. Podani, Z. N. Oltvai, H. Jeong, B. Tombor, A.-L. Barabási, and. Szathmáry (reference numbers

More information

Keystone Exams: Biology Assessment Anchors and Eligible Content. Pennsylvania Department of Education

Keystone Exams: Biology Assessment Anchors and Eligible Content. Pennsylvania Department of Education Assessment Anchors and Pennsylvania Department of Education www.education.state.pa.us 2010 PENNSYLVANIA DEPARTMENT OF EDUCATION General Introduction to the Keystone Exam Assessment Anchors Introduction

More information

Goal 1: Develop knowledge and understanding of core content in biology

Goal 1: Develop knowledge and understanding of core content in biology Indiana State University» College of Arts & Sciences» Biology BA/BS in Biology Standing Requirements s Library Goal 1: Develop knowledge and understanding of core content in biology 1: Illustrate and examine

More information

International Journal of Scientific & Engineering Research, Volume 6, Issue 2, February ISSN

International Journal of Scientific & Engineering Research, Volume 6, Issue 2, February ISSN International Journal of Scientific & Engineering Research, Volume 6, Issue 2, February-2015 273 Analogizing And Investigating Some Applications of Metabolic Pathway Analysis Methods Gourav Mukherjee 1,

More information

Chapter 8 Notes. An Introduction to Metabolism

Chapter 8 Notes. An Introduction to Metabolism Chapter 8 Notes An Introduction to Metabolism Describe how allosteric regulators may inhibit or stimulate the activity of an enzyme. Objectives Distinguish between the following pairs of terms: catabolic

More information

Computational Systems Biology

Computational Systems Biology Computational Systems Biology Vasant Honavar Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Graduate Program Center for Computational Intelligence, Learning, & Discovery

More information