DECOMPOSITION OF GENE REGULATORY NETWORKS INTO FUNCTIONAL PATHS AND THEIR MATCHING WITH MICROARRAY GENE EXPRESSION PROFILES
|
|
- Mabel Short
- 5 years ago
- Views:
Transcription
1 DECOMPOSITION OF GENE REGULATORY NETWORKS INTO FUNCTIONAL PATHS AND THEIR MATCHING WITH MICROARRAY GENE EXPRESSION PROFILES A. Kanterakis*, D. Kafetzopoulos**, V. Moustakis*** and G. Potamias* * FORTH/Institute of Computer Science (ICS), Heraklion, Greece * FORTH/Institute of Molecular Biology & Biotechnology, Heraklion (IMBB), Greece *** TUC/Department of Production Engineering and Management, Chania, Greece {kantale,moustaki,potamias}@ics.forth.gr, kafetzo@imbb.forth.gr Abstract: Gene Regulatory Networks (GRNs) and DNA Microarrays (MAs) present two of the most prominent and heavily researched concepts in contemporary molecular biology and bioinformatics. GRNs model the interfering relations among gene products during the regulation of the cell function. MAs measure the simultaneous expression profile of thousands of genes. In an effort to combine these two sources of biological information, concurrent studies try either to 'build from scratch' or, focus on the expression profile of genes regulated in a specific GRN. Our work aims to discover the functional parts of GRNs by identifying consistencies and inconsistencies with respective MA gene-expression data. Specifically, GRNs are decomposed into all possible functional paths. Functional GRN paths from the repository are then matched against microarray gene-expression profiles of samples (with different clinical phenotype). In this way the underlying mechanism the highly matched and phenotype-differentiating functional paths that guide to specific gene-expression profiles are uncovered. The discovery guides the finer re-classification of samples with profound diagnostic and prognostic potential, and/or triggers further molecular biology research. Preliminary results on applying our methodology on a real-world microarray study and targeting the Apoptosis gene-regulatory network are encouraging and demonstrate the suitability, efficiency and reliability of the approach. 1. Introduction Current post-genomics bioinformatics research seeks for methods that not only combine the information from dispersed and heterogeneous data sources but distil the knowledge and provide a systematic, genome-scale view of biology [1]. The advantage of this approach is that it can identify emergent properties of the underlying molecular system as a whole an endeavour of limited success if targeted genes, reactions or even molecular pathways are studied in isolation [2]. Individuals show different phenotypes for the same disease they respond differently to drugs and sometimes the effects are unpredictable. Many of the genes examined in early clincio-genomic studies were linked to single-gene traits, but future advances engage the elucidation of multi-gene determinants of drug response. Differences in the individuals background DNA code but mainly, differences in the underlying gene regulation mechanisms alter the expression or function of proteins being targeted by drugs, and contribute significantly to variation in the responses of individuals. The challenge is to accelerate our understanding of the molecular mechanisms of these variations and to produce targeted individualized therapies. Two of the most promising and advanced technologies and notions of contemporary bioinformatics research are Microarrays (MA) and Gene Regulatory Networks (GRNs). Combining knowledge from both knowledge sources will accelerate our understanding of the molecular mechanisms of genome variations and will guide the discovery of targeted individualized therapies. Faced with such a challenge we devised and present an integrated methodology that amalgamates knowledge and data from both GRNs and MA geneexpression sources. The methodology aims to uncover potential gene-regulatory fingerprints and mechanisms that govern the genomic profiles of diseases. A preliminary implementation of our methodology is made in a system called MinePath. 2. MAs, GRNs and their Combination Microarrays. With the recent advances in MA technology [3, 4], the potential for molecular diagnostic and prognostic tools seem to come in reality. The last years, microarray-chips have been devised and manufactured in order to measure the expression profile of thousands of genes. The aim is to add molecular characteristics to the classification of diseases so that diagnostic procedures are enhanced and prognostic predictions are improved. These studies demonstrate that gene-expression profiling has great potential in identifying and predicting various targets and prognostic factors of diseases. By measuring transcription levels of genes in an organism under various conditions, in different tissue samples, we can build up gene expression profiles, which characterize the dynamic 1
2 functioning of each gene in the genome. The microarray data are represented in a matrix with rows representing genes, columns representing samples (e.g. various tissues, developmental stages and treatments), and each cell containing a number characterizing the expression level of the particular gene in the particular sample - gene expression matrix. Gene Regulatory Networks. Gene regulatory networks (GRNs) are network structures that depict the interaction of DNA segments during the transcription of the genes into mrna. The prominent and vital role of GRNs in the study of various biology processes is a major sector in contemporary biology research, where numerous thorough studies have been made [5, 6]. Each network has inputs, usually genes/proteins or transcription factors that initiate the network function, and outputs that usually is a certain gene. The network by itself acts as a mechanism that determines cellular behaviour where the nodes are genes and edges are functions that represent the molecular reactions between the nodes. These functions can be perceived as Boolean functions, where nodes have only two possible states ( on and off ), and the whole network represented as a simple directed graph [7]. It is indicative that most of the relations in known and established GRNs have been derived from laborious and extensive laboratory experiments and careful study of the existing biochemical literature. Thus GRNs are far from complete. Current efforts focus on the reconstruction of GRNs by exploring gene-expression data. Specifically, Babu in [8] identified that network topologies extracted from gene co-expression events could discover motifs and regulatory hubs that can characterize the entire cellular states and guide further the pharmaceutical research. Combining MAs and GRNs: The Needs. MA experiments involve more variables (genes) than samples (patients). This fact, leads to results with poor biological significance. To remedy this there is an open debate whether we should concentrate on gathering more data or on building new algorithms. Simon et al., in [9] published a very strict criticism on common pitfalls on microarray data mining while in [10] commented about the bias in the gene selection processes. Another significant aspect is the noisy content of the experiment. Appropriate statistical analysis of noisy data is very important in order to obtain meaningful biological information [11, 12]. Evidence on this is given by the fact that different methods produce gene lists (i.e., gene-markers and molecular signatures) that are strikingly different. Very few methods of gene regulatory inference are considered superior to the others mainly because of the intrinsically noisy property of the data, the curse of dimensionality, and the unknown true underlying networks. The study of the function, structure and evolution of GRNs in combination with microarray gene-expression profiles and data is essential for contemporary biology research. Cell, tissues, organs, organisms or any other biological systems defined by evolution are essentially complex physicochemical systems. They consist of numerous dynamic networks of biochemical reactions and signalling interactions between active cellular components. This cellular complexity has made it difficult to build a complete understanding of cellular machinery to achieve a specific purpose [13]. To circumvent this complexity, microarrays, biology knowledge and biology networks can be combined in order to document and support the detected and predicted interactions [14]. The advances and tools that each discipline carries can be integrated in a holistic and generic perspective so that the chaotic complexity of biology networks can be traced down. 3. Decomposition of GRNs Existing GRNs databases provide us with widely utilized networks of proved molecular validity. The most known are network that describe important cellular processes such as cell-cycle, apoptosis, signaling, and regulation of important growth factors. Online public repositories contain a variety of information that includes not only the network per se but links to respective nodes (genes) and edge (regulation) metadata and annotation. Currently MinePath utilizes the KEGG pathways repository (Kyoto Encyclopedia of Genes and Genomes; KEGG provides a standardized format representation operationalised by its own markup description language, the KGML (KEGG Markup Language; genome.jp/kegg/xml/). Figure 1. Path decomposition. Top: A target part of the KEGG cell-cycle GRN; Bottom: The five decomposed paths for the tergated path part - all possible functional routes taking place during network regulaion machinery. Our methodology relies on a novel approach for GRN processing that takes into account all possible interpretations of the network. The different GRN interpretations correspond to the different functional paths that can be followed during the regulation of a target gene. Different GRNs are downloaded from the KEGG repository. With an XML parser we obtain all the internal network semantics (see next sub-section). In a subsequent step, all possible and functional network 2
3 paths are extracted as exemplified in Figure 1 above. Each functional path is annotated with the possible valid values according to Kauffman s principles that follow a binary setting: each gene in a functional path can be either ON or OFF. According to Kauffman [7], the following functional gene-regulatory semantics apply: (a) the network is a directed graph with genes (inputs and outputs) being the graph nodes and the edges between them representing the casual (regulatory) links between them; (b) each node can be in one of the two states: on or off ; (c) for a gene, on corresponds to the gene being expressed (i.e., the respective substance being present); and (d) time is viewed as proceeding in discrete steps - at each step, the new state of a node is a Boolean function of the prior states of the nodes with arrows pointing towards it. KEGG encompasses and models a variety of regulation links (edges). Figure 2 shows these links accompanied with their underlying notation and semantics. Figure 2. Different functional relationships between genes (or group of genes) Since the regulation edge connecting two genes defines explicitly the possible values of each gene, we can set all possible state-values that a gene may take in a path. Thus, each extracted path contains not only the relevant sub-graph but the state-values ( ON or OFF ) of each gene as well. The only requirement concerns the assumption that for a path being functional, the path should be active during the GRN regulation process. In other words we assume that all genes in a path are functionally active. For example assume the functional path A B ( is an activation/expression regulatory relation). If gene A is on an OFF state then, gene B is not allowed to be on an ON state - B could become ON only and only if it is activated/expressed by another gene in a different functional path, e.g., C B). If we had allowed non-functional genes to have arbitrary values then the significant paths would be more likely to be noisy rather than of biological importance. The extracted and annotated paths are stored in a database that acts as a repository for future reference. Through this repository we can query paths being parts of targeted GRN, contain specific genes or, involve a specific regulatory relation. Moreover, the stored paths can be combined and analyzed in the view of specific microarray experiments and respective gene-expression sample profiles. Furthermore, as the database repository contain and retrieves functional paths from a variety of different GRNs (e.g., cell-cycle, apoptosis etc), we may combine knowledge (i.e., the functional paths) from different molecular pathways and networks a major need for molecular biology and a big challenge for contemporary bioinformatics research. 3. Matching GRN Functional-Paths with MA Gene-Expression Profiles The next step is to locate microarray experiments and respective gene-expression data where we expect the targeted GRN to play an important role. For example the cell-cycle and apoptosis GRNs play an important role in cancer studies and respective experiments dealing with tumor progression. With a geneexpression/functional-path matching operation, the valid and most prominent GRN functional paths are identified. In order to combine and match GRN paths with microarray data the respective gene-expression values should be transformed into two (binary) states - on and off. Microarray gene-expression discretization is a popular method to indicate vigorously the expressed (up-regulated/ on ) and non-expressed (downregulated/ off ) genes. Currently MinePath encompasses an information-theoretic process for the binary transformation of gene-expression values. The process pre-supposes the categorization of the input microarray samples into two classes and is based on an information-theoretic setting. A detailed presentation of the discretization process may be found in [15]. GRN and MA gene-expression data matching aims to differentiate GRN functional paths and identify the most prominent of them with respect to the gene-expression profiles of the input samples. In other words, the quest is for those functional paths that exhibit high matching scores for one of class and low matching scores for the other. This is a paradigm shift from mining for genes with differential expression, to mining for subparts of GRN with differential function. The procedure for differential path identification is inherently simple. Assume five input samples {S 1, S 2, S 3 } and {S 4, S 5 } assigned to (phenotypic) class POS and NEG, respectively, and engages five genes {g 1, g 2, g 3, g4, g 5 }. Furthermore assume that we are about to explore (target) an artificial GRN being decomposed into four functional-paths {P 1, P 2, P 3, P 4 } (see Figure 3). For matching a sample with a specific path we consider just the genes engaged by this path. Our notion of matching is realized as a consistency relation between functional GRN paths and gene-expression profiles. In other words, we care just for those GRN parts which represent causal putative molecular mechanisms that govern the expression status of genes for specific samples. For example, sample S 1 matches perfectly path P 1 even if this path engages just two of the total five genes. In contrast, S 5 does not match perfectly path P 4 even if both share the same ( on ) state for genes g 1 and g 4, they differ in the state of gene g 5 ( on in path P 4 and off in sample S 5 ). 3
4 Figure 3. Matching GRN functional paths with samples geneexpression profiles; red and light-green coloring represents on and off states of genes, respectively, and represent activation/expression and inhibition relations between genes, respectively.). In general, for each path we compute the number of samples that is consistent for each class. Suppose that there are S 1 and S 2 samples belonging to the first and second class, respectively. Assume that path P i is consistent with S i;1 and S i;2 samples form the first and second class, respectively. The formula below, computes the differential power of the specific path with respect to the two classes. dp(p i ) = ( Si; 1 / S 1) ( Si;2 / S 2) Ranking of paths according to the above formula, provides the most differentiating and prominent GRN functional paths for the respective phenotypic classes. These paths uncover putative molecular mechanisms that govern specific phenotypes (disease, disease states, drug responses etc). For the example of figure 3 the following matching path/samples matching and differentiating power matrix is computed. Examining the matrix we may identify a path, path P 2 (shaded), that differentiates perfectly between the two classes it is a path that matches just class POS samples and none from class NEG. So, a direct conclusion is that a putative molecular mechanism governing class 'POS' is represented by functional path P 2. The introduced dp formula can be enriched so that longer consistent paths acquire stronger power. It can also be relaxed so that consistent presents a continuous, rather than a Boolean indicator. Finally we may introduce unknown values for missing and erroneous gene expression values. 4. Preliminary Experiments and Results We applied the presented methodology on a real-world, and widely utilized, cancer-related microarray study. The study concerns the gene-expression profiling of breast-cancer (BRCA) patients. The targeted phenotypic classes concern the good and bad prognosis of the patients; good and bad modelled as >5 - and 5 - years free-of-relapse, respectively. The study concerns 34 and 44 gene-expression profiles for RELAPSE and NON-RELAPSE patients, respectively. For a detailed description of this clinicogenomic study please refer to [Veer]. In the aforementioned study publication, the researchers were able to identify a molecular signature of 70 differentially expressed genes the expression profiles of which was capable of adequately distinguish between the two phenotypic classes. But there was no evidential reporting on the underlying gene-regulatory mechanisms governing the expression status of the identified molecular signature. In an effort to identify such mechanisms we applied the introduced methodology on the respective BRCA microarray dataset, targeting the Apoptosis regulatory network ( hsa04210.html). Our conjecture is: for RELAPSE patients the gene-regulating molecular mechanisms guiding to apoptosis should be blocked, with the inverse for the NON-RELAPSE patients. A total of 406 consistent matching paths were identified, each one matching with a different degree the different samples. Moreover, each of these paths exhibit different differential power with respect to the two classes. Based on the computed (by the introduced dp formula) differential power of paths, we were able to identify: (i) six paths that match only NON-RELAPSE samples and none of the RELAPSE ones; and (ii) just one path for the inverse case; see Figure 5. In the figure, the on and off genes are represented with red and light-green colour, respectively. The parts of the Apoptosis GRN where the six identified paths are active for the NON- RELAPSE, and the one being active for the RELAPSE cases are enclosed in the blue and red rectangles, respectively. RELAPSE path: CASP87 (caspase7, apoptosisrelated cysteine peptidase) expresses/activates DFFA (DNA fragmentation factor) which indirectly guides to Degradation effects, and which also dissociates (represented with ) DFFB (DNA fragmentation factor) making it inactive. As DFFB guides to Apoptosis we may conclude that apoptosis is blocked. So, the specific path presents an anti-apoptotic molecular behavior which is consistent with cancerrelapse cases. NON-RELAPSE path: This part of the Apoptosis GRN permits apoptotic events. In the upper-part of the blue rectangle notice the on (red) genes being active (their regulation is indicated by the blue arrows). A putative mechanism guiding to apoptosis is identified by the bold-shaded arrows that engage the Cleavage of Caspase Substrate cascade which indirectly guides to apoptosis. Following the down part of the 4
5 rectangle, and because of the activation of MAP3K14 (mitogen-activated protein kinase kinase kinase 14) which in turn phosphorylates IKK (CHUK - conserved helix-loop-helix ubiquitous kinase) a whole GRN sub-network should be activated (shaded elliptic area). This sub-network guides to degradation and anti-survival events. From this, we may conclude that the MAP3K phosphorylation - --> IKK is probably not active. This finding may present a promising hint for further molecular lab experimentation. paths, and the matching of these paths with samples gene-expression profiles. An initial implementation of the whole methodology is made in a system called MinePath. MinePath was applied in a real-world and widely utilised gene-expression breast-cancer with encouraging results that demonstrate the suitability and reliability of the whole approach. Among others, our on-going and immediate research on the field include: (a) further experimentation with various real-world microarray studies and different GRN targets (accompanied with the evaluation of results form molecular biology experts); (b) extension of pathdecomposition to multiple GRNs; (c) elaboration on more sophisticated path/gene-expression sample matching formulas and operations; (d) incorporation of different gene nomenclatures in order to cope with microarray experiments from different platforms and encodings; and (e) porting of the whole methodology in a Web-Services and workflow environment. References Figure 5. The Apoptosis -related RELAPSE vs. NON- RELAPSE putative molecular mechanisms. From the above findings it should be evident that there are two different putative molecular regulatory mechanisms that differentiate between RELAPSE and NON-RELAPSE behaviours in BRCA cases. The discovered and identified differentiated paths may be of high value for deciding treatment plans and potential therapeutic targets in drug design processes. Of course, a more complete picture could be achieved when more regulatory networks take part in the analysis and more differentiating paths are identified, e.g., cell-cycle, p53 (tumour suppressor gene) signalling pathways etc. Moreover, all the identified functional paths share a number of common genes that exhibit the same ( on or off ) for both classes. So, with a standard gene-selection approach these genes could not be highly ranked and selected as potential gene-markers. It is the power of their regulation and not the state of genes themselves that makes the difference! This is a realization of the already mentioned paradigm shift: from mining for genes with differential expression, to mining for subparts of GRN with differential function. 4. Conclusions We have presented an integrated methodology for the combined mining of both GRNs and MA geneexpression profiles. In the heart of the methodology is the decomposition of GRNs into all possible functional [1] T. Ideker, T. Galitski and L. Hood, A new approach to decoding life: systems biology, Annu Rev Genomics Hum Genet, 2, (2001). [2] F.S. Collins, E.D. Green, A. E. Guttmacher and M. S. Guyer, A Vision for the Future of Genomics Research, Nature, 422(6934), (2003). [3] H.F. Friend, How DNA microarrays and expression profiling will affect clinical practice, Br Med J., 319, 1-2 (1999). [4] D.E. Bassett, M.B. Eisen, and M.S. Boguski, Gene expression informatics: it s all in your mine, Nature Genetics, 21(Supplement 1), (1999). [5] J. M. Bower and H.Bolouri, Computational Modeling of Genetic and Biochemical Networks, Computational Molecular Biology Series, MIT Press, [6] B. Alberts, A. Johnson, J. Lewis, M. Raff, K. Roberts, and P. Walter, Molecular Biology of the Cell, Garland Science, New York, [7] S. A. Kauffman, The Origins of Order: Self- Organization and Selection in Evolution, Oxford Univ. Press, New York, [8] N.M. Babu, N.M. Luscombe, L. Aravind, M. Gerstein and S.A. Teichmann, Structure and evolution of transcriptional regulatory networks, Curr. Opin. Struct. Biol., 14, (2004). [9] R. Simon, M. D. Radmacher, K. Dobbin and L. M. McShane, Pitfalls in the Use of DNA Microarray Data for Diagnostic Classification, Journal of the National Cancer Institute, 95(1), 14-18, (2003). [10] Ambroise and G. J. McLachlan, Selection bias in gene extraction on the basis of microarray gene-expression data, PNAS, 99(10), , (2002). [11] D.K. Slonim, From pattern to pathways: gene expression data analysis comes of age, Nature Genetics, 32, (2002). [12] J. Quackenbush, Computational Analysis of Microarray Data, Nature Reviews Genetics, 2, (2001). [13] H. Kitano, Systems biology: a brief overview, Science, 295(5560), (2002). [14] K. Kwoh and P. Y. Ng, Network analysis approach for biology, Cell. Mol. Life Sci., 64, (2007). [15] G. Potamias, L. Koumakis and V. Moustakis, Gene Selection via Discretized Gene-Expression Profiles and Greedy Feature-Elimination, LNAI, 3025, (2004). 5
Introduction to Bioinformatics
Systems biology Introduction to Bioinformatics Systems biology: modeling biological p Study of whole biological systems p Wholeness : Organization of dynamic interactions Different behaviour of the individual
More informationComputational Systems Biology
Computational Systems Biology Vasant Honavar Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Graduate Program Center for Computational Intelligence, Learning, & Discovery
More informationIntroduction to Bioinformatics
CSCI8980: Applied Machine Learning in Computational Biology Introduction to Bioinformatics Rui Kuang Department of Computer Science and Engineering University of Minnesota kuang@cs.umn.edu History of Bioinformatics
More informationTypes of biological networks. I. Intra-cellurar networks
Types of biological networks I. Intra-cellurar networks 1 Some intra-cellular networks: 1. Metabolic networks 2. Transcriptional regulation networks 3. Cell signalling networks 4. Protein-protein interaction
More informationUNIVERSITY OF CALIFORNIA SANTA BARBARA DEPARTMENT OF CHEMICAL ENGINEERING. CHE 154: Engineering Approaches to Systems Biology Spring Quarter 2004
UNIVERSITY OF CALIFORNIA SANTA BARBARA DEPARTMENT OF CHEMICAL ENGINEERING CHE 154: Engineering Approaches to Systems Biology Spring Quarter 2004 Lecture: Tue/Thu 9:30-10:45am Engr-II Room 3301 Instructor
More informationInternational Journal of Scientific & Engineering Research, Volume 6, Issue 2, February ISSN
International Journal of Scientific & Engineering Research, Volume 6, Issue 2, February-2015 273 Analogizing And Investigating Some Applications of Metabolic Pathway Analysis Methods Gourav Mukherjee 1,
More informationWritten Exam 15 December Course name: Introduction to Systems Biology Course no
Technical University of Denmark Written Exam 15 December 2008 Course name: Introduction to Systems Biology Course no. 27041 Aids allowed: Open book exam Provide your answers and calculations on separate
More informationBioinformatics. Dept. of Computational Biology & Bioinformatics
Bioinformatics Dept. of Computational Biology & Bioinformatics 3 Bioinformatics - play with sequences & structures Dept. of Computational Biology & Bioinformatics 4 ORGANIZATION OF LIFE ROLE OF BIOINFORMATICS
More informationBiological Pathways Representation by Petri Nets and extension
Biological Pathways Representation by and extensions December 6, 2006 Biological Pathways Representation by and extension 1 The cell Pathways 2 Definitions 3 4 Biological Pathways Representation by and
More informationLecture Notes for Fall Network Modeling. Ernest Fraenkel
Lecture Notes for 20.320 Fall 2012 Network Modeling Ernest Fraenkel In this lecture we will explore ways in which network models can help us to understand better biological data. We will explore how networks
More informationProteomics Systems Biology
Dr. Sanjeeva Srivastava IIT Bombay Proteomics Systems Biology IIT Bombay 2 1 DNA Genomics RNA Transcriptomics Global Cellular Protein Proteomics Global Cellular Metabolite Metabolomics Global Cellular
More informationContext dependent visualization of protein function
Article III Context dependent visualization of protein function In: Juho Rousu, Samuel Kaski and Esko Ukkonen (eds.). Probabilistic Modeling and Machine Learning in Structural and Systems Biology. 2006,
More informationIntroduction Biology before Systems Biology: Reductionism Reduce the study from the whole organism to inner most details like protein or the DNA.
Systems Biology-Models and Approaches Introduction Biology before Systems Biology: Reductionism Reduce the study from the whole organism to inner most details like protein or the DNA. Taxonomy Study external
More information86 Part 4 SUMMARY INTRODUCTION
86 Part 4 Chapter # AN INTEGRATION OF THE DESCRIPTIONS OF GENE NETWORKS AND THEIR MODELS PRESENTED IN SIGMOID (CELLERATOR) AND GENENET Podkolodny N.L. *1, 2, Podkolodnaya N.N. 1, Miginsky D.S. 1, Poplavsky
More informationNetworks & pathways. Hedi Peterson MTAT Bioinformatics
Networks & pathways Hedi Peterson (peterson@quretec.com) MTAT.03.239 Bioinformatics 03.11.2010 Networks are graphs Nodes Edges Edges Directed, undirected, weighted Nodes Genes Proteins Metabolites Enzymes
More informationCSCE555 Bioinformatics. Protein Function Annotation
CSCE555 Bioinformatics Protein Function Annotation Why we need to do function annotation? Fig from: Network-based prediction of protein function. Molecular Systems Biology 3:88. 2007 What s function? The
More informationBiological Concepts and Information Technology (Systems Biology)
Biological Concepts and Information Technology (Systems Biology) Janaina de Andréa Dernowsek Postdoctoral at Center for Information Technology Renato Archer Janaina.dernowsek@cti.gov.br Division of 3D
More informationIntroduction to Systems Biology. James Gomes KSBS, IITD
Introduction to Systems Biology James Gomes KSBS, IITD Inner Life of a Cell Cells Exhibit a High Degree of Complexity What are the different types of cells? Is Life an emergent property? What is Systems
More informationAn introduction to SYSTEMS BIOLOGY
An introduction to SYSTEMS BIOLOGY Paolo Tieri CNR Consiglio Nazionale delle Ricerche, Rome, Italy 10 February 2015 Universidade Federal de Minas Gerais, Belo Horizonte, Brasil Course outline Day 1: intro
More informationThe Role of Network Science in Biology and Medicine. Tiffany J. Callahan Computational Bioscience Program Hunter/Kahn Labs
The Role of Network Science in Biology and Medicine Tiffany J. Callahan Computational Bioscience Program Hunter/Kahn Labs Network Analysis Working Group 09.28.2017 Network-Enabled Wisdom (NEW) empirically
More informationDifferential Modeling for Cancer Microarray Data
Differential Modeling for Cancer Microarray Data Omar Odibat Department of Computer Science Feb, 01, 2011 1 Outline Introduction Cancer Microarray data Problem Definition Differential analysis Existing
More informationReductionist Paradox Are the laws of chemistry and physics sufficient for the discovery of new drugs?
Quantitative Biology Colloquium, University of Arizona, March 1, 2011 Reductionist Paradox Are the laws of chemistry and physics sufficient for the discovery of new drugs? Gerry Maggiora, Ph.D. Adjunct
More informationANAXOMICS METHODOLOGIES - UNDERSTANDING
ANAXOMICS METHODOLOGIES - UNDERSTANDING THE COMPLEXITY OF BIOLOGICAL PROCESSES Raquel Valls, Albert Pujol ǂ, Judith Farrés, Laura Artigas and José Manuel Mas Anaxomics Biotech, c/balmes 89, 08008 Barcelona,
More informationCourse plan Academic Year Qualification MSc on Bioinformatics for Health Sciences. Subject name: Computational Systems Biology Code: 30180
Course plan 201-201 Academic Year Qualification MSc on Bioinformatics for Health Sciences 1. Description of the subject Subject name: Code: 30180 Total credits: 5 Workload: 125 hours Year: 1st Term: 3
More informationPredicting Protein Functions and Domain Interactions from Protein Interactions
Predicting Protein Functions and Domain Interactions from Protein Interactions Fengzhu Sun, PhD Center for Computational and Experimental Genomics University of Southern California Outline High-throughput
More informationClustering and Network
Clustering and Network Jing-Dong Jackie Han jdhan@picb.ac.cn http://www.picb.ac.cn/~jdhan Copy Right: Jing-Dong Jackie Han What is clustering? A way of grouping together data samples that are similar in
More informationBMD645. Integration of Omics
BMD645 Integration of Omics Shu-Jen Chen, Chang Gung University Dec. 11, 2009 1 Traditional Biology vs. Systems Biology Traditional biology : Single genes or proteins Systems biology: Simultaneously study
More informationBEFORE TAKING THIS MODULE YOU MUST ( TAKE BIO-4013Y OR TAKE BIO-
2018/9 - BIO-4001A BIODIVERSITY Autumn Semester, Level 4 module (Maximum 150 Students) Organiser: Dr Harriet Jones Timetable Slot:DD This module explores life on Earth. You will be introduced to the major
More informationGene Network Science Diagrammatic Cell Language and Visual Cell
Gene Network Science Diagrammatic Cell Language and Visual Cell Mr. Tan Chee Meng Scientific Programmer, System Biology Group, Bioinformatics Institute Overview Introduction Why? Challenges Diagrammatic
More informationFull versus incomplete cross-validation: measuring the impact of imperfect separation between training and test sets in prediction error estimation
cross-validation: measuring the impact of imperfect separation between training and test sets in prediction error estimation IIM Joint work with Christoph Bernau, Caroline Truntzer, Thomas Stadler and
More informationBiological Networks: Comparison, Conservation, and Evolution via Relative Description Length By: Tamir Tuller & Benny Chor
Biological Networks:,, and via Relative Description Length By: Tamir Tuller & Benny Chor Presented by: Noga Grebla Content of the presentation Presenting the goals of the research Reviewing basic terms
More informationGLOBEX Bioinformatics (Summer 2015) Genetic networks and gene expression data
GLOBEX Bioinformatics (Summer 2015) Genetic networks and gene expression data 1 Gene Networks Definition: A gene network is a set of molecular components, such as genes and proteins, and interactions between
More informationInferring Transcriptional Regulatory Networks from High-throughput Data
Inferring Transcriptional Regulatory Networks from High-throughput Data Lectures 9 Oct 26, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20
More informationGene Ontology. Shifra Ben-Dor. Weizmann Institute of Science
Gene Ontology Shifra Ben-Dor Weizmann Institute of Science Outline of Session What is GO (Gene Ontology)? What tools do we use to work with it? Combination of GO with other analyses What is Ontology? 1700s
More informationKEGGgraph: Application Examples
KEGGgraph: Application Examples Jitao David Zhang April 30, 2018 Abstract In this vignette, we demonstrate the application of KEGGgraph as flexible module in analysis pipelines targeting heterogenous biological
More informationSample Size Estimation for Studies of High-Dimensional Data
Sample Size Estimation for Studies of High-Dimensional Data James J. Chen, Ph.D. National Center for Toxicological Research Food and Drug Administration June 3, 2009 China Medical University Taichung,
More informationUSING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES
USING BLAST TO IDENTIFY PROTEINS THAT ARE EVOLUTIONARILY RELATED ACROSS SPECIES HOW CAN BIOINFORMATICS BE USED AS A TOOL TO DETERMINE EVOLUTIONARY RELATIONSHPS AND TO BETTER UNDERSTAND PROTEIN HERITAGE?
More informationGene Ontology and overrepresentation analysis
Gene Ontology and overrepresentation analysis Kjell Petersen J Express Microarray analysis course Oslo December 2009 Presentation adapted from Endre Anderssen and Vidar Beisvåg NMC Trondheim Overview How
More informationComparative Network Analysis
Comparative Network Analysis BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2016 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under CC BY-NC 4.0 by
More informationMeasuring TF-DNA interactions
Measuring TF-DNA interactions How is Biological Complexity Achieved? Mediated by Transcription Factors (TFs) 2 Regulation of Gene Expression by Transcription Factors TF trans-acting factors TF TF TF TF
More informationPathway Association Analysis Trey Ideker UCSD
Pathway Association Analysis Trey Ideker UCSD A working network map of the cell Network evolutionary comparison / cross-species alignment to identify conserved modules The Working Map Network-based classification
More informationRegulation and signaling. Overview. Control of gene expression. Cells need to regulate the amounts of different proteins they express, depending on
Regulation and signaling Overview Cells need to regulate the amounts of different proteins they express, depending on cell development (skin vs liver cell) cell stage environmental conditions (food, temperature,
More informationNew Computational Methods for Systems Biology
New Computational Methods for Systems Biology François Fages, Sylvain Soliman The French National Institute for Research in Computer Science and Control INRIA Paris-Rocquencourt Constraint Programming
More informationSupplementary Tables and Figures
Supplementary Tables Supplementary Tables and Figures Supplementary Table 1: Tumor types and samples analyzed. Supplementary Table 2: Genes analyzed here. Supplementary Table 3: Statistically significant
More informationMSc Drug Design. Module Structure: (15 credits each) Lectures and Tutorials Assessment: 50% coursework, 50% unseen examination.
Module Structure: (15 credits each) Lectures and Assessment: 50% coursework, 50% unseen examination. Module Title Module 1: Bioinformatics and structural biology as applied to drug design MEDC0075 In the
More informationComputational approaches for functional genomics
Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding
More informationGOSAP: Gene Ontology Based Semantic Alignment of Biological Pathways
GOSAP: Gene Ontology Based Semantic Alignment of Biological Pathways Jonas Gamalielsson and Björn Olsson Systems Biology Group, Skövde University, Box 407, Skövde, 54128, Sweden, [jonas.gamalielsson][bjorn.olsson]@his.se,
More informationUSING THE COMPARATIVE TOXICOGENOMICS DATABASE TO UNDERSTAND THE IMPACT OF HIGH-RISK TOXICANTS ON BIOLOGICAL PATHWAYS. Samantha E.
USING THE COMPARATIVE TOXICOGENOMICS DATABASE TO UNDERSTAND THE IMPACT OF HIGH-RISK TOXICANTS ON BIOLOGICAL PATHWAYS Samantha E. Tulenko A technical report submitted to the faculty of the University of
More informationIntroduction to clustering methods for gene expression data analysis
Introduction to clustering methods for gene expression data analysis Giorgio Valentini e-mail: valentini@dsi.unimi.it Outline Levels of analysis of DNA microarray data Clustering methods for functional
More informationBIOINFORMATICS LAB AP BIOLOGY
BIOINFORMATICS LAB AP BIOLOGY Bioinformatics is the science of collecting and analyzing complex biological data. Bioinformatics combines computer science, statistics and biology to allow scientists to
More informationComputational Genomics. Systems biology. Putting it together: Data integration using graphical models
02-710 Computational Genomics Systems biology Putting it together: Data integration using graphical models High throughput data So far in this class we discussed several different types of high throughput
More informationCorrelation Networks
QuickTime decompressor and a are needed to see this picture. Correlation Networks Analysis of Biological Networks April 24, 2010 Correlation Networks - Analysis of Biological Networks 1 Review We have
More information2 GENE FUNCTIONAL SIMILARITY. 2.1 Semantic values of GO terms
Bioinformatics Advance Access published March 7, 2007 The Author (2007). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org
More informationSTATE UNIVERSITY OF NEW YORK COLLEGE OF TECHNOLOGY CANTON, NEW YORK. COURSE OUTLINE BIOL 310 The Human Genome. Prepared By: Ron Tavernier
STATE UNIVERSITY OF NEW YORK COLLEGE OF TECHNOLOGY CANTON, NEW YORK COURSE OUTLINE BIOL 310 The Human Genome Prepared By: Ron Tavernier SCHOOL OF SCIENCE, HEALTH & CRIMINAL JUSTICE SCIENCE DEPARTMENT May
More informationClustering of Pathogenic Genes in Human Co-regulatory Network. Michael Colavita Mentor: Soheil Feizi Fifth Annual MIT PRIMES Conference May 17, 2015
Clustering of Pathogenic Genes in Human Co-regulatory Network Michael Colavita Mentor: Soheil Feizi Fifth Annual MIT PRIMES Conference May 17, 2015 Topics Background Genetic Background Regulatory Networks
More informationMTopGO: a tool for module identification in PPI Networks
MTopGO: a tool for module identification in PPI Networks Danila Vella 1,2, Simone Marini 3,4, Francesca Vitali 5,6,7, Riccardo Bellazzi 1,4 1 Clinical Scientific Institute Maugeri, Pavia, Italy, 2 Department
More informationSimplifying Drug Discovery with JMP
Simplifying Drug Discovery with JMP John A. Wass, Ph.D. Quantum Cat Consultants, Lake Forest, IL Cele Abad-Zapatero, Ph.D. Adjunct Professor, Center for Pharmaceutical Biotechnology, University of Illinois
More informationUnravelling the biochemical reaction kinetics from time-series data
Unravelling the biochemical reaction kinetics from time-series data Santiago Schnell Indiana University School of Informatics and Biocomplexity Institute Email: schnell@indiana.edu WWW: http://www.informatics.indiana.edu/schnell
More informationFrom Petri Nets to Differential Equations An Integrative Approach for Biochemical Network Analysis
From Petri Nets to Differential Equations An Integrative Approach for Biochemical Network Analysis David Gilbert drg@brc.dcs.gla.ac.uk Bioinformatics Research Centre, University of Glasgow and Monika Heiner
More informationA Database of human biological pathways
A Database of human biological pathways Steve Jupe - sjupe@ebi.ac.uk 1 Rationale Journal information Nature 407(6805):770-6.The Biochemistry of Apoptosis. Caspase-8 is the key initiator caspase in the
More informationInferring Protein-Signaling Networks
Inferring Protein-Signaling Networks Lectures 14 Nov 14, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday 12:00-1:20 Johnson Hall (JHN) 022 1
More informationWhat is Systems Biology
What is Systems Biology 2 CBS, Department of Systems Biology 3 CBS, Department of Systems Biology Data integration In the Big Data era Combine different types of data, describing different things or the
More informationOptimal State Estimation for Boolean Dynamical Systems using a Boolean Kalman Smoother
Optimal State Estimation for Boolean Dynamical Systems using a Boolean Kalman Smoother Mahdi Imani and Ulisses Braga-Neto Department of Electrical and Computer Engineering Texas A&M University College
More informationCurriculum Map. Biology, Quarter 1 Big Ideas: From Molecules to Organisms: Structures and Processes (BIO1.LS1)
1 Biology, Quarter 1 Big Ideas: From Molecules to Organisms: Structures and Processes (BIO1.LS1) Focus Standards BIO1.LS1.2 Evaluate comparative models of various cell types with a focus on organic molecules
More informationBiological networks CS449 BIOINFORMATICS
CS449 BIOINFORMATICS Biological networks Programming today is a race between software engineers striving to build bigger and better idiot-proof programs, and the Universe trying to produce bigger and better
More informationMolecular Biology Of The Cell 6th Edition Alberts
We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer, you have convenient answers with molecular biology of
More informationBiological Networks. Gavin Conant 163B ASRC
Biological Networks Gavin Conant 163B ASRC conantg@missouri.edu 882-2931 Types of Network Regulatory Protein-interaction Metabolic Signaling Co-expressing General principle Relationship between genes Gene/protein/enzyme
More informationIntroduction to Bioinformatics. Shifra Ben-Dor Irit Orr
Introduction to Bioinformatics Shifra Ben-Dor Irit Orr Lecture Outline: Technical Course Items Introduction to Bioinformatics Introduction to Databases This week and next week What is bioinformatics? A
More informationModule Based Neural Networks for Modeling Gene Regulatory Networks
Module Based Neural Networks for Modeling Gene Regulatory Networks Paresh Chandra Barman, Std 1 ID: 20044523 Term Project: BiS732 Bio-Network Department of BioSystems, Korea Advanced Institute of Science
More informationSynthesis of Biological Models from Mutation Experiments
Synthesis of Biological Models from Mutation Experiments Ali Sinan Köksal, Saurabh Srivastava, Rastislav Bodík, UC Berkeley Evan Pu, MIT Jasmin Fisher, Microsoft Research Cambridge Nir Piterman, University
More informationUpdated: 10/11/2018 Page 1 of 5
A. Academic Division: Health Sciences B. Discipline: Biology C. Course Number and Title: BIOL1230 Biology I MASTER SYLLABUS 2018-2019 D. Course Coordinator: Justin Tickhill Assistant Dean: Melinda Roepke,
More informationDESIGN OF EXPERIMENTS AND BIOCHEMICAL NETWORK INFERENCE
DESIGN OF EXPERIMENTS AND BIOCHEMICAL NETWORK INFERENCE REINHARD LAUBENBACHER AND BRANDILYN STIGLER Abstract. Design of experiments is a branch of statistics that aims to identify efficient procedures
More informationModeling Biological Networks
Modeling Biological Networks Dr. Carlo Cosentino School of Computer and Biomedical Engineering Department of Experimental and Clinical Medicine Università degli Studi Magna Graecia Catanzaro, Italy carlo.cosentino@unicz.it
More informationModel Accuracy Measures
Model Accuracy Measures Master in Bioinformatics UPF 2017-2018 Eduardo Eyras Computational Genomics Pompeu Fabra University - ICREA Barcelona, Spain Variables What we can measure (attributes) Hypotheses
More informationMolecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems.
Molecular descriptors and chemometrics: a powerful combined tool for pharmaceutical, toxicological and environmental problems. Roberto Todeschini Milano Chemometrics and QSAR Research Group - Dept. of
More informationNetwork Motifs of Pathogenic Genes in Human Regulatory Network
Network Motifs of Pathogenic Genes in Human Regulatory Network Michael Colavita Mentor: Soheil Feizi Fourth Annual MIT PRIMES Conference May 18, 2014 Topics Background Genetics Regulatory Networks The
More informationUnderstanding Science Through the Lens of Computation. Richard M. Karp Nov. 3, 2007
Understanding Science Through the Lens of Computation Richard M. Karp Nov. 3, 2007 The Computational Lens Exposes the computational nature of natural processes and provides a language for their description.
More informationATLAS of Biochemistry
ATLAS of Biochemistry USER GUIDE http://lcsb-databases.epfl.ch/atlas/ CONTENT 1 2 3 GET STARTED Create your user account NAVIGATE Curated KEGG reactions ATLAS reactions Pathways Maps USE IT! Fill a gap
More informationHow much non-coding DNA do eukaryotes require?
How much non-coding DNA do eukaryotes require? Andrei Zinovyev UMR U900 Computational Systems Biology of Cancer Institute Curie/INSERM/Ecole de Mine Paritech Dr. Sebastian Ahnert Dr. Thomas Fink Bioinformatics
More informationBiological Mass Spectrometry
Biochemistry 412 Biological Mass Spectrometry February 13 th, 2007 Proteomics The study of the complete complement of proteins found in an organism Degrees of Freedom for Protein Variability Covalent Modifications
More informationMetabolomics in an Identity Crisis? Am I a Feature or a Compound? The world leader in serving science
Metabolomics in an Identity Crisis? Am I a Feature or a Compound? The world leader in serving science Agenda 1 2 3 Unknown Analysis in Metabolomics Determining Features and Compounds for Data Reduction
More informationIntegration of functional genomics data
Integration of functional genomics data Laboratoire Bordelais de Recherche en Informatique (UMR) Centre de Bioinformatique de Bordeaux (Plateforme) Rennes Oct. 2006 1 Observations and motivations Genomics
More information6.047 / Computational Biology: Genomes, Networks, Evolution Fall 2008
MIT OpenCourseWare http://ocw.mit.edu 6.047 / 6.878 Computational Biology: Genomes, Networks, Evolution Fall 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationMOLECULAR BIOLOGY BIOL 021 SEMESTER 2 (2015) COURSE OUTLINE
COURSE OUTLINE 1 COURSE GENERAL INFORMATION 1 Course Title & Course Code Molecular Biology: 2 Credit (Contact hour) 3 (2+1+0) 3 Title(s) of program(s) within which the subject is taught. Preparatory Program
More informationGene Control Mechanisms at Transcription and Translation Levels
Gene Control Mechanisms at Transcription and Translation Levels Dr. M. Vijayalakshmi School of Chemical and Biotechnology SASTRA University Joint Initiative of IITs and IISc Funded by MHRD Page 1 of 9
More informationBioControl - Week 6, Lecture 1
BioControl - Week 6, Lecture 1 Goals of this lecture Large metabolic networks organization Design principles for small genetic modules - Rules based on gene demand - Rules based on error minimization Suggested
More information10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison
10-810: Advanced Algorithms and Models for Computational Biology microrna and Whole Genome Comparison Central Dogma: 90s Transcription factors DNA transcription mrna translation Proteins Central Dogma:
More informationIdentifying Signaling Pathways
These slides, excluding third-party material, are licensed under CC BY-NC 4.0 by Anthony Gitter, Mark Craven, Colin Dewey Identifying Signaling Pathways BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2018
More informationInferring Transcriptional Regulatory Networks from Gene Expression Data II
Inferring Transcriptional Regulatory Networks from Gene Expression Data II Lectures 9 Oct 26, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday
More informationCoupled Random Boolean Network Forming an Artificial Tissue
Coupled Random Boolean Network Forming an Artificial Tissue M. Villani, R. Serra, P.Ingrami, and S.A. Kauffman 2 DSSC, University of Modena and Reggio Emilia, via Allegri 9, I-4200 Reggio Emilia villani.marco@unimore.it,
More informationThe geneticist s questions
The geneticist s questions a) What is consequence of reduced gene function? 1) gene knockout (deletion, RNAi) b) What is the consequence of increased gene function? 2) gene overexpression c) What does
More informationProblem Set # 3
20.320 Problem Set # 3 October 1 st, 2010 Due on October 8 th, 2010 at 11:59am. No extensions, no electronic submissions. General Instructions: 1. You are expected to state all your assumptions and provide
More informationGene Expression Data Classification with Revised Kernel Partial Least Squares Algorithm
Gene Expression Data Classification with Revised Kernel Partial Least Squares Algorithm Zhenqiu Liu, Dechang Chen 2 Department of Computer Science Wayne State University, Market Street, Frederick, MD 273,
More informationComputational Network Biology Biostatistics & Medical Informatics 826 Fall 2018
Computational Network Biology Biostatistics & Medical Informatics 826 Fall 2018 Sushmita Roy sroy@biostat.wisc.edu https://compnetbiocourse.discovery.wisc.edu Sep 6 th 2018 Goals for today Administrivia
More informationPhysical network models and multi-source data integration
Physical network models and multi-source data integration Chen-Hsiang Yeang MIT AI Lab Cambridge, MA 02139 chyeang@ai.mit.edu Tommi Jaakkola MIT AI Lab Cambridge, MA 02139 tommi@ai.mit.edu September 30,
More informationNetwork Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai
Network Biology: Understanding the cell s functional organization Albert-László Barabási Zoltán N. Oltvai Outline: Evolutionary origin of scale-free networks Motifs, modules and hierarchical networks Network
More informationDiscovering molecular pathways from protein interaction and ge
Discovering molecular pathways from protein interaction and gene expression data 9-4-2008 Aim To have a mechanism for inferring pathways from gene expression and protein interaction data. Motivation Why
More informationbiologically-inspired computing lecture 5 Informatics luis rocha 2015 biologically Inspired computing INDIANA UNIVERSITY
lecture 5 -inspired Sections I485/H400 course outlook Assignments: 35% Students will complete 4/5 assignments based on algorithms presented in class Lab meets in I1 (West) 109 on Lab Wednesdays Lab 0 :
More informationIntroduction to clustering methods for gene expression data analysis
Introduction to clustering methods for gene expression data analysis Giorgio Valentini e-mail: valentini@dsi.unimi.it Outline Levels of analysis of DNA microarray data Clustering methods for functional
More informationMEDLINE Clinical Laboratory Sciences Journals
Source Type Publication Name ISSN Peer-Reviewed Academic Journal Acta Biochimica et Biophysica Sinica 1672-9145 Y Academic Journal Acta Physiologica 1748-1708 Y Academic Journal Aging Cell 1474-9718 Y
More information