Networks & pathways. Hedi Peterson MTAT Bioinformatics

Similar documents
PNmerger: a Cytoscape plugin to merge biological pathways and protein interaction networks

Computational Challenges in Systems Biology

86 Part 4 SUMMARY INTRODUCTION

ATLAS of Biochemistry

Cytoscape An open-source software platform for the exploration of molecular interaction networks

Integration of functional genomics data

BMD645. Integration of Omics

Context dependent visualization of protein function

networks in molecular biology Wolfgang Huber

A taxonomy of visualization tasks for the analysis of biological pathway data

CSCE555 Bioinformatics. Protein Function Annotation

Course plan Academic Year Qualification MSc on Bioinformatics for Health Sciences. Subject name: Computational Systems Biology Code: 30180

Chemical Data Retrieval and Management

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

Chapter 15 Active Reading Guide Regulation of Gene Expression

Cross Discipline Analysis made possible with Data Pipelining. J.R. Tozer SciTegic

Introduction to Bioinformatics

International Journal of Scientific & Engineering Research, Volume 6, Issue 2, February ISSN

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Written Exam 15 December Course name: Introduction to Systems Biology Course no

FUNDAMENTALS of SYSTEMS BIOLOGY From Synthetic Circuits to Whole-cell Models

Predicting Protein Functions and Domain Interactions from Protein Interactions

Systems biology and biological networks

Proteomics & Bioinformatics Part II. David Wishart 3-41 Athabasca Hall

Types of biological networks. I. Intra-cellurar networks

Biological Networks. Gavin Conant 163B ASRC

New Computational Methods for Systems Biology

Structure and Centrality of the Largest Fully Connected Cluster in Protein-Protein Interaction Networks

Prokaryotic Gene Expression (Learning Objectives)

L3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus

CS612 - Algorithms in Bioinformatics

Network motifs in the transcriptional regulation network (of Escherichia coli):

Self Similar (Scale Free, Power Law) Networks (I)

56:198:582 Biological Networks Lecture 9

Comparative Network Analysis

A Protein Ontology from Large-scale Textmining?

Computational Systems Biology

10-810: Advanced Algorithms and Models for Computational Biology. microrna and Whole Genome Comparison

Evidence for dynamically organized modularity in the yeast protein-protein interaction network

Proteomics Systems Biology

SABIO-RK Integration and Curation of Reaction Kinetics Data Ulrike Wittig

Graph Alignment and Biological Networks

Genome-wide multilevel spatial interactome model of rice

Bioinformatics and Computerscience

FCModeler: Dynamic Graph Display and Fuzzy Modeling of Regulatory and Metabolic Maps

Dynamic modeling and analysis of cancer cellular network motifs

Differential Modeling for Cancer Microarray Data

Gene Ontology and overrepresentation analysis

Biological Pathways Representation by Petri Nets and extension

SUPPLEMENTAL DATA - 1. This file contains: Supplemental methods. Supplemental results. Supplemental tables S1 and S2. Supplemental figures S1 to S4

DATA ACQUISITION FROM BIO-DATABASES AND BLAST. Natapol Pornputtapong 18 January 2018

Genome Annotation. Bioinformatics and Computational Biology. Genome sequencing Assembly. Gene prediction. Protein targeting.

Gene Network Science Diagrammatic Cell Language and Visual Cell

Browsing Genomic Information with Ensembl Plants

How much non-coding DNA do eukaryotes require?

A Database of human biological pathways

BIOINFORMATICS LAB AP BIOLOGY

Research Article HomoKinase: A Curated Database of Human Protein Kinases

Regulation of Gene Expression

Biological Concepts and Information Technology (Systems Biology)

Lecture 7: Simple genetic circuits I

Analysis and Visualization of Biological Networks with Cytoscape

STRUCTURAL BIOINFORMATICS I. Fall 2015

Introduction to Bioinformatics

Bioinformatics: Network Analysis

Nature Structural and Molecular Biology: doi: /nsmb Supplementary Figure 1

Preface. Contributors

Grundlagen der Systembiologie und der Modellierung epigenetischer Prozesse

part III Systems Biology, 25 October 2013

Lecture 6: The feed-forward loop (FFL) network motif

Biological Networks: Comparison, Conservation, and Evolution via Relative Description Length By: Tamir Tuller & Benny Chor

GRAPH-THEORETICAL COMPARISON REVEALS STRUCTURAL DIVERGENCE OF HUMAN PROTEIN INTERACTION NETWORKS

Identifying Signaling Pathways

Boolean models of gene regulatory networks. Matthew Macauley Math 4500: Mathematical Modeling Clemson University Spring 2016

Homology and Information Gathering and Domain Annotation for Proteins

Supplementary online material

COMPARATIVE PATHWAY ANNOTATION WITH PROTEIN-DNA INTERACTION AND OPERON INFORMATION VIA GRAPH TREE DECOMPOSITION

Control of Gene Expression in Prokaryotes

Prokaryotic Gene Expression (Learning Objectives)

3.B.1 Gene Regulation. Gene regulation results in differential gene expression, leading to cell specialization.

Regulation and signaling. Overview. Control of gene expression. Cells need to regulate the amounts of different proteins they express, depending on

CISC 636 Computational Biology & Bioinformatics (Fall 2016)

SYSTEMS BIOLOGY 1: NETWORKS

AP Bio Module 16: Bacterial Genetics and Operons, Student Learning Guide

Computational Genomics. Systems biology. Putting it together: Data integration using graphical models

Lecture 2. The Blast2GO annotation framework

A Max-Flow Based Approach to the. Identification of Protein Complexes Using Protein Interaction and Microarray Data

The BRENDA Enzyme Information System. Module B4. Ligand Search Substructure Search

Networks in systems biology

RULE-BASED REASONING FOR SYSTEM DYNAMICS IN CELL SYSTEMS

Pathway Association Analysis Trey Ideker UCSD

Functional Characterization and Topological Modularity of Molecular Interaction Networks

APPLICATION OF GRAPH BASED DATA MINING TO BIOLOGICAL NETWORKS CHANG HUN YOU. Presented to the Faculty of the Graduate School of

The Drosophila Interactions Database: Integrating The Interactome And Transcriptome

We have: We will: Assembled six genomes Made predictions of most likely gene locations. Add a layers of biological meaning to the sequences

Metabolic modelling. Metabolic networks, reconstruction and analysis. Esa Pitkänen Computational Methods for Systems Biology 1 December 2009

Name Period The Control of Gene Expression in Prokaryotes Notes

Agilent MassHunter Profinder: Solving the Challenge of Isotopologue Extraction for Qualitative Flux Analysis

Detecting temporal protein complexes from dynamic protein-protein interaction networks

Transcription:

Networks & pathways Hedi Peterson (peterson@quretec.com) MTAT.03.239 Bioinformatics 03.11.2010

Networks are graphs Nodes Edges

Edges Directed, undirected, weighted

Nodes Genes Proteins Metabolites Enzymes Organisms

Why do we build networks? Motivation: most biological processes are not performed by a single, independent macromolecule

Slide from W. Huber

Slide from W. Huber

Pathways Do not exist in the cell! A human abstraction to help organizeour understanding of biology. A description of chronological ordering of proteins/dna/small molecule interactions. Unlike proteins and genes, pathwaysrepresent processes that may not be clearly defined. Slide from Doron Betel, MSKCC

From experiments to networks Source: http://www.nature.com/ng/journal/v37/n6s/full/ng1561.html Slide from https://extras.csc.fi/biosciences/courses/cytoscape

Example networks signal transduction pathway Source: http://www.sigmaaldrich.com/img/assets/6460/egf_r.gif Slide from https://extras.csc.fi/biosciences/courses/cytoscape

Transcription Source: http://www.medscape.com/pi/editorial/conferences/2001/873/art-amc.fig2.jpg Slide from https://extras.csc.fi/biosciences/courses/cytoscape

Complexes Source: Science. 2005 Feb 4;307(5710) Slide from https://extras.csc.fi/biosciences/courses/cytoscape

Public data repositories Protein-protein interaction data BIND, DIP, MINT, MIPS, InACT, Protein-DNA interaction data BIND, Transfac, Metabolic pathway data BioCyc, KEGG, WIT, Text-mining, coexpression Pre-BIND, Tmm, Slide from https://extras.csc.fi/biosciences/courses/cytoscape

Slide from Doron Betel, MSKCC

Slide from Doron Betel, MSKCC

Slide from Doron Betel, MSKCC

Slide from Doron Betel, MSKCC

Slide from Doron Betel, MSKCC

Slide from Doron Betel, MSKCC

Slide from Doron Betel, MSKCC

Slide from Doron Betel, MSKCC

Slide from W. Huber

Slide from W. Huber

Slide from W. Huber

Slide from W. Huber

Slide from W. Huber

Slide from W. Huber

autoregulation approximately 10% of yeast genes encoding regulators are autoregulated autoregulation is thought to provide several selective growth advantages -- response to environmental stimuli -- decreased biosynthetic cost of regulation --increased stability of gene expression

Multi-component loop - consists of a regulatory circuit whose closure involves two or more factors - provides the capacity for feedback control and offers the potential to produce bistable systems that can switch between two alternative states

feedforward loop contains a regulator that controls a second regulator and have the additional feature that both regulators bind a common target gene -- the most popular among yeast gene regulatory motifs(about 10% of genes that are bound in the genomewide location data set) very sensitive

single input motif single regulator that binds a set if genes under a specific condition mostly useful in metabolic pathways

Multiinput motif set of regulators that bind together to a set of genes two different inputs would allow coordinate expression of a set of genes under these two different condition

regulatory chain consists of three or more regulators in witch one regulator binds the promoter for a second regulator, the second binds the promoter for a third regulator, and so forth best example is cell cycle Slide from W. Huber

http://biit.cs.ut.ee/graphweb

Motivation Graph-based methods for mining functional and regulatory modules from heterogeneous data Biological knowledge represented as network 1+1 > 2 combining different datasources brings out most important connections

Main idea of GraphWeb Input is a graph (gene pairs, connected by an edge) Gene/probe/protein names are converted to common ground - Ensembl Extract modules by applying graph algorithms Each module is described Biologically significant annotations are used to score modules

What to search for? Hubs in regulatory network these are usually transcription factors Cliques highly connected genes, most probably protein complex members Connected components genes participating in similar functions MCL find genes that might share a function Network neighbourhood what genes are having connections to my favourite ones?

Reactome What is Reactome? An open, on-line, human-curated knowledgebase of biological pathways and reactions in humans Pathways and reactions in other species are predicated based on protein orthologies Data in the Reactome is interconnected with other databases: pubmed, Gene Ontology, NCBI, Ensembl, UniProt, OMIM, etc. Authored and created by experts Slide by L.Stein

Reactome http://www.reactome.org Menu bar Search Box Reaction Map Topic Table Slide by L.Stein

Kyoto Encyclopedia of Genes and Genomes Integrates: current knowledge of molecular interaction networks information about genes and proteins information about chemical compounds and reactions

Core Features Customize network data display using visual styles Powerful graph layout tools Easily organize multiple networks Easily navigate large networks Filter the network Plugin API Input/Output Protein protein interactions from BIND, TRANSFAC databases Gene functional annotations from Gene Ontology (GO) and KEGG databases Biological models from Systems Biology Markup Language (SBML) cpath: Cancer Pathway database Proteomics Standards Initiative Molecular Interaction (PSI-MI) or Biopathway Exchange Language (BioPAX) formats Oracle Spatial Network data model Cytoscape.org Cytoscape is a freely-available (open-source, java-based) bioinformatics software platform for visualizing biological networks (e.g. molecular interaction networks) and analyzing networks with gene expression profiles and other state data. Additional features are available as plugins. jactivemodules: identify significant active subnetworks Expression Correlation Network: cluster expression data Agilent Literature Search: build networks by extracting interactions from scientific literature. MCODE: finds clusters of highly interconnected regions in networks cpath: query, retrieve and visualize interactions from the MSKCC Cancer Pathway database BiNGO: determine which Gene Ontology (GO) categories are statistically over-represented in a set of genes Motif Finder: runs a Gibbs sampling motif detector on sequences for nodes in a Cytoscape network. CytoTalk: Interact with Cytoscape from Perl, Python, R, shell scripts or C or C++ programs.

www.hedipeterson.com/cytoscape.pdf Graphweb Exercises Use Rual network (the same fro Cytoscape) together with public protein-protein interaction (PPI) data (From a file in our server) and find gene pairs being present in both networks by applying Remove edges with less than N labels Filter PPI networks with Oct4, Sox2, Nanog, TP53 genes of interest by using Network neighbourhood, keep the distance 1. Look for modules (hubs, cliques, strongly connected components tc) in the data using different network algorithms and filter out different module sizes. Check the functional annotations of resulting modules.