identifiers matched to homologous genes. Probeset annotation files for each array platform were used to
|
|
- Daniella Dawson
- 5 years ago
- Views:
Transcription
1 SUPPLEMENTARY METHODS Data combination and normalization Prior to data analysis we first had to appropriately combine all 1617 arrays such that probeset identifiers matched to homologous genes. Probeset annotation files for each array platform were used to identify the gene symbol associated with each probeset and this gene symbol was mapped to a unique identifier for homologous gene groups provided by the homologene database ( Data were then combined from all experiments by matching homologene identifiers. To allow for comparison of absolute gene expression data between phenotypes across different experimental data sets, normalization of raw expression data was performed prior to all analyses. Median absolute deviation normalization of log transformed expression values was chosen for its performance advantages in sparse datasets 1. Via this method, intensity-dependent differences in expression level were minimized for between-sample comparisons, as evidenced by M-A plot (Supplementary Figure 1). Of note, the Pearson correlation is invariant under data transformation of location and scale and thus this normalization step does not affect gene expression network analysis. Species specificity We next analyzed species specificity of gene coexpression to assess for primacy of species over phenotype. Experiments were clustered according to shared expression profiles using agglomerative hierarchical clustering. As shown in Supplementary Figure 2, array experiments were tightly coordinated by species, indicating a significant species dependency for gene coexpression. As such, datasets from species with the most complete data set with regard to phenotype, mouse, were chosen for further analysis. Forming networks To perform network modeling we used weighted gene coexpression analysis as described 2. The vertices of a weighted gene co-expression graph represent genes and the connection strengths (adjacency) between them comprise edges that connect vertices. The connectivity k i of the i-th vertex (gene) is defined as the sum of its adjacencies with the other nodes in the network. The frequency distribution of the connectivity often (but not always) follows a power law, which is referred to as scale free topology. Many (but not all) biologic networks have been found to satisfy scale free topology 3, 4. The connection strength or adjacency was determined by the Pearson correlation of the corresponding expression profiles. Page 1 of 19
2 Specifically, the connection strength (adjacency) between expression profiles i and j is a power of the absolute value of their Pearson correlation. Forming the power of a correlation amounts to a soft-thresholding approach that emphasizes large correlations at the expense of low correlations and obviates the need for arbitrary thresholds for adjacency, thus preserving the continuous nature of the coexpression information. The extent to which a network satisfies scale free topology can be measured using a model fitting index R 2 2. Supplementary Figures 3 and 4 show how the scale free topology model fitting index R 2 (y-axis) depends on the soft threshold b (x-axis). We chose b according to the scale free topology criterion, which amounts to choosing the smallest b that leads to approximate scale free topology 2. To facilitate comparisons, we chose b=9 for all networks. A major advantage of weighted co-expression networks is that they are statistically highly robust with regard to the chosen threshold b. The final adjacency was further defined using the topological overlap 5. This network adjacency describes the commonality t ij of network neighbors for genes i and j and is given by: I ij + a ij t ij = min k i,k j where I ij = { } +1 a ij if i j 1 if i = j a iu a uj, k i = a iu, and k j = a uj for a iu and a uj given by soft-thresholded Pearson u i, j u i u j correlation as described above. For all t ij a distance d ij =1 t ij was defined, giving a symmetrical distance matrix of rows and columns. This distance matrix was used as the input for agglomerative hierarchical clustering in the training set of 43 fetal samples and subsequently in a validation set of 21 fetal samples and in other phenotypes. Separate distance matrices were constructed for each phenotype and for training and validation fetal datasets. The dynamic tree cut algorithm was used to define gene modules from the resultant hierarchical clustering. For this algorithm, minimum cluster size was set at 25, the hybrid tree cut algorithm was chosen, deep split was set to one, and no minimum or maximum absolute core scatter were defined; sensitivity analysis for the parameters chosen for this function is described in Supplementary Figure 5. The 90 resulting networks were subsequently internally validated using an iterative approach as described in the main manuscript in the validation set (Supplementary (1) Page 2 of 19
3 Figure 6). The degree to which topology of these validated modules was shared between phenotypes was then evaluated using the same iterative approach. To illustrate topology of key marker genes previously identified to be associated with cardiac development and disease, gene coepxression topology of networks containing these genes is displayed in Supplementary Figure 7. Transcription factor target identification Transcription factor target identification was performed to assess for common transcription factor modulation of gene coexpression modules. We applied our previously described, phylogenetic footprinting approach to determine putative gene targets for a large set of vertebrate transcription factors in the mouse genome 6. We first clustered all eukaryotic transcription factor motifs from TRANSFAC v10.2 into 235 representative motifs as described 7, 8. For each of the 235 motifs provided in the TRANSFAC database we scanned the mouse 1 kb promoter regions to first identify match with a p-value <= We further filter these matches to only include the binding sites that are either 80% conserved between human and mouse genomes based on the genome alignment provided in UCSC database or the match p-value <= In a cross validation with experimentally verified binding sites, these criteria yielded a false positive rate of 1 in 50 kb of genomic data searched, or a false discovery rate of 1 transcription factor for every 50 genes searched in our algorithm. The details of the method are as described 6, 8. Page 3 of 19
4 SUPPLEMENTARY FIGURE 1 Supplementary Figure 1. Normalization of gene expression data using mean absolute deviation method reduces intensity-dependent differences for experiments from different experimental conditions. Panel A depicts the relationship between mean and difference between experimental arrays for all experiments with the same experimental set (top panel, i.e., similar laboratory conditions) and for experiments not within the same experimental set (top panel, i.e. different laboratory conditions) before performing normalization. Horizontal scatter envelopes centered at 0 indicate minimal intensity-dependent differences in expression level. Panel B depicts the same relationships after performing normalization and indicates minimization of intensity-dependent differences in expression among disparate experimental conditions. Page 4 of 19
5 SUPPLEMENTARY FIGURE 2 Supplementary Figure 2. Gene expression profiles from different experimental conditions cluster by species. Hierarchical clustering of experimental conditions was performed to evaluate for speciesspecific patterns of gene coexpression. The top half of each figure depicts the hierarchical clustering of experiments within each experimental phenotype and the bottom half contains a color-keyed representation of species identifiers for each experiment. In normal adult (A), fetal (B), failing (C), and hypertrophied (D) cardiac tissue experiments clustered tightly by species, indicating strong species dependency of gene coexpression. Abbreviations: H = human, M = mouse, R = rat, D = dog. Page 5 of 19
6 SUPPLEMENTARY FIGURE 3 Supplementary Figure 3. Gene coexpression network adjacency soft threshold for A) normal adult and B) fetal cardiac tissue was chosen to approximate scale-free topology. The left-hand portion of each panel depicts the relationship between soft-thresholding coefficient (x axis, numbered in red) and the r value for the linear regression (y axis, slope constrained to -1) between the logarithm of the connectivity and the logarithm of the proportion of genes with that connectivity. The right-hand portion of each panel depicts the relationship between the soft-thresholding coefficient and mean connectivity. Page 6 of 19
7 SUPPLEMENTARY FIGURE 4 Supplementary Figure 4. Gene coexpression network adjacency soft threshold for A) failing and B) hypertrophied cardiac tissue was chosen to approximate scale-free topology. Panels are as described above in Supplementary Figure 3. Page 7 of 19
8 SUPPLEMENTARY FIGURE 5 Supplementary Figure 5. Sensitivity analysis for parameters used in the dynamic tree cut algorithm. The z-axis describes module reproducibility average over all derived modules, which is in turn given by the Pearson correlation between vectors describing intra-modular connectivity for each gene. The chosen parameters for the dynamic tree cut algorithm (deepsplit = 1, labelunlabeled = TRUE) gave the highest average reproducibility of modules in the fetal dataset. Page 8 of 19
9 SUPPLEMENTARY FIGURE 6 Page 9 of 19
10 Supplementary Figure 6.. Identification and validation of fetal gene modules. Genes were clustered first by topological overlap in the training dataset (n = 43 samples) and then evaluated for significant reproducibility in the validation dataset (n = 21 samples). A.) Cluster dendrogram and barplot describing module membership for each of the 12,620 genes. B.) Cluster dendrogram and barplot describing module membership for each of the 12,620 genes (as defined in the training set) in the validation dataset. C.) Significance (-log p value) of module reproducibility in the validation set. Colors below the bargraph correspond to the module membership as identified in the barplots in the above panels. Red line indicates cutoff for statistical significance (p < 5.5 x 10-4 or log(p value) > 3.26). Seventy-two modules met this criterion. Page 10 of 19
11 Dewey, et al SUPPLEMENTARY FIGURE 7 Supplementary Figure 7. Topology of developmental gene coexpression modules containing key marker genes. The hub genes for modules containing A) NPPA and MYH7 and B) SLC2A4 are depicted in Page 11 of 19
12 red, while marker genes are depicted in yellow and all nodes are depicted in green. Only edges with adjacency weight > 0.7 are shown for clarity. The proximity of each gene to the center of the figure indicates its connectivity, or sum of connection weights. Page 12 of 19
13 SUPPLEMENTARY TABLE 1. Term Count % P value Benjamini corrected P value GO: ~proteasomal protein E catabolic process GO: ~proteasomal ubiquitindependent E protein catabolic process GO: ~negative regulation of protein ubiquitination E GO: ~negative regulation of protein modification process E GO: ~anaphase-promoting complex-dependent proteasomal ubiquitindependent protein catabolic process GO: ~negative regulation of ubiquitin-protein ligase activity during mitotic cell cycle GO: ~negative regulation of ligase activity GO: ~negative regulation of ubiquitin-protein ligase activity GO: ~positive regulation of ubiquitin-protein ligase activity during mitotic cell cycle GO: ~regulation of protein ubiquitination GO: ~positive regulation of ubiquitin-protein ligase activity GO: ~regulation of ubiquitinprotein ligase activity during mitotic cell cycle GO: ~positive regulation of ligase activity Page 13 of 19
14 GO: ~regulation of ubiquitinprotein ligase activity GO: ~regulation of ligase activity GO: ~positive regulation of macromolecule metabolic process GO: ~ubiquitin-dependent protein catabolic process GO: ~positive regulation of protein ubiquitination GO: ~negative regulation of cellular protein metabolic process GO: ~regulation of cellular protein metabolic process GO: ~negative regulation of protein metabolic process GO: ~positive regulation of protein modification process GO: ~positive regulation of cellular protein metabolic process GO: ~translation GO: ~positive regulation of protein metabolic process GO: ~ER to Golgi vesicle-mediated transport GO: ~proteoglycan metabolic process GO: ~negative regulation of programmed cell death GO: ~negative regulation of cell death GO: ~mitotic cell cycle GO: ~Golgi vesicle transport GO: ~negative regulation of catalytic activity GO: ~glycosaminoglycan Page 14 of 19
15 biosynthetic process GO: ~ncRNA metabolic process GO: ~aminoglycan biosynthetic process GO: ~positive regulation of cellular biosynthetic process GO: ~axon regeneration in the peripheral nervous system GO: ~ER-associated protein catabolic process GO: ~modification-dependent macromolecule catabolic process GO: ~modification-dependent protein catabolic process GO: ~positive regulation of biosynthetic process GO: ~regulation of protein modification process GO: ~translational elongation GO: ~negative regulation of apoptosis GO: ~proteoglycan biosynthetic process GO: ~response to toxin GO: ~synapse organization GO: ~neuromuscular process GO: ~cell cycle GO: ~positive regulation of macromolecule biosynthetic process GO: ~microtubule-based process GO: ~glycoprotein metabolic process GO: ~proteolysis involved in cellular protein catabolic process Page 15 of 19
16 GO: ~cellular protein catabolic process GO: ~anti-apoptosis GO: ~regeneration GO: ~maternal process involved in female pregnancy GO: ~extracellular structure organization GO: ~ossification GO: ~sulfur metabolic process GO: ~synaptogenesis GO: ~palate development GO: ~protein catabolic process GO: ~cell cycle process GO: ~protein ubiquitination GO: ~negative regulation of molecular function GO: ~protein transport GO: ~bone development GO: ~leukocyte homeostasis GO: ~positive regulation of gene expression GO: ~establishment of protein localization GO: ~positive regulation of catalytic activity GO: ~positive regulation of nitrogen compound metabolic process GO: ~response to wounding GO: ~positive regulation of T cell proliferation GO: ~intracellular transport GO: ~protein modification by small protein conjugation Page 16 of 19
17 GO: ~cellular macromolecule catabolic process GO: ~cell-cell signaling GO: ~Wnt receptor signaling pathway GO: ~synaptic transmission GO: ~proteolysis hypertrophy. Supplementary Table 1. Over-represented biological processes according to gene ontology (GO) in the fetal module recapitulated in heart failure and Page 17 of 19
18 SUPPLEMENTARY TABLE 2. Entrez ID Gene Name Der1-like domain family, member F-box protein UBX domain protein anaphase promoting complex subunit 11 cell division cycle 26 homolog (S. cerevisiae); cell division cycle 26 homolog (S. cerevisiae) pseudogene 5705 proteasome (prosome, macropain) 26S subunit, ATPase, proteasome (prosome, macropain) subunit, alpha type, ubiquitin B 7324 ubiquitin-conjugating enzyme E2E 1 hypertrophy. Supplementary Table 2. Summary of genes involved in protein catabolism biological process from fetal module recapitulated in heart failure and Page 18 of 19
19 REFERENCES 1. Barbacioru CC, Wang Y, Canales RD, Sun YA, Keys DN, Chan F, Poulter KA, Samaha RR. Effect of various normalization methods on Applied Biosystems expression array system data. BMC Bioinformatics. 2006;7: Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol. 2005;4:Article Barabasi AL, Albert R. Emergence of scaling in random networks. Science. 1999;286: Jeong H, Tombor B, Albert R, Oltvai ZN, Barabasi AL. The large-scale organization of metabolic networks. Nature. 2000;407: Yip AM, Horvath S. Gene network interconnectedness and the generalized topological overlap measure. BMC Bioinformatics. 2007;8: Levy S, Hannenhalli S. Identification of transcription factor binding sites in the human genome sequence. Mamm Genome. 2002;13: Wingender E, Dietze P, Karas H, Knuppel R. TRANSFAC: a database on transcription factors and their DNA binding sites. Nucleic Acids Res. 1996;24: Hannenhalli S, Putt ME, Gilmore JM, Wang J, Parmacek MS, Epstein JA, Morrisey EE, Margulies KB, Cappola TP. Transcriptional genomics associates FOX transcription factors with human heart failure. Circulation. 2006;114: Page 19 of 19
Weighted gene co-expression analysis. Yuehua Cui June 7, 2013
Weighted gene co-expression analysis Yuehua Cui June 7, 2013 Weighted gene co-expression network (WGCNA) A type of scale-free network: A scale-free network is a network whose degree distribution follows
More informationBioControl - Week 6, Lecture 1
BioControl - Week 6, Lecture 1 Goals of this lecture Large metabolic networks organization Design principles for small genetic modules - Rules based on gene demand - Rules based on error minimization Suggested
More information1 GO: regulation of cell size E-04 2 GO: negative regulation of cell growth GO:
Table S2: The biological modulated by mir-5701 Sr. No Term Id 1 Term Name 2 Hit Gene Number 3 P-Value 4 1 GO:0008361 regulation of cell size 9 4.37E-04 2 GO:0030308 negative regulation of cell growth 8
More informationInteraction Network Analysis
CSI/BIF 5330 Interaction etwork Analsis Young-Rae Cho Associate Professor Department of Computer Science Balor Universit Biological etworks Definition Maps of biochemical reactions, interactions, regulations
More informationnetworks in molecular biology Wolfgang Huber
networks in molecular biology Wolfgang Huber networks in molecular biology Regulatory networks: components = gene products interactions = regulation of transcription, translation, phosphorylation... Metabolic
More informationNetwork Biology: Understanding the cell s functional organization. Albert-László Barabási Zoltán N. Oltvai
Network Biology: Understanding the cell s functional organization Albert-László Barabási Zoltán N. Oltvai Outline: Evolutionary origin of scale-free networks Motifs, modules and hierarchical networks Network
More informationS1 Gene ontology (GO) analysis of the network alignment results
1 Supplementary Material for Effective comparative analysis of protein-protein interaction networks by measuring the steady-state network flow using a Markov model Hyundoo Jeong 1, Xiaoning Qian 1 and
More informationTypes of biological networks. I. Intra-cellurar networks
Types of biological networks I. Intra-cellurar networks 1 Some intra-cellular networks: 1. Metabolic networks 2. Transcriptional regulation networks 3. Cell signalling networks 4. Protein-protein interaction
More informationSupplementary Information
Supplementary Information For the article"comparable system-level organization of Archaea and ukaryotes" by J. Podani, Z. N. Oltvai, H. Jeong, B. Tombor, A.-L. Barabási, and. Szathmáry (reference numbers
More informationBioinformatics 2. Yeast two hybrid. Proteomics. Proteomics
GENOME Bioinformatics 2 Proteomics protein-gene PROTEOME protein-protein METABOLISM Slide from http://www.nd.edu/~networks/ Citrate Cycle Bio-chemical reactions What is it? Proteomics Reveal protein Protein
More informationCell biology traditionally identifies proteins based on their individual actions as catalysts, signaling
Lethality and centrality in protein networks Cell biology traditionally identifies proteins based on their individual actions as catalysts, signaling molecules, or building blocks of cells and microorganisms.
More informationAndrogen-independent prostate cancer
The following tutorial walks through the identification of biological themes in a microarray dataset examining androgen-independent. Visit the GeneSifter Data Center (www.genesifter.net/web/datacenter.html)
More informationProteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?
Proteomics What is it? Reveal protein interactions Protein profiling in a sample Yeast two hybrid screening High throughput 2D PAGE Automatic analysis of 2D Page Yeast two hybrid Use two mating strains
More informationA Geometric Interpretation of Gene Co-Expression Network Analysis. Steve Horvath, Jun Dong
A Geometric Interpretation of Gene Co-Expression Network Analysis Steve Horvath, Jun Dong Outline Network and network concepts Approximately factorizable networks Gene Co-expression Network Eigengene Factorizability,
More informationBiological Process Term Enrichment
Biological Process Term Enrichment cellular protein localization cellular macromolecule localization intracellular protein transport intracellular transport generation of precursor metabolites and energy
More informationDynamic modular architecture of protein-protein interaction networks beyond the dichotomy of date and party hubs
Dynamic modular architecture of protein-protein interaction networks beyond the dichotomy of date and party hubs Xiao Chang 1,#, Tao Xu 2,#, Yun Li 3, Kai Wang 1,4,5,* 1 Zilkha Neurogenetic Institute,
More informationClustering and Network
Clustering and Network Jing-Dong Jackie Han jdhan@picb.ac.cn http://www.picb.ac.cn/~jdhan Copy Right: Jing-Dong Jackie Han What is clustering? A way of grouping together data samples that are similar in
More informationBMD645. Integration of Omics
BMD645 Integration of Omics Shu-Jen Chen, Chang Gung University Dec. 11, 2009 1 Traditional Biology vs. Systems Biology Traditional biology : Single genes or proteins Systems biology: Simultaneously study
More informationNetworks & pathways. Hedi Peterson MTAT Bioinformatics
Networks & pathways Hedi Peterson (peterson@quretec.com) MTAT.03.239 Bioinformatics 03.11.2010 Networks are graphs Nodes Edges Edges Directed, undirected, weighted Nodes Genes Proteins Metabolites Enzymes
More informationEvidence for dynamically organized modularity in the yeast protein-protein interaction network
Evidence for dynamically organized modularity in the yeast protein-protein interaction network Sari Bombino Helsinki 27.3.2007 UNIVERSITY OF HELSINKI Department of Computer Science Seminar on Computational
More informationRule learning for gene expression data
Rule learning for gene expression data Stefan Enroth Original slides by Torgeir R. Hvidsten The Linnaeus Centre for Bioinformatics Predicting biological process from gene expression time profiles Papers:
More informationInferring Transcriptional Regulatory Networks from Gene Expression Data II
Inferring Transcriptional Regulatory Networks from Gene Expression Data II Lectures 9 Oct 26, 2011 CSE 527 Computational Biology, Fall 2011 Instructor: Su-In Lee TA: Christopher Miles Monday & Wednesday
More informationA General Framework for Weighted Gene Co-Expression Network Analysis. Steve Horvath Human Genetics and Biostatistics University of CA, LA
A General Framework for Weighted Gene Co-Expression Network Analysis Steve Horvath Human Genetics and Biostatistics University of CA, LA Content Novel statistical approach for analyzing microarray data:
More informationSelf Similar (Scale Free, Power Law) Networks (I)
Self Similar (Scale Free, Power Law) Networks (I) E6083: lecture 4 Prof. Predrag R. Jelenković Dept. of Electrical Engineering Columbia University, NY 10027, USA {predrag}@ee.columbia.edu February 7, 2007
More informationComputational methods for predicting protein-protein interactions
Computational methods for predicting protein-protein interactions Tomi Peltola T-61.6070 Special course in bioinformatics I 3.4.2008 Outline Biological background Protein-protein interactions Computational
More informationFunctional Characterization and Topological Modularity of Molecular Interaction Networks
Functional Characterization and Topological Modularity of Molecular Interaction Networks Jayesh Pandey 1 Mehmet Koyutürk 2 Ananth Grama 1 1 Department of Computer Science Purdue University 2 Department
More informationBiological Networks. Gavin Conant 163B ASRC
Biological Networks Gavin Conant 163B ASRC conantg@missouri.edu 882-2931 Types of Network Regulatory Protein-interaction Metabolic Signaling Co-expressing General principle Relationship between genes Gene/protein/enzyme
More informationGene Ontology and Functional Enrichment. Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein
Gene Ontology and Functional Enrichment Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein The parsimony principle: A quick review Find the tree that requires the fewest
More informationPredicting Protein Functions and Domain Interactions from Protein Interactions
Predicting Protein Functions and Domain Interactions from Protein Interactions Fengzhu Sun, PhD Center for Computational and Experimental Genomics University of Southern California Outline High-throughput
More informationProtein-protein interaction networks Prof. Peter Csermely
Protein-Protein Interaction Networks 1 Department of Medical Chemistry Semmelweis University, Budapest, Hungary www.linkgroup.hu csermely@eok.sote.hu Advantages of multi-disciplinarity Networks have general
More informationIntroduction to Bioinformatics
CSCI8980: Applied Machine Learning in Computational Biology Introduction to Bioinformatics Rui Kuang Department of Computer Science and Engineering University of Minnesota kuang@cs.umn.edu History of Bioinformatics
More informationSupplementary Information
Supplementary Information Supplementary Figure 1. Schematic pipeline for single-cell genome assembly, cleaning and annotation. a. The assembly process was optimized to account for multiple cells putatively
More informationComputational Systems Biology
Computational Systems Biology Vasant Honavar Artificial Intelligence Research Laboratory Bioinformatics and Computational Biology Graduate Program Center for Computational Intelligence, Learning, & Discovery
More informationHow much non-coding DNA do eukaryotes require?
How much non-coding DNA do eukaryotes require? Andrei Zinovyev UMR U900 Computational Systems Biology of Cancer Institute Curie/INSERM/Ecole de Mine Paritech Dr. Sebastian Ahnert Dr. Thomas Fink Bioinformatics
More informationCISC 636 Computational Biology & Bioinformatics (Fall 2016)
CISC 636 Computational Biology & Bioinformatics (Fall 2016) Predicting Protein-Protein Interactions CISC636, F16, Lec22, Liao 1 Background Proteins do not function as isolated entities. Protein-Protein
More informationContext dependent visualization of protein function
Article III Context dependent visualization of protein function In: Juho Rousu, Samuel Kaski and Esko Ukkonen (eds.). Probabilistic Modeling and Machine Learning in Structural and Systems Biology. 2006,
More informationLecture 10: Cyclins, cyclin kinases and cell division
Chem*3560 Lecture 10: Cyclins, cyclin kinases and cell division The eukaryotic cell cycle Actively growing mammalian cells divide roughly every 24 hours, and follow a precise sequence of events know as
More informationAnalysis of Biological Networks: Network Robustness and Evolution
Analysis of Biological Networks: Network Robustness and Evolution Lecturer: Roded Sharan Scribers: Sasha Medvedovsky and Eitan Hirsh Lecture 14, February 2, 2006 1 Introduction The chapter is divided into
More informationCourse Descriptions Biology
Course Descriptions Biology BIOL 1010 (F/S) Human Anatomy and Physiology I. An introductory study of the structure and function of the human organ systems including the nervous, sensory, muscular, skeletal,
More informationWGCNA User Manual. (for version 1.0.x)
(for version 1.0.x) WGCNA User Manual A systems biologic microarray analysis software for finding important genes and pathways. The WGCNA (weighted gene co-expression network analysis) software implements
More informationTranscription Regulation and Gene Expression in Eukaryotes FS08 Pharmacenter/Biocenter Auditorium 1 Wednesdays 16h15-18h00.
Transcription Regulation and Gene Expression in Eukaryotes FS08 Pharmacenter/Biocenter Auditorium 1 Wednesdays 16h15-18h00. Promoters and Enhancers Systematic discovery of transcriptional regulatory motifs
More informationDr. Amira A. AL-Hosary
Phylogenetic analysis Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic Basics: Biological
More informationComparative RNA-seq analysis of transcriptome dynamics during petal development in Rosa chinensis
Title Comparative RNA-seq analysis of transcriptome dynamics during petal development in Rosa chinensis Author list Yu Han 1, Huihua Wan 1, Tangren Cheng 1, Jia Wang 1, Weiru Yang 1, Huitang Pan 1* & Qixiang
More informationUnderstanding Science Through the Lens of Computation. Richard M. Karp Nov. 3, 2007
Understanding Science Through the Lens of Computation Richard M. Karp Nov. 3, 2007 The Computational Lens Exposes the computational nature of natural processes and provides a language for their description.
More informationGRAPH-THEORETICAL COMPARISON REVEALS STRUCTURAL DIVERGENCE OF HUMAN PROTEIN INTERACTION NETWORKS
141 GRAPH-THEORETICAL COMPARISON REVEALS STRUCTURAL DIVERGENCE OF HUMAN PROTEIN INTERACTION NETWORKS MATTHIAS E. FUTSCHIK 1 ANNA TSCHAUT 2 m.futschik@staff.hu-berlin.de tschaut@zedat.fu-berlin.de GAUTAM
More informationCluster Analysis of Gene Expression Microarray Data. BIOL 495S/ CS 490B/ MATH 490B/ STAT 490B Introduction to Bioinformatics April 8, 2002
Cluster Analysis of Gene Expression Microarray Data BIOL 495S/ CS 490B/ MATH 490B/ STAT 490B Introduction to Bioinformatics April 8, 2002 1 Data representations Data are relative measurements log 2 ( red
More informationDesign and characterization of chemical space networks
Design and characterization of chemical space networks Martin Vogt B-IT Life Science Informatics Rheinische Friedrich-Wilhelms-University Bonn 16 August 2015 Network representations of chemical spaces
More informationGene Ontology. Shifra Ben-Dor. Weizmann Institute of Science
Gene Ontology Shifra Ben-Dor Weizmann Institute of Science Outline of Session What is GO (Gene Ontology)? What tools do we use to work with it? Combination of GO with other analyses What is Ontology? 1700s
More informationSUPPLEMENTARY INFORMATION
Supplementary information S1 (box). Supplementary Methods description. Prokaryotic Genome Database Archaeal and bacterial genome sequences were downloaded from the NCBI FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/)
More informationWritten Exam 15 December Course name: Introduction to Systems Biology Course no
Technical University of Denmark Written Exam 15 December 2008 Course name: Introduction to Systems Biology Course no. 27041 Aids allowed: Open book exam Provide your answers and calculations on separate
More informationZhongyi Xiao. Correlation. In probability theory and statistics, correlation indicates the
Character Correlation Zhongyi Xiao Correlation In probability theory and statistics, correlation indicates the strength and direction of a linear relationship between two random variables. In general statistical
More informationErzsébet Ravasz Advisor: Albert-László Barabási
Hierarchical Networks Erzsébet Ravasz Advisor: Albert-László Barabási Introduction to networks How to model complex networks? Clustering and hierarchy Hierarchical organization of cellular metabolism The
More informationDifferential Modeling for Cancer Microarray Data
Differential Modeling for Cancer Microarray Data Omar Odibat Department of Computer Science Feb, 01, 2011 1 Outline Introduction Cancer Microarray data Problem Definition Differential analysis Existing
More informationDeciphering regulatory networks by promoter sequence analysis
Bioinformatics Workshop 2009 Interpreting Gene Lists from -omics Studies Deciphering regulatory networks by promoter sequence analysis Elodie Portales-Casamar University of British Columbia www.cisreg.ca
More informationA New Method to Build Gene Regulation Network Based on Fuzzy Hierarchical Clustering Methods
International Academic Institute for Science and Technology International Academic Journal of Science and Engineering Vol. 3, No. 6, 2016, pp. 169-176. ISSN 2454-3896 International Academic Journal of
More informationMotif Prediction in Amino Acid Interaction Networks
Motif Prediction in Amino Acid Interaction Networks Omar GACI and Stefan BALEV Abstract In this paper we represent a protein as a graph where the vertices are amino acids and the edges are interactions
More informationDivergence Pattern of Duplicate Genes in Protein-Protein Interactions Follows the Power Law
Divergence Pattern of Duplicate Genes in Protein-Protein Interactions Follows the Power Law Ze Zhang,* Z. W. Luo,* Hirohisa Kishino,à and Mike J. Kearsey *School of Biosciences, University of Birmingham,
More informationThe Generalized Topological Overlap Matrix For Detecting Modules in Gene Networks
The Generalized Topological Overlap Matrix For Detecting Modules in Gene Networks Andy M. Yip Steve Horvath Abstract Systems biologic studies of gene and protein interaction networks have found that these
More informationPreface. Contributors
CONTENTS Foreword Preface Contributors PART I INTRODUCTION 1 1 Networks in Biology 3 Björn H. Junker 1.1 Introduction 3 1.2 Biology 101 4 1.2.1 Biochemistry and Molecular Biology 4 1.2.2 Cell Biology 6
More informationFine-scale dissection of functional protein network. organization by dynamic neighborhood analysis
Fine-scale dissection of functional protein network organization by dynamic neighborhood analysis Kakajan Komurov 1, Mehmet H. Gunes 2, Michael A. White 1 1 Department of Cell Biology, University of Texas
More informationDynamic optimisation identifies optimal programs for pathway regulation in prokaryotes. - Supplementary Information -
Dynamic optimisation identifies optimal programs for pathway regulation in prokaryotes - Supplementary Information - Martin Bartl a, Martin Kötzing a,b, Stefan Schuster c, Pu Li a, Christoph Kaleta b a
More informationComputational approaches for functional genomics
Computational approaches for functional genomics Kalin Vetsigian October 31, 2001 The rapidly increasing number of completely sequenced genomes have stimulated the development of new methods for finding
More informationIntegration of functional genomics data
Integration of functional genomics data Laboratoire Bordelais de Recherche en Informatique (UMR) Centre de Bioinformatique de Bordeaux (Plateforme) Rennes Oct. 2006 1 Observations and motivations Genomics
More informationThe architecture of complexity: the structure and dynamics of complex networks.
SMR.1656-36 School and Workshop on Structure and Function of Complex Networks 16-28 May 2005 ------------------------------------------------------------------------------------------------------------------------
More informationIn order to compare the proteins of the phylogenomic matrix, we needed a similarity
Similarity Matrix Generation In order to compare the proteins of the phylogenomic matrix, we needed a similarity measure. Hamming distances between phylogenetic profiles require the use of thresholds for
More informationIdentifying Signaling Pathways
These slides, excluding third-party material, are licensed under CC BY-NC 4.0 by Anthony Gitter, Mark Craven, Colin Dewey Identifying Signaling Pathways BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2018
More informationProteomics Systems Biology
Dr. Sanjeeva Srivastava IIT Bombay Proteomics Systems Biology IIT Bombay 2 1 DNA Genomics RNA Transcriptomics Global Cellular Protein Proteomics Global Cellular Metabolite Metabolomics Global Cellular
More informationAmira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut
Amira A. AL-Hosary PhD of infectious diseases Department of Animal Medicine (Infectious Diseases) Faculty of Veterinary Medicine Assiut University-Egypt Phylogenetic analysis Phylogenetic Basics: Biological
More informationBiology 112 Practice Midterm Questions
Biology 112 Practice Midterm Questions 1. Identify which statement is true or false I. Bacterial cell walls prevent osmotic lysis II. All bacterial cell walls contain an LPS layer III. In a Gram stain,
More informationV 5 Robustness and Modularity
Bioinformatics 3 V 5 Robustness and Modularity Mon, Oct 29, 2012 Network Robustness Network = set of connections Failure events: loss of edges loss of nodes (together with their edges) loss of connectivity
More informationProtein function prediction via analysis of interactomes
Protein function prediction via analysis of interactomes Elena Nabieva Mona Singh Department of Computer Science & Lewis-Sigler Institute for Integrative Genomics January 22, 2008 1 Introduction Genome
More informationAP Biology Unit 6 Practice Test 1. A group of cells is assayed for DNA content immediately following mitosis and is found to have an average of 8
AP Biology Unit 6 Practice Test Name: 1. A group of cells is assayed for DNA content immediately following mitosis and is found to have an average of 8 picograms of DNA per nucleus. How many picograms
More informationComparing transcription factor regulatory networks of human cell types. The Protein Network Workshop June 8 12, 2015
Comparing transcription factor regulatory networks of human cell types The Protein Network Workshop June 8 12, 2015 KWOK-PUI CHOI Dept of Statistics & Applied Probability, Dept of Mathematics, NUS OUTLINE
More informationComparative Network Analysis
Comparative Network Analysis BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2016 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under CC BY-NC 4.0 by
More informationComputational Network Biology Biostatistics & Medical Informatics 826 Fall 2018
Computational Network Biology Biostatistics & Medical Informatics 826 Fall 2018 Sushmita Roy sroy@biostat.wisc.edu https://compnetbiocourse.discovery.wisc.edu Sep 6 th 2018 Goals for today Administrivia
More informationBiological Networks: Comparison, Conservation, and Evolution via Relative Description Length By: Tamir Tuller & Benny Chor
Biological Networks:,, and via Relative Description Length By: Tamir Tuller & Benny Chor Presented by: Noga Grebla Content of the presentation Presenting the goals of the research Reviewing basic terms
More informationBIOH111. o Cell Biology Module o Tissue Module o Integumentary system o Skeletal system o Muscle system o Nervous system o Endocrine system
BIOH111 o Cell Biology Module o Tissue Module o Integumentary system o Skeletal system o Muscle system o Nervous system o Endocrine system Endeavour College of Natural Health endeavour.edu.au 1 Textbook
More informationDe novo assembly of the pepper transcriptome (Capsicum annuum): a benchmark for in silico discovery of SNPs, SSRs and candidate genes
Additional_file Ashrafi_et_al_0_Pepper_Annotation_Supp_0070.docx A Microsoft-Word 007 file with figures comparing the results of BlastGO for GeneChip (Sanger-EST) and transcriptome assemlies of pepper
More information16 The Cell Cycle. Chapter Outline The Eukaryotic Cell Cycle Regulators of Cell Cycle Progression The Events of M Phase Meiosis and Fertilization
The Cell Cycle 16 The Cell Cycle Chapter Outline The Eukaryotic Cell Cycle Regulators of Cell Cycle Progression The Events of M Phase Meiosis and Fertilization Introduction Self-reproduction is perhaps
More informationCSCE555 Bioinformatics. Protein Function Annotation
CSCE555 Bioinformatics Protein Function Annotation Why we need to do function annotation? Fig from: Network-based prediction of protein function. Molecular Systems Biology 3:88. 2007 What s function? The
More informationRegulation of gene expression. Premedical - Biology
Regulation of gene expression Premedical - Biology Regulation of gene expression in prokaryotic cell Operon units system of negative feedback positive and negative regulation in eukaryotic cell - at any
More informationComputational Genomics. Systems biology. Putting it together: Data integration using graphical models
02-710 Computational Genomics Systems biology Putting it together: Data integration using graphical models High throughput data So far in this class we discussed several different types of high throughput
More informationNetwork Biology-part II
Network Biology-part II Jun Zhu, Ph. D. Professor of Genomics and Genetic Sciences Icahn Institute of Genomics and Multi-scale Biology The Tisch Cancer Institute Icahn Medical School at Mount Sinai New
More informationHands-On Nine The PAX6 Gene and Protein
Hands-On Nine The PAX6 Gene and Protein Main Purpose of Hands-On Activity: Using bioinformatics tools to examine the sequences, homology, and disease relevance of the Pax6: a master gene of eye formation.
More informationGO ID GO term Number of members GO: translation 225 GO: nucleosome 50 GO: calcium ion binding 76 GO: structural
GO ID GO term Number of members GO:0006412 translation 225 GO:0000786 nucleosome 50 GO:0005509 calcium ion binding 76 GO:0003735 structural constituent of ribosome 170 GO:0019861 flagellum 23 GO:0005840
More informationarxiv:q-bio/ v1 [q-bio.mn] 13 Aug 2004
Network properties of protein structures Ganesh Bagler and Somdatta Sinha entre for ellular and Molecular Biology, Uppal Road, Hyderabad 57, India (Dated: June 8, 218) arxiv:q-bio/89v1 [q-bio.mn] 13 Aug
More informationGENE ONTOLOGY (GO) Wilver Martínez Martínez Giovanny Silva Rincón
GENE ONTOLOGY (GO) Wilver Martínez Martínez Giovanny Silva Rincón What is GO? The Gene Ontology (GO) project is a collaborative effort to address the need for consistent descriptions of gene products in
More informationThe Role of Network Science in Biology and Medicine. Tiffany J. Callahan Computational Bioscience Program Hunter/Kahn Labs
The Role of Network Science in Biology and Medicine Tiffany J. Callahan Computational Bioscience Program Hunter/Kahn Labs Network Analysis Working Group 09.28.2017 Network-Enabled Wisdom (NEW) empirically
More informationLecture 8: Temporal programs and the global structure of transcription networks. Chap 5 of Alon. 5.1 Introduction
Lecture 8: Temporal programs and the global structure of transcription networks Chap 5 of Alon 5. Introduction We will see in this chapter that sensory transcription networks are largely made of just four
More informationChapter 8: The Topology of Biological Networks. Overview
Chapter 8: The Topology of Biological Networks 8.1 Introduction & survey of network topology Prof. Yechiam Yemini (YY) Computer Science Department Columbia University A gallery of networks Small-world
More informationLife Sciences 1a: Section 3B. The cell division cycle Objectives Understand the challenges to producing genetically identical daughter cells
Life Sciences 1a: Section 3B. The cell division cycle Objectives Understand the challenges to producing genetically identical daughter cells Understand how a simple biochemical oscillator can drive the
More informationIntro Gene regulation Synteny The End. Today. Gene regulation Synteny Good bye!
Today Gene regulation Synteny Good bye! Gene regulation What governs gene transcription? Genes active under different circumstances. Gene regulation What governs gene transcription? Genes active under
More informationProfiling Human Cell/Tissue Specific Gene Regulation Networks
Profiling Human Cell/Tissue Specific Gene Regulation Networks Louxin Zhang Department of Mathematics National University of Singapore matzlx@nus.edu.sg Network Biology < y 1 t, y 2 t,, y k t > u 1 u 2
More informationChapter 15 Active Reading Guide Regulation of Gene Expression
Name: AP Biology Mr. Croft Chapter 15 Active Reading Guide Regulation of Gene Expression The overview for Chapter 15 introduces the idea that while all cells of an organism have all genes in the genome,
More informationDiscovering modules in expression profiles using a network
Discovering modules in expression profiles using a network Igor Ulitsky 1 2 Protein-protein interactions (PPIs) Low throughput measurements: accurate, scarce High throughput: more abundant, noisy Large,
More informationWelcome to HST.508/Biophysics 170
Harvard-MIT Division of Health Sciences and Technology HST.58: Quantitative Genomics, Fall 5 Instructors: Leonid Mirny, Robert Berwick, Alvin Kho, Isaac Kohane Welcome to HST.58/Biophysics Our emphasis
More informationSupplemental Material
Supplemental Material Protocol S1 The PromPredict algorithm considers the average free energy over a 100nt window and compares it with the the average free energy over a downstream 100nt window separated
More informationGeert Geeven. April 14, 2010
iction of Gene Regulatory Interactions NDNS+ Workshop April 14, 2010 Today s talk - Outline Outline Biological Background Construction of Predictors The main aim of my project is to better understand the
More informationTaxonomy. Content. How to determine & classify a species. Phylogeny and evolution
Taxonomy Content Why Taxonomy? How to determine & classify a species Domains versus Kingdoms Phylogeny and evolution Why Taxonomy? Classification Arrangement in groups or taxa (taxon = group) Nomenclature
More informationTiffany Samaroo MB&B 452a December 8, Take Home Final. Topic 1
Tiffany Samaroo MB&B 452a December 8, 2003 Take Home Final Topic 1 Prior to 1970, protein and DNA sequence alignment was limited to visual comparison. This was a very tedious process; even proteins with
More information