Supplementary Figure 1 Histogram of the marginal probabilities of the ancestral sequence reconstruction without gaps (insertions and deletions).
|
|
- Bernard Oliver
- 6 years ago
- Views:
Transcription
1 Supplementary Figure 1 Histogram of the marginal probabilities of the ancestral sequence reconstruction without gaps (insertions and deletions).
2 Supplementary Figure 2 Marginal probabilities of the ancestral sequence reconstruction (via FastML) (Ashkenazy et al., 2012) with respect to amino acid position and secondary structure prediction (JPred3) (Cole et al., 2006).
3 Supplementary Figure 3 Structural assessment via SWISS-MODEL (Arnold et al. 2006) for the best model built for the ancestral sequence between vertebrate thymidine kinase 2 and the arthropod multisubstrate deoxyribonucleoside kinases.
4 Supplementary Figure 4 Structural assessment via SWISS-MODEL (Arnold et al. 2006) for the best model built for human thymidine kinase 2
5 Supplementary Figure 5 Partial alignment of the TK2/dNK sequences (a) arranged by the topology of the corresponding tree (b). Notice the insertion unique to Monodelphis domestica and Sarcophilus harrissii (top middle, likely the cause of topological discrepancy between the gene and species trees). (a)
6 (b)
7 Supplementary Figure 6 DSSP (Kabsch and Sander, 1983) inference of secondary structure for D. melanogaster dnk, both truncated (1J90 ; Johansson et al. 2001) and non-trancated (2VP0; Mikkelsen et al. 2008), H. sapiens dck (2A2Z; Godsey et al. 2006), and H. sapiens dgk (2OCP; Johansson et al. 2001), and for the ROSETTA and DMD modeled structures of human TK2 and the ancestral sequence.
8 Supplementary Figure 7 Tree reconstruction of a subset of sequences from the original dataset. While the grouping differs from that in figure 2, the tree still implies that crustacean and some insect species have multiple dnks grouping paraphyletically.
9 Supplementary Figure 8: Gu99 results (Gu and Vander Velden, 2002) are shown for posterior probability cutoff values of 0.5 on the alignment (a) and the structures of Homo sapiens TK2 (as modeled by Rosetta) (b) and Drosophila melanogaster dnk (c). Coloring of the alignment corresponds to similarity in amino acid properties. Posterior probability profiles are given in Supplementary Figure 11. (a) (b) Homo sapiens TK2 Rosetta Model (c) 1J90 (Drosophila melanogaster dnk)
10 Supplementary Figure 9: Type 2 divergence (Gu and Vander Velden, 2002) results are shown for posterior probability cutoff values of 0.5 on the alignment (a) and the structures of Homo sapiens TK2 (as modeled by Rosetta) (b) and Drosophila melanogaster dnk (c). Coloring of the alignment corresponds to similarity in amino acid properties. Posterior probability profiles are given in Supplementary Figure 11. (a) (b) Homo sapiens TK2 Rosetta Model (c) 1J90 (Drosophila melanogaster dnk)
11 Supplementary Figure 10: Site specific profiles for a three-cluster analysis for type 1 divergence (a) (Gu99 algorithm), type 2 divergence (b), and type 1 divergence (c) (2013 implementation) (Gu and Vander Velden, 2002; Gu et al., 2013). The three clusters contain the TK2, the crustacean and arachnid dnks, and the insect dnks. (a) The Gu99 analysis only identified residue 237 to be of significance between the crustacean/arachnid and the insect dnks (yellow). No residues were identified to be significant in type 1 divergence between TK2 and the insect dnks (orange), while a number were found to have potential type 2 divergence patterns between TK2 and crustacean/arachnid dnk (blue). The residues of interest above posterior probabilities of 0.5 are highlighted in suppl. Fig. 9. (b) Posterior probabilities for type 2 divergence indicate that multiple residues may have contributed to functional divergence between TK2 and the crustacean/arachnid dnks (blue), as well as between the crustacean/arachnid and the insect dnks (yellow). These residues of interest above posterior probabilities of 0.5 are highlighted in suppl. Fig. 10. (c) Type 1 functional divergence analysis (2013 algorithm) implies with high posterior probability that none of the three clusters experienced type 1 functional divergence (red bars). The xaxis refers to residue position, while the y-axis indicates posterior probabilities. Blue bars represent the probability that cluster 1 only experienced type 1 divergence. Cluster 2 and 3 had posterior probabilities of zero for type 1 divergence (not shown). (a) (b) (c)
12 Supplementary Figure 11 Gu99 results (Gu and Vander Velden, 2002) are shown for posterior probability cutoff values of 0.5 on the alignment (a) and the structures of Homo sapiens dck (b) and Homo sapiens dgk (c). Coloring of the alignment corresponds to similarity in amino acid properties. Posterior probability profiles are given in Supplementary Figure 14. (a) (b) Homo sapiens dck (c) Homo sapiens dgk
13 Supplementary Figure 12 Type 2 divergence (Gu and Vander Velden, 2002) results are shown for posterior probability cutoff values of 0.5 on the alignment (a) and the structures of Homo sapiens dck (b) and Homo sapiens dgk (c). Coloring of the alignment corresponds to similarity in amino acid properties. Posterior probability profiles are given in Supplementary Figure 14. (a) (b) Homo sapiens dck (c) Homo sapiens dgk
14 Supplementary Figure 13 Site specific profiles for a three-cluster analysis for type 1 divergence (a) (Gu99 algorithm), type 2 divergence (b), and type 1 divergence (c) (2013 implementation) (Gu and Vander Velden, 2002; Gu et al., 2013). The three clusters contain the dgksand the dcks/dck2s. (a) The Gu99 did not identify any type 1 residues between dck and dck2, however, a cluster of residues seem to have experienced type 1 divergence between dgk and dck2 (blue) and possibly between dgk and dck (orange). The residues of interest above posterior probabilities of 0.5 are highlighted in suppl. fig. 12. (b) Posterior probabilities for type 2 divergence indicate that multiple residues may have contributed to functional divergence between dgk and dck2 (blue), as well as between the dck and dck2 (yellow). These residues of interest above posterior probabilities of 0.5 are highlighted in suppl. fig. 13. (c) Type 1 functional divergence analysis (2013 algorithm): the x-axis refers to residue position, while the y-axis indicates posterior probabilities. Red bars represent the posterior probability that none of the three clusters experienced type 1 functional divergence. Blue bars represent the probability that cluster 1 only experienced type 1 divergence. Purple bars represent type 1 functional divergence in cluster 2. Cluster 3 had posterior probabilities of zero for type 1 divergence (not shown). (a) (b) (c)
15 Supplementary Figure 14 Site specific profiles for the Gu99 two-cluster analyses (Gu and Vander Velden, 2002) of dnk and TK2 (a) and of dck and dgk (b). (a) (b)
16 References: Arnold K, Bordoli L, Kopp J, Schwede T (2006) The SWISS-MODEL Workspace: A web-based environment for protein structure homology modelling. Bioinformatics 22: Ashkenazy H, Penn O, Doron-Faigenboim A, Cohen O, Cannarozzi G, Zomer O, Pupko T (2012) FastML: a web server for probabilistic reconstruction of ancestralsequences. Nucl Acids Res 40(Web Server issue):w580-w584. Cole C, Barber JD, Barton GJ (2008) The Jpred 3 secondary structure prediction server. Nucl Acids Res 36(suppl 2):W197-W201. Gu X, Vander Velden K. (2002). DIVERGE: phylogeny-based analysis for functional-structural divergence of a protein family. Bioinformatics 18(3): Gu X, Zou Y, Su Z, Huang W, Zhou Z, Arendsee Z, Zeng Y. (2013). An update of DIVERGE software for functional divergence analysis of protein family. Mol Biol Evol 30(7): Godsey MH, Ort S, Sabini E, Konrad M, Lavie A (2006) Structural basis for the preference of UTP over ATP in human deoxycytidine kinase: illuminating the role of main-chain reorganization. Biochemistry 45: Johansson K, Ramaswamy S, Ljungcrantz C, Knecht W, Piškur J, Munch-Petersen B, Eriksson S, Eklund H (2001) Structural basis for substrate specificities of cellular deoxyribonucleoside kinases. Nat Struct Biol 8(7): Kabsch W, Sander C. (1983). Dictionary of protein secondary structure: pattern recognition of hydrogenbonded and geometrical features. Biopolymers 22: Mikkelsen NE, Munch-Petersen B, Eklund H (2008) Structural studies of nucleoside analog and feedback inhibitor binding to Drosophila melanogaster multisubstrate deoxyribonucleoside kinase. FEBS J 275(9):
Finding Motifs in Protein Sequences and Marking Their Positions in Protein Structures
MotifMarker Demo This document contains a demo of MotifMarker used to study the structures of the CDK2,CDC2, CDK4 and CDK6 highlighting the 3 important domains/motifs, namely, the PSTAIRE region, ATP binding
More informationPhylogenetics: Bayesian Phylogenetic Analysis. COMP Spring 2015 Luay Nakhleh, Rice University
Phylogenetics: Bayesian Phylogenetic Analysis COMP 571 - Spring 2015 Luay Nakhleh, Rice University Bayes Rule P(X = x Y = y) = P(X = x, Y = y) P(Y = y) = P(X = x)p(y = y X = x) P x P(X = x 0 )P(Y = y X
More informationNumber sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence
Number sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence Naoto Morikawa (nmorika@genocript.com) October 7, 2006. Abstract A protein is a sequence
More informationGenomics and bioinformatics summary. Finding genes -- computer searches
Genomics and bioinformatics summary 1. Gene finding: computer searches, cdnas, ESTs, 2. Microarrays 3. Use BLAST to find homologous sequences 4. Multiple sequence alignments (MSAs) 5. Trees quantify sequence
More informationMolecular modeling. A fragment sequence of 24 residues encompassing the region of interest of WT-
SUPPLEMENTARY DATA Molecular dynamics Molecular modeling. A fragment sequence of 24 residues encompassing the region of interest of WT- KISS1R, i.e. the last intracellular domain (Figure S1a), has been
More informationSome Problems from Enzyme Families
Some Problems from Enzyme Families Greg Butler Department of Computer Science Concordia University, Montreal www.cs.concordia.ca/~faculty/gregb gregb@cs.concordia.ca Abstract I will discuss some problems
More informationBioinformatics tools for phylogeny and visualization. Yanbin Yin
Bioinformatics tools for phylogeny and visualization Yanbin Yin 1 Homework assignment 5 1. Take the MAFFT alignment http://cys.bios.niu.edu/yyin/teach/pbb/purdue.cellwall.list.lignin.f a.aln as input and
More informationMegAlign Pro Pairwise Alignment Tutorials
MegAlign Pro Pairwise Alignment Tutorials All demo data for the following tutorials can be found in the MegAlignProAlignments.zip archive here. Tutorial 1: Multiple versus pairwise alignments 1. Extract
More informationProtein Structures: Experiments and Modeling. Patrice Koehl
Protein Structures: Experiments and Modeling Patrice Koehl Structural Bioinformatics: Proteins Proteins: Sources of Structure Information Proteins: Homology Modeling Proteins: Ab initio prediction Proteins:
More informationCISC 636 Computational Biology & Bioinformatics (Fall 2016)
CISC 636 Computational Biology & Bioinformatics (Fall 2016) Predicting Protein-Protein Interactions CISC636, F16, Lec22, Liao 1 Background Proteins do not function as isolated entities. Protein-Protein
More informationDetection of Protein Binding Sites II
Detection of Protein Binding Sites II Goal: Given a protein structure, predict where a ligand might bind Thomas Funkhouser Princeton University CS597A, Fall 2007 1hld Geometric, chemical, evolutionary
More informationActa Crystallographica Section D
Supporting information Acta Crystallographica Section D Volume 70 (2014) Supporting information for article: Structural characterization of the virulence factor Nuclease A from Streptococcus agalactiae
More informationPage 1. References. Hidden Markov models and multiple sequence alignment. Markov chains. Probability review. Example. Markovian sequence
Page Hidden Markov models and multiple sequence alignment Russ B Altman BMI 4 CS 74 Some slides borrowed from Scott C Schmidler (BMI graduate student) References Bioinformatics Classic: Krogh et al (994)
More informationIntroduction to Evolutionary Concepts
Introduction to Evolutionary Concepts and VMD/MultiSeq - Part I Zaida (Zan) Luthey-Schulten Dept. Chemistry, Beckman Institute, Biophysics, Institute of Genomics Biology, & Physics NIH Workshop 2009 VMD/MultiSeq
More informationIT og Sundhed 2010/11
IT og Sundhed 2010/11 Sequence based predictors. Secondary structure and surface accessibility Bent Petersen 13 January 2011 1 NetSurfP Real Value Solvent Accessibility predictions with amino acid associated
More informationNature Genetics: doi: /ng Supplementary Figure 1. Icm/Dot secretion system region I in 41 Legionella species.
Supplementary Figure 1 Icm/Dot secretion system region I in 41 Legionella species. Homologs of the effector-coding gene lega15 (orange) were found within Icm/Dot region I in 13 Legionella species. In four
More informationElements of Bioinformatics 14F01 TP5 -Phylogenetic analysis
Elements of Bioinformatics 14F01 TP5 -Phylogenetic analysis 10 December 2012 - Corrections - Exercise 1 Non-vertebrate chordates generally possess 2 homologs, vertebrates 3 or more gene copies; a Drosophila
More informationStructure of the SPRY domain of human DDX1 helicase, a putative interaction platform within a DEAD-box protein
Supporting information Volume 71 (2015) Supporting information for article: Structure of the SPRY domain of human DDX1 helicase, a putative interaction platform within a DEAD-box protein Julian Kellner
More informationGrouping of amino acids and recognition of protein structurally conserved regions by reduced alphabets of amino acids
Science in China Series C: Life Sciences 2007 Science in China Press Springer-Verlag Grouping of amino acids and recognition of protein structurally conserved regions by reduced alphabets of amino acids
More informationCryo-EM data collection, refinement and validation statistics
1 Table S1 Cryo-EM data collection, refinement and validation statistics Data collection and processing CPSF-160 WDR33 (EMDB-7114) (PDB 6BM0) CPSF-160 WDR33 (EMDB-7113) (PDB 6BLY) CPSF-160 WDR33 CPSF-30
More informationAlgorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment
Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot
More informationSupplementary Information
Supplementary Information Supplementary Figure 1. Schematic pipeline for single-cell genome assembly, cleaning and annotation. a. The assembly process was optimized to account for multiple cells putatively
More informationPhylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata.
Supplementary Note S2 Phylogenetic relationship among S. castellii, S. cerevisiae and C. glabrata. Phylogenetic trees reconstructed by a variety of methods from either single-copy orthologous loci (Class
More informationSecondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure
Bioch/BIMS 503 Lecture 2 Structure and Function of Proteins August 28, 2008 Robert Nakamoto rkn3c@virginia.edu 2-0279 Secondary Structure Φ Ψ angles determine protein structure Φ Ψ angles are restricted
More informationStatistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences
Statistical Machine Learning Methods for Bioinformatics II. Hidden Markov Model for Biological Sequences Jianlin Cheng, PhD Department of Computer Science University of Missouri 2008 Free for Academic
More informationNature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1
Supplementary Figure 1 Identification of the ScDcp2 minimal region interacting with both ScDcp1 and the ScEdc3 LSm domain. Pull-down experiment of untagged ScEdc3 LSm with various ScDcp1-Dcp2-His 6 fragments.
More informationComparison and Analysis of Heat Shock Proteins in Organisms of the Kingdom Viridiplantae. Emily Germain, Rensselaer Polytechnic Institute
Comparison and Analysis of Heat Shock Proteins in Organisms of the Kingdom Viridiplantae Emily Germain, Rensselaer Polytechnic Institute Mentor: Dr. Hugh Nicholas, Biomedical Initiative, Pittsburgh Supercomputing
More informationCMPS 3110: Bioinformatics. Tertiary Structure Prediction
CMPS 3110: Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the laws of physics! Conformation space is finite
More informationResearch Proposal. Title: Multiple Sequence Alignment used to investigate the co-evolving positions in OxyR Protein family.
Research Proposal Title: Multiple Sequence Alignment used to investigate the co-evolving positions in OxyR Protein family. Name: Minjal Pancholi Howard University Washington, DC. June 19, 2009 Research
More informationBIOINFORMATICS LAB AP BIOLOGY
BIOINFORMATICS LAB AP BIOLOGY Bioinformatics is the science of collecting and analyzing complex biological data. Bioinformatics combines computer science, statistics and biology to allow scientists to
More informationHomology Modeling. Roberto Lins EPFL - summer semester 2005
Homology Modeling Roberto Lins EPFL - summer semester 2005 Disclaimer: course material is mainly taken from: P.E. Bourne & H Weissig, Structural Bioinformatics; C.A. Orengo, D.T. Jones & J.M. Thornton,
More informationCMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction
CMPS 6630: Introduction to Computational Biology and Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the
More informationBIOINFORMATICS: An Introduction
BIOINFORMATICS: An Introduction What is Bioinformatics? The term was first coined in 1988 by Dr. Hwa Lim The original definition was : a collective term for data compilation, organisation, analysis and
More informationComputational methods for predicting protein-protein interactions
Computational methods for predicting protein-protein interactions Tomi Peltola T-61.6070 Special course in bioinformatics I 3.4.2008 Outline Biological background Protein-protein interactions Computational
More informationLecture 7 Sequence analysis. Hidden Markov Models
Lecture 7 Sequence analysis. Hidden Markov Models Nicolas Lartillot may 2012 Nicolas Lartillot (Universite de Montréal) BIN6009 may 2012 1 / 60 1 Motivation 2 Examples of Hidden Markov models 3 Hidden
More informationCHAPTERS 24-25: Evidence for Evolution and Phylogeny
CHAPTERS 24-25: Evidence for Evolution and Phylogeny 1. For each of the following, indicate how it is used as evidence of evolution by natural selection or shown as an evolutionary trend: a. Paleontology
More informationA New Similarity Measure among Protein Sequences
A New Similarity Measure among Protein Sequences Kuen-Pin Wu, Hsin-Nan Lin, Ting-Yi Sung and Wen-Lian Hsu * Institute of Information Science Academia Sinica, Taipei 115, Taiwan Abstract Protein sequence
More informationModelling of Possible Binding Modes of Caffeic Acid Derivatives to JAK3 Kinase
John von Neumann Institute for Computing Modelling of Possible Binding Modes of Caffeic Acid Derivatives to JAK3 Kinase J. Kuska, P. Setny, B. Lesyng published in From Computational Biophysics to Systems
More informationChapter 26 Phylogeny and the Tree of Life
Chapter 26 Phylogeny and the Tree of Life Chapter focus Shifting from the process of how evolution works to the pattern evolution produces over time. Phylogeny Phylon = tribe, geny = genesis or origin
More informationStatistical Machine Learning Methods for Biomedical Informatics II. Hidden Markov Model for Biological Sequences
Statistical Machine Learning Methods for Biomedical Informatics II. Hidden Markov Model for Biological Sequences Jianlin Cheng, PhD William and Nancy Thompson Missouri Distinguished Professor Department
More informationHomology models of the tetramerization domain of six eukaryotic voltage-gated potassium channels Kv1.1-Kv1.6
Homology models of the tetramerization domain of six eukaryotic voltage-gated potassium channels Kv1.1-Kv1.6 Hsuan-Liang Liu* and Chin-Wen Chen Department of Chemical Engineering and Graduate Institute
More information, Work in progress
2012-2013, Work in progress Samuel Blanquart, CR2 INRIA, EPI Bonsai 10 juin 2013 2012 2013 : Researches, animation and teaching Teaching : Algorithm and Application for 20 master 1 students (24h). Teaching
More informationProcheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics.
Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics Iosif Vaisman Email: ivaisman@gmu.edu ----------------------------------------------------------------- Bond
More informationAnalysis on sliding helices and strands in protein structural comparisons: A case study with protein kinases
Sliding helices and strands in structural comparisons 921 Analysis on sliding helices and strands in protein structural comparisons: A case study with protein kinases V S GOWRI, K ANAMIKA, S GORE 1 and
More informationSUPPLEMENTARY INFORMATION
SUPPLMTARY IFORMATIO a doi:10.108/nature10402 b 100 nm 100 nm c SAXS Model d ulers assigned to reference- Back-projected free class averages class averages Refinement against single particles Reconstructed
More informationEffects of Gap Open and Gap Extension Penalties
Brigham Young University BYU ScholarsArchive All Faculty Publications 200-10-01 Effects of Gap Open and Gap Extension Penalties Hyrum Carroll hyrumcarroll@gmail.com Mark J. Clement clement@cs.byu.edu See
More informationErasing Errors Due to Alignment Ambiguity When Estimating Positive Selection
Article (Methods) Erasing Errors Due to Alignment Ambiguity When Estimating Positive Selection Authors Benjamin Redelings 1,2 1 Biology Department, Duke University 2 National Evolutionary Synthesis Center
More informationCMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison
CMPS 6630: Introduction to Computational Biology and Bioinformatics Structure Comparison Protein Structure Comparison Motivation Understand sequence and structure variability Understand Domain architecture
More informationProtein Bioinformatics Computer lab #1 Friday, April 11, 2008 Sean Prigge and Ingo Ruczinski
Protein Bioinformatics 260.655 Computer lab #1 Friday, April 11, 2008 Sean Prigge and Ingo Ruczinski Goals: Approx. Time [1] Use the Protein Data Bank PDB website. 10 minutes [2] Use the WebMol Viewer.
More informationAlgorithms in Bioinformatics
Algorithms in Bioinformatics Sami Khuri Department of Computer Science San José State University San José, California, USA khuri@cs.sjsu.edu www.cs.sjsu.edu/faculty/khuri Distance Methods Character Methods
More informationThe PRALINE online server: optimising progressive multiple alignment on the web
Computational Biology and Chemistry 27 (2003) 511 519 Software Note The PRALINE online server: optimising progressive multiple alignment on the web V.A. Simossis a,b, J. Heringa a, a Bioinformatics Unit,
More informationSupplementary Information. Recognition of the pre-mirna structure by Drosophila Dicer-1
Supplementary Information Recognition of the pre-mirna structure by Drosophila Dicer-1 Akihisa Tsutsumi 1,2, Tomoko Kawamata 1, Natsuko Izumi 1 Hervé Seitz 3,4 and Yukihide Tomari 1,2 Nature Structural
More informationComparative Genomics II
Comparative Genomics II Advances in Bioinformatics and Genomics GEN 240B Jason Stajich May 19 Comparative Genomics II Slide 1/31 Outline Introduction Gene Families Pairwise Methods Phylogenetic Methods
More informationMultiple Sequence Alignment. Sequences
Multiple Sequence Alignment Sequences > YOR020c mstllksaksivplmdrvlvqrikaqaktasglylpe knveklnqaevvavgpgftdangnkvvpqvkvgdqvl ipqfggstiklgnddevilfrdaeilakiakd > crassa mattvrsvksliplldrvlvqrvkaeaktasgiflpe
More informationHands-On Nine The PAX6 Gene and Protein
Hands-On Nine The PAX6 Gene and Protein Main Purpose of Hands-On Activity: Using bioinformatics tools to examine the sequences, homology, and disease relevance of the Pax6: a master gene of eye formation.
More information2 Dean C. Adams and Gavin J. P. Naylor the best three-dimensional ordination of the structure space is found through an eigen-decomposition (correspon
A Comparison of Methods for Assessing the Structural Similarity of Proteins Dean C. Adams and Gavin J. P. Naylor? Dept. Zoology and Genetics, Iowa State University, Ames, IA 50011, U.S.A. 1 Introduction
More informationPhylogenetic Analysis. Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center
Phylogenetic Analysis Han Liang, Ph.D. Assistant Professor of Bioinformatics and Computational Biology UT MD Anderson Cancer Center Outline Basic Concepts Tree Construction Methods Distance-based methods
More informationTHEORY. Based on sequence Length According to the length of sequence being compared it is of following two types
Exp 11- THEORY Sequence Alignment is a process of aligning two sequences to achieve maximum levels of identity between them. This help to derive functional, structural and evolutionary relationships between
More informationUsing phylogenetics to estimate species divergence times... Basics and basic issues for Bayesian inference of divergence times (plus some digression)
Using phylogenetics to estimate species divergence times... More accurately... Basics and basic issues for Bayesian inference of divergence times (plus some digression) "A comparison of the structures
More informationPROTEIN FUNCTION PREDICTION WITH AMINO ACID SEQUENCE AND SECONDARY STRUCTURE ALIGNMENT SCORES
PROTEIN FUNCTION PREDICTION WITH AMINO ACID SEQUENCE AND SECONDARY STRUCTURE ALIGNMENT SCORES Eser Aygün 1, Caner Kömürlü 2, Zafer Aydin 3 and Zehra Çataltepe 1 1 Computer Engineering Department and 2
More informationSCOP. all-β class. all-α class, 3 different folds. T4 endonuclease V. 4-helical cytokines. Globin-like
SCOP all-β class 4-helical cytokines T4 endonuclease V all-α class, 3 different folds Globin-like TIM-barrel fold α/β class Profilin-like fold α+β class http://scop.mrc-lmb.cam.ac.uk/scop CATH Class, Architecture,
More informationSUPPLEMENTARY INFORMATION
Dph2 SeMet (iron-free) # Dph2 (iron-free) Dph2-[4Fe-4S] Data collection Space group P2 1 2 1 2 1 P2 1 2 1 2 1 P2 1 2 1 2 1 Cell dimensions a, b, c (Å) 58.26, 82.08, 160.42 58.74, 81.87, 160.01 55.70, 80.53,
More information2MHR. Protein structure classification is important because it organizes the protein structure universe that is independent of sequence similarity.
Protein structure classification is important because it organizes the protein structure universe that is independent of sequence similarity. A global picture of the protein universe will help us to understand
More informationCluster Analysis of Gene Expression Microarray Data. BIOL 495S/ CS 490B/ MATH 490B/ STAT 490B Introduction to Bioinformatics April 8, 2002
Cluster Analysis of Gene Expression Microarray Data BIOL 495S/ CS 490B/ MATH 490B/ STAT 490B Introduction to Bioinformatics April 8, 2002 1 Data representations Data are relative measurements log 2 ( red
More informationPHYLOGENY & THE TREE OF LIFE
PHYLOGENY & THE TREE OF LIFE PREFACE In this powerpoint we learn how biologists distinguish and categorize the millions of species on earth. Early we looked at the process of evolution here we look at
More informationSequence Bioinformatics. Multiple Sequence Alignment Waqas Nasir
Sequence Bioinformatics Multiple Sequence Alignment Waqas Nasir 2010-11-12 Multiple Sequence Alignment One amino acid plays coy; a pair of homologous sequences whisper; many aligned sequences shout out
More informationInDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9
Lecture 5 Alignment I. Introduction. For sequence data, the process of generating an alignment establishes positional homologies; that is, alignment provides the identification of homologous phylogenetic
More informationSome Statistical Inferences For Two Frequency Distributions Arising In Bioinformatics
Applied Mathematics E-Notes, 14(2014), 151-160 c ISSN 1607-2510 Available free at mirror sites of http://www.math.nthu.edu.tw/ amen/ Some Statistical Inferences For Two Frequency Distributions Arising
More informationSCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION. Using Anatomy, Embryology, Biochemistry, and Paleontology
SCIENTIFIC EVIDENCE TO SUPPORT THE THEORY OF EVOLUTION Using Anatomy, Embryology, Biochemistry, and Paleontology Scientific Fields Different fields of science have contributed evidence for the theory of
More informationSupplementary Figure 1 Crystal packing of ClR and electron density maps. Crystal packing of type A crystal (a) and type B crystal (b).
Supplementary Figure 1 Crystal packing of ClR and electron density maps. Crystal packing of type A crystal (a) and type B crystal (b). Crystal contacts at B-C loop are magnified and stereo view of A-weighted
More informationProtein Secondary Structure Prediction using Feed-Forward Neural Network
COPYRIGHT 2010 JCIT, ISSN 2078-5828 (PRINT), ISSN 2218-5224 (ONLINE), VOLUME 01, ISSUE 01, MANUSCRIPT CODE: 100713 Protein Secondary Structure Prediction using Feed-Forward Neural Network M. A. Mottalib,
More informationIntroduction to protein alignments
Introduction to protein alignments Comparative Analysis of Proteins Experimental evidence from one or more proteins can be used to infer function of related protein(s). Gene A Gene X Protein A compare
More informationSUPPLEMENTARY INFORMATION
Supplementary information S1 (box). Supplementary Methods description. Prokaryotic Genome Database Archaeal and bacterial genome sequences were downloaded from the NCBI FTP site (ftp://ftp.ncbi.nlm.nih.gov/genomes/all/)
More informationVariable-Length Protein Sequence Motif Extraction Using Hierarchically-Clustered Hidden Markov Models
2013 12th International Conference on Machine Learning and Applications Variable-Length Protein Sequence Motif Extraction Using Hierarchically-Clustered Hidden Markov Models Cody Hudson, Bernard Chen Department
More information1-D Predictions. Prediction of local features: Secondary structure & surface exposure
1-D Predictions Prediction of local features: Secondary structure & surface exposure 1 Learning Objectives After today s session you should be able to: Explain the meaning and usage of the following local
More informationSupporting Online Material for
www.sciencemag.org/cgi/content/full/309/5742/1868/dc1 Supporting Online Material for Toward High-Resolution de Novo Structure Prediction for Small Proteins Philip Bradley, Kira M. S. Misura, David Baker*
More informationDiphthamide biosynthesis requires a radical iron-sulfur enzyme. Pennsylvania State University, University Park, Pennsylvania 16802, USA
Diphthamide biosynthesis requires a radical iron-sulfur enzyme Yang Zhang, 1,4 Xuling Zhu, 1,4 Andrew T. Torelli, 1 Michael Lee, 2 Boris Dzikovski, 1 Rachel Koralewski, 1 Eileen Wang, 1 Jack Freed, 1 Carsten
More informationSoftware GASP: Gapped Ancestral Sequence Prediction for proteins Richard J Edwards* and Denis C Shields
BMC Bioinformatics BioMed Central Software GASP: Gapped Ancestral Sequence Prediction for proteins Richard J Edwards* and Denis C Shields Open Access Address: Bioinformatics Core, Clinical Pharmacology,
More informationComparing Genomes! Homologies and Families! Sequence Alignments!
Comparing Genomes! Homologies and Families! Sequence Alignments! Allows us to achieve a greater understanding of vertebrate evolution! Tells us what is common and what is unique between different species
More informationBIOINFORMATICS. NIFAS: visual analysis of domain evolution in proteins. Christian E. V. Storm and Erik L. L. Sonnhammer INTRODUCTION
BIOINFORMATICS Vol. 17 no. 4 2001 Pages 343 348 NIFAS: visual analysis of domain evolution in proteins Christian E. V. Storm and Erik L. L. Sonnhammer Center for Genomics Research, Karolinska Institutet,
More information1. Protein Data Bank (PDB) 1. Protein Data Bank (PDB)
Protein structure databases; visualization; and classifications 1. Introduction to Protein Data Bank (PDB) 2. Free graphic software for 3D structure visualization 3. Hierarchical classification of protein
More informationProtein structure alignments
Protein structure alignments Proteins that fold in the same way, i.e. have the same fold are often homologs. Structure evolves slower than sequence Sequence is less conserved than structure If BLAST gives
More informationMotifs, Profiles and Domains. Michael Tress Protein Design Group Centro Nacional de Biotecnología, CSIC
Motifs, Profiles and Domains Michael Tress Protein Design Group Centro Nacional de Biotecnología, CSIC Comparing Two Proteins Sequence Alignment Determining the pattern of evolution and identifying conserved
More informationRELATING PHYSICOCHEMMICAL PROPERTIES OF AMINO ACIDS TO VARIABLE NUCLEOTIDE SUBSTITUTION PATTERNS AMONG SITES ZIHENG YANG
RELATING PHYSICOCHEMMICAL PROPERTIES OF AMINO ACIDS TO VARIABLE NUCLEOTIDE SUBSTITUTION PATTERNS AMONG SITES ZIHENG YANG Department of Biology (Galton Laboratory), University College London, 4 Stephenson
More informationMolecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007
Molecular Modeling Prediction of Protein 3D Structure from Sequence Vimalkumar Velayudhan Jain Institute of Vocational and Advanced Studies May 21, 2007 Vimalkumar Velayudhan Molecular Modeling 1/23 Outline
More informationOptimization of the Sliding Window Size for Protein Structure Prediction
Optimization of the Sliding Window Size for Protein Structure Prediction Ke Chen* 1, Lukasz Kurgan 1 and Jishou Ruan 2 1 University of Alberta, Department of Electrical and Computer Engineering, Edmonton,
More informationModule: Sequence Alignment Theory and Applications Session: Introduction to Searching and Sequence Alignment
Module: Sequence Alignment Theory and Applications Session: Introduction to Searching and Sequence Alignment Introduction to Bioinformatics online course : IBT Jonathan Kayondo Learning Objectives Understand
More informationSUPPLEMENTARY INFORMATION
SUPPLEMENTARY INFORMATION doi:10.1038/nature11510 Supplementary Table 1. Indel Index Removal Gene Number of Starting Sequences Number of Final Sequences Percentage of Sequences Removed based on the Indel
More informationSupplementary Figure 1 Preparation of PDA nanoparticles derived from self-assembly of PCDA. (a)
Supplementary Figure 1 Preparation of PDA nanoparticles derived from self-assembly of PCDA. (a) Computer simulation on the self-assembly of PCDAs. Two kinds of totally different initial conformations were
More informationMotivating the need for optimal sequence alignments...
1 Motivating the need for optimal sequence alignments... 2 3 Note that this actually combines two objectives of optimal sequence alignments: (i) use the score of the alignment o infer homology; (ii) use
More informationBiological networks CS449 BIOINFORMATICS
CS449 BIOINFORMATICS Biological networks Programming today is a race between software engineers striving to build bigger and better idiot-proof programs, and the Universe trying to produce bigger and better
More informationNJMerge: A generic technique for scaling phylogeny estimation methods and its application to species trees
NJMerge: A generic technique for scaling phylogeny estimation methods and its application to species trees Erin Molloy and Tandy Warnow {emolloy2, warnow}@illinois.edu University of Illinois at Urbana
More informationCSCE555 Bioinformatics. Protein Function Annotation
CSCE555 Bioinformatics Protein Function Annotation Why we need to do function annotation? Fig from: Network-based prediction of protein function. Molecular Systems Biology 3:88. 2007 What s function? The
More informationA profile-based protein sequence alignment algorithm for a domain clustering database
A profile-based protein sequence alignment algorithm for a domain clustering database Lin Xu,2 Fa Zhang and Zhiyong Liu 3, Key Laboratory of Computer System and architecture, the Institute of Computing
More informationPage 1. Evolutionary Trees. Why build evolutionary tree? Outline
Page Evolutionary Trees Russ. ltman MI S 7 Outline. Why build evolutionary trees?. istance-based vs. character-based methods. istance-based: Ultrametric Trees dditive Trees. haracter-based: Perfect phylogeny
More informationBiochemistry 324 Bioinformatics. Pairwise sequence alignment
Biochemistry 324 Bioinformatics Pairwise sequence alignment How do we compare genes/proteins? When we have sequenced a genome, we try and identify the function of unknown genes by finding a similar gene
More informationExploring Evolution & Bioinformatics
Chapter 6 Exploring Evolution & Bioinformatics Jane Goodall The human sequence (red) differs from the chimpanzee sequence (blue) in only one amino acid in a protein chain of 153 residues for myoglobin
More informationCan protein model accuracy be. identified? NO! CBS, BioCentrum, Morten Nielsen, DTU
Can protein model accuracy be identified? Morten Nielsen, CBS, BioCentrum, DTU NO! Identification of Protein-model accuracy Why is it important? What is accuracy RMSD, fraction correct, Protein model correctness/quality
More informationToday s Lecture: HMMs
Today s Lecture: HMMs Definitions Examples Probability calculations WDAG Dynamic programming algorithms: Forward Viterbi Parameter estimation Viterbi training 1 Hidden Markov Models Probability models
More informationSupplementary Figure 1
Supplementary Figure 1 The correlation of n-score cutoff and FDR in both CID-only and CID-ETD fragmentation strategies. A bar diagram of different n-score thresholds applied in the search, plotted against
More information