Protein Predic+on I for Computer Scien+sts
|
|
- Sheila Watts
- 6 years ago
- Views:
Transcription
1 Protein Predic+on I for Computer Scien+sts Resources June 8 th /20th, Summer Term 2017 Burkhard Rost & Lothar Richter
2 Primary Databases Genbank/EBI/DDBJ: nucleic acid sequences UniprotKB: protein sequences PDB: 3D-structures
3 Growth of GenBank and WGS -doubling approx. every 18 months, diagram for release 207, Apr current version: release 213, Apr. 2016: bases in Genbank, bases in WGS -taken from h[p://
4 Growth of GenBank and WGS -current release 213: sequence in Genbank, sequences in WGS -taken from h[p:// release 207, Apr. 2015
5 UniProtKB: UniProt Components: - UniProtKB/Swiss-Prot - UniProtKB/TrEMBL UniParc: pure sequence archive, no annota+ons UniRef: consists fo three databases of clustered sets of protein sequences (UniRef100, UniRef90, UniRef50) using the CD-HIT algorithm UniMes: data from metagenomic and environmental samples, not in UniProtKB
6 Growth over three years: 2018_4 vs 2017_5 vs 2016_5 vs 2015_5 Protein existence (PE) Entries % 1. Evidence at protein level 98,566 (95,143) (92,536) (85,419) 17,7 (17.2) (16.8) (15.6) 2. Evidence at transcript level 57,060 (57,649) (57,757) (61,814) 10.2 (10.4) (10.5) (11.3) 3. Inferred from homology 386,164 (386,111) (387,589) (387,733) 69.3 (69.6) (70.3) (70.7) 4. Predicted 13,621 (13,751) (11,358) (11,526) 2.4 (2.5) (2.1) (2.1) 5. Uncertain 1864 (1,861) (1,953) (1,962) 0.3 (0.3) (0.4) (0.4)
7 Development taken from h[p://web.expasy.org/docs/relnotes/relstat1.png for release 2017_5
8 More Numbers (rel. 2017_5) Represented species: ( in 2015_5) Top 20 species: 119,149 sequences, i.e. 21.5% of the total number of entries Entries No of Species Entries No of Species 1 5,570 (5,495) (228) 2 1,921 (1,899) (214) 3 1,054 (1,023) (122) (657) (711) (487) (426) (399) (213) (289) > (1.046)
9 Species Representa+on (rel. 2017_5/ 2015_5) Top Frequency Species 1 20,201 (+3) Homo sapiens (Human) 2 16,877 (+166) Mus musculus (Mouse) ,333 (+1,445) Arabidopsis thaliana (Mouse-ear cress) 7,989 (+68) Rattus norvegicus (Rat) 5 6,721 (+3) Saccharomyces cerevisiae (Baker s yest) 6 5,999 (+6) Bos taurus (Bovine) 7 5,141 (+38) Schizosaccheromyces pombe (Fission yeast) 8 4,435 (+2) Escherichia coli K12 9 4,185 (+0) Bacillus subtilis 10 4,134 (+3) Dictyostelium discoideum (Slime mold)
10 Length Distribu+on (rel. 2017_5)
11 Annota+on Phases 1. Sequence cura+on 2. Sequence analysis 3. Literature cura+on 4. Family-based cura+on 5. Evidence a[ribu+on 6. Quality assurance, integra+on and update
12 taken from Figure 1. UniProtKB sequence analysis results displayed in graphical interface
13 Slides by Dr. Schafferhans X-ray Crystallography Crystallize Diffract Ø Measurement shows electron density Ø Not visible: ØH Ø Flexible parts Ø Ambiguity (e.g. NH2 / OH) Build model Images: By Thomas Splettstoesser ( - own work; images were rendered with PyMol ( based on PDB id 1MBO, CC BY-SA 3.0, Public Domain, By Hydrargyrum - Own work, CC BY-SA 3.0, By Jeff Dahl - Own work, CC BY-SA 3.0, By Bassophile 08:00, 25 May 2007 (UTC) - Self made using my own data and PyMol, CC BY 2.5, index.php?curid=
14 Slides by Dr. Schafferhans NMR (Nuclear magnetic resonance) Spinning nucleus absorbs radiation Exact frequency depends on environment chemical shift J-Coupling split Proteins too crowded for normal NMR! Image: By T.vanschaik - Own work, CC BY-SA 3.0,
15 Slides by Dr. Schafferhans Protein NMR Sample preparation (<1 ml, <3 mmol/l) (15N-, 13C-labeled samples help assign residues) Data collection (2D spectra) Resonance assignment Restraint generation Distance (peak intensity) Angle (coupling magnitude) Model building ensemble of solutions No standard quality measures Images: By User:Nonoelmo - English Wikipedia, CC BY-SA 3.0,
16 Slides by Dr. Schafferhans Electron Microscopy Sample preparation (mostly cyro) Measurement: Electron diffraction Electron tomography Model building: Reconstructing electron density Fitting atoms (e.g. from X-ray) No standard quality measures Images: By Claudionico~commonswiki - Own work, CC BY-SA 4.0,
17 Slides by Dr. Schafferhans Images: By D Wells - Own work, CC BY-SA 4.0,
18 Slides by Dr. Schafferhans Protein Data Bank History 1968: Brookhaven RAster Display (BRAD) 1969: Edgar Meyer (Brookhaven National Laboratory): file format for atomic coordinates 1971: Symposium Structure and Function of Proteins at the Three-Dimensional Level 1971: remote access with SEARCH program (Meyer) 1998: transfer to RCSB (Research Collaboratory for Structural Biology) 2003: formation of wwpdb (PDBe, RCSB, PDBj, BMRB)
19 Slides by Dr. Schafferhans Brookhaven RAster Display screen shot
20 Slides by Dr. Schafferhans Current composi+on (2017/05) Experimental Method X-ray diffraction NMR Proteins Nucleic Acids Protein / Nucleic Acid complexes Other Total Hybrid Other Total Electron microscopy Unique chains: 95987
21 Slides by Dr. Schafferhans Tradi+onal: PDB File formats Latest (last) PDB file format version: Contents Guide Version 3.30 (Nov. 21, 2012) 190 pages! Problem: large structures New standard (2104): PDBx/mmCIF see mmcif.wwpdb.org
22 Slides by Dr. Schafferhans PDB file overview Header - Protein informa+on - Cita+on - Details of structure resolu+on Coordinates / Connec+vity
23 Slides by Dr. Schafferhans PDB file format (general) allowed characters: abcdefghijklmnopqrstuvwxyzabcdefg HIJKLMNOPQRSTUVWXYZ :"<>? each line is 80 characters wide including EOL lines self-iden+fying: - col 1-6: record name - col 7: blank Records in defined order Some records are mandatory
24 nomenclature. This nomenclature is described in the Chemical Component Dictionary (ftp://ftp.wwpdb.org/pub/pdb/data/monomers). Slides by Dr. Schafferhans Record Format COLUMNS DATA TYPE FIELD DEFINITION Record name "ATOM " 7-11 Integer serial Atom serial number Atom name Atom name. 17 Character altloc Alternate location indicator Residue name resname Residue name. 22 Character chainid Chain identifier Integer resseq Residue sequence number. 27 AChar icode Code for insertion of residues Real(8.3) x Orthogonal coordinates for X in Angstroms Real(8.3) y Orthogonal coordinates for Y in Angstroms Real(8.3) z Orthogonal coordinates for Z in Angstroms Real(6.2) occupancy Occupancy Real(6.2) tempfactor Temperature factor LString(2) element Element symbol, right-justified LString(2) charge Charge on the atom. PP1CS Details SoSe 18
25 Slides by Dr. Schafferhans Example for Coordinates ATOM 601 N LEU A N ATOM 602 CA LEU A C ATOM 603 C LEU A C ATOM 604 O LEU A O ATOM 605 CB LEU A C TER 606 LEU A 75 ATOM 1185 O LEU B O ATOM 1186 CB LEU B C TER 1187 LEU B 75 HETATM 1188 H2 SRT A H HETATM 1189 HA SRT A H HETATM 1190 H3 SRT A H HETATM 1191 HB SRT A H HETATM 1192 O1 SRT A O ATOM 295 HB2 ALA A H ATOM 296 HB3 ALA A H TER 297 ALA A 18 ENDMDL
26 Slides by Dr. Schafferhans PDBx/mmCIF overview Standard format for PDB since 2014 Originated in the crystallographic community The STAR file: a new format for electronic data transfer and archiving Sydney R. Hall, Journal of Chemical Information and Computer Sciences (2), Advantages of new format: Extensible Flexible with respect to order Few syntax rules Facilitates automatic validation (with mmcif dictionary)
27 Slides by Dr. Schafferhans STAR (PDBx/mmCIF) syntax Data name: any text string starting with an underline (unique) (e.g. _chemical_formula) Data item: any text string not starting with an underline, preceded by data name (e.g. C23 H36 O7 ) Data loop: list of data names, preceded by loop_, followed by repeated data items loop citation_author.citation_id _citation_author.ordinal _citation_author.name primary 1 'Fitzgerald, P.M.D.' primary 2 'McKeever, B.M.' primary 3 'Van Middlesworth, J.F.' " Data block: collection of data, preceded by data_xxx!
28 Slides by Dr. Schafferhans Concepts in mmcif Entity: polymer / non-polymer / water Chemical component: blocks that build entities (e.g. non-standard residue) Structural Component: structural features, e.g. helix Asymmetric Unit Component: (chain), two compontents can refer to same entity Biological Component: sub- and super-components of the structure
29 Chirality (~Handedness)
30 Amino Acids from wiki/amino_acid Schema of an α-amino acid
31 Amino Acids
32 Amino Acids
33 Amino Acids Essential for humans: phenylalanine, valine, threonine, tryptophan, methionine, leucine, isoleucine, lysine, and histidine
34 Protein Sequence / Primary Structure linear sequence of amino acids oriented from N- to C-terminus (typically) always starts with Methionin IMPORTANT: consider the different meanings Coding/Representation Protein Aspects 1D-information: sequence of amino acids as string Primary structure: amino acid sequence 2D-information: 2D-array, contact map Secondary structure: secondary structure elements like helices or sheets,... 3D-information: coordinates or atom couplings Tertiary structure: spatial arrangement of secondary structure elements (incl. amino acids, atoms,...)
35 Secondary Structure local structural elements structural building blocks for higher order structures α-helix, β-sheet, loops stabilized by hydrogen bonds amino acids have preferences for certain secondary structure elements
36 Ter+ary Structure spa+al arrangement of all secondary structure elements of a protein alterna+ve arrangements can exist (conforma+on changes upon substrate binding, or induced fit) can by used to hierarchical organize found protein structures
37 Quarternary Structure forma+on of mul+-protein complexes many cellular processes are carried out by mul+protein complexes: especially for highly coordinated/regulated ac+ons like: - replica+on - transcrip+on - transla+on difficult to determine precisely, some+me visible already in EM
38 Alpha Helix 3.6 amino acids per turn, spiral forming 4-40 residue (mostly 10) stabilized by hydrogen bonds between backbone atoms:
39 Beta Strand/Sheet long range (in terms of involved residues) hydrogen bonds parallel or an+parallel flat
40 Loop/Turn/Coil generally: change of direc+on: alpha (4), beta (3), gamma (2), delta (1), pi (5) omega-loop: catch all term, includes longer stretches, no hydrogen bonding involved connector between be[er defined secondary structure elements or at the end of a polypep+de chain not dis+nguished in 3-state model
41 Random Coil/ Disordered Region no clear secondary structure elements iden+fiable like sta+s+cal distribu+on of shapes biologically: can be used as adapter to different target shapes, i.e. one conforma+on is stabilized upon interac+on with a partner
42 Protein Features surface area hydrophobicity size iso-electric point amino acid composi+on various func+onal or structural mo+fs
Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb
Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description Version 3.30 Document Published by the wwpdb This format complies with the PDB Exchange Dictionary (PDBx) http://mmcif.pdb.org/dictionaries/mmcif_pdbx.dic/index/index.html.
More informationProtein Predic+on I for Computer Scien+sts
Protein Predic+on I for Computer Scien+sts Proteins May 18/23th, Summer Term 2017 Burkhard Rost & Lothar Richter Lecture and exercise hgps://www.rostlab.org/teaching/ss17/pp1cs Announcements, slides and
More informationCOMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University
COMP 598 Advanced Computational Biology Methods & Research Introduction Jérôme Waldispühl School of Computer Science McGill University General informations (1) Office hours: by appointment Office: TR3018
More informationProtein Structure and Visualisation. Introduction to PDB and PyMOL
Protein Structure and Visualisation Introduction to PDB and PyMOL 1 Feedback Persons http://www.bio-evaluering.dk/ 2 Program 8.00-8.15 Quiz results 8.15-8.50 Introduction to PDB & PyMOL 8.50-9.00 Break
More informationAppendices. Appendix I: PDB file format
Appendices This section contains the details of some file formats that have been used in examples in these notes. They are included for the student who wants to work on projects that use such file formats.
More informationBasics of protein structure
Today: 1. Projects a. Requirements: i. Critical review of one paper ii. At least one computational result b. Noon, Dec. 3 rd written report and oral presentation are due; submit via email to bphys101@fas.harvard.edu
More informationQ1 current best prac2ce
Group- A Q1 current best prac2ce Star2ng from some common molecular representa2on with bond orders, configura2on on chiral centers (e.g. ChemDraw, SMILES) NEW! PDB should become resource for refinement
More informationExamples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE
Examples of Protein Modeling Protein Modeling Visualization Examination of an experimental structure to gain insight about a research question Dynamics To examine the dynamics of protein structures To
More informationProteins: Structure & Function. Ulf Leser
Proteins: Structure & Function Ulf Leser This Lecture Proteins Structure Function Databases Predicting Protein Secondary Structure Many figures from Zvelebil, M. and Baum, J. O. (2008). "Understanding
More informationCAP 5510 Lecture 3 Protein Structures
CAP 5510 Lecture 3 Protein Structures Su-Shing Chen Bioinformatics CISE 8/19/2005 Su-Shing Chen, CISE 1 Protein Conformation 8/19/2005 Su-Shing Chen, CISE 2 Protein Conformational Structures Hydrophobicity
More informationProtein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version 3.0, December 1, 2006 Updated to Version 3.
Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description Version 3.0, December 1, 2006 Updated to Version 3.01 March 30, 2007 1. Introduction The Protein Data Bank (PDB) is an archive
More informationIntroduction to" Protein Structure
Introduction to" Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Learning Objectives Outline the basic levels of protein structure.
More informationData File Formats. There are dozens of file formats for chemical data.
1 Introduction There are dozens of file formats for chemical data. We will do an overview of a few that are often used in structural bioinformatics. 2 1 PDB File Format (1) The PDB file format specification
More informationPreparing a PDB File
Figure 1: Schematic view of the ligand-binding domain from the vitamin D receptor (PDB file 1IE9). The crystallographic waters are shown as small spheres and the bound ligand is shown as a CPK model. HO
More informationIntroduction to Comparative Protein Modeling. Chapter 4 Part I
Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature
More informationMolecular Graphics with PyMOL
Molecular Graphics with PyMOL Jean)YvesSgro Instructors Molecular Graphics & Scientific Communication Ann Palmenberg Jean-Yves Sgro Marchel Hill Holly Basta H. Adam Steinberg 1 Lab Book : Section 1 Computer
More informationProtein Structure Marianne Øksnes Dalheim, PhD candidate Biopolymers, TBT4135, Autumn 2013
Protein Structure Marianne Øksnes Dalheim, PhD candidate Biopolymers, TBT4135, Autumn 2013 The presentation is based on the presentation by Professor Alexander Dikiy, which is given in the course compedium:
More informationX-ray crystallography NMR Cryoelectron microscopy
Molecular Graphics with PyMOL Overview of: Protein Data Bank Coordinates Jean-Yves Sgro PyMOL interface Hands-on! Experimental Methods 3 Main: X-ray crystallography NMR Cryoelectron microscopy X-ray source
More informationBioinformatics. Macromolecular structure
Bioinformatics Macromolecular structure Contents Determination of protein structure Structure databases Secondary structure elements (SSE) Tertiary structure Structure analysis Structure alignment Domain
More informationPymol Practial Guide
Pymol Practial Guide Pymol is a powerful visualizor very convenient to work with protein molecules. Its interface may seem complex at first, but you will see that with a little practice is simple and powerful
More informationCopyright Mark Brandt, Ph.D A third method, cryogenic electron microscopy has seen increasing use over the past few years.
Structure Determination and Sequence Analysis The vast majority of the experimentally determined three-dimensional protein structures have been solved by one of two methods: X-ray diffraction and Nuclear
More informationProtein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods
Cell communication channel Bioinformatics Methods Iosif Vaisman Email: ivaisman@gmu.edu SEQUENCE STRUCTURE DNA Sequence Protein Sequence Protein Structure Protein structure ATGAAATTTGGAAACTTCCTTCTCACTTATCAGCCACCT...
More informationMolecular Modeling lecture 2
Molecular Modeling 2018 -- lecture 2 Topics 1. Secondary structure 3. Sequence similarity and homology 2. Secondary structure prediction 4. Where do protein structures come from? X-ray crystallography
More informationProtein Structure: Data Bases and Classification Ingo Ruczinski
Protein Structure: Data Bases and Classification Ingo Ruczinski Department of Biostatistics, Johns Hopkins University Reference Bourne and Weissig Structural Bioinformatics Wiley, 2003 More References
More informationNMR, X-ray Diffraction, Protein Structure, and RasMol
NMR, X-ray Diffraction, Protein Structure, and RasMol Introduction So far we have been mostly concerned with the proteins themselves. The techniques (NMR or X-ray diffraction) used to determine a structure
More informationProperties of amino acids in proteins
Properties of amino acids in proteins one of the primary roles of DNA (but not the only one!) is to code for proteins A typical bacterium builds thousands types of proteins, all from ~20 amino acids repeated
More informationSection II Understanding the Protein Data Bank
Section II Understanding the Protein Data Bank The focus of Section II of the MSOE Center for BioMolecular Modeling Jmol Training Guide is to learn about the Protein Data Bank, the worldwide repository
More informationEnzyme Catalysis & Biotechnology
L28-1 Enzyme Catalysis & Biotechnology Bovine Pancreatic RNase A Biochemistry, Life, and all that L28-2 A brief word about biochemistry traditionally, chemical engineers used organic and inorganic chemistry
More informationDrug targets, Protein Structures and Crystallography
Drug targets, Protein Structures and Crystallography NS5B viral RNA polymerase (RNA dep) Hepa88s C drug Sofosbuvir (Sovaldi) FDA 2013 Epclusa - combina8on with Velpatasvir approved in in 2016) Prodrug
More informationObjective: Students will be able identify peptide bonds in proteins and describe the overall reaction between amino acids that create peptide bonds.
Scott Seiple AP Biology Lesson Plan Lesson: Primary and Secondary Structure of Proteins Purpose:. To understand how amino acids can react to form peptides through peptide bonds.. Students will be able
More informationBasic Principles of Protein Structures
Basic Principles of Protein Structures Proteins Proteins: The Molecule of Life Proteins: Building Blocks Proteins: Secondary Structures Proteins: Tertiary and Quartenary Structure Proteins: Geometry Proteins
More information1. Protein Data Bank (PDB) 1. Protein Data Bank (PDB)
Protein structure databases; visualization; and classifications 1. Introduction to Protein Data Bank (PDB) 2. Free graphic software for 3D structure visualization 3. Hierarchical classification of protein
More informationComputational Molecular Modeling
Computational Molecular Modeling Lecture 1: Structure Models, Properties Chandrajit Bajaj Today s Outline Intro to atoms, bonds, structure, biomolecules, Geometry of Proteins, Nucleic Acids, Ribosomes,
More informationProtein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche
Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche The molecular structure of a protein can be broken down hierarchically. The primary structure of a protein is simply its
More informationAnalysis and Prediction of Protein Structure (I)
Analysis and Prediction of Protein Structure (I) Jianlin Cheng, PhD School of Electrical Engineering and Computer Science University of Central Florida 2006 Free for academic use. Copyright @ Jianlin Cheng
More informationVMD Tutorial Ho Chi Minh City, 12/01/2012 Emiliano Ippoliti:
VMD Tutorial Ho Chi Minh City, 12/01/2012 Emiliano Ippoliti: e.ippoliti@grs-sim.de A Unix-like operating system is assumed to be used in this tutorial. Each file mentioned below can also be found on the
More informationDetails of Protein Structure
Details of Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Anne Mølgaard, Kemisk Institut, Københavns Universitet Learning Objectives
More informationPart 4 The Select Command and Boolean Operators
Part 4 The Select Command and Boolean Operators http://cbm.msoe.edu/newwebsite/learntomodel Introduction By default, every command you enter into the Console affects the entire molecular structure. However,
More informationProtein Struktur (optional, flexible)
Protein Struktur (optional, flexible) 22/10/2009 [ 1 ] Andrew Torda, Wintersemester 2009 / 2010, AST nur für Informatiker, Mathematiker,.. 26 kt, 3 ov 2009 Proteins - who cares? 22/10/2009 [ 2 ] Most important
More informationVisualization of Macromolecular Structures
Visualization of Macromolecular Structures Present by: Qihang Li orig. author: O Donoghue, et al. Structural biology is rapidly accumulating a wealth of detailed information. Over 60,000 high-resolution
More informationProtein Structure Prediction and Display
Protein Structure Prediction and Display Goal Take primary structure (sequence) and, using rules derived from known structures, predict the secondary structure that is most likely to be adopted by each
More informationCHAPTER 29 HW: AMINO ACIDS + PROTEINS
CAPTER 29 W: AMI ACIDS + PRTEIS For all problems, consult the table of 20 Amino Acids provided in lecture if an amino acid structure is needed; these will be given on exams. Use natural amino acids (L)
More informationProblem Set 1
2006 7.012 Problem Set 1 Due before 5 PM on FRIDAY, September 15, 2006. Turn answers in to the box outside of 68-120. PLEASE WRITE YOUR ANSWERS ON THIS PRINTOUT. 1. For each of the following parts, pick
More informationChapter 2 Structures. 2.1 Introduction Storing Protein Structures The PDB File Format
Chapter 2 Structures 2.1 Introduction The three-dimensional (3D) structure of a protein contains a lot of information on its function, and can be used for devising ways of modifying it (propose mutants,
More informationViewing and Analyzing Proteins, Ligands and their Complexes 2
2 Viewing and Analyzing Proteins, Ligands and their Complexes 2 Overview Viewing the accessible surface Analyzing the properties of proteins containing thousands of atoms is best accomplished by representing
More information1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?!
1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?! Visual analysis: Identification of ligand density from
More informationThe Select Command and Boolean Operators
The Select Command and Boolean Operators Part of the Jmol Training Guide from the MSOE Center for BioMolecular Modeling Interactive version available at http://cbm.msoe.edu/teachingresources/jmol/jmoltraining/boolean.html
More informationWorking with protein structures. Benjamin Jack
Working with protein structures Benjamin Jack Structure of Triosephosphate Isomerase PDB ID: 1HTI loop beta sheet alpha helix Different perspectives of the same structure Structure of Truncated Hemoglobin
More informationProtein Structure. Hierarchy of Protein Structure. Tertiary structure. independently stable structural unit. includes disulfide bonds
Protein Structure Hierarchy of Protein Structure 2 3 Structural element Primary structure Secondary structure Super-secondary structure Domain Tertiary structure Quaternary structure Description amino
More informationMultiple Sequence Alignments
Multiple Sequence Alignments...... Elements of Bioinformatics Spring, 2003 Tom Carter http://astarte.csustan.edu/ tom/ March, 2003 1 Sequence Alignments Often, we would like to make direct comparisons
More informationProtein Secondary Structure Prediction
part of Bioinformatik von RNA- und Proteinstrukturen Computational EvoDevo University Leipzig Leipzig, SS 2011 the goal is the prediction of the secondary structure conformation which is local each amino
More informationProtein Bioinformatics Computer lab #1 Friday, April 11, 2008 Sean Prigge and Ingo Ruczinski
Protein Bioinformatics 260.655 Computer lab #1 Friday, April 11, 2008 Sean Prigge and Ingo Ruczinski Goals: Approx. Time [1] Use the Protein Data Bank PDB website. 10 minutes [2] Use the WebMol Viewer.
More informationStatistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics
Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics Jianlin Cheng, PhD Department of Computer Science University of Missouri, Columbia
More informationModel Mélange. Physical Models of Peptides and Proteins
Model Mélange Physical Models of Peptides and Proteins In the Model Mélange activity, you will visit four different stations each featuring a variety of different physical models of peptides or proteins.
More informationHeuristic Methods. Heuristic methods for alignment Sequence databases Multiple alignment Gene and protein prediction
Heuristic methods for alignment Sequence databases Multiple alignment Gene and protein prediction Armstrong, 2010 Heuristic Methods! FASTA! BLAST! Gapped BLAST! PSI-BLAST Armstrong, 2010 1 Assumptions
More informationALL LECTURES IN SB Introduction
1. Introduction 2. Molecular Architecture I 3. Molecular Architecture II 4. Molecular Simulation I 5. Molecular Simulation II 6. Bioinformatics I 7. Bioinformatics II 8. Prediction I 9. Prediction II ALL
More informationBetter Bond Angles in the Protein Data Bank
Better Bond Angles in the Protein Data Bank C.J. Robinson and D.B. Skillicorn School of Computing Queen s University {robinson,skill}@cs.queensu.ca Abstract The Protein Data Bank (PDB) contains, at least
More informationUsing NMR to study Macromolecular Interactions. John Gross, BP204A UCSF. Nov 27, 2017
Using NMR to study Macromolecular Interactions John Gross, BP204A UCSF Nov 27, 2017 Outline Review of basic NMR experiment Multidimensional NMR Monitoring ligand binding Structure Determination Review:
More informationFull wwpdb NMR Structure Validation Report i
Full wwpdb NMR Structure Validation Report i Feb 17, 2018 06:22 am GMT PDB ID : 141D Title : SOLUTION STRUCTURE OF A CONSERVED DNA SEQUENCE FROM THE HIV-1 GENOME: RESTRAINED MOLECULAR DYNAMICS SIMU- LATION
More informationProtein Structure. W. M. Grogan, Ph.D. OBJECTIVES
Protein Structure W. M. Grogan, Ph.D. OBJECTIVES 1. Describe the structure and characteristic properties of typical proteins. 2. List and describe the four levels of structure found in proteins. 3. Relate
More informationPhysiochemical Properties of Residues
Physiochemical Properties of Residues Various Sources C N Cα R Slide 1 Conformational Propensities Conformational Propensity is the frequency in which a residue adopts a given conformation (in a polypeptide)
More informationPROTEIN STRUCTURE AMINO ACIDS H R. Zwitterion (dipolar ion) CO 2 H. PEPTIDES Formal reactions showing formation of peptide bond by dehydration:
PTEI STUTUE ydrolysis of proteins with aqueous acid or base yields a mixture of free amino acids. Each type of protein yields a characteristic mixture of the ~ 20 amino acids. AMI AIDS Zwitterion (dipolar
More informationPart II => PROTEINS and ENZYMES. 2.3 PROTEIN STRUCTURE 2.3a Secondary Structure 2.3b Tertiary Structure 2.3c Quaternary Structure
Part II => PROTEINS and ENZYMES 2.3 PROTEIN STRUCTURE 2.3a Secondary Structure 2.3b Tertiary Structure 2.3c Quaternary Structure Section 2.3a: Secondary Structure Synopsis 2.3a - Secondary structure refers
More informationHOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.
HOMOLOGY MODELING Homology modeling, also known as comparative modeling of protein refers to constructing an atomic-resolution model of the "target" protein from its amino acid sequence and an experimental
More informationHIV protease inhibitor. Certain level of function can be found without structure. But a structure is a key to understand the detailed mechanism.
Proteins are linear polypeptide chains (one or more) Building blocks: 20 types of amino acids. Range from a few 10s-1000s They fold into varying three-dimensional shapes structure medicine Certain level
More informationProtein Structure Bioinformatics Introduction
1 Swiss Institute of Bioinformatics Protein Structure Bioinformatics Introduction Basel, 27. September 2004 Torsten Schwede Biozentrum - Universität Basel Swiss Institute of Bioinformatics Klingelbergstr
More information4. The Michaelis-Menten combined rate constant Km, is defined for the following kinetic mechanism as k 1 k 2 E + S ES E + P k -1
Fall 2000 CH 595C Exam 1 Answer Key Multiple Choice 1. One of the reasons that enzymes are such efficient catalysts is that a) the energy level of the enzyme-transition state complex is much higher than
More information1) NMR is a method of chemical analysis. (Who uses NMR in this way?) 2) NMR is used as a method for medical imaging. (called MRI )
Uses of NMR: 1) NMR is a method of chemical analysis. (Who uses NMR in this way?) 2) NMR is used as a method for medical imaging. (called MRI ) 3) NMR is used as a method for determining of protein, DNA,
More informationPrinciples of Physical Biochemistry
Principles of Physical Biochemistry Kensal E. van Hold e W. Curtis Johnso n P. Shing Ho Preface x i PART 1 MACROMOLECULAR STRUCTURE AND DYNAMICS 1 1 Biological Macromolecules 2 1.1 General Principles
More informationPROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS
Int. J. LifeSc. Bt & Pharm. Res. 2012 Kaladhar, 2012 Research Paper ISSN 2250-3137 www.ijlbpr.com Vol.1, Issue. 1, January 2012 2012 IJLBPR. All Rights Reserved PROTEIN SECONDARY STRUCTURE PREDICTION:
More informationBIOCHEMISTRY Course Outline (Fall, 2011)
BIOCHEMISTRY 402 - Course Outline (Fall, 2011) Number OVERVIEW OF LECTURE TOPICS: of Lectures INSTRUCTOR 1. Structural Components of Proteins G. Brayer (a) Amino Acids and the Polypeptide Chain Backbone...2
More information4 Proteins: Structure, Function, Folding W. H. Freeman and Company
4 Proteins: Structure, Function, Folding 2013 W. H. Freeman and Company CHAPTER 4 Proteins: Structure, Function, Folding Learning goals: Structure and properties of the peptide bond Structural hierarchy
More informationHands-On Nine The PAX6 Gene and Protein
Hands-On Nine The PAX6 Gene and Protein Main Purpose of Hands-On Activity: Using bioinformatics tools to examine the sequences, homology, and disease relevance of the Pax6: a master gene of eye formation.
More informationAdvanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions
BIRKBECK COLLEGE (University of London) Advanced Certificate in Principles in Protein Structure MSc Structural Molecular Biology Date: Thursday, 1st September 2011 Time: 3 hours You will be given a start
More informationCS612 - Algorithms in Bioinformatics
Fall 2017 Databases and Protein Structure Representation October 2, 2017 Molecular Biology as Information Science > 12, 000 genomes sequenced, mostly bacterial (2013) > 5x10 6 unique sequences available
More informationThe Structure and Functions of Proteins
Wright State University CORE Scholar Computer Science and Engineering Faculty Publications Computer Science and Engineering 2003 The Structure and Functions of Proteins Dan E. Krane Wright State University
More informationSequence comparison: Score matrices
Sequence comparison: Score matrices http://facultywashingtonedu/jht/gs559_2013/ Genome 559: Introduction to Statistical and omputational Genomics Prof James H Thomas FYI - informal inductive proof of best
More informationAmino Acids and Peptides
Amino Acids Amino Acids and Peptides Amino acid a compound that contains both an amino group and a carboxyl group α-amino acid an amino acid in which the amino group is on the carbon adjacent to the carboxyl
More information1. Amino Acids and Peptides Structures and Properties
1. Amino Acids and Peptides Structures and Properties Chemical nature of amino acids The!-amino acids in peptides and proteins (excluding proline) consist of a carboxylic acid ( COOH) and an amino ( NH
More informationTHE UNIVERSITY OF MANITOBA. PAPER NO: 409 LOCATION: Fr. Kennedy Gold Gym PAGE NO: 1 of 6 DEPARTMENT & COURSE NO: CHEM 4630 TIME: 3 HOURS
PAPER NO: 409 LOCATION: Fr. Kennedy Gold Gym PAGE NO: 1 of 6 DEPARTMENT & COURSE NO: CHEM 4630 TIME: 3 HOURS EXAMINATION: Biochemistry of Proteins EXAMINER: J. O'Neil Section 1: You must answer all of
More informationTranslation. A ribosome, mrna, and trna.
Translation The basic processes of translation are conserved among prokaryotes and eukaryotes. Prokaryotic Translation A ribosome, mrna, and trna. In the initiation of translation in prokaryotes, the Shine-Dalgarno
More informationPDBe TUTORIAL. PDBePISA (Protein Interfaces, Surfaces and Assemblies)
PDBe TUTORIAL PDBePISA (Protein Interfaces, Surfaces and Assemblies) http://pdbe.org/pisa/ This tutorial introduces the PDBePISA (PISA for short) service, which is a webbased interactive tool offered by
More informationAutomated Assignment of Backbone NMR Data using Artificial Intelligence
Automated Assignment of Backbone NMR Data using Artificial Intelligence John Emmons στ, Steven Johnson τ, Timothy Urness*, and Adina Kilpatrick* Department of Computer Science and Mathematics Department
More information1. What is an ångstrom unit, and why is it used to describe molecular structures?
1. What is an ångstrom unit, and why is it used to describe molecular structures? The ångstrom unit is a unit of distance suitable for measuring atomic scale objects. 1 ångstrom (Å) = 1 10-10 m. The diameter
More informationCHEM 463: Advanced Inorganic Chemistry Modeling Metalloproteins for Structural Analysis
CHEM 463: Advanced Inorganic Chemistry Modeling Metalloproteins for Structural Analysis Purpose: The purpose of this laboratory is to introduce some of the basic visualization and modeling tools for viewing
More informationLecture 14 - Cells. Astronomy Winter Lecture 14 Cells: The Building Blocks of Life
Lecture 14 Cells: The Building Blocks of Life Astronomy 141 Winter 2012 This lecture describes Cells, the basic structural units of all life on Earth. Basic components of cells: carbohydrates, lipids,
More informationProtein structure (and biomolecular structure more generally) CS/CME/BioE/Biophys/BMI 279 Sept. 28 and Oct. 3, 2017 Ron Dror
Protein structure (and biomolecular structure more generally) CS/CME/BioE/Biophys/BMI 279 Sept. 28 and Oct. 3, 2017 Ron Dror Please interrupt if you have questions, and especially if you re confused! Assignment
More informationModeling Biological Systems Opportunities for Computer Scientists
Modeling Biological Systems Opportunities for Computer Scientists Filip Jagodzinski RBO Tutorial Series 25 June 2007 Computer Science Robotics & Biology Laboratory Protein: πρώτα, "prota, of Primary Importance
More informationSyllabus of BIOINF 528 (2017 Fall, Bioinformatics Program)
Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program) Course Name: Structural Bioinformatics Course Description: Instructor: This course introduces fundamental concepts and methods for structural
More informationFull wwpdb X-ray Structure Validation Report i
Full wwpdb X-ray Structure Validation Report i Feb 17, 2018 01:16 am GMT PDB ID : 1IFT Title : RICIN A-CHAIN (RECOMBINANT) Authors : Weston, S.A.; Tucker, A.D.; Thatcher, D.R.; Derbyshire, D.J.; Pauptit,
More informationSequence comparison: Score matrices. Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas
Sequence comparison: Score matrices Genome 559: Introduction to Statistical and omputational Genomics Prof James H Thomas Informal inductive proof of best alignment path onsider the last step in the best
More informationAlpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University
Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University Department of Chemical Engineering Program of Applied and
More informationLet s continue our discussion on the interaction between Fe(III) and 6,7-dihydroxynaphthalene-2- sulfonate.
Chemistry 5995(133)-8990(013) Bioinorganic Chemistry: The Good, the Bad, and the Potential of Metals Assignment 2- Aqueous Speciation, Magnetism, Redox, UV-Vis Spectroscopy, and Pymol Let s continue our
More informationTimescales of Protein Dynamics
Timescales of Protein Dynamics From Henzler-Wildman and Kern, Nature 2007 Summary of 1D Experiment time domain data Fourier Transform (FT) frequency domain data or Transverse Relaxation Ensemble of Nuclear
More informationProcedure to Create NCBI KOGS
Procedure to Create NCBI KOGS full details in: Tatusov et al (2003) BMC Bioinformatics 4:41. 1. Detect and mask typical repetitive domains Reason: masking prevents spurious lumping of non-orthologs based
More information8 Protein secondary structure
Grundlagen der Bioinformatik, SoSe 11, D. Huson, June 6, 211 13 8 Protein secondary structure Sources for this chapter, which are all recommended reading: Introduction to Protein Structure, Branden & Tooze,
More informationLecture 26: Polymers: DNA Packing and Protein folding 26.1 Problem Set 4 due today. Reading for Lectures 22 24: PKT Chapter 8 [ ].
Lecture 26: Polymers: DA Packing and Protein folding 26.1 Problem Set 4 due today. eading for Lectures 22 24: PKT hapter 8 DA Packing for Eukaryotes: The packing problem for the larger eukaryotic genomes
More informationRNA and Protein Structure Prediction
RNA and Protein Structure Prediction Bioinformatics: Issues and Algorithms CSE 308-408 Spring 2007 Lecture 18-1- Outline Multi-Dimensional Nature of Life RNA Secondary Structure Prediction Protein Structure
More informationSummary of Experimental Protein Structure Determination. Key Elements
Programme 8.00-8.20 Summary of last week s lecture and quiz 8.20-9.00 Structure validation 9.00-9.15 Break 9.15-11.00 Exercise: Structure validation tutorial 11.00-11.10 Break 11.10-11.40 Summary & discussion
More informationTimescales of Protein Dynamics
Timescales of Protein Dynamics From Henzler-Wildman and Kern, Nature 2007 Dynamics from NMR Show spies Amide Nitrogen Spies Report On Conformational Dynamics Amide Hydrogen Transverse Relaxation Ensemble
More information