Bioinformatics Practical for Biochemists

Similar documents
Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Packing of Secondary Structures

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

1-D Predictions. Prediction of local features: Secondary structure & surface exposure

Major Types of Association of Proteins with Cell Membranes. From Alberts et al

Any protein that can be labelled by both procedures must be a transmembrane protein.

Bahnson Biochemistry Cume, April 8, 2006 The Structural Biology of Signal Transduction

Model Mélange. Physical Models of Peptides and Proteins

Physiochemical Properties of Residues

Properties of amino acids in proteins

Peptides And Proteins

Translation. A ribosome, mrna, and trna.

Protein Structures: Experiments and Modeling. Patrice Koehl

Chapter 12: Intracellular sorting

What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary

Exam I Answer Key: Summer 2006, Semester C

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

Intro Secondary structure Transmembrane proteins Function End. Last time. Domains Hidden Markov Models

BIRKBECK COLLEGE (University of London)

Today. Last time. Secondary structure Transmembrane proteins. Domains Hidden Markov Models. Structure prediction. Secondary structure

PROTEIN ORIGAMI - A PROGRAM FOR THE CREATION OF 3D PAPER MODELS OF FOLDED PEPTIDES

Proteins: Characteristics and Properties of Amino Acids

Other Methods for Generating Ions 1. MALDI matrix assisted laser desorption ionization MS 2. Spray ionization techniques 3. Fast atom bombardment 4.

Review. Membrane proteins. Membrane transport

Energy and Cellular Metabolism

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Cellular Transport. 1. Transport to and across the membrane 1a. Transport of small molecules and ions 1b. Transport of proteins

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb

Introduction to the Ribosome Overview of protein synthesis on the ribosome Prof. Anders Liljas

Read more about Pauling and more scientists at: Profiles in Science, The National Library of Medicine, profiles.nlm.nih.gov

7.012 Problem Set 1 Solutions

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

Details of Protein Structure

Protein Structure Bioinformatics Introduction

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination

Chapter 4: Amino Acids

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells?

PROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS

Advanced Topics in RNA and DNA. DNA Microarrays Aptamers

Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution

Basic Principles of Protein Structures

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

B O C 4 H 2 O O. NOTE: The reaction proceeds with a carbonium ion stabilized on the C 1 of sugar A.

UNIT TWELVE. a, I _,o "' I I I. I I.P. l'o. H-c-c. I ~o I ~ I / H HI oh H...- I II I II 'oh. HO\HO~ I "-oh

Protein Structure. Role of (bio)informatics in drug discovery. Bioinformatics

Transport between cytosol and nucleus

SCOP. all-β class. all-α class, 3 different folds. T4 endonuclease V. 4-helical cytokines. Globin-like

Objective: Students will be able identify peptide bonds in proteins and describe the overall reaction between amino acids that create peptide bonds.

GCD3033:Cell Biology. Transcription

Amino Acid Side Chain Induced Selectivity in the Hydrolysis of Peptides Catalyzed by a Zr(IV)-Substituted Wells-Dawson Type Polyoxometalate

Supplementary information

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Amino Acids and Peptides

BIOINF 4120 Bioinformatics 2 - Structures and Systems - Oliver Kohlbacher Summer Protein Structure Prediction I

Protein structure alignments

Chemistry Chapter 22

The Structure and Functions of Proteins

Keywords: intrinsic disorder, IDP, protein, protein structure, PTM

Steps in protein modelling. Structure prediction, fold recognition and homology modelling. Basic principles of protein structure

Desorption/Ionization Efficiency of Common Amino Acids in. Surface-assisted Laser Desorption/ionization Mass Spectrometry

Clustering and Model Integration under the Wasserstein Metric. Jia Li Department of Statistics Penn State University

Biological Macromolecules

1. What is an ångstrom unit, and why is it used to describe molecular structures?

Protein Structure. Hierarchy of Protein Structure. Tertiary structure. independently stable structural unit. includes disulfide bonds

Computer simulations of protein folding with a small number of distance restraints

Ramachandran Plot. 4ysz Phi (degrees) Plot statistics

Protein Structure Basics

Problem Set 1

Resonance assignments in proteins. Christina Redfield

Oxygen Binding in Hemocyanin

Solutions In each case, the chirality center has the R configuration

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Central Dogma. modifications genome transcriptome proteome

Heteropolymer. Mostly in regular secondary structure

SUPPLEMENTARY MATERIALS

Nuclear targeting by Nuclear Localization Signals (NLS) Richardson and Laskey (1988)

HSQC spectra for three proteins

Membrane Protein Channels

From Gene to Protein

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 06 Protein Structure IV

Protein Fragment Search Program ver Overview: Contents:

12/6/12. Dr. Sanjeeva Srivastava IIT Bombay. Primary Structure. Secondary Structure. Tertiary Structure. Quaternary Structure.

4 Proteins: Structure, Function, Folding W. H. Freeman and Company

Biochemistry Quiz Review 1I. 1. Of the 20 standard amino acids, only is not optically active. The reason is that its side chain.

Advanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions

Peptide Syntheses. Illustrative Protection: BOC/ t Bu. A. Introduction. do not acid

Yeast ORFan Gene Project: Module 5 Guide

C H E M I S T R Y N A T I O N A L Q U A L I F Y I N G E X A M I N A T I O N SOLUTIONS GUIDE

Amino Acids and Proteins at ZnO-water Interfaces in Molecular Dynamics Simulations: Electronic Supplementary Information

Final Chem 4511/6501 Spring 2011 May 5, 2011 b Name

Protein Secondary Structure Prediction using Feed-Forward Neural Network

Table 1. Crystallographic data collection, phasing and refinement statistics. Native Hg soaked Mn soaked 1 Mn soaked 2

Supplementary Information Intrinsic Localized Modes in Proteins

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

THE UNIVERSITY OF MANITOBA. PAPER NO: _1_ LOCATION: 173 Robert Schultz Theatre PAGE NO: 1 of 5 DEPARTMENT & COURSE NO: CHEM / MBIO 2770 TIME: 1 HOUR

Structure of the SPRY domain of human DDX1 helicase, a putative interaction platform within a DEAD-box protein

Can protein model accuracy be. identified? NO! CBS, BioCentrum, Morten Nielsen, DTU

Exam III. Please read through each question carefully, and make sure you provide all of the requested information.

Supersecondary Structures (structural motifs)

Transcription:

Bioinformatics Practical for Biochemists Andrei Lupas, Birte Höcker, Steffen Schmidt WS 2013/14 03. Sequence Features

Targeting proteins signal peptide targets proteins to the secretory pathway N-terminal sequence recognized while peptide is still synthesized on the ribosome Günter Blobel, 1999, nobelprize.org

Signal Peptide Prediction Sequence Logo of eukaryotic signal peptides Nielsen et al. (2007)

Signal Peptide Prediction - SignalP http://www.cbs.dtu.dk/services/signalp/ http://www.cbs.dtu.dk/cgi-bin/webface2.fcgi?jobid=53329bb900004a2504297a29&wait=20

Transmembrane Helices unusually long stretch of hydrophobic residues >18 hydrophobic amino acids hydrophobic interaction with lipids in membrane orientation of helix / topology of the protein looking at the loops : R & K mainly found on cytoplasmic side positive inside rule TGRPEWIWLALGTALMGLGTLYFLVKGMGVSDPDAKKFYAITTLVPAIAFTMYLSMLLGYGL N C PDB-id: 1c3w

Transmembrane Helices TMHMM http://www.cbs.dtu.dk/services/tmhmm/ Accuracy of predicting TM helices high > 90% Accuracy of predicting the topology prediction > 75% Sonnhammer et al. (1998)

Transmembrane Helices TMHMM http://www.cbs.dtu.dk/services/tmhmm/ http://www.cbs.dtu.dk/cgi-bin/webface2.fcgi?jobid=5332a5e70000681f58ebac3b&wait=20

Secondary Structure amino acid preferences -helix" β-strand" β-turn " Glu" 1.59" 0.52" 1.01" Ala" 1.41" 0.72" 0.92" Leu" 1.34" 1.22" 0.57" Met" 1.3" 1.14" 0.52" Gln" 1.27" 0.98" 0.84" Lys" 1.23" 0.69" 1.07" Arg" 1.21" 0.84" 0.9" His" 1.05" 0.8" 0.81" Val" 0.9" 1.87" 0.41" Ile" 1.09" 1.67" 0.47" Tyr" 0.74" 1.45" 0.76" Cys" 0.66" 1.4" 0.54" Trp" 1.02" 1.35" 0.65" Phe" 1.16" 1.33" 0.59" Thr" 0.76" 1.17" 0.9" Gly" 0.43" 0.58" 1.77" Asn" 0.76" 0.48" 1.34" Pro" 0.34" 0.31" 1.32" Ser" 0.57" 0.96" 1.22" Asp" 0.99" 0.39" 1.24" William (1987) Biochim Biophys Acta

Secondary Structure buried ß-sheet PDB: 1kgs, Buckler et al. (2002), Structure

Secondary Structure amphiphilic partially buried -helix PDB: 1kgs, Buckler et al. (2002), Structure

Secondary Structure amphiphilic ß-strand PDB: 1jat, VanDenmark et al. (2001), Cell

Secondary Structure collagen

Secondary Structure Prediction Toolkit Quick2D

Secondary Structure Prediction Toolkit Ali2D

Disordered Regions Today, programs predict that about 40% of all human proteins contain at least one intrinsically disordered segment of 30 amino acids or more, and that some 25% are likely to be disordered from beginning to end. lack of hydrophobic residues often with overrepresentation of a few amino acids http://www.nature.com/news/2011/110309/full/471151a/box/1.html

Secondary Structure Prediction Toolkit - Ali2D

Disorder Prediction IUPRED - http://iupred.enzim.hu/

Short Linear Motifs Eukaryotic Linear Motifs (ELM) / Short Linear Motifs (SLiM) Hunt T (1990) These motifs are linear, in the sense that three- dimensional organization is not required to bring distant segments of the molecule together to make the recognizable unit. The conservation of these motifs varies: some are highly conserved while others allow substitutions that retain only a certain pattern of charge across the motif.

Short Linear Motifs Characteristics 3-11 amino acids long poorly conserved / evolve fast 1-3 amino acids in the motifs are hot spots ~ 80% in disordered regions relatively low affinity to interacting partner (1-150µM) interaction via induced fit

Short Linear Motifs Function protein-protein interactions post-translational modifications e.g. Phosphorylation proteolytic cleavage/processing sites KEN / D box in cell cycle - degradation signals subcellular targeting sites NES - nuclear export signal modulation of interactions - fine tuning

Short Linear Motifs Nuclear Localization Signal (NLS) Impor'n- beta (1qgk; blue) recognizes nuclear pores and moves through them. It wraps around the end of importin-alpha (1ee5; green), an adaptor molecule that connects importin-beta with the cargo, here nucleoplasmin(1k5j; yellow), a chaperone important in nucleosome assembly. All interactions are mediated by linear motifs in unstructured segments (bipartite nuclear localization signals). David Goodsell, http://www.rcsb.org/pdb/101/motm.do?momid=85

ELM Resources elm.eu.org NUPL_XENLA

NLS in nucleoplasmin Quick 2D secondary structure prediction for nucleoplasmin, showing the unstructured C-terminal tail and the bipartite nuclear localization motif 50 100!! MASTVSNTSKLEKPVSLIWGCELNEQNKTFEFKVEDDEEKCEHQLALRTVCLGDKAKDEFHIVEIVTQEEGAEKSVPIATLKPSILPMATMVGIELTPPVTFRLKAGSG! SS PSIPRED EEEEEEEE EEEE EEEEEEEEE EEEEEE EEEEEEEE EEE EE EEEEEE! SS JNET EEEEEEE EEE HHHHHHHHHHHH EEEEEEEE EEEEEE EEEEEE! DO DISOPRED2 DDDDDDDDDDDDDDDDD! DO IUPRED DDD D DDDD DDD D D DDD DDDDDDD DDDD D DDDD DDDD! SO Prof (Rost) B B B BBBBB B B B B B BBB BBB BBBB BB B BB B BB BB BBBB B B B B! SO JNET B BBBBBBBB B B B B B BBBBBBBBBBB B BBBBBBB B BBBBBB B B BBBB B B B BBB B B!! 150!! PLYISGQHVAMEEDYSWAEEEDEGEAEGEEEEEEEEDQESPPKAVKRPAATKKAGQAKKKKLDKEDESSEEDSPTKKGKGAGRGRKPAAKK! SS PSIPRED EEEEEEEEEE HH! SS JNET EEE E! DO DISOPRED2 DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD! DO IUPRED DDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDDD! SO Prof (Rost) BBB BBB B B B B B! SO JNET BBBBBBBBBB!! SS = Alpha-Helix Beta-Sheet Secondary Structure! DO = Disorder! SO = Solvent accessibility (A burried residue has at most 25% of its surface exposed to the solvent.)!

DDX6 & Scd6 / EDC3 Interaction EDC3 LSM FDF FDK YjeF DDX6/Me31B DEAD-box helicase Scd6/Tral LSM FDF

DDX6 & Scd6 / EDC3 Interaction

DDX6 & Scd6 / EDC3 Interaction PDB: 2wax, Tritschler et al. (2009), Mol Cell