This is an author produced version of Privateer: : software for the conformational validation of carbohydrate structures.

Similar documents
Manipulating Ligands Using Coot. Paul Emsley May 2013

Modelling Macromolecules with Coot

Garib N Murshudov MRC-LMB, Cambridge

1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?!

Electronic Supplementary Information (ESI) for Chem. Commun. Unveiling the three- dimensional structure of the green pigment of nitrite- cured meat

MR model selection, preparation and assessing the solution

Pipelining Ligands in PHENIX: elbow and REEL

Full wwpdb X-ray Structure Validation Report i

The structure of Aquifex aeolicus FtsH in the ADP-bound state reveals a C2-symmetric hexamer

CSD. Unlock value from crystal structure information in the CSD

Full wwpdb X-ray Structure Validation Report i

wwpdb X-ray Structure Validation Summary Report

Catalytic Mechanism of the Glycyl Radical Enzyme 4-Hydroxyphenylacetate Decarboxylase from Continuum Electrostatic and QC/MM Calculations

Acta Crystallographica Section F

Dictionary of ligands

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Rietveld Structure Refinement of Protein Powder Diffraction Data using GSAS

Direct Method. Very few protein diffraction data meet the 2nd condition

Full wwpdb NMR Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Exploring symmetry related bias in conformational data from the Cambridge Structural Database: A rare phenomenon?

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

CCP4 Diamond 2014 SHELXC/D/E. Andrea Thorn

Determination of the structure and dynamics of proteins using NMR chemical shifts (CS) and CS enhanced protein data bank (CS-PDB)

X-ray Crystallography I. James Fraser Macromolecluar Interactions BP204

Tools for Cryo-EM Map Fitting. Paul Emsley MRC Laboratory of Molecular Biology

electronic reprint To B or not to B: a question of resolution? Ethan A. Merritt Crystallography Journals Online is available from journals.iucr.

Experimental phasing in Crank2

Coot, pyrogen & CCP4 SRS. Paul Emsley MRC Laboratory of Molecular Biology, Cambridge, UK June 2015

Supporting Information. Synthesis of Aspartame by Thermolysin : An X-ray Structural Study

TR&D3: Brownian Mover DBP4: Biosensors

Validation of Experimental Crystal Structures

G L. Achieving high quality protein-ligand X-ray structures for drug design. Oliver Smart Global Phasing Ltd

molecules ISSN

Removing bias from solvent atoms in electron density maps

Alicyclic Hydrocarbons can be classified into: Cycloalkanes Cycloalkenes Cycloalkynes

electronic reprint (2,4,6-Trinitrophenyl)guanidine Graham Smith, Urs D. Wermuth and Jonathan M. White Editors: W. Clegg and D. G.

Received: February 2, 2015 Published: May 13, 2015

PDBe TUTORIAL. PDBePISA (Protein Interfaces, Surfaces and Assemblies)

Structurale, Université Grenoble Alpes, CNRS, CEA, Grenoble, France

Preparing a PDB File

CRYSTALLOGRAPHY AND STORYTELLING WITH DATA. President, Association of Women in Science, Bethesda Chapter STEM Consultant

CSD. CSD-Enterprise. Access the CSD and ALL CCDC application software

Full wwpdb/emdatabank EM Map/Model Validation Report i

Re-refinement from deposited X-ray data can deliver improved models for most PDB entries

Recent activities around the crystallography open databases COD, PCOD and P2D2

Automated Protein Model Building with ARP/wARP

electronic reprint 3,5-Di-p-toluoyl-1,2-dideoxy-fi-1-(imidazol-1-yl)-D-ribofuranose Nicole Düpre, Wei-Zheng Shen, Pablo J. Sanz Miguel and Jens Müller

organic papers 2-Iodo-4-nitro-N-(trifluoroacetyl)aniline: sheets built from iodo nitro and nitro nitro interactions

Anisotropy in macromolecular crystal structures. Andrea Thorn July 19 th, 2012

Version 1.2 October 2017 CSD v5.39

Automated ligand fitting by core-fragment fitting and extension into density

Molecular-scale ligand effects in small gold-thiolate nanoclusters

organic papers Acetone (2,6-dichlorobenzoyl)hydrazone: chains of p-stacked hydrogen-bonded dimers Comment Experimental

International Journal of Innovative Research in Science, Engineering and Technology. (An ISO 3297: 2007 Certified Organization)

shelxl: Refinement of Macromolecular Structures from Neutron Data

Atomic weight = Number of protons + neutrons

Full wwpdb X-ray Structure Validation Report i

Model and data. An X-ray structure solution requires a model.

Web-based Auto-Rickshaw for validation of the X-ray experiment at the synchrotron beamline

SUPPLEMENTARY INFORMATION. doi: /nature07461

Ultra-high resolution structures in validation

Direct Methods and Many Site Se-Met MAD Problems using BnP. W. Furey

DOCKING TUTORIAL. A. The docking Workflow

Kd = koff/kon = [R][L]/[RL]

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar.

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK

IgE binds asymmetrically to its B cell receptor CD23

CS612 - Algorithms in Bioinformatics

addenda and errata [N,N 0 -Bis(4-bromobenzylidene)-2,2-dimethylpropane-j Corrigendum Reza Kia, a Hoong-Kun Fun a * and Hadi Kargar b

PAN-modular Structure of Parasite Sarcocystis muris Microneme Protein SML-2 at 1.95 Å Resolution and the Complex with 1-Thio-β-D-Galactose

The Cambridge Structural Database (CSD) a Vital Resource for Structural Chemistry and Biology Stephen Maginn, CCDC, Cambridge, UK

The Force Field Toolkit (fftk)

TLS and all that. Ethan A Merritt. CCP4 Summer School 2011 (Argonne, IL) Abstract

The use of Refmac crystallographic refinement program for the detection of alternative conformations in biological macromolecules

SUPPLEMENTARY FIGURES

Continuous symmetry and shape measures, or how to measure the distance between polyhedra representing molecules

Coot Updates. Paul Emsley Sept 2016

1. Protein Data Bank (PDB) 1. Protein Data Bank (PDB)

Macromolecular Crystallography Part II

π-hole interactions involving nitro compounds: directionality of nitrate esters Antonio Báuza, a Antonio Frontera*,a and Tiddo J.

Structural insights into Aspergillus fumigatus lectin specificity - AFL binding sites are functionally non-equivalent

4. Constraints and Hydrogen Atoms

Protein Structure Determination Using NMR Restraints BCMB/CHEM 8190

Procheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics.

SUPPLEMENTARY INFORMATION

OpenDiscovery: Automated Docking of Ligands to Proteins and Molecular Simulation

Experimental Phasing with SHELX C/D/E

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version 3.0, December 1, 2006 Updated to Version 3.

User Guide for LeDock

Experimental phasing in Crank2

Supplementary Information for Evaluating the. energetic driving force for co-crystal formation

Macromolecular Crystallography Part II

AN AB INITIO STUDY OF INTERMOLECULAR INTERACTIONS OF GLYCINE, ALANINE AND VALINE DIPEPTIDE-FORMALDEHYDE DIMERS

Rational Design of Thermodynamic and Kinetic Binding Profiles by. Optimizing Surface Water Networks Coating Protein Bound Ligands

Transcription:

This is an author produced version of Privateer: : software for the conformational validation of carbohydrate structures. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/95794/ Article: Agirre, Jon orcid.org/0000-0002-1086-0253, Iglesias-Fernández, Javier, Rovira, Carme et al. (3 more authors) (2015) Privateer: : software for the conformational validation of carbohydrate structures. Nature structural & molecular biology. pp. 833-834. ISSN 1545-9985 https://doi.org/10.1038/nsmb.3115 promoting access to White Rose research papers eprints@whiterose.ac.uk http://eprints.whiterose.ac.uk/

NATURE STRUCTURAL & MOLECULAR BIOLOGY CORRESPONDENCE Privateer: software for the conformational validation of carbohydrate structures Jon Agirre 1, Javier Iglesias Fernández 2, Carme Rovira 2,3, Gideon J Davies 1, Keith S Wilson 1 & Kevin D Cowtan 1 1 York Structural Biology Laboratory, Department of Chemistry, The University of York, England. 2 Departament de Química Orgànica and Institut de Química Teòrica i Computacional (IQTCUB), Universitat de Barcelona, Barcelona, Spain. 3 Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain. e mail: kevin.cowtan@york.ac.uk or jon.agirre@york.ac.uk Nature Structural & Molecular Biology 22, 833 834 (2015) doi:10.1038/nsmb.3115 Published online 04 November 2015 Carbohydrates, including O and N glycans attached to protein and lipid structures, are increasingly important in cellular biology. Crystallographic refinement of sugars is, however, very poorly performed with thousands of incorrect structures polluting the PDB 1,2. While nomenclature validation became possible in the past decade with the introduction of tools such as pdb care 3, inappropriate refinement protocols at resolutions lower than 1.6 Å can still force the correct sugar into a most improbable ring conformation, let alone distort one with chemical errors 1. Higher energy conformations are very infrequent in nature even out of the question in N glycans and should always be backed by clear electron density, and otherwise treated as outliers. We have developed the Privateer software package for the detection and prevention of conformational anomalies in cyclic carbohydrate structures. The software identifies incorrect regio and stereo chemistry and unlikely conformations, produces a visual checklist for rapid correction of the errors in real space, and ensures that conformational preferences are accounted for during subsequent rebuilding and refinement, achieved as follows. Privateer holds a manually curated database of supported monosaccharides based on the PDB Chemical Component Dictionary, whose entries contain coordinates for an energy minimized conformer that the PDB calculates upon new depositions using Corina (Molecular Networks) or Omega (OpenEye). For each of these supported sugars, the database holds the conformation as determined by the Cremer Pople algorithm 4, as well as stereochemical and geometric information, to be used as reference upon validation with the software. Additionally, puckering amplitudes 4 of pyranoses are compared to those registered

during ab initio metadynamics simulations of the conformational free energy landscape of cyclohexane 5. A real space correlation coefficient against omit mfo DFc electron density is reported as a quality of fit indicator. Bias minimized map coefficients are exported automatically and can be subsequently used to assess identification of the sugars. Privateer is able to feed results into Coot 6 through its Python or Scheme scripting interface, loading model and maps automatically and flagging issues up visually. Most of the conformational outliers detected at higher resolutions (< 1.6Å) are typically wrongly modelled from the start and may require rebuilding (Fig. 1, top panel). At lower resolutions, higher energy conformations may appear as a consequence of under parameterized refinement, in spite of starting from a correct input model. These can be corrected during both real and reciprocal space refinement (Fig. 1, bottom panel) provided that enough restraints are introduced to balance the parameter to observation ratio. A common way of introducing conformational preferences is by using harmonic torsion restraints, but these are equally satisfied by multiple conformations, e.g. a chair and a boat. To enforce a single conformation, Privateer produces CIF library files that contain monoperiodic torsion restraints on the ring bonds, computed directly from the minimal energy conformer. These files, generated using the CCP4 Storage, Retrieval and Search framework for small molecule data (CCP4SRS), are compatible with all major refinement packages. In addition, torsion restraints can be turned on for the distorted sugars in Refmac5 7 using the keyword file generated by Privateer, making its integration into re refinement pipelines such as PDB_REDO 8 straightforward. It is just such examples that emphasize the problems of modelling sugars where the density is poor. In both cases, the published structures have high energy conformations which are not justified by the density, while is clearly more appropriate to use the expected minimal energy conformation. Privateer is distributed by CCP4 9. A graphical user interface will be available in the upcoming CCP4 7.0 release, fully automating the validation, correction and re refinement process, and providing access to interactive HTML5/Javascript graphical reports. This publication should be considered the primary citation for Privateer. Acknowledgments This work was supported by the Biotechnology and Biological Sciences Research Council grant BB/K008153/1. The authors thank Eugene Krissinel (CCP4, UK) for valuable assistance in using CCP4SRS, and to the Barcelona Supercomputing Center (BSC). Competing financial interests The authors declare no competing financial interests.

References 1 Agirre, J., et al. Nat Chem Biol 11(5), 303 (2015). 2 Lütteke, T. et al. Carbohydr. Res. 339, 1015 1020 (2004). 3 Lütteke, T. & von der Lieth, C.W. BMC bioinformatics. 5, 69 (2004). 4 Cremer, D. & Pople, J.A. J. Am. Chem. Soc. 97, 1354 1358 (1975). 5 Iglesias Fernandez, J., et al. Chemical Science 6(2), 1167 1177 (2015). 6 Emsley, P. et al. Acta Cryst. D66 (4), 486 501 (2010). 7 Murshudov, G.N. et al. Acta Cryst. D67(4), 355 367 (2011). 8 Joosten, R.P. et al. Bioinformatics 27, 3392 3398 (2011). 9 Winn, M.D. et al. Acta Cryst. D67(4), 235 242 (2011). 10 Brunger, A.T. Nature Protocols 2, 2728 2733 (2007).

Figure 1 Figure 1. N glycan D pyranosides in higher energy conformations as deposited in the PDB (left) and their corrected counterparts (right). Grey mesh: omit mfo DFc density contoured at 4. Top panel: MAN 616 (chain A) from PDB 3QVP. 3d7d, 1.1 Å resolution data. This sugar was distorted to a 1 S 5 conformation by the refinement software (CNS 10 ) after being modeled with a linkage instead of the expected one, and therefore required rebuilding before it could be refined to the minimal energy conformation ( 4 C 1 ). Bottom panel: NAG 1757 (chain A) from PDB 3D7D, refined with Refmac5 to the 2,5 B conformation in partial density. Since this conformation minimizes deviations from ideal bond lengths, angles and harmonic torsions, the model could not be corrected by restraining those parameters. Application of the monoperiodic torsion set in Refmac5 produced by Privateer produced the expected minimal energy conformation ( 4 C 1 ).