G L. Achieving high quality protein-ligand X-ray structures for drug design. Oliver Smart Global Phasing Ltd

Similar documents
1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?!

Garib N Murshudov MRC-LMB, Cambridge

Analyzing Molecular Conformations Using the Cambridge Structural Database. Jason Cole Cambridge Crystallographic Data Centre

Generating Small Molecule Conformations from Structural Data

Full wwpdb X-ray Structure Validation Report i

Coot Updates. Paul Emsley Sept 2016

Q1 current best prac2ce

wwpdb X-ray Structure Validation Summary Report

Full wwpdb X-ray Structure Validation Report i

CSD. Unlock value from crystal structure information in the CSD

Full wwpdb X-ray Structure Validation Report i

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

X-ray Crystallography I. James Fraser Macromolecluar Interactions BP204

Pipelining Ligands in PHENIX: elbow and REEL

Dictionary of ligands

Web-based Auto-Rickshaw for validation of the X-ray experiment at the synchrotron beamline

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

4. Constraints and Hydrogen Atoms

This is an author produced version of Privateer: : software for the conformational validation of carbohydrate structures.

Full wwpdb X-ray Structure Validation Report i

Full wwpdb/emdatabank EM Map/Model Validation Report i

Version 1.2 October 2017 CSD v5.39

shelxl: Refinement of Macromolecular Structures from Neutron Data

Coot, pyrogen & CCP4 SRS. Paul Emsley MRC Laboratory of Molecular Biology, Cambridge, UK June 2015

Acta Crystallographica Section F

Full wwpdb X-ray Structure Validation Report i

Electronic Supplementary Information (ESI) for Chem. Commun. Unveiling the three- dimensional structure of the green pigment of nitrite- cured meat

Scientific Integrity: A crystallographic perspective

Preparing a PDB File

Manipulating Ligands Using Coot. Paul Emsley May 2013

Direct Method. Very few protein diffraction data meet the 2nd condition

Build_model v User Guide

Validation of Experimental Crystal Structures

User Guide for LeDock

Full wwpdb NMR Structure Validation Report i

Macromolecular Crystallography Part II

DOCKING TUTORIAL. A. The docking Workflow

Modelling Macromolecules with Coot

CCP4 Diamond 2014 SHELXC/D/E. Andrea Thorn

The Cambridge Structural Database (CSD) a Vital Resource for Structural Chemistry and Biology Stephen Maginn, CCDC, Cambridge, UK

Summary of Experimental Protein Structure Determination. Key Elements

APPENDIX E. Crystallographic Data for TBA Eu(DO2A)(DPA) Temperature Dependence

A Journey from Data to Knowledge

Macromolecular Crystallography Part II

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

LigandScout. Automated Structure-Based Pharmacophore Model Generation. Gerhard Wolber* and Thierry Langer

Flexibility and Constraints in GOLD

Supporting Information

Supporting Information. Synthesis of Aspartame by Thermolysin : An X-ray Structural Study

Data Quality Issues That Can Impact Drug Discovery

CALIFORNIA INSTITUTE OF TECHNOLOGY BECKMAN INSTITUTE X-RAY CRYSTALLOGRAPHY LABORATORY

Conformational Searching using MacroModel and ConfGen. John Shelley Schrödinger Fellow

Protein crystallography. Garry Taylor

Experimental Phasing with SHELX C/D/E

Dock Ligands from a 2D Molecule Sketch

IgE binds asymmetrically to its B cell receptor CD23

Physical Chemistry Analyzing a Crystal Structure and the Diffraction Pattern Virginia B. Pett The College of Wooster

Protein Crystallography Part II

Molecular Biology Course 2006 Protein Crystallography Part II

CSD Conformer Generator User Guide

Ranking of HIV-protease inhibitors using AutoDock

The structure of Aquifex aeolicus FtsH in the ADP-bound state reveals a C2-symmetric hexamer

Molecular Modeling lecture 2

Organometallics & InChI. August 2017

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

CSD Conformer Generator User Guide

Crystallographic education and research in the developing world: Experiences in DR Congo

Phaser: Experimental phasing

Structure Validation in Chemical Crystallography with CheckCIF/PLATON

The PhilOEsophy. There are only two fundamental molecular descriptors

Molecular Graphics. Molecular Graphics Expt. 1 1

Dr. Sander B. Nabuurs. Computational Drug Discovery group Center for Molecular and Biomolecular Informatics Radboud University Medical Centre

Identifying Interaction Hot Spots with SuperStar

Schrodinger ebootcamp #3, Summer EXPLORING METHODS FOR CONFORMER SEARCHING Jas Bhachoo, Senior Applications Scientist

Exploring symmetry related bias in conformational data from the Cambridge Structural Database: A rare phenomenon?

CAMBRIDGE STRUCTURAL DATABASE SYSTEM 2010 RELEASE WORKED EXAMPLES

Kd = koff/kon = [R][L]/[RL]

5.1. Hardwares, Softwares and Web server used in Molecular modeling

How to interpret the BUSTER reciprocal space correlation coefficients plot. Gérard Bricogne

ICM-Chemist-Pro How-To Guide. Version 3.6-1h Last Updated 12/29/2009

Autodock tutorial VINA with UCSF Chimera

Chem 341 Organic Chemistry I Lecture Summary 10 September 14, 2007

GC and CELPP: Workflows and Insights

CHEMISTRY 332 SUMMER 08 EXAM I June 26-28, 2008

Manganese-Calcium Clusters Supported by Calixarenes

Molecular Modelling. Computational Chemistry Demystified. RSC Publishing. Interprobe Chemical Services, Lenzie, Kirkintilloch, Glasgow, UK

Why Crystal Structure Validation?

Anisotropy in macromolecular crystal structures. Andrea Thorn July 19 th, 2012

Sigma Bond Metathesis with Pentamethylcyclopentadienyl Ligands in Sterically. Thomas J. Mueller, Joseph W. Ziller, and William J.

7.91 Amy Keating. Solving structures using X-ray crystallography & NMR spectroscopy

Likelihood and SAD phasing in Phaser. R J Read, Department of Haematology Cambridge Institute for Medical Research

ENERGY MINIMIZATION AND CONFORMATION SEARCH ANALYSIS OF TYPE-2 ANTI-DIABETES DRUGS

Redetermination of Crystal Structure of Bis(2,4-pentanedionato)copper(II)

Synthesis and structural characterization of homophthalic acid and 4,4-bipyridine

Fondamenti di Chimica Farmaceutica. Computer Chemistry in Drug Research: Introduction

X- ray crystallography. CS/CME/Biophys/BMI 279 Nov. 12, 2015 Ron Dror

Transcription:

Achieving high quality protein-ligand X-ray structures for drug design Oliver Smart Global Phasing Ltd G L Structural Basis of Pharmacology: Deeper Understanding of Drug Discovery through Crystallography Erice 4 th June 2014

Structure-guided drug discovery o Relies on X-ray structure of protein ligand complex (XSPLC) o Obtaining such a structure involves many stages. o Mistakes made in any of the stages will adversely affect results. o PDB is a data bank of complete structures o Despite this, the PDB has entries showing mistakes for each stage. o Users of XSPLCs should have critical awareness Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 2

Steps involved in solving a XSPLC See page 135 of the lecture notes for list of steps: a) Experimental X-ray data collection from a protein crystal with the ligand soaked or co-crystallized using synchrotron or an in-house diffractometer. b) Image integration and data processing to give space group, unit cell and structure factor amplitudes (SF). c) Molecular replacement to reposition the protein model for the cell and SF from (b). d) Initial refinement of model from (c) without a ligand. e) Assessment of whether the difference electron density (ED) for the model from (d) supports placing a ligand. f) Production of a molecular model and a restraint dictionary for the ligand. g) Fitting the model of ligand (f) into difference density and protein model from (d). h) Refinement of combined protein and ligand model. i) Assessment of refined protein-ligand complex (h). j) If assessment shows issues then rebuild/refit protein, ligand and/or solvent and back to step (h). k) Deposition of the structure model, SF, maps and validation data to an in-house database (or the PDB). We will look at each one in turn Slide are marked step (a), step (b) Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 3

before step (a) Crystallization/soaking Often ligands are dissolved in nasty solvents like DMSO. Using this to soak protein crystals can cause: o Crystal cracking o Unit-cell changes o Reduction in the resolution limits of measurable diffraction data Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 4

X-ray data resolution The further out spots go, the more information. Measure this in Å data resolution: In practice diffraction is often more limited: High resolution data but with ice rings (Rupp figure 8-25) http://commons.wikimedia.org/wiki/file:lysozym_diffraction.png Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 5

Data resolution limit affects ligand electron density detail Well placed/refined sucrose ligand at different data resolutions: 1ylt 1.2Å resolution atomic resolution 2Fo-Fc ED shows atoms as Individual blobs. Need higher resolution for hydrogen atoms 2pwe 2.0Å resolution Typical medium resolution for ligand studies. Can see ring pucker 2qqv 3.0Å resolution Low resolution. Ligand placement unambiguous but fine detail cannot be seen Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 6

Electron density (ED) o ED maps are as important as the model: o 2Fo-Fc map indicates where electrons are (according to SF and model). Normally color blue or grey. o Fo-Fc difference map: green for positive difference: where the current model fails to place sufficient electrons Red for negative difference: where the current model places too many electrons Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 7

Ligand electron density 2h7p.pdb: ED around ligand, as visualized in buster-report 2h7p.pdb: ED visualized in coot Notice difference density around ligand Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 8

step (i) showing problems in step (a) Data collection o Mistakes here are serious (uncorrectable) o For example 1t0o Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 9

step (b) Data processing and integration o G L tool autoproc provides a robust, intelligent tool to go from images to intensities o Uses XDS, ccp4 tools, G L tools o Important to exclude ice rings Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 10

step (d) showing problems in step (b) Data processing problems revealed in BUSTER RecipSCC plot o Shows up ice rings o Shows up problems with beam stop o Shows up anisotropic diffraction o We've tried to persuade people to look at it for years o Now buster-report makes it hard to ignore! Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 11

step (c) Phasing or Molecular Replacement o For simple ligand soaks this is usually not a problem o But sometimes unit-cell changes can be large o The G L Pipedream tool automates limited molecular replacement using ccp4 Phaser Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 12

Initial Refinement of protein before ligand placement o Model is adjusted to better fit density o This is crucial because ED maps improve as model is improved, so poor initial ligand ED becomes interpretable o Maximum-likelihood refinement is now universally used. o BUSTER maps are particularly good. step (d) Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 13

step (d) BUSTER maps are bias-resistant Electron Density Server BUSTER maps 2wfw.pdb BUSTER rebuilt & re-refined model Example 2wfw 1.6Å resolution Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 14

step (d) 2wfw example 1.6Å resolution As deposited After default refinement After remediation Rwork 0.210 0.228 0.193 Rfree 0.246 0.263 0.215 Rotamer outliers % 4.70 3.69 1.78 Ramach. outliers % 3.32 3.88 0.29 Ramach. favoured % 94.5 93.35 98.3 Molprobity score 2.55 2.27 1.28 Molprobity percentile 12 26 97 Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 15

step (d) Target restraints o Use similarity restraints LSSR to an external fixed PDB structure. o Useful for low resolution ligand complex with higher resolution apo/other ligand o 3urp, 3.2Å resolution poor data, Acta Cryst D (2012) 68:368-380 - - - Normal R free Normal R work - - - Rigid body R free Rigid body R work - - - Target R free Target R work Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 16

After initial refinement, need to assess ED difference density: is there any ligand bound? step (e) Figure 4. Pozharsk et al. [2013] classify the diclofenac ligand in PDB entry 3IB0 (1.4 Å resolution) as absent. BUSTER-REPORT supports this classification: the ligand has high B- factors and a CC (2Fo-Fc) of 0.57 and as shown in panel (a) there is only a small amount of disconnected ED around it. Panel (b) shows COOT image of the result of BUSTER rerefinement of the protein after the diclofenac (purple ghost ) has been removed. The re-refinement included automated water placement and this shows the ED can be well modeled by three water molecules (red crosses) that form good hydrogen bonds. 17

step (e) and (f) Fitting a protein-bound ligand o The task is to fit ligand into F o -F c and then refine complex o Prior knowledge of ligand chemistry is needed to interpret density o Use this prior knowledge o in ligand fitting step (g) to assess accessible low-strain conformations that the ligand can adopt o in refinement step (h) to keep ligand conformation realistic Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 18

step (f) Prior knowledge is essential! 2bal kinase 2.1Å resolution. Restraints: 2Fo-Fc real space CC 0.943 No restraints:cc 0.957 Oliver Smart G L 19

step (f) 2h7p.pdb shows how the use of a poor dictionary leads to poor fit o 2h7p.pdb 1.86Å resolution shows nice density o But ligand has poor geometry 5-membered saturated lactam ring is flat while it should be puckered Bent amide group Unusual cyclohexyl group conformation Difference density shows X-ray data are unhappy with the modeled ligand 20

grade: ligand dictionaries based on CSD information where possible step (f) o Use CCDC Mogul program to survey the CSD small-molecule structure database o G L grade uses mogul in batch mode! o Mogul is used as a source of information for restraints not for validation o Uses CSD dihedral information for restraints on clear sp2-sp2 or sp3-sp3 torsions Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 21

step (f) What if CSD has nothing to say? o Use quantum chemistry as pioneered by elbow normally use RM1 semi-empirical method as implemented in dynamo (Martin Field, IBS Grenoble) o Only used for particular restraints where Mogul does not provide CSD data. Angles to H atoms (bond lengths taken from neutron data) Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 22

grade tool step (f) o Needs CSDS installation o Input: mol2 coordinates cif dictionary from other dictionary generators smi smiles string uses libcheck o grade_pdb_ligand tool for existing PDB ligands fetches info from pdbechem or ligand_expo produces grade dictionary o Produces cif restraint dictionary for use in BUSTER and rhofit coot refmac... o In current BUSTER academic & consortium releases Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 23

Let s look at one ligand-like example from CSD: EVIDUI S(=O)(=O)(N1CCN(CC1)C)c1ccc(NC(=O)C)cc1 grade-xxx.cif step (f) o Like ½ viagra with a amide attached o CSD structure: o amide planar but not coplanar with phenyl ring o Both nitrogen atoms pyramidal o Create dictionaries from smiles o Score dictionary against CSD structure Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 24

step (f) EVIDUI 3 rd party dictionary o dictionary from X, ideal coords o Test the dictionary by scoring the CSD structure o piperazine nitrogens exactly planar but should be ~tetrahedral o amide plane missed. o rms bond deviation 0.048Å rms angle 5.0 Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 25

step (f) EVIDUI grade with mogul grade S(=O)(=O)(N1CCN(CC1)C)c1ccc(NC(=O)C)cc1 o Piperazine nitrogen atoms correct pyramidal o Amide group: torsions restraints hold trans. o rms bond 0.006Å rms angle 0.6 Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 26

step (f) Grade Web Server http://grade.globalphasing.org o For good results grade requires mogul and so CSDS licence o Grade Web Server By kind permission of CCDC Launched in May 2012 Provides easy access to Grade for non-confidential compounds Provides GΦL with useful test examples for development >3500 cif dictionaries produced to date (May 2014) Dictionaries can be used in BUSTER, refmac, phenix.refine Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design Grade Web Server 27

2h7p.pdb buster-report steps (f), (i), (j) o 2h7p.pdb 1.86Å resolution shows nice density o But ligand has poor geometry 5-membered saturated lactam ring is flat while it should be puckered Bent amide group Unusual cyclohexyl group conformation Difference density shows X-ray knows better! Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 28

steps (j), (h) and (i) 2h7p BUSTER re-refinement with grade dictionary fixes all problems o Grouped occupancy refinement sorts negative density for chlorine atom o Very nice fit to density Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 29

Importance of providing correct chemistry to the dictionary generator. o 1byk.pdb E. coli trehalose-repressor in complex with trehalose-6-phosphate. o From Kay Diederichs University of Konstanz o 2.5Å resolution structure solved in 1998. They used β-glucopyranose as part of trehalose-6- phosphate rather than the correct α anomer o PDB chemical components T6P propagates mistake. step (f) Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 30

1byk BUSTER re-refinement with incorrect T6P grade dictionary steps (h) and (i) ED and mogul show problem Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 31

steps (j), (h) and (i) 1byk BUSTER re-refinement with correct-chemistry T6P grade dictionary Now working with Kay to deposit correction to PDB step (k) Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 32

step (g) Ligand fitting by hand o Erice Workshop o Will use as examples: 2h7p (Bob Stroud) 1pmq (Giovanna Scapin) Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 33

Rhofit ligand fitter step (g) o G L Rhofit is our flexible ligand Fo-Fc fitter o Uses cif restraint dictionary to describe ligand conformational properties. o Works well with macrocycles o Can automatically sample chiralities if these are unknown. Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 34

Rhofit can handle a wide diversity of ligands step (g) Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 35

Pipedream: an intelligent ligand screening pipeline steps (b), (c), (d), (e), (g), (h) and (i) o Industrial crystallographers 50+ structures per month. o G L Pipedream: automated data processing, structure refinement, ligand fitting and post-refinement report o Seamlessly links autoproc, BUSTER, rhofit & buster-report o Grade not included as it should be run before you go get images o Automation is useful but human checking and intervention are essential! Cc1c(Cl)cccc1NC(=O)[C@@H]1CN(C2CCCCC2)C(=O)C1 Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 36

step (i) buster-report Oliver Smart G L pdb entry 3qgd has bad ice ring data contamination MolProbity is a great tool for assessing protein geometry issues. o Produce rich but concise report of a BUSTER run o html, pdf, xml output o Including useful analysis like MolProbity and Mogul 37

Current work: use QM or force field in BUSTER refinement to assess ligand strain energy Use a weighted QM or FF energy as part of the geometry function X-ray BUSTER ML for everything! conventional TNT+ geometry function: Protein & water & ions step (i) Weighted QM or force field energy for ligand 38

39 J. Chem. Inf. Model. 2012, 52, 739 756

BUSTER Refinement with QM or Force Field for ligands PDB ligand structures reported by Sitzmann et al. to have high local strain energy using DFT Select 19 examples Refine in BUSTER with DFT or MMFF94 Determine local strain energy BUSTER + DFT results Low strain energy (<5 kcal/mol) BUSTER + MMFF94 results strain compares to DFT CPU requirements: DFT days, MMFF94 negligible G L Global Phasing Limited M. Sitzmann et al., JCIM, 52, 739 (2012).

Use QM or force field in BUSTER refinement to assess ligand strain energy o Conformational strain energy provides an additional method of ligand validation o Well fitted/refined structures have strain energies < 5kcal/mol o FF suitable for general use. o QM suitable for difficult cases. o Links refinement to computational modelling o Indications of strain agree with Mogul o Joint work with John Liebeschuetz (CCDC) and Greg Warren OpenEye step (i) Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 41

Identification of mis-fitted ligand using ligand strain & difference density step (i) 4bki DDR1 Receptor Tyrosine Kinase Inhibitor PDB entry fitted as per design, but the ligand geometry is strained 42

4bki re-refinement with BUSTER reduces strain and reveals mis-fit step (i) indolin-2-one ring flip required Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 43

step (j) BUSTER re-refinement after ring flip indolin-2-one ring happy improve C=O H-bond Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 44

step (k) Corrected PDB entry 4CKR Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 45

step (k) Deposition to in-house database or PDB o Deposition minimum: Model (PDB, mmcif) Structure factors (mtz) o Ideally should include: Diffraction images! For 1byk Kay Diederichs kept these. Completeness 1998: 67.5% 2014: 99.4% Ligand chemistry and restraint dictionaries Final map that the crystallographer saw Validation information (buster-report pdf output) Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 46

Acknowledgments Ideas, Feedback, Global Phasing Colleagues Gérard Bricogne Claus Flensburg Andrew Sharff Tom Womack Clemens Vonrhein Global Phasing Consortium Users BUSTER users CCDC John Liebeschuetz Colin Groom OpenEye Greg Warren Brian Kelley Support $ Global Phasing Consortium EU project: SILVER FP7-HEALTH-F3-2010-260644 PDB & depositors Thanks to all depositors of structures used. We do not learn from experience... we learn from reflecting on experience. John Dewey Oliver Smart G L Achieving high quality protein-ligand X-ray structures for drug design 47