Octa-Acid Binding Affinities with QM Methods

Similar documents
Free-energy perturbation and quantum mechanical study of SAMPL4 octa-acid hostguest binding energies.

Binding free energies in the SAMPL5 octa-acid host guest challenge calculated with DFT-D3 and CCSD(T)

Supporting Information: Predicting the Ionic Product of Water

Quantum Chemistry for Supramolecular Complexes

Advanced in silico drug design

Non-covalent force fields computed ab initio

High-level Quantum Chemistry Methods and Benchmark Datasets for Molecules

Interaction energy of DNA base pairs and amino acid pairs: DFT, DFTB and WFT calculations

Comparison of the Efficiency of the LIE and MM/GBSA Methods to Calculate LigandBinding Energies

Prediction of spectroscopic parameters for bio-organic and bio-inorganic intermediates in complex systems

Jack Simons, Henry Eyring Scientist and Professor Chemistry Department University of Utah

Introduction to Electronic Structure Theory

Calculation of Ligand Dissociation Energies in Large Transition-Metal Complexes arxiv: v2 [physics.chem-ph] 20 Mar 2018

Macrocycle Conformational Sampling by DFT-

Binding affinities by alchemical perturbation using QM/MM with a large QM system and polarizable MM model.

Large Density-Functional and Basis-Set Effects for the DMSO Reductase Catalyzed Oxo-Transfer Reaction

Tailoring the Properties of Quadruplex Nucleobases for Biological and Nanomaterial Applications

How are hydrogen bonds modified by metal binding?

QUANTUM CHEMISTRY PROJECT 3: PARTS B AND C

T. Helgaker, Department of Chemistry, University of Oslo, Norway. T. Ruden, University of Oslo, Norway. W. Klopper, University of Karlsruhe, Germany

platinum and iridium complexes

Effect of geometry optimisations on QM-cluster and QM/MM studies of reaction energies in proteins

This is a simple input file for the calculation of NMR chemical shieldings for a given molecule using the B3LYP functional and def2-tzvpp basis set:

Alanine: Then There Was Water

Force Fields for Classical Molecular Dynamics simulations of Biomolecules. Emad Tajkhorshid

3rd Advanced in silico Drug Design KFC/ADD Molecular mechanics intro Karel Berka, Ph.D. Martin Lepšík, Ph.D. Pavel Polishchuk, Ph.D.

Introduction to DFTB. Marcus Elstner. July 28, 2006

Computational Studies of the Photoreceptor Rhodopsin. Scott E. Feller Wabash College

Noncovalent interactions in biochemistry

Multi-scale approaches in description and design of enzymes

List of Figures Page Figure No. Figure Caption No. Figure 1.1.

Electronic quantum effect on hydrogen bond geometry in. water dimer

4 Post-Hartree Fock Methods: MPn and Configuration Interaction

Fragmentation methods

Solution of the Electronic Schrödinger Equation. Using Basis Sets to Solve the Electronic Schrödinger Equation with Electron Correlation

Towards gas-phase accuracy for condensed phase problems

Basis Set for Molecular Orbital Theory

Electronic excitations in conjugated. Many-Body Green's Functions. Behnaz Bagheri Varnousfaderani. Behnaz Bagheri Varnousfaderani

Force Fields for MD simulations

Electronic structure theory: Fundamentals to frontiers. VI. Analysis and more.

Computational Chemistry I

Importing ab-initio theory into DFT: Some applications of the Lieb variation principle

Structures and infrared spectra of fluoride hydrogen sulfide clusters from ab initio calculations: F -(H 2 S) n, n = 1 5w

Specific Ion Solvtion in Ethylene Carbonate and Propylene Carbonate

Supporting Information

Accurate van der Waals interactions from ground state electron density

Supplementary information

Jack Simons, Henry Eyring Scientist and Professor Chemistry Department University of Utah

CE 530 Molecular Simulation

Lec20 Fri 3mar17

Tools for QM studies of large systems

MM-PBSA Validation Study. Trent E. Balius Department of Applied Mathematics and Statistics AMS

Computational Methods. Chem 561

Molecular dynamics simulation of Aquaporin-1. 4 nm

Q-Chem 4.0: Expanding the Frontiers. Jing Kong Q-Chem Inc. Pittsburgh, PA

Towards a force field based on density fitting

Supporting Information Computational Part

Competing, Coverage-Dependent Decomposition Pathways for C 2 H y Species on Nickel (111)

D3RGC2: free energy scoring by alchemical free energy implementation in SOMD

Lecture 11: Potential Energy Functions

Supporting Information Computational Part

BUDE. A General Purpose Molecular Docking Program Using OpenCL. Richard B Sessions

CHEM6085: Density Functional Theory Lecture 10

General Physical Chemistry II

Practical Advice for Quantum Chemistry Computations. C. David Sherrill School of Chemistry and Biochemistry Georgia Institute of Technology

Hyeyoung Shin a, Tod A. Pascal ab, William A. Goddard III abc*, and Hyungjun Kim a* Korea

Lecture: P1_Wk1_L1 IntraMolecular Interactions. Ron Reifenberger Birck Nanotechnology Center Purdue University 2012

Uptake of OH radical to aqueous aerosol: a computational study

Force Fields for Classical Molecular Dynamics simulations of Biomolecules. Emad Tajkhorshid

On the selection of domains and orbital pairs in local correlation treatments

Ab initio calculations for potential energy surfaces. D. Talbi GRAAL- Montpellier

ONETEP PB/SA: Application to G-Quadruplex DNA Stability. Danny Cole

Lecture 5: More about one- Final words about the Hartree-Fock theory. First step above it by the Møller-Plesset perturbation theory.

Charge-Transfer and Dispersion Energies in Water Clusters

This semester. Books

Gilles Frison, Gilles Ohanessian. To cite this version: HAL Id: hal

The Molecular Dynamics Method

Characteristics of the interaction in azulene (H 2 X) n=1,2 (X=O,S) clusters.

The Potential Energy Surface

Ligand affinities estimated by quantum chemical calculations

An introduction to Molecular Dynamics. EMBO, June 2016

PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS

PH 548 Atomistic Simulation Techniques

Example questions for Molecular modelling (Level 4) Dr. Adrian Mulholland

The Potential Energy Surface (PES) And the Basic Force Field Chem 4021/8021 Video II.iii

Electron Correlation Methods

COSMO-RS Theory. The Basics

Mechanism of Cu/Pd-Catalyzed Decarboxylative Cross-Couplings: A DFT Investigation

Electric properties of molecules

Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015,

Accounting for Solvation in Quantum Chemistry. Comp chem spring school 2014 CSC, Finland

Structure and Dynamics of Chitin Nanofibrils in Aqueous Environment Revealed by Molecular Dynamics Simulations

Comparison of frozen-density embedding and discrete reaction field solvent models for molecular propertiesw

What Makes for a Good Catalytic Cycle? A Theoretical Study of the. SPhos Ligand in the Suzuki-Miyaura Reaction.

AN INTRODUCTION TO QUANTUM CHEMISTRY. Mark S. Gordon Iowa State University

METHODS FOR TREATING SOLVENT EFFECTS AND INTERMOLECULAR FORCES. Mark S. Gordon Iowa State University Ames Laboratory

Exercise 1: Structure and dipole moment of a small molecule

4 th Advanced in silico Drug Design KFC/ADD Molecular Modelling Intro. Karel Berka, Ph.D.

Exploring the Sequence Dependent Structure and Dynamics of DNA with Molecular Dynamics Simulation

A. MP2 - Inclusion of counterpoise in the optimisation step

Transcription:

Octa-Acid Binding Affinities with QM Methods Ulf Ryde Theoretical Chemistry Lund University Sweden 8 H B 7 Distance (Å) 6 5 4 3 2 1 0 0 1 2 3 4 5 6 Time (ns) 7 8 9 10 1. Intro 2. DFT-D3 3. CCSD(T) 4. MD snaps

1. Introduction 2. DFT Optimisation 3. DLPNO-CCSD(T) 4. MD snapshots

Results from SAMPL4 FEP with MM RESP or AM1-BCC charges QM/MM-FEP with NBB or ssea charge 1 DFT on optimised structures LCCSD(T0)-PMISP on DFT structures Varying results MAD RESP 4 AM1-BCC 4 QM-FEP 26 DFT opt 6 LCCSD(T0) 24 R2 0.8 0.8 0.4 0.8 0.4 tr 1.0 1.0-0.3 0.7 0.3

SAMPL5 Gusts more dissimilar Problems for FEP Net charge 1 or +1 not for DFT continuum solvation MAD RESP 4 AM1-BCC 4 QM-FEP 26 DFT opt 6 LCCSD(T0) 24

Philosophy Ligands not proper for FEP Dissimilar ligands Varying net charge In SAMPL4 DFT-opt gave MAD 6 9 kj/mol Might be good enough if FEP does not work Improve the DFT approach with gained experience

Improvements Minimise effect of flexibility Reduce effects of negative charge Try MD sampling Improve CCSD(T) with DLPNO-CCSD(T) (domain-based local pair natural orbital) no fractionation needed

DFT-opt in SAMPL4 Three approaches Opt in vacuum Vac Cos Opt in COSMO continuum COSMO + 4 water Vac Cos Wat Cons MAD 7 9 9 6 MADtr 5 8 9 6 R2 0.8 0.7 0.6 0.8 t 0.7 0.7 0.7 0.7 Wat

Flexibility of host from MD Extensive fluctuation 8Å Breathing motion Rapid dynamics No difference between guests in average 8 Distances HB CA 7 Distance (Å) 6 5 4 3 2 1 0 0 1 2 3 4 5 Time (ns) 6 7 8 9 10 HB Bz 11.7 MeBz 11.7 EtBz 11.8 pclbz 11.6 mclbz 11.6 Hx 11.7 MeHx 11.7 Pen 11.7 Hep 11.8 CC 18.2 18.4 18.5 18.4 18.2 18.5 18.4 18.6 18.4

Problem for optimised structures

Movement of Propionate Groups Extensive dynamics with 3 minima All 8 torsions change on a time-scale of 0.1 1.4 ns 1 D11 All 3 minima visited, but not for all propionates D21 D31 D41 D12 D22 D23 D24 Dihedral angle (degree) 360 300 240 180 120 60 0 0 1 2 3 4 5 Time (ns) 6 7 8 9 10

Problem also for optimised structures Less for vacuum optimisations

Attempt 1 Start from C4 symmetric host Keep all structures as symmetric and similar as possible Use vacuum optimisation Vac Cos Wat Cons MAD 7 9 9 6 MADtr 5 8 9 6 R2 0.8 0.7 0.6 0.8 t 0.7 0.7 0.7 0.7

Partly Successful G3 G5

Partly Successful G3 G5 G3 and G5 (with NMe3+) groups still distorted

Attempt 2 Remove all 4 propionate and 4 bezoate groups Reduces solvation energy from 6600 to 300 kj/mol Reduced flexibility Coulombic (ε = 80) correction for +/ series (~23 kj/mol)

Minimal effect at MM level The differences come from propionate groups MAD R2 tr 2 kj/mol difference for 3 perturbations Nearly same performance compared to Exp. OA NOA 3.6 ±0.2 4.1 ±0.1 0.84 ±0.04 0.79 ±0.03 1.00 ±0.00 1.00 ±0.00 HOA 3.9±0.2 0.81±0.04 1.00 ±0.00 20 OA NOA HOA 18 16 14 Calc. (kj/mol) Identical results for 5 perturbations 12 10 8 6 4 2 0 0 2 4 6 8 Exp. (kj/mol) 10 12 14

1. Introduction 2. DFT Optimisation 3. DLPNO-CCSD(T) 4. MD snapshots

Method Approach suggested by Grimme Chem Eur J 18(12)9955 Gbind Optimise structures with TPSS-D3/def2-SV(P) = EDFT Single-point TPSS/def2-QZVP' No counter-poise correction + Edisp + Gsolv + Gfreq DFT-D3 Becke Johnson damping and 3rd-order terms COSMO-RS solvation energy from BP/TZVP in vacuum and in COSMO(ε= ) ZPE, entropy & thermal corrections from HF-3c optimisation & frequency calculation Rigid-rotor harmonic-oscillator ideal-gas approximation low-lying modes with free-rotor approximation

Rigid Interaction Energies Results strongly stabilised when using rigid interaction energies i.e. with the geometry of the complex Ebind = E (HG@HG) E(H@HG) E(G@HG) Relaxation energy of guests 0 8 kj/mol for G2, G4 G6 G1 & G3 13 37 kj/mol Deteriorates the results Reduces the effect of host flexibility

Four sets Fully charged hosts (199 228 atoms) Neutralised hosts DLPNO-CCSD(T) calculations on DFT structures neutralised hosts MD snapshots for neutralised host

DFT Results Gbind = EDFT Charged host Large variation in DFT binding energy 1062 to +973 kj/mol Compensated by solvation energy 906 to +1156 + Edisp + Gsolv + Gfreq Neutral host Less variation in DFT energy 72 to +30 kj/mol Also solvation energy 62 197 kj/mol Sum shows small correlation with Exp. (0 0.4)

DFT Results Gbind = EDFT Dispersion energy 165 to 94 kj/mol Somewhat more variation with charged host + Some correlation with Exp. R = 0.3 0.4 Edisp + Gsolv + Gfreq Entropy and thermal corrections 68 89 kj/mol No correlation with Exp.

Total Energies Gbind Poor results = EDFT All binding energies 41 88 kj/mol too positive Calc. (kj/mol) 80 R2 = 0.1 0.6 120 100 MADtr = 18 27 kj/mol OAH NOAH OAM NOAM G1 + Edisp + Gsolv + Gfreq 60 40 20 G2 G5 0-20 -40 G6-35 -30-25 -20 Exp. (kj/mol) -15-10 -5 OAH OAMe NOAH NOAMe MADtr 23 21 18 27 R2 0.07 0.60 0.34 0.37

Sometimes Strange Structures G1 G3 G5 G6 G1 carboxylate inside host G3 & G5 NMe3 partly inside G6 also too deep

Structure Problems G4 More distorted than charged hosts All distorted, except G4

OAMe structures G1 G4 does not bind G1 carboxylate still inside host

OAMe structures G1 G2 G6 G1 & G6 less distorted G2 more distorted

OAH vs. OAMe Differences G2 upright in OAMe G3 & G5 NMe3 more inside in OAH G6 more inside in OAH

1. Introduction 2. DFT structures 3. DLPNO-CCSD(T) 4. MD snapshots

DLPNO-CCSD(T) Calculations Gbind = EDFT + Tried to improve the DFT-D3 results with DLPNO-CCSD(T) calculations 1 Neese et al. J Chem Theory Comput 2011, 7 (2011) 33 Tight PNO thresholds Full complex (159 188 atoms) Edisp CBS extrapolation with def2-svp and def2-tzvp TZ QZ on way + Gsolv + Gfreq Counter-poise corrections Br in G4 treated by ZORA CCSD(T) replaced the TPSS/def2-QZVP'+DFT-D3 energies Gbind = ECCSD(T) + Gsolv + Gfreq

DLPNO-CCSD(T) Results Quite similar to DFT results 7 to +11 kj/mol difference MAD = 5 kj/mol Much less than in SAMPL4 LCCSD(T)+PMISP 24 60 kj/mol Charged host? 140 120 Calc. (kj/mol) 100 NOAH NOAM CCH CCMe 80 60 40 20 0-45 -40-35 -30-25 Exp. (kj/mol) -20-15 -10-5

DLPNO-CCSD(T) Results Reproduce Exp equally poorly MADtr = 17 32 kj/mol R2 = 0.4 0.5 Same DFT structures NOAH NOAMe CCH CCMe MADtr 18 27 17 32 R2 0.34 0.37 0.51 0.44 140 120 Calc. (kj/mol) 100 NOAH NOAM CCH CCMe 80 60 40 20 0-45 -40-35 -30-25 Exp. (kj/mol) -20-15 -10-5

1. Introduction 2. DFT structures 3. DLPNO-CCSD(T) 4. MD snapshots

Structures from MD Gbind = EDFT Run 10 ns MD simulation of each complex with MM + Edisp + Gsolv + Gfreq Took 10 snapshots Optimised with HF-3c TPSS/def2-QZVP' DFT-D3 BJ 3rd-order dispersion COSMO-RS solvation energy (BP/TZVP) HF-3c ZPE, entropy & thermal corrections (same as DFT-opt)

Variation of binding free energies Restricted variation for most guests; 3 15 kj/mol Standard Error = 0.3 1.4 kj/mol Sometimes a one outlier G6 large variation (28 52 kj/mol; SE 3 5 kj/mol) G4 bimodal in OAH 80 70 60 Binding Energy (kj/mol) 50 40 30 20 10 0-10 -20 G1 G2 G3 G4 G5 G6 OAM G1 G2 G3 G4 G5 G6

Structures different (NOAH) G3 much more outside G5 much more outside G6 sometimes other orientation

Structures different (NOAMe) G1 less inside G3 & G5 rather similar G4 forced inside in MD and stayed there Distorted host

Results R2 much worse 0.0 to 0.1 70 60 OAH OAMe partly owing to forced binding of G4 (host distortion E missing) Calc. (kj/mol) 50 40 30 20 10 0-10 -45 MAD better 14 17 kj/mol G3 & G5 too strong G1 too weak -40-35 -30-25 -20-15 -10-5 Exp. (kj/mol) NOAH NOAMe MADtr 18 27 R2 0.34 0.37 MDH MDMe 17 14-0.02-0.08

Conclusions MD Promising approach More black-box Gives a precision of 1 ( 5) kj/mol Trying better methods for optimisation

Improved Optimisation Methods MD snapshot Vacuum opt. COSMO opt. COSMO + 4 Water

Conclusions Poor results Partly owing to the use of vacuum structures Possibly additional un-recovered problems (unexperienced PhD) Flexibility a major problem Optimisation time-consuming (~1 month) Need much nursing MD approach may solve some problems

Worth Investigating Alternative QM/MM FEP with MM reference potential Latest results with SAMPL4 data QM systems of 158 224 atoms NOA and PM6-DH2X 720 000 QM calculations are needed for convergence to 1 kj/mol DFT optimisation ~200 QM calculations 1/4000!

Acknowledgements Octav Caldararu Martin Olsson Dr. Christoph Riplinger Prof. Frank Neese Swedish Research Council Lunarc HPC2N

Submitted Data We submitted three sets of data 1. Charged host with DFT 2. Neutral host with DFT 3. Neutral host with DLPNO-CCSD(T)//DFT OAH, not rigid energies (missing at submission time) With guest relaxation energies (suboptimal) Error in isolated G2 energy MD not finished in time No Coulombic correction

MM-MD structures G1 G1 carboxylate actually inside host

MM-MD structures OAMe G1 G2 G1 carboxylate more exposed G2 more up-right

LCCSD(T) PMISP Tried to improve the DFT-D3 results with Local CCSD(T) calculations Hampel & Werner, J Chem Phys 104 (1996) 6286 Employing the PMISP approach Polarised Multipolar Interactions with Supermolecular Pairs Söderhjelm & Ryde, J Phys Chem A 113 (2009) 617 Based on the Cos structures solvation energies, and thermal corrections Replaced the TPSS/def2-QZVP'/TZVP+DFT-D3 energies with LCCSD(T) energies

LCCSD(T) calculations Together with Prof. Ricardo Mata, Göttingen cc-pvtz basis set Extrapolated to basis-set limit at conventional MP2 level with aug-cc-pvtz and aug-cc-pvqz basis sets and n 3 scheme Full counter-poise corrections Density fitting Pipek Mezey localisation

PMISP Etot = Eele + Eind + Eother < Eother = Σ cj(eqm(bai) Eele(BAi) Eind(BAi) MM (NEMO) electrostatics and induction QM calculations on Guest (B) and Host fragments (Aj) (to get multipole expansion and polarisabilities) and all BAi pairs (to get dispersion, repulsion, penetration, CT,... = other )

PMISP details Multipoles up to octupoles and anisotropic polarisabilities in atomic centres and all bond midpoints Calculated at the MP2/cc-pVTZ level Interaction energies between guest and host calculated for guest+fragment at the LCCSD(T)/CBS level Many-body effects treated at the MM (NEMO) level with the multipoles and polarisabilities

Fractionation Host fractionated according to the MFCC approach Molecular Fractionation with Conjugate Caps Adapted to the complicated host 24 fragments 32 conjugate fragments 4 doubly-conjugate fragments

Results CCSD(T) correction rather large and negative 27 (Hx) to 54 (mclbz) kj/mol Bz MeBz EtBz pclbz mclbz Hx MeHx Pen Hep All quality measures significantly worse Constant shift + exaggerate differences Submitted absolute affinities with one outlier Cos -17-36 -38-39 -35-17 -44-3 -31 CC -64-79 -83-85 -90-44 -79-35 -60 corr -47-42 -45-46 -54-27 -35-32 -29 Cos MAD 9 MADtr 8 R2 0.7 t 0.7 t90 0.8 PI 0.9 RMSD 10 MSD -5 slope 2 intercept 21 CC 48 24 0.4 0.3 0.3 0.5 50-37 4 41 0-10 -20 Calculated (kj/mol) -30-40 -50-60 -70-80 -90-100 -30-25 -20 Experimental (kj/mol) -15

Conclusions MM FEP acceptable results Not much better than for proteins Best MAD = 6.0 kj/mol for 91 transformations in 10 proteins No difference between RESP & BCC charges DFT-FEP has severe convergence problems Poor Optimised DFT intermediate performance Intermediate Severe problems with flexibility Hard to improve LCCSD(T) possible for ligand binding with PMISP Worst But poor results why?

Structures different (NOAH) G1 deeper G2 similar G3 much more outside G4 similar G5 much more outside G6 other orientation

Structures different (NOAMe) G1 less inside G2 similar G3 rather similar G4 forced inside G5 similar G6 similar