Ensemble refinement of protein crystal structures in PHENIX. Tom Burnley Piet Gros

Similar documents
Summary of Experimental Protein Structure Determination. Key Elements

X-ray Crystallography I. James Fraser Macromolecluar Interactions BP204

The structure of Aquifex aeolicus FtsH in the ADP-bound state reveals a C2-symmetric hexamer

Q1 current best prac2ce

RNA protects a nucleoprotein complex against radiation damage

Table 1. Crystallographic data collection, phasing and refinement statistics. Native Hg soaked Mn soaked 1 Mn soaked 2

Full wwpdb X-ray Structure Validation Report i

Electronic Supplementary Information (ESI) for Chem. Commun. Unveiling the three- dimensional structure of the green pigment of nitrite- cured meat

Automated identification of functional dynamic contact networks from X-ray crystallography

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Anisotropy in macromolecular crystal structures. Andrea Thorn July 19 th, 2012

Full wwpdb X-ray Structure Validation Report i

Computational engineering of cellulase Cel9A-68 functional motions through mutations in its linker region. WT 1TF4 (crystal) -90 ERRAT PROVE VERIFY3D

X-ray Crystallography

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Molecular Biology Course 2006 Protein Crystallography Part II

Direct Method. Very few protein diffraction data meet the 2nd condition

Macromolecular Crystallography Part II

Model and data. An X-ray structure solution requires a model.

Protein Crystallography Part II

Crystal lattice Real Space. Reflections Reciprocal Space. I. Solving Phases II. Model Building for CHEM 645. Purified Protein. Build model.

Plasmid Relevant features Source. W18N_D20N and TrXE-W18N_D20N-anti

Molecular Replacement. Airlie McCoy

Modelling against small angle scattering data. Al Kikhney EMBL Hamburg, Germany

Data quality indicators. Kay Diederichs

PHENIX Wizards and Tools

Likelihood and SAD phasing in Phaser. R J Read, Department of Haematology Cambridge Institute for Medical Research

Full wwpdb X-ray Structure Validation Report i

SUPPLEMENTARY INFORMATION

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Full wwpdb X-ray Structure Validation Report i

Structure Investigation of Fam20C, a Golgi Casein Kinase

Supplementary Figures

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB

Macromolecular X-ray Crystallography

TLS and all that. Ethan A Merritt. CCP4 Summer School 2011 (Argonne, IL) Abstract

Supporting Information

8. Strategies for Macromolecular Refinement

Linking data and model quality in macromolecular crystallography. Kay Diederichs

SUPPLEMENTARY INFORMATION. doi: /nature07461

Understanding protein motions by computational modeling and statistical approaches

7.91 Amy Keating. Solving structures using X-ray crystallography & NMR spectroscopy

Overview & Applications. T. Lezon Hands-on Workshop in Computational Biophysics Pittsburgh Supercomputing Center 04 June, 2015

Molecular Dynamics. Molecules in motion

Macromolecular Crystallography Part II

Ensemble of Climate Models

Full wwpdb X-ray Structure Validation Report i

Pipelining Ligands in PHENIX: elbow and REEL

Proteins. Central Dogma : DNA RNA protein Amino acid polymers - defined composition & order. Perform nearly all cellular functions Drug Targets

Details of Protein Structure

Protein crystallography. Garry Taylor

Phase problem: Determining an initial phase angle α hkl for each recorded reflection. 1 ρ(x,y,z) = F hkl cos 2π (hx+ky+ lz - α hkl ) V h k l

Full wwpdb X-ray Structure Validation Report i

wwpdb X-ray Structure Validation Summary Report

BIOINF 4371 Drug Design 1 Oliver Kohlbacher & Jens Krüger

NMR, X-ray Diffraction, Protein Structure, and RasMol

*Corresponding Author *K. F.: *T. H.:

Refinement of Disorder with SHELXL

Molecular replacement. New structures from old

Introduction to" Protein Structure

research papers Iterative-build OMIT maps: map improvement by iterative model building and refinement without model bias 1.

Native State of Complement Protein C3d Analyzed via Hydrogen Exchange and Conformational Sampling

GC376 (compound 28). Compound 23 (GC373) (0.50 g, 1.24 mmol), sodium bisulfite (0.119 g,

Structural and mechanistic insight into the substrate. binding from the conformational dynamics in apo. and substrate-bound DapE enzyme

Protein Structure Determination Using NMR Restraints BCMB/CHEM 8190

Structure factors again

SUPPLEMENTARY INFORMATION

MD Simulation in Pose Refinement and Scoring Using AMBER Workflows

APPENDIX E. Crystallographic Data for TBA Eu(DO2A)(DPA) Temperature Dependence

SUPPLEMENTARY INFORMATION

SOCS3 binds specific receptor JAK complexes to control cytokine signaling by direct kinase inhibition SUPPLEMENTARY INFORMATION

Acta Cryst. (2017). D73, doi: /s

SUPPLEMENTARY INFORMATION

ABSTRACT. Chong Zhang

Improving Protein Function Prediction with Molecular Dynamics Simulations. Dariya Glazer Russ Altman

Stabilizing the CH2 domain of an Antibody by Engineering in an Enhanced Aromatic Sequon

Supporting Information

X-ray Diffraction. Diffraction. X-ray Generation. X-ray Generation. X-ray Generation. X-ray Spectrum from Tube

Structural basis for catalytically restrictive dynamics of a high-energy enzyme state

SUPPLEMENTARY INFORMATION

Biophysical Journal Volume 96 June

ENZYME MECHANISMS, PROTEASES, STRUCTURAL BIOLOGY

Determining Protein Structure BIBC 100

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Physiochemical Properties of Residues

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

Determination of the structure and dynamics of proteins using NMR chemical shifts (CS) and CS enhanced protein data bank (CS-PDB)

SHELXC/D/E. Andrea Thorn

Experimental Techniques in Protein Structure Determination

Experimental and Computational Mutagenesis to Investigate the. Positioning of a General Base within an Enzyme Active Site

Imaging and monitoring with industrial seismic noise.

Supporting Information

Supplementary Figure 1. Proposed mechanism for AusE, PrhA, and these mutants. (a) 5 is desaturated

Unfolding CspB by means of biased molecular dynamics

IgE binds asymmetrically to its B cell receptor CD23

Protein Simulations in Confined Environments

SUPPLEMENTARY INFORMATION

Transcription:

Ensemble refinement of protein crystal structures in PHENIX Tom Burnley Piet Gros

Incomplete modelling of disorder contributes to R factor gap Only ~5% of residues in the PDB are modelled with more than one conforma=on (x- ray structures) Mul=ple discrete models restricted due to increase in number of model parameters Vitkup et al., (2002) Lang et al., (2010)

Incomplete modelling of disorder contributes to R factor gap Only ~5% of residues in the PDB are modelled with more than one conforma=on (x- ray structures) Mul=ple discrete models restricted due to increase in number of model parameters Molecular dynamics simula=ons produce a Boltzmann- weighted popula=on of inter- related structures MD simula=ons can be restrained with x- ray data

Simulated Annealing/MD refinement Geometric restraints: E bond E angle E dih X- ray restraint: E = X ray whkl ( Fobs( hkl) k Fcalc( hkl) ) hkl 2 Simula,on temperature >2000K Trajectory resolves local minima Final model = end structure Brunger et al., 1987

Time- averaged MD refinement Geometric restraints: E bond E angle E dih X- ray restraint: E X ray = w hkl ( F obs (hkl) k F calc (hkl) ) 2 hkl Sampling F calc t = (1 e Δt/τ x )F t calc + e Δt/τ x F calc t Δt Gros et al., 1990

Time- averaged MD refinement Geometric restraints: E bond E angle E dih X- ray restraint: E X ray = w hkl ( F obs (hkl) k F calc (hkl) ) 2 hkl Running average Simula,on temperature = 300K Final model = trajectory ensemble F calc t = (1 e Δt/τ x )F t calc + e Δt/τ x F calc t Δt

Ensemble refinement with TA restraints (1) (2) (1) Start: Tradi.onal structure (2) Fit TLS / remove alt. conf.

Ensemble refinement with TA restraints (1) (2) (3) (1) Start: Tradi.onal structure (2) Fit TLS / remove alt. conf. (3) TA restrained MD simula.on Collect structure / 0.04ps X- ray restraints accelerate sampling

Ensemble refinement with TA restraints (1) (2) (3) (1) Start: Tradi.onal structure (2) Fit TLS / remove alt. conf. (3) TA restrained MD simula.on (4) Final ensemble (4)

Molecular disorder / la@ce disorder Protein with local disorder Perfect crystal laoce

Molecular disorder / la@ce disorder Protein with local disorder Perfect crystal laoce 'Real' crystal laoce Averaged data

Molecular disorder / la@ce disorder Protein with local disorder Perfect crystal laoce 'Real' crystal laoce Averaged data Deconvolute: molecular disorder from laoce disorder

Rigid body disorder modelled with TLS LaOce distor.ons, domain mo.ons... Model rigid body mo=ons with TLS model

Rigid body disorder modelled with TLS Extract core rigid body mo=on by excluding atoms with large local fluctua=ons (defined as devia=ons from B TLS ).

Local disorder sampled within restrained MD LaOce distor.ons, domain mo.ons... Model rigid body mo=ons with TLS model 20 parameters per group 1 group / domain Fit TLS to star=ng structure B- factors Side- chains, loops... Local disorder sampled in MD MD simula=on restrained with X- ray data

Composite B- factor sum of TLS and atomic fluctuajons TLS Core TLS model RMSF Atomic fluctua=ons in ensemble TLS+RMSF TLS RMSF TLS+RMSF Start Total disorder in ensemble Start Input single structure ADPs

Dual explicit- bulk solvent model - Explicit solvent - Model with explicit atoms - Water picked every 250 steps standard rules: > 3 σ in difference map - < 3 Å distances B- factor from nearest TLS group

Dual explicit- bulk solvent model - Explicit solvent - Model with explicit atoms - Water picked every 250 steps standard rules: > 3 σ in difference map - < 3 Å distances B- factor from nearest TLS group - Bulk solvent - Model with density mask F mask t = (1 e Δt/τ x )F t mask + e Δt/τ x F mask t Δt

Development of ensemble refinement PHENIX Time- averaged x- ray restrained MD TLS fiong Explicit- and bulk- solvent model Maximum- likelihood target func.on

Development of ensemble refinement Tested with 20 datasets Resolu.on: 1-3 Å ASU size: 50-1000 residues CPU.me: 7-100 hours 50 500 models / ensemble

Ensemble refinement reduces Rfree Rfree: ensemble vs phenix.refine Rfree reduced in all cases - 4.9% (max) - 0.3% (min) - 1.8% (mean) Rf/Rw ra=o (mean): = 1.23 phenix.refine = 1.25 ensemble

Ensemble refinement reduces Rfree Rfree: ensemble vs phenix.refine Rfree reduced in all cases - 4.9% (max) - 0.3% (min) - 1.8% (mean) Rf/Rw ra=o (mean): = 1.23 phenix.refine = 1.25 ensemble Rfree: ensemble vs PDB_REDO - 5.1% (max) + 0.6% (min) - 2.1% (mean) Joosten et. al (2011)

B factor 5-20Ų 1uoy.pdb 188 ensemble 40ps acquisi.on.me 1 tls group

B factor 5-20Ų 1uoy.pdb 188 ensemble 40ps acquisi.on.me 1 tls group

1uoy.pdb phenix.refine 1 tls group mfo- DFc ±0.49 e/å³ (3.00 σ)

1uoy.pdb 188 ensemble 1 tls group mfo- DFc ±0.49 e/å³ (4.27 σ)

Improved real- space correla=on Burling et al. (1996): Excellent experimentally phased data for MBP: 1YTT (1.8- Å res.) Ensemble Phenix.refine B- factor

Disordered side chain in MBP (1YTT) Gln167 (A) Glu117 (A) Phenix.refine Diff. vector map (3σ) Ensemble Diff. vector map (3σ)

Anisotropic side chain in MBP (1YTT) Burling et al. (1996) Phe121 (A) Experimental map (1.4σ) Mul.- conformer (Rfree 20.3%) Ensemble (Rfree 17.4%)

Diffuse solvent in MBP (1YTT) Burling et al. (1996) Published: Experimental map (1.4σ and 0.7σ)

Diffuse solvent in MBP (1YTT) Burling et al. (1996) Published: Experimental map (1.4σ and 0.7σ) Ensemble: Experimental map (1.4 σ) (0.7 σ)

Explicit waters in MBP (1YTT)

How similar are NCS copies? 1IEP (2.1- Å res.)

NCS copies show similar distribu=ons 1IEP (2.1- Å res.) - Global TLS model accounts for differences in packing of copies - Local fluctua=ons are similar between the two copies

1M52 (2.6- Å res.)

2XFA (2.1- Å res.)

Isomorphous crystals at 100 and 288 K 3K0N & 3K0M: Proline isomerase (Cyclophilin A) Fraser et al. (2009)

Mul=- conformers in ac=ve site 3K0M (1.3- Šres.) 100 K Leu98 Ser99 3K0N (1.4- Šres.) 288 K Phe113 Leu98 Ensemble refinement: 1.3% gain Rfree A:B = 2:1 Ser99 Phe113 Fraser et al. (2009), Eisenmesser et al. (2005) B- factor 5-25Ų 2.6% gain Rfree

Occupancies agree with NMR data NMR relaxa=on dispersion (283 K) L98, S99, F113 Minor popula=on ~10% S99 L98 3K0N & 3K0M: Proline isomerase Fraser et al. (2009), Eisenmesser et al. (2005) F113 Ensemble (288 K) B- factor 5-25Ų

Ima=nib- ABL Tyrosine Kinase (1IEP) Nagar et al. (2002)

Ima=nib- ABL Tyrosine Kinase (1IEP)

Ima=nib- ABL Tyrosine Kinase (1IEP) Nagar et al (2002), Johnson (2009)

Inhibitor binding to HIV protease apo- protease (2PC0) complex with JE- 2147 (1KZK) Compara=ve analysis of atomic flexibility: - Isomorphous crystals eliminate differences due to Xtal contacts - Non- isomorphous crystals allow evalua=on of Xtal contact effects

Conclusions Global disorder modeled by TLS and local disorder by MD Ensemble refinement improves Rfree and electron density maps Suitable for a broad resolu=on range (1Å 3Å) NCS copies show very similar fluctua=ons Clear representa=on of local disorder / uncertainty Distribu=on of atom posi=ons allows further structural analysis Resolve the finer details of protein structure(s)

Acknowledgements Piet Gros Gros Laboratory Pavel Afonine, Paul Adams PHENIX Developers Funding: Utrecht University / NWO

Number structures in ensemble vs resolujon

Number of structures in the ensemble Number of structures Averaging =me τ x Structures taken equidistant in =me to reproduce the Rwork within 0.1%

R- factor by resolu=on shell 1UOY

Par=al ligand/ion binding B- factor of Cd 2+ - ion more than twice its surrounding Simula=on run at Q=1 for Cd 2+ - ion (atoms coloured by kine=c energy) - High B- factor ligand/ion may indicate par=al occupancy

Par=al ligand/ion binding Cd 2+ : Q=1 Cd 2+ : Q=0.7 - Heat map (kine=c energy) is valida=on tool for Ensemble Refinement

Ensemble Ensemble Single- structure

Geometric valida=on Bonds Angles Dih. angles Single structure 0.010 Å 1.23 15.1 Ens: (x ideal - x model ) 2-0.002-0.28-6.5 Ens: (x ideal - x model ) 2 +0.002 +0.30 +4.0 (sta=s=cs for all 20 cases)

Ramachandran analysis 1UOY ( 1.5- Å res.) Outliers Allowed Favoured

Outliers occur more in flexible regions X- ray ensemble 1BV1 (2.0- Å res.) NMR ensemble

Ramachandran devia=ons by resolu=on Single structure Ensemble modal Ensemble mean Outliers Allowed - More outliers are observed at lower resolu=on - Geometric quality correlates with Rfree - Best run selected by Rfree

Ramachandran devia=ons by resolu=on Single structure Ensemble modal Ensemble mean Outliers Allowed - More outliers are observed at lower resolu=on - Geometric quality correlates with Rfree - Best run selected by Rfree