Structure Refinement with Cryo-EM Density Maps

Similar documents
Overview & Applications. T. Lezon Hands-on Workshop in Computational Biophysics Pittsburgh Supercomputing Center 04 June, 2015

A New Approach to EM of Helical Polymers Yields New Insights

Single-particle cryo-electron microscopy is a powerful

Molecular Dynamics Flexible Fitting

Direct Method. Very few protein diffraction data meet the 2nd condition

Deformable elastic network refinement for low-resolution macromolecular crystallography

Image Assessment San Diego, November 2005

ParM filament images were extracted and from the electron micrographs and

Supplementary Information Energy barriers and driving forces of trna translocation through the ribosome

Summary of Experimental Protein Structure Determination. Key Elements

Introduction to Three Dimensional Structure Determination of Macromolecules by Cryo-Electron Microscopy

X-ray Crystallography I. James Fraser Macromolecluar Interactions BP204

There and back again A short trip to Fourier Space. Janet Vonck 23 April 2014

Opportunity and Challenge: the new era of cryo-em. Ning Gao School of Life Sciences Tsinghua University

Crystal lattice Real Space. Reflections Reciprocal Space. I. Solving Phases II. Model Building for CHEM 645. Purified Protein. Build model.

Supporting Online Material for

RNA protects a nucleoprotein complex against radiation damage

Sample Preparation. Holger Stark MPI for biophysical Chemistry Göttingen Germany

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

type GroEL-GroES complex. Crystals were grown in buffer D (100 mm HEPES, ph 7.5,

Protein Simulations in Confined Environments

Molecular Modeling lecture 2

7.91 Amy Keating. Solving structures using X-ray crystallography & NMR spectroscopy

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Protein Structure Determination Using NMR Restraints BCMB/CHEM 8190

Lecture CIMST Winter School. 1. What can you see by TEM?

Electron microscopy in molecular cell biology II

Understanding protein motions by computational modeling and statistical approaches

History of 3D Electron Microscopy and Helical Reconstruction

HADDOCK: High Ambiguity

PROTEIN'STRUCTURE'DETERMINATION'

Ensemble refinement of protein crystal structures in PHENIX. Tom Burnley Piet Gros

Introduction to" Protein Structure

Experimental Techniques in Protein Structure Determination

Small-Angle Scattering Atomic Structure Based Modeling

Biological Small Angle X-ray Scattering (SAXS) Dec 2, 2013

X- ray crystallography. CS/CME/Biophys/BMI 279 Nov. 12, 2015 Ron Dror

Model building and validation for cryo-em maps at low resolution

Improving cryo-em maps: focused 2D & 3D classifications and focused refinements

Application examples of single particle 3D reconstruction. Ning Gao Tsinghua University

Supplementary Information: Long range allosteric regulation of the human 26S proteasome by 20S proteasome-targeting cancer drugs

Part 1 X-ray Crystallography

wwpdb X-ray Structure Validation Summary Report

CS273: Algorithms for Structure Handout # 13 and Motion in Biology Stanford University Tuesday, 11 May 2003

1) NMR is a method of chemical analysis. (Who uses NMR in this way?) 2) NMR is used as a method for medical imaging. (called MRI )

SUPPLEMENTARY INFORMATION

NMR, X-ray Diffraction, Protein Structure, and RasMol

Protein Structure Determination Using NMR Restraints BCMB/CHEM 8190

Interpreting and evaluating biological NMR in the literature. Worksheet 1

Multi-Scale Hierarchical Structure Prediction of Helical Transmembrane Proteins

X-ray Crystallography. Kalyan Das

Modelling against small angle scattering data. Al Kikhney EMBL Hamburg, Germany

Principles of Physical Biochemistry

Theory and Applications of Residual Dipolar Couplings in Biomolecular NMR

I690/B680 Structural Bioinformatics Spring Protein Structure Determination by NMR Spectroscopy

Processing Heterogeneity

Conformational Geometry of Peptides and Proteins:

SUPPLEMENTARY INFORMATION

Crystallographic Conformers of Actin in a Biologically Active Bundle of Filaments

Large Macromolecular Assemblies (LMAs) Large Macromolecular Assemblies (LMAs) Large Macromolecular Assemblies (LMAs)

Tools for Cryo-EM Map Fitting. Paul Emsley MRC Laboratory of Molecular Biology

EXTENDING THE BOUNDARIES OF STRUCTURAL MODELING EPIGENETIC EFFECTS ON NUCLEOSOME POSITIONING

Modeling Biological Systems Opportunities for Computer Scientists

Computational Studies of the Photoreceptor Rhodopsin. Scott E. Feller Wabash College

PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS

SUPPLEMENTARY INFORMATION. doi: /nature07461

Prediction and refinement of NMR structures from sparse experimental data

Full wwpdb X-ray Structure Validation Report i

Measurement of Dynamic trna Movement and Visualizing Ribosome Biogenesis Applications of Time resolved Electron Cryo microscopy

Full wwpdb/emdatabank EM Map/Model Validation Report i

Protein crystallography. Garry Taylor

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

Studying conformational dynamics and molecular recognition using integrated structural biology in solution Michael Sattler

Secondary and sidechain structures

Procheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics.

Large Macromolecular Assemblies (LMAs)

The living cell is a society of molecules: molecules assembling and cooperating bring about life!

Computational Modeling of Protein Dynamics. Yinghao Wu Department of Systems and Computational Biology Albert Einstein College of Medicine Fall 2014

Template Free Protein Structure Modeling Jianlin Cheng, PhD

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

Full wwpdb X-ray Structure Validation Report i

Computational Modeling of Protein Kinase A and Comparison with Nuclear Magnetic Resonance Data

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University

A conserved P-loop anchor limits the structural dynamics that mediate. nucleotide dissociation in EF-Tu.

Chemical properties that affect binding of enzyme-inhibiting drugs to enzymes

SUPPLEMENTARY INFORMATION

Electronic Supplementary Information (ESI) for Chem. Commun. Unveiling the three- dimensional structure of the green pigment of nitrite- cured meat

Three-Dimensional Electron Microscopy of Macromolecular Assemblies

SUPPLEMENTARY FIGURES. Structure of the cholera toxin secretion channel in its. closed state

Molecular mechanism of selective transport across the Nuclear Pore Complex

Normal Mode Analysis. Jun Wan and Willy Wriggers School of Health Information Sciences & Institute of Molecular Medicine University of Texas Houston

fitting of atomic structures Interpretation of 3D EM maps, opf Maya T ! & ' +*. +*2 +*+, +*+0 *$+ + / /* +** % % % % - &- '

Crystal Structure Prediction using CRYSTALG program

Full wwpdb X-ray Structure Validation Report i

Proteins are not rigid structures: Protein dynamics, conformational variability, and thermodynamic stability

Protein Structures: Experiments and Modeling. Patrice Koehl

Protein Structure Prediction

Protein structure analysis. Risto Laakso 10th January 2005

Relative binding enthalpies from molecular dynamics simulations. using a direct method

Magnetic Resonance Lectures for Chem 341 James Aramini, PhD. CABM 014A

Transcription:

Structure Refinement with Cryo-EM Density Maps Gunnar F. Schröder Computational Structural Biology Group Institute of Complex Systems - Structural Biochemistry (ICS-6) Forschungszentrum Jülich and Heinrich-Heine Universität Düsseldorf

Structure Refinement at Low-resolution Assume: We have a starting structure (crystal structure in different conformation or good homology model) Standard refinement yields a bad structure How to make use of prior structural information during the refinement? 10 Å

The required X-ray resolution (determinacy point) depends on the number of degrees of freedom and the solvent fraction! Degrees of Freedom & N/N res S (Solvent Volume Fraction) 0.5 0.6 0.7 All Atoms with H atoms 48 2.3 Å 2.5 Å 2.8 Å All Atoms no H atoms 24 2.9 Å 3.2 Å 3.5 Å All (!,",#) Torsions 4 5.3 Å 5.8 Å 6.3 Å All (!,") Torsions 2 6.7 Å 7.3 Å 8.0 Å All ($) Torsions 1 8.5 Å 9.13 Å 10.1 Å

Deformable Elastic Network (DEN) Refine only those degrees of freedom that need to be refined to fit the data, but not more. Find only the relevant degrees of freedom for which the data actually provide information Schröder, Brunger & Levitt, Structure (2007) 15:1630 Schröder, Levitt & Brunger, Nature (2010) 464:1218-1222

Deformable Elastic Network randomly chosen distance restraints deformable distance restraints nothing to do with normal modes The weight wden and the γ-parameter control the flexibility of the restraints ETarget = EXray + EChem + wdeneden

Effect of the γ-parameter RMSD Reference model increasing deformability Experimental restraints

Application of DEN to Reciprocal-space Structure Refinement using X-ray Diffraction Data DEN method implemented in CNS (v1.3) and Phenix (v.1.8)

Real-space Refinement Efficient geometry-based conformational sampling Cross-correlation coefficient between model and target density map is optimized Cross-validation DEN restraints - no normal modes!! Symmetry restraints Distance restraints - no coarse-graining (although possible) Positions restraints Accurate modeling of electron scattering Bulk solvent model Overall B-factor optimization http://simtk.org/home/direx/ http://www.schroderlab.org/software/direx/

DireX Example: Elongation Factor 2 (EF-2) initial final Cα-RMSD 13.6 Å 0.8 Å 6300 atoms, 14 steps/min (3.5 hrs for 3000 steps) synthetic 8Å density map

DireX Example: Elongation Factor 2 (EF-2) initial final Cα-RMSD 13.6 Å 0.8 Å 6300 atoms, 14 steps/min (3.5 hrs for 3000 steps) synthetic 8Å density map

There are 3 main forces on atoms in DireX: 1) Concoord restraints: maintain correct stereochemistry (bond lengths, planarity, etc. ) and prevent atom overlaps 2) DEN restraints: control the deviation from a reference model 3) Density restraints: fit model into density

DireX: Geometry-based conformational sampling based on CONCOORD B.L. de Groot, et al. Proteins 29: 240-251 (1997) 1. Initial model

DireX: Geometry-based conformational sampling based on CONCOORD B.L. de Groot, et al. Proteins 29: 240-251 (1997) bonds 1. Initial model 2. Generate list of distance restraints (intervals) angles vdw

DireX: Geometry-based conformational sampling based on CONCOORD B.L. de Groot, et al. Proteins 29: 240-251 (1997) 1. Initial model 2. Generate list of distance restraints (intervals) 3. Perturb coordinates

DireX: Geometry-based conformational sampling based on CONCOORD B.L. de Groot, et al. Proteins 29: 240-251 (1997) 1. Initial model 2. Generate list of distance restraints (intervals) 3. Perturb coordinates 4. use CONCOORD algorithm to obtain a new structure which also obeys all distance restraints CONCOORD: correct distances iteratively in a random order

DireX: Geometry-based conformational sampling based on CONCOORD B.L. de Groot, et al. Proteins 29: 240-251 (1997) Iteration CONCOORD Random walk through conformational space while maintaining correct stereochemistry and avoiding atom clashes

DireX: Geometry-based conformational sampling based on CONCOORD B.L. de Groot, et al. Proteins 29: 240-251 (1997) Iteration CONCOORD Random walk through conformational space while maintaining correct stereochemistry and avoiding atom clashes

DireX: Forces derived from a density map ρ difference (x) = ρ target (x) ρ model (x) Stochastic gradient to move atoms into high difference-density regions Target density Model density Difference Density map Atom Search radius For each atom: average over 10 randomly chosen vectors weighted by density difference

Difference between MDFF and DireX Low-resolution data are (obviously) missing high-resolution information. The difference is where this missing information comes from: MDFF uses MD force field, i.e. predicts missing information DireX takes missing information from crystal structure (or other reference model)

Ribosome trna translocation 2 million single-particle images sorted into 50 conformational substates Resolution 8-15 Å In collaboration with Holger Starkʼs lab (MPI Biophysical Chemistry, Göttingen) Fischer, Konevega, Wintermeyer, Rodnina & Stark (2010) Nature 466: 329-333

Ribosome trna translocation Each state was refined separately using DireX Large subunit Small subunit trna trna 24 states shown Bock, et al., Nat. Struct. Mol. Biol. (2013)

MD Simulation of Conformational Transitions......... Image classification can yield conformational states that are sufficiently similar to describe the transitions between these states by molecular dynamics simulations In collaboration with Holger Stark and Helmut Grubmüller (MPI Biophysical Chemistry, Göttingen) Bock, et al., Nat. Struct. Mol. Biol. (2013)

Ion Channel Gating The cyclic-nucleotide activated Mlok1 K + channel The channel opens upon binding of camp to the CNBD outside CNBD inside cyclic-nucleotide binding domain Cryo-EM structure at 7Å (12 Å vertical) in collaboration with H.Stahlberg (Biozentrum Basel) and C.Nimigean (Cornell) Kowal et al., Nat. Comm. (2013), accepted

Ion Channel Gating Open and closed conformation of the CNBD determined by NMR in the group of Dieter Willbold -camp +camp Schünke, et al. PNAS (2011) in collaboration with H.Stahlberg (Biozentrum Basel) and C.Nimigean (Cornell) Kowal et al., Nat. Comm. (2013), accepted

Ion Channel Gating The cyclic-nucleotide activated Mlok1 K + channel channel closed outside inside -camp in collaboration with H.Stahlberg (Biozentrum Basel) and C.Nimigean (Cornell) Kowal et al., Nat. Comm. (2013), accepted

Ion Channel Gating The cyclic-nucleotide activated Mlok1 K + channel channel open outside inside +camp in collaboration with H.Stahlberg (Biozentrum Basel) and C.Nimigean (Cornell) Kowal et al., Nat. Comm. (2013), accepted

Ion Channel Gating Transition between open and closed channel conformation membrane membrane Voltage sensor domain CNBD

Cross-validation At low resolution overfitting becomes a serious problem. Standard procedure in X-ray refinement (Brunger, 1992): Split data set (randomly) into two sets: ʻworkʼ set and ʻtestʼ set 95% 5% Refinement is done only with work set. Test set data are only used for computing Rfree R = " h If difference between Rfree and Rwork gets too large, model is overfitted. measured amplitudes F obs (h)! F calc (h) " h F obs (h)! amplitudes calculated from model

Cross-validation Exclude part of the data that is not used for refinement, but only for validation ( test set ). Neighboring Fourier components are correlated in EM densities, therefore define high-resolution shell as free set 1 8Å 6Å Cfree: density cross-correlation between model and target free maps Fourier shell correlation 0.8 0.6 0.4 0.2 work free Falkner and Schröder, PNAS (2013) 0 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 Spatial frequency (1/Å)

Cross-validation EM density Model density Density cross-correlation: Work interval (here 7-200 Å) -> Cwork Free interval (here 5-7 Å) -> Cfree

Cross-validation Starting from correct structure (1ikn) 0.98 0.979 0.978 Cwork 1 0.8 0.6 0.4 0.2 Synthetic density map with added noise FSC (0.5) ~11 Å Map Correlation 0.977 0.976 0.975 0.974 0.62 0.6 0.5 1 1.5 2 2.5 3 Cfree 0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16 0.58 Free resolution range: 10-13 Å 0.56 0.54 1 RMSD (Å) 2 3 0.5 1 1.5 2 2.5 3

1ake at ~11 Å from Benchmark set by Topf et al. (Structure 16: 295, 2008) initial RMSD 3.6 Å refined 1.5 Å free resolution range 10-13 Å RMSD (Å) Cwork (%) Cfree (%) 0.98 0.60 3 0.97 0.59 2.5 2 no-den DEN 0.96 0.95 0.58 0.57 0.56 1.5 0 200 400 600 800 1000 0.94 0.55 0 200 400 600 800 1000 0 200 400 600 800 1000 steps steps steps

Backbone Tracing at Low Resolution (4 6 Å) Treat tracing as a global optimization problem Place Beads Randomly into the Density Map Refine Bead Positions Solve the Traveling Salesman Problem: find shortest path such that each bead is visited exactly one time Lin-Kernighan algorithm + secondary structure restraints + statistical residue pair potential Refine Backbone Trace (with DireX + Secondary Structure Restraints)

Additional Restraints on the Traveling Salesman Problem The Miyazawa-Jernigan potential is a statistical potential which favors contacts of amino acids that are frequently observed to be in contact in the PDB: E MJ = i<j M(a i,a j )D ij Dij is the contact matrix between amino acids of types ai and aj. M is the weight according the observed frequency of the ai,aj pair. Secondary structure prediction yields restraints on the distances between amino acids that are within the same secondary structure element: E SSE = (d ij d seq ij )2 The tracing algorithm then optimizes the ETotal E Total = E Lin-Kernighan + E MJ + E SSE

Calmodulin Backbone Trace at Different Resolutions 4 Å resolution 5 Å resolution 6 Å resolution 7 Å resolution Test with synthetic (perfect) density maps at different resolutions 10 traces were generated for each resolution For all resolution the correct topology was found Map Resolution RMSD 4 Å 6.4 Å 5 Å 7.5 Å 6 Å 4.7 Å 7 Å 8.1 Å

Sampling the backbone conformations with DireX DireX does not require a complete input model Ca-trace can be extensively sampled (simulated annealing) Distance restraints can impose secondary structure information

Iho670 adhesion filaments from the Ignicoccus hospitalis EM reconstruction at 4 5 Å ~75% of trace complete Sequence assignment in progress In collaboration with Ed Egelman (U Virginia), Reinhard Rachel and Reinhard Wirth (U Regensburg)

... Heterogeneity and Flexibility in single-particle Cryo-EM... - Heterogeneity severely limits the resolution - Advantage of Cryo-EM: all information is in the particle images (but difficult to extract due to noise). - Goal: determine conformational variance AND improve resolution

Extracting Conformational Dynamics Equilibrium Fluctuations Single-particle images (18,168)... Bootstrapping... 100 bootstrapped maps Penczek, Yang, Frank & Spahn (2006) J. Struct. Biol. 154:168-183

GroEL at 8 Å (GroEL Aacpn10 ADP) In collaboration with Wah Chiu (Baylor College) and Hays Rye (Texas A&M) Chen, Luke, Zhang, et al. JMB (2008)

GroEL - Principal Component Analysis of the ensemble fitted models: 1. Eigenvector - lock-in of GroES - upward motion of cis-ring apical domains - rotations of trans-ring apical domains side view

GroEL - Principal Component Analysis of the ensemble fitted models: 1. Eigenvector - lock-in of GroES - upward motion of cis-ring apical domains - rotations of trans-ring apical domains top view on GroES

GroEL - Principal Component Analysis of the ensemble fitted models: 1. Eigenvector - lock-in of GroES - upward motion of cis-ring apical domains - rotations of trans-ring apical domains bottom view on trans-apical domain

Comparing GroEL/ES in two nucleotide states R3ATP - R4ADP GroES GroEL-cis-apical GroEL-trans-apical Chen, Madan, Weaver, Lin, Schröder, Chiu, Rye, Cell (2013) Ranson, et al. Nat. Struct. Mol. Biol. (2006)

Image Sorting is key to achieving high resolution (not necessarily a large number of particles) Images used for reconstruction need to show the molecule in the same conformation! Image Sorting reveals different conformational states

Standard image classification sorts images according to density similarity. But: Conformational Variance is not the same as Density Variance Ongoing Work - Use principal conformational motions to sort images into classes - Iteratively determine residual conformational dynamics in subclasses of images for further

Refinement of generic bead models DireX can refine any generic geometric model you do not need a crystal structure to determine principal motions - Use program beadgen to generate a bead model from a density map - Refine bead model to different density maps Models refined to 100 bootstrapped maps

Acknowledgements Group Members Benjamin Falkner André Wildberg Zhe Wang Kumaran Baskaran Dennis Della Corte Amudha Duraisamy Michaela Spiegel Lena Möhlenkamp Robin Pauli Baylor College of Medicine Junjie Zhang Dong-Hua Chen Wah Chiu Forschungszentrum Jülich Dieter Willbold (ICS-6) Karl-Erich Jaeger (IMET) Jan Marienhagen (IBG-1) Biozentrum Basel Henning Stahlberg University of Virginia Ed Egelman Vitold Galkin Stanford University Michael Levitt Axel Brunger Max-Planck-Institute Göttingen Holger Stark Niels Fischer Jülich Supercomputing Centre (JSC) 5.9 Petaflops