Experimental phasing in Crank2

Similar documents
Experimental phasing in Crank2

research papers Reduction of density-modification bias by b correction 1. Introduction Pavol Skubák* and Navraj S. Pannu

Recent developments in Crank. Leiden University, The Netherlands

Likelihood and SAD phasing in Phaser. R J Read, Department of Haematology Cambridge Institute for Medical Research

SHELXC/D/E. Andrea Thorn

Web-based Auto-Rickshaw for validation of the X-ray experiment at the synchrotron beamline

Maximum Likelihood. Maximum Likelihood in X-ray Crystallography. Kevin Cowtan Kevin Cowtan,

ACORN - a flexible and efficient ab initio procedure to solve a protein structure when atomic resolution data is available

PAN-modular Structure of Parasite Sarcocystis muris Microneme Protein SML-2 at 1.95 Å Resolution and the Complex with 1-Thio-β-D-Galactose

Experimental Phasing with SHELX C/D/E

Practical aspects of SAD/MAD. Judit É Debreczeni

Phaser: Experimental phasing

Determination of the Substructure

CCP4 Diamond 2014 SHELXC/D/E. Andrea Thorn

Macromolecular Phasing with shelxc/d/e

SOLVE and RESOLVE: automated structure solution, density modification and model building

Structure solution from weak anomalous data

ACORN in CCP4 and its applications

The SHELX approach to the experimental phasing of macromolecules. George M. Sheldrick, Göttingen University

Biology III: Crystallographic phases

The application of multivariate statistical techniques improves single-wavelength anomalous diffraction phasing

Direct Methods and Many Site Se-Met MAD Problems using BnP. W. Furey

MR model selection, preparation and assessing the solution

Direct-method SAD phasing with partial-structure iteration: towards automation

Anomalous dispersion

Experimental phasing, Pattersons and SHELX Andrea Thorn

Scattering by two Electrons

Molecular Replacement (Alexei Vagin s lecture)

Protein Crystallography

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

ANODE: ANOmalous and heavy atom DEnsity

research papers HKL-3000: the integration of data reduction and structure solution from diffraction images to an initial model in minutes

The Crystallographic Process

short communications Novel approach to phasing proteins: derivatization by short cryo-soaking with halides

The Crystallographic Process

Overview - Macromolecular Crystallography

Direct Method. Very few protein diffraction data meet the 2nd condition

Phase Improvement by Multi-Start Simulated Annealing Re nement and Structure-Factor Averaging

ANODE: ANOmalous and heavy atom DEnsity

Twinning. Andrea Thorn

research papers Liking likelihood 1. Introduction Airlie J. McCoy

Computational aspects of high-throughput crystallographic macromolecular structure determination

Kevin Cowtan, York The Patterson Function. Kevin Cowtan

Cover Page. The handle holds various files of this Leiden University dissertation.

Automated Protein Model Building with ARP/wARP

Crystal lattice Real Space. Reflections Reciprocal Space. I. Solving Phases II. Model Building for CHEM 645. Purified Protein. Build model.

Patterson Methods

X-ray Crystallography. Kalyan Das

Protein Crystallography Part II

Phase problem: Determining an initial phase angle α hkl for each recorded reflection. 1 ρ(x,y,z) = F hkl cos 2π (hx+ky+ lz - α hkl ) V h k l

Non-merohedral Twinning in Protein Crystallography

Charles Ballard (original GáborBunkóczi) CCP4 Workshop 7 December 2011

Anisotropy in macromolecular crystal structures. Andrea Thorn July 19 th, 2012

Protein crystallography. Garry Taylor

Twinning in PDB and REFMAC. Garib N Murshudov Chemistry Department, University of York, UK

Removing bias from solvent atoms in electron density maps

research papers Protein crystal structure solution by fast incorporation of negatively and positively charged anomalous scatterers

Molecular Biology Course 2006 Protein Crystallography Part II

Fast, Intuitive Structure Determination IV: Space Group Determination and Structure Solution

Author's personal copy

Acta Crystallographica Section F

PHENIX Wizards and Tools

Supporting Information. Synthesis of Aspartame by Thermolysin : An X-ray Structural Study

Image definition evaluation functions for X-ray crystallography: A new perspective on the phase. problem. Hui LI*, Meng HE* and Ze ZHANG

This is an author produced version of Privateer: : software for the conformational validation of carbohydrate structures.

PSD '17 -- Xray Lecture 5, 6. Patterson Space, Molecular Replacement and Heavy Atom Isomorphous Replacement

Molecular replacement. New structures from old

Macromolecular Crystallography Part II

radiation damage Identification of the point of diminishing returns in high-multiplicity data collection for sulfur SAD phasing 1.

Supplemental Data. Structure of the Rb C-Terminal Domain. Bound to E2F1-DP1: A Mechanism. for Phosphorylation-Induced E2F Release

Schematic representation of relation between disorder and scattering

Statistical density modification with non-crystallographic symmetry

research papers 1. Introduction Thomas C. Terwilliger a * and Joel Berendzen b

MRSAD: using anomalous dispersion from S atoms collected at CuKffwavelength in molecular-replacement structure determination

Principles of Protein X-Ray Crystallography

Fan, Hai-fu Institute of Physics, Chinese Academy of Sciences, Beijing , China

Electronic Supplementary Information (ESI) for Chem. Commun. Unveiling the three- dimensional structure of the green pigment of nitrite- cured meat

Application of the complex multivariate normal distribution to crystallographic methods with insights into multiple isomorphous replacement phasing

Garib N Murshudov MRC-LMB, Cambridge

Proteins. Central Dogma : DNA RNA protein Amino acid polymers - defined composition & order. Perform nearly all cellular functions Drug Targets

Electron Density at various resolutions, and fitting a model as accurately as possible.

S-SAD and Fe-SAD Phasing using X8 PROTEUM

for Automated Macromolecular Crystal Structure Solution

Using Solvent Halide Ions in Protein Structure Determination


Structural Basis for Methyl Transfer by a Radical SAM Enzyme

The Phase Problem of X-ray Crystallography

Structure factors again

Supporting Information

SUPPLEMENTARY INFORMATION

University of Groningen

Institute of Physics, Prague 6, Cukrovarnická street

Shake-and-Bake: Applications and Advances. Russ Miller & Charles M. Weeks

A tutorial for learning and teaching macromolecular crystallography

Introduction to single crystal X-ray analysis

Protein Crystallography. Mitchell Guss University of Sydney Australia

Chapter 2 Kinematical theory of diffraction

Molecular Biology Course 2006 Protein Crystallography Part I

Linking data and model quality in macromolecular crystallography. Kay Diederichs

Transcription:

Experimental phasing in Crank2 Pavol Skubak and Navraj Pannu Biophysical Structural Chemistry, Leiden University, The Netherlands http://www.bfsc.leidenuniv.nl/software/crank/

Crank2 for experimental phasing Crank2 is suitable for SAD, MAD and SIRAS. Crank2 has been shown to be particularly powerful weak anomalous signal to low resolution SAD datasets (4.5 Angstroms)

Simultaneously combining experimental phasing steps to improve structure solution Traditionally structure solution is divided into distinct steps: Substructure detection Obtain initial phases (Density) modify the initial experimental map Build and refine the model By combining these steps, we can improve the process.

Traditional structure solution Traditional approach Step-wise Information is propagated via phase probabilities Experimental data, anomalous substructure Experimental phasing Phases, phase probabilities Density modification, phase combination Phases, phase probabilities Model building and refinement Model

Phase probabilities P ( α) = exp( Acos( α) + Bsin( α) + C cos(2α ) + Dsin(2α )) The phase distribution can be approximated via 4 Hendrickson-Lattmann coefficients, A, B, C, D. We rely on programs to estimate these coefficients. Even better than the real thing?

Kevin Cowtan, cowtan@ysbl.york.ac.uk Density modification 2. Phase weighting: F, φ FFT ρ(x) φ=f(φ exp,φ mod ) Modify ρ F mod, φ mod FFT -1 ρ mod (x) Reciprocal space Real space

Hendrickson-Lattmann (HL) propagation and the independence assumption Use the experimental data and anomalous substructure directly! Do not need to assume independence or rely on HL coefficients. Need multivariate distributions at each step that take into account correlations between the model and data.

Traditional approach Step-wise multivariate structure solution Still step-wise Information is propagated via the data and model(s). Experimental data, anomalous substructure Experimental phasing Phases Density modification, phase combination Phases Model building and refinement Model

What are step-wise multivariate functions? The multivariate functions consider all the correlations between the data and the model together. Earlier methods neglected correlations and took into account, for example, DANO (DeltaF) instead of F+ and F-. By taking into account of F+ and F-, we can explicitly consider the measurement error we determine.

Combined structure solu9on Simultaneously use informa9on from experimental phasing, density modifica9on and model refinement Experimental data, anomalous substructure Combined experimental phasing, phase combina9on and model refinement Phases Electron density Model Density modifica9on Model building

The combined multivariate function The combined multivariate function consider all the correlations between the data and the substructure, (partially-built) model and density modified phases.

Tests of > 140 real SAD data sets Resolution range of data sets is 0.94 to 3.88 Angstroms Types of anomalous scatterers: selenium, sulfur, chloride, iodide, bromide, calcium, zinc (and others). We compare with the step wise multivariate approach (current CRANK) versus the combined approach.

Model building results on over 140 real SAD data sets (using parrot and buccaneer)

Summary of large scale test The average fraction of the model built increased from 60% to 74% with the new approach. If we exclude data sets built to 85% by the current approach or where the substructure was not found, 45 data sets remain and the average fraction of the model built increased from 28 to 77%.

3.88 Angstrom RNA polymerase II 3.88 Angstrom SAD data with signal from zinc. Authors could not solve the structure with SAD data alone, but with a partial model, multi-crystal MAD and manual building. > 80% can be built with SAD data alone with the new algorithm automatically to an R-free of 37.6%

Related RNA polymerases complex 3.3 Angstrom data with signal from zinc. Could not solve the structure with anomalous data alone. With the new method, a majority can be built automatically in minutes.

Crank1 vs Crank2 Crank1 and Crank2 are suitable for S/MAD and S/MIRAS experiments and implements multivariate functions. Crank1 and Crank2 are available in ccp4 6.5 and Crank2 is available in ccp4 7.0 with new gui.

Important parameters in substructure detection The number of cycles run. The number of atoms to search for. Should be within 10-20% of actual number A first guess uses a probabilistic Matthew s coefficient The resolution cut-off: For MAD, look at signed anomalous difference correlation. For SAD, a first guess is 0.5 + high resolution limit.

Is my map good enough? Statistics from substructure phasing: Look at FOM. Refined occupancies. Statistics from density modification: Compare the contrast from hand and enantiomorph (output of solomon or shelxe). Does it look like a protein? (model visualization) For Crank2, look to see if R-comb < 40%.

Bias reduction in density modification Density modified map is obtained from experimental map leading to artificially high correlations between the observed and modified amplitudes. β correction is applied to the Luzzati error parameter to reduce bias of modified data.

FOM and phase error after DM with/ without bias reduction

When do you use MR-SAD? If you have a molecular replacement solution, but you can not obtain a complete model. If you have an anomalous signal (look at CCano), you can use this to improve your model.

What do you need to consider? Your partial model may not contain all the anomalous scatterers and you will need to find the rest using anomalous maps. Your partial MR model may be biased and that can affect your ability to complete your model. Should you truncate your model to only what you believe is correctly modelled??

SAD and SIRAS functions in model refinement Previous functions in REFMAC: No prior phase information (Rice function) (Murshudov et al.,1997), (Bricogne and Irwin, 1996), (Pannu and Read, 1996) Prior phase information used indirectly in the form of Hendrickson-Lattman coefficients (MLHL) (Pannu et al., 1998)

Tests of SAD and SIRAS functions in refinement The functions were tested on many real data sets (various phasing signals and resolution ranges) in ARP/wARP + REFMAC. Skubak et al. (2004,2005,2009) Acta D.

Results from SAD function RICE MLHL SAD Green circle: 100 70% built Yellow circle: 70 50% built Red circle: 50 20% built Black circle: 20 0% built

MR-SAD tests 4.5 Angstrom ATPase SecA-SecY complex 4.5 Angstrom data set with weak anomalous signal from Se-Met SecY. Authors could not solve the structure with SAD data alone, but used a partial MR structure, (2-fold NCS averaging), cross-crystal averaging, and manual model building > 60% can be built automatically starting just from selenium positions (obtained from partial MR solution) R-free obtained was under 40%.

4.6 Angstrom SKI2-3-8 complex 4.6 Angstrom data set Se-Met data set. > 50% can be built automatically starting just from selenium positions. R-free obtained was under 40%.

Conclusions from MR-SAD At the moment, if a (partial) MR solution is available, it is best to run two crank2 jobs and manually combine runs: Input the whole MR solution Input just the heavy atoms

References l Crank l Ness et al (2004) Structure 12, 1753-1761. l Pannu et al (2011) Acta Cryst D67, 331-337. l l Combined approach and Crank2 l Skubak and Pannu (2013) Nature Communications 4: 2777. Using data directly in refinement l Skubak et al (2004) Acta Cryst D60, 2196-2201. l Skubak et al (2009) Acta Cryst D65, 1051-1061. l Multivariate phase combination l Waterreus et al (2010) Acta Cryst D66, 783-788.

Acknowledgements All dataset contributors Garib Murshudov, Kevin Cowtan, George Sheldrick, Victor Lamzin http://www.bfsc.leidenuniv.nl/software/crank/ Cyttron