Experimental phasing in Crank2

Similar documents
Experimental phasing in Crank2

research papers Reduction of density-modification bias by b correction 1. Introduction Pavol Skubák* and Navraj S. Pannu

Recent developments in Crank. Leiden University, The Netherlands

Likelihood and SAD phasing in Phaser. R J Read, Department of Haematology Cambridge Institute for Medical Research

SHELXC/D/E. Andrea Thorn

Determination of the Substructure

Web-based Auto-Rickshaw for validation of the X-ray experiment at the synchrotron beamline

Anomalous dispersion

Scattering by two Electrons

CCP4 Diamond 2014 SHELXC/D/E. Andrea Thorn

Experimental Phasing with SHELX C/D/E

Protein Crystallography

Phaser: Experimental phasing

Practical aspects of SAD/MAD. Judit É Debreczeni

ACORN - a flexible and efficient ab initio procedure to solve a protein structure when atomic resolution data is available

Macromolecular Phasing with shelxc/d/e

Experimental phasing, Pattersons and SHELX Andrea Thorn

Biology III: Crystallographic phases

The SHELX approach to the experimental phasing of macromolecules. George M. Sheldrick, Göttingen University

ANODE: ANOmalous and heavy atom DEnsity

PAN-modular Structure of Parasite Sarcocystis muris Microneme Protein SML-2 at 1.95 Å Resolution and the Complex with 1-Thio-β-D-Galactose

Protein crystallography. Garry Taylor

ANODE: ANOmalous and heavy atom DEnsity

X-ray Crystallography. Kalyan Das

Direct Methods and Many Site Se-Met MAD Problems using BnP. W. Furey

Structure solution from weak anomalous data

Overview - Macromolecular Crystallography

The application of multivariate statistical techniques improves single-wavelength anomalous diffraction phasing

Direct Method. Very few protein diffraction data meet the 2nd condition

Maximum Likelihood. Maximum Likelihood in X-ray Crystallography. Kevin Cowtan Kevin Cowtan,

Crystal lattice Real Space. Reflections Reciprocal Space. I. Solving Phases II. Model Building for CHEM 645. Purified Protein. Build model.

MR model selection, preparation and assessing the solution

Charles Ballard (original GáborBunkóczi) CCP4 Workshop 7 December 2011

Direct-method SAD phasing with partial-structure iteration: towards automation

Phase problem: Determining an initial phase angle α hkl for each recorded reflection. 1 ρ(x,y,z) = F hkl cos 2π (hx+ky+ lz - α hkl ) V h k l

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

SOLVE and RESOLVE: automated structure solution, density modification and model building

Patterson Methods

ACORN in CCP4 and its applications

research papers 1. Introduction Thomas C. Terwilliger a * and Joel Berendzen b

The Crystallographic Process

PSD '17 -- Xray Lecture 5, 6. Patterson Space, Molecular Replacement and Heavy Atom Isomorphous Replacement

Materials 286C/UCSB: Class VI Structure factors (continued), the phase problem, Patterson techniques and direct methods

Twinning. Andrea Thorn

BCM Protein crystallography - II Isomorphous Replacement Anomalous Scattering and Molecular Replacement Model Building and Refinement

Protein Crystallography Part II

Molecular replacement. New structures from old

research papers HKL-3000: the integration of data reduction and structure solution from diffraction images to an initial model in minutes

Molecular Biology Course 2006 Protein Crystallography Part II

Kevin Cowtan, York The Patterson Function. Kevin Cowtan

Cover Page. The handle holds various files of this Leiden University dissertation.

Electron Density at various resolutions, and fitting a model as accurately as possible.

Anisotropy in macromolecular crystal structures. Andrea Thorn July 19 th, 2012

Molecular Biology Course 2006 Protein Crystallography Part I

short communications Novel approach to phasing proteins: derivatization by short cryo-soaking with halides

Fan, Hai-fu Institute of Physics, Chinese Academy of Sciences, Beijing , China

Proteins. Central Dogma : DNA RNA protein Amino acid polymers - defined composition & order. Perform nearly all cellular functions Drug Targets

Principles of Protein X-Ray Crystallography

X-ray Crystallography

What is the Phase Problem? Overview of the Phase Problem. Phases. 201 Phases. Diffraction vector for a Bragg spot. In General for Any Atom (x, y, z)

Acta Crystallographica Section F

S-SAD and Fe-SAD Phasing using X8 PROTEUM

Structure factors again

Macromolecular Crystallography Part II

Theory of X-ray diffraction

The Crystallographic Process

Molecular Replacement (Alexei Vagin s lecture)

research papers Liking likelihood 1. Introduction Airlie J. McCoy

This is an author produced version of Privateer: : software for the conformational validation of carbohydrate structures.

Protein Crystallography. Mitchell Guss University of Sydney Australia

Garib N Murshudov MRC-LMB, Cambridge

research papers Protein crystal structure solution by fast incorporation of negatively and positively charged anomalous scatterers

Supporting Information. Synthesis of Aspartame by Thermolysin : An X-ray Structural Study

PHENIX Wizards and Tools

Author's personal copy

Summary: Crystallography in a nutshell. Lecture no. 4. (Crystallography without tears, part 2)

Protein Structure Determination 9/25/2007

Application of the complex multivariate normal distribution to crystallographic methods with insights into multiple isomorphous replacement phasing

Phase Improvement by Multi-Start Simulated Annealing Re nement and Structure-Factor Averaging

Electronic Supplementary Information (ESI) for Chem. Commun. Unveiling the three- dimensional structure of the green pigment of nitrite- cured meat

Image definition evaluation functions for X-ray crystallography: A new perspective on the phase. problem. Hui LI*, Meng HE* and Ze ZHANG

Diffraction Geometry

Crystals, X-rays and Proteins

SUPPLEMENTARY INFORMATION

Fast, Intuitive Structure Determination IV: Space Group Determination and Structure Solution

Chapter 2 Kinematical theory of diffraction

Computational aspects of high-throughput crystallographic macromolecular structure determination

SUPPLEMENTARY INFORMATION

The Phase Problem of X-ray Crystallography

Macromolecular X-ray Crystallography

Schematic representation of relation between disorder and scattering

General theory of diffraction

Supplementary Material (ESI) for CrystEngComm This journal is The Royal Society of Chemistry 2010

Removing bias from solvent atoms in electron density maps

Experimental Phasing of SFX Data. Thomas Barends MPI for Medical Research, Heidelberg. Conclusions

shelxl: Refinement of Macromolecular Structures from Neutron Data

Automated Protein Model Building with ARP/wARP

Supplemental Data. Structure of the Rb C-Terminal Domain. Bound to E2F1-DP1: A Mechanism. for Phosphorylation-Induced E2F Release

Protein Structure Determination. Part 1 -- X-ray Crystallography

Transcription:

Experimental phasing in Crank2 Pavol Skubak and Navraj Pannu Biophysical Structural Chemistry, Leiden University, The Netherlands http://www.bfsc.leidenuniv.nl/software/crank/

X-ray structure solution pipeline Data collection Data processing Experimental phasing Molecular Replacement Density Modification Model building Refinement Rebuilding Validation

Crank2 for experimental phasing Crank2 is suitable for SAD, MAD and SIRAS. Crank2 has been shown to be particularly powerful weak anomalous signal to low resolution SAD datasets (4.5 Angstroms)

Single isomorphous replacement A native data set from a crystal that just contains your macromolecule. A derivative data set A macromolecular crystal that has been soaked with a heavy atom.

Phase problem Acentric Centric F p (h) F p (h) F p (h) Argand diagram F p (h) is the observed structure factor amplitude.

What is a centric reflection? For certain reflections and some space groups, you can get phase restrictions. For example, suppose if you have space group operators (x,y,z) and (-x,-y,-z): F(h) = Ncell f j exp(2πi h. x j ) F(h) = Nasu f j exp(2πi h. x j ) + f j exp(-2πi h. x j ) F(h) = 2 Nasu f j cos(2π h. x j )

Phase information from (errorfree) isomorphous replacement Acentric Two solutions Centric Single solution F P (h) F PH (h) F H (h) F PH (h) F PH (h) F H (h) F P (h) F P (h) F PH (h) : derivative observed amplitude F H (h): heavy atom structure factor F PH (h) = F H (h) + F P (h) (vector addition)

What are some of the errors in isomorphous replacement? Sometimes heavy atoms induce changes in the unit cell or changes in the macromolecule (non-isomorphism). The scale between native and derivative data sets. Errors in the heavy atom positions. Errors in the measurement of the amplitudes. But, heavy atoms could be easier than growing Se-met protein!

Effect of changing the wavelength: Anomalous scattering F + (h) = (f j + f j + i f jʺ) exp(2πi h. x j ) F - (h) = (f j + f j - i f jʺ) exp(2πi h. x j )

Single-wavelength Anomalous Diffraction Solving structures using Friedel pairs collected at one wavelength from a crystal that contains an anomalous substructure.

Solving phases: using the anomalous signal Anomalous scattering arises due to a phaseshift of X-rays diffracted by tightly bound electrons The strength of the anomalous effect depends on the wavelength of the X-rays Anomalous scattering causes very small changes in intensity between F(+) and F(-) F + (h) = (f j + f jʹ + i f jʺ) exp(2πi h. x j ) F - (h) = (f j +f j ' - i f jʺ) exp(2πi h. x j )

Simultaneously combining experimental phasing steps to improve structure solution Traditionally structure solution is divided into distinct steps: Substructure detection Obtain initial phases (Density) modify the initial experimental map Build and refine the model By combining these steps, we can improve the process.

Traditional structure solution Traditional approach Step-wise Information is propagated via phase probabilities Experimental data, anomalous substructure Experimental phasing Phases, phase probabilities Density modification, phase combination Phases, phase probabilities Model building and refinement Model

Phase probabilities P ( α) = exp( Acos( α) + Bsin( α) + C cos(2α ) + Dsin(2α )) The phase distribution can be approximated via 4 Hendrickson-Lattmann coefficients, A, B, C, D. We rely on programs to estimate these coefficients. Even better than the real thing?

Density modification 2. Phase weighting: F, φ FFT ρ(x) φ=f(φ exp,φ mod ) Modify ρ F mod, φ mod FFT -1 ρ mod (x) Reciprocal space Real space Kevin Cowtan, cowtan@ysbl.york.ac.uk

HL propagation and the independence assumption Use the experimental data and anomalous substructure directly! Do not need to assume independence or rely on HL coefficients. Need multivariate distributions at each step that take into account correlations between the model and data.

Traditional approach Step-wise multivariate structure solution Still step-wise Information is propagated via the data and model(s). Experimental data, anomalous substructure Experimental phasing Phases Density modification, phase combination Phases Model building and refinement Model

What are step-wise multivariate functions? The multivariate functions consider all the correlations between the data and the model together. Earlier methods neglected correlations and took into account, for example, DANO (DeltaF) instead of F+ and F-. By taking into account of F+ and F-, we can explicitly consider the measurement error we determine.

Combined structure solution Simultaneously use information from experimental phasing, density modification and model refinement Experimental data, anomalous substructure Combined experimental phasing, phase combination and model refinement Phases Electron density Model Density modification Model building

The combined multivariate function The combined multivariate function consider all the correlations between the data and the substructure, (partially-built) model and density modified phases.

Tests of > 140 real SAD data sets Resolution range of data sets is 0.94 to 3.88 Angstroms Types of anomalous scatterers: selenium, sulfur, chloride, iodide, bromide, calcium, zinc (and others). We compare with the step wise multivariate approach (current CRANK) versus the combined approach.

Model building results on over 140 real SAD data sets (using parrot and buccaneer)

Summary of large scale test The average fraction of the model built increased from 60% to 74% with the new approach. If we exclude data sets built to 85% by the current approach or where the substructure was not found, 45 data sets remain and the average fraction of the model built increased from 28 to 77%.

3.88 Angstrom RNA polymerase II 3.88 Angstrom SAD data with signal from zinc. Authors could not solve the structure with SAD data alone, but with a partial model, multi-crystal MAD and manual building. > 80% can be built with SAD data alone with the new algorithm automatically to an R-free of 37.6%

Related RNA polymerases complex from Cramer et al. 3.3 Angstrom data with signal from zinc. Could not solve the structure with anomalous data alone. With the new method, a majority can be built automatically in minutes.

Crank1 vs Crank2 Crank1 and Crank2 are suitable for S/MAD and S/MIRAS experiments and implements multivariate functions. Crank1 and Crank2 are available in ccp4 6.5 and Crank2 is available in ccp4 7.0 with gui ccp4i, ccp4i2 and ccp4online

Important parameters in substructure detection The number of cycles run. The number of atoms to search for. Should be within 10-20% of actual number A first guess uses a probabilistic Matthew s coefficient The resolution cut-off: For MAD, look at signed anomalous difference correlation. For SAD, a first guess is 0.5 + high resolution limit.

Is my map good enough? Statistics from substructure phasing: Look at FOM. Refined occupancies. Statistics from density modification: Compare the contrast from hand and enantiomorph (output of solomon or shelxe). Does it look like a protein? (model visualization) For Crank2, look to see if R-comb < 40%.

Bias reduction in density modification Density modified map is obtained from experimental map leading to artificially high correlations between the observed and modified amplitudes. β correction is applied to the Luzzati error parameter to reduce bias of modified data.

FOM and phase error after DM with/ without bias reduction

Combining molecular replacement and anomalous scattering (MR-SAD) When do you use MR-SAD? If you have a partial model (usually obtained by molecular replacement) and anomalous diffraction. Two approaches implemented in Crank2 Use only the substructure model only Use the substructure and partial model

The two pipelines

MR-SAD data sets Dataset 2 NA 3.2 Se 378 60.8 12.9 1.7 Final PDB code Resolution (Å) Anomalous Scatterer Number of residues Correct MR Residues (%) Incorrect MR Residues (%) RMSD correct residues (Å) Dataset 1 NA 3.6 Se 800 42.5 23.5 1.6 GPCR ECR-Mb AAA- ATPase F 1 - ATPase SecYEG- SecA 5kvm 3.0 I 459 49.7 11.7 1.5 4d80 3.6 Se 1776 75.2 22.1 1.7 2w6f 3.0 S, P 3587 46.7 2.3 0.9 3din 4.5 Se 2886 47.9 40.7 1.7

MR-SAD results: R-free values Molecular replacement solution Substructureonly pipeline Rebuild pipeline Dataset 1 49.8 32.6 29.8 Dataset 2 53.7 28.9 32.3 GPCR ECR-Mb 48.6 39.1 38.4 AAA-ATPase 47.5 39.0 40.9 F 1 -ATPase 46.5 34.8 33.8 SecYEG-SecA 51.8 39.9 39.6

References l Crank l Ness et al (2004) Structure 12, 1753-1761. l Pannu et al (2011) Acta Cryst D67, 331-337. l l Combined approach and Crank2 l Skubak and Pannu (2013) Nature Communications 4: 2777. l Skubak et al (2018) IUCrJ (available online) Using data directly in refinement l Skubak et al (2004) Acta Cryst D60, 2196-2201. l Skubak et al (2009) Acta Cryst D65, 1051-1061. l Multivariate phase combination l Waterreus et al (2010) Acta Cryst D66, 783-788.

Acknowledgements All dataset contributors Garib Murshudov, Kevin Cowtan, George Sheldrick, Victor Lamzin http://www.bfsc.leidenuniv.nl/software/crank/ Cyttron