Determination of the Substructure

Similar documents
Macromolecular Phasing with shelxc/d/e

SHELXC/D/E. Andrea Thorn

Protein Crystallography

Scattering by two Electrons

Anomalous dispersion

Experimental Phasing with SHELX C/D/E

Practical aspects of SAD/MAD. Judit É Debreczeni

Experimental phasing, Pattersons and SHELX Andrea Thorn

CCP4 Diamond 2014 SHELXC/D/E. Andrea Thorn

Experimental phasing in Crank2

Fast, Intuitive Structure Determination IV: Space Group Determination and Structure Solution

Molecular Biology Course 2006 Protein Crystallography Part I

The SHELX approach to the experimental phasing of macromolecules. George M. Sheldrick, Göttingen University

Patterson Methods

Biology III: Crystallographic phases

What is the Phase Problem? Overview of the Phase Problem. Phases. 201 Phases. Diffraction vector for a Bragg spot. In General for Any Atom (x, y, z)

Direct Method. Very few protein diffraction data meet the 2nd condition

Likelihood and SAD phasing in Phaser. R J Read, Department of Haematology Cambridge Institute for Medical Research

BC530 Class notes on X-ray Crystallography

Non-merohedral Twinning in Protein Crystallography

The Phase Problem of X-ray Crystallography

X-ray Crystallography. Kalyan Das

Direct Methods and Many Site Se-Met MAD Problems using BnP. W. Furey

Structure factors again

Structure solution from weak anomalous data

BCM Protein crystallography - II Isomorphous Replacement Anomalous Scattering and Molecular Replacement Model Building and Refinement

Crystal lattice Real Space. Reflections Reciprocal Space. I. Solving Phases II. Model Building for CHEM 645. Purified Protein. Build model.

Crystals, X-rays and Proteins

Molecular Biology Course 2006 Protein Crystallography Part II

Resolution: maximum limit of diffraction (asymmetric)

PAN-modular Structure of Parasite Sarcocystis muris Microneme Protein SML-2 at 1.95 Å Resolution and the Complex with 1-Thio-β-D-Galactose

ANODE: ANOmalous and heavy atom DEnsity

Experimental phasing in Crank2

General theory of diffraction

Protein crystallography. Garry Taylor

ANODE: ANOmalous and heavy atom DEnsity

Macromolecular X-ray Crystallography

Protein Structure Determination 9/25/2007

Handout 7 Reciprocal Space

Protein Crystallography Part II

X-ray Crystallography

Electron Density at various resolutions, and fitting a model as accurately as possible.

Basic Crystallography Part 1. Theory and Practice of X-ray Crystal Structure Determination

Fan, Hai-fu Institute of Physics, Chinese Academy of Sciences, Beijing , China

X-ray Diffraction. Diffraction. X-ray Generation. X-ray Generation. X-ray Generation. X-ray Spectrum from Tube

Phaser: Experimental phasing

Materials 286C/UCSB: Class VI Structure factors (continued), the phase problem, Patterson techniques and direct methods

Supporting Information

Principles of Protein X-Ray Crystallography

PSD '17 -- Xray Lecture 5, 6. Patterson Space, Molecular Replacement and Heavy Atom Isomorphous Replacement

Space Group & Structure Solution

Twinning. Andrea Thorn

Phase problem: Determining an initial phase angle α hkl for each recorded reflection. 1 ρ(x,y,z) = F hkl cos 2π (hx+ky+ lz - α hkl ) V h k l

SOLVE and RESOLVE: automated structure solution, density modification and model building

Shake-and-Bake: Applications and Advances. Russ Miller & Charles M. Weeks

ACORN - a flexible and efficient ab initio procedure to solve a protein structure when atomic resolution data is available

Data processing and reduction

Web-based Auto-Rickshaw for validation of the X-ray experiment at the synchrotron beamline

CS273: Algorithms for Structure Handout # 13 and Motion in Biology Stanford University Tuesday, 11 May 2003

Scattering Lecture. February 24, 2014

Protein Crystallography. Mitchell Guss University of Sydney Australia

Tutorial on how to solve a Se-substructure using

3.012 Fund of Mat Sci: Structure Lecture 18

Cover Page. The handle holds various files of this Leiden University dissertation.

Roger Johnson Structure and Dynamics: X-ray Diffraction Lecture 6

Analytical Methods for Materials

Anisotropy in macromolecular crystal structures. Andrea Thorn July 19 th, 2012

SOLID STATE 9. Determination of Crystal Structures

SI Text S1 Solution Scattering Data Collection and Analysis. SI references

Linking data and model quality in macromolecular crystallography. Kay Diederichs

Two Lectures in X-ray Crystallography

SOLID STATE 18. Reciprocal Space

Macromolecular Crystallography Part II

The application of multivariate statistical techniques improves single-wavelength anomalous diffraction phasing

Crystal planes. Neutrons: magnetic moment - interacts with magnetic materials or nuclei of non-magnetic materials. (in Å)

Diffraction Geometry

Keble College - Hilary 2012 Section VI: Condensed matter physics Tutorial 2 - Lattices and scattering

6. X-ray Crystallography and Fourier Series

Overview - Macromolecular Crystallography

Theory of X-ray diffraction

Direct-method SAD phasing with partial-structure iteration: towards automation

X-ray analysis. 1. Basic crystallography 2. Basic diffraction physics 3. Experimental methods

Applications of X-ray and Neutron Scattering in Biological Sciences: Symmetry in direct and reciprocal space 2012

Image definition evaluation functions for X-ray crystallography: A new perspective on the phase. problem. Hui LI*, Meng HE* and Ze ZHANG

Practical applications of synchrotron radiation in the determination of bio-macromolecule three-dimensional structures. M. Nardini and M.

Resolution and data formats. Andrea Thorn

GBS765 Electron microscopy

ACORN in CCP4 and its applications

PROBING CRYSTAL STRUCTURE

Synthetic, Structural, and Mechanistic Aspects of an Amine Activation Process Mediated at a Zwitterionic Pd(II) Center

research papers Reduction of density-modification bias by b correction 1. Introduction Pavol Skubák* and Navraj S. Pannu

research papers 1. Introduction Thomas C. Terwilliger a * and Joel Berendzen b

shelxl: Refinement of Macromolecular Structures from Neutron Data

Supplementary materials. Crystal structure of the carboxyltransferase domain. of acetyl coenzyme A carboxylase. Department of Biological Sciences

Molecular replacement. New structures from old

There and back again A short trip to Fourier Space. Janet Vonck 23 April 2014

Professor G N Ramachandran's Contributions to X-ray Crystallography

The Reciprocal Lattice

Scattering and Diffraction

Transcription:

Monday, June 15 th, 2009 Determination of the Substructure EMBO / MAX-INF2 Practical Course http://shelx.uni-ac.gwdg.de

Overview Substructure Definition and Motivation Extracting Substructure Data from measured Data Substructure Solution Experimental Considerations Overview 1/25

Substructure Definition and Motivation Motivation 2/25

Substructure: What is it? The Substructure of a (crystal-) structure are the coordinates of a small subset of atoms within the same unit cell. A substructure is not a physically existing crystal structure, but a theoretical concept used in phasing of the macromolecule. Motivation 3/25

Substructure: Why is it so important? If we know the coordinates of the substructure, we can solve the phase problem for the whole structure. Motivation 4/25

Procedure (Overview) xprep, shelxc,... extract substructure data Karle, Hendrickson Smith, Sheriff equation shelxd calculate substructure coordinates Direct Methods shelxe calculate substructure phases Inverse Fourier shelxe calculate and refine phases for remaining atoms Harker Diagram Motivation 5/25

Motivation: Benefit of Substructure for Macromolecules (1/2) A major problem in macromolecular crystallography is the phase problem: A single diffraction experiment delivers the amplitude F, but not the phase φ of the structure factors for each measured reflection (hkl). Therefore we cannot directly calculate the electron density from the measured data. ρ(x, y, z) = 1 V cell h,k,l F (h, k, l) e iφ(h,k,l) e 2πi(hx+ky+lz) (1) Motivation 6/25

Motivation: Benefit of Substructure for Macromolecules (2/2) This is different in the small molecule world. A structure with not too many atoms (< 1000 non-hydrogen atoms) can be solved from a single data set with so-called ab initio methods. The term direct methods encompasses all ab initio methods that derive the reflection phases from the amplitudes using probabilistic phase relations usually the tangent formula. Once we know the substructure, the phases for the reflections of the real data can be determined at least sufficiently accurately to calculate an interpretable electron density map. Motivation 7/25

Extracting Substructure Data from measured Data Substructure Data 8/25

Going backwards: Amplitudes from Coordinates Inversely to the calculation of the electron density ρ(x, y, z) from the structure factors F (h, k, l) by the Fourier transformation (Eq. 1), we can calculate F (h, k, l) from knowing the positions (x, y, z) and types of all atoms within the unit cell: F (h, k, l) f j e 2πi(hx+ky+lz) (2) f j is the atomic scattering factor specific to each atom type (C, N, P, etc.). In the presence of anomalous scattering, f j splits into a normal part, only dependent on the scattering angle θ and two anomalous parts, only dependent on the wavelength λ: fj A = f j (θ) + f j (λ) + if j (λ) Substructure Data 9/25

The Phase Diagram for SAD Since Eq. 2 is a simple sum, one can group it into sub-sums. The graphical representation of this grouping is the phase diagram on the right. In the case of SAD the following grouping has turned out to be useful: F (h) = f µ e 2πihr µ } non-anomalous {{} F P + f ν e 2πihr ν } substructure {{} F A + (f τ + if τ )e 2πihr τ substructure Im(F ) F + P F + F + A α F + T Re(F ) F F T α F A F P The factor if causes the breakdown of Friedel s law, i.e. the splitting into F + = F (+h) and F = F ( h). The experiment provides us (only) with their amplitudes F + and F. Substructure Data 10/25

Getting Closer The reason for this rather non-intuitive splitting is a formula derived by Karle (1980) and Hendrickson, Smith, Sheriff (1985). It puts F ±, F A, and the angle α = φ T φ A into context : F ± 2 = F T 2 + a F A 2 + b F A F T ± c F A F T sin α (3) a = f 2 +f 2, b = 2f f 2 0 f 0, c = 2f f 0 Substructure Data 11/25

Finally - the Substructure Amplitude If we substract above equations for F + 2 and F 2 from each other and use the approximation F + + F 2 F T, the result is F + F c F A sin(α) (4) Remember: 1. Our goal is to know F A, the structure factor amplitude for the substructure atoms 2. We know F + and F directly from the experiment 3. c is just a constant Substructure Data 12/25

The Angle α Up to the factor sin(α) we have arrived at an expression that allows us to calculate F A from the difference of the Bijvoet pairs F + and F. In the case of MAD, we can further eliminate this angle because there is one of the above equations for each wavelength. In the case of SAD, the program SHELXD approximates F A F A sin(α). Why is this justified? The method used to calculate the substructure emphasises strong reflections. In our case it means large differences between the Bijvoet pairs. This difference is large if the angle is close to 90 or 270, i.e. if sin(α) ±1, in which case it is a fairly good approximation - good enough to solve the structure, as plenty of solved structures confirm again and again. Substructure Data 13/25

Substructure Solution with Direct Methods Substructure Solution 14/25

Direct Methods Having figured out the values F A from our measured data we are actually pretending having collected a data set from a crystal with exactly the same (large) unit cell as our actual macromolecule but with only very few atoms inside. We artificially created a small molecule dataset. Substructure Solution 15/25

Direct Methods Direct methods have been applied to solve structures with more than 2000 independent non-hydrogen atoms (1gyo). These, however, require atomic resolution at 1.2 Å and better. For substructure solution, anomalous data to 2.5 3 Å are usually sufficient and even 5 Å may work. This is because the distance between atoms of the (hypothetical) substructure crystal is generally quite large, much larger than the data set resolution. Substructure Solution 16/25

Starting Point: The Sayre Equation In 1952, Sayre published what now has become known as the Sayre-Equation F (h) = q(sin(θ)/λ) h F h F h h (5) This equation is exact for an equal-atom-structure (like the substructure generally is). It requires, however, complete data including F (000), which is hidden by the beamstop. Substructure Solution 17/25

Normalised Structure Factors Experience has shown that direct methods produce better results if, instead of the normal structure factor F (hkl), the normalised structure factor is used. The normalised structure factor is calculated as E(hkl) 2 = F (hkl)2 /ε F (hkl) 2 /ε (6) It is calculated per resolution shell ( 20 shells over the whole resolution range). ε is a statistical constant used for the proper treatment of centric and acentric reflections. The denominator F (hkl) 2 /ε as is averaged per resolution shell. The normalised structure factor E is independent of the thermal motion (B-factor) and electron distribution around the atom. Substructure Solution 18/25

Tangent Formula (Karle & Hauptman, 1956) While the Sayre-equation directly is not very useful (because it requires complete data), it serves to derive the tangent formula h tan(φ h ) E h E h h sin(φ h + φ h h ) h E h E h h cos(φ h + φ h h ) (7) Given a set of test phases (one for each reflection), the tangent formula will usually not be fulfilled. By altering the test phases so that they better match the desired ideal values from Eq. 7, one can now refine these phases. Substructure Solution 19/25

Nothing comes from Nothing The tangent formula does not provide us with an initial set of phases for the substructure. It only allows to refine phases. shelxd (and e.g. the program SnB) solves this problem by a statistical approach which is called dual-space recycling. Substructure Solution 20/25

shelxd flow-chart random atoms or (better) atoms consistent with Patterson selection criterion: Correlation Coefficient between E obs and E calc Real space: select atoms best CC? Yes phases(map) from atoms dual space FFT and peak search recycling reciprocal space refine phases No repeat a few 100 times keep solution Substructure Solution 21/25

shelxd improvements shelxd provide two alternative methods to improve the quality of the search: PATS - Patterson Seeding the initial atoms are not chosen completely arbitrarily, but such that they obey the Patterson map of the data. This adds some chemical or geometrical information to the process and therefore helps to improve the result. WEED - Random omit maps after real space refinement, 30% of the peak positions in the map are left out from phase calculation and subsequent phase refinement. This is similar to omit maps in model building and refinement and improves the results. The idea behind this is to reduce model bias or over-emphasis from strong contributors. PATS is the default for macromolecules. There are mainly two cases for which WEED is recommended instead, both of which lead to an uninterpretable Patterson map with too dense peaks to distinguish: 1. too many atoms in the substructure 2. the Laue group is one of the higher cubic ones Substructure Solution 22/25

Density Modification Knowing the positions of the substructure atoms we can now calculate their structure factor amplitudes F A and their phases φ A. If we knew α we would know φ T = φ A + α and hence could calculate a (starting) electron density map together with the amplitudes F T. For MAD, SIRAS, MIR we can calculate α. For SAD and SIR, however, approximations are necessary - again.... From the phase diagram (slide 10), one can figure out, that if F + F, then α 90 and α 270 if F F +. In the SIR case the angles are 0 and 180 respectively. These approximations are poor, usually too poor for model building. They are, however, good enough for density modification which improves these estimates for φ T in several cycles of imposing chemical or statistical information on the reflection. This is yet another story, though.... Substructure Solution 23/25

Experimental Considerations data multiplicity and independence E.g. when collecting more than 360 with 1 framewidth, shift the goniometer by 0.5 degree backwards after the first round to get fresh, i.e. statistically independent, data. proper resolution limit during integration If the resolution limit during data integration is set too far into the noise region, also the data quality at lower resolution will suffer from it. Try to estimate how far your crystal diffracts and only integrate up to that resolution. proper resolution limit for shelxd shelxd is sensitive to the resolution cut-off. Check (e.g. with xprep) to what resolution an anomalous signal could be detected and try several resolution limits around that value. Experimental Considerations 24/25

References Drenth, J. (2007), Principles of Protein X-Ray Crystallography, Springer G.M. Sheldrick, H.A. Hauptman, C.M. Weeks, R. Miller and I. Usón (2001), Intern. Tables for Crystallography F, Section 16.1: Ab initio phasing Thank you for your attention! References 25/25