Direct Method. Very few protein diffraction data meet the 2nd condition

Similar documents
Crystal lattice Real Space. Reflections Reciprocal Space. I. Solving Phases II. Model Building for CHEM 645. Purified Protein. Build model.

Phase problem: Determining an initial phase angle α hkl for each recorded reflection. 1 ρ(x,y,z) = F hkl cos 2π (hx+ky+ lz - α hkl ) V h k l

Protein crystallography. Garry Taylor

X-ray Crystallography. Kalyan Das

Structure factors again

PSD '17 -- Xray Lecture 5, 6. Patterson Space, Molecular Replacement and Heavy Atom Isomorphous Replacement

X-ray Crystallography

Molecular Biology Course 2006 Protein Crystallography Part II

Anomalous dispersion

Molecular Replacement (Alexei Vagin s lecture)

Scattering by two Electrons

Phaser: Experimental phasing

From x-ray crystallography to electron microscopy and back -- how best to exploit the continuum of structure-determination methods now available

Likelihood and SAD phasing in Phaser. R J Read, Department of Haematology Cambridge Institute for Medical Research

Protein Crystallography Part II

X-ray Crystallography I. James Fraser Macromolecluar Interactions BP204

Overview - Macromolecular Crystallography

Protein Structure Determination. Part 1 -- X-ray Crystallography

Fourier Syntheses, Analyses, and Transforms

Determination of the Substructure

Macromolecular Crystallography Part II


Molecular replacement. New structures from old

Electron Density at various resolutions, and fitting a model as accurately as possible.

SUPPLEMENTARY INFORMATION

Macromolecular X-ray Crystallography

Protein Crystallography

Summary of Experimental Protein Structure Determination. Key Elements

Biology III: Crystallographic phases

Crystals, X-rays and Proteins

CCP4 Diamond 2014 SHELXC/D/E. Andrea Thorn

SOLID STATE 9. Determination of Crystal Structures

Why do We Trust X-ray Crystallography?

Materials 286C/UCSB: Class VI Structure factors (continued), the phase problem, Patterson techniques and direct methods

SHELXC/D/E. Andrea Thorn

Direct Methods and Many Site Se-Met MAD Problems using BnP. W. Furey

Ultra-high resolution structures in validation

What is the Phase Problem? Overview of the Phase Problem. Phases. 201 Phases. Diffraction vector for a Bragg spot. In General for Any Atom (x, y, z)

Handout 12 Structure refinement. Completing the structure and evaluating how good your data and model agree

Fast, Intuitive Structure Determination IV: Space Group Determination and Structure Solution

Supporting Information. Synthesis of Aspartame by Thermolysin : An X-ray Structural Study

Web-based Auto-Rickshaw for validation of the X-ray experiment at the synchrotron beamline

Resolution: maximum limit of diffraction (asymmetric)

BC530 Class notes on X-ray Crystallography

SUPPLEMENTARY INFORMATION

Experimental Phasing with SHELX C/D/E

Macromolecular Phasing with shelxc/d/e

Image definition evaluation functions for X-ray crystallography: A new perspective on the phase. problem. Hui LI*, Meng HE* and Ze ZHANG

Exploring symmetry related bias in conformational data from the Cambridge Structural Database: A rare phenomenon?

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

Practical aspects of SAD/MAD. Judit É Debreczeni

Proteins. Central Dogma : DNA RNA protein Amino acid polymers - defined composition & order. Perform nearly all cellular functions Drug Targets

APPENDIX E. Crystallographic Data for TBA Eu(DO2A)(DPA) Temperature Dependence

Practical applications of synchrotron radiation in the determination of bio-macromolecule three-dimensional structures. M. Nardini and M.

SUPPLEMENTARY INFORMATION

Experimental phasing in Crank2

Part 1 X-ray Crystallography

11/6/2013. Refinement. Fourier Methods. Fourier Methods. Difference Map. Difference Map Find H s. Difference Map No C 1

ACORN - a flexible and efficient ab initio procedure to solve a protein structure when atomic resolution data is available

Garib N Murshudov MRC-LMB, Cambridge

Jack D. Dunitz. X-Ray Analysis and the Structure of Organic Molecules VCHP. (2nd Corrected Reprint) $ Verlag Helvetica Chimica Acta, Basel

CALIFORNIA INSTITUTE OF TECHNOLOGY BECKMAN INSTITUTE X-RAY CRYSTALLOGRAPHY LABORATORY

Electronic Supplementary Information (ESI) for Chem. Commun. Unveiling the three- dimensional structure of the green pigment of nitrite- cured meat

Protein Structure Determination 9/25/2007

Acta Crystallographica Section F

Supporting Information

General theory of diffraction

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

Structure solution from weak anomalous data

Summary: Crystallography in a nutshell. Lecture no. 4. (Crystallography without tears, part 2)

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

7.91 Amy Keating. Solving structures using X-ray crystallography & NMR spectroscopy

Table 1. Crystallographic data collection, phasing and refinement statistics. Native Hg soaked Mn soaked 1 Mn soaked 2

4. Constraints and Hydrogen Atoms

organic papers Malonamide: an orthorhombic polymorph Comment

Molecular Modeling lecture 2

1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?!

The SHELX approach to the experimental phasing of macromolecules. George M. Sheldrick, Göttingen University

SUPPLEMENTARY INFORMATION

Charles Ballard (original GáborBunkóczi) CCP4 Workshop 7 December 2011

Patterson Methods

Full wwpdb X-ray Structure Validation Report i

The Crystal and Molecular Structures of Hydrazine Adducts with Isomeric Pyrazine Dicarboxylic Acids

Full wwpdb X-ray Structure Validation Report i

Supporting Information

Full wwpdb X-ray Structure Validation Report i

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Principles of Physical Biochemistry

6. X-ray Crystallography and Fourier Series

Rietveld Structure Refinement of Protein Powder Diffraction Data using GSAS

Supporting Information

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

This is an author produced version of Privateer: : software for the conformational validation of carbohydrate structures.

The Phase Problem of X-ray Crystallography

Protein structure analysis. Risto Laakso 10th January 2005

TLS and all that. Ethan A Merritt. CCP4 Summer School 2011 (Argonne, IL) Abstract

Crystal structure of DL-Tryptophan at 173K

A Primer in X-ray Crystallography for Redox Biologists. Mark Wilson Karolinska Institute June 3 rd, 2014

Full wwpdb X-ray Structure Validation Report i

Transcription:

Direct Method Two conditions: -atoms in the structure are equal-weighted -resolution of data are higher than the distance between the atoms in the structure Very few protein diffraction data meet the 2nd condition Heavy atoms in protein => sub-structure of heavy atoms in a derivative F H = F PH - F P

Direct Method 1. Determine the substructure of heavy atoms 2. Determine overall protein structure at 1.2 Å 3. Programs: - Shake and bake (SnB) - SHELXD 4. Phases refined by Sharp

Large substructure solved mainly by direct method http://www.hwi.buffalo.edu/substructure/viewdb.htm The number of heavy atoms (anomalous scatters) > 20 Mainly apply to protein crystals contain many Se-Met and halide soaking

Molecular Replacement 1. A homologous model (structure) available 2. A structure determined by putting the model in proper orientation and precise position in the target unit cell. - rotation search (rotation matrix) - translation search (translation vector) 3. New structure X B X B = [C] X A + d 4. Amore, Molrep, CNS, Phaser 5. Over 50% protein structures solved by MR

Solvent Content Mathew coefficient (V M ) V O V M = Z x n x MW V M = 1.7~ 3.5 V O Unit cell volume Z number of au in a unit cell M W molecular weight of protein n number of protein molecules in the au Protein content: V protein = 1.23/V M, 1.23 = density of protein crystals Solvent content: V sol = 1-1.23/V M When V M = 2.5 Å 3 /Da, V sol = 0.51 =>Most common Vm and Vsol -Solvent fractions of 0.3-0.7 are common. -Crystals with high solvent contents general diffract poorly and are fragile. -But high solvent content is a great advantage in phase improvement by density modification.

Fourier transforms Electron density ρ (x,y,z) = 1/V F hkl exp (-2πi(hX+kY+lZ)) h k l F hkl = F hkl exp(i 2π(hxj+kyj+lzj) ) Structure factor ρ (x) FT - FT F(h)

Density modification We use current phases (model) to calculate a density map. Then we modify that map to make it conform better to some idea about what an electron density map should look like. The new and better map is then back-transformed to the calculate structure factors, which should have more accurate phases than original map. By iterating, we are getting a map that does not change anymore and should be closer to the true map

Solvent flattening/flipping Density in protein and solvent regions: Solvent flattening: -replacing all the density values within the solvent region with the average value throughout the solvent region. ρ out (x) = (ρ in (x) - ρ sol ) * µ (x) + ρ sol Solvent flipping: - modified solvent flattening to remove biases from original map

NCS averaging NCS copies in AU The NCS copies in AU should have same electron density The difference in a noisy density map between NCS copies caused by random errors. Non-crystallographic symmetry: symmetry relations among identical copies in AU A new map made by averaging the copies of density related by non-crystallographic symmetry should be more accurate, since the noise is averaged out.

Model Building Putting blocks of protein structures into electron density Mainly use interactive computer graphic programs: - O, XtalView, Coot Automatic model-building programs: - ARP/wARP, RESOLVE, MAID

Map and Model 6.0 Å map

Map and Model 1.0 Å map

Fourier Method F F(h) = F exp (iα c ) -F is the true structure factor but only F measured -Fc is from an initial model or a molecular replacement search model -F(h) is model-phased structured factor closer to the true F than Fc was -So the map will the map will have the features of the true structure -Used to solve ligand- soaked structures

Map from Fourier Method o Model-phased map

Difference Map F(h) = ( F - F C ) exp (iα c ) F o - F c map The difference map highlights the difference of true structure and the model -positive density (blue)indicates atoms should be added -negative density (red) indicates the atoms should be moved elsewhere (or removed)

Refinement of Model Now we have 1) X-ray diffraction intensities (h, k, l) from data 2) x, y, z positions of atoms in unit cell from models 3) known scattering factors of atoms Let s adjust the model to find a closer agreement between the calculated and observed structure factors - minimize the difference below: Σ ( F cal - F obs ) 2 hkl - correct the errors in the initial atomic model

Thermal Motion - atoms undergo motion in crystal - not exactly fixed at x j, y j, z j - B factor B j = 8π 2 u j 2 B j is a measure of motion u j is degree of vibration B j = 80 Å 2, u j = 1.00Å B j = 20 Å 2, u j = 0.5Å atoms F (h,k,l) = Σ f (j) exp [2π * i(hx (j) + ky (j) +lz (j) ]* exp [-B j * (sinθ/λ) 2 ] j=1

Resolution and Refinement Resolution Observations/parameters 3.5 Å 0.5 3.0 Å 0.8 2.5 Å 1.4 2.0 Å 2.8 1.5 Å 6.2 -For a protein crystal with a typical packing density, and 4 parameters (x, y, z, and B) per atom (non-h). -At resolutions < 1.0, the ratio of observations to parameters is low and the refinement is poor over-determined.

Resolution and Structure 1.0 Å 2.5 Å 3.0 Å 4.0 Å Cambridge course http://www-structmed.cimr.cam.ac.uk/course/fitting/fittingtalk.html

Restraints and Constraints Additional observations are incorporated in the refinement -Stereochemical data from small molecular structures e.g. bond lengths and angles, etc Constraints The stereo data taken as rigid and only dihedral angles varied in models -effectively reduce the # of parameters Restraints The stereo data allowed to vary around a standard value and controlled by an energy term E = E chem + w E xray E xay = Σ( F cal (h) - k F obs (h) ) 2 h -define the difference of models to x-ray data E chem = Σ (M ideal M model ) M bond lengths and angles, torsion angles and van der Waals contacts, etc

R Factor Difference between F obs and F calc R factor = Σ F obs - F calc hkl Σ F obs hkl Quality of Model R factor = 0.00 perfect fit 0.20 good fit 0.60 random fit

Subjectivity and Overfitting Subjectivity: misinterpret density map Overfitting: lower R-factors without removing errors in the model Protein crystals usually could not diffract to atomic resolution, which provide a room for above two error-inducing phenomena Overfit the diffraction data by introduction of too many adjustable parameters. e.g., too many water molecules are fitted to the diffraction data, which compensates for errors in the model or the data. Certain subtle errors introduced by overfitting can produce a low R factor.

Validation and Model Evaluation Ramachandran plot Bond lengths/angles Homolog structure comparison Independent structural solutions

Ramachandran Plot

R free and Cross-validation R free = hkl T Σ F obs - F calc hkl T Σ F obs Difference between F obs and F calc for the test data hkl T: All the reflections belong to test set, random selection of ~5% of the observed reflections which never used in refinement. Every observation contains information from all the atoms in a structure.

Structure Quality What to look for 1) R free near 0.25 2) Resolution (better than 3.5 Å) 3) Completeness ~ 95% 4) Ramachandran plot 5) RMSD of bond lengths and angles from ideal values (<0.02 Å, <2.0 o )

Refine and Rebuild Model - Check the correctness of the model - 2Fo-Fc and Fo-Fc maps (re-building) - database in O, coot, XtalView - Refine the updated model against diffraction data - use R free as monitor - Evaluate the structure - torsion angles by Ramachandran Plot - geometry, bond lengths and angles - Iterate the three steps until satisfaction