Handout 12 Structure refinement. Completing the structure and evaluating how good your data and model agree

Similar documents
11/6/2013. Refinement. Fourier Methods. Fourier Methods. Difference Map. Difference Map Find H s. Difference Map No C 1

Handout 7 Reciprocal Space

Structure Factors. How to get more than unit cell sizes from your diffraction data.

APPENDIX E. Crystallographic Data for TBA Eu(DO2A)(DPA) Temperature Dependence

X-ray Crystallography

Direct Method. Very few protein diffraction data meet the 2nd condition

4. Constraints and Hydrogen Atoms

Fourier Syntheses, Analyses, and Transforms

Protein crystallography. Garry Taylor

Scattering by two Electrons

CALIFORNIA INSTITUTE OF TECHNOLOGY BECKMAN INSTITUTE X-RAY CRYSTALLOGRAPHY LABORATORY

Why do We Trust X-ray Crystallography?

Seth B. Harkins and Jonas C. Peters

Crystals, X-rays and Proteins

X-ray Crystallography I. James Fraser Macromolecluar Interactions BP204

Protein Structure Determination 9/25/2007

What is the Phase Problem? Overview of the Phase Problem. Phases. 201 Phases. Diffraction vector for a Bragg spot. In General for Any Atom (x, y, z)

Refinement of Disorder with SHELXL

Orthorhombic, Pbca a = (3) Å b = (15) Å c = (4) Å V = (9) Å 3. Data collection. Refinement

Pseudo translation and Twinning

Ultra-high resolution structures in validation

Electron Density at various resolutions, and fitting a model as accurately as possible.

Bob Brown Math 251 Calculus 1 Chapter 4, Section 4 1 CCBC Dundalk

QUESTIONNAIRE FOR STRUCTURE DETERMINATION BY POWDER DIFFRACTOMETRY ROUND ROBIN

Protein Crystallography Part II

Synthetic, Structural, and Mechanistic Aspects of an Amine Activation Process Mediated at a Zwitterionic Pd(II) Center

Molecular Biology Course 2006 Protein Crystallography Part II

Ethylene Trimerization Catalysts Based on Chromium Complexes with a. Nitrogen-Bridged Diphosphine Ligand Having ortho-methoxyaryl or

Basic Crystallography Part 1. Theory and Practice of X-ray Crystal Structure Determination

Macromolecular Crystallography Part II

Data processing and reduction

Anomalous dispersion

11/18/2013 SHELX. History SHELXS. .ins File for SHELXS TITL. .ins File Organization

CHAPTER 2 CRYSTALLOGRAPHIC ANALYSIS

Remote Asymmetric Induction in an Intramolecular Ionic Diels-Alder Reaction: Application to the Total Synthesis of (+)-Dihydrocompactin

Supporting Information

TLS and all that. Ethan A Merritt. CCP4 Summer School 2011 (Argonne, IL) Abstract

organic papers Acetone (2,6-dichlorobenzoyl)hydrazone: chains of p-stacked hydrogen-bonded dimers Comment Experimental

Anisotropy in macromolecular crystal structures. Andrea Thorn July 19 th, 2012

The CIF file, refinement details and validation of the structure

Matthias W. Büttner, Jennifer B. Nätscher, Christian Burschka, and Reinhold Tacke *

organic papers Malonamide: an orthorhombic polymorph Comment

Summary of Experimental Protein Structure Determination. Key Elements

metal-organic compounds

CIF access. Redetermination of biphenylene at 130K. R. Boese, D. Bläser and R. Latz

Handout 13 Interpreting your results. What to make of your atomic coordinates, bond distances and angles

research papers Detecting outliers in non-redundant diffraction data 1. Introduction Randy J. Read

Cages on a plane: a structural matrix for molecular 'sheets'

STRUCTURES OF MERCURY MERCAPTIDES

Supporting Information for A Janus-type Bis(maloNHC) and its Zwitterionic Gold and Silver Metal Complexes

High-Throughput in Chemical Crystallography from an industrial point of view

Synthesis and structural characterization of homophthalic acid and 4,4-bipyridine

X-ray Crystallography. Kalyan Das

Reaction Landscape of a Pentadentate N5-Ligated Mn II Complex with O 2

Supplementary Material (ESI) for CrystEngComm This journal is The Royal Society of Chemistry 2010

metal-organic compounds

Eur. J. Inorg. Chem WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim, 2013 ISSN SUPPORTING INFORMATION

Redetermination of Crystal Structure of Bis(2,4-pentanedionato)copper(II)

metal-organic compounds

Decomposition of Ruthenium Olefin Metathesis. Catalysts

Supporting information

Data Collection. Overview. Methods. Counter Methods. Crystal Quality with -Scans

organic papers 2-Iodo-4-nitro-N-(trifluoroacetyl)aniline: sheets built from iodo nitro and nitro nitro interactions

Materials 286C/UCSB: Class VI Structure factors (continued), the phase problem, Patterson techniques and direct methods

From x-ray crystallography to electron microscopy and back -- how best to exploit the continuum of structure-determination methods now available

Data collection. Refinement. R[F 2 >2(F 2 )] = wr(f 2 ) = S = reflections 92 parameters

The structure of Aquifex aeolicus FtsH in the ADP-bound state reveals a C2-symmetric hexamer

Jack D. Dunitz. X-Ray Analysis and the Structure of Organic Molecules VCHP. (2nd Corrected Reprint) $ Verlag Helvetica Chimica Acta, Basel

Manganese-Calcium Clusters Supported by Calixarenes

V = (14) Å 3 Z =8 Cu K radiation. Data collection. Refinement

Electronic Supplementary Information for: Gram-scale Synthesis of a Bench-Stable 5,5 -Unsubstituted Terpyrrole

Supporting Information for First thia-diels Alder reactions of thiochalcones with 1,4- quinones

SOLID STATE 9. Determination of Crystal Structures

1 Introduction. Abstract

Determination of the Substructure

5.80 Small-Molecule Spectroscopy and Dynamics

SUPPLEMENTARY INFORMATION

Rigid body Rigid body approach

Sodium 3,5-dinitrobenzoate

Stephen F. Nelsen, Asgeir E. Konradsson, Rustem F. Ismagilov, Ilia A. Guzei N N

Lesson 6-1: Relations and Functions

The Crystal Structure of Iron Pentacarbonyl: Space Group and Refinement of the Structure

Kevin Cowtan, York The Patterson Function. Kevin Cowtan

Reactivity of Diiron Imido Complexes

oligomerization to polymerization of 1-hexene catalyzed by an NHC-zirconium complex

Crystal lattice Real Space. Reflections Reciprocal Space. I. Solving Phases II. Model Building for CHEM 645. Purified Protein. Build model.

organic compounds 2,4,6-Trimethylphenyl isocyanide

CHAPTER 6 CRYSTAL STRUCTURE OF A DEHYDROACETIC ACID SUBSTITUTED SCHIFF BASE DERIVATIVE

Name: Partners: PreCalculus. Review 5 Version A

Small Molecule Crystallography Lab Department of Chemistry and Biochemistry University of Oklahoma 101 Stephenson Parkway Norman, OK

Chemical Crystallography

structures sur poudres : contraintes Progrès dans l affinement de rigides et molles Laboratoire Léon Brillouin (CEA-CNRS), CEA/Saclay

The Potential Energy Surface

CONTRIBUTED PAPERS DETERMINATION OF A DIFFICULT STRUCTURE: A CASE OF STRONG CORRELATIONS BETWEEN PARAMETERS R. KURODA

Combined Analysis of 1,3-Benzodioxoles by Crystalline Sponge X-ray Crystallography and Laser Desorption Ionization Mass Spectrometry

7.91 Amy Keating. Solving structures using X-ray crystallography & NMR spectroscopy

organic compounds 2-(3-Nitrophenylaminocarbonyl)- benzoic acid: hydrogen-bonded sheets of R 4 4 (22) rings Comment

SUPPLEMENTARY INFORMATION

Phase problem: Determining an initial phase angle α hkl for each recorded reflection. 1 ρ(x,y,z) = F hkl cos 2π (hx+ky+ lz - α hkl ) V h k l

Supporting Information Strong Luminescent Copper(I)-halide Coordination Polymers and Dinuclear Complexes with Thioacetamide and N,N-donor ligands

Transcription:

Handout 1 Structure refinement Completing the structure and evaluating how good your data and model agree

Why you should refine a structure We have considered how atoms are located by Patterson, direct methods or specialized methods, as well as from Fourier difference maps The atomic positions extracted from these methods are close to the correct values, but very rarely in exactly the right place During a process called refinement, the starting atomic positions are optimized - the goal is to get an as good fit between your model and your data as possible

Possible refinement methods You can modify your model in order to obtain better agreement between observed and calculated Fourier maps Alternatively, you can modify your model to get better agreement between observed and calculated structure factor magnitudes ( F o and F c ) For most small molecule structure solutions, the latter process is used

Letting the computer do the work The refinement process can be described as a minimization process - minimization of the difference between F o and F c - easily automated The best model is obtained by minimizing the following expression: R' = w ( F o k F c ) - called a least squares minimization - w khl are weighing factors related to the quality/reliability of the data - k is a scale factor (inverse of scale factor that would need to be applied to F o

Least squares minimization Imagine a function F that is linearly dependent on a set of parameters, x j - F(x 1,x, x n ) = p 1 x 1 + p x + p n x n Suppose we make m independent measurements of F for different values of x j - we want to get the parameters p j - if m = n, we just have to solve a set of simultaneous equations - if m > n, the system is overdetermined

Crystallography and least squares The crystallographic function F() (with the variables: atomic coordinates and thermal parameters of each atom) is not a linear function of the model parameters F() N = j= 1 f j exp[πi( hx j + ky j + lz j ) φ( )] The equation cannot be solved for the correct parameters in one go Use an iterative process instead

Our goal is to minimize We can use an iterative least squares approach to give parameter shifts that will lead to an improved agreement between F o and F c The process is repeated until the suggested shifts are insignificantly small Refinement iterations R' = w - usually considered to be achieved when parameter shift << standard deviation ( F o k F c ) - at this minimum, the derivative of R with respect to each parameter (x j, y j, z j B j ) should be zero

Convergence A refinement process is usually considered as finished when convergence is reached - when all parameter shifts << standard deviation However, the process only works well if the starting model is sufficiently good - many local minima in which the refinement could get stuck Other methods offer better chances to find the absolute minimum from a bad starting model - simulated annealing, random walk

False minima

Errors The errors of all observations are included in the refinement process via the weight factors - weak or uncertain reflections will have less weight than strong reflections The least squares output will provide esds (estimated standard deviations) for all refined parameters Watch out for high correlation coefficients - correlation coefficients tell you whether two model parameters really are independent

Statistical values are used to judge the goodness of a refinement The level of agreement between observed and calculated structure factors is often indicated by R factors and Goodness of Fit (GooF) values R F wr Judging the refinement = ( F ( ) k F ( ) ) / F ( ) o - for n observations and p parameters c = { [ w( Fo ( ) Fc ( ) ) ]/ [ w Fo ( F GooF = S = o { [ w( Fo ( ) Fc ( ) ) ]/( n p )} ) 1/ ]} 1/

R factors For a good small molecule refinement, the final R F values are expected to be ~0.0-0.08 Placing random atoms in a unit cell is expected to give R factors of 0.83 and 0.59 for centric and acentric space groups, respectively Obtaining R F < 0.0 usually means that the structural model has no major errors in it

Refinement strategy Unit cell constants need to be refined - original indexing just gives approximate values Atomic positions obtained by Patterson searches, direct methods or other approaches should be refined - look at interatomic distances to judge whether refined positions make sense - atoms in special positions cannot move in all directions Atomic displacement parameters should be varied - can indicate wrong atomic weight: Z model > (<) Z real leads to large (small) ADPs - isotropic overall temperature factors can be obtained from Wilson plots as a first approximation

Estimating the overall temperature factor We know that observed structure factor magnitudes are smaller than real values because of thermal motion and scaling issues K F o ( ) F( ) = f e Bsin θ / λ ln[ F o ( ) / f ] = ln K Bsin θ / λ - for random atom placement in the unit cell Both K and B can be obtained from a Wilson plot

Wilson plots Crystal Structure Analysis for Chemists and Biologists, Glusker, Lewis and Rossi, VCH, 1994.

Restraints and constraints It is possible to impose certain restrictions on the refinement due to some knowledge that is not inherent in the diffraction data - restraints and constraints Chemical knowledge, such as bond distances and angles, can be used as a restraint in a minimization procedure - a restraint will make certain moves unfavorable, but will not prohibit them Knowledge about molecular connectivity can be used - e.g., a rigid aromatic ring can be described by three positional and three rotational parameters instead of 4n parameters (n = number of atoms) - this would be a rigid body constraint

When to use restraints and constraints Restraints and constraints can improve your data to parameter ratio by reducing the number of parameters or adding observations - very useful if your data are limited or of low quality They improve the convergence properties of your refinement and may allow a refinement to converge to the correct answer even if the starting model is poor - they make potentially disastrous parameter changes (e.g., one carbon atom moving so far that the aromatic ring will no longer be connected) unfavorable or prohibit them altogether