Summary of Experimental Protein Structure Determination. Key Elements

Similar documents
X-ray Crystallography

Introduction to" Protein Structure

X-ray Crystallography I. James Fraser Macromolecluar Interactions BP204

7.91 Amy Keating. Solving structures using X-ray crystallography & NMR spectroscopy

Ultra-high resolution structures in validation

Details of Protein Structure

Direct Method. Very few protein diffraction data meet the 2nd condition

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

Ensemble refinement of protein crystal structures in PHENIX. Tom Burnley Piet Gros

Macromolecular X-ray Crystallography

Molecular Biology Course 2006 Protein Crystallography Part II

SOLID STATE 9. Determination of Crystal Structures

Can protein model accuracy be. identified? NO! CBS, BioCentrum, Morten Nielsen, DTU

Macromolecular Crystallography Part II

Molecular Modeling lecture 2

Protein Crystallography Part II

SUPPLEMENTARY INFORMATION

Electronic Supplementary Information (ESI) for Chem. Commun. Unveiling the three- dimensional structure of the green pigment of nitrite- cured meat

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

Structure factors again

Acta Crystallographica Section F

X-ray Crystallography. Kalyan Das

Scattering by two Electrons

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB

Protein Structure Determination. Part 1 -- X-ray Crystallography

NMR in Structural Biology

TLS and all that. Ethan A Merritt. CCP4 Summer School 2011 (Argonne, IL) Abstract

The structure of Aquifex aeolicus FtsH in the ADP-bound state reveals a C2-symmetric hexamer

RNA protects a nucleoprotein complex against radiation damage

Macromolecular Crystallography Part II

X-ray analysis. 1. Basic crystallography 2. Basic diffraction physics 3. Experimental methods

Basics of protein structure

NMR, X-ray Diffraction, Protein Structure, and RasMol

Rietveld Structure Refinement of Protein Powder Diffraction Data using GSAS

11/6/2013. Refinement. Fourier Methods. Fourier Methods. Difference Map. Difference Map Find H s. Difference Map No C 1

NMR in Medicine and Biology

Protein Structure Determination Using NMR Restraints BCMB/CHEM 8190

Protein crystallography. Garry Taylor

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Full wwpdb X-ray Structure Validation Report i

HTCondor and macromolecular structure validation

Drug targets, Protein Structures and Crystallography

Protein structures and comparisons ndrew Torda Bioinformatik, Mai 2008

wwpdb X-ray Structure Validation Summary Report

Why do We Trust X-ray Crystallography?

PROTEIN'STRUCTURE'DETERMINATION'

IgE binds asymmetrically to its B cell receptor CD23

Full wwpdb X-ray Structure Validation Report i

Crystal lattice Real Space. Reflections Reciprocal Space. I. Solving Phases II. Model Building for CHEM 645. Purified Protein. Build model.

Full wwpdb X-ray Structure Validation Report i

Bioinformatics. Macromolecular structure

Plasmid Relevant features Source. W18N_D20N and TrXE-W18N_D20N-anti

Linking data and model quality in macromolecular crystallography. Kay Diederichs

Anisotropy in macromolecular crystal structures. Andrea Thorn July 19 th, 2012

Protein Structure Determination

Protein NMR Structures Refined with Rosetta Have Higher Accuracy Relative to Corresponding X ray Crystal Structures

Full wwpdb X-ray Structure Validation Report i

Protein Structures. 11/19/2002 Lecture 24 1

Full wwpdb X-ray Structure Validation Report i

Electron Density at various resolutions, and fitting a model as accurately as possible.

Determining Protein Structure BIBC 100

Resolution and data formats. Andrea Thorn

Full wwpdb X-ray Structure Validation Report i

Homology modeling of Ferredoxin-nitrite reductase from Arabidopsis thaliana

Modeling for 3D structure prediction

Proteins. Central Dogma : DNA RNA protein Amino acid polymers - defined composition & order. Perform nearly all cellular functions Drug Targets

1) NMR is a method of chemical analysis. (Who uses NMR in this way?) 2) NMR is used as a method for medical imaging. (called MRI )

Phase problem: Determining an initial phase angle α hkl for each recorded reflection. 1 ρ(x,y,z) = F hkl cos 2π (hx+ky+ lz - α hkl ) V h k l

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Principles of NMR Protein Spectroscopy. 2) Assignment of chemical shifts in a protein ( 1 H, 13 C, 15 N) 3) Three dimensional structure determination

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Solid-state NMR and proteins : basic concepts (a pictorial introduction) Barth van Rossum,

Roger Johnson Structure and Dynamics: X-ray Diffraction Lecture 6

Supporting Information

Handout 12 Structure refinement. Completing the structure and evaluating how good your data and model agree

SHELXC/D/E. Andrea Thorn

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

A Primer in X-ray Crystallography for Redox Biologists. Mark Wilson Karolinska Institute June 3 rd, 2014

Crystals, X-rays and Proteins

Protein Structures: Experiments and Modeling. Patrice Koehl

Protein Structure Determination from Pseudocontact Shifts Using ROSETTA

Nature Structural & Molecular Biology: doi: /nsmb.3194

5.067 Crystal Structure Refinement Fall 2007

Full wwpdb X-ray Structure Validation Report i

Protein Structure Determination 9/25/2007

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007

Fourier Syntheses, Analyses, and Transforms

Table S1. Primers used for the constructions of recombinant GAL1 and λ5 mutants. GAL1-E74A ccgagcagcgggcggctgtctttcc ggaaagacagccgcccgctgctcgg

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Protein Structure Analysis and Verification. Course S Basics for Biosystems of the Cell exercise work. Maija Nevala, BIO, 67485U 16.1.

CCP4 Diamond 2014 SHELXC/D/E. Andrea Thorn

Visualization of Macromolecular Structures

Experimental Techniques in Protein Structure Determination

Basic Crystallography Part 1. Theory and Practice of X-ray Crystal Structure Determination

Automated identification of functional dynamic contact networks from X-ray crystallography

Solid State Spectroscopy Problem Set 7

Acta Cryst. (2017). D73, doi: /s

HADDOCK: High Ambiguity

Supporting Information

Transcription:

Programme 8.00-8.20 Summary of last week s lecture and quiz 8.20-9.00 Structure validation 9.00-9.15 Break 9.15-11.00 Exercise: Structure validation tutorial 11.00-11.10 Break 11.10-11.40 Summary & discussion 11.40-12.00 Quiz

Feedback Persons 2

Summary of Experimental Protein Structure Determination Key Elements

Learning Objectives After last week you should be able to: Give an outline of the most important steps (and obstacles) in protein structure determination by X-ray crystallography and NMR spectroscopy. Identify relevant parameters for evaluating the quality of protein structures determined by X-ray crystallography and NMR spectroscopy.

X-Ray Crystallography in Brief General method for structure determination Very accurate/well-defined structures Only little information on dynamics (Bfactors) For quality check: Remember the three Rs Resolution R/R free Ramachandran plot

The Experiment(s)

X-Ray Diffraction and Resolution Bragg s equation Diffraction image nλ = 2d sinθ " θ = sin 1 nλ % $ ' # 2d & 7

Other Parameters Non-crystallographic symmetry (NCS) Subunit symmetry not coinciding with crystal symmetry. Assume (=force) subunits to have identical conformation (strict/nonstrict). Reduces number of free parameters in refinement. Used when amount of data is insufficient (low resolution). B-factors Not refined Overall (from data) Grouped (aa. or bb/sc) Individual (atomic) Anisotropic (Res. < 1.5Å) Geometry Side chain rotamers Bond lengths Angles G-factor (overall geometrical quality)

Bottlenecks Getting the protein in sufficient quantity and purity Crystallisation (trial and error) Diffraction/resolution limit (both high and low)

NMR Basics NMR is nuclear magnetic resonance NMR spectroscopy is done on proteins IN SOLUTION Only atoms 1 H, 13 C, 15 N (and 31 P) can be detected in NMR experiments Proteins up to 30 kda Proteins stable at high concentration (0.5-1mM), preferably at room temperature

Distance Restraints Distance restraints are derived from spectra

Deposited NMR structure Finally, an ensemble of the 10-25 structures with lowest total energy and lowest number of violations are deposited in the Protein Data Bank The restraints are deposited in the BioMagResBank (BMRB) The ensemble shows possible structures within the space defined by the constraints The ensemble of 20 NMR structures of the U-box domain from Atpub14 deposited in PDB (1T1H) Andersen et al., JBC, 2004

Evaluation of NMR Structures Homo-/hetero- nuclear NMR? RMSD (be careful)? Types of constraints? Numbers of constraints? Violations? Specific regions?

Related Courses @ DTU 26325 Protein Crystallography Spring course, 13 weeks Contact: Pernille Harris, build. 206, room 212, (+45) 4525 2024, ph@kemi.dtu.dk

Structure Validation Originally by Anne Mølgaard, University of Copenhagen

Structure Validation A homology model will never be better than the template it is based on So make sure the template is good!

Learning objective After today you will be able to tell a good crystal structure from a bad one! 1hgf 1hgf Jmol PyMOL

The Protein Data Bank February, 2012

X-ray Crystallography vs. NMR X-ray crystallography Proteins of any size Proteins in crystal Complete data/total map of structure Many details one model Resolution, R-values, Ramachandran plot NMR spectroscopy Proteins below 50 kda Proteins in solution Incomplete data Fewer details many models Restraint violations, RMSD, Ramachandran plot

Ramachandran plot

Ramachandran Plot I Procheck (Laskowski) 4 categories: Most favored Additionally allowed Generously allowed disallowed More than 90% structures in most favored regions and none or only a few in disallowed regions PDBSum: http://www.ebi.ac.uk/ thornton-srv/databases/ pdbsum/

Ramachandran Plot II Uppsala Ramachandran server (Kleywegt & Jones): Newer analysis of PDB Two categories: Allowed Disallowed More than 95% residues in allowed regions http://eds.bmc.uu.se/ ramachan.html

Ramachandran Plot III MolProbity Current PDB standard. Three categories Favoured (>98%) Allowed (>99.8%) Disallowed Also outputs plots for Glycine Proline Pre-proline http:// molprobity.biochem.duke. edu/index.php

Ramachandran plot Good structures should have the majority of the residues in the most favored regions: Procheck: >90% Uppsala: >95% MolProbity: >98% and none or only a few in the disallowed regions. Residues in disallowed regions are either wrong or potentially very interesting.

Strange But Correct Mølgaard & Larsen (2002) Acta Cryst. D58, 111-119.

R-factor, R free Structure factor ( ) = f j exp 2πi hx j + ky j + lz j F hkl N j=1 [ ( )] Individual reflections I hkl F obs (hkl) 2 Take out 5-10% of the data and use to calculate R free

R-factor, R free R-factors vary from < 10% (high resolution structures) to 59% (theoretical random structure). Rule of thumb for good quality structures: R Resolution/10 R free should be < 30% and not much more than 5% (points) higher than R: R + 0.05 > R free > R

Key Parameters Resolution R values Agreement between data and model. Usually between 0.15 and 0.25, should not exceed 0.30. R ~ Resolution / 10 R + 0.05 > R free > R. Ramachandran plot The majority of residues in most favoured regions. B factors Contributions from static and dynamic disorder Well determined ~10-20 Å 2, intermediate ~20-30 Å 2, flexible 30-50 Å 2, invisible >60 Å 2.

1p7q R, Rfree: 21.8%, 30.9% Resolution: 3.4 Å Ramachandran plot B.E. Willcox et al. Nat Immunol. 2003 Sep;4(9):913-9.

Other parameters B-factors Contributions from static and dynamic disorder Well determined ~10-20 Å 2 Intermediate ~20-30 Å 2 Flexible 30-50 Å 2 Invisible >60 Å 2. Should be (mostly) > 5 Å 2 and < 50 Å 2 Occupancies Between 0 and 1 (0-100%) Rarely below 0.4 (40%)

B-factors 1xr9, 1.79Å atom # chain ID resi # x y z occ B-factor

B-factors 1p7q, 3.4Å

1hhh, 3.0Å Madden et al. Cell. 1993 Nov 19;75(4):693-708.

1au4, 2.3Å, R=26.6, R free =37.0 Yamashita et al. J.Am.Chem.Soc. v119 pp.11351, 1997

1a0h, 3.2Å Martin et al. Structure. 1997 Dec 15;5(12):1681-93

Occupancies, 1ea3, 2.3Å

The Electron Density Server

Something s Fishy in 2hr0 No correlation between B-factors and solvent accessibility. Large layers of solvent in the c-direction (30-40Å thick) that spans entire unit cell (i.e. nothing to hold the crystal together). Absence of bulk solvent in diffraction data. 80% solvent resolution 2.3Å. Perfect electron density around impossible geometries. Impossible merging statistics for data from four crystals (Rmerge = 0.11 in the last resolution shell, with I/sig(I) = 1.32).

Diffraction Precision Index DPI (with Rfree): DPI (no Rfree): Scaled by B-factor Cruickshank 1999, Acta Cryst D

DPI vs. Resolution 2BSU

2bsu, 1.60Å, R=31.9, R free =33.6 Has now been replaced by 2v2w

Take Home Message It is easy to find bad structures in the PDB so validate!! Resolution Ramachandran R free RMSD (NMR) Restraint violations (NMR) High impact journal high quality structure!

Programme 8.00-8.20 Summary of last week s lecture and quiz 8.20-9.00 Structure validation 9.00-9.15 Break 9.15-11.00 Exercise: Structure validation tutorial 11.00-11.10 Break 11.10-11.40 Summary & discussion 11.40-12.00 Quiz

Break!

Programme 8.00-8.20 Summary of last week s lecture and quiz 8.20-9.00 Structure validation 9.00-9.15 Break 9.15-11.00 Exercise: Structure validation tutorial 11.00-11.10 Break 11.10-11.40 Summary & discussion 11.40-12.00 Quiz