Full wwpdb X-ray Structure Validation Report i

Similar documents
Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

Full wwpdb X-ray Structure Validation Report i

wwpdb X-ray Structure Validation Summary Report

Full wwpdb X-ray Structure Validation Report i

Full wwpdb/emdatabank EM Map/Model Validation Report i

Full wwpdb NMR Structure Validation Report i

Full wwpdb/emdatabank EM Map/Model Validation Report io

Full wwpdb/emdatabank EM Map/Model Validation Report i

Full wwpdb X-ray Structure Validation Report i

1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?!

Electronic Supplementary Information (ESI) for Chem. Commun. Unveiling the three- dimensional structure of the green pigment of nitrite- cured meat

Full wwpdb X-ray Structure Validation Report i

Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08 Å resolution: comparison with the Azotobacter vinelandii MoFe protein

Full wwpdb/emdatabank EM Map/Model Validation Report io

The structure of Aquifex aeolicus FtsH in the ADP-bound state reveals a C2-symmetric hexamer

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Multivariate Analyses of Quality Metrics for Crystal Structures in the PDB Archive

Ramachandran Plot. 4ysz Phi (degrees) Plot statistics

A new generation of crystallographic validation tools for the Protein Data Bank

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version 3.0, December 1, 2006 Updated to Version 3.

FW 1 CDR 1 FW 2 CDR 2

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb

A New Generation of Crystallographic Validation Tools for the Protein Data Bank

DOCKING TUTORIAL. A. The docking Workflow

Table 1. Crystallographic data collection, phasing and refinement statistics. Native Hg soaked Mn soaked 1 Mn soaked 2

Chapter 24. Stereochemistry and Validation of Macromolecular Structures. Alexander Wlodawer. Abstract. 1 Introduction

Table S1. Overview of used PDZK1 constructs and their binding affinities to peptides. Related to figure 1.

IgE binds asymmetrically to its B cell receptor CD23

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Physiochemical Properties of Residues


Report of protein analysis

X-ray Crystallography I. James Fraser Macromolecluar Interactions BP204

Manipulating Ligands Using Coot. Paul Emsley May 2013

SUPPLEMENTARY INFORMATION

Automated identification of functional dynamic contact networks from X-ray crystallography

*Corresponding Author *K. F.: *T. H.:

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination

Protein structure analysis. Risto Laakso 10th January 2005

SUPPLEMENTARY INFORMATION. doi: /nature07461

Garib N Murshudov MRC-LMB, Cambridge

Conformational Geometry of Peptides and Proteins:

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

Report of protein analysis

Ensemble refinement of protein crystal structures in PHENIX. Tom Burnley Piet Gros

Macromolecular Crystallography Part II

Protein Structures: Experiments and Modeling. Patrice Koehl

Joana Pereira Lamzin Group EMBL Hamburg, Germany. Small molecules How to identify and build them (with ARP/wARP)

Data File Formats. There are dozens of file formats for chemical data.

HTCondor and macromolecular structure validation

Q1 current best prac2ce

This is an author produced version of Privateer: : software for the conformational validation of carbohydrate structures.

Model Mélange. Physical Models of Peptides and Proteins

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

SUPPLEMENTARY INFORMATION

Rietveld Structure Refinement of Protein Powder Diffraction Data using GSAS

Electron Density at various resolutions, and fitting a model as accurately as possible.

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

8. Strategies for Macromolecular Refinement

Enhancing Specificity in the Janus Kinases: A Study on the Thienopyridine. JAK2 Selective Mechanism Combined Molecular Dynamics Simulation

Rotamers in the CHARMM19 Force Field

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease

BSc and MSc Degree Examinations

Protein sidechain conformer prediction: a test of the energy function Robert J Petrella 1, Themis Lazaridis 1 and Martin Karplus 1,2

SUPPLEMENTARY INFORMATION

Stabilizing the CH2 domain of an Antibody by Engineering in an Enhanced Aromatic Sequon

PDBe TUTORIAL. PDBePISA (Protein Interfaces, Surfaces and Assemblies)

Twinning in PDB and REFMAC. Garib N Murshudov Chemistry Department, University of York, UK

April, The energy functions include:

Sensitive NMR Approach for Determining the Binding Mode of Tightly Binding Ligand Molecules to Protein Targets

Supporting Information. UV-induced ligand exchange in MHC class I protein crystals

Modelling Macromolecules with Coot

Full wwpdb/emdatabank EM Map/Model Validation Report i

Drug targets, Protein Structures and Crystallography

Supplementary Information

Visualization of Macromolecular Structures

NMR, X-ray Diffraction, Protein Structure, and RasMol

Supplementary Materials for

Summary of Experimental Protein Structure Determination. Key Elements

Cryo-EM data collection, refinement and validation statistics

SUPPLEMENTARY INFORMATION

Coot, pyrogen & CCP4 SRS. Paul Emsley MRC Laboratory of Molecular Biology, Cambridge, UK June 2015

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

Supplementary Figures

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Nature Structural and Molecular Biology: doi: /nsmb Supplementary Figure 1. Experimental approach for enhancement of unbiased Fo Fc maps.

SUPPLEMENTARY INFORMATION

Central Dogma. modifications genome transcriptome proteome

PAN-modular Structure of Parasite Sarcocystis muris Microneme Protein SML-2 at 1.95 Å Resolution and the Complex with 1-Thio-β-D-Galactose

Acta Cryst. (2017). D73, doi: /s

Direct Method. Very few protein diffraction data meet the 2nd condition

Acta Crystallographica Section F

Transcription:

Full wwpdb X-ray Structure Validation Report i Mar 8, 2018 10:24 pm GMT PDB ID : 1A30 Title : HIV-1 PROTEASE COMPLEXED WITH A TRIPEPTIDE INHIBITOR Authors : Louis, J.M.; Dyda, F.; Nashed, N.T.; Kimmel, A.R.; Davies, D.R. Deposited on : 1998-01-27 Resolution : 2.00 Å(reported) This is a Full wwpdb X-ray Structure Validation Report for a publicly released PDB entry. We welcome your comments at validation@mail.wwpdb.org A user guide is available at https://www.wwpdb.org/validation/2017/xrayvalidationreporthelp with specific help available everywhere you see the i symbol. The following versions of software and data (see references i ) were used in the production of this report: MolProbity : 4.02b-467 Xtriage (Phenix) : 1.13 EDS : trunk30967 Percentile statistics : 20171227.v01 (using entries in the PDB archive December 27th 2017) Refmac : 5.8.0158 CCP4 : 7.0 (Gargrove) Ideal geometry (proteins) : Engh & Huber (2001) Ideal geometry (DNA, RNA) : Parkinson et al. (1996) Validation Pipeline (wwpdb-vp) : trunk30967

Page 2 Full wwpdb X-ray Structure Validation Report 1A30 1 Overall quality at a glance i The following experimental techniques were used to determine the structure: X-RAY DIFFRACTION The reported resolution of this entry is 2.00 Å. Percentile scores (ranging between 0-100) for global validation metrics of the entry are shown in the following graphic. The table shows the number of entries on which the scores are based. Metric Whole archive Similar resolution (#Entries) (#Entries, resolution range(å)) Clashscore 122126 8267 (2.00-2.00) Ramachandran outliers 120053 8166 (2.00-2.00) Sidechain outliers 120020 8165 (2.00-2.00) RSRZ outliers 108989 7011 (2.00-2.00) The table below summarises the geometric issues observed across the polymeric chains and their fit to the electron density. The red, orange, yellow and green segments on the lower bar indicate the fraction of residues that contain outliers for >=3, 2, 1 and 0 types of geometric quality criteria. A grey segment represents the fraction of residues that are not modelled. The numeric value for each fraction is indicated below the corresponding segment, with a dot representing fractions <=5% The upper red bar (where present) indicates the fraction of residues that have poor fit to the electron density. The numeric value is given above the bar. Mol Chain Length Quality of chain 1 A 99 1 B 99 2 C 3

Page 3 Full wwpdb X-ray Structure Validation Report 1A30 2 Entry composition i There are 3 unique types of molecules in this entry. The entry contains 1752 atoms, of which 0 are hydrogens and 0 are deuteriums. In the tables below, the ZeroOcc column contains the number of atoms modelled with zero occupancy, the AltConf column contains the number of residues with at least one atom in alternate conformation and the Trace column contains the number of residues modelled with at most 2 atoms. Molecule 1 is a protein called HIV-1 PROTEASE. Mol Chain Residues Atoms ZeroOcc AltConf Trace 1 A 99 Total C N O S 752 487 127 134 4 0 0 0 1 B 99 Total C N O S 758 490 130 134 4 0 0 0 There are 6 discrepancies between the modelled and reference sequences: Chain Residue Modelled Actual Comment Reference A 7 LYS GLN ENGINEERED UNP P04585 A 33 ILE LEU ENGINEERED UNP P04585 A 63 ILE LEU ENGINEERED UNP P04585 B 7 LYS GLN ENGINEERED UNP P04585 B 33 ILE LEU ENGINEERED UNP P04585 B 63 ILE LEU ENGINEERED UNP P04585 Molecule 2 is a protein called TRIPEPTIDE GLU-ASP-LEU. Mol Chain Residues Atoms ZeroOcc AltConf Trace 2 C 3 Total C N O 26 15 3 8 0 0 0 Molecule 3 is water. Mol Chain Residues Atoms ZeroOcc AltConf Total O 3 A 106 0 0 106 106 Total O 3 B 107 0 0 107 107 Total O 3 C 3 0 0 3 3

Page 4 Full wwpdb X-ray Structure Validation Report 1A30 3 Residue-property plots i These plots are drawn for all protein, RNA and DNA chains in the entry. The first graphic for a chain summarises the proportions of the various outlier classes displayed in the second graphic. The second graphic shows the sequence view annotated by issues in geometryand electron density. Residues are color-coded according to the number of geometric quality criteria for which they contain at least one outlier: green = 0, yellow = 1, orange = 2 and red = 3 or more. A red dot above a residue indicates a poor fit to the electron density (RSRZ > 2). Stretches of 2 or more consecutive residues without any outlier are shown as a green connector. Residues present in the sample, but not in the model, are shown in grey. Molecule 1: HIV-1 PROTEASE Chain A: P1 L10 I15 L23 D30 I50 I64 I85 G86 L90 N98 F99 Molecule 1: HIV-1 PROTEASE Chain B: P1 K14 Q18 L19 K20 G27 W42 G49 K55 V56 R57 I63 I64 E65 I66 C67 K70 V82 C95 N98 F99 Molecule 2: TRIPEPTIDE GLU-ASP-LEU Chain C: E506 D507 L508

Page 5 Full wwpdb X-ray Structure Validation Report 1A30 4 Data and refinement statistics i Property Value Source Space group P 21 21 2 Depositor Cell constants 58.47Å 86.44Å 45.81Å a, b, c, α, β, γ 90.00 90.00 90.00 Depositor Resolution (Å) 15.00 2.00 Depositor 33.28 2.00 EDS % Data completeness (in resolution range) 97.3 (15.00-2.00) 88.7 (33.28-2.00) Depositor EDS R merge 0.05 Depositor R sym 0.05 Depositor < I/σ(I) > 1 4.24 (at 2.00Å) Xtriage Refinement program X-PLOR 3.1 Depositor 0.189, 0.227 Depositor R, R free 0.204, (Not available) DCC R free test set No test flags present. wwpdb-vp Wilson B-factor (Å 2 ) 24.0 Xtriage Anisotropy 0.324 Xtriage Bulk solvent k sol (e/å 3 ), B sol (Å 2 ) 0.34, 54.4 EDS L-test for twinning 2 < L > = 0.48, < L 2 > = 0.31 Xtriage Estimated twinning fraction No twinning to report. Xtriage F o,f c correlation 0.95 EDS Total number of atoms 1752 wwpdb-vp Average B, all atoms (Å 2 ) 27.0 wwpdb-vp Xtriage s analysis on translational NCS is as follows: The largest off-origin peak in the Patterson function is 5.71% of the height of the origin peak. No significant pseudotranslation is detected. 1 Intensities estimated from amplitudes. 2 Theoretical values of < L >, < L 2 > for acentric reflections are 0.5, 0.333 respectively for untwinned datasets, and 0.375, 0.2 for perfectly twinned datasets.

Page 6 Full wwpdb X-ray Structure Validation Report 1A30 5 Model quality i 5.1 Standard geometry i The Z score for a bond length (or angle) is the number of standard deviations the observed value is removed from the expected value. A bond length (or angle) with Z > 5 is considered an outlier worth inspection. RMSZ is the root-mean-square of all Z scores of the bond lengths (or angles). Mol Chain Bond lengths Bond angles RMSZ # Z >5 RMSZ # Z >5 1 A 0.36 0/765 0.68 0/1035 1 B 0.36 0/771 0.65 0/1042 2 C 0.47 0/25 0.71 0/31 All All 0.36 0/1561 0.66 0/2108 There are no bond length outliers. There are no bond angle outliers. There are no chirality outliers. There are no planarity outliers. 5.2 Too-close contacts i In the following table, the Non-H and H(model) columns list the number of non-hydrogen atoms and hydrogen atoms in the chain respectively. The H(added) column lists the number of hydrogen atoms added and optimized by MolProbity. The Clashes column lists the number of clashes within the asymmetric unit, whereas Symm-Clashes lists symmetry related clashes. Mol Chain Non-H H(model) H(added) Clashes Symm-Clashes 1 A 752 0 800 9 0 1 B 758 0 811 13 0 2 C 26 0 20 2 0 3 A 106 0 0 1 0 3 B 107 0 0 1 0 3 C 3 0 0 0 0 All All 1752 0 1631 18 0 The all-atom clashscore is defined as the number of clashes found per 1000 atoms (including hydrogen atoms). The all-atom clashscore for this structure is 6. All (18) close contacts within the same asymmetric unit are listed below, sorted by their clash magnitude.

Page 7 Full wwpdb X-ray Structure Validation Report 1A30 Atom-1 Atom-2 Interatomic Clash distance (Å) overlap (Å) 1:B:63:ILE:HG23 1:B:70:LYS:HD2 1.76 0.67 1:A:85:ILE:HG22 1:A:90:LEU:HD13 1.86 0.57 1:B:82:VAL:HG22 2:C:508:LEU:HD13 1.91 0.52 1:B:63:ILE:CG2 1:B:70:LYS:HD2 2.39 0.52 1:B:42:TRP:CE2 1:B:57:ARG:HD2 2.49 0.48 1:B:14:LYS:HB3 1:B:65:GLU:HB3 1.97 0.46 1:A:99:PHE:HB3 1:B:95:CYS:HA 1.98 0.46 1:A:15:ILE:HD12 1:A:64:ILE:HD11 1.98 0.45 1:A:23:LEU:HD21 1:B:27:GLY:HA2 2.01 0.42 1:A:98:ASN:HD22 1:B:98:ASN:HD22 1.68 0.42 1:B:82:VAL:CG2 2:C:508:LEU:HD13 2.49 0.42 1:B:18:GLN:HE21 1:B:20:LYS:NZ 2.17 0.42 1:A:50:ILE:HB 1:B:49:GLY:HA3 2.00 0.42 1:B:55:LYS:HE3 3:B:1111:HOH:O 2.20 0.41 1:A:90:LEU:HA 1:A:90:LEU:HD12 1.95 0.41 1:A:30:ASP:HB3 3:A:1137:HOH:O 2.20 0.41 1:A:30:ASP:O 1:A:86:GLY:HA3 2.20 0.41 1:B:18:GLN:HG3 1:B:20:LYS:HZ2 1.85 0.40 There are no symmetry-related clashes. 5.3 Torsion angles i 5.3.1 Protein backbone i In the following table, the Percentiles column shows the percent Ramachandran outliers of the chain as a percentile score with respect to all X-ray entries followed by that with respect to entries of similar resolution. The Analysed column shows the number of residues for which the backbone conformation was analysed, and the total number of residues. Mol Chain Analysed Favoured Allowed Outliers Percentiles 1 A 97/99 (98%) 95 (98%) 2 (2%) 0 100 100 1 B 97/99 (98%) 97 (100%) 0 0 100 100 2 C 1/3 (33%) 1 (100%) 0 0 100 100 All All 195/201 (97%) 193 (99%) 2 (1%) 0 100 100 There are no Ramachandran outliers to report.

Page 8 Full wwpdb X-ray Structure Validation Report 1A30 5.3.2 Protein sidechains i In the following table, the Percentiles column shows the percent sidechain outliers of the chain as a percentile score with respect to all X-ray entries followed by that with respect to entries of similar resolution. The Analysed column shows the number of residues for which the sidechain conformation was analysed, and the total number of residues. Mol Chain Analysed Rotameric Outliers Percentiles 1 A 82/83 (99%) 80 (98%) 2 (2%) 52 54 1 B 83/83 (100%) 81 (98%) 2 (2%) 52 54 2 C 3/3 (100%) 3 (100%) 0 100 100 All All 168/169 (99%) 164 (98%) 4 (2%) 52 54 All (4) residues with a non-rotameric sidechain are listed below: Mol Chain Res Type 1 A 10 LEU 1 A 90 LEU 1 B 65 GLU 1 B 67 CYS Some sidechains can be flipped to improve hydrogen bonding and reduce clashes. All (4) such sidechains are listed below: Mol Chain Res Type 1 A 2 GLN 1 A 98 ASN 1 B 18 GLN 1 B 98 ASN 5.3.3 RNA i There are no RNA molecules in this entry. 5.4 Non-standard residues in protein, DNA, RNA chains i There are no non-standard protein/dna/rna residues in this entry.

Page 9 Full wwpdb X-ray Structure Validation Report 1A30 5.5 Carbohydrates i There are no carbohydrates in this entry. 5.6 Ligand geometry i There are no ligands in this entry. 5.7 Other polymers i There are no such residues in this entry. 5.8 Polymer linkage issues i There are no chain breaks in this entry.

Page 10 Full wwpdb X-ray Structure Validation Report 1A30 6 Fit of model and data i 6.1 Protein, DNA and RNA chains i In the following table, the column labelled #RSRZ> 2 contains the number (and percentage) of RSRZ outliers, followed by percent RSRZ outliers for the chain as percentile scores relative to all X-ray entries and entries of similar resolution. The OWAB column contains the minimum, median, 95 th percentile and maximum values of the occupancy-weighted average B-factor per residue. The column labelled Q< 0.9 lists the number of (and percentage) of residues with an average occupancy less than 0.9. Mol Chain Analysed <RSRZ> #RSRZ>2 OWAB(Å 2 ) Q<0.9 1 A 99/99 (100%) -0.36 0 100 100 15, 25, 42, 48 0 1 B 99/99 (100%) -0.49 0 100 100 13, 23, 39, 49 0 2 C 3/3 (100%) 1.54 1 (33%) 0 0 33, 33, 34, 52 1 (33%) All All 201/201 (100%) -0.40 1 (0%) 90 90 13, 24, 41, 52 1 (0%) All (1) RSRZ outliers are listed below: Mol Chain Res Type RSRZ 2 C 508 LEU 3.0 6.2 Non-standard residues in protein, DNA, RNA chains i There are no non-standard protein/dna/rna residues in this entry. 6.3 Carbohydrates i There are no carbohydrates in this entry. 6.4 Ligands i There are no ligands in this entry. 6.5 Other polymers i There are no such residues in this entry.