Protein Structure and Visualisation. Introduction to PDB and PyMOL

Similar documents
Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb

X-ray crystallography NMR Cryoelectron microscopy

Appendices. Appendix I: PDB file format

Working with protein structures. Benjamin Jack

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Molecular Graphics with PyMOL

Pymol Practial Guide

April, The energy functions include:

QuartPAC: Identifying mutational clusters while utilizing protein quaternary structural data

Data File Formats. There are dozens of file formats for chemical data.

ATP GTP Problem 2 mm.py

Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08 Å resolution: comparison with the Azotobacter vinelandii MoFe protein

Chapter 2 Structures. 2.1 Introduction Storing Protein Structures The PDB File Format

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

Drug targets, Protein Structures and Crystallography

Computational Structural Biology and Molecular Simulation. Introduction to VMD Molecular Visualization and Analysis

Introduction to Structure Preparation and Visualization

VMD Tutorial Ho Chi Minh City, 12/01/2012 Emiliano Ippoliti:

Molecular Visualization. Introduction

3D Visualization of Drugs-Protein Complexes

Details of Protein Structure

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Let s continue our discussion on the interaction between Fe(III) and 6,7-dihydroxynaphthalene-2- sulfonate.

NMR Assignments using NMRView II: Sequential Assignments

MULTIDOCK V1.0. Richard M. Jackson Biomolecular Modelling Laboratory Imperial Cancer Research Fund 44 Lincoln s Inn Fields London WC2A 3PX

Proteins. Central Dogma : DNA RNA protein Amino acid polymers - defined composition & order. Perform nearly all cellular functions Drug Targets

Model Mélange. Physical Models of Peptides and Proteins

Preparing a PDB File

Supplementary Information Intrinsic Localized Modes in Proteins

Full wwpdb X-ray Structure Validation Report i

----- Ver October 24, 2014 Bug about reading MOPAC2012 Ver.14 calculations of 1 atom and 2 atoms molecule was fixed.

Section III - Designing Models for 3D Printing

Build_model v User Guide

Protein Structure Prediction and Display

Properties of amino acids in proteins

Packing of Secondary Structures

Central Dogma. modifications genome transcriptome proteome

Advanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions

Protein Fragment Search Program ver Overview: Contents:

PROTEIN ORIGAMI - A PROGRAM FOR THE CREATION OF 3D PAPER MODELS OF FOLDED PEPTIDES

Molecular modeling with InsightII

Protein Structure Determination. Why Bother With Structure? Protein Sequences Far Outnumber Structures. Growth of Structural Data

Tutorial on molecular visualisation

Biological Macromolecules

Physiochemical Properties of Residues

Any protein that can be labelled by both procedures must be a transmembrane protein.

BIOINF527: STRUCTURAL BIOINFORMATICS LAB SESSION

Major Types of Association of Proteins with Cell Membranes. From Alberts et al

Other Methods for Generating Ions 1. MALDI matrix assisted laser desorption ionization MS 2. Spray ionization techniques 3. Fast atom bombardment 4.

X-Ray structure analysis

Hands-on pdb4dna.

Proteins: Characteristics and Properties of Amino Acids

Full wwpdb X-ray Structure Validation Report i

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Flexibility and Constraints in GOLD

Ramachandran Plot. 4ysz Phi (degrees) Plot statistics

Protein Structure Determination. Why Bother With Structure? Protein Sequences Far Outnumber Structures

BIRKBECK COLLEGE (University of London)

Bacterial protease uses distinct thermodynamic signatures for substrate recognition

Protein Structure Determination. How are these structures determined?

Identifying Interaction Hot Spots with SuperStar

Exam I Answer Key: Summer 2006, Semester C

SUPPLEMENTARY INFORMATION

Tutorial. Getting started. Sample to Insight. March 31, 2016

Protein Bioinformatics Computer lab #1 Friday, April 11, 2008 Sean Prigge and Ingo Ruczinski

Hands-on Course in Computational Structural Biology and Molecular Simulation BIOP590C/MCB590C. Course Details

Translation. A ribosome, mrna, and trna.

Full wwpdb X-ray Structure Validation Report i

Supplementary Information. Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1

APBS electrostatics in VMD - Software. APBS! >!Examples! >!Visualization! >! Contents

Molecular Modeling Lecture 7. Homology modeling insertions/deletions manual realignment

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

Full wwpdb X-ray Structure Validation Report i

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells?

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Protein Structure: Data Bases and Classification Ingo Ruczinski

Right click on the link and save the file on the disk (Save link target as...). Then execute this in the command window:

Tutorial: Structural Analysis of a Protein-Protein Complex

Full wwpdb X-ray Structure Validation Report i

Biochemistry Quiz Review 1I. 1. Of the 20 standard amino acids, only is not optically active. The reason is that its side chain.

Computational Molecular Modeling

Assignment A02: Geometry Definition: File Formats, Redundant Coordinates, PES Scans

Report of protein analysis

Esser et al. Crystal Structures of R. sphaeroides bc 1

Supplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate

Project Manual Bio3055. Apoptosis: Caspase-1

Structural Computational Biology: Introduction and Background *

HIV protease inhibitor. Certain level of function can be found without structure. But a structure is a key to understand the detailed mechanism.

Section II Understanding the Protein Data Bank

Silica surface - Materials Studio tutorial. CREATING SiO 2 SURFACE

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version 3.0, December 1, 2006 Updated to Version 3.

SUPPLEMENTARY INFORMATION

1. What is an ångstrom unit, and why is it used to describe molecular structures?

What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary

Introduction to the Ribosome Overview of protein synthesis on the ribosome Prof. Anders Liljas

Transcription:

Protein Structure and Visualisation Introduction to PDB and PyMOL 1

Feedback Persons http://www.bio-evaluering.dk/ 2

Program 8.00-8.15 Quiz results 8.15-8.50 Introduction to PDB & PyMOL 8.50-9.00 Break 9.00-11.00 PyMOL tutorial 11.00-11.15 Break 11.15-11.30 Summary 11.30-12.00 Quiz 3

Program 8.00-8.15 Quiz results 8.15-8.50 Introduction to PDB & PyMOL 8.50-9.00 Break 9.00-11.00 PyMOL tutorial 11.00-11.15 Break 11.15-11.30 Summary 11.30-12.00 Quiz 4

PDB Growth 1971-2011 71138 February 2011 5

Protein Data Bank http://www.rcsb.org/ Contents File structure Types of structures Structure reports & summaries Quality check Searching Molecule of the Month 6

PDB File Header HEADER IMMUNE SYSTEM 16-AUG-04 1X7Q TITLE CRYSTAL STRUCTURE OF HLA-A*1101 WITH SARS NUCLEOCAPSID TITLE 2 PEPTIDE COMPND MOL_ID: 1; COMPND 2 MOLECULE: HLA CLASS I HISTOCOMPATIBILITY ANTIGEN, A-11 REMARK 3 DATA USED IN REFINEMENT. REMARK 3 RESOLUTION RANGE HIGH (ANGSTROMS) : 1.45 REMARK 3 RESOLUTION RANGE LOW (ANGSTROMS) : 30.00 SEQRES 1 C 9 LYS THR PHE PRO PRO THR GLU PRO LYS HET SO4 B4001 5 HETNAM SO4 SULFATE ION HELIX 1 1 ALA A 49 GLU A 53 5 5 CRYST1 58.465 80.623 56.570 90.00 116.79 90.00 P 1 21 1 2 7

PDB File Atom Coordinates ATOM 1 N THR A 1 25.200 26.068 37.670 1.00 25.43 N ATOM 2 CA THR A 1 26.443 26.547 37.135 1.00 16.70 C ATOM 3 C THR A 1 27.568 25.589 37.431 1.00 13.12 C ATOM 4 O THR A 1 27.577 25.073 38.554 1.00 15.92 O ATOM 5 CB THR A 1 26.745 27.891 37.843 1.00 20.41 C ATOM 6 OG1 THR A 1 25.564 28.674 37.550 1.00 26.40 O ATOM 7 CG2 THR A 1 27.995 28.594 37.359 1.00 22.25 C HETATM 3284 O3 GOL B5007 23.356 27.459 10.492 1.00 29.71 O HETATM 3285 O HOH A5007 19.520 54.389 40.841 1.00 10.97 O ATOM 1 N GLY A 1 9.360 23.828 42.512 1.00 45.34 N ANISOU 1 N GLY A 1 6582 3670 6975 2086 1854 1681 N ATOM 2 CA GLY A 1 8.104 24.285 41.984 1.00 33.31 C ANISOU 2 CA GLY A 1 5450 1898 5309-137 1446 394 C ATOM 3 C GLY A 1 7.740 25.713 42.380 1.00 28.54 C ANISOU 3 C GLY A 1 4079 1849 4915-6 728 561 C

PDB File Fields COLUMNS DATA TYPE FIELD DEFINITION ----------------------------------------------------------- 1 6 Record name "ATOM" 7-11 Integer serial Atom serial number. 13 16 Atom name Atom name. 17 Character altloc Alternate location indicator. 18 20 Residue name resname Residue name. 22 Character chainid Chain identifier. 23 26 Integer resseq Residue sequence number. 27 AChar icode Code for insertion of residues. 31 38 Real(8.3) x Orthogonal coordinates for X in Angstroms 39 46 Real(8.3) y Orthogonal coordinates for Y in Angstroms 47 54 Real(8.3) z Orthogonal coordinates for Z in Angstroms 55 60 Real(6.2) occupancy Occupancy. 61 66 Real(6.2) tempfactor Temperature factor. 77 78 LString(2) element Element symbol, right-justified. 79 80 LString(2) charge Charge on the atom.

Worldwide Structural Genomics Fold space coverage " Complete genomes Disease-causing organisms Model organisms" Membrane proteins" Protein-ligand interactions Hou et al., PNAS 2003, 100: 2386-2390 "

Protein Folds in PDB 0 200 400 600 800 1000 1200 1400 1600 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 New folds Total folds

Protein Structure Visualisation PyMOL 12

What is PyMOL? Open-source molecular viewing program http://www.pymol.org

Benefits It s free! For academia not for industry. Version 0.99rc6 Users can always compile the latest version. But you should contribute! Pay to get support, manual, latest version etc.

Potential Weaknesses Few! Not a fully integrated modelling environment. Not fully developed for experimental structure determination/fitting. Mostly for qualitative analyses. No undo function

PyMOL Representations Lines, sticks, ribbon, spheres, cartoon(s) Surfaces Transparency, quality Ray-tracing (rendering) Modes 16

Selections & Objects Every molecule (pdb file) is an object. Selections refer to objects Make smaller or composite objects Changes in representation can affect objects or selections.

Links PDB (protein structure database) www.pdb.org/ PyMOL home: http://pymol.sourceforge.net/ PyMOL manual: http://pymol.sourceforge.net/newman/user/toc.html PyMOL Wiki: http://www.pymolwiki.org/index.php/main_page PyMOL settings (documented): http://cluster.earlham.edu/detail/bazaar/software/pymol/ modules/pymol/setting.py

Installation Windows Mac OS X Special behaviour Rename to get plugin menu! MacPyMOL MacPyMOLX11Hybrid Linux

PyMOL scripting

Scripts PyMOL is based on the programming language Python. PyMOL will read python commands. PyMOL also has a set of native commands. See PyMOLWiki and elsewhere for examples. http://www.rubor.de/bioinf/tips_python.html http://pldserver1.biochem.queensu.ca/~rlc/work/ pymol/

Automating Tasks Scripts are useful for automating tasks Repeated analyses Reading in large numbers of structures Making illustrations! get_view set_view viewport Saves space! Relative to PyMOL session files (.pse)

A Useful Example Using PyMOL commands: list=[]! iterate (name ca),list.append((resi,resn))! print list!! [('ASP', '1'), ('CYS', '2'), ('ALA', '3'), ('TRP', '4'), ('HIS', '5'), ('LEU',! '6'), ('GLY', '7'), ('GLU', '8'), ('LEU', '9'), ('VAL', '10'), ('TRP', '11'),! ('CYS', '12'), ('THR', '13')]!! or using a Python script (in PyMOL):!! from pymol import cmd,stored! stored.list=[]! cmd.iterate("(name ca)","stored.list.append((resi,resn))")! print stored.list!! [('1', 'ASP'), ('2', 'CYS'), ('3', 'ALA'), ('4', 'TRP'), ('5', 'HIS'), ('6', '! LEU'), ('7', 'GLY'), ('8', 'GLU'), ('9', 'LEU'), ('10', 'VAL'), ('11', 'TRP'),! ('12', 'CYS'), ('13', 'THR')]

color_b.py To colour proteins according to the values of the B-factor column (can be replaced with any desired value). http://www.mcgnmr.ca/protskin/ Syntax: (see script header) color_b (c;a or c;b), mode=ramp, gradient=bwr, nbins=30, sat=0.5, value=1 to color chains A and B with the Blue-White-Red gradient in 30 colors of equal numbers of atoms in each color.

PyMOL Log Function Records all actions as commands in a PyMOL session GUI clicks Written commands Needs to be opened at start of session Is only written when closed (File Close log)

Example 1 rrna Large subunit: Catalytic mechanism Protein mrna Small subunit: Specificity 26

Example 2 27 René Magritte

Aesthetics of Molecular Images Personal taste General guidelines: Focus on relevant parts Strip away unnecessary information Find good viewing angle! Choice of representation No need for excessive graphics Good figures are (almost) self-explanatory. 28

Molecular Images Molecular graphics is an example of selective reduction of complexity. 29

Molecular Images Molecular graphics is an example of selective reduction of complexity. 30

Molecular Images Molecular graphics is an example of selective reduction of complexity. 31

Molecular Images Molecular graphics is an example of selective reduction of complexity. 32

Molecular Images Molecular graphics is an example of selective reduction of complexity. 33

Molecular Images Molecular graphics is an example of selective reduction of complexity. 34

Molecular Images Molecular graphics is an example of selective reduction of complexity. 35

Molecular Images Molecular graphics is an example of selective reduction of complexity. 36

Molecular Images Molecular graphics is an example of selective reduction of complexity. 37

Figures in PyMOL 38

Representations ray_trace_mode 0 39

Representations ray_trace_mode 1 40

Representations ray_trace_mode 2 41

Representations ray_trace_mode 3 42

Representations 43

Representations 44

Representations 45

Representations Standard stickrepresentation Sphere_scale = 0.3 With hydrogen atoms With shadows 46

Representations Standard stickrepresentation Sphere_scale = 0.3 With hydrogen atoms No shadows 47

Representations Standard stickrepresentation Sphere_scale = 0.3 With hydrogen atoms No shadows With depth cue 48

Representations Standard stickrepresentation Sphere_scale = 0.3 With hydrogen atoms No shadows With depth cue Larger image 49

Representations Standard stickrepresentation Sphere_scale = 0.3 With hydrogen atoms No shadows With depth cue Even larger image 50

Focus on the Relevant Parts Homologous peptides in different proteins Intuitive colouring 51

Residue Conservation http://www.mcgnmr.ca/protskin/ 52

T-Cell Receptor Interactions I II III 53

Cross-Sections Proteasome 54

Electrostatic Surface Potentials 55

Composite Images MHC-I binding groove Images combined in Adobe Illustrator. C F B A E D 56

Simplified Composite Images C N P 2 57

Cross-Sections 58

Comparing Active Sites 59

Compiled Graphics Objects 60

PyMOL Links PyMOL home: http://www.pymol.org/ PyMOL manual: http://pymol.sourceforge.net/newman/user/ toc.html PyMOL Wiki: http://www.pymolwiki.org/index.php/main_page PyMOL settings (documented): http://www.pymolwiki.org/index.php/settings 61

Program 8.00-8.15 Quiz results 8.15-8.50 Introduction to PDB & PyMOL 8.50-9.00 Break 9.00-11.00 PyMOL tutorial 11.00-11.15 Break 11.15-11.30 Summary 11.30-12.00 Quiz 62

Program 8.00-8.15 Quiz results 8.15-8.50 Introduction to PDB & PyMOL 8.50-9.00 Break 9.00-11.00 PyMOL tutorial 11.00-11.15 Break 11.15-11.30 Summary 11.30-12.00 Quiz 63

Program 8.00-8.15 Quiz results 8.15-8.50 Introduction to PDB & PyMOL 8.50-9.00 Break 9.00-11.00 PyMOL tutorial 11.00-11.15 Break 11.15-11.30 Summary 11.30-12.00 Quiz 64