ATP GTP Problem 2 mm.py

Size: px
Start display at page:

Download "ATP GTP Problem 2 mm.py"

Transcription

1 Problem 1 This problem will give you some experience with the Protein Data Bank (PDB), structure analysis, viewing and assessment and will bring up such issues as evolutionary conservation of function, structure and sequence. For a large portion of this problem you will need to use the molecular visualization program PyMOL. PyMOL is freely available for a number of different platforms (see A neat thing about PyMOL is that it has a python interpreter embedded into it, so you can write molecular visualization scripts using python! It also makes attractive images, which you may find useful in your future scientific career (or for class presentations). PyMOL is well documented on the web and you will find that answers to many questions about how to use it can be easily found with google. The official PyMOL User s Manual and Reference Manual can be found at There is also a PyMOL wiki that is a good source of commands and other PyMOL information: By using these resources, and just playing around with the program, you will quickly pick up the small number of commands required to do the following assignment. PyMOL is available on Athena. To use it type add pymol followed by pymol at the Athena prompt. Additionally, you can install it on your home machine. If you have problems using PyMOL, do not hesitate to contact your TAs. Premise for problem 1 You have set out to computationally re-design a novel signal-regulated kinase protein to serve as a convenient read-out for a particular signaling pathway (an ambitious task!). Computational protein design usually begins with the known structure of an existing protein, as you will see later in this course. Among native kinases, you have limited your search for an appropriate template to two candidates: - Human mitogen-activated protein kinase 2 (PDB ID 1S9I) - Rat mitogen-activated protein kinase ERK2 (PDB ID 1ERK) Assessment of Structure Quality a) Go to the Protein Data Bank website ( and navigate to the entries for the two proteins listed above. Several parameters that can be used to assess structure quality have been discussed in class. Some of these are: the resolution, the R value, R free and the average B-factor. For each of these parameters, explain what it reflects, give the relevant value for each structure, and indicate which of the two structures is better in terms of each parameter. Hint: you may find useful information in different places: e.g. on the front page of the PDB entry, in the PDB header itself, or in summary files generated by the PDB under the Structure Analysis:Geometry section on the left panel. b) Using the Procheck tool under Structure Analysis:Geometry, analyze the backbone conformation of each structure using the Ramachandran plot. Which structure has a better backbone geometry? (Give quantitative support for your answer.) Why are Pro and Gly treated specially in this analysis?

2 Structure-based Sequence Alignment c) Go to the Pairwise Comparison of Protein Structures portion of the Dali server ( Note, 1S9I is crystallized as a dimer (chains A and B), so you should try two alignments 1ERK with 1S9I chain A and 1ERK with 1S9I chain B. Enter the PDB codes of the proteins above and specify either chain A or B for 1S9I (in two separate runs) to align the structures. Pick the alignment with the highest Z-score. How many residues were successfully aligned? What is the RMSD in the aligned region? What fraction of amino acids in the alignment are identical between the two proteins? Do not discard the results page of this alignment, you will need it later. d) Download the files mol1_original.pdb and mol2_1.pdb generated by DALI for the alignment of 1ERK with 1S9I chain B. mol2_1.pdb is the second molecule you entered, rotated so that it is aligned with the first. Rename these two objects in PyMOL appropriately as 1S9I and 1ERK by clicking A next to the object and selecting rename (make sure you label them correctly!). Play around with hiding and showing different features (side chains, ribbons, cartoon, etc) as well as with coloring (under C to the right of object names) of the two molecules to assess the quality of the alignment. As you can deduce from its PDB entry, 1S9I has not yet been classified by SCOP. In your opinion, what is the lowest level in SCOP hierarchy that should be shared by 1ERK and 1S9I? e) Go to the Pairwise Alignment Algorithms section of the EMBOSS website ( and use the water tool (Smith-Waterman algorithm) to generate the best local alignment between chain B of 1S9I and 1ERK (get the FASTA sequences from the PDB). Use a BLOSSUM40 matrix, a gap penalty of 20 and leave all other parameters at their default values (empirically, this set of parameters produces good alignments for these proteins). Find the two largest gaps predicted by the Smith-Waterman approach and the two largest gaps assigned by Dali. Locate these gaps in the structures (use the same PyMOL session as in part (d) to simultaneously view the two structures aligned and use the sequence viewer to select different regions). Comment on the differences and similarities of placement for these gaps between Dali and Smith- Waterman. Comment on the appropriateness of the gaps given the structural alignment. Which of the two sequence alignments is more useful? Attach a printout of both alignments with the two gaps in question circled. f) As you may have noticed, 1S9I is co-crystallized with ATP. In the same PyMOL session as in part (d), type select LIG, resn ATP into the PyMOL command line to create a selection called LIG that consists of all atoms belonging to residues named ATP (resn stands for residue name ). How many atoms are selected (see output in command window)? What is the chemical formula for ATP? How many atoms are in ATP in total? Why is this count different from the PyMOL count? g) Type select AS, LIG around 4 in 1S9I into PyMOL command line to create a selection called AS that contains all atoms in chain B of 1S9I within 4 Å of

3 selection LIG. From the menu bar select Display:Sequence. Two rows of sequences (corresponding to the two molecules read) are shown. Residues with selected atoms are highlighted in the sequence for 1S9I. Report the selected residues and their indices (note, sequence indices correspond to the amino acid in the same column as the first digit of the index). h) Go back to the Dali results page and click on the link under the Structural Alignment column in the results table. Using this alignment, calculate the percent identity of residues in your selection AS and report this. Explain the sizeable discrepancy between the number you obtained here and the overall sequence identity of the alignment. Redesigning binding specificity: i) You realize that as part of your reengineering of a native kinase, you may want to change its binding preference from ATP to GTP (see structures below). Use PyMOL to analyze the ATP molecule bound to 1S9I. Out of the residues in selection AS, which 4 would you consider most likely candidates for mutations that change the binding specificity of your kinase. ATP GTP Problem 2 In this problem you will get practice parsing through a PDB file, identifying specific atoms and calculating several molecular mechanics energy terms. a) Write a python script called mm.py that accepts the name of a PDB file as the first command line argument, calculates C -C bond stretching energies for all residues (except glycine, which of course does not have a C atom) and van der Waals energies for all interactions between backbone amide N atoms and backbone carbonyl O atoms occurring between residues separated by at least 1 other residue. Your script should only print those bond stretching and van der Waals energies that are above the cutoff energy provided as the second command line argument. A suggested algorithm for this program is provided below. You can, if you want to, choose to use your own algorithm. Your program is going to

4 be tested based on its output. As before, please make sure you format your output exactly as shown in the sample run below. b) Download the structure of entry 1BBG from the PDB. Run your program on this structure with an energy cutoff of 0.1 kcal/mol. What can you say about the relative magnitudes of van der Waals versus bond-stretching energies? Does this make sense in terms of the equations below? Note: if you can t get your code to work, use the sample run below to answer this question. c) For 1BBG, do you see any pattern of residue index pairs for which van der Waals interactions exceed the cutoff? Is this pattern consistent with any particular structure type? Open 1BBG in PyMOL and examine the interactions that you detected to have van der Waals interactions above 0.1. Why would these atom pairs be closer to one another than the distance at which they have optimal van der Waals interactions (equilibrium distance)? Atom types, formulae and parameters for MM terms As you have seen in class, bond-stretching energies are calculated according to the formula: 1 U ( ) 2 bond = kb b bo, 2 where k b and b o are the force constant and equilibrium length for the given bond and b is the actual bond length. For C -C bonds that you have to consider in this assignment, k b is kcal/å 2 and b o is 1.53 Å. The formula you will use for calculating van der Waals interactions in this assignment is as follows: 12 6 Rmin Rmin U vdw = ε 2, rij rij where and R min are the energy well-depth and equilibrium distance for the given atom pair and r ij is the actual distance (in Angstroms) between the two atoms. For amide N and carbonyl O interactions that you have to consider in this problem, is kcal/mol and R min is 3.2 Å. In PDB files C, C, backbone N and backbone O atoms have atom names of CA, CB, N and O (see PDB file format specifications below). You can rely on this fact in your script (as does the algorithm below). Additionally, you can rely on the fact that if there is a CB atom in a residue, it always comes after a CA atom in that same residue. PDB file format In this problem we will only be concerned with parsing ATOM records of a PDB file (i.e. only lines starting with ATOM ). Each ATOM record is one line and contains information about one atom its name, its index in the structure, the name and the index of the residue it belongs to, its x, y, and z coordinates and its temperature factor, to name a few. Each of these values resides in a specific range of characters within the ATOM record string. For example, atom name is always found between characters 13 and 16

5 (inclusively) and residue name between characters 18 and 20 (inclusively). For the full specification of the ATOM record format, see Keep in mind that the ranges given assume that the first character has an index of 1, whereas in python indexing starts at 0, so in your program you have to adjust your character ranges accordingly. Below is an example ATOM record (in fixed-width font) along with a meter for your convenience. ATOM 14 2HB ASP H Suggested algorithm Algorithms are often communicated in terms of pseudo-code something half way between a plain human language and a computer language. Pseudo-code uses basic constructs that are common to most computer languages (such as variables, assignments, loops and functions) while at the same time abstracting away from any particular language thus allowing for the flexibility to easily explain most important aspects of an algorithm. Below is a suggested algorithm for solving this problem written in pseudocode. Comments are in plain text while code is in bold. process command-line arguments bind pdbf to the input PDB file name bind ecut to the specified energy cutoff Arrays Xn, Yn, and Zn will hold x, y, and z coordinates of backbone amide nitrogen (N) atoms in the order encountered in the structure. Arrays rnn and rin will hold residue names and indices of the corresponding atoms. Thus, Xn[i], Yn[i], Zn[i] will hold the coordinates of the i-th N atom in the structure and rnn[i] and rin[i] will hold the residue name and number of the residue to which this atom belongs. Arrays Xo, Yo, Zo, rno, and rio will hold the same information for backbone carbonyl oxygen (O) atoms. We will need this information for calculating distances and for producing understandable output. initialize empty arrays Xn, Yn, Zn, rnn, and rin initialize empty arrays Xo, Yo, Zo, rno, and rio for each line in file pdbf: if first 4 characters of the line are not ATOM : skip this line Given the specification of the PDB format above, accessing specific values on each line is a simple matter of looking at the right sub-string. However, make sure to use the strip() function (i.e. str = str.strip() ) to remove any trailing or leading white space characters from each value. bind atom name to variable an we are only interested in CA, CB, N and O atoms if atom name is neither of CA, CB, N or O : skip this line bind atom x-coordinate to variable x bind atom y-coordinate to variable y bind atom z-coordinate to variable z bind residue name to variable rn bind residue index to variable ri if we just found a CA atom, we need to remember this atom and update f to indicate that a CA has been seen in the current residue

6 if atom name is CA: remember the information about the 1 st atom in our CA-CB bond x1 = x y1 = y z1 = z rn1 = rn ri1 = ri f = 1 if atom name is CB, that means a previous CA had to have been encountered (because of their order in a PDB file). if atom name is CB: calculate the distance between CA-CB. bind b to the distance between [x1, y1, z1] [x, y, z] bind en to the CA-CB bond-stretching energy at distance b if en is grater than ecut: print Bond %s %3d CA - %s %3d CB %.2e % (rn1, ri1, rn, ri, en) if atom name is either N or O we need to store information about this atom for later calculation of N-O van der Waals energies if atom name is N : append arrays Xn, Yn, and Zn with x, y, and z respectively append arrays rnn and rin with rn ri respectively if atom name is O : append arrays Xo, Yo, and Zo with x, y, and z respectively append arrays rno and rio with rn ri respectively Now we are ready to iterate over all pairs of O-N atoms and calculate each interaction energy for each i-th backbone atom O and j-th backbone atom N: we only care about interactions non-local in sequence not those within the same residue or between neighboring residues use residue index to determine this if the two atoms are in the same residue or adjacent residues: skip this iteration use values in arrays Xn, Yn, Zn, Xo, Yo, Zo to calculate distance d = distance between the two atoms en = van der Waals energy of N-O interaction at distance d if en is grater than ecut: print vdw %s %2d O - %s %2d N %6.2f % (rno[i], rio[i], rnn[j], rin[j], en) Python Hints - In python, sub-strings (or slices of arbitrary arrays) can be easily accessed by specifying subscript ranges. Consider the following code snippet: str = "some long string of characters" print str print str[0:3] print str[12:18] print str[:18] print str[7:] print str[:]

7 This code produces: some long string of characters som ring o some long string o ng string of characters some long string of characters Notice several things: Just like with the range() function, the end index is treated exclusively. That is, if the ending index is 5, the character with index 5 is the first NOT to be included in the substring. Both the beginning and the ending indices are optional the beginning index defaults to 0 and ending index to the length of the string. - If you have read in a value as a string and would like to convert it into a float, use python s built-in function float(). The same works for integers and function int(). Consider the following code: s1 = " " s2 = "11" f = float(s1) i = int(s2) print f, i, f+1, i*i which produces: To calculate the distance between two points in 3D located at [x i, y i, z i ] and [x j, y j, z j ] we can apply the Pythagorean theorem twice to arrive at the formula: d = ( x x ) + ( y y ) 2 + ( z z ) 2 i 2 j i j i j - In order to calculate exponents in python, use the built-in function pow(). pow(x, y) calculates x y. Sample run The run below is done on the PDB entry with ID 1BBG. It is also posted on the class website as sample.run.txt. [gevorg@keating9 ps3]$ python mm.py 1BBG.pdb 0.1 Bond LYS 13 CA - LYS 13 CB 1.46e-01 Bond TYR 29 CA - TYR 29 CB 1.21e-01 Bond ILE 34 CA - ILE 34 CB 3.13e-01 Bond ASN 38 CA - ASN 38 CB 2.14e-01 vdw SER 31 O - ALA 33 N 3.60 vdw SER 31 O - ILE 34 N 27.63

8 vdw LYS 32 O - CYS 35 N 1.13 vdw ALA 33 O - CYS 35 N 0.23 vdw ALA 33 O - LYS 37 N 0.94 vdw ILE 34 O - CYS 39 N 0.26 vdw CYS 35 O - CYS 39 N 0.70 vdw ASN 38 O - THR 40 N 0.10

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Viewing and Analyzing Proteins, Ligands and their Complexes 2 2 Viewing and Analyzing Proteins, Ligands and their Complexes 2 Overview Viewing the accessible surface Analyzing the properties of proteins containing thousands of atoms is best accomplished by representing

More information

Protein Structure and Visualisation. Introduction to PDB and PyMOL

Protein Structure and Visualisation. Introduction to PDB and PyMOL Protein Structure and Visualisation Introduction to PDB and PyMOL 1 Feedback Persons http://www.bio-evaluering.dk/ 2 Program 8.00-8.15 Quiz results 8.15-8.50 Introduction to PDB & PyMOL 8.50-9.00 Break

More information

Protein Bioinformatics Computer lab #1 Friday, April 11, 2008 Sean Prigge and Ingo Ruczinski

Protein Bioinformatics Computer lab #1 Friday, April 11, 2008 Sean Prigge and Ingo Ruczinski Protein Bioinformatics 260.655 Computer lab #1 Friday, April 11, 2008 Sean Prigge and Ingo Ruczinski Goals: Approx. Time [1] Use the Protein Data Bank PDB website. 10 minutes [2] Use the WebMol Viewer.

More information

Let s continue our discussion on the interaction between Fe(III) and 6,7-dihydroxynaphthalene-2- sulfonate.

Let s continue our discussion on the interaction between Fe(III) and 6,7-dihydroxynaphthalene-2- sulfonate. Chemistry 5995(133)-8990(013) Bioinorganic Chemistry: The Good, the Bad, and the Potential of Metals Assignment 2- Aqueous Speciation, Magnetism, Redox, UV-Vis Spectroscopy, and Pymol Let s continue our

More information

NMR Assignments using NMRView II: Sequential Assignments

NMR Assignments using NMRView II: Sequential Assignments NMR Assignments using NMRView II: Sequential Assignments DO THE FOLLOWING, IF YOU HAVE NOT ALREADY DONE SO: For Mac OS X, you should have a subdirectory nmrview. At UGA this is /Users/bcmb8190/nmrview.

More information

CHEM 498Q / CHEM 630Q: Molecular Modeling of Proteins TUTORIAL #3a: Empirical force fields

CHEM 498Q / CHEM 630Q: Molecular Modeling of Proteins TUTORIAL #3a: Empirical force fields Department of Chemistry and Biochemistry, Concordia University! page 1 of 6 CHEM 498Q / CHEM 630Q: Molecular Modeling of Proteins TUTORIAL #3a: Empirical force fields INTRODUCTION The goal of this tutorial

More information

April, The energy functions include:

April, The energy functions include: REDUX A collection of Python scripts for torsion angle Monte Carlo protein molecular simulations and analysis The program is based on unified residue peptide model and is designed for more efficient exploration

More information

CHEM 498Q / CHEM 630Q: Molecular Modeling of Proteins TUTORIAL #3a: Empirical force fields

CHEM 498Q / CHEM 630Q: Molecular Modeling of Proteins TUTORIAL #3a: Empirical force fields Department of Chemistry and Biochemistry, Concordia University page 1 of 6 CHEM 498Q / CHEM 630Q: Molecular Modeling of Proteins TUTORIAL #3a: Empirical force fields INTRODUCTION The goal of this tutorial

More information

User Guide for LeDock

User Guide for LeDock User Guide for LeDock Hongtao Zhao, PhD Email: htzhao@lephar.com Website: www.lephar.com Copyright 2017 Hongtao Zhao. All rights reserved. Introduction LeDock is flexible small-molecule docking software,

More information

Comparing whole genomes

Comparing whole genomes BioNumerics Tutorial: Comparing whole genomes 1 Aim The Chromosome Comparison window in BioNumerics has been designed for large-scale comparison of sequences of unlimited length. In this tutorial you will

More information

Ligand Scout Tutorials

Ligand Scout Tutorials Ligand Scout Tutorials Step : Creating a pharmacophore from a protein-ligand complex. Type ke6 in the upper right area of the screen and press the button Download *+. The protein will be downloaded and

More information

Working with protein structures. Benjamin Jack

Working with protein structures. Benjamin Jack Working with protein structures Benjamin Jack Structure of Triosephosphate Isomerase PDB ID: 1HTI loop beta sheet alpha helix Different perspectives of the same structure Structure of Truncated Hemoglobin

More information

Introduction to Structure Preparation and Visualization

Introduction to Structure Preparation and Visualization Introduction to Structure Preparation and Visualization Created with: Release 2018-4 Prerequisites: Release 2018-2 or higher Access to the internet Categories: Molecular Visualization, Structure-Based

More information

Molecular Modeling Lecture 7. Homology modeling insertions/deletions manual realignment

Molecular Modeling Lecture 7. Homology modeling insertions/deletions manual realignment Molecular Modeling 2018-- Lecture 7 Homology modeling insertions/deletions manual realignment Homology modeling also called comparative modeling Sequences that have similar sequence have similar structure.

More information

Section III - Designing Models for 3D Printing

Section III - Designing Models for 3D Printing Section III - Designing Models for 3D Printing In this section of the Jmol Training Guide, you will become familiar with the commands needed to design a model that will be built on a 3D Printer. As you

More information

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Introduction to Comparative Protein Modeling. Chapter 4 Part I Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature

More information

Packing of Secondary Structures

Packing of Secondary Structures 7.88 Lecture Notes - 4 7.24/7.88J/5.48J The Protein Folding and Human Disease Professor Gossard Retrieving, Viewing Protein Structures from the Protein Data Base Helix helix packing Packing of Secondary

More information

Build_model v User Guide

Build_model v User Guide Build_model v.2.0.1 User Guide MolTech Build_model User Guide 2008-2011 Molecular Technologies Ltd. www.moltech.ru Please send your comments and suggestions to contact@moltech.ru. Table of Contents Input

More information

LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor

LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor Note: Adequate space is given for each answer. Questions that require a brief explanation should

More information

Preparing a PDB File

Preparing a PDB File Figure 1: Schematic view of the ligand-binding domain from the vitamin D receptor (PDB file 1IE9). The crystallographic waters are shown as small spheres and the bound ligand is shown as a CPK model. HO

More information

Assignment 2 Atomic-Level Molecular Modeling

Assignment 2 Atomic-Level Molecular Modeling Assignment 2 Atomic-Level Molecular Modeling CS/BIOE/CME/BIOPHYS/BIOMEDIN 279 Due: November 3, 2016 at 3:00 PM The goal of this assignment is to understand the biological and computational aspects of macromolecular

More information

Molecular Visualization. Introduction

Molecular Visualization. Introduction Molecular Visualization Jeffry D. Madura Department of Chemistry & Biochemistry Center for Computational Sciences Duquesne University Introduction Assessments of change, dynamics, and cause and effect

More information

Moving into the information age: From records to Google Earth

Moving into the information age: From records to Google Earth Moving into the information age: From records to Google Earth David R. R. Smith Psychology, School of Life Sciences, University of Hull e-mail: davidsmith.butterflies@gmail.com Introduction Many of us

More information

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Programme 8.00-8.20 Last week s quiz results + Summary 8.20-9.00 Fold recognition 9.00-9.15 Break 9.15-11.20 Exercise: Modelling remote homologues 11.20-11.40 Summary & discussion 11.40-12.00 Quiz 1 Feedback

More information

Building small molecules

Building small molecules Building small molecules Use the Builder (right panel) to build up molecules. Start building clicking a fragment/atom in the builder and it will appear to the workspace. Continue modifying the molecule

More information

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description Version 3.30 Document Published by the wwpdb This format complies with the PDB Exchange Dictionary (PDBx) http://mmcif.pdb.org/dictionaries/mmcif_pdbx.dic/index/index.html.

More information

CHEM 463: Advanced Inorganic Chemistry Modeling Metalloproteins for Structural Analysis

CHEM 463: Advanced Inorganic Chemistry Modeling Metalloproteins for Structural Analysis CHEM 463: Advanced Inorganic Chemistry Modeling Metalloproteins for Structural Analysis Purpose: The purpose of this laboratory is to introduce some of the basic visualization and modeling tools for viewing

More information

Assignment A02: Geometry Definition: File Formats, Redundant Coordinates, PES Scans

Assignment A02: Geometry Definition: File Formats, Redundant Coordinates, PES Scans Assignment A02: Geometry Definition: File Formats, Redundant Coordinates, PES Scans In Assignments A00 and A01, you familiarized yourself with GaussView and G09W, you learned the basics about input (GJF)

More information

Introduction to Hartree-Fock calculations in Spartan

Introduction to Hartree-Fock calculations in Spartan EE5 in 2008 Hannes Jónsson Introduction to Hartree-Fock calculations in Spartan In this exercise, you will get to use state of the art software for carrying out calculations of wavefunctions for molecues,

More information

Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner

Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner Table of Contents Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner Introduction... 2 CSD-CrossMiner Terminology... 2 Overview of CSD-CrossMiner... 3 Features

More information

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells?

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells? ame 7.01 Problem Set 1 Section Question 1 a) What are the four major types of biological molecules discussed in lecture? Give one important function of each type of biological molecule in the cell? b)

More information

Dock Ligands from a 2D Molecule Sketch

Dock Ligands from a 2D Molecule Sketch Dock Ligands from a 2D Molecule Sketch March 31, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com

More information

APBS electrostatics in VMD - Software. APBS! >!Examples! >!Visualization! >! Contents

APBS electrostatics in VMD - Software. APBS! >!Examples! >!Visualization! >! Contents Software Search this site Home Announcements An update on mailing lists APBS 1.2.0 released APBS 1.2.1 released APBS 1.3 released New APBS 1.3 Windows Installer PDB2PQR 1.7.1 released PDB2PQR 1.8 released

More information

Tutorial. Getting started. Sample to Insight. March 31, 2016

Tutorial. Getting started. Sample to Insight. March 31, 2016 Getting started March 31, 2016 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com Getting started

More information

Introduction to Spark

Introduction to Spark 1 As you become familiar or continue to explore the Cresset technology and software applications, we encourage you to look through the user manual. This is accessible from the Help menu. However, don t

More information

Chemistry Problem Set #9 Due on Thursday 11/15/18 in class.

Chemistry Problem Set #9 Due on Thursday 11/15/18 in class. Chemistry 391 - Problem Set #9 Due on Thursday 11/15/18 in class. Name 1. There is a real enzyme called cocaine esterase that is produced in bacteria that live at the base of the coca plant. The enzyme

More information

Protein Fragment Search Program ver Overview: Contents:

Protein Fragment Search Program ver Overview: Contents: Protein Fragment Search Program ver 1.1.1 Developed by: BioPhysics Laboratory, Faculty of Life and Environmental Science, Shimane University 1060 Nishikawatsu-cho, Matsue-shi, Shimane, 690-8504, Japan

More information

Using Microsoft Excel

Using Microsoft Excel Using Microsoft Excel Objective: Students will gain familiarity with using Excel to record data, display data properly, use built-in formulae to do calculations, and plot and fit data with linear functions.

More information

Pymol Practial Guide

Pymol Practial Guide Pymol Practial Guide Pymol is a powerful visualizor very convenient to work with protein molecules. Its interface may seem complex at first, but you will see that with a little practice is simple and powerful

More information

Protein Structure Prediction

Protein Structure Prediction Page 1 Protein Structure Prediction Russ B. Altman BMI 214 CS 274 Protein Folding is different from structure prediction --Folding is concerned with the process of taking the 3D shape, usually based on

More information

Supporting Information

Supporting Information Supporting Information Micelle-Triggered b-hairpin to a-helix Transition in a 14-Residue Peptide from a Choline-Binding Repeat of the Pneumococcal Autolysin LytA HØctor Zamora-Carreras, [a] Beatriz Maestro,

More information

Create Satellite Image, Draw Maps

Create Satellite Image, Draw Maps Create Satellite Image, Draw Maps 1. The goal Using Google Earth, we want to create and import a background file into our Adviser program. From there, we will be creating paddock boundaries. The accuracy

More information

Lab 1: Empirical Energy Methods Due: 2/14/18

Lab 1: Empirical Energy Methods Due: 2/14/18 Lab 1: Empirical Energy Methods Due: 2/14/18 General remarks on scientific scripting Scientific scripting for managing the input and output data is an important component of modern materials computations,

More information

The Geodatabase Working with Spatial Analyst. Calculating Elevation and Slope Values for Forested Roads, Streams, and Stands.

The Geodatabase Working with Spatial Analyst. Calculating Elevation and Slope Values for Forested Roads, Streams, and Stands. GIS LAB 7 The Geodatabase Working with Spatial Analyst. Calculating Elevation and Slope Values for Forested Roads, Streams, and Stands. This lab will ask you to work with the Spatial Analyst extension.

More information

Flexibility and Constraints in GOLD

Flexibility and Constraints in GOLD Flexibility and Constraints in GOLD Version 2.1 August 2018 GOLD v5.6.3 Table of Contents Purpose of Docking... 3 GOLD s Evolutionary Algorithm... 4 GOLD and Hermes... 4 Handling Flexibility and Constraints

More information

Structural Alignment of Proteins

Structural Alignment of Proteins Goal Align protein structures Structural Alignment of Proteins 1 2 3 4 5 6 7 8 9 10 11 12 13 14 PHE ASP ILE CYS ARG LEU PRO GLY SER ALA GLU ALA VAL CYS PHE ASN VAL CYS ARG THR PRO --- --- --- GLU ALA ILE

More information

Computational Structural Biology and Molecular Simulation. Introduction to VMD Molecular Visualization and Analysis

Computational Structural Biology and Molecular Simulation. Introduction to VMD Molecular Visualization and Analysis Computational Structural Biology and Molecular Simulation Introduction to VMD Molecular Visualization and Analysis Emad Tajkhorshid Department of Biochemistry, Beckman Institute, Center for Computational

More information

Identifying Interaction Hot Spots with SuperStar

Identifying Interaction Hot Spots with SuperStar Identifying Interaction Hot Spots with SuperStar Version 1.0 November 2017 Table of Contents Identifying Interaction Hot Spots with SuperStar... 2 Case Study... 3 Introduction... 3 Generate SuperStar Maps

More information

Chapter 2 Structures. 2.1 Introduction Storing Protein Structures The PDB File Format

Chapter 2 Structures. 2.1 Introduction Storing Protein Structures The PDB File Format Chapter 2 Structures 2.1 Introduction The three-dimensional (3D) structure of a protein contains a lot of information on its function, and can be used for devising ways of modifying it (propose mutants,

More information

1 Introduction. command intended for command prompt

1 Introduction. command intended for command prompt Guest Lecture, Smith College, CS 334, BioInformatics 21 October 2008 GROMACS, Position Restrained MD, Protein Catalytic Activity Filip Jagodzinski 1 Introduction GROMACS (GROningen MAchine for Chemistry

More information

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE Examples of Protein Modeling Protein Modeling Visualization Examination of an experimental structure to gain insight about a research question Dynamics To examine the dynamics of protein structures To

More information

Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08 Å resolution: comparison with the Azotobacter vinelandii MoFe protein

Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08 Å resolution: comparison with the Azotobacter vinelandii MoFe protein Acta Cryst. (2015). D71, 274-282, doi:10.1107/s1399004714025243 Supporting information Volume 71 (2015) Supporting information for article: Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08

More information

Tutorial: Structural Analysis of a Protein-Protein Complex

Tutorial: Structural Analysis of a Protein-Protein Complex Molecular Modeling Section (MMS) Department of Pharmaceutical and Pharmacological Sciences University of Padova Via Marzolo 5-35131 Padova (IT) @contact: stefano.moro@unipd.it Tutorial: Structural Analysis

More information

DISCRETE RANDOM VARIABLES EXCEL LAB #3

DISCRETE RANDOM VARIABLES EXCEL LAB #3 DISCRETE RANDOM VARIABLES EXCEL LAB #3 ECON/BUSN 180: Quantitative Methods for Economics and Business Department of Economics and Business Lake Forest College Lake Forest, IL 60045 Copyright, 2011 Overview

More information

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle.

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle. Project I Chemistry 8021, Spring 2005/2/23 This document was turned in by a student as a homework paper. 1. Methods First, the cartesian coordinates of 5021 and 8021 molecules (Fig. 1) are generated, in

More information

114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009

114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009 114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009 9 Protein tertiary structure Sources for this chapter, which are all recommended reading: D.W. Mount. Bioinformatics: Sequences and Genome

More information

Online Protein Structure Analysis with the Bio3D WebApp

Online Protein Structure Analysis with the Bio3D WebApp Online Protein Structure Analysis with the Bio3D WebApp Lars Skjærven, Shashank Jariwala & Barry J. Grant August 13, 2015 (updated November 17, 2016) Bio3D1 is an established R package for structural bioinformatics

More information

LAB 2 - ONE DIMENSIONAL MOTION

LAB 2 - ONE DIMENSIONAL MOTION Name Date Partners L02-1 LAB 2 - ONE DIMENSIONAL MOTION OBJECTIVES Slow and steady wins the race. Aesop s fable: The Hare and the Tortoise To learn how to use a motion detector and gain more familiarity

More information

NMR-Structure determination with the program CNS

NMR-Structure determination with the program CNS NMR-Structure determination with the program CNS Blockkurs 2013 Exercise 11.10.2013, room Mango? 1 NMR-Structure determination - Overview Amino acid sequence Topology file nef_seq.mtf loop cns_mtf_atom.id

More information

Tutorial 8 Raster Data Analysis

Tutorial 8 Raster Data Analysis Objectives Tutorial 8 Raster Data Analysis This tutorial is designed to introduce you to a basic set of raster-based analyses including: 1. Displaying Digital Elevation Model (DEM) 2. Slope calculations

More information

Performing a Pharmacophore Search using CSD-CrossMiner

Performing a Pharmacophore Search using CSD-CrossMiner Table of Contents Introduction... 2 CSD-CrossMiner Terminology... 2 Overview of CSD-CrossMiner... 3 Searching with a Pharmacophore... 4 Performing a Pharmacophore Search using CSD-CrossMiner Version 2.0

More information

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27 Acta Cryst. (2014). D70, doi:10.1107/s1399004714021695 Supporting information Volume 70 (2014) Supporting information for article: Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase

More information

Conformational Analysis of n-butane

Conformational Analysis of n-butane Conformational Analysis of n-butane In this exercise you will calculate the Molecular Mechanics (MM) single point energy of butane in various conformations with respect to internal rotation around the

More information

Ramachandran Plot. 4ysz Phi (degrees) Plot statistics

Ramachandran Plot. 4ysz Phi (degrees) Plot statistics B Ramachandran Plot ~b b 135 b ~b ~l l Psi (degrees) 5-5 a A ~a L - -135 SER HIS (F) 59 (G) SER (B) ~b b LYS ASP ASP 315 13 13 (A) (F) (B) LYS ALA ALA 315 173 (E) 173 (E)(A) ~p p ~b - -135 - -5 5 135 (degrees)

More information

3D - Structure Graphics Capabilities with PDF-4 Database Products

3D - Structure Graphics Capabilities with PDF-4 Database Products 3D - Structure Graphics Capabilities with PDF-4 Database Products Atomic Structure Information in the PDF-4 Databases ICDD s PDF-4 databases contain atomic structure information for a significant number

More information

Lab 1: Handout GULP: an Empirical energy code

Lab 1: Handout GULP: an Empirical energy code Lab 1: Handout GULP: an Empirical energy code We will be using the GULP code as our energy code. GULP is a program for performing a variety of types of simulations on 3D periodic solids, gas phase clusters,

More information

CityGML XFM Application Template Documentation. Bentley Map V8i (SELECTseries 2)

CityGML XFM Application Template Documentation. Bentley Map V8i (SELECTseries 2) CityGML XFM Application Template Documentation Bentley Map V8i (SELECTseries 2) Table of Contents Introduction to CityGML 1 CityGML XFM Application Template 2 Requirements 2 Finding Documentation 2 To

More information

DATE A DAtabase of TIM Barrel Enzymes

DATE A DAtabase of TIM Barrel Enzymes DATE A DAtabase of TIM Barrel Enzymes 2 2.1 Introduction.. 2.2 Objective and salient features of the database 2.2.1 Choice of the dataset.. 2.3 Statistical information on the database.. 2.4 Features....

More information

Course Notes: Topics in Computational. Structural Biology.

Course Notes: Topics in Computational. Structural Biology. Course Notes: Topics in Computational Structural Biology. Bruce R. Donald June, 2010 Copyright c 2012 Contents 11 Computational Protein Design 1 11.1 Introduction.........................................

More information

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1 Supporting information to: Time-resolved observation of protein allosteric communication Sebastian Buchenberg, Florian Sittel and Gerhard Stock Biomolecular Dynamics, Institute of Physics, Albert Ludwigs

More information

Supplementary Information Intrinsic Localized Modes in Proteins

Supplementary Information Intrinsic Localized Modes in Proteins Supplementary Information Intrinsic Localized Modes in Proteins Adrien Nicolaï 1,, Patrice Delarue and Patrick Senet, 1 Department of Physics, Applied Physics and Astronomy, Rensselaer Polytechnic Institute,

More information

ISIS/Draw "Quick Start"

ISIS/Draw Quick Start ISIS/Draw "Quick Start" Click to print, or click Drawing Molecules * Basic Strategy 5.1 * Drawing Structures with Template tools and template pages 5.2 * Drawing bonds and chains 5.3 * Drawing atoms 5.4

More information

PDBe TUTORIAL. PDBePISA (Protein Interfaces, Surfaces and Assemblies)

PDBe TUTORIAL. PDBePISA (Protein Interfaces, Surfaces and Assemblies) PDBe TUTORIAL PDBePISA (Protein Interfaces, Surfaces and Assemblies) http://pdbe.org/pisa/ This tutorial introduces the PDBePISA (PISA for short) service, which is a webbased interactive tool offered by

More information

Hands-on Course in Computational Structural Biology and Molecular Simulation BIOP590C/MCB590C. Course Details

Hands-on Course in Computational Structural Biology and Molecular Simulation BIOP590C/MCB590C. Course Details Hands-on Course in Computational Structural Biology and Molecular Simulation BIOP590C/MCB590C Emad Tajkhorshid Center for Computational Biology and Biophysics Email: emad@life.uiuc.edu or tajkhors@uiuc.edu

More information

IBiSA_tools Documentation

IBiSA_tools Documentation IBiSA_tools Documentation Release 1.00 Kota Kasahara Oct 24, 2016 CONTENTS 1 About 3 1.1 Authors.................................................. 3 1.2 Citation..................................................

More information

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013 Hydration of protein-rna recognition sites Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India 1 st November, 2013 Central Dogma of life DNA

More information

Electric Fields and Equipotentials

Electric Fields and Equipotentials OBJECTIVE Electric Fields and Equipotentials To study and describe the two-dimensional electric field. To map the location of the equipotential surfaces around charged electrodes. To study the relationship

More information

Lab 2 Worksheet. Problems. Problem 1: Geometry and Linear Equations

Lab 2 Worksheet. Problems. Problem 1: Geometry and Linear Equations Lab 2 Worksheet Problems Problem : Geometry and Linear Equations Linear algebra is, first and foremost, the study of systems of linear equations. You are going to encounter linear systems frequently in

More information

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two Supplementary Figure 1. Biopanningg and clone enrichment of Alphabody binders against human IL 23. Positive clones in i phage ELISA with optical density (OD) 3 times higher than background are shown for

More information

FlexPepDock In a nutshell

FlexPepDock In a nutshell FlexPepDock In a nutshell All Tutorial files are located in http://bit.ly/mxtakv FlexPepdock refinement Step 1 Step 3 - Refinement Step 4 - Selection of models Measure of fit FlexPepdock Ab-initio Step

More information

Pipelining Ligands in PHENIX: elbow and REEL

Pipelining Ligands in PHENIX: elbow and REEL Pipelining Ligands in PHENIX: elbow and REEL Nigel W. Moriarty Lawrence Berkeley National Laboratory Physical Biosciences Division Ligands in Crystallography Drug design Biological function studies Generate

More information

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine Supplementary figure 1. Comparison of unbound and as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine GM-CSF (slate) with bound GM-CSF in the GIF:GM-CSF complex (GIF: green,

More information

Molecular Mechanics. I. Quantum mechanical treatment of molecular systems

Molecular Mechanics. I. Quantum mechanical treatment of molecular systems Molecular Mechanics I. Quantum mechanical treatment of molecular systems The first principle approach for describing the properties of molecules, including proteins, involves quantum mechanics. For example,

More information

Bonds and Structural Supports

Bonds and Structural Supports Bonds and Structural Supports Part of the Jmol Training Guide from the MSOE Center for BioMolecular Modeling Interactive version available at http://cbm.msoe.edu/teachingresources/jmol/jmoltraining/struts.html

More information

Docking with Water in the Binding Site using GOLD

Docking with Water in the Binding Site using GOLD Docking with Water in the Binding Site using GOLD Version 2.0 November 2017 GOLD v5.6 Table of Contents Docking with Water in the Binding Site... 2 Case Study... 3 Introduction... 3 Provided Input Files...

More information

DOCKING TUTORIAL. A. The docking Workflow

DOCKING TUTORIAL. A. The docking Workflow 2 nd Strasbourg Summer School on Chemoinformatics VVF Obernai, France, 20-24 June 2010 E. Kellenberger DOCKING TUTORIAL A. The docking Workflow 1. Ligand preparation It consists in the standardization

More information

Lysozyme pka example - Software. APBS! >!Examples! >!pka calculations! >! Lysozyme pka example. Background

Lysozyme pka example - Software. APBS! >!Examples! >!pka calculations! >! Lysozyme pka example. Background Software Search this site Home Announcements An update on mailing lists APBS 1.2.0 released APBS 1.2.1 released APBS 1.3 released New APBS 1.3 Windows Installer PDB2PQR 1.7.1 released PDB2PQR 1.8 released

More information

Chemistry 14CL. Worksheet for the Molecular Modeling Workshop. (Revised FULL Version 2012 J.W. Pang) (Modified A. A. Russell)

Chemistry 14CL. Worksheet for the Molecular Modeling Workshop. (Revised FULL Version 2012 J.W. Pang) (Modified A. A. Russell) Chemistry 14CL Worksheet for the Molecular Modeling Workshop (Revised FULL Version 2012 J.W. Pang) (Modified A. A. Russell) Structure of the Molecular Modeling Assignment The molecular modeling assignment

More information

X-ray crystallography NMR Cryoelectron microscopy

X-ray crystallography NMR Cryoelectron microscopy Molecular Graphics with PyMOL Overview of: Protein Data Bank Coordinates Jean-Yves Sgro PyMOL interface Hands-on! Experimental Methods 3 Main: X-ray crystallography NMR Cryoelectron microscopy X-ray source

More information

SeeSAR 7.1 Beginners Guide. June 2017

SeeSAR 7.1 Beginners Guide. June 2017 SeeSAR 7.1 Beginners Guide June 2017 Part 1: Basics 1 Type a pdb code and press return or Load your own protein or already existing project, or Just load molecules To begin, let s type 2zff and download

More information

CycloBranch. Tutorials

CycloBranch. Tutorials CycloBranch Tutorials Outline Tutorial 1: Does a peptide have a cycle? Tutorial 2: How to determine a tag? Tutorial 3: How to determine a complete sequence? Tutorial 4: How to determine a branched sequence?

More information

Protein Structure Prediction and Display

Protein Structure Prediction and Display Protein Structure Prediction and Display Goal Take primary structure (sequence) and, using rules derived from known structures, predict the secondary structure that is most likely to be adopted by each

More information

FW 1 CDR 1 FW 2 CDR 2

FW 1 CDR 1 FW 2 CDR 2 Supplementary Figure 1 Supplementary Figure 1: Interface of the E9:Fas structure. The two interfaces formed by V H and V L of E9 with Fas are shown in stereo. The Fas receptor is represented as a surface

More information

Introduction Molecular Structure Script Console External resources Advanced topics. JMol tutorial. Giovanni Morelli.

Introduction Molecular Structure Script Console External resources Advanced topics. JMol tutorial. Giovanni Morelli. Gen 19th, 2017 1 2 Create and edit Display and view Mesurament and labelling Surface and Orbitals 3 4 from Database Protein Enzyme Crystal Structure and Unit Cell 5 Symmetry Animation General information

More information

Problem Set 3 September 25, 2009

Problem Set 3 September 25, 2009 September 25, 2009 General Instructions: 1. You are expected to state all your assumptions and provide step-by-step solutions to the numerical problems. Unless indicated otherwise, the computational problems

More information

Computational Protein Design

Computational Protein Design 11 Computational Protein Design This chapter introduces the automated protein design and experimental validation of a novel designed sequence, as described in Dahiyat and Mayo [1]. 11.1 Introduction Given

More information

Jaguar DFT Optimizations and Transition State Searches

Jaguar DFT Optimizations and Transition State Searches Jaguar DFT Optimizations and Transition State Searches Density Functional Theory (DFT) is a quantum mechanical (QM) method that gives results superior to Hartree Fock (HF) in less computational time. A

More information

Homework 9: Protein Folding & Simulated Annealing : Programming for Scientists Due: Thursday, April 14, 2016 at 11:59 PM

Homework 9: Protein Folding & Simulated Annealing : Programming for Scientists Due: Thursday, April 14, 2016 at 11:59 PM Homework 9: Protein Folding & Simulated Annealing 02-201: Programming for Scientists Due: Thursday, April 14, 2016 at 11:59 PM 1. Set up We re back to Go for this assignment. 1. Inside of your src directory,

More information

ECEN 651: Microprogrammed Control of Digital Systems Department of Electrical and Computer Engineering Texas A&M University

ECEN 651: Microprogrammed Control of Digital Systems Department of Electrical and Computer Engineering Texas A&M University ECEN 651: Microprogrammed Control of Digital Systems Department of Electrical and Computer Engineering Texas A&M University Prof. Mi Lu TA: Ehsan Rohani Laboratory Exercise #4 MIPS Assembly and Simulation

More information

BCB410 Protein-Ligand Docking Exercise Set Shirin Shahsavand December 11, 2011

BCB410 Protein-Ligand Docking Exercise Set Shirin Shahsavand December 11, 2011 BCB410 Protein-Ligand Docking Exercise Set Shirin Shahsavand December 11, 2011 1. Describe the search algorithm(s) AutoDock uses for solving protein-ligand docking problems. AutoDock uses 3 different approaches

More information