Molecular Modeling Lecture 11 side chain modeling rotamers rotamer explorer buried cavities.

Similar documents
Molecular Modeling -- Lecture 15 Surfaces and electrostatics

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Knowledge-based structure prediction of MHC class I bound peptides: a study of 23 complexes Ora Schueler-Furman 1,2, Ron Elber 2 and Hanah Margalit 1

Course Notes: Topics in Computational. Structural Biology.

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

= (-22) = +2kJ /mol

Computational Protein Design

Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Secondary and sidechain structures

Supplementary Information

Problem Set 1

PROTEIN STRUCTURE AMINO ACIDS H R. Zwitterion (dipolar ion) CO 2 H. PEPTIDES Formal reactions showing formation of peptide bond by dehydration:

Abstract. Introduction

Molecular Modeling Lecture 7. Homology modeling insertions/deletions manual realignment

Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015,

Chem 728 Spring 2012 April 14, 2012

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Supporting Online Material for

Flexibility and Constraints in GOLD

Proteins polymer molecules, folded in complex structures. Konstantin Popov Department of Biochemistry and Biophysics

BIOC : Homework 1 Due 10/10

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

Docking. GBCB 5874: Problem Solving in GBCB

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Destruction of Amyloid Fibrils by Graphene through Penetration and Extraction of Peptides

Protein Structure Refinement Using 13 C α Chemical. Shift Tensors. Benjamin J. Wylie, Charles D. Schwieters, Eric Oldfield and Chad M.

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Structural Bioinformatics (C3210) Molecular Docking

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007

Visualization of Macromolecular Structures

FlexPepDock In a nutshell

09/06/25. Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Non-uniform distribution of folds. Scheme of protein structure predicition

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Accurate prediction for atomic-level protein design and its application in diversifying the near-optimal sequence space

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

Pymol Practial Guide

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Tutorial: Structural Analysis of a Protein-Protein Complex

Computational protein design

Lecture 34 Protein Unfolding Thermodynamics

Context of the project...3. What is protein design?...3. I The algorithms...3 A Dead-end elimination procedure...4. B Monte-Carlo simulation...

Packing of Secondary Structures

Protein sidechain conformer prediction: a test of the energy function Robert J Petrella 1, Themis Lazaridis 1 and Martin Karplus 1,2

Texas A&M University

Molecular Modeling lecture 2

Protein Bioinformatics Computer lab #1 Friday, April 11, 2008 Sean Prigge and Ingo Ruczinski

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

Structural study of fluorescence-quenching tryptophan residues in γ-crystallin. Technical Report, April 2007

Lecture 15: Enzymes & Kinetics. Mechanisms ROLE OF THE TRANSITION STATE. H-O-H + Cl - H-O δ- H Cl δ- HO - + H-Cl. Margaret A. Daugherty.

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction

CMPS 3110: Bioinformatics. Tertiary Structure Prediction

The protein folding problem consists of two parts:

Importance of chirality and reduced flexibility of protein side chains: A study with square and tetrahedral lattice models

Chem. 27 Section 1 Conformational Analysis Week of Feb. 6, TF: Walter E. Kowtoniuk Mallinckrodt 303 Liu Laboratory

Lattice protein models

Lecture 5: Electrostatic Interactions & Screening

Chapter 4. Glutamic Acid in Solution - Correlations

Life Sciences 1a Lecture Slides Set 10 Fall Prof. David R. Liu. Lecture Readings. Required: Lecture Notes McMurray p , O NH

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation.

CAP 5510 Lecture 3 Protein Structures

Biomolecules: lecture 9

Tools for Cryo-EM Map Fitting. Paul Emsley MRC Laboratory of Molecular Biology

BCH 4053 Spring 2003 Chapter 6 Lecture Notes

Molecular Modeling 2018

Template Free Protein Structure Modeling Jianlin Cheng, PhD

Table S1. Overview of used PDZK1 constructs and their binding affinities to peptides. Related to figure 1.

DOCKING TUTORIAL. A. The docking Workflow

wwpdb X-ray Structure Validation Summary Report

Template Free Protein Structure Modeling Jianlin Cheng, PhD

Protein Synthesis. Unit 6 Goal: Students will be able to describe the processes of transcription and translation.

Physiochemical Properties of Residues

High Specificity and Reversibility

Lecture 1. Conformational Analysis in Acyclic Systems

De Novo Protein Structure Prediction

Physical Chemistry Final Take Home Fall 2003

Housekeeping. Housekeeping. Molecules of Life: Biopolymers

Supplementary Information

CS 273 Prof. Serafim Batzoglou Prof. Jean-Claude Latombe Spring Lecture 12 : Energy maintenance (1) Lecturer: Prof. J.C.

Introduction to Chemoinformatics and Drug Discovery

Step 1: Solute particles must separate from each other. Since energy must be absorbed to overcome the forces of attraction between solute particles,

Full wwpdb X-ray Structure Validation Report i

Alchemical free energy calculations in OpenMM

NMR, X-ray Diffraction, Protein Structure, and RasMol

SAM Teacher s Guide Protein Partnering and Function

Building small molecules

Full wwpdb X-ray Structure Validation Report i

Outline. Levels of Protein Structure. Primary (1 ) Structure. Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins

PDBe TUTORIAL. PDBePISA (Protein Interfaces, Surfaces and Assemblies)

Protein Structure Prediction

Solutions and Non-Covalent Binding Forces

Outline. Levels of Protein Structure. Primary (1 ) Structure. Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins

Computational Molecular Modeling

Electronic Supplementary Information (ESI) for Chem. Commun. Unveiling the three- dimensional structure of the green pigment of nitrite- cured meat

Full wwpdb X-ray Structure Validation Report i

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

Computational structural biology and bioinformatics

Full wwpdb X-ray Structure Validation Report i

LS1a Midterm Exam 1 Review Session Problems

Full wwpdb X-ray Structure Validation Report i

Proteins are not rigid structures: Protein dynamics, conformational variability, and thermodynamic stability

Transcription:

Molecular Modeling 218 Lecture 11 side chain modeling rotamers rotamer explorer buried cavities.

Sidechain Rotamers Discrete approximation of the continuous space of backbone angles. Sidechain conformations fall into descrete classes called rotational isomers, or rotamers. A random sampling of Phenylalanine sidechains, w/ backbone superimposed

Sidechain rotamers, naming 1-4 interactions differ greatly in energy depending on the moieties involved. N N N H CA CB CG H CA CB H CG CA CB H O=C H H O=C H O=C H H CG "m" "t" "p" -6 gauche 18 anti/trans +6 gauche

Rotamer stability is dependent on the backbone φψ angles W sidechain is shown here lying over Thr backbone Rotamers of W*: φ ψ P φ=-14,ψ=16 P φ=-6,ψ=-4 p-9 +6-9.372.79 p9 +6 +9.238.5 t-15 18-15.33.251 t9 18 9.21.268 m -65 5.38.124 m95-65 95.183.23

Rotamer Libraries Rotamer libraries have been compiled by clustering the sidechains of each amino acid over the whole database. Each cluster is a representative conformation (or rotamer), and is represented in the library by the best sidechain angles (chi angles), the "centroid" angles, for that cluster. Two commonly used rotamer libraries: *Jane & David Richardson: http://kinemage.biochem.duke.edu/databases/ rotamer.php Roland Dunbrack: http://dunbrack.fccc.edu/bbdep/index.php *rotamers of W on the previous page are from the Richardson library.

sidechain prediction Given the sequence and only the backbone atom coordinates, accurately model the positions of the sidechains. fine lines = true structure thick lines = sidechain predictions using the method of Desmet et al. Desmet et al, Nature v.356, pp339-342 (1992)

Dead end elimination theorem Each residue is numbered (i or j) and each residue has a set of rotamers (r, s or t). So, the notation i r means "choose rotamer r for position i". The total energy is the sum of the three components: fixed-fixed fixed-movable movable-movable E global = E template + Σ i E(i r ) + Σ i Σ j E(i r,j s ) where r and s are any choice of rotamers. NOTE: E global E GMEC for any choice of rotamers.

Dead end elimination theorem If i g is in the GMEC and i t is not, then we can separate the terms that contain i g or i t and re-write the inequality. E GMEC = E template + E(i g ) + Σ j E(i g,j g ) + Σ j E(j g ) + Σ j Σ k E(j g,k g )...is less than... E notgmec = E template + E(i t ) + Σ j E(i t,j g ) + Σ j E(j g ) + Σ j Σ k E(j g,k g ) Canceling all terms in black, we get: E(i r ) + Σ j E(i r j s ) > E(i g ) + Σ j E(i g,j s ) So, if we find two rotamers i r and i t, and: E(i r ) + Σ j min s E(i r j s ) > E(i t ) + Σ j max s E(i t,j s ) Then i r cannot possibly be in the GMEC.

Dead end elimination theorem E(i r ) + Σ j min s E(i r j s ) > E(i t ) + Σ j max s E(i t,j s ) DEE theorem can be translated into plain English as follows: If the "worst case scenario" for t is better than the "best case scenario" for r, then you always choose t.

DEE algorithm E global = E template + Σ i E(i r ) + Σ i Σ j E(i r,j s ) r 1 1 2 3 a b c a b c a b c 1-1 1 1 3 5 1 5 5-1 a -1 3 5 r 2 2 b 1 5 5 E(r 1,r 2 ) c 1 1-1 3 a b c a b c -2 2 5 5-1 12 4 5 3 1-2 2 5 5-1 1 12 5 4 3 5 12 E(r 2 ) a b c 1 x x x x x x 2 3 5 12 E(r 1 ) Find two columns (rotamers) within the same residue, where one is always better than the other. Eliminate the rotamer that can always be beat. (repeat until only 1 rotamer per residue)

r 1 1 2 3 a b c a b c a b c 1 a b c -1 1 1 3 5 1 5 5-1 -2 2 5 5-1 5 E(r 2 ) a -1 3 5 1 r 2 2 b c 1 5 5 1 1-1 E(r 1,r 2 ) 12 5 4 3 a -2 12 4 3 b c 2 5 5-1 5 3 1 12 5 12 E(r 1 )

DEE algorithm E global = E template + Σ i E(i r ) + Σ i Σ j E(i r,j s ) r 1 1 2 3 a b c a b c a b c 1-1 1 1 3 5 1 5 5-1 a -1 3 5 r 2 2 b 1 5 5 E(r 1,r 2 ) c 1 1-1 3 a b c a b c -2 2 5 5-1 12 4 5 3 1-2 2 5 5-1 1 12 5 4 3 5 12 E(r 2 ) a b c 1 2 3 5 12 E(r 1 ) Find two columns (rotamers) within the same residue, where one is always better than the other. Eliminate the rotamer that can always be beat. (repeat until only 1 rotamer per residue)

Rotamer explorer Pulls side chain rotamers from a rotamer library. Superposes on the backbone. Note difference in chi angles (side chain torsion angles) Only displays rotamers that have a good energy. Energy is calculated relative to the best rotamer. Not shown in Rotamer Explorer: Rotamers have intrinsic energies. Some are intrinsically better than others. 13

Try Rotamer explorer Study your structure. Find a side chain that is not "happy". Select it. Protein Rotamer explorer Get from MOE Explore click through results inspect. Mutate. Repeat. 14

Nature abhors a vacuum no, not this kind... There is only one way to make space empty, but many ways to fill it. S = p log p, where p is the number of states. Higher entropy means more probable. S= zero particles, 4 possible locations, one state S=2.44 S=4.66 two particles, 4 possible locations, 6 states S=2.44 S= 4 particles, 4 possible locations, one state

Solvent excluded surface complementarity. Complementary surfaces leave relatively little unfilled (void) space. White space within colored outlines are solvent excluded spaces in the monomer. White space in grey outline is solvent excluded in the dimer. No addition solvent excluded space means tight binding.

Better fit, less wasted space, lower energy. If we compare two surfacesurface interactions, all else being equal, the one with more void spaces has a higher free energy, and is therefore less favorable. The one that fills space has a lower energy. higher E lower E

The Hydrophobic Effect Solvent accessible surface (dashed line) around non-polar atoms contains "high energy waters" because those waters lose H-bonds. Non-polar atoms come together because it decreases the number of high energy waters. (Even at the cost of creating void space (brown).

Solvation A molecular surface is the border or interface between a molecule and its environment: where interesting things happen! Water that sits on the surface behaves differently than water that is surrounded by water. Therefore, surfaces can be used to model solvation. Surfaces properties: size/area electrostatic properties shape properties (convex, concave) bulk water protein hydration layer

Exercise 11.1 Finding voids Select an atom near the center of the model. Compute Surfaces & Maps Molecular surface Set transparency to allow you to see the atoms through the front surface Buried void Deep pocket

Exercise 11.2 Filling void by Rotamer Explorer Find a side chain that could potentially rotate into the buried cavity. Protein Rotamer explorer Get from MOE Explore. Click on each rotamer and look at where it goes. Mutate to the rotamer that best fills the void. Select the side chain and minimize around it. Select Extend, Edit potential unfix, Select invert, Edit potential fix Minimize Recalculate the surface. Name it something different. Toggle between before and after surfaces. Did the void disappear? Did another one appear? 21

Review questions What variables define side chain rotamers? What makes some rotamers intrinsically lower energy than others? What kinds of positions tend to have many allowed rotameric states? What kind of positions tend to have few allowed rotameric states? Which amino acids have only 1 rotamer? Which amino acids have the most rotamers? Why does nature abhor a vacuum? What is the solvent excluded surface? What is the energetic significance of the solvent interface? Where do you tend to find ordered waters? How can we fill voids? 22