Optimisation & Landscape Analysis for Studying Protein Folding. Roy L. Johnston School of Chemistry University of Birmingham

Size: px
Start display at page:

Download "Optimisation & Landscape Analysis for Studying Protein Folding. Roy L. Johnston School of Chemistry University of Birmingham"

Transcription

1 Optimisation & Landscape Analysis for Studying Protein Folding Roy L. Johnston School of Chemistry University of Birmingham Bridging the Gap Workshop: Dynamic Optimisation University of Birmingham, 24 February 2011

2 Overview Introduction The Protein Folding Problem Protein Models Genetic Algorithms HP Lattice Bead Model Dynamical Lattice Model Energy Landscapes for Protein Folding BLN Model Principal Component Analysis Landscape Complexity Conclusions

3 The Protein Folding Problem To predict the 3D local spatial arrangement (secondary structure) and folded conformation (tertiary structure) of a protein from knowledge of its primary structure the 1D sequence of amino acid residues Why Study Protein Folding? To rationalise and predict the relationship between sequence, 3D structure and function. To understand the effect of mutations on protein structure and function. To understand protein folding dynamics e.g. in order to understand protein misfolding diseases (Alzheimers, CJD etc).

4 Search Methods in Protein Folding Even for the minimalist HP lattice bead model, global optimization is NP-hard. Search methods adopted include: Monte Carlo Simulated Annealing Chain Growth Algorithms Genetic Algorithms (Unger & Moult) Ant Colony Optimization (Hoos) Immune Algorithms (Cutello, Timmis et al.)

5 Protein Models Bead Models minimalist models each amino acid is represented by a bead, usually based on their hydrophobic or hydrophilic nature e.g. HP and BLN models beads may be constrained to a lattice or may be off-lattice. United Atom Models with backbone and side chain beads e.g. Dynamical Lattice Model. All-Atom Models full atomistic treatment of protein e.g. CHarMM, AMBER.

6 The HP Lattice Bead Model Amino acids are classed as either Hydrophobic (H) or Polar (P). Each amino acid is represented as a hard sphere ( bead ) on a lattice (e.g. 2-D square and 3-D diamond lattices). Interactions occur between beads which are adjacent on the lattice (topological neighbours) but are not directly bonded (sequence neighbours). P H Standard HP Model HH = 1 HP = PP = 0 E i j ε ij ij ij = 1 i and j are topological neighbours (but not sequence neighbours) ij = 0 otherwise.

7 Protein Folding GA: HP Model Local coordinate scheme: Conformation vector c = { } Sequence vector s = {HPPHHP } Initial valid conformations generated using Recoil Growth Algorithm. Invalid Structure (superimposed beads) Dead End Structure (no further growth possible) Fitness simply related to energy of the conformation: F i = E i Roulette wheel & Brood selection. 1-point crossover. Variety of mutation operators. Monte Carlo local search. Diversity checking no duplicate structures allowed.

8 Crossover 1-point crossover leads to higher GA success rates (and fewer structure evaluations) than 2-pt. 1-pt crossover is better at maintaining schemata (good regions of local structure).

9 Mutation Corrector operator introduced to repair invalid structures generated by mutation. (Sequential 1-bit changes.)

10 Benchmark Sequences Name E (GM) Sequence % Success HP-20 9 HPHP 2 H 2 PHP 2 HPH 2 P 2 HPH 99.5 HP-24 9 H 2 P 2 (HP 2 ) 6 H HP-25 8 P 2 HP 2 (H 2 P 4 ) 3 H HP P 3 H 2 P 2 H 2 P 5 H 7 P 2 H 2 P 4 H 2 P 2 HP HP P 2 H(P 2 H 2 ) 2 P 5 H 10 P 6 (H 2 P 2 ) 2 HP 2 H 5 HP H 2 (PH) 3 PH 4 P(HP 3 ) 3 P(HP 3 ) 2 HPH 4 (PH) 4 H 200 GA runs. Parameters: X-over = 1.0, mutation = 0.5, elitism = 30%. Structures sampled capped at 60,000.

11 Local Search Modified GA Introduce long range Monte Carlo move operator to allow local searching around each offspring and mutant. Conformation c 1 (energy = E 1 ) undergoes random fold mutation (changing one bit in conformation vector): c 1 (E 1 ) c 2 (E 2 ) E 2 < E 1 accept move. E 2 > E 1 accept move with probability p = E 2 /15E 1 30 attempted MC steps = 1 local search. Brood Selection More than 2 offspring generated from a selected pair of parents. The best 2 offspring replace the parents. Allows wider exploration of crossover space around the two parents. Optimum brood size = 5.

12 Comparison with Previous GA This Work Unger & Moult Sequence E(GM) D(GM) %Success N eval E (GM) N eval HP , ,492 HP , ,491 HP , ,400 HP , ,339 HP , ,547 HP , , GA runs. Parameters: X-over = 1.0, mutation = 0.1, elitism = 30%, DPL = 1, local search, brood size = 5. Maximum generations = 100. G.A. Cox, T. V. Mortimer-Jones, R. P. Taylor, RLJ, Theor. Chem. Acc. 112, (2004).

13 Example Global Minima for Benchmark Sequences HP-20 HP-24 HP-25 HP-36 HP-48 HP-50

14 Dynamical Lattice Model* Amino acid residues have preferred conformations determined by backbone angles and. Dynamical lattice = discrete but non-regular grid. From cluster analysis of Ramachandran plots, certain allowed (, ) pairs are defined for each residue. * F. Dressel, S. Kobe, Chem. Phys. Lett. 424, (2006).

15 Dynamical Lattice Model Amino acid residues treated explicitly. Beads for all backbone atoms (N, C, C). R Hard sphere beads for side chains (R). Energy obtained by summing interactions between C beads. E ij e E n T E ij i, j tanh 8. 0 r ij e e i, j e S,i e S, j Graham Cox; S. Kobe, F. Dressel (Dresden)

16 Parameters Dynamical Lattice Model * * From cluster analysis

17 Dynamical Lattice Model Gene coding (e.g. cysteine) Code (, ) GA Parameters Population 200 Structure limit 500,000 X-over 1.0 Mutation 0.1 Elitism 30% Local search 20% No duplicates allowed Repair invalid structures (hard sphere overlap)

18 n Results (400 GA runs) * = GM from Branch & Bound Search * * * * * * * 1AL1 (right-handed -helix, n = 13) XELLKKLLEELKG PDB GA 1A1P (Compstatin, n =14) ICVVQDWGHHRCTX PDB GA GM(B+B) =

19 Energy Energy Landscapes The EL determines kinetics and thermodynamics of e.g. clusters, liquids, glasses and biomolecules. Determines ability of system to find the global minimum energy and of search methods to find the GM. Examples: Potential Energy Surfaces Free Energy Surfaces (as function of T) Multidimensional surfaces are difficult to visualise. Consider connected network of minima and transition states: Eigenvector following, successive confinement,parallel tempering, and nudged elastic band methods.

20 Representing Energy Landscapes Disconnectivity Graph (DG) approach (Hoffman, Sibani, Schoen, Becker & Karplus, Berry et al., Wales et al.) allows visualisation of the connectivity of high-dimensional PES (e.g. for proteins, clusters, spin glasses). E E BUT the x-coordinate has no meaning. Can more physically meaningful coordinates be obtained? Metric Disconnectivity Graphs (MDGs): Reproducible placement of superbasins. Separation of superbasins reflects structural difference. Thickness of line represents size of superbasin.

21 Principal Component Analysis (PCA) Perform a linear transformation of coordinates of energy minima and transition states (1 st rank saddles). z D2 D1 Identify the principal components coordinates that maximise the variance of the system. y x PCA finds orthogonal lines of best fit through a data set. These lines of best fit are used as coordinates to re-plot the data. D2 This analysis can be used to show and visualise trends in multi-dimensional data. D1

22 PCA for Proteins There are several ways to represent the structure of a protein: (,, ) dihedral angles (x,y,z) Cartesian co-ordinates of atoms We have (mostly) used (x,y,z) co-ordinates, with translations and rotations removed.

23 Energy Energy Energy PCA-based Disconnectivity Graphs PCA can be combined with DGs to produce MDGs in which the x and y axes are used to display structural information. Structures are grouped into superbasins that are mutually accessible without passing through a transition state with energy > E max. The MDG is produced using the average coordinate of all members of a superbasin to place the node. The number of superbasins and the connectivity is assessed at intervals E sep. The thickness of the lines can be used to represent the number of structures or the structural diversity within the superbasin. Structural diversity = number of dimensions needed to reproduce (say) 99% (SD 0.99 ) of the information (variance) within a superbasin. V X 1 Q 1

24 The Off-Lattice BLN Bead Model 3 Types of bead: HydrophoBic (B), HydrophiLic (L) and Neutral (N). Off-lattice model has (bond r) stretching, (angle ) bending, torsional ( ) and through space (Lennard- Jones) components. Gauche Anti Gauche

25 46-Bead BLN Model Global minimum is a 4-strand -barrel. Frustrated PES. Inefficient folding. Gō Model All non-native attractive contacts removed. Single-funnel PES with same GM as BLN. Efficient folding.

26 PCA for the 46-bead BLN Model 1 st PC (Q 1 ) contains approx. 30% of total variance. 1 st + 2 nd PCs (Q 1,Q 2 ) contain approx. 45% of total variance. 2D and 3D disconnectivity graphs can be plotted against Q 1 and Q 2 Line thickness related to structural diversity within a superbasin. T. Komatsuzaki, K. Hoshino, Y. Matsunaga, G.J. Rylance, RLJ, D.J. Wales, J. Chem. Phys. 122, (2005).

27 BLN Go 2D 3D

28 3D Disconnectivity Graphs: Go vs. BLN Go BLN

29 Dihedral Angles 46 Bead Go model Disconnectivity graph based on 43 dihedrals A B A B

30 Landscape Complexity How can we quantify the complexity of an energy landscape? Residential probability (p r ): probability of being located in a given superbasin at a certain energy. Branching probability (p b ): probability of taking a particular path to a given superbasin compared to all possible paths, leading from parent node. Landscape complexity (C L ): Shannon entropy of residual probabilities: Path complexity (C P, ): Shannon entropy of branching probabilities: C C L V P, i V i p r p log b p log r p b

31 Complexity: Go vs. BLN Go Landscape Complexity BLN GO BLN C L = C L =

32 Conclusions Nature-inspired Computation has proven to be successful in searching for low energy folds for simple model proteins. Disconnectivity Graphs give information about the connectivity and important energy barriers on energy (and other) landscapes. The Dynamical Lattice Model shows promise as an intermediate between simple bead models and all-atom models. PCA-based visualisation and complexity analysis of protein folding landscapes allows us to explain difficulties encountered by global optimisation algorithms in certain cases and may aid the design of more robust search algorithms.

33 Acknowledgments Birmingham Dr Gareth Rylance Dr Graham Cox Dr Ben Curley Dr Lesley Lloyd Dr Andrew Bennett Dr Jun He (now Aberystwyth) Eleanor Turpin Funding EPSRC The Royal Society Wellcome VIP Scheme JSPS Leverhulme Trust University of Birmingham BlueBEAR External Prof. Sigismund Kobe (Dresden) Prof. Tamiki Komatsuzaki (Kobe) Prof. David Wales (Cambridge) Prof. Martin Karplus & Dr Paul Maragakis (Harvard) Prof. Said Salhi (Kent)

34 Cambridge Daan Frenkel, David Wales & Mark Miller Oxford Jon Doye Birmingham Roy Johnston, Mark Oakley Methods New coarse-grained potentials. Analysis of potential energy, free energy and other landscapes. New hybrid search algorithms. Dynamical and thermodynamic simulations. Investigation of hierarchical selfassembly. Example Systems Proteins; DNA, RNA; Liquid crystals Simulation of Self-Assembly (Programme Grant EP/I001352)

Topographical complexity of multidimensional energy landscapes

Topographical complexity of multidimensional energy landscapes Topographical complexity of multidimensional energy landscapes Gareth J. Rylance, Roy L. Johnston, Yasuhiro Matsunaga, Chun-Biu Li, Akinori Baba, and Tamiki Komatsuzaki PNAS published online Nov 28, 2006;

More information

Energy Landscape and Global Optimization for a Frustrated Model Protein

Energy Landscape and Global Optimization for a Frustrated Model Protein pubs.acs.org/jpcb Energy Landscape and Global Optimization for a Frustrated Model Protein Mark T. Oakley, David J. Wales, and Roy L. Johnston*, School of Chemistry, University of Birmingham, Edgbaston,

More information

Analyzing energy landscapes for folding model proteins

Analyzing energy landscapes for folding model proteins THE JOURNAL OF CHEMICAL PHYSICS 124, 204714 2006 Analyzing energy landscapes for folding model proteins Graham A. Cox and Roy L. Johnston a School of Chemistry, University of Birmingham, Edgbaston, Birmingham

More information

Optimization and Frustration:

Optimization and Frustration: Optimization and Frustration: The Dynamical Lattice Model of Proteins Sigismund Kobe and Frank Dressel Institut für Theoretische Physik, Technische Universität Dresden, D 01062 Dresden, Germany (http://www.physik.tu-dresden.de/~fdressel/dlm-main.html)

More information

The protein folding problem consists of two parts:

The protein folding problem consists of two parts: Energetics and kinetics of protein folding The protein folding problem consists of two parts: 1)Creating a stable, well-defined structure that is significantly more stable than all other possible structures.

More information

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC The precise definition of a dihedral or torsion angle can be found in spatial geometry Angle between to planes Dihedral

More information

arxiv: v1 [cond-mat.soft] 22 Oct 2007

arxiv: v1 [cond-mat.soft] 22 Oct 2007 Conformational Transitions of Heteropolymers arxiv:0710.4095v1 [cond-mat.soft] 22 Oct 2007 Michael Bachmann and Wolfhard Janke Institut für Theoretische Physik, Universität Leipzig, Augustusplatz 10/11,

More information

CMPS 3110: Bioinformatics. Tertiary Structure Prediction

CMPS 3110: Bioinformatics. Tertiary Structure Prediction CMPS 3110: Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the laws of physics! Conformation space is finite

More information

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction CMPS 6630: Introduction to Computational Biology and Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the

More information

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding?

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding? The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation By Jun Shimada and Eugine Shaknovich Bill Hawse Dr. Bahar Elisa Sandvik and Mehrdad Safavian Outline Background on protein

More information

Frustration and optimization:

Frustration and optimization: Frustration and optimization: What magnetic materials and proteins have in common Sigismund Kobe Institut für Theoretische Physik, Technische Universität Dresden, D 01062 Dresden, Germany Outline» Ising

More information

Protein Structures. 11/19/2002 Lecture 24 1

Protein Structures. 11/19/2002 Lecture 24 1 Protein Structures 11/19/2002 Lecture 24 1 All 3 figures are cartoons of an amino acid residue. 11/19/2002 Lecture 24 2 Peptide bonds in chains of residues 11/19/2002 Lecture 24 3 Angles φ and ψ in the

More information

Introduction to" Protein Structure

Introduction to Protein Structure Introduction to" Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Learning Objectives Outline the basic levels of protein structure.

More information

Molecular Modelling. part of Bioinformatik von RNA- und Proteinstrukturen. Sonja Prohaska. Leipzig, SS Computational EvoDevo University Leipzig

Molecular Modelling. part of Bioinformatik von RNA- und Proteinstrukturen. Sonja Prohaska. Leipzig, SS Computational EvoDevo University Leipzig part of Bioinformatik von RNA- und Proteinstrukturen Computational EvoDevo University Leipzig Leipzig, SS 2011 Protein Structure levels or organization Primary structure: sequence of amino acids (from

More information

Ab-initio protein structure prediction

Ab-initio protein structure prediction Ab-initio protein structure prediction Jaroslaw Pillardy Computational Biology Service Unit Cornell Theory Center, Cornell University Ithaca, NY USA Methods for predicting protein structure 1. Homology

More information

Global Optimisation of Hydrated Sulfate Clusters

Global Optimisation of Hydrated Sulfate Clusters Global Optimisation of Hydrated Sulfate Clusters Lewis Smeeton University of Birmingham School of Chemistry 15 th December, 2014 1 / 31 1 Introduction The Hofmeister Series Hydrated Sulfate Clusters 2

More information

Analysis of Crossover Operators for Cluster Geometry Optimization

Analysis of Crossover Operators for Cluster Geometry Optimization Analysis of Crossover Operators for Cluster Geometry Optimization Francisco B. Pereira Instituto Superior de Engenharia de Coimbra Portugal Abstract We study the effectiveness of different crossover operators

More information

Computer Simulation of Peptide Adsorption

Computer Simulation of Peptide Adsorption Computer Simulation of Peptide Adsorption M P Allen Department of Physics University of Warwick Leipzig, 28 November 2013 1 Peptides Leipzig, 28 November 2013 Outline Lattice Peptide Monte Carlo 1 Lattice

More information

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed.

Many proteins spontaneously refold into native form in vitro with high fidelity and high speed. Macromolecular Processes 20. Protein Folding Composed of 50 500 amino acids linked in 1D sequence by the polypeptide backbone The amino acid physical and chemical properties of the 20 amino acids dictate

More information

Protein structure optimization with a "Lamarckian" ant colony algorithm Oakley, Mark; Richardson, E Grace; Carr, Harriet; Johnston, Roy

Protein structure optimization with a Lamarckian ant colony algorithm Oakley, Mark; Richardson, E Grace; Carr, Harriet; Johnston, Roy Protein structure optimization with a "Lamarckian" ant colony algorithm Oakley, Mark; Richardson, E Grace; Carr, Harriet; Johnston, Roy DOI: 10.1109/TCBB.2013.125 License: Other (please specify with Rights

More information

Exploring the energy landscape

Exploring the energy landscape Exploring the energy landscape ChE210D Today's lecture: what are general features of the potential energy surface and how can we locate and characterize minima on it Derivatives of the potential energy

More information

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University Department of Chemical Engineering Program of Applied and

More information

Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model

Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model JOURNAL OF CHEMICAL PHYSICS VOLUME 111, NUMBER 8 22 AUGUST 1999 Simulation of mutation: Influence of a side group on global minimum structure and dynamics of a protein model Benjamin Vekhter and R. Stephen

More information

Polypeptide Folding Using Monte Carlo Sampling, Concerted Rotation, and Continuum Solvation

Polypeptide Folding Using Monte Carlo Sampling, Concerted Rotation, and Continuum Solvation Polypeptide Folding Using Monte Carlo Sampling, Concerted Rotation, and Continuum Solvation Jakob P. Ulmschneider and William L. Jorgensen J.A.C.S. 2004, 126, 1849-1857 Presented by Laura L. Thomas and

More information

Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover

Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Tomoyuki Hiroyasu, Mitsunori Miki, Shinya Ogura, Keiko Aoi, Takeshi Yoshida, Yuko Okamoto Jack Dongarra

More information

Protein Folding. I. Characteristics of proteins. C α

Protein Folding. I. Characteristics of proteins. C α I. Characteristics of proteins Protein Folding 1. Proteins are one of the most important molecules of life. They perform numerous functions, from storing oxygen in tissues or transporting it in a blood

More information

Energy landscapes of model polyalanines

Energy landscapes of model polyalanines JOURNAL OF CHEMICAL PHYSICS VOLUME 117, NUMBER 3 15 JULY 2002 Energy landscapes of model polyalanines Paul N. Mortenson, David A. Evans, and David J. Wales University Chemical Laboratories, Lensfield Road,

More information

Finding Ground States of SK Spin Glasses with hboa and GAs

Finding Ground States of SK Spin Glasses with hboa and GAs Finding Ground States of Sherrington-Kirkpatrick Spin Glasses with hboa and GAs Martin Pelikan, Helmut G. Katzgraber, & Sigismund Kobe Missouri Estimation of Distribution Algorithms Laboratory (MEDAL)

More information

Introduction to Computational Structural Biology

Introduction to Computational Structural Biology Introduction to Computational Structural Biology Part I 1. Introduction The disciplinary character of Computational Structural Biology The mathematical background required and the topics covered Bibliography

More information

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Introduction to Comparative Protein Modeling. Chapter 4 Part I Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature

More information

Protein structure (and biomolecular structure more generally) CS/CME/BioE/Biophys/BMI 279 Sept. 28 and Oct. 3, 2017 Ron Dror

Protein structure (and biomolecular structure more generally) CS/CME/BioE/Biophys/BMI 279 Sept. 28 and Oct. 3, 2017 Ron Dror Protein structure (and biomolecular structure more generally) CS/CME/BioE/Biophys/BMI 279 Sept. 28 and Oct. 3, 2017 Ron Dror Please interrupt if you have questions, and especially if you re confused! Assignment

More information

3D HP Protein Folding Problem using Ant Algorithm

3D HP Protein Folding Problem using Ant Algorithm 3D HP Protein Folding Problem using Ant Algorithm Fidanova S. Institute of Parallel Processing BAS 25A Acad. G. Bonchev Str., 1113 Sofia, Bulgaria Phone: +359 2 979 66 42 E-mail: stefka@parallel.bas.bg

More information

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar.

Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar. Why Proteins Fold? (Parts of this presentation are based on work of Ashok Kolaskar) CS490B: Introduction to Bioinformatics Mar. 25, 2002 Molecular Dynamics: Introduction At physiological conditions, the

More information

Protein Folding by Robotics

Protein Folding by Robotics Protein Folding by Robotics 1 TBI Winterseminar 2006 February 21, 2006 Protein Folding by Robotics 1 TBI Winterseminar 2006 February 21, 2006 Protein Folding by Robotics Probabilistic Roadmap Planning

More information

arxiv:cond-mat/ v1 [cond-mat.soft] 23 Mar 2007

arxiv:cond-mat/ v1 [cond-mat.soft] 23 Mar 2007 The structure of the free energy surface of coarse-grained off-lattice protein models arxiv:cond-mat/0703606v1 [cond-mat.soft] 23 Mar 2007 Ethem Aktürk and Handan Arkin Hacettepe University, Department

More information

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror Protein structure prediction CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror 1 Outline Why predict protein structure? Can we use (pure) physics-based methods? Knowledge-based methods Two major

More information

StructuralBiology. October 18, 2018

StructuralBiology. October 18, 2018 StructuralBiology October 18, 2018 1 Lecture 18: Computational Structural Biology CBIO (CSCI) 4835/6835: Introduction to Computational Biology 1.1 Overview and Objectives Even though we re officially moving

More information

Folding of small proteins using a single continuous potential

Folding of small proteins using a single continuous potential JOURNAL OF CHEMICAL PHYSICS VOLUME 120, NUMBER 17 1 MAY 2004 Folding of small proteins using a single continuous potential Seung-Yeon Kim School of Computational Sciences, Korea Institute for Advanced

More information

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy Design of a Novel Globular Protein Fold with Atomic-Level Accuracy Brian Kuhlman, Gautam Dantas, Gregory C. Ireton, Gabriele Varani, Barry L. Stoddard, David Baker Presented by Kate Stafford 4 May 05 Protein

More information

Supporting Online Material for

Supporting Online Material for www.sciencemag.org/cgi/content/full/309/5742/1868/dc1 Supporting Online Material for Toward High-Resolution de Novo Structure Prediction for Small Proteins Philip Bradley, Kira M. S. Misura, David Baker*

More information

PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS

PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS TASKQUARTERLYvol.20,No4,2016,pp.353 360 PROTEIN-PROTEIN DOCKING REFINEMENT USING RESTRAINT MOLECULAR DYNAMICS SIMULATIONS MARTIN ZACHARIAS Physics Department T38, Technical University of Munich James-Franck-Str.

More information

Orientational degeneracy in the presence of one alignment tensor.

Orientational degeneracy in the presence of one alignment tensor. Orientational degeneracy in the presence of one alignment tensor. Rotation about the x, y and z axes can be performed in the aligned mode of the program to examine the four degenerate orientations of two

More information

From Amino Acids to Proteins - in 4 Easy Steps

From Amino Acids to Proteins - in 4 Easy Steps From Amino Acids to Proteins - in 4 Easy Steps Although protein structure appears to be overwhelmingly complex, you can provide your students with a basic understanding of how proteins fold by focusing

More information

Papers listed: Cell2. This weeks papers. Chapt 4. Protein structure and function. The importance of proteins

Papers listed: Cell2. This weeks papers. Chapt 4. Protein structure and function. The importance of proteins 1 Papers listed: Cell2 During the semester I will speak of information from several papers. For many of them you will not be required to read these papers, however, you can do so for the fun of it (and

More information

Biophysics II. Key points to be covered. Molecule and chemical bonding. Life: based on materials. Molecule and chemical bonding

Biophysics II. Key points to be covered. Molecule and chemical bonding. Life: based on materials. Molecule and chemical bonding Biophysics II Life: based on materials By A/Prof. Xiang Yang Liu Biophysics & Micro/nanostructures Lab Department of Physics, NUS Organelle, formed by a variety of molecules: protein, DNA, RNA, polysaccharides

More information

Computer simulations of protein folding with a small number of distance restraints

Computer simulations of protein folding with a small number of distance restraints Vol. 49 No. 3/2002 683 692 QUARTERLY Computer simulations of protein folding with a small number of distance restraints Andrzej Sikorski 1, Andrzej Kolinski 1,2 and Jeffrey Skolnick 2 1 Department of Chemistry,

More information

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror Protein structure prediction CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror 1 Outline Why predict protein structure? Can we use (pure) physics-based methods? Knowledge-based methods Two major

More information

Free Radical-Initiated Unfolding of Peptide Secondary Structure Elements

Free Radical-Initiated Unfolding of Peptide Secondary Structure Elements Free Radical-Initiated Unfolding of Peptide Secondary Structure Elements Thesis of the Ph.D. Dissertation by Michael C. Owen, M.Sc. Department of Chemical Informatics Faculty of Education University of

More information

The Molecular Dynamics Method

The Molecular Dynamics Method The Molecular Dynamics Method Thermal motion of a lipid bilayer Water permeation through channels Selective sugar transport Potential Energy (hyper)surface What is Force? Energy U(x) F = d dx U(x) Conformation

More information

Aggregation of the Amyloid-β Protein: Monte Carlo Optimization Study

Aggregation of the Amyloid-β Protein: Monte Carlo Optimization Study John von Neumann Institute for Computing Aggregation of the Amyloid-β Protein: Monte Carlo Optimization Study S. M. Gopal, K. V. Klenin, W. Wenzel published in From Computational Biophysics to Systems

More information

Short Announcements. 1 st Quiz today: 15 minutes. Homework 3: Due next Wednesday.

Short Announcements. 1 st Quiz today: 15 minutes. Homework 3: Due next Wednesday. Short Announcements 1 st Quiz today: 15 minutes Homework 3: Due next Wednesday. Next Lecture, on Visualizing Molecular Dynamics (VMD) by Klaus Schulten Today s Lecture: Protein Folding, Misfolding, Aggregation

More information

Multi-Scale Hierarchical Structure Prediction of Helical Transmembrane Proteins

Multi-Scale Hierarchical Structure Prediction of Helical Transmembrane Proteins Multi-Scale Hierarchical Structure Prediction of Helical Transmembrane Proteins Zhong Chen Dept. of Biochemistry and Molecular Biology University of Georgia, Athens, GA 30602 Email: zc@csbl.bmb.uga.edu

More information

STRUCTURAL BIOINFORMATICS. Barry Grant University of Michigan

STRUCTURAL BIOINFORMATICS. Barry Grant University of Michigan STRUCTURAL BIOINFORMATICS Barry Grant University of Michigan www.thegrantlab.org bjgrant@umich.edu Bergen, Norway 28-Sep-2015 Objective: Provide an introduction to the practice of structural bioinformatics,

More information

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche The molecular structure of a protein can be broken down hierarchically. The primary structure of a protein is simply its

More information

ONETEP PB/SA: Application to G-Quadruplex DNA Stability. Danny Cole

ONETEP PB/SA: Application to G-Quadruplex DNA Stability. Danny Cole ONETEP PB/SA: Application to G-Quadruplex DNA Stability Danny Cole Introduction Historical overview of structure and free energy calculation of complex molecules using molecular mechanics and continuum

More information

Protein Structure. Hierarchy of Protein Structure. Tertiary structure. independently stable structural unit. includes disulfide bonds

Protein Structure. Hierarchy of Protein Structure. Tertiary structure. independently stable structural unit. includes disulfide bonds Protein Structure Hierarchy of Protein Structure 2 3 Structural element Primary structure Secondary structure Super-secondary structure Domain Tertiary structure Quaternary structure Description amino

More information

Simulating Folding of Helical Proteins with Coarse Grained Models

Simulating Folding of Helical Proteins with Coarse Grained Models 366 Progress of Theoretical Physics Supplement No. 138, 2000 Simulating Folding of Helical Proteins with Coarse Grained Models Shoji Takada Department of Chemistry, Kobe University, Kobe 657-8501, Japan

More information

Structural biomathematics: an overview of molecular simulations and protein structure prediction

Structural biomathematics: an overview of molecular simulations and protein structure prediction : an overview of molecular simulations and protein structure prediction Figure: Parc de Recerca Biomèdica de Barcelona (PRBB). Contents 1 A Glance at Structural Biology 2 3 1 A Glance at Structural Biology

More information

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University COMP 598 Advanced Computational Biology Methods & Research Introduction Jérôme Waldispühl School of Computer Science McGill University General informations (1) Office hours: by appointment Office: TR3018

More information

Biomolecules: lecture 10

Biomolecules: lecture 10 Biomolecules: lecture 10 - understanding in detail how protein 3D structures form - realize that protein molecules are not static wire models but instead dynamic, where in principle every atom moves (yet

More information

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Protein Dynamics The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Below is myoglobin hydrated with 350 water molecules. Only a small

More information

Analysis of the simulation

Analysis of the simulation Analysis of the simulation Marcus Elstner and Tomáš Kubař January 7, 2014 Thermodynamic properties time averages of thermodynamic quantites correspond to ensemble averages (ergodic theorem) some quantities

More information

Molecular Mechanics. Yohann Moreau. November 26, 2015

Molecular Mechanics. Yohann Moreau. November 26, 2015 Molecular Mechanics Yohann Moreau yohann.moreau@ujf-grenoble.fr November 26, 2015 Yohann Moreau (UJF) Molecular Mechanics, Label RFCT 2015 November 26, 2015 1 / 29 Introduction A so-called Force-Field

More information

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING:

DISCRETE TUTORIAL. Agustí Emperador. Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING: DISCRETE TUTORIAL Agustí Emperador Institute for Research in Biomedicine, Barcelona APPLICATION OF DISCRETE TO FLEXIBLE PROTEIN-PROTEIN DOCKING: STRUCTURAL REFINEMENT OF DOCKING CONFORMATIONS Emperador

More information

Molecular Mechanics, Dynamics & Docking

Molecular Mechanics, Dynamics & Docking Molecular Mechanics, Dynamics & Docking Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine Larry.Hunter@uchsc.edu http://compbio.uchsc.edu/hunter

More information

Molecular Interactions F14NMI. Lecture 4: worked answers to practice questions

Molecular Interactions F14NMI. Lecture 4: worked answers to practice questions Molecular Interactions F14NMI Lecture 4: worked answers to practice questions http://comp.chem.nottingham.ac.uk/teaching/f14nmi jonathan.hirst@nottingham.ac.uk (1) (a) Describe the Monte Carlo algorithm

More information

Research Paper 1. Correspondence: Devarajan Thirumalai

Research Paper 1. Correspondence: Devarajan Thirumalai Research Paper Protein folding kinetics: timescales, pathways and energy landscapes in terms of sequence-dependent properties Thomas Veitshans, Dmitri Klimov and Devarajan Thirumalai Background: Recent

More information

arxiv:cond-mat/ v1 2 Feb 94

arxiv:cond-mat/ v1 2 Feb 94 cond-mat/9402010 Properties and Origins of Protein Secondary Structure Nicholas D. Socci (1), William S. Bialek (2), and José Nelson Onuchic (1) (1) Department of Physics, University of California at San

More information

Motif Prediction in Amino Acid Interaction Networks

Motif Prediction in Amino Acid Interaction Networks Motif Prediction in Amino Acid Interaction Networks Omar GACI and Stefan BALEV Abstract In this paper we represent a protein as a graph where the vertices are amino acids and the edges are interactions

More information

Template Free Protein Structure Modeling Jianlin Cheng, PhD

Template Free Protein Structure Modeling Jianlin Cheng, PhD Template Free Protein Structure Modeling Jianlin Cheng, PhD Associate Professor Computer Science Department Informatics Institute University of Missouri, Columbia 2013 Protein Energy Landscape & Free Sampling

More information

A Minimal Model for the Hydrophobic and Hydrogen Bonding Effects on Secondary and Tertiary Structure Formation in Proteins

A Minimal Model for the Hydrophobic and Hydrogen Bonding Effects on Secondary and Tertiary Structure Formation in Proteins A Minimal Model for the Hydrophobic and Hydrogen Bonding Effects on Secondary and Tertiary Structure Formation in Proteins by Kyle Denison A thesis presented to the University of Waterloo in fulfillment

More information

Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11

Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11 Distance Constraint Model; Donald J. Jacobs, University of North Carolina at Charlotte Page 1 of 11 Taking the advice of Lord Kelvin, the Father of Thermodynamics, I describe the protein molecule and other

More information

Physiochemical Properties of Residues

Physiochemical Properties of Residues Physiochemical Properties of Residues Various Sources C N Cα R Slide 1 Conformational Propensities Conformational Propensity is the frequency in which a residue adopts a given conformation (in a polypeptide)

More information

Novel Monte Carlo Methods for Protein Structure Modeling. Jinfeng Zhang Department of Statistics Harvard University

Novel Monte Carlo Methods for Protein Structure Modeling. Jinfeng Zhang Department of Statistics Harvard University Novel Monte Carlo Methods for Protein Structure Modeling Jinfeng Zhang Department of Statistics Harvard University Introduction Machines of life Proteins play crucial roles in virtually all biological

More information

Computational Biology & Computational Medicine

Computational Biology & Computational Medicine Computational Biology & Computational Medicine Homayoun Valafar Outline Why proteins? What are proteins? How do we compute them? How do we use computational approaches? Why Proteins? Molecular basis of

More information

Dana Alsulaibi. Jaleel G.Sweis. Mamoon Ahram

Dana Alsulaibi. Jaleel G.Sweis. Mamoon Ahram 15 Dana Alsulaibi Jaleel G.Sweis Mamoon Ahram Revision of last lectures: Proteins have four levels of structures. Primary,secondary, tertiary and quaternary. Primary structure is the order of amino acids

More information

Kd = koff/kon = [R][L]/[RL]

Kd = koff/kon = [R][L]/[RL] Taller de docking y cribado virtual: Uso de herramientas computacionales en el diseño de fármacos Docking program GLIDE El programa de docking GLIDE Sonsoles Martín-Santamaría Shrödinger is a scientific

More information

Importance of chirality and reduced flexibility of protein side chains: A study with square and tetrahedral lattice models

Importance of chirality and reduced flexibility of protein side chains: A study with square and tetrahedral lattice models JOURNAL OF CHEMICAL PHYSICS VOLUME 121, NUMBER 1 1 JULY 2004 Importance of chirality and reduced flexibility of protein side chains: A study with square and tetrahedral lattice models Jinfeng Zhang Department

More information

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 06 Protein Structure IV

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 06 Protein Structure IV Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur Lecture - 06 Protein Structure IV We complete our discussion on Protein Structures today. And just to recap

More information

Equilibrium Molecular Thermodynamics from Kirkwood Sampling

Equilibrium Molecular Thermodynamics from Kirkwood Sampling This is an open access article published under a Creative Commons Attribution (CC-BY) License, which permits unrestricted use, distribution and reproduction in any medium, provided the author and source

More information

Energetics and Thermodynamics

Energetics and Thermodynamics DNA/Protein structure function analysis and prediction Protein Folding and energetics: Introduction to folding Folding and flexibility (Ch. 6) Energetics and Thermodynamics 1 Active protein conformation

More information

Bioengineering 215. An Introduction to Molecular Dynamics for Biomolecules

Bioengineering 215. An Introduction to Molecular Dynamics for Biomolecules Bioengineering 215 An Introduction to Molecular Dynamics for Biomolecules David Parker May 18, 2007 ntroduction A principal tool to study biological molecules is molecular dynamics simulations (MD). MD

More information

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell Mathematics and Biochemistry University of Wisconsin - Madison 0 There Are Many Kinds Of Proteins The word protein comes

More information

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION AND CALIBRATION Calculation of turn and beta intrinsic propensities. A statistical analysis of a protein structure

More information

Algorithm for protein folding problem in 3D lattice HP model

Algorithm for protein folding problem in 3D lattice HP model Algorithm for protein folding problem in 3D lattice HP model METODI TRAYKOV Department of Electrical Engineering, Electronics and Automatics University Center for Advanced Bioinformatics Research South-West

More information

Equilibrium Molecular Thermodynamics from Kirkwood Sampling

Equilibrium Molecular Thermodynamics from Kirkwood Sampling This is an open access article published under a Creative Commons Attribution (CC-BY) License, which permits unrestricted use, distribution and reproduction in any medium, provided the author and source

More information

Why Proteins Fold. How Proteins Fold? e - ΔG/kT. Protein Folding, Nonbonding Forces, and Free Energy

Why Proteins Fold. How Proteins Fold? e - ΔG/kT. Protein Folding, Nonbonding Forces, and Free Energy Why Proteins Fold Proteins are the action superheroes of the body. As enzymes, they make reactions go a million times faster. As versatile transport vehicles, they carry oxygen and antibodies to fight

More information

Protein Structure Basics

Protein Structure Basics Protein Structure Basics Presented by Alison Fraser, Christine Lee, Pradhuman Jhala, Corban Rivera Importance of Proteins Muscle structure depends on protein-protein interactions Transport across membranes

More information

Multicanonical jump walk annealing: An efficient method for geometric optimization

Multicanonical jump walk annealing: An efficient method for geometric optimization JOURNAL OF CHEMICAL PHYSICS VOLUME 112, NUMBER 6 8 FEBRUARY 2000 Multicanonical jump wal annealing: An efficient method for geometric optimization Huafeng Xu and B. J. Berne a) Department of Chemistry

More information

The Dispersion Metric and the CMA Evolution Strategy

The Dispersion Metric and the CMA Evolution Strategy The Dispersion Metric and the CMA Evolution Strategy Monte Lunacek Department of Computer Science Colorado State University Fort Collins, CO 80523 lunacek@cs.colostate.edu Darrell Whitley Department of

More information

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison CMPS 6630: Introduction to Computational Biology and Bioinformatics Structure Comparison Protein Structure Comparison Motivation Understand sequence and structure variability Understand Domain architecture

More information

Presenter: She Zhang

Presenter: She Zhang Presenter: She Zhang Introduction Dr. David Baker Introduction Why design proteins de novo? It is not clear how non-covalent interactions favor one specific native structure over many other non-native

More information

Essential dynamics sampling of proteins. Tuorial 6 Neva Bešker

Essential dynamics sampling of proteins. Tuorial 6 Neva Bešker Essential dynamics sampling of proteins Tuorial 6 Neva Bešker Relevant time scale Why we need enhanced sampling? Interconvertion between basins is infrequent at the roomtemperature: kinetics and thermodynamics

More information

Frontiers in Physics 27-29, Sept Self Avoiding Growth Walks and Protein Folding

Frontiers in Physics 27-29, Sept Self Avoiding Growth Walks and Protein Folding Frontiers in Physics 27-29, Sept. 2012 Self Avoiding Growth Walks and Protein Folding K P N Murthy, K Manasa, and K V K Srinath School of Physics, School of Life Sciences University of Hyderabad September

More information

arxiv:cond-mat/ v1 [cond-mat.soft] 19 Mar 2001

arxiv:cond-mat/ v1 [cond-mat.soft] 19 Mar 2001 Modeling two-state cooperativity in protein folding Ke Fan, Jun Wang, and Wei Wang arxiv:cond-mat/0103385v1 [cond-mat.soft] 19 Mar 2001 National Laboratory of Solid State Microstructure and Department

More information

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 8 Nov 1996

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 8 Nov 1996 Protein Folding Kinetics: Time Scales, Pathways, and Energy arxiv:cond-mat/9611065v1 [cond-mat.stat-mech] 8 Nov 1996 Landscapes in Terms of Sequence Dependent Properties T. Veitshans 1, D. K. Klimov 2,

More information

Lecture 11: Potential Energy Functions

Lecture 11: Potential Energy Functions Lecture 11: Potential Energy Functions Dr. Ronald M. Levy ronlevy@temple.edu Originally contributed by Lauren Wickstrom (2011) Microscopic/Macroscopic Connection The connection between microscopic interactions

More information

Pressure Dependent Study of the Solid-Solid Phase Change in 38-Atom Lennard-Jones Cluster

Pressure Dependent Study of the Solid-Solid Phase Change in 38-Atom Lennard-Jones Cluster University of Rhode Island DigitalCommons@URI Chemistry Faculty Publications Chemistry 2005 Pressure Dependent Study of the Solid-Solid Phase Change in 38-Atom Lennard-Jones Cluster Dubravko Sabo University

More information

CAP 5510 Lecture 3 Protein Structures

CAP 5510 Lecture 3 Protein Structures CAP 5510 Lecture 3 Protein Structures Su-Shing Chen Bioinformatics CISE 8/19/2005 Su-Shing Chen, CISE 1 Protein Conformation 8/19/2005 Su-Shing Chen, CISE 2 Protein Conformational Structures Hydrophobicity

More information

Tools for QM studies of large systems

Tools for QM studies of large systems Tools for QM studies of large systems Automated, hessian-free saddle point search & characterization QM/MM implementation for zeolites Shaama Mallikarjun Sharada Advisors: Prof. Alexis T Bell, Prof. Martin

More information