Protein Structure Prediction, Engineering & Design CHEM 430

Similar documents
Template Free Protein Structure Modeling Jianlin Cheng, PhD

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

CMPS 3110: Bioinformatics. Tertiary Structure Prediction

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction

Template Free Protein Structure Modeling Jianlin Cheng, PhD

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Supporting Online Material for

Protein Structure Prediction

Protein Structure Determination

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Introduction to Comparative Protein Modeling. Chapter 4 Part I

FlexPepDock In a nutshell

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Building 3D models of proteins

CS612 - Algorithms in Bioinformatics

Contact map guided ab initio structure prediction

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

Protein Structure Prediction

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748

Homology Modeling. Roberto Lins EPFL - summer semester 2005

114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009

proteins Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB

Week 10: Homology Modelling (II) - HHpred

Ab-initio protein structure prediction

Sequence analysis and comparison

09/06/25. Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Non-uniform distribution of folds. Scheme of protein structure predicition

Orientational degeneracy in the presence of one alignment tensor.

Prediction and refinement of NMR structures from sparse experimental data

Bioinformatics. Macromolecular structure

Protein Structure Prediction

ALL LECTURES IN SB Introduction

Can protein model accuracy be. identified? NO! CBS, BioCentrum, Morten Nielsen, DTU

Steps in protein modelling. Structure prediction, fold recognition and homology modelling. Basic principles of protein structure

Chemical Shift Restraints Tools and Methods. Andrea Cavalli

Template-Based Modeling of Protein Structure

Protein Structure Determination from Pseudocontact Shifts Using ROSETTA

Neural Networks for Protein Structure Prediction Brown, JMB CS 466 Saurabh Sinha

Conformational Sampling in Template-Free Protein Loop Structure Modeling: An Overview

Protein Threading. BMI/CS 776 Colin Dewey Spring 2015

Protein Structures. 11/19/2002 Lecture 24 1

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Lecture 8: Protein structure analysis

Assignment 2 Atomic-Level Molecular Modeling

Modeling for 3D structure prediction

Bioinformatics: Secondary Structure Prediction

Protein Structure Analysis with Sequential Monte Carlo Method. Jinfeng Zhang Computational Biology Lab Department of Statistics Harvard University

Protein Structures: Experiments and Modeling. Patrice Koehl

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Tools for Cryo-EM Map Fitting. Paul Emsley MRC Laboratory of Molecular Biology

Bio nformatics. Lecture 23. Saad Mneimneh

CAP 5510 Lecture 3 Protein Structures

Molecular Modeling Lecture 7. Homology modeling insertions/deletions manual realignment

Protein folding. α-helix. Lecture 21. An α-helix is a simple helix having on average 10 residues (3 turns of the helix)

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University

Protein Secondary Structure Prediction

Improving De novo Protein Structure Prediction using Contact Maps Information

3DRobot: automated generation of diverse and well-packed protein structure decoys

07/30/2012 Rose=a Conference. Principle for designing! ideal protein structures! Nobuyasu & Rie Koga University of Washington

Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program)

Protein Structure Prediction and Protein-Ligand Docking

Structural Bioinformatics

Improving the Physical Realism and Structural Accuracy of Protein Models by a Two-Step Atomic-Level Energy Minimization

Presenter: She Zhang

proteins The dual role of fragments in fragmentassembly methods for de novo protein structure prediction

Protein Structure Prediction

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

Structural Bioinformatics (C3210) Molecular Docking

Genomics and bioinformatics summary. Finding genes -- computer searches

SUPPLEMENTARY MATERIALS

proteins Prediction Methods and Reports

CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Giri Narasimhan

Computational Molecular Biology. Protein Structure and Homology Modeling

Why Do Protein Structures Recur?

Protein Structure Prediction 11/11/05

AbInitioProteinStructurePredictionviaaCombinationof Threading,LatticeFolding,Clustering,andStructure Refinement

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison

Solvent accessible surface area approximations for rapid and accurate protein structure prediction

Homology Modeling I. Growth of the Protein Data Bank PDB. Basel, September 30, EMBnet course: Introduction to Protein Structure Bioinformatics

Protein Modeling. Generating, Evaluating and Refining Protein Homology Models

The protein folding problem consists of two parts:

From Amino Acids to Proteins - in 4 Easy Steps

Basics of protein structure

08/21/2017 BLAST. Multiple Sequence Alignments: Clustal Omega

7.91 Amy Keating. Solving structures using X-ray crystallography & NMR spectroscopy

AB initio protein structure prediction or template-free

Protein quality assessment

Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics

) P = 1 if exp # " s. + 0 otherwise

Protein folding. Today s Outline

Polypeptide Folding Using Monte Carlo Sampling, Concerted Rotation, and Continuum Solvation

Sequence Bioinformatics. Multiple Sequence Alignment Waqas Nasir

BIOINFORMATICS TOOLS & ANALYSIS OF PROTEIN STRUCTURE AND FUNCTION FEI JI. (Under the Direction of Ying Xu) ABSTRACT

3D Structure. Prediction & Assessment Pt. 2. David Wishart 3-41 Athabasca Hall

PDBe TUTORIAL. PDBePISA (Protein Interfaces, Surfaces and Assemblies)

A Method for the Improvement of Threading-Based Protein Models

Transcription:

Protein Structure Prediction, Engineering & Design CHEM 430

Eero Saarinen

The free energy surface of a protein

Protein Structure Prediction & Design Full Protein Structure from Sequence - High Alignment (comparative modeling) - Low Alignment (fold recognition) - No Alignment (ab Initio) Protein Engineering: Scaffold Design Protein Engineering: Function Design

Comparative modeling AVGIFRAAVCTRGVAKAVDFVP AVGIFRAAVCTRGVAKAVDFVP AIGIWRSATCTKGVAKA--FVA +

Comparative Homology Modeling Proteins evolve gradually 3D structures and functions are often strongly conserved during this process. Strong sequence similarity often indicates strong structure similarity Finding family members or similar sequences AGCTTCAG...CTGA... TGCTACAG... Data query sequence Target database Scoring Matrix Global -or- Local Alignment

From Alignment to Structure Copy aligned backbone from template Retain conserved side chains Predict new side chains and loops No current comparative modeling method can recover from an incorrect alignment Use multiple sequence alignments as initial guide. Once a suitable template is found, it is a good idea to do a literature search (PubMed) on the relevant fold to determine what biological role(s) it plays Hydrophobic residues exposed/buried polar without charges balance Very large RMSD among the templates Different models give very different answers

Fold Recognition Alignment Method (also called "Threading") Applied when we have < 20% alignment Used with a limited # of fold types Score the folds Pair-wise potential function Fold alignment using structural scoring matrix

Figure 4. Comparison of the negative logarithms of equation (5) and the residue pair speciæc second term in equation (8) for sequence separations greater than ten. Residues with greater than 16 neighbors were considered buried. Continuous lines, equation (5); dotted lines, equation (8) both residues buried; broken line, equation (8) both residues exposed.

Rosetta Method for ab initio Modeling Creative ways to memorize sequence: structure correlations in short segments from the PDB, and use these to model new structures. ROSETTA Method. 1. Break target into fragments of 9 (25) and 3 (200) amino acids Using MSA profile and predicted secondary structure 2. (from totally extent) Insert 9 mers from database. Metropolis Monte Carlo for 2000 steps (steric clashes criteria) 3. 2000 more steps with residue-residue scores : hydrophobic burial and specific pair interactions and secondary structure packing scores. 4. 10 iterations 2000 steps during which the local strandpairing score is cycled on and off to promote formation of nonlocal -strand pairing over local strand kinetic traps, whereas the local atom density is pushed toward that of native protein structures 5. 4000 3-mer fragment insertions; a term linear in the radius of gyration is added to help condense the model and a higher resolution model of strand pairing is used The final decoy is stored only if it passes several filters. Between 10,000 and 400,000 independent simulations starting from different random number seeds

Fragment assembly known structures fragment library protein sequence predicted structure

Rosetta/Robetta Decoys are assembled from fragments Lowest energy model from a set of generated decoys is selected as the prediction Monte Carlo simulated annealing Physical energy function with elements of a statistical potential Fragment library http://robetta.bakerlab.org/