MSc in Bioinformatics for Health Sciences SBI. Structural Bioinformatics

Similar documents
1. Presentation of the course

STRUCTURAL BIOINFORMATICS I. Fall 2015

STRUCTURAL BIOINFORMATICS II. Spring 2018

Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program)

Chemistry 565 / 665 BIOPHYSICAL CHEMISTRY. - Spring

Introduction to Computational Structural Biology

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748

Bioinformatics. Macromolecular structure

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University

CAP 5510 Lecture 3 Protein Structures

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Syllabus BINF Computational Biology Core Course

ALL LECTURES IN SB Introduction

MOLECULAR MODELING IN BIOLOGY (BIO 3356) SYLLABUS

Motif Prediction in Amino Acid Interaction Networks

D Dobbs ISU - BCB 444/544X 1

Molecular Modelling. part of Bioinformatik von RNA- und Proteinstrukturen. Sonja Prohaska. Leipzig, SS Computational EvoDevo University Leipzig

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007

Introduction to" Protein Structure

CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Giri Narasimhan

Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics

Advanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions

Introduction to Comparative Protein Modeling. Chapter 4 Part I

BCH 4053 Spring 2003 Chapter 6 Lecture Notes

Advanced Structural Analysis. Code: ECTS Credits: 6. Degree Type Year Semester

Sequence and Structure Alignment Z. Luthey-Schulten, UIUC Pittsburgh, 2006 VMD 1.8.5

Biochemistry 530: Introduction to Structural Biology. Autumn Quarter 2014 BIOC 530

Computational Molecular Biology. Protein Structure and Homology Modeling

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction

Getting To Know Your Protein

Protein structure alignments

Course plan Academic Year Qualification MSc on Bioinformatics for Health Sciences. Subject name: Computational Systems Biology Code: 30180

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

CHEM 565 and CHEM/BIOCHEM 665 BIOPHYSICAL CHEMISTRY University of Wisconsin-Madison - Spring

CMPS 3110: Bioinformatics. Tertiary Structure Prediction

Chemistry. Faculty Kent Davis, chair; Pablo Hilario, Denise Lee-Haye, Marie Pak, Robert Wilson Departmental Office: 355 Chan Shun Hall; (707)

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

Lecture 1: Introduction. Keywords: Mathematics as a language, Need of learning mathematics, Applications of mathematics in BIology

Protein Structure Prediction Using Multiple Artificial Neural Network Classifier *

Analysis and Prediction of Protein Structure (I)

Basics of protein structure

Procheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics.

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB

Week 10: Homology Modelling (II) - HHpred

Protein Bioinformatics. Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet sandberg.cmb.ki.

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

Principles of Physical Biochemistry

BBS501 Section 1 9:00 am 10:00 am Monday thru Friday LRC 105 A & B

BSc and MSc Degree Examinations

Contents. xiii. Preface v

BIRKBECK COLLEGE (University of London)

Protein Structure. Hierarchy of Protein Structure. Tertiary structure. independently stable structural unit. includes disulfide bonds

SCOP. all-β class. all-α class, 3 different folds. T4 endonuclease V. 4-helical cytokines. Globin-like

BIOC 530 Fall, 2011 BIOC 530

Today. Last time. Secondary structure Transmembrane proteins. Domains Hidden Markov Models. Structure prediction. Secondary structure

Title (of the subject): QUÍMICA BIOORGÁNICA Code: Degree/Master: GRADO DE BIOQUÍMICA Name of the module to which it belongs:

Protein Structure: Data Bases and Classification Ingo Ruczinski

Protein Structure Determination

Lecture 2-3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Homology modeling. Dinesh Gupta ICGEB, New Delhi 1/27/2010 5:59 PM

Intro Secondary structure Transmembrane proteins Function End. Last time. Domains Hidden Markov Models

Protein Structure Analysis and Verification. Course S Basics for Biosystems of the Cell exercise work. Maija Nevala, BIO, 67485U 16.1.

Neural Networks for Protein Structure Prediction Brown, JMB CS 466 Saurabh Sinha

Protein Structure. Role of (bio)informatics in drug discovery. Bioinformatics

Structure to Function. Molecular Bioinformatics, X3, 2006

CHEMISTRY (CHEM) CHEM 208. Introduction to Chemical Analysis II - SL

BIOCHEMISTRY Course Outline (Fall, 2011)

From Amino Acids to Proteins - in 4 Easy Steps

Supporting Online Material for

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Protein Structure Prediction

Protein Structure Prediction, Engineering & Design CHEM 430

A profile-based protein sequence alignment algorithm for a domain clustering database

Number sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence

Lecture 10 (10/4/17) Lecture 10 (10/4/17)

Computational Biology: Basics & Interesting Problems

Computational Biology 1

S H/T 0 ph = log([h + ]) E = mc 2 S = klnw G = H T S ph = pk a + log([a ]/[HA]) K a = [H + ][A ]/[HA] G = RTlnK eq e iπ + 1 = 0

STRUCTURAL BIOINFORMATICS. Barry Grant University of Michigan

REQUIREMENTS FOR THE BIOCHEMISTRY MAJOR

BIOCHEMISTRY Unit 2 Part 4 ACTIVITY #6 (Chapter 5) PROTEINS

CSCI EETF Assessment Year End Report, June, 2015

Introduction to Structural Bioinformatics

Protein folding. Today s Outline

Protein Structures: Experiments and Modeling. Patrice Koehl

Biochemistry,530:,, Introduc5on,to,Structural,Biology, Autumn,Quarter,2015,

Algorithms in Computational Biology (236522) spring 2008 Lecture #1

Computational Analysis of Small Proteins using the WebAmber Interface. By Eddie Gorr Gustavus Adolphus College

BME Engineering Molecular Cell Biology. Structure and Dynamics of Cellular Molecules. Basics of Cell Biology Literature Reading

Building 3D models of proteins

BIBC 100. Structural Biochemistry

Physiochemical Properties of Residues

Biochemistry for Complementary Therapists

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison

MCB65. Physical Biochemistry: Understanding Macromolecular Machines. Rachelle Gaudet Martin Samuels MCB 65 1/25/16 1

BCMP 201 Protein biochemistry

Supersecondary Structures (structural motifs)

Amino Acid Structures from Klug & Cummings. 10/7/2003 CAP/CGS 5991: Lecture 7 1

HIV protease inhibitor. Certain level of function can be found without structure. But a structure is a key to understand the detailed mechanism.

Transcription:

MSc in Bioinformatics for Health Sciences SBI. Structural Bioinformatics Syllabus Information Academic Course: 2018/19 Academic Center: 804 - Official Postgraduate Programme in Biomedicine Study: 8045 Bioinformatics for Health Sciences - MSc Subject: 30169 SBI. Structural Bioinformatics Credits: 5.0 Course: 1st Teaching languages: English Teachers: Baldomero Oliva Teaching Period: 2 nd term Presentation The main goal of the course is to learn the basic concepts on the structure of macromolecules, more specifically of proteins. Also the principles of crystallography are given in order to gain a deeper understanding on the 3-dimensional data. The main features relating sequence and protein fold will be used on the prediction of secondary and tertiary structure of proteins and on its evaluation. Finally, the relationship between sequence/structure and function of proteins will be analyzed. Associated skills Instrumental: 1. Competence to unveil the relationship between the three-dimensional structure of bio-molecules and their biological activity 2. Proficiency on the main programs that handle macromolecular data of sequence and three-dimensional coordinates Interpersonal: 1. Capacity to be integrated on a working multi-cultural team and split in milestones the analysis of biomolecules Systemic: 1. Leading capacity on a working team and motivation to finish on time the work 2. Creativity and motivation to increase the quality.

Learning outcomes Contents 1: Principles of Protein Structure Concepts of protein structure: primary, secondary and tertiary Visualization of protein structures in the computer Using DSSP to calculate secondary structures Organization of data Classification of folds: all alpha, all beta, alpha/beta and alpha+beta Identification of energetic terms involved in fold and folding pathway Visual abstraction Contents 2: Sequence comparison Criticism to modify computer-based alignments Differences between local and global alignments Use of the package HMMER and the PFAM database Methods of alignment: Needelman-Wunsch and Hidden Markov Models Use of BLAST and PSI- BLAST Criticism to modify computer-based alignments Definition of sequence domains by PFAM Multiple alignment of sequences Use of ClustalW

Contents 3: Structure comparison Structure superposition: pair-wise and multiple structures Calculation of backbone RMSD (root mean square deviation) Pair-wise and multiple structural alignment with STAMP Visual abstraction on three-dimensional space Structure is more conserved than sequence Contents 4: Comparative Modeling Distance restraints based on evolution Application of energies to optimize protein structure Use of MODELLER Intuition to superpose structures and identify sequence conservation Protein flexibility on loops Contents 5: Threading and fold recognition The Boltzman Law and relationship between energy and probability Pseudo-energies and statistical potentials Use of PROSA, THREADER, LOOPP, FUGUE, PHYRE and ModLink+ Criticism to accept the solutions from fold recognition programs Correlated mutations, mutual information and distance correlations theory Rossetta approach

Contents 5: Refinement and evaluation Detection of modeling errors combining PROSA and PSIPRED: missalignments, wrong target, secondary-structure optimization. Use of secondary structure prediction programs: PSIPRED Helix capping Structural classification of loops Use of the structural classification of capping: Arch-caps Use of the structural classification of loops: ArchDB Criticism to modify sequence alignments and to refine a model Relationship between secondary structure conservation and model refinement Loop prediction: ArchPred and ModLoop Prerequisites 1) Knowledge in Bioinformatics: alignment algorithms (Needleman-Wunsch and Smith- Watermann) and substitution matrices (PAM and BLOSUM); Hidden Markov Models and Neural Networks; Linux and windows operative systems; Perl programming. 2) Knowledge in Mathematics: Derivatives and Integrals in n-dimensions; Taylor polynomial; Linear algebra and trigonometry; main concepts of statistics and probabilities: combinatorial analysis, permutations, Bayes theorem, Gaussian and Poisson distributions; numerical analysis, optimization and fitting of functions. 3) Knowledge in Physics: Newton classical mechanics; Thermodynamics, Gibbs energy, entropy, enthalpy and basic statistical mechanics. 4) Knowledge in Chemistry: Chemical structure, conformation, configuration and stereoisomers; Reactivity, transition state, electrophilic and nucleophilic attacks; Electronic density, molecular orbitals, quantum numbers, Schrödinger function and wave function; Steady state and equilibrium. 5) Knowledge in Biochemistry: 20 aminoacids, peptide bond, hydrogen bond and hydrophobic/polar properties of residues; Michaelis-Menten equation and enzyme kinetics and reactivity. 6) Knowledge in Molecular Biology: exonic structure and splicing; DNA duplication, transcription and translation; transcription factors; site directed mutagenesis; signalling pathways.

Contents Theory Lectures Agenda PART I: INTRODUCTION Lesson 1. Introduction to the course. Design of the course. Distribution practices and requirements. Description of the work of structural biology. Evaluation and degrees. PART II: PRINCIPLES OF STRUCTURAL BIOLOGY Lesson 2. Proteins: polypeptide chain, secondary structure, tertiary and quaternary. Sequence and coding information. Conserved patterns by blocs. Sequence homology. Entropy and solvation. Energy principles: concepts of force and work. Entropic effect and solvent environment of clatrates. Thermodynamic principles. General principles of globular protein folding: hydrophobic core and secondary structure elements. Definition of secondary structure. The phi-psi space. The alpha helix and beta sheet. Supersecondary structure and connections of secondary structures (loops). Packing of alpha-helices. All alpha domains. Domains α / β: the Rossmann fold and TIM barrel. α + β domains. All beta domains: super barrel and β-barrel meanders, the sandwich greek-key and jelly roll. PART III: CONFORMATIONAL SPACE Lesson 3. Conformational space and molecular dynamics Molecular force fields. Using the potential energy function to find stable structures of three-dimensional models of proteins through minimization processes. The phase space, the conformational space and the partition function. Entropy (S), enthalpy (H) and free energy (G). Flexible systems and use of molecular dynamics to explore flexibility. Understand and apply computational methods such as the Mean-Force Potential and the free energy perturbation theory. Finite difference algorithms (Verlet). Simulated annealing. Solvent effect in simulations of proteins. Boundary conditions and treatment of the electrostatic field potential. PART IV: STRUCTURE DETERMINATION Lesson 4. COMPARATIVE MODELING The classification of proteins and evolutionary relationships. Definition of homology. Methods to superpose tertiary structures. Characterization of active sites and functional domains. Hidden Markov Models. PFAM and SMART databases. Alignment of sequences. Selection of the template. Detection of problems in the alignment. Variable regions and conserved regions. Classification of loops. Model building of the scaffold of a protein. Optimization of loops and side chains. Lesson 5. Fold prediction The theorem of inverse folding. Statistical potentials. Neural networks and prediction of secondary structure (Threader and PSIPRED). Prediction methods of folding and threading. Inference of function (PHYRE and Modlink+). Alignment of secondary structure (TOPITS). Ab initio and mini-threading (Rosetta). Lesson 6. Structures in Systems Biology Partition of protein domains. Interactions between chains and between domains. Predicting physical interactions based on domains. Transitive and permanent complexes. Other predictions of relationships between genes and proteins. Communication systems and signalling networks (phosphorylation). Study of interaction networks: Interactome.

Teaching methods Week Time Extra time Type Subject Program, Database 1 2h 1h Theory Protein Structure 1 2h 2h Practice Protein Structure SCOP, DSSP, RASMOL 2 2h 1h Theory Homology Modelling 2 2h 2h Practice Sequence Alignment PSI-BLAST 2 2h 2h Practice Sequence Alignment HMMER, PFAM 3 2h 2h Theory Threading 3 2h 3h Practice Structure Superposition STAMP 4 2h 4h Practice Model Building MODELLER 4 2h 4h Practice Model evaluation PROSA 4 2h 4h Practice Threading Threader, 3DPSSM, Fugue, LooPP 5 2h 2h Practice Model refinement MODELLER 5 2h 2h Practice Modelling examples MODELLER, PROSA 6 2h 2h Theory Protein interactions 6 2h 3h Practice Docking PatchDock, FTDock, ZDOCK, HEX 7 2h 2h Practice Model proteinprotein interactions MODELLER 7 2h 2h Practice Integrated MODELLER, ZDOCK 8 2h 2h Practice examples Integrated examples evaluation 9-12 48h Project Programming Phyton MODELLER, DOPE, ZRANK, TM_Score Evaluation The evaluation is performed in three parts: theoretical exam (3/10), practical exam (3/10) and a work project (usually programmed in python) (4/10). The work project implies groups of 2 students to generate a program of structural analysis. The aim of the program is to prove the relationships of structure, sequence and function, unveil the conservation of the most important residues and rationalize its structural role on the protein activity. The program can be standalone or in a web service and it is evaluated in association with the python course. It is mandatory that students not enrolled in the Introduction to python course form the group with one of the enrolled students (i.e. we ensure that the programs of all groups are also evaluated according to the lecturer of python). The mark of the project will consider the following criteria (weigths in parenthesis):

Quality of the program: assessed by the lecturer of python (3) Executability: 0/1 if the program works (1) Installation: Easy to install (1) Quality of the tutorial: Explanation of the installation and execution with some examples (2) Theory : Theoretical background and methods developed (4) Applicability: 0/1 either the program can be applied to other problems or not (i.e. a database would always have 0) (1) Quality of the analysis: Explanations on examples and furtjher analysis of the quality of the results produced with the program (3) The practical exam is aimed to prove the ability of the student to solve problems on molecular modeling and to characterize the protein structure The theoretical exam is aimed to validate the basic knowledge of the student on protein folds. Bibliography and Information Resources Textbooks. BRAND, Carl; TOOZE, John. Introduction to Protein Structure. 2a. ed. Garland Publishing, 1999. Cantores; Schimmel. Biophysical Chemistry. WH Freeman & Co., 1980. DAUNE, M. Molecular Biophysics. Oxford: University Press, 1999. DRENTH. Principles of Protein X-ray Crystallography. New York: Springer- Verlag Inc., 1998. Leach, A. Molecular Modelling: Principles and Applications. 2a. ed. Harlow: Pearson Education, 2001. Finkelstein A, Ptitsyn O. Protein physics: a course of lectures. 2nd ed Academic Press. 2016. PHILIP E. BOURNE Structural Bioinformatics / edited by Philip E. Bourne, Helge Weissig Hoboken, N.J. : Wiley-Liss, C2003 TAYLOR, W. R. (Willie R.). Protein Geometry, Classification, symmetry and topology: a computational analysis of structure / William R. Taylor and András Aszódi Bristol: Institute of Physics Pub., Once. 2005 Further reading Ulloa LANGEL ET AL. Introduction to Peptides and Proteins / Langel by Ulloa, Benjamin F. Cravatt, Astrid Graslund, N.G.H. von Heijne, Matjaz Zork (2010) A. PANCHENKO, T. Protein-protein interactions PRZYTYCKA and networks: computing methods for identification, analysis and prediction. London: Springer, 2008 PETER ALAN & Tompa FERSHT Structure and function of intrinsically disordered proteins / by Peter Tompa, Fersht Alan Publications. 2010

K. NAJARIAN Systems biology and Bioinformatics. CRC Press, 2009 Jake Y. CHEN & STEFANO LONARDI Biological data mining. 2009 Sheldon J. PARK Protein engineering and design. CRC Press, 2009 Gentleman, ROBERT, R Programming for Bioinformatics. Boca Raton, Fla. : CRC Press, c2009 Valiente Feruglio, Gabriel. Combinatorial pattern matching algorithms in Computational Biology using Perl and R. Chapman & Hall / CRC, 2009 BURKOWSKI, F. J. Structural Bioinformatics: an algorithmic approach. London: Chapman & Hall / CRC, c2009 Creighton, Thomas E. The biophysical chemistry of nucleic acids & proteins. Helvetian Press, 2010 IRENA ROTERMAN Structure-function relation in proteins / no. Standard 978-81-7895-409-7 b14339729 Teaching resources http://sbi.imim.es/courses.php