Introduction to" Protein Structure

Similar documents
Details of Protein Structure

Basics of protein structure

Outline. Levels of Protein Structure. Primary (1 ) Structure. Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins

Outline. Levels of Protein Structure. Primary (1 ) Structure. Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins

Biomolecules: lecture 10

Supersecondary Structures (structural motifs)

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748

Protein structure (and biomolecular structure more generally) CS/CME/BioE/Biophys/BMI 279 Sept. 28 and Oct. 3, 2017 Ron Dror

Announcements. Primary (1 ) Structure. Lecture 7 & 8: PROTEIN ARCHITECTURE IV: Tertiary and Quaternary Structure

From Amino Acids to Proteins - in 4 Easy Steps

THE UNIVERSITY OF MANITOBA. PAPER NO: 409 LOCATION: Fr. Kennedy Gold Gym PAGE NO: 1 of 6 DEPARTMENT & COURSE NO: CHEM 4630 TIME: 3 HOURS

Analysis and Prediction of Protein Structure (I)

Principles of Physical Biochemistry

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Protein Structure. Hierarchy of Protein Structure. Tertiary structure. independently stable structural unit. includes disulfide bonds

1-D Predictions. Prediction of local features: Secondary structure & surface exposure

CAP 5510 Lecture 3 Protein Structures

Protein Structure Basics

Physiochemical Properties of Residues

D Dobbs ISU - BCB 444/544X 1

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

BSc and MSc Degree Examinations

Summary of Experimental Protein Structure Determination. Key Elements

Biochemistry: Concepts and Connections

Protein Structure & Motifs

Dana Alsulaibi. Jaleel G.Sweis. Mamoon Ahram

F. Piazza Center for Molecular Biophysics and University of Orléans, France. Selected topic in Physical Biology. Lecture 1

Protein Structure. Role of (bio)informatics in drug discovery. Bioinformatics

Biochemistry Quiz Review 1I. 1. Of the 20 standard amino acids, only is not optically active. The reason is that its side chain.

Ch 3: Chemistry of Life. Chemistry Water Macromolecules Enzymes

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

Copyright Mark Brandt, Ph.D A third method, cryogenic electron microscopy has seen increasing use over the past few years.

Model Mélange. Physical Models of Peptides and Proteins

The Structure and Functions of Proteins

CHAPTER 29 HW: AMINO ACIDS + PROTEINS

4 Proteins: Structure, Function, Folding W. H. Freeman and Company

1. (5) Draw a diagram of an isomeric molecule to demonstrate a structural, geometric, and an enantiomer organization.

Proteins are not rigid structures: Protein dynamics, conformational variability, and thermodynamic stability

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

Nanobiotechnology. Place: IOP 1 st Meeting Room Time: 9:30-12:00. Reference: Review Papers. Grade: 40% midterm, 60% final report (oral + written)

Protein Structures. Sequences of amino acid residues 20 different amino acids. Quaternary. Primary. Tertiary. Secondary. 10/8/2002 Lecture 12 1

Contents. xiii. Preface v

Protein Structure Marianne Øksnes Dalheim, PhD candidate Biopolymers, TBT4135, Autumn 2013

The protein folding problem consists of two parts:

Lecture 10 (10/4/17) Lecture 10 (10/4/17)

BBS501 Section 1 9:00 am 10:00 am Monday thru Friday LRC 105 A & B

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 06 Protein Structure IV

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Introduction to Computational Structural Biology

NMR in Medicine and Biology

A) at equilibrium B) endergonic C) endothermic D) exergonic E) exothermic.

Part II => PROTEINS and ENZYMES. 2.3 PROTEIN STRUCTURE 2.3a Secondary Structure 2.3b Tertiary Structure 2.3c Quaternary Structure

Protein Structures: Experiments and Modeling. Patrice Koehl

Enzyme Catalysis & Biotechnology

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a

STRUCTURAL BIOINFORMATICS. Barry Grant University of Michigan

AP Biology. Proteins. AP Biology. Proteins. Multipurpose molecules

BCH 4053 Spring 2003 Chapter 6 Lecture Notes

1) NMR is a method of chemical analysis. (Who uses NMR in this way?) 2) NMR is used as a method for medical imaging. (called MRI )

Charged amino acids (side-chains)

Protein Structure and Function. Protein Architecture:

Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

PROTEIN STRUCTURE AMINO ACIDS H R. Zwitterion (dipolar ion) CO 2 H. PEPTIDES Formal reactions showing formation of peptide bond by dehydration:

1/23/2012. Atoms. Atoms Atoms - Electron Shells. Chapter 2 Outline. Planetary Models of Elements Chemical Bonds

Protein Structures. 11/19/2002 Lecture 24 1

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

I690/B680 Structural Bioinformatics Spring Protein Structure Determination by NMR Spectroscopy

Macromolecular X-ray Crystallography

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769

Amino Acid Structures from Klug & Cummings. 10/7/2003 CAP/CGS 5991: Lecture 7 1

BIRKBECK COLLEGE (University of London)

BIBC 100. Structural Biochemistry

Introduction solution NMR

Biological Macromolecules

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Protein Structure Bioinformatics Introduction

MULTIPLE CHOICE. Circle the one alternative that best completes the statement or answers the question.

RNA and Protein Structure Prediction

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Major Types of Association of Proteins with Cell Membranes. From Alberts et al

Protein Structure Determination

HIV protease inhibitor. Certain level of function can be found without structure. But a structure is a key to understand the detailed mechanism.

Basic structures of proteins

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

Biochemistry - I SPRING Mondays and Wednesdays 9:30-10:45 AM (MR-1307) Lectures 3-4. Based on Profs. Kevin Gardner & Reza Khayat

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University

Lesson Overview The Structure of DNA

Proteins. Division Ave. High School Ms. Foglia AP Biology. Proteins. Proteins. Multipurpose molecules

Lecture 11: Protein Folding & Stability

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding

Properties of amino acids in proteins

Dental Biochemistry EXAM I

Motif Prediction in Amino Acid Interaction Networks

Introduction to Protein Folding

Molecular Modelling. part of Bioinformatik von RNA- und Proteinstrukturen. Sonja Prohaska. Leipzig, SS Computational EvoDevo University Leipzig

The biomolecules of terrestrial life

Secondary and sidechain structures

Transcription:

Introduction to" Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis

Learning Objectives Outline the basic levels of protein structure. Outline key differences between X-ray crystallography and NMR spectroscopy. Identify relevant parameters for evaluating the quality of protein structures determined by X-ray crystallography and NMR spectroscopy.

Outline Protein structure evolution and function Inferring function from structure. Experimental techniques X-ray crystallography NMR spectroscopy Structure validation

Watson, Crick and DNA, 1952

DNA Conclusions "We wish to suggest a structure for the salt of deoxyribose nucleic acid (D.N.A.). This structure has novel features which are of considerable biological interest. It has not escaped our notice that the specific pairing we have postulated immediately suggests a possible copying mechanism for the genetic material. " J.D. Watson & F.H.C. Crick (1953) Nature, 171, 737.

Once Upon a Time Could the search for ultimate truth really have revealed so hideous and visceral-looking an object? Max Perutz, 1964, on protein structure John Kendrew, 1959, with myoglobin model

Why are Protein Structures so Interesting? They provide a detailed picture of interesting biological features, such as active site, substrate specificity, allosteric regulation etc. They aid in rational drug design and protein engineering. They can elucidate evolutionary relationships undetectable by sequence comparisons.

Structure & Evolution In evolution structure is conserved longer than both function and sequence. Structure > Function > Sequence

Structure & Evolution Rhamnogalacturonan acetylesterase (A. aculeatus) (1K7C) Platelet activating factor acetylhydrolase (B. Taurus) (1WAB) Serine esterase (S. scabies) (1ESC)

Structure to Function Inferring biological features from the structure NH 2 1DEO Asp His Ser Topological switchpoint COOH

Structure & Evolution Rhamnogalacturonan acetylesterase Serine esterase Platelet activating factor acetylhydrolase Mølgaard, Kauppinen & Larsen (2000) Structure, 8, 373-383.

Why Fold? Hydrophobic collapse Hydrophobic residues cluster to escape interactions with water. Indirect effect of attraction between water molecules. Polar backbone groups form secondary structure to satisfy hydrogen bonding donors and acceptors. E U ΔG T F Initially formed structure is in molten globule state (ensemble). Molten globule condenses to native fold via transition state

Hydrophobic Effect and Folding Oil and water Clathrate structures Entropy Indirect consequence of attraction between water molecules

Hydrophobic Core Hydrophobic side chains go into the core of the molecule but the main chain is highly polar. The polar groups (C=O and NH) are neutralized through formation of H-bonds. Myoglobin Surface Interior

Hydrophobic vs. Hydrophilic Globular protein (in solution) Membrane protein (in membrane) Myoglobin Aquaporin

Hydrophobic vs. Hydrophilic Globular protein (in solution) Membrane protein (in membrane) Cross-section Myoglobin Cross-section Aquaporin

Characteristics of Helices Aligned peptide units Dipolar moment Ion/ligand binding Secondary and quaternary structure packing Capping residues The α helix (i i +4) Other helix types! (3 10, π) C N

Residue Patterns Helices Helix capping Amphiphilic residue patterns C N Sheets Amphiphilic residue patterns Residue preferences at edges vs. middle Special residues Proline Helix breaker Glycine In turns/loops/bends

β-sheets Multiple strands sheet Parallel vs. antiparallel Twist Thioredoxin Flexibility Vs. helices Folding Structure propagation (amyloids) Other

β-sheets Multiple strands sheet Parallel vs. antiparallel Twist Flexibility Vs. helices Folding Structure propagation (amyloids) Other

β-sheets Multiple strands sheet Parallel vs. antiparallel Twist Flexibility Vs. helices Folding Structure propagation (amyloids) Other

β-sheets Multiple strands sheet Parallel vs. antiparallel Twist Flexibility Vs. helices Folding Structure propagation (amyloids) Other

β-sheets Multiple strands sheet Parallel vs. antiparallel Twist Antiparallel Parallel Strand interactions are non-local Flexibility Vs. helices Folding

Turns, Loops & Bends Between helices and sheets On protein surface Intrinsically unstructured proteins

Structure Levels Primary structure = Sequence Secondary Structure = Helix, sheets/strands, loops & turns Structural Motif = Small, recurrent arrangement of secondary structure, e.g. Helix-loop-helix Beta hairpins EF hand (calcium binding motif) Etc. Tertiary structure = Arrangement of Secondary structure elements MSSVLLGHIKKLEMGHS

Quaternary Structure Assembly of monomers/subunits into protein complex Backbone-backbone, backbone-side-chain & side-chain-side-chain interactions: Intramolecular vs. intermolecular contacts. For ligand binding side chains may or may not contribute. For the latter, mutations have little effect. Myoglobin α Hemoglobin α β β α

Amino Acids Proteins are built from amino acids Amino group and acid group C ε S δ C γ Side chain at Cα C β Chiral, only one enantiomer found in proteins (L-amino acids) N C α C 20 natural amino acids O Methionine

The Amino Acids http://www.ch.cam.ac.uk/magnus/molecules/amino/

Grouping Amino Acids http://www.dreamingintechnicolor.com/infoandideas/aminoacids.gif

The Evolution Way Based on Blosum62 matrix Measure of evolutionary substitution probability

Proteins Are Polypeptides The peptide bond A polypeptide chain

Ramachandran Plot Allowed backbone torsion angles in proteins N H

Torsion Angles

Ramachandran Plots

Experimental Methods Crystallography & NMR spectroscopy

Methods for Structure Determination X-ray crystallography Nuclear Magnetic Resonance (NMR) Modelling techniques More exotic techniques Cryo electron microscopy (Cryo EM) Small angle X-ray scattering (SAXS) Neutron scattering

X-ray Crystallography No size limitation. Protein molecules are stuck in a crystal lattice. Some proteins seem to be uncrystallizable. Slow. Especially suited for studying structural details.

X-rays Fourier transform

The Importance of Resolution 4 Å 3 Å low Molecules/domains Secondary structure elements Residue side chains 2 Å Multiple conformations 1 Å high Atoms

Evaluation of X-ray Structures Key parameters: Resolution R values Agreement between data and model. Usually between 0.15 and 0.25, should not exceed 0.30. Ramachandran plot B factors Contributions from static and dynamic disorder Well determined ~10-20 Å 2, intermediate ~20-30 Å 2, flexible 30-50 Å 2, invisible >60 Å 2.

NMR Spectroscopy Upper limit for structure determination currently ~50 kda. Protein molecules are in solution. Dynamics, protein folding. Slow. Especially suited for studies of protein dynamics of small to medium size proteins.

NMR Basics NMR is nuclear magnetic resonance NMR spectroscopy is done on proteins IN SOLUTION Only atoms 1 H, 13 C, 15 N (and 31 P) can be detected in NMR experiments Proteins up to 30 kda Proteins stable at high concentration (0.5-1mM), preferably at room temperature

NMR Spectroscopy

Evaluation of NMR Structures What regions in the structure are most well-defined? Look at the pdb ensembles to see which regions are well-defined 1RJH Nielbo et al, Biochemistry, 2003

Evaluation of NMR Structures Atomic backbone RMSD: Well-defined structures RMSDs < 0.6 Å RMSD = n 1 ( ' x i x ) 2 i n Less well-defined structures RMSDs > 0.6 Å 1T1H, Andersen et al. JBC, 2004 3GF1, Cooke et al. Biochemistry, 1991

NMR versus X-ray Crystallography Hydrogen atoms are observed! Only 13 C, 15 N and 1 H are observed Study of proteins in solution Only proteins up to 30-40 kda No total map of the structure Information used is incomplete and used as restraints An ensemble of structures is submitted to PDB The solved structure can be used for further dynamics characterization with NMR

PDB Holdings of the Protein Data Bank (PDB): Sep. 2001 Oct. 2007 May 2008 X-ray 13116 39706 43378 NMR 2451 6862 7306 Other 338 250 277 Total 15905 46818 50961 The PDB also contains nucleotide and nucleotide analogue structures. http://www.pdb.org

Summary In evolution structure is conserved longer than both function and sequence. X-ray crystallography Proteins in crystal lattice Many details one model Resolution, R-values, Ramchandran plot NMR spectroscopy Proteins in solution Fewer details many models Violations, RMSD, Ramachandran plot