Protein Struktur (optional, flexible)

Similar documents
Protein Struktur. Biologen und Chemiker dürfen mit Handys spielen (leise) go home, go to sleep. wake up at slide 39

Proteins: Characteristics and Properties of Amino Acids

Properties of amino acids in proteins

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Chemistry Chapter 22

Read more about Pauling and more scientists at: Profiles in Science, The National Library of Medicine, profiles.nlm.nih.gov

Solutions In each case, the chirality center has the R configuration

Amino Acids and Peptides

Section Week 3. Junaid Malek, M.D.

UNIT TWELVE. a, I _,o "' I I I. I I.P. l'o. H-c-c. I ~o I ~ I / H HI oh H...- I II I II 'oh. HO\HO~ I "-oh

Protein Structure Bioinformatics Introduction

Exam III. Please read through each question carefully, and make sure you provide all of the requested information.

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

Translation. A ribosome, mrna, and trna.

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Lecture 15: Realities of Genome Assembly Protein Sequencing

PROTEIN STRUCTURE AMINO ACIDS H R. Zwitterion (dipolar ion) CO 2 H. PEPTIDES Formal reactions showing formation of peptide bond by dehydration:

Protein Structure Marianne Øksnes Dalheim, PhD candidate Biopolymers, TBT4135, Autumn 2013

1. Amino Acids and Peptides Structures and Properties

Problem Set 1

PROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Chemical Properties of Amino Acids

Exam I Answer Key: Summer 2006, Semester C

Protein Structure. Role of (bio)informatics in drug discovery. Bioinformatics

Basic Principles of Protein Structures

CHAPTER 29 HW: AMINO ACIDS + PROTEINS

NH 2. Biochemistry I, Fall Term Sept 9, Lecture 5: Amino Acids & Peptides Assigned reading in Campbell: Chapter

LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor

Enzyme Catalysis & Biotechnology

CHEM J-9 June 2014

Biochemistry Quiz Review 1I. 1. Of the 20 standard amino acids, only is not optically active. The reason is that its side chain.

12/6/12. Dr. Sanjeeva Srivastava IIT Bombay. Primary Structure. Secondary Structure. Tertiary Structure. Quaternary Structure.

The Structure of Enzymes!

The Structure of Enzymes!

BCH 4053 Exam I Review Spring 2017

Lecture'18:'April'2,'2013

Physiochemical Properties of Residues

Peptides And Proteins

Central Dogma. modifications genome transcriptome proteome

Principles of Biochemistry

The Structure and Functions of Proteins

Secondary and sidechain structures

Dental Biochemistry Exam The total number of unique tripeptides that can be produced using all of the common 20 amino acids is

Packing of Secondary Structures

titin, has 35,213 amino acid residues (the human version of titin is smaller, with only 34,350 residues in the full length protein).

Overview. The peptide bond. Page 1

Practice Midterm Exam 200 points total 75 minutes Multiple Choice (3 pts each 30 pts total) Mark your answers in the space to the left:

B O C 4 H 2 O O. NOTE: The reaction proceeds with a carbonium ion stabilized on the C 1 of sugar A.

Potentiometric Titration of an Amino Acid. Introduction

A. Two of the common amino acids are analyzed. Amino acid X and amino acid Y both have an isoionic point in the range of

CHEMISTRY ATAR COURSE DATA BOOKLET

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Biomolecules: lecture 9

Resonance assignments in proteins. Christina Redfield

Protein structures and comparisons ndrew Torda Bioinformatik, Mai 2008

Studies Leading to the Development of a Highly Selective. Colorimetric and Fluorescent Chemosensor for Lysine

Chapter 4: Amino Acids

Housekeeping. Housekeeping. Molecules of Life: Biopolymers

BIS Office Hours

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Major Types of Association of Proteins with Cell Membranes. From Alberts et al

From Amino Acids to Proteins - in 4 Easy Steps

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA

Protein Secondary Structure Prediction

Tamer Barakat. Razi Kittaneh. Mohammed Bio. Diala Abu-Hassan

DATA MINING OF ELECTROSTATIC INTERACTIONS BETWEEN AMINO ACIDS IN COILED-COIL PROTEINS USING THE STABLE COIL ALGORITHM ANKUR S.

Biomolecules: lecture 10

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

BIRKBECK COLLEGE (University of London)

Model Mélange. Physical Models of Peptides and Proteins

On the Structure Differences of Short Fragments and Amino Acids in Proteins with and without Disulfide Bonds

7.05 Spring 2004 February 27, Recitation #2

Collision Cross Section: Ideal elastic hard sphere collision:

Review of General & Organic Chemistry

Conformational Analysis

26.7 Laboratory Synthesis of Peptides

EXAM 1 Fall 2009 BCHS3304, SECTION # 21734, GENERAL BIOCHEMISTRY I Dr. Glen B Legge

The MOLECULES of LIFE

INTRODUCTION. Amino acids occurring in nature have the general structure shown below:

THE UNIVERSITY OF MANITOBA. PAPER NO: _1_ LOCATION: 173 Robert Schultz Theatre PAGE NO: 1 of 5 DEPARTMENT & COURSE NO: CHEM / MBIO 2770 TIME: 1 HOUR

Biochemistry Prof. S. DasGupta Department of Chemistry Indian Institute of Technology Kharagpur. Lecture - 06 Protein Structure IV

CHMI 2227 EL. Biochemistry I. Test January Prof : Eric R. Gauthier, Ph.D.

1. What is an ångstrom unit, and why is it used to describe molecular structures?

Medicinal Chemistry: An Overview

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Advanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions

Dental Biochemistry EXAM I

Chapter 3 - Amino Acids

Periodic Table. 8/3/2006 MEDC 501 Fall

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb

Useful background reading

Biological Macromolecules

5) An amino acid that doesn't exist in proteins: - Tyrosine - Tryptophan - Cysteine - ornithine* 6) How many tripeptides can be formed from using one

Announcements. Primary (1 ) Structure. Lecture 7 & 8: PROTEIN ARCHITECTURE IV: Tertiary and Quaternary Structure

Proton Acidity. (b) For the following reaction, draw the arrowhead properly to indicate the position of the equilibrium: HA + K + B -

Lecture 26: Polymers: DNA Packing and Protein folding 26.1 Problem Set 4 due today. Reading for Lectures 22 24: PKT Chapter 8 [ ].

Chem. 27 Section 1 Conformational Analysis Week of Feb. 6, TF: Walter E. Kowtoniuk Mallinckrodt 303 Liu Laboratory

Discussion Section (Day, Time):

Rotamers in the CHARMM19 Force Field

Transcription:

Protein Struktur (optional, flexible) 22/10/2009 [ 1 ] Andrew Torda, Wintersemester 2009 / 2010, AST nur für Informatiker, Mathematiker,.. 26 kt, 3 ov 2009

Proteins - who cares? 22/10/2009 [ 2 ] Most important molecules in life? Ask the DA / RA people structural (keratin / hair) enzymes (catalysts) messengers (hormones) regulation (bind to other proteins, DA,..) industrial biosensors to washing powder receptors transporters ( 2, sugars, fats) anti-freeze

Proteins are easy 22/10/2009 [ 3 ] data (protein data bank, www.rcsb.org) 61 000 structures literature on function, interactions, structure software viewers, molecular dynamics simulators, docking,.. nomenclature and rules Proteins are not friendly one cannot take a sequence and predict structure /function data formats are full of surprises, mostly old formats data contains error and mistakes

Protein Rules 22/10/2009 [ 4 ] Physics /chemistry versus rules / dogma / beliefs / folklore Physics / Chemistry protein + water = set of interacting atoms can be calculated (not really) Rules (not quantified) proteins unfold if you heat them (exceptions?) if they contain lots of charged amino acids, they are soluble if they are more than 300 residues, they have more than one domain, proteins fold to a unique structure (could you prove this?) lowest free energy structure

Protein chemistry 22/10/2009 [ 5 ] Chemists / biochemists may sleep (quietly) Short version proteins are sets of building blocks (amino acids, residues, Reste) 20 types of residue chains of length few to 10 3 ( 100 or 200 typical) small ones (< 50) are peptides Longer version

myoglobin picture 2w6w Sizes 22/10/2009 [ 6 ] 1 Å = 10-10 m or 0.1 nm structure size bond CH 1 Å CC 1.5 Å protein radius α-helix spacing C α i to C α i+1 10-10 2 Å 5 ½ Å 3.8 Å

proteins are polymers Is it obvious what R is? 22/10/2009 [ 7 ] simple polymers A X B many times gives A X X X X X X X B example what kind of polymer would this give?

Why are proteins interesting polymers? 22/10/2009 [ 8 ] boring polymer gives uninteresting structures K for plastic bags, haushaltsfolie. ot nice regular structures.. What can we do to make things more protein like?

Giving proteins character 1 22/10/2009 [ 9 ] more complicated backbone with H-bond donor acceptor R R R basis of standard regular structures in proteins (secondary structure) R R repeating polymer unit: if this was all there was all proteins would be the same R

protein chemistry 22/10/2009 [ 10 ] amino acids (monomers) all look like: H 2 H C C H maybe H 3 + H C R C }- R sidechain α carbon or C α how can we construct specific structures? different kinds of "R" groups

Putting monomers together 22/10/2009 [ 11 ] H 2 H C R 1 H C H H H + 2 C C + R 2 H 2 H C R 3 C H H 2 H C C H H C C H H C H C R 1 R 2 R 3 protein synthesis story (biochemistry lectures) peptides and proteins < 30 or 40 residues = peptide > 30 or 40 residues = protein

side chain possibilities 22/10/2009 [ 12 ] big / small charged +, charged -, polar hydrophobic (not water soluble), polar interactions between sites A C C R W S T G B

Backbone and consequences 22/10/2009 [ 13 ] peptide bond is planar partial double bond character (resonance forms) shorter than other C- nearly always trans two bonds can rotate H 2 H C R 1 C H H C R 2 C H H C R 3 H C H 2 H C R 1 C H H C R 2 C H H C R 3 H C phi φ psi ψ

ramachandran plot can we rotate freely? no steric hindrance Ramachandran plot ψ psi 180 β 120 60 α 0-180 -120-60 0 60 120 180-60 -120-180 φ phi diagram from http://www.cgl.ucsf.edu/home/glasfeld/tutorial/aaa/aaa.html 22/10/2009 [ 14 ]

Backbone H bonds 22/10/2009 [ 15 ] oxygen is slightly negative H bond is polar δ + δ - δ + δ - C H H C H-bonds can be near or far in sequence fairly stable at room temperature

Secondary structure 22/10/2009 [ 16 ] regular structures using information so far rotate phi, psi angles so as to form H-bonds where possible do not force side chains to hit each other (steric clash) two common structures α-helix β-strand / sheet

diagram from Voet, D.J. and Voet, J.G, Biochemistry, Wiley, 2004 α helix 22/10/2009 [ 17 ] each C of residue i H-bonded to of i+4 3.6 residues per turn 2 H-bonds per residue side chains well separated

β-sheet β-strand stretch out backbone and make H and C groups point out β-sheet join these strands together with H-bonds (2 H-bonds/residue) anti-parallel or parallel diagram from Voet, D.J. and Voet, J.G, Biochemistry, Wiley, 2004 22/10/2009 [ 18 ]

After α-helix and β-sheet 22/10/2009 [ 19 ] do helices and sheets explain everything? no there is flexibility in the angles (look at plot) geometry is not perfectly defined there are local deviations and exceptions other common structures tighter helices some turns other structure coil, random, not named ψ psi 180 120 60 0-60 -180-120 -60 0 60 120 180-120 -180 φ phi

What determines secondary structure? 22/10/2009 [ 20 ] So far secondary structure pattern of H-bonding Almost all residues have H-bond acceptor and donor all could form α-helix or β-sheet? o Difference? sequence of side-chains overall folding Why else are sidechains important chemistry of proteins (interactions, catalysis) Fundamental dogma the sequence of sidechains determines the protein shape

Side chain properties 22/10/2009 [ 21 ] properties big / small neutral / polar / charged special ( ) example phenylalanine side chain looks like benzene (benzin) very insoluble benzene would rather interact with benzene than water what if you have phe-phe-phe poly-phe? does not happen in nature (can be made) would be insoluble not like a real peptide phe is a constituent of real proteins has a role

Taylor, W.R. (1986) J. Theor. Biol., 215-218 Properties are not clear cut 22/10/2009 [ 22 ] You can be big / small, hydrophic / polar combinations are possible Do not memorise this figure

Sidechain interactions 22/10/2009 [ 23 ] ionic (if the sidechains have charge) hydrophobic (insoluble sidechains) H-bonds (some donors and acceptors) repulsive

Summary of amino acids (first dozen) 22/10/2009 [ 24 ]

summary of amino acids (second lot) 22/10/2009 [ 25 ]

Amino Acids by property 22/10/2009 [ 26 ] aromatic tryptophan phenylalanine tyrosine

rather hydrophobic 22/10/2009 [ 27 ] leucine isoleucine cysteine S methionine S alanine proline glycine valine

Polar 22/10/2009 [ 28 ] threonine serine glutamine asparagine

charged 22/10/2009 [ 29 ] histidine arginine lysine aspartate glutamate

Hydrophobicity how serious? 22/10/2009 [ 30 ] very serious, but simplified the lists above are ph dependent difficult to measure experimentally (some aspects) is hydrophobicity really defined? ther properties - size big small gly trp ala

ther properties chemistry / geometry 22/10/2009 [ 31 ] proline only one rotatable angle! peptide bond sometimes cis pro ramachandran plot

picture from Stryer, L, Biochemistry, WH Freeman, 1981 gly and cys 22/10/2009 [ 32 ] glycine no side chain can visit forbidden parts of phi-psi map (4 000 points here) cysteine forms covalent links with other cys

Summary so far 22/10/2009 [ 33 ] proteins are heteropolymers backbone forms α-helices and β-strands (and more) not sequence specific side-chains determine the pattern of secondary structure overall protein shape special amino acids cys (forms disulfide bridges) gly (can visit "forbidden" regions of ramachandran plot) pro (no H-bond donor) how many sequences can one have? nres 20

22/10/2009 [ 34 ] some rules are unavoidable omenclature Alanine Ala A Cysteine Cys C Aspartic acid Asp D Glutamic acid Glu E Phenylalanine Phe F Glycine Gly G Histidine His H Isoleucine Ile I Lysine Lys K Leucine Leu L Methionine Met M Asparagine Asn Proline Pro P Glutamine Gln Q Arginine Arg R Serine Ser S Threonine Thr T Valine Val V Tryptophan Trp W Tyrosine Tyr Y always write from to C terminal important convention

Definitions, primary, secondary 22/10/2009 [ 35 ] More definitions primary structure sequence of amino acids ACDF (ala cys asp phe ) secondary structure α-helix, β-sheet (+ few more) structure defined by local backbone tertiary structure how these units fold together coordinates of a protein

Protein structure general comments 22/10/2009 [ 36 ] primary, secondary, tertiary structure how real? primary/secondary well defined edges can blur supersecondary struct / tertiary

Representation 22/10/2009 [ 37 ] Ultimately, our representation of a structure ATM 1 ARG 1 31.758 13.358-13.673 1.00 18.79 1BPI 137 ATM 2 CA ARG 1 31.718 13.292-12.188 1.00 14.26 1BPI 138 ATM 3 C ARG 1 33.154 13.224-11.664 1.00 18.25 1BPI 139 ATM 4 ARG 1 33.996 12.441-12.225 1.00 20.10 1BPI 140 ATM 5 CB ARG 1 30.886 12.103-11.724 1.00 16.74 1BPI 141 ATM 6 CG ARG 1 29.594 11.968-12.534 1.00 15.96 1BPI 142 ATM 7 CD ARG 1 28.700 13.182-12.299 1.00 15.45 1BPI 143 ATM 8 E ARG 1 27.267 12.895-12.546 1.00 12.82 1BPI 144 ATM 9 CZ ARG 1 26.661 13.087-13.727 1.00 17.38 1BPI 145 ATM 10 H1 ARG 1 27.370 13.558-14.735 1.00 18.38 1BPI 146 ATM 11 H2 ARG 1 25.367 12.797-13.838 1.00 25.73 1BPI 147 ATM 12 PR 2 33.800 13.936-10.586 1.00 17.07 1BPI 148 ATM 13 CA PR 2 34.976 13.367-9.840 1.00 14.99 1BPI 149 ATM 14 C PR 2 34.960 11.922-9.660 1.00 13.11 1BPI 150 ATM 15 PR 2 33.962 11.306-9.391 1.00 10.57 1BPI 151 ATM 16 CB PR 2 34.922 14.145-8.523 1.00 15.81 1BPI 152 ATM 17 CG PR 2 34.058 x, y, z coordinates 15.391-8.737 1.00 18.91 1BPI 153 ATM 18 CD PR 2 33.371 15.273-10.096 1.00 19.41 1BPI 154 ATM 19 ASP 3 36.192 11.317-9.707 1.00 8.73 1BPI 155 drawing the structure?

Representations 22/10/2009 [ 38 ] where are atoms? therapeutic binding which residues could be involved in interactions?

Representations 22/10/2009 [ 39 ] what is the surface? where could molecules fit?

Representations 22/10/2009 [ 40 ] colour surface by hydrophobicity

Representations 22/10/2009 [ 41 ] highlight / emphasise regular structures

Different levels of abstraction pictures from "Structural Bioinformatics", ed Bourne, PE and Weissig, H., Wiley ew York (2003) 22/10/2009 [ 42 ]

Atomistic For details where does a ligand bind? which interactions is a residue involved in? Ribbons verview shape number secondary struct elements symmetry strands helices pictures from "Structural Bioinformatics", ed Bourne, PE and Weissig, H., Wiley ew York (2003) 22/10/2009 [ 43 ]

More abstract no idea of real shape very quickly classify a protein example lots of serine proteases lots of different sequences all very similar at this level of abstraction pictures from "Structural Bioinformatics", ed Bourne, PE and Weissig, H., Wiley ew York (2003) 22/10/2009 [ 44 ]

Why does structure matter? 22/10/2009 [ 45 ] what residues can I change and preserve function? what is the reaction mechanism of an enzyme? what small molecules would bind and block the enzyme? is this protein the same shape as some other of known function? Where do structures come from? X-ray crystallography MR + a bit of small angle X-ray scattering, electron diffraction,

Atomic coordinates - warnings 22/10/2009 [ 46 ] remember the coordinate file? lots of problems atoms and residues missing numbering can be peculiar history suits fortran 66 (think columns) non-standard amino acids nucleotides, ligands accuracy ATM 1 ARG 1 31.758 13.358-13.673 1.00 18.79 1BPI 137 ATM 2 CA ARG 1 31.718 13.292-12.188 1.00 14.26 1BPI 138 ATM 3 C ARG 1 33.154 13.224-11.664 1.00 18.25 1BPI 139 ATM 4 ARG 1 33.996 12.441-12.225 1.00 20.10 1BPI 140 ATM 5 CB ARG 1 30.886 12.103-11.724 1.00 16.74 1BPI 141 ATM 6 CG ARG 1 29.594 11.968-12.534 1.00 15.96 1BPI 142 ATM 7 CD ARG 1 28.700 13.182-12.299 1.00 15.45 1BPI 143 ATM 8 E ARG 1 27.267 12.895-12.546 1.00 12.82 1BPI 144 ATM 9 CZ ARG 1 26.661 13.087-13.727 1.00 17.38 1BPI 145 ATM 10 H1 ARG 1 27.370 13.558-14.735 1.00 18.38 1BPI 146 ATM 11 H2 ARG 1 25.367 12.797-13.838 1.00 25.73 1BPI 147 ATM 12 PR 2 33.800 13.936-10.586 1.00 17.07 1BPI 148 ATM 13 CA PR 2 34.976 13.367-9.840 1.00 14.99 1BPI 149 ATM 14 C PR 2 34.960 11.922-9.660 1.00 13.11 1BPI 150 ATM 15 PR 2 33.962 11.306-9.391 1.00 10.57 1BPI 151 ATM 16 CB PR 2 34.922 14.145-8.523 1.00 15.81 1BPI 152 ATM 17 CG PR 2 34.058 15.391-8.737 1.00 18.91 1BPI 153 ATM 18 CD PR 2 33.371 15.273-10.096 1.00 19.41 1BPI 154 ATM 19 ASP 3 36.192 11.317-9.707 1.00 8.73 1BPI 155

resolution, precision, accuracy 22/10/2009 [ 47 ] coordinates 1.1 1.0 8.5 what do they mean? random errors non-systematic / noise / uncertainty should be scattered around correct point from any measurement there are errors ± x x-ray crystallography has model for data uncertainty (probability) resolution (experimental) < 1 Å (good) > 5 Å (bad, but excusable monster structures)

22/10/2009 [ 48 ] X-ray crystallography non-systematic errors small problems: ( and look the same) few huge problems newer structures are better proteins are not static overall motion local motion C α C β C α C β C C

MR structures different philosophy to X-ray lots of little internal distances do not quite define structure generate 50 or 10 2 solutions look at scatter of solutions as with X-ray some parts are well defined some not structure 1sm7 22/10/2009 [ 49 ]

Summarise and stop 22/10/2009 [ 50 ] roles of proteins heteropolymers 20 types of amino acid / residue geometry avoiding atomic clashes, forming H bonds leads to regular secondary structure chemistry of amino acids very different to another unique structure for a sequence reflects these differences representations of structures structures in PDB are experimental have errors