BIOINF 4120 Bioinformatics 2 - Structures and Systems - Oliver Kohlbacher Summer Protein Structure Prediction I

Size: px
Start display at page:

Download "BIOINF 4120 Bioinformatics 2 - Structures and Systems - Oliver Kohlbacher Summer Protein Structure Prediction I"

Transcription

1 BIOINF 4120 Bioinformatics 2 - Structures and Systems - Oliver Kohlbacher Summer Protein Structure Prediction I

2 Structure Prediction Overview Overview of problem variants Secondary structure prediction Automatic extraction from 3D structures Prediction algorithms Chou-Fasman PHD Consensus Server Benchmarks: CASP 2

3 Basic Problem KVYGRCELAAAMKRLGLDNYR GYSLGNWVCAAKFESNFNTHA TNRNTDGSTDYGILQINSRWW CNDGRTPGSKNLCNIPCSALL SSDITASVNCAKKIASGGNGM NAWVAWRNRCKGTDVHAWIRG CRL Tertiary Structure Secondary Structure 3

4 Protein Structure Prediction Basic Problem: Given a sequence, predict its structure Choice of method depends on Availability of homologous structures Availability of additional experimental data Quality/accuracy of the desired model Predict backbone positions only We will model side-chains independently Techniques for this will be discussed later 4

5 Methods Sec. Struct. Prediction Sequence Search Sequence DB Secondary Structure Sequence Homologs Mult. Alignment + Profiles Alignment/ Profiles Ab initio Prediction Fold Recognition Threading Model Modeling/ Refinement Refined Model After: Zimmer, Lengauer: Bioinformatics From Genomes to Drugs, Wiley VCH,

6 Ab Initio Prediction Prediction based on physical models only (ab initio = first principles ) Does not require information from homologous structures Prediction of new folds possible Potential Sequence Ab initio Prediction Applicable for small proteins only (<100 aa) Model 6

7 Threading Threading Model a target sequence onto the structures of several homologs (templates) Choose the template structure that best matches the target sequence Build a full model of the sequence based on the template Restricted to the modeling of known fold classes Fold Recognition Simplified version of the threading problem Identify fold class of the target sequence only 7

8 Secondary Structure Prediction Given: sequence Find: KVYGRCELAAAMKRLGLDNYRGYSLGNWVC AAKFESNFNTHATNRNTDGSTDYGILQINS RWWCNDGRTPGSKNLCNIPCSALLSSDITA SVNCAKKIASGGNGMNAWVAWRNRCKGTDV HAWIRGCRL Secondary structure assignment for three classes E (extended, strand), H (helix), C/ (coil) for every aa. KVYGRCELAAAMKRLGLDNYRGYSLGNWVCAAKFESNFNTHATNRNTD -----HHHHHHHHH EEEEE GSTDYGILQINSRWWCNDGRTPGSKNLCNIPCSALLSSDITASVNCAK ----EEEEEE HHHHHH KIASGGNGMNAWVAWRNRCKGTDVHAWIRGCRL HHH EEE

9 Test Data To assess the quality of predictions, we need to have some gold standard This is usually done by extracting the secondary structure from high-quality crystal structures from the PDB Problem: how to extract the secondary structure from a 3D structure? DSSP and STRIDE are two well-known algorithms for automatic secondary structure assignment from a 3D structure They consider backbone torsion angles, H-bond patterns, and other parameters that are characteristic for certain secondary structures The algorithm assigns one of three/eight secondary structure classes to each aa of the structure 9

10 DSSP Core of DSSP is a function for the detection of H-bonds formed by the protein backbone Decision whether an H-bond exists is made based on the electrostatic energy for each acceptor/donor pair: Assumption: C=O and H-N are polarized and bear partial charges q + and q - : C=O r OH q + q - r ON q + q- q - = e 0 q + = e 0 Kabsch, Sander, Biopolymers (1983), 22,

11 DSSP Hydrogen positions for backbone NH are constructed from standard bond length/angles (not contained in XRD data!) DSSP assumes there is an H-bond between two amino acids (i,j) if E ij is lesser than the threshold t = 2.4 kj/mol If H-bonds are present for (i,i+3), (i,i+4) or(i,i+5), this is interpreted as 3-, 4-, or 5-turn Multiple adjacent turns of the same type correspond to , α- and π-helices A β-bridge is assumed if there exist H-bonds for (i-1,j) and (j,i+1) [parallel] (i,j) and (j,i) [anti-parallel] Multiple adjacent β-bridges of the same type indicate the presence of β-sheets 11

12 STRIDE STRIDE is an improved version of DSSP Improved energy function for H-bonds Includes dependence on H-bond angle Different thresholds for helices/sheets Also considers backbone torsion angles Often recognizes amino acids at the end of a secondary structure element which DSSP would miss ( slightly longer helices/strands) STRIDE yields slightly better results than DSSP (95% correct for helices, 93% for strands; relative to manually annotated X-ray structures) Frishman, Argos, Proteins (1995), 23,

13 STRIDE Empirical potential for H-bond energy contains a distance-dependent contribution (E r ) and directional contribution (E t, E p ) Distance dependence is modeled by a 8-6-potential where r is the distance between donor- and acceptor atoms (N, O) and C, D are constants derived from average H-bond donor-acceptor distances The angle-dependent terms describe the deviation from the ideal bond geometry, however, they are rather complex and thus left out here (for details see Frishman & Argos, 1995) Frishman, Argos, Proteins (1995), 23,

14 DSSPcont Secondary structure assignment not unambiguous Structures are flexible Parts of the structure might fluctuate between different secondary structures Example: H-bonds at the end of a helix are often very close to the threshold of DSSP DSSPcont: instead of a fixed assignment, estimate probabilities for each secondary structure Andersen et al., Structure (2002), 10,

15 DSSPcont Apply DSSP, but compute secondary structures assignment for various thresholds t T= {-1.0, -0.9, -0.1 kcal/mol} Every aa i of the sequence will be assigned a secondary structure class c = DSSP(i, t) with c C = {G, H, I, T, E, B, S, L} for each threshold t We now define a binary function DSSP it (c) as For each sequence position i DSSP it (c) defines a 8x10-matrix with the DSSP assignments for all thresholds Andersen et al., Structure (2002), 10,

16 DSSPcont From this matrix DSSPcont determines the probabilities DSSPcont i (c) for each position i and class c by a scaling with empirically determined weights w it : This assigns a vector of the probabilities for each of the eight secondary structure classes to each position 16

17 DSSPcont Secondary structure variability (in particular at the ends of the helices) in the 23 NMR models of 1CY3 are correctly captured by DSSP This allows the identification of areas of unstable secondary structure Andersen et al., Structure (2002), 10,

18 Quality Measures Three-state classification (C/H/E Coil/Helix/Extended) Q 3 score: percentage of correctly assigned amino acids according to three-state classification In particular the ends of secondary structure elements are often not unambiguously classifiable (c.f. thresholding in DSSP!) Predictions with 80+% accuracy are thus excellent predicted observed 18

19 Quality Measures Occasionally eight-state classifications are used (H/E/G/I/T/B/S/L) 3 10 helix (G) α-helix (H) π-helix (I) helix turn (T) strand (E) β-bridge(b) bend (S) other/loop (L) Q 8 score: fraction of correctly assigned amino acids Eight classes can be mapped back to three: HELIX = helix + α-helix + π-helix EXTENDED = strand + β-bridge LOOP = loop + bend + helix turn Q 8 score generally smaller than Q 3 score 19

20 Segment OVerlap SOV Measure for the overlap between prediction and observed secondary structure, but based on the comparison of pairs of segments Compare observed (s b ) and predicted (s v ) segments of same type (type: H, C or E) 100% for entirely correct assignment minov(s b, s v ): length of the intersection of s b and s v maxov(s b, s v ): length of the union of s b and s v s b minov(s b, s v ) s v maxov(s b, s v ) predicted observed 20

21 Segment OVerlap SOV δ(s b,s v maxov(sb,sv)-minov(sb,s ) = min sb sv minov(sb,sv); ; 2 2 s length of segment s t {H, C, E} secondary structure type N = s b total length of all segments S(t): set of all pairs (s v, s b ) of overlapping segments of type t {H, C, E} in predicted and observed structure v ); 21

22 Secondary Structure Prediction Several generations of algorithms 1st Generation Consider properties of individual aa only (Q %) 2nd Generation Include local environment (Q 3 65%) 3rd Generation Include information from homologs (Q 3 > 70%) 4th Generation Consensus methods combining results from several other (subprediction) methods (Q %) 22

23 Chou-Fasman Algorithm Idea: amino acids differ in their affinity towards specific secondary structures Analysis of structural databases: how often is each aa found in each secondary structure type Let n j the number of occurrences of aa j in all proteins of the database Probability p j of aa j occurring in a protein is then p j = n j / j n j Similarly, define the probability to find aa j in secondary structure type k (with k {C, H, E}) as p j,k = n j,k / j n j,k Chou, Fasman, Biochemistry (1974), 13,

24 Chou-Fasman Algorithm Similarly defined relative probability f j,k for finding aa j in secondary structure type k: f j,k = n j,k / n j Average probability for any of the 20 aa to be found in secondary structure k can thus be written as <f k > = j f j,k / 20 = j n j,k / j n j Relative probability that aa j occurs in secondary structure k is thus: P j,k = f j,k / <f k > These relative probabilities define the preference of the individual amino acids for a certain secondary structure type Chou, Fasman, Biochemistry (1974), 13,

25 Chou-Fasman Algorithm Divide the 20 aa into several classes according to their P αi : Strong helix builder H α (Glu, Ala, Leu) Helix builders h α (His, Met, Gln, Trp, Val, Phe) Weak helix builders I α (Lys, Ile) Indifferent i α (Asp, Thr, Ser, Arg, Cys) Weak helix breakers b α (Asn, Tyr) Strong helix breakers B α (Pro, Gly) Similarly for β-strands: H β, h β, i β, b β, B β Chou, Fasman, Biochemistry (1974), 13,

26 Chou-Fasman Parameters AA P α Class AA P β Class AA P α Class AA P β Class Glu 1.53 Met 1.67 Ala 1.45 H α Val 1.65 H β Ile 1.00 I α Ala 0.93 I β Asp 0.98 Arg 0.90 Leu 1.34 Ile 1.60 Thr 0.82 Gly 0.81 i β His 1.24 Cys 1.30 Ser 0.79 Asp 0.80 i α Met 1.20 Tyr 1.29 Arg 0.79 Lys 0.74 Gln 1.17 Phe 1.28 Cys 0.77 Ser 0.72 h α Trp 1.14 Gln 1.23 Val 1.14 Leu 1.22 h β Asn 0.73 His 0.71 b α Tyr 0.61 Asn 0.65 b β Phe 1.12 Thr 1.20 Lys 1.07 I α Trp 1.19 Pro 0.59 Pro 0.62 B α Gly 0.53 Glu 0.26 B β Chou, Fasman, Biochemistry (1974), 13,

27 Chou-Fasman Algorithm II Example:.. T S P T A E L M R S T G.. i α i α B α i α H α H α h α H α i α i α i α B α

28 Chou-Fasman Algorithm II Example:.. T S P T A E L M R S T G.. i α i α B α i α H α H α h α H α i α i α i α B α = 5 Helix start 28

29 Chou-Fasman Algorithm II Example:.. T S P T A E L M R S T G / 4 > 1.0 Expand to the left with window of 4 aa (based on P α values!) 29

30 Chou-Fasman Algorithm II Example:.. T S P T A E L M R S T G / 4 < 1.0 Expand to the left with window of 4 aa (based on P α values!) 30

31 Chou-Fasman Algorithm II Example:.. T S P T A E L M R S T G / 4 > 1.0 Expand to the right with window of 4 aa (based on P α values!) 31

32 Chou-Fasman Algorithm II Example:.. T S P T A E L M R S T G / 4 > 1.0 Expand to the right with window of 4 aa (based on P α values!) 32

33 Chou-Fasman Algorithm II Example:.. T S P T A E L M R S T G / 4 < 1.0 Expand to the right with window of 4 aa (based on P α values!) 33

34 Chou-Fasman Algorithm II Example:.. T S P T A E L M R S T G Similar procedure is the applied for strands 34

35 Chou-Fasman Algorithm III Algorithm (simplified!) Assign α/β classes to each aa of sequences S = s 1 s 2...s k A: HELICES Assign a weight w i to every aa i with w(h α ) = w(h α ) = 1, w(i α ) = 0.5, w(b α ) = w(b α ) = 1 Find helix cores Find first window of length 6 aa with w i 4 Expand cores to the left and to the right Windows of length 4 Shift to the left and right until P α s i < 4 Compatible aa of the first window no longer matching are considered part of the helix (special rule for compatibility) Chou, Fasman, Biochemistry (1974), 13,

36 Chou-Fasman Algorithm II Algorithm (simplified!) B: STRANDS Assign weights w i with w(h β ) = w(h β ) = 1, w(i α ) = 0.5, w(b α ) = w(b α ) = 1 Find strand cores Windows of length five with Three or more H β or h β At most one B β or b β Expand cores to the left and right Windows of four aa Shift left/right until P β s i < 4 Chou, Fasman, Biochemistry (1974), 13,

37 Chou-Fasman Algorithm III Algorithm (simplified!) C: CONFLICT RESOLUTION For segments marked as α and β: Calculate average P avg α and P avg β Helix, if P avg α > P avg β Strand, if P avg α < P avg β Complete algorithm contains further rules for assignments on the ends of segments and conflict resolution Chou, Fasman, Biochemistry (1974), 13,

38 Chou-Fasman Algorithm Online prediction: Prediction accuracy rather low (50-60%) There is a whole range of improved methods: Including the prediction of turns Improved statistics (Chou, Fasman: 15 proteins!) Key problem: neighboring residues should have a strong influence and need to be considered beyond an averaging 38

39 Non-Locality Same sequence produces different secondary structures: Val-Asn-Thr-Phe-Val in 1ECN (80-84) and 9RSA (43-47) 1ECN 9RSA 39

40 Non-Locality Strands show stronger non-locality than helices: interactions between very distant sequence regions necessary for stabilization Helices: interactions only between adjacent turns of the helix (at most 5 aa removed!) 40

41 2nd Generation Methods Include neighboring residues Drastically improves prediction for helices Strands still difficult Wide range of methods employing of all sorts of techniques from statistical learning Artificial neural networks LDFs (Linear Discriminant Functions) Nearest-neighbor classifiers Support Vector Machines Hidden Markov Models 41

42 GOR Method Garnier-Osguthorpe-Robson method Several variants (GOR I GOR IV) Here: GOR IV as an example of a 2nd generation method Includes neighboring residues in a wider window Window length: GOR IV: 17 aa Common lengths of secondary structure elements: Helices ca aa Strands ca aa 42

43 GOR IV Instead of P ij there are now three matrices (PSSMs, positionspecific scoring matrices) One for each of the classes H, C, E Matrix entry corresponds to a probability to find a certain residue in this environment in a given secondary structure type Val Tyr Cys Ala YGRCELAAAMKRLGLDNYRGYSLGNWVCAAKFES 43

44 GOR IV Matrix entries S α ij are determined as Score for position i is then obtained by summation over the whole window Tyr Gly Met YGRCELAAAMKRLGLDNYRGYSLGNWVCAAKFES 44

45 GOR IV Requires a large data basis to determine all matrix elements with sufficient accuracy Still leads to ambiguities, in particular at the ends of secondary structure elements Prediction quality: Q 3 64% Available online at There exist further, slightly improved versions (e.g. GOR V) 45

46 Third-Generation Methods Only about 65% of the information required is local 1 st /2 nd generation methods cannot get much better Observation About 67% of the residues of a sequence can be exchanged without breaking the secondary structures Evolution has tried many of these neutral mutations Evolutionarily related (homologous) sequences contain this information If there are helix breakers in homologous sequences at the same position, it is unlikely that there is a helix This type of information is easily integrated through sequences profiles 46

47 PHD PHD uses Artificial neural networks (ANNs) for classification Profiles of homologous sequences Three-layered ANN 1st + 2nd layer: mapping of the sequence/profile onto secondary structure classes 3rd layer: majority vote on the results of the previous layers Rost, Sander, JMB (1993), 252, 584) 47

48 Recap: ANNs Graph defines topology Arranged in layers Weighted edges Weighted summation of input signals (nonlinear) activation function f Popular choice: f = logistic function I 1 I 2 I 3 w 1 w 2 w 3 /f 48

49 PHD Topology of the ANN Query.. K E L N D L E K K Y N A H I G.. Alin.... Seq.... K-HK EDAE FFFF SAAS QKKQ LLLL EEEE KEKK KQEK FFYF DDND AAAA RKKR LLLL GGGG st Layer seq.-to-struct.... 2nd Layer Struct-to-struct.. 3rd Layer Jury Decision 2.46 Helix! After: Rost, Sander, J. Mol. Biol. (1993), 232,

50 PHD Post processing step then removes secondary structure elements with a length below three aa ANN is trained on DSSP-annotated X-ray structures Results: Use of profiles instead of single sequences improves Q 3 by about 6%, use of majority votes adds another 2% Improved version PHD3 improves Q 3 to about 75% 50

51 PSIPRED I Three-step algorithm Construction of a profile Prediction with a two-layer ANN Filtering of predictions Profile generation PSI-BLAST run (three iterations) of the sequence against a large, non-redundant protein sequences database PSI-BLAST profile (scoring matrix) serves as input to the first layer of the ANN Jones, J. Mol. Biol. (1999), 292,

52 PSIPRED II A window of 15 rows of the profile is used for the first layer 15 x 3 outputs of the first layer are connected to the second layer, which recognizes neighboring residues of similar secondary structure (segment filtering) 2nd layer produces final classification A C D E F G H I K L M N P Q R S T V W Y - Profile 15x21 inputs 75 hidden nodes 3 outputs 60 inputs 60 hidden nodes 3 outputs Jones, JMB (1999), 292,

53 PSIPRED III Training of the ANN through back propagation 2nd layer removes very short secondary structure elements Results: PSIPRED is one of the best prediction algorithms currently available Online server: Q 3 ~ 77% Improved versions: Q 3 ~ 81% Jones, J. Mol. Biol. (1999), 292,

54 sspro Uses bidirectional recurrent neural (BRNN) Windows size of 41 AA Evolutionary information from multiple alignment Q 3 ~ 76% Baldi, Brunak, Frasconi, Soda, Pollastri, Bioinformatics (1999), 15,

55 Consensus Methods JPRED Meta Server: uses six independent methods in parallel NNSSP (a variant of SSP) PHD MULPRED (multiple predictions including GOR, Chou & Fasman) ZPRED PREDATOR DSC Majority vote for each amino acid If no clear winner: use result of PHD! Accuracy: 73% (1% better than PHD) 55

56 CASP5 Results CASP Critical Assessment of Structure Prediction a blind prediction competition Meta servers come out on top TOP 10 achieves SOV of about 80% (CASP4, 2000: 76%) Successful meta servers are based on sspro, PSIPRED and/or SAM-T02 (HMM approach) Helix predictions still about 10% better than those for strands Aloy et al., Proteins: Structure, Function, Genetics (2003), 53,

57 CASP5 Secondary Structure Aloy et al., Proteins: Structure, Function, Genetics (2003), 53,

58 Summary Secondary structure prediction is a first step in tertiary structure prediction Successful methods consider large sequence stretches and evolutionary information alike Meta-servers yield slightly superior results Prediction accuracies (Q 3 ) of 75-80% are possible 58

59 References Burkhard Rost: Prediction in 1D, In: Structural Bioinformatics (Hrsg.: P. E. Bourne, H. Weissig), Wiley, 2003 Ralf Zimmer, Thomas Lengauer: Structure Prediction, Chapter 5 in T. Lengauer (Hrsg.): Bioinformatics: From Genomes to Drugs, Wiley,

Protein Structures: Experiments and Modeling. Patrice Koehl

Protein Structures: Experiments and Modeling. Patrice Koehl Protein Structures: Experiments and Modeling Patrice Koehl Structural Bioinformatics: Proteins Proteins: Sources of Structure Information Proteins: Homology Modeling Proteins: Ab initio prediction Proteins:

More information

Physiochemical Properties of Residues

Physiochemical Properties of Residues Physiochemical Properties of Residues Various Sources C N Cα R Slide 1 Conformational Propensities Conformational Propensity is the frequency in which a residue adopts a given conformation (in a polypeptide)

More information

Protein Secondary Structure Prediction using Feed-Forward Neural Network

Protein Secondary Structure Prediction using Feed-Forward Neural Network COPYRIGHT 2010 JCIT, ISSN 2078-5828 (PRINT), ISSN 2218-5224 (ONLINE), VOLUME 01, ISSUE 01, MANUSCRIPT CODE: 100713 Protein Secondary Structure Prediction using Feed-Forward Neural Network M. A. Mottalib,

More information

Bioinformatics III Structural Bioinformatics and Genome Analysis Part Protein Secondary Structure Prediction. Sepp Hochreiter

Bioinformatics III Structural Bioinformatics and Genome Analysis Part Protein Secondary Structure Prediction. Sepp Hochreiter Bioinformatics III Structural Bioinformatics and Genome Analysis Part Protein Secondary Structure Prediction Institute of Bioinformatics Johannes Kepler University, Linz, Austria Chapter 4 Protein Secondary

More information

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Introduction to Comparative Protein Modeling. Chapter 4 Part I Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature

More information

Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics

Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics Jianlin Cheng, PhD Department of Computer Science University of Missouri, Columbia

More information

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods Cell communication channel Bioinformatics Methods Iosif Vaisman Email: ivaisman@gmu.edu SEQUENCE STRUCTURE DNA Sequence Protein Sequence Protein Structure Protein structure ATGAAATTTGGAAACTTCCTTCTCACTTATCAGCCACCT...

More information

Protein Secondary Structure Prediction

Protein Secondary Structure Prediction Protein Secondary Structure Prediction Doug Brutlag & Scott C. Schmidler Overview Goals and problem definition Existing approaches Classic methods Recent successful approaches Evaluating prediction algorithms

More information

PROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS

PROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS Int. J. LifeSc. Bt & Pharm. Res. 2012 Kaladhar, 2012 Research Paper ISSN 2250-3137 www.ijlbpr.com Vol.1, Issue. 1, January 2012 2012 IJLBPR. All Rights Reserved PROTEIN SECONDARY STRUCTURE PREDICTION:

More information

Proteins: Structure & Function. Ulf Leser

Proteins: Structure & Function. Ulf Leser Proteins: Structure & Function Ulf Leser This Lecture Proteins Structure Function Databases Predicting Protein Secondary Structure Many figures from Zvelebil, M. and Baum, J. O. (2008). "Understanding

More information

Protein Structure Prediction and Display

Protein Structure Prediction and Display Protein Structure Prediction and Display Goal Take primary structure (sequence) and, using rules derived from known structures, predict the secondary structure that is most likely to be adopted by each

More information

Bioinformatics: Secondary Structure Prediction

Bioinformatics: Secondary Structure Prediction Bioinformatics: Secondary Structure Prediction Prof. David Jones d.jones@cs.ucl.ac.uk LMLSTQNPALLKRNIIYWNNVALLWEAGSD The greatest unsolved problem in molecular biology:the Protein Folding Problem? Entries

More information

Protein Structure Prediction

Protein Structure Prediction Protein Structure Prediction Michael Feig MMTSB/CTBP 2006 Summer Workshop From Sequence to Structure SEALGDTIVKNA Ab initio Structure Prediction Protocol Amino Acid Sequence Conformational Sampling to

More information

Basics of protein structure

Basics of protein structure Today: 1. Projects a. Requirements: i. Critical review of one paper ii. At least one computational result b. Noon, Dec. 3 rd written report and oral presentation are due; submit via email to bphys101@fas.harvard.edu

More information

1-D Predictions. Prediction of local features: Secondary structure & surface exposure

1-D Predictions. Prediction of local features: Secondary structure & surface exposure 1-D Predictions Prediction of local features: Secondary structure & surface exposure 1 Learning Objectives After today s session you should be able to: Explain the meaning and usage of the following local

More information

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure Bioch/BIMS 503 Lecture 2 Structure and Function of Proteins August 28, 2008 Robert Nakamoto rkn3c@virginia.edu 2-0279 Secondary Structure Φ Ψ angles determine protein structure Φ Ψ angles are restricted

More information

Improved Protein Secondary Structure Prediction

Improved Protein Secondary Structure Prediction Improved Protein Secondary Structure Prediction Secondary Structure Prediction! Given a protein sequence a 1 a 2 a N, secondary structure prediction aims at defining the state of each amino acid ai as

More information

Protein Secondary Structure Assignment and Prediction

Protein Secondary Structure Assignment and Prediction 1 Protein Secondary Structure Assignment and Prediction Defining SS features - Dihedral angles, alpha helix, beta stand (Hydrogen bonds) Assigned manually by crystallographers or Automatic DSSP (Kabsch

More information

Packing of Secondary Structures

Packing of Secondary Structures 7.88 Lecture Notes - 4 7.24/7.88J/5.48J The Protein Folding and Human Disease Professor Gossard Retrieving, Viewing Protein Structures from the Protein Data Base Helix helix packing Packing of Secondary

More information

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Programme 8.00-8.20 Last week s quiz results + Summary 8.20-9.00 Fold recognition 9.00-9.15 Break 9.15-11.20 Exercise: Modelling remote homologues 11.20-11.40 Summary & discussion 11.40-12.00 Quiz 1 Feedback

More information

Neural Networks for Protein Structure Prediction Brown, JMB CS 466 Saurabh Sinha

Neural Networks for Protein Structure Prediction Brown, JMB CS 466 Saurabh Sinha Neural Networks for Protein Structure Prediction Brown, JMB 1999 CS 466 Saurabh Sinha Outline Goal is to predict secondary structure of a protein from its sequence Artificial Neural Network used for this

More information

114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009

114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009 114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009 9 Protein tertiary structure Sources for this chapter, which are all recommended reading: D.W. Mount. Bioinformatics: Sequences and Genome

More information

Bayesian Network Multi-classifiers for Protein Secondary Structure Prediction

Bayesian Network Multi-classifiers for Protein Secondary Structure Prediction Bayesian Network Multi-classifiers for Protein Secondary Structure Prediction Víctor Robles a, Pedro Larrañaga b,josém.peña a, Ernestina Menasalvas a,maría S. Pérez a, Vanessa Herves a and Anita Wasilewska

More information

Bioinformatics: Secondary Structure Prediction

Bioinformatics: Secondary Structure Prediction Bioinformatics: Secondary Structure Prediction Prof. David Jones d.t.jones@ucl.ac.uk Possibly the greatest unsolved problem in molecular biology: The Protein Folding Problem MWMPPRPEEVARK LRRLGFVERMAKG

More information

CAP 5510 Lecture 3 Protein Structures

CAP 5510 Lecture 3 Protein Structures CAP 5510 Lecture 3 Protein Structures Su-Shing Chen Bioinformatics CISE 8/19/2005 Su-Shing Chen, CISE 1 Protein Conformation 8/19/2005 Su-Shing Chen, CISE 2 Protein Conformational Structures Hydrophobicity

More information

Bayesian Network Multi-classifiers for Protein Secondary Structure Prediction

Bayesian Network Multi-classifiers for Protein Secondary Structure Prediction Bayesian Network Multi-classifiers for Protein Secondary Structure Prediction Víctor Robles a, Pedro Larrañaga b,josém.peña a, Ernestina Menasalvas a,maría S. Pérez a, Vanessa Herves a and Anita Wasilewska

More information

Lecture 7. Protein Secondary Structure Prediction. Secondary Structure DSSP. Master Course DNA/Protein Structurefunction.

Lecture 7. Protein Secondary Structure Prediction. Secondary Structure DSSP. Master Course DNA/Protein Structurefunction. C N T R F O R N T G R A T V B O N F O R M A T C S V U Master Course DNA/Protein Structurefunction Analysis and Prediction Lecture 7 Protein Secondary Structure Prediction Protein primary structure 20 amino

More information

SUPPLEMENTARY MATERIALS

SUPPLEMENTARY MATERIALS SUPPLEMENTARY MATERIALS Enhanced Recognition of Transmembrane Protein Domains with Prediction-based Structural Profiles Baoqiang Cao, Aleksey Porollo, Rafal Adamczak, Mark Jarrell and Jaroslaw Meller Contact:

More information

Predicting Secondary Structures of Proteins

Predicting Secondary Structures of Proteins CHALLENGES IN PROTEOMICS BACKGROUND PHOTODISC, FOREGROUND IMAGE: U.S. DEPARTMENT OF ENERGY GENOMICS: GTL PROGRAM, HTTP://WWW.ORNL.GOV.HGMIS BY JACEK BLAŻEWICZ, PETER L. HAMMER, AND PIOTR LUKASIAK Predicting

More information

Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution

Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution Supplemental Materials for Structural Diversity of Protein Segments Follows a Power-law Distribution Yoshito SAWADA and Shinya HONDA* National Institute of Advanced Industrial Science and Technology (AIST),

More information

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two Supplementary Figure 1. Biopanningg and clone enrichment of Alphabody binders against human IL 23. Positive clones in i phage ELISA with optical density (OD) 3 times higher than background are shown for

More information

Protein Secondary Structure Prediction

Protein Secondary Structure Prediction C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U E Master Course DNA/Protein Structurefunction Analysis and Prediction Lecture 7 Protein Secondary Structure Prediction Protein primary

More information

Week 10: Homology Modelling (II) - HHpred

Week 10: Homology Modelling (II) - HHpred Week 10: Homology Modelling (II) - HHpred Course: Tools for Structural Biology Fabian Glaser BKU - Technion 1 2 Identify and align related structures by sequence methods is not an easy task All comparative

More information

CMPS 3110: Bioinformatics. Tertiary Structure Prediction

CMPS 3110: Bioinformatics. Tertiary Structure Prediction CMPS 3110: Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the laws of physics! Conformation space is finite

More information

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction CMPS 6630: Introduction to Computational Biology and Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the

More information

Protein Structure Prediction using String Kernels. Technical Report

Protein Structure Prediction using String Kernels. Technical Report Protein Structure Prediction using String Kernels Technical Report Department of Computer Science and Engineering University of Minnesota 4-192 EECS Building 200 Union Street SE Minneapolis, MN 55455-0159

More information

Two-Stage Multi-Class Support Vector Machines to Protein Secondary Structure Prediction. M.N. Nguyen and J.C. Rajapakse

Two-Stage Multi-Class Support Vector Machines to Protein Secondary Structure Prediction. M.N. Nguyen and J.C. Rajapakse Two-Stage Multi-Class Support Vector Machines to Protein Secondary Structure Prediction M.N. Nguyen and J.C. Rajapakse Pacific Symposium on Biocomputing 10:346-357(2005) TWO-STAGE MULTI-CLASS SUPPORT VECTOR

More information

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche The molecular structure of a protein can be broken down hierarchically. The primary structure of a protein is simply its

More information

Getting To Know Your Protein

Getting To Know Your Protein Getting To Know Your Protein Comparative Protein Analysis: Part III. Protein Structure Prediction and Comparison Robert Latek, PhD Sr. Bioinformatics Scientist Whitehead Institute for Biomedical Research

More information

Protein Structure Prediction

Protein Structure Prediction Page 1 Protein Structure Prediction Russ B. Altman BMI 214 CS 274 Protein Folding is different from structure prediction --Folding is concerned with the process of taking the 3D shape, usually based on

More information

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013 Hydration of protein-rna recognition sites Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India 1 st November, 2013 Central Dogma of life DNA

More information

Procheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics.

Procheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics. Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics Iosif Vaisman Email: ivaisman@gmu.edu ----------------------------------------------------------------- Bond

More information

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination Lecture 9 M230 Feigon Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination Reading resources v Roberts NMR of Macromolecules, Chap 4 by Christina Redfield

More information

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007 Molecular Modeling Prediction of Protein 3D Structure from Sequence Vimalkumar Velayudhan Jain Institute of Vocational and Advanced Studies May 21, 2007 Vimalkumar Velayudhan Molecular Modeling 1/23 Outline

More information

PROTEIN SECONDARY STRUCTURE PREDICTION USING NEURAL NETWORKS AND SUPPORT VECTOR MACHINES

PROTEIN SECONDARY STRUCTURE PREDICTION USING NEURAL NETWORKS AND SUPPORT VECTOR MACHINES PROTEIN SECONDARY STRUCTURE PREDICTION USING NEURAL NETWORKS AND SUPPORT VECTOR MACHINES by Lipontseng Cecilia Tsilo A thesis submitted to Rhodes University in partial fulfillment of the requirements for

More information

Protein Secondary Structure Prediction

Protein Secondary Structure Prediction part of Bioinformatik von RNA- und Proteinstrukturen Computational EvoDevo University Leipzig Leipzig, SS 2011 the goal is the prediction of the secondary structure conformation which is local each amino

More information

Details of Protein Structure

Details of Protein Structure Details of Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Anne Mølgaard, Kemisk Institut, Københavns Universitet Learning Objectives

More information

Sequence Alignments. Dynamic programming approaches, scoring, and significance. Lucy Skrabanek ICB, WMC January 31, 2013

Sequence Alignments. Dynamic programming approaches, scoring, and significance. Lucy Skrabanek ICB, WMC January 31, 2013 Sequence Alignments Dynamic programming approaches, scoring, and significance Lucy Skrabanek ICB, WMC January 31, 213 Sequence alignment Compare two (or more) sequences to: Find regions of conservation

More information

Major Types of Association of Proteins with Cell Membranes. From Alberts et al

Major Types of Association of Proteins with Cell Membranes. From Alberts et al Major Types of Association of Proteins with Cell Membranes From Alberts et al Proteins Are Polymers of Amino Acids Peptide Bond Formation Amino Acid central carbon atom to which are attached amino group

More information

Steps in protein modelling. Structure prediction, fold recognition and homology modelling. Basic principles of protein structure

Steps in protein modelling. Structure prediction, fold recognition and homology modelling. Basic principles of protein structure Structure prediction, fold recognition and homology modelling Marjolein Thunnissen Lund September 2012 Steps in protein modelling 3-D structure known Comparative Modelling Sequence of interest Similarity

More information

Conditional Graphical Models

Conditional Graphical Models PhD Thesis Proposal Conditional Graphical Models for Protein Structure Prediction Yan Liu Language Technologies Institute University Thesis Committee Jaime Carbonell (Chair) John Lafferty Eric P. Xing

More information

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27 Acta Cryst. (2014). D70, doi:10.1107/s1399004714021695 Supporting information Volume 70 (2014) Supporting information for article: Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase

More information

Structural Alignment of Proteins

Structural Alignment of Proteins Goal Align protein structures Structural Alignment of Proteins 1 2 3 4 5 6 7 8 9 10 11 12 13 14 PHE ASP ILE CYS ARG LEU PRO GLY SER ALA GLU ALA VAL CYS PHE ASN VAL CYS ARG THR PRO --- --- --- GLU ALA ILE

More information

Sequence analysis and comparison

Sequence analysis and comparison The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species

More information

Optimization of the Sliding Window Size for Protein Structure Prediction

Optimization of the Sliding Window Size for Protein Structure Prediction Optimization of the Sliding Window Size for Protein Structure Prediction Ke Chen* 1, Lukasz Kurgan 1 and Jishou Ruan 2 1 University of Alberta, Department of Electrical and Computer Engineering, Edmonton,

More information

Protein structure alignments

Protein structure alignments Protein structure alignments Proteins that fold in the same way, i.e. have the same fold are often homologs. Structure evolves slower than sequence Sequence is less conserved than structure If BLAST gives

More information

What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary

What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary Electronic Supplementary Material (ESI) for Journal of Materials Chemistry B. This journal is The Royal Society of Chemistry 21 What makes a good graphene-binding peptide? Adsorption of amino acids and

More information

Presentation Outline. Prediction of Protein Secondary Structure using Neural Networks at Better than 70% Accuracy

Presentation Outline. Prediction of Protein Secondary Structure using Neural Networks at Better than 70% Accuracy Prediction of Protein Secondary Structure using Neural Networks at Better than 70% Accuracy Burkhard Rost and Chris Sander By Kalyan C. Gopavarapu 1 Presentation Outline Major Terminology Problem Method

More information

IT og Sundhed 2010/11

IT og Sundhed 2010/11 IT og Sundhed 2010/11 Sequence based predictors. Secondary structure and surface accessibility Bent Petersen 13 January 2011 1 NetSurfP Real Value Solvent Accessibility predictions with amino acid associated

More information

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine Supplementary figure 1. Comparison of unbound and as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine GM-CSF (slate) with bound GM-CSF in the GIF:GM-CSF complex (GIF: green,

More information

Research Article Extracting Physicochemical Features to Predict Protein Secondary Structure

Research Article Extracting Physicochemical Features to Predict Protein Secondary Structure The Scientific World Journal Volume 2013, Article ID 347106, 8 pages http://dx.doi.org/10.1155/2013/347106 Research Article Extracting Physicochemical Features to Predict Protein Secondary Structure Yin-Fu

More information

Resonance assignments in proteins. Christina Redfield

Resonance assignments in proteins. Christina Redfield Resonance assignments in proteins Christina Redfield 1. Introduction The assignment of resonances in the complex NMR spectrum of a protein is the first step in any study of protein structure, function

More information

7 Protein secondary structure

7 Protein secondary structure 78 Grundlagen der Bioinformatik, SS 1, D. Huson, June 17, 21 7 Protein secondary structure Sources for this chapter, which are all recommended reading: Introduction to Protein Structure, Branden & Tooze,

More information

ALL LECTURES IN SB Introduction

ALL LECTURES IN SB Introduction 1. Introduction 2. Molecular Architecture I 3. Molecular Architecture II 4. Molecular Simulation I 5. Molecular Simulation II 6. Bioinformatics I 7. Bioinformatics II 8. Prediction I 9. Prediction II ALL

More information

Properties of amino acids in proteins

Properties of amino acids in proteins Properties of amino acids in proteins one of the primary roles of DNA (but not the only one!) is to code for proteins A typical bacterium builds thousands types of proteins, all from ~20 amino acids repeated

More information

HIV protease inhibitor. Certain level of function can be found without structure. But a structure is a key to understand the detailed mechanism.

HIV protease inhibitor. Certain level of function can be found without structure. But a structure is a key to understand the detailed mechanism. Proteins are linear polypeptide chains (one or more) Building blocks: 20 types of amino acids. Range from a few 10s-1000s They fold into varying three-dimensional shapes structure medicine Certain level

More information

8 Protein secondary structure

8 Protein secondary structure Grundlagen der Bioinformatik, SoSe 11, D. Huson, June 6, 211 13 8 Protein secondary structure Sources for this chapter, which are all recommended reading: Introduction to Protein Structure, Branden & Tooze,

More information

Central Dogma. modifications genome transcriptome proteome

Central Dogma. modifications genome transcriptome proteome entral Dogma DA ma protein post-translational modifications genome transcriptome proteome 83 ierarchy of Protein Structure 20 Amino Acids There are 20 n possible sequences for a protein of n residues!

More information

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description Version 3.30 Document Published by the wwpdb This format complies with the PDB Exchange Dictionary (PDBx) http://mmcif.pdb.org/dictionaries/mmcif_pdbx.dic/index/index.html.

More information

Bioinformatics Practical for Biochemists

Bioinformatics Practical for Biochemists Bioinformatics Practical for Biochemists Andrei Lupas, Birte Höcker, Steffen Schmidt WS 2013/14 03. Sequence Features Targeting proteins signal peptide targets proteins to the secretory pathway N-terminal

More information

Protein 8-class Secondary Structure Prediction Using Conditional Neural Fields

Protein 8-class Secondary Structure Prediction Using Conditional Neural Fields 2010 IEEE International Conference on Bioinformatics and Biomedicine Protein 8-class Secondary Structure Prediction Using Conditional Neural Fields Zhiyong Wang, Feng Zhao, Jian Peng, Jinbo Xu* Toyota

More information

Correlations of Amino Acids with Secondary Structure Types: Connection with Amino Acid Structure

Correlations of Amino Acids with Secondary Structure Types: Connection with Amino Acid Structure Correlations of Amino Acids with Secondary Structure Types: Connection with Amino Acid Structure Saša Malkov, Miodrag V. Živković, Miloš V. Beljanski, Snežana D. Zarić * Department of Mathematic University

More information

Profiles and Majority Voting-Based Ensemble Method for Protein Secondary Structure Prediction

Profiles and Majority Voting-Based Ensemble Method for Protein Secondary Structure Prediction Evolutionary Bioinformatics Original Research Open Access Full open access to this and thousands of other papers at http://www.la-press.com. Profiles and Majority Voting-Based Ensemble Method for Protein

More information

Computer simulations of protein folding with a small number of distance restraints

Computer simulations of protein folding with a small number of distance restraints Vol. 49 No. 3/2002 683 692 QUARTERLY Computer simulations of protein folding with a small number of distance restraints Andrzej Sikorski 1, Andrzej Kolinski 1,2 and Jeffrey Skolnick 2 1 Department of Chemistry,

More information

Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program)

Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program) Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program) Course Name: Structural Bioinformatics Course Description: Instructor: This course introduces fundamental concepts and methods for structural

More information

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Viewing and Analyzing Proteins, Ligands and their Complexes 2 2 Viewing and Analyzing Proteins, Ligands and their Complexes 2 Overview Viewing the accessible surface Analyzing the properties of proteins containing thousands of atoms is best accomplished by representing

More information

Building 3D models of proteins

Building 3D models of proteins Building 3D models of proteins Why make a structural model for your protein? The structure can provide clues to the function through structural similarity with other proteins With a structure it is easier

More information

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1 Supporting information to: Time-resolved observation of protein allosteric communication Sebastian Buchenberg, Florian Sittel and Gerhard Stock Biomolecular Dynamics, Institute of Physics, Albert Ludwigs

More information

HMM applications. Applications of HMMs. Gene finding with HMMs. Using the gene finder

HMM applications. Applications of HMMs. Gene finding with HMMs. Using the gene finder HMM applications Applications of HMMs Gene finding Pairwise alignment (pair HMMs) Characterizing protein families (profile HMMs) Predicting membrane proteins, and membrane protein topology Gene finding

More information

Improving Protein Secondary-Structure Prediction by Predicting Ends of Secondary-Structure Segments

Improving Protein Secondary-Structure Prediction by Predicting Ends of Secondary-Structure Segments Improving Protein Secondary-Structure Prediction by Predicting Ends of Secondary-Structure Segments Uros Midic 1 A. Keith Dunker 2 Zoran Obradovic 1* 1 Center for Information Science and Technology Temple

More information

Template-Based 3D Structure Prediction

Template-Based 3D Structure Prediction Template-Based 3D Structure Prediction Sequence and Structure-based Template Detection and Alignment Issues The rate of new sequences is growing exponentially relative to the rate of protein structures

More information

12 Protein secondary structure

12 Protein secondary structure Grundlagen der Bioinformatik, SoSe 14, D. Huson, July 2, 214 147 12 Protein secondary structure Sources for this chapter, which are all recommended reading: Introduction to Protein Structure, Branden &

More information

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748 CAP 5510: Introduction to Bioinformatics Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinfs07.html 2/15/07 CAP5510 1 EM Algorithm Goal: Find θ, Z that maximize Pr

More information

Protein Structure Bioinformatics Introduction

Protein Structure Bioinformatics Introduction 1 Swiss Institute of Bioinformatics Protein Structure Bioinformatics Introduction Basel, 27. September 2004 Torsten Schwede Biozentrum - Universität Basel Swiss Institute of Bioinformatics Klingelbergstr

More information

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison CMPS 6630: Introduction to Computational Biology and Bioinformatics Structure Comparison Protein Structure Comparison Motivation Understand sequence and structure variability Understand Domain architecture

More information

CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Giri Narasimhan

CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Giri Narasimhan CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinff18.html Proteins and Protein Structure

More information

Proteins: Characteristics and Properties of Amino Acids

Proteins: Characteristics and Properties of Amino Acids SBI4U:Biochemistry Macromolecules Eachaminoacidhasatleastoneamineandoneacidfunctionalgroupasthe nameimplies.thedifferentpropertiesresultfromvariationsinthestructuresof differentrgroups.thergroupisoftenreferredtoastheaminoacidsidechain.

More information

Supplementary Information. Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1

Supplementary Information. Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1 Supplementary Information Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1 Philip D. Mosier 1, Meng-Jung Chiang 2, Zhengshi Lin 2, Yamei Gao 2, Bashayer Althufairi

More information

Identification of Representative Protein Sequence and Secondary Structure Prediction Using SVM Approach

Identification of Representative Protein Sequence and Secondary Structure Prediction Using SVM Approach Identification of Representative Protein Sequence and Secondary Structure Prediction Using SVM Approach Prof. Dr. M. A. Mottalib, Md. Rahat Hossain Department of Computer Science and Information Technology

More information

Peptides And Proteins

Peptides And Proteins Kevin Burgess, May 3, 2017 1 Peptides And Proteins from chapter(s) in the recommended text A. Introduction B. omenclature And Conventions by amide bonds. on the left, right. 2 -terminal C-terminal triglycine

More information

SCOP. all-β class. all-α class, 3 different folds. T4 endonuclease V. 4-helical cytokines. Globin-like

SCOP. all-β class. all-α class, 3 different folds. T4 endonuclease V. 4-helical cytokines. Globin-like SCOP all-β class 4-helical cytokines T4 endonuclease V all-α class, 3 different folds Globin-like TIM-barrel fold α/β class Profilin-like fold α+β class http://scop.mrc-lmb.cam.ac.uk/scop CATH Class, Architecture,

More information

Protein Struktur (optional, flexible)

Protein Struktur (optional, flexible) Protein Struktur (optional, flexible) 22/10/2009 [ 1 ] Andrew Torda, Wintersemester 2009 / 2010, AST nur für Informatiker, Mathematiker,.. 26 kt, 3 ov 2009 Proteins - who cares? 22/10/2009 [ 2 ] Most important

More information

Model Mélange. Physical Models of Peptides and Proteins

Model Mélange. Physical Models of Peptides and Proteins Model Mélange Physical Models of Peptides and Proteins In the Model Mélange activity, you will visit four different stations each featuring a variety of different physical models of peptides or proteins.

More information

Introduction. System and methods ORIGINAL PAPER

Introduction. System and methods ORIGINAL PAPER J Mol Model (2001) 7:360 369 DOI 10.1007/s008940100038 ORIGINAL PAPER Jens Meiler Michael Müller Anita Zeidler Felix Schmäschke Generation and evaluation of dimension-reduced amino acid parameter representations

More information

Protein Structure Prediction

Protein Structure Prediction Protein Structure Prediction Michael Feig MMTSB/CTBP 2009 Summer Workshop From Sequence to Structure SEALGDTIVKNA Folding with All-Atom Models AAQAAAAQAAAAQAA All-atom MD in general not succesful for real

More information

Protein Secondary Structure Prediction using Pattern Recognition Neural Network

Protein Secondary Structure Prediction using Pattern Recognition Neural Network Protein Secondary Structure Prediction using Pattern Recognition Neural Network P.V. Nageswara Rao 1 (nagesh@gitam.edu), T. Uma Devi 1, DSVGK Kaladhar 1, G.R. Sridhar 2, Allam Appa Rao 3 1 GITAM University,

More information

3D Structure. Prediction & Assessment Pt. 2. David Wishart 3-41 Athabasca Hall

3D Structure. Prediction & Assessment Pt. 2. David Wishart 3-41 Athabasca Hall 3D Structure Prediction & Assessment Pt. 2 David Wishart 3-41 Athabasca Hall david.wishart@ualberta.ca Objectives Become familiar with methods and algorithms for secondary Structure Prediction Become familiar

More information

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell Mathematics and Biochemistry University of Wisconsin - Madison 0 There Are Many Kinds Of Proteins The word protein comes

More information

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle.

Figure 1. Molecules geometries of 5021 and Each neutral group in CHARMM topology was grouped in dash circle. Project I Chemistry 8021, Spring 2005/2/23 This document was turned in by a student as a homework paper. 1. Methods First, the cartesian coordinates of 5021 and 8021 molecules (Fig. 1) are generated, in

More information

Similarity or Identity? When are molecules similar?

Similarity or Identity? When are molecules similar? Similarity or Identity? When are molecules similar? Mapping Identity A -> A T -> T G -> G C -> C or Leu -> Leu Pro -> Pro Arg -> Arg Phe -> Phe etc If we map similarity using identity, how similar are

More information