Computational Molecular Biology. Protein Structure and Homology Modeling

Size: px
Start display at page:

Download "Computational Molecular Biology. Protein Structure and Homology Modeling"

Transcription

1 Computational Molecular Biology Protein Structure and Homology Modeling Prof. Alejandro Giorge1 Dr. Francesco Musiani

2 Sequence, function and structure relationships v Life is the ability to metabolize nutrients, respond to external stimuli, grow, reproduce and evolve v From a chemical point of view, proteins are linear hetero-polymers formed by amino acids (aa)

3 Sequence, function and structure relationships v Life is the ability to metabolize nutrients, respond to external stimuli, grow, reproduce and evolve v From a chemical point of view, proteins are linear hetero-polymers formed by amino acids (aa) v Proteins assume a 3D shape which is usually responsible for function v The consequence of the tight link between structure, function and evolutionary pressure distinguish proteins from ordinary polymers

4 Protein structure v v v v v The sequence of amino acids is called the primary structure Secondary structure refers to local folding Tertiary structure is the arrangement of secondary elements in 3D Quaternary structure describes the arrangement of a protein subunits The peptide bond is planar and the dihedral angle it defines is almost always 18

5 Protein structure v What is a dihedral angle? Is the angle between two planes. In practice, if you have four connected atoms and you want measure the dihedral angle around the central bond, you orient the system in such a way that the two central atoms are superimposed and measure the resulting angle between the first and last atom.

6 Protein structure v What is a dihedral angle? Is the angle between two planes. In practice, if you have four connected atoms and you want measure the dihedral angle around the central bond, you orient the system in such a way that the two central atoms are superimposed and measure the resulting angle between the first and last atom.

7 Protein structure v What is a dihedral angle? Is the angle between two planes. In practice, if you have four connected atoms and you want measure the dihedral angle around the central bond, you orient the system in such a way that the two central atoms are superimposed and measure the resulting angle between the first and last atom.

8 Protein structure v The simplest arrangements of aa is the alpha-helix, a right handed spiral conformation. v The structure repeats itself every 5.4 Å along the helix axis. v There are 3.6 aa per turn. O (n) - NH (n+4) H- bond

9 Protein structure v The beta sheet. v The R groups of neighboring residues in strand point in opposite directions. v There are parallel or anti-parallel beta sheets.

10 Protein structure Ramchandan plot: pairs of angles that do not cause the atoms of a dipeptide to collide.

11 Protein structure Ramchandan plot: pairs of angles that do not cause the atoms of a dipeptide to collide.

12 Protein structure

13 Protein structure Right- handed α- helix

14 Protein structure Parallel β- sheet Right- handed α- helix

15 Protein structure An<- parallel β- sheet Parallel β- sheet Right- handed α- helix

16 Protein structure An<- parallel β- sheet Le?- handed α- helix Parallel β- sheet Right- handed α- helix

17 Protein structure An<- parallel β- sheet Parallel β- sheet Collagen triple helix Le?- handed α- helix Right- handed α- helix

18 Protein structure Loops: regions without repetitive structure that connects secondary structure elements.

19 Protein structure Supersecondary elements (motifs): arrangements of two or three consecutive secondary structure that are present in many different protein structures, even with completely different sequences.

20 Protein structure Domains: portion of the polypeptide chain that folds into a compact semi-independent unit. v Class (C) Derived from secondary structure content is assigned automa<cally v Architecture (A) Describes the gross orienta<on of secondary structures, independent of connec<vity. v Topology (T) Clusters structures according to their topological connec<ons and numbers of secondary structures v Homologous superfamily (H)

21 Ala: transient interac<ons Thr, Ser: phosphoryla<on target: protein kinases anack phosphate group to the side- chain. Gly: unusual ramachandran, o?en found in turns Thr: Beta- branched more o?en found in beta- sheets. Cys: Very reac<ve, coordinate metals.

22 The problem of protein folding What is protein fold: v Compact, globular folding arrangement of the polypeptide chain v Chain folds to optimize packing of the hydrophobic residues in the interior core of the protein Thermodynamics: ΔG = ΔH TΔS

23 The problem of protein folding What is protein fold: v Compact, globular folding arrangement of the polypeptide chain v Chain folds to optimize packing of the hydrophobic residues in the interior core of the protein Thermodynamics: ΔG = ΔH TΔS (i.e. stability of a given conformation)

24 The problem of protein folding What is protein fold: v Compact, globular folding arrangement of the polypeptide chain v Chain folds to optimize packing of the hydrophobic residues in the interior core of the protein Thermodynamics: ΔG = ΔH TΔS (i.e. stability of a given conformation) Enthalpy: electrostatics, dispersion, van der Waals, H-bonds. Entropy: water molecules form ordered cages around hydrophobic amino acids. The protein folding process breaks this order. The free energy of folding of a protein is of the order of few kcal/mol

25 The problem of protein folding v Anfinsen s dogma: (at least for small globular proteins) the native structure is determined only by the protein's amino acid sequence

26 The problem of protein folding v Anfinsen s dogma: (at least for small globular proteins) the native structure is determined only by the protein's amino acid sequence v Levinthal paradox: because of the very large number of degrees of freedom in an unfolded polypeptide chain, the molecule has an astronomical number of possible conformations

27 The problem of protein folding v Anfinsen s dogma: (at least for small globular proteins) the native structure is determined only by the protein's amino acid sequence v Levinthal paradox: because of the very large number of degrees of freedom in an unfolded polypeptide chain, the molecule has an astronomical number of possible conformations v Funnel theory: every protein has a specific folding pathway

28 The problem of protein folding - + ΔH Internal interactions TΔS Conformational entropy TΔS Hydrophobic effects Result: ΔG Folding

29 The problem of protein folding - + ΔH Internal interactions TΔS Conformational entropy TΔS Hydrophobic effects Result: ΔG Folding

30 The problem of protein folding - + ΔH Internal interactions TΔS Conformational entropy TΔS Hydrophobic effects Result: ΔG Folding

31 Evolution of protein structure v What if a base-substitution event occurs in a protein-coding DNA region? A. The fine balance between the gain and loss of free energy of folding is compromised: no single energy minimun NOT FOLD

32 Evolution of protein structure v What if a base-substitution event occurs in a protein-coding DNA region? A. The fine balance between the gain and loss of free energy of folding is compromised: no single energy minimun NOT FOLD B. The energy landscape of the protein change, but there is a global minimum of energy same or similar function (i.e. local perturbations without affecting the general shape or topology) FOLD

33 The comparative modeling principle

34 Evolutionary-based methods for protein structure prediction v Proteins evolved from a common ancestor maintain similar core 3D structures We can use protein of known structure (templates) to model protein of unknown 3D structure (targets) by starting from the sequence This can be done if the templates and the target are evolutionarily correlated

35 Evolutionary-based methods for protein structure prediction v Proteins evolved from a common ancestor maintain similar core 3D structures We can use protein of known structure (templates) to model protein of unknown 3D structure (targets) by starting from the sequence This can be done if the templates and the target are evolutionarily correlated

36 Evolutionary-based methods for protein structure prediction v Proteins evolved from a common ancestor maintain similar core 3D structures We can use protein of known structure (templates) to model protein of unknown 3D structure (targets) by starting from the sequence This can be done if the templates and the target are evolutionarily correlated

37 Why Protein Structure Prediction? We have an experimentally determined atomic structure for only ~1% of the known protein sequences

38 Why Protein Structure Prediction? Growth in the number of unique folds per year in the PDB based on the SCOP data base from 1986 to 27

39 Why? v We can use homology modeling to predict the structure of proteins of unknown structure but also

40 Why? v We can use homology modeling to predict the structure of proteins of unknown structure but also To reconstruct some missing part in an incomplete protein structure (common in low resolution structures or for large mobile loops)

41 Why? v We can use homology modeling to predict the structure of proteins of unknown structure but also To reconstruct some missing part in an incomplete protein structure (common in low resolution structures or for large mobile loops) To calculate a mutant of a known protein structure To calculate the mean structure of an NMR ensamble

42 Homology modeling flowchart Query sequence

43 Homology modeling flowchart Query sequence Search for suitable template(s) Sequence databases

44 Homology modeling flowchart Query sequence Search for suitable template(s) Sequence databases Align sequence with template(s)

45 Homology modeling flowchart Query sequence Search for suitable template(s) Sequence databases Align sequence with template(s) Template PDB structure(s)

46 Homology modeling flowchart Query sequence Search for suitable template(s) Sequence databases Align sequence with template(s) Template PDB structure(s) Calculate model(s)

47 Homology modeling flowchart Query sequence Search for suitable template(s) Sequence databases Align sequence with template(s) Template PDB structure(s) Calculate model(s) Assess results Refinement (loops)

48 Homology modeling flowchart Query sequence Search for suitable template(s) Sequence databases Align sequence with template(s) Template PDB structure(s) Calculate model(s) Model(s) Assess results Refinement (loops)

49 Homology modeling flowchart Query sequence Search for suitable template(s) Sequence databases Possible errors Align sequence with template(s) Template PDB structure(s) Calculate model(s) Model(s) Assess results Refinement (loops)

50 Homology modeling flowchart hnp://salilab.org/modeller/

51 How does it works?

52 How does it works?

53 1. Align sequence with structures v First, must determine the template structures Simplistically, try to align the target sequence against every known structure s sequence. In practice, this is too slow, so heuristics are used (e.g. BLAST) Profile or HMM searches are generally more sensitive in difficult cases (Modeller s profile.build method, PSI-BLAST or HHpred) Could also use threading or other web servers v Remember to look at:

54 1. Align sequence with structures v First, must determine the template structures Simplistically, try to align the target sequence against every known structure s sequence. In practice, this is too slow, so heuristics are used (e.g. BLAST) Profile or HMM searches are generally more sensitive in difficult cases (Modeller s profile.build method, PSI-BLAST or HHpred) Could also use threading or other web servers v Remember to look at: Sequence identity/similarity between the putative template(s) and the target Experimental method, resolution and completeness of the template(s) Other compounds bound to the template(s) Oligomerization state

55 1. Align sequence with structures v Alignment to templates Sequence-sequence: relies purely on a matrix of observed residue-residue mutation probabilities ( align ) Sequence-structure: gap insertion is penalized within secondary structure (helices etc.) ( align2d ) Other features, profile-profile, and/or user-defined ( salign ) or use an external program v Remember: An error in the alignment is always a fatal error for the whole modeling procedure! One amino acid sequence plays coy; a pair of homologous sequences whisper; many aligned sequences shout out loud (A.M. Lesk, Introduction to Bioinformatics, 22)

56 1. Align sequence with structures v Evaluation of sequence alignment quality E. Krieger, S.B. Nabuurs, G. Vriend: Homology modeling. In Structural Bioinforma<cs. P.E. Bourne and H. Weissig Eds. (23).

57 2. Extract spatial restraints v Spatial restraints incorporate homology information, statistical preferences, and physical knowledge Template Cα- Cα internal distances Backbone dihedrals (φ/ψ) Sidechain dihedrals given residue type of both target and template Force field stereochemistry (bond, angle, dihedral) Statistical potentials Other experimental constraints Etc.

58 3. Satisfy spatial restraints v Satisfaction of spatial restraints Represent system at appropriate level(s) of resolution (e.g. atoms, residues, domains, proteins) Convert each data source into spatial restraints (e.g. harmonic distance simulates using spring ) Sum all restraints into a scoring function Generate models that are consistent with all restraints by optimizing the scoring function (e.g. conjugate gradients, molecular dynamics, Monte Carlo)

59 3. Satisfy spatial restraints v All information is combined into a single objective function Force field (CHARMM 22) simply added in Function is optimized by conjugate gradients and simulated annealing molecular dynamics, starting from the target sequence threaded onto template structure(s) Multiple models are generally recommended best model or cluster or models chosen by simply taking the lowest objective function score, or using a model assessment method such as Modeller s own DOPE or GA341, or external programs such as PROSA or DFIRE

60 4. Assess results

61 4. Assess results v How do we know if the model is a good one? Check log file for restraint violations and Modeller score (molpdf) (not reliable since the scoring function is not perfect!) Use another assessment score on the final model Ø Statistical Potential: GA341, DOPE, QMEAN Ø Other programs (e.g. Prosa, Verify3D..) Use structure assessment programs (e.g. ProCheck) Fit the model to some other experimental data not used in the modeling procedure

62 Typical assessments DOPE profile

63 Typical assessments DOPE profile Ramachandran plot (ProCheck)

64 Typical assessments DOPE profile Ramachandran plot (ProCheck) PROSA profile

65 Structural alignment Root-mean square deviation (RMSD) 2 Structural alignment of thioredoxins from humans (red) and the fly Drosophila melanogaster (yellow) Where x i and x j are the coordinate vectors of the structure i and j, respectively, and N is the number of atoms of the two strucures

66 Typical errors in comparative models

67 Model Accuracy as a Function of Target-Template Sequence Identity

68 Model accuracy

69 Applications of protein structure models Topology recogni<on Famili assignment Overall fold Mutagenesis design Func<onal rela<onship Drug design Virtual screening Docking Binding site detec<on

70 Model refining v Loop optimization Often, there are parts of the sequence which have no detectable templates Mini folding problem these loops must be sampled to get improved conformations Database searches only complete for 4-6 residue loops Modeller uses conformational search with a custom energy function optimized for loop modeling (statistical potential derived from PDB) Ø Fiser/Melo protocol ( loopmodel ) Ø Newer DOPE + GB/SA protocol ( dope_loopmodel )

71 Model refining v Accuracy of loop models as a function of amount of optimization

72 Model refining v Fraction of loops modeled with medium accuracy (<2Å)

73 Advanced topics v Modeller can also Perform more sensitive searches for templates (sequence-profile, profile-profile, similar to PSI-BLAST) Incorporate ligands, RNA/DNA and water molecules into built models Build structures of multi-chain proteins (homo or hetero) Add extra restraints to the modeling process (such as known distances, e.g. from FRET) Use multiple templates to build a model v Remember:

74 Advanced topics v Modeller can also Perform more sensitive searches for templates (sequence-profile, profile-profile, similar to PSI-BLAST) Incorporate ligands, RNA/DNA and water molecules into built models Build structures of multi-chain proteins (homo or hetero) Add extra restraints to the modeling process (such as known distances, e.g. from FRET) Use multiple templates to build a model v Remember: You don t have to use Modeller for template search, alignment, assessment or refinement. If you know your template (e.g. from BLAST) just format the alignment for Modeller and skip straight to the model building step!

75 Hidden Markov Models A dishonest croupier could use a dice that has a higher probability of landing on a 6, (e.g., 5%). To avoid being caught, the croupier can switch from a fair die to a loaded die with a certain frequency. For example, he can change the die from fair to loaded after 2 rolls and from loaded to fair after 1 rolls. v Likelihood evaluation Given a series of emissions X1, X2, X3 Which is the probability that our model had emitted the observed sequence? v Alignment. Given the sequence of observed emissions: which is the sequence of hidden states that generated it? v Training: How can we optimize the statistical parameters in order to maximize probabilities 1 and 2? 42

76 Hidden Markov Models: Protein Structural Bioinformatics In structure prediction, models can best be thought of as sequence generators (e.g., Hidden Markov Models) or sequence classifiers (e.g., Neural Networks) v Likelihood evaluation Performed using dynamic programming algorithms (similar to the ones used in sequence alignments) v Alignment Thus, given a model and a sequence we want to determine the probability of any specific (query) sequence having been generated by the model in any of each possible paths. v Training The model is trained by aligning protein families. 43

77 Hidden Markov Models v Described by ü A set of possible states: match, insert, deletion. ü A set of possible observations: frequencies of aa in each position. ü A transition probability matrix ü An emission probability matrix (frequencies of aa occurring in a particular state). ü Initial state probabilities. 44

78 Sequence profiles are a condensed representation of alignments master sequence HBA_human W G K V G A - - H A G E HBB_human W G K V N V D E MYG_phyca W G K V E A - - D V A G LGB2_luplu W K D F N A - - N I P K GLB1_glydi W E E I A G A D N G A G Each column of the profile p j (a) contains the amino acid frequencies in the multiple sequence alignment A C D E F G H I K L M N P Q R S T V W Y

79 HMM include position specific gap penalties Deletions Insertions Probabilities for Insert Open Insert Extend Delete Open Delete Extend M/D M/D M/D I I M/D M/D M/D M/D M/D HBA_human V G A.. H A G E Y HBB_human V N V D E V MYG_phyca V E A.. D V A G H LGB2_luplu F N A.. N I P K H GLB1_glydi I A G a d N G A G V A C D E F G H I K L M N P W Y M I I I M D D D Match or Delete

80 Profile HMM can be represented as states connected by transitions Probability that a sequence is emitted by an HMM rather than by a random model? M/D M/D M/D I I M/D M/D M/D M/D M/D HBA_human V G A.. H A G E Y HBB_human V N V D E V MYG_phyca V E A.. D V A G H LGB2_luplu F N A.. N I P K H GLB1_glydi I A G a d N G - G V I I I I I I I I HMM p M M M M M M M M D D D D D D D D The probability for emitting the sequence x1,.., xl along the path through an HMM is: P(x1,..., x1 emission on path). This probability is a product of the amino acid emission probabilities for each state on the path and the transition probabilities between states.

81 Profile HMM can be represented as states connected by transitions M/D M/D M/D I I M/D M/D M/D M/D M/D HBA_human V G A.. H A G E Y HBB_human V N V D E V MYG_phyca V E A.. D V A G H LGB2_luplu F N A.. N I P K H GLB1_glydi I A G a d N G - G V HMM p I M I M I M I I I I I M M M M M Matrix: p i (a) p i (X Y) A C W Y M I I I M D D D D D D D D D D D

82 Profile HMM can be represented as states connected by transitions M/D M/D M/D I I M/D M/D M/D M/D M/D HBA_human V G A.. H A G E Y HBB_human V N V D E V MYG_phyca V E A.. D V A G H LGB2_luplu F N A.. N I P K H GLB1_glydi I A G a d N G - G V HMM p I M I M I M I I I I I M M M M M Matrix: p i (a) p i (X Y) A C W Y M I I I M D D D D D D D D D D D

83 Profile HMM can be represented as states connected by transitions M/D M/D M/D I I M/D M/D M/D M/D M/D HBA_human V G A.. H A G E Y HBB_human V N V D E V MYG_phyca V E A.. D V A G H LGB2_luplu F N A.. N I P K H GLB1_glydi I A G a d N G - G V HMM p I M I M I M I I I I I M M M M M Matrix: p i (a) p i (X Y) A C W Y M I I I M D D D D D D D D D D D

84 Profile HMM can be represented as states connected by transitions M/D M/D M/D I I M/D M/D M/D M/D M/D HBA_human V G A.. H A G E Y HBB_human V N V D E V MYG_phyca V E A.. D V A G H LGB2_luplu F N A.. N I P K H GLB1_glydi I A G a d N G - G V HMM p I M I M I M I I I I I M M M M M Matrix: p i (a) p i (X Y) A C W Y M I I I M D D D D D D D D D D D

85 State q State p M D I M D I M D I M D I M D I M D I M D I HMM q M M M M M I M M M M D M M M D I M D I M D I M D I M D I HMM p x 1 x 2 x 3 x 4 x 5 x 6 Söding, J. (25) Bioinformatics 21, Include Null model maximize log-sum-of-odds score Co-emitted sequence Find path through two HMM that maximizes co-emission probability

86 Excercise v Target: human thioesterase 8 : interacts with HIV-1 Nef protein. v Procedure: Search for templates using HHpred Prepare Modeller input files Build the models Evaluate the model structure v Materials and Methods: UniProt Modeller ( Modeller manual ProCheck web server ( databases/pdbsum/generate.html) Prosa web server ( prosa.php)

87 Fold recognition Principle: Find a compatible fold >Target Sequence XY MSTLYEKLGGTTAVDLAVAAVA GAPAHKRDVLNQ Profile method For each aa we can calculate the frequency in Secondary elements Surface of the protein Hydrophobic environment Each aa is substituted by a letter (property) From the structure we can analyze positions in terms of: Build model of target protein based on each template structure Rank models according to SCORE or ENERGY - Presence in secondary structure element - Percentage of solvent exposition - Hydrophobic or polar environment? Thus each structure is converted into property sequencesnot aa PDB becomes a property sequence DB. Thus we have to just align property sequences

88 Fold recognition v Threading methods Ø Statistical Potentials Ø Programs: Threader, mgenthreader. M A T E A F T Q S G Several approximations: Frozen approximation used for accelerate calculations In the past used for remote homology assessment Now used in automatic projects for the structural prediction of the entire human genome.

89 Fold recognition v Threading methods Ø Statistical Potentials Ø Programs: Threader, mgenthreader. M A T E A F T Q S G Several approximations: Frozen approximation used for accelerate calculations In the past used for remote homology assessment Now used in automatic projects for the structural prediction of the entire human genome.

90 New folds

91 New folds v Ab inito modeling or de novo prediction Ø Folding by statistical approaches: very coarse-grained Ø Force Fields Ø Fragment Assemblies. Ø Ø Structure with common structural motifs or supersecondary structures The relationship between local sequence and local structure is highly degenerated Ø Programs: Fragfold and Rosetta Ø These approaches were a real breakthrough in the field Ø New folds, difficult crustal structures, difficult modeling, protein design: see articles by David Baker.

92 Fragment Assembly MSSPQAPEDGQGCGDRGDPPGDLRSVLVTTV ROSETTA 9 aa fragments Choose the 25 closest sequences FRAGFOLD Supersecondary structure elements tri, tetra and penta peptides Each fragment is energetically evaluated (Statistical potential) ROSETTA Simulated Annealing of dihedral angles Optimization and Assembly (statistical potential) FRAGFOLD Random combination of fragments Simulated annealing

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Introduction to Comparative Protein Modeling. Chapter 4 Part I Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature

More information

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror Protein structure prediction CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror 1 Outline Why predict protein structure? Can we use (pure) physics-based methods? Knowledge-based methods Two major

More information

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB Homology Modeling (Comparative Structure Modeling) Aims of Structural Genomics High-throughput 3D structure determination and analysis To determine or predict the 3D structures of all the proteins encoded

More information

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748 CAP 5510: Introduction to Bioinformatics Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinfs07.html 2/15/07 CAP5510 1 EM Algorithm Goal: Find θ, Z that maximize Pr

More information

ALL LECTURES IN SB Introduction

ALL LECTURES IN SB Introduction 1. Introduction 2. Molecular Architecture I 3. Molecular Architecture II 4. Molecular Simulation I 5. Molecular Simulation II 6. Bioinformatics I 7. Bioinformatics II 8. Prediction I 9. Prediction II ALL

More information

Week 10: Homology Modelling (II) - HHpred

Week 10: Homology Modelling (II) - HHpred Week 10: Homology Modelling (II) - HHpred Course: Tools for Structural Biology Fabian Glaser BKU - Technion 1 2 Identify and align related structures by sequence methods is not an easy task All comparative

More information

Homology Modeling. Roberto Lins EPFL - summer semester 2005

Homology Modeling. Roberto Lins EPFL - summer semester 2005 Homology Modeling Roberto Lins EPFL - summer semester 2005 Disclaimer: course material is mainly taken from: P.E. Bourne & H Weissig, Structural Bioinformatics; C.A. Orengo, D.T. Jones & J.M. Thornton,

More information

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Programme 8.00-8.20 Last week s quiz results + Summary 8.20-9.00 Fold recognition 9.00-9.15 Break 9.15-11.20 Exercise: Modelling remote homologues 11.20-11.40 Summary & discussion 11.40-12.00 Quiz 1 Feedback

More information

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction CMPS 6630: Introduction to Computational Biology and Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the

More information

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror Protein structure prediction CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror 1 Outline Why predict protein structure? Can we use (pure) physics-based methods? Knowledge-based methods Two major

More information

CMPS 3110: Bioinformatics. Tertiary Structure Prediction

CMPS 3110: Bioinformatics. Tertiary Structure Prediction CMPS 3110: Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the laws of physics! Conformation space is finite

More information

CAP 5510 Lecture 3 Protein Structures

CAP 5510 Lecture 3 Protein Structures CAP 5510 Lecture 3 Protein Structures Su-Shing Chen Bioinformatics CISE 8/19/2005 Su-Shing Chen, CISE 1 Protein Conformation 8/19/2005 Su-Shing Chen, CISE 2 Protein Conformational Structures Hydrophobicity

More information

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy Design of a Novel Globular Protein Fold with Atomic-Level Accuracy Brian Kuhlman, Gautam Dantas, Gregory C. Ireton, Gabriele Varani, Barry L. Stoddard, David Baker Presented by Kate Stafford 4 May 05 Protein

More information

Molecular Modelling. part of Bioinformatik von RNA- und Proteinstrukturen. Sonja Prohaska. Leipzig, SS Computational EvoDevo University Leipzig

Molecular Modelling. part of Bioinformatik von RNA- und Proteinstrukturen. Sonja Prohaska. Leipzig, SS Computational EvoDevo University Leipzig part of Bioinformatik von RNA- und Proteinstrukturen Computational EvoDevo University Leipzig Leipzig, SS 2011 Protein Structure levels or organization Primary structure: sequence of amino acids (from

More information

Basics of protein structure

Basics of protein structure Today: 1. Projects a. Requirements: i. Critical review of one paper ii. At least one computational result b. Noon, Dec. 3 rd written report and oral presentation are due; submit via email to bphys101@fas.harvard.edu

More information

Structure to Function. Molecular Bioinformatics, X3, 2006

Structure to Function. Molecular Bioinformatics, X3, 2006 Structure to Function Molecular Bioinformatics, X3, 2006 Structural GeNOMICS Structural Genomics project aims at determination of 3D structures of all proteins: - organize known proteins into families

More information

Introduction to" Protein Structure

Introduction to Protein Structure Introduction to" Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Learning Objectives Outline the basic levels of protein structure.

More information

Supporting Online Material for

Supporting Online Material for www.sciencemag.org/cgi/content/full/309/5742/1868/dc1 Supporting Online Material for Toward High-Resolution de Novo Structure Prediction for Small Proteins Philip Bradley, Kira M. S. Misura, David Baker*

More information

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Protein Dynamics The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Below is myoglobin hydrated with 350 water molecules. Only a small

More information

HMM applications. Applications of HMMs. Gene finding with HMMs. Using the gene finder

HMM applications. Applications of HMMs. Gene finding with HMMs. Using the gene finder HMM applications Applications of HMMs Gene finding Pairwise alignment (pair HMMs) Characterizing protein families (profile HMMs) Predicting membrane proteins, and membrane protein topology Gene finding

More information

Protein Structure Prediction

Protein Structure Prediction Page 1 Protein Structure Prediction Russ B. Altman BMI 214 CS 274 Protein Folding is different from structure prediction --Folding is concerned with the process of taking the 3D shape, usually based on

More information

Protein structure alignments

Protein structure alignments Protein structure alignments Proteins that fold in the same way, i.e. have the same fold are often homologs. Structure evolves slower than sequence Sequence is less conserved than structure If BLAST gives

More information

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE Examples of Protein Modeling Protein Modeling Visualization Examination of an experimental structure to gain insight about a research question Dynamics To examine the dynamics of protein structures To

More information

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a Lecture 11: Protein Folding & Stability Margaret A. Daugherty Fall 2004 How do we go from an unfolded polypeptide chain to a compact folded protein? (Folding of thioredoxin, F. Richards) Structure - Function

More information

Introduction to Computational Structural Biology

Introduction to Computational Structural Biology Introduction to Computational Structural Biology Part I 1. Introduction The disciplinary character of Computational Structural Biology The mathematical background required and the topics covered Bibliography

More information

Procheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics.

Procheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics. Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics Iosif Vaisman Email: ivaisman@gmu.edu ----------------------------------------------------------------- Bond

More information

Can protein model accuracy be. identified? NO! CBS, BioCentrum, Morten Nielsen, DTU

Can protein model accuracy be. identified? NO! CBS, BioCentrum, Morten Nielsen, DTU Can protein model accuracy be identified? Morten Nielsen, CBS, BioCentrum, DTU NO! Identification of Protein-model accuracy Why is it important? What is accuracy RMSD, fraction correct, Protein model correctness/quality

More information

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche The molecular structure of a protein can be broken down hierarchically. The primary structure of a protein is simply its

More information

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC The precise definition of a dihedral or torsion angle can be found in spatial geometry Angle between to planes Dihedral

More information

CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Giri Narasimhan

CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Giri Narasimhan CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinff18.html Proteins and Protein Structure

More information

Protein Structure Prediction, Engineering & Design CHEM 430

Protein Structure Prediction, Engineering & Design CHEM 430 Protein Structure Prediction, Engineering & Design CHEM 430 Eero Saarinen The free energy surface of a protein Protein Structure Prediction & Design Full Protein Structure from Sequence - High Alignment

More information

D Dobbs ISU - BCB 444/544X 1

D Dobbs ISU - BCB 444/544X 1 11/7/05 Protein Structure: Classification, Databases, Visualization Announcements BCB 544 Projects - Important Dates: Nov 2 Wed noon - Project proposals due to David/Drena Nov 4 Fri PM - Approvals/responses

More information

Protein Structure Basics

Protein Structure Basics Protein Structure Basics Presented by Alison Fraser, Christine Lee, Pradhuman Jhala, Corban Rivera Importance of Proteins Muscle structure depends on protein-protein interactions Transport across membranes

More information

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES Protein Structure W. M. Grogan, Ph.D. OBJECTIVES 1. Describe the structure and characteristic properties of typical proteins. 2. List and describe the four levels of structure found in proteins. 3. Relate

More information

SCOP. all-β class. all-α class, 3 different folds. T4 endonuclease V. 4-helical cytokines. Globin-like

SCOP. all-β class. all-α class, 3 different folds. T4 endonuclease V. 4-helical cytokines. Globin-like SCOP all-β class 4-helical cytokines T4 endonuclease V all-α class, 3 different folds Globin-like TIM-barrel fold α/β class Profilin-like fold α+β class http://scop.mrc-lmb.cam.ac.uk/scop CATH Class, Architecture,

More information

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University COMP 598 Advanced Computational Biology Methods & Research Introduction Jérôme Waldispühl School of Computer Science McGill University General informations (1) Office hours: by appointment Office: TR3018

More information

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007 Molecular Modeling Prediction of Protein 3D Structure from Sequence Vimalkumar Velayudhan Jain Institute of Vocational and Advanced Studies May 21, 2007 Vimalkumar Velayudhan Molecular Modeling 1/23 Outline

More information

Announcements. Primary (1 ) Structure. Lecture 7 & 8: PROTEIN ARCHITECTURE IV: Tertiary and Quaternary Structure

Announcements. Primary (1 ) Structure. Lecture 7 & 8: PROTEIN ARCHITECTURE IV: Tertiary and Quaternary Structure Announcements TA Office Hours: Brian Eckenroth Monday 3-4 pm Thursday 11 am-12 pm Lecture 7 & 8: PROTEIN ARCHITECTURE IV: Tertiary and Quaternary Structure Margaret Daugherty Fall 2003 Homework II posted

More information

Bioinformatics. Macromolecular structure

Bioinformatics. Macromolecular structure Bioinformatics Macromolecular structure Contents Determination of protein structure Structure databases Secondary structure elements (SSE) Tertiary structure Structure analysis Structure alignment Domain

More information

Major Types of Association of Proteins with Cell Membranes. From Alberts et al

Major Types of Association of Proteins with Cell Membranes. From Alberts et al Major Types of Association of Proteins with Cell Membranes From Alberts et al Proteins Are Polymers of Amino Acids Peptide Bond Formation Amino Acid central carbon atom to which are attached amino group

More information

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION AND CALIBRATION Calculation of turn and beta intrinsic propensities. A statistical analysis of a protein structure

More information

Building 3D models of proteins

Building 3D models of proteins Building 3D models of proteins Why make a structural model for your protein? The structure can provide clues to the function through structural similarity with other proteins With a structure it is easier

More information

The protein folding problem consists of two parts:

The protein folding problem consists of two parts: Energetics and kinetics of protein folding The protein folding problem consists of two parts: 1)Creating a stable, well-defined structure that is significantly more stable than all other possible structures.

More information

CS612 - Algorithms in Bioinformatics

CS612 - Algorithms in Bioinformatics Fall 2017 Protein Structure Detection Methods October 30, 2017 Comparative Modeling Comparative modeling is modeling of the unknown based on comparison to what is known In the context of modeling or computing

More information

Analysis and Prediction of Protein Structure (I)

Analysis and Prediction of Protein Structure (I) Analysis and Prediction of Protein Structure (I) Jianlin Cheng, PhD School of Electrical Engineering and Computer Science University of Central Florida 2006 Free for academic use. Copyright @ Jianlin Cheng

More information

BCH 4053 Spring 2003 Chapter 6 Lecture Notes

BCH 4053 Spring 2003 Chapter 6 Lecture Notes BCH 4053 Spring 2003 Chapter 6 Lecture Notes 1 CHAPTER 6 Proteins: Secondary, Tertiary, and Quaternary Structure 2 Levels of Protein Structure Primary (sequence) Secondary (ordered structure along peptide

More information

Physiochemical Properties of Residues

Physiochemical Properties of Residues Physiochemical Properties of Residues Various Sources C N Cα R Slide 1 Conformational Propensities Conformational Propensity is the frequency in which a residue adopts a given conformation (in a polypeptide)

More information

Presenter: She Zhang

Presenter: She Zhang Presenter: She Zhang Introduction Dr. David Baker Introduction Why design proteins de novo? It is not clear how non-covalent interactions favor one specific native structure over many other non-native

More information

Protein Structure Prediction

Protein Structure Prediction Protein Structure Prediction Michael Feig MMTSB/CTBP 2006 Summer Workshop From Sequence to Structure SEALGDTIVKNA Ab initio Structure Prediction Protocol Amino Acid Sequence Conformational Sampling to

More information

Outline. Levels of Protein Structure. Primary (1 ) Structure. Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins

Outline. Levels of Protein Structure. Primary (1 ) Structure. Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins Margaret Daugherty Fall 2004 Outline Four levels of structure are used to describe proteins; Alpha helices and beta sheets

More information

Outline. Levels of Protein Structure. Primary (1 ) Structure. Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins

Outline. Levels of Protein Structure. Primary (1 ) Structure. Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins Lecture 6:Protein Architecture II: Secondary Structure or From peptides to proteins Margaret Daugherty Fall 2003 Outline Four levels of structure are used to describe proteins; Alpha helices and beta sheets

More information

Getting To Know Your Protein

Getting To Know Your Protein Getting To Know Your Protein Comparative Protein Analysis: Part III. Protein Structure Prediction and Comparison Robert Latek, PhD Sr. Bioinformatics Scientist Whitehead Institute for Biomedical Research

More information

Modeling for 3D structure prediction

Modeling for 3D structure prediction Modeling for 3D structure prediction What is a predicted structure? A structure that is constructed using as the sole source of information data obtained from computer based data-mining. However, mixing

More information

Biomolecules: lecture 10

Biomolecules: lecture 10 Biomolecules: lecture 10 - understanding in detail how protein 3D structures form - realize that protein molecules are not static wire models but instead dynamic, where in principle every atom moves (yet

More information

Protein Structures: Experiments and Modeling. Patrice Koehl

Protein Structures: Experiments and Modeling. Patrice Koehl Protein Structures: Experiments and Modeling Patrice Koehl Structural Bioinformatics: Proteins Proteins: Sources of Structure Information Proteins: Homology Modeling Proteins: Ab initio prediction Proteins:

More information

Motif Prediction in Amino Acid Interaction Networks

Motif Prediction in Amino Acid Interaction Networks Motif Prediction in Amino Acid Interaction Networks Omar GACI and Stefan BALEV Abstract In this paper we represent a protein as a graph where the vertices are amino acids and the edges are interactions

More information

09/06/25. Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Non-uniform distribution of folds. Scheme of protein structure predicition

09/06/25. Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Non-uniform distribution of folds. Scheme of protein structure predicition Sequence identity Structural similarity Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Fold recognition Sommersemester 2009 Peter Güntert Structural similarity X Sequence identity Non-uniform

More information

Supersecondary Structures (structural motifs)

Supersecondary Structures (structural motifs) Supersecondary Structures (structural motifs) Various Sources Slide 1 Supersecondary Structures (Motifs) Supersecondary Structures (Motifs): : Combinations of secondary structures in specific geometric

More information

FlexPepDock In a nutshell

FlexPepDock In a nutshell FlexPepDock In a nutshell All Tutorial files are located in http://bit.ly/mxtakv FlexPepdock refinement Step 1 Step 3 - Refinement Step 4 - Selection of models Measure of fit FlexPepdock Ab-initio Step

More information

Biology Chemistry & Physics of Biomolecules. Examination #1. Proteins Module. September 29, Answer Key

Biology Chemistry & Physics of Biomolecules. Examination #1. Proteins Module. September 29, Answer Key Biology 5357 Chemistry & Physics of Biomolecules Examination #1 Proteins Module September 29, 2017 Answer Key Question 1 (A) (5 points) Structure (b) is more common, as it contains the shorter connection

More information

Ab-initio protein structure prediction

Ab-initio protein structure prediction Ab-initio protein structure prediction Jaroslaw Pillardy Computational Biology Service Unit Cornell Theory Center, Cornell University Ithaca, NY USA Methods for predicting protein structure 1. Homology

More information

Details of Protein Structure

Details of Protein Structure Details of Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Anne Mølgaard, Kemisk Institut, Københavns Universitet Learning Objectives

More information

Protein Structure Determination

Protein Structure Determination Protein Structure Determination Given a protein sequence, determine its 3D structure 1 MIKLGIVMDP IANINIKKDS SFAMLLEAQR RGYELHYMEM GDLYLINGEA 51 RAHTRTLNVK QNYEEWFSFV GEQDLPLADL DVILMRKDPP FDTEFIYATY 101

More information

Protein Modeling. Generating, Evaluating and Refining Protein Homology Models

Protein Modeling. Generating, Evaluating and Refining Protein Homology Models Protein Modeling Generating, Evaluating and Refining Protein Homology Models Troy Wymore and Kristen Messinger Biomedical Initiatives Group Pittsburgh Supercomputing Center Homology Modeling of Proteins

More information

Template Free Protein Structure Modeling Jianlin Cheng, PhD

Template Free Protein Structure Modeling Jianlin Cheng, PhD Template Free Protein Structure Modeling Jianlin Cheng, PhD Professor Department of EECS Informatics Institute University of Missouri, Columbia 2018 Protein Energy Landscape & Free Sampling http://pubs.acs.org/subscribe/archive/mdd/v03/i09/html/willis.html

More information

Bioinformatics: Secondary Structure Prediction

Bioinformatics: Secondary Structure Prediction Bioinformatics: Secondary Structure Prediction Prof. David Jones d.t.jones@ucl.ac.uk Possibly the greatest unsolved problem in molecular biology: The Protein Folding Problem MWMPPRPEEVARK LRRLGFVERMAKG

More information

114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009

114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009 114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009 9 Protein tertiary structure Sources for this chapter, which are all recommended reading: D.W. Mount. Bioinformatics: Sequences and Genome

More information

Homology modeling. Dinesh Gupta ICGEB, New Delhi 1/27/2010 5:59 PM

Homology modeling. Dinesh Gupta ICGEB, New Delhi 1/27/2010 5:59 PM Homology modeling Dinesh Gupta ICGEB, New Delhi Protein structure prediction Methods: Homology (comparative) modelling Threading Ab-initio Protein Homology modeling Homology modeling is an extrapolation

More information

From Amino Acids to Proteins - in 4 Easy Steps

From Amino Acids to Proteins - in 4 Easy Steps From Amino Acids to Proteins - in 4 Easy Steps Although protein structure appears to be overwhelmingly complex, you can provide your students with a basic understanding of how proteins fold by focusing

More information

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison CMPS 6630: Introduction to Computational Biology and Bioinformatics Structure Comparison Protein Structure Comparison Motivation Understand sequence and structure variability Understand Domain architecture

More information

Protein Structure Bioinformatics Introduction

Protein Structure Bioinformatics Introduction 1 Swiss Institute of Bioinformatics Protein Structure Bioinformatics Introduction Basel, 27. September 2004 Torsten Schwede Biozentrum - Universität Basel Swiss Institute of Bioinformatics Klingelbergstr

More information

Sequence analysis and comparison

Sequence analysis and comparison The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species

More information

Lecture 11: Protein Folding & Stability

Lecture 11: Protein Folding & Stability Structure - Function Protein Folding: What we know Lecture 11: Protein Folding & Stability 1). Amino acid sequence dictates structure. 2). The native structure represents the lowest energy state for a

More information

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding Lecture 11: Protein Folding & Stability Margaret A. Daugherty Fall 2003 Structure - Function Protein Folding: What we know 1). Amino acid sequence dictates structure. 2). The native structure represents

More information

Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program)

Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program) Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program) Course Name: Structural Bioinformatics Course Description: Instructor: This course introduces fundamental concepts and methods for structural

More information

Central Dogma. modifications genome transcriptome proteome

Central Dogma. modifications genome transcriptome proteome entral Dogma DA ma protein post-translational modifications genome transcriptome proteome 83 ierarchy of Protein Structure 20 Amino Acids There are 20 n possible sequences for a protein of n residues!

More information

Copyright Mark Brandt, Ph.D A third method, cryogenic electron microscopy has seen increasing use over the past few years.

Copyright Mark Brandt, Ph.D A third method, cryogenic electron microscopy has seen increasing use over the past few years. Structure Determination and Sequence Analysis The vast majority of the experimentally determined three-dimensional protein structures have been solved by one of two methods: X-ray diffraction and Nuclear

More information

2MHR. Protein structure classification is important because it organizes the protein structure universe that is independent of sequence similarity.

2MHR. Protein structure classification is important because it organizes the protein structure universe that is independent of sequence similarity. Protein structure classification is important because it organizes the protein structure universe that is independent of sequence similarity. A global picture of the protein universe will help us to understand

More information

4 Proteins: Structure, Function, Folding W. H. Freeman and Company

4 Proteins: Structure, Function, Folding W. H. Freeman and Company 4 Proteins: Structure, Function, Folding 2013 W. H. Freeman and Company CHAPTER 4 Proteins: Structure, Function, Folding Learning goals: Structure and properties of the peptide bond Structural hierarchy

More information

Molecular Modeling Lecture 7. Homology modeling insertions/deletions manual realignment

Molecular Modeling Lecture 7. Homology modeling insertions/deletions manual realignment Molecular Modeling 2018-- Lecture 7 Homology modeling insertions/deletions manual realignment Homology modeling also called comparative modeling Sequences that have similar sequence have similar structure.

More information

Section Week 3. Junaid Malek, M.D.

Section Week 3. Junaid Malek, M.D. Section Week 3 Junaid Malek, M.D. Biological Polymers DA 4 monomers (building blocks), limited structure (double-helix) RA 4 monomers, greater flexibility, multiple structures Proteins 20 Amino Acids,

More information

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target.

HOMOLOGY MODELING. The sequence alignment and template structure are then used to produce a structural model of the target. HOMOLOGY MODELING Homology modeling, also known as comparative modeling of protein refers to constructing an atomic-resolution model of the "target" protein from its amino acid sequence and an experimental

More information

Template Free Protein Structure Modeling Jianlin Cheng, PhD

Template Free Protein Structure Modeling Jianlin Cheng, PhD Template Free Protein Structure Modeling Jianlin Cheng, PhD Associate Professor Computer Science Department Informatics Institute University of Missouri, Columbia 2013 Protein Energy Landscape & Free Sampling

More information

Template-Based Modeling of Protein Structure

Template-Based Modeling of Protein Structure Template-Based Modeling of Protein Structure David Constant Biochemistry 218 December 11, 2011 Introduction. Much can be learned about the biology of a protein from its structure. Simply put, structure

More information

Protein Structure Prediction

Protein Structure Prediction Protein Structure Prediction Michael Feig MMTSB/CTBP 2009 Summer Workshop From Sequence to Structure SEALGDTIVKNA Folding with All-Atom Models AAQAAAAQAAAAQAA All-atom MD in general not succesful for real

More information

Protein Structures. 11/19/2002 Lecture 24 1

Protein Structures. 11/19/2002 Lecture 24 1 Protein Structures 11/19/2002 Lecture 24 1 All 3 figures are cartoons of an amino acid residue. 11/19/2002 Lecture 24 2 Peptide bonds in chains of residues 11/19/2002 Lecture 24 3 Angles φ and ψ in the

More information

Assignment 2 Atomic-Level Molecular Modeling

Assignment 2 Atomic-Level Molecular Modeling Assignment 2 Atomic-Level Molecular Modeling CS/BIOE/CME/BIOPHYS/BIOMEDIN 279 Due: November 3, 2016 at 3:00 PM The goal of this assignment is to understand the biological and computational aspects of macromolecular

More information

Homology Modeling I. Growth of the Protein Data Bank PDB. Basel, September 30, EMBnet course: Introduction to Protein Structure Bioinformatics

Homology Modeling I. Growth of the Protein Data Bank PDB. Basel, September 30, EMBnet course: Introduction to Protein Structure Bioinformatics Swiss Institute of Bioinformatics EMBnet course: Introduction to Protein Structure Bioinformatics Homology Modeling I Basel, September 30, 2004 Torsten Schwede Biozentrum - Universität Basel Swiss Institute

More information

STRUCTURAL BIOINFORMATICS I. Fall 2015

STRUCTURAL BIOINFORMATICS I. Fall 2015 STRUCTURAL BIOINFORMATICS I Fall 2015 Info Course Number - Classification: Biology 5411 Class Schedule: Monday 5:30-7:50 PM, SERC Room 456 (4 th floor) Instructors: Vincenzo Carnevale - SERC, Room 704C;

More information

Conditional Graphical Models

Conditional Graphical Models PhD Thesis Proposal Conditional Graphical Models for Protein Structure Prediction Yan Liu Language Technologies Institute University Thesis Committee Jaime Carbonell (Chair) John Lafferty Eric P. Xing

More information

Protein Structure: Data Bases and Classification Ingo Ruczinski

Protein Structure: Data Bases and Classification Ingo Ruczinski Protein Structure: Data Bases and Classification Ingo Ruczinski Department of Biostatistics, Johns Hopkins University Reference Bourne and Weissig Structural Bioinformatics Wiley, 2003 More References

More information

Prediction and refinement of NMR structures from sparse experimental data

Prediction and refinement of NMR structures from sparse experimental data Prediction and refinement of NMR structures from sparse experimental data Jeff Skolnick Director Center for the Study of Systems Biology School of Biology Georgia Institute of Technology Overview of talk

More information

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions: Van der Waals Interactions

More information

STRUCTURAL BIOINFORMATICS. Barry Grant University of Michigan

STRUCTURAL BIOINFORMATICS. Barry Grant University of Michigan STRUCTURAL BIOINFORMATICS Barry Grant University of Michigan www.thegrantlab.org bjgrant@umich.edu Bergen, Norway 28-Sep-2015 Objective: Provide an introduction to the practice of structural bioinformatics,

More information

Protein Folding experiments and theory

Protein Folding experiments and theory Protein Folding experiments and theory 1, 2,and 3 Protein Structure Fig. 3-16 from Lehninger Biochemistry, 4 th ed. The 3D structure is not encoded at the single aa level Hydrogen Bonding Shared H atom

More information

Secondary and sidechain structures

Secondary and sidechain structures Lecture 2 Secondary and sidechain structures James Chou BCMP201 Spring 2008 Images from Petsko & Ringe, Protein Structure and Function. Branden & Tooze, Introduction to Protein Structure. Richardson, J.

More information

Why Proteins Fold. How Proteins Fold? e - ΔG/kT. Protein Folding, Nonbonding Forces, and Free Energy

Why Proteins Fold. How Proteins Fold? e - ΔG/kT. Protein Folding, Nonbonding Forces, and Free Energy Why Proteins Fold Proteins are the action superheroes of the body. As enzymes, they make reactions go a million times faster. As versatile transport vehicles, they carry oxygen and antibodies to fight

More information

Bioinformatics III Structural Bioinformatics and Genome Analysis Part Protein Secondary Structure Prediction. Sepp Hochreiter

Bioinformatics III Structural Bioinformatics and Genome Analysis Part Protein Secondary Structure Prediction. Sepp Hochreiter Bioinformatics III Structural Bioinformatics and Genome Analysis Part Protein Secondary Structure Prediction Institute of Bioinformatics Johannes Kepler University, Linz, Austria Chapter 4 Protein Secondary

More information

Computer simulations of protein folding with a small number of distance restraints

Computer simulations of protein folding with a small number of distance restraints Vol. 49 No. 3/2002 683 692 QUARTERLY Computer simulations of protein folding with a small number of distance restraints Andrzej Sikorski 1, Andrzej Kolinski 1,2 and Jeffrey Skolnick 2 1 Department of Chemistry,

More information

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure Bioch/BIMS 503 Lecture 2 Structure and Function of Proteins August 28, 2008 Robert Nakamoto rkn3c@virginia.edu 2-0279 Secondary Structure Φ Ψ angles determine protein structure Φ Ψ angles are restricted

More information