BCB 444/544 Fall 07 Dobbs 1

Size: px
Start display at page:

Download "BCB 444/544 Fall 07 Dobbs 1"

Transcription

1 BCB 444/544 Required Reading (before lecture) Lecture 23 Mon Oct 5 - Lecture 23 Protein Tertiary Structure Chp 5 - pp Protein Tertiary Structure Wed Oct 7 & Thurs Oct 8 - Lecture 24 & Lab 8 RNA Structure/Function & RNA Structure Chp 6 - pp (Terribilini) #23_Oct5 Fri Oct 8 - Lecture 25 Gene Chp 8 - pp New Reading & Homework Assignment Seminars Last Week ALL: HomeWork #4 ( ed & posted online Sat AM) Due: Mon Oct 22 by 5 PM (not Fri Oct 9) Read: Ginalski et al.(2005) Practical Lessons from Protein Structure, Nucleic Acids Res. 33: (PDF posted on website) Although somewhat dated, this paper provides a nice overview of protein structure prediction methods and evaluation of predicted structures. Your assignment is to write a summary of this paper - for details see HW#4 posted online & sent by on Sat Oct 3 Dr. Klaus Schulten (Univ of Illinois) - Baker Center Seminar The Computational Microscope 2:0 PM in E64 Lagomarcino n_seminar.pdf Check out links on Schulten's website (videos, etc) Great seminar - amazing simulations of dynamics in proteins and large macromolecular assemblies Very computationally intensive - very impressive demonstration of power of computation to produce insights not attainable using only experimental approaches 3 4 Seminars this Week Protein Sequence & Structure: Analysis BCB List of URLs for Seminars related to Bioinformatics: Oct 8 Thur - BBMB Seminar 4:0 in 44 MBB Sachdeve Sidhu (Genentech) Phage peptide and antibody libraries in protein engineering and ligand selection Oct 9 Fri - BCB Faculty Seminar 2:0 in 02 ScI Lyric Bartholomay (Ent, ISU) TBA Diamond STING Millennium - Many useful structure analysis tools, including Protein Dossier SwissProt (UniProt) Protein knowledgebase InterPro Sequence analysis tools BCB 444/544 Fall 07 Dobbs

2 Chp 4 - Secondary Structure Where Find "Actual" Secondary Structure? In the PDB SECTION V STRUCTURAL BIOINFORMATICS Xiong: Chp 4 Protein Secondary Structure Secondary Structure for Globular Proteins Secondary Structure for Transmembrane Proteins Coiled-Coil 7 8 How Does Predicted Secondary Structure Compare with Actual? (An example) Chp 5 - Tertiary Structure Actual - Calculated from PDB coordinates by DSSP or author: SECTION V STRUCTURAL BIOINFORMATICS DSSP Author Xiong: Chp 5 Protein Tertiary Structure Query MAATAAEAVASGSGEPREEAGALGPAWDESQLRSYSFPTRPIPRLSQSDPRAEELIENEE GOR V CCCCHHHHHHHHCCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCC FDM CCCCCCCCCCCCCCCCCEECCCCCCCCCHHHCCCCCCEECCCCCCCCCCHHHHHHHHCCC CDM CCCCHHHHHHCCCCCCCEECCCCCCCCCHHHCCCCCCEECCCCCCCCCCHHHHHHHHCCC Predicted - Using 3 methods (from CMD server, Jernigan Group, ISU) Methods Homology Modeling Threading and Fold Recognition Ab Initio Protein Structural CASP 9 0 Structural Genomics - Status & Goal Structural Genomics Project ~ 20,000 "traditional" genes in human genome (recall, this is fewer than earlier estimate of 30,000) ~ 2,000 proteins in a typical cell > 4.9 million sequences in UniProt (Oct 2007) > 46,000 protein structures in the PDB (Oct 2007) TargetDB: Database of Structural Genomics Targets Experimental determination of protein structure lags far behind sequence determination! Goal: Determine structures of "all" protein folds in nature, using combination of experimental structure determination methods (X-ray crystallography, NMR, mass spectrometry) & structure prediction 2 BCB 444/544 Fall 07 Dobbs 2

3 Database of Theoretical Structures? PMDB: Protein Model Database also, via NAR's Molecular Biology Database Collection Protein Structure or Protein Folding Problem "Major unsolved problem in molecular biology" In cells: spontaneous assisted by enzymes assisted by chaperones Theoretical structural models (predicted) are no longer accepted by the PDB (since 0/5/06); but, it is possible to search for models deposited earlier: In vitro: many proteins can fold to their "native" states spontaneously & without assistance but, many do not! 3 4 Deciphering the Protein Folding Code Protein Structure Protein Structure or Protein Folding Problem Given the amino acid sequence of a protein, predict its 3-dimensional structure (fold) Inverse Folding Problem Given a protein fold, identify every amino acid sequence that can adopt its 3-dimensional structure Structure is largely determined by sequence BUT: Similar sequences can assume different structures Dissimilar sequences can assume similar structures Many proteins are multi-functional 2 Major Protein Folding Problems: - Determine folding pathway 2- Predict tertiary structure from sequence Both still largely unsolved problems 5 6 Steps in Protein Folding Protein Dynamics - "Collapse"- driving force is burial of hydrophobic aa s (fast - msecs) 2- Molten globule - helices & sheets form, but "loose" (slow - secs) 3- "Final" native folded state - compaction & rearrangement of 2' structures Protein in native state is NOT static Function of many proteins requires conformational changes, sometimes large, sometimes small Globular proteins are inherently "unstable" (NOT evolved for maximum stability) Energy difference between native and denatured state is very small (5-5 kcal/mol) (this is equivalent to ~ 2 H-bonds!) Native state? - assumed to be lowest free energy - may be an ensemble of structures Folding involves changes in both entropy & enthalpy 7 8 BCB 444/544 Fall 07 Dobbs 3

4 Difficulty of Tertiary Structure Folding or tertiary structure prediction problem can be formulated as a search for minimum energy conformation Search space is defined by psi/phi angles of backbone and side-chain rotamers Search space is enormous even for small proteins! Number of local minima increases exponentially with number of residues Tertiary Structure Methods 2 (or 3) Major Methods:. Comparative Modeling: Homology Modeling (easiest!) Threading and Fold Recognition (harder) 2. Ab Initio Protein Structural (really hard) Computationally it is an exceedingly difficult problem! 9 20 Comparative Modeling? Ab Initio Comparative modeling - term is sometimes used interchangeably with homology modeling, but also sometimes used to mean both: homology modeling threading/fold recognition. Develop energy function bond energy bond angle energy dihedral angle energy van der Waals energy electrostatic energy 2. Calculate structure by minimizing energy function usually Molecular Dynamics (MD) or Monte Carlo (MC) Ab initio prediction - impractical for most real (long) proteins Computationally? very expensive Accuracy? Usually poor for all except short peptides (but much improvement recently!) Provides both folding pathway & folded structure 2 22 Comparative Modeling Homology Modeling Two types: ) Homology modeling 2) Threading (fold recognition) Both rely on availability of experimentally determined structures that are "homologous" or at least structurally very similar to target Provide folded structure only. Identify homologous protein sequences (ψ-blast) 2. Among available structures (in PDB), choose one with closest sequence to target as template (can combine steps & 2 by using PDB-BLAST) 3. Build model by placing target sequence residues in corresponding positions on homologous structure & refine by "tweaking" modeled structure (energy minimization) Homology modeling - works "well" Computationally? "relatively" inexpensive Accuracy? higher sequence identity better model Requires ~30% sequence identity with sequence for which structure is known BCB 444/544 Fall 07 Dobbs 4

5 Threading - Fold Recognition Identify best fit between target sequence & template structure. Develop energy function 2. Develop template library 3. Align target sequence with each template in library & score 4. Identify top scoring template (D to 3D alignment) 5. Refine structure as in homology modeling Threading - works "sometimes" Computationally? Can be expensive or cheap, depends on energy function & whether "all atom" or "backbone only" threading is used Accuracy? in theory, should not depend on sequence identity (should depend on quality of template library & "luck") Usually, higher sequence identity to protein of known structure better model Threading: the Motivation Basic premise: The number of unique structural folds in nature is fairly small (probably ) Statistics from Protein Data Bank (>46,000 structures) Prior to Structural Genomics Project, 90% of "new" structures submitted to PDB were similar to existing folds in PDB - suggesting that almost all folds in nature have been identified Thus, chances for a protein to have a native-like structural fold in PDB are quite good Note: Proteins with similar structural folds could be either homologs or analogs Steps in Threading Threading Goal - & Issues Target Sequence ALKKGF HFDTSE Find correct sequence-structure alignment of a target sequence with its native-like fold in template library (usually derived from PDB) Structure Templates Structure database - must be "complete" Can't build a good model if there is no good template in library! Sequence-structure alignment algorithm: Bad alignment Bad score! Energy function or Scoring Scheme:. Align target sequence with template structures in fold library (usually from the PDB) 2. Calculate energy score to evaluate "goodness of fit" between target sequence & template structure 3. Rank models based on energy scores Must distinguish correct sequence-fold alignment from incorrect sequence-fold alignments Must distinguish correct fold from close decoys reliability assessment - How determine whether predicted structure is correct? (or even close?) Threading: Template database Threading: Energy function Build a database of structural templates e.g., ASTRAL domain library derived from the PDB Two main methods (& combinations of these) Structural profile (environmental) physicochemical properties of amino acids Sometimes, supplement with additional decoys e.g., generated using ab initio approach such as Rosetta (Baker) Contact potential (statistical) based on contact statistics from PDB famous one: Miyazawa & Jernigan (ISU) BCB 444/544 Fall 07 Dobbs 5

6 Protein Threading: Typical energy function A Local Example: Rapid Threading Approach for Protein Structure What is "probability" that two specific residues are in contact? Total energy: E p + E s + E g How well does a specific residue fit structural environment? Alignment gap penalty? Goal: Find a sequence-structure alignment that minimizes energy function Kai-Ming Ho, Physics Haibo Cao Yungok Ihm Zhong Gao James Morris Cai-zhuang Wang Drena Dobbs, GDCB Jae-Hyung Lee Michael Terribilini Jeff Sander Cao H, Ihm Y, Wang, CZ, Morris, JR, Su, M, Dobbs, D, Ho, KM (2004) Three-dimensional threading approach to protein structure recognition Polymer 45: Motivations for & Assumptions of Ho Threading Algorithm Simplify: Template structure representation Goal: Develop a threading algorithm that: Is simple & rapid enough to be used in high throughput applications Is relatively "insensitive" to sequence similarity between target protein sequence & sequence of template structure (to enhance detection of remote homologs & structures that are similar due to convergent evolution) Can be used to answer questions such as: What are predicted structures of all "unassigned" ORFs in Arabidopsis? Does Arabidopsis have a protein with structure similar to mammalian Tumor Necrosis Factor (TNF)? Assumptions: Native state of a protein is lowest free energy state Hydrophobic interactions drive protein folding 33 Template structure ( N! N contact matrix) Cij =, if rij! 6. 5 Cij = 0, Yungok Ihm Å C (contact) Otherwise A neighbor in sequence (non-contact) i j N 34 Simplify: Target Sequence Representation Simplify: Energy Function Miyazawa-Jernigan (MJ) model: inter-residue contact energy M(i,j) is a quasi-chemical approximation based on pairwise contact statistics extracted from known protein structures in the PDB: 20 X 20 matrix = 20 values ("letters") Li-Tang-Wingreen (LTW): factorize the MJ interaction matrix to reduce the number of parameters associated with amino acids from 20 to 20 q values Hydrophobic-Polar (HP): represent amino acids as either H (hydrophobic) or polar (P); Dill et al demonstrated the utility of this simple binary alphabet representation: 2 values Compare results with 20 vs 20 vs 2 letter representations How low can we go? Interaction counts only if two hydrophobic amino acid residues are in contact At residue level, pair-wise hydrophobic interaction is dominant: E = Σi,j Cij Uij Cij : contact matrix Uij = U(residue I, residue J) MJ: U = Uij LTW: U = Qi*Qj HP: U = {,0} 35 Yungok IhmBCB 444/544 F07 ISU Dobbs #23 - Protein Tertiary Structure 36 BCB 444/544 Fall 07 Dobbs 6

7 Energy calculation: Contact energy Summary of Ho Threading Procedure Miyazawa-Jernigan (MJ) matrix: Statistical potential 20 parameters M = C M F I L V W C M F I L Template Structure i j N Contact Matrix Cij =, Cij = 0, if rij < 6 5 Å otherwise (a neighbor in sequence) Li-Tang-Wingreen (LTW): 20 parameters ~ M ij = j Contact Energy: Ec = ( QiCijQj +! Cij) ij= Yungok Ihm N " C2{( qi + ")( q + ") +!} qi ~ solubility Qi ~ hydrophobicity C contact matrix Qi =! qi!# with # =! , " =! Sequence AVFMRIHNDIVYNDIANTTQ Scoring Function Yungok Ihm Sequence Vector S = ( QA, QV, QF,..., QE) = (0.7997, ,.97, ) Contact Energy N Ec = " QiCijQj +! ij= Can complexity be further reduced? Consider simplifying structure representation, too Examine eigenvectors of contact matrix ALKKGF HFDTSE Hydrophobic Contacts ~ # TCT = T ~ ( = N $ i= ~! ivit 2 N $ i= ~! ivivi) T ~ "! V T 2 ri = N ~ 2 " i( ViT ) ~ " i( ViT )! i= 2 Sequence Structure Sequence Contact Matrix Sequence D Profile Haibo Cao (D 3D problem) (D 2D problem) (D D problem) C :contact matrix!i :i-th eigenvalue of C V i :i-th eigenvector V :eigenvector with largest eigenvalue T :protein sequence of the template structure i :fraction of hydrophobic contacts from i-th eigenvector r Haibo Cao Represent contact matrix by its dominant eigenvector (D profile) Threading Alignment Step - now fast! Align target sequence vector (D) with eigenvector profile of template structure (D) D Profile P = V First eigenvector (with highest eigenvalue) dominates the overlap between sequence and structure Higher ranking (rank > 4) eigenvectors are sequence blind Maximize the overlap between the Sequence (S) and the profile (P) S P allowing gaps Calculate contact energy using the alignment: E c New profile P = CP Cao et al Haibo Cao Polymer 45 (2004) BCB /544 F07 ISU Dobbs #23 - Protein Tertiary Structure BCB 444/544 Fall 07 Dobbs 7

8 Parameters for alignment? How incorporate secondary structure? Gap penalty: Insertion/deletion in helices or strands is strongly penalized; smaller penalties for in/dels in loops Gap penalties apply to alignment score only, not to energy calculation Size penalty: If a target residue and aligned template residue differ in radius by > 0.5Å and if residue is involved in > 2 contacts, alignment is penalized Size penalties apply to alignment score only, not to energy calculation ALKKGFG HFDTSE Loop Helix Predict secondary structure of target sequence (PSIPRED, PROF, JPRED, SAM, GOR V) N + = total number of matches between predicted & actual secondary structure of template N - = total number of mismatches N s = total number of residues selected in alignment Global fitness : f = + (N + - N - ) / N s Emod = f * Ethreading Yungok IhmBCB Yungok Ihm BCB /544 F07 ISU Dobbs #23 - Protein Tertiary Structure /544 F07 ISU Dobbs #23 - Protein Tertiary Structure How much better is this fit than random? Eshuffle : Shuffled Sequence vs Structure Erelative = Emod Eshuffled Performance Evaluation? "Blind Test" CASP5 Competition (CASP7 is most recent) (Critical Assessment of Protein Structure ) Given: Amino acid sequence Goal: Predict 3-D structure (before experimental results published) E score modifed to reflect fit with predicted 2' structure Avg E score for same sequence shuffled (randomized) many times Yungok Ihm 46 Typical Results: (well, actually, our BEST Results): HO = #-Ranked CASP5 for this Target Overall Performance in CASP5 Contest ~8th out of 80 (M. Levitt, Stanford) Target 74 PDB ID = MG7 T74_ T74_2 Predicted Structure Actual Structure FR Fold Recognition (targets manually assessed by Nick Grishin) Rank Z-Score Ngood Npred NgNW NpNW Group-name Ginalski Skolnick Kolinski Baker BIOINFO.PL Shortle BAKER-ROBETTA Brooks Ho-Kai-Ming Jones-NewFold FR NgNW - number of good predictions without weighting for multiple models FR NpNW - number of total predictions without weighting for multiple models Cao, Ihm, Wang, BCB 444/544 Dobbs, F07 ISU Ho Dobbs #23 - Protein Tertiary Structure BCB 444/544 Fall 07 Dobbs 8

9 CASP - Check it out! Critical Assessment of Protein Structure CASP7 contest : Provides assessment of automated servers for protein structure prediction (LiveBench, CAFASP, EVA) & URLs for them Another Convenient List of Links for Protein Servers diction_software Related contests & resources: Protein Function (part of CASP) CAPRI = Critical Assessment of Predicted Interactions New: CASPM = CASP for M = Mutant proteins Predict effects of small (point) mutations, e.g., SNPs SECTION V Chp 3 - Protein Structure Visualization, Comparison & Classification Xiong: Chp 3 STRUCTURAL BIOINFORMATICS Protein Structure Visualization, Comparison & Classification Protein Structural Visualization Protein Structure Comparison Protein Structure Classification Protein Structure Comparison Methods 3 Basic Approaches for Aligning Structures (see Xiong textbook for details). Intermolecular 2. Intramolecular 3. Combined But, very active research area - many recent new methods 3 Popular Methods: DALI = Distance Matrix Alignment of Structures (Holm) FSSP Database SSAP = Sequential Structure Alignment Program (Orengo) CATH Database CE = Combinatorial Extension (Bourne) VAST at NCBI URLS: Another local example: Combining Structure, Machine Learning & "Real" (wet-lab) Experiments to Investigate the Lentiviral Rev Protein: A Step Toward New HIV Therapies Susan Carpenter (Washington State Univ) Wendy Sparks Yvonne Wannemuehler Drena Dobbs, GDCB Jae-Hyung Lee Michael Terribilini Kai-Ming Ho, Physics Yungok Ihm Haibo Cao Cai-zhuang Wang Gloria Culver, BBMB Laura Dutca Macromolecular interactions mediated by Rev protein in lentiviruses (HIV & EIAV) Cytoplasm Tat Rev Nucleus pre-mrna Spliceosome AAAA Provirus NUCLEAR IMPORT Early: Regulatory Proteins Rev AAAA (protein-protein) RNA BINDING (protein-rna) Rev Rev AAAA MULTIMERIZATION Rev Rev AAAA (protein-protein) NUCLEAR EXPORT (protein-protein) Late: Structural Proteins Progeny RNA 53 Susan Carpenter 54 BCB 444/544 Fall 07 Dobbs 9

10 Rev is essential for lentiviral replication Rev is a small nucleoplasmic shuttling protein (HIV Rev 5 aa; EIAV Rev 65 aa) Recognizes a specific binding site on viral RNA: Rev Responsive Element (RRE) Interacts with CRM to export incompletely spliced viral RNAs from nucleus to the cytoplasm Specific domains of Rev mediate nuclear localization, RNA binding, and nuclear export Critical role of Rev in lentiviral replication makes it an attractive target for antiviral (AIDs) therapy Problem: no high resolution Rev structure! not even for HIV Rev, despite intense effort ($$) Why?? Rev aggregates at concentrations needed for NMR or X- ray crystallography What about insights from sequence comparisons? "undetectable" sequence similarity among Revs from different lentiviruses (eg, EIAV vs HIV <0%) But: We know that lentiviral Rev proteins are functionally "homologous" - even in highly diverse lentiviruses Approach: Hypothesis: Rev proteins from diverse lentiviruses share structural features critical for function Computationally model structures of lentiviral Rev proteins - using structural threading algorithm (with Ho et al) Predict critical residues for RNA-binding, protein interaction - using machine learning algorithms (with Honavar et al ) Test model and predictions - using genetic/biochemical approaches (with Carpenter & Culver) - using biophysical approaches (with Andreotti & Yu groups) Initially: focus on EIAV Rev & RRE EIAV Rev HIV- Rev Functional domains: EIAV vs HIV Rev exon exon RBM Folding? RRDRW ERLE KRRRK 6 NLS/RBM RQARRNRRRRWR NLS - Nuclear Export Signal NLS - Nuclear Localization Signal RBM - putative RNA Binding Motif Predicted EIAV Rev Structure Comparison of Predicted Rev Structures EIAV FIV HIV SIV Dimer HIV Dimer Yungok IhmBCB Yungok IhmBCB /544 F07 ISU Dobbs #23 - Protein Tertiary Structure /544 F07 ISU Dobbs #23 - Protein Tertiary Structure BCB 444/544 Fall 07 Dobbs 0

11 Predicted vs Experimental Structure of N-terminal region of HIV Rev Location of functional residues EIAV Rev? A B C Leu36,45,49: On surface, consistent with role in nuclear export Leu95 & Leu09: Buried in core, critical hydrophic contacts for fold? Predicted Structure HIV Rev N-terminus NMR Structure Overlay HIV Rev N-terminal Alignment of Predicted Peptide & NMR Structures (Battiste & Williamson) Putative RBM Yungok IhmBCB Yungok IhmBCB /544 F07 ISU Dobbs #23 - Protein Tertiary Structure 6 444/544 F07 ISU Dobbs #23 - Protein Tertiary Structure Mutate hydrophobic residues predicted to be critical for helical packing in core Functional Analysis of Rev Structural Mutants in vivo (CAT assay) L65 vs L95 & L09 Single mutants: Leu to Ala Leu to Asp Double mutants: Leu to Ala Single Ala Mutation L A Single Asp Mutation L D Insert charged aa in hydrophobic core Negligible effect on Rev activity Dramatic change in Rev activity? L65 L95 L09 Activity of Rev Structural Mutants Double Ala Mutation L L A A Reduction in Rev activity? Sham pcdna3 RI A D L65 A D L95 A D L09 L65A L95A L65A L09A L95A L09A Controls Single Mutations Double Mutations Yungok Ihm 63 Wendy Sparks 64 EIAV Rev Functional domains: EIAV vs HIV Rev RBM Folding? Red - RNA interaction Green - Protein interaction - Nuclear Export Signal NLS - Nuclear Localization Signal RBM - putative RNA Binding Motif Putative RNA-binding Motifs & Predicted RNA-binding Residues Mapped onto Predicted EIAV Rev Structure ERLE KRRRK NLS Yungok Ihm RRDRW HIV- Rev RRDRW ERLE KRRRK ARRHLGPGPT QHTPSRRDRW IREQILQAEV LQERLEWRIR NLS/RBM RQARRNRRRRWR HFREDQRGDF SAWGDYQQAQ ERRWGEQSSP RVLRPGDSKRRRKHL Michael Terribilini 66 BCB 444/544 Fall 07 Dobbs

12 Express & purify -ERev deletion mutants -ERev binds specifically to RRE in vitro RBM Folding? -ERev NE S NLS UV crosslinking Competition BSA sense antisense BSA No protein No cold RRE Cold RRE Marker Undigested 32 P-RRE Jae-Hyung BCB Lee444/544 F07 ISU Dobbs #23 - Protein Tertiary Structure 67 Jae-Hyung BCB Lee444/544 F07 ISU Dobbs #23 - Protein Tertiary Structure 68 EIAV Rev: Binding s vs Experiments Structure PREDICTED: Protein binding residues RNA binding residues + + KRRRK RRDRW GPLESDQWCRVLRQSLPEEKISSQTCIARRHLGPGPTQHTPSRRDRWIREQILQAEVLQERLEWRI VALIDATED: QRGDFSAWGDYQQAQERRWGEQSSPRVLRPGDSKRRRKHL Protein binding residues RNA binding residues WT RBM FOLD? NLS/RBM RRDRW ERLE KRRRK Roles of Putative RNA Binding Motifs? RBD RBD NLS RRDRW ERLE KRRRK AADAA AALA KAAAK ERDE Jae-Hyung Lee Lee et al (2006) J Virol 80:3844 Terribilini et al (2006) PSB :45 Jae-Hyung Lee BCB /544 F07 ISU Dobbs #23 - Protein Tertiary Structure Rev RNA Binding Motifs: Predicted vs Experiment Structure PREDICTED: Protein binding residues RNA binding residues + + KRRRK RRDRW GPLESDQWCRVLRQSLPEEKISSQTCIARRHLGPGPTQHTPSRRDRWIREQILQAEVLQERLEWRI VALIDATED: QRGDFSAWGDYQQAQERRWGEQSSPRVLRPGDSKRRRKHL Protein binding residues RNA binding residues Jae-Hyung Lee WT AADAA AALA ERDE KAAAK RBM FOLD? RRDRW ERLE KRRRK AADAA AALA KAAAK ERDE NLS/RBM Summary: s vs Experiments KRRRK RRDRW GPLESDQWCRVLRQSLPEEKISSQTCIARRHLGPGPTQHTPSRRDRWIREQILQAEVLQERLEWRI Lee et al (2006) J Virol 80: ERLE Terribilini et al (2006) PSB : RBM FOLD NLS/RBM RRDRW ERLE KRRRK QRGDFSAWGDYQQAQERRWGEQSSPRVLRPGDSKRRRKHL BCB 444/544 Fall 07 Dobbs 2

13 Conclusions & Future Directions Combination of computational & wet lab approaches revealed that: EIAV Rev has a bipartite RNA binding domain Two Arg-rich RBMs are critical RRDRW in central region (but not ERLE) KRRRK at C-terminus, overlapping the NLS Based on computational modeling, the RBMs are in close proximity within the 3-D structure of protein Lentiviral Rev proteins & their cognate RRE binding sites may be more similar in structure than has been appreciated Future: Computational: Use Rev-RRE model system to discover "predictive rules" for protein-rna recognition Experimental? Experimentally determine the structure of Rev-RRE complex!!! Lee et al (2006) Terribilini et al (2006) J Virol 80:3844 BCB 444/544 PSB :45 F07 ISU Dobbs #23 - Protein Tertiary Structure Building Designer Zinc Finger DNA-binding Proteins J Sander, P Zaback, F Fu, J Townsend, R Winfrey D Wright, K Joung, L Miller, D Dobbs, D Voytas Chp 6 - RNA Structure SECTION V STRUCTURAL BIOINFORMATICS Xiong: Chp 6 RNA Structure (Terribilini) Introduction Types of RNA Structures RNA Secondary Structure Methods Ab Initio Approach Comparative Approach Performance Evaluation Wright et al (2006) Sander et al (2007) Nature Protocols Nucleic Acids Res 76 BCB 444/544 Fall 07 Dobbs 3

Protein Structure Prediction 11/11/05

Protein Structure Prediction 11/11/05 11/11/05 Protein Structure Prediction & Modeling Bioinformatics Seminars Nov 11 Fri 12:10 BCB Seminar in E164 Lago Building Supertrees Using Distances Steve Willson, Dept of Mathematics http://www.bcb.iastate.edu/courses/bcb691-f2005.html

More information

BCB 444/544 Fall 07 Dobbs 1

BCB 444/544 Fall 07 Dobbs 1 BCB 444/544 Required Reading (before lecture) Lecture 25 Mon Oct 15 - Lecture 23 Protein Tertiary Structure Prediction Chp 15 - pp 214-230 More RNA Structure Wed Oct 17 & Thurs Oct 18 - Lecture 24 & Lab

More information

BCB 444/544 Fall 07 Dobbs 1

BCB 444/544 Fall 07 Dobbs 1 BCB 444/544 Lecture 21 Protein Structure Visualization, Classification & Comparison Secondary Structure #21_Oct10 Required Reading (before lecture) Mon Oct 8 - Lecture 20 Protein Secondary Structure Chp

More information

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche

Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche The molecular structure of a protein can be broken down hierarchically. The primary structure of a protein is simply its

More information

CMPS 3110: Bioinformatics. Tertiary Structure Prediction

CMPS 3110: Bioinformatics. Tertiary Structure Prediction CMPS 3110: Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the laws of physics! Conformation space is finite

More information

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction CMPS 6630: Introduction to Computational Biology and Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the

More information

Basics of protein structure

Basics of protein structure Today: 1. Projects a. Requirements: i. Critical review of one paper ii. At least one computational result b. Noon, Dec. 3 rd written report and oral presentation are due; submit via email to bphys101@fas.harvard.edu

More information

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Introduction to Comparative Protein Modeling. Chapter 4 Part I Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature

More information

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues Programme 8.00-8.20 Last week s quiz results + Summary 8.20-9.00 Fold recognition 9.00-9.15 Break 9.15-11.20 Exercise: Modelling remote homologues 11.20-11.40 Summary & discussion 11.40-12.00 Quiz 1 Feedback

More information

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748 CAP 5510: Introduction to Bioinformatics Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinfs07.html 2/15/07 CAP5510 1 EM Algorithm Goal: Find θ, Z that maximize Pr

More information

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.

Protein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Protein Dynamics The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Below is myoglobin hydrated with 350 water molecules. Only a small

More information

Protein Structures. 11/19/2002 Lecture 24 1

Protein Structures. 11/19/2002 Lecture 24 1 Protein Structures 11/19/2002 Lecture 24 1 All 3 figures are cartoons of an amino acid residue. 11/19/2002 Lecture 24 2 Peptide bonds in chains of residues 11/19/2002 Lecture 24 3 Angles φ and ψ in the

More information

CAP 5510 Lecture 3 Protein Structures

CAP 5510 Lecture 3 Protein Structures CAP 5510 Lecture 3 Protein Structures Su-Shing Chen Bioinformatics CISE 8/19/2005 Su-Shing Chen, CISE 1 Protein Conformation 8/19/2005 Su-Shing Chen, CISE 2 Protein Conformational Structures Hydrophobicity

More information

Protein Structures: Experiments and Modeling. Patrice Koehl

Protein Structures: Experiments and Modeling. Patrice Koehl Protein Structures: Experiments and Modeling Patrice Koehl Structural Bioinformatics: Proteins Proteins: Sources of Structure Information Proteins: Homology Modeling Proteins: Ab initio prediction Proteins:

More information

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror Protein structure prediction CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror 1 Outline Why predict protein structure? Can we use (pure) physics-based methods? Knowledge-based methods Two major

More information

114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009

114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009 114 Grundlagen der Bioinformatik, SS 09, D. Huson, July 6, 2009 9 Protein tertiary structure Sources for this chapter, which are all recommended reading: D.W. Mount. Bioinformatics: Sequences and Genome

More information

D Dobbs ISU - BCB 444/544X 1

D Dobbs ISU - BCB 444/544X 1 11/7/05 Protein Structure: Classification, Databases, Visualization Announcements BCB 544 Projects - Important Dates: Nov 2 Wed noon - Project proposals due to David/Drena Nov 4 Fri PM - Approvals/responses

More information

ALL LECTURES IN SB Introduction

ALL LECTURES IN SB Introduction 1. Introduction 2. Molecular Architecture I 3. Molecular Architecture II 4. Molecular Simulation I 5. Molecular Simulation II 6. Bioinformatics I 7. Bioinformatics II 8. Prediction I 9. Prediction II ALL

More information

Homology Modeling. Roberto Lins EPFL - summer semester 2005

Homology Modeling. Roberto Lins EPFL - summer semester 2005 Homology Modeling Roberto Lins EPFL - summer semester 2005 Disclaimer: course material is mainly taken from: P.E. Bourne & H Weissig, Structural Bioinformatics; C.A. Orengo, D.T. Jones & J.M. Thornton,

More information

Protein Structure Prediction, Engineering & Design CHEM 430

Protein Structure Prediction, Engineering & Design CHEM 430 Protein Structure Prediction, Engineering & Design CHEM 430 Eero Saarinen The free energy surface of a protein Protein Structure Prediction & Design Full Protein Structure from Sequence - High Alignment

More information

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB Homology Modeling (Comparative Structure Modeling) Aims of Structural Genomics High-throughput 3D structure determination and analysis To determine or predict the 3D structures of all the proteins encoded

More information

#33 - Genomics 11/09/07

#33 - Genomics 11/09/07 BCB 444/544 Required Reading (before lecture) Lecture 33 Mon Nov 5 - Lecture 31 Phylogenetics Parsimony and ML Chp 11 - pp 142 169 Genomics Wed Nov 7 - Lecture 32 Machine Learning Fri Nov 9 - Lecture 33

More information

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007

Molecular Modeling. Prediction of Protein 3D Structure from Sequence. Vimalkumar Velayudhan. May 21, 2007 Molecular Modeling Prediction of Protein 3D Structure from Sequence Vimalkumar Velayudhan Jain Institute of Vocational and Advanced Studies May 21, 2007 Vimalkumar Velayudhan Molecular Modeling 1/23 Outline

More information

Introduction to Computational Structural Biology

Introduction to Computational Structural Biology Introduction to Computational Structural Biology Part I 1. Introduction The disciplinary character of Computational Structural Biology The mathematical background required and the topics covered Bibliography

More information

Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program)

Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program) Syllabus of BIOINF 528 (2017 Fall, Bioinformatics Program) Course Name: Structural Bioinformatics Course Description: Instructor: This course introduces fundamental concepts and methods for structural

More information

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror

Protein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror Protein structure prediction CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror 1 Outline Why predict protein structure? Can we use (pure) physics-based methods? Knowledge-based methods Two major

More information

Protein Structure Prediction

Protein Structure Prediction Page 1 Protein Structure Prediction Russ B. Altman BMI 214 CS 274 Protein Folding is different from structure prediction --Folding is concerned with the process of taking the 3D shape, usually based on

More information

Bioinformatics. Macromolecular structure

Bioinformatics. Macromolecular structure Bioinformatics Macromolecular structure Contents Determination of protein structure Structure databases Secondary structure elements (SSE) Tertiary structure Structure analysis Structure alignment Domain

More information

A threading approach to protein structure prediction: studies on TNF-like molecules, Rev proteins, and protein kinases

A threading approach to protein structure prediction: studies on TNF-like molecules, Rev proteins, and protein kinases Retrospective Theses and Dissertations 2004 A threading approach to protein structure prediction: studies on TNF-like molecules, Rev proteins, and protein kinases Yungok Ihm Iowa State University Follow

More information

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION

THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION THE TANGO ALGORITHM: SECONDARY STRUCTURE PROPENSITIES, STATISTICAL MECHANICS APPROXIMATION AND CALIBRATION Calculation of turn and beta intrinsic propensities. A statistical analysis of a protein structure

More information

CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Giri Narasimhan

CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Giri Narasimhan CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinff18.html Proteins and Protein Structure

More information

Introduction to" Protein Structure

Introduction to Protein Structure Introduction to" Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Learning Objectives Outline the basic levels of protein structure.

More information

Syllabus BINF Computational Biology Core Course

Syllabus BINF Computational Biology Core Course Course Description Syllabus BINF 701-702 Computational Biology Core Course BINF 701/702 is the Computational Biology core course developed at the KU Center for Computational Biology. The course is designed

More information

Protein structure alignments

Protein structure alignments Protein structure alignments Proteins that fold in the same way, i.e. have the same fold are often homologs. Structure evolves slower than sequence Sequence is less conserved than structure If BLAST gives

More information

Protein Structure Prediction

Protein Structure Prediction Protein Structure Prediction Michael Feig MMTSB/CTBP 2006 Summer Workshop From Sequence to Structure SEALGDTIVKNA Ab initio Structure Prediction Protocol Amino Acid Sequence Conformational Sampling to

More information

SCOP. all-β class. all-α class, 3 different folds. T4 endonuclease V. 4-helical cytokines. Globin-like

SCOP. all-β class. all-α class, 3 different folds. T4 endonuclease V. 4-helical cytokines. Globin-like SCOP all-β class 4-helical cytokines T4 endonuclease V all-α class, 3 different folds Globin-like TIM-barrel fold α/β class Profilin-like fold α+β class http://scop.mrc-lmb.cam.ac.uk/scop CATH Class, Architecture,

More information

Template-Based Modeling of Protein Structure

Template-Based Modeling of Protein Structure Template-Based Modeling of Protein Structure David Constant Biochemistry 218 December 11, 2011 Introduction. Much can be learned about the biology of a protein from its structure. Simply put, structure

More information

Week 10: Homology Modelling (II) - HHpred

Week 10: Homology Modelling (II) - HHpred Week 10: Homology Modelling (II) - HHpred Course: Tools for Structural Biology Fabian Glaser BKU - Technion 1 2 Identify and align related structures by sequence methods is not an easy task All comparative

More information

Building 3D models of proteins

Building 3D models of proteins Building 3D models of proteins Why make a structural model for your protein? The structure can provide clues to the function through structural similarity with other proteins With a structure it is easier

More information

The protein folding problem consists of two parts:

The protein folding problem consists of two parts: Energetics and kinetics of protein folding The protein folding problem consists of two parts: 1)Creating a stable, well-defined structure that is significantly more stable than all other possible structures.

More information

Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics

Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics Statistical Machine Learning Methods for Bioinformatics IV. Neural Network & Deep Learning Applications in Bioinformatics Jianlin Cheng, PhD Department of Computer Science University of Missouri, Columbia

More information

Analysis and Prediction of Protein Structure (I)

Analysis and Prediction of Protein Structure (I) Analysis and Prediction of Protein Structure (I) Jianlin Cheng, PhD School of Electrical Engineering and Computer Science University of Central Florida 2006 Free for academic use. Copyright @ Jianlin Cheng

More information

Template Free Protein Structure Modeling Jianlin Cheng, PhD

Template Free Protein Structure Modeling Jianlin Cheng, PhD Template Free Protein Structure Modeling Jianlin Cheng, PhD Associate Professor Computer Science Department Informatics Institute University of Missouri, Columbia 2013 Protein Energy Landscape & Free Sampling

More information

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University COMP 598 Advanced Computational Biology Methods & Research Introduction Jérôme Waldispühl School of Computer Science McGill University General informations (1) Office hours: by appointment Office: TR3018

More information

Supporting Online Material for

Supporting Online Material for www.sciencemag.org/cgi/content/full/309/5742/1868/dc1 Supporting Online Material for Toward High-Resolution de Novo Structure Prediction for Small Proteins Philip Bradley, Kira M. S. Misura, David Baker*

More information

Template Free Protein Structure Modeling Jianlin Cheng, PhD

Template Free Protein Structure Modeling Jianlin Cheng, PhD Template Free Protein Structure Modeling Jianlin Cheng, PhD Professor Department of EECS Informatics Institute University of Missouri, Columbia 2018 Protein Energy Landscape & Free Sampling http://pubs.acs.org/subscribe/archive/mdd/v03/i09/html/willis.html

More information

Amino Acid Structures from Klug & Cummings. 10/7/2003 CAP/CGS 5991: Lecture 7 1

Amino Acid Structures from Klug & Cummings. 10/7/2003 CAP/CGS 5991: Lecture 7 1 Amino Acid Structures from Klug & Cummings 10/7/2003 CAP/CGS 5991: Lecture 7 1 Amino Acid Structures from Klug & Cummings 10/7/2003 CAP/CGS 5991: Lecture 7 2 Amino Acid Structures from Klug & Cummings

More information

Protein Structure Determination

Protein Structure Determination Protein Structure Determination Given a protein sequence, determine its 3D structure 1 MIKLGIVMDP IANINIKKDS SFAMLLEAQR RGYELHYMEM GDLYLINGEA 51 RAHTRTLNVK QNYEEWFSFV GEQDLPLADL DVILMRKDPP FDTEFIYATY 101

More information

Copyright Mark Brandt, Ph.D A third method, cryogenic electron microscopy has seen increasing use over the past few years.

Copyright Mark Brandt, Ph.D A third method, cryogenic electron microscopy has seen increasing use over the past few years. Structure Determination and Sequence Analysis The vast majority of the experimentally determined three-dimensional protein structures have been solved by one of two methods: X-ray diffraction and Nuclear

More information

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES Protein Structure W. M. Grogan, Ph.D. OBJECTIVES 1. Describe the structure and characteristic properties of typical proteins. 2. List and describe the four levels of structure found in proteins. 3. Relate

More information

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding?

Outline. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Unfolded Folded. What is protein folding? The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation By Jun Shimada and Eugine Shaknovich Bill Hawse Dr. Bahar Elisa Sandvik and Mehrdad Safavian Outline Background on protein

More information

Sequence analysis and comparison

Sequence analysis and comparison The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species

More information

Course Notes: Topics in Computational. Structural Biology.

Course Notes: Topics in Computational. Structural Biology. Course Notes: Topics in Computational Structural Biology. Bruce R. Donald June, 2010 Copyright c 2012 Contents 11 Computational Protein Design 1 11.1 Introduction.........................................

More information

1-D Predictions. Prediction of local features: Secondary structure & surface exposure

1-D Predictions. Prediction of local features: Secondary structure & surface exposure 1-D Predictions Prediction of local features: Secondary structure & surface exposure 1 Learning Objectives After today s session you should be able to: Explain the meaning and usage of the following local

More information

Protein Structure Prediction

Protein Structure Prediction Protein Structure Prediction Michael Feig MMTSB/CTBP 2009 Summer Workshop From Sequence to Structure SEALGDTIVKNA Folding with All-Atom Models AAQAAAAQAAAAQAA All-atom MD in general not succesful for real

More information

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall How do we go from an unfolded polypeptide chain to a Lecture 11: Protein Folding & Stability Margaret A. Daugherty Fall 2004 How do we go from an unfolded polypeptide chain to a compact folded protein? (Folding of thioredoxin, F. Richards) Structure - Function

More information

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE

Examples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE Examples of Protein Modeling Protein Modeling Visualization Examination of an experimental structure to gain insight about a research question Dynamics To examine the dynamics of protein structures To

More information

HMM applications. Applications of HMMs. Gene finding with HMMs. Using the gene finder

HMM applications. Applications of HMMs. Gene finding with HMMs. Using the gene finder HMM applications Applications of HMMs Gene finding Pairwise alignment (pair HMMs) Characterizing protein families (profile HMMs) Predicting membrane proteins, and membrane protein topology Gene finding

More information

FlexPepDock In a nutshell

FlexPepDock In a nutshell FlexPepDock In a nutshell All Tutorial files are located in http://bit.ly/mxtakv FlexPepdock refinement Step 1 Step 3 - Refinement Step 4 - Selection of models Measure of fit FlexPepdock Ab-initio Step

More information

3D Structure. Prediction & Assessment Pt. 2. David Wishart 3-41 Athabasca Hall

3D Structure. Prediction & Assessment Pt. 2. David Wishart 3-41 Athabasca Hall 3D Structure Prediction & Assessment Pt. 2 David Wishart 3-41 Athabasca Hall david.wishart@ualberta.ca Objectives Become familiar with methods and algorithms for secondary Structure Prediction Become familiar

More information

Details of Protein Structure

Details of Protein Structure Details of Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Anne Mølgaard, Kemisk Institut, Københavns Universitet Learning Objectives

More information

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions: Van der Waals Interactions

More information

Computational Molecular Biology (

Computational Molecular Biology ( Computational Molecular Biology (http://cmgm cmgm.stanford.edu/biochem218/) Biochemistry 218/Medical Information Sciences 231 Douglas L. Brutlag, Lee Kozar Jimmy Huang, Josh Silverman Lecture Syllabus

More information

RNA and Protein Structure Prediction

RNA and Protein Structure Prediction RNA and Protein Structure Prediction Bioinformatics: Issues and Algorithms CSE 308-408 Spring 2007 Lecture 18-1- Outline Multi-Dimensional Nature of Life RNA Secondary Structure Prediction Protein Structure

More information

Bioinformatics: Secondary Structure Prediction

Bioinformatics: Secondary Structure Prediction Bioinformatics: Secondary Structure Prediction Prof. David Jones d.jones@cs.ucl.ac.uk LMLSTQNPALLKRNIIYWNNVALLWEAGSD The greatest unsolved problem in molecular biology:the Protein Folding Problem? Entries

More information

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University

Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University Alpha-helical Topology and Tertiary Structure Prediction of Globular Proteins Scott R. McAllister Christodoulos A. Floudas Princeton University Department of Chemical Engineering Program of Applied and

More information

Grundlagen der Bioinformatik Summer semester Lecturer: Prof. Daniel Huson

Grundlagen der Bioinformatik Summer semester Lecturer: Prof. Daniel Huson Grundlagen der Bioinformatik, SS 10, D. Huson, April 12, 2010 1 1 Introduction Grundlagen der Bioinformatik Summer semester 2010 Lecturer: Prof. Daniel Huson Office hours: Thursdays 17-18h (Sand 14, C310a)

More information

Getting To Know Your Protein

Getting To Know Your Protein Getting To Know Your Protein Comparative Protein Analysis: Part III. Protein Structure Prediction and Comparison Robert Latek, PhD Sr. Bioinformatics Scientist Whitehead Institute for Biomedical Research

More information

09/06/25. Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Non-uniform distribution of folds. Scheme of protein structure predicition

09/06/25. Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Non-uniform distribution of folds. Scheme of protein structure predicition Sequence identity Structural similarity Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Fold recognition Sommersemester 2009 Peter Güntert Structural similarity X Sequence identity Non-uniform

More information

F. Piazza Center for Molecular Biophysics and University of Orléans, France. Selected topic in Physical Biology. Lecture 1

F. Piazza Center for Molecular Biophysics and University of Orléans, France. Selected topic in Physical Biology. Lecture 1 Zhou Pei-Yuan Centre for Applied Mathematics, Tsinghua University November 2013 F. Piazza Center for Molecular Biophysics and University of Orléans, France Selected topic in Physical Biology Lecture 1

More information

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot

More information

Protein Structure Basics

Protein Structure Basics Protein Structure Basics Presented by Alison Fraser, Christine Lee, Pradhuman Jhala, Corban Rivera Importance of Proteins Muscle structure depends on protein-protein interactions Transport across membranes

More information

GCD3033:Cell Biology. Transcription

GCD3033:Cell Biology. Transcription Transcription Transcription: DNA to RNA A) production of complementary strand of DNA B) RNA types C) transcription start/stop signals D) Initiation of eukaryotic gene expression E) transcription factors

More information

Modeling for 3D structure prediction

Modeling for 3D structure prediction Modeling for 3D structure prediction What is a predicted structure? A structure that is constructed using as the sole source of information data obtained from computer based data-mining. However, mixing

More information

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy Design of a Novel Globular Protein Fold with Atomic-Level Accuracy Brian Kuhlman, Gautam Dantas, Gregory C. Ireton, Gabriele Varani, Barry L. Stoddard, David Baker Presented by Kate Stafford 4 May 05 Protein

More information

Assignment 2 Atomic-Level Molecular Modeling

Assignment 2 Atomic-Level Molecular Modeling Assignment 2 Atomic-Level Molecular Modeling CS/BIOE/CME/BIOPHYS/BIOMEDIN 279 Due: November 3, 2016 at 3:00 PM The goal of this assignment is to understand the biological and computational aspects of macromolecular

More information

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769

Dihedral Angles. Homayoun Valafar. Department of Computer Science and Engineering, USC 02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC The precise definition of a dihedral or torsion angle can be found in spatial geometry Angle between to planes Dihedral

More information

Sequence and Structure Alignment Z. Luthey-Schulten, UIUC Pittsburgh, 2006 VMD 1.8.5

Sequence and Structure Alignment Z. Luthey-Schulten, UIUC Pittsburgh, 2006 VMD 1.8.5 Sequence and Structure Alignment Z. Luthey-Schulten, UIUC Pittsburgh, 2006 VMD 1.8.5 Why Look at More Than One Sequence? 1. Multiple Sequence Alignment shows patterns of conservation 2. What and how many

More information

Lecture 11: Protein Folding & Stability

Lecture 11: Protein Folding & Stability Structure - Function Protein Folding: What we know Lecture 11: Protein Folding & Stability 1). Amino acid sequence dictates structure. 2). The native structure represents the lowest energy state for a

More information

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding

Protein Folding & Stability. Lecture 11: Margaret A. Daugherty. Fall Protein Folding: What we know. Protein Folding Lecture 11: Protein Folding & Stability Margaret A. Daugherty Fall 2003 Structure - Function Protein Folding: What we know 1). Amino acid sequence dictates structure. 2). The native structure represents

More information

1. Protein Data Bank (PDB) 1. Protein Data Bank (PDB)

1. Protein Data Bank (PDB) 1. Protein Data Bank (PDB) Protein structure databases; visualization; and classifications 1. Introduction to Protein Data Bank (PDB) 2. Free graphic software for 3D structure visualization 3. Hierarchical classification of protein

More information

Short Announcements. 1 st Quiz today: 15 minutes. Homework 3: Due next Wednesday.

Short Announcements. 1 st Quiz today: 15 minutes. Homework 3: Due next Wednesday. Short Announcements 1 st Quiz today: 15 minutes Homework 3: Due next Wednesday. Next Lecture, on Visualizing Molecular Dynamics (VMD) by Klaus Schulten Today s Lecture: Protein Folding, Misfolding, Aggregation

More information

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9

InDel 3-5. InDel 8-9. InDel 3-5. InDel 8-9. InDel InDel 8-9 Lecture 5 Alignment I. Introduction. For sequence data, the process of generating an alignment establishes positional homologies; that is, alignment provides the identification of homologous phylogenetic

More information

Bioinformatics. Dept. of Computational Biology & Bioinformatics

Bioinformatics. Dept. of Computational Biology & Bioinformatics Bioinformatics Dept. of Computational Biology & Bioinformatics 3 Bioinformatics - play with sequences & structures Dept. of Computational Biology & Bioinformatics 4 ORGANIZATION OF LIFE ROLE OF BIOINFORMATICS

More information

Steps in protein modelling. Structure prediction, fold recognition and homology modelling. Basic principles of protein structure

Steps in protein modelling. Structure prediction, fold recognition and homology modelling. Basic principles of protein structure Structure prediction, fold recognition and homology modelling Marjolein Thunnissen Lund September 2012 Steps in protein modelling 3-D structure known Comparative Modelling Sequence of interest Similarity

More information

Can protein model accuracy be. identified? NO! CBS, BioCentrum, Morten Nielsen, DTU

Can protein model accuracy be. identified? NO! CBS, BioCentrum, Morten Nielsen, DTU Can protein model accuracy be identified? Morten Nielsen, CBS, BioCentrum, DTU NO! Identification of Protein-model accuracy Why is it important? What is accuracy RMSD, fraction correct, Protein model correctness/quality

More information

Bioinformatics: Secondary Structure Prediction

Bioinformatics: Secondary Structure Prediction Bioinformatics: Secondary Structure Prediction Prof. David Jones d.t.jones@ucl.ac.uk Possibly the greatest unsolved problem in molecular biology: The Protein Folding Problem MWMPPRPEEVARK LRRLGFVERMAKG

More information

Protein Structure: Data Bases and Classification Ingo Ruczinski

Protein Structure: Data Bases and Classification Ingo Ruczinski Protein Structure: Data Bases and Classification Ingo Ruczinski Department of Biostatistics, Johns Hopkins University Reference Bourne and Weissig Structural Bioinformatics Wiley, 2003 More References

More information

Protein Folding Prof. Eugene Shakhnovich

Protein Folding Prof. Eugene Shakhnovich Protein Folding Eugene Shakhnovich Department of Chemistry and Chemical Biology Harvard University 1 Proteins are folded on various scales As of now we know hundreds of thousands of sequences (Swissprot)

More information

Prediction and refinement of NMR structures from sparse experimental data

Prediction and refinement of NMR structures from sparse experimental data Prediction and refinement of NMR structures from sparse experimental data Jeff Skolnick Director Center for the Study of Systems Biology School of Biology Georgia Institute of Technology Overview of talk

More information

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison CMPS 6630: Introduction to Computational Biology and Bioinformatics Structure Comparison Protein Structure Comparison Motivation Understand sequence and structure variability Understand Domain architecture

More information

Ab-initio protein structure prediction

Ab-initio protein structure prediction Ab-initio protein structure prediction Jaroslaw Pillardy Computational Biology Service Unit Cornell Theory Center, Cornell University Ithaca, NY USA Methods for predicting protein structure 1. Homology

More information

HIV protease inhibitor. Certain level of function can be found without structure. But a structure is a key to understand the detailed mechanism.

HIV protease inhibitor. Certain level of function can be found without structure. But a structure is a key to understand the detailed mechanism. Proteins are linear polypeptide chains (one or more) Building blocks: 20 types of amino acids. Range from a few 10s-1000s They fold into varying three-dimensional shapes structure medicine Certain level

More information

Protein quality assessment

Protein quality assessment Protein quality assessment Speaker: Renzhi Cao Advisor: Dr. Jianlin Cheng Major: Computer Science May 17 th, 2013 1 Outline Introduction Paper1 Paper2 Paper3 Discussion and research plan Acknowledgement

More information

Motivating the need for optimal sequence alignments...

Motivating the need for optimal sequence alignments... 1 Motivating the need for optimal sequence alignments... 2 3 Note that this actually combines two objectives of optimal sequence alignments: (i) use the score of the alignment o infer homology; (ii) use

More information

BCMP 201 Protein biochemistry

BCMP 201 Protein biochemistry BCMP 201 Protein biochemistry BCMP 201 Protein biochemistry with emphasis on the interrelated roles of protein structure, catalytic activity, and macromolecular interactions in biological processes. The

More information

Biomolecules: lecture 10

Biomolecules: lecture 10 Biomolecules: lecture 10 - understanding in detail how protein 3D structures form - realize that protein molecules are not static wire models but instead dynamic, where in principle every atom moves (yet

More information

GC and CELPP: Workflows and Insights

GC and CELPP: Workflows and Insights GC and CELPP: Workflows and Insights Xianjin Xu, Zhiwei Ma, Rui Duan, Xiaoqin Zou Dalton Cardiovascular Research Center, Department of Physics and Astronomy, Department of Biochemistry, & Informatics Institute

More information

Homology models of the tetramerization domain of six eukaryotic voltage-gated potassium channels Kv1.1-Kv1.6

Homology models of the tetramerization domain of six eukaryotic voltage-gated potassium channels Kv1.1-Kv1.6 Homology models of the tetramerization domain of six eukaryotic voltage-gated potassium channels Kv1.1-Kv1.6 Hsuan-Liang Liu* and Chin-Wen Chen Department of Chemical Engineering and Graduate Institute

More information

Structural Alignment of Proteins

Structural Alignment of Proteins Goal Align protein structures Structural Alignment of Proteins 1 2 3 4 5 6 7 8 9 10 11 12 13 14 PHE ASP ILE CYS ARG LEU PRO GLY SER ALA GLU ALA VAL CYS PHE ASN VAL CYS ARG THR PRO --- --- --- GLU ALA ILE

More information