remembering Secondary Structures Does everyone know what the backbone and residue/side chains are? Clear about 1, 2 3 structures?
|
|
- Emery Vincent Holmes
- 5 years ago
- Views:
Transcription
1 remembering Secondary Structures add blast Does everyone know what the backbone residue/side chains are? Clear about 1, 2 3 structures? Heteropolymer - + Mostly in regular secondary structure + - Secondary structure can be defined by phi psi angles or by hydrogen bond patterns. A connected sequence amino must have the right phi-psi angles to make the secondary structure. Gennis-protein structure
2 .*1..*+/'-+3+//+-'27+'-T$PX<$^l#$Th$R+5)+"8(/+G1,34+"15 =*+4'G<[h4(5: 6(742)(5.1'5%[/+-'27+-$ ;*+-'2+)*1'5-(8.*+ =*+4'G-.')F(7."15:4+2.(61/2-.*+1,'5(=.+/,'57-(8.*+ *+4'GCX.>1(')*5)JD$H8.*+*+4'G'-B'+6+28/(,.*+_=.+/,'57-2(65.*+*+4'G1G'-".*+ -'2+)*1'5-)159+(9-+/B+21522'-30+2'5.*+8(/,(81*+4')*++4$S5+G1,34+(8.*'-'--*(65'5X.>1(')*5)J$M'5)+'..1F+-<$^/+-'27+-.(,1F+1)(, /5"+1)* /+-'27+'-/(.1.+2#[[(6'.*/+-3+)..('.-5+1/+-.5+':*9(/-'5.*+*+4'G$H8.*+/+-'27+- *1B+3(/)*1/1).+/(5(5+-'2+1525(5=3(/)*1/1).+/(5.*+(33(-'.+-'2+"1-'5.*+ +G1,34+-*(65'5X.>1(')*5)J".*+*+4'G'--1'2.(9+$A%#.%$&#.2$ move around not quite 1 C >N C >N C2 >N6 8.&#&#')P('4.01'44#/8-$=/7'5?#.4'Q$A%9'4#/$-$A%#.%$&#.2#'9.QK8.&# %/9$($-0-/-@%/9$(('4.01'4/-/%%/4.&'4.0'45?#'#'9.2$98#''9.4&$H'-3(/A$ _$7$,%%9'&8(.&&'-=:U08$(0`5IaY'.9$-0"#$(9'4F5](.4#$ACb-.7'(4.&:/3 c.(>.-.$.-"#$(9/&&'47.99'kc.(>.-.$e+ #&&%+VV2&.5.&25c.(>.-.$5UDbVd2A>VD'A/V8#''9V8#''9,%%5#&A95 C1 >N5 What s the pattern? Ci>Ni+? Gennis %T notice up-down-up-down Each side can have different properties the amino are on the outside 1f3c the boxes show amino
3 Motifs : Mruzin et at JMB 1995 (Cyrus Chothia) Cath: Orengo et al Structure 1997 (Janet Thornton) Each starts with domains Homolog - Phylogenetically related - derived from a common ancestor gene. (PFAM-genes; e/cathstructures). ifications are descriptive; Structural diversity comes from evolution - we use classification to deduce evolutionary relationships. Ortholog - retain the same function Paralog - function has diverged Family: related with similar function Superfamily: related with different functions Gennis: structure Gennis: structure Generally with very similar sequences you have a very similar fold. There are RARE cases where this is not true (even with different folds under different conditions) Proteins with similar fold can have different sequence. Fold is more conserved than sequence & convergent evolution can lead to similar folds. Proteins are made domains. A domain is a structural an evolutionary unit. They have 50-0 residues. Compact folded unit, quasi-independent structurally functionally Domains that are or come from a common ancestor. similar sequence - family diverged sequence but similar fold function - superfamily Chothia Gough (Biochem J (09) 9, 15-28
4 SCIENCE VOL JUNE 03 Fig. 2. An example a supradomain. The P-loop containing NTP hydrolase domain the Translation Proteins domain (5) occur in prokaryotic eukaryotic translation factors that hydrolyze guanosine triphosphate (GTP). GTP hydrolysis in the P-loop domain drives the conformational change in the Translation Proteins domain, which is then transmitted onto the ribosome. The supradomain occurs in 35 different domain architectures, 6 these are given here. The nset at left shows a protein known structure, which contains the supradomain. IF, initiation actor; EF, elongation factor; RF, factor; trna, transfer RNA. Figure 3. Glycosyl Hydrolases (A C) (A) (1/3)-b-glucanase (Varghese et al., 1994) represents the basic (Trans) glycosidases superfamily (c.1.8). Homologous catalytic domains are found in (B) b-glucuronidase (C) b-galactosidase. (B) In b-glucuronidase (Jain et al., 1996), the catalytic domain is 3 (in red) is joined by two other domains: 1 restricts the binding site, 2 links 1 to 3. (C) b-galactosidase.the first three domains have the same structure as b-glucuronidase (Jacobson et al., 1994). Domain 4 links domain 3 to 5, which contributes to the active site. Bashton Chothia: structure 15: (07) Dominant mechanisms that produce new are Duplication the genes old divergence these sequences to produce modified functions combination genes to further modify properties Many are multi-domain. Chothia Gough (Biochem J (09) 9, CATH : α,β,αβ Architecture: gross arrangement 2 structure independent connectivity Topology: Fold family linking 2 structure Fold=Topology 3 Homologous superfamily structure similar function similar 2386 Sequence family >35% identity domains : α,β,αβ,α+β Fold same 2 structure elements same topology not related Superfamily Common evolutionary origin low seq identity 1962 Family >30% identical or >15% with same function 92 ification: based on structure sequence (C-level): secondary structure composition contacts. The first, most general level the classification, class, describes the relative content α helices β sheets in a similar way to that described by Levitt Chothia [29], except that we only define three major classes mainly α, mainly β α β. Although the latter class can be sub- divided into alternating α/β α+β, in CATH, this information is considered at a lower level describing topology. Architecture (A-level): description the gross arrangement secondary structures, independent connectivity This level distinguishes structures in the same class with different architectures, but does not distinguish between different topologies (connectivities). The architectural groupings can sometimes be rather broad as they describe general features protein-fold shape, for example, the number layers in an α-β swich. A given architecture will contain structures with diverse connectivities (see Figure 2) which will be distinguished at the next level down (topology). For example, in the α-β class (C = 3), there are two common architectures each containing a large number different fold. One is the barrel- like architecture (A = ) adopted, (egtim-barrel folds). These have an inner β barrel an outer layer α helices (Figure 2). Alternatively, the three-layer α-β swich architecture (A = 40) consists a central β sheet which is covered by a layer α helices on both sides the sheet (Figure 2). Topology (T-level): fold Structures which are grouped at the T-level have the same overall fold, which means that they have a similar number arrangement secondary structures that the connectivity linking their secondary structure elements is the same. In this paper, the words fold topology have the same meaning. Proteins with the same CAT numbers have the same class, architecture topology but do not necessarily belong to the same homologous superfamily.within a given topology level, the structures are similar, but may have diverse functions. Homologous superfamily (H-level): highly similar structures functional similarity At the H-level, structures are grouped by their high structural similarity similar functions, which suggest that they may have evolved from a common ancestor, particularly, where there are resemblances in core packing or putative active sites. Using the example the mainly α.non-bundle. globin-like folds the erythrocruorins, colicins, phycocya- nins domain 1 diptheria toxin all have the same CAT number (1..340), but are differentiated by their H numbers,, 30 40, respectively (see Figure 3). Sequence family (S-level): significant sequence similarity thus a high probability having similar structure/function Members which are clustered at this level (having the same CATHS number) have sequence identities >35% as such are presumed to have extremely similar structures functions they may be slightly different examples the same protein from different species belonging to the same sequence superfamily. Some have many protein domains found (9 take up % the human genome) others have few. There are in animals; bacteria Many are found in all kingdoms life. Chothia Gough (Biochem J (09) 9, 15-28
5 Structural ification Proteins Structural ification Proteins : :Structural Structuralification ification PDB PDB (1 (1Aug Aug1998). 1998) Domains. Domains ification ification Orengo :93 et at: Structure folds folds Structural ification Proteins Structural ification Proteins 25 : :Structural Structural ification ification : Structural ification PDB PDB (1(1Aug 1998) Domains. Domains ). 70Aug PDB (23 Feb 09) Domains Reference : Structural ification PDB (23 Feb 09) Domains. 1 Reference 113 folds folds : Structural ification folds PDB (23 Feb ification 09) Reference Domains. 6 : Structural surface 12 folds PDB (23Feb 09) Domains. 1 Reference ification ification CATH version 3.3 (class, architecture, topology, homology) contains domains, 2386 homologous 3 fold groups folds surface 12 folds surface surface surface surface : :Structural Structuralification ification Nucleic Acids Research 01 Alison L Cuff, Ian Sillitoe, Tony Lewis, Andrew B Clegg, Robert Rentzsch, Nicholas Furnham, Marialuisa Pellegrini-Calace, David Jones, Janet Thornton, Christine A Orengo ification ification ification ification PDB PDB ( (Oct Oct1997). 1997) Domains. Domains.1 1 ification : :Structural Structuralification ification PDB PDB ( (Oct Oct1997). 1997) Domains. Domains.1 1 folds ification folds ification ification : Structural ification folds folds PDB (26 Sep 07). Reference Domains : Structural ification : Structural 1.73 ification PDB PDB (26 Sep 07) Reference Domains (26 Sep 07). 971 Domains Reference : Structural ification PDB (26 Sep 07). 11 Reference Domains folds surface folds folds 459 surface folds Page 1 9 Copyright The scop / /scop@mrc-lmb.cam.ac.uk Copyright The scopauthors authors scop@mrc-lmb.cam.ac.uk Page 1 9 Copyright The Copyright Thescop scopauthors authors/ /scop@mrc-lmb.cam.ac.uk scop@mrc-lmb.cam.ac.uk June June09 09 June June09 09 PagePage PagePage PagePage Figure 2 Schematic representation the class (C), architecture (A) topology (T) level in the C ATH database. Helices are drawn in blue strs are drawn as magenta arrows. The barrel, three-layer swich roll architectures (A-level) are shown for the α β class. Two representatives from fold in the three-layer swich architecture are shown. Cuff... Orengo Structure 17, 51 62, August 12, 09 The structural universe as revealed by CATH Research Article CATH: classification protein structures Orengo et al. 97 Figure 3 C ATH numbering scheme for representative structures from the globin-like fold family in the mainly α class. Four the seven levels within the C ATH database are shown, associated with C lass, Architecture, Topology, Homology. Each level is associated with a unique number. The (A), (T) (H) levels are numbered in bins ten to allow expansion the database. Architecture 1 Mainly α 2 Mainly β 3 4 Non-bundle α β Bundle Few SS 30 Few SS Topology Homology Variant surface glycoprotein Glucoamylase, domain 2 4 Globin-like 500 Although 5993 protein chains (80 domains) were selected for CATH from the September 1996 the PDB, sequence comparisons showed that approximately three quarters these were nearly identical (see Figure 1, Tables 1 2). CATH grouped the 80 domain structures into 21 non-identical (N-level). By assuming that with more than 35% sequence similarity adopt the same fold share a common evolutionary ancestor, this number can be reduced to 68 sequence (S-level). More distant relatives were added to these by searching for significantly high structural similarity related biological functions, thereby reducing the number homologous to 645 (H-level). If a lower degree structural similarity is allowed, these further group to give a total 505 1hlm Casein kinase δ.. 1cpc chain A col chain A 40 1ddt domain through overlap a large motif containing five β strs. This is not currently done in C ATH, as both are commonly referred to as separate folds in the literature. Mainly α.non-bundle.globin-like.1cpc chain A somewhat broad category, the fold within this barrel architecture all share a common structural feature comprising a single β sheet. Similarly, the three-layer α β swich architecture also represents a large grouping folds varying sizes (containing β sheets having from 4 to 17 β-strs). Again, all the folds within this architecture can be simply usefully described as having a central β sheet with layers α helices on each its sides (see Figure 2). Wherever possible, we have used architectural descriptions commonly cited in the literature. For a majority the folds (>80%) this was a simple straightforward process the architectural categories assigned agreed well with those given in other publicly available databases (e.g. [17]). For more complex shapes, no architecture was assigned, these folds were all placed in a single complex bin until alternative assignment methods are developed. Such methods will probably describe shape according to the diverse motifs contained within the fold the ways in which these motifs are combined in 3D space. The variety architectures that can easily be assigned by visual inspection is shown, for each class, in Figure 4. For mainly α only the familiar four-helix bundle architecture is easily distinguishable. Other helix arrangements Overview architectures The CATH architecture level is a subjective grouping folds having similar shape, regardless differences in scale or numbers secondary structures. For example, both the 5-stred barwin-like β-barrel folds the 17-stred porin-like β-barrel folds are assigned to the same general β-barrel architecture. Although this is a Illustration motif overlaps in the mainly β swich architecture. Each structure shown can be related to the central tenascin structure by a motif containing at least four β strs (although these are not sequential in the transthyretin structure) up to seven β strs in plastocyanin the -76 immunoglobulin variable domain structures. It can be seen that this results in the possible merging the immunoglobulin fold family (2rhe) the jelly-roll fold family (1tnfA) Figure 9 βlactamase, domain 2 5 fold (T-level), within which similarity may be a result divergent or convergent evolution. These fold are further grouped within CATH, automatically according to class manually according to their architecture giving a total 3 major classes 28 different architectures (see Table 1). Figure 1 shows the annual increase in structures structural for each level in the CATH hierarchy. 1TTF.pdb SHEET 1 SHEET 2 SHEET 3 SHEET 1 SHEET 2 SHEET 3 SHEET GLU A 9 THR A SER A 17 ASP A 23-1 O SER A 21 N GLU A THR A 56 SER A 60-1 N ALA A 57 O ILE A 2 4 GLN A 46 PRO A TYR A 31 GLU A 38-1 N TYR A 36 O GLN A VAL A THR A 76-1 N VAL A 75 O TYR A ILE A 88 THR A 94-1 N ILE A 88 O VAL A 72
6 14 Figure 9 Illustration motif overlaps in the mainly β swich architecture. Each structure shown can be related to the central tenascin structure by a motif containing at least four β strs (although these are not sequential in the transthyretin structure) up to seven β strs in plastocyanin the immunoglobulin variable domain structures. It can be seen that this results in the possible merging the immunoglobulin fold family (2rhe) the jelly-roll fold family (1tnfA) through overlap a large motif containing five β strs. This is not currently done in C ATH, as both are commonly referred to as separate folds in the literature. 10 Bron Tooze Figure 4 prediction (from sequence) methods. It would therefore seem more useful to subclassify these regions fold space according to more sophisticated criteria for structural similarity, in order to generate smaller containing closer relatives. These criteria would be based on recognising similarities within the cores the protein structures belonging to a particular family setting tolerances on the number allowed secondary structure indels. With this aim, a suite programs (CORA, COnsensus Residues Attributes) for analysing structural has been developed (CAO, unpublished data) will be applied to all the superfold within the CATH database. It is planned that future s CATH will assign structures to fold within the superarchitectures according to the diagnostics generated by CORA. Interestingly, an analysis the distribution domain structures in non-homologous multidomain showed that only 8.1% these domains occur also as single domain only 5% recur in other multidomain. Identification multidomains recurrence folds within multidomain Until we improve our understing structural constraints on secondary structure packing, the ideal a completely automatic approach generating self-consistent reproducible hierarchies at all levels is not feasible. Some approaches avoid this problem by clustering on the basis overlapping helices, strs or small common motifs, regardless 3D arrangement. Such motif-based classifications, however, are generally less useful for revealing global structural relationships between evolutionary related. We chose, therefore, to use an By applying the consensus domain boundary assignment procedure to all N-representatives (7 ) in the September 1996 version CATH, 74% the total number single domains (515) could be assigned automatically 21% the total number multidomain (272). Of those remaining unassigned, at least one the methods gave acceptable boundary assignments, needing no or only minor adjustments. Research Article CATH: classification pro Future developments: automatic architecture assignment The CATH architectural groupings are currently broad, general, categories that represent a preliminary classification which should significantly aid a future, more detailed analysis common architectural features. Although, these groups are assigned manually, other publicly available classifications have adopted a similar pragmatic approach, using a combination automatic manual approaches where appropriate (, DIAL [17,27]). Research Article CATH: classification protein structures Orengo et al. Research Article CATH: classification protein structures Orengo et al. 11 Research Article CATH: classification protein structures Orengo et al architectures identified for the mainly α, mainly β α β classes Research Article CATH: classification protein structures O rengo et al. 11 M O LS C RIPT [33] representations the architectures identified for the mainly α, mainly β α β classes (PD B codes are given in parentheses). architectures identified for the mainly α, mainly β α β classes M O LS C RIPT [33] representations the architectures identified for the mainly α, mainly β α β classes (PD B codes are given in parentheses). more pround disturbances to the architecture. This is structures in these regions fold space. This means because, in addition to the ability to exp by adding on similar motifs (which may arise from gene duplica for these architectures it is perhaps more more pround disturbances appropriate to the architecture. This isa continuum structures these regions fold space. This means tion), layer-based architectures can further accommodate to consider inprotein folds. This to the ability to exp by adding evolutionary changes by allowing thebecause, layers in to addition slide relais particularly apparent in the layer-based swich archion similar motifs (which may arise from gene duplica- for these architectures it is perhaps more M O LS C RIPT [33] representations the architectures identified for the mainly α, mainly β α β classes more pround disturbances to the architecture. This is because, in addition to the ability to exp by adding on similar motifs (which may arise from gene duplication), layer-based architectures can further accommodate evolutionary changes by allowing the layers to slide relative to each other. This mechanism would not be available to complex architectures possessing a more diffuse hydrophobic core. structures in these reg that it becomes harder to for these archite appropriate to consider a c is particularly apparent in tectures the mainly β within the α β three-laye is possible to generate a using the simple criteria M O LS C RIPT [33] representations the architectures identified for the mainly α, mainly β α β classes (PD B codes are given in parentheses). M O LS C RIPT [33] representations the architectures identified fold for the mainly α, β doll α βeffect classes (PD B codes are given in parentheses). Overlap between : themainly Russian (SSAP score > = 70) Each new structure added The recurrence common motifs within many the last by a simple extension superfolds major architectures gives rise to an overlap more pround disturbances to the architecture. This is structures in these regions fold space. This means more pround disturbances to the architecture. This is structures in these regions fold space. This means because, in addition to the ability to exp by adding because, in addition to the ability to exp by adding on similar motifs (which may arise from gene duplica for these architectures it is perhaps more on similar motifs (which may arise from gene duplica for these architectures it is perhaps more tion), layer-based architectures can further accommodate appropriate to consider a continuum protein folds. This tion), layer-based architectures can further accommodate appropriate to consider a continuum protein folds. This evolutionary changes by allowing the layers to slide relais particularly apparent in the layer-based swich archievolutionary changes by allowing the layers to slide relais particularly apparent in the layer-based swich architive to each other. This mechanism would not be availtectures the mainly β α β classes. For example, tive to each other. This mechanism would not be availtectures the mainly β α β classes. For example, able to complex architectures possessing a more diffuse within the α β three-layer doubly wound architectures, it able to complex architectures possessing a more diffuse within the α β three-layer doubly wound architectures, it hydrophobic core. is possible to generate a very large family structures hydrophobic core. is possible to generate a very large family structures using the simple criteria a good structural comparison using the simple criteria a good structural comparison Overlap between fold : the Russian doll effect (SSAP score > = 70) reasonable overlap (> = 60%). Overlap between fold : the Russian doll effect (SSAP score > = 70) reasonable overlap (> = 60%). Each new structure added to a family will be related to the The recurrence common motifs within many the Each new structure added to a family will be related to the The recurrence common motifs within many the last by a simple extension one or more βαβ motifs superfolds major architectures gives rise to an overlap last by a simple extension one or more βαβ motifs superfolds major architectures gives rise to an overlap
7 Atpase 1BMF-Bovine ATPase F1 Chain A all Left hed superhelix P-loop containing nucleoside trip hydrolase Vogal et al: Current Opinion in Structural Biology 04: 14: 8-216
Heteropolymer. Mostly in regular secondary structure
Heteropolymer - + + - Mostly in regular secondary structure 1 2 3 4 C >N trace how you go around the helix C >N C2 >N6 C1 >N5 What s the pattern? Ci>Ni+? 5 6 move around not quite 120 "#$%&'!()*(+2!3/'!4#5'!1/,#64!#6!,6!
More informationThe CATH Database provides insights into protein structure/function relationships
1999 Oxford University Press Nucleic Acids Research, 1999, Vol. 27, No. 1 275 279 The CATH Database provides insights into protein structure/function relationships C. A. Orengo, F. M. G. Pearl, J. E. Bray,
More information2MHR. Protein structure classification is important because it organizes the protein structure universe that is independent of sequence similarity.
Protein structure classification is important because it organizes the protein structure universe that is independent of sequence similarity. A global picture of the protein universe will help us to understand
More informationCMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison
CMPS 6630: Introduction to Computational Biology and Bioinformatics Structure Comparison Protein Structure Comparison Motivation Understand sequence and structure variability Understand Domain architecture
More informationProtein Structure: Data Bases and Classification Ingo Ruczinski
Protein Structure: Data Bases and Classification Ingo Ruczinski Department of Biostatistics, Johns Hopkins University Reference Bourne and Weissig Structural Bioinformatics Wiley, 2003 More References
More informationProcheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics.
Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics Iosif Vaisman Email: ivaisman@gmu.edu ----------------------------------------------------------------- Bond
More informationPhysiochemical Properties of Residues
Physiochemical Properties of Residues Various Sources C N Cα R Slide 1 Conformational Propensities Conformational Propensity is the frequency in which a residue adopts a given conformation (in a polypeptide)
More informationIntroduction to Comparative Protein Modeling. Chapter 4 Part I
Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature
More informationSupersecondary Structures (structural motifs)
Supersecondary Structures (structural motifs) Various Sources Slide 1 Supersecondary Structures (Motifs) Supersecondary Structures (Motifs): : Combinations of secondary structures in specific geometric
More informationProtein structure alignments
Protein structure alignments Proteins that fold in the same way, i.e. have the same fold are often homologs. Structure evolves slower than sequence Sequence is less conserved than structure If BLAST gives
More informationPacking of Secondary Structures
7.88 Lecture Notes - 4 7.24/7.88J/5.48J The Protein Folding and Human Disease Professor Gossard Retrieving, Viewing Protein Structures from the Protein Data Base Helix helix packing Packing of Secondary
More informationHMM applications. Applications of HMMs. Gene finding with HMMs. Using the gene finder
HMM applications Applications of HMMs Gene finding Pairwise alignment (pair HMMs) Characterizing protein families (profile HMMs) Predicting membrane proteins, and membrane protein topology Gene finding
More informationSCOP. all-β class. all-α class, 3 different folds. T4 endonuclease V. 4-helical cytokines. Globin-like
SCOP all-β class 4-helical cytokines T4 endonuclease V all-α class, 3 different folds Globin-like TIM-barrel fold α/β class Profilin-like fold α+β class http://scop.mrc-lmb.cam.ac.uk/scop CATH Class, Architecture,
More information1. Protein Data Bank (PDB) 1. Protein Data Bank (PDB)
Protein structure databases; visualization; and classifications 1. Introduction to Protein Data Bank (PDB) 2. Free graphic software for 3D structure visualization 3. Hierarchical classification of protein
More information09/06/25. Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Non-uniform distribution of folds. Scheme of protein structure predicition
Sequence identity Structural similarity Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Fold recognition Sommersemester 2009 Peter Güntert Structural similarity X Sequence identity Non-uniform
More informationCS612 - Algorithms in Bioinformatics
Fall 2017 Databases and Protein Structure Representation October 2, 2017 Molecular Biology as Information Science > 12, 000 genomes sequenced, mostly bacterial (2013) > 5x10 6 unique sequences available
More informationGiri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748
CAP 5510: Introduction to Bioinformatics Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinfs07.html 2/15/07 CAP5510 1 EM Algorithm Goal: Find θ, Z that maximize Pr
More informationUnderstanding Sequence, Structure and Function Relationships and the Resulting Redundancy
Understanding Sequence, Structure and Function Relationships and the Resulting Redundancy many slides by Philip E. Bourne Department of Pharmacology, UCSD Agenda Understand the relationship between sequence,
More informationSecondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure
Bioch/BIMS 503 Lecture 2 Structure and Function of Proteins August 28, 2008 Robert Nakamoto rkn3c@virginia.edu 2-0279 Secondary Structure Φ Ψ angles determine protein structure Φ Ψ angles are restricted
More informationHomology and Information Gathering and Domain Annotation for Proteins
Homology and Information Gathering and Domain Annotation for Proteins Outline Homology Information Gathering for Proteins Domain Annotation for Proteins Examples and exercises The concept of homology The
More informationNumber sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence
Number sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence Naoto Morikawa (nmorika@genocript.com) October 7, 2006. Abstract A protein is a sequence
More informationDesign of a Novel Globular Protein Fold with Atomic-Level Accuracy
Design of a Novel Globular Protein Fold with Atomic-Level Accuracy Brian Kuhlman, Gautam Dantas, Gregory C. Ireton, Gabriele Varani, Barry L. Stoddard, David Baker Presented by Kate Stafford 4 May 05 Protein
More informationHomology. and. Information Gathering and Domain Annotation for Proteins
Homology and Information Gathering and Domain Annotation for Proteins Outline WHAT IS HOMOLOGY? HOW TO GATHER KNOWN PROTEIN INFORMATION? HOW TO ANNOTATE PROTEIN DOMAINS? EXAMPLES AND EXERCISES Homology
More informationProtein Structure Prediction
Page 1 Protein Structure Prediction Russ B. Altman BMI 214 CS 274 Protein Folding is different from structure prediction --Folding is concerned with the process of taking the 3D shape, usually based on
More informationSequence analysis and comparison
The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species
More informationSupporting Online Material for
www.sciencemag.org/cgi/content/full/309/5742/1868/dc1 Supporting Online Material for Toward High-Resolution de Novo Structure Prediction for Small Proteins Philip Bradley, Kira M. S. Misura, David Baker*
More informationProtein Structure & Motifs
& Motifs Biochemistry 201 Molecular Biology January 12, 2000 Doug Brutlag Introduction Proteins are more flexible than nucleic acids in structure because of both the larger number of types of residues
More informationAnalysis and Prediction of Protein Structure (I)
Analysis and Prediction of Protein Structure (I) Jianlin Cheng, PhD School of Electrical Engineering and Computer Science University of Central Florida 2006 Free for academic use. Copyright @ Jianlin Cheng
More informationProtein Structure. Role of (bio)informatics in drug discovery. Bioinformatics
Bioinformatics Protein Structure Principles & Architecture Marjolein Thunnissen Dep. of Biochemistry & Structural Biology Lund University September 2011 Homology, pattern and 3D structure searches need
More informationD Dobbs ISU - BCB 444/544X 1
11/7/05 Protein Structure: Classification, Databases, Visualization Announcements BCB 544 Projects - Important Dates: Nov 2 Wed noon - Project proposals due to David/Drena Nov 4 Fri PM - Approvals/responses
More informationReview. Membrane proteins. Membrane transport
Quiz 1 For problem set 11 Q1, you need the equation for the average lateral distance transversed (s) of a molecule in the membrane with respect to the diffusion constant (D) and time (t). s = (4 D t) 1/2
More informationBasics of protein structure
Today: 1. Projects a. Requirements: i. Critical review of one paper ii. At least one computational result b. Noon, Dec. 3 rd written report and oral presentation are due; submit via email to bphys101@fas.harvard.edu
More informationBasic structures of proteins
Basic structures of proteins Structural Hierarchy of Protein Primary structure Functional elements : α-helix, strands, β-sheet, loops.. - Structure, affinity, activity, specificity, stability etc. Secondary
More informationMajor Types of Association of Proteins with Cell Membranes. From Alberts et al
Major Types of Association of Proteins with Cell Membranes From Alberts et al Proteins Are Polymers of Amino Acids Peptide Bond Formation Amino Acid central carbon atom to which are attached amino group
More informationAdvanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions
BIRKBECK COLLEGE (University of London) Advanced Certificate in Principles in Protein Structure MSc Structural Molecular Biology Date: Thursday, 1st September 2011 Time: 3 hours You will be given a start
More informationGenome Databases The CATH database
Genome Databases The CATH database Michael Knudsen 1 and Carsten Wiuf 1,2* 1 Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark 2 Centre for Membrane Pumps in Cells and Disease
More informationCAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Giri Narasimhan
CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinff18.html Proteins and Protein Structure
More informationA General Model for Amino Acid Interaction Networks
Author manuscript, published in "N/P" A General Model for Amino Acid Interaction Networks Omar GACI and Stefan BALEV hal-43269, version - Nov 29 Abstract In this paper we introduce the notion of protein
More informationProtein Structure. W. M. Grogan, Ph.D. OBJECTIVES
Protein Structure W. M. Grogan, Ph.D. OBJECTIVES 1. Describe the structure and characteristic properties of typical proteins. 2. List and describe the four levels of structure found in proteins. 3. Relate
More informationBIRKBECK COLLEGE (University of London)
BIRKBECK COLLEGE (University of London) SCHOOL OF BIOLOGICAL SCIENCES M.Sc. EXAMINATION FOR INTERNAL STUDENTS ON: Postgraduate Certificate in Principles of Protein Structure MSc Structural Molecular Biology
More informationAmino Acid Structures from Klug & Cummings. 10/7/2003 CAP/CGS 5991: Lecture 7 1
Amino Acid Structures from Klug & Cummings 10/7/2003 CAP/CGS 5991: Lecture 7 1 Amino Acid Structures from Klug & Cummings 10/7/2003 CAP/CGS 5991: Lecture 7 2 Amino Acid Structures from Klug & Cummings
More informationProtein Bioinformatics. Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet sandberg.cmb.ki.
Protein Bioinformatics Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet rickard.sandberg@ki.se sandberg.cmb.ki.se Outline Protein features motifs patterns profiles signals 2 Protein
More informationBioinformatics. Macromolecular structure
Bioinformatics Macromolecular structure Contents Determination of protein structure Structure databases Secondary structure elements (SSE) Tertiary structure Structure analysis Structure alignment Domain
More informationChapter 5. Proteomics and the analysis of protein sequence Ⅱ
Proteomics Chapter 5. Proteomics and the analysis of protein sequence Ⅱ 1 Pairwise similarity searching (1) Figure 5.5: manual alignment One of the amino acids in the top sequence has no equivalent and
More informationStudy of Mining Protein Structural Properties and its Application
Study of Mining Protein Structural Properties and its Application A Dissertation Proposal Presented to the Department of Computer Science and Information Engineering College of Electrical Engineering and
More informationSUPPLEMENTARY INFORMATION
Supplementary Results DNA binding property of the SRA domain was examined by an electrophoresis mobility shift assay (EMSA) using synthesized 12-bp oligonucleotide duplexes containing unmodified, hemi-methylated,
More informationCSCE555 Bioinformatics. Protein Function Annotation
CSCE555 Bioinformatics Protein Function Annotation Why we need to do function annotation? Fig from: Network-based prediction of protein function. Molecular Systems Biology 3:88. 2007 What s function? The
More informationBio 1B Lecture Outline (please print and bring along) Fall, 2007
Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution
More informationComparing Protein Structures. Why?
7.91 Amy Keating Comparing Protein Structures Why? detect evolutionary relationships identify recurring motifs detect structure/function relationships predict function assess predicted structures classify
More informationSupplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution
Supplemental Materials for Structural Diversity of Protein Segments Follows a Power-law Distribution Yoshito SAWADA and Shinya HONDA* National Institute of Advanced Industrial Science and Technology (AIST),
More informationProtein Structure. Hierarchy of Protein Structure. Tertiary structure. independently stable structural unit. includes disulfide bonds
Protein Structure Hierarchy of Protein Structure 2 3 Structural element Primary structure Secondary structure Super-secondary structure Domain Tertiary structure Quaternary structure Description amino
More informationCMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction
CMPS 6630: Introduction to Computational Biology and Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the
More informationB O C 4 H 2 O O. NOTE: The reaction proceeds with a carbonium ion stabilized on the C 1 of sugar A.
hbcse 33 rd International Page 101 hemistry lympiad Preparatory 05/02/01 Problems d. In the hydrolysis of the glycosidic bond, the glycosidic bridge oxygen goes with 4 of the sugar B. n cleavage, 18 from
More informationRanjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013
Hydration of protein-rna recognition sites Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India 1 st November, 2013 Central Dogma of life DNA
More informationCOMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University
COMP 598 Advanced Computational Biology Methods & Research Introduction Jérôme Waldispühl School of Computer Science McGill University General informations (1) Office hours: by appointment Office: TR3018
More informationOrientational degeneracy in the presence of one alignment tensor.
Orientational degeneracy in the presence of one alignment tensor. Rotation about the x, y and z axes can be performed in the aligned mode of the program to examine the four degenerate orientations of two
More informationIntroduction to Evolutionary Concepts
Introduction to Evolutionary Concepts and VMD/MultiSeq - Part I Zaida (Zan) Luthey-Schulten Dept. Chemistry, Beckman Institute, Biophysics, Institute of Genomics Biology, & Physics NIH Workshop 2009 VMD/MultiSeq
More informationTranslation. A ribosome, mrna, and trna.
Translation The basic processes of translation are conserved among prokaryotes and eukaryotes. Prokaryotic Translation A ribosome, mrna, and trna. In the initiation of translation in prokaryotes, the Shine-Dalgarno
More informationProtein Science (1997), 6: Cambridge University Press. Printed in the USA. Copyright 1997 The Protein Society
1 of 5 1/30/00 8:08 PM Protein Science (1997), 6: 246-248. Cambridge University Press. Printed in the USA. Copyright 1997 The Protein Society FOR THE RECORD LPFC: An Internet library of protein family
More informationDATE A DAtabase of TIM Barrel Enzymes
DATE A DAtabase of TIM Barrel Enzymes 2 2.1 Introduction.. 2.2 Objective and salient features of the database 2.2.1 Choice of the dataset.. 2.3 Statistical information on the database.. 2.4 Features....
More informationBioinformatics. Proteins II. - Pattern, Profile, & Structure Database Searching. Robert Latek, Ph.D. Bioinformatics, Biocomputing
Bioinformatics Proteins II. - Pattern, Profile, & Structure Database Searching Robert Latek, Ph.D. Bioinformatics, Biocomputing WIBR Bioinformatics Course, Whitehead Institute, 2002 1 Proteins I.-III.
More informationGenomic and structural aspects of protein evolution
www.biochemj.org Biochem. J. (2009) 419, 15 28 (Printed in Great Britain) doi:10.1042/bj20090122 15 REVIEW ARTICLE Genomic and structural aspects of protein evolution Cyrus CHOTHIA* and Julian GOUGH *MRC
More informationLecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability
Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions: Van der Waals Interactions
More informationProtein structure analysis. Risto Laakso 10th January 2005
Protein structure analysis Risto Laakso risto.laakso@hut.fi 10th January 2005 1 1 Summary Various methods of protein structure analysis were examined. Two proteins, 1HLB (Sea cucumber hemoglobin) and 1HLM
More informationCMPS 3110: Bioinformatics. Tertiary Structure Prediction
CMPS 3110: Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the laws of physics! Conformation space is finite
More informationAdvanced Topics in RNA and DNA. DNA Microarrays Aptamers
Quiz 1 Advanced Topics in RNA and DNA DNA Microarrays Aptamers 2 Quantifying mrna levels to asses protein expression 3 The DNA Microarray Experiment 4 Application of DNA Microarrays 5 Some applications
More informationCAP 5510 Lecture 3 Protein Structures
CAP 5510 Lecture 3 Protein Structures Su-Shing Chen Bioinformatics CISE 8/19/2005 Su-Shing Chen, CISE 1 Protein Conformation 8/19/2005 Su-Shing Chen, CISE 2 Protein Conformational Structures Hydrophobicity
More informationPeptides And Proteins
Kevin Burgess, May 3, 2017 1 Peptides And Proteins from chapter(s) in the recommended text A. Introduction B. omenclature And Conventions by amide bonds. on the left, right. 2 -terminal C-terminal triglycine
More informationResearch Proposal. Title: Multiple Sequence Alignment used to investigate the co-evolving positions in OxyR Protein family.
Research Proposal Title: Multiple Sequence Alignment used to investigate the co-evolving positions in OxyR Protein family. Name: Minjal Pancholi Howard University Washington, DC. June 19, 2009 Research
More informationALL LECTURES IN SB Introduction
1. Introduction 2. Molecular Architecture I 3. Molecular Architecture II 4. Molecular Simulation I 5. Molecular Simulation II 6. Bioinformatics I 7. Bioinformatics II 8. Prediction I 9. Prediction II ALL
More informationChapter
Chapter 17 17.4-17.6 Molecular Components of Translation A cell interprets a genetic message and builds a polypeptide The message is a series of codons on mrna The interpreter is called transfer (trna)
More informationSupplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate
SUPPLEMENTARY FIGURE LEGENDS Supplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate dehydrogenase from Escherichia coli [ICD, pdb 1PB1, Mesecar, A. D., and Koshland,
More informationSupporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1
Supporting information to: Time-resolved observation of protein allosteric communication Sebastian Buchenberg, Florian Sittel and Gerhard Stock Biomolecular Dynamics, Institute of Physics, Albert Ludwigs
More informationCentral Dogma. modifications genome transcriptome proteome
entral Dogma DA ma protein post-translational modifications genome transcriptome proteome 83 ierarchy of Protein Structure 20 Amino Acids There are 20 n possible sequences for a protein of n residues!
More informationGENETICS - CLUTCH CH.11 TRANSLATION.
!! www.clutchprep.com CONCEPT: GENETIC CODE Nucleotides and amino acids are translated in a 1 to 1 method The triplet code states that three nucleotides codes for one amino acid - A codon is a term for
More informationLarge-Scale Genomic Surveys
Bioinformatics Subtopics Fold Recognition Secondary Structure Prediction Docking & Drug Design Protein Geometry Structural Informatics Homology Modeling Sequence Alignment Structure Classification Gene
More informationWeek 10: Homology Modelling (II) - HHpred
Week 10: Homology Modelling (II) - HHpred Course: Tools for Structural Biology Fabian Glaser BKU - Technion 1 2 Identify and align related structures by sequence methods is not an easy task All comparative
More informationProtein Structure Prediction and Display
Protein Structure Prediction and Display Goal Take primary structure (sequence) and, using rules derived from known structures, predict the secondary structure that is most likely to be adopted by each
More informationProperties of amino acids in proteins
Properties of amino acids in proteins one of the primary roles of DNA (but not the only one!) is to code for proteins A typical bacterium builds thousands types of proteins, all from ~20 amino acids repeated
More informationModel Mélange. Physical Models of Peptides and Proteins
Model Mélange Physical Models of Peptides and Proteins In the Model Mélange activity, you will visit four different stations each featuring a variety of different physical models of peptides or proteins.
More informationSequences, Structures, and Gene Regulatory Networks
Sequences, Structures, and Gene Regulatory Networks Learning Outcomes After this class, you will Understand gene expression and protein structure in more detail Appreciate why biologists like to align
More informationEBI web resources II: Ensembl and InterPro
EBI web resources II: Ensembl and InterPro Yanbin Yin http://www.ebi.ac.uk/training/online/course/ 1 Homework 3 Go to http://www.ebi.ac.uk/interpro/training.htmland finish the second online training course
More informationDetails of Protein Structure
Details of Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Anne Mølgaard, Kemisk Institut, Københavns Universitet Learning Objectives
More informationHomology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB
Homology Modeling (Comparative Structure Modeling) Aims of Structural Genomics High-throughput 3D structure determination and analysis To determine or predict the 3D structures of all the proteins encoded
More informationFrom gene to protein. Premedical biology
From gene to protein Premedical biology Central dogma of Biology, Molecular Biology, Genetics transcription replication reverse transcription translation DNA RNA Protein RNA chemically similar to DNA,
More informationStructure to Function. Molecular Bioinformatics, X3, 2006
Structure to Function Molecular Bioinformatics, X3, 2006 Structural GeNOMICS Structural Genomics project aims at determination of 3D structures of all proteins: - organize known proteins into families
More informationStructural Alignment of Proteins
Goal Align protein structures Structural Alignment of Proteins 1 2 3 4 5 6 7 8 9 10 11 12 13 14 PHE ASP ILE CYS ARG LEU PRO GLY SER ALA GLU ALA VAL CYS PHE ASN VAL CYS ARG THR PRO --- --- --- GLU ALA ILE
More informationMotifs, Profiles and Domains. Michael Tress Protein Design Group Centro Nacional de Biotecnología, CSIC
Motifs, Profiles and Domains Michael Tress Protein Design Group Centro Nacional de Biotecnología, CSIC Comparing Two Proteins Sequence Alignment Determining the pattern of evolution and identifying conserved
More informationB. β Structure. All contents of this document, unless otherwise noted, are David C. & Jane S. Richardson. All Rights Reserved.
B. β Structure The other major structural element found in globular proteins is the β sheet. Historically, it was first observed as the β, or extended, form of keratin fibers. An approximate understanding
More informationCONCEPT OF SEQUENCE COMPARISON. Natapol Pornputtapong 18 January 2018
CONCEPT OF SEQUENCE COMPARISON Natapol Pornputtapong 18 January 2018 SEQUENCE ANALYSIS - A ROSETTA STONE OF LIFE Sequence analysis is the process of subjecting a DNA, RNA or peptide sequence to any of
More informationAlgorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment
Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot
More informationIntroduction to" Protein Structure
Introduction to" Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Learning Objectives Outline the basic levels of protein structure.
More informationChapter 2 Structures. 2.1 Introduction Storing Protein Structures The PDB File Format
Chapter 2 Structures 2.1 Introduction The three-dimensional (3D) structure of a protein contains a lot of information on its function, and can be used for devising ways of modifying it (propose mutants,
More informationHomology Modeling. Roberto Lins EPFL - summer semester 2005
Homology Modeling Roberto Lins EPFL - summer semester 2005 Disclaimer: course material is mainly taken from: P.E. Bourne & H Weissig, Structural Bioinformatics; C.A. Orengo, D.T. Jones & J.M. Thornton,
More informationBahnson Biochemistry Cume, April 8, 2006 The Structural Biology of Signal Transduction
Name page 1 of 6 Bahnson Biochemistry Cume, April 8, 2006 The Structural Biology of Signal Transduction Part I. The ion Ca 2+ can function as a 2 nd messenger. Pick a specific signal transduction pathway
More informationProtein Folds, Functions and Evolution
Article No. jmbi.1999.3054 available online at http://www.idealibrary.com on J. Mol. Biol. (1999) 293, 333±342 Protein Folds, Functions and Evolution Janet M. Thornton 1,2 *, Christine A. Orengo 1, Annabel
More informationGene regulation I Biochemistry 302. Bob Kelm February 25, 2005
Gene regulation I Biochemistry 302 Bob Kelm February 25, 2005 Principles of gene regulation (cellular versus molecular level) Extracellular signals Chemical (e.g. hormones, growth factors) Environmental
More informationPHYLOGENY AND SYSTEMATICS
AP BIOLOGY EVOLUTION/HEREDITY UNIT Unit 1 Part 11 Chapter 26 Activity #15 NAME DATE PERIOD PHYLOGENY AND SYSTEMATICS PHYLOGENY Evolutionary history of species or group of related species SYSTEMATICS Study
More informationObjective: Students will be able identify peptide bonds in proteins and describe the overall reaction between amino acids that create peptide bonds.
Scott Seiple AP Biology Lesson Plan Lesson: Primary and Secondary Structure of Proteins Purpose:. To understand how amino acids can react to form peptides through peptide bonds.. Students will be able
More informationPrediction and refinement of NMR structures from sparse experimental data
Prediction and refinement of NMR structures from sparse experimental data Jeff Skolnick Director Center for the Study of Systems Biology School of Biology Georgia Institute of Technology Overview of talk
More information