remembering Secondary Structures Does everyone know what the backbone and residue/side chains are? Clear about 1, 2 3 structures?

Size: px
Start display at page:

Download "remembering Secondary Structures Does everyone know what the backbone and residue/side chains are? Clear about 1, 2 3 structures?"

Transcription

1 remembering Secondary Structures add blast Does everyone know what the backbone residue/side chains are? Clear about 1, 2 3 structures? Heteropolymer - + Mostly in regular secondary structure + - Secondary structure can be defined by phi psi angles or by hydrogen bond patterns. A connected sequence amino must have the right phi-psi angles to make the secondary structure. Gennis-protein structure

2 .*1..*+/'-+3+//+-'27+'-T$PX<$^l#$Th$R+5)+"8(/+G1,34+"15 =*+4'G<[h4(5: 6(742)(5.1'5%[/+-'27+-$ ;*+-'2+)*1'5-(8.*+ =*+4'G-.')F(7."15:4+2.(61/2-.*+1,'5(=.+/,'57-(8.*+ *+4'GCX.>1(')*5)JD$H8.*+*+4'G'-B'+6+28/(,.*+_=.+/,'57-2(65.*+*+4'G1G'-".*+ -'2+)*1'5-)159+(9-+/B+21522'-30+2'5.*+8(/,(81*+4')*++4$S5+G1,34+(8.*'-'--*(65'5X.>1(')*5)J$M'5)+'..1F+-<$^/+-'27+-.(,1F+1)(, /5"+1)* /+-'27+'-/(.1.+2#[[(6'.*/+-3+)..('.-5+1/+-.5+':*9(/-'5.*+*+4'G$H8.*+/+-'27+- *1B+3(/)*1/1).+/(5(5+-'2+1525(5=3(/)*1/1).+/(5.*+(33(-'.+-'2+"1-'5.*+ +G1,34+-*(65'5X.>1(')*5)J".*+*+4'G'--1'2.(9+$A%#.%$&#.2$ move around not quite 1 C >N C >N C2 >N6 8.&#&#')P('4.01'44#/8-$=/7'5?#.4'Q$A%9'4#/$-$A%#.%$&#.2#'9.QK8.&# %/9$($-0-/-@%/9$(('4.01'4/-/%%/4.&'4.0'45?#'#'9.2$98#''9.4&$H'-3(/A$ _$7$,%%9'&8(.&&'-=:U08$(0`5IaY'.9$-0"#$(9'4F5](.4#$ACb-.7'(4.&:/3 c.(>.-.$.-"#$(9/&&'47.99'kc.(>.-.$e+ #&&%+VV2&.5.&25c.(>.-.$5UDbVd2A>VD'A/V8#''9V8#''9,%%5#&A95 C1 >N5 What s the pattern? Ci>Ni+? Gennis %T notice up-down-up-down Each side can have different properties the amino are on the outside 1f3c the boxes show amino

3 Motifs : Mruzin et at JMB 1995 (Cyrus Chothia) Cath: Orengo et al Structure 1997 (Janet Thornton) Each starts with domains Homolog - Phylogenetically related - derived from a common ancestor gene. (PFAM-genes; e/cathstructures). ifications are descriptive; Structural diversity comes from evolution - we use classification to deduce evolutionary relationships. Ortholog - retain the same function Paralog - function has diverged Family: related with similar function Superfamily: related with different functions Gennis: structure Gennis: structure Generally with very similar sequences you have a very similar fold. There are RARE cases where this is not true (even with different folds under different conditions) Proteins with similar fold can have different sequence. Fold is more conserved than sequence & convergent evolution can lead to similar folds. Proteins are made domains. A domain is a structural an evolutionary unit. They have 50-0 residues. Compact folded unit, quasi-independent structurally functionally Domains that are or come from a common ancestor. similar sequence - family diverged sequence but similar fold function - superfamily Chothia Gough (Biochem J (09) 9, 15-28

4 SCIENCE VOL JUNE 03 Fig. 2. An example a supradomain. The P-loop containing NTP hydrolase domain the Translation Proteins domain (5) occur in prokaryotic eukaryotic translation factors that hydrolyze guanosine triphosphate (GTP). GTP hydrolysis in the P-loop domain drives the conformational change in the Translation Proteins domain, which is then transmitted onto the ribosome. The supradomain occurs in 35 different domain architectures, 6 these are given here. The nset at left shows a protein known structure, which contains the supradomain. IF, initiation actor; EF, elongation factor; RF, factor; trna, transfer RNA. Figure 3. Glycosyl Hydrolases (A C) (A) (1/3)-b-glucanase (Varghese et al., 1994) represents the basic (Trans) glycosidases superfamily (c.1.8). Homologous catalytic domains are found in (B) b-glucuronidase (C) b-galactosidase. (B) In b-glucuronidase (Jain et al., 1996), the catalytic domain is 3 (in red) is joined by two other domains: 1 restricts the binding site, 2 links 1 to 3. (C) b-galactosidase.the first three domains have the same structure as b-glucuronidase (Jacobson et al., 1994). Domain 4 links domain 3 to 5, which contributes to the active site. Bashton Chothia: structure 15: (07) Dominant mechanisms that produce new are Duplication the genes old divergence these sequences to produce modified functions combination genes to further modify properties Many are multi-domain. Chothia Gough (Biochem J (09) 9, CATH : α,β,αβ Architecture: gross arrangement 2 structure independent connectivity Topology: Fold family linking 2 structure Fold=Topology 3 Homologous superfamily structure similar function similar 2386 Sequence family >35% identity domains : α,β,αβ,α+β Fold same 2 structure elements same topology not related Superfamily Common evolutionary origin low seq identity 1962 Family >30% identical or >15% with same function 92 ification: based on structure sequence (C-level): secondary structure composition contacts. The first, most general level the classification, class, describes the relative content α helices β sheets in a similar way to that described by Levitt Chothia [29], except that we only define three major classes mainly α, mainly β α β. Although the latter class can be sub- divided into alternating α/β α+β, in CATH, this information is considered at a lower level describing topology. Architecture (A-level): description the gross arrangement secondary structures, independent connectivity This level distinguishes structures in the same class with different architectures, but does not distinguish between different topologies (connectivities). The architectural groupings can sometimes be rather broad as they describe general features protein-fold shape, for example, the number layers in an α-β swich. A given architecture will contain structures with diverse connectivities (see Figure 2) which will be distinguished at the next level down (topology). For example, in the α-β class (C = 3), there are two common architectures each containing a large number different fold. One is the barrel- like architecture (A = ) adopted, (egtim-barrel folds). These have an inner β barrel an outer layer α helices (Figure 2). Alternatively, the three-layer α-β swich architecture (A = 40) consists a central β sheet which is covered by a layer α helices on both sides the sheet (Figure 2). Topology (T-level): fold Structures which are grouped at the T-level have the same overall fold, which means that they have a similar number arrangement secondary structures that the connectivity linking their secondary structure elements is the same. In this paper, the words fold topology have the same meaning. Proteins with the same CAT numbers have the same class, architecture topology but do not necessarily belong to the same homologous superfamily.within a given topology level, the structures are similar, but may have diverse functions. Homologous superfamily (H-level): highly similar structures functional similarity At the H-level, structures are grouped by their high structural similarity similar functions, which suggest that they may have evolved from a common ancestor, particularly, where there are resemblances in core packing or putative active sites. Using the example the mainly α.non-bundle. globin-like folds the erythrocruorins, colicins, phycocya- nins domain 1 diptheria toxin all have the same CAT number (1..340), but are differentiated by their H numbers,, 30 40, respectively (see Figure 3). Sequence family (S-level): significant sequence similarity thus a high probability having similar structure/function Members which are clustered at this level (having the same CATHS number) have sequence identities >35% as such are presumed to have extremely similar structures functions they may be slightly different examples the same protein from different species belonging to the same sequence superfamily. Some have many protein domains found (9 take up % the human genome) others have few. There are in animals; bacteria Many are found in all kingdoms life. Chothia Gough (Biochem J (09) 9, 15-28

5 Structural ification Proteins Structural ification Proteins : :Structural Structuralification ification PDB PDB (1 (1Aug Aug1998). 1998) Domains. Domains ification ification Orengo :93 et at: Structure folds folds Structural ification Proteins Structural ification Proteins 25 : :Structural Structural ification ification : Structural ification PDB PDB (1(1Aug 1998) Domains. Domains ). 70Aug PDB (23 Feb 09) Domains Reference : Structural ification PDB (23 Feb 09) Domains. 1 Reference 113 folds folds : Structural ification folds PDB (23 Feb ification 09) Reference Domains. 6 : Structural surface 12 folds PDB (23Feb 09) Domains. 1 Reference ification ification CATH version 3.3 (class, architecture, topology, homology) contains domains, 2386 homologous 3 fold groups folds surface 12 folds surface surface surface surface : :Structural Structuralification ification Nucleic Acids Research 01 Alison L Cuff, Ian Sillitoe, Tony Lewis, Andrew B Clegg, Robert Rentzsch, Nicholas Furnham, Marialuisa Pellegrini-Calace, David Jones, Janet Thornton, Christine A Orengo ification ification ification ification PDB PDB ( (Oct Oct1997). 1997) Domains. Domains.1 1 ification : :Structural Structuralification ification PDB PDB ( (Oct Oct1997). 1997) Domains. Domains.1 1 folds ification folds ification ification : Structural ification folds folds PDB (26 Sep 07). Reference Domains : Structural ification : Structural 1.73 ification PDB PDB (26 Sep 07) Reference Domains (26 Sep 07). 971 Domains Reference : Structural ification PDB (26 Sep 07). 11 Reference Domains folds surface folds folds 459 surface folds Page 1 9 Copyright The scop / /scop@mrc-lmb.cam.ac.uk Copyright The scopauthors authors scop@mrc-lmb.cam.ac.uk Page 1 9 Copyright The Copyright Thescop scopauthors authors/ /scop@mrc-lmb.cam.ac.uk scop@mrc-lmb.cam.ac.uk June June09 09 June June09 09 PagePage PagePage PagePage Figure 2 Schematic representation the class (C), architecture (A) topology (T) level in the C ATH database. Helices are drawn in blue strs are drawn as magenta arrows. The barrel, three-layer swich roll architectures (A-level) are shown for the α β class. Two representatives from fold in the three-layer swich architecture are shown. Cuff... Orengo Structure 17, 51 62, August 12, 09 The structural universe as revealed by CATH Research Article CATH: classification protein structures Orengo et al. 97 Figure 3 C ATH numbering scheme for representative structures from the globin-like fold family in the mainly α class. Four the seven levels within the C ATH database are shown, associated with C lass, Architecture, Topology, Homology. Each level is associated with a unique number. The (A), (T) (H) levels are numbered in bins ten to allow expansion the database. Architecture 1 Mainly α 2 Mainly β 3 4 Non-bundle α β Bundle Few SS 30 Few SS Topology Homology Variant surface glycoprotein Glucoamylase, domain 2 4 Globin-like 500 Although 5993 protein chains (80 domains) were selected for CATH from the September 1996 the PDB, sequence comparisons showed that approximately three quarters these were nearly identical (see Figure 1, Tables 1 2). CATH grouped the 80 domain structures into 21 non-identical (N-level). By assuming that with more than 35% sequence similarity adopt the same fold share a common evolutionary ancestor, this number can be reduced to 68 sequence (S-level). More distant relatives were added to these by searching for significantly high structural similarity related biological functions, thereby reducing the number homologous to 645 (H-level). If a lower degree structural similarity is allowed, these further group to give a total 505 1hlm Casein kinase δ.. 1cpc chain A col chain A 40 1ddt domain through overlap a large motif containing five β strs. This is not currently done in C ATH, as both are commonly referred to as separate folds in the literature. Mainly α.non-bundle.globin-like.1cpc chain A somewhat broad category, the fold within this barrel architecture all share a common structural feature comprising a single β sheet. Similarly, the three-layer α β swich architecture also represents a large grouping folds varying sizes (containing β sheets having from 4 to 17 β-strs). Again, all the folds within this architecture can be simply usefully described as having a central β sheet with layers α helices on each its sides (see Figure 2). Wherever possible, we have used architectural descriptions commonly cited in the literature. For a majority the folds (>80%) this was a simple straightforward process the architectural categories assigned agreed well with those given in other publicly available databases (e.g. [17]). For more complex shapes, no architecture was assigned, these folds were all placed in a single complex bin until alternative assignment methods are developed. Such methods will probably describe shape according to the diverse motifs contained within the fold the ways in which these motifs are combined in 3D space. The variety architectures that can easily be assigned by visual inspection is shown, for each class, in Figure 4. For mainly α only the familiar four-helix bundle architecture is easily distinguishable. Other helix arrangements Overview architectures The CATH architecture level is a subjective grouping folds having similar shape, regardless differences in scale or numbers secondary structures. For example, both the 5-stred barwin-like β-barrel folds the 17-stred porin-like β-barrel folds are assigned to the same general β-barrel architecture. Although this is a Illustration motif overlaps in the mainly β swich architecture. Each structure shown can be related to the central tenascin structure by a motif containing at least four β strs (although these are not sequential in the transthyretin structure) up to seven β strs in plastocyanin the -76 immunoglobulin variable domain structures. It can be seen that this results in the possible merging the immunoglobulin fold family (2rhe) the jelly-roll fold family (1tnfA) Figure 9 βlactamase, domain 2 5 fold (T-level), within which similarity may be a result divergent or convergent evolution. These fold are further grouped within CATH, automatically according to class manually according to their architecture giving a total 3 major classes 28 different architectures (see Table 1). Figure 1 shows the annual increase in structures structural for each level in the CATH hierarchy. 1TTF.pdb SHEET 1 SHEET 2 SHEET 3 SHEET 1 SHEET 2 SHEET 3 SHEET GLU A 9 THR A SER A 17 ASP A 23-1 O SER A 21 N GLU A THR A 56 SER A 60-1 N ALA A 57 O ILE A 2 4 GLN A 46 PRO A TYR A 31 GLU A 38-1 N TYR A 36 O GLN A VAL A THR A 76-1 N VAL A 75 O TYR A ILE A 88 THR A 94-1 N ILE A 88 O VAL A 72

6 14 Figure 9 Illustration motif overlaps in the mainly β swich architecture. Each structure shown can be related to the central tenascin structure by a motif containing at least four β strs (although these are not sequential in the transthyretin structure) up to seven β strs in plastocyanin the immunoglobulin variable domain structures. It can be seen that this results in the possible merging the immunoglobulin fold family (2rhe) the jelly-roll fold family (1tnfA) through overlap a large motif containing five β strs. This is not currently done in C ATH, as both are commonly referred to as separate folds in the literature. 10 Bron Tooze Figure 4 prediction (from sequence) methods. It would therefore seem more useful to subclassify these regions fold space according to more sophisticated criteria for structural similarity, in order to generate smaller containing closer relatives. These criteria would be based on recognising similarities within the cores the protein structures belonging to a particular family setting tolerances on the number allowed secondary structure indels. With this aim, a suite programs (CORA, COnsensus Residues Attributes) for analysing structural has been developed (CAO, unpublished data) will be applied to all the superfold within the CATH database. It is planned that future s CATH will assign structures to fold within the superarchitectures according to the diagnostics generated by CORA. Interestingly, an analysis the distribution domain structures in non-homologous multidomain showed that only 8.1% these domains occur also as single domain only 5% recur in other multidomain. Identification multidomains recurrence folds within multidomain Until we improve our understing structural constraints on secondary structure packing, the ideal a completely automatic approach generating self-consistent reproducible hierarchies at all levels is not feasible. Some approaches avoid this problem by clustering on the basis overlapping helices, strs or small common motifs, regardless 3D arrangement. Such motif-based classifications, however, are generally less useful for revealing global structural relationships between evolutionary related. We chose, therefore, to use an By applying the consensus domain boundary assignment procedure to all N-representatives (7 ) in the September 1996 version CATH, 74% the total number single domains (515) could be assigned automatically 21% the total number multidomain (272). Of those remaining unassigned, at least one the methods gave acceptable boundary assignments, needing no or only minor adjustments. Research Article CATH: classification pro Future developments: automatic architecture assignment The CATH architectural groupings are currently broad, general, categories that represent a preliminary classification which should significantly aid a future, more detailed analysis common architectural features. Although, these groups are assigned manually, other publicly available classifications have adopted a similar pragmatic approach, using a combination automatic manual approaches where appropriate (, DIAL [17,27]). Research Article CATH: classification protein structures Orengo et al. Research Article CATH: classification protein structures Orengo et al. 11 Research Article CATH: classification protein structures Orengo et al architectures identified for the mainly α, mainly β α β classes Research Article CATH: classification protein structures O rengo et al. 11 M O LS C RIPT [33] representations the architectures identified for the mainly α, mainly β α β classes (PD B codes are given in parentheses). architectures identified for the mainly α, mainly β α β classes M O LS C RIPT [33] representations the architectures identified for the mainly α, mainly β α β classes (PD B codes are given in parentheses). more pround disturbances to the architecture. This is structures in these regions fold space. This means because, in addition to the ability to exp by adding on similar motifs (which may arise from gene duplica for these architectures it is perhaps more more pround disturbances appropriate to the architecture. This isa continuum structures these regions fold space. This means tion), layer-based architectures can further accommodate to consider inprotein folds. This to the ability to exp by adding evolutionary changes by allowing thebecause, layers in to addition slide relais particularly apparent in the layer-based swich archion similar motifs (which may arise from gene duplica- for these architectures it is perhaps more M O LS C RIPT [33] representations the architectures identified for the mainly α, mainly β α β classes more pround disturbances to the architecture. This is because, in addition to the ability to exp by adding on similar motifs (which may arise from gene duplication), layer-based architectures can further accommodate evolutionary changes by allowing the layers to slide relative to each other. This mechanism would not be available to complex architectures possessing a more diffuse hydrophobic core. structures in these reg that it becomes harder to for these archite appropriate to consider a c is particularly apparent in tectures the mainly β within the α β three-laye is possible to generate a using the simple criteria M O LS C RIPT [33] representations the architectures identified for the mainly α, mainly β α β classes (PD B codes are given in parentheses). M O LS C RIPT [33] representations the architectures identified fold for the mainly α, β doll α βeffect classes (PD B codes are given in parentheses). Overlap between : themainly Russian (SSAP score > = 70) Each new structure added The recurrence common motifs within many the last by a simple extension superfolds major architectures gives rise to an overlap more pround disturbances to the architecture. This is structures in these regions fold space. This means more pround disturbances to the architecture. This is structures in these regions fold space. This means because, in addition to the ability to exp by adding because, in addition to the ability to exp by adding on similar motifs (which may arise from gene duplica for these architectures it is perhaps more on similar motifs (which may arise from gene duplica for these architectures it is perhaps more tion), layer-based architectures can further accommodate appropriate to consider a continuum protein folds. This tion), layer-based architectures can further accommodate appropriate to consider a continuum protein folds. This evolutionary changes by allowing the layers to slide relais particularly apparent in the layer-based swich archievolutionary changes by allowing the layers to slide relais particularly apparent in the layer-based swich architive to each other. This mechanism would not be availtectures the mainly β α β classes. For example, tive to each other. This mechanism would not be availtectures the mainly β α β classes. For example, able to complex architectures possessing a more diffuse within the α β three-layer doubly wound architectures, it able to complex architectures possessing a more diffuse within the α β three-layer doubly wound architectures, it hydrophobic core. is possible to generate a very large family structures hydrophobic core. is possible to generate a very large family structures using the simple criteria a good structural comparison using the simple criteria a good structural comparison Overlap between fold : the Russian doll effect (SSAP score > = 70) reasonable overlap (> = 60%). Overlap between fold : the Russian doll effect (SSAP score > = 70) reasonable overlap (> = 60%). Each new structure added to a family will be related to the The recurrence common motifs within many the Each new structure added to a family will be related to the The recurrence common motifs within many the last by a simple extension one or more βαβ motifs superfolds major architectures gives rise to an overlap last by a simple extension one or more βαβ motifs superfolds major architectures gives rise to an overlap

7 Atpase 1BMF-Bovine ATPase F1 Chain A all Left hed superhelix P-loop containing nucleoside trip hydrolase Vogal et al: Current Opinion in Structural Biology 04: 14: 8-216

Heteropolymer. Mostly in regular secondary structure

Heteropolymer. Mostly in regular secondary structure Heteropolymer - + + - Mostly in regular secondary structure 1 2 3 4 C >N trace how you go around the helix C >N C2 >N6 C1 >N5 What s the pattern? Ci>Ni+? 5 6 move around not quite 120 "#$%&'!()*(+2!3/'!4#5'!1/,#64!#6!,6!

More information

The CATH Database provides insights into protein structure/function relationships

The CATH Database provides insights into protein structure/function relationships 1999 Oxford University Press Nucleic Acids Research, 1999, Vol. 27, No. 1 275 279 The CATH Database provides insights into protein structure/function relationships C. A. Orengo, F. M. G. Pearl, J. E. Bray,

More information

2MHR. Protein structure classification is important because it organizes the protein structure universe that is independent of sequence similarity.

2MHR. Protein structure classification is important because it organizes the protein structure universe that is independent of sequence similarity. Protein structure classification is important because it organizes the protein structure universe that is independent of sequence similarity. A global picture of the protein universe will help us to understand

More information

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison CMPS 6630: Introduction to Computational Biology and Bioinformatics Structure Comparison Protein Structure Comparison Motivation Understand sequence and structure variability Understand Domain architecture

More information

Protein Structure: Data Bases and Classification Ingo Ruczinski

Protein Structure: Data Bases and Classification Ingo Ruczinski Protein Structure: Data Bases and Classification Ingo Ruczinski Department of Biostatistics, Johns Hopkins University Reference Bourne and Weissig Structural Bioinformatics Wiley, 2003 More References

More information

Procheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics.

Procheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics. Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics Iosif Vaisman Email: ivaisman@gmu.edu ----------------------------------------------------------------- Bond

More information

Physiochemical Properties of Residues

Physiochemical Properties of Residues Physiochemical Properties of Residues Various Sources C N Cα R Slide 1 Conformational Propensities Conformational Propensity is the frequency in which a residue adopts a given conformation (in a polypeptide)

More information

Introduction to Comparative Protein Modeling. Chapter 4 Part I

Introduction to Comparative Protein Modeling. Chapter 4 Part I Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature

More information

Supersecondary Structures (structural motifs)

Supersecondary Structures (structural motifs) Supersecondary Structures (structural motifs) Various Sources Slide 1 Supersecondary Structures (Motifs) Supersecondary Structures (Motifs): : Combinations of secondary structures in specific geometric

More information

Protein structure alignments

Protein structure alignments Protein structure alignments Proteins that fold in the same way, i.e. have the same fold are often homologs. Structure evolves slower than sequence Sequence is less conserved than structure If BLAST gives

More information

Packing of Secondary Structures

Packing of Secondary Structures 7.88 Lecture Notes - 4 7.24/7.88J/5.48J The Protein Folding and Human Disease Professor Gossard Retrieving, Viewing Protein Structures from the Protein Data Base Helix helix packing Packing of Secondary

More information

HMM applications. Applications of HMMs. Gene finding with HMMs. Using the gene finder

HMM applications. Applications of HMMs. Gene finding with HMMs. Using the gene finder HMM applications Applications of HMMs Gene finding Pairwise alignment (pair HMMs) Characterizing protein families (profile HMMs) Predicting membrane proteins, and membrane protein topology Gene finding

More information

SCOP. all-β class. all-α class, 3 different folds. T4 endonuclease V. 4-helical cytokines. Globin-like

SCOP. all-β class. all-α class, 3 different folds. T4 endonuclease V. 4-helical cytokines. Globin-like SCOP all-β class 4-helical cytokines T4 endonuclease V all-α class, 3 different folds Globin-like TIM-barrel fold α/β class Profilin-like fold α+β class http://scop.mrc-lmb.cam.ac.uk/scop CATH Class, Architecture,

More information

1. Protein Data Bank (PDB) 1. Protein Data Bank (PDB)

1. Protein Data Bank (PDB) 1. Protein Data Bank (PDB) Protein structure databases; visualization; and classifications 1. Introduction to Protein Data Bank (PDB) 2. Free graphic software for 3D structure visualization 3. Hierarchical classification of protein

More information

09/06/25. Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Non-uniform distribution of folds. Scheme of protein structure predicition

09/06/25. Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Non-uniform distribution of folds. Scheme of protein structure predicition Sequence identity Structural similarity Computergestützte Strukturbiologie (Strukturelle Bioinformatik) Fold recognition Sommersemester 2009 Peter Güntert Structural similarity X Sequence identity Non-uniform

More information

CS612 - Algorithms in Bioinformatics

CS612 - Algorithms in Bioinformatics Fall 2017 Databases and Protein Structure Representation October 2, 2017 Molecular Biology as Information Science > 12, 000 genomes sequenced, mostly bacterial (2013) > 5x10 6 unique sequences available

More information

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748

Giri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748 CAP 5510: Introduction to Bioinformatics Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinfs07.html 2/15/07 CAP5510 1 EM Algorithm Goal: Find θ, Z that maximize Pr

More information

Understanding Sequence, Structure and Function Relationships and the Resulting Redundancy

Understanding Sequence, Structure and Function Relationships and the Resulting Redundancy Understanding Sequence, Structure and Function Relationships and the Resulting Redundancy many slides by Philip E. Bourne Department of Pharmacology, UCSD Agenda Understand the relationship between sequence,

More information

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure Bioch/BIMS 503 Lecture 2 Structure and Function of Proteins August 28, 2008 Robert Nakamoto rkn3c@virginia.edu 2-0279 Secondary Structure Φ Ψ angles determine protein structure Φ Ψ angles are restricted

More information

Homology and Information Gathering and Domain Annotation for Proteins

Homology and Information Gathering and Domain Annotation for Proteins Homology and Information Gathering and Domain Annotation for Proteins Outline Homology Information Gathering for Proteins Domain Annotation for Proteins Examples and exercises The concept of homology The

More information

Number sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence

Number sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence Number sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence Naoto Morikawa (nmorika@genocript.com) October 7, 2006. Abstract A protein is a sequence

More information

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy

Design of a Novel Globular Protein Fold with Atomic-Level Accuracy Design of a Novel Globular Protein Fold with Atomic-Level Accuracy Brian Kuhlman, Gautam Dantas, Gregory C. Ireton, Gabriele Varani, Barry L. Stoddard, David Baker Presented by Kate Stafford 4 May 05 Protein

More information

Homology. and. Information Gathering and Domain Annotation for Proteins

Homology. and. Information Gathering and Domain Annotation for Proteins Homology and Information Gathering and Domain Annotation for Proteins Outline WHAT IS HOMOLOGY? HOW TO GATHER KNOWN PROTEIN INFORMATION? HOW TO ANNOTATE PROTEIN DOMAINS? EXAMPLES AND EXERCISES Homology

More information

Protein Structure Prediction

Protein Structure Prediction Page 1 Protein Structure Prediction Russ B. Altman BMI 214 CS 274 Protein Folding is different from structure prediction --Folding is concerned with the process of taking the 3D shape, usually based on

More information

Sequence analysis and comparison

Sequence analysis and comparison The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species

More information

Supporting Online Material for

Supporting Online Material for www.sciencemag.org/cgi/content/full/309/5742/1868/dc1 Supporting Online Material for Toward High-Resolution de Novo Structure Prediction for Small Proteins Philip Bradley, Kira M. S. Misura, David Baker*

More information

Protein Structure & Motifs

Protein Structure & Motifs & Motifs Biochemistry 201 Molecular Biology January 12, 2000 Doug Brutlag Introduction Proteins are more flexible than nucleic acids in structure because of both the larger number of types of residues

More information

Analysis and Prediction of Protein Structure (I)

Analysis and Prediction of Protein Structure (I) Analysis and Prediction of Protein Structure (I) Jianlin Cheng, PhD School of Electrical Engineering and Computer Science University of Central Florida 2006 Free for academic use. Copyright @ Jianlin Cheng

More information

Protein Structure. Role of (bio)informatics in drug discovery. Bioinformatics

Protein Structure. Role of (bio)informatics in drug discovery. Bioinformatics Bioinformatics Protein Structure Principles & Architecture Marjolein Thunnissen Dep. of Biochemistry & Structural Biology Lund University September 2011 Homology, pattern and 3D structure searches need

More information

D Dobbs ISU - BCB 444/544X 1

D Dobbs ISU - BCB 444/544X 1 11/7/05 Protein Structure: Classification, Databases, Visualization Announcements BCB 544 Projects - Important Dates: Nov 2 Wed noon - Project proposals due to David/Drena Nov 4 Fri PM - Approvals/responses

More information

Review. Membrane proteins. Membrane transport

Review. Membrane proteins. Membrane transport Quiz 1 For problem set 11 Q1, you need the equation for the average lateral distance transversed (s) of a molecule in the membrane with respect to the diffusion constant (D) and time (t). s = (4 D t) 1/2

More information

Basics of protein structure

Basics of protein structure Today: 1. Projects a. Requirements: i. Critical review of one paper ii. At least one computational result b. Noon, Dec. 3 rd written report and oral presentation are due; submit via email to bphys101@fas.harvard.edu

More information

Basic structures of proteins

Basic structures of proteins Basic structures of proteins Structural Hierarchy of Protein Primary structure Functional elements : α-helix, strands, β-sheet, loops.. - Structure, affinity, activity, specificity, stability etc. Secondary

More information

Major Types of Association of Proteins with Cell Membranes. From Alberts et al

Major Types of Association of Proteins with Cell Membranes. From Alberts et al Major Types of Association of Proteins with Cell Membranes From Alberts et al Proteins Are Polymers of Amino Acids Peptide Bond Formation Amino Acid central carbon atom to which are attached amino group

More information

Advanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions

Advanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions BIRKBECK COLLEGE (University of London) Advanced Certificate in Principles in Protein Structure MSc Structural Molecular Biology Date: Thursday, 1st September 2011 Time: 3 hours You will be given a start

More information

Genome Databases The CATH database

Genome Databases The CATH database Genome Databases The CATH database Michael Knudsen 1 and Carsten Wiuf 1,2* 1 Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark 2 Centre for Membrane Pumps in Cells and Disease

More information

CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Giri Narasimhan

CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools. Giri Narasimhan CAP 5510: Introduction to Bioinformatics CGS 5166: Bioinformatics Tools Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinff18.html Proteins and Protein Structure

More information

A General Model for Amino Acid Interaction Networks

A General Model for Amino Acid Interaction Networks Author manuscript, published in "N/P" A General Model for Amino Acid Interaction Networks Omar GACI and Stefan BALEV hal-43269, version - Nov 29 Abstract In this paper we introduce the notion of protein

More information

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES

Protein Structure. W. M. Grogan, Ph.D. OBJECTIVES Protein Structure W. M. Grogan, Ph.D. OBJECTIVES 1. Describe the structure and characteristic properties of typical proteins. 2. List and describe the four levels of structure found in proteins. 3. Relate

More information

BIRKBECK COLLEGE (University of London)

BIRKBECK COLLEGE (University of London) BIRKBECK COLLEGE (University of London) SCHOOL OF BIOLOGICAL SCIENCES M.Sc. EXAMINATION FOR INTERNAL STUDENTS ON: Postgraduate Certificate in Principles of Protein Structure MSc Structural Molecular Biology

More information

Amino Acid Structures from Klug & Cummings. 10/7/2003 CAP/CGS 5991: Lecture 7 1

Amino Acid Structures from Klug & Cummings. 10/7/2003 CAP/CGS 5991: Lecture 7 1 Amino Acid Structures from Klug & Cummings 10/7/2003 CAP/CGS 5991: Lecture 7 1 Amino Acid Structures from Klug & Cummings 10/7/2003 CAP/CGS 5991: Lecture 7 2 Amino Acid Structures from Klug & Cummings

More information

Protein Bioinformatics. Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet sandberg.cmb.ki.

Protein Bioinformatics. Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet sandberg.cmb.ki. Protein Bioinformatics Rickard Sandberg Dept. of Cell and Molecular Biology Karolinska Institutet rickard.sandberg@ki.se sandberg.cmb.ki.se Outline Protein features motifs patterns profiles signals 2 Protein

More information

Bioinformatics. Macromolecular structure

Bioinformatics. Macromolecular structure Bioinformatics Macromolecular structure Contents Determination of protein structure Structure databases Secondary structure elements (SSE) Tertiary structure Structure analysis Structure alignment Domain

More information

Chapter 5. Proteomics and the analysis of protein sequence Ⅱ

Chapter 5. Proteomics and the analysis of protein sequence Ⅱ Proteomics Chapter 5. Proteomics and the analysis of protein sequence Ⅱ 1 Pairwise similarity searching (1) Figure 5.5: manual alignment One of the amino acids in the top sequence has no equivalent and

More information

Study of Mining Protein Structural Properties and its Application

Study of Mining Protein Structural Properties and its Application Study of Mining Protein Structural Properties and its Application A Dissertation Proposal Presented to the Department of Computer Science and Information Engineering College of Electrical Engineering and

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION Supplementary Results DNA binding property of the SRA domain was examined by an electrophoresis mobility shift assay (EMSA) using synthesized 12-bp oligonucleotide duplexes containing unmodified, hemi-methylated,

More information

CSCE555 Bioinformatics. Protein Function Annotation

CSCE555 Bioinformatics. Protein Function Annotation CSCE555 Bioinformatics Protein Function Annotation Why we need to do function annotation? Fig from: Network-based prediction of protein function. Molecular Systems Biology 3:88. 2007 What s function? The

More information

Bio 1B Lecture Outline (please print and bring along) Fall, 2007

Bio 1B Lecture Outline (please print and bring along) Fall, 2007 Bio 1B Lecture Outline (please print and bring along) Fall, 2007 B.D. Mishler, Dept. of Integrative Biology 2-6810, bmishler@berkeley.edu Evolution lecture #5 -- Molecular genetics and molecular evolution

More information

Comparing Protein Structures. Why?

Comparing Protein Structures. Why? 7.91 Amy Keating Comparing Protein Structures Why? detect evolutionary relationships identify recurring motifs detect structure/function relationships predict function assess predicted structures classify

More information

Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution

Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution Supplemental Materials for Structural Diversity of Protein Segments Follows a Power-law Distribution Yoshito SAWADA and Shinya HONDA* National Institute of Advanced Industrial Science and Technology (AIST),

More information

Protein Structure. Hierarchy of Protein Structure. Tertiary structure. independently stable structural unit. includes disulfide bonds

Protein Structure. Hierarchy of Protein Structure. Tertiary structure. independently stable structural unit. includes disulfide bonds Protein Structure Hierarchy of Protein Structure 2 3 Structural element Primary structure Secondary structure Super-secondary structure Domain Tertiary structure Quaternary structure Description amino

More information

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction

CMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction CMPS 6630: Introduction to Computational Biology and Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the

More information

B O C 4 H 2 O O. NOTE: The reaction proceeds with a carbonium ion stabilized on the C 1 of sugar A.

B O C 4 H 2 O O. NOTE: The reaction proceeds with a carbonium ion stabilized on the C 1 of sugar A. hbcse 33 rd International Page 101 hemistry lympiad Preparatory 05/02/01 Problems d. In the hydrolysis of the glycosidic bond, the glycosidic bridge oxygen goes with 4 of the sugar B. n cleavage, 18 from

More information

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013 Hydration of protein-rna recognition sites Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India 1 st November, 2013 Central Dogma of life DNA

More information

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University

COMP 598 Advanced Computational Biology Methods & Research. Introduction. Jérôme Waldispühl School of Computer Science McGill University COMP 598 Advanced Computational Biology Methods & Research Introduction Jérôme Waldispühl School of Computer Science McGill University General informations (1) Office hours: by appointment Office: TR3018

More information

Orientational degeneracy in the presence of one alignment tensor.

Orientational degeneracy in the presence of one alignment tensor. Orientational degeneracy in the presence of one alignment tensor. Rotation about the x, y and z axes can be performed in the aligned mode of the program to examine the four degenerate orientations of two

More information

Introduction to Evolutionary Concepts

Introduction to Evolutionary Concepts Introduction to Evolutionary Concepts and VMD/MultiSeq - Part I Zaida (Zan) Luthey-Schulten Dept. Chemistry, Beckman Institute, Biophysics, Institute of Genomics Biology, & Physics NIH Workshop 2009 VMD/MultiSeq

More information

Translation. A ribosome, mrna, and trna.

Translation. A ribosome, mrna, and trna. Translation The basic processes of translation are conserved among prokaryotes and eukaryotes. Prokaryotic Translation A ribosome, mrna, and trna. In the initiation of translation in prokaryotes, the Shine-Dalgarno

More information

Protein Science (1997), 6: Cambridge University Press. Printed in the USA. Copyright 1997 The Protein Society

Protein Science (1997), 6: Cambridge University Press. Printed in the USA. Copyright 1997 The Protein Society 1 of 5 1/30/00 8:08 PM Protein Science (1997), 6: 246-248. Cambridge University Press. Printed in the USA. Copyright 1997 The Protein Society FOR THE RECORD LPFC: An Internet library of protein family

More information

DATE A DAtabase of TIM Barrel Enzymes

DATE A DAtabase of TIM Barrel Enzymes DATE A DAtabase of TIM Barrel Enzymes 2 2.1 Introduction.. 2.2 Objective and salient features of the database 2.2.1 Choice of the dataset.. 2.3 Statistical information on the database.. 2.4 Features....

More information

Bioinformatics. Proteins II. - Pattern, Profile, & Structure Database Searching. Robert Latek, Ph.D. Bioinformatics, Biocomputing

Bioinformatics. Proteins II. - Pattern, Profile, & Structure Database Searching. Robert Latek, Ph.D. Bioinformatics, Biocomputing Bioinformatics Proteins II. - Pattern, Profile, & Structure Database Searching Robert Latek, Ph.D. Bioinformatics, Biocomputing WIBR Bioinformatics Course, Whitehead Institute, 2002 1 Proteins I.-III.

More information

Genomic and structural aspects of protein evolution

Genomic and structural aspects of protein evolution www.biochemj.org Biochem. J. (2009) 419, 15 28 (Printed in Great Britain) doi:10.1042/bj20090122 15 REVIEW ARTICLE Genomic and structural aspects of protein evolution Cyrus CHOTHIA* and Julian GOUGH *MRC

More information

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability

Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Lecture 2 and 3: Review of forces (ctd.) and elementary statistical mechanics. Contributions to protein stability Part I. Review of forces Covalent bonds Non-covalent Interactions: Van der Waals Interactions

More information

Protein structure analysis. Risto Laakso 10th January 2005

Protein structure analysis. Risto Laakso 10th January 2005 Protein structure analysis Risto Laakso risto.laakso@hut.fi 10th January 2005 1 1 Summary Various methods of protein structure analysis were examined. Two proteins, 1HLB (Sea cucumber hemoglobin) and 1HLM

More information

CMPS 3110: Bioinformatics. Tertiary Structure Prediction

CMPS 3110: Bioinformatics. Tertiary Structure Prediction CMPS 3110: Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the laws of physics! Conformation space is finite

More information

Advanced Topics in RNA and DNA. DNA Microarrays Aptamers

Advanced Topics in RNA and DNA. DNA Microarrays Aptamers Quiz 1 Advanced Topics in RNA and DNA DNA Microarrays Aptamers 2 Quantifying mrna levels to asses protein expression 3 The DNA Microarray Experiment 4 Application of DNA Microarrays 5 Some applications

More information

CAP 5510 Lecture 3 Protein Structures

CAP 5510 Lecture 3 Protein Structures CAP 5510 Lecture 3 Protein Structures Su-Shing Chen Bioinformatics CISE 8/19/2005 Su-Shing Chen, CISE 1 Protein Conformation 8/19/2005 Su-Shing Chen, CISE 2 Protein Conformational Structures Hydrophobicity

More information

Peptides And Proteins

Peptides And Proteins Kevin Burgess, May 3, 2017 1 Peptides And Proteins from chapter(s) in the recommended text A. Introduction B. omenclature And Conventions by amide bonds. on the left, right. 2 -terminal C-terminal triglycine

More information

Research Proposal. Title: Multiple Sequence Alignment used to investigate the co-evolving positions in OxyR Protein family.

Research Proposal. Title: Multiple Sequence Alignment used to investigate the co-evolving positions in OxyR Protein family. Research Proposal Title: Multiple Sequence Alignment used to investigate the co-evolving positions in OxyR Protein family. Name: Minjal Pancholi Howard University Washington, DC. June 19, 2009 Research

More information

ALL LECTURES IN SB Introduction

ALL LECTURES IN SB Introduction 1. Introduction 2. Molecular Architecture I 3. Molecular Architecture II 4. Molecular Simulation I 5. Molecular Simulation II 6. Bioinformatics I 7. Bioinformatics II 8. Prediction I 9. Prediction II ALL

More information

Chapter

Chapter Chapter 17 17.4-17.6 Molecular Components of Translation A cell interprets a genetic message and builds a polypeptide The message is a series of codons on mrna The interpreter is called transfer (trna)

More information

Supplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate

Supplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate SUPPLEMENTARY FIGURE LEGENDS Supplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate dehydrogenase from Escherichia coli [ICD, pdb 1PB1, Mesecar, A. D., and Koshland,

More information

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1 Supporting information to: Time-resolved observation of protein allosteric communication Sebastian Buchenberg, Florian Sittel and Gerhard Stock Biomolecular Dynamics, Institute of Physics, Albert Ludwigs

More information

Central Dogma. modifications genome transcriptome proteome

Central Dogma. modifications genome transcriptome proteome entral Dogma DA ma protein post-translational modifications genome transcriptome proteome 83 ierarchy of Protein Structure 20 Amino Acids There are 20 n possible sequences for a protein of n residues!

More information

GENETICS - CLUTCH CH.11 TRANSLATION.

GENETICS - CLUTCH CH.11 TRANSLATION. !! www.clutchprep.com CONCEPT: GENETIC CODE Nucleotides and amino acids are translated in a 1 to 1 method The triplet code states that three nucleotides codes for one amino acid - A codon is a term for

More information

Large-Scale Genomic Surveys

Large-Scale Genomic Surveys Bioinformatics Subtopics Fold Recognition Secondary Structure Prediction Docking & Drug Design Protein Geometry Structural Informatics Homology Modeling Sequence Alignment Structure Classification Gene

More information

Week 10: Homology Modelling (II) - HHpred

Week 10: Homology Modelling (II) - HHpred Week 10: Homology Modelling (II) - HHpred Course: Tools for Structural Biology Fabian Glaser BKU - Technion 1 2 Identify and align related structures by sequence methods is not an easy task All comparative

More information

Protein Structure Prediction and Display

Protein Structure Prediction and Display Protein Structure Prediction and Display Goal Take primary structure (sequence) and, using rules derived from known structures, predict the secondary structure that is most likely to be adopted by each

More information

Properties of amino acids in proteins

Properties of amino acids in proteins Properties of amino acids in proteins one of the primary roles of DNA (but not the only one!) is to code for proteins A typical bacterium builds thousands types of proteins, all from ~20 amino acids repeated

More information

Model Mélange. Physical Models of Peptides and Proteins

Model Mélange. Physical Models of Peptides and Proteins Model Mélange Physical Models of Peptides and Proteins In the Model Mélange activity, you will visit four different stations each featuring a variety of different physical models of peptides or proteins.

More information

Sequences, Structures, and Gene Regulatory Networks

Sequences, Structures, and Gene Regulatory Networks Sequences, Structures, and Gene Regulatory Networks Learning Outcomes After this class, you will Understand gene expression and protein structure in more detail Appreciate why biologists like to align

More information

EBI web resources II: Ensembl and InterPro

EBI web resources II: Ensembl and InterPro EBI web resources II: Ensembl and InterPro Yanbin Yin http://www.ebi.ac.uk/training/online/course/ 1 Homework 3 Go to http://www.ebi.ac.uk/interpro/training.htmland finish the second online training course

More information

Details of Protein Structure

Details of Protein Structure Details of Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Anne Mølgaard, Kemisk Institut, Københavns Universitet Learning Objectives

More information

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB

Homology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB Homology Modeling (Comparative Structure Modeling) Aims of Structural Genomics High-throughput 3D structure determination and analysis To determine or predict the 3D structures of all the proteins encoded

More information

From gene to protein. Premedical biology

From gene to protein. Premedical biology From gene to protein Premedical biology Central dogma of Biology, Molecular Biology, Genetics transcription replication reverse transcription translation DNA RNA Protein RNA chemically similar to DNA,

More information

Structure to Function. Molecular Bioinformatics, X3, 2006

Structure to Function. Molecular Bioinformatics, X3, 2006 Structure to Function Molecular Bioinformatics, X3, 2006 Structural GeNOMICS Structural Genomics project aims at determination of 3D structures of all proteins: - organize known proteins into families

More information

Structural Alignment of Proteins

Structural Alignment of Proteins Goal Align protein structures Structural Alignment of Proteins 1 2 3 4 5 6 7 8 9 10 11 12 13 14 PHE ASP ILE CYS ARG LEU PRO GLY SER ALA GLU ALA VAL CYS PHE ASN VAL CYS ARG THR PRO --- --- --- GLU ALA ILE

More information

Motifs, Profiles and Domains. Michael Tress Protein Design Group Centro Nacional de Biotecnología, CSIC

Motifs, Profiles and Domains. Michael Tress Protein Design Group Centro Nacional de Biotecnología, CSIC Motifs, Profiles and Domains Michael Tress Protein Design Group Centro Nacional de Biotecnología, CSIC Comparing Two Proteins Sequence Alignment Determining the pattern of evolution and identifying conserved

More information

B. β Structure. All contents of this document, unless otherwise noted, are David C. & Jane S. Richardson. All Rights Reserved.

B. β Structure. All contents of this document, unless otherwise noted, are David C. & Jane S. Richardson. All Rights Reserved. B. β Structure The other major structural element found in globular proteins is the β sheet. Historically, it was first observed as the β, or extended, form of keratin fibers. An approximate understanding

More information

CONCEPT OF SEQUENCE COMPARISON. Natapol Pornputtapong 18 January 2018

CONCEPT OF SEQUENCE COMPARISON. Natapol Pornputtapong 18 January 2018 CONCEPT OF SEQUENCE COMPARISON Natapol Pornputtapong 18 January 2018 SEQUENCE ANALYSIS - A ROSETTA STONE OF LIFE Sequence analysis is the process of subjecting a DNA, RNA or peptide sequence to any of

More information

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment

Algorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot

More information

Introduction to" Protein Structure

Introduction to Protein Structure Introduction to" Protein Structure Function, evolution & experimental methods Thomas Blicher, Center for Biological Sequence Analysis Learning Objectives Outline the basic levels of protein structure.

More information

Chapter 2 Structures. 2.1 Introduction Storing Protein Structures The PDB File Format

Chapter 2 Structures. 2.1 Introduction Storing Protein Structures The PDB File Format Chapter 2 Structures 2.1 Introduction The three-dimensional (3D) structure of a protein contains a lot of information on its function, and can be used for devising ways of modifying it (propose mutants,

More information

Homology Modeling. Roberto Lins EPFL - summer semester 2005

Homology Modeling. Roberto Lins EPFL - summer semester 2005 Homology Modeling Roberto Lins EPFL - summer semester 2005 Disclaimer: course material is mainly taken from: P.E. Bourne & H Weissig, Structural Bioinformatics; C.A. Orengo, D.T. Jones & J.M. Thornton,

More information

Bahnson Biochemistry Cume, April 8, 2006 The Structural Biology of Signal Transduction

Bahnson Biochemistry Cume, April 8, 2006 The Structural Biology of Signal Transduction Name page 1 of 6 Bahnson Biochemistry Cume, April 8, 2006 The Structural Biology of Signal Transduction Part I. The ion Ca 2+ can function as a 2 nd messenger. Pick a specific signal transduction pathway

More information

Protein Folds, Functions and Evolution

Protein Folds, Functions and Evolution Article No. jmbi.1999.3054 available online at http://www.idealibrary.com on J. Mol. Biol. (1999) 293, 333±342 Protein Folds, Functions and Evolution Janet M. Thornton 1,2 *, Christine A. Orengo 1, Annabel

More information

Gene regulation I Biochemistry 302. Bob Kelm February 25, 2005

Gene regulation I Biochemistry 302. Bob Kelm February 25, 2005 Gene regulation I Biochemistry 302 Bob Kelm February 25, 2005 Principles of gene regulation (cellular versus molecular level) Extracellular signals Chemical (e.g. hormones, growth factors) Environmental

More information

PHYLOGENY AND SYSTEMATICS

PHYLOGENY AND SYSTEMATICS AP BIOLOGY EVOLUTION/HEREDITY UNIT Unit 1 Part 11 Chapter 26 Activity #15 NAME DATE PERIOD PHYLOGENY AND SYSTEMATICS PHYLOGENY Evolutionary history of species or group of related species SYSTEMATICS Study

More information

Objective: Students will be able identify peptide bonds in proteins and describe the overall reaction between amino acids that create peptide bonds.

Objective: Students will be able identify peptide bonds in proteins and describe the overall reaction between amino acids that create peptide bonds. Scott Seiple AP Biology Lesson Plan Lesson: Primary and Secondary Structure of Proteins Purpose:. To understand how amino acids can react to form peptides through peptide bonds.. Students will be able

More information

Prediction and refinement of NMR structures from sparse experimental data

Prediction and refinement of NMR structures from sparse experimental data Prediction and refinement of NMR structures from sparse experimental data Jeff Skolnick Director Center for the Study of Systems Biology School of Biology Georgia Institute of Technology Overview of talk

More information