Molecular Modeling lecture 4. Rotation Least-squares Superposition Structure-based alignment algorithms
|
|
- Baldwin Johnston
- 5 years ago
- Views:
Transcription
1 Molecular Modeling lecture 4 Rotation Least-squares Superposition Structure-based alignment algorithms
2 Rotation is addition in polar coordinates 2
3 What happens when you move the mouse to rotate a molecule? Mouse sends mouse coordinates (Δx,Δy) to the running program Rotation angles are calculated: θ x = Δx*scale, θ y = Δy*scale Rotation matrices are calculated: y x R x = [ ] cos θ x -sin θ x 0 sin θ x cos θ x
4 What happens when you move the mouse (cont'd): 4. New atom coordinates are calculated v = RyRxv 5. The scene is rendered using the new coordinates.
5 Rotation is angular addition y atom starts at v=(x= rcosα, y= rsinα) axis of rotation = Cartesian origin r β (x,y ) α (x,y) x rotates around origin to v =(x =rcos(α+β), y'=rsin(α+β)) Angles are measured counter-clockwise (right-handed).
6 Sum of angles formuli cos (α+β) = cos α cos β sin α sin β sin (α+β) = sin α cos β + sin β cos α
7 A rotation matrix x = rcos α y (x,y ) y = rsin α β (x,y) r α x x' = r cos (α+β) = r (cos α cos β sin α sin β) = (r cos α) cos β (r sin α)sin β = x cos β y sin β y' = r sin (α+β) = r (sin α cos β + sin β cos α) = (r sin α) cos β + (r cos α) sin β = y cos β + x sin β! # " x' $! & = cosβ y' # % " sinβ sin β $! & rcosα $! # & = cos β cos β %" rsinα # % " sin β sin β $! cosβ % & # x$ " y & % rotation matrix is the same for any r, any α.
8 rotations around principal axes The Z coordinate stays the same. X and Y change. cos β -sin β 0 Rz = The Y coordinate stays the same. X and Z change. cos γ 0 sin γ Ry = sin β cos β sin γ 0 cos γ The X coordinate stays the same. Y and Z change R x = 0 cos α -sin α 0 sin α cos α
9 A series of rotations is the product of rotation matrices. cos β -sin β 0 cos γ 0 sin γ cos β cos γ -sin β cos β sin β cos β = sin β cos γ cos β -sin β sin γ sin γ 0 cos γ sin γ 0 cos γ Rotation around z Rotation around y 3D rotation This is a Y rotation followed by a Z rotation
10 multiplication order is right to left. X-rotation α degrees, followed by Y-rotation γ degrees cos γ 0 sin γ sin γ 0 cos γ cos α -sin α 0 sin α cos α R y R x = = cos γ sin α sin γ cos α sin γ 0 cos α -sin α -sin γ sin α cos γ cos α cos γ Y-rotation γ degrees, followed by X-rotation α degrees cos α -sin α 0 sin α cos α cos γ 0 sin γ sin γ 0 cos γ xr y = = cos γ 0 sin γ sin α sin γ cos α -cos γ sin α -cos α sin γ sin α cos α cos γ opposite order gives a different result.
11 Transposing the matrix reverses the rotation For the opposite rotation, flip the matrix. This is the transpose x y = A C B D cos β sin β -sin β cos β The inverse matrix = The transposed matrix, because = A B C D T x y cos β sin β -sin β cos β cos β -sin β sin β cos β =
12 Right-handed 90 principle axis rotations: 90 rotation around X 90 rotation around Y 90 rotation around Z Helpful hint: For a R-handed rotation, the -sine is up and to the right of the +sine. z y x y x z z x y
13 Exercise 3.1: rotate a point (x,y,z) = (1., 4., 7.) Rotate this point by 90 around the Z-axis Then... Rotate the new point by 90 around the Y-axis. What are the new coordinates?
14 supplementary slide: Special properties of rotation matrices They are square, 2x2 or 3x3(higher dimensions in principle) The product of any two rotation matrices is a rotation matrix. The inverse equals the transpose, R -1 = R T Every row/column is a unit vector. Any two rows/columns are orthogonal vectors. The cross-product of any two rows equals the third. x = Rx, where R is a rotation matrix. Read more about rotation matrices at: mathworld.wolfram.com/rotationmatrix.html
15 Least Squares Superposition minimizes the root mean squared deviation in aligned atoms 15
16 Exercise 3.2: Superimpose by hand Do these pairs: 1N9U vs 1N9V 1WFA (chain A vs chain B) File Open: RCSB PDB: code: xxxx in-class Delete parts not needed (e.g. waters). Select Chain label in SEQ. R-mouse/Delete. Hide All atoms Ribbon Style: oval, Color: chain or terminus Select synchronize (always check synchronize) SEQ (opens the sequence window) Double-click on chain label to select one molecule. In MOE window: Rotate selected : meta-middlemouse-drag. Translate selected : shift-meta-middlemouse-drag Rotate all: middlemouse-drag Translate all: shift-middlemouse-drag Until molecules are superposed. have the instructor check you off.
17 RMSD Root Mean Square Deviation in superimposed coordinates is the standard measure of structural difference. Σ(v 1 i - v 2 i) 2 i=1,n N Where v 1 i and v 2 i are the equivalent* coordinates from molecules 1 and 2, respectively. *Equivalent as defined by an alignment.
18 Least squares superposition Problem: find the rotation matrix, M, and a vector, v, that minimize the following quantity: M x i + v y i 2 i Where x i and y i are the equivalent coordinates from molecules 1 and 2, respectively.
19 Alignment defines structurally equivalent positions. Only aligned positions are included in the RMSD calculation. 4DFR:A ISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLDKPVIMGRHTWESIG-RPLPGRKNI 1DFR:_ TAFLWAQNRNGLIGKDGHLPWHLPDDLHYFRAQTVGKIMVVGRRTYESFPKRPLPERTNV 4DFR:A ILSSQ-PGTDDRVTWVKSVDEAIAAC--GDVPEIMVIGGGRVYEQFLPKAQKLYLTHIDA 1DFR:_ VLTHQEDYQAQGAVVVHDVAAVFAYAKQHLDQELVIAGGAQIFTAFKDDVDTLLVTRLAG 4DFR:A EVEGDTHFPDYEPDDWESVFSEFHDADAQNS--HSYCFKILERR 1DFR:_ SFEGDTKMIPLNWDDFTKVSSRTVEDT---NPALTHTYEVWQKK Unaligned positions are not.
20 Finding structural equivalences by linear Least squares (1) At the position of best superposition, we have an approximate equality: (2) We can eliminate v by translating the center of mass of both molecules to the origin. The equation simplifies to: We have one equation (i) for each atom, M has 9 unknowns. Mx i + v y i Mx' i y' i If there are more equations than unknowns, there is a unique solution.
21 Least squares Least squares solves a set of a linear equations : x 11 a 1 +x 12 a 2 + x 1N a N = y 1 x 21 a 1 +x 22 a 2 + x 2N a N = y 2 x M1 a 1 +x M2 a 2 + x MN a N = y M This is 'shorthand' notation for the linear equations. knowns unknowns a x = y knowns matrix vector vector
22 x a Least squares, continued = y Green elements are known. Orange are unknown. Fat Rectangles are matrices. Thin rectangles are vectors. x T a = x x T y Multiply both sides by transpose of x. "Squaring" x T x a = x T y "Squared" matrix can be inverted. (We can use the "LU decomposition".) (x T x) -1 x T x a = (xt x) -1 x T y Multiplying both sides by the inverse of "squared" matrix solves for a. a = (x T x) -1 x T y Summary: a = (x T x) -1 x T y
23 least-squares superimposed molecules
24 Structure-based alignment algorithms use structure to define the sequence alignment 24
25 Chicken/Egg Least squares superposition defines the alignment. The alignment defines the least squares superposition. 25
26 Structural alignment algorithm types Alignment algorithms create a one-to-one mapping of subset(s) of one sequence to subset(s) of another sequence. Structure-based alignment types: Geometric--intermolecular Algorithms may be do this by minimizing the intermolecular distances or rootmean-square deviation (rmsd) in superimposed alpha-carbon positions. Geometric--intramolecular Algorithms minimize the difference between aligned contact maps or distance matrices. Intramolecular distances are used Non-Geometric Algorithms align structural properties, such as %buried, or secondary structure type, usually using dynamics programming (DP)
27 Structure-based alignment tools DALI Programs VAST CE FATCAT MAMMOTH PRiSM *SCALI *SARF *FlexSNAP Databases FSSP HOMSTRAD COMPASS PALI SSAP SuperPose *SARF, SCALI, FlexSNAP do non-sequential alignment.
28 Real (meaningful) structural alignment Aligned residues have same secondary (local) structure. Pairs of aligned residues have the same contacts (either both in contact or both not in contact) Erroneous structural alignment Aligned residues are simply superimposed in space. Are not the same secondary structure. Do not have the same contacts.
29 Structure-based alignment can identify remote homologs In general, the number of remote homologs exceeds the number of close homologs. But sequence similarity is useless below 25% identity likelihood the "twilight zone" percent identity for structural homologs
30 Example of structural homologs (analogs) 4DFR: Dihydrofolate reductase 1YAC: Octameric Hydrolase Of Unknown Specificity 5.9% sequence identity (best alignment) 1YAC structure solved without knowing function. Alignment to 4DFR and others implies it is a hydrolase of some sort, probably uses NAD cofactors.
31 Viewing structural homologs (analogs) Sometimes you can see the structural similarity by viewing secondary structure layers. DHFR in yellow and orange. YAC in green and purple sheets only helices only
32 DALI: a intramolecular geometric structural alignment algorithm DALI: (Distance matrix-based ALIgnment) Liisa Holm & Chris Sander (1) Generate a distance matrix for each protein The distance matrix contains all pairwise distances.(symmetrical) j i D ij = distance between alpha carbon i and alpha carbon j Geometric--intramolecular
33 Aligning two distance matrices Cut-andpaste alignment of distance matrices Resulting sequence alignment
34 DALI algorithm S = L i =1 L j =1 DALI maximizes S. φ R ( i, j) where, φ R ( i, j) = θ R d A dij A - d B kl ij θ R is just a constant. dij A is the distance from residue i to residue j, and dkl B is the distance from residue k to residue l. i is aligned to k and j is aligned to l. S is maximal when d ij A = d kl B everywhere. The alignment is the independent variable, i.e. the search space.
35 DALI algorithm: Get a square from structure A Search alignment space using distance matrices. Randomly align with structure B Keep the best S scores vs send high scores to pairs list Each pair of 6x6 s corresponds to a gapped alignment
36 DALI algorithm DALI algorithm: select a square from the pairs list 1st pair 4dfr Axes = sequence position shade=distance, shorter distances are lighter shade. 1yac
37 DALI algorithm add a square, evaluate cross blocks 2nd pair 1st pair 4dfr cross blocks Calculate S. Better: Keep. Worse: Reject. 1yac
38 DALI alignment output
39 Uses of structural alignment in modeling A structure-based alignment is the Gold Standard for a sequence alignment. Aligned structures tell you where structurally conserved regions are, versus where insertions/deletions are allowed. Structural analogs provide a source of plausible loop structures. Multiple aligned structures show evolutionary plasticity.
40 Exercise 3.3: MOE Superpose Do these pairs: 2ptl.pdb 2gb1.pdb 3sdh.pdb 1h97.pdb 3sdh.pdb 1phn.pdb File Open: RCSB PDB: code: xxxx Delete extra copies if there are multiple chains. Hide All atoms Ribbon Style: oval, Color: terminus Select synchronize (check) Protein Superpose Try different settings. Check RMSD. Take-home part!
41 Review questions How do you multiply a matrix by a vector? Rotate the vector (1,2,3) by 90 around Z. How is the Z axis related to the X and Y axes? Can I apply rotations in any order and get the same result? What does RMSD stand for? Why do we need an alignment to calculate RMSD? What is minimized in least-squares superposition? Is least squares optimal or heuristic? What is the output of a structure-based alignment? Is structure-based alignment optimal or heuristic? What kind of structure-based alignment algorithm is DALI? What two properties should be conserved between structurally aligned positions? What are protein analogs? What does DALI maximize? What is the search space of DALI? 41
42 Supplementary slides 42
43 supplementary slides: Principal axes unit vectors are matrix columns. ( a b c d e f g h i )x = a d g rotated x axis = b e h rotated y axis = c f i rotated z axis You can create a rotation matrix by defining three mutually orthogonal unit vectors, then lining them up side-by-side in a 3x3 matrix. 43
44 supplementary slides: two 3D rotation conventions: Euler angles, α β γ axis of z x z rotation: # cosγ sinγ 0& # &# cosα sinα 0& % (% (% ( % sinγ cosγ 0( % 0 cos β sinβ( % sinα cosα 0( % (% (% ( Order of $ 0 0 1' $ 0 sinβ cos β ' $ 0 0 1' rotations: Each rotation is around a principle axis. z Polar angles, φψκ y z -y -z # cosφ sinφ 0& # cosϕ 0 sinϕ& # cosκ sinκ 0& # cosϕ 0 sinϕ &# cosφ sinφ 0& % (% (% (% (% ( % sinφ cosφ 0( % (% sinκ cosκ 0( % (% sinφ cosφ 0( % (% (% (% (% ( $ 0 0 1' $ sinϕ 0 cosϕ ' $ 0 0 1' $ sinϕ 0 cosϕ' $ 0 0 1' Net rotation = κ, around an axis axis defined by φ and ψ
45 supplementary slides: Polar angle convention: # cosφ sinφ 0& # cosϕ 0 sinϕ& # cosκ sinκ 0& # cosϕ 0 sinϕ &# cosφ sinφ 0& % (% (% (% (% ( % sinφ cosφ 0( % (% sinκ cosκ 0( % (% sinφ cosφ 0( % (% (% (% (% ( $ 0 0 1' $ sinϕ 0 cosϕ ' $ 0 0 1' $ sinϕ 0 cosϕ' $ 0 0 1' z = north pole ψ κ y φ x = prime equator Rotation of κ degrees around an axis axis located at φ degrees longitude and ψ degrees latitude. Rotation order is Z(-φ), Y(-ψ), Z(κ), Y(ψ), Z(φ) Notice the nested rotations.
46 SSAP alignment A View is the set of all vectors from one residue. Each residue has its own "View", which is a set of vectors to nearest neighbor residues. Algorithm type: Geometric--intramolecular
47 SSAP alignment: views i and j must have similar backbone angles, otherwise the score is zero. View for Template residue i View for Target residue j j The difference between the two views is a measure of how similar the structures are, when viewed from i and j. i residue level score matrix
48 SSAP algorithm: Double Dynamic Programming residue level scoring matrix DP1 summary matrix DP2 For each ij pair, we find the best DP alignment that includes ij. Keep the DP score at position (i,j) in teh summary matrix. The summary matrix is subjected to a second round of DP, to give the optimal alignment.
49 HOW to use CE to find and align structural homologs Go to Find structural alignments by selecting from ALL or REPRESENTATIVES from the PDB. Submit your protein and chain. Or use 4dfr:A Select 2 structures. Then hit: Get alignment (or use 4DFR and 1YAC) Download as PDB file. Save it. For use in MOE, divide the file into 2, one for each protein.
50 CE alignment
51 Alignment of15 analogs helix sheet
52 Two other servers for structurebased alignment FATCAT VAST: Algorithm type: Geometric--intermolecular
53 Non-sequential alignment! 1alk vpt 4 3 SCALI non-sequential alignment Yuan & Bystroff, 2005
54 SCALI Non-sequential structure-based alignments can be used to identify similar motifs in the packing geometry of SSEs. For example, it can find two proteins that have 2/4/2 α/β/α 3-layer sandwich architecture, regardless of how the SSEs are connected. (1) Exhaustive gapless alignments (2) FASTA-style assembly of alignment from fragment pairs. (3) Alignment score based on sequence similarity, local structure similarity and contact map similarity. Algorithm type: Geometric--intramolecular
55 Exhaustive alignment, no gaps. protein structure B Each dot represents two positions that have the same local structure. Darker means similar in sequence. The SCALI alignment is constructed from fragment pairs (diagonal rows). protein structure A
56 FASTA-style search in alignment space alignments generation... SCALI uses a near-greedy algorithm. Each candidate parent alignment generates up to 50 children alignments. Children are selected based on similarity (sequence, local structure, contacts). Selected children become parents in the next generation. The program is done when there is nothing left to align.
57 Non-sequential alignment space alignment space vs β α β β β α Expressing a non-sequential alignment like a standard sequence alignment loses information. Using an alignment matrix is better, as in this example of two different ββα units.
Molecular Modeling lecture 17, Tue, Mar. 19. Rotation Least-squares Superposition Structure-based alignment algorithms
Molecular Modeling 2019 -- lecture 17, Tue, Mar. 19 Rotation Least-squares Superposition Structure-based alignment algorithms Matrices and vectors Matrix algebra allows you to express multiple equations
More informationMolecular Modeling Lecture 7. Homology modeling insertions/deletions manual realignment
Molecular Modeling 2018-- Lecture 7 Homology modeling insertions/deletions manual realignment Homology modeling also called comparative modeling Sequences that have similar sequence have similar structure.
More informationOrientational degeneracy in the presence of one alignment tensor.
Orientational degeneracy in the presence of one alignment tensor. Rotation about the x, y and z axes can be performed in the aligned mode of the program to examine the four degenerate orientations of two
More informationAlgorithms in Bioinformatics FOUR Pairwise Sequence Alignment. Pairwise Sequence Alignment. Convention: DNA Sequences 5. Sequence Alignment
Algorithms in Bioinformatics FOUR Sami Khuri Department of Computer Science San José State University Pairwise Sequence Alignment Homology Similarity Global string alignment Local string alignment Dot
More informationPymol Practial Guide
Pymol Practial Guide Pymol is a powerful visualizor very convenient to work with protein molecules. Its interface may seem complex at first, but you will see that with a little practice is simple and powerful
More informationProtein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche
Protein Structure Prediction II Lecturer: Serafim Batzoglou Scribe: Samy Hamdouche The molecular structure of a protein can be broken down hierarchically. The primary structure of a protein is simply its
More informationProtein structure analysis. Risto Laakso 10th January 2005
Protein structure analysis Risto Laakso risto.laakso@hut.fi 10th January 2005 1 1 Summary Various methods of protein structure analysis were examined. Two proteins, 1HLB (Sea cucumber hemoglobin) and 1HLM
More informationStructural Alignment of Proteins
Goal Align protein structures Structural Alignment of Proteins 1 2 3 4 5 6 7 8 9 10 11 12 13 14 PHE ASP ILE CYS ARG LEU PRO GLY SER ALA GLU ALA VAL CYS PHE ASN VAL CYS ARG THR PRO --- --- --- GLU ALA ILE
More informationGiri Narasimhan. CAP 5510: Introduction to Bioinformatics. ECS 254; Phone: x3748
CAP 5510: Introduction to Bioinformatics Giri Narasimhan ECS 254; Phone: x3748 giri@cis.fiu.edu www.cis.fiu.edu/~giri/teach/bioinfs07.html 2/15/07 CAP5510 1 EM Algorithm Goal: Find θ, Z that maximize Pr
More informationMolecular Modeling lecture 2
Molecular Modeling 2018 -- lecture 2 Topics 1. Secondary structure 3. Sequence similarity and homology 2. Secondary structure prediction 4. Where do protein structures come from? X-ray crystallography
More informationCMPS 6630: Introduction to Computational Biology and Bioinformatics. Structure Comparison
CMPS 6630: Introduction to Computational Biology and Bioinformatics Structure Comparison Protein Structure Comparison Motivation Understand sequence and structure variability Understand Domain architecture
More informationLinear Algebra & Geometry why is linear algebra useful in computer vision?
Linear Algebra & Geometry why is linear algebra useful in computer vision? References: -Any book on linear algebra! -[HZ] chapters 2, 4 Some of the slides in this lecture are courtesy to Prof. Octavia
More informationSequence Alignment: A General Overview. COMP Fall 2010 Luay Nakhleh, Rice University
Sequence Alignment: A General Overview COMP 571 - Fall 2010 Luay Nakhleh, Rice University Life through Evolution All living organisms are related to each other through evolution This means: any pair of
More informationLinear Algebra & Geometry why is linear algebra useful in computer vision?
Linear Algebra & Geometry why is linear algebra useful in computer vision? References: -Any book on linear algebra! -[HZ] chapters 2, 4 Some of the slides in this lecture are courtesy to Prof. Octavia
More information1. Protein Data Bank (PDB) 1. Protein Data Bank (PDB)
Protein structure databases; visualization; and classifications 1. Introduction to Protein Data Bank (PDB) 2. Free graphic software for 3D structure visualization 3. Hierarchical classification of protein
More informationWeek 10: Homology Modelling (II) - HHpred
Week 10: Homology Modelling (II) - HHpred Course: Tools for Structural Biology Fabian Glaser BKU - Technion 1 2 Identify and align related structures by sequence methods is not an easy task All comparative
More informationProtein Structure Overlap
Protein Structure Overlap Maximizing Protein Structural Alignment in 3D Space Protein Structure Overlap Motivation () As mentioned several times, we want to know more about protein function by assessing
More informationExamples of Protein Modeling. Protein Modeling. Primary Structure. Protein Structure Description. Protein Sequence Sources. Importing Sequences to MOE
Examples of Protein Modeling Protein Modeling Visualization Examination of an experimental structure to gain insight about a research question Dynamics To examine the dynamics of protein structures To
More informationBuilding small molecules
Building small molecules Use the Builder (right panel) to build up molecules. Start building clicking a fragment/atom in the builder and it will appear to the workspace. Continue modifying the molecule
More informationBasics of protein structure
Today: 1. Projects a. Requirements: i. Critical review of one paper ii. At least one computational result b. Noon, Dec. 3 rd written report and oral presentation are due; submit via email to bphys101@fas.harvard.edu
More informationDistance Between Ellipses in 2D
Distance Between Ellipses in 2D David Eberly, Geometric Tools, Redmond WA 98052 https://www.geometrictools.com/ This work is licensed under the Creative Commons Attribution 4.0 International License. To
More informationSequence analysis and comparison
The aim with sequence identification: Sequence analysis and comparison Marjolein Thunnissen Lund September 2012 Is there any known protein sequence that is homologous to mine? Are there any other species
More informationIntroduction to Comparative Protein Modeling. Chapter 4 Part I
Introduction to Comparative Protein Modeling Chapter 4 Part I 1 Information on Proteins Each modeling study depends on the quality of the known experimental data. Basis of the model Search in the literature
More informationHomologous proteins have similar structures and structural superposition means to rotate and translate the structures so that corresponding atoms are
1 Homologous proteins have similar structures and structural superposition means to rotate and translate the structures so that corresponding atoms are as close to each other as possible. Structural similarity
More informationAlgorithms in Bioinformatics
Algorithms in Bioinformatics Sami Khuri Department of omputer Science San José State University San José, alifornia, USA khuri@cs.sjsu.edu www.cs.sjsu.edu/faculty/khuri Pairwise Sequence Alignment Homology
More informationCMPS 3110: Bioinformatics. Tertiary Structure Prediction
CMPS 3110: Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the laws of physics! Conformation space is finite
More informationCMPS 6630: Introduction to Computational Biology and Bioinformatics. Tertiary Structure Prediction
CMPS 6630: Introduction to Computational Biology and Bioinformatics Tertiary Structure Prediction Tertiary Structure Prediction Why Should Tertiary Structure Prediction Be Possible? Molecules obey the
More informationCAP 5510 Lecture 3 Protein Structures
CAP 5510 Lecture 3 Protein Structures Su-Shing Chen Bioinformatics CISE 8/19/2005 Su-Shing Chen, CISE 1 Protein Conformation 8/19/2005 Su-Shing Chen, CISE 2 Protein Conformational Structures Hydrophobicity
More informationClassical Mechanics. Luis Anchordoqui
1 Rigid Body Motion Inertia Tensor Rotational Kinetic Energy Principal Axes of Rotation Steiner s Theorem Euler s Equations for a Rigid Body Eulerian Angles Review of Fundamental Equations 2 Rigid body
More informationMATH Topics in Applied Mathematics Lecture 12: Evaluation of determinants. Cross product.
MATH 311-504 Topics in Applied Mathematics Lecture 12: Evaluation of determinants. Cross product. Determinant is a scalar assigned to each square matrix. Notation. The determinant of a matrix A = (a ij
More informationSeeSAR 7.1 Beginners Guide. June 2017
SeeSAR 7.1 Beginners Guide June 2017 Part 1: Basics 1 Type a pdb code and press return or Load your own protein or already existing project, or Just load molecules To begin, let s type 2zff and download
More informationPrediction and refinement of NMR structures from sparse experimental data
Prediction and refinement of NMR structures from sparse experimental data Jeff Skolnick Director Center for the Study of Systems Biology School of Biology Georgia Institute of Technology Overview of talk
More information3. SEQUENCE ANALYSIS BIOINFORMATICS COURSE MTAT
3. SEQUENCE ANALYSIS BIOINFORMATICS COURSE MTAT.03.239 25.09.2012 SEQUENCE ANALYSIS IS IMPORTANT FOR... Prediction of function Gene finding the process of identifying the regions of genomic DNA that encode
More informationIntroduction and Vectors Lecture 1
1 Introduction Introduction and Vectors Lecture 1 This is a course on classical Electromagnetism. It is the foundation for more advanced courses in modern physics. All physics of the modern era, from quantum
More informationHongbing Zhang October 2018
Exact and Intuitive Geometry Explanation: Why Does a Half-angle-rotation in Spin or Quaternion Space Correspond to the Whole-angle-rotation in Normal 3D Space? Hongbing Zhang October 2018 Abstract Why
More informationSequence Analysis '17- lecture 8. Multiple sequence alignment
Sequence Analysis '17- lecture 8 Multiple sequence alignment Ex5 explanation How many random database search scores have e-values 10? (Answer: 10!) Why? e-value of x = m*p(s x), where m is the database
More informationThe Select Command and Boolean Operators
The Select Command and Boolean Operators Part of the Jmol Training Guide from the MSOE Center for BioMolecular Modeling Interactive version available at http://cbm.msoe.edu/teachingresources/jmol/jmoltraining/boolean.html
More informationGetting To Know Your Protein
Getting To Know Your Protein Comparative Protein Analysis: Part III. Protein Structure Prediction and Comparison Robert Latek, PhD Sr. Bioinformatics Scientist Whitehead Institute for Biomedical Research
More informationLinear Algebra March 16, 2019
Linear Algebra March 16, 2019 2 Contents 0.1 Notation................................ 4 1 Systems of linear equations, and matrices 5 1.1 Systems of linear equations..................... 5 1.2 Augmented
More informationProtein Bioinformatics Computer lab #1 Friday, April 11, 2008 Sean Prigge and Ingo Ruczinski
Protein Bioinformatics 260.655 Computer lab #1 Friday, April 11, 2008 Sean Prigge and Ingo Ruczinski Goals: Approx. Time [1] Use the Protein Data Bank PDB website. 10 minutes [2] Use the WebMol Viewer.
More informationCan protein model accuracy be. identified? NO! CBS, BioCentrum, Morten Nielsen, DTU
Can protein model accuracy be identified? Morten Nielsen, CBS, BioCentrum, DTU NO! Identification of Protein-model accuracy Why is it important? What is accuracy RMSD, fraction correct, Protein model correctness/quality
More informationSUPPLEMENTARY MATERIALS
SUPPLEMENTARY MATERIALS Enhanced Recognition of Transmembrane Protein Domains with Prediction-based Structural Profiles Baoqiang Cao, Aleksey Porollo, Rafal Adamczak, Mark Jarrell and Jaroslaw Meller Contact:
More informationENGI 9420 Lecture Notes 2 - Matrix Algebra Page Matrix operations can render the solution of a linear system much more efficient.
ENGI 940 Lecture Notes - Matrix Algebra Page.0. Matrix Algebra A linear system of m equations in n unknowns, a x + a x + + a x b (where the a ij and i n n a x + a x + + a x b n n a x + a x + + a x b m
More informationLinear Algebra Review. Fei-Fei Li
Linear Algebra Review Fei-Fei Li 1 / 51 Vectors Vectors and matrices are just collections of ordered numbers that represent something: movements in space, scaling factors, pixel brightnesses, etc. A vector
More informationSupplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate
SUPPLEMENTARY FIGURE LEGENDS Supplementary Figure 1. Aligned sequences of yeast IDH1 (top) and IDH2 (bottom) with isocitrate dehydrogenase from Escherichia coli [ICD, pdb 1PB1, Mesecar, A. D., and Koshland,
More informationOLLSCOIL NA heireann MA NUAD THE NATIONAL UNIVERSITY OF IRELAND MAYNOOTH MATHEMATICAL PHYSICS EE112. Engineering Mathematics II
OLLSCOIL N heirenn M NUD THE NTIONL UNIVERSITY OF IRELND MYNOOTH MTHEMTICL PHYSICS EE112 Engineering Mathematics II Prof. D. M. Heffernan and Mr. S. Pouryahya 1 5 Scalars and Vectors 5.1 The Scalar Quantities
More informationNature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1
Supplementary Figure 1 Cryo-EM structure and model of the C. thermophilum 90S preribosome. a, Gold standard FSC curve showing the average resolution of the 90S preribosome masked and unmasked (left). FSC
More informationPresentation Outline. Prediction of Protein Secondary Structure using Neural Networks at Better than 70% Accuracy
Prediction of Protein Secondary Structure using Neural Networks at Better than 70% Accuracy Burkhard Rost and Chris Sander By Kalyan C. Gopavarapu 1 Presentation Outline Major Terminology Problem Method
More information2 Dean C. Adams and Gavin J. P. Naylor the best three-dimensional ordination of the structure space is found through an eigen-decomposition (correspon
A Comparison of Methods for Assessing the Structural Similarity of Proteins Dean C. Adams and Gavin J. P. Naylor? Dept. Zoology and Genetics, Iowa State University, Ames, IA 50011, U.S.A. 1 Introduction
More informationComparing whole genomes
BioNumerics Tutorial: Comparing whole genomes 1 Aim The Chromosome Comparison window in BioNumerics has been designed for large-scale comparison of sequences of unlimited length. In this tutorial you will
More informationTransformations. Lars Vidar Magnusson. August 24,
Transformations Lars Vidar Magnusson August 24, 2012 http://www.it.hiof.no/~larsvmag/iti43309/index.html 2D Translation To translate an object is to move it in two-dimensinal space. If we have a point
More informationproteins Refinement by shifting secondary structure elements improves sequence alignments
proteins STRUCTURE O FUNCTION O BIOINFORMATICS Refinement by shifting secondary structure elements improves sequence alignments Jing Tong, 1,2 Jimin Pei, 3 Zbyszek Otwinowski, 1,2 and Nick V. Grishin 1,2,3
More informationThe Coefficient of Friction
The Coefficient of Friction OBJECTIVE To determine the coefficient of static friction between two pieces of wood. To determine the coefficient of kinetic friction between two pieces of wood. To investigate
More informationProtein Structure Prediction, Engineering & Design CHEM 430
Protein Structure Prediction, Engineering & Design CHEM 430 Eero Saarinen The free energy surface of a protein Protein Structure Prediction & Design Full Protein Structure from Sequence - High Alignment
More informationProtein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror
Protein structure prediction CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror 1 Outline Why predict protein structure? Can we use (pure) physics-based methods? Knowledge-based methods Two major
More informationPhysics 201 Lab 2 Air Drag Simulation
Physics 201 Lab 2 Air Drag Simulation Jan 28, 2013 Equipment Initial Set Up Type the data from Table 1 into the appropriate cells. By preceding the content of the cell with an equal sign (as in cell A6)
More informationPairwise & Multiple sequence alignments
Pairwise & Multiple sequence alignments Urmila Kulkarni-Kale Bioinformatics Centre 411 007 urmila@bioinfo.ernet.in Basis for Sequence comparison Theory of evolution: gene sequences have evolved/derived
More informationProtein Structure Comparison Methods
Protein Structure Comparison Methods D. Petrova Key Words: Protein structure comparison; models; comparison algorithms; similarity measure Abstract. Existing methods for protein structure comparison are
More informationProtein structure prediction. CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror
Protein structure prediction CS/CME/BioE/Biophys/BMI 279 Oct. 10 and 12, 2017 Ron Dror 1 Outline Why predict protein structure? Can we use (pure) physics-based methods? Knowledge-based methods Two major
More informationElementary Linear Algebra
Matrices J MUSCAT Elementary Linear Algebra Matrices Definition Dr J Muscat 2002 A matrix is a rectangular array of numbers, arranged in rows and columns a a 2 a 3 a n a 2 a 22 a 23 a 2n A = a m a mn We
More informationPart 4 The Select Command and Boolean Operators
Part 4 The Select Command and Boolean Operators http://cbm.msoe.edu/newwebsite/learntomodel Introduction By default, every command you enter into the Console affects the entire molecular structure. However,
More informationBio nformatics. Lecture 23. Saad Mneimneh
Bio nformatics Lecture 23 Protein folding The goal is to determine the three-dimensional structure of a protein based on its amino acid sequence Assumption: amino acid sequence completely and uniquely
More informationIntroduction to Vectors
Introduction to Vectors K. Behrend January 31, 008 Abstract An introduction to vectors in R and R 3. Lines and planes in R 3. Linear dependence. 1 Contents Introduction 3 1 Vectors 4 1.1 Plane vectors...............................
More informationSupporting Text 1. Comparison of GRoSS sequence alignment to HMM-HMM and GPCRDB
Structure-Based Sequence Alignment of the Transmembrane Domains of All Human GPCRs: Phylogenetic, Structural and Functional Implications, Cvicek et al. Supporting Text 1 Here we compare the GRoSS alignment
More informationSequence Analysis '17 -- lecture 7
Sequence Analysis '17 -- lecture 7 Significance E-values How significant is that? Please give me a number for......how likely the data would not have been the result of chance,......as opposed to......a
More informationA profile-based protein sequence alignment algorithm for a domain clustering database
A profile-based protein sequence alignment algorithm for a domain clustering database Lin Xu,2 Fa Zhang and Zhiyong Liu 3, Key Laboratory of Computer System and architecture, the Institute of Computing
More informationProtein folding. α-helix. Lecture 21. An α-helix is a simple helix having on average 10 residues (3 turns of the helix)
Computat onal Biology Lecture 21 Protein folding The goal is to determine the three-dimensional structure of a protein based on its amino acid sequence Assumption: amino acid sequence completely and uniquely
More informationDesign of a Novel Globular Protein Fold with Atomic-Level Accuracy
Design of a Novel Globular Protein Fold with Atomic-Level Accuracy Brian Kuhlman, Gautam Dantas, Gregory C. Ireton, Gabriele Varani, Barry L. Stoddard, David Baker Presented by Kate Stafford 4 May 05 Protein
More informationProtein Structure Prediction and Display
Protein Structure Prediction and Display Goal Take primary structure (sequence) and, using rules derived from known structures, predict the secondary structure that is most likely to be adopted by each
More informationMultiple Mapping Method: A Novel Approach to the Sequence-to-Structure Alignment Problem in Comparative Protein Structure Modeling
63:644 661 (2006) Multiple Mapping Method: A Novel Approach to the Sequence-to-Structure Alignment Problem in Comparative Protein Structure Modeling Brajesh K. Rai and András Fiser* Department of Biochemistry
More informationHomology Modeling. Roberto Lins EPFL - summer semester 2005
Homology Modeling Roberto Lins EPFL - summer semester 2005 Disclaimer: course material is mainly taken from: P.E. Bourne & H Weissig, Structural Bioinformatics; C.A. Orengo, D.T. Jones & J.M. Thornton,
More informationHomology Modeling (Comparative Structure Modeling) GBCB 5874: Problem Solving in GBCB
Homology Modeling (Comparative Structure Modeling) Aims of Structural Genomics High-throughput 3D structure determination and analysis To determine or predict the 3D structures of all the proteins encoded
More informationMultiple structure alignment with mstali
Multiple structure alignment with mstali Shealy and Valafar Shealy and Valafar BMC Bioinformatics 2012, 13:105 Shealy and Valafar BMC Bioinformatics 2012, 13:105 SOFTWARE Open Access Multiple structure
More informationProcheck output. Bond angles (Procheck) Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics.
Structure verification and validation Bond lengths (Procheck) Introduction to Bioinformatics Iosif Vaisman Email: ivaisman@gmu.edu ----------------------------------------------------------------- Bond
More informationLinear Algebra Section 2.6 : LU Decomposition Section 2.7 : Permutations and transposes Wednesday, February 13th Math 301 Week #4
Linear Algebra Section. : LU Decomposition Section. : Permutations and transposes Wednesday, February 1th Math 01 Week # 1 The LU Decomposition We learned last time that we can factor a invertible matrix
More informationMOL410/510 Problem Set 1 - Linear Algebra - Due Friday Sept. 30
MOL40/50 Problem Set - Linear Algebra - Due Friday Sept. 30 Use lab notes to help solve these problems. Problems marked MUST DO are required for full credit. For the remainder of the problems, do as many
More informationDetermining a Triangle
Determining a Triangle 1 Constraints What data do we need to determine a triangle? There are two basic facts that constrain the data: 1. The triangle inequality: The sum of the length of two sides is greater
More informationNMR, X-ray Diffraction, Protein Structure, and RasMol
NMR, X-ray Diffraction, Protein Structure, and RasMol Introduction So far we have been mostly concerned with the proteins themselves. The techniques (NMR or X-ray diffraction) used to determine a structure
More informationProtein Dynamics. The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron.
Protein Dynamics The space-filling structures of myoglobin and hemoglobin show that there are no pathways for O 2 to reach the heme iron. Below is myoglobin hydrated with 350 water molecules. Only a small
More informationEE263 homework 3 solutions
EE263 Prof. S. Boyd EE263 homework 3 solutions 2.17 Gradient of some common functions. Recall that the gradient of a differentiable function f : R n R, at a point x R n, is defined as the vector f(x) =
More informationREVIEW - Vectors. Vectors. Vector Algebra. Multiplication by a scalar
J. Peraire Dynamics 16.07 Fall 2004 Version 1.1 REVIEW - Vectors By using vectors and defining appropriate operations between them, physical laws can often be written in a simple form. Since we will making
More informationSupporting Information. The structure of dynamic, taxol-stabilized, and. GMPPCP-stabilized microtubules
Supporting Information The structure of dynamic, taxol-stabilized, and MPPCP-stabilized microtubules Avi insburg,,,, Asaf Shemesh,,, Abigail Millgram,, Raviv Dharan,, Yael Levi-Kalisman,, Israel Ringel,,
More information18.06 Problem Set 1 Solutions Due Thursday, 11 February 2010 at 4 pm in Total: 100 points
18.06 Problem Set 1 Solutions Due Thursday, 11 February 2010 at 4 pm in 2-106. Total: 100 points Section 1.2. Problem 23: The figure shows that cos(α) = v 1 / v and sin(α) = v 2 / v. Similarly cos(β) is
More informationRigid Geometric Transformations
Rigid Geometric Transformations Carlo Tomasi This note is a quick refresher of the geometry of rigid transformations in three-dimensional space, expressed in Cartesian coordinates. 1 Cartesian Coordinates
More informationQuantifying sequence similarity
Quantifying sequence similarity Bas E. Dutilh Systems Biology: Bioinformatic Data Analysis Utrecht University, February 16 th 2016 After this lecture, you can define homology, similarity, and identity
More informationElementary maths for GMT
Elementary maths for GMT Linear Algebra Part 2: Matrices, Elimination and Determinant m n matrices The system of m linear equations in n variables x 1, x 2,, x n a 11 x 1 + a 12 x 2 + + a 1n x n = b 1
More informationBioinformatics 2 -- lecture 6
Bioinformatics 2 -- lecture 6 Loop modeling Energy minimization Steps in homology modeling Identify a sequence of interest. Search database for homologs of known structure. Align homologs with each other
More informationBonds and Structural Supports
Bonds and Structural Supports Part of the Jmol Training Guide from the MSOE Center for BioMolecular Modeling Interactive version available at http://cbm.msoe.edu/teachingresources/jmol/jmoltraining/struts.html
More informationCh. 7.3, 7.4: Vectors and Complex Numbers
Ch. 7.3, 7.4: Vectors and Complex Numbers Johns Hopkins University Fall 2014 (Johns Hopkins University) Ch. 7.3, 7.4: Vectors and Complex Numbers Fall 2014 1 / 38 Vectors(1) Definition (Vector) A vector
More informationLab 2 Worksheet. Problems. Problem 1: Geometry and Linear Equations
Lab 2 Worksheet Problems Problem : Geometry and Linear Equations Linear algebra is, first and foremost, the study of systems of linear equations. You are going to encounter linear systems frequently in
More informationPart 7 Bonds and Structural Supports
Part 7 Bonds and Structural Supports http://cbm.msoe.edu/newwebsite/learntomodel Introduction In addition to covalent bonds between atoms in a molecule, Jmol has the ability to render Hydrogen Bonds and
More informationProtein Structure Determination. Part 1 -- X-ray Crystallography
Protein Structure Determination Part 1 -- X-ray Crystallography Topics covering in this 1/2 course Crystal growth Diffraction theory Symmetry Solving phases using heavy atoms Solving phases using a model
More informationExercise 5. Sequence Profiles & BLAST
Exercise 5 Sequence Profiles & BLAST 1 Substitution Matrix (BLOSUM62) Likelihood to substitute one amino acid with another Figure taken from https://en.wikipedia.org/wiki/blosum 2 Substitution Matrix (BLOSUM62)
More informationTHEORY. Based on sequence Length According to the length of sequence being compared it is of following two types
Exp 11- THEORY Sequence Alignment is a process of aligning two sequences to achieve maximum levels of identity between them. This help to derive functional, structural and evolutionary relationships between
More informationAPPLICATIONS The eigenvalues are λ = 5, 5. An orthonormal basis of eigenvectors consists of
CHAPTER III APPLICATIONS The eigenvalues are λ =, An orthonormal basis of eigenvectors consists of, The eigenvalues are λ =, A basis of eigenvectors consists of, 4 which are not perpendicular However,
More informationBioinformatics (GLOBEX, Summer 2015) Pairwise sequence alignment
Bioinformatics (GLOBEX, Summer 2015) Pairwise sequence alignment Substitution score matrices, PAM, BLOSUM Needleman-Wunsch algorithm (Global) Smith-Waterman algorithm (Local) BLAST (local, heuristic) E-value
More information1. Matrix multiplication and Pauli Matrices: Pauli matrices are the 2 2 matrices. 1 0 i 0. 0 i
Problems in basic linear algebra Science Academies Lecture Workshop at PSGRK College Coimbatore, June 22-24, 2016 Govind S. Krishnaswami, Chennai Mathematical Institute http://www.cmi.ac.in/~govind/teaching,
More informationProtein Structure: Data Bases and Classification Ingo Ruczinski
Protein Structure: Data Bases and Classification Ingo Ruczinski Department of Biostatistics, Johns Hopkins University Reference Bourne and Weissig Structural Bioinformatics Wiley, 2003 More References
More informationSection III - Designing Models for 3D Printing
Section III - Designing Models for 3D Printing In this section of the Jmol Training Guide, you will become familiar with the commands needed to design a model that will be built on a 3D Printer. As you
More informationOnline Protein Structure Analysis with the Bio3D WebApp
Online Protein Structure Analysis with the Bio3D WebApp Lars Skjærven, Shashank Jariwala & Barry J. Grant August 13, 2015 (updated November 17, 2016) Bio3D1 is an established R package for structural bioinformatics
More information