Protein Fragment Search Program ver Overview: Contents:

Similar documents
Proteins: Characteristics and Properties of Amino Acids

Other Methods for Generating Ions 1. MALDI matrix assisted laser desorption ionization MS 2. Spray ionization techniques 3. Fast atom bombardment 4.

Viewing and Analyzing Proteins, Ligands and their Complexes 2

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Packing of Secondary Structures

What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary

----- Ver October 24, 2014 Bug about reading MOPAC2012 Ver.14 calculations of 1 atom and 2 atoms molecule was fixed.

Chapter 4: Amino Acids

Properties of amino acids in proteins

Ramachandran Plot. 4ysz Phi (degrees) Plot statistics

Exam I Answer Key: Summer 2006, Semester C

Peptides And Proteins

B O C 4 H 2 O O. NOTE: The reaction proceeds with a carbonium ion stabilized on the C 1 of sugar A.

Using Higher Calculus to Study Biologically Important Molecules Julie C. Mitchell

Secondary Structure. Bioch/BIMS 503 Lecture 2. Structure and Function of Proteins. Further Reading. Φ, Ψ angles alone determine protein structure

Exam III. Please read through each question carefully, and make sure you provide all of the requested information.

Amino Acids and Peptides

Oxygen Binding in Hemocyanin

CHMI 2227 EL. Biochemistry I. Test January Prof : Eric R. Gauthier, Ph.D.

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells?

Physiochemical Properties of Residues

Solutions In each case, the chirality center has the R configuration

Amino Acid Side Chain Induced Selectivity in the Hydrolysis of Peptides Catalyzed by a Zr(IV)-Substituted Wells-Dawson Type Polyoxometalate

Course Notes: Topics in Computational. Structural Biology.

Section Week 3. Junaid Malek, M.D.

Lecture 15: Realities of Genome Assembly Protein Sequencing

Protein Data Bank Contents Guide: Atomic Coordinate Entry Format Description. Version Document Published by the wwpdb

April, The energy functions include:

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination

7.012 Problem Set 1 Solutions

UNIT TWELVE. a, I _,o "' I I I. I I.P. l'o. H-c-c. I ~o I ~ I / H HI oh H...- I II I II 'oh. HO\HO~ I "-oh

Electronic Supplementary Information

Major Types of Association of Proteins with Cell Membranes. From Alberts et al

A. Two of the common amino acids are analyzed. Amino acid X and amino acid Y both have an isoionic point in the range of

Translation. A ribosome, mrna, and trna.

Supporting information to: Time-resolved observation of protein allosteric communication. Sebastian Buchenberg, Florian Sittel and Gerhard Stock 1

The translation machinery of the cell works with triples of types of RNA bases. Any triple of RNA bases is known as a codon. The set of codons is

DISCLAIMER. Some General Comments on this Workbook. How to Use This Workbook. Peptide-MS-Calc.xls 1

Amino Acids and Proteins at ZnO-water Interfaces in Molecular Dynamics Simulations: Electronic Supplementary Information

Unraveling the degradation of artificial amide bonds in Nylon oligomer hydrolase: From induced-fit to acylation processes

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease

C H E M I S T R Y N A T I O N A L Q U A L I F Y I N G E X A M I N A T I O N SOLUTIONS GUIDE

Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution

Supplementary Information Intrinsic Localized Modes in Proteins

Chemistry Chapter 22

Diastereomeric resolution directed towards chirality. determination focussing on gas-phase energetics of coordinated. sodium dissociation

Computer simulations of protein folding with a small number of distance restraints

Clustering and Model Integration under the Wasserstein Metric. Jia Li Department of Statistics Penn State University

Introduction to the Ribosome Overview of protein synthesis on the ribosome Prof. Anders Liljas

Ranjit P. Bahadur Assistant Professor Department of Biotechnology Indian Institute of Technology Kharagpur, India. 1 st November, 2013

NMR Assignments using NMRView II: Sequential Assignments

Model Mélange. Physical Models of Peptides and Proteins

Desorption/Ionization Efficiency of Common Amino Acids in. Surface-assisted Laser Desorption/ionization Mass Spectrometry

Amino acids: Optimization in underivatized formula and fragment patterns in LC/ESI-MS analysis. Yoshinori Takano (JAMSTEC)

Computational Protein Design

Biochemistry Quiz Review 1I. 1. Of the 20 standard amino acids, only is not optically active. The reason is that its side chain.

PROTEIN SECONDARY STRUCTURE PREDICTION: AN APPLICATION OF CHOU-FASMAN ALGORITHM IN A HYPOTHETICAL PROTEIN OF SARS VIRUS

Peptide Syntheses. Illustrative Protection: BOC/ t Bu. A. Introduction. do not acid

Supporting information

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Protein structure. Protein structure. Amino acid residue. Cell communication channel. Bioinformatics Methods

Problem Set 1

Biological Macromolecules

Advanced Certificate in Principles in Protein Structure. You will be given a start time with your exam instructions

12/6/12. Dr. Sanjeeva Srivastava IIT Bombay. Primary Structure. Secondary Structure. Tertiary Structure. Quaternary Structure.

Read more about Pauling and more scientists at: Profiles in Science, The National Library of Medicine, profiles.nlm.nih.gov

Resonance assignments in proteins. Christina Redfield

Supplementary Information. Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1

C CH 3 N C COOH. Write the structural formulas of all of the dipeptides that they could form with each other.

Chemical Models, Properties and Structures Learning Goals:

Introduction to Comparative Protein Modeling. Chapter 4 Part I

The Structure and Functions of Proteins

Central Dogma. modifications genome transcriptome proteome

Rotamers in the CHARMM19 Force Field

LS1a Fall 2014 Problem Set #2 Due Monday 10/6 at 6 pm in the drop boxes on the Science Center 2 nd Floor

Full wwpdb X-ray Structure Validation Report i

Conformational Analysis

Any protein that can be labelled by both procedures must be a transmembrane protein.

Programme Last week s quiz results + Summary Fold recognition Break Exercise: Modelling remote homologues

FW 1 CDR 1 FW 2 CDR 2

Final Chem 4511/6501 Spring 2011 May 5, 2011 b Name

26.7 Laboratory Synthesis of Peptides

7.014 Problem Set 1. A nswers to this problem set are to be turned in. Problem sets will not be accepted late. Solutions will be posted on the web.

Conformational Geometry of Peptides and Proteins:

Biophysical Journal, Volume 96. Supporting Material

Principles of Biochemistry

HSQC spectra for three proteins

Build_model v User Guide

THE UNIVERSITY OF MANITOBA. PAPER NO: _1_ LOCATION: 173 Robert Schultz Theatre PAGE NO: 1 of 5 DEPARTMENT & COURSE NO: CHEM / MBIO 2770 TIME: 1 HOUR

CHEM J-9 June 2014

Geometrical Concept-reduction in conformational space.and his Φ-ψ Map. G. N. Ramachandran

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA

7.05 Spring 2004 February 27, Recitation #2

Supplement information

Similarity or Identity? When are molecules similar?

Energy and Cellular Metabolism

CHEMISTRY ATAR COURSE DATA BOOKLET

Meeting the challenges of representing large, modified biopolymers

Protein Structure Bioinformatics Introduction

Transcription:

Protein Fragment Search Program ver 1.1.1 Developed by: BioPhysics Laboratory, Faculty of Life and Environmental Science, Shimane University 1060 Nishikawatsu-cho, Matsue-shi, Shimane, 690-8504, Japan E-mail: aoyagi@life.shimane-u.ac.jp Web: http://bioinfoenv.shimane-u.ac.jp/aoyagi/english.htm July 2010 Copyright (C) 2010 BioPhysics Laboratory, Faculty of Life and Environmental Science, Shimane University. All rights reserved. Please let us know when you modify this program and when you redistribute the original program or a modified one. Overview: This is a program for searching combinations of amino acid residues from an amino acid sequence based on m/z values. This program is written in Ruby, an open source programming language (http://www.ruby-lang.org/en/). When you open "protein analysis.zip" file the new folder "protein_analysis" will be created. The folder "protein_analysis" contains "config" folder, "programs" folder, "protein_analysis.bat" and "USER_GUIDE.pdf". USER_GUIDE.pdf is written in Japanese. The all information of USER_GUIDE.PDF is described in this instruction. The ZIP archive "protein analysis.zip" contains the following files: protein analysis.bat: The program boot file config: Information of atomic weight and structures of amino acid residues programs: Program files Contents: Uninstalling: p2 How to use: p2 Results: p3 Example: p4 Advanced usages: p6 The basic concept of the protein fragment search method: p8 References: p10-1-

Uninstalling: Please delete "protein_analysis" folder. How to use: When you double-click "protein analysis.bat" file, the following window will be opened. 1. Please put m/z value in the "m/z" cell. Put it to two decimal places. ex) 70.06 2. Then push "Search" button and then the search of combinations of amino acid residues will begin. 3. "Results" and "Details" windows show the results. "Results" shows combinations of amino acid residues. "Details" shows results of every possible main structure. 4. When you push "Save as", you can save the results as a txt file. -2-

5. When you push "Clear" button, the previous results in "Results" and "Details" windows will be deleted. 6. When you finish the program, please push "Exit" button. * This program does not consider some of fragments from one amino acid. Table 1 shows fragment ions related to each amino acid reported in the following references. D. S. Mantus, B. D. Ratner, B. A. Carlson, and J. F. Moulder, Anal. Chem. 65, 1431 (1993) J-B. Lhoest, E. Detrait, PvdBd Aguilar and P. Bertrand: J. Biomed. Materials Res. 41, 95 (1998) Table 1: Reported fragment ions related to the amino acids Results: The results will be written using abbreviated expressions as follows. [Y, W] means that a secondary ion can be generated from neighboring Y and W residues. [(Y, W, T)] means that a secondary ion can be generated from Y, W or T residue. [G, (A, E)] means that a secondary ion can be generated from neighboring "G and A" or "G and E". [G, (A, E), (S, T)] means that a secondary ion can be generated from neighboring "G, A and S" or "G, A and T" or "G, E and S" or "G, E and T". -3-

Example: m/z 126.06 The comment on [Results]: 6H8NO2 (126.055) => The possible formula of the secondary ion m/z 126.06 is C 6 H 8 NO 2. 1 residue => When the secondary ion was generated from 1 residue: [(V,L,I,P,K,R)] => The secondary ion can be generated from V,L,I,P,K, or R residue. 2 residues => When the secondary ion was generated from 2 residues: [G,(V,L,I,M,P,T,Q,K,R,E)] => The secondary ion can be generated from two neighboring residues, G and V, G and L, G and I, G and M, G and P, G and T, G and Q, G and K, G and R, or G and E. [(A,V,L,I,M,P,F,W,S,T,N,Q,Y,C,K,R,H,D,E),(A,V,L,I,M,P,F,W,S,T,N,Q,Y,C,K,R,H,D,E)] => The secondary ion can be generated from two neighboring residues, one of A,V,L,I,M,P,F,W,S,T,N,Q,Y,C,K,R,H,D and E resides and one of A,V,L,I,M,P,F,W,S,T,N,Q,Y,C,K,R,H,D and E resides. -4-

The comment on [Details]: Chemical formula: C6H8NO2 (126.055) Formula of backbone: C[3]N[1]O[2] => The part of the secondary ion related to the backbone of the polypeptide is composed of C 3 H x NO 2. Residue and fragment from side-chain: [(V,L,I,P,K,R)],C3: => The part of the secondary ion related to the side chain of the polypeptide is composed of C 3 H x which can be generated from the side chain of V, L, I, P, K or R residue. -------------------------------------------------------- -5-

Advanced usages: Addition of elements and modification of atomic weight: Elements used for the program were defined in "atom weight.yml" file in "config" folder. One row contains definition of one element like the following: <Symbol of element><colon><space><atomic weight> ex) C: 12 H: 1.0078 Please do not add a space at the beginning of a row. Please back up the original file before you modify this file. Addition or modification of principal chain structures of amino acid residues: Possible structures of principal chains of amino acid residues corresponding to the number of residues are described in the following file in "config" folder: _ principle chains.yml _ principle chains with proline.yml "principle chains.yml" contains all amino acids information. "principle chains with proline.yml" contains structures of principal chains of amino acids when more than one proline residue is included. Files "principle chains.yml" and "principle chains with proline.yml" are read when the program is started. If they are changed wrongly, the program will not be performed properly. Possible fragment formulas from principal chains are described in each file. Up to five neighboring amino acid residues are considered in this program. Fragments from principal chains (backbone) and those from side chains are considered separately. According to the number of parincipal chains composing a secondary ion, every fragment of each side chain is checked if it can be a part of the secondary ion based on its mass value. For example, possible formulas of fragments are listed from the least number of amino acid residues which the secondary ion can contain. The "principle chains.yml" file contains the following list. First group "- C3N2O2, - C3NO2, - C2N2O, - C2NO, - C2O, - CN, - C" contains possible fragments from a principal chain of one amino acid residue, and second group "- C5N3O3, - C5N2O3, - C4N3O2, - -6-

C4N2O2, - C4NO2, - C3N2O, - C3NO" contains possible fragments from principal chains of two neighboring amino acid residues. - - C3N2O2 - C3NO2 - C2N2O - C2NO - C2O - CN - C - - C5N3O3 - C5N2O3 - C4N3O2 - C4N2O2 - C4NO2 - C3N2O - C3NO -7-

The basic concept of the protein fragment search method: Fragment ion peaks from neighboring amino acids are identified by searching every combination of amino acids based on the following hypothesis: 1) The number of carbon, oxygen, nitrogen and sulfur atoms is considered. 2) The number of hydrogen atoms is adjusted (hydrogen addition and hydrogen desorption are considered flexibly). 3) Double bonds do not cleave. 4) Recombination and rearrangement of ions are neglected. Possible fragments from every part of a protein molecule were considered with structures of the 20 amino acid residues composing a protein. Table 2 shows fragments from the side chains of the residues, and table 3 shows fragments from the principal chain or principal chains (backbone). If it is assumed that a fragment ion is generated from a part containing two neighbor residues, its formula can be one of the combinations of two of the residues in table 2 and fragments of the two principle chains in table 3. In addition, when the fragment ion generation part contains a proline residue, the possible fragments in table 4 should be considered. -8-

Number of residues Table 2 Possible fragments generated from side chains of amino acid residues Amino acid Formula of side chain Possible fragment from side chain (except for hydrogen) Gly G H Ala A CH 3 C Val V C 3 H 7 C 3 C 2 C Leu L C 4 H 9 C 4 C 3 C 2 C Ile I C 4 H 9 C 4 C 3 C 2 C Met M C 3 H 7 S C 3 S C 2 S C 2 C Pro P C 3 H 6 C 3 C 2 C Phe F C 7 H 8 C 7 C Trp W C 9 H 7 N C 9 N C Ser S CH 3 O CO C Thr T C 2 H 5 O C 2 O C 2 CO C Asn N C 2 H 4 NO C 2 NO C 2 O C Gln Q C 3 H 6 NO C 3 NO C 3 O C 2 C Tyr Y C 7 H 8 O C 7 O C 7 C Cys C CH 3 S CS C Lys K C 4 H 11 N C 4 N C 4 C 3 C 2 C Arg R C 4 H 11 N 3 C 4 N 3 C 4 N 2 C 3 N C 3 C 2 C His H C 4 H 5 N 2 C 4 N 2 C Asp D C 2 H 2 O 2 C 2 O 2 C 2 O C Glu E C 3 H 4 O 2 C 3 O 2 C 3 O C 2 C Table 3 Possible fragments generated from principle chains (backbone) of amino acid residues Formula of backbone Possible fragment from backbone (except for hydrogen) 1 C 2 H 2 NO C3N2O2 C3NO2 C2N2O C2NO C2O CN C 2 C 4 H 4 N 2 O 2 C5N3O3 C5N2O3 C4N3O2 C4N2O2 C4NO2 C3N2O C3NO 3 C 6 H 6 N 3 O 3 C7N4O4 C7N3O4 C6N4O3 C6N3O3 C6N2O3 C5N3O2 C5N2O2 4 C 8 H 8 N 4 O 4 C9N5O5 C9N4O5 C8N5O4 C8N4O4 C8N3O4 C7N4O3 C7N3O3 5 C 10 H 10 N 5 O 5 C11N6O6 C11N5O6 C10N6O5 C10N5O5 C10N4O5 C9N5O4 C9N4O4 Table 4 Possible fragments generated from principle chains containing Proline Number of residues Possible fragment from principle chain (except for hydrogen) 1 N 2 C2N2O C2NO 3 C4N3O2 C4N2O2 C5N3O3 4 C6N4O3 C6N3O3 C7N4O4 5 C8N5O4 C8N4O4 C9N5O5-9-

Fragments and possible side chains which can generate each fragment: Zero G C A V L I M F W S T N Q Y C K R H D E C2 V L I M T N Q K R E C3 V L I (P) K R C4 L I K C7 F Y CO S T C2O T N D C2O2 D C2NO N C3O Q E C3O2 E C3NO Q C3N R CS C C2S M C3S M C4N K C4N2 R H C4N3 R C7O Y C9N W References: More information on Protein Fragment Search Program is available on the following web site. http://bioinfoenv.shimane-u.ac.jp/aoyagi/english.htm Related articles: S. Aoyagi, Surface and Interface Analysis, 41(2) 136-142, (2009) S. Aoyagi, M. Higuchi, N. Kato, and M. Kudo, e-j. Surf. Sci. Nanotech., 7, 715-720 (2009) If you have any questions on this program, please contact us. e-mail: aoyagi@life.shimane-u.ac.jp -10-