CycloBranch. Tutorials

Similar documents
DISCLAIMER. Some General Comments on this Workbook. How to Use This Workbook. Peptide-MS-Calc.xls 1

Other Methods for Generating Ions 1. MALDI matrix assisted laser desorption ionization MS 2. Spray ionization techniques 3. Fast atom bombardment 4.

MassHunter TOF/QTOF Users Meeting

MassHunter Software Overview

NMR Assignments using NMRView II: Sequential Assignments

Proteins: Characteristics and Properties of Amino Acids

Amino Acid Side Chain Induced Selectivity in the Hydrolysis of Peptides Catalyzed by a Zr(IV)-Substituted Wells-Dawson Type Polyoxometalate

Lecture 15: Realities of Genome Assembly Protein Sequencing

Collision Cross Section: Ideal elastic hard sphere collision:

Viewing and Analyzing Proteins, Ligands and their Complexes 2

A Plausible Model Correlates Prebiotic Peptide Synthesis with. Primordial Genetic Code

Tutorial 1: Setting up your Skyline document

Agilent MassHunter Quantitative Data Analysis

Supplemental Materials for. Structural Diversity of Protein Segments Follows a Power-law Distribution

Tutorial 2: Analysis of DIA data in Skyline

Nature Methods: doi: /nmeth Supplementary Figure 1. Fragment indexing allows efficient spectra similarity comparisons.

C H E M I S T R Y N A T I O N A L Q U A L I F Y I N G E X A M I N A T I O N SOLUTIONS GUIDE

Physiochemical Properties of Residues

via Tandem Mass Spectrometry and Propositional Satisfiability De Novo Peptide Sequencing Renato Bruni University of Perugia

Making Sense of Differences in LCMS Data: Integrated Tools

Molecular Modeling Lecture 7. Homology modeling insertions/deletions manual realignment

Computational Methods For Identification Of Cyclic Peptides Using Mass Spectrometry. Julio Ng Bioinformatics Program, UCSD March, 26 th 2010

SRM assay generation and data analysis in Skyline

profileanalysis Innovation with Integrity Quickly pinpointing and identifying potential biomarkers in Proteomics and Metabolomics research

Information Dependent Acquisition (IDA) 1

OECD QSAR Toolbox v.3.0

Clustering and Model Integration under the Wasserstein Metric. Jia Li Department of Statistics Penn State University

Protein Fragment Search Program ver Overview: Contents:

TUTORIAL EXERCISES WITH ANSWERS

Welcome! Course 7: Concepts for LC-MS

Supplementary Information. Broad Spectrum Anti-Influenza Agents by Inhibiting Self- Association of Matrix Protein 1

Last updated: Copyright

A. Two of the common amino acids are analyzed. Amino acid X and amino acid Y both have an isoionic point in the range of

Peptide Syntheses. Illustrative Protection: BOC/ t Bu. A. Introduction. do not acid

Amino acids: Optimization in underivatized formula and fragment patterns in LC/ESI-MS analysis. Yoshinori Takano (JAMSTEC)

Oxygen Binding in Hemocyanin

MASS SPECTROMETRY. Topics

OECD QSAR Toolbox v.4.1. Tutorial illustrating new options for grouping with metabolism

OECD QSAR Toolbox v.3.3. Step-by-step example of how to categorize an inventory by mechanistic behaviour of the chemicals which it consists

Die Nadel im Heuhaufen

Amino Acids and Peptides

CHEMISTRY ATAR COURSE DATA BOOKLET

Analyzing Compounds of Environmental Interest Using an LC/Q-TOF Part 1: Dyes and Pigments. Application. Introduction. Authors. Abstract.

TOMAHAQ Method Construction

SEQUENCE ALIGNMENT BACKGROUND: BIOINFORMATICS. Prokaryotes and Eukaryotes. DNA and RNA

Chemistry 224 Bioorganic Chemistry Friday, Sept. 29, This Exam is closed book and closed notes. Please show all your work!

Desorption/Ionization Efficiency of Common Amino Acids in. Surface-assisted Laser Desorption/ionization Mass Spectrometry

Conformational Analysis

Cerno Application Note Extending the Limits of Mass Spectrometry

Peptides And Proteins

Ionization Methods in Mass Spectrometry at the SCS Mass Spectrometry Laboratory

Chemistry Chapter 22

Sensitive NMR Approach for Determining the Binding Mode of Tightly Binding Ligand Molecules to Protein Targets

CHEM J-9 June 2014

All Ions MS/MS: Targeted Screening and Quantitation Using Agilent TOF and Q-TOF LC/MS Systems

Supporting information

Agilent MassHunter Quantitative Data Analysis

Electronic Supplementary Information

Packing of Secondary Structures

Efficient Perception of Proteins and Nucleic Acids from Atomic Connectivity. Roger Sayle, Ph.D. NextMove Software, Cambridge, UK

X!TandemPipeline (Myosine Anabolisée) validating, filtering and grouping MSMS identifications

NMR study of complexes between low molecular mass inhibitors and the West Nile virus NS2B-NS3 protease

Solutions In each case, the chirality center has the R configuration

MS/MS of Peptides Manual Sequencing of Protonated Peptides

Protein Identification Using Tandem Mass Spectrometry. Nathan Edwards Informatics Research Applied Biosystems

A Logic-Based Approach to Polymer Sequence Analysis

Creating a Pharmacophore Query from a Reference Molecule & Scaffold Hopping in CSD-CrossMiner

Diastereomeric resolution directed towards chirality. determination focussing on gas-phase energetics of coordinated. sodium dissociation

The Pitfalls of Peaklist Generation Software Performance on Database Searches

----- Ver October 24, 2014 Bug about reading MOPAC2012 Ver.14 calculations of 1 atom and 2 atoms molecule was fixed.

Agilent TOF Screening & Impurity Profiling Julie Cichelli, PhD LC/MS Small Molecule Workshop Dec 6, 2012

Ligand Scout Tutorials

Structure and evolution of the spliceosomal peptidyl-prolyl cistrans isomerase Cwc27

Agilent All Ions MS/MS

Chapter 4: Amino Acids

Any protein that can be labelled by both procedures must be a transmembrane protein.

SPECTRA LIBRARY ASSISTED DE NOVO PEPTIDE SEQUENCING FOR HCD AND ETD SPECTRA PAIRS

Skyline Small Molecule Targets

Sequential resonance assignments in (small) proteins: homonuclear method 2º structure determination

OECD QSAR Toolbox v.4.1. Implementation AOP workflow in Toolbox: Skin Sensitization

Similarity or Identity? When are molecules similar?

Ramachandran Plot. 4ysz Phi (degrees) Plot statistics

Comparing whole genomes

Exam III. Please read through each question carefully, and make sure you provide all of the requested information.

What makes a good graphene-binding peptide? Adsorption of amino acids and peptides at aqueous graphene interfaces: Electronic Supplementary

Resonance assignments in proteins. Christina Redfield

7.012 Problem Set 1. i) What are two main differences between prokaryotic cells and eukaryotic cells?

April, The energy functions include:

Translation. A ribosome, mrna, and trna.

Supplementary Figure 3 a. Structural comparison between the two determined structures for the IL 23:MA12 complex. The overall RMSD between the two

Become a Microprobe Power User Part 2: Qualitative & Quantitative Analysis

Supplementary Information Intrinsic Localized Modes in Proteins

X-ray crystallography NMR Cryoelectron microscopy

Model Mélange. Physical Models of Peptides and Proteins

Improved 6- Plex TMT Quantification Throughput Using a Linear Ion Trap HCD MS 3 Scan Jane M. Liu, 1,2 * Michael J. Sweredoski, 2 Sonja Hess 2 *

CHMI 2227 EL. Biochemistry I. Test January Prof : Eric R. Gauthier, Ph.D.

Supporting Information

Exam I Answer Key: Summer 2006, Semester C

Supplementary figure 1. Comparison of unbound ogm-csf and ogm-csf as captured in the GIF:GM-CSF complex. Alignment of two copies of unbound ovine

Properties of amino acids in proteins

Transcription:

CycloBranch Tutorials

Outline Tutorial 1: Does a peptide have a cycle? Tutorial 2: How to determine a tag? Tutorial 3: How to determine a complete sequence? Tutorial 4: How to determine a branched sequence? Tutorial 5: How to determine terminal modifications? Tutorial 6: How to compare a theoretical spectrum with a peaklist? Tutorial 7: How to search in a database of MS/MS spectra? Tutorial 8: How to search MS spectrum against a database of compounds? Tutorial 9: Draw Peptide Tool Tutorial 10: Identification of Branch-Cyclic Peptide with a Long-sized Branch Tutorial 11: Identification of Peptide Family using Similarity Search Tutorial 12: Identification of Siderophores in MS/MS data Tutorial 13: Identification of Siderophores in MALDI-MS and LC-MS data Tutorial 14: Identification of Siderophores in Imaging Mass Spectra

Tutorial 1: Does a peptide have a cycle? Are y-ions observed in linear mode? No. Are multiple overlapping series of b-ions observed in cyclic mode? Yes. Are scrambled ions observed in cyclic mode? Yes. The peptide likely contains a cycle.

Example 1a

Example 1a

Example 1b

Example 1b

Example 1b

Example 1b

Tutorial 2: How to determine a tag? Detection of sequence tags must be enabled. Proper scoring function must be chosen. (e.g., number of matched peaks, sum of relative intensities of matched peaks, number of b-ions or number of b-ions + dehydrated b-ions)

Tutorial 2: How to determine a tag? A threshold of minimum relative intensity of peaks must be set high. 3 5 % Maximum number of combined building blocks must be small. 1/1/1

Example 2

Example 2 Correct tag of Cyclosporin A [NMe-Leu]-[Ala]-[Ala]- [NMe-Leu]-[NMe-Leu]-[NMe-Val] was reported as a top hit.

Tutorial 3: How to determine a complete sequence? Mass-to-charge error tolerances must be small. 1 2 ppm Detection of sequence tags should be disabled.

Tutorial 3: How to determine a complete sequence? Threshold of minimum relative intensity should be small. Maximum numbers of combined building blocks must be increased. proper values must be determined empirically from the gaps between peaks

Tutorial 3: How to determine a complete sequence? Sequence tag(s) should be used. sequences without the tag are skipped sequences without the tag are processed and sequences with the tag are yellow

Tutorial 3: How to determine a complete sequence? Examples of sequence tags: [Val] one-block tag [Val]-[Leu] two-block tag, etc. ([Val]-[Leu]-*){2} [Val]-[Leu]-[Val]-[Leu] ([Val] [Leu]) [Val] or [Leu] [Pro]-[Ile]-[Ile]\([Orn]-[N-Ac-Ile]\)[Phe] branching at [Orn], branch is formed from [N-Ac-Ile] ECMAScript syntax is supported (except [ and ] ) http://www.cplusplus.com/reference/regex/ecmascript/

Example 3

Example 3 Correct sequence of Cyclosporin A [NMe-Bmt]- [Abu]-[NMe-Gly]-[NMe-Leu]-[Val]-[NMe-Leu]-[Ala]- [Ala]-[NMe-Leu]-[NMe-Leu]-[NMe-Val] was reported as a top hit.

Tutorial 4: How to determine a branched sequence? Look for sequence tags in linear mode. If possible, look for a building block which may cause branching (lysine, ornithine, etc.).

Tutorial 4: How to determine a branched sequence? Switch into the branched mode. Use a sequence tag: [Asp]-[Gln] sequence of blocks [Ser].*[Asp]-[Gln].* stands for unknown sequence \([Lys] branching at [Lys] \([Lys].*\)[Asp]-[Gln] partially known branched sequence See also Tutorials 2 and 3.

Example 4a

Example 4a Correct tag [Asp]-[Gln] was reported as a top hit. Correct tag [Ser] was also reported.

Example 4b

Example 4b

Example 4b

Example 4b Correct sequence Acetyl [Glu]-[Ser]-[Leu]([Lys]- [Asn]-[Phe]-[Ile])[Asp]-[Gln]-[Tyr]-[Gly] Amidated was reported among 24 candidates with equal score (23 matched peaks). All 24 candidates have the same mask Acetyl [Glu]- [xxx]-[xxx]([lys]-[xxx]-[xxx]-[xxx])[asp]-[gln]-[tyr]- [Gly] Amidated branching at [Lys] It is not easy to predict the numbers of combined blocks 2/4/1. Acetyl, Amidated must be listed in a list of modifications.

Tutorial 5: How to determine terminal modifications? Open a file with modifications in Tools -> Modifications Editor. Remove unlikely modifications. Configure settings and click Search -> Run in the main menu. Detected modifications are shown in an output report.

Example 5

Example 5

Example 5

Tutorial 6: How to compare a theoretical spectrum with a peaklist? Select mode Compare Peaklist with Spectrum of Searched Sequence. Select a peptide type, peaklist file and database of building blocks.

Tutorial 6: How to compare a theoretical spectrum with a peaklist? Define N-/C-terminal Modifications File. Define peptide sequence. Click Search -> Run in the main menu.

Example 6

Example 6

Tutorial 7: How to search in a database of MS/MS spectra? Select mode Compare Peaklist with Database MS/MS data. Select a peptide type, peaklist file and database of building blocks.

Tutorial 7: How to search in a database of MS/MS spectra? Define N-/C-terminal Modifications File. Define database of peptide sequences. Click Search -> Run in the main menu.

Example 7

Example 7 Database of sequences.

Output. Example 7

Tutorial 8: How to search MS spectrum against a database of compounds? Select mode Compare Peaklist with Database MS data. Select a peaklist file and database of compounds.

Tutorial 8: How to search MS spectrum against a database of compounds? Set a maximum charge of ions. Select types of ions to be detected. Click Search -> Run in the main menu.

Example 8

Example 8 Database of compounds/sequences.

Example 8

Tutorial 9: Draw Peptide Tool

Tutorial 9: Draw Peptide Tool

Tutorial 9: Draw Peptide Tool

Tutorial 9: Draw Peptide Tool

Tutorial 10: Identification of Branch-Cyclic Peptide with a Long-sized Branch 3 ways of identification may be used for branch-cyclic peptides short-sized branch (1-3 building blocks) cyclic mode long-sized branch ( 5 building blocks) linear mode medium-sized branch branch-cyclic mode

Tutorial 10: Identification of Branch-Cyclic Peptide with a Long-sized Branch Tolaasin B linear mode simple & fast identification of the branch branch-cyclic mode more peaks are matched The spectrum of Tolaasin B is available at http://gnps.ucsd.edu/.

Example 10a Comparison of experimental spectrum of Tolaasin B with theoretical spectrum in linear mode Select peptide type linear Enable cyclic C-terminus i.e., mass of H 2 O is subtracted when C-terminal fragment ions are generated mass of b-ion is equal to mass of (y-h 2 O)-ion

Example 10a Enter peptide sequence in linear format linear format the branch is red [C8:0-OH(3)]-[dhAbu]-[D-Pro]-[D-Ser]-[D-Leu]-[D-Val]- [D-Ser]-[D-Leu]-[D-Val]-[Val]-[D-Gln]-[Leu]-[D-Val]- [dhabu]-[d-athr]-[lys]-[d-dab]-[d-hse]-[val] branch-cyclic format [Val]-[D-Hse]-[D-Dab]-[Lys]\([D-aThr]-[dhAbu]-[D-Val]- [Leu]-[D-Gln]-[Val]-[D-Val]-[D-Leu]-[D-Ser]-[D-Val]-[D- Leu]-[D-Ser]-[D-Pro]-[dhAbu]-[C8:0-OH(3)]\)

Example 10a

Example 10a 9 b-ions found + 11 (y-h 2 O)-ions found

Example 10b Comparison of experimental spectrum of Tolaasin B with theoretical spectrum in branch-cyclic mode Select peptide type branch-cyclic Enter peptide sequence in branch-cyclic format

Example 10b

Example 10a + 10b

Example 10a + 10b Comparison in linear mode 9 b-ions found 11 (y-h 2 O)-ions found 11 b-ions found 20 b-ions found in total Comparison in branch-cyclic mode 21 b-ions found 1 y-ion found false hit (branch is N-terminated, y-ions cannot be found)

Example 10c Detection of peptide sequence tag from the branch in linear mode

Example 10c Sequence tag was found [D-Pro]-[D-Ser]-[D- Leu]-[D-Val]-[D-Ser]-[D-Leu]-[D-Val]-[Val]

Tutorial 11: Identification of Peptide Family using Similarity Search Suitable when a searched peptide is not included in a sequence database but its analogue is included (i.e., one or more building blocks are different)

Tutorial 11: Identification of Peptide Family using Similarity Search Define peptide type and peaklist file Select a database of building blocks Select N-/C-terminal Modifications File Enable Similarity Search this option disables filtering of peptide sequence candidates by precursor mass

Tutorial 11: Identification of Peptide Family using Similarity Search Select mode Compare Peaklist with Database MS/MS data Select a database of sequences

Tutorial 11: Identification of Peptide Family using Similarity Search

Tutorial 11: Identification of Peptide Family using Similarity Search Gramicidin C and 2 other analogs were found

Tutorial 12: Identification of Siderophores in MS/MS data CycloBranch can identify siderophores in MS/MS data linear, cyclic, branched, and branch-cyclic NRP siderophores linear and cyclic polyketide siderophores MALDI-MS and LC-MS data imaging mass spectra (MSI) various biometals are supported - Li, Na, Mg, Al, K, Ca, Cr, Mn, Fe, Co, Ni, Cu, Zn, Ga)

Example 12a: Identification of Fe-TAFC ferri-form of triacetylfusarinine C (Fe-TAFC) precursor ion adduct must be defined [M+Fe-2H] + FeH-2 other examples: [M+H] + empty or H [M+Fe-3H+Na] + FeH-3Na [M+Al-2H] + AlH-2 [M+Zn-H] + ZnH-1

Example 12a: Identification of Fe-TAFC

Example 12a: Identification of Fe-TAFC Fe-TAFC was reported among candidates 1-8 belonging to 4 groups

Example 12a: Identification of Fe-TAFC a group corresponds to a path in the de novo graph, i.e., the 8 sequence candidates were generated from 4 paths by permutations of building blocks

Example 12b: Identification of desferri-ferrioxamine B linear polyketide must be selected database of ketide blocks must be defined ketide terminal modifications must be defined

Example 12b: Identification of desferri-ferrioxamine B

Example 12b: Identification of desferri-ferrioxamine B desferri-ferrioxamine B was reported as the top hit (6 peaks were matched)

Example 12b: Identification of desferri-ferrioxamine B

Example 12c: Identification of ferri-ferrioxamine B precursor ion adduct must be defined

Example 12c: Identification of ferri-ferrioxamine B

Example 12c: Identification of ferri-ferrioxamine B

Tutorial 13: Identification of Siderophores in MALDI-MS and LC-MS data select MS mode select a database of compounds select ion types to be detected

Tutorial 13a: Identification of Siderophores in MALDI-MS data

Tutorial 13a: Identification of Siderophores in MALDI-MS data

Tutorial 13a: Identification of Siderophores in MALDI-MS data

Tutorial 13b: Identification of Siderophores in LC-MS data select a file with LC-MS data

Tutorial 13b: Identification of Siderophores in LC-MS data

Tutorial 13b: Identification of Siderophores in LC-MS data each row corresponds to a scan spectrum ID = scan ID

Tutorial 13b: Identification of Siderophores in LC-MS data Summary Table of Matched Peaks matched peaks from all scans are listed (ID = scan ID)

Tutorial 13b: Identification of Siderophores chromatogram in LC-MS data

Tutorial 13: Identification of Siderophores in MALDI-MS and LC-MS data List of currently supported ions: [M+H]+, [M+Na]+, [M+K]+, [M-H]-, [M+Fe-2H]+, [M+Fe- 3H+Na]+, [2M+Fe-2H]+, [2M+Fe-3H+Na]+, [3M+Fe-2H]+, [3M+Fe-3H+Na]+, [3M+2Fe-5H]+, [3M+2Fe-6H+Na]+, [M+Fe-4H]-, [2M+Fe-4H]-, [3M+Fe-4H]-, [3M+2Fe-7H]-, [M+Li]+, [M+Mg-H]+, [M+Mg-2H+Na]+, [M+Mg-3H]-, [M+Al-2H]+, [M+Al-3H+Na]+, [M+Al-4H]-, [M+Ca-H]+, [M+Ca-2H+Na]+, [M+Ca-3H]-, [M+Cr-2H]+, [M+Cr- 3H+Na]+, [M+Cr-4H]-, [M+Mn-H]+, [M+Mn-2H+Na]+, [M+Mn-3H]-, [M+Co-H]+, [M+Co-2H+Na]+, [M+Co-3H]-, [M+Ni-H]+, [M+Ni-2H+Na]+, [M+Ni-3H]-, [M+Cu-H]+, [M+Cu-2H+Na]+, [M+Cu-3H]-, [M+Zn-H]+, [M+Zn-2H+Na]+, [M+Zn-3H]-, [M+Ga-2H]+, [M+Ga-3H+Na]+, [M+Ga-4H]-

Tutorial 14: Identification of Siderophores in Imaging Mass Spectra standard imzml file format is supported processed or continuous data format m/z values and intensities must be stored as 32/64-bit floats no compression profile or centroided spectra profile spectra are automatically converted to centroided spectra using OpenMS (www.openms.de) see also www.imzml.org

Tutorial 14: Identification of Siderophores in Imaging Mass Spectra the single spectra are batch-processed X and Y coordinates are reported for each single spectrum Summary Table of Matched Peaks matched peaks from all single spectra are listed Image Window visualization of matched compounds

Tutorial 14: Identification of Siderophores select imzml file in Imaging Mass Spectra set up the minimum threshold of rel. int. set up FWHM (for profile spectra only)

Tutorial 14: Identification of Siderophores select MSI mode in Imaging Mass Spectra select a database of compounds select ion types to be detected

Tutorial 14: Identification of Siderophores in Imaging Mass Spectra

Tutorial 14: Identification of Siderophores in Imaging Mass Spectra

Tutorial 14: Identification of Siderophores in Imaging Mass Spectra

Tutorial 14: Identification of Siderophores in Imaging Mass Spectra

Tutorial 14: Identification of Siderophores Image Window open image in Imaging Mass Spectra set up Max X and Max Y coordinates of the image bottom-right corner the values must be determined manually in FlexImaging (Bruker) to correctly map the spectra into the image R00X451Y321 means Max X = 451 Max Y = 321

Tutorial 14: Identification of Siderophores Image Window in Imaging Mass Spectra point corresponds to single mass spectrum standard HSL (hue, saturation, lightness) color model the most intensive point in a selected region is red zero intensity points are violet

Tutorial 14: Identification of Siderophores Image Window color represents in Imaging Mass Spectra sum of intensities of matched peaks in single mass spectrum intensity of a specific compound if it was filtered by its name in Summary Table of Matched Peaks

Tutorial 14: Identification of Siderophores in Imaging Mass Spectra