Quantitative Proteomics

Similar documents
PC235: 2008 Lecture 5: Quantitation. Arnold Falick

Mass Spectrometry and Proteomics - Lecture 5 - Matthias Trost Newcastle University

NPTEL VIDEO COURSE PROTEOMICS PROF. SANJEEVA SRIVASTAVA

Chapter 4. strategies for protein quantitation Ⅱ

Protein Quantitation II: Multiple Reaction Monitoring. Kelly Ruggles New York University

Workshop: SILAC and Alternative Labeling Strategies in Quantitative Proteomics

Protein Quantitation II: Multiple Reaction Monitoring. Kelly Ruggles New York University

Quantitative Proteomics

Computational Methods for Mass Spectrometry Proteomics

Proteome-wide label-free quantification with MaxQuant. Jürgen Cox Max Planck Institute of Biochemistry July 2011

Key questions of proteomics. Bioinformatics 2. Proteomics. Foundation of proteomics. What proteins are there? Protein digestion

SILAC and TMT. IDeA National Resource for Proteomics Workshop for Graduate Students and Post-docs Renny Lan 5/18/2017

Comprehensive support for quantitation

Genome wide analysis of protein and mrna half lives reveals dynamic properties of mammalian gene expression

Quantitation of a target protein in crude samples using targeted peptide quantification by Mass Spectrometry

Aplicació de la proteòmica a la cerca de Biomarcadors proteics Barcelona, 08 de Juny 2010

PRG2006 Research Study

Workflow concept. Data goes through the workflow. A Node contains an operation An edge represents data flow The results are brought together in tables

Modeling Mass Spectrometry-Based Protein Analysis

Chemical Labeling Strategy for Generation of Internal Standards for Targeted Quantitative Proteomics

Protein Identification Using Tandem Mass Spectrometry. Nathan Edwards Informatics Research Applied Biosystems

Improved 6- Plex TMT Quantification Throughput Using a Linear Ion Trap HCD MS 3 Scan Jane M. Liu, 1,2 * Michael J. Sweredoski, 2 Sonja Hess 2 *

TUTORIAL EXERCISES WITH ANSWERS

Amine specific Labeling Reagents for Multiplexed Relative and Absolute Protein Quantitation

Designed for Accuracy. Innovation with Integrity. High resolution quantitative proteomics LC-MS

We are IntechOpen, the first native scientific publisher of Open Access books. International authors and editors. Our authors are among the TOP 1%

Analysis of Labeled and Non-Labeled Proteomic Data Using Progenesis QI for Proteomics

Spectrum-to-Spectrum Searching Using a. Proteome-wide Spectral Library

Quantitative analysis of the proteome

DIA-Umpire: comprehensive computational framework for data independent acquisition proteomics

Reagents. Affinity Tag (Biotin) Acid Cleavage Site. Figure 1. Cleavable ICAT Reagent Structure.

Isotopic-Labeling and Mass Spectrometry-Based Quantitative Proteomics

Targeted protein quantification

Proteomics. November 13, 2007

Tutorial 1: Setting up your Skyline document

Identification of Human Hemoglobin Protein Variants Using Electrospray Ionization-Electron Transfer Dissociation Mass Spectrometry

Developing Algorithms for the Determination of Relative Abundances of Peptides from LC/MS Data

Quantitative analysis of the proteome. Proteomics Data Standards

Chem 250 Unit 1 Proteomics by Mass Spectrometry

SRM assay generation and data analysis in Skyline

The Agilent 6495 Triple Quadrupole LC/MS: Peptide Quantitation Performance

6 x 5 Ways to Ensure Your LC-MS/MS is Healthy

LECTURE-13. Peptide Mass Fingerprinting HANDOUT. Mass spectrometry is an indispensable tool for qualitative and quantitative analysis of

MS-based proteomics to investigate proteins and their modifications

Proteolytic 18O-Labeling Strategies for Quantitative Proteomics

Rapid and Sensitive Fluorescent Peptide Quantification Using LavaPep

Statistical analysis of isobaric-labeled mass spectrometry data

Atomic masses. Atomic masses of elements. Atomic masses of isotopes. Nominal and exact atomic masses. Example: CO, N 2 ja C 2 H 4

Rapid Distinction of Leucine and Isoleucine in Monoclonal Antibodies Using Nanoflow. LCMS n. Discovery Attribute Sciences

Proteomics: the first decade and beyond. (2003) Patterson and Aebersold Nat Genet 33 Suppl: from

Overview - MS Proteomics in One Slide. MS masses of peptides. MS/MS fragments of a peptide. Results! Match to sequence database

High-Throughput Protein Quantitation Using Multiple Reaction Monitoring

De novo Protein Sequencing by Combining Top-Down and Bottom-Up Tandem Mass Spectra. Xiaowen Liu

Quantitation of TMT-Labeled Peptides Using Higher-Energy Collisional Dissociation on the Velos Pro Ion Trap Mass Spectrometer

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?

Introduction to Proteomics & Bottom-up Proteomics

Protocol. Product Use & Liability. Contact us: InfoLine: Order per fax: www:

UCD Conway Institute of Biomolecular & Biomedical Research Graduate Education 2009/2010

Towards the Prediction of Protein Abundance from Tandem Mass Spectrometry Data

High-Field Orbitrap Creating new possibilities

Protein analysis using mass spectrometry

Site-specific Identification of Lysine Acetylation Stoichiometries in Mammalian Cells

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

Applications of Mass Spectrometry for Biotherapeutic Characterization

Chapter 2 What are the Common Mass Spectrometry-Based Analyses Used in Biology?

Tandem mass spectra were extracted from the Xcalibur data system format. (.RAW) and charge state assignment was performed using in house software

Protein Sequencing Research Group ABRF 2015 annual meeting

HOWTO, example workflow and data files. (Version )

Qualitative Proteomics (how to obtain high-confidence high-throughput protein identification!)

Mass spectrometry-based proteomics in the life sciences: a review

Increasing the Multiplexing of Protein Quantitation from 6- to 10-Plex with Reporter Ion Isotopologues

Comparative and Quantitative Global Proteomics Approaches: An Overview

Analyst Software. Peptide and Protein Quantitation Tutorial

PROTEOMICS IN VASCULAR BIOLOGY

Calibration in Proteomics. Proteomics 202 :: Practical Proteomics Using the Skyline Software Ecosystem Lindsay K. Pino Monday, Jan 22

PeptideProphet: Validation of Peptide Assignments to MS/MS Spectra. Andrew Keller

MS Based Proteomics: Recent Case Studies Using Advanced Instrumentation

Introduction into Selected Reaction Monitoring (SRM) Christina Ludwig

Isobaric Labeling-Based Relative Quantification in Shotgun Proteomics

SERVA ICPL Kit (Cat.-No )

A Statistical Model of Proteolytic Digestion

The Power of LC MALDI: Identification of Proteins by LC MALDI MS/MS Using the Applied Biosystems 4700 Proteomics Analyzer with TOF/TOF Optics

Introduction to Proteomics: Fragmentation of protonated peptides and manual sequencing

Peter A. DiMaggio, Jr., Nicolas L. Young, Richard C. Baliban, Benjamin A. Garcia, and Christodoulos A. Floudas. Research

Relative Quantitation of TMT-Labeled Proteomes Focus on Sensitivity and Precision

Targeted Proteomics Environment

Q Exactive TM : A True Qual-Quan HR/AM Mass Spectrometer for Routine Proteomics Applications. Yi Zhang, Ph.D. ThermoFisher Scientific

Supplementary Figure 1

Advanced Fragmentation Techniques for BioPharma Characterization

Mass spectrometry has been used a lot in biology since the late 1950 s. However it really came into play in the late 1980 s once methods were

Protocol. Product Use & Liability. Contact us: InfoLine: Order per fax: www:

iprophet: Multi-level integrative analysis of shotgun proteomic data improves peptide and protein identification rates and error estimates

Properties of Average Score Distributions of SEQUEST

Statistical mass spectrometry-based proteomics

Peptide Targeted Quantification By High Resolution Mass Spectrometry A Paradigm Shift? Zhiqi Hao Thermo Fisher Scientific San Jose, CA

Biochemistry 3100 Sample Problems Chemical and Physical Methods

A label-free quantification method by MS/MS TIC compared to SILAC and spectral counting in a proteomics screen

TOMAHAQ Method Construction

BIOINF 4120 Bioinformatics 2 - Structures and Systems - Oliver Kohlbacher Summer Systems Biology Exp. Methods

MALDI-HDMS E : A Novel Data Independent Acquisition Method for the Enhanced Analysis of 2D-Gel Tryptic Peptide Digests

Transcription:

BSPR workshop 16 th July 2010 Quantitative Proteomics Kathryn Lilley Cambridge Centre for Proteomics Department of Biochemistry University of Cambridge k.s.lilley@bioc.cam.ac.uk www.bio.cam.ac.uk/proteomics/

Outline Quantitation in proteomics Relative Quantitation Absolute Quantitation Importance of Experimental Design Importance of Suitable Data Analysis

Quantitative Proteomics Relative Absolute fold change absolute amount

Quantitative Proteomics Relative Absolute -Comparative levels of proteins between two or more samples - 2D gel/ DIGE - Isobaric labelling itraq/tmt - Metabolic labelling/ SILAC - Label Free - Rank order of protein abundance - Assessment of stoichiometry - Facilitates targeted analysis - Transferable data sets - Internal standards (usually peptide surrogates)

Outline Quantitation in proteomics Relative Quantitation Absolute Quantitation Importance of Experimental Design Importance of Suitable Data Analysis

Quantitative proteomics methodologies Gel based Stable isotope labelling Label free

2D PAGE Visualize many proteins at once Relatively quick acidic Isoelectric focussing basic High mw Great way of storing samples SDS PAGE Detect isoforms if pi shift Relatively inexpensive Can use with functional stains Poor gel to gel reproducibility 1 st dimension = pi Low mw Many stains not linear along dynamic range 2 nd dimension = MW No good for membrane proteins

Difference Gel Electrophoresis DIGE First described by Jon Minden (Carnegie Mellon University. Pittsburg, USA Ünlü M. et al (1997). Electrophoresis,18, 2071-2077

Sample 1 Sample 2 Sample 3 label with cy2 in dark 30mins @ 4 O C label with cy3 in dark 30mins @ 4 O C label with cy5 in dark 30mins @ 4 O C quench un-reacted dye by adding 1mM lysine in dark 10mins @ 4 O C Difference Gel Electrophoresis Ünlü M. et al (1997). Electrophoresis,18, 2071-2077 2D gel electrophoresis

Cy5 Cy3 no difference presence / absence up / down-regulation Cy3 +Cy5

Quantification using stable isotope labelling In vivo In vitro Elemental Amino acid MS MS/MS

Stable Isotope Labelling in vivo Sample 1 incorporates natural isotope Sample 2 incorporates heavier isotope Digest with protease MS/MS to identify Mixture of light/heavy peptides LC separation usually multi dimensional light heavy Quantitation in MS

Stable Isotope Labelling in vivo 1. Elemental Samples grown in medium where there is replacement of an element with a stable isotope? Typically 15 N instead of 14 N, or 13 C instead of 12 C 13 C not often used as more carbon in proteins than nitrogen and therefore big mass shifts Do not known mass difference between light and heavy pairs unless sequence is deduced (retention times) Types of samples suitable? Bacterial / Cell culture

Examples E coli grown on 15 N sole nitrogen source and then fed to C.elegans Light mutant Heavy WT Light WT Heavy mutant Krijgsveld et al (2003) Nat. Biotech.21:927

Stable Isotope Labelling in vivo 1. Amino acid - SILAC (Stable Isotope Labeling with Amino acids in Cell culture) Samples grown in medium where there is replacement of an amino acid with heavier stable isotopic form of the amino acid Typically 13 C instead of 12 C labelled lysine, arginine or leucine Know the mass difference between light and heavy pairs Need to check for extent of incorporation as need also to buy depleted medium Types of samples suitable? Bacterial / Cell culture

Stable Isotope Labelling in vivo SILAC Mouse Krüger et al (2008) Cell 134(2):353-64 SILAC Drosophila Sury et al (2010) Mol. Cell Prot. On-line Problem is the conversion of Arg to Pro many in higher organisms only use labeled lysine and digestion with LysC, this gives rise to longer peptides for analysis

Stable Isotope Labelling in vitro 1. Analysis at MS stage 16 O 18 O Many variants including Isotopes introduced during proteolysis 18 O labelled water, C-termini Guanidation of lysine using isotopes of O-methyl isourea lysine residues Dimethyl labelling lysine residues Mostly the above lead to small mass differences Back exchange can be a problem with trypsin

Stable Isotope Labelling in vitro 2. Analysis at MS/MS stage itraq reagents (Amine Modifying Labeling Reagents for Multiplexed Relative and Absolute Protein Quantitation) 4 x isobaric tags - all 145 Da React with primary amines Label at peptide level Fragment during MSMS to produce characteristic reporter ion for each tag Ross et al (2004) Mol. Cell. Prot. 3:1154

Stable Isotope Labelling itraq Isotopic Variation MS MSMS

Quantitation using a label free approach Peak measurements Spectral counting

Label Free Proteomics Peaks Ion intensity measurements Compare peak intensities of the same ion in consecutive LCMS runs Need to match retention times with m/z values Can be targeted approach collecting MSMS information in a separate run only fragmenting ions showing change in abundance Essential to have good mass accuracy and reproducible retention times M/z time

Label Free Proteomics - Spectral counting Spectral counts Number of non-redundant spectra matching the same proteins The number of redundant peptides observed correlates with abundance PAI = protein abudance index number of observed peptdies /number of observable peptides empai = 10 PAI -1 Must take length of protein into account empai software available for analysis (Exponentially modified protein abundance index) See: Ishihama Y, et al Mol Cell Proteomics. (2005) 4(9):1265-72

Summary 2D PAGE/DIGE Good method of protein separation Limited protein coverage as no membrane proteins Co-migration In vivo labelling Data from all peptides potentially Only appropriate where growth conditions can be specified itraq/tmt Data from all peptides potentially 4,6,or 8 way multiplexing Co-selection problem leads to unestimation of large fold changes Label free Cheap Complex data analysis Greatest variance? 114 115 116 117

Outline Quantitation in proteomics Relative Quantitation Absolute Quantitation Importance of Experimental Design Importance of Suitable Data Analysis

Absolute Quantitation Assay proteins of interest MS based absolute quantitation works by measuring peptide surrogate simultaneously against quantified internal standard. Surrogate = peptides The ions that are used for measurement are generally MS/MS fragment ions which are discriminatory for the peptide of choice

Multiple Reaction Monitoring Q CID Q Precursor ion selected Collision Induced Fragmentation Diagnostic fragment ions selected = transitions Natural peptide Synthetic version of peptide containing stable isotope m/z

How to create good peptide internal standard? AQUA Gerber et al (2003) PNAS 100(12):6940-5 QconCAT Beynon et al (2005) Nat. Methods 2(8):587-9. Labelled proteins mass Western Lehmann et al (2008) The Plant Journal 55:1039 1046 Good Example Full dynamic range proteome analysis of S. cerevisiae by targeted proteomics. Picotti P, Bodenmiller B, Mueller LN, Domon B, Aebersold R. Cell. 2009 138(4):795-806

AQUA Stable isotope tagged synthetic peptide protein of interest Assumption: Stoichiometric release of peptide surrogate. Internal standard not generated by tryptic cleavage Tryptic digestion

QconCAT Stable isotope labelled synthetic protein Constructed from concatenated peptides(qprotein) Protein of interest Assumption: Stoichiometric release of peptide surrogate. Internal standard not generated by identical tryptic cleavage Tryptic digestion

Recombinant labelled protein Assumption: Stable isotope labelled recombinant protein Mass Western Protein of interest Identical tryptic cleavage for internal standard and surrogate. Complete set of internal standards Tryptic digestion

LC-MS E Multiplexed data acquisition Add known amount of enolase and calibrate absolute abundance based on the performance of its top three scoring peptides Collision Energy in gas cell alternated between Low energy (5eV) Elevated energy (linear 15 ev - 42 ev) Silva et. al., Anal Chem. (2005) Liu et. al. Proteomics (2009)

MS E Absolute and estimated Quantitation

Outline Quantitation in proteomics Relative Quantitation Absolute Quantitation Importance of Experimental Design Importance of Suitable Data Analysis

Quantitative data Knowledge of these facts influences Systematic error Systematic variance 1. Design of experiment Sensitivity Precise, but actually measures the incorrect amount Imprecise, measures the correct value only some of the time 2. Number of replicates utilised Ability to measure small differences 3. Application of normalisation methods

ABRF Proteomics Research Group Study 2006 www.abrf.org 8 proteins Same total amount of protein in each sample 52 responses Turck et al (2007) Mol. Cell. Prot. 6(8):1291-8

Do they give the same results? 2D gel DIGE Label free (P) Stable isotope tags Spectral count

Why you don t get the same answer? Variability in starting material Biological variation Variability in experimental protocol (influences technical variance) Point at which you combine samples to be compared Inappropriate experimental design Not enough replicates Inaccuracy of measurement The wrong answer all the time The wrong answer some of the time Inappropriate statistical testing Using a test that does not fit the data

Biological Variance Try to control as much of variance as possible 0.2 0.15 0.1 0.05 0-0.05 Aco2 Alb Aldh-M1 Aldh2 (1) Aldh2 (2) Aldh2 (3) Cat (1) Cat (2) Cpsase1 (3) Eif3s4d Khk Sdh1-0.1 Standardised collection protocols -0.15 Appropriate samples (matched controls) Time of harvest Relative protein expression 0.1 0.05 0-0.05-0.1-0.15 Acaa2 (1) Acaa2 (2) Aldh1a1 Aldo2 Ass1 Bhmt Cpsase1 (1) Cpsase1 (2) Cpsase1 (4) Cpsase1 (5) Eno1 Grp58 Hnrpa2b1 LOC236937 Prdx6 (2) Scp2 Serpina1d Upb1 0.15 0.1 0.05 0-0.1-0.15-0.2 0610038K03Rik Akr7a5 Arg1 Atp5b Cpsase1 (6) Eef1d Hao1 Hspa9a Prdx6 (1) Rgn -0.25-0.3 0 4 8 12 16 20 Circadian time -0.05 Reddy et al (2006) Current Biology 16(11):1107-15.

Differential variance in a protocol Extract proteins Points of variance Extraction of proteins 1D gel In gel digestion LC MS Russell and Lilley in preparation

Types of Replicates Abundance Abundance A B A B Biological Replicates Technical Replicates Technical replicates give an illusion of more power (sensitivity) Karp et al, J.Prot Res, 2005, 4, 1867-71.

Power comparison The power of a is the probability that the test will reject a false null hypothesis (i.e. that it will not make a Type II error). As power increases, the chances of a Type II error decrease. The probability of a Type II error is referred to as the false negative rate (β). Therefore power is equal to 1 β. Depends on noise of system (variance), effect size (i.e. 2 fold), significance demanded by researcher (error you re prepared to live with), number of replicates. Calculated in detecting a 2 fold change with a noise measure that encompasses 75% of the species studied for a confidence of 0.01. Karp et al, Mol.Cell Prot, 2007, 6, 1354-64.

Is the sample representative? What threshold should you use? Abundance Density Data point Log(ratio) Karp et al, Proteomics, 2004, 4, 1421-32.

Randomisation in design Cy3 Cy5 Cy5 control treated Internal standard treated control Internal standard control treated Internal standard treated control Internal standard Cy3 Cy5 Cy5 control treated Internal standard control treated Internal standard control treated Internal standard control treated Internal standard Principle component 2 batch effects seen in same-same study. Principle component 1 Karp et al, Proteomics, 2005, 5, 81-90.

Are you using the correct statistical test? Assumptions: Normality Homogeneity of variance Y Variable Density Independent sampling Variable X Variable Karp et al, MCP, 2007, 6, 1354-64. Karp & Lilley, Proteomics, 2005, 5, 3105-15.

False Discovery Rate TP Significance threshold FDR = false calls of significance calls of significance Frequency Distribution for tests with differences Uniform distribution for tests with no differences FP p-value

Importance of communication and design

Thank you for listening Kathryn Lilley Cambridge Centre for Proteomics k.s.lilley@bioc.cam.ac.uk www.bio.cam.ac.uk/proteomics