Yifei Bao. Beatrix. Manor Askenazi

Similar documents
Modeling Mass Spectrometry-Based Protein Analysis

Improved 6- Plex TMT Quantification Throughput Using a Linear Ion Trap HCD MS 3 Scan Jane M. Liu, 1,2 * Michael J. Sweredoski, 2 Sonja Hess 2 *

Protein Quantitation II: Multiple Reaction Monitoring. Kelly Ruggles New York University

Protein Quantitation II: Multiple Reaction Monitoring. Kelly Ruggles New York University

Overview - MS Proteomics in One Slide. MS masses of peptides. MS/MS fragments of a peptide. Results! Match to sequence database

Biological Mass Spectrometry

UCD Conway Institute of Biomolecular & Biomedical Research Graduate Education 2009/2010

De novo Protein Sequencing by Combining Top-Down and Bottom-Up Tandem Mass Spectra. Xiaowen Liu

Atomic masses. Atomic masses of elements. Atomic masses of isotopes. Nominal and exact atomic masses. Example: CO, N 2 ja C 2 H 4

Key questions of proteomics. Bioinformatics 2. Proteomics. Foundation of proteomics. What proteins are there? Protein digestion

PROTEIN SEQUENCING AND IDENTIFICATION USING TANDEM MASS SPECTROMETRY

HR/AM Targeted Peptide Quantification on a Q Exactive MS: A Unique Combination of High Selectivity, High Sensitivity, and High Throughput

MS-based proteomics to investigate proteins and their modifications

Tutorial 1: Setting up your Skyline document

DIA-Umpire: comprehensive computational framework for data independent acquisition proteomics

Quantitation of a target protein in crude samples using targeted peptide quantification by Mass Spectrometry

Aplicació de la proteòmica a la cerca de Biomarcadors proteics Barcelona, 08 de Juny 2010

Performing Peptide Bioanalysis Using High Resolution Mass Spectrometry with Target Enhancement MRM Acquisition

Proteomics. Yeast two hybrid. Proteomics - PAGE techniques. Data obtained. What is it?

Background: Imagine it is time for your lunch break, you take your sandwich outside and you sit down to enjoy your lunch with a beautiful view of

Computational Methods for Mass Spectrometry Proteomics

Background: Comment [1]: Comment [2]: Comment [3]: Comment [4]: mass spectrometry

Developing Algorithms for the Determination of Relative Abundances of Peptides from LC/MS Data

HOWTO, example workflow and data files. (Version )

Towards the Prediction of Protein Abundance from Tandem Mass Spectrometry Data

Powerful Scan Modes of QTRAP System Technology

Proteome-wide label-free quantification with MaxQuant. Jürgen Cox Max Planck Institute of Biochemistry July 2011

for the Novice Mass Spectrometry (^>, John Greaves and John Roboz yc**' CRC Press J Taylor & Francis Group Boca Raton London New York

Bioinformatics 2. Yeast two hybrid. Proteomics. Proteomics

Designed for Accuracy. Innovation with Integrity. High resolution quantitative proteomics LC-MS

Guide to Peptide Quantitation. Agilent clinical research

BIOINF 4120 Bioinformatics 2 - Structures and Systems - Oliver Kohlbacher Summer Systems Biology Exp. Methods

Workshop: SILAC and Alternative Labeling Strategies in Quantitative Proteomics

SRM assay generation and data analysis in Skyline

Effective Strategies for Improving Peptide Identification with Tandem Mass Spectrometry

Nature Methods: doi: /nmeth Supplementary Figure 1. Fragment indexing allows efficient spectra similarity comparisons.

Isotopic-Labeling and Mass Spectrometry-Based Quantitative Proteomics

Chemical Labeling Strategy for Generation of Internal Standards for Targeted Quantitative Proteomics

NPTEL VIDEO COURSE PROTEOMICS PROF. SANJEEVA SRIVASTAVA

Proteomics. Areas of Interest

MS Based Proteomics: Recent Case Studies Using Advanced Instrumentation

Isotope Dilution Mass Spectrometry

All Ions MS/MS: Targeted Screening and Quantitation Using Agilent TOF and Q-TOF LC/MS Systems

Targeted Proteomics Environment

L3.1: Circuits: Introduction to Transcription Networks. Cellular Design Principles Prof. Jenna Rickus

Engineering of Repressilator

High-Field Orbitrap Creating new possibilities

ALIGNMENT OF LC-MS DATA USING PEPTIDE FEATURES. A Thesis XINCHENG TANG

PC235: 2008 Lecture 5: Quantitation. Arnold Falick

Data pre-processing in liquid chromatography mass spectrometry-based proteomics

LECTURE-11. Hybrid MS Configurations HANDOUT. As discussed in our previous lecture, mass spectrometry is by far the most versatile

Electron Transfer Dissociation of N-linked Glycopeptides from a Recombinant mab Using SYNAPT G2-S HDMS

Mass spectrometry has been used a lot in biology since the late 1950 s. However it really came into play in the late 1980 s once methods were

Statistical mass spectrometry-based proteomics

Improved Throughput and Reproducibility for Targeted Protein Quantification Using a New High-Performance Triple Quadrupole Mass Spectrometer

Proteomics. 2 nd semester, Department of Biotechnology and Bioinformatics Laboratory of Nano-Biotechnology and Artificial Bioengineering

Reagents. Affinity Tag (Biotin) Acid Cleavage Site. Figure 1. Cleavable ICAT Reagent Structure.

Electrospray ionization mass spectrometry (ESI-

Mass Spectrometry Based De Novo Peptide Sequencing Error Correction

Targeted protein quantification

Peptide Targeted Quantification By High Resolution Mass Spectrometry A Paradigm Shift? Zhiqi Hao Thermo Fisher Scientific San Jose, CA

Computational Analysis of Mass Spectrometric Data for Whole Organism Proteomic Studies

Relative quantification using TMT11plex on a modified Q Exactive HF mass spectrometer

Biology Scope & Sequence

Mass spectrometry in proteomics

Self-assembling covalent organic frameworks functionalized. magnetic graphene hydrophilic biocomposite as an ultrasensitive

LECTURE-13. Peptide Mass Fingerprinting HANDOUT. Mass spectrometry is an indispensable tool for qualitative and quantitative analysis of

TOMAHAQ Method Construction

A New Hybrid De Novo Sequencing Method For Protein Identification

Analysis of Peptide MS/MS Spectra from Large-Scale Proteomics Experiments Using Spectrum Libraries

Rapid and Accurate Forensics Analysis using High Resolution All Ions MS/MS

Chem 250 Unit 1 Proteomics by Mass Spectrometry

CSE182-L8. Mass Spectrometry

A Software Suite for the Generation and Comparison of Peptide Arrays from Sets. of Data Collected by Liquid Chromatography-Mass Spectrometry

What is Systems Biology

A Quadrupole-Orbitrap Hybrid Mass Spectrometer Offers Highest Benchtop Performance for In-Depth Analysis of Complex Proteomes

Accelerating Biomolecular Nuclear Magnetic Resonance Assignment with A*

Supporting Information

Last updated: Copyright

Application Note LCMS-112 A Fully Automated Two-Step Procedure for Quality Control of Synthetic Peptides

Peter A. DiMaggio, Jr., Nicolas L. Young, Richard C. Baliban, Benjamin A. Garcia, and Christodoulos A. Floudas. Research

Cryodetectors in Life Science Mass Spectrometry. Damian Twerenbold

Mass Spectrometry and Proteomics - Lecture 5 - Matthias Trost Newcastle University

Optimization and Use of Peptide Mass Measurement Accuracy in Shotgun Proteomics* S

A Dynamic Programming Approach to De Novo Peptide Sequencing via Tandem Mass Spectrometry

A Statistical Model of Proteolytic Digestion

Increasing Speed of UHPLC-MS Analysis Using Single-stage Orbitrap Mass Spectrometer

Key Words Q Exactive, Accela, MetQuest, Mass Frontier, Drug Discovery

CONJOINT 541. Translating a Transcriptome at Specific Times and Places. David Morris. Department of Biochemistry

Proteomics. November 13, 2007

Towards Detecting Protein Complexes from Protein Interaction Data

Protein Identification Using Tandem Mass Spectrometry. Nathan Edwards Informatics Research Applied Biosystems

Unravelling the biochemical reaction kinetics from time-series data

An extended siderophore suite from Synechococcus sp. PCC 7002 revealed by LC- ICPMS-ESIMS

Quantitative Proteomics

networks in molecular biology Wolfgang Huber

TUTORIAL EXERCISES WITH ANSWERS

Q Exactive TM : A True Qual-Quan HR/AM Mass Spectrometer for Routine Proteomics Applications. Yi Zhang, Ph.D. ThermoFisher Scientific

Curriculum Links. AQA GCE Biology. AS level

Exhaustive search. CS 466 Saurabh Sinha

Transcription:

Detection and Correction of Interference in MS1 Quantitation of Peptides Using their Isotope Distributions Yifei Bao Department of Computer Science Stevens Institute of Technology Beatrix Ueberheide Department of Biochemis try and Molecular Pharmacology and Proteomics Resource Lab New York University Langonee Medical Center Manor Askenazi The Ionomix Initiative, Arlington, MA Joseph S. Glavy Department of Chemistry, Chemical Biology and Biomedical Engineering Stevens Institute of Technology Jessica Chapman and David Fenyö Center for Health Informatics and Bioinformatics New York University Langonee Medical Center Department of Computer Science Technical Report CS-2013-3

1 Abstract Each peptide is observed as a group of peaks in mass spectrometry because of the presence of small amounts of stable heavy isotopes in nature. The natural isotopic distribution of a peptide is dependent on the number of atoms that make up the peptide and can be theoretically determined. Therefore, the isotopic distribution of a peptide is independent of peptide concentration, and any interference in MS1 quantitation of peptides can be detected by looking for deviations from a peptide s theoretical isotopic distributions. In this study, computer simulations were used to build a library of di erent types and magnitude of interference, i.e. hypothetical interferences were added to theoretical isotope distribution for peptides. Interference correction was done through searching this library. The approach was demonstrated using experimental data from E. coli and S. cerevisiae tryptic digestion mixtures. 1 Introduction The research and development of the Human Genome Project has provided a wealth of genetic information available to biologists. The major challenges now are to study the structure and function of the encoded proteins and to approach biological questions from the protein perspective to provide both qualitative and quantitative information about gene expression [1]. The structure and function of proteins are too complex to be be predicted from analysis of nucleic acids alone. From transcription to translation, extensive changes can be introduced. Post-translational modifications (PTM) such as phosphorylation and dephosphorylation, glycosylation, and translocation within the cell can also dramatically change the protein products from a single gene. Proteomics is an emerging field to explore protein research from the overall level of cellular proteins [2 4]. A multitude of proteomic technologies have been developed to address the challenges from qualitative protein identification and quantitative analysis of changes in proteomes [5]. Two dimensional gel electrophoresis [6, 7] is the major tool to separate proteins on the basis of charge in the first dimension and molecular mass in the second. After separation, proteins can be cut into peptides by enzymes, which can be identified and quantified using mass spectrometry to measure their mass-charge ratio (m/z), and abundance in the sample. Mass spectrometry was originally developed almost 100 years ago to measure elemental atomic weights and the natural abundance of specific isotopes [8]. Over the pass decades, mass spectrometry (MS) has been the indispensable technology in proteomics for accurate identification and quantitation of proteins because of its sensitivity which allows for the quantitative measurement of low level peptides in complex mixtures [9]. In mass spectrometry based peptide quantitation, the measured signal is considered to have a linear dependence on the amount of material in the sample for the entire range of amounts being studied. Recent studies have been done to address the problems of signal processing and peak finding [10, 11]. The quantity of peptides is measured by calculating the height or the area of the corresponding peaks in the ion chromatograms. A challenging problem for peptide quantitation is that peptides with the same m/z will interfere with each other. The frequent occurrence of interferences is a significant problem that can result in inaccurate quantitation of peptides. However there are only a few recent studies that address interference detection and correction issues for MS2 quantitation, and there is very little focus on this problem for MS1 quantitation. Manual inspection is still typically used to identify interferences. To address the time-intensive and error-prone nature of manual inspection, we propose computational approaches here to detect and correct the interference in both MS1 quantitation of peptides and MRM quantitation of peptides.

2 2 Materials and methods 2.1 Theoretical isotope distribution calculation The amount of stable heavy isotopes in nature is known, e.g., 0.015% 2 H, 1.11% 13 C and 0.366% 15 N, 0.038% 17 O, 0.200% 18 O, 0.75% 33 S, 4.21% 34 S, and 0.02% 36 S. The following equation (Equation 1) is usually used to calculate the intensity of peak m in a mass spectrum caused by one specific heavy isotope. n T m = p m (1 p) (n m) (1) m For instance, if the heavy isotope is 13 C, n is the number of carbon atoms in the peptide; p is the probability of 13 C in nature, which is 1.11%; m is the number of 13 C; and T m represents the probability of m 13 C atoms appear in the peptide. The intensity of the monoisotopic peak is considered to be 1, so that T m also represents the relative intensity of peak m caused by the only heavy isotope 13 Cinthe distribution. The intensities of the theoretical isotope distribution of peptides are calculated by including all possible isotopes. We used Monte Carlo simulations to simulate the possible appearance of heavy isotopes in a peptides for each atom. The algorithm was repeated 10000000 times. The intensities is calculated by the normalizations between the count of peptides with heavy isotopes and the count of monoisotopic peptides. For each peptide, three peaks starting from the monoisotopic peak were used to represent the intensities of isotope distribution. The intensities of the three peaks of peptide GLPADVVPGDILLLDDGR are 1, 1.0147529010, and 0.5896354272. 2.2 Data collection Group Mixture 1 1µg E. coli tryptic digestion 2 1µg E. coli and 0.05µg S. cerevisiae tryptic digestions mixture 3 1µg E. coli and 0.1µg S. cerevisiae tryptic digestions mixture Table 1. Samples for experimental data generation In this study, three groups of samples as shown in Table 1 with di erent E. coli and S. cerevisiae tryptic digestion mixture ratios were prepared to generate experimental data. Those samples were analyzed using a Q Exactive (Thermo Scientific, Inc.) a hybrid quadrupole orbitrap mass spectrometer for peptide identification and quantitation. Group 1 has only E. coli tryptic digestion, which is a control group and supposed to have no interference from S. cerevisiae peptides. For the other two groups, some E. coli peptides would su er interference from S. cerevisiae peptides. Our approach was applied to the experimental data for interference detection and correction. From the raw files, we extracted both the MS1 and MS2 datasets. We searched the data with X!Tandem (http://www.thegpm.org/tandem/ index.html) using the MS2 dataset to identify a list of E. coli peptides and S. cerevisiae peptides. A Perl program was implemented to find the E. coli peptides which are interfered by the S. cerevisiae peptides though searching the peptides which may possibly have the same m/z value. GLPADVVPGDILLLDDGR is a candidate E. coli peptide on the list, and it is used as an example here to interpret our approach. The experimental data was uploaded to Slice (http://fenyolab.ionomix.com/), whichisaweb- based application for mass spectrometry data searching and visualization. The MS1 measured intensities of the peptides that we are going to analyze from the three di erent groups were obtained from Slice. Figure 1 shows the MS1 measurement around the highest intensity of monoisotopic peak of 2+ charged GLPADVVPGDILLLDDGRE. Each graph from the top to the bottom shows the data from pure E. coli

3 sample (group 1), E. coli plus lower concentration of S. cerevisiae sample (group 2), and E. coli plus higher concentration of S. cerevisiae sample (group 3) respectively. On the right side of each panel, the intensities of theoretical isotope distribution are shown for this peptide. 2.3 Interference detection Since the isotopic distribution of a peptide is independent of peptide concentration, the measured intensities in one specific peptides mass spectrometry without interference must be consistent with its theoretical isotope distribution. If there is interference, the intensities of one or more peaks will be increased and the changes can be detected by comparing the measured intensities with the theoretical isotope distribution. With obvious interference, manual inspection can be easily conducted by observing the plots as in Figure 1. However, manual inspection is error-prone and does not work for many cases. From Figure 1, observation with human eyes cannot help to draw the conclusion of whether there is interference for 2+ charged GLPADVVPGDILLLDDGR in the samples, because the plots of measured intensities seem very similar to the theoretical isotope distribution. The measured intensities of 2+ charged GLPADVVPGDILLLD- DGR in the three di erence samples can be found in Table 2 below. The 1st peak column represents the intensities of the monoisotopic peak. Compared with group 1, the measured intensities in group 2 and group 3 increase as the concentration of S. cerevisiae tryptic digestion increases, which indicates group 2 and group 3 su er from interference based on the condition that group 1 has no interference from S. cerevisiae peptides. We present here a computational approach to detect interference by checking whether measured isotopic distribution of a peptide deviates from the theoretical isotope distribution. We used the relative ratios between the intensities of di erent isotopic peaks as the signature features to define the isotope distribution. Equation 2 below calculates the di erence between two isotope distribution. i is the number of peaks starting from monoisotopic peak, whose number is 1. I 1,i is the intensity of peak i from the measured isotope distribution and I 2,i is the intensity of peak i from the theoretical isotope distribution. dif i represents the di erence between two isotope distributions by comparing the relative ratios calculated by the first two peaks. We only focused on the first three peaks for each measurement. Hence, i can be 1 or 2. We set the threshold for the di erence between two isotope distributions to be 0.1, which means if either dif 1 or dif 2 is greater than 0.1, we consider that the measurement su ers from interference. dif i = I1,i+1 abs( I 1,i I 2,i+1 I 2,i ) I 2,i+1 I 2,i (if I 1,i 6=0,I 2,i 6=0,andI 2,i+1 6= 0) (2) As mentioned in Section 2.1, the intensity ratios of the three peaks starting from monoisotopic peak for peptide GLPADVVPGDILLLDDGR is theoretically to be 1: 1.0147529010: 0.5896354272. Table 2 below shows the calculation of dif 1 and dif 2 for the measurements of 2+ charged GLPADVVPGDILLLDDGR. Group 2 and 3 are detected to have interference based on the calculation, and the result is consistent with the experimental data. Group 1st Peak 2nd Peak 3rd Peak dif 1 dif 2 1 4101069 3871992 2037097 0.06958512 0.090449 2 6136956 5574542 3019380 0.104850 0.0636069 3 7720741 7526431 3893232 0.0393407 0.105725 Table 2. Interference detection based on the experimental data for 2+ GLPADVVPGDILLLDDGR. The threshold was set to 0.1, and both group 2 and group 3 were detected to have interference, while group 1 had no interference detected.

Figure 1. MS1 intensity distribution for 2+ charged GMPADVVPGDILLLDDGR. Black represents the first peak which is the monoisotopic peak, red represents the second peak and yellow represents the third peak. The intensities of the theoretical isotope distribution are presented on the right side of each panel. X axis is the scan time, and y axis is the intensity. 4

5 2.4 Interference library construction and interference correction Compared with interference detection, it is a much more challenging task to correct the interference, because it is necessary to know the intensities of which peaks are increased by interference and how much is increased. We propose an approach to simulate the di erent types and magnitude of interference that one peptide can have which constructs a library of mass spectrum patterns. For each pattern in the library, both the theoretical isotope distribution of the peptide and what kind of interferences was added to the theoretical isotope distribution are known information. The interference correction can be done by searching matching patterns in the library for the measurements that are detected to have interference. A suite of Perl programs were written to simulate the e ect of di erent types of interference on the isotope distribution of three E. coli peptides in the 800-3000 Da mass range, which are recognized as base peptides for the simulation. For each base peptide, 4 charges (1+, 2+, 3+, and 4+) were considered and the intensities based on the natural isotopic abundance of ions were calculated. For each mass and charge combination, the di erent types of peptide interferences were enumerated. The interference could be in any charge state, have a mass o set (+1, +2, +3, 0, -1, -2, and -3), and di erent relative intensities (0.1, 0.33, 1.0, 3.0 and 1.0). Figure 2 shows examples of the interference simulation based on the 2+ charged peptide TKPHVN- VGTIGHVDHGKTT which has 1997 Da mass. In those examples, the charge state of the peptides which causes interference is 4+ and the mass o set is 1.0. The first column shows the intensities of theoretical isotope distribution for TKPHVNVGTIGHVDHGKTT and the intensities of the monoisotopic peaks are all set to 1.0. The second column shows the average intensities of theoretical isotope distribution for S. cerevisiae peptides with mass as 3994 Da, which have same m/z value as 2+ charged TKPHVN- VGTIGHVDHGKTT when they are 4+ charged. From the top to the bottom in the second column, the intensities of the monoisotopic peak in each graph are 0.1, 0.33, 1.0, 3.0, and 10.0 respectively, which covers a range of relative intensities of the interference. The first column and second column overlapped to generate the third column, in which the blue is the correct intensity based on theoretical isotope distribution and the red on the top of the blue is the increased intensity caused by interference. The fourth column shows the increase of intensities for the first three peaks of the theoretical isotope distribution. When a measurement is detected to have interference, it is searched in the library for matching patterns. Each graph as those in the third column of Figure 2 can be candidate patterns for searching. In order to narrow the searching range, we only focus on the base peptides whose charges are the same as the measured peptide and m/z values within the range of 200 of the measured peptides m/z value. We used Equation 2 to check the similarities between the measured intensities of a peptide and a pattern in the library. To find a matching pattern in the library, the following two conditions must be satisfied: 1. The measured intensities should be similar to the pattern in the library, which means dif 1 and dif 2 are both smaller than the threshold 0.1. 2. Without adding interference, the theoretical isotope distribution of the base peptide should be similar to the theoretical isotope distribution of the measured peptide and the threshold is set to 0.1 as well. 3 Results and Discussions The measurements for 2+ charged GLPADVVPGDILLLDDGR in E. coli plus lower concentration of S. cerevisiae sample (group 2), and E. coli plus higher concentration of S. cerevisiae sample (group 3) were detected to have interference as mentioned in Section 2.2. After searching in the library, two patterns were found to match with the measurements from group 2 and one pattern was found to match with the measurements from group 3. Figure 3 shows a matching pattern from the library for group 2

Figure 2. Interference simulation examples for 2+ charged TKPHVNVGTIGHVDHGKTT. The charge of the peptides which causes interference is 4. For the first three columns, the x axis is the m/z and y axis is the intensity. For the last column, the x axis is the m/z and y axis is the increase of intensity for the first three peaks of theoretical isotope distribution. 6

7 Figure 3. Pattern searching for 2+ charged GLPADVVPGDILLLDDGR measurement from group 2. (A) The first three peaks of the isotope distribution of GLPADVVPGDILLLDDGR. (B) The measured isotope distribution of GLPADVVPGDILLLDDGR from group 2. (C) The isotope distribution of the base peptide. (D) The isotope distribution of the peptides that cause interference. (E) The pattern that matches with (B). (F) The increase of intensity from interference for the first three peaks. measurements. In this pattern the mass of the interference is 1997 Da; the mass o set is -3 and the charge is 2+. Figure 4(A) shows the correction based on the pattern in Figure 3. The corrected intensity is very close to the measured intensity for group 1. Figure 4(B) shows the interference correction for group 2 measurements based on the other pattern, and Figure 4(C) shows the correction for the measurements from group 3. The interference corrections presented in Figure 4 all reduce the impact of interference for the measurements of 2+ charged E. coli peptide GLPADVVPGDILLLDDGR. For the two matching patterns of group 2 measurements, the correction showed in Figure 4(B) is not as good as the correction showed in Figure 4(A). However, the pattern matching can be more specific by adjusting the parameters. When we compare the di erence between two isotope distributions, how the threshold is set will have influence on the interference detection and correction. If the threshold is increased, less interference will be detected and more patterns will be selected from the library for interference detection. If the threshold is decreased, more interference will be detected and fewer patterns will be selected from the library for interference detection. For the two patterns selected for goup2 measurements mentioned above, if we set the threshold for pattern searching to be 0.9 instead of 1.0, the pattern that was used to generate the correction in Figure 4(B) would not be selected and Figure 4(A) would be the only correction result for the group 2 measurements.

8 7000000 6000000 5000000 4000000 3000000 2000000 1000000 0 7000000 6000000 5000000 4000000 3000000 2000000 1000000 0 9000000 8000000 7000000 6000000 5000000 4000000 3000000 2000000 1000000 0 (A) (B) (C) Figure 4. Interference correction result for 2+ charged GLPADVVPGDILLLDDGR measurement. (A) and (B) are the corrections for group 2, and (C) is the correction for group 3. X axis shows the three groups of data: uncorrected measurement, corrected measurement and measurement from group 1. Y axis is the intensity. References 1. R. E. Banks, M. J. Dunn, D. F. Hochstrasser, J. C. Sanchez, W. Blackstock, D. J. Pappin, and P. J. Selby, Proteomics: new perspectives, new biomedical opportunities, Lancet, vol. 356, pp. 1749 1756, Nov. 2000. PMID: 11095271. 2. W. P. Blackstock and M. P. Weir, Proteomics: quantitative and physical mapping of cellular proteins, Trends in biotechnology, vol. 17, pp. 121 127, Mar. 1999. PMID: 10189717. 3. N. L. Anderson and N. G. Anderson, Proteome and proteomics: new technologies, new concepts, and new words, Electrophoresis, vol. 19, pp. 1853 1861, Aug. 1998. PMID: 9740045. 4. P. James, Protein identification in the post-genome era: the rapid rise of proteomics, Quarterly reviews of biophysics, vol. 30, pp. 279 331, Nov. 1997. PMID: 9634650. 5. C. May, F. Brosseron, P. Chartowski, C. Schumbrutzki, B. Schoenebeck, and K. Marcus, Instruments and methods in proteomics, Methods in molecular biology (Clifton, N.J.), vol. 696, pp. 3 26, 2011. PMID: 21063938. 6. P. H. O Farrell, High resolution two-dimensional electrophoresis of proteins, The Journal of biological chemistry, vol. 250, pp. 4007 4021, May 1975. PMID: 236308. 7. A. Gorg, C. Obermaier, G. Boguth, A. Harder, B. Scheibe, R. Wildgruber, and W. Weiss, The current state of two-dimensional electrophoresis with immobilized ph gradients, Electrophoresis, vol. 21, pp. 1037 1053, Apr. 2000. PMID: 10786879. 8. H. H. Willard, L. L. M. Jr, and J. A. Dean, Instrumental Methods of Analysis. Wadsworth Publishing Company, 7 sub ed., Jan. 1988.

9 9. R. Aebersold and M. Mann, Mass spectrometry-based proteomics, Nature, vol. 422, pp. 198 207, Mar. 2003. 10. J. Zhang, E. Gonzalez, T. Hestilow, W. Haskins, and Y. Huang, Review of peak detection algorithms in liquid-chromatography-mass spectrometry, Current Genomics, vol. 10, pp. 388 401, Sept. 2009. PMID: 20190954 PMCID: PMC2766790. 11. C. Yang, Z. He, and W. Yu, Comparison of public peak detection algorithms for MALDI mass spectrometry data analysis, BMC Bioinformatics, vol. 10, p. 4, Jan. 2009.