Features or compounds? A data reduction strategy for untargeted metabolomics to generate meaningful data

Similar documents
Metabolomics in an Identity Crisis? Am I a Feature or a Compound? The world leader in serving science

Automated and accurate component detection using reference mass spectra

Increasing Speed of UHPLC-MS Analysis Using Single-stage Orbitrap Mass Spectrometer

A Strategy for an Unknown Screening Approach on Environmental Samples using HRAM Mass Spectrometry

Fast and Reliable Method for the Analysis of Methylmalonic Acid from Human Plasma

Tomorrow s quantitation with the TSQ Fortis mass spectrometer: quantitation of phenylephrine hydrochloride for QA/QC laboratories

A Strategy for an Unknown Screening Approach on Environmental Samples Using HRAM Mass Spectrometry

De Novo Metabolite Chemical Structure Determination. Paul R. West Ph.D. Stemina Biomarker Discovery, Inc.

LC-MS Based Metabolomics

Achieve confident synthesis control with the Thermo Scientific ISQ EC single quadrupole mass spectrometer

Key Words Q Exactive, Accela, MetQuest, Mass Frontier, Drug Discovery

Quantitative analysis of small molecules in biological samples. Jeevan Prasain, Ph.D. Department of Pharmacology & Toxicology, UAB.

LC-MS/MS Method for the Determination of Diclofenac in Human Plasma

HR/AM Targeted Peptide Quantification on a Q Exactive MS: A Unique Combination of High Selectivity, High Sensitivity, and High Throughput

Relative quantification using TMT11plex on a modified Q Exactive HF mass spectrometer

Choosing the metabolomics platform

Simultaneous, Fast Analysis of Melamine and Analogues in Pharmaceutical Components Using Q Exactive - Benchtop Orbitrap LC-MS/MS

Expanded capabilities in ammonium and amine detection

MetWorks Metabolite Identification Software

SEAMLESS INTEGRATION OF MASS DETECTION INTO THE UV CHROMATOGRAPHIC WORKFLOW

TR ACE ORGANIC. lgcstandards.com/foodandenvironment

Fundamentals of Mass Spectrometry. Fundamentals of Mass Spectrometry. Learning Objective. Proteomics

Chemistry Instrumental Analysis Lecture 37. Chem 4631

Overcoming Interferences with the Thermo Scientific icap 7000 Plus Series ICP-OES

Hydrophilic Interaction Liquid Chromatography: Some Aspects of Solvent and Column Selectivity

All Ions MS/MS: Targeted Screening and Quantitation Using Agilent TOF and Q-TOF LC/MS Systems

Plasma-free Metanephrines Quantitation with Automated Online Sample Preparation and a Liquid Chromatography-Tandem Mass Spectrometry Method

Multi-residue analysis of pesticides by GC-HRMS

Determination of urea in ultrapure water by IC-MS/MS

Rapid Quan/Qual Metabolic Stability Analysis with Online Oxidative Metabolism Synthesis

High-Throughput LC-MS/MS Quantification of Estrone (E1) and Estradiol (E2) in Human Blood Plasma/Serum for Clinical Research Purposes

Accurate measurement of elemental impurities in metals and metal alloys using the Thermo Scientific icap TQ ICP-MS

Evaluation of a New HPLC, a New Tandem MS and a New Data Processing Software for General Clinical Use

High-Field Orbitrap Creating new possibilities

Multiple Fragmentation Methods for Small Molecule Characterization on a Dual Pressure Linear Ion Trap Orbitrap Hybrid Mass Spectrometer

Quan/Qual Analyses. Unmatched Confidence for. Thermo Scientific Q Exactive Orbitrap LC-MS/MS System. Identify Quantify Confirm

Making Sense of Differences in LCMS Data: Integrated Tools

A Platform to Identify Endogenous Metabolites Using a Novel High Performance Orbitrap MS and the mzcloud Library

Determination of trace anions in concentrated hydrofluoric acid

Utility of H-SRM to Reduce Matrix Interference in Food Residue Analysis of Pesticides by LC-MS/MS Using the TSQ Quantum Discovery

Mixed-Mode, Weak Anion-Exchange, Solid-Phase Extraction Method for the Extraction of Niflumic Acid from Human Plasma

PesticideScreener. Innovation with Integrity. Comprehensive Pesticide Screening and Quantitation UHR-TOF MS

A Study of Stability, Robustness and Time Efficiency of a New HPLC and a New Tandem MS

Identifying Disinfection Byproducts in Treated Water

Screening of pesticides residues by the time of flight analyzer (ToF) : Myth or reality?

M M e M M H M M H. Ion Sources

OVERVIEW INTRODUCTION. Michael O Leary, Jennifer Gough, Tanya Tollifson Waters Corporation, Milford, MA USA

Accelerated unknown compound annotation with confidence: from spectra to structure in untargeted metabolomics experiments

Analytical Technologies and Compound Identification. Daniel L. Norwood, MSPH, PhD SCĪO Analytical Consulting, LLC.

WADA Technical Document TD2003IDCR

Exploring the Benefits of Automated Unattended Sample Derivatization Prior to Gas Chromatography Analysis

Yun W. Alelyunas, Mark D. Wrona, Russell J. Mortishire-Smith, Nick Tomczyk, and Paul D. Rainville Waters Corporation, Milford, MA, USA INTRODUCTION

Live Webinar : How to be more Successful with your ACQUITY QDa Detector?

A novel high resolution accurate mass Orbitrap-based GC-MS platform for routine analysis of Short Chained Chlorinated Paraffins

Analysis of Polar Metabolites using Mass Spectrometry

Using a Reagent-Free ion chromatography system to monitor trace anion contamination in the extracts of electronic components

Compounding insights Thermo Scientific Compound Discoverer Software

sample was a solution that was evaporated in the spectrometer (such as with ESI-MS) ions such as H +, Na +, K +, or NH 4

Automated, intelligent sample preparation: Integration of the ESI prepfast Auto-dilution System with the Thermo Scientific icap 7400 ICP-OES

ProPac WCX-10 Columns

Thermo Scientific LTQ Orbitrap Velos Hybrid FT Mass Spectrometer

Targeted protein quantification

Elemental Analysis: CHNS/O characterization of polymers and plastics

MassHunter TOF/QTOF Users Meeting

The Theory of HPLC. Quantitative and Qualitative HPLC

MS Interpretation I. Identification of the Molecular Ion

Total Elemental Analysis of Food Samples for Routine and Research Laboratories, using the Thermo Scientific icap RQ ICP-MS

Direct Analysis using Paper-Spray Mass Spectrometry: Method Development for the Rapid Screening of Drugs of Abuse for Forensic Toxicology

Application Note LCMS-116 What are we eating? MetaboScape Software; Enabling the De-replication and Identification of Unknowns in Food Metabolomics

A Rapid Approach to the Confirmation of Drug Metabolites in Preclinical and Clinical Bioanalysis Studies

SmartNotes. What is meant by the term interference?

Sensitive HILIC UHPLC-UV determination of steviol glycoside natural sweeteners

WADA Technical Document TD2015IDCR

Mass Spectrometry. Ionizer Mass Analyzer Detector

Fitting of ETD Rate Constants for Doubly and Triply Charged Ions in a Linear Ion Trap

Elemental Analysis: NCS characterization of paper

High-Pressure Electrolytic Carbonate Eluent Generation Devices and Their Applications in Ion Chromatography Systems

Dionex IonPac AS28-Fast-4µm column

Analysis of a Verapamil Microsomal Incubation using Metabolite ID and Mass Frontier TM

Improved Screening for 250 Pesticides in Matrix using a LC-Triple Quadrupole Mass Spectrometer

The Power of LC MALDI: Identification of Proteins by LC MALDI MS/MS Using the Applied Biosystems 4700 Proteomics Analyzer with TOF/TOF Optics

Overview. Introduction. André Schreiber AB SCIEX Concord, Ontario (Canada)

Thermo Scientific Pesticide Explorer Collection. Start-to-finish. workflows for pesticide analysis

Improved Throughput and Reproducibility for Targeted Protein Quantification Using a New High-Performance Triple Quadrupole Mass Spectrometer

The key to your laboratory s success

Webinar: How to be More Successful with your ACQUITY QDa Detector Eric van Beelen

Speciation of Bromine Compounds in Ozonated Drinking Water using Ion Chromatography and Inductively Coupled Plasma Mass Spectrometry

Mass Spectral Studies of Polypropylene Chromatographic Well Plates

HILIC Method Development in a Few Simple Steps

Quantitation and Characterization of Copper Plating Bath Additives by Liquid Chromatography with Charged Aerosol Detection

Determination of Polyacrylic Acid in Boiler Water Using Size-Exclusion Chromatography with Charged Aerosol Detection

Improved 6- Plex TMT Quantification Throughput Using a Linear Ion Trap HCD MS 3 Scan Jane M. Liu, 1,2 * Michael J. Sweredoski, 2 Sonja Hess 2 *

An Effective Workflow for Impurity Analysis Incorporating High Quality HRAM LCMS & MSMS with Intelligent Automated Data Mining

Mass Spectrometry. General Principles

Defining quality standards for the analysis of solid samples

CHAPTER 2 LIFE'S CHEMICAL BASIS MULTIPLE CHOICE. ANS: C PTS: 1 DIF: Moderate OBJ: Bloom's Taxonomy: Knowledge. Mercury Rising

Identification and Characterization of an Isolated Impurity Fraction: Analysis of an Unknown Degradant Found in Quetiapine Fumarate

Analysis of high matrix samples using argon gas dilution with the Thermo Scientific icap RQ ICP-MS

How to screen and identify unexpected and unwanted compounds in food

Accelerate Unknown Detection in Emerging Drug Testing Using Thermo Scientific Compound Discoverer and mzcloud

Transcription:

TECHNICAL NOTE 65204 Features or compounds? A data reduction strategy for untargeted metabolomics to generate meaningful data Authors Amanda Souza and Ralf Tautenhahn Thermo Fisher Scientific Keywords Metabolomics, untargeted metabolomics, features, compounds, data reduction, Compound Discoverer software Introduction Untargeted metabolomics describes an unbiased approach that globally analyzes the metabolome in any given sample. Comprehensive metabolome coverage is obtained using analytical techniques such as liquid chromatography mass spectrometry (LC-MS) by collecting all measurements indiscriminately. While a defined mass range is normally implemented with this approach, there is a fundamental notion to collect everything and leave nothing behind. At the same time, unbiased data collection results in an exhaustive list of spectral features or signals, making data analysis laborious and cumbersome. Through a data reduction process these features can be converted to a list of meaningful compounds by accounting for artifacts such as naturally occurring isotopes and chemical interactions such as adduct formations with metal ions. Likewise, background features unrelated to the experimental samples such as chemical noise, contaminants, or analytes resulting from sample handling can be eliminated from the working list. Furthermore, neglecting artifacts may lead to over interpretation of data, thus drawing incorrect conclusions and wasting time. Here we present how to differentiate a compound from a molecular feature using Thermo Scientific Compound Discoverer software to reduce redundancies and accelerate data analysis.

Unbiased peak detection with high resolution accurate mass (HRAM) LC-MS Untargeted metabolomics aims to capture comprehensive coverage of the metabolome utilizing an all-encompassing approach. The application can be applied to a variety of samples including biofluids, animal or plant tissue, and intracellular components. These samples of biological origin are commonly regarded as complex matrices due to the presence of numerous and diverse endogenous components such as biogenic amines, amino acids, organic acids, and lipid species. These complex mixtures benefit from separation techniques such as liquid chromatography to resolve components in the time dimension based on compound polarity. Indeed the data generated is highly dense and complex in nature. Mass spectrometers measure the mass-to-charge ratio (m/z) of chemical species. Combined with signal intensity, a three dimensional array of components representing molecules from the extracted complex matrix is generated using unbiased detection, thus producing an untargeted output (Figure 1). HRAM mass spectrometers operated in full scan mode detect all ions in a complex matrix where spectral resolution pertains to the spectrometer s ability to differentiate adjacent mass peaks in the spectrum. High resolution mass spectrometers such as those using Thermo Scientific Orbitrap technology offer ultrahigh resolving power, up to 500,000 FWHM, and recently available is the option for one million FWHM at m/z 200. In most cases, data acquisition of untargeted metabolomics applications using Orbitrap mass spectrometers generally start with full scan mode analysis using resolving power from 60,000 to 140,000 FWHM at m/z 200 depending on the instrument. A wide mass range, for instance 70 800 m/z, is interrogated and allows for quantification of precursor (parent) ions in either positive or negative ion polarity. One analyte can give rise to multiple adducts, fragments, and even cluster ions in addition to their respective isotopic peaks. As a result, detection of thousands of features, also termed signals or peaks, occurs when analyzing a wide mass range in full scan mode. This list of thousands of features in untargeted analysis can be misleading because not all features independently represent a singular biomolecular, endogenous metabolite. Accounting for naturally occurring isotopes The small molecules of interest in metabolomics studies are predominantly composed of chemical elements like carbon, hydrogen, nitrogen, sulfur, and oxygen. For example, the endogenous small molecule creatine Intensity [counts] (10 9 ) m/z Time (min) Figure 1. Three dimensional array of spectral features using LC-MS for untargeted metabolomics analysis. The first dimension, X axis, represents chromatography separation by time. The second dimension, Z axis, represents mass detection based on a mass-to-charge ratio (m/z). The final dimension is signal intensity or the response of the measurement based on ion current. 2

has an elemental composition of C 4 H 9 N 3 O 2. Further, these chemical elements contain naturally occurring stable isotopes. For instance, 14 N and 15 N are naturally abundant isotopes of nitrogen. So for compounds observed in a mass spectrum, there are multiple isotopic peaks representing different naturally occurring isotopes. For instance, carbon has one isotope, 13 C, of detectable abundance with a mass difference of 1.0034 Daltons from the monoisotopic peak, 12 C. Undoubtedly, these two spectral peaks for any given compound are baseline resolved with high resolution mass spectrometers. Yet, other elements have isotopes of one mass unit from the monoisotopic peak, thus creating an A1 cluster of detectable ions. 1 Consider the amino acid methionine with an elemental composition of C 5 H 11 NO 2 S. The theoretical m/z of the monoisotopic peak in positive ion polarity is 150.05833 m/z. The calculated m/z for the 13 C and 15 N isotope peaks of methionine is 151.06168 and 151.05536, respectively. Thus for small molecules containing nitrogen along with carbon, the 15 N isotope peak, with a mass difference of 0.997 Daltons from the monoisotopic peak, can be resolved from the 13 C isotope peak, a mere spectral distance of 0.0063 Daltons. Higher resolution settings can sufficiently resolve these closely related ions within an isotope cluster as shown with methionine in a metabolomics extract from the human plasma reference material, NIST 1950 (Figure 2). Resolution can further be extended to the A2 cluster or elements possessing an isotope differing by two mass units from the monoisotopic peak. Ultimately, higher resolving power reveals a fine isotope structure of detected peaks where several spectral peaks represent the same molecule. Confidence in spectral peak assignments is generated by high mass accuracy. Orbitrap mass spectrometers are capable of obtaining less than three parts per million (ppm) mass accuracy with an external calibration and less than one ppm mass accuracy with internal calibration. Exact mass measurements are then compared to theoretical or expected masses based on elemental composition as described above. With high mass accuracy, low mass tolerance criteria can be applied when determining spectral peaks. Lower tolerances, in turn, allow for higher confidence in mass assignments. Correct mass assignments are critical in unknown metabolomics analysis given the exhaustive list of unknown spectral peaks present. L-Methionine C 5 H 11 NO 2 S Positive Mode Figure 2. Example of a fine isotopic distribution for the A1 ion cluster of methionine. Data collection was on an organic extraction of human plasma reference material, NIST SRM 1950, using a Thermo Scientific Q Exactive HF MS with a resolution setting of 120,000 FWHM @ 200 m/z. Elemental composition for the amino acid is C 5 H 12 N 2 O 2. Multiple isotope peaks are resolved in the A1 cluster. 3

Multiple ion species for one molecule in electrospray ionization Electrospray ionization (ESI) is widely implemented in untargeted metabolomics LC-MS analysis. While the exact mechanism of how this ionization technique works have yet to be fully determined, the fundamental principle states that molecules in liquid phase are transferred to gas phase by applying an electric potential to the analyte solution to produce a fine mist of charged droplets subjected to solvent evaporation and subsequent ion ejection. 2 ESI is considered a soft ionization process because little to no fragmentation of the charged molecule occurs. Positively and negatively charged ions form in solution by the presence of acidic or basic functional groups on organic molecules and charge separation of inorganic species. 3 Ionization can be impacted by the choice of solvents, additives, and salt buffers for chromatographic mobile phases. Further, analytes with adduct formation to cationic (sodium, potassium, ammonium) and anionic species (chloride) may be observed. Given the possibilities for chemical interactions, one molecule may be represented by several ionic forms. 4 When analyzing mass spectral data, the protonated or deprotonated form of the molecule is typically searched. However, it is possible and likely that other forms of the molecule could be present like those arising from adduct formation, dimerization, or interaction with molecules of the mobile phase. For example, the sodium adduct [M+Na] should be searched for data acquired in the positive ion polarity. In this simple case of considering sodium, two features reflect the same molecule. To add to this, these two monoisotopic features have related naturally occurring isotopes attributing to a greater number of features. Taken together, the potential that multiple features exist representing one molecule should not be underestimated. Figure 3 shows multiple features detected representing tryptophan in a metabolomics extract of rat plasma. Failure to recognize and account for these chemical interactions is time consuming and can lead to misinterpretation of results. Analyzing data at the features level is a formidable and inefficient task given the labeled A m/z B Figure 3. MS1 mass spectrum in positive ion polarity and related table from Compound Discoverer software for tryptophan. A) The mass spectrum shows the most intense peak highlighted in lavender while peaks marked with green are associated ions for tryptophan. B) Table displaying ion assembly list for tryptophan. 4

redundancies from isotopes and adducts already noted. Besides this, the risk of carrying out analyses on false positives is much greater. Consequently, with the recognition that untargeted metabolomics is typically processed for statistical significance by differential analysis, the statistics can simply be inflated leading researchers to incorrect conclusions. These pitfalls can be avoided when handling the data appropriately. Background ions contribute to molecular features Data output from an untargeted metabolomics LC-MS analysis is comprised of not only biologically relevant small molecules but also components unrelated to the experiment which are derived from external sources. The experimental sample, whether biofluid or tissue, goes through a series of processing steps before reaching the detector of the mass spectrometer. The sample preparation process often entails quenching and extracting target compounds with organic solvents and possibly reconstitution with chromatographic mobile phases all the while being subjected to consumables like microcentrifuge tubes and pipette tips. This process creates a favorable environment to attract unsuspecting molecules such as plasticizers like phthalates and polymers like polysiloxanes. 5 Additional sources of unrelated compounds include solvents not only for sample extraction but also chromatographic mobile phases. Further, molecules from within the laboratory environment may trickle in. Fortunately, these unrelated components can be controlled for during data processing with proper experimental design. Prior to the start of the experimental analysis, a blank sample should be analyzed. In the context of untargeted metabolomics analysis, a blank sample can be either a solvent blank or an experimental blank. The solvent blank often mirrors the sample injection medium, which tends to be the starting chromatographic conditions. For instance, using reversed-phase chromatography, this will likely be a high percentage of water. With hydrophilic interaction chromatography, the blank will consist of mostly organic solvent compositions. As mentioned above, a sample is exposed to external factors like consumables. Same as the sample, the selected solvent can be prepared with the same procedure to mimic the sample extraction. This blank in turn is referred to as the experimental blank. Either way, the components originating from these samples can then be referenced to the true experimental samples to determine unrelated background molecules, also termed background noise (Figure 4). Incorporating a solvent blank or experimental blank enables determining spectral features originating from external sources. Figure 4. Extracted ion chromatogram (XIC) overlay from Compound Discoverer software showing an unknown background compound detected in all samples. This unknown compound was consistently detected in the experimental samples as well as the solvent blank injection. The XICs for all samples are overlaid showing the same peak shape and intensity. Compounds of this nature are marked as such and can easily be hidden from the results list or recalled up when necessary. The red vertical line represents the apex for post chromatographic alignment. 5

Meaningful compounds extracted from molecular features using Compound Discoverer software Distinguishing a feature from a compound using Compound Discoverer software provides perspective into the data reduction strategy in unknown analysis toward generating a working list of meaningful results. The discussion thus far focuses on molecular features and the origin of these features in order to appreciate how compound lists using Compound Discoverer software are generated. Specifically, a feature is an m/z at a given retention time. The term feature is synonymous with chromatographic peak or signal. As described above, multiple features can represent a single molecule via naturally occurring isotopes or chemical interactions. HRAM MS resolves closely related masses and even fine isotopic structure to efficiently detect these constituents. In contrast, a compound within Compound Discoverer software reflects a group of associated features. To this end, the feature assembly process in Compound Discoverer software was applied to a rat plasma extract analyzed in full scan at 70,000 resolving power, FWHM at 200 m/z, using a Thermo Scientific Q Exactive mass spectrometer. Using typical detection parameter settings for untargeted metabolomics studies, a total number of 54,000 features was reduced to 20,000 unique isotope groups, and further converted to 2,467 compounds. Beyond detecting experimental artifacts like naturally occurring isotopes and ion formations by chemical interactions, Compound Discoverer software can mark background compounds from blank samples and further refine the list of compounds (Figure 5). When analyzed using the same LC-MS conditions, the solvent or experimental blank contains ions unrelated to the experimental samples. Blank sample injections can be placed at the beginning of the injection sequence or at various points within throughout the analysis run. Multiple blank injections can be submitted to Compound Discoverer software for consideration. Based on a user-defined threshold of blank samples compared to experimental samples, background compounds are marked and can be hidden from the working results table. Accounting for unrelated background ions can eliminate unrelated molecules from the data output while substantially reducing the number of compounds for analysis. Figure 5. XIC overlay of tryptophan from Compound Discoverer software showing associated ion species. Several ions detected in positive polarity at the same retention time are assembled to reflect one compound within one sample injection of rat plasma extract. Associated ions include [M+H] +, [M+Na] +, [M+K] +, [M+H-NH 3 ] +, [2M+H] +, [2M+Na] +. 6

Summary HRAM LC-MS is a powerful tool for untargeted metabolomics analysis of complex biological matrices. High resolution full scan MS detects all ionic species present in analyzed samples generating an exhaustive results list of molecular features. This list of features is comprised of monoisotopic species, corresponding natural isotopes, associated adducts, and unsuspecting background ions due to the sample handling process and chromatographic mobile phase. Taking all of these experimental artifacts into consideration during data processing enables a proactive approach to eliminating redundancies and unrelated ions. Moving from features to compounds using Compound Discoverer software can provide a list of meaningful results for accelerated analysis. References 1. McLafferty, F.W. (1980). Interpretation of mass spectra, third edition. University science books, Mill Valley, California. 2. Konermann, L., Ahadi, E., Rodriguez, A.D. and Vahidi, S. (2013). Unraveling the mechanisms of electrospray ionization. Analytical Chemistry, 85, 2 9. 3. Cech, N.B., and Enke, C.G. (2001). Practical implications of some recent studies in electrospray ionization fundamentals. Mass Spectrometry Reviews, 20, 362 387. 4. Mahieu, N.G. and Patti, G.J. (2017). Systems-level annotation of a metabolomics data set reduces 25,000 features to fewer than 1000 unique metabolites. Analytical Chemistry, 89, 10397 10406. 5. Keller, B.O., Sui, J., Young, A.B., and Whittal, R.M. (2008). Interferences and contaminants encountered in modern mass spectrometry. Analytica Chimica Acta, 672, 1 81. Find out more at thermofisher.com/metabolomics 2018 Thermo Fisher Scientific Inc. All rights reserved. All trademarks are the property of Thermo Fisher Scientific Inc. and its subsidiaries. This information is presented as an example of the capabilities of Thermo Fisher Scientific Inc. products. It is not intended to encourage use of these products in any manners that might infringe the intellectual property rights of others. Specifications, terms and pricing are subject to change. Not all products are available in all countries. Please consult your local sales representative for details. TN65204-EN 0418S