Method Transfer across Multiple MicroNIR Spectrometers for Raw Material Identification

Similar documents
Verification of Pharmaceutical Raw Materials Using the Spectrum Two N FT-NIR Spectrometer

Lan Sun, Chang Hsiung, Christopher G. Pederson, Peng Zou, Valton Smith, Marc von Gunten, and Nada A. O Brien. Introduction.

Pros and cons of using correlation versus multivariate algorithms for material identification via handheld spectroscopy

Putting Near-Infrared Spectroscopy (NIR) in the spotlight. 13. May 2006

Raw material identification using dual laser handheld Raman spectroscopy Sulaf Assi Associate Lecturer, Forensic Sciences, Bournemouth University

NIR Spectroscopy Identification of Persimmon Varieties Based on PCA-SVM

Snowy Range Instruments

Snowy Range Instruments

Positive and Nondestructive Identification of Acrylic-Based Coatings

Spectroscopy in Transmission

INTERNATIONAL JOURNAL OF PHARMACEUTICAL RESEARCH AND BIO-SCIENCE

Application of Raman Spectroscopy for Detection of Aflatoxins and Fumonisins in Ground Maize Samples

A Machine Learning Application for Classification of Chemical Spectra

PTG-NIR Powder Characterisation System (not yet released for selling to end users)

Application of Raman Spectroscopy for Noninvasive Detection of Target Compounds. Kyung-Min Lee

Mixed Hierarchical Models for the Process Environment

IDRaman reader and IDRaman mini: Raster Orbital Scanning (ROS)

A comparison of near infrared method development approaches using a drug product on different spectrophotometers and chemometric software algorithms

NEAR INFRARED SPECTROSCOPY IN DRUG DISCOVERY AND DEVELOPMENT PROCESSES

Monitoring the Aggregation of Particles through Raman Spectroscopy

Handheld FTIR spectroscopy applications using the Agilent ExoScan 4100 FTIR with diffuse sample interface

MULTIVARIATE PATTERN RECOGNITION FOR CHEMOMETRICS. Richard Brereton

Inspection of raw materials and chemicals by using Raman Spectroscopy October, 2013

TECHNICAL DESCRIPTION SPECTRAFLOW ON LINE ANALYZER for BELT CONVEYOR APPLICATION

Swiss Medic Training Sampling

Advancing from unsupervised, single variable-based to supervised, multivariate-based methods: A challenge for qualitative analysis

The Role of a Center of Excellence in Developing PAT Applications

Mira Metrohm Instant Raman Analyzer. Always ready to measure

Analysis of Polymers and Plastics. Innovation with Integrity. Quality Control & Failure Analysis FTIR

UV3000. Accurate, precise, and portable ambient gas point analyzer

transpec Instant Polymer Analyser Webinar starting soon

Detection and Quantification of Adulteration in Honey through Near Infrared Spectroscopy

Natural Products. Innovation with Integrity. High Performance NMR Solutions for Analysis NMR

Chemometrics: Classification of spectra

Process Analytical Technology Diagnosis, Optimization and Monitoring of Chemical Processes

Optimizing Model Development and Validation Procedures of Partial Least Squares for Spectral Based Prediction of Soil Properties

Plan. Lecture: What is Chemoinformatics and Drug Design? Description of Support Vector Machine (SVM) and its used in Chemoinformatics.

Plum-tasting using near infra-red (NIR) technology**

Measurement & Analytics Measurement made easy. MB3600-CH70 FT-NIR polyol analyzer Pre-calibrated for OH value determination

Overview and comparison of portable spectroscopy techniques: FTIR, NIR and Raman

THE INTRODUCTION OF PROCESS ANALYTICAL TECHNOLOGY, USING NEAR INFRARED ANALYSIS, TO A PHARMACEUTICAL BLENDING PROCESS

Raman Spectrometers. Cora Family

The Plus for your lab!

Research Article Near-Infrared Spectroscopy for Anticounterfeiting Innovative Fibers

Robert P. Cogdill, Carl A. Anderson, and James K. Drennen, III

Automated Stability Testing Delivering Rapid Product Development

Introduction to Support Vector Machines

Chemometrics. 1. Find an important subset of the original variables.

Nouvelles approches de mise en œuvre de la spectroscopie NIR et Raman pour le suivi de la qualité des produits industriels en ligne ou at-line

FTIR characterization of Mexican honey and its adulteration with sugar syrups by using chemometric methods

Introducing the Morphologi G3 ID The future of particle characterization

PREDICTION OF SOIL ORGANIC CARBON CONTENT BY SPECTROSCOPY AT EUROPEAN SCALE USING A LOCAL PARTIAL LEAST SQUARE REGRESSION APPROACH

Evolved gas analysis by simultaneous thermogravimetric differential thermal analysis-fourier transformation infrared spectroscopy (TG-DTA-FTIR)

Monitoring Emulsion Polymerization by Raman Spectroscopy

ALPHA. Innovation with Integrity. The very compact and smart FTIR spectrometer FTIR

3.0 APPARATUS : Capsule Machine, Analytical balance, Spatula, Beaker, Pestle and mortar, Powder paper.

Discrimination of Dyed Cotton Fibers Based on UVvisible Microspectrophotometry and Multivariate Statistical Analysis

Analysis of Polymers and Plastics. Innovation with Integrity. Quality Control & Failure Analysis FT-IR

Analysis of Cocoa Butter Using the SpectraStar 2400 NIR Spectrometer

Corresponding Author: Jason Wight,

The prediction of house price

Advanced Spectroscopy Laboratory

THERAPEUTIC GOODS SAMPLING Application to Tablet Manufacture and Ingredients. David Edmonds CMC Regulatory

Probabilistic Approach for Material Verification and Identification in Pharmaceutical Applications

MULTIVARIATE STATISTICAL ANALYSIS OF SPECTROSCOPIC DATA. Haisheng Lin, Ognjen Marjanovic, Barry Lennox

Independence and Dependence in Calibration: A Discussion FDA and EMA Guidelines

Characterization of Jet Charge at the LHC

Influence of different pre-processing methods in predicting sugarcane quality from near-infrared (NIR) spectral data

Whole Tablet Measurements Using the Frontier Tablet Autosampler System

Pungency Level Assessment With E-Tongue Application to Tabasco Sauce

The integration of NeSSI with Continuous Flow Reactors and PAT for Process Optimization

Agilent Oil Analyzer: customizing analysis methods

Detection of trace contamination on metal surfaces using the handheld Agilent 4100 ExoScan FTIR

Near Infrared reflectance spectroscopy (NIRS) Dr A T Adesogan Department of Animal Sciences University of Florida

Raw Material Identification with FT-NIR

VALIDATION OF A UPLC METHOD FOR A BENZOCAINE, BUTAMBEN, AND TETRACAINE HYDROCHLORIDE TOPICAL SOLUTION

Discriminating between polymorphs of acetaminophen using Morphologically-Directed Raman Spectroscopy (MDRS)

Application Note 12: Fully Automated Compound Screening and Verification Using Spinsolve and MestReNova

Agilent Cary 630 FTIR sample interfaces combining high performance and versatility

Chromatographic Methods of Analysis Section 2: Planar Chromatography. Prof. Tarek A. Fayed

Inferential Analysis with NIR and Chemometrics

Online Reaction Monitoring of In-Process Manufacturing Samples by UPLC

Introduction to Fourier Transform Infrared Spectroscopy

Analysis of NIR spectroscopy data for blend uniformity using semi-nonnegative Matrix Factorization

ABASTRACT. Determination of nitrate content and ascorbic acid in intact pineapple by Vis-NIR spectroscopy Student. Thesis

CSE 417T: Introduction to Machine Learning. Final Review. Henry Chai 12/4/18

STANDARD OPERATING PROCEDURES

Development of Near Infrared Spectroscopy for Rapid Quality Assessment of Red Ginseng

ActiPix SDI300 Surface Dissolution Imaging System

In Silico Spectra Lab. Slide 1

NIR Chemical Imaging as a Process Analytical Tool

Optical In-line Control of Web Coating Processes with UV-VIS-NIR Spectroscopy. AIMCAL Europe Web Coating Conference Peter Lamparter

Agilent Cary 630 FTIR Spectrometer Supporting Organic Synthesis in Academic Teaching Labs

Advanced analysis and modelling tools for spatial environmental data. Case study: indoor radon data in Switzerland

Whole Tablet Measurements Using the Spectrum One NTS Tablet Autosampler System

DAMAGE ASSESSMENT OF REINFORCED CONCRETE USING ULTRASONIC WAVE PROPAGATION AND PATTERN RECOGNITION

BIOLIGHT STUDIO IN ROUTINE UV/VIS SPECTROSCOPY

Application Note. The Optimization of Injection Molding Processes Using Design of Experiments

Plop Plop, Fizz Fizz, Oh What A Relief It Is (Which Pain Reliever Works Fastest)

UV Spectroscopy Determination of Aqueous Lead and Copper Ions in Water

Transcription:

Method Transfer across Multiple MicroNIR Spectrometers for Raw Material Identification Raw material identification or verification (of the packaging label) is a common quality-control practice. In the pharmaceutical industry, the increasing global footprint of the supply chain and public health concerns 1 resulting from contaminated materials (or mislabeling) have driven many regulatory bodies to require inspection of every barrel in every shipment of materials used in the manufacture of pharmaceutical drugs. Handheld analytical instruments are becoming more common for rapid and non-destructive testing of incoming raw materials directly in warehouses, reducing the cost and time needed for taking samples to the laboratory and quarantining incoming shipments until lab testing is complete. The MicroNIR spectrometer is a near-infrared (NIR) handheld spectrometer tethered to a smart tablet and equipped with predictive modeling software 2 that identifies pharmaceutical raw materials. Advantages of the MicroNIR spectrometer relative to Raman and Fourier-transform infrared (FTIR) spectroscopy include: Figure 1. MicroNIR spectrometer tethered to a rugged 7 Windows 8.1 tablet; the MicroNIR is equipped with a vial holder in this image NIR spectroscopy does not suffer from the fluorescence problem encountered in Raman spectroscopy Measurement time with MicroNIR is very fast (<1 second) MicroNIR measures through plastic or glass containers Light sources are eye safe This application note reports on a study conducted to demonstrate how the MicroNIR 1700 spectrometer 2-3 correctly classifies pharmaceutical raw materials and enables transferring the method from one or more master instruments to multiple target instruments. The study compared different classification algorithms to assess the number of master instruments needed to achieve 100% accurate prediction. In addition to adopting the preferred classification algorithm, successfully transferring methods to target instruments depends highly on instruments that are consistent in performance spectrally, optically, and physically 3. MicroNIR spectrometer technology, when coupled with common chemometric analysis techniques, is highly suitable for classification applications in high-volume applications. /go/micronir Application Note

Methodology and Experimental Procedure The study was conducted in two phases. Phase I consisted of the scanning of 19 of the most commonly used pharmaceutical excipients and active ingredients to build a classification library. Phase II aimed at providing a robust model that could be transferred to a new, untrained spectrometer. Spectra of all 19 compounds were collected from six instruments with cross-instrument validations using soft independent modeling of class analogies (SIMCA), partial least squares discriminant analysis (PLS-DA), and support vector machine (SVM) classification algorithms. Additionally, data pretreatment algorithms to find out the minimal number of spectrometers needed and the best combinations (and sequences) of pretreatment schemes for achieving optimal prediction performance were also studied. The spectra of 19 pharmaceutical materials listed in Table 1 were collected in laboratory ambient conditions. All samples were presented to the spectrometer housed in 4 mm borosilicate glass vials with measurements performed through the bottom of the vials. A vial holder that slips over the end of the spectrometer was used by the operator to easily introduce and remove samples and also maintain an optimal 3 mm distance between the material and Figure 2. Experimental setup the face of the spectrometer. Figure 2 shows a picture of the vial holder in use. The vial was rotated approximately 10 to 15 degrees after every scan. Each scan had an integration time of 10 ms with spectrum averaged over 50 collections. Each material was scanned a minimum of five times. Table 1. 19 pharmaceutical compounds used in the study Material ID CAS Number Short Names Acetaminophen 103-90-2 Acetaminophen Ascorbic acid, L-ascorbic acid 50-81-7 Ascorbic acid Aspirin 50-78-2 Aspirin Benzocaine (ethyl-4-aminobenzoate) 94-09-7 Benzocaine Caffeine 58-08-2 Caffeine Cellulose 9004-34-6 Cellulose Corn Starch 9005-25-8 Corn starch Fructose, D-( )-fructose 57-48-7 Fructose HPC, hydroxypropyl cellulose 9004-64-2 HPC (Hydroxypropyl) methyl cellulose (HPMC) 9004-65-3 HPMC Ibuprofen 15687-27-1 Ibuprofen Lactose 63-42-3 Lactose Magnesium stearate 557-04-0 Mg-stearate Poly (ethylene oxide) (PEO) powder 25322-68-3 PEO PVP, polyvidone, polyvinylpyrrolidone, povidone 9003-39-8 PVP Polysorbate 80 9005-65-6 Polysorbate80 SSG, sodium starch glycolate, Explotab 9063-38-1 SSG Talc 14807-96-6 Talc Titanium dioxide, titanium(iv) oxide 13463-67-7 TiO2 2

All measurements were collected using MicroNIR Pro spectrometer software version 2.0 (Figure 3). A reference measurement was performed on the MicroNIR approximately 15 minutes after the lamps were turned on and every hour thereafter while performing scans. A 99% diffuse reflectance panel was used for the 100% reference value and the 0% reference value was taken by leaving the tungsten lamps on with an empty vial holder. This scenario was used to account for any scattered light from the sample vial holder. Figure 3. The MicroNIR Pro software home page Most of the 19 pharmaceutical compounds were ordered from 2-3 different vendors to increase the coverage of samples manufacture sources variations. Each compound from each manufacture source was scanned at three different dates and ambient temperatures. To study the models transfer performance among different spectrometers, six spectrometers collected data. After spectra collection, data was imported into an embedded version of Unscrambler X software version 10.3 in the MicroNIR Pro software for spectral analysis and calibration model development. All spectra were pretreated using a Savitzky-Golay first derivative followed by standard normal variate (SNV), also performed within the MicroNIR Pro software. Figure 4 is a plot of the first derivate of the spectra of the 19 materials tested. Figure 4. Savitzky-Golay first derivative and SNV pre-treated spectral data across 950 1650 nm 3

Chemometric Library Development Chemometric libraries are predictive and data-driven statistical models that use chemometric and machine-learning principles to extract key features and underlying relationships between spectra and their respective known identities (training sets). The libraries are used later to predict the same identities of unknown samples (test sets). In this study, we investigated the building of such models for pharmaceutical raw materials identification applications with spectra data from the MicroNIR 1700 instruments. We used three types of classification algorithms characterized by specific principles: SIMCA 4 a principal component model represents each class in the data set PLS-DA 5 X- and Y-scores are chosen so that it will seek directions in the factor space that are associated with high variations in responses but biasing them toward directions that are accurately predicted SVM 6 Class boundaries based on maximizing the separation margin between classes: samples at the borders of each class play a major role Table 2. Pros and cons for each classifier Classifier Pros Cons SIMCA Easy to understand, implementation is simple Difficult to optimize and maintain parameters for each class Cannot generalize well for heterogeneous structured datasets PLS-DA Can calculate each variable s contribution Can run into overfit issues for complicated datasets SVM Robust and good generalization capabilities Implementation is more complicated but commercial software packages such as CAMO s The Unscrambler offers this capability Cannot calculate each variable s contribution Results and Discussion Figure 5 is the principle component analysis (PCA) score plot showing PC-1 plotted against PC-2. The PCA scores plot provides an initial indication of spectral differentiation in the calibration set. Chemically-similar materials tend to cluster closer to each other versus chemically different materials. Figure 5. PCA scores plot, PC-1 vs. PC-2 4

Figure 6 shows the partitions of classes in SVM models by the PCA-SVM plot, where the x-axis and y-axis represent PC-1 and PC-2 scores and the class decision boundaries were generated by SVM calculations. This gives an overall picture of the relative positions of each class and their class boundaries. Figure 6. PCA-SVM plot for 19 pharmaceutical compounds based on SVM models Model Building Times Different classification algorithms take different amounts of time to build training models. Model building time in offline learning cases may not be crucial but if it takes too long to build, it will be difficult to study and optimize modeling processes. One key factor in deciding model building time is the size of the training set (number of samples in the training set). Figure 7 compares a model s building time vs. the size of the training set for the three types of classification algorithm we studied here. Both PLS-DA and SIMCA require optimization of their model parameters (number of PLS factors for PLS-DA and number of principal components for each class in SIMCA) through a training set cross-validation process while for SVM there is no need for cross validation. As shown in the figure, it will take about 10,000 times longer to train PLS-DA vs. SVM and roughly 100 times longer to train SIMCA vs. SVM. It was also estimated that PLS-DA model building times increased with a power of about 1.9 vs. sample size and SIMCA increased with a power of about 1.5. With SVM, model building times increased almost linearly with training set sample sizes. Model building time (sec) Number of samples in training set Figure 7. Model building times for three kinds of classification algorithms vs. sizes of training sets In this study, and for reference, with two instruments used for the training set, the model building time was about 1 hour, 100 sec, and 1 sec for PLS-DA, SIMCA, and SVM respectively. 5

Building Robust Models With point-of-use NIR applications, it is possible that hundreds of spectrometers will be deployed in the field. It is almost impossible to train each spectrometer with the specific dataset of materials it will analyze. The practical approach is to collect a few complete sets of spectra covering all variances from chemicals, environmental factors (temperatures, humidity), and user variations (path length, for example) from a couple of master spectrometers. These models are then transferred to all other target or production spectrometers. In this study, we collected six libraries of spectral data from six different MicroNIR spectrometers and determined how many spectrometers would be needed for building a robust model that will be successful when applied to spectrometers that were not part of the training set. When constructing these kinds of consolidated libraries (based on spectra collected from more than one spectrometer), a sample selection scheme can drastically reduce the size of these training sets. This means less storage space and quicker model building times. The study used the Kennard-Stone 7 sample selection scheme to select the most representative samples. Table 3 shows results where ntux (x=1 to 6) denotes how many spectrometers were used to build training models and the remaining spectrometers data was used as test set. For example, ntu2 shows average prediction results when any two spectrometers were used to build models and the remaining four spectrometers data served as a test set. In the case of ntu6, since there are only six spectrometers, the training set consisted of samples chosen by Kennard-Stone algorithm and the remaining samples served as a test set. Also shown in the table is the number of spectrometers needed to achieve 100% classification prediction. Based on this analysis, for SVM it would only need one spectrometer s data to achieve perfect prediction while for PLS-DA and SIMCA, it would take all six spectrometers data to build models to reach 100%. However, even with PLS-DA and SIMCA, accurate prediction levels of >98.8% are achieved with only two instruments for training. Table 3. Number of spectrometers needed to build robust models Classification ntu1 ntu2 ntu3 ntu4 ntu5 ntu6 # Units for 100% SIMCA 97.395 98.851 99.496 99.808 99.961 100 6 PLS-DA 99.664 99.68 99.904 99.961 99.99 100 6 SVM 100 100 100 100 100 100 1 SVM is the preferred model for classifying raw materials. The model has been tested on over 20 different MicroNIR spectrometers manufactured over different lots. 100% successful prediction has been achieved every time. Conclusion In this study, we showed that the MicroNIR spectrometer is effective for use in pharmaceutical raw materials identification. Three machine learning pattern classification model-building algorithms were tested: SIMCA, PLS-DA, and SVM. All delivered nearperfect identification success rates, with increasingly better performance from SIMCA to PLS-DA to SVM and correspondingly lower model building times. Model building and transfer with MicroNIR Pro software requires minimal time and performs with high reliability. Furthermore, model transfer from a master instrument to a number of other instruments is easily achievable due to the high reproducibility of the MicroNIR instruments in production. 6

References [1] http://www.pewtrusts.org/en/research-and-analysis/issue-briefs/2012/05/16/heparin-a-wakeup-call-on-risks-to-the-usdrug-supply [2] O Brien, N., Hulse, C., Friedrich, D., Van Milligen, F., von Gunten, M., Pfeifer, F., Siesler, H., Miniature Near-Infrared (NIR) Spectrometer Engine For Handheld Applications. Proc. SPIE, Ed. M. Druy, and R. Crocombe, 8374, p 837404-1-8 (2012). [3] Friedrich, D., Hulse, C., von Gunten, M., Williamson, E., Pederson, C., O Brien, N., Miniature near-infrared spectrometer for pointof-use chemical analysis. Proc. SPIE, Ed. Y. Soskind and G. Olson, 8992 (2014). [4] Wold, Svante, and Sjostrom, Michael, 1977, SIMCA: A method for analyzing chemical data in terms of similarity and analogy, in Kowalski, B.R., ed., Chemometrics Theory and Application, American Chemical Society Symposium Series 52, Wash., D.C., American Chemical Society, p. 243-282. [5] Wold, S; Sjöström, M.; Eriksson, L. (2001). PLS-regression: a basic tool of chemometrics. Chemometrics and Intelligent Laboratory Systems 58 (2): 109 130. [6] Cortes, C.; Vapnik, V. (1995). Support-vector networks. Machine Learning 20 (3): 273. [7] R. W. Kennard and L. A. Stone (1969): Computer Aided Design of Experiments, Technometrics, 11:1, 137-148. 7

Americas Toll Free: +1 800 254-3684 Europe Tel: +33 1 30 81 50 41 Asia Pacific Tel: +86 512 6956 7895 E-mail ospcustomerservice@jdsu.com 2014 JDS Uniphase Corporation Product specifications and descriptions in this document are subject to change without notice. 30176041 000 1214 MICRONIRRAWMATERIALS.AN.CO.AE December 2014