Los Alamos NATIONAL LABORATORY ' S. SI-Combiner: Making Sense of Results from Multiple Trained Programs

Similar documents
Bobi K. Den Hartog, ESA-EPE John W. Elling, BioReason, Inc Roger M. Kieckhafer, University of Nebraska

A Few-Group Delayed Neutron Model Based on a Consistent Set of Decay Constants. Joann M. Campbell Gregory D. Spriggs

CQNl_" RESPONSE TO 100% INTERNAL QUANTUM EFFICIENCY SILICON PHOTODIODES TO LOW ENERGY ELECTRONS AND IONS

Los Alamos IMPROVED INTRA-SPECIES COLLISION MODELS FOR PIC SIMULATIONS. Michael E. Jones, XPA Don S. Lemons, XPA & Bethel College Dan Winske, XPA

SPHERICAL COMPRESSION OF A MAGNETIC FIELD. C. M. Fowler DISCLAIMER

Walter P. Lysenko John D. Gilpatrick Martin E. Schulze LINAC '98 XIX INTERNATIONAL CONFERENCE CHICAGO, IL AUGUST 23-28,1998

0STI. E. Hammerberg, XNH MD SIMULATIONS OF DUSTY PLASMA CRYSTAL FORMATION: PRELIMINARY RESULTS M. J. S. Murillo, XPA

TOWARDS EVEN AND ODD SQUEEZED NUMBER STATES NIETO, MICHAEL M

Alamos National Laboratory is operated by the University of California for the United States Department of Energy under contract W-7405-ENG-36

LA-UR- Title: Author(s): Submitted to: Approved for public release; distribution is unlimited.

Hyperon Particle Physics at JHF. R. E. Mischke

Further Investigation of Spectral Temperature Feedbacks. Drew E. Kornreich TSA-7, M S F609

ION EXCHANGE SEPARATION OF PLUTONIUM AND GALLIUM (1) Resource and Inventory Requirements, (2) Waste, Emissions, and Effluent, and (3) Facility Size

Turbulent Scaling in Fluids. Robert Ecke, MST-10 Ning Li, MST-10 Shi-Yi Li, T-13 Yuanming Liu, MST-10

Ray M. Stringfield Robert J. Kares J. R. Thomas, Jr.

Los Alamos. Nova 2. Solutions of the Noh Problem for Various Equations of State Using Lie Groups LA-13497

Data Comparisons Y-12 West Tower Data

Surrogate Guderley Test Problem Definition

GA A26057 DEMONSTRATION OF ITER OPERATIONAL SCENARIOS ON DIII-D

~ _- IDOCLRXKT DMSP SATELLITE DETECTIONS OF GAMMA-RAY BURSTS. J. Terrell, NIS-2 P. Lee, NIS-2 R W.

Stability of Nuclear Forces Versus Weapons of Mass Destruction. Gregory H. Canavan

Three-Dimensional Silicon Photonic Crystals

LA-UR Approved for public release; distribution is unlimited.

MULTIGROUP BOLTZMANN FOKKER PLANCK ELECTRON-PHOTON TRANSPORT CAPABILITY IN M C N P ~ ~ DISCLAIMER

Los Alamos. OF nm LA-"'- Title: HYBRID KED/XRF MEASUREMENT OF MINOR ACTINIDES IN REPROCESSING PLANTS. S. T. Hsue and M. L.

DE '! N0V ?

(4) How do you develop an optimal signal detection technique from the knowledge of

Gordon G. Parker Mechanical Engineering Dept., Michigan Institute of Technology Houghton, MI

Plasma Response Control Using Advanced Feedback Techniques

Multi-Scale Chemical Process Modeling with Bayesian Nonparametric Regression

David C. Moir Charles M. Snell Michael Kang

Author(s). Richard C. Elphic, NIS-1

DETECTING AND QUANTIFYING LEWISITE DEGRADATION PRODUCTS I N ENVIRONMENTAL SAMPLES USING ARSENIC SPECIATION*

Si, X. X. Xi, and Q. X. JIA

Predicting On-Orbit SEU Rates

Summary Report: Working Group 5 on Electron Beam-Driven Plasma and Structure Based Acceleration Concepts

RUDOLPH J. HENNINGER, XHM PAUL J. MAUDLIN, T-3 ERIC N. HARSTAD, T-3

Characterizing Size Dependence of Ceramic- Fiber Strength Using Modified Weibull Distribution

N. Mattor T. J. Williams D. W. Hewett A. M. Dimits. IMACS 97 August 24-29, 1997 Berlin, Germany

Los Alamos OSTI MAY Title: On Lamb Wave Propagation from Small Surface Explosions in the Atmospheric Boundary Layer

Comments on the Geophysics paper Multiparameter 11norm. waveform fitting: Interpretation of Gulf of Mexico reflection

Roger J. Bartlett, P-22 Dane V. Morgan, P-22 M. Sagurton, EG&G J. A. Samson, University of Nebraska

Use of AVHRR-Derived Spectral Reflectances to Estimate Surface Albedo across the Southern Great Plains Cloud and Radiation Testbed (CART) Site

PROJECT PROGRESS REPORT (03/lfi?lfibr-~/15/1998):

Los Alamos NATIONAL LABORATORY LA-UR Double Shock Initiation of the HMX Based Explosive EDC-37

Development of a High Intensity EBIT for Basic and Applied Science

M. S. Krick D. G. Langner J. E. Stewart.

IMAGING OF HETEROGENEOUS MATERIALS BY. P. Staples, T. H. Prettyman, and J. Lestone

Overview of Strangeness Nuclear Physics. Benjamin F. Gibson, T-5

Microscale Chemistry Technology Exchange at Argonne National Laboratory - East

c*7 U336! On Satellite Constellation Selection LA MS ^«fc ^Xiyy PLEASE RETURN TO:

ust/ aphysics Division, Argonne National Laboratory, Argonne, Illinois 60439, USA

Neutronic Simulations for Source-Instrument Matching at the Lujan. The neutronics design of pulsed spallation neutron sources can be difficult when

Application of Spectral Summing to Suspect Low Level Debris Drums at Los Alamos National Laboratory

Manifestations of the axial anomaly in finite temperature QCD

Start-up Noise in 3-D Self-AmpMed

Solid Phase Microextraction Analysis of B83 SLTs and Core B Compatibility Test Units

National Accelerator Laboratory

Simulation of Double-Null Divertor Plasmas with the UEDGE Code

August 3,1999. Stiffness and Strength Properties for Basic Sandwich Material Core Types UCRL-JC B. Kim, R.M. Christensen.

K.M. Hoffmann and D. C. Seely

Analysis of Shane Telescope Aberration and After Collimation

A Framework for Modeling and Optimizing Dynamic Systems Under Uncertainty

The photoneutron yield predictions by PICA and comparison with the measurements

: Y/LB-16,056 OAK RIDGE Y-12 PLANT

Barry M. Lesht. Environmental Research Division Argonne National Laboratory Argonne, Illinois 60439

Variational Nodal PerturbationCalculations Using Simplified Spherical HarmonicsO

8STATISTICAL ANALYSIS OF HIGH EXPLOSIVE DETONATION DATA. Beckman, Fernandez, Ramsay, and Wendelberger DRAFT 5/10/98 1.

RESONANCE INTERFERENCE AND ABSOLUTE CROSS SECTIONS IN NEAR-THRESHOLD ELECTRON-IMPACT EXCITATION OF MULTICHARGED IONS

Los Alamos, New Mexico 87545

SENSOR FUSION AND NONLINEAR P R E D I C T I O N FOR ANOMALOUS EVENT DETECTION. Jose V. Hernandez, NIS-1 Kurt R. Moore, NIS-1 Richard C.

Results of the LSND Search for. Fred J. Federspid. Electroweak Interactions and Unified Theories Conference. Les Arcs, France March 16-23,1996

12/16/95-3/15/96 PERIOD MULTI-PARAMETER ON-LINE COAL BULK ANALYSIS. 2, 1. Thermal Neutron Flux in Coal: New Coal Container Geometry

Synthesis of Methyl Methacrylate from Coal-derived Syngas

Lawrence Berkeley National Laboratory Lawrence Berkeley National Laboratory

Causes of Color in Minerals and Gemstones

James Theiler, NIS-2 Jeff Bloch, NIS-2

A Distributed Radiator, Heavy Ion Driven Inertial Confinement Fusion Target with Realistic, Multibeam Illumination Geometry

DML and Foil Measurements of ETA Beam Radius

Scaling between K+ and proton production in nucleus-nucleus collisions *

Undulator Interruption in

Measurement of Low Levels of Alpha in 99M0Product Solutions

APPLICATION SINGLE ION ACTIVITY COEFFICIENTS TO DETERMINE SOLVENT EXTRACTION MECHANISM FOR COMPONENTS OF NUCLEAR WASTE

LLNL DATA COLLECTION DURING NOAAETL COPE EXPERIMENT

m w n? r i OF,THISDOCUMENT IS UNLIMITED

GA A27235 EULERIAN SIMULATIONS OF NEOCLASSICAL FLOWS AND TRANSPORT IN THE TOKAMAK PLASMA EDGE AND OUTER CORE

Nearest Neighbor Rules PAC-Approximate Feedforward Networks t

Contractor Name and Address: Oxy USA, Inc. (Oxy), Midland, Texas OBJECTIVES

Paper presented at the 7th International Conference on Ion Source Catania, Italy. 4 Oak Ridge National Laboratory Oak Ridge, Tennessee

Cosmological Gamma-Ray Bursts. Edward Fenimore, NIS-2 Richard Epstein, NIS-2 Cheng Ho, NIS-2 Johannes Intzand, NIS-2

Constant of Motion for a One- Dimensional and nth-order Autonomous System and Its Relation to the Lagrangian and Hamiltonian

Reasoning with Uncertainty

Statistical Learning Reading Assignments

N. Tsoupas, E. Rodger, J. Claus, H.W. Foelsche, and P. Wanderer Brookhaven National Laboratory Associated Universities, Inc. Upton, New York 11973

0 s Los Alamos,New Mexico 87545

H. Kamada, A. Nogga, and W. Glockle, Institute fur Theoretische Physik 11, Ruhr-Universitat Bochum, D-44780

Performance Comparison of K-Means and Expectation Maximization with Gaussian Mixture Models for Clustering EE6540 Final Project

DS ALAMOS SCIENTIFIC LABORATORY

Algorithmisches Lernen/Machine Learning

by KAPL, Inc. a Lockheed Martin company KAPL-P EFFECTS OF STATOR AND ROTOR CORE OVALITY ON INDUCTION MACHINE BEHAVIOR September 1996 NOTICE

Transcription:

8 LA-UR- '9 8 3 4 5 S 0 A proved forpublic release; dtribution is unlimited. Title: Author(s): SI-Combiner: Making Sense of Results from Multiple Trained Programs Bobi K. Den Hartog, ESA-EPE John W. Elling, ESA-EPE Roger M. Kieckhafer, University of Nebraska Submitted to: Proceedings of the Fourth Joint Conference on Information Sciences, Research Triangle Park, N.C. October 23-28 1998 Los Alamos NATIONAL LABORATORY Los Alamos National Laboratory, an affirmative action/equal opportunity employer, is operated by the University of California for the US. Department of Energy under contract W-7405-ENG-36. By acceptance of this article, the publisher recognizes that the U.S. Government retains a nonexclusive, royalty-free license to publish or reproduce the published form of this contribution, or to allow others to do so,for US. Government purposes. Los Alamos National Laboratory requests that the publisher identify this article as work performed under the auspices of the US. Department of Energy. The Los Alamos National Laboratorystrongly supports academic freedom and a researcher's right to publish; as an institution, however, the Laboratory does not endorse the viewpoint of a publication or guarantee its technical correctness. Form 836 (10/96)

DISCLAIMER This report was prepared as an account of work sponsod by an agency of the United States Covernmtnt Neither the United States Governmeat nor my agency thereof, nor any of their employees. makes my wmanty, exprrsr or imphi, or 8sSumes ally ICgaI liabiiity or rrtponsity for the acatmcy, completul~or pltfulncss of any infomation, apparatus, product, or disci& or rrprrsenu that its usc wouid not infringe privately owned rights R e f c r ~ c eba& to any spec i f i c ~ o m m pmduct. d proass, or semice by wade name, uldanulq inanufacturn, or otherwise does not nec#zarily constitute or imply its cndorrewnt.rccommeadation, or favoring by the United States Gavernmmt or my agency thmof. lke vim and opinions of authors exprcsted htrrin Q not naxssady state or reflect those of the United States Govanmtm or any agency thereof..

DISCLAIMER Portions of this document may be illegible in electronic image products. Images are produced from the best available original document.

. The SI-Combiner: Making Sense of Results from Multiple Trained Programs Bobi K. Den Hartog and John W. Elling, Engineering Sciences and Applications, Energy Processes and Engineering, Mail Stop J580,Los Alamos National Laboratory, Los Alamos, N.M. 87545 Roger Kieckhafer, University of Nebraska Abstract Many problems, such as Aroclor Interpretation, are ill-conditioned problems in which trained programs, or methods, must operate in scenarios outside their training ranges because it is intractable to train them completely. Consequently,they fail in ways related to the scenarios. Importantly, when multiple trained methou3 fail divergently, their patterns of failures provide insights into the true results. The SI-Combiner solves this problem of Integrating Multiple Learned Models (IMLM) by automatically learning and using these insights to produce a solution more accurate than any single trained program. In application, the Aroclor Interpretation SI-Combiner improved on the accuracy of the most accurate individual trained program in the suite. This paper presents a new fuzzy IMLM method called the SI-Combiner and its application to Aroclor Interpretation. Additionally, this paper shows the improvement in accuracy that the SICombiner s components show against Multicategory Classif cation (MCC),DempsterShafer (DS), and the best individual trained program in the Aroclor Interpretation suite (imlr). Method The SI-Combiner uses a suite of trained programs, or methods, which must operate in conditions that are outside of their training domains and in which they may fail. The SICombiner incorporates application-specific knowledge, called scenarios, about conditions in the testing domains that can distort the inputs and thereby cause divergent behavior among the methods. Since scenarios are defined as signal conditions, during periodic retraining the method training set (which does not cover the testing domain) can be synthetically extended and distorted to provide training samples throughout the entire testing domain. From the extended training set, the SICombiner learns two types of information: Scenario Identijication (SI) rules, which capture how patterns of method opinions give information about a sample s scenario, and Method Weighting rules, which capture for each scenario, the accuracy of a method s opinion. Armed with thesi rules and the learned biases in the Method Weighting rules, the SI-Combiner first analyzes a test sample and assigns values between zero and one for each possible scenario for the sample. Higher numbers indicate higher likelihood that the sample belongs to that scenario. This SI step is key and gives the SICombiner its name. Finally, the SI-Combiner blends the scenariobased accuracy information, the method opinions and the scenario information producing final results, confidences, context information and explanations. Application Chemistry Certain chemical compounds named Aroclor 1242TM, Aroclor1254 and Aroclor I260 have become environmental concerns as carcinogens. Detecting and quantifying Aroclors in chromatograms Aroclor Interpretation examines a chromatogram (such as shown in Figure 1) and determines the presence or absence of each Aroclor. For each Aroclor present, the concentration in micrograms per milliliter (ug/ml) is measured. Each peak represents a separate component. The x-axis value for a peak reveals the component s identity, while the y-axis value indicates the concentration. The y-axis value, however, has no units: interpretation of a test chromatogram with unknown content must be based on comparisons with standard chromatograms (or standards) with known Aroclor concentrations. I Aroclor is a trademark of Monsanto Corporation.

The Training Domain For Aroclor Interpretation Before processing a batch of tests, operators produce a set of five standard chromatograms with standard concentrations for each of the three Aroclors to create fifteen standards. By comparing the chromatogram of a test to the fifteen standard chromatograms, a chemist or an automated pattern recognition method estimates the concentration or quantity of an Aroclor believed present. These single, pure Aroclors constitute the training domain. The Training Domain Cannot Cover The Testing Domain When the automated method or the chemist encounters a sample with conditions like those in the training domain, its chromatogram can be accurately compared with the standards. However, many real-life samples contain multiple Aroclors, no Aroclors, aged or weathered Aroclors, and materials that are not Aroclors but appear in the chromatogram. The lower chromatogram in Figure 1 contains the same Aroclor content as the upper chromatogram, but the added presence of oil distorts the baseline and creates nonaroclor peaks. It is intractable to train automated methods or humans on all possible conditions encountered in the testing domain. However, they must work on samples with conditions outside their training domain. The SI-Combiner enables this by exploiting the variations in conditions through the testing domain. Automated Aroclor Interpretation The Aroclor Interpretation Data Interpretation Module (DIM)[ 11is shown in Figure 2. SI-Combiner Uses Opinions From Trained Methods Five automated pattern recognition methods were implemented in the Aroclor Interpretation DIM. Each method, like a chemist, receives fifteen standards for each retraining. Four methods produce quantity estimates for each Aroclor: Iterative Multiple Linear Regression (imlr) 0 Neural Network (NN) Principle Components Regression (PCR) Peak Area Analysis (PA) The fifth method produces a value between zero and one for each Aroclor reflecting its belief that the Aroclor is present: Neural Network for Identification (NNI) [2] Aroclor Interpretation DIM chromatogram ---, PA.1 NNI.1 SI-Combiner Scenario Identification Aroclor Quantification Aroclor Presence ~~ 1 f i M 1 Presence results ~ 1 f i M 1 Quuntifcation results Figure 2: Aroclor Interpretation DIM including the SI-Combiner. Each method produces an opinion for each of the three target Aroclors, so that the SI-Combiner receives an opinion set containing fifteen views of a sample. Scenarios Are Signal Conditions That Distort Method Opinions Methods disagree due to conditions in the signal. In Aroclor Interpretation, and in other problems, these conditions can be approximately identified. Scenario elements are 1. conditions in the method s inputs that affect the methods responses, which in turn, are 2. related to patterns of opinion sets. For Aroclor Interpretation, five scenario elements are used to partition the testing domain into scenarios. These elements include one contamination descriptor, three presence descriptors (none, single and multiple),and one overconcentration descriptor. The effect of

., 5 4 contamination is seen in the lower chromatogram of Figure 1. An opinion set is fifteen opinions, in a specified order, for a sample. Scenario changes cause observable changes in these patterns. Evidence of these observable changes in patterns of opinion sets, which are directly linked to changes in scenario, is shown in Figure 3. For simplicity, the opinion sets contain information for one Aroclor at a time, so fivetuples are plotted instead of fifteen-tuples. In these radar plots, each opinion is mapped on a separate spoke, creating a spatial view of the relationships among the opinions. The left plot of Figure 3 shows opinion sets for samples in the training domain, i.e., single uncontaminated Aroclors. The radial symmetry reflects the identical performances by the four quantification Methods for each test sample, which are accurate since they are operating in their training domain. MLR PA MLR NN PA NN Figure 3: Opinion set patterns vary according to scenario The plot on the right side of Figure 3 shows the opinions sets for samples with multiple pure Aroclors. The pattern for samples in this scenario shows distortion along the upper left axis. Each of the ten scenarios for Aroclor Interpretation has a pattern. These patterns are learned automatically and matched approximately by the SI step. Art@ciizlly Extend the SI-Combiner s Training Domain To learn how method accuracy depends on scenario, the SI-Combiner needs training data consisting of method opinions for samples in all scenarios. The specific definition of scenario requires that the method opinions vary according to the scenario that scenarios are due to conditions in the input signal. This allows the explicit manipulation of signals to synthesize artificial training data. The SI-Combiner artificially extends the Method training domain through the Synthetic Chromatogram Generator. The Synthetic Chromatogram Generator uses the fifteen genuine standards that constitute the Method Training Set and a Noise Engine to generate chromatograms in all scenarios. These chromatograms are representative; much of the testing domain is still not covered by the training samples. Automatically Learning SI And Method Weighting Rules Two types of rule sets are in the SI-Combiner: 1. Scenario Identification: fuzzy classification produces scenario weights, one per scenario, which measure the test sample s similarities to training examples in each scenario. The key to the SI-Combiner is the Scenario Identification (SI) step, as indicated in the technique s name. 2. Method-Weighting: For Aroclor Interpretation, there are two types of accuracy measured since two questions need to be answered. The first Method-Weightingrule set is the Aroclor Presence Detection (AP) rule set, which is used to answer the question Is the Aroclor Present?. The second MethodWeighting rule set is the Aroclor Quantification (AQ) rule set, which characterizes each quantitative method s bias in each scenario, and is used to answer What quantity of Aroclor is present? Approximately Match Opinion Patterns from Test Samples to Training Samples The SI-Combiner exploits very sparse and representative training data, through fuzzy logic, by approximately matching an SI rule. Every testing sample will match every rule to some degree, and a rule s firing weight clearly reflects how closely the test sample resembles a sample seen in the training set. As shown in Figure 2, the SI-Combiner works in steps. First, SI produces fuzzy belief weights for each scenario by approximately matching the SI rules. Weight Method Opinions By Scenario Likelihood And Method Accuracy The SI-Combiner s second step, AP, classifies each of three Aroclors as present or absent in a sample. The third step, AQ, produces quantitative values for the concentration of each Aroclor in a sample. AP and AQ use automatically learned empirical biases for each of the Methods in each scenario. Through fuzzy logic, AP and AQ combine scenario weights, automatically learned biases for each of the Methods in each scenario, and Methods results to determine results for a

sample. AP and AQ use well-established techniques internally; their novel features are the use of SI S scenario weights, and the nature of the bias estimations learned by each step. Results The SI-Combiner improves on two existing approaches to IMLM, which combine either multiple experts each viewing aportion of the problem, or 0 selection of one expert s view of the entire problem. The SI-Combiner, in contrast, blends multiple experts, each viewing the whole problem, according to likely accuracies for a given sample. The SI-Combiner uses: all available information biases found through empirical learning application-specific knowledge to predict and exploit variations in the scenarios Components of the SI-Combiner Figure 4 summarizes these results. Scenario ldentwcalion class contarmnatm COmMner YP YulUcateguy UaodRcapion (Mot) cambner I ~ M Mw: S bv NMle EW-lram sxie +MccmsdaulR.dmpm*rar hs - c w r * d d d e s Amelor P Detectian: m Em7 T W False POsllNe FalseNea;dwe Aroclu QwadkaUon: MLR AveraSW PeakArea 15% 1% 55% 55% 6% commnervs.dempdashater(dsi ~ r n ~ r n D S W comknerm p w e s on ktaj fednvque by Figure 4: Summary of SI-Combiner improvements 58% 76% 14% 920/. 95% For SI, the SI-Combiner was compared to Multicategory Classification (MCC)[3]. The SICombiner handled the nonlinear, poorly separated space far better than MCC did. Furthermore, as a fuzzy classifier, the SI-Combiner accrues evidence of a test sample s membership in all scenarios, which is passed on without loss to the SI-Combiner s method-weighting steps. SI rule firing weights are available for explanations, accountability and tracability. For AP, the SI-Combiner was compared to Dempster-Shafer (DS)[4]. As summarized in Figure 4, the SI-Combiner produced fewer errors than DS did, and referred fewer samples with indeterminate results to the operator. Furthermore, the SI-Combiner uses empirical biases; DS uses errors reported by the Methods, which, like opinions, are not guaranteed accurate for samples outside the training domain. For AQ, the SI-Combiner greatly outperformed a scenario-ignorant combining method of Averaging and the human representative Method of PA. The SI-Combiner improved on the best individual method, imlr. Conclusion The SI-Combiner uses a suite of trained methods, which must operate in conditions that are outside of their training domains and in which they may fail. The SI-Combiner incorporates application-specific knowledge, called scenarios, about conditions in the testing domains that can distort the inputs and thereby cause divergent behavior among the methods. This problem is a version of Integrating Multiple Learned Models (IMLM). The SI-Combiner improves on existing approaches to IMLM, and its components perform better than traditional approaches to the same tasks. The SI-Combiner is a new and powerful solution for IMLM problems. Acknowledgments This work is supported in part by the U.S. Department of Energy under the Defense Programs Technology Transfer Initiative Program. Portions of this work were performed under a Cooperative Research and Development Agreement with Varian Chromatography Systems Business, Lockheed Missiles and Space Company, and Neuralware, Inc. Bibliography 1J.W.Elling, S.M.Mniszewslu, J.D.Zahrt, L.N.Klatt. Automated Chromatographic Data Interpretation Using an Expert System. Journal of Chromatographic Science, Vol. 32 No. 6. 2 G.R.Magelssen, J.W.Elling ChromatographyPattern Recognition using Iterative Probabilistic Neural Networks, Journal of Chromatography A, to be published. 3 J.T.Tou, R.C.Gonzalez. Pattern Recognition Principles, Addison-Wesley 1974. 4 L.A.Klein, Sensor and Data Fusion Concepts and Applications, SPIE Optical Engineering Press, 1993