REGIONAL TRAINING COURSE ON METHODS AND TOOLS TO IDENTIFY SOURCES OF AIR POLLUTION

Similar documents
GAMINGRE 8/1/ of 7

Technical note on seasonal adjustment for M0

Saharan Dust Impact on PM2.5 in central Italy

Technical note on seasonal adjustment for Capital goods imports

Annual Average NYMEX Strip Comparison 7/03/2017

Data report for elemental analysis of IMPROVE samples collected during April, May, June 2009 UC Davis Submitted June 18, 2010 SUMMARY

Data report for elemental analysis of IMPROVE samples collected during OCTOBER, NOVEMBER, DECEMBER 2005 UC Davis SUMMARY

Microsoft Excel Directions

Summary of Seasonal Normal Review Investigations CWV Review

Data report for elemental analysis of IMPROVE samples collected during April, May, June 2006 UC Davis - Submitted May 14, 2008 SUMMARY

Computing & Telecommunications Services

CWV Review London Weather Station Move

Computing & Telecommunications Services Monthly Report January CaTS Help Desk. Wright State University (937)

Half Cell / redox potentials. Context. Task. Evaluation

DOCUMENT HISTORY. Initials Section/s Modified Brief Description of Modifications

Unit 1 Part 2 Atomic Structure and The Periodic Table Introduction to the Periodic Table UNIT 1 ATOMIC STRUCTURE AND THE PERIODIC TABLE

How to use the guidance tool (Producing Guidance and Verification)

Element Cube Project (x2)

John G. Watson Judith C. Chow Mark C. Green Xiaoliang Wang Desert Research Institute, Reno, NV

Who is polluting the Columbia River Gorge?

The Periodic Table. Periodic Properties. Can you explain this graph? Valence Electrons. Valence Electrons. Paramagnetism

Appendix BAL Baltimore, Maryland 2003 Annual Report on Freeway Mobility and Reliability

Your World is not Red or Green. Good Practice in Data Display and Dashboard Design

Worldwide Open Proficiency Test for X Ray Fluorescence Laboratories PTXRFIAEA13. Determination of Major, Minor and Trace Elements in a Clay Sample

Mr. XYZ. Stock Market Trading and Investment Astrology Report. Report Duration: 12 months. Type: Both Stocks and Option. Date: Apr 12, 2011

Chemistry 1 First Lecture Exam Fall Abbasi Khajo Levine Mathias Mathias/Ortiz Metlitsky Rahi Sanchez-Delgado Vasserman

Day Length Web Activity

The European Activation File: EAF-2005 decay data library

Section II: Assessing Chart Performance. (Jim Benneyan)

State of Passive Microwave Polar Stereographic Products. Walt Meier and polar stereographic product team

High-resolution study of Gamow- Teller transitions in pf-shell nuclei. Tatsuya ADACHI

Climatography of the United States No

Suan Sunandha Rajabhat University

REPORT ON LABOUR FORECASTING FOR CONSTRUCTION

The science of learning from data.

BROOKLYN COLLEGE Department of Chemistry. Chemistry 1 Second Lecture Exam Nov. 27, Name Page 1 of 5

SYSTEM BRIEF DAILY SUMMARY

= observed volume on day l for bin j = base volume in jth bin, and = residual error, assumed independent with mean zero.

Chemistry 2 Exam Roane State Academic Festival. Name (print neatly) School

YACT (Yet Another Climate Tool)? The SPI Explorer

Ch. 9 NOTES ~ Chemical Bonding NOTE: Vocabulary terms are in boldface and underlined. Supporting details are in italics.

A look into the factor model black box Publication lags and the role of hard and soft data in forecasting GDP

Electron Configuration. The electron configuration of an atom tells us how the electrons are distributed among the various atomic orbitals.

Radiometric Dating (tap anywhere)

DESC Technical Workgroup. CWV Optimisation Production Phase Results. 17 th November 2014

The Agilent 7700x ICP-MS Advantage for Drinking Water Analysis

Chem Exam 1. September 26, Dr. Susan E. Bates. Name 9:00 OR 10:00

Lab Day and Time: Instructions. 1. Do not open the exam until you are told to start.

Site Description: Tower Site

SYSTEM BRIEF DAILY SUMMARY

Ions. How are ions made from neutral atoms? Is the number of protons the same in the atom and the ion?

Tracking Accuracy: An Essential Step to Improve Your Forecasting Process

2. Graphing Practice. Warm Up

2 (27) 3 (26) 4 (21) 5 (18) 6 (8) Total (200) Periodic Table

In Centre, Online Classroom Live and Online Classroom Programme Prices

Climatography of the United States No

Climatography of the United States No

Chemical mass closure of atmospheric aerosol collected over Athens, Greece.

The Source Finder (SoFi)

Where Was Mars At Your Birth?

Haze Communication using the CAMNET and IMPROVE Archives: Case Study at Acadia National Park

Accelerated Chemistry Pre-Course Assignment

1 of 5 14/10/ :21

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

The Cryosphere Radiative Effect in CESM. Justin Perket Mark Flanner CESM Polar Climate Working Group Meeting Wednesday June 19, 2013

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Chapter 3: Stoichiometry

Quantitative XRF Analysis. algorithms and their practical use

CWV Optimisation. TWG 12 th February 2014

CLASS TEST GRADE 11. PHYSICAL SCIENCES: CHEMISTRY Test 4: Matter and materials 1

Site Description: Tower Site

Guide to the Extended Step-Pyramid Periodic Table

Climatography of the United States No

Thermo Scientific icap RQ ICP-MS: Typical limits of detection

Climatography of the United States No

Climatography of the United States No

Jayalath Ekanayake Jonas Tappolet Harald Gall Abraham Bernstein. Time variance and defect prediction in software projects: additional figures

Climatography of the United States No

9/20/2017. Elements are Pure Substances that cannot be broken down into simpler substances by chemical change (contain Only One Type of Atom)

Time Series Analysis

CHEM 130 Exp. 8: Molecular Models

Instructions. 1. Do not open the exam until you are told to start.

Climatography of the United States No

Forecasting using R. Rob J Hyndman. 1.3 Seasonality and trends. Forecasting using R 1

FEB DASHBOARD FEB JAN FEB MAR APR MAY JUN JUL AUG SEP OCT NOV DEC

Made the FIRST periodic table

Metallurgical Chemistry. An Audio Course for Students

A. Element 1. The number of protons and neutrons of an atom.

02/05/09 Last 4 Digits of USC ID: Dr. Jessica Parr

Improve Forecasts: Use Defect Signals

Spin Cut-off Parameter of Nuclear Level Density and Effective Moment of Inertia

Climatography of the United States No

CHEM 171 EXAMINATION 1. October 9, Dr. Kimberly M. Broekemeier. NAME: Key

Transcription:

REGIONAL TRAINING COURSE ON METHODS AND TOOLS TO IDENTIFY SOURCES OF AIR POLLUTION IAEA RER 1013: Supporting Air Quality Management Source apportionment by Positive Matrix Factorization Marta Almeida Email: smarta@ctn.ist.utl.pt Instituto Superior Técnico Campus Tecnológico e Nuclear Sacavém, Portugal 02 06 June, 2014

Concept EPA Positive Matrix Factorization (PMF) a multivariate tool Objective: Identify number of factors p The species profile of each source f Amount of mass contributed by each factor to each individual sample - g Matrix of speciated sample data Matrix factor contributions Matrix factor profiles i SAMPLE Al As Br 09-01-2011 12:00 6.4609 0.0008 0.0108 12-01-2011 12:25 7.0842 0.0009 0.0093 14-01-2011 12:45 3.3487 0.0004 0.0053 18-01-2011 13:45 12.2642 0.0017 0.0087 24-01-2011 12:40 0.1984 0.0010 0.0028 26-01-2011 13:20 0.2612 0.0003 0.0023 j x ij = p k=1 g ik f kj + e ij Residual for each sample/species

Object function Q Results are constrained so that no sample can have negative source contribution; PMF allows each data point to be individually weighed; The PMF solution minimizes the object function Q, based upon the data uncertainties (u). Q = n m i=1 j=1 x ij p k=1 u ij g ik f kj 2

Getting started EPA Positive Matrix Factorization (PMF) Download the programme http://www.epa.gov/heasd/research/pmf.html

Getting started run as administrator

Order of the operations Input/Output files Analyze Input Data Base Model Results Bootstrap Model Results Fpeak Model Results - Concentration file - Input file -Output file - Configuration file - Concentration/ Uncertainty - Concentration Scatter plot - Concentration Time Series - Data exceptions - Residual Analysis - O/P Scatter Plot - O/P Time Series - Profiles/ Contributions - Aggregate Contributions - Box Plots - Summary - Profiles/ Contributions - G-Space - Diagnostics - G-Space Plot - Factor Pie Chart - Diagnostics

Input/Output Files

Samples Parameters Input/Output Files Input files 1. Concentration of chemical species 2. Uncertainties Accepted formats 1. Tab-delimited (.txt) 2. Comma-separated value (.csv) 3. MS Excel (.xls) Blank cells are not accepted Factor analysis technique provides a robust solution when the number of samples minus the number of variable is at least 30 (Henry, 1991)

Input/Output Files Types of Uncertainties files 1. Sample specific Uncertainty file provides na estimate of the uncertainty for each sample of each species; 2. Equation-based file provides species-specific parameters that PMF uses to calculate uncertainties for each sample. Method Detection Limit (MDL) Error fraction (in percentage) If concentration Detection limit => Unc = 5 6 MDL If concentration Detection limit =>Unc = (Error Fraction concentration 2 + MDL 2

Input/Output Files Missing values in the data base Concentration Uncertainty Concentration < MDL ½ MDL 5/6 MDL Missing values Geometric Mean 4 Geometric Mean

Input/Output Files *_diag *_contrib *_profile *_resid *_strength *_profile_boot *_fpeak Output Files Contains a record of the user inputs and model diagnostic information Contains the contributions for each base run Contains the profiles for each base run *_ Contains the residuals for each base run Contains the factor strength for each nase run Contains the number of bootstrap runs mapped to each base run, each bootstrap profile that was mapped to the base profile and all bootstrap statistics Contains the profiles and contributions of each fpeak run

Order of the operations Input/Output files Analyze Input Data Base Model Results Bootstrap Model Results Fpeak Model Results - Concentration file - Input file -Output file - Configuration file - Concentration/ Uncertainty - Concentration Scatter plot - Concentration Time Series - Data exceptions - Residual Analysis - O/P Scatter Plot - O/P Time Series - Profiles/ Contributions - Aggregate Contributions - Box Plots - Summary - Profiles/ Contributions - G-Space - Diagnostics - G-Space Plot - Factor Pie Chart - Diagnostics

Analyze input data Objective: - To analyze the concentration and uncertainty data before running the model; - Help the user decide whether certain species should be excluded or down-weighted; - Help the user decide whether certain samples should be excluded.

Analyze input data Concentration/Uncertainty The following statistics are calculated for each species: Signal-to-noise ratio (S/N) S N j = n i=1 x ij s ij 2 n s ij 2 i=1 Minimum concentration value 25th percentile Median 75th percentile Maximum concentration value

Analyze input data Concentration/Uncertainty Based on these statistics + knowledge of the data set the user can categorise a specie as: S/N>2 STRONG - 0.2<S/N<2 S/N<0.2 WEEK BAD uncertainty is tripled Specie is excluded

Analyze input data Scatter plots Useful pre-pmf analysis tool; The user should examine scatter plots to look for expected relations

Analyze input data Scatter plots Useful pre-pmf analysis tool; The user should examine scatter plots to look for expected relations

Analyze input data Concentration Time Series Useful to detect unusual events; Help to exclude extreme events from the model.

Order of the operations Input/Output files Analyze Input Data Base Model Results Bootstrap Model Results Fpeak Model Results - Concentration file - Input file -Output file - Configuration file - Concentration/ Uncertainty - Concentration Scatter plot - Concentration Time Series - Data exceptions - Residual Analysis - O/P Scatter Plot - O/P Time Series - Profiles/ Contributions - Aggregate Contributions - Box Plots - Summary - Profiles/ Contributions - G-Space - Diagnostics - G-Space Plot - Factor Pie Chart - Diagnostics Model Execution

Model execution 2. 1.

Model execution Base run summary Q values - assessment of how well the model fit the input data Q (robust) calculated excluding outliers Q (true) calculated including all points Theoretical Q nm-p(n+m) (n - no. of species; m no. of samples; p - no. of factors) Q (true)>1.5 Q (robust) => indicate that peak events may be disproportionately influencing the model ; Theoretical Q Q(true) and Q (robust) Only converged solutions should be investigated further (Causes for non-convergence - Uncertainties too low, specified incorrectly or inappropriate input parameters)

Order of the operations Input/Output files Analyze Input Data Base Model Results Bootstrap Model Results Fpeak Model Results - Concentration file - Input file -Output file - Configuration file - Concentration/ Uncertainty - Concentration Scatter plot - Concentration Time Series - Data exceptions - Residual Analysis - O/P Scatter Plot - O/P Time Series - Profiles/ Contributions - Aggregate Contributions - Box Plots - Summary - Profiles/ Contributions - G-Space - Diagnostics - G-Space Plot - Factor Pie Chart - Diagnostics

Base model results Residuals Objective: - To determine how well the model fit each species - Poor fit when: large residuals ( r >3) Non-normal curve

Base model results d jkl = r ijk r ijl 2 i D kl = j D jkl

Base model results

Base model results Observed/Predicted scatter plot Objective: - To determine how well the model fit each species - Evaluate if a species should be down weighted or excluded from the model

Base model results Observed/Predicted time series Objective: - To determine how well the model fit each species - Evaluate if a species should be down weighted or excluded from the model

Base model results Profiles/Contribs Objective: - Display the factors resolved by PMF; - Two graphs are displayed: - Profile graph - Mass of each species apportioned to the factor - Percent of each species apportionned to the factor - Contribution of the factor to the total mass by sample

Base model results Identifications of sources Species Source F1 Br, CO 3 2-, Ca 2+ Local soil F2 Cr, Ni, Sb, V, NO 3-, SO 4 2- Anthropogenic source F3 Sb, EC Anthropogenic source F4 Br, Zn, Cl -, NO 3-, SO 4 2-, Na +, NH 4+, K +, Mg 2+ F5 Al, As, Ce, Cr, Cu, Fe, Mn, Mi, Si, Sm, Ti, V, Pb, Kns Sea Sahara dust

V Pb Zn EC OC CO32- Cl- NO3- SO42- Na+ NH4+ K+ Kns Mg2+ Ca2+ PM10 Source contribution ( g.m -3 ) V Pb Zn EC OC CO32- Cl- NO3- SO42- Na+ NH4+ K+ Kns Mg2+ Ca2+ PM10 Concentration ( g.m -3 ) V Pb Zn EC OC CO32- Cl- NO3- SO42- Na+ NH4+ K+ Kns Mg2+ Ca2+ PM10 V Pb Zn EC OC CO32- Cl- NO3- SO42- Na+ NH4+ K+ Kns Mg2+ Ca2+ PM10 V Pb Zn EC OC CO32- Cl- NO3- SO42- Na+ NH4+ K+ Kns Mg2+ Ca2+ PM10 % of species [PM 10 ] gravimetry ( g.m -3 ) % of species Base model results 10 1 10 0 10-1 10-2 10-3 10-4 a) Al As Br Ce Cr Cu Fe Mn Ni Sb Si Sm Ti Sea 1 Anthr. EC Sea 2 Sahara 100 80 60 40 20 0 Conc. of species 10 1 b) 10 0 10-1 10-2 10-3 10-4 Al As Br Ce Cr Cu Fe Mn Ni Sb Si Sm Ti % of species 100 80 60 40 20 0 10 1 10 0 10-1 10-2 10-3 10-4 c) Al As Br Ce Cr Cu Fe Mn Ni Sb Si Sm Ti 100 80 60 40 20 0 10 1 10 0 10-1 10-2 10-3 10-4 d) Al As Br Ce Cr Cu Fe Mn Ni Sb Si Sm Ti 100 80 60 40 20 0 10 1 10 0 e) 10-1 10-2 10-3 10-4 600 500 400 300 200 100 0 Al As Br Ce Cr Cu Fe Mn Ni Sb Si Sm Ti g) 100 80 60 40 20 0 600 f) 500 400 300 y = 0.98x+4.4 200 r = 0.99 100 0 0 100 200 300 400 500 600 [PM 10 ] model ( g.m -3 ) PM 10 measured by gravimetry Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

Concentration ( g.m -3 ) Base model results a) b) 140 120 1 2 3 4 100 80 60 40 20 c) 0 PM10 Local soil Anthr. EC Sea Sahara

Base model results Aggregate Contribs Objective: - Display the factors contribution discriminated: - by year; - season; - weekday/wekeend

Base model results G-Space plot Objective: - Display 1 factor contribution versus the other; - Determine if the solution 1- has filled the solution space edge of the scatter plot will correspond to the axis 2 - has some rotational ambiguity appears na oblique edge on a space plot

Base model results Factor Pie Chart Objective: - Display the distribution of each species among the factors resolved by PMF

Order of the operations Input/Output files Analyze Input Data Base Model Results Bootstrap Model Results Fpeak Model Results - Concentration file - Input file -Output file - Configuration file - Concentration/ Uncertainty - Concentration Scatter plot - Concentration Time Series - Data exceptions - Residual Analysis - O/P Scatter Plot - O/P Time Series - Profiles/ Contributions - Aggregate Contributions - Box Plots - Summary - Profiles/ Contributions - G-Space - Diagnostics - G-Space Plot - Factor Pie Chart - Diagnostics Bootstrap runs

Bootstrap runs Objective: Estimate the stability and uncertainty of the solution How? 1 Randomly selects non-overlapping blocks of samples and creats a new input data file of the selected samples; 2 - PMF is run on the new data set and each bootstrap factor is mapped to a base run factor with which the bootstrap factor has the highest correlation, above a user-specific threshold; 3 If no base factors have a correlation above the thresholdfor a given bootstrap factor, the factor is considered unmapped;

Bootstrap runs Selected base run the base run to be used to map each bootstrap run Number of bootstraps It is recommended at least 100 bootstrap runs to ensure the robustness of the statistics Minimum Correlation R-Values the minimum Pearson correlation coefficient that will be used in the assignement of a bootstrap run factor to a base run factor. Default value 0.6 (can decrease if a large number of factors are unmapped) Seed similar to base runs, starting point for iterations Block size the number of samples that will be selected in each step for resampling

Order of the operations Input/Output files Analyze Input Data Base Model Results Bootstrap Model Results Fpeak Model Results - Concentration file - Input file -Output file - Configuration file - Concentration/ Uncertainty - Concentration Scatter plot - Concentration Time Series - Data exceptions - Residual Analysis - O/P Scatter Plot - O/P Time Series - Profiles/ Contributions - Aggregate Contributions - Box Plots - Summary - Profiles/ Contributions - G-Space - Diagnostics - G-Space Plot - Factor Pie Chart - Diagnostics

Bootstrap Model Results How many bootstrap factors were matched to each base factor

Bootstrap Model Results How many bootstrap factors were matched to each base factor

Bootstrap Model Results

Bootstrap Model Results Uncertainty of the solution - Go to output file *_diag.txt. - In Bootstrap run uncertainty statistics - Variability in concentration of species we find output of the bootstrap factor by factor. - The uncertainty is the absolute difference between the 75 th and 25 th percentiles divided by two.

Order of the operations Input/Output files Analyze Input Data Base Model Results Bootstrap Model Results Fpeak Model Results - Concentration file - Input file -Output file - Configuration file - Concentration/ Uncertainty - Concentration Scatter plot - Concentration Time Series - Data exceptions - Residual Analysis - O/P Scatter Plot - O/P Time Series - Profiles/ Contributions - Aggregate Contributions - Box Plots - Summary - Profiles/ Contributions - G-Space - Diagnostics - G-Space Plot - Factor Pie Chart - Diagnostics Fpeak runs

Fpeak runs Rotation ambiguity - The user should inspect the G-space plots for each pair of factors in the original solution to determine if an incorrect rotation has occured. - In this case the solution must to be rotated to the real solution using Fpeak

Fpeak runs Values should be between -5 and 5 (Fpeak 0)

Fpeak runs Values should be between -5 and 5 (Fpeak 0)

Order of the operations Input/Output files Analyze Input Data Base Model Results Bootstrap Model Results Fpeak Model Results - Concentration file - Input file -Output file - Configuration file - Concentration/ Uncertainty - Concentration Scatter plot - Concentration Time Series - Data exceptions - Residual Analysis - O/P Scatter Plot - O/P Time Series - Profiles/ Contributions - Aggregate Contributions - Box Plots - Summary - Profiles/ Contributions - G-Space - Diagnostics - G-Space Plot - Factor Pie Chart - Diagnostics

Fpeak model results

Fpeak model results

Thank you for your attention