A critical view on ISO standard 13528

Similar documents
Acceptance sampling theory applied to EQA sample homogeneity testing

Sample Homogeneity Testing

The AAFCO Proficiency Testing Program Statistics and Reporting

Homogeneity of EQA samples requirements according to ISO/IEC 17043

ŞİŞLİ-İSTANBUL. NİŞANTAŞI-İSTANBUL

Protocol for the design, conducts and interpretation of collaborative studies (Resolution Oeno 6/2000)

Statistical Reports in the Magruder Program

Practical Statistics for the Analytical Scientist Table of Contents

Journal of Consumer Protection and Food Safety Journal für Verbraucherschutz und Lebensmittelsicherheit

Reference Materials and Proficiency Testing. CropLife International Meeting October 2015 Gina M. Clapper

Results of Proficiency Test Rubber/Compounds May 2005

Analysis of interlaboratory comparison when the measurements are not normally distributed

Proposed Procedures for Determining the Method Detection Limit and Minimum Level

OF ANALYSIS FOR DETERMINATION OF PESTICIDES RESIDUES IN FOOD (CX/PR 15/47/10) European Union Competence European Union Vote

Schedule for a proficiency test

Review Inter-laboratory studies in analytical chemistry. Edelgard Hund, D.Luc Massart, Johanna Smeyers-Verbeke

UNDERESTIMATING UNCERTAINTY

Precision estimated by series of analysis ISO and Approach Duplicate Approach

EFFECT OF THE UNCERTAINTY OF THE STABILITY DATA ON THE SHELF LIFE ESTIMATION OF PHARMACEUTICAL PRODUCTS

HOW TO ASSESS THE MEASUREMENT UNCERTAINTY

Requirements of a reference measurement procedure and how they relate to a certified reference material for ctni that is fit for purpose

CHEK Proficiency study 644. Solvents in adhesive. Date 15 October 2016 Version 1 Status Final Report number CHEK

OA03 UNCERTAINTY OF MEASUREMENT IN CHEMICAL TESTING IN ACCORDANCE WITH THE STANDARD SIST EN ISO/IEC Table of contents

Method Validation and Accreditation

Measurement Uncertainty - How to Calculate It In The Medical Laboratory

Philipp Koskarti

Chapter 1 Statistical Inference

CHEK Proficiency study 642. Migration of formaldehyde from melamine kitchenware. Date 7 July 2016 Version 1 Status Final Report number CHEK

Uncertainty of Measurement (Analytical) Maré Linsky 14 October 2015

Analysis For quality control the reference material is analysed at the same time and in the same manner as other samples.

PROTOCOL FOR THE DESIGN, CONDUCT AND INTERPRETATION OF METHOD-PERFORMANCE STUDIES

EPAs New MDL Procedure What it Means, Why it Works, and How to Comply

Estimating MU for microbiological plate count using intermediate reproducibility duplicates method

Use of a ISO template for quantitative schemes : the issues. Annette Thomas

EURL Food Contact Material. ILC 2009/02 BPA in 50% ethanol

Unit 4. Statistics, Detection Limits and Uncertainty. Experts Teaching from Practical Experience

Scoring systems for quantitative schemes what are the different principles?

Proficiency testing: Aqueous ethanol. Test and measurement Workshop Marcellé Archer. 20 September 2011

Ivo Leito University of Tartu

Instrumentation (cont.) Statistics vs. Parameters. Descriptive Statistics. Types of Numerical Data

3.1.1 The method can detect, identify, and potentially measure the amount of (quantify) an analyte(s):

Analysis For quality control the reference material is analysed at the same time and in the same manner as other samples.

MEASUREMENT UNCERTAINTY PREPARED FOR ENAO ASSESSOR CALIBRATION COURSE OCTOBER/NOVEMBER Prepared by MJ Mc Nerney for ENAO Assessor Calibration

Report on the 2011 Proficiency Test for the determination of T-2 and HT-2 toxins in wheat flour

Validation and Standardization of (Bio)Analytical Methods

CIND e InterCinD 15 years of statistical elaboration and innovation

Analysis For quality control the reference material is analysed at the same time and in the same manner as other samples.

Analysis For quality control the reference material is analysed at the same time and in the same manner as other samples.

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

ISO/TS TECHNICAL SPECIFICATION. Water quality Guidance on analytical quality control for chemical and physicochemical water analysis

A basic introduction to reference materials. POPs Strategy

International Symposium Standardisation of non-targeted methods for food authentication, Session II: Standardisation of Analytical Methods

Method Validation. Role of Validation. Two levels. Flow of method validation. Method selection

ON USING BOOTSTRAP APPROACH FOR UNCERTAINTY ESTIMATION

A case study on how to maintain confidence of thermal properties test: Thermal conductivity of building insulation materials

Bayesian methods for sample size determination and their use in clinical trials

Laboratoria De Nayer

And how to do them. Denise L Seman City of Youngstown

Results of Proficiency Test OPP, PCP and TeCP in textile December 2016

Useful Reading (continued)

CHEK Proficiency study 664. Solvents in adhesive. Date 3 July 2017 Version 1. Report number CHEK Final CHEK proficiency study July 2017

Analytical Method Validation: An Updated Review

DECISION LIMITS FOR THE CONFIRMATORY QUANTIFICATION OF THRESHOLD SUBSTANCES

NATIONAL ASSOCIATION OF TESTING AUTHORITIES (NATA) REQUIREMENTS FOR ACCREDITATION OF ICP-MS TECHNIQUES

Uncertainty of Measurement

Chapter 23: Inferences About Means

NVLAP TEM PROFICIENCY Individual Laboratory Report LAB ERROR POINT SUMMARY

Hypothesis Testing with the Bootstrap. Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods

Joint Research Centre (JRC)

Workshop on Understanding and Evaluating Radioanalytical Measurement Uncertainty November 2007

CHEK Proficiency study 671. CMI, MI and free formaldehyde in body lotion. Date 30 November 2017 Version 1. Report number CHEK

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE

Homogeneity Assessment for Grass Samples Used for Organically Bound Tritium Proficiency Test

Reference Materials for Trade and Development: Quality and Comparability

STUDY OF THE APPLICABILTY OF CONTENT UNIFORMITY AND DISSOLUTION VARIATION TEST ON ROPINIROLE HYDROCHLORIDE TABLETS

Response Surface Methodology

Bayesian Model Diagnostics and Checking

-However, this definition can be expanded to include: biology (biometrics), environmental science (environmetrics), economics (econometrics).

Development of a harmonised method for specific migration into the new simulant for dry foods established in Regulation 10/2011

Assessing Test Quality from EQA/PT Surveys. James O. Westgard University of Wisconsin Medical School Sten A. Westgard Westgard QC, Inc.

TNI V1M Standard Update Guidance on Detection and Quantitation

Analysis of variance

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Biol/Chem 4900/4912. Forensic Internship Lecture 5

This procedure describes the monitoring activities in a laboratory quality control (QC) program to ensure the quality of test results.

ISO INTERNATIONAL STANDARD. Statistical methods for use in proficiency testing by interlaboratory comparisons

Measurement Uncertainty: A practical guide to understanding what your results really mean.

Technical Report Collaborative Study of CORESTA Ignition Propensity Monitor Test Piece CM IP 2 for the Determination of Ignition Propensity

Glossary for the Triola Statistics Series

European Brewery Convention

Validation Scheme for Qualitative Analytical Methods

A Discussion of Measurement of Uncertainty for Life Science Laboratories

International interlaboratory study of forensic ethanol standards

Standard Terminology Relating to Quality and Statistics 1

It is important that the batch numbers of the reference material and on the certificate are identical.

Confidence intervals and Hypothesis testing

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations

PROFICIENCY TESTING: EXPERIENCES FROM CROATIA ON THE ISSUE OF HEAVY METALS DETERMINATION IN MARINE SEDIMENTS

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Transcription:

A critical view on ISO standard 13528 Wim Coucke EQALM symposium, Dublin, 20th of October 2017 WIV-ISP Rue Juliette Wytsmanstraat 14 1050 Brussels Belgium T+32 2 642 55 23 F +32 2 642 56 45 wim.coucke@wiv-isp.be www.wiv-isp.be

ISO 13528(2012): aims Detailed descriptions of statistical methods Procedures can be applied to demonstrate that the measurements meet specified criteria for acceptable performance Applicable to either quantitative measurements or qualitative observations Applicable, especially for newly established proficiency testing schemes.

Sample quality: Homogeneity ISO 13528: At least 10 samples at random Analyse samples in random order in duplicate Analytical variability of method<0.5σ pt inter-sample standard deviation <0.3σ pt Inferential test, reject sample if evidence of heterogeneity

Sample quality: homogeneity Probability of non-estimable sample standard deviation increases with ratio analytical variability/sample variability If σ s = 0.3σ pt and σ a = 0.5σ pt, duplicates: 21% If σ s = 0.4σ pt and σ a = 0.5σ pt, duplicates: 11% Probability can be reduced by taking more replicates per sample.

Sample quality: homogeneity Probability of non-estimable sample standard deviation increases with ratio analytical variability/sample variability If σ s = 0.3σ pt and σ a = 0.5σ pt, triplicates: 8% If σ s = 0.4σ pt and σ a = 0.5σ pt, triplicates: 2%

Sample quality: homogeneity Inferential test: = +, reject batch if > Test only rejects if sample heterogeneity is too large Accepts batch in all other cases

Sample quality: homogeneity Probability of accepting batch (%) 0 20 40 60 80 100 s s <0.3σ pt σ a =0.1σ pt > σ a =0.1σ pt > σ a =0.5σ pt 0.0 0.2 0.3 0.4 0.6 0.8 1.0 Theoretical sample standard deviation as a fraction of σ pt (σ s /σ pt )

Sample quality: homogeneity What about qualitative schemes? ISO 13528: Appropriate sample of proficiency test items, all of which should demonstrate the expected property value What is the frequency of nonconforming units in a batch if a certain number of samples are all conforming and we don t have any prior knowledge about the effectiveness of our preparation procedure? 95% lower limit of confidence: Sample size 10 50 57 100 95% upperlimit of confidence of rate of nonconforming units 23.8 5.7 5 2.9 Use acceptance sampling theory or Bayesian approach with informative prior

Sample quality: Stability ISO 13528: take at least 2 samples, measure them in the beginning and at the end. If difference of means <0.3σ pt, accept batch for stability Comment: Probability of meeting criterion in case of perfect stability: ~(,. ), ~(0,. #( <0.3!" = ~(,. +. ) ), ~(0,0.5!" ) # <0.3!" >0 +#( > 0.3!" < 0) =0.45

Sample quality: stability Probability of acceptance in case of perfect stability (%) 40 50 60 70 80 90 100 σ a =0.1σ pt σ a =0.25σ pt σ a =0.5σ pt 5 10 15 20 Number of samples Solution: use very precise method or assess relation between date of analysis and reported value

Normal distribution of data ISO 13528 not very clear: 6.6.1: not always necessary to verify normal distribution, approximate symmetry is enough, only if outlier searching techniques are used (point 6.6.1). 9.4.4: proficiency testing provider may wish to check the normality of the distribution if there is a very large number of participants. In fact: Threshold values of Z scores are based on the normal distribution, so better to check. Robust estimators of variability assume a normal distribution with outliers Recommended way of checking: normal quantile plot after removing outliers.

Data quality: multimodality ISO 13528: mentions unimodality two times and evaluates robust estimation procedures with respect to their robustness against multimodality In fact: Multimodality may show bad repartitioning of peer groups or lack of harmonisation Multimodality is an interesting finding, not something we should protect ourselves against

Data quality: multimodality Histogram or density plot for multimodality? Enough data are needed Hartigan s dip test Silverman test (more powerful) After adjusting for outliers and same values appearing multiple times!

Estimates of deviation: Based on reported data: MAD / niqr / Algorithm A / Qn / Q Comment: - Algorithm A: originally designed for finding assigned value. - Qn: only when all values are different

Estimates of deviation Z scores: ( )*( ---- Z scores: ( ) *( +,-+ ( Comments: Z scores are corrected for uncertainty in estimation of central location Why is there no corrected for uncertainty in estimation of deviance when standard deviation is calculated based on the reported results?

Estimates of deviation Z scores: ( )*( ---- Z scores: ( ) *( +,-+ ( Possible solution: compare Z score with quantile of t distribution Give corresponding inverse quantile of normal distribution eg: Z-score of 4, n=10. Quantile of t-distribution: 0.998448; corresponding inverse quantile of normal Distribution: 2.96 0.000 0.010 0.020 3 4 5 Z-score of 4, n=20. Corresponding value: 3.36

Assessing quality checks for estimates of location and deviation ISO 13528: The proficiency testing provider should apply a procedure to monitor interlaboratory agreement, to track changes in performance and ensure the reasonableness of statistical procedures.... Estimates of variability should be plotted on graphs sequentially or as a time-series

How to do as well: monitor interlaboratory agreement Characteristic function:./ = 0 +1 95% confidence bounds Standard deviation 0.0 0.1 0.2 0.3 0.4 0 1 2 3 4 Assigned value

ISO 13528(2012): aims Detailed descriptions of statistical methods Procedures can be applied to demonstrate that the measurements meet specified criteria for acceptable performance Applicable to either quantitative measurements or qualitative observations Applicable, especially for newly established proficiency testing schemes.

Conclusion Very complete in some cases, too superficial in others ISO 13528 lacks sufficient description for long-term evaluation qualitative schemes

Literature Cárdenas S, Valcárcel M. Analytical features in qualitative analysis. TrAC Trends Anal Chem. 2005;24:477 87. Coucke W, China B, Delattre I, Lenga Y, Van Blerk M, Van Campenhout C, et al. Comparison of different approaches to evaluate External Quality Assessment Data. Clin Chim Acta. 2012;413:582 6. Coucke W, Van Blerk M, Libeer J-C, Van Campenhout C, Albert A. A new statistical method for evaluating long-term analytical performance of laboratories applied to an external quality assessment scheme for flow cytometry. Clin Chem Lab Med. 2010;48:645 50. Coucke, W., Charlier, C., Lambert, W., Martens, F., Neels, H., Tytgat, J.,... & Verstraete, A. G. (2015). Application of the characteristic function to evaluate and compare analytical variability in an external quality assessment scheme for serum ethanol. Clinical chemistry, 61(7), 948-954. Duewer DL. A comparison of location estimators for interlaboratory data contaminated with value and uncertainty outliers. Accreditation Qual Assur. 2008;13:193 216. Ehrmeyer S, Laessig R. Alternative statistical approach to evaluating interlaboratory performance. Clin Chem. 1985;31:106 8.

Literature Ellison SLR, Fearn T. Characterising the performance of qualitative analytical methods: Statistics and terminology. TrAC Trends Anal Chem. 2005;24:468 76. Hartigan, J. A., & Hartigan, P. M. (1985). The dip test of unimodality. The annals of Statistics, 70-84. Langton SD, Chevennement R, Nagelkerke N, Lombard B. Analysing collaborative trials for qualitative microbiological methods: accordance and concordance. Int J Food Microbiol. 2002;79:175 81. Ram B. J. A recursive version of Grubbs test for detecting multiple outliers in environmental and chemical data. Clin Biochem. 2010;43:1030 3. Schilling, E. G., & Neubauer, D. V. (2017). Acceptance sampling in quality control. CRC Press. Searle, S. R., Casella, G., & McCulloch, C. E. (2009). Variance components (Vol. 391). John Wiley & Sons. Silverman, B. W. (1981). Using kernel density estimates to investigate multimodality. Journal of the Royal Statistical Society. Series B (Methodological), 97-99.