Quantile POD for Hit-Miss Data

Similar documents
POD Tutorial Part I I Review of ahat versus a Strategies

Model-Assisted Probability of Detection for Ultrasonic Structural Health Monitoring

Markov Chain Monte Carlo methods

Bayesian Inference. Chapter 1. Introduction and basic concepts

Bayesian Defect Signal Analysis

Statistical Practice

Bayesian Regression Linear and Logistic Regression

A Note on Bayesian Inference After Multiple Imputation

Practical Methods to Simplify the Probability of Detection Process

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1

Principles of Bayesian Inference

Bayesian inference. Fredrik Ronquist and Peter Beerli. October 3, 2007

PARAMETER ESTIMATION: BAYESIAN APPROACH. These notes summarize the lectures on Bayesian parameter estimation.

Bayesian Estimation for the Generalized Logistic Distribution Type-II Censored Accelerated Life Testing

Principles of Bayesian Inference

Estimating Probability of Detection Curves Related to Eddy Current Sender Receiver Probes

eqr094: Hierarchical MCMC for Bayesian System Reliability

Principles of Bayesian Inference

INVERTED KUMARASWAMY DISTRIBUTION: PROPERTIES AND ESTIMATION

POD(a) = Pr (Y(a) > '1').

Approximate Bayesian computation for spatial extremes via open-faced sandwich adjustment

Principles of Bayesian Inference

Bagging During Markov Chain Monte Carlo for Smoother Predictions

Using prior knowledge in frequentist tests

Bayesian Modeling of Accelerated Life Tests with Random Effects

Heriot-Watt University

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

Bayesian Methods in Multilevel Regression

Slice Sampling with Adaptive Multivariate Steps: The Shrinking-Rank Method

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-675.

Introduction. Chapter 1

Bayesian Methods for Estimating the Reliability of Complex Systems Using Heterogeneous Multilevel Information

Lecture 5: Bayes pt. 1

INTRODUCTION TO BAYESIAN INFERENCE PART 2 CHRIS BISHOP

Monte Carlo in Bayesian Statistics

Default Priors and Effcient Posterior Computation in Bayesian

Stat 5101 Lecture Notes

Simultaneous Prediction Intervals for the (Log)- Location-Scale Family of Distributions

Penalized Loss functions for Bayesian Model Choice

McGill University. Department of Epidemiology and Biostatistics. Bayesian Analysis for the Health Sciences. Course EPIB-682.

Strong Lens Modeling (II): Statistical Methods

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

AFRL-ML-WP-TR

IMPROVED METHODOLOGY FOR INSPECTION RELIABILITY ASSESSMENT FOR DETECTING SYNTHETIC HARD ALPHA INCLUSIONS IN TITANIUM

Bayes: All uncertainty is described using probability.

BAYESIAN ESTIMATION OF LINEAR STATISTICAL MODEL BIAS

Statistical Methods in Particle Physics

MEASUREMENT UNCERTAINTY AND SUMMARISING MONTE CARLO SAMPLES

A Bayesian Approach to Phylogenetics

n =10,220 observations. Smaller samples analyzed here to illustrate sample size effect.

STA 4273H: Statistical Machine Learning

(5) Multi-parameter models - Gibbs sampling. ST440/540: Applied Bayesian Analysis

Nonparametric Bayesian Methods (Gaussian Processes)

The Bayesian Approach to Multi-equation Econometric Model Estimation

Bayesian inference for factor scores

Nonparametric Bayesian Methods - Lecture I

Lecture 5. G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1

STA 4273H: Sta-s-cal Machine Learning

Modeling Uncertainty in the Earth Sciences Jef Caers Stanford University

Bayesian model selection for computer model validation via mixture model estimation

Bayesian Econometrics

Bayesian Statistical Methods. Jeff Gill. Department of Political Science, University of Florida

Part 8: GLMs and Hierarchical LMs and GLMs

STAT 499/962 Topics in Statistics Bayesian Inference and Decision Theory Jan 2018, Handout 01

Bayesian rules of probability as principles of logic [Cox] Notation: pr(x I) is the probability (or pdf) of x being true given information I

Introduction to Bayesian Inference

David Giles Bayesian Econometrics

Supplement to A Hierarchical Approach for Fitting Curves to Response Time Measurements

Dynamic System Identification using HDMR-Bayesian Technique

Advanced Statistical Methods. Lecture 6

Bayesian Life Test Planning for the Weibull Distribution with Given Shape Parameter

ST 740: Markov Chain Monte Carlo

Bayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework

Bayesian networks for multilevel system reliability

Statistical Inference

σ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =

Fractional Imputation in Survey Sampling: A Comparative Review

STA 4273H: Statistical Machine Learning

PART I INTRODUCTION The meaning of probability Basic definitions for frequentist statistics and Bayesian inference Bayesian inference Combinatorics

Bayesian Methods for Machine Learning

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

1 Bayesian Linear Regression (BLR)

Statistics in Environmental Research (BUC Workshop Series) II Problem sheet - WinBUGS - SOLUTIONS

Tools for Parameter Estimation and Propagation of Uncertainty

Bayesian Inference. p(y)

Bayesian Inference. Anders Gorm Pedersen. Molecular Evolution Group Center for Biological Sequence Analysis Technical University of Denmark (DTU)

Plausible Values for Latent Variables Using Mplus

Bayesian Inference for the Multivariate Normal

Markov Chain Monte Carlo

Introduction to Systems Analysis and Decision Making Prepared by: Jakub Tomczak

Model comparison. Christopher A. Sims Princeton University October 18, 2016

Vector Autoregression

Subject CS1 Actuarial Statistics 1 Core Principles

BAYESIAN METHODS FOR VARIABLE SELECTION WITH APPLICATIONS TO HIGH-DIMENSIONAL DATA

Introduction to Probability and Statistics (Continued)

Bayesian Networks in Educational Assessment

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Comparison of Three Calculation Methods for a Bayesian Inference of Two Poisson Parameters

Bayesian modelling. Hans-Peter Helfrich. University of Bonn. Theodor-Brinkmann-Graduate School

Bayesian Estimation of DSGE Models 1 Chapter 3: A Crash Course in Bayesian Inference

Transcription:

Quantile POD for Hit-Miss Data Yew-Meng Koh a and William Q. Meeker a a Center for Nondestructive Evaluation, Department of Statistics, Iowa State niversity, Ames, Iowa 50010 Abstract. Probability of detection (POD) is commonly used to measure an NDE inspection procedure s performance. Due to inherent variability in the inspection procedure caused by variability in factors such as operators and crack morphology, for some purposes it is important to model POD as a random function. Traditionally, inspection variabilities are pooled and an estimate of the mean POD is reported. In some applications it is important to know how poor typical inspections might be and this question can be answered by estimating a quantile of the POD distribution. This paper shows how to fit a proper model to hit-miss data and considers estimation of the mean POD as well as quantiles of the POD distribution for binary (hit-miss) NDE data. We also show how to compute credible intervals for these quantities using a Bayesian estimation approach. Keywords: Bayesian Estimation, Binary Regression, Random Effects PACS: 0.50.-r, 81.70 INTRODCTION Background and Motivation Probability of Detection (POD) is the most commonly used metric to characterize the performance of an NDE procedure. POD curves provide important inputs for making accept/reject decisions, scheduling inspections, doing lifing calculations, and performing risk analyses. There are many sources of variability that can affect the outcome of an NDE inspection, such as operator-to-operator differences, calibration, and crack morphology. For example, if there are important differences in the performance of different operators, different operators will have different POD curves. Traditionally, the POD that has been reported has been what we call the mean POD. That is, what has been reported is a POD that estimates the average POD over all sources of variability. sually, along with this estimate, a set of pointwise lower 95% confidence intervals for POD, reflecting the uncertainty from limited data, are also reported. We have seen confusion between the lower confidence bound for mean POD and the concept of a quantile POD. They are not the same. i, Meeker, and Spencer [1] and i, Meeker, and Thompson [] introduce the concept of a quantile POD for quantitative signal-response NDE data (sometimes known as â vs a data). In this paper we extend these ideas and show how to estimate a quantile of the POD distribution from hit-miss inspection data. To illustrate our methods, we use a subset of the Have Cracks, Will Travel (HCWT) data described in ewis et al. [3]. In particular, we use the hit-miss data that were obtained from the examination of 95 cracks of various sizes in aircraft components by 14 different operators using eddy current surface scans. Figure 1 shows the number of inspections in the dataset versus crack length on a logarithmic axis. A few issues to be noted about this dataset are: Not all cracks were inspected by all inspectors. Some cracks were inspected twice by the same inspector. More information about these data can be found in Singh [4]. With the capabilities of modern statistical modelling methods, the unbalance noted in the last two points causes no serious problems in the data analysis and we are able to estimate the different components of variability and other unknown model parameters without difficulty.

FIGRE 1. Number of inspections vs crack length in the dataset (on a log axis). OGISTIC REGRESSION MODE Basic ogistic Regression Model The binary logistic regression model for estimation of Hit-Miss POD data is presented in MI-HDBK-183A [5]. Here we review the basic ideas. et Y denote a binary random variable, with We let 1 for a hit Y =. 0 for a miss x = log(crack length). Then the logistic binary regression model is where exp( z) ( z) logis 1 exp( z) POD( x) = Pr( Y = 1; x) = ( x) exp( x) 0 1 logis 0 1 0 1 1 exp( x) The logistic binary regression model can also be expressed as [POD( x)] = x 1 logis 0 1 where the function ( p ) = log( p / (1 p )). 1 logis An Alternative ogistic Regression Model Spencer [6] suggested a generalization to the binary regression hit-miss model

POD( x) = ( ) ( x). (1) logis 0 1 that allows the POD to have a lower limit (upper limit) for the POD that is greater than 0 (less than 1). Here, and are unknown parameters with 0 < 1. Generally, will be close to 0, and will be close to 1. The inclusion of these two parameters allows for the possibilities that: The POD never goes to zero as crack length decreases, but tails off to a threshold small positive value, generally due to hits caused by noise artifacts that cause a signal that is larger than that caused by a small crack. The POD could have its asymptote at a value that is less than 1, generally caused by having a positive probability of a miss for large cracks, perhaps caused by a gross operator error. This model was also used in Knopp and Zeng [7]. In our analysis of the HCWT data, we found both these additional terms to be important. ogistic Regression Model with Random Effects Inspections and POD studies are carried out Across different cracks (which, even if they are of the same size, will have different response distributions due to crack-to-crack variability), By different operators (e.g., with different skills and amounts of experience). Also, due to the fact that some operators inspected an entire set of cracks twice, an additional estimable source of variability arises from the setup (e.g., through inspection-system setup and initial calibration, etc) of these sets of cracks. We note, also, that the cracks were from two types of components (A and B). An indicator variable z is introduced, with the following definition: 0 if theflaw is fromsample A z =. 1 if theflaw is fromsample B We assume that the cracks, operators and setups used in the study are sampled from a larger population of cracks, operators and setups. We say that operators, cracks and setups are random effects. In the random effects model, the probability of a hit for an operator effect, a crack effect and a setup effect, can be expressed as POD( x) = Pr( Y( x) = 1,, ) = ( ) ( x z ), () logis from which we have 1 POD( x) = x z. logis (3) In this model, we note that, and are random and we will assume that N(0, ), N(0, ) and N are mutually independent. Here,,,,,,, and (0, ) are the parameters to be estimated from the experimental data. The model in (3) is similar to that described in i, Meeker, and Thompson (014), except that the observed data is hit-miss instead of signal-response ( â versus a ) data and we added a model term because some operators saw some cracks more than once. We note here that in this application, x is taken as the log-transformed crack length. In this paper, we follow the usual convention used in most data analysis literature and use natural (base e ) logarithms.

BAYESIAN MOTIVATION, ESTIMATION, AND RESTS Motivation for sing Bayesian Estimation Following the approach in i, Meeker, and Thompson [], we use a Bayesian method for estimation for the following reasons: When diffuse prior distributions are used, the Bayesian analysis yields results that are close to those that would be produced by a maximum likelihood based analysis. That is, it has been shown that Bayesian credible bounds, when used with diffuse prior information and reasonable sample sizes, have good frequentist properties (i.e., 95% credible bounds will contain the true POD with approximate probability 0.95). Bayesian estimation can still be carried out with models that are more complicated than (3) (e.g. for models which are non-linear in the parameters and/or random effects). The Bayesian framework provides a straightforward procedure to estimate important functions of the parameters, such as mean POD and quantile POD and to obtain corresponding credible bounds, even with random effects. When valid informative prior information is available, it can be integrated with the data to improve precision. Bayesian methods for statistical inference are described in numerous places, including Carlin and ouis [8] and Gelman et al. [9]. Software packages are available to do the computations. Bayes Theorem et q = (,,,,,,, ) ' be the vector of the parameters in model (3). A likelihood-based analysis estimates these parameters by using the observed data and identifying the value of q which maximises the likelihood of the data [denoted by f ( y q) ]. A Bayesian analysis of the data, however, integrates prior knowledge about parameters with the available data. This prior knowledge is specified by using prior probability distributions [described by a joint density function (q) ]. When there is little or no knowledge about the parameters, it is possible to use diffuse prior distributions that are relatively flat over that part of the parameter space that has non-negligible likelihood. In this paper, we use diffuse prior distributions. The conditional distribution of the parameters, given the data, q Y = y, is called the posterior distribution of q. According to Bayes theorem, the posterior density is g(q y) f( y q) (q), (4) where f ( y q) is the likelihood of the data y when the true parameter vector is q. Bayes theorem then allows us to update the information that we have about the parameter vector after observing the data. The combined information is reflected in the joint posterior distribution of the parameters. Prior Specification When there is little or no available prior information about the parameters in q, we can use a diffuse prior distribution. By doing this, the joint posterior will be approximately proportional to the likelihood and the influence of the prior specification will be small as long as there is a reasonable amount of data. For and (unrestricted parameters), we used a normal distribution with a large variance. For and, we used niform (0,0.5) and niform (0.5,1) prior distributions respectively. For, and (positive parameters), we used a gamma distribution with a large variance. The Joint Posterior Distribution and Bayesian Parameter Estimates In the modern approach to Bayesian inference, one uses draws from the joint posterior distributions of the parameters and the random effects. These draws can be obtained by using a Markov Chain Monte Carlo (MCMC)

procedure. A sufficiently large number of these draws allows computation of parameter estimates, estimates of functions of parameters, and credible intervals for these quantities. * * We denote the k =1,..., M draws from the joint posterior distribution of q by q 1,...,q M. To estimate a scalar function of the parameters g (q), draws from the marginal posterior distribution of g(q) are obtained by evaluating at * * q 1,...,q M. A Bayes point estimate is given by the median of the set of simulated values { g,..., g }. A 90% * * 1 M * * credible interval for g(q) is given by the 0.05 and 0.95 empirical quantiles of the { g1,..., gm} values. In the following section we give expressions for the particular g (q) functions needed for POD analysis in our random effects model. INFERENCE FOR FNCTIONS OF PARAMETERS OF THE POD DISTRIBTION We note that for any randomly selected operator and crack of log crack length x and crack membership z from the population of operators and cracks, POD = ( ) logis ( x z ), is a random variable because, and are random. For most applications of POD, one would not be interested in specific crack-operator-setup combinations. Instead, one would typically be interested in situations in which cracks, operators and setups are random draws from a population of operators, cracks and setups. We would like to make inferences about the mean POD (averaging over many inspections) or perhaps some quantile of the POD distribution. sually, this would be a lower-tail quantile to indicate the chance of having an operator-crack combination that might result in a poor POD. Mean of the POD Distribution It can be shown that the mean of the POD distribution (averaging over all of the random effects) for a given logcracklength x and crack membership z can be expressed as 1 s logis x z g(,,,,,,,, x, z) = 1 ds (5) nor 1 1 t where ( ) = e dt nor is the standard normal (Gaussian) cumulative distribution function, used here due to the normal distribution assumption for the random effects. For given values of the parameters and logcracklength x, (5) can be evaluated using numerical integration. Figure shows an estimate and 95% lower and 95% upper credible bounds for the mean POD for the eddy current surface scans, based on the HCWT data.

FIGRE. Mean of the POD distribution with credible bounds. Quantiles of the POD Distribution The p quantile of the POD distribution for a given log(cracklength) x and crack membership z, denoted by g ( ) p x, can be found by solving for g p, which gives us g x z 1 p logis nor = p 1 g p x logis x z nor p ( ) = ( ) ( ). (6) Obtaining the Bayesian estimate and credible bounds for g ( ) p x is achieved by the same procedure that was used for the POD mean, but using the function gp( x ) instead of g(,,,,,,, x, z). Again, repeating the computations over a range of values of x would give us estimates of the quantiles of the POD distribution (and credible bounds for these quantiles) as a function of log crack length x. Figure 3 shows the logistic model Bayes estimates of the 0.05 quantiles of the POD distribution as a function of crack area, along with 95% lower and 95% upper credible bounds for the 0.05 quantile of the POD for the eddy current surface scans, based on the HCWT data. Comparing the estimates in Figures and 3 shows that the mean POD gives an overly optimistic reflection of POD, relative to the quantile POD that reflects the possibility of getting a poor inspection result on any given inspection.

FIGRE 3. Bayes estimates and 95% lower and 95% upper credible bounds for the 0.05 quantile of the POD distribution. CONCSIONS Some NDE applications report inspection data in binary (hit-miss) form. A common metric for inspection procedure performance is the POD. Different sources of variability in the inspection process suggest that we model the POD as a random variable. That is, due to differences in factors such as operator and crack morphology, the POD for a crack of a given size may vary from inspection to inspection. For some applications, especially when safety is involved, there may be interest in how poor of an inspection one might have, due to the inspection-to-inspection variability. Instead of estimating the POD mean (averaged over all sources of variability such as operator and crack morphology), it might be more appropriate to compute estimates of quantiles of the POD distribution. There are considerable differences between these two summaries of the POD distribution. ACKNOWEDGEMENTS This material is based upon work supported by the Air Force Research aboratory under Contract # FA8650-04- C-58 at Iowa State niversity s Center for Nondestructive Evaluation. REFERENCES 1. i, M., W. Q. Meeker, and F. W. Spencer, Distinguishing between uncertainty and variability in nondestructive evaluation, Review of Progress in Quantitative Nondestructive Evaluation, Vol. 31, pp. 175 173, (01).. i, M., W. Q. Meeker, and R. B. Thompson, Physical model assisted probability of detection in nondestructive evaluation for detecting of flaws in titanium forgings, Technometrics, 014 (in press). 3. ewis, W. H., B. D. Dodd, W. H. Sproat, and J. M. Hamilton (1978), Reliability of non-destructive evaluations - final report, SAF SA-AC/MEE 76-6-38-1. Available online at http://www.dtic.mil/cgibin/gettrdoc?ocation= &doc=gettrdoc.pdf&ad=ada07097, accessed 9 Oct 01. 4. R. Singh, Three decades of NDI reliability assessment, Report No. Karta-3510-99-01, 000. Available online at http://www.cnde.iastate.edu/mapod/reference970-1999.pdf, accessed 1 Feb 013. 5. MI-HDBK-183A, Nondestructive evaluation system reliability assessment, 009. Available online at http://www.statisticalengineering.com/mh183/mi-hdbk-183a(009).pdf, accessed 16 May 01.

6. F. W. Spencer, Identifying sources of variation for reliability analysis of field inspections, Sandia National aboratories Report, (SAND 98-0980C), 1998. 7. Knopp, J. S. and. Zeng, Statistical analysis of hit/miss data, Materials Evaluation, 3:3 39, 013. 8. Carlin, B. P. and T. A. ouis, Bayesian Methods for Data Analysis (3 ed.), Chapman & Hall/CRC Texts in Statistical Science, 008. 9. Gelman, A., J. B. Carlin, H. S. Stern, and D. B. Rubin (004), Bayesian Data Analysis ( ed.), Chapman & Hall/CRC, 004.