Using the negative log-gamma distribution for Bayesian system reliability assessment

Similar documents
Bayesian Methods for Estimating the Reliability of Complex Systems Using Heterogeneous Multilevel Information

Experimental designs for multiple responses with different models

Bayesian methods for system reliability and community detection

Numerical computation of an optimal control problem with homogenization in one-dimensional case

Bayesian Reliability Analysis: Statistical Challenges from Science-Based Stockpile Stewardship

Bayesian Life Test Planning for the Weibull Distribution with Given Shape Parameter

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Quantile POD for Hit-Miss Data

Stat 5101 Lecture Notes

The extreme points of symmetric norms on R^2

COPYRIGHTED MATERIAL CONTENTS. Preface Preface to the First Edition

Bayesian networks for multilevel system reliability

The Mixture Approach for Simulating New Families of Bivariate Distributions with Specified Correlations

Inferences about Parameters of Trivariate Normal Distribution with Missing Data

Beta statistics. Keywords. Bayes theorem. Bayes rule

Principles of Bayesian Inference

Principles of Bayesian Inference

3 Joint Distributions 71

Bayesian Inference: Probit and Linear Probability Models

Contents. Part I: Fundamentals of Bayesian Inference 1

eqr094: Hierarchical MCMC for Bayesian System Reliability

Bayesian inference. Fredrik Ronquist and Peter Beerli. October 3, 2007

Principles of Bayesian Inference

Introduction to Applied Bayesian Modeling. ICPSR Day 4

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1

Using Bayesian Statistics to Model the Reliability of Complex Weapon Systems

Estimation of Quantiles

HPD Intervals / Regions

Best Linear Unbiased Prediction: an Illustration Based on, but Not Limited to, Shelf Life Estimation

Dynamic System Identification using HDMR-Bayesian Technique

Bayesian Methods for Accelerated Destructive Degradation Test Planning

Part 8: GLMs and Hierarchical LMs and GLMs

A BAYESIAN SOLUTION TO INCOMPLETENESS

STA 4273H: Sta-s-cal Machine Learning

Basics of Uncertainty Analysis

Data Analysis and Uncertainty Part 2: Estimation

Simultaneous Prediction Intervals for the (Log)- Location-Scale Family of Distributions

A Few Special Distributions and Their Properties

USEFUL PROPERTIES OF THE MULTIVARIATE NORMAL*

Part 2: One-parameter models

Inference for a Population Proportion

ABC methods for phase-type distributions with applications in insurance risk problems

Markov Chain Monte Carlo methods

Stat 5421 Lecture Notes Proper Conjugate Priors for Exponential Families Charles J. Geyer March 28, 2016

Assessing the Effect of Prior Distribution Assumption on the Variance Parameters in Evaluating Bioequivalence Trials

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

Stat 535 C - Statistical Computing & Monte Carlo Methods. Lecture 15-7th March Arnaud Doucet

10. Exchangeability and hierarchical models Objective. Recommended reading

Bayesian analysis in nuclear physics

Other Noninformative Priors

Step-Stress Models and Associated Inference

Part 6: Multivariate Normal and Linear Models

36-463/663: Hierarchical Linear Models

PART I INTRODUCTION The meaning of probability Basic definitions for frequentist statistics and Bayesian inference Bayesian inference Combinatorics

Bayesian Modeling of Accelerated Life Tests with Random Effects

Institute of Actuaries of India

Bayesian Inference. Chapter 2: Conjugate models

Assessing Reliability Using Developmental and Operational Test Data

Lifetime prediction and confidence bounds in accelerated degradation testing for lognormal response distributions with an Arrhenius rate relationship

Introduction to Bayesian Methods

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Petr Volf. Model for Difference of Two Series of Poisson-like Count Data

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1

1 Introduction. P (n = 1 red ball drawn) =

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

Sequential Monte Carlo and Particle Filtering. Frank Wood Gatsby, November 2007

An introduction to Bayesian statistics and model calibration and a host of related topics

Bayesian Estimation for the Generalized Logistic Distribution Type-II Censored Accelerated Life Testing

DEPARTMENT OF COMPUTER SCIENCE Autumn Semester MACHINE LEARNING AND ADAPTIVE INTELLIGENCE

Introduction to Bayesian Statistics with WinBUGS Part 4 Priors and Hierarchical Models

Assessing system reliability through binary decision diagrams using bayesian techniques.

Physics 509: Bootstrap and Robust Parameter Estimation

Case Study in the Use of Bayesian Hierarchical Modeling and Simulation for Design and Analysis of a Clinical Trial

BAYESIAN METHODS FOR VARIABLE SELECTION WITH APPLICATIONS TO HIGH-DIMENSIONAL DATA

A Simulation Study on Confidence Interval Procedures of Some Mean Cumulative Function Estimators

Bayesian room-acoustic modal analysis

Bayesian model selection for computer model validation via mixture model estimation

1 Uniform Distribution. 2 Gamma Distribution. 3 Inverse Gamma Distribution. 4 Multivariate Normal Distribution. 5 Multivariate Student-t Distribution

Introduction to Bayesian Methods. Introduction to Bayesian Methods p.1/??

Bayesian Gaussian Process Regression

The performance of estimation methods for generalized linear mixed models

Confidence Intervals for the Ratio of Two Exponential Means with Applications to Quality Control

Stat 516, Homework 1

The binomial model. Assume a uniform prior distribution on p(θ). Write the pdf for this distribution.

Subject CS1 Actuarial Statistics 1 Core Principles

Supplement to A Hierarchical Approach for Fitting Curves to Response Time Measurements

Examining the accuracy of the normal approximation to the poisson random variable

Bayesian Regression Linear and Logistic Regression

Foundations of Statistical Inference

A General Bayes Weibull Inference Model for Accelerated Life Testing

INVERTED KUMARASWAMY DISTRIBUTION: PROPERTIES AND ESTIMATION

A Bayesian Approach to Missile Reliability

The Bayesian Choice. Christian P. Robert. From Decision-Theoretic Foundations to Computational Implementation. Second Edition.

Sparse Linear Models (10/7/13)

Modelling Operational Risk Using Bayesian Inference

STATISTICAL INFERENCE IN ACCELERATED LIFE TESTING WITH GEOMETRIC PROCESS MODEL. A Thesis. Presented to the. Faculty of. San Diego State University

Bayesian Inference. Chapter 4: Regression and Hierarchical Models

13: Variational inference II

CPSC 540: Machine Learning

Transcription:

Graduate Theses and Dissertations Graduate College 2012 Using the negative log-gamma distribution for Bayesian system reliability assessment Roger Zoh Iowa State University Follow this and additional works at: http://lib.dr.iastate.edu/etd Part of the Statistics and Probability Commons Recommended Citation Zoh, Roger, "Using the negative log-gamma distribution for Bayesian system reliability assessment" (2012). Graduate Theses and Dissertations. 12951. http://lib.dr.iastate.edu/etd/12951 This Dissertation is brought to you for free and open access by the Graduate College at Iowa State University Digital Repository. It has been accepted for inclusion in Graduate Theses and Dissertations by an authorized administrator of Iowa State University Digital Repository. For more information, please contact digirep@iastate.edu.

Using the negative log-gamma distribution for Bayesian system reliability assessment by Roger Saï Zoh A dissertation submitted to the graduate faculty in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Major: Statistics Program of Study Committee: Alyson Wilson, Major Professor Max Morris Mark Kaiser Bill Meeker Daniel Nordman Scott Vander Wiel Iowa State University Ames, Iowa 2012 Copyright c Roger Saï Zoh, 2012. All rights reserved.

ii DEDICATION I would like to dedicate this thesis to my parents Tiemoko and Madeleine without whose support and prayers I would not have been able to complete this work. I would also like to thank my friends and family for their guidance and words of encouragement during the writing of this work.

iii TABLE OF CONTENTS LIST OF TABLES.................................... vi LIST OF FIGURES................................... vii ACKNOWLEDGEMENTS............................... ABSTRACT........................................ x xi CHAPTER 1. Bayesian Assessment.................. 1 1.1 Properties of the Negative Log-Gamma Distribution............... 2 1.1.1 Previous work................................. 2 1.1.2 The NLG as a prior distribution...................... 3 1.1.3 Properties of the posterior distribution................... 4 1.2 Example Revisited.................................. 6 1.3 Results and Discussion................................ 7 CHAPTER 2. NLG Prior Distributions....................... 9 2.1 Motivating Example................................. 9 2.2 Previous Work.................................... 11 2.3 Independent End Points and Independent Decrement Priors........... 13 2.3.1 Independent end points prior........................ 14 2.3.2 Independent decrement prior........................ 16 2.4 Motivating Example Revisited........................... 18 CHAPTER 3. Specifying Negative Log-Gamma Prior Distributions..... 26 3.1 Another Motivating Example............................ 26 3.2 Independent End Point Prior Distributions.................... 36

iv 3.2.1 Three component quantiles, two times................... 36 3.2.2 Three system quantiles, two times..................... 39 3.2.3 Three component quantiles, three times.................. 39 3.2.4 Three system quantiles, three times.................... 42 3.2.5 Four component quantiles, two times.................... 44 3.2.6 Four system quantiles, two times...................... 47 3.2.7 Four component quantiles, three times................... 47 3.3 Independent Decrement Prior Distributions.................... 49 3.3.1 Three component quantiles, two times................... 54 3.3.2 Three component quantiles, three times.................. 55 3.3.3 Four component quantiles, two times.................... 57 3.3.4 Four component quantiles, three times................... 59 3.4 Discussion....................................... 61 CHAPTER 4. Component Selection......................... 64 4.1 Motivating Example................................. 64 4.2 A Finite Mixture Prior Approach to Component Decrement Modeling..... 68 4.2.1 Model specification.............................. 71 4.2.2 Determination of prior based on system quantiles............. 72 4.3 Motivating Example Revisited........................... 72 4.4 Missile System Data Revisited........................... 76 CHAPTER 5. Conclusions and Future Directions................. 79 5.1 Summary....................................... 79 5.2 Recommendations for Future Work......................... 80 APPENDIX A. ADDITIONAL MATERIAL.................... 82 A.1 Distribution function of R(t), independent end points prior........... 82 A.1.1 Show that Equation 2.3 is a non-negative, non-decreasing function of r(t) 82 A.1.2 Show that Equation 2.3 converges to 1 as r(t) 1............ 83 A.2 Distribution function of R(t), independent decrement prior........... 84

v A.2.1 Show that Equation 2.7 is a non-negative, non-decreasing function of r(t) 84 A.2.2 Show that Equation 2.7 converges to 1 as r(t) 1............ 85 A.3 Joint prior distribution of a and b, independent end points prior........ 85 A.4 Joint prior distribution of a and b, independent decrement prior......... 86 APPENDIX B. MCMC Implementation for Point Mass Mixture Prior on System Decrement.................................. 88 BIBLIOGRAPHY.................................... 90

vi LIST OF TABLES Table 1.1 Series system reliability example from Hamada et al. (2008)...... 2 Table 2.1 Summary of the flight test data. The data are formatted as number of tests (number of failures).......................... 10 Table 3.1 Component test data. Data are formatted as number of successful tests/number of tests............................. 26 Table 4.1 System and component test data: Data are formatted as number of successful tests/number of tests....................... 64

vii LIST OF FIGURES Figure 1.1 NLG probability density functions for various values of the shape (α) and scale (β) parameters.......................... 5 Figure 1.2 Q 1 = quantile from the posterior using NLG(2, 0.18) and Q 2 = quantile from the posterior using Beta(5.6, 0.66) as component priors....... 8 Figure 2.1 Histogram of the prior distribution of R i (0) (component) and R(0) (system) assuming that a i Normal(0, 1000 2 ) and b i Normal(0, 1000 2 )........................................ 11 Figure 2.2 Posterior distribution for Component 1 and 2 reliability with two variants of diffuse normal priors......................... 12 Figure 2.3 Component and system priors as induced by Prior 3........... 19 Figure 2.4 Posterior distributions for Component 2 reliability using three priors.. 20 Figure 2.5 Posterior of Component 1 and Component 2, logit transform of reliability, using Prior 1 and Prior 2........................ 21 Figure 2.6 Posterior of Component 1 and Component 2, logit transform of reliability, using Prior 3............................... 22 Figure 2.7 Posterior distributions for Component 1 reliability using three priors.. 23 Figure 2.8 Posterior distributions for system reliability using three priors...... 24 Figure 3.1 Posterior distribution of component and system reliability........ 29 Figure 3.2 Probability density function for a NLG(1/4, 1).............. 30 Figure 3.3 Contour plot of the joint prior distribution of a and b and the marginal prior distributions of a and b (independent end points prior)...... 31

viii Figure 3.4 paths from example prior distribution. For the sample path in dash, b = 20.74 and a = 21.25..................... 32 Figure 3.5 Contour plot of the joint posterior distribution of a and b and the marginal posterior densities of a, b, and logit(r(1)) for Component 1.. 34 Figure 3.6 Contour plot of the joint posterior distribution of a and b and the marginal posterior densities of a, b, and logit(r(1)) for Component 3.. 35 Figure 3.7 Unbounded NLG densities.......................... 36 Figure 3.8 Prior distribution and posterior distributions of Component 1 and Component 3 reliability for three component quantiles at two times (independent end point prior)........................... 38 Figure 3.9 Prior distribution for a component and system reliability, and posterior distribution for Component 4 for three system quantiles at two times (independent end point prior)........................ 40 Figure 3.10 Component 1 prior and posterior distributions for reliability for three component quantiles at three times (independent end point prior)... 43 Figure 3.11 Prior and posterior distribution for Component 1 and Component 4 reliability over time for three systems quantiles at three times (independent end points prior)............................... 45 Figure 3.12 Component prior distribution and Component 1 and Component 3 posterior distributions for four component quantiles at two times (independent end points prior)............................ 46 Figure 3.13 System and component reliability prior distributions and Component 1 and 4 posterior distributions for four system quantiles at 2 times (independent end points prior).......................... 48 Figure 3.14 Prior distribution for component reliability and posterior distributions for Components 1, 3, and 4 with four component quantiles at three times (independent end points priors)....................... 50 Figure 3.15 Posterior distributions for Component 1, Component 3, and Component 4 reliability.................................. 52

ix Figure 3.16 Scatterplots of the prior and posterior distributions of a and b..... 53 Figure 3.17 Component 1 and Component 3 prior posterior distribution for three component quantiles and two times (independent decrement prior)... 56 Figure 3.18 Comparison of two priors obtained by slightly different component quantile constraints................................ 58 Figure 3.19 System prior distribution, and prior and posterior distribution for Components 1, 3, and 4 for four quantiles and two times (independent decrement priors)................................. 60 Figure 3.20 Prior and posterior distribution of Component 1 and 3 reliability for four quantiles at three times (independent decrement priors)...... 62 Figure 4.1 Prior distribution of system reliability for Analysis 1 and Analysis 2.. 66 Figure 4.2 Quantile-quantile plots comparing the prior distribution of system reliability at times 0, 0.5, 1, and 1.5...................... 67 Figure 4.3 Posterior distribution of system reliability for Analysis 1 and Analysis 2. 68 Figure 4.4 Quantile-quantile plot comparing the posterior distribution of system reliability at times 0, 1, 1.5, and 2 for Analysis 1 and Analysis 2.... 69 Figure 4.5 Posterior distribution of system reliability for Analysis 1, 2, and 3... 74 Figure 4.6 Quantile-quantile plot comparing the posterior distribution of system reliability at times 0, 1, 1.5, and 2 for Analysis 1 and 3.......... 75 Figure 4.7 Posterior distributions for Components 1, 2, and system reliability for π = 0.4: (left) mixture prior and (right) non-mixture prior........ 77 Figure 4.8 Posterior distributions of Components 1, 2, and system reliability over time for π = 5 6................................ 78

x ACKNOWLEDGEMENTS I would like to take this opportunity to express my thanks to those who helped me with various aspects of conducting research and the writing of this thesis. First and foremost, Dr. Alyson Wilson for her guidance, patience and support throughout this research and the writing of this thesis. Her dedication, insights and words of encouragement have often inspired me and renewed my hopes for completing my graduate education. I would also like to thank my committee members for their efforts and contributions to this work: Dr. Max Morris, Dr. Mark Kaiser, Dr. Bill Meeker, Dr. Daniel Nordman, and Dr. Scott Vander Wiel. I would especially like to thank Dr. Earl Lawrence and Dr. Scott Vander Wiel for their guidance and dedication throughout this project. They found time in their busy schedules to work with me every week, and their support made this research possible.

xi ABSTRACT Modeling system reliability over time when binary data are collected both at the system and component level has been the subject of many papers. In a series system, it is often assumed that component reliability is linear in time through some link function. Often little or no information exists on the parameters of the linear regression, and in a Bayesian analysis they are modeled using diffuse priors. This can have unintended consequences for the analysis, specifically for the extrapolation of component and system reliabilities. In this work, we consider negative log-gamma (NLG) distributions for specifying prior information on reliability. We first show how our method can be implemented to model the reliability of a series system at a given time and extend to the case where we are interested in modeling reliability over time. We then discuss methods of estimation for the parameters of the NLG prior based on quantiles obtained from expert knowledge. Finally, we propose a component selection approach to help identify active and inactive components. The component selection approach leads to reasonable estimates of trend in the reliability of a large system when only a few components among many actually contribute to the trend.

1 CHAPTER 1. Bayesian Assessment When assessing the reliability of a system, data are often collected at system, subsystem, and component levels. This dissertation considers a single simple system structure a series system where all components must be working for the system to work. Binary data are collected over time at the system and component levels, and prior information may be available about the system and component reliabilities. A Bayesian approach is used to analyze the data. The primary focus of the dissertation is the examination of a flexible class of prior distributions for use with this problem. To introduce the problem, consider the following example from Hamada et al. (2008). Table 1.1 contains binary data, both successes and failures, on system and component tests from a three-component series system. The data are assumed to be collected at a single point in time. Let R i denote the reliability of Component i(probability of a successful component i test) and R 0 denote the reliability of the system. Because of the series system structure, R 0 = R 1 R 2 R 3 = 3 i=1 R i. The likelihood for the data collected about Component i is Binomial(n i, R i ), and the system likelihood is Binomial(n 0, R 0 ). R 1, R 2, and R 3 are unknown parameters, for which we need to specify a prior distribution. Let π(r 1, R 2, R 3 ) denote the joint prior distribution for the vector of unknown parameters. Their joint posterior is proportional to p(r 1, R 2, R 3 x) (1.1) ( 3 ) ( 3 ) xs ) ns x 3 s R x i i (1 R i) n i x i R i (1 R i π(r 1, R 2, R 3 ). i=1 i=1 In this dissertation, we will consider how to specify π(r 1, R 2, R 3 ) and how to generalize the prior specification when R i is a function of time. i=1

2 Successes (x) Failures Units Tested (n) Component 1 8 2 10 Component 2 7 2 9 Component 3 3 1 4 System 10 2 12 Table 1.1: Series system reliability example from Hamada et al. (2008) 1.1 Properties of the Negative Log-Gamma Distribution 1.1.1 Previous work One common specification for the prior distribution π(r 1, R 2, R 3 ) is the product of independent beta distributions. Unfortunately, the beta family is not closed under products, and, as Parker (1972) points out, the induced system prior is a complex distribution and not a member of the beta family. The negative log-gamma (NLG) distribution is another alternative. The NLG has been considered in several papers where modeling component and system reliability is of interest. Parker (1972) observes that NLG distributions are induced priors for component reliability, when a rectangular (Uniform(0,1)) prior distribution is assumed for a series system s reliability. Mastran (1976) and Mastran and Singpurwalla (1978) arrive at a NLG prior for component reliability assuming an exponential distribution for the time-to-failure data and by further assuming a gamma distribution for its rate parameter. Specifically, let T be the time-to-failure of a component, with T Exponential(θ), where θ is the failure rate, and θ Gamma(α, β). Then the component reliability, R = P (T > t) = exp( tθ) NLG(α, tβ). Waller et al. (1977) and Martz and Waller (1982) describe a method for estimating the parameters of a NLG prior specified on component reliability based on quantiles obtained from expert opinion. Waller et al. (1977) and Martz and Waller (1982) also provide a detailed discussion of the properties of the NLG distribution. More recently, Allella et al. (2001) considers modeling the reliability of a complex system in presence of imprecise component reliability data due to random environmental conditions. They model component reliability using a NLG distribution. The scale of the component NLG

3 distribution assumed is dependent on the cumulative hazard function describing the uncertainty in the component data collected. Allella et al. (2001) justifies the choice of the NLG distribution by the modeling flexibility offered by the NLG distribution and the robustness of the system NLG prior assumption with respect to component-level departures from the NLG. Allella et al. (2005) assumes an NLG distribution for component reliability in a complex system, with the goal of finding an optimal allocation of component reliability subject to some system reliability cost constraints. In most of the references cited above, the modeling of component or system reliability was done assuming either time-to-failure type data were collected, or the mean and variance of the prior distribution of component or system reliability were provided. However, many authors have considered Bayesian approaches for modeling system and component reliability using binary data. Cole (1975) considers an analysis of system reliability that combines both system and component test data assuming beta priors. Martz and Waller (1988) considers a Bayesian approach to modeling system reliability when binary data are collected at the component, subsystem, and system level. Martz and Waller (1988) uses beta priors and a bottom-up approach where priors are updated moving from the bottom level (component) to the top level (system) of the whole system. This approach requires approximating the product of independent beta distributions by another beta distribution along the way. Martz et al. (1990) extends Martz and Waller (1988) to the case of a complex series and parallel system. In most of the cases discussed where binary data were collected, the prior considered for the component reliability is the beta distribution. However, the assumption of beta-distributed component reliability require an approximation in the assessment of the induced prior distribution on the system reliability, adding a layer of difficulty in the modeling task. In what follows, we illustrate the modeling advantages presented by the NLG prior over the assumption of beta priors for both component and system reliability. 1.1.2 The NLG as a prior distribution Consider the special case in which system reliability R 0 Uniform(0, 1). Let Z = log(r 0 ) and Z i = log(r i ), which implies Z = C i=1 Z i. Since Z Exponential(1), if Z i Gamma(α i, 1),

4 where C i=1 α i = 1 and α i > 0, the relationship is satisfied. The α i can be chosen in many ways. One choice is α i = 1/C, so that the induced priors on all of the components are the same. The resulting distribution on R i is then a NLG(α i, 1) where the p.d.f. is given as π(r i ) = 1 Γ(α i ) [ log(r i)] α i 1. The k th moments of the NLG are given by ( ) 1 α E(R k ) =, for k = 1, 2,... 1 + kβ Now consider the generalization to an informative prior distribution for R 0. Let Z = log(r 0 ), with Z Gamma(α, β), α > 0 and β > 0. If Z i Gamma(α i, β), where C i=1 α i = α, then the system-component relationship is again satisfied. R i NLG(α i, β). The form of the system reliability distribution function is given by π(r 0 ) = 1 Γ(α)β α [ log(r 0)] α 1 R 1 α 1 0, (1.2) and the component reliability expression is obtained by replacing α by α i in Equation 1.2. Equation 1.2 is the p.d.f. of a NLG(α, β), where α > 0 is the shape parameter and β > 0 is the scale parameter. Taking the first derivative of Equation 1.2 with respect to R 0, we have ( π 1 (R 0 ) = Γ(α)β α ( log(r 0)) α 2 1 β R 2 0 (1 1 ) β ) log(r 0) (α 1), (1.3) ) which implies a critical point at R 0 = exp. We plot the probability density functions ( β(α 1) β 1 for various choices of α and β parameters in Figure 1.1. 1.1.3 Properties of the posterior distribution NLG priors are not conjugate to the binomial likelihood, but the moments of the posteriors distribution of R can be obtained in closed form (see Lawrence and Vander Wiel (2005)). Suppose that R has a NLG prior as given in Equation 1.2, and that n units are tested resulting in m successes and n m failures. The posterior distribution of R is then proportional to p(r n, m) 1 Γ(α)β α [ log(r)]α 1 R 1 β +m 1 (1 R) n m = 1 Γ(α)β α [ log(r)]α 1 R 1 β +m 1 n m j=0 ( n m j ) ( 1) j R j.

5 f(x) 0 2 4 6 α= 0.25, β= 0.25 f(x) 0 2 4 6 α= 0.25, β= 1 f(x) 0 2 4 6 α= 0.25, β= 3 0.0 0.4 0.8 0.0 0.4 0.8 0.0 0.4 0.8 x x x f(x) 0 2 4 6 α= 1, β= 0.25 f(x) 0 2 4 6 α= 1, β= 1 f(x) 0 2 4 6 α= 1, β= 3 0.0 0.4 0.8 0.0 0.4 0.8 0.0 0.4 0.8 x x x f(x) 0 2 4 6 α= 3, β= 0.25 f(x) 0 2 4 6 α= 3, β= 1 f(x) 0 2 4 6 α= 3, β= 3 0.0 0.4 0.8 0.0 0.4 0.8 0.0 0.4 0.8 x x x Figure 1.1: NLG probability density functions for various values of the shape (α) and scale (β) parameters

6 The normalizing constant is given by a linear combination of NLG(α, K(α, β, m, n) = 1 0 = (1 + βm) α Γ(α)β α [ log(r)]α 1 R 1 β +m 1 n m j=0 ( n m )( 1) j j n m j=0 ( n m j β βm+1 ) moments ) ( 1) j R j dr 1 Γ(α)β α [ log(r)]α 1 R 1 β +m 1 R j dr. β Noting the kernel of a NLG(α, βm+1 ) in the integrand, the above equation becomes n m ( ( n m K(α, β, m, n) = )( 1) j 1 + βj ) α. j βm + 1 j=0 The posterior moments are also given by linear combinations of NLG moments E(R q n, m) = 1 K(α, β, m, n) ( n m n m j=0 Noting again the kernel of a NLG(α, E(R q n, m) = j 1 0 1 K(α, β, m, n) (1 + βm) α Γ(α)β α [ log(r)]α 1 R 1 β +m 1 ) ( 1) j R j dr. β βm+1 ), the above equation becomes n m ( n m j=0 j )( 1) j ( 1 + 1.2 Example Revisited ) β(j + q) α. βm + 1 Consider again the posterior distribution given in Equation 1.1. Suppose that we have prior information about system reliability. We believe that the median system reliability is 0.73 and the 90th percentile of system reliability is 0.91. We want to derive prior distributions for the component reliability. Assume that we want to use the same independent beta distributions to describe each component s reliability. The parameters of the beta prior distributions for the components reliabilities can be obtained through simulation. The problem of finding ã and b, the values of the parameters of the component reliability prior, can be seen as a minimization problem. We need to find ã and b so that d(a, b) = ( q(0.5, ã, b) ) 2 ( 0.73 + q(0.9, ã, b) ) 2 0.91

7 is minimized. Note that q(p, a, b) represents the p th quantile of the system reliability. The distribution of the system reliability is approximated using Monte Carlo simulation as the product of independent samples from a Beta(ã, b) distribution. Here, ã = 5.6 and b = 0.66. The joint prior distribution on the vector of parameters (R 1, R 2, R 3 ), assuming beta priors, is proportional to π(r 1, R 2, R 3 ) R 4.6 1 (1 R 1 ) 0.34 R 4.6 2 (1 R 2 ) 0.34 R 4.6 3 (1 R 3 ) 0.34. Suppose that we model system reliability using a NLG distribution. To determine the parameters of the NLG, we perform the same minimization, although the system reliability now has a known functional form. We find that the system-level NLG prior consistent with the given quantiles is NLG(2, 0.18). Assuming exchangeable components, each component prior has R i NLG(2/3, 0.18). The joint prior distribution on the vector of parameters (R 1, R 2, R 3 ) is proportional to π(r 1, R 2, R 3 ) [ log(r 1 )] 1/3 R 1 0.18 1 1 [ log(r 2 )] 1/3 R 1 0.18 1 2 [ log(r 3 )] 1/3 R 1 0.18 1 3. The NLG prior specification can easily accommodate cases where it is unreasonable to assume exchangeable components in the prior specification; the independent beta distribution will require additional simulation and approximation (see Cole (1975), Martz and Waller (1988)). Using both beta and NLG priors and the data in Table 1.1, a Markov chain Monte Carlo (MCMC) can easily be implemented for both models to help simulate from their respective posterior distributions. 1.3 Results and Discussion Both models provide very similar results. Quantile-quantile plots for the posterior distributions of system reliability under the two models is given in Figure 1.2a, along with component 2 posterior reliability distribution in Figure 1.2b. The posterior distributions are quite similar, as the quantiles fall on a straight line for the system and Component 2 reliabilities.

8 Q2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Q2 0.70 0.75 0.80 0.85 0.90 0.95 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 Q1 0.70 0.75 0.80 0.85 0.90 0.95 Q1 (a) Q-Q plot of the posterior distribution of R 0 under the NLG(2, 0.18) and Beta(5.6, 0.66) priors along with the line y = x. (b) Q-Q plot of the posterior distribution of R 2 under the NLG(2, 0.18) and Beta(5.6, 0.66) priors along with the line y = x. Figure 1.2: Q 1 = quantile from the posterior using NLG(2, 0.18) and Q 2 = quantile from the posterior using Beta(5.6, 0.66) as component priors. Given prior information on the system reliability in a series system, choosing to express a prior belief using a beta distribution or a NLG distribution has little effect on the conclusions of the analysis for the data in Table 1.1. In fact, a NLG distribution can be chosen to closely match a beta distribution by quantile matching or mean and variance matching (see Allella et al. (2001)). Also, Allella et al. (2001) shows that inference about system reliability is robust to moderate departures from the assumption of an NLG prior for component reliability. Specifying a NLG prior for the component or system reliability provides some clear modeling advantages over beta distributions. The NLG prior eliminates the need to determine prior parameters by simulation or numerical approximations, since NLG is closed under product. The NLG prior easily accommodates series systems with large number of components without increasing the computational burden for assessment of the system reliability. The method illustrated through this example can also be extended to the case where we collect data over time on the components and the system.

9 CHAPTER 2. Negative Log-Gamma Prior Distributions The previous chapter examined NLG prior distributions when reliability was considered at a single time point. In this chapter, we extend the discussion to assume that reliability is changing over time. 2.1 Motivating Example Consider the following example, which is abstracted from a Department of Defense missile system. Binary data were collected on a series system with six components. We denote each of the components as C i, for i = 1,..., 6, and the system as C 0. The component test data were clustered in time; i.e., many tests were done at time t = 0, and the remaining tests were performed around t = 150. The system data were collected at various times between t = 0 and t = 105. The data are displayed in Table 2.1. The table shows the number of tests with the number of failures in parentheses. For Component 1, 190 tests were done at t = 0, and none of them resulted in a failure. For the system data, 72 tests were done between t = 0 and t = 104, and 12 resulted in failure. Let X ijk denote the k th test result on the i th component at time t j, where i = 0, 1,..., 6, j = 1, 2,..., T i and k = 1, 2,..., N i, where T i denotes the number of distinct times at which data was collected for component i, and N i denotes the total number of tests at time t i. X ijk Bernoulli(R i (t j )). Assume that logit(r i (t)) = a i + b i t, for i = 1, 2,..., 6. To complete the model specification, we need to choose prior distributions for a i and b i. Suppose that we choose a diffuse independent normal distribution for the intercept, a i Normal(0, 1000 2 ). We will consider two specifications for the slope b i. The first is b i Normal(0, 1000 2 ), which was considered in Weaver and Hamada (2008). The second speci-

10 Component Age = 0 130 < Age < 154 1 190(0) 92(2) 2 196(0) 77(0) 3 123(2) 82(2) 4 197(1) 79(54) 5 136(3) 82(21) 6 136(1) 82(18) System Ages 0-104 72(12) Table 2.1: Summary of the flight test data. The data are formatted as number of tests (number of failures). fication is b i Normal(0, 10 2 ), which was used by Anderson-Cook et al. (2009). Both prior specifications induce a prior distribution for component reliability at time 0 with most of its mass concentrated near 0 and 1. The induced prior for system reliability at time 0 is a distribution with most of its mass at 0. Figure 2.1 is a histogram of the distributions of component and system reliability at time 0. The posterior distributions for C 1 are shown in Figures 2.2a and 2.2b. The posterior distributions for C 2 are shown in Figures 2.2c and 2.2d. Note the steep drop of the lower bound of the credible interval for C 1 after we stop observing data. This is undesirable behavior in the posterior distribution; ideally, we would like to be able to predict at least a short distance into the future with a credible interval narrower than (0, 1). For Component 2, we observed no failures. Both priors lead to very different conclusions about reliability at times after t = 150. When the prior for b 2 is Normal(0, 1000 2 ), the posterior distribution for reliability is concentrated at 1 with no uncertainty. When the prior for b 2 is Normal(0, 10 2 ), the posterior distribution for reliability has uncertainty that ranges between (0, 1) shortly after we stop observing data. Another counterintuitive feature of Figures 2.2c and 2.2d is that that the more diffuse prior for b 2 leads to narrower posterior distribution on the reliability scale. In what follows, we seek to remedy the inadequacies presented in the posterior distributions obtained from the diffuse prior specifications by considering NLG priors.

11 Density 0 2 4 6 8 10 Density 0 5 10 15 20 R(0) (a) Prior of component reliability distribution at age 0 (R i(0)) R s (0) (b) Prior of system reliability distribution at age 0 (R 0(0)) Figure 2.1: Histogram of the prior distribution of R i (0) (component) and R(0) (system) assuming that a i Normal(0, 1000 2 ) and b i Normal(0, 1000 2 ). 2.2 Previous Work Many authors have discussed how various sources of information can be combined in the Bayesian assessment of system and component reliability. One approach to combining system and component reliability data is the top-down method. This approach specifies a prior for the system, updates system reliability using system data, derives a prior for the components based on updated system reliability, updates component reliability using component data, and finally derives a posterior reliability for the system. Mastran (1976) uses the top-down method in the assessment of a series system using failure time and binary data at the system and components. Mastran and Singpurwalla (1978) extends the work of Mastran (1976) to more complex systems. Martz and Almond (1997) also uses the top-down method in the quantification of fault tree reliability. Another approach is the bottom-up method. This approach specifies priors at the component level, updates component reliability using component data, and continues this process using data collected at intermediate levels to finally obtain the system reliability. Martz and Waller (1988) proposes assessing system reliability in a series system of independent binomial

12 Component 1 Component 1 Post. Mean Post. Median 0 50 100 150 200 250 300 Age Post. Mean Post. Median 0 50 100 150 200 250 300 Age (a) Posterior distribution for Component 1 with b 1 Normal(0, 1000 2 ) (b) Posterior distribution for Component 1 with b 1 Normal(0, 10 2 ) Component 2 Component 2 Post. Mean Post. Median 0 50 100 150 200 250 300 Age Post. Mean Post. Median 0 50 100 150 200 250 300 Age (c) Posterior distribution for Component 2 with b 2 Normal(0, 1000 2 ) (d) Posterior distribution for Component 2 with b 2 Normal(0, 10 2 ) Figure 2.2: Posterior distribution for Component 1 and 2 reliability with two variants of diffuse normal priors.

13 subsystems and components using a bottom-up approach. Martz et al. (1990) extends this work to systems with more complex components structure. Both the top-down and bottom-up approaches rely on approximations of the priors and posteriors at each stage. More recent work has relied on a fully Bayesian approach that simultaneously models system and component reliability. The fully Bayesian approach relies on Monte Carlo Markov Chains (MCMC) for simulation of posterior distributions. MCMC algorithms enable the analysis of more complex systems with a wider variety of data. Johnson et al. (2003) considers a hierarchical model to assess the reliability of a system represented as a block diagram with binary data; Hamada et al. (2004) extends this to a fault tree and considers the optimal design for collecting additional data at different levels in the fault tree. Graves and Hamada (2005) considers modeling system reliability over time with a variety of data collected on the components. Wilson et al. (2006) also considers the assessment of component and system reliability over time when failure time data, failure count data, binary data and degradation data are collected on components. Anderson-Cook et al. (2009) combines component and system data collected over time to model the system reliability and determine which components degrade with time. The analysis assumes that component reliability is linear in time through some link function, and priors are formulated on the parameters of the regression. 2.3 Independent End Points and Independent Decrement Priors Recall the general problem statement. Consider a series system with C components. Let X ijk denote the k th test result on the i th component at time t j, where i = 0, 1,..., C, j = 1, 2,..., T i and k = 1, 2,..., N i, where T i denotes the number of distinct times at which data were collected for component i, and N i denotes the total number of tests at time t i. X ijk Bernoulli(R i (t j )), and R i (t) = f(a i + b i t) t > 0, for i = 1, 2,..., C. We will consider various choices for the function f( ), where f : R (0, 1) is a link function with support on the real line. We assume that f( ) is a monotone and continuous function on R. For a series system with C independent components, the system reliability R 0 is: C R 0 (t) = f(a i + b i t). i=1

14 For the moment, we consider the specification of a prior distribution for a single component. For simplicity, we suppress the subscript notation and write R(t) = f(a + bt). 2.3.1 Independent end points prior We need to specify a joint prior distribution for (a, b). Our first specification chooses prior distributions on reliability at two distinct times t = t 0 and t = t 1. We describe our prior knowledge about reliability at these two time points using independent NLG distributions, and consequently, we call this specification an independent end points prior distribution. The distribution of R(t), the reliability at some other time t, is not NLG distributed. With a little bit of algebra, we can analytically compute the distribution of R(t). The joint prior distribution for (R(t 0 ), R(t 1 )) is π(r(t 0 ), R(t 1 )) = 1 Γ(α 0 )β α 0 0 1 Γ(α 1 )β α 1 1 [ log(r(t 0 ))] α 0 1 R(t 0 ) 1 α 0 1 [ log(r(t 1 ))] α 1 1 R(t 1 ) 1 α 1 1. A prior distribution for (a, b) can be derived using standard change-of-variables techniques. Define Y 0 = f 1 (R(t 0 )) and Y 1 = f 1 (R(t 1 )), where f( ) is the link function, and f 1 ( ) its inverse. We can then write a = t 1Y 0 t 0 Y 1 t 1 t 0 b = Y 1 Y 0 t 1 t 0. and In what follows, it will sometimes be useful to parameterize in terms of (R(t 0 ), R(t 1 )), sometimes in terms of (a, b), and sometimes in terms of (Y 0, Y 1 ). For example, R(t) = f (η(t)y 1 + (1 η(t))y 0 ), where η(t) = t t 0 t 1 t 0. Let Y = logit(r). If R NLG(α, β), we can derive the density of Y as: f(y α, β) = exp( y) ( ) 1 (1/β)+1 Γ(α)β α [log(1 + exp( y))] (α 1). (2.1) 1 + exp( y)

15 for < y <. Define γ = (α 0, α 1, β 0, β 1 ). Since we assume that R(t 0 ), and R(t 1 ) are independent, so are Y 0 = logit(r(0)), and Y 1 = logit(r(1)). From Equation 2.1, the joint distribution of Y 0 and Y 1 is equal to ( 1 f(y 0, y 1 γ) = C(γ) 1 + exp( y 0 ) ( exp( y 0 ) 1 1 + exp( y 1 ) ) (1/β0 )+1 [log(1 + exp( y 0 ))] (α 0 1) ) (1/β1 )+1 [log(1 + exp( y 1 ))] (α 1 1) exp( y 1 ), (2.2) where C(γ) is the normalizing constant depending on α 0, α 1, β 0, β 1. Since Y (t) = (t t 0)Y 1 +(t 1 t)y 0 t 1 t 0, we can derive the marginal distribution of Y (t) by convolution as: ( ) (t1 t 0 )y(t) (t 1 t)y 0 t1 t 0 f(y(t) γ) = f, y 0 γ dy 0. t t 0 t t 0 If y p (t) = logit(r p (t)) is the pth quantile of Y (t), then the following equation yp(t) f(y(t) γ)dy(t) = yp(t) ( (t1 t 0 )y(t) (t 1 t)y 0 f t t 0 ) t1 t 0, y 0 γ dy 0 dy(t) t t 0 is exactly equal to p. We can exchange the order of the integrals above by Tonelli s Theorem to obtain yp(t) f(y(t) γ) = yp(t) ( (t1 t 0 )y(t) (t 1 t)y 0 f t t 0 ) t1 t 0, y 0 γ dy 0 dy(t). t t 0 Using the change of variables and setting ( ( U(u 0, y p (t)) = log 1 + exp (t )) 1 t 0 )y(t) (t 1 t)y 0 t t 0 and u 0 = log(1 + exp( y 0 )), the above integral becomes 0 dgamma(u 0, α 0, β 0 ) (1 pgamma (U(u 0, r(t)), α 1, β 1 )) du 0, (2.3) where dgamma(u, α 0, β 0 ) and pgamma(u(u 0 ), α 0, β 0 ) are respectively the density function and the distribution function of a gamma(α 0, β 0 ) random variable. In Equation 2.3, ( U(u 0, r(t)) = log 1 + exp ( (t 1 t 0 )logit(r(t)) + (t 1 t) log(exp(u 0 ) 1) t t 0 )), where r(t) is the reliability at time t. Equation 2.3 can be used to easily compute quantiles of the distribution of Y (t) = logit (R(t)), eliminating the need for simulation.

16 2.3.2 Independent decrement prior Our second specification chooses a NLG prior for reliability at time t 0 and describes the reliability at time t 1 > t 0 as a fraction (between 0 and 1) of the reliability at time t 0, so that R(t 1 ) = R(t 0 ). We specify a NLG prior distribution for R(t 0 ) and another NLG for, the fraction of reliability at time t 0 remaining by time t 1. We call this specification the independent decrement prior distribution. The independent decrement prior implies decreasing reliability over time. R(t 1 ) has an exact NLG distribution only if the NLG prior distributions for R(t 0 ) and have the same scale parameter. For t t 1, R(t) is not NLG distributed, but the quantiles of its distribution can be computed analytically. R(t 0 ) and are assumed independent, and their joint distribution is π(r(t 0 ), ) = 1 Γ(α 0 )β α 0 0 [ log(r(t 0 ))] α 0 1 R(t 0 ) 1 α 0 1 1 Γ(α δ )β α δ δ [ log( ))] α δ 1 1 α δ 1. Since = R(t 1 )/R(t 0 ), with γ = (α 0, α δ, β 0, β δ ), we can derive the joint prior distribution for (R(t 0 ), R(t 1 )) as f(r(t 0 ), R(t 1 ) γ) = C(γ)(R(t 0 )) ( 1 β 0 1 β δ ) 1 [ log(r(t0 ))] α 0 1 (R(t 1 )) 1 β δ 1 [ log(r(t 1 )) + log(r(t 0 ))] α δ 1 I(R(t 1 ) < R(t 0 )), where C(γ) is the normalizing constant depending on α 0, α 1, β 0, β 1. Let Y 0 = logit(r(t 0 )) and Y 1 = logit(r(t 1 )) = logit(r(t 0 ) ). The joint distribution of Y 0 and Y 1 is f(y 0, y 1 γ) = ( C(γ) 1 1 + exp( y 0 ) ) ( 1 β 0 1 β δ )+1 [log(1 + exp( y 0 ))] α 0 1 (1 + exp( y 1 )) 1 β δ 1 [ log(1 + exp( y0 )) + log(1 + exp( y 1 ))] α δ 1 exp( y 0 ) exp( y 1 )I(y 1 < y 0 ). Let Y (t) = logit(r(t)) = log( R(t) 1 R(t) ) = (t 1 t 0 )Y 1 +(t 1 t)y 0 t t 0. Using convolution, the joint

17 distribution of Y (t) and Y 0 can be derived from the joint distribution of Y 0 and Y 1 as: ( ) 1 ( 1 1 )+1 β 0 β δ f(y 0, y(t) γ) = C(γ) [log(1 + exp( y 0 ))] α 0 1 1 + exp( y 0 ) ( ( 1 + exp (t 1 t 0 )y(t) (t 1 t)y 0 t t 0 [ log(1 + exp( y 0 )) + log exp( y 0 ) exp ( 1 + exp ( (t 1 t 0 )y(t) (t 1 t)y 0 t t 0 )) 1 β δ 1 ( (t 1 t 0 )y(t) (t 1 t)y 0 t t 0 ) (t1 t 0 ) t t 0 I(y(t) < y 0 ). ))] αδ 1 The marginal distribution of Y (t) is obtained by integrating out Y 0 in Equation 2.4. Then if y p (t) denotes the p th quantile of Y (t), we have (2.4) yp(t) f(y(t) γ)dy(t) = yp(t) f(y(t), y 0 γ)dy 0 dy(t) yp(t) = f(y(t), y 0 γ)dy 0 dy(t) = p. Using the change of variables trick and letting ( V (v 0, y p (t)) = log 1 + exp 1 + exp( y 0 ) and v 0 = log (1 + exp( y 0 )), we then have ) (t 1 t 0 )y p(t) (t 1 t)y 0 t t 0, (2.5) yp(t) f(y(t), y 0 γ)dy 0 dy(t) = (2.6) 0 dgamma(v 0, α 0, β 0 ) (1 pgamma (V (v 0, y p (t)), α δ, β δ )) dv 0. Equation 2.6 is the distribution function for R(t), expressed as a function of y p (t). Note that if r p (t) is the p th quantile of R(t), then logit(r p (t)) is also the p th quantile of Y (t). We can write Equation 2.6 as a function of reliability at time t: 0 dgamma(v 0, α 0, β 0 ) (1 pgamma (V (v 0, logit(r p (t))), α δ, β δ )) dv 0. (2.7) Equation 2.7 can be used to compute quantiles of the distribution of R(t). Equations 2.3 and 2.7 are useful in the quantile matching approach we use to obtain prior distributions from elicited prior quantiles. These derivations are discussed in more detail in Appendix A.

18 2.4 Motivating Example Revisited In Section 2.1, we considered the analysis of a set of missile system data. In this section, we revisit this example and discuss results using three prior specifications: Prior 1: logit(r i (t)) = a i + b i t for each component i, with a i Normal(0, 1000 2 ) and b i Normal(0, 1000 2 ) Prior 2: logit(r i (t)) = a i + b i t for each component i, with a i Normal(0, 1000 2 ) and b i Normal(0, 10 2 ) Prior 3: logit(r i (t)) = a i + b i t for each component i, with an independent decrement prior. The 5th percentile of the prior for R i (0) is 0.96, the 95th percentile of R i (0) is 0.999, the 5th percentile of R i (50) is 0.95, and the 5th percentile of R i (200) is 0.90. This implies α 0 = 1.143, β 0 = 0.012, α δ = 0.014, and β δ = 0.439, where R(50) = R(0). The parameters for the prior of R i (0) are easily obtained given two different quantiles of the distribution (see Cook (2010), Martz and Waller (1982), and Chapter 3). Estimation of the parameters for the prior distribution of i requires the use of Equation 2.6. Plots of Prior 3 for a single component and the system are given in Figure 2.3. Unlike Priors 1 and 2, Prior 3 does not induce a distribution that has most of it mass concentrated near 0 and 1. We have already discussed the posterior distributions for Component 2 using Priors 1 and 2 in Section 2.1 (see Figures 2.4a and 2.4b). Consider the logit transformation of Component 2 reliability. Prior 1 (Figure 2.5a) shows the large uncertainty present in the posterior distribution on the logit transformed scale, characterized by extremely large positive slopes. Prior 2 (Figure 2.5b) does a slightly better job in constraining the slopes and the values of y(t), but the values of y(t) are still substantially large relative to the range (-6, 6). Values of y(t) outside of the interval ( 6, 6) are essentially mapped to 0 and 1 on the reliability scale. Therefore, the large uncertainty in the posterior of Y (t) is not reflected in the uncertainty on the reliability. Understanding the distribution of the slope and intercept is key to understanding the behavior of prior and posterior distributions on the reliability scale. Prior 3 puts a sharp constraint on

19 Prior. Mean Prior. Median 5% 95% Prior. Mean Prior. Median 5% 95% 0 50 100 150 200 250 300 0 50 100 150 200 250 300 Age Age (a) Prior 3 for single component (b) Prior 3 for system 0 10 20 30 40 50 60 0 5 10 15 0.88 0.90 0.92 0.94 0.96 0.98 1.00 (c) Prior 3 for component reliability distribution at age 0 (R 1(0)) r0 0.75 0.80 0.85 0.90 0.95 1.00 (d) Prior 3 for system reliability distribution at age 0 (R 0(0)) r0 Figure 2.3: Component and system priors as induced by Prior 3.

20 Component 2 Component 2 Post. Mean Post. Median 0 50 100 150 200 250 300 Age Post. Mean Post. Median 0 50 100 150 200 250 300 Age (a) Posterior distribution for Component 2 reliability (Prior 1) (b) Posterior distribution for Component 2 reliability (Prior 2) Component 2 Post. Mean Post. Median 0 50 100 150 200 250 300 (c) Posterior distribution for Component 2 reliability (Prior 3) Figure 2.4: Posterior distributions for Component 2 reliability using three priors.

21 Component 2 Component 2 y(t) 0e+00 2e+05 4e+05 6e+05 Post. Mean Post. Median y(t) 0 2000 4000 6000 8000 Post. Mean Post. Median 0 50 100 150 200 250 300 Age 0 50 100 150 200 250 300 Age (a) Component 2 posterior summaries (Prior 1) (b) Component 2 posterior summaries (Prior 2) Component 1 y(t) 20 10 0 10 20 Post. Mean Post. Median 0 50 100 150 200 250 300 Age (c) Component 1 posterior summaries (Prior 1) Figure 2.5: Posterior of Component 1 and Component 2, logit transform of reliability, using Prior 1 and Prior 2.

22 Component 2 Component 1 y(t) 20 10 0 10 20 Post. Mean Post. Median y(t) 20 10 0 10 20 Post. Mean Post. Median 0 50 100 150 200 250 300 Age 0 50 100 150 200 250 300 Age (a) Component 2 logit of reliability posterior summaries (Prior 3) (b) Component 1 logit of reliability posterior summaries (Prior 3) Figure 2.6: Posterior of Component 1 and Component 2, logit transform of reliability, using Prior 3. the values of the intercept (a), slope (b), and y(t) (Figure 2.6a), which yields a very different posterior distribution (Figure 2.4c). The posteriors of Component 1 reliability under Priors 1 and 2 were also discussed in Section 2.1(see Figures 2.7a and 2.7b). On the logit scale (Figure 2.5c), we observe the large negative slope. The lack of failures at time 0 implies reliability close to 1 at time 0, and extremely large values of the intercept a. Recall that b = logit(r 1 (0) 1 ) logit(r 1 (0)), where a = logit(r 1 (0)). Since we observed 2 failures at later time, this implies presence of some trend in reliability, and the posterior of assigns moderate weights to extremely large values of 1. The logit function has a large slope for values of reliability in the neighborhood of 1. This means that b = logit(r 1 (0) 1 ) logit(r 1 (0)) is extremely large if R 1 (0) is extremely close to 1, and 1 is moderate. Prior 3, on the other hand, puts a constraint on the reliability at time 0, which has the effect of excluding values of reliability at time 0 very close to 1. This prevents large slopes (Figure 2.6b) and the rapidly dropping lower bound of the 95% posterior credible interval (Figure 2.7c).

23 Component 1 Component 1 Post. Mean Post. Median 0 50 100 150 200 250 300 Age Post. Mean Post. Median 0 50 100 150 200 250 300 Age (a) Posterior distribution for Component 1 reliability, Prior 1 (b) Posterior distribution for Component 1 reliability, Prior 2 Component 1 Post. Mean Post. Median 0 50 100 150 200 250 300 (c) Posterior distribution for Component 1 reliability, Prior 3 Figure 2.7: Posterior distributions for Component 1 reliability using three priors.

24 System System Post. Mean Post. Median Post. Mean Post. Median 0 50 100 150 200 250 300 Age 0 50 100 150 200 250 300 Age (a) Posterior distribution for system reliability, Prior 1 (b) Posterior distribution for system reliability, Prior 2 System Post. Mean Post. Median 0 50 100 150 200 250 300 Age (c) Posterior distribution for system reliability, Prior 3 Figure 2.8: Posterior distributions for system reliability using three priors.

25 Despite the differences illustrated between Component 1 and Component 2 posterior reliability distributions for the three priors considered, posterior distributions of system reliability are very similar (Figure 2.8). In fact, the posterior distributions for Components 3, 4, 5, and 6 are very similar under the three priors considered. For each of these components, we observed at least one failure at time 0, and a considerable number of failures at later times (see Table 2.1). These four components account for most of the trend observed in the system reliability over time, and explains why the posterior distribution for system reliability is similar for all three priors.

26 CHAPTER 3. Specifying Negative Log-Gamma Prior Distributions 3.1 Another Motivating Example Consider a system with four independent components in series, where binary data are collected at three separate time points for each of four components. The data are reported in Table 3.1. The primary interest is the estimation of component and system reliability over the range of time where test data were collected. In addition, however, extrapolation of both component and system reliability to later times may be of interest. Let X ijk denote the k th t = 0.0 t = 0.5 t = 1.0 Component 1 100/100 100/100 100/100 Component 2 100/100 100/100 100/100 Component 3 100/100 100/100 99/100 Component 4 99/100 98/100 97/100 Table 3.1: Component test data. Data are formatted as number of successful tests/number of tests. test result on the i th component at time t j (j = 0, 1, 2), where i = 1,..., 4, t 0 = 0, t 1 =.5, and t 2 = 1, k = 1, 2,..., 100. We assume X ijk Bernoulli(R i (t j )) and that logit(r i (t)) = a i + b i t, for i = 1, 2,..., 4. The system reliability at time t j is then equal to R 0 (t j ) = 4 i=1 R i(t j ). The data in Table 3.1 have a binomial likelihood proportional to: 4 3 ( l(a, b) (Ri (t j )) x ijk (1 R i (t j )) 100 x ) ijk i=1 i=1 j=1 4 3 j=1 ( ( ) 1 xijk ( 1 1 + exp ( (a i + b i t j )) where a = (a 1, a 2, a 3, a 4 ) and b = (b 1, b 2, b 3, b 4 ). ) ) 1 100 xijk, 1 + exp ( (a i + b i t j ))

27 In the Bayesian paradigm, the likelihood and prior are combined to obtain the posterior distribution for the unknown parameters. Inference is then based on the resulting posterior distribution. Unfortunately, informative priors on the vector of parameters, a and b, are difficult to obtain, as the parameters do not have an easy interpretation. However, specifying flat priors on a and b, especially when no or few failures are observed, can lead to two issues. 1. Issue 1: Intervals with inappropriate credible intervals. There are two ways that this issue presents. First, the width of the posterior credible interval for reliability increases rapidly at times shortly after we stop observing data; the bounds of the credible interval are at (0, 1). Second, there is no uncertainty reflected in the posterior credible intervals; both bounds are at 1, or both bounds are at 0. These intervals are problematic if extrapolation is of interest, and they may impact estimation as well. 2. Issue 2: Posterior inference is sensitive to small changes in the data. Inference about reliability changes greatly when moving from a case with no failures to a case with one failure. Extreme sensitivity of inference to small changes in data may cast serious doubt on the validity of inference. We examine these issues by considering priors specified on the reliability scale. NLG distributions can be used to formulate prior distributions on components and system reliabilities (see Mastran (1976), Waller et al. (1977), Mastran and Singpurwalla (1978), Martz and Waller (1982), Allella et al. (2001), and Allella et al. (2005)). In most of these references, the NLG arises when lifetime data are obtained about the component or system. Few authors have considered using NLG priors to directly model component and system reliability. Parker (1972) uses an NLG prior for component reliability, assuming a Uniform(0, 1) prior distribution on the reliability of a series system. The remainder of this section illustrates undesirable behaviors of the posterior distribution when the system has Uniform(0, 1) distributions independently at t 0 and t 1 ; Section 3.2 describes alternative methods for selecting independent end points prior distributions; Section 3.3 describes alternative methods for selection independent decrement prior distributions; Section 3.4 summarizes results and draws conclusions.