Modeling Rate of Occurrence of Failures with Log-Gaussian Process Models: A Case Study for Prognosis and Health Management of a Fleet of Vehicles

Similar documents
Reliability Monitoring Using Log Gaussian Process Regression

Gaussian processes. Chuong B. Do (updated by Honglak Lee) November 22, 2008

Neutron inverse kinetics via Gaussian Processes

STAT 518 Intro Student Presentation

Nonparametric Bayesian Methods (Gaussian Processes)

Nonparameteric Regression:

STA414/2104. Lecture 11: Gaussian Processes. Department of Statistics

Gaussian Process Regression

Optimization of Gaussian Process Hyperparameters using Rprop

Gaussian process for nonstationary time series prediction

ABSTRACT INTRODUCTION

GWAS V: Gaussian processes

PILCO: A Model-Based and Data-Efficient Approach to Policy Search

Learning Gaussian Process Models from Uncertain Data

STA 4273H: Sta-s-cal Machine Learning

Variational Principal Components

Relevance Vector Machines for Earthquake Response Spectra

Gaussian Processes (10/16/13)

Introduction. Chapter 1

Recent Advances in Bayesian Inference Techniques

Pattern Recognition and Machine Learning

Nonparmeteric Bayes & Gaussian Processes. Baback Moghaddam Machine Learning Group

Probabilistic & Unsupervised Learning

Probabilistic Regression Using Basis Function Models

A Data-driven Approach for Remaining Useful Life Prediction of Critical Components

COMP 551 Applied Machine Learning Lecture 20: Gaussian processes

STA 4273H: Statistical Machine Learning

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1

An Introduction to Gaussian Processes for Spatial Data (Predictions!)

Efficient Resonant Frequency Modeling for Dual-Band Microstrip Antennas by Gaussian Process Regression

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Pattern Recognition and Machine Learning. Bishop Chapter 2: Probability Distributions

Gaussian Process for Internal Model Control

Computer Vision Group Prof. Daniel Cremers. 4. Gaussian Processes - Regression

A Process over all Stationary Covariance Kernels

Assessing Reliability Using Developmental and Operational Test Data

Machine Learning Srihari. Gaussian Processes. Sargur Srihari

Computer Vision Group Prof. Daniel Cremers. 2. Regression (cont.)

STA414/2104 Statistical Methods for Machine Learning II

Gaussian Processes: We demand rigorously defined areas of uncertainty and doubt

Lecture 5: GPs and Streaming regression

Statistical Techniques in Robotics (16-831, F12) Lecture#21 (Monday November 12) Gaussian Processes

Computer Vision Group Prof. Daniel Cremers. 9. Gaussian Processes - Regression

Statistical Techniques in Robotics (16-831, F12) Lecture#20 (Monday November 12) Gaussian Processes

Gaussian Process Regression Model in Spatial Logistic Regression

Universität Potsdam Institut für Informatik Lehrstuhl Maschinelles Lernen. Bayesian Learning. Tobias Scheffer, Niels Landwehr

Heterogeneous mixture-of-experts for fusion of locally valid knowledge-based submodels

Gaussian Processes in Machine Learning

Tokamak profile database construction incorporating Gaussian process regression

Regression with Input-Dependent Noise: A Bayesian Treatment

STA 4273H: Statistical Machine Learning

An Integral Measure of Aging/Rejuvenation for Repairable and Non-repairable Systems

CSC2541 Lecture 2 Bayesian Occam s Razor and Gaussian Processes

COPYRIGHTED MATERIAL CONTENTS. Preface Preface to the First Edition

Model Based Clustering of Count Processes Data

Introduction to Gaussian Processes

The Variational Gaussian Approximation Revisited

BAYESIAN DECISION THEORY

Modeling human function learning with Gaussian processes

Random Field Models for Applications in Computer Vision

Advanced analysis and modelling tools for spatial environmental data. Case study: indoor radon data in Switzerland

Bayesian Inference of Noise Levels in Regression

A graph contains a set of nodes (vertices) connected by links (edges or arcs)

Introduction to emulators - the what, the when, the why

Bayesian Machine Learning

STA 4273H: Statistical Machine Learning

Probabilistic Graphical Models Lecture 20: Gaussian Processes

component risk analysis

Machine Learning. Bayesian Regression & Classification. Marc Toussaint U Stuttgart

Confidence Estimation Methods for Neural Networks: A Practical Comparison

Introduction to machine learning and pattern recognition Lecture 2 Coryn Bailer-Jones

Evaluating the value of structural heath monitoring with longitudinal performance indicators and hazard functions using Bayesian dynamic predictions

Introduction to Gaussian Process

Subject CS1 Actuarial Statistics 1 Core Principles

Prediction of Data with help of the Gaussian Process Method

Bayesian Linear Regression. Sargur Srihari

Bayesian Inference: Principles and Practice 3. Sparse Bayesian Models and the Relevance Vector Machine

ECE521 week 3: 23/26 January 2017

Effect of Characterization Test Matrix on Design Errors: Repetition or Exploration?

CSC2515 Winter 2015 Introduction to Machine Learning. Lecture 2: Linear regression

Why experimenters should not randomize, and what they should do instead

These slides follow closely the (English) course textbook Pattern Recognition and Machine Learning by Christopher Bishop

Gaussian with mean ( µ ) and standard deviation ( σ)

Hierarchical sparse Bayesian learning for structural health monitoring. with incomplete modal data

Prediction of double gene knockout measurements

Failure Prognostics with Missing Data Using Extended Kalman Filter

Optimum Test Plan for 3-Step, Step-Stress Accelerated Life Tests

Model Selection for Gaussian Processes

Machine Learning Lecture 2

Bayesian Knowledge Fusion in Prognostics and Health Management A Case Study

Linear Dynamical Systems

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of

CS 540: Machine Learning Lecture 1: Introduction

Using Gaussian Processes for Variance Reduction in Policy Gradient Algorithms *

Discrete Simulation of Power Law Noise

Introduction to Graphical Models

Maximum Direction to Geometric Mean Spectral Response Ratios using the Relevance Vector Machine

Gaussian Processes. 1 What problems can be solved by Gaussian Processes?

Transcription:

International Journal of Performability Engineering, Vol. 9, No. 6, November 2013, pp. 701-713. RAMS Consultants Printed in India Modeling Rate of Occurrence of Failures with Log-Gaussian Process Models: A Case Study for Prognosis and Health Management of a Fleet of Vehicles MARTIN WAYNE and MOHAMMAD MODARRES University of Maryland, Center for Risk and Reliability, Department of Mechanical Engineering College Park, MD 20742, U.S.A. 1. Introduction (Received on May 03, 2013, Revised on May 05, and June 10, 2013) Abstract: Gaussian Process Regression (GPR) is a flexible non-parametric regression technique that provides an alternative solution to the model selection problem commonly seen in parametric models. GPR models are able to easily model complex non-linear relationships that are often present in rate of occurrence of failure data. The predictive distributions that result from the regression models are then used to provide valuable insights into the behavior of the data. An example of the application of this approach has been demonstrated for modeling the rate of occurrence of failure of a fleet of vehicles on a monthly basis. The Log-GPR model applied in this context is useful for detecting significant reliability problems that may occur over time. In order to assess the generalization capability of the model to accurately represent a test data set, a goodness of fit test is also presented. Keywords: Gaussian process, rate of occurrence of failure prediction, time-dependent rate of occurrence of failure, Bayesian analysis, rate of occurrence of failure of vehicles Predicting and monitoring the reliability of equipment has been a topic of great interest for many years throughout a variety of industries. It is of particular importance for fleets of complex repairable systems. High or suddenly increased rates of occurrence of failures within the fleet can have a number of negative consequences, including reduced performance and increased operating costs. Reliability monitoring in this way can also contribute to the establishment of preventative maintenance policies that may help to increase the reliability of the fleet. Reliability monitoring in this context involves examining the occurrence of failures over time for the entire fleet of vehicles, which reduces to a time-series problem. The problem is often complicated by the complexities of the fleet itself. The vehicles can see different amounts of usage throughout the year, and they are also potentially spread over large geographic areas. This exposes the individual vehicles to a wide variety of environmental conditions and stresses, which serve to introduce complex non-linear relationships into the data. These non-linear relationships are often difficult to model using standard parametric techniques, and this is the main benefit of the flexible modeling technique known as Log-Gaussian Process Regression (Log-GPR) presented here. Standard GPR is a nonparametric alternative to traditional Bayesian regression that can easily model the complex non-linear relationships present in many real world settings. The technique provides a straightforward approach for modeling dependencies within the data, such as those that would be found with a complex time-series. The Log-GPR approach used here applies the standard GPR approach to the Log-transformed data to * Corresponding author s email: modarres@umd.edu 701

702 Martin Wayne, and Mohammad Modarres develop the model. The flexibility of GPR (and hence Log-GPR) makes it extremely useful in this context. Past work in this area has often involved parametric modeling of probabilistic processes. A negative binomial model using constrained likelihood estimates was proposed in [1] and [2]. Mining equipment reliability was modeled in [3] and [4], first using a combination of graphical and statistical methods and then extending to a power law process model. A modified Duane model applying reliability decay was used in [5]. Statistical process control methods have also been a consistent approach in this area. Cumulative sum control chart procedures for maintenance data were developed in [6], while more general control chart procedures for Exponential and Weibull time between failures were presented in [7]. An application of this procedure to the Exponential process was provided in [8]. Other work in process control modeling has used Shewhart charts while assuming a Markov process to model a number of possible out-of-control states. Cost and maintenance are considered jointly under this framework in [9], and a variable parameter control chart is developed in [10]. More recent work has also examined the impacts of general deterioration while developing an integrated process control and preventative maintenance approach [11], and a progressive mean control chart approach is presented in [12]. The use of GPR itself has grown in recent years due to advances in computational capabilities, but it has basic theoretical origins that date to Weiner and Kolmogorov in the 1940 s. GPR has also been widely used in geostatistics since the 1970 s, where it is more commonly known as kriging [13]. GPR can be shown as a limiting form of a large class of Artificial Neural Network (ANN) models [14]. It is also a more general form of both the Relevance Vector Machine (RVM) found in the machine learning literature [15],[16] and the more traditional Auto Regressive Moving Average (ARMA) model [17]. GPR has also seen recent applications involving complex response functions that may otherwise be difficult to model. Computer simulations are increasingly used as a substitute for testing and experimentation, and the output from such simulations is modeled with GPR in [18]. GPR has also been used in [19] and [20] to develop surrogate models for complex component response functions in the context of system reliability analysis. This paper is organized as follows. Section 2 introduces Log-GPR in the context of modeling and monitoring the reliability of a fleet of vehicles over time. A goodness of fit test for model assessment is presented in Section 3, and conclusions are provided in Section 4. 2. Gaussian Process Regression for Fleet Reliability Monitoring We illustrate the application of reliability monitoring using Log-GPR by examining the monthly number of failures that are observed for a fleet of similar vehicles. The vehicles are spread over two different locations, and two different jobs are performed at each location. The odometer reading and location of each vehicle are both recorded, along with the specific model of the vehicle (e.g., specific drivetrain, components, etc.) and the job type (e.g., long trips with heavy loads, short trips, etc.). The number of failures is impacted by differences in usage and fleet size, and we control for this by normalizing with respect to the monthly miles accumulated. The data are therefore rate of occurrence of failures with units of number of failures per mile. Data are collected for the entire fleet, and the overall rate of occurrence of failure is found by dividing the total number of failures by the total number of miles driven during the month.

Modeling Rate of Occurrence of Failures with Log-Gaussian Process Models: A Case Study 703 for Prognosis and Health Management of a Fleet of Vehicles Analyzing data at this level can be useful for indicating any unforeseen issues that may be occurring in the fleet, such as parts wear-out or a defective lot of replacement parts. It can provide insights into any gradual changes in the rate of occurrence of failure that may occur over time, which may prove useful for fleet management. Log-GPR modeling can also be used as a means of examining the reliability variation that may exist within the fleet for a given month. Detailed inputs for each vehicle, such as the odometer reading, location, and job associated with each can be used to develop an additional Log-GPR model. This can provide additional information on the cause of a specific problem, which allows fleet managers to direct the appropriate resources and address the problem in a more efficient manner. 2.1 Log-GPR Approach To motivate development of the Log-GPR approach used to model the rate of occurrence of failures, we first present the standard GPR approach. These models are powerful methods that provide an extremely flexible framework for regression. The underlying problem assumes a set of samples where each of the inputs X i are associated with the target values Y i. The problem is then to infer the function f such that Y = f(x). To apply GPR to rate of occurrence of failures, it is useful to consider Gaussian Processes as a collection of random variables that are multivariate normally distributed. This can be described more formally as: (1) where the target data vector Y follows a multivariate normal distribution with mean zero and covariance matrix K(X,X). The covariance matrix is defined element-wise by the kernel function k(x,x ) for any two points x and x in the input vector X. The kernel function can be defined by any function that meets the necessary properties for a covariance function. Further details on these properties can be found in [15] and [21]. In the context of the covariance matrix K, the properties can be reduced to the sufficient condition that the eigenvalues of the resulting matrix are positive. From the definition in Eq. (1), the joint distribution of the observed Y values and any future values of interest represented by Y* is given by, (2) where the X values are the inputs associated with the observed target Y s and the X* are the inputs associated with the Y* values of interest. The problem is then to estimate the Y* given the values of X, Y, and X*. Standard results (see for example Appendix A.2 in [15]) then provide the following predictive distributional result. (3) Under squared error loss the best estimate for Y* is then given by the mean of the predictive distribution Note that specification of the kernel function completely determines the model and allows for predictions to be made with simple matrix algebra. The kernel function itself is (4)

704 Martin Wayne, and Mohammad Modarres specified with numerous potential hyperparameters, which can be determined through optimization techniques such as maximum likelihood and cross validation. The computational complexity lies with computing the required matrix inverse in Eqs. (3) and (4), and this can be easily handled with techniques such as Cholesky decomposition. It is also worth mentioning that the mean of the model in Eqs. (1) and (2) is assumed to be zero for simplicity. This is not necessarily a limitation of the technique though, as the predicted result in Eq. (4) is not zero. The GPR model can also be easily extended to handle non-zero mean functions (see Section 2.7 in [15] for further details and discussion). When the data are strictly positive as in the case of the rate of occurrence of failures, the assumption of normality in Eqs. (1) and (2) is violated. A log-transform can be used to map the data into the appropriate range, and the underlying GP model is now a Log-GP model given by (5) From Eq. (5), standard results for lognormal distributions provide the mean predicted value as where μ and Σ are the mean and covariance of the normal distribution for the transformed data from Eqs. (3) and (4). The corresponding (1-α )% probability intervals can then be found from ( exp[ µ Σ z 1 α / 2], exp[ µ + Σ z1 α / 2] ) (7) where the z 1-α/2 value is the corresponding standard normal percentile. 2.2 Reliability Monitoring with Log-GPR Model A general step-by-step approach for applying Log-GPR is as follows: 1. Segment data into training/test sets for later goodness-of-fit testing. 2. Examine data for any trends that may exist, and select potential kernel function based on observed trends and additional information regarding data (e.g., noise). 3. Determine hyperparameter estimates using training data and desired technique (e.g., maximum likelihood, cross validation). 4. Compute model predictions using training data. 5. Examine model performance using goodness-of-fit test with test data. To enable later tests for model assessment, the rate of occurrence of failures over a period of 41 months were partitioned into 30 training data points and 11 randomly chosen test data points. The training data points were used to develop the Log-GPR model described in Eqs. (5)-(7), with the X inputs being the month number and the Log(Y) being the log of the fleet rate of occurrence of failure each month. The test data were held out of the initial model and applied in goodness of fit tests presented in Section 3. When choosing the kernel function to describe the covariance matrix for the data, we followed the approach discussed in [15]. The data were examined for any indications of the appropriate kernel functions, and choices were made based on the observed properties. For the monthly rate of occurrence of failure data a possible periodic trend was noted in the data, leading to a periodic kernel function component. A smoothed model that could handle the noise present within the data was also desired, leading to the addition of squared exponential (6)

Modeling Rate of Occurrence of Failures with Log-Gaussian Process Models: A Case Study 705 for Prognosis and Health Management of a Fleet of Vehicles and noise components. The resulting kernel function is shown in Eq. (8), and it can be seen to consist of the sum of squared exponential, periodic, and general noise components. The delta function in (8) is defined as (8). (9) The θ values in Eq. (8) are hyperparameters that can be found using cross validation or maximum likelihood techniques [15], similar to parameter estimation in a traditional parametric model. The estimated values of the hyperparameters in Table 1 were computed using Matlab code in GPML [22] with conjugate gradient optimization on the marginal likelihood. Note that approximate confidence intervals on the estimated hyperparameters could be calculated using Fisher Information to provide additional insight into the model uncertainty if desired. The resulting rate of occurrence of failure model for the data is shown in Figure 1. The solid line is the predicted value, and the dashed lines indicate the 95% probability interval associated with the prediction. The training data points that were used to develop the model are shown as circles, and the + symbols represent test data points that were withheld during the model development. The plot shows that the model appears to perform reasonably well at predicting the test data points, and formal statistical tests are presented in Section 3. Table 1: Hyperparameter Estimates for Covariance Function Parameter Value θ 1 4.38 θ 2 0.31 θ 3 0.13 θ 4 1.03 θ 5 0.26 σ n 0.12 The increasing trend that exists in the data from months 10 25 is due the use of the fleet in a harsher environment than in the earlier and later months. The predicted future behavior of the model is also shown for 24 months into the future, which can provide valuable information for maintenance planning for the fleet of vehicles being considered in the GPR model. The uncertainty can be seen to increase with further extrapolation beyond the range of the data, as is the case with regression in general.

706 Martin Wayne, and Mohammad Modarres Figure 1: Log-GPR Model for Monthly Rate of Occurrence of Failure To demonstrate an additional capability of the Log-GPR model for reliability monitoring, we use the previous model as a baseline and examine the rate of occurrence of failure for the following month. Figure 2 shows the following month s rate of occurrence of failure with respect to the model developed using the previous 41 months of data. The rate of occurrence of failure for the following month is plotted as an x. The resulting rate of occurrence of failure is above the 95% upper probability bound, which provides motivation to investigate the cause of the increased number of failures. Because of the geographic and usage variation that exists within the fleet, it is reasonable to investigate whether the increase in failures is due to a specific use or location, or possibly even age related. This investigation can be accomplished using Log-GPR on the detailed data that is also collected each month. Also note that the model would traditionally be updated in a continuous process monitoring approach, but this should only be considered after understanding the cause of the variation that is present in the new data.

Modeling Rate of Occurrence of Failures with Log-Gaussian Process Models: A Case Study 707 for Prognosis and Health Management of a Fleet of Vehicles Figure 2: Process Control Result for Monthly Rate of Occurrence of Failure Unlike the numbered inputs in the time series model previously developed, the input in this case will be a vector of information that describes each individual vehicle in the fleet for the month. The detailed inputs collected for each vehicle are a mix of quantitative and qualitative values consisting of the odometer reading, vehicle model, location, and job type. The vehicle model, location, and job type are qualitative values that require indicator variables for their specific classes. Standard methods for regression with qualitative variables can be applied in this context; a more in-depth discussion of modeling with quantitative variables can be found in many references, see for example chapter 11 in [23]. There are seven distinct possibilities for the vehicle model, along with two distinct possibilities for both the location and job type. This results in nine total values in the input vector. The odometer reading is a quantitative value and is therefore represented by a single input denoted as X 1. The vehicle model is represented by indicator variables X 2 through X 7, and location and job type are represented by indicator variables X 8 and X 9 respectively. The overall input vector X is then just X = [X 1 X 2 X 9 ], and the Log(Y) values are the observed log-rate of occurrence of failures for each vehicle during the month. There were 1392 vehicles in the fleet for the month being examined. 1192 training data points were used to build the model, with 200 randomly chosen test data points held out for goodness of fit assessment. The kernel function used in this case was a combination of a rational quadratic form with a noise term. The rational quadratic kernel corresponds to an infinite mixture of squared exponential kernels [15], making it an intuitive choice for smoothing with multi-dimensional inputs. It was also necessary to have a kernel that would allow for noise in the data, leading to the kernel function shown in Eq. (10). ( x, x ) 1 ( x x )' P ( x x ) θ11 i j i j 10 1 + + σ n 2θ 11 ( x x ) k i j = θ i, The delta function is again defined as j, (10)

708 Martin Wayne, and Mohammad Modarres (11) The matrix P is a diagonal 9-dimensional matrix with hyperparameters θ 1 through θ 9 along the diagonal and zeros elsewhere. Each hyperparameter in P corresponds to a specific term in the input vector, and these values can be chosen as before through crossvalidation or maximum likelihood approaches. An additional benefit of this kernel function is that it allows for automatic relevance determination, where the optimal values of the hyperparameters within P define the relevance of the contribution of the specific input within the overall model structure. Because the inverse of P is used, large hyperparameter values indicate little relevance to the model while small values indicate input covariates of higher importance to predicting the desired rate of occurrence of failure. The optimal values of the hyperparameters in Table 2 were again computed using Matlab code in GPML [22] with conjugate gradient optimization. Approximate confidence intervals on the estimated hyperparameters could again be calculated using Fisher Information to provide additional insight into the model uncertainty. Table 2: Hyperparameter Values for Covariance Function Parameter Value θ 1 2373.141 θ 2 10.609 θ 3 0.289 θ 4 0.743 θ 5 2.493 θ 6 0.616 θ 7 0.242 θ 8 0.386 θ 9 0.100 θ 10 3.503 θ 11 0.001 σ n 0.008 To examine the model s ability to provide reasonable predictions for general data inputs, the model results for the first 20 test data points are shown in Figure 3. The rate of occurrence of failures is plotted with respect to the vehicle number for display purposes. The x and dashed lines indicate the mean and 95% probability interval associated with the model prediction, with the actual test set values represented by the bars. The plot shows that the model appears to perform reasonably well at predicting the rate of occurrence of failure for the test data points, and formal statistical tests are presented in Section 3. It also shows that a small number of vehicles are experiencing higher rates of occurrence of failures relative to other vehicles in the fleet. Commonality between these vehicles, such as location or mileage can then be examined within the model to determine any underlying connections or causes. This can help to appropriately direct resources for better health management of the fleet.

Modeling Rate of Occurrence of Failures with Log-Gaussian Process Models: A Case Study 709 for Prognosis and Health Management of a Fleet of Vehicles Figure 3: Rate of Occurrence of Failure Test Data Result for Detailed Inputs To demonstrate this concept, Figure 4 shows the rate of occurrence of failure as a function of the odometer reading of the vehicle. The solid centerline indicates the mean rate of occurrence of failure, while the shaded area is the 95% probability interval. This relationship indicates the behavior of the rate of occurrence of failure with respect to the age of the vehicle while the vehicle model, location, and job type are held constant. This plot can generally be used for prognosis purposes, showing the success of any preventative maintenance programs. Successful programs should show no significant increases in the rate of occurrence of failure of the vehicle as it accumulates more miles. As shown in the results, the rate of occurrence of failure as a function of total mileage is indeed fairly constant, slightly increasing at mileages near 50,000 miles and then decreasing again near 75,000 miles. These results indicate that the preventative maintenance techniques are well within control for the vehicle. In Table 2 it is also worth noting that the value of the first hyperparameter corresponding to the odometer reading is large. There is an inverse relationship between the hyperparameter and input variable, which provides further evidence through automatic relevance determination [15] that there is little impact due to vehicle mileage. While aging does not appear to be a major problem within these data, the slight increase in the failure intensity that occurs near 50,000 miles is reason for further examination. This may be an early indication of some wear-related problems that could be mitigated through additional preventative maintenance procedures.

710 Martin Wayne, and Mohammad Modarres Figure 4: Log-GPR Results: Rate of Occurrence of Failure vs. Total Vehicle Mileage 3. Goodness of Fit for Model Generalization The kernel function is crucial in the regression problem. It generally describes the degree of similarity between any two input points, and it also completely determines the predictive results of the model. This is a somewhat intuitive property, in that it is reasonable to expect that when the inputs are also close to one another, the observed outputs are also close in some sense. It is therefore desirable to have some means of assessing the appropriateness of the chosen kernel function. Use of the likelihood function for model selection and assessment is a rigorous method for comparing multiple model forms under consideration [21], but it provides no indication of the overall ability of the model to adequately generalize and provide reasonable predictions. All model forms under consideration may simply be inadequate for describing the data, which may not be obvious when considering the likelihood alone. To overcome this limitation, a Goodness of Fit (GoF) procedure can be used to assess the generalization capability of the proposed GPR model. A drawback of this approach is the required segmentation of the available data into training and test sets, but it does offer an additional procedure for assessing the utility of the model in a general setting beyond that of the data that were used to develop the original model. Let z = [z 1 z 2 z n ] represent the nx1 dimensional log-transformed test data with predicted mean μ and covariance Σ. From the assumption of the Log-GPR model developed using the training data, the z i values are a set of correlated normally distributed random variables. We can decompose Σ into UΛU, where U is a matrix whose columns are the eigenvectors of Σ, Λ is a matrix with the eigenvalues of Σ as the diagonal and zeros elsewhere, and U is the transpose of U. The transform z* =U -1 z will then be normally distributed with expected value and covariance given by (12) (13)

Modeling Rate of Occurrence of Failures with Log-Gaussian Process Models: A Case Study 711 for Prognosis and Health Management of a Fleet of Vehicles z* is therefore a vector of independent normal random variables. For the i th term in the vector z, the quantity 1 ( U ) * z i µ i Λi, i will follow a chi-squared distribution with one degree of freedom, where (U -1 μ and Λ i,i are the corresponding mean and variance of the z * i respectively. Summing over all points in the test data set then yields the statistic n 1 ( U ) * z i µ i Λ i= 1 i, i which follows a chi-squared distribution with n degrees of freedom. Figure 5 shows a histogram of the distribution for 10,000 simulated values of the chi-squared statistic in Eq. (15) for 10-dimensional correlated multivariate normal random variables. The theoretical chi-squared distribution with 10 degrees of freedom is overlaid as a solid line. The agreement between the two distributions indicates the validity of the chi-squared test in this application. Computing this statistic for the test data allows for computation of a p- value. A small p-value will indicate that the test data do not agree well with the underlying model that has been developed from the training data. 2 2,, (14) (15) Figure 5: Distribution of Simulated Chi-Square Test Statistic For the monthly rate of occurrence of failure test data in Section 2.2, the computed chisquared test statistic was found to be 9.57, which yields a p-value of approximately 0.92. For the detailed vehicle test data, the chi-squared test statistic was computed to be 160.70, which yields a p-value of 0.98. This indicates that there is no evidence against either of the models developed from the training data. The models each do generalize extremely well; but as with any model of this type, caution should be used when extrapolating far beyond the ranges of the data that were used to develop the models.

712 Martin Wayne, and Mohammad Modarres 4. Conclusions A Log-GPR technique for modeling the rate of occurrence of failure of complex systems has been demonstrated in the context of a fleet of vehicles. The technique has been shown to be a powerful non-parametric method for prognosis and health management while handling complex dependencies and non-linear relationships within data. The model and associated predictions are completely determined through the specification of the kernel function, and the Bayesian framework of GPR provides a straightforward method for developing the results. A GoF procedure has also been presented for examining the generalization capability of the chosen model. Further work in the area of model selection would prove useful though. While the GoF procedure presented here has been shown to be useful for model assessment, more fully automated procedures for specifying the appropriate kernel function would make widespread implementation of the technique much more practical. References [1] Triner, D.A. The Assessment of Fleet Equipment Reliability, Reliability Engineering, 1986; 14(1):63-74. [2] Triner, D.A. Confidence Limits for Fleet Equipment Reliability, Reliability Engineering, 1986; 16(4):255-264. [3] Kumar, U., B. Klefsjo, and S. Granholm. Reliability Investigation for a Fleet of Load-Haul-Dump Machines in a Swedish Mine, Reliability Engineering and System Safety, 1989; 26(4):341-361. [4] Kumar, U. and B. Klefsjo. Reliability Analysis of Hydraulic Systems of LHD Using the Power Law Process Model, Reliability Engineering and System Safety, 1992; 35(3): 217-224. [5] Ritchie, P. Modeling and Monitoring the Decay of Equipment Reliability, Proceedings of the Annual RAM Symposium, 1991:312-316. [6] English, C. In-Service Reliability Estimates from Maintenance Data, Reliability Engineering and System Safety, 1988; 21(3):163-173. [7] Xie, M., T.N. Goh, and P. Ranjan. Some Effective Control Chart Procedures for Reliability Monitoring, Reliability Engineering and System Safety, 2002; 77(2):143-150. [8] Wu, C., C. Yang, and M. Shu. Quality Technology and Quantitative Management Reliability Monitoring and Performance Measuring for the Exponential Failure Process, Second Int. Conf. on Innovative Computing, Information and Control, 2007:77. [9] Tagaras, G. An Integrated Cost Model for the Joint Optimization of Process Control and Maintenance, Journal of the Operational Research Society, 1988; 39(8):757-766. [10] Panagiotidou, S. and G. Nenes. An Economically Designed, Integrated Quality and Maintenance Model Using an Adaptive Shewhart Chart, Reliability Engineering and System Safety, 2009; 94(3):732-741. [11] Panagiotidou, S. and G. Tagaras. Optimal Integrated Process Control and Maintenance Under General Deterioration, Reliability Engineering and System Safety, 2012; 104:58-70. [12] Abbas, N., R. Zafar, M. Riaz, and Z. Hussain. Progressive Mean Control Chart for Monitoring Process Location Parameter, Quality & Reliability Engineering International, 2013; 29(3):357-367.

Modeling Rate of Occurrence of Failures with Log-Gaussian Process Models: A Case Study 713 for Prognosis and Health Management of a Fleet of Vehicles [13] Matheron, G. The Intrinsic Random Functions and Their Applications, Advances in Applied Probability, 1973; 5:439-468. [14] Neal, R.M. Bayesian Learning via Stochastic Dynamics. Fifth Neural Information Processing Systems Conference, 1993:475-482. [15] Rasmussen, C. and C. Williams. Gaussian Processes for Machine Learning. MIT Press, Cambridge, MA, 2006. [16] Tipping, M. Sparse Bayesian Learning and the Relevance Vector Machine. Journal of Machine Learning Research, 2001; 1:211-244. [17] Williams, C. and C. Rasmussen. Gaussian Processes for Regression. Advances in Neural Information Processing Systems, 1996; 8:514-520. [18] Kennedy, M., C. Anderson, S. Conti, and A. O'Hagan. Case Studies in Gaussian Process Modeling of Computer Codes, Reliability Engineering and System Safety, 2006; 91(10):1301-1309. [19] Bichon, B, M. Eldred, L. Swiler, S. Mahadevan, and J. Mcfarland. Efficient Global Reliability Analysis for Nonlinear Implicit Performance Functions, AIA Journal, 2008; 46(10):2459-2468. [20] Bichon, B., J. McFarland, and S. Mahadevan. Efficient Surrogate Models for Reliability Analysis of Systems with Multiple Failure Modes, Reliability Engineering and System Safety, 2011; 96(10):1386-1395. [21] Bishop, C. Pattern Recognition and Machine Learning. Springer, New York, NY, 2006. [22] GPML, http://www.gaussianprocess.org/gpml/code/matlab/doc/index.html. [23] Neter, J., M. Kutner, W. Wasserman, and C. Nachtsheim. Applied Linear Statistical Models. WCB McGraw-Hill, Boston, MA, 1996. Martin Wayne works for the U.S. Army Material Systems Analysis Activity and is currently a Ph.D. student at the University of Maryland, College Park. Mohammad Modarres received his M.S in mechanical engineering from M.I.T in 1977 and Ph.D. in nuclear engineering also from M.I.T in 1980. Dr. Modarres is currently Minta Martin Professor of Engineering and Director of the Reliability Engineering Program at the University of Maryland, College Park. He has served as a consultant or a board member to several governmental agencies including the Nuclear Regulatory Commission, Department of Energy, National Academy of Sciences, and several national laboratories in areas related to nuclear safety, probabilistic risk assessment, probabilistic fracture mechanics and physics of failure. Dr. Modarres has over 300 papers in archival journals and proceedings of conferences. He has published a number of textbooks, edited books and book chapters in various areas of nuclear safety, risk and reliability engineering. He is a University of Maryland Distinguished Scholar-Teacher and a fellow of the American Nuclear Society. Dr. Modarres works in probabilistic risk assessment and management, uncertainty analysis, probabilistic physics of failure and probabilistic fracture mechanics modeling.