BAYESIAN MODELING OF DYNAMIC SOFTWARE GROWTH CURVE MODELS
|
|
- Augusta Bailey
- 5 years ago
- Views:
Transcription
1 BAYESIAN MODELING OF DYNAMIC SOFTWARE GROWTH CURVE MODELS Zhaohui Liu, Nalini Ravishanker, University of Connecticut Bonnie K. Ray, IBM Watson Research Center Department of Mathematical Sciences, IBM Watson Research Center P.O. Box 218 Yorktown Hts., NY Key Words: hierarchical Bayes, reliability growth, software engineering 1. Introduction Models for characterizing the reliability of software have traditionally focused on using the observed number of failures 1 or time between observed failures to estimate how defects will be uncovered over time. These models, which are called growth curve (GC) models, are most often used during the latter stages of development, as an aid in determining when the software is ready to be released or to predict the failure rate in the field. The models rely strictly on characterizing the instantaneous rate of failure as a function of the number of failures detected upto time t. Most often, both the failure detection rate and the total number of expected failures are assumed to remain constant over time. Additional information, such as data concerning the operational profile at different testing stages or expert opinion concerning the expected reliability of the current product, is seldom incorporated into the models. In this paper, we develop an extended GC methodology for estimating failure rate that allows for evolution of the growth curve parameters as a function of the dynamic operational profile of the product. The model is fit using hierarchical Bayesian methods, which allow for incorporation of available prior information that may be relevant to estimating the ultimate failure rate. In Section 1 We use the terms failure, fault, and defect interchangeably in this paper, to denote a unique error in the code that causes the software not to function properly. 2, we provide some additional background on the software reliability GC models that form the basis of our extended modeling framework. Section 3 discusses a Bayesian framework for model estimation. Section 4 gives an illustration of software failure data from the system test stage of two releases of an IBM middleware product for a large operating system. Section 5 concludes. 2. Background and Model Formulation 2.1 Background A wide variety of software reliability growth models have been postulated in the literature, including those of Jelinski and Moranda (1972), Goel and Okumoto (1979), and Yamada et al. (1983). Many of these models assume that the underlying software failure process can be described using a nonhomogeneous Poisson process (NHPP). If it is assumed that no new defects are introduced at each repair, and that there is a finite, but Poisson distributed, random number N of defects remaining in the software at time t =0,then failure times observed upto time t can be taken to be the first n order statistics from N independent and identically distributed (i.i.d) observations having probability distribution f(t). Different choices of f( ) determine the exact shape of the failure rate function. For example, in the Jelinski-Moranda model, failure times are assumed to follow an exponential distribution having f(t) = β exp( βt),β,t > 0, which gives rise to a NHPP with mean rate of occurrence upto time t of m(t) = θ[1 exp( βt)], where θ denotes the mean of N. This is just the 2850
2 GC model proposed by Goel and Okumoto (1979). The parameter β can be interpreted as the defect discovery rate, while m (t) =λ(t) is called the failure rate function. See Musa, Iannino, and Okumoto (1987) and Kuo and Yang (1996) for further discussion of commonly used models for characterizing failure times and/or number of failures upto time t. The above mentioned models assume that both θ and β remain constant over the observed time interval. However, many different actions may occur during the testing or release phases of the software that make this assumption questionable. For example, it is typically assumed that code remains frozen during testing, i.e., the number of defects in a system under test does not change. However, code drops, affecting the total lines of code (LOC) in the software, are not uncommon during function testing or even later phases of software development. Thus, the size of code and, consequently, the number of failures in a large system, can vary widely during testing. Other variables relevant to the observed in-process defect discovery rate are the number of test cases run, the size of the test team, etc. Defect discovery in the field may be affected by such things as the rate at which customers install and exercise the software. If these influences are not incorporated into models used for characterizing reliability, resulting models are likely to have increased variability and poor predictive performance. A model containing covariate information could also be useful for what-if scenarios, for example to estimate when 90% of defects would be found under various test strategies. The importance of research into new models for software reliability growth that incorporate covariate information was noted in a 1996 National Academy of Science Panel Report on Statistical Methods in Software Engineering. Although dynamic models for software reliability were investigated by Singpurwalla and Soyer (1985), the models proposed there had failure time distributions changing randomly, as opposed to being driven by operational characteristics of the software development process. Incorporation of deterministic or stochastic covariate information into the software reliability modeling framework was mentioned by Singpurwalla and Wilson (1999, Section 7.1), but to the best of our knowledge, there have been no published applications which use this type of information. 2.2 Model Formulation We concentrate here on failure times assumed to follow a Weibull distribution, f(t) =αβt α 1 exp( βt α ),α,β,t>0, which reduces to an exponential distribution when α = 1. The Weibull distribution gives rise to a NHPP with mean rate of occurrence upto time t of m(t) =θ[1 exp( βt α )], and has been used to model discovery of defects in the field, where the α parameter was interpreted as being related to the customer usage rate (Kenney, 1993). Here, we call α a measure of the exercise rate of the software. We allow θ, β and α to change over time following a log-linear model. Let X t,i denotes the i th covariate value at time t and let k denote the number of covariates used to model each parameter. We have log(θ t )=η 0 + k θ i=1 η ix t,i, log(β t )=γ 0 + k β i=1 γ ix t,i, log(α t )=κ 0 + k α i=1 κ ix t,i. Thus the effect of the covariate is to scale the associated parameter by an amount corresponding to the exponentiated linear model. For example, the baseline defect discovery rate exp(γ 0 ) is scaled by the amount exp( k β i=1 γ ix t,i ). Under the assumption that X tj,i is constant in the interval (t j,t j+1 ), the Weibull distribution with stochastic covariate information gives rise to a NHPP with mean rate of occurrence in the interval (t j,t j+1 )of m j+1 = θ tj [exp( β tj t αt j j+1 ) exp( β t j t αt j j )] (1) It is not necessary that the covariates be the same for each parameter. As discussed above, LOC may be a reasonable covariate for θ, whereas test cases may be a reasonable covariate for β or α. The time intervals (t j,t j+1 ) can also be of varying length. In the next section, we describe a Bayesian estimation method for fitting these extended growth curve models. 2851
3 3. Bayesian modeling framework The Bayesian framework provides a way to incorporate reliability information from historical releases and test information for the current release completed previous to the time-frame of the model, for example code and function test defect information (Jeske et al., 2000) and from expert opinion (Singpurwalla and Wilson, 1999, Section 4.3) through the use of informative prior distributions on model parameters. When few failures have been observed, such as might be expected early in system testing (when growth curve models are most useful for planning purposes), MLE estimates may be difficult to obtain. Additionally, Jeske and Pham (2001) show that standard maximum likelihood estimation (MLE) techniques do not yield asymptotically efficient estimates for the exponential GC models in some instances.the incorporation of strong prior information mitigates these problems, while providing a context for expressing subjective judgement as to how a release is expected to perform. The following subsection gives details of the Bayesian estimation method and discusses choice of prior distributions. 3.1 Sampling-based Bayesian estimation The most common format for reporting software defects at IBM is that of recording the defect information, along with the day on which the defect was logged. Usually, more than one defect is found in a single day. This type of data gives rise to defect counts per unit time, as opposed to individual failure times, and necessitates using the NHPP form of the likelihood for model estimation. We formulate our model as a hierarchical Bayesian model and estimate it using sampling-based procedures (Gelfand and Smith, 1990). Unfortunately, the NHPP framework does not result in standard conditional posterior distributions, as were obtained by Kuo and Yang (1996) and Singpurwalla and Wilson (1999, Chapter 4) for failure time data. Thus we use a Metropolis-Hastings algorithm to sample from those conditional densities having non-standard distributions. Let Y tj represent the number of defects observed in the interval (t j 1,t j ). In general, given data Y tj,j = 1,,n, along with parameters Ψ = ( η, γ, κ), the Bayesian model specification requires a likelihood f(ỹn; Ψ) and a prior π(ψ). By Bayes theorem, we then obtain the posterior density as π(ψ Ỹn) f(ỹn;ψ)π(ψ). The likelihood function for the NHPP with mean function as in (1) is computed as f(ỹn;ψ)= n j=1 m Yt j j exp( m j ) Y tj!. (2) We assume that the priors for all parameters are independent and Gaussian. The hyperparameters for the distributions of η 0,γ 0, and κ 0 are selected to incorporate prior information provided in the form of an expected mean and variance for the total number of defects, the defect density rate, and the exercise rate. This information can come from experts, e.g. product development managers who may give an expected mean and upper and lower bound for these parameters, or from estimation results for the constant parameter Weibull model fitted to defect data from previous releases. The choice of which prior information to use depends on planned use of the model. For instance, if the model is fit to partially collected in-process data as a way of determining, e.g., when 90% of in-process defects will be found, prior information should be based on complete in-process data for a previous release of the same product or for a product having similar characteristics, such as similar LOC, functionality, operating environment, etc. If the GC model is fit as a way of projecting field defect discovery rates, previous release field data should be used for prior determination (see Jeske et al., 2000). For the remaining parameters, which reflect the dependence on covariate information, we use independent Gaussian priors centered at zero and having large variance relative to the specified prior of the baseline parameters, η 0,γ 0,κ 0. This results in non-informative prior information reflecting considerable uncertainty about the effect of the covariates. The rationale is that development managers will rarely be able to provide any reasonable guess as to how these parameters change and that only a simple model with no covariate information is fit to historical data. In the MCMC framework, extensive posterior and predictive analysis is facilitated through the 2852
4 use of numerical summary statistics and graphical displays of samples from the joint posterior and predictive distributions. For example, it is straightforward to obtain a confidence interval for T p, the time at which p percent of the remaining defects are expected to be discovered, which is not the case for GC models fit using MLE techniques. In the next section, we illustrate these ideas using system test defect data from an IBM product. 4. Data Example We use system test data from two releases of an IBM middleware product for a large operating system to illustrate our method. The two releases were of similar size and functionality. Data from the first release represent a complete system test cycle, whereas system test had just begun when the data for the second release was obtained. 4.1 Orthogonal Defect Classification Almost all of the published literature on GC modeling has focused on characterizing a product s reliability based on all defects considered together, not distinguished by type. However, several papers in the last few years have pointed out the information that can be gained in terms of understanding the evolving software process by consideration of the type of defects observed. In particular, IBM uses a scheme called Orthogonal Defect Classification (ODC) to distinguish different types of defects found during the development process. The failure process associated with each of the defect types can provide additional insight into the overall reliability of the product. Chillarege et al. (1992) defined seven standard defect types and established a cause and effect relationshipbetween type of defect and reliability growth. To exploit this, Biryani and Chillarege (1994) used separate growth curve models to track defects of different types over time. However, as mentioned in Chillarege et al. (1992), dependence relationships may exist between defects of different types, the rationale being that certain defects cannot be discovered until defects of a different type are first found. Bhandari et al. (1992) developed a reliability growth model that explicitly incorporates such relationships in the case of two defect types. The primary example given in Bhandari et al. (1992) is Cumulative Counts Cumulative Counts Plot of CH and AI (Release 2) Weeks Figure 1: Cumulative Assignment/Initialization and Checking defects per week during System Test for second release of product that of Assignment/Initialization(AI) defects that must be discovered before certain Checking (CH) defects are found. It is reasonable to believe then that an increase in the discovery rate for CH defects might result as more AI defects are found. Feedback relationships between CH and AI defect discovery may also exist. Although the model of Bhandari et al. (1992) is useful, extension to allow for dependence on more than one defect type is mathematically intractable. The dependence of discovery of defects of type l on an arbitrary number of other defect types can be handled in our extended GC framework by modeling the reliability growth of type l defects using covariates X tj,i to represent the number of defects of Type i found upto time t j, i =1,,k,i =l. Here, we illustrate the use of such covariates in the extended GC model for AI and CH defects. Figure 1 shows the cumulative number of AI and CH defects per week for the second release of the project described above. The plot for CH defects suggests that checking defects do tend to be discovered a little later than AI defects in the development process. We compare four different models for characterizing the cumulative growth of AI and CH defects over the second release system test time frame. Model 1 is the simple exponential model. Model 2 is an exponential model in which the de- 2853
5 fect discovery rate is allowed to vary as a function of the number of defects of the other type discovered in the previous time interval. Model 3 is a simple Weibull model, while Model 4 is a Weibull model allowing for varying defect discovery rates as in Model 2. System test data from the earlier release of the product was used to obtain meaningful priors for θ and γ 0 through fitting of the simple exponential model using maximum likelihood estimation. As discussed in the previous section, the prior distribution for γ 1 is taken to be Normal (0, 1). The prior distribution for α was set to lognormal (1, 1), to indicate a prior belief in a simple exponential model relative to the Weibull model. Results based on other values of the hyperparameters, in particular differing standard deviations, did not significantly alter the results. Normal proposal densities were used to generate samples in the Metropolis-Hastings step. The means and variances of the proposal densities were determined using the last 500 samples from an initial run of 1000 iterations of the modified Gibbs algorithm. The algorithm was then run a second time for 5000 iterations, each time taking 25 replications within each MH step. The last 2500 iterations were used to compute the results. All computations were done using the R programming language, freely available from Table 1 shows the means, standard deviations, and 95% credible sets for the parameters of each model for both AI and CH type defects. We see that Model 2, which allows the defect discovery rate to change as a function of the number of discovered defects of the other type in the previous time period, yields posterior means of γ 1 which are significantly different from zero for CH defects but not for AI defects. This indicates that there is indeed a feedforward relationshipbetween AI and CH defects, as hypothesized, but not vice versa. This relationshipagain shows upin Model 4, although not as significantly as for Model 2. Based on the 95% credible set, the α parameter of Models 3 and 4 does not differ significantly from one, indicating that the exponential model is sufficient for this data. However, the mean α value for the CH Weibull models is larger than one, while for AI defects it is very close to one, indicating that there may be some small exercise rate effect for CH defects. It is also interesting to note that the total expected defects of each type is higher when the defect discovery rate is allowed to change, in this case increasing as more defects of the other type are found. This suggests that models which fail to incorporate the feedback relationship may underestimate the total defects for a software development project. 5. Discussion We have shown how time-dependent covariate information can easily be incorporated into reliability growth curve modeling using sampling-based Bayesian methods. The Bayesian framework also allows incorporation of useful prior information. Although we have focused on time-dependent covariates here, the method also applies to the case in which failure data from several products is available, along with product-specific information, such as LOC and operating environment, and a common model is fit to the data with parameters varying across products as a function of the covariate information. This is closer to the regression model framework of classical reliability modeling, although the issue of appropriate scaling must be addressed when the defect rates vary dramatically in size. 6. References 1. Chillarege, R., Bhandari, I., Chaar, J., Halliday, M., Moebus, D., Ray, B., and Wong, M. (1992) Orthogonal defect classification-a concept for in-process measurement. IEEE Transactions on Software Engineering, 18, Chillarege, R. and Biryani, S. (1994). Identifying risk using ODC based growth curve models. Proceedings of the Fifth International Symposium of Software Reliability Engineering, Gelfand, A.E. and Smith, A.F.M. (1990). Sampling based approaches to calculating marginal densities, Journal of the American Statistical Association, 85,
6 Table 1: Parameter estimates for Exponential and Weibull GC models of AI and CH system test defects AI CH θ γ 0 γ 1 α θ γ 0 γ 1 α Mean Model 1 Std.dev Lower Bound Upper Bound Mean Model 2 Std.dev Lower Bound Upper Bound Mean Model 3 Std.dev Lower Bound Upper Bound Mean Model 4 Std.dev Lower Bound Upper Bound Goel, A. L. and Okumoto, K., (1979). Timedependent error-detection rate model for software reliability and other performance measures, IEEE Transactions on Reliability, R- 28(1), Jelinski, Z. and Moranda, P. (1972). Software Reliability Research, in Statistical Computer Performance Evaluation, e.d. W. Freiberger, New York: Academic Press, Jeske, D., Qureshi, M., and Muldoon, E. (2000). A Bayesian methodology for estimating the failure rate of software, International Journal of Reliability, Quality, and Safety Engineering, 7, Jeske, D., and Pham, H. (2001). On the maximum likelihood estimates for the Goel-Okumoto software reliability model, The American Statistician, 55, Kenney, G. (1993). Estimating defects in commercial software during operational use. IEEE Transactions on Reliability, 42, Kuo, L. and Yang, T.Y. (1996). Bayesian computation for nonhomogeneous Poisson processes in software reliability, Journal of the American Statistical Association, 91, Musa, J., Iannino, A., and Okumoto, K. (1987). Software Reliability: Measurement, Prediction, Application, McGraw-Hill: New York. 11. Robert, C. and Casella, G.(1999). Monte Carlo Statistical Methods, Springer: New York. 12. Singpurwalla, N. and Soyer, R. (1985). Assessing (software) reliablity growth using a random coefficient autoregressive process and its ramifications. IEEE Transacations on Software Engineering, SE-11 12: Singpurwalla, N. and Wilson, S. (1999). Statistical Methods in Software Engineering: Reliablity and Risk, Springer: New York. 14. Yamada, S., Ohba, M., and Osaki, S. (1983). S-shaped reliability growth modeling for software error detection, IEEE Transactions on Reliability, 32,
\ fwf The Institute for Integrating Statistics in Decision Sciences
# \ fwf The Institute for Integrating Statistics in Decision Sciences Technical Report TR-2007-8 May 22, 2007 Advances in Bayesian Software Reliability Modelling Fabrizio Ruggeri CNR IMATI Milano, Italy
More informationVariational Bayesian Approach for Interval Estimation of NHPP-based Software Reliability Models
Variational Bayesian Approach for Interval Estimation of NHPP-based Software Reliability Models Hiroyuki Okamura, Michael Grottke, Tadashi Dohi, and Kishor S. Trivedi Department of Information Engineering
More informationBayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence
Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns
More informationSoftware Reliability Growth Modelling using a Weighted Laplace Test Statistic
Software Reliability Growth Modelling using a Weighted Laplace Test Statistic Yan Luo Torsten Bergander A. Ben Hamza Concordia Institute for Information Systems Engineering Concordia University, Montréal,
More informationeqr094: Hierarchical MCMC for Bayesian System Reliability
eqr094: Hierarchical MCMC for Bayesian System Reliability Alyson G. Wilson Statistical Sciences Group, Los Alamos National Laboratory P.O. Box 1663, MS F600 Los Alamos, NM 87545 USA Phone: 505-667-9167
More informationTime-varying failure rate for system reliability analysis in large-scale railway risk assessment simulation
Time-varying failure rate for system reliability analysis in large-scale railway risk assessment simulation H. Zhang, E. Cutright & T. Giras Center of Rail Safety-Critical Excellence, University of Virginia,
More informationChap 4. Software Reliability
Chap 4. Software Reliability 4.2 Reliability Growth 1. Introduction 2. Reliability Growth Models 3. The Basic Execution Model 4. Calendar Time Computation 5. Reliability Demonstration Testing 1. Introduction
More informationConfidence Intervals for Reliability Growth Models with Small Sample Sizes. Reliability growth models, Order statistics, Confidence intervals
Confidence Intervals for Reliability Growth Models with Small Sample Sizes John Quigley, Lesley Walls University of Strathclyde, Glasgow, Scotland Key Words Reliability growth models, Order statistics,
More informationBayesian Methods for Estimating the Reliability of Complex Systems Using Heterogeneous Multilevel Information
Statistics Preprints Statistics 8-2010 Bayesian Methods for Estimating the Reliability of Complex Systems Using Heterogeneous Multilevel Information Jiqiang Guo Iowa State University, jqguo@iastate.edu
More informationSupplement to A Hierarchical Approach for Fitting Curves to Response Time Measurements
Supplement to A Hierarchical Approach for Fitting Curves to Response Time Measurements Jeffrey N. Rouder Francis Tuerlinckx Paul L. Speckman Jun Lu & Pablo Gomez May 4 008 1 The Weibull regression model
More informationIntroduction to Reliability Theory (part 2)
Introduction to Reliability Theory (part 2) Frank Coolen UTOPIAE Training School II, Durham University 3 July 2018 (UTOPIAE) Introduction to Reliability Theory 1 / 21 Outline Statistical issues Software
More informationImprecise Software Reliability
Munich, September 2009 Denitions and assumptions of the software reliability models Well-known models Standard denitions of software reliability A fault in a software is an incorrect step, process, or
More informationOn a Method for Mending Time to Failure Distributions
On a Method for Mending Time to Failure Distributions Michael Grottke and Kishor S. Trivedi Department of Electrical & Computer Engineering Duke University, Durham, NC 27708-0291, USA {grottke, kst}@ee.duke.edu
More informationPrinciples of Bayesian Inference
Principles of Bayesian Inference Sudipto Banerjee University of Minnesota July 20th, 2008 1 Bayesian Principles Classical statistics: model parameters are fixed and unknown. A Bayesian thinks of parameters
More informationEstimation of reliability parameters from Experimental data (Parte 2) Prof. Enrico Zio
Estimation of reliability parameters from Experimental data (Parte 2) This lecture Life test (t 1,t 2,...,t n ) Estimate θ of f T t θ For example: λ of f T (t)= λe - λt Classical approach (frequentist
More informationMarkov Chain Monte Carlo methods
Markov Chain Monte Carlo methods Tomas McKelvey and Lennart Svensson Signal Processing Group Department of Signals and Systems Chalmers University of Technology, Sweden November 26, 2012 Today s learning
More informationReliability Monitoring Using Log Gaussian Process Regression
COPYRIGHT 013, M. Modarres Reliability Monitoring Using Log Gaussian Process Regression Martin Wayne Mohammad Modarres PSA 013 Center for Risk and Reliability University of Maryland Department of Mechanical
More informationVariability within multi-component systems. Bayesian inference in probabilistic risk assessment The current state of the art
PhD seminar series Probabilistics in Engineering : g Bayesian networks and Bayesian hierarchical analysis in engeering g Conducted by Prof. Dr. Maes, Prof. Dr. Faber and Dr. Nishijima Variability within
More informationMultistate Modeling and Applications
Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)
More informationBayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework
HT5: SC4 Statistical Data Mining and Machine Learning Dino Sejdinovic Department of Statistics Oxford http://www.stats.ox.ac.uk/~sejdinov/sdmml.html Maximum Likelihood Principle A generative model for
More informationBayesian Methods for Machine Learning
Bayesian Methods for Machine Learning CS 584: Big Data Analytics Material adapted from Radford Neal s tutorial (http://ftp.cs.utoronto.ca/pub/radford/bayes-tut.pdf), Zoubin Ghahramni (http://hunch.net/~coms-4771/zoubin_ghahramani_bayesian_learning.pdf),
More informationComputer Vision Group Prof. Daniel Cremers. 11. Sampling Methods
Prof. Daniel Cremers 11. Sampling Methods Sampling Methods Sampling Methods are widely used in Computer Science as an approximation of a deterministic algorithm to represent uncertainty without a parametric
More informationMASTER. Parameter estimation for software reliability models. Meyfroyt, P.H.A. Award date: Link to publication
MASTER Parameter estimation for software reliability models Meyfroyt, P.H.A. Award date: 01 Link to publication Disclaimer This document contains a student thesis (bachelor's or master's), as authored
More informationComputer Vision Group Prof. Daniel Cremers. 14. Sampling Methods
Prof. Daniel Cremers 14. Sampling Methods Sampling Methods Sampling Methods are widely used in Computer Science as an approximation of a deterministic algorithm to represent uncertainty without a parametric
More informationStatistical approach to software reliability certification
Statistical approach to software reliability certification Corro Ramos, I.; Di Bucchianico, A.; van Hee, K.M. Published: 1/1/28 Document Version Publisher s PDF, also known as Version of Record (includes
More informationEvaluating the value of structural heath monitoring with longitudinal performance indicators and hazard functions using Bayesian dynamic predictions
Evaluating the value of structural heath monitoring with longitudinal performance indicators and hazard functions using Bayesian dynamic predictions C. Xing, R. Caspeele, L. Taerwe Ghent University, Department
More informationDevelopment of Stochastic Artificial Neural Networks for Hydrological Prediction
Development of Stochastic Artificial Neural Networks for Hydrological Prediction G. B. Kingston, M. F. Lambert and H. R. Maier Centre for Applied Modelling in Water Engineering, School of Civil and Environmental
More informationModelling trends in the ocean wave climate for dimensioning of ships
Modelling trends in the ocean wave climate for dimensioning of ships STK1100 lecture, University of Oslo Erik Vanem Motivation and background 2 Ocean waves and maritime safety Ships and other marine structures
More informationBayesian Analysis for Step-Stress Accelerated Life Testing using Weibull Proportional Hazard Model
Noname manuscript No. (will be inserted by the editor) Bayesian Analysis for Step-Stress Accelerated Life Testing using Weibull Proportional Hazard Model Naijun Sha Rong Pan Received: date / Accepted:
More informationSTA 4273H: Statistical Machine Learning
STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistical Sciences! rsalakhu@cs.toronto.edu! h0p://www.cs.utoronto.ca/~rsalakhu/ Lecture 7 Approximate
More informationHIERARCHICAL MODELS IN EXTREME VALUE THEORY
HIERARCHICAL MODELS IN EXTREME VALUE THEORY Richard L. Smith Department of Statistics and Operations Research, University of North Carolina, Chapel Hill and Statistical and Applied Mathematical Sciences
More informationABSTRACT. improving the underlying reliability. Traditional models for assessing reliability
ABSTRACT Title of dissertation: METHODOLOGY FOR ASSESSING RELIABILITY GROWTH USING MULTIPLE INFORMATION SOURCES Martin Wayne, Doctor of Philosophy, 2013 Dissertation directed by: Professor Mohammad Modarres
More informationABC methods for phase-type distributions with applications in insurance risk problems
ABC methods for phase-type with applications problems Concepcion Ausin, Department of Statistics, Universidad Carlos III de Madrid Joint work with: Pedro Galeano, Universidad Carlos III de Madrid Simon
More informationBayesian Inference. Chapter 4: Regression and Hierarchical Models
Bayesian Inference Chapter 4: Regression and Hierarchical Models Conchi Ausín and Mike Wiper Department of Statistics Universidad Carlos III de Madrid Master in Business Administration and Quantitative
More informationBayesian Inference. Chapter 4: Regression and Hierarchical Models
Bayesian Inference Chapter 4: Regression and Hierarchical Models Conchi Ausín and Mike Wiper Department of Statistics Universidad Carlos III de Madrid Advanced Statistics and Data Mining Summer School
More informationBayesian Estimation of DSGE Models 1 Chapter 3: A Crash Course in Bayesian Inference
1 The views expressed in this paper are those of the authors and do not necessarily reflect the views of the Federal Reserve Board of Governors or the Federal Reserve System. Bayesian Estimation of DSGE
More informationIntroduction to Engineering Reliability
Introduction to Engineering Reliability Robert C. Patev North Atlantic Division Regional Technical Specialist (978) 318-8394 Topics Reliability Basic Principles of Reliability Analysis Non-Probabilistic
More informationBayesian Modeling of Accelerated Life Tests with Random Effects
Bayesian Modeling of Accelerated Life Tests with Random Effects Ramón V. León Avery J. Ashby Jayanth Thyagarajan Joint Statistical Meeting August, 00 Toronto, Canada Abstract We show how to use Bayesian
More informationBayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets
Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets Athanasios Kottas Department of Applied Mathematics and Statistics,
More informationIterative Markov Chain Monte Carlo Computation of Reference Priors and Minimax Risk
Iterative Markov Chain Monte Carlo Computation of Reference Priors and Minimax Risk John Lafferty School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 lafferty@cs.cmu.edu Abstract
More informationHastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model
UNIVERSITY OF TEXAS AT SAN ANTONIO Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model Liang Jing April 2010 1 1 ABSTRACT In this paper, common MCMC algorithms are introduced
More informationFault Tolerant Computing CS 530 Software Reliability Growth. Yashwant K. Malaiya Colorado State University
Fault Tolerant Computing CS 530 Software Reliability Growth Yashwant K. Malaiya Colorado State University 1 Software Reliability Growth: Outline Testing approaches Operational Profile Software Reliability
More informationσ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =
Until now we have always worked with likelihoods and prior distributions that were conjugate to each other, allowing the computation of the posterior distribution to be done in closed form. Unfortunately,
More informationOnline appendix to On the stability of the excess sensitivity of aggregate consumption growth in the US
Online appendix to On the stability of the excess sensitivity of aggregate consumption growth in the US Gerdie Everaert 1, Lorenzo Pozzi 2, and Ruben Schoonackers 3 1 Ghent University & SHERPPA 2 Erasmus
More informationBayesian Reliability Demonstration
Bayesian Reliability Demonstration F.P.A. Coolen and P. Coolen-Schrijner Department of Mathematical Sciences, Durham University Durham DH1 3LE, UK Abstract This paper presents several main aspects of Bayesian
More informationFailure Correlation in Software Reliability Models
Failure Correlation in Software Reliability Models Katerina Goševa Popstojanova, Member IEEE Duke University, Durham Kishor S. Trivedi, Fellow IEEE Duke University, Durham Key Words Software reliability,
More informationHow likely is Simpson s paradox in path models?
How likely is Simpson s paradox in path models? Ned Kock Full reference: Kock, N. (2015). How likely is Simpson s paradox in path models? International Journal of e- Collaboration, 11(1), 1-7. Abstract
More informationBayesian Regression Linear and Logistic Regression
When we want more than point estimates Bayesian Regression Linear and Logistic Regression Nicole Beckage Ordinary Least Squares Regression and Lasso Regression return only point estimates But what if we
More informationParameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn
Parameter estimation and forecasting Cristiano Porciani AIfA, Uni-Bonn Questions? C. Porciani Estimation & forecasting 2 Temperature fluctuations Variance at multipole l (angle ~180o/l) C. Porciani Estimation
More informationA discussion of software reliability growth models with time-varying learning effects
American Journal of Software Engineering and Applications 13; (3): 9-14 Published online July, 13 (http://www.sciencepublishinggroup.com/j/ajsea) doi: 1.11648/j.ajsea.133.1 A discussion of software reliability
More informationSubject CS1 Actuarial Statistics 1 Core Principles
Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and
More informationStat 542: Item Response Theory Modeling Using The Extended Rank Likelihood
Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal
More informationMarkov chain Monte Carlo
Markov chain Monte Carlo Karl Oskar Ekvall Galin L. Jones University of Minnesota March 12, 2019 Abstract Practically relevant statistical models often give rise to probability distributions that are analytically
More informationExpectation propagation for signal detection in flat-fading channels
Expectation propagation for signal detection in flat-fading channels Yuan Qi MIT Media Lab Cambridge, MA, 02139 USA yuanqi@media.mit.edu Thomas Minka CMU Statistics Department Pittsburgh, PA 15213 USA
More informationMaking rating curves - the Bayesian approach
Making rating curves - the Bayesian approach Rating curves what is wanted? A best estimate of the relationship between stage and discharge at a given place in a river. The relationship should be on the
More informationOn Reparametrization and the Gibbs Sampler
On Reparametrization and the Gibbs Sampler Jorge Carlos Román Department of Mathematics Vanderbilt University James P. Hobert Department of Statistics University of Florida March 2014 Brett Presnell Department
More informationBayesian Inference. Chapter 1. Introduction and basic concepts
Bayesian Inference Chapter 1. Introduction and basic concepts M. Concepción Ausín Department of Statistics Universidad Carlos III de Madrid Master in Business Administration and Quantitative Methods Master
More informationModelling geoadditive survival data
Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model
More informationAlgorithm-Independent Learning Issues
Algorithm-Independent Learning Issues Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr CS 551, Spring 2007 c 2007, Selim Aksoy Introduction We have seen many learning
More informationMarkov Chain Monte Carlo methods
Markov Chain Monte Carlo methods By Oleg Makhnin 1 Introduction a b c M = d e f g h i 0 f(x)dx 1.1 Motivation 1.1.1 Just here Supresses numbering 1.1.2 After this 1.2 Literature 2 Method 2.1 New math As
More informationThe random counting variable. Barbara Russo
The random counting variable Barbara Russo Counting Random Variable } Until now we have seen the point process through two sets of random variables } T i } X i } We introduce a new random variable the
More informationDefault Priors and Effcient Posterior Computation in Bayesian
Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature
More informationParametric and Topological Inference for Masked System Lifetime Data
Parametric and for Masked System Lifetime Data Rang Louis J M Aslett and Simon P Wilson Trinity College Dublin 9 th July 2013 Structural Reliability Theory Interest lies in the reliability of systems composed
More informationChapter 4 HOMEWORK ASSIGNMENTS. 4.1 Homework #1
Chapter 4 HOMEWORK ASSIGNMENTS These homeworks may be modified as the semester progresses. It is your responsibility to keep up to date with the correctly assigned homeworks. There may be some errors in
More informationMCMC Sampling for Bayesian Inference using L1-type Priors
MÜNSTER MCMC Sampling for Bayesian Inference using L1-type Priors (what I do whenever the ill-posedness of EEG/MEG is just not frustrating enough!) AG Imaging Seminar Felix Lucka 26.06.2012 , MÜNSTER Sampling
More informationBayesian non-parametric model to longitudinally predict churn
Bayesian non-parametric model to longitudinally predict churn Bruno Scarpa Università di Padova Conference of European Statistics Stakeholders Methodologists, Producers and Users of European Statistics
More informationModelling Operational Risk Using Bayesian Inference
Pavel V. Shevchenko Modelling Operational Risk Using Bayesian Inference 4y Springer 1 Operational Risk and Basel II 1 1.1 Introduction to Operational Risk 1 1.2 Defining Operational Risk 4 1.3 Basel II
More informationIntroduction to Markov Chain Monte Carlo & Gibbs Sampling
Introduction to Markov Chain Monte Carlo & Gibbs Sampling Prof. Nicholas Zabaras Sibley School of Mechanical and Aerospace Engineering 101 Frank H. T. Rhodes Hall Ithaca, NY 14853-3801 Email: zabaras@cornell.edu
More information(5) Multi-parameter models - Gibbs sampling. ST440/540: Applied Bayesian Analysis
Summarizing a posterior Given the data and prior the posterior is determined Summarizing the posterior gives parameter estimates, intervals, and hypothesis tests Most of these computations are integrals
More informationThe comparative studies on reliability for Rayleigh models
Journal of the Korean Data & Information Science Society 018, 9, 533 545 http://dx.doi.org/10.7465/jkdi.018.9..533 한국데이터정보과학회지 The comparative studies on reliability for Rayleigh models Ji Eun Oh 1 Joong
More informationBayesian Prediction of Code Output. ASA Albuquerque Chapter Short Course October 2014
Bayesian Prediction of Code Output ASA Albuquerque Chapter Short Course October 2014 Abstract This presentation summarizes Bayesian prediction methodology for the Gaussian process (GP) surrogate representation
More informationA note on Reversible Jump Markov Chain Monte Carlo
A note on Reversible Jump Markov Chain Monte Carlo Hedibert Freitas Lopes Graduate School of Business The University of Chicago 5807 South Woodlawn Avenue Chicago, Illinois 60637 February, 1st 2006 1 Introduction
More informationFast Likelihood-Free Inference via Bayesian Optimization
Fast Likelihood-Free Inference via Bayesian Optimization Michael Gutmann https://sites.google.com/site/michaelgutmann University of Helsinki Aalto University Helsinki Institute for Information Technology
More information10. Exchangeability and hierarchical models Objective. Recommended reading
10. Exchangeability and hierarchical models Objective Introduce exchangeability and its relation to Bayesian hierarchical models. Show how to fit such models using fully and empirical Bayesian methods.
More information7. Estimation and hypothesis testing. Objective. Recommended reading
7. Estimation and hypothesis testing Objective In this chapter, we show how the election of estimators can be represented as a decision problem. Secondly, we consider the problem of hypothesis testing
More informationNow consider the case where E(Y) = µ = Xβ and V (Y) = σ 2 G, where G is diagonal, but unknown.
Weighting We have seen that if E(Y) = Xβ and V (Y) = σ 2 G, where G is known, the model can be rewritten as a linear model. This is known as generalized least squares or, if G is diagonal, with trace(g)
More informationSTAT 499/962 Topics in Statistics Bayesian Inference and Decision Theory Jan 2018, Handout 01
STAT 499/962 Topics in Statistics Bayesian Inference and Decision Theory Jan 2018, Handout 01 Nasser Sadeghkhani a.sadeghkhani@queensu.ca There are two main schools to statistical inference: 1-frequentist
More informationCOPYRIGHTED MATERIAL CONTENTS. Preface Preface to the First Edition
Preface Preface to the First Edition xi xiii 1 Basic Probability Theory 1 1.1 Introduction 1 1.2 Sample Spaces and Events 3 1.3 The Axioms of Probability 7 1.4 Finite Sample Spaces and Combinatorics 15
More informationContents. Part I: Fundamentals of Bayesian Inference 1
Contents Preface xiii Part I: Fundamentals of Bayesian Inference 1 1 Probability and inference 3 1.1 The three steps of Bayesian data analysis 3 1.2 General notation for statistical inference 4 1.3 Bayesian
More informationA comparison of inverse transform and composition methods of data simulation from the Lindley distribution
Communications for Statistical Applications and Methods 2016, Vol. 23, No. 6, 517 529 http://dx.doi.org/10.5351/csam.2016.23.6.517 Print ISSN 2287-7843 / Online ISSN 2383-4757 A comparison of inverse transform
More informationInternational Journal of Scientific & Engineering Research, Volume 5, Issue 2, February ISSN
International Journal of Scientific & Engineering Research, Volume 5, Issue 2, ebruary-2014 981 Detection Inflection s-shaped model Using SPRT based on Order Statistics Dr. R. Satya Prasad 1 and Y.Sangeetha
More informationA BAYESIAN SOLUTION TO INCOMPLETENESS
A BAYESIAN SOLUTION TO INCOMPLETENESS IN PROBABILISTIC RISK ASSESSMENT 14th International Probabilistic Safety Assessment & Management Conference PSAM-14 September 17-21, 2018 Los Angeles, United States
More informationON THE FAILURE RATE ESTIMATION OF THE INVERSE GAUSSIAN DISTRIBUTION
ON THE FAILURE RATE ESTIMATION OF THE INVERSE GAUSSIAN DISTRIBUTION ZHENLINYANGandRONNIET.C.LEE Department of Statistics and Applied Probability, National University of Singapore, 3 Science Drive 2, Singapore
More informationAn Integral Measure of Aging/Rejuvenation for Repairable and Non-repairable Systems
An Integral Measure of Aging/Rejuvenation for Repairable and Non-repairable Systems M.P. Kaminskiy and V.V. Krivtsov Abstract This paper introduces a simple index that helps to assess the degree of aging
More informationBayesian Life Test Planning for the Weibull Distribution with Given Shape Parameter
Statistics Preprints Statistics 10-8-2002 Bayesian Life Test Planning for the Weibull Distribution with Given Shape Parameter Yao Zhang Iowa State University William Q. Meeker Iowa State University, wqmeeker@iastate.edu
More informationA hidden semi-markov model for the occurrences of water pipe bursts
A hidden semi-markov model for the occurrences of water pipe bursts T. Economou 1, T.C. Bailey 1 and Z. Kapelan 1 1 School of Engineering, Computer Science and Mathematics, University of Exeter, Harrison
More informationFailure Correlation in Software Reliability Models
Failure Correlation in Software Reliability Models Katerina Goševa Popstojanova and Kishor Trivedi Center for Advanced Computing and Communication Department of Electrical and Computer Engineering Duke
More informationModeling and Interpolation of Non-Gaussian Spatial Data: A Comparative Study
Modeling and Interpolation of Non-Gaussian Spatial Data: A Comparative Study Gunter Spöck, Hannes Kazianka, Jürgen Pilz Department of Statistics, University of Klagenfurt, Austria hannes.kazianka@uni-klu.ac.at
More informationPrinciples of Bayesian Inference
Principles of Bayesian Inference Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department
More informationTwo Statistical Problems in X-ray Astronomy
Two Statistical Problems in X-ray Astronomy Alexander W Blocker October 21, 2008 Outline 1 Introduction 2 Replacing stacking Problem Current method Model Further development 3 Time symmetry Problem Model
More informationSoftware Reliability & Testing
Repairable systems Repairable system A reparable system is obtained by glueing individual non-repairable systems each around a single failure To describe this gluing process we need to review the concept
More informationA Bayesian Approach to Phylogenetics
A Bayesian Approach to Phylogenetics Niklas Wahlberg Based largely on slides by Paul Lewis (www.eeb.uconn.edu) An Introduction to Bayesian Phylogenetics Bayesian inference in general Markov chain Monte
More informationB.H. Far
SENG 521 Software Reliability & Software Quality Chapter 6: Software Reliability Models Department of Electrical & Computer Engineering, University of Calgary B.H. Far (far@ucalgary.ca) http://www.enel.ucalgary.ca/people/far/lectures/seng521
More informationContents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1
Contents Preface to Second Edition Preface to First Edition Abbreviations xv xvii xix PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1 1 The Role of Statistical Methods in Modern Industry and Services
More informationStatistical Inference for Stochastic Epidemic Models
Statistical Inference for Stochastic Epidemic Models George Streftaris 1 and Gavin J. Gibson 1 1 Department of Actuarial Mathematics & Statistics, Heriot-Watt University, Riccarton, Edinburgh EH14 4AS,
More informationStructural Reliability
Structural Reliability Thuong Van DANG May 28, 2018 1 / 41 2 / 41 Introduction to Structural Reliability Concept of Limit State and Reliability Review of Probability Theory First Order Second Moment Method
More informationEstimating the marginal likelihood with Integrated nested Laplace approximation (INLA)
Estimating the marginal likelihood with Integrated nested Laplace approximation (INLA) arxiv:1611.01450v1 [stat.co] 4 Nov 2016 Aliaksandr Hubin Department of Mathematics, University of Oslo and Geir Storvik
More informationINVERTED KUMARASWAMY DISTRIBUTION: PROPERTIES AND ESTIMATION
Pak. J. Statist. 2017 Vol. 33(1), 37-61 INVERTED KUMARASWAMY DISTRIBUTION: PROPERTIES AND ESTIMATION A. M. Abd AL-Fattah, A.A. EL-Helbawy G.R. AL-Dayian Statistics Department, Faculty of Commerce, AL-Azhar
More informationMonte Carlo in Bayesian Statistics
Monte Carlo in Bayesian Statistics Matthew Thomas SAMBa - University of Bath m.l.thomas@bath.ac.uk December 4, 2014 Matthew Thomas (SAMBa) Monte Carlo in Bayesian Statistics December 4, 2014 1 / 16 Overview
More informationECO 513 Fall 2009 C. Sims HIDDEN MARKOV CHAIN MODELS
ECO 513 Fall 2009 C. Sims HIDDEN MARKOV CHAIN MODELS 1. THE CLASS OF MODELS y t {y s, s < t} p(y t θ t, {y s, s < t}) θ t = θ(s t ) P[S t = i S t 1 = j] = h ij. 2. WHAT S HANDY ABOUT IT Evaluating the
More information