Reliability of Safety-Critical Systems Chapter 9. Average frequency of dangerous failures

Similar documents
Reliability of Safety-Critical Systems Chapter 8. Probability of Failure on Demand using IEC formulas

Chapter 12. Spurious Operation and Spurious Trips

Reliability of Safety-Critical Systems 5.1 Reliability Quantification with RBDs

Chapter 8. Calculation of PFD using Markov

Reliability of Safety-Critical Systems Chapter 8. Probability of Failure on Demand using fault trees

Chapter 8. Calculation of PFD using FTA

Reliability of Safety-Critical Systems 5.4 Petrinets

System Reliability Thory 6. Common-Cause Failures

Non-observable failure progression

Choice of Demand Mode for Subsea Safety Systems

Imperfect Testing and its Influence on Availability of Safety Instrumented Systems

Mean fault time for estimation of average probability of failure on demand.

AVERAGE PROBABILITY OF FAILURE ON DEMAND ESTIMATION FOR BURNER MANAGEMENT SYSTEMS

Considering Security Aspects in Safety Environment. Dipl.-Ing. Evzudin Ugljesa

9. Reliability theory

Modeling Common Cause Failures in Diverse Components with Fault Tree Applications

Evaluating the PFD of Safety Instrumented Systems with Partial Stroke Testing

Chapter 9 Part II Maintainability

Single-part-type, multiple stage systems. Lecturer: Stanley B. Gershwin

Terminology and Concepts

Introduction to repairable systems STK4400 Spring 2011

An Integral Measure of Aging/Rejuvenation for Repairable and Non-repairable Systems

Introduction to Reliability Theory (part 2)

Examination paper for TMA4275 Lifetime Analysis

Failure modeling and maintenance optimization for a railway line

Reliable Computing I

The random counting variable. Barbara Russo

Reliability Growth in JMP 10

CHAPTER 10 RELIABILITY

The exponential distribution and the Poisson process

STOCHASTIC MODELS FOR RELIABILITY, AVAILABILITY, AND MAINTAINABILITY

Combination of safety integrity levels (SILs): A study of IEC61508 merging rules

Concept of Reliability

Reliability of Technical Systems

A hidden semi-markov model for the occurrences of water pipe bursts

Failure rate in the continuous sense. Figure. Exponential failure density functions [f(t)] 1

Chapter 5 Component Importance

Engineering Risk Benefit Analysis

AN INTEGRAL MEASURE OF AGING/REJUVENATION FOR REPAIRABLE AND NON REPAIRABLE SYSTEMS

Fault-Tolerant Computing

Markov Reliability and Availability Analysis. Markov Processes

Reliability Engineering I

Availability and Maintainability. Piero Baraldi

Reliability analysis of power systems EI2452. Lifetime analysis 7 May 2015

0utline. 1. Tools from Operations Research. 2. Applications

Practical Applications of Reliability Theory

MIT Manufacturing Systems Analysis Lectures 6 9: Flow Lines

Lecture 7. Poisson and lifetime processes in risk analysis

Common Cause Failure (CCF)

The optimal delay of the second test is therefore approximately 210 hours earlier than =2.

Availability and Reliability Analysis for Dependent System with Load-Sharing and Degradation Facility

CHAPTER 3 MATHEMATICAL AND SIMULATION TOOLS FOR MANET ANALYSIS

FAULT TOLERANT SYSTEMS

arxiv: v1 [math.ho] 25 Feb 2008

Stochastic Modelling Unit 1: Markov chain models

Availability. M(t) = 1 - e -mt

EE 445 / 850: Final Examination

Time-varying failure rate for system reliability analysis in large-scale railway risk assessment simulation

Chapter 5. System Reliability and Reliability Prediction.

57:022 Principles of Design II Final Exam Solutions - Spring 1997

Dependable Systems. ! Dependability Attributes. Dr. Peter Tröger. Sources:

Multi-State Availability Modeling in Practice

e 4β e 4β + e β ˆβ =0.765

Markov Models for Reliability Modeling

CHAPTER 9 AVAILABILITY DEMONSTRATION PLANS CONTENTS

57:022 Principles of Design II Midterm Exam #2 Solutions

Safety Instrumented Systems & Functional safety of safety-related systems. A substantial new opportunity for Pfannenberg Signaling products

The story of the film so far... Mathematics for Informatics 4a. Continuous-time Markov processes. Counting processes

ANALYSIS OF INDEPENDENT PROTECTION LAYERS AND SAFETY INSTRUMENTED SYSTEM FOR OIL GAS SEPARATOR USING BAYESIAN METHODS

Duration Analysis. Joan Llull

Reliability Analysis of a Fuel Supply System in Automobile Engine

Quantitative Reliability Analysis

Q = (c) Assuming that Ricoh has been working continuously for 7 days, what is the probability that it will remain working at least 8 more days?

A new condition based maintenance model with random improvements on the system after maintenance actions: Optimizing by monte carlo simulation

Reliability of Technical Systems

Monte Carlo Simulation for Reliability and Availability analyses

Slides 9: Queuing Models

Evaluation and Validation

CHAPTER 3 ANALYSIS OF RELIABILITY AND PROBABILITY MEASURES

Assignment 3 with Reference Solutions

Let (Ω, F) be a measureable space. A filtration in discrete time is a sequence of. F s F t

Queueing Theory. VK Room: M Last updated: October 17, 2013.

Evaluating Safety Integrity Level in presence of uncertainty

These notes will supplement the textbook not replace what is there. defined for α >0

Safety analysis and standards Analyse de sécurité et normes Sicherheitsanalyse und Normen

Reliability Analysis of Electronic Systems using Markov Models

Reliability of Technical Systems

IE 336 Seat # Name (one point) < KEY > Closed book. Two pages of hand-written notes, front and back. No calculator. 60 minutes.

Chapter 6 Queueing Models. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation

Queueing Systems: Lecture 3. Amedeo R. Odoni October 18, Announcements

Partial stroke tests as a procedure for the extension of the proof test interval

Quantitative evaluation of Dependability

Probability Methods in Civil Engineering Prof. Rajib Maity Department of Civil Engineering Indian Institute of Technology, Kharagpur

Probability and Probability Distributions. Dr. Mohammed Alahmed

Monte Carlo (MC) simulation for system reliability

Markov Repairable Systems with History-Dependent Up and Down States

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Quantification of the safety level of a safety-critical control system K. Rástočný 1, J. Ilavský 1

.IEEEEEEIIEII. fflllffo. I ll, 7 AO-AO CALIFORNIA UN IV BERKELEY OPERATIONS RESEARCH CENTER FIG 12/1

Markov Processes and Queues

Transcription:

Reliability of Safety-Critical Systems Chapter 9. Average frequency of dangerous failures Mary Ann Lundteigen and Marvin Rausand mary.a.lundteigen@ntnu.no &marvin.rausand@ntnu.no RAMS Group Department of Production and Quality Engineering NTNU (Version 1.1 per July 2015) M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 1 / 38

Reliability of Safety-Critical Systems Slides related to the book Reliability of Safety-Critical Systems Theory and Applications Wiley, 2014 Theory and Applications Marvin Rausand Homepage of the book: http://www.ntnu.edu/ross/ books/sis M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 2 / 38

Purpose The main purpose of this slides series is to: Introduce the alternative failure measure used with SIS, called average frequency of dangerous failures in key standards like IEC 61508 and IEC 61511. More specifically, we will address: When this measure is more suited than the average probability of failure on demand, PFD avg. The underlying theory and assumptions for this failure measure. Some of the analytical formulas developed for this purpose. Other methods may be presented in other slides series, such as the use of Markov methods and PetriNets. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 3 / 38

Motivation and clarifications average frequency of dangerous failure is frequency measure The measure is used for many applications, such as for railway signaling systems and for machinery systems. For railway industry, it is sometimes referred to as the hazard rate, as the result of a SIS failure (e.g., setting a green light signal when it should be red) is likely to be hazardous. The main attribute of systems using a failure frequency measure is that they operate in the continuous demand or high-demand, according to the classification in IEC 61508. The measure is often abbreviated PFH, even if the old term "Probability of having a dangerous failure per hour has been removed from IEC 61508. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 4 / 38

Characteristics of a high-/continuous demand system What characterizes a high-demand vs continuous demand system (SIS)? The SIS is subject to frequent demands, either more often than once per year (high-demand) or always (continuous demand) The system may be repairable or non-repairable In case of a repairable system: The system may or may not be subject to regular proof testing If never tested, there is a possibility that a redundant system is down, even if the response to the demand is successful M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 5 / 38

Frequency is not just a frequency The failure frequency can be different things, and a starting point is to distinguish frequency measures for non-repairable and for repairable systems. For repairable systems, we often refer to the following types of frequency measures: Probability density function Failure rate function Average failure rate For repairable systems, we may use: Rate of occurrence of failures (ROCOF) Average ROCOF Vesely Failure Rate M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 6 / 38

Frequency measure for non-repairable systems Detailed about the frequency measures for repairable systems are: Probability density function, f (t), which is the unconditional probability that the item fails in a small interval t per t time units. Failure rate function, z(t), which is the conditional probability that the item fails in a small test interval t given that the item has survived til the starting point of the interval, t, per t time units. Average failure rate, which is z(0,τ ) = 1 τ τ 0 z(t)dt F (τ ) τ Note that the approximation above only applies to rather short intervals. For large values of τ, we note that the average failure rate tends towards zero, which is not realistic (even if desired...). M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 7 / 38

Frequency measure for non-repairable systems Which measure would be PFH for non-repairable systems? The failure rate function would perhaps be the best alternative (?) Once the distribution of z(t) or F(t) is selected, we may calculate the PFH Consider a single item. We then get for: Exponentially distributed time to failure, with parameter λ: z(0,τ ) = 1 τ τ 0 z(t)dt = 1 τ τ 0 λdt = λ Weilbul distributed time to failure, with parameters λ and α z(0,τ ) = 1 τ τ 0 z(t)dt = 1 τ τ 0 αλ α t α 1 dt = λ α τ α 1 M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 8 / 38

What would be such a non-repairable system? Most safety-critical systems in our context are repairable. Non-repairable systems are less common. One example could be a space mission system (never to return to earth) or a destructive system (destroye, once used). We will not spend more time on non-repairable systems in this lecture. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 9 / 38

Frequency measure for repairable systems Suitable frequency measure for a repairable system is ROCOF, w(t) ROCOF can be estimated on the basis of the following curve. Note that the curve may be linear, convex or concave, depending on whether the ROCOF is constant, increasing or decreasing with time. N(t) 10 9 8 7 6 5 4 3 2 1 0 0 t 1 t 2 t 3 t 4 t 5 t 6 t 7 t 8 t 9 t 10 Time M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 10 / 38

Frequency measure for repairable systems ROCOF may be calculated as: w(t) = d E[N (t + t) N (t)] E[N (t)] = lim dt t 0 t E[N (t + t) N (t)] (for small t) t Pr[Failure in (t,t+ t)] = (Assuming 1 failure in t) t Note that the ROCOF is unconditional failure frequency, as it does not consider what happened before t. Any further calculation requires that we make an assumption about the distribution of the occurrence of failures. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 11 / 38

Frequency measure for repairable systems For small values of t, we get: w(t) Mean (or observed) number of failures in (t, t + t) t (Assuming 1 failure in t) This result is useful if we want to estimate the ROCOF on the basis of a given curve (N(t) or E(N(t)). M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 12 / 38

Frequency measure for repairable systems We now assume that the failure rate, λ, of an item is constant (i.e., exponentially distributed time to failure for each item) In this case, the failures occur according to a homogeneous Poisson process with distribution: Pr(N (t) = n) = (λt)n e λt for n = 0, 1... n! The mean number of failures in an interval (0, t) is: E[N (t)] = n Pr(N (t) = n) = λt The ROCOF then becomes (for not too large values of t): n=0 w(t) = E[N (t)] t M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 13 / 38 = λ

Frequency measure for repairable systems Assuming the homogeneous Poisson process results in a ROCOF that has the same value as the rate λ, however the concepts have different meaning! M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 14 / 38

Frequency measure for repairable systems Vesely Failure Rate, here denoted λ V (t), is an alternative to the ROCOF failure frequency measure. It is defind as: λ V (t) = Pr(Failure in (t, t + t] X (t) = 1) t Pr(Failure in (t, t + t]) 1 Pr(X (t) = 1) t = w(t) A(t) This failure rate has the following attributes: It is a conditional probability of a failure in (t, t + t], given that the item is functioning at time t. It is not considered how many times the item has failed prior to t, just with what probability the item is functioning at time t. The ROCOF is therefore a fraction of the Vesely Failure Rate. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 15 / 38

PFH as a frequency measure PFH has a meaning that partly overlap with ROCOF: PFH may be regarded as ROCOF with respect to dangerous failures It is suggested in the textbook that the term PFH can be used as as a time dependent parameter as well as an average value, just as we have for ROCOF. IEC 61508 defines PFH as a frequency per hour. In principle, we could decide to use PFH per minute, per second, per year and so on, however tables in IEC 61508 is set up using per hour. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 16 / 38

The meaning of dangerous in PFH PFH is the (average) failure rate of dangerous failures per hour. But what is meant by dangerous in this context. Interpretation 1: It considers all failures that terminates the ability of a SIS to carry out its safety functions Interpretation 2: It considers those dangerous failures that results in a hazardous event within the EUC Confused? In the second option, we disregard certain types of dangerous failures, the dangerous detected (DD) failures IF the EUC is brought to a safe state as a response to the failures. This may be implemented for sensors and logic solvers, but now always applicable to final elements. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 17 / 38

The meaning of dangerous in PFH Why are DD failures a concern: In high-demand mode, demands occur so often that it may not be time to initiate or complete repair in before The probability that a demand with constant rate λ de occurs before the DD failure has been repaired, with mean time to restoration (MTTR), is: Pr(t < MTTR) λ de MTTR If the MTTR is 24 hours and λ DE is once every week, the probability that a demand occurs before restoration is approximately 14%. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 18 / 38

PFH of a SIF The PFH of a SIF is the sum of the PFH calculated for each subsystem in the same time interval, often (0,τ ) where τ is the proof test interval: PFH SIF (0,τ ) = PFH S (0,τ ) + PFH LS (0,τ ) + PFH FE (0,τ ) Where the notation S means sensors, LS means logic solver and FE means final element. Even with constant failure rate of items, the PFH at the subsystem level (and thereby also the SIF level) may be time dependent if one or more of the subsystems are redundant. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 19 / 38

Simplified formulas: PFH for a single system A single system may enter either a DD or DU failure state. In a time interval DD, we may experience first one (that is repaired) and then the other. Assumption: Only one failure can occur during a period of in the magnitude of the proof test interval. Since the system is a single system, we will shut down/stop the EUC upon a DD failure. The assumption means that we only consider the contribution from DU failrues: where PFH 1oo1 G = E[N G (τ )] τ E[N G (τ ) = 0 Pr(N (τ ) = 0) + 1 Pr(N (τ ) = 1) = (1 e λ DU τ ) λ DU τ which results in: PFH 1oo1 G = 1 e λ DU τ τ λ DU M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 20 / 38

Simplified formulas: PFH for a series structure The failure rate of a series structure is the sum of the failure rates: PFH noon G = n i=1 λ DU,i M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 21 / 38

Simplified formulas: PFH for a 1oo2 system Assumption: We maintain focus on DU failures only. No more than one DANGEROUS GROUP FAILURE during a period of a proof test interval, or another defined time interval of a magnitude of a year or so: where: PFH 1oo2 G = E[N G (τ )] τ E[N G (τ ) = 0 Pr(N (τ ) = 0) + 1 Pr(N (τ ) = 1) = (1 e λ DU τ ) 2 (λ DU τ ) 2 This results in: PFH 1oo2 G = λ 2 DU τ M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 22 / 38

Simplified formulas: PFH of a koon system Assumption: We maintain focus on DU failures only. No more than one DANGEROUS GROUP FAILURE during a period of a proof test interval, or another defined time interval of a magnitude of a year or so: where: This results in: PFH koon G = E[N G (τ )] τ E[N G (τ ) = 0 Pr(N (τ ) = 0) + 1 Pr(N (τ ) = 1) = Pr(M n k + 1) n ( ) n (λ DU τ ) j j j=n k+1 Pr(M = n k + 1) ( PFHG koon Pr(M = n k + 1) = = τ n n k + 1 ) λ n k+1 DU τ n k M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 23 / 38

Inclusion of DD failures Assumption: We study a 1oo2 system. Two possible situations for the first failure: (a) A DU failure occurs first, OR (b) A DD failure occurs first. The next (second) failure will result in a group failure. We assume that (n-k+1) DD failures always results in an automatic transition to the safe state. The approach is as before: First determine Pr(DGF in (0,τ )) (which under our assumptions is the same as E[(N G τ )]). Then determine: Pr(DGF in(0,τ ) PFH G = τ M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 24 / 38

Inclusion of DD failures: Option (a) Option (a) - (approximation found using Maple): Pr(DGF in(0,τ )) = τ 0 2λ DU e 2λ DU t (1 e (λ DU +λ DD )(τ t) )dt (λ DU τ ) 2 + λ DU λ DD τ 2 M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 25 / 38

Inclusion of DD failures: Option (b) Option (b) - approach 1 (approximation found using Maple): Pr(DGF in(0,τ )) = Option (b) - approach 2: τ 0 [ ] 2λ DD e 2λ DDt 1 e λ DU MTTR dt 2λ DU λ DD τ MTTR Pr(DGF in(0,τ )) = (1 e 2λ DDτ )(1 e λ DU MTTR ) 2λ DU λ DD τ MTTR Note the assumption that the second failure of interest is a DU failure only (as a double DD failure is assumed to result a commanded transition to the safe state) M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 26 / 38

Inclusion of DD failures: Options (a) AND (b) A DGF occurs if option (a) or option (b) occurs. This means that: Pr(DGF in(0,τ )) = (λ DU τ ) 2 + λ DU λ DD τ 2 + 2λ DU λ DD τ MTTR The PFH becomes then: PFH G = Pr(DGF in(0,τ ) τ = λ 2 DU τ + λ DUλ DD τ + 2λ DU λ DD MTTR M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 27 / 38

Section 9.4.2 - low high-demand mode When the demand rate is just slightly higher than one demand per year (e.g., once every 7-10 months), it is likely that any detected (DD) failure is repaired before the next demand. In this case, we may disregard the DD-failures in the PFH formulas. Pr(DGF in(0,τ )) = (λ DU τ ) 2 The PFH becomes then: PFH G = Pr(DGF in(0,τ ) τ = λ 2 DU τ M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 28 / 38

Section 9.4.2 - inclusion of CCFs The inclusion of CCFs is rather straight forward with the standard beta factor model: PFHG koon = PFHG,i koon + PFHG,c koon PFH G,i corresponds to the equation(s) developed on previous slides, but with λ DU replaced by (1 β)λ DU and λ DD replaced by (1 β D )λ DD, and: PFH G,c = βλ DU You may notice that β D λ DD was omitted. The main reason is that a double DD failure is assumed to result in a commanded transition to the safe state. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 29 / 38

IEC 61508 formulas: Assumptions The main assumption in IEC 61508 is that a group failure (DGF) occurs if a D (DU or DD) failure occurs first (independent failure), and then a DU failure. If a DD failure occurs as the last failure (independent or CCF), it results in a transition to the safe state. This last assumptions is questionable since it is impossible to know when the DD failure is the last of n failures, if one or more of the previous failures are (hidden) DU failures. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 30 / 38

IEC 61508 formulas: Basic approach The basic idea is to find what we previously have referred to as dangerous group failure (DGF). DGF is however not exactly the same as we determined it with PFD-formulas, because of the assumption that the the contribution DD-failure as the last failure is disregarded. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 31 / 38

IEC 61508 formulas: Examples Consider a 1oo2 system (without including CCFs): where t CE = λdu λd ( τ 2 PFH 1oo2 G = 2 λ D (1 e λ DU t CE ) 2λ D λ DU t CE λdd + MRT ) + λd MTTR. With CCFs included, we get: PFH 1oo2 G = 2 λ D (1 e (1 β )λ DU t CE ) + βλdu 2λ (I ) D λ DU t CE + βλ DU where and λ (i) D = (1 β)λ DU +(1 β D )λ DD. Note again that CCF contribution from DD failures are disregarded due to the initial assumption. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 32 / 38

IEC 61508 formulas We may use the same assumptions to derive the formula for 2oo3 system: PFH 2oo3 G = 3 λ (I ) D (1 e 2(1 β )λ DU t CE ) + βλdu 6λ D λ DU t CE + βλdu For a 1003 system, the system fails when the third failure occurs. This must happen in the mean downtime after two failures, which is: The equation becomes: t GE2 = λ DU λ D ( τ 3 + MRT ) + λ DD λ D MTTR PFHG 1oo3 = 3 λ (I ) D (1 ) e 2λ(I D t CE )(1 e (1 β )λ DU t GE2 ) + βλdu = 6λ 2 D (1 β)λ DU t CE t GE2 + βλdu Note that the group equivalent mean downtime is never part of PFH formulas (therefore no t GE which is previously used to denote this special case. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 33 / 38

IEC 61508 formula- alternatives We can also use an alternative approach to derive the IEC 61508 formulas, using the relationship: PFH G = PFD avg t GE However, it should be noted the assumption about DD failure to be disregarded as the last failure is not captured here. M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 34 / 38

IEC 61508 formula- alternatives For a koon system, we get: n k+1 PFH G = ( (n i + 1))(λ (I ) n k D )n k (1 β)λ DU ( t GEi )t CE + βλ DU i=1 Note that there is an error in formula (9.59) in the SIS book, so the one above has been developed for the purpose of the slides. It is in line with the new version of section 8.4, formula 8.48, found under errata for the textbook. i=2 M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 35 / 38

Markov methods The use of Markov approach to solve for PFH is very similar to what we did for PFD. 1. Define the system states 2. Set up the state transition diagram 3. Calculate the steady state probabilities 1 4. Determine PFH by considering all "jumps" into the dangerous states 2 1 Alternatively, one can calculate the time dependent probabilities. 2 Alternatively, if time dependent probabilities have been calculated: Integrate and average the "jumps" into the dangerous states during a defined time interval M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 36 / 38

Example Consider a 2oo4 system of independent channels that may fail with a dangerous failure rate λ D. State State description 0 Four channels are functioning 1 Three channels are functioning, one has failed 2 Two channels are functioning, two have failed 3 One channel is functioning and three are failed The state transition diagram becomes: 4λ D 3λ D 2λ D λ D 4OK 3OK 1F 2OK 2F 1OK 3F 4F 0 1 2 3 4 μ 2μ 3μ 4μ Non-dangerous states Dangerous states Non-critical Critical M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 37 / 38

Example continued PFH becomes: PFH G = P 2 2λ D where P 2 is (by using MAPLE) is: P 2 = 6λ 2 D µ2 µ 2 + µ 3 λ D + 6µ 2 λ 2 D + 4µλ3 D + λ4 D M.A.Lundteigen (RAMS Group) Reliability of Safety-Critical Systems (Version 1.1) 38 / 38