Fault-Tolerant Computing

Size: px
Start display at page:

Download "Fault-Tolerant Computing"

Transcription

1 Fault-Tolerant Computing Motivation, Background, and Tools Slide 1

2 About This Presentation This presentation has been prepared for the graduate course ECE 257A (Fault-Tolerant Computing) by Behrooz Parhami, Professor of Electrical and Computer Engineering at University of California, Santa Barbara. The material contained herein can be used freely in classroom teaching or any other educational setting. Unauthorized uses are prohibited. Behrooz Parhami Edition Released Revised Revised First Slide 2

3 for Dependability Slide 3

4 Slide 4

5 Impairments to Dependability Hazard Bug Fault Degradation Defect ERROR Malfunction Crash Intrusion Slide 5 Flaw

6 The Fault-Error- Cycle Includes both components and design Aspect Structure State Behavior Impairment Fault Error Fault 0 Correct signal 0 0 Replaced with NAND? Schematic diagram of the Newcastle hierarchical model and the impairments within one level. Slide 6

7 The Four-Universe Model Universe Physical Logical Informational External Impairment Fault Error Crash Cause-effect diagram for Avižienis four-universe model of impairments to dependability. Slide 7

8 Unrolling the Fault-Error- Cycle Abstraction Impairment Aspect Structure State Behavior Impairment Fault Error First Cycle Second Cycle Component Logic Information System Service Result Defect Fault Error Malfunction Degradation Low- Level Mid- Level High- Level Cause-effect diagram for an extended six-level view of impairments to dependability. Slide 8

9 Multilevel Model Ideal Component Logic Information System Legend: Legned: Initial Entry Entry Deviation Defective Faulty Erroneous Malfunctioning Low-Level Impaired Mid-Level Impaired Service Result Remedy Tolerance Degraded Failed High-Level Impaired Slide 9

10 Analogy for the Multilevel Model An analogy for our multi-level model of dependable computing. Defects, faults, errors, malfunctions, degradations, and failures are represented by pouring water from above. Valves represent avoidance and tolerance techniques. The goal is to avoid overflow. Wall heights represent inter-level latencies Concentric reservoirs are analogs of the six model levels, with defect being innermost Inlet valves represent avoidance techniques I I I I I I I I I I I Drain valves represent tolerance techniques I Slide 10

11 Why Our Concern with Dependability? Reliability of n-transistor system, each having failure rate λ R(t) = e nλt There are only 3 ways of making systems more reliable Reduce λ Reduce n Reduce t Alternative: Change the reliability formula by introducing redundancy in system e n λ t nt Slide 11

12 Highly Dependable Computer Systems Long-life systems: Fail-slow, Rugged, High-reliability Spacecraft with multiyear missions, systems in inaccessible locations Methods: Replication (spares), error coding, monitoring, shielding Safety-critical systems: Fail-safe, Sound, High-integrity Flight control computers, nuclear-plant shutdown, medical monitoring Methods: Replication with voting, time redundancy, design diversity High-availability: Fail-soft, Robust, High-availability Telephone switching centers, transaction processing, e-commerce Methods: HW/info redundancy, backup schemes, hot-swap, recovery Just as performance enhancement techniques gradually migrate from supercomputers to desktops, so too dependability enhancement methods find their way from exotic systems into personal computers Slide 12

13 Aspects of Dependability Safety Resilience Risk, consequence Testability Security Controllability, observability Availability Pointwise av., Interval av., MTBF, MTTR RELIABILITY Reliability, MTTF = MTFF Maintainability Performability Robustness Integrity Performability, MCBF Slide 13 Serviceability

14 Concepts from Probability Theory Probability density function: pdf f(t) = prob[t x t + dt] / dt = df(t) / dt Cumulative distribution function: CDF t F(t) = prob[x t] = 0 f(x) dx Liftimes of 20 identical systems Expected value of x + E x = xf(x) dx = k x k f(x k ) Variance of x 2 + σ x = (x E x ) 2 f(x) dx = k (x k E x ) 2 f(x k ) Covariance of x and y ψ x,y = E [(x E x )(y E y )] = E [xy] E x E y Time CDF pdf f(t) Time Time F(t) Slide 14

15 Some Simple Probability Distributions F(x) 1 CDF CDF CDF CDF f(x) pdf pdf pdf Uniform Exponential Normal Binomial Slide 15

16 Reliability and MTTF Reliability: R(t) Probability that system remains in the Good state through the interval [0, t] R(t + dt) = R(t) [1 z(t) dt] Hazard function Start State Good Two-state nonrepairable system Failed R(t) = 1 F(t) CDF of the system lifetime, or its unreliability Constant hazard function z(t) = λ R(t) = e λt (system failure rate is independent of its age) Mean time to failure: MTTF + + MTTF = 0 tf(t) dt = 0 R(t) dt Expected value of lifetime Area under the reliability curve (easily provable) Exponential reliability law Slide 16

17 Distributions of Interest Exponential: z(t) = λ R(t) = e λt MTTF = 1/λ Rayleigh: z(t) = 2λ(λt) R(t) = e ( λt)2 MTTF = (1/λ) π / 2 Discrete versions Geometric R(k) = q k Weibull: z(t) = αλ(λt) α 1 R(t) = e ( λt)α MTTF = (1/λ) Γ(1 + 1/α) Discrete Weibull Erlang: MTTF = k/λ Gamma: Erlang and exponential are special cases Normal: Reliability and MTTF formulas are complicated Binomial Slide 17

18 Reliability difference: R 2 R 1 Comparing Reliabilities Reliability gain: R 2 / R 1 Reliability improvement factor RIF 2/1 = [1 R 1 (t M )] / [1 R 2 (t M )] Example: [1 0.9] / [1 0.99] = 10 Reliability improv. index RII = log R 1 (t M ) / log R 2 (t M ) Mission time extension MTE 2/1 (r G ) = T 2 (r G ) T 1 (r G ) System Reliability (R) 1.0 R 2(t M) r R (t ) 1 G M Reliability functions for Systems 1/2 R (t) 2 R (t) 1 Mission time improv. factor: MTIF 2/1 (r G ) = T 2 (r G ) / T 1 (r G ) 0.0 T 1(r ) t T (r ) G M 2 G MTTF 2 Time (t) MTTF 1 Slide 18

19 Availability, MTTR, and MTBF (Interval) Availability: A(t) Fraction of time that system is in the Up state during the interval [0, t] Steady-state availability: A = lim t A(t) Pointwise availability: a(t) Probability that system available at time t t A(t) = (1/t) 0 a(x) dx Start State Two-state repairable system Up Repair Down Availability = Reliability, when there is no repair Availability is a function not only of how rarely a system fails (reliability) but also of how quickly it can be repaired (time to repair) MTTF MTTF μ A = = = MTTF + MTTR MTBF λ + μ In general, μ >> λ, leading to A 1 Repair rate 1/μ = MTTR (Will justify this equation later) Slide 19

20 System Up and Down Times Short repair time implies good Maintainability (serviceability) Start State Up Repair Down Time to first failure Time between failures Repair time Up Down 0 t 1 t' 1 t 2 t' 2 t Time Slide 20

21 Performability and MCBF Performability: P Composite measure, incorporating both performance and reliability Simple example Worth of Up2 twice that of Up1 t p Upi = probability system is in state Upi P = 2p Up2 + p Up1 Start State Repair Up 2 Up 1 Partial failure Three-state degradable system Partial repair Down Question: What is system availability here? p Up2 = 0.92, p Up1 = 0.06, p Down = 0.02, P = 1.90 (system performance equiv. To that of 1.9 processors on average) Performability improvement factor of this system (akin to RIF) relative to a fail-hard system that goes down when either processor fails: PIF = ( ) / (2 1.90) = 1.6 Slide 21

22 System Up, Partially Up, and Down Times Important to prevent direct transitions to the Down state (coverage) Start State Repair Up 2 Up 1 Partial failure Partial repair Down Up Partial Partially Up Total Partial Repair Down 0 t 1 t 2 t' 2 t' 1 t 3 t' 3 t Time Slide 22

23 Integrity and Safety Risk: Prob. of being in Unsafe Failed state There may be multiple unsafe states, each with a different consequence (cost) Simple analysis Lump Safe Failed state with Good state; proceed as in reliability analysis More detailed analysis Even though Safe Failed state is more desirable than Unsafe Failed, it is still not as desirable as the Good state; so keeping it separate makes sense Start State Good Three-state fail-safe system Safe Failed Unsafe Failed For example, if a repair transition is introduced between Safe Failed and Good states, we can tackle questions such as the expected outage of the system in safe mode, and thus its availability Slide 23

Fault-Tolerant Computing

Fault-Tolerant Computing Fault-Tolerant Computing Motivation, Background, and Tools Slide 1 About This Presentation This presentation has been prepared for the graduate course ECE 257A (Fault-Tolerant Computing) by Behrooz Parhami,

More information

UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering. Fault Tolerant Computing ECE 655

UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering. Fault Tolerant Computing ECE 655 UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering Fault Tolerant Computing ECE 655 Part 1 Introduction C. M. Krishna Fall 2006 ECE655/Krishna Part.1.1 Prerequisites Basic courses in

More information

Terminology and Concepts

Terminology and Concepts Terminology and Concepts Prof. Naga Kandasamy 1 Goals of Fault Tolerance Dependability is an umbrella term encompassing the concepts of reliability, availability, performability, safety, and testability.

More information

CHAPTER 10 RELIABILITY

CHAPTER 10 RELIABILITY CHAPTER 10 RELIABILITY Failure rates Reliability Constant failure rate and exponential distribution System Reliability Components in series Components in parallel Combination system 1 Failure Rate Curve

More information

Dependable Systems. ! Dependability Attributes. Dr. Peter Tröger. Sources:

Dependable Systems. ! Dependability Attributes. Dr. Peter Tröger. Sources: Dependable Systems! Dependability Attributes Dr. Peter Tröger! Sources:! J.C. Laprie. Dependability: Basic Concepts and Terminology Eusgeld, Irene et al.: Dependability Metrics. 4909. Springer Publishing,

More information

Reliable Computing I

Reliable Computing I Instructor: Mehdi Tahoori Reliable Computing I Lecture 5: Reliability Evaluation INSTITUTE OF COMPUTER ENGINEERING (ITEC) CHAIR FOR DEPENDABLE NANO COMPUTING (CDNC) National Research Center of the Helmholtz

More information

CHAPTER 3 ANALYSIS OF RELIABILITY AND PROBABILITY MEASURES

CHAPTER 3 ANALYSIS OF RELIABILITY AND PROBABILITY MEASURES 27 CHAPTER 3 ANALYSIS OF RELIABILITY AND PROBABILITY MEASURES 3.1 INTRODUCTION The express purpose of this research is to assimilate reliability and its associated probabilistic variables into the Unit

More information

EE 445 / 850: Final Examination

EE 445 / 850: Final Examination EE 445 / 850: Final Examination Date and Time: 3 Dec 0, PM Room: HLTH B6 Exam Duration: 3 hours One formula sheet permitted. - Covers chapters - 5 problems each carrying 0 marks - Must show all calculations

More information

Dependable Computer Systems

Dependable Computer Systems Dependable Computer Systems Part 3: Fault-Tolerance and Modelling Contents Reliability: Basic Mathematical Model Example Failure Rate Functions Probabilistic Structural-Based Modeling: Part 1 Maintenance

More information

Fault Tolerance. Dealing with Faults

Fault Tolerance. Dealing with Faults Fault Tolerance Real-time computing systems must be fault-tolerant: they must be able to continue operating despite the failure of a limited subset of their hardware or software. They must also allow graceful

More information

Part 3: Fault-tolerance and Modeling

Part 3: Fault-tolerance and Modeling Part 3: Fault-tolerance and Modeling Course: Dependable Computer Systems 2012, Stefan Poledna, All rights reserved part 3, page 1 Goals of fault-tolerance modeling Design phase Designing and implementing

More information

Modeling and Simulation of Communication Systems and Networks Chapter 1. Reliability

Modeling and Simulation of Communication Systems and Networks Chapter 1. Reliability Modeling and Simulation of Communication Systems and Networks Chapter 1. Reliability Prof. Jochen Seitz Technische Universität Ilmenau Communication Networks Research Lab Summer 2010 1 / 20 Outline 1 1.1

More information

Signal Handling & Processing

Signal Handling & Processing Signal Handling & Processing The output signal of the primary transducer may be too small to drive indicating, recording or control elements directly. Or it may be in a form which is not convenient for

More information

Page 1. Outline. Modeling. Experimental Methodology. ECE 254 / CPS 225 Fault Tolerant and Testable Computing Systems. Modeling and Evaluation

Page 1. Outline. Modeling. Experimental Methodology. ECE 254 / CPS 225 Fault Tolerant and Testable Computing Systems. Modeling and Evaluation Page 1 Outline ECE 254 / CPS 225 Fault Tolerant and Testable Computing Systems Modeling and Evaluation Copyright 2004 Daniel J. Sorin Duke University Experimental Methodology and Modeling Modeling Random

More information

9. Reliability theory

9. Reliability theory Material based on original slides by Tuomas Tirronen ELEC-C720 Modeling and analysis of communication networks Contents Introduction Structural system models Reliability of structures of independent repairable

More information

Chapter Learning Objectives. Probability Distributions and Probability Density Functions. Continuous Random Variables

Chapter Learning Objectives. Probability Distributions and Probability Density Functions. Continuous Random Variables Chapter 4: Continuous Random Variables and Probability s 4-1 Continuous Random Variables 4-2 Probability s and Probability Density Functions 4-3 Cumulative Functions 4-4 Mean and Variance of a Continuous

More information

Concept of Reliability

Concept of Reliability Concept of Reliability Prepared By Dr. M. S. Memon Department of Industrial Engineering and Management Mehran University of Engineering and Technology Jamshoro, Sindh, Pakistan RELIABILITY Reliability

More information

Slides 8: Statistical Models in Simulation

Slides 8: Statistical Models in Simulation Slides 8: Statistical Models in Simulation Purpose and Overview The world the model-builder sees is probabilistic rather than deterministic: Some statistical model might well describe the variations. An

More information

Page 1. Outline. Experimental Methodology. Modeling. ECE 254 / CPS 225 Fault Tolerant and Testable Computing Systems. Modeling and Evaluation

Page 1. Outline. Experimental Methodology. Modeling. ECE 254 / CPS 225 Fault Tolerant and Testable Computing Systems. Modeling and Evaluation Outline Fault Tolerant and Testable Computing Systems Modeling and Evaluation Copyright 2011 Daniel J. Sorin Duke University Experimental Methodology and Modeling Random Variables Probabilistic Models

More information

Evaluation and Validation

Evaluation and Validation Evaluation and Validation Peter Marwedel TU Dortmund, Informatik 12 Germany Graphics: Alexandra Nolte, Gesine Marwedel, 2003 2011 06 18 These slides use Microsoft clip arts. Microsoft copyright restrictions

More information

Engineering Risk Benefit Analysis

Engineering Risk Benefit Analysis Engineering Risk Benefit Analysis 1.155, 2.943, 3.577, 6.938, 10.816, 13.621, 16.862, 22.82, ESD.72, ESD.721 RPRA 3. Probability Distributions in RPRA George E. Apostolakis Massachusetts Institute of Technology

More information

Reliability Engineering I

Reliability Engineering I Happiness is taking the reliability final exam. Reliability Engineering I ENM/MSC 565 Review for the Final Exam Vital Statistics What R&M concepts covered in the course When Monday April 29 from 4:30 6:00

More information

Key Words: Lifetime Data Analysis (LDA), Probability Density Function (PDF), Goodness of fit methods, Chi-square method.

Key Words: Lifetime Data Analysis (LDA), Probability Density Function (PDF), Goodness of fit methods, Chi-square method. Reliability prediction based on lifetime data analysis methodology: The pump case study Abstract: The business case aims to demonstrate the lifetime data analysis methodology application from the historical

More information

Quantitative evaluation of Dependability

Quantitative evaluation of Dependability Quantitative evaluation of Dependability 1 Quantitative evaluation of Dependability Faults are the cause of errors and failures. Does the arrival time of faults fit a probability distribution? If so, what

More information

Evaluation and Validation

Evaluation and Validation Evaluation and Validation Jian-Jia Chen (slides are based on Peter Marwedel) TU Dortmund, Informatik 12 Germany Springer, 2010 2018 年 01 月 17 日 These slides use Microsoft clip arts. Microsoft copyright

More information

CHAPTER 3 MATHEMATICAL AND SIMULATION TOOLS FOR MANET ANALYSIS

CHAPTER 3 MATHEMATICAL AND SIMULATION TOOLS FOR MANET ANALYSIS 44 CHAPTER 3 MATHEMATICAL AND SIMULATION TOOLS FOR MANET ANALYSIS 3.1 INTRODUCTION MANET analysis is a multidimensional affair. Many tools of mathematics are used in the analysis. Among them, the prime

More information

Quantitative evaluation of Dependability

Quantitative evaluation of Dependability Quantitative evaluation of Dependability 1 Quantitative evaluation of Dependability Faults are the cause of errors and failures. Does the arrival time of faults fit a probability distribution? If so, what

More information

57:022 Principles of Design II Final Exam Solutions - Spring 1997

57:022 Principles of Design II Final Exam Solutions - Spring 1997 57:022 Principles of Design II Final Exam Solutions - Spring 1997 Part: I II III IV V VI Total Possible Pts: 52 10 12 16 13 12 115 PART ONE Indicate "+" if True and "o" if False: + a. If a component's

More information

B.H. Far

B.H. Far SENG 521 Software Reliability & Software Quality Chapter 6: Software Reliability Models Department of Electrical & Computer Engineering, University of Calgary B.H. Far (far@ucalgary.ca) http://www.enel.ucalgary.ca/people/far/lectures/seng521

More information

These notes will supplement the textbook not replace what is there. defined for α >0

These notes will supplement the textbook not replace what is there. defined for α >0 Gamma Distribution These notes will supplement the textbook not replace what is there. Gamma Function ( ) = x 0 e dx 1 x defined for >0 Properties of the Gamma Function 1. For any >1 () = ( 1)( 1) Proof

More information

Tradeoff between Reliability and Power Management

Tradeoff between Reliability and Power Management Tradeoff between Reliability and Power Management 9/1/2005 FORGE Lee, Kyoungwoo Contents 1. Overview of relationship between reliability and power management 2. Dakai Zhu, Rami Melhem and Daniel Moss e,

More information

Lecture 5 Probability

Lecture 5 Probability Lecture 5 Probability Dr. V.G. Snell Nuclear Reactor Safety Course McMaster University vgs 1 Probability Basic Ideas P(A)/probability of event A 'lim n64 ( x n ) (1) (Axiom #1) 0 # P(A) #1 (1) (Axiom #2):

More information

Monte Carlo Simulation for Reliability and Availability analyses

Monte Carlo Simulation for Reliability and Availability analyses Monte Carlo Simulation for Reliability and Availability analyses Monte Carlo Simulation: Why? 2 Computation of system reliability and availability for industrial systems characterized by: many components

More information

Chapter 9 Part II Maintainability

Chapter 9 Part II Maintainability Chapter 9 Part II Maintainability 9.4 System Repair Time 9.5 Reliability Under Preventive Maintenance 9.6 State-Dependent Systems with Repair C. Ebeling, Intro to Reliability & Maintainability Chapter

More information

Ching-Han Hsu, BMES, National Tsing Hua University c 2015 by Ching-Han Hsu, Ph.D., BMIR Lab. = a + b 2. b a. x a b a = 12

Ching-Han Hsu, BMES, National Tsing Hua University c 2015 by Ching-Han Hsu, Ph.D., BMIR Lab. = a + b 2. b a. x a b a = 12 Lecture 5 Continuous Random Variables BMIR Lecture Series in Probability and Statistics Ching-Han Hsu, BMES, National Tsing Hua University c 215 by Ching-Han Hsu, Ph.D., BMIR Lab 5.1 1 Uniform Distribution

More information

Safety and Reliability of Embedded Systems

Safety and Reliability of Embedded Systems (Sicherheit und Zuverlässigkeit eingebetteter Systeme) Fault Tree Analysis Mathematical Background and Algorithms Prof. Dr. Liggesmeyer, 0 Content Definitions of Terms Introduction to Combinatorics General

More information

Stochastic Renewal Processes in Structural Reliability Analysis:

Stochastic Renewal Processes in Structural Reliability Analysis: Stochastic Renewal Processes in Structural Reliability Analysis: An Overview of Models and Applications Professor and Industrial Research Chair Department of Civil and Environmental Engineering University

More information

Introduction to Reliability Theory (part 2)

Introduction to Reliability Theory (part 2) Introduction to Reliability Theory (part 2) Frank Coolen UTOPIAE Training School II, Durham University 3 July 2018 (UTOPIAE) Introduction to Reliability Theory 1 / 21 Outline Statistical issues Software

More information

System Simulation Part II: Mathematical and Statistical Models Chapter 5: Statistical Models

System Simulation Part II: Mathematical and Statistical Models Chapter 5: Statistical Models System Simulation Part II: Mathematical and Statistical Models Chapter 5: Statistical Models Fatih Cavdur fatihcavdur@uludag.edu.tr March 20, 2012 Introduction Introduction The world of the model-builder

More information

Quiz #2 A Mighty Fine Review

Quiz #2 A Mighty Fine Review Quiz #2 A Mighty Fine Review February 27: A reliable adventure; a day like all days filled with those events that alter and change the course of history and you will be there! What is a Quiz #2? Three

More information

Considering Security Aspects in Safety Environment. Dipl.-Ing. Evzudin Ugljesa

Considering Security Aspects in Safety Environment. Dipl.-Ing. Evzudin Ugljesa Considering Security spects in Safety Environment Dipl.-ng. Evzudin Ugljesa Overview ntroduction Definitions of safety relevant parameters Description of the oo4-architecture Calculation of the FD-Value

More information

Reliability and Availability Simulation. Krige Visser, Professor, University of Pretoria, South Africa

Reliability and Availability Simulation. Krige Visser, Professor, University of Pretoria, South Africa Reliability and Availability Simulation Krige Visser, Professor, University of Pretoria, South Africa Content BACKGROUND DEFINITIONS SINGLE COMPONENTS MULTI-COMPONENT SYSTEMS AVAILABILITY SIMULATION CONCLUSION

More information

I I FINAL, 01 Jun 8.4 to 31 May TITLE AND SUBTITLE 5 * _- N, '. ', -;

I I FINAL, 01 Jun 8.4 to 31 May TITLE AND SUBTITLE 5 * _- N, '. ', -; R AD-A237 850 E........ I N 11111IIIII U 1 1I!til II II... 1. AGENCY USE ONLY Leave 'VanK) I2. REPORT DATE 3 REPORT TYPE AND " - - I I FINAL, 01 Jun 8.4 to 31 May 88 4. TITLE AND SUBTITLE 5 * _- N, '.

More information

Closed book and notes. 60 minutes. Cover page and four pages of exam. No calculators.

Closed book and notes. 60 minutes. Cover page and four pages of exam. No calculators. IE 230 Seat # Closed book and notes. 60 minutes. Cover page and four pages of exam. No calculators. Score Exam #3a, Spring 2002 Schmeiser Closed book and notes. 60 minutes. 1. True or false. (for each,

More information

Satisfiability A Lecture in CE Freshman Seminar Series: Ten Puzzling Problems in Computer Engineering. Apr Satisfiability Slide 1

Satisfiability A Lecture in CE Freshman Seminar Series: Ten Puzzling Problems in Computer Engineering. Apr Satisfiability Slide 1 Satisfiability A Lecture in CE Freshman Seminar Series: Ten Puzzling Problems in Computer Engineering Apr. 206 Satisfiability Slide About This Presentation This presentation belongs to the lecture series

More information

Practical Applications of Reliability Theory

Practical Applications of Reliability Theory Practical Applications of Reliability Theory George Dodson Spallation Neutron Source Managed by UT-Battelle Topics Reliability Terms and Definitions Reliability Modeling as a tool for evaluating system

More information

Chapter 15. System Reliability Concepts and Methods. William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University

Chapter 15. System Reliability Concepts and Methods. William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University Chapter 15 System Reliability Concepts and Methods William Q. Meeker and Luis A. Escobar Iowa State University and Louisiana State University Copyright 1998-2008 W. Q. Meeker and L. A. Escobar. Based on

More information

BMIR Lecture Series on Probability and Statistics Fall, 2015 Uniform Distribution

BMIR Lecture Series on Probability and Statistics Fall, 2015 Uniform Distribution Lecture #5 BMIR Lecture Series on Probability and Statistics Fall, 2015 Department of Biomedical Engineering and Environmental Sciences National Tsing Hua University s 5.1 Definition ( ) A continuous random

More information

IoT Network Quality/Reliability

IoT Network Quality/Reliability IoT Network Quality/Reliability IEEE PHM June 19, 2017 Byung K. Yi, Dr. of Sci. Executive V.P. & CTO, InterDigital Communications, Inc Louis Kerofsky, PhD. Director of Partner Development InterDigital

More information

Chapter 5. System Reliability and Reliability Prediction.

Chapter 5. System Reliability and Reliability Prediction. Chapter 5. System Reliability and Reliability Prediction. Problems & Solutions. Problem 1. Estimate the individual part failure rate given a base failure rate of 0.0333 failure/hour, a quality factor of

More information

ECE 313 Probability with Engineering Applications Fall 2000

ECE 313 Probability with Engineering Applications Fall 2000 Exponential random variables Exponential random variables arise in studies of waiting times, service times, etc X is called an exponential random variable with parameter λ if its pdf is given by f(u) =

More information

B.H. Far

B.H. Far SENG 637 Dependability, Reliability & Testing of Software Systems Chapter 3: System Reliability Department of Electrical & Computer Engineering, University of Calgary B.H. Far (far@ucalgary.ca) http://www.enel.ucalgary.ca/people/far/lectures/seng637/

More information

Comparative Distributions of Hazard Modeling Analysis

Comparative Distributions of Hazard Modeling Analysis Comparative s of Hazard Modeling Analysis Rana Abdul Wajid Professor and Director Center for Statistics Lahore School of Economics Lahore E-mail: drrana@lse.edu.pk M. Shuaib Khan Department of Statistics

More information

Stochastic Analysis of a Two-Unit Cold Standby System with Arbitrary Distributions for Life, Repair and Waiting Times

Stochastic Analysis of a Two-Unit Cold Standby System with Arbitrary Distributions for Life, Repair and Waiting Times International Journal of Performability Engineering Vol. 11, No. 3, May 2015, pp. 293-299. RAMS Consultants Printed in India Stochastic Analysis of a Two-Unit Cold Standby System with Arbitrary Distributions

More information

Computer Simulation of Repairable Processes

Computer Simulation of Repairable Processes SEMATECH 1996 Applied Reliability Tools Workshop (ARTWORK IX) Santa Fe Computer Simulation of Repairable Processes Dave Trindade, Ph.D. Senior AMD Fellow Applied Statistics Introduction Computer simulation!

More information

3 Continuous Random Variables

3 Continuous Random Variables Jinguo Lian Math437 Notes January 15, 016 3 Continuous Random Variables Remember that discrete random variables can take only a countable number of possible values. On the other hand, a continuous random

More information

Non-observable failure progression

Non-observable failure progression Non-observable failure progression 1 Age based maintenance policies We consider a situation where we are not able to observe failure progression, or where it is impractical to observe failure progression:

More information

Evaluation criteria for reliability in computer systems

Evaluation criteria for reliability in computer systems Journal of Electrical and Electronic Engineering 5; 3(-): 83-87 Published online February, 5 (http://www.sciencepublishinggroup.com/j/jeee) doi:.648/j.jeee.s.53.8 ISSN: 39-63 (Print); ISSN: 39-65 (Online)

More information

Reliability of Safety-Critical Systems 5.1 Reliability Quantification with RBDs

Reliability of Safety-Critical Systems 5.1 Reliability Quantification with RBDs Reliability of Safety-Critical Systems 5.1 Reliability Quantification with RBDs Mary Ann Lundteigen and Marvin Rausand mary.a.lundteigen@ntnu.no &marvin.rausand@ntnu.no RAMS Group Department of Production

More information

Quantitative Reliability Analysis

Quantitative Reliability Analysis Quantitative Reliability Analysis Moosung Jae May 4, 2015 System Reliability Analysis System reliability analysis is conducted in terms of probabilities The probabilities of events can be modelled as logical

More information

B.H. Far

B.H. Far SENG 521 Software Reliability & Software Quality Chapter 8: System Reliability Department of Electrical & Computer Engineering, University of Calgary B.H. Far (far@ucalgary.ca) http://www.enel.ucalgary.ca/people/far/lectures/seng521

More information

Dependability Analysis

Dependability Analysis Software and Systems Verification (VIMIMA01) Dependability Analysis Istvan Majzik Budapest University of Technology and Economics Fault Tolerant Systems Research Group Budapest University of Technology

More information

Combinational Techniques for Reliability Modeling

Combinational Techniques for Reliability Modeling Combinational Techniques for Reliability Modeling Prof. Naga Kandasamy, ECE Department Drexel University, Philadelphia, PA 19104. January 24, 2009 The following material is derived from these text books.

More information

ELE 491 Senior Design Project Proposal

ELE 491 Senior Design Project Proposal ELE 491 Senior Design Project Proposal These slides are loosely based on the book Design for Electrical and Computer Engineers by Ford and Coulston. I have used the sources referenced in the book freely

More information

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr.

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr. Simulation Discrete-Event System Simulation Chapter 4 Statistical Models in Simulation Purpose & Overview The world the model-builder sees is probabilistic rather than deterministic. Some statistical model

More information

Availability. M(t) = 1 - e -mt

Availability. M(t) = 1 - e -mt Availability Availability - A(t) the probability that the system is operating correctly and is available to perform its functions at the instant of time t More general concept than reliability: failure

More information

IEOR 6711: Stochastic Models I, Fall 2003, Professor Whitt. Solutions to Final Exam: Thursday, December 18.

IEOR 6711: Stochastic Models I, Fall 2003, Professor Whitt. Solutions to Final Exam: Thursday, December 18. IEOR 6711: Stochastic Models I, Fall 23, Professor Whitt Solutions to Final Exam: Thursday, December 18. Below are six questions with several parts. Do as much as you can. Show your work. 1. Two-Pump Gas

More information

Time-varying failure rate for system reliability analysis in large-scale railway risk assessment simulation

Time-varying failure rate for system reliability analysis in large-scale railway risk assessment simulation Time-varying failure rate for system reliability analysis in large-scale railway risk assessment simulation H. Zhang, E. Cutright & T. Giras Center of Rail Safety-Critical Excellence, University of Virginia,

More information

An Integral Measure of Aging/Rejuvenation for Repairable and Non-repairable Systems

An Integral Measure of Aging/Rejuvenation for Repairable and Non-repairable Systems An Integral Measure of Aging/Rejuvenation for Repairable and Non-repairable Systems M.P. Kaminskiy and V.V. Krivtsov Abstract This paper introduces a simple index that helps to assess the degree of aging

More information

JRF (Quality, Reliability and Operations Research): 2013 INDIAN STATISTICAL INSTITUTE INSTRUCTIONS

JRF (Quality, Reliability and Operations Research): 2013 INDIAN STATISTICAL INSTITUTE INSTRUCTIONS JRF (Quality, Reliability and Operations Research): 2013 INDIAN STATISTICAL INSTITUTE INSTRUCTIONS The test is divided into two sessions (i) Forenoon session and (ii) Afternoon session. Each session is

More information

Software Reliability.... Yes sometimes the system fails...

Software Reliability.... Yes sometimes the system fails... Software Reliability... Yes sometimes the system fails... Motivations Software is an essential component of many safety-critical systems These systems depend on the reliable operation of software components

More information

The random counting variable. Barbara Russo

The random counting variable. Barbara Russo The random counting variable Barbara Russo Counting Random Variable } Until now we have seen the point process through two sets of random variables } T i } X i } We introduce a new random variable the

More information

Chapter 4 Continuous Random Variables and Probability Distributions

Chapter 4 Continuous Random Variables and Probability Distributions Chapter 4 Continuous Random Variables and Probability Distributions Part 1: Continuous Random Variables Section 4.1 Continuous Random Variables Section 4.2 Probability Distributions & Probability Density

More information

Failure rate in the continuous sense. Figure. Exponential failure density functions [f(t)] 1

Failure rate in the continuous sense. Figure. Exponential failure density functions [f(t)] 1 Failure rate (Updated and Adapted from Notes by Dr. A.K. Nema) Part 1: Failure rate is the frequency with which an engineered system or component fails, expressed for example in failures per hour. It is

More information

Reliability, Redundancy, and Resiliency

Reliability, Redundancy, and Resiliency Review of probability theory Component reliability Confidence Redundancy Reliability diagrams Intercorrelated failures System resiliency Resiliency in fixed fleets Perspective on the term project 1 2010

More information

Cyber Physical Power Systems Power in Communications

Cyber Physical Power Systems Power in Communications 1 Cyber Physical Power Systems Power in Communications Information and Communications Tech. Power Supply 2 ICT systems represent a noticeable (about 5 % of total t demand d in U.S.) fast increasing load.

More information

FAULT TOLERANT DESIGN: AN INTRODUCTION

FAULT TOLERANT DESIGN: AN INTRODUCTION FAULT TOLERANT DESIGN: AN INTRODUCTION ELENA DUBROVA Department of Microelectronics and Information Technology Royal Institute of Technology Stockholm, Sweden Kluwer Academic Publishers Boston/Dordrecht/London

More information

Markov Reliability and Availability Analysis. Markov Processes

Markov Reliability and Availability Analysis. Markov Processes Markov Reliability and Availability Analysis Firma convenzione Politecnico Part II: Continuous di Milano e Time Veneranda Discrete Fabbrica State del Duomo di Milano Markov Processes Aula Magna Rettorato

More information

10 Introduction to Reliability

10 Introduction to Reliability 0 Introduction to Reliability 10 Introduction to Reliability The following notes are based on Volume 6: How to Analyze Reliability Data, by Wayne Nelson (1993), ASQC Press. When considering the reliability

More information

Reliability of Technical Systems

Reliability of Technical Systems Main Topics 1. Introduction, Key Terms, Framing the Problem 2. Reliability Parameters: Failure Rate, Failure Probability, etc. 3. Some Important Reliability Distributions 4. Component Reliability 5. Software

More information

Applied Statistics and Probability for Engineers. Sixth Edition. Chapter 4 Continuous Random Variables and Probability Distributions.

Applied Statistics and Probability for Engineers. Sixth Edition. Chapter 4 Continuous Random Variables and Probability Distributions. Applied Statistics and Probability for Engineers Sixth Edition Douglas C. Montgomery George C. Runger Chapter 4 Continuous Random Variables and Probability Distributions 4 Continuous CHAPTER OUTLINE Random

More information

Chapter 4 Continuous Random Variables and Probability Distributions

Chapter 4 Continuous Random Variables and Probability Distributions Applied Statistics and Probability for Engineers Sixth Edition Douglas C. Montgomery George C. Runger Chapter 4 Continuous Random Variables and Probability Distributions 4 Continuous CHAPTER OUTLINE 4-1

More information

Probability Midterm Exam 2:15-3:30 pm Thursday, 21 October 1999

Probability Midterm Exam 2:15-3:30 pm Thursday, 21 October 1999 Name: 2:15-3:30 pm Thursday, 21 October 1999 You may use a calculator and your own notes but may not consult your books or neighbors. Please show your work for partial credit, and circle your answers.

More information

CDA6530: Performance Models of Computers and Networks. Chapter 2: Review of Practical Random Variables

CDA6530: Performance Models of Computers and Networks. Chapter 2: Review of Practical Random Variables CDA6530: Performance Models of Computers and Networks Chapter 2: Review of Practical Random Variables Two Classes of R.V. Discrete R.V. Bernoulli Binomial Geometric Poisson Continuous R.V. Uniform Exponential,

More information

PAS04 - Important discrete and continuous distributions

PAS04 - Important discrete and continuous distributions PAS04 - Important discrete and continuous distributions Jan Březina Technical University of Liberec 30. října 2014 Bernoulli trials Experiment with two possible outcomes: yes/no questions throwing coin

More information

STAT 509 Section 3.4: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s.

STAT 509 Section 3.4: Continuous Distributions. Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. STAT 509 Section 3.4: Continuous Distributions Probability distributions are used a bit differently for continuous r.v. s than for discrete r.v. s. A continuous random variable is one for which the outcome

More information

Chapter 8. Calculation of PFD using Markov

Chapter 8. Calculation of PFD using Markov Chapter 8. Calculation of PFD using Markov Mary Ann Lundteigen Marvin Rausand RAMS Group Department of Mechanical and Industrial Engineering NTNU (Version 0.1) Lundteigen& Rausand Chapter 8.Calculation

More information

Fault-Tolerant Computer System Design ECE 60872/CS 590. Topic 2: Discrete Distributions

Fault-Tolerant Computer System Design ECE 60872/CS 590. Topic 2: Discrete Distributions Fault-Tolerant Computer System Design ECE 60872/CS 590 Topic 2: Discrete Distributions Saurabh Bagchi ECE/CS Purdue University Outline Basic probability Conditional probability Independence of events Series-parallel

More information

Step-Stress Models and Associated Inference

Step-Stress Models and Associated Inference Department of Mathematics & Statistics Indian Institute of Technology Kanpur August 19, 2014 Outline Accelerated Life Test 1 Accelerated Life Test 2 3 4 5 6 7 Outline Accelerated Life Test 1 Accelerated

More information

System Simulation Part II: Mathematical and Statistical Models Chapter 5: Statistical Models

System Simulation Part II: Mathematical and Statistical Models Chapter 5: Statistical Models System Simulation Part II: Mathematical and Statistical Models Chapter 5: Statistical Models Fatih Cavdur fatihcavdur@uludag.edu.tr March 29, 2014 Introduction Introduction The world of the model-builder

More information

Fundamentals of Reliability Engineering and Applications

Fundamentals of Reliability Engineering and Applications Fundamentals of Reliability Engineering and Applications E. A. Elsayed elsayed@rci.rutgers.edu Rutgers University Quality Control & Reliability Engineering (QCRE) IIE February 21, 2012 1 Outline Part 1.

More information

ANALYSIS FOR A PARALLEL REPAIRABLE SYSTEM WITH DIFFERENT FAILURE MODES

ANALYSIS FOR A PARALLEL REPAIRABLE SYSTEM WITH DIFFERENT FAILURE MODES Journal of Reliability and Statistical Studies; ISSN (Print): 0974-8024, (Online):2229-5666, Vol. 5, Issue 1 (2012): 95-106 ANALYSIS FOR A PARALLEL REPAIRABLE SYSTEM WITH DIFFERENT FAILURE MODES M. A.

More information

Risk Analysis of Highly-integrated Systems

Risk Analysis of Highly-integrated Systems Risk Analysis of Highly-integrated Systems RA II: Methods (FTA, ETA) Fault Tree Analysis (FTA) Problem description It is not possible to analyse complicated, highly-reliable or novel systems as black box

More information

Reliability of Safety-Critical Systems Chapter 9. Average frequency of dangerous failures

Reliability of Safety-Critical Systems Chapter 9. Average frequency of dangerous failures Reliability of Safety-Critical Systems Chapter 9. Average frequency of dangerous failures Mary Ann Lundteigen and Marvin Rausand mary.a.lundteigen@ntnu.no &marvin.rausand@ntnu.no RAMS Group Department

More information

Reliability Education Opportunity: Reliability Analysis of Field Data

Reliability Education Opportunity: Reliability Analysis of Field Data Reliability Education Opportunity: Reliability Analysis of Field Data Vasiliy Krivtsov, PhD Sr. Staff Technical Specialist Reliability & Risk Analysis Ford Motor Company 25th Anniversary of Reliability

More information

Aviation Infrastructure Economics

Aviation Infrastructure Economics Aviation Short Course Aviation Infrastructure Economics October 14-15, 15, 2004 The Aerospace Center Building 901 D St. SW, Suite 850 Washington, DC 20024 Lecture BWI/Andrews Conference Rooms Instructor:

More information

Application of the Stress-Strength Interference Model to the Design of a Thermal- Hydraulic Passive System for Advanced Reactors

Application of the Stress-Strength Interference Model to the Design of a Thermal- Hydraulic Passive System for Advanced Reactors Application of the Stress-Strength Interference Model to the Design of a Thermal- Hydraulic Passive System for Advanced Reactors Luciano Burgazzi Italian National Agency for New Technologies, Energy and

More information

1. Reliability and survival - basic concepts

1. Reliability and survival - basic concepts . Reliability and survival - basic concepts. Books Wolstenholme, L.C. "Reliability modelling. A statistical approach." Chapman & Hall, 999. Ebeling, C. "An introduction to reliability & maintainability

More information

Introduction to repairable systems STK4400 Spring 2011

Introduction to repairable systems STK4400 Spring 2011 Introduction to repairable systems STK4400 Spring 2011 Bo Lindqvist http://www.math.ntnu.no/ bo/ bo@math.ntnu.no Bo Lindqvist Introduction to repairable systems Definition of repairable system Ascher and

More information

PROBABILITY DISTRIBUTIONS

PROBABILITY DISTRIBUTIONS Review of PROBABILITY DISTRIBUTIONS Hideaki Shimazaki, Ph.D. http://goo.gl/visng Poisson process 1 Probability distribution Probability that a (continuous) random variable X is in (x,x+dx). ( ) P x < X

More information