Assessment of the Reactor Trip System Dependability

Similar documents
Comparative Reliability Analysis of Reactor Trip System Architectures: Industrial Case

UNIVERSITY OF MASSACHUSETTS Dept. of Electrical & Computer Engineering. Fault Tolerant Computing ECE 655

Multi-State Availability Modeling in Practice

Time Dependent Analysis with Common Cause Failure Events in RiskSpectrum

Fault Tolerant Computing CS 530 Software Reliability Growth. Yashwant K. Malaiya Colorado State University

System Reliability Analysis. CS6323 Networks and Systems

Quantitative evaluation of Dependability

Safety and Reliability of Embedded Systems

Modeling Common Cause Failures in Diverse Components with Fault Tree Applications

Quantitative evaluation of Dependability

Safety and Reliability of Embedded Systems. (Sicherheit und Zuverlässigkeit eingebetteter Systeme) Fault Tree Analysis Obscurities and Open Issues

Reliability Analysis of k-out-of-n Systems with Phased- Mission Requirements

Fault Tolerance. Dealing with Faults

Reliability Analysis of Electronic Systems using Markov Models

Considering Security Aspects in Safety Environment. Dipl.-Ing. Evzudin Ugljesa

The d-or Gate Problem in Dynamic Fault Trees and its Solution in Markov Analysis

Reliability of Safety-Critical Systems 5.4 Petrinets

Evaluation criteria for reliability in computer systems

ANALYSIS FOR A PARALLEL REPAIRABLE SYSTEM WITH DIFFERENT FAILURE MODES

Reliability Analysis of an Anti-lock Braking System using Stochastic Petri Nets

Mean fault time for estimation of average probability of failure on demand.

A New Reliability Allocation Method Based on FTA and AHP for Nuclear Power Plant!

Reduction of Detected Acceptable Faults for Yield Improvement via Error-Tolerance

SYMMETRIC PROJECTION METHODS FOR DIFFERENTIAL EQUATIONS ON MANIFOLDS

Markov Models for Reliability Modeling

Dependable Computer Systems

6.1 Dependability Modeling. General Rules. Analysis

Availability and Reliability Analysis for Dependent System with Load-Sharing and Degradation Facility

Technology Mapping for Reliability Enhancement in Logic Synthesis

Development of Multi-Unit Dependency Evaluation Model Using Markov Process and Monte Carlo Method

Population models from PEPA descriptions

Module No. # 03 Lecture No. # 11 Probabilistic risk analysis

Reliability and Economic Analysis of a Power Generating System Comprising One Gas and One Steam Turbine with Random Inspection

AP1000 European 19. Probabilistic Risk Assessment Design Control Document

The Test Vector Problem and Limitations to Evolving Digital Circuits

Treatment of Expert Opinion Diversity in Bayesian Belief Network Model for Nuclear Digital I&C Safety Software Reliability Assessment

Terminology and Concepts

9. Reliability theory

No.5 Node Grouping in System-Level Fault Diagnosis 475 identified under above strategy is precise in the sense that all nodes in F are truly faulty an

Analysis for Parallel Repairable System with Degradation Facility

STOCHASTIC MODELS FOR RELIABILITY, AVAILABILITY, AND MAINTAINABILITY

Fault Tolerance Technique in Huffman Coding applies to Baseline JPEG

Estimating the Region of Attraction of Ordinary Differential Equations by Quantified Constraint Solving

EXISTENCE VERIFICATION FOR SINGULAR ZEROS OF REAL NONLINEAR SYSTEMS

A First Course on Kinetics and Reaction Engineering Supplemental Unit S5. Solving Initial Value Differential Equations

Steady-State and Transient Probabilities Calculation for Engineering Models

Part 3: Fault-tolerance and Modeling

A BENEFIT ASSESSMENT OF USING IN-CORE NEUTRON DETECTOR SIGNALS IN CORE PROTECTION CALCULATOR SYSTEM (CPCS)

A DESIGN DIVERSITY METRIC AND RELIABILITY ANALYSIS FOR REDUNDANT SYSTEMS. Subhasish Mitra, Nirmal R. Saxena and Edward J.

Reliability of Technical Systems

Machine Learning 2010

Degradable Agreement in the Presence of. Byzantine Faults. Nitin H. Vaidya. Technical Report #

Reliable Computing I

Dependability Analysis

A Practical Implementation of Maximum Likelihood Voting *

Common-cause failures as major issue in safety of control systems

Radiation Effects on Electronics. Dr. Brock J. LaMeres Associate Professor Electrical & Computer Engineering Montana State University

Availability. M(t) = 1 - e -mt

DVClub Europe Formal fault analysis for ISO fault metrics on real world designs. Jörg Große Product Manager Functional Safety November 2016

ASSESSMENT OF DYNAMIC PRA TECHNIQUES WITH INDUSTRY- AVERAGE COMPONENT PERFORMANCE DATA

Interconnect Yield Model for Manufacturability Prediction in Synthesis of Standard Cell Based Designs *

Improving the Efficiency of Dynamic Fault Tree Analysis by Considering Gate FDEP as Static

Chapter 8. Calculation of PFD using FTA

Dependable Systems. ! Dependability Attributes. Dr. Peter Tröger. Sources:

Comparative Analysis of Two-Unit Hot Standby Hardware-Software Systems with Impact of Imperfect Fault Coverages

Safety analysis and standards Analyse de sécurité et normes Sicherheitsanalyse und Normen

Chapter 5. System Reliability and Reliability Prediction.

CHAPTER 3 : MATHEMATICAL MODELLING PRINCIPLES

A BAYESIAN SOLUTION TO INCOMPLETENESS

Dynamic Fault Tree Analysis Based On The Structure Function

B.H. Far

Reliability evaluation of a repairable system under fuzziness

A Generalized Fault Coverage Model for Linear Time- Invariant Systems

A new FMECA model for reliability computations in electrical distribution systems

A SYMBOLIC-NUMERIC APPROACH TO THE SOLUTION OF THE BUTCHER EQUATIONS

Common Cause Failure (CCF)

FPGA Implementation of a Predictive Controller

Field data reliability analysis of highly reliable item

of an algorithm for automated cause-consequence diagram construction.

Transient behaviour in highly dependable Markovian systems: new regimes, many paths.

Quantification of the safety level of a safety-critical control system K. Rástočný 1, J. Ilavský 1

An Efficient Calculation Algorithm in Continuous-Time Markov Analysis Using Large-Scale Numerical Calculation

Assessing system reliability through binary decision diagrams using bayesian techniques.

Fault-Tolerant Computing

Reliability of Safety-Critical Systems Chapter 8. Probability of Failure on Demand using fault trees

Fault-Tolerant Computing

Human Spaceflight Value Study Was the Shuttle a Good Deal?

Failure prognostics in a particle filtering framework Application to a PEMFC stack

Failure Correlation in Software Reliability Models

Graph and Controller Design for Disturbance Attenuation in Consensus Networks

Lecture 5 Fault Modeling

Modeling Input Space for Testing Scientific Computational Software: A Case Study

Evaluation and Validation

Thermal Calculation for Linear Regulator

Automation in Complex Systems MIE090

IEOR 4106: Introduction to Operations Research: Stochastic Models Spring 2011, Professor Whitt Class Lecture Notes: Tuesday, March 1.

Data Dependence in Combining Classifiers

The Basic Approach in Designing of the Functional. Safety Index for Transport Infrastructure

ALGEBRAIC FLUX CORRECTION FOR FINITE ELEMENT DISCRETIZATIONS OF COUPLED SYSTEMS

EECS150 - Digital Design Lecture 26 - Faults and Error Correction. Types of Faults in Digital Designs

Transcription:

Assessment of the Reactor Trip System Dependability Two Markov Chains - based Cases Vyacheslav Kharchenko Department of Computer Systems and Networks National aerospace university KhAI 1 Kharkiv, Ukraine v.kharchenko@csn.khai.edu Oleg Odarushchenko Research and Production Corporation Radiy Kirovograd, Ukraine odarushchenko@gmail.com Elena Odarushchneko Poltava National Technical University Department of Applied Mathematics, Informatics and Mathematical Modeling skifs2007@mail.ru Dmitriy Butenko AltexSoft Kharkiv, Ukraine twilightwnd@gmail.com Valentina Butenko 1 v. odarushchenko@csn.khai.edu Abstract The diversity approach is commonly used to ensure dependability attributes of such critical computer-based systems as NPP I&C systems, including the Reactor Trip System. There are two basic problems emerging in this approach the choice of product-process diversity kinds and assessment of multi-version systems dependability. This paper presents a study for dependability assessment of two architectures of a typical NPP I&C system, the Reactor Trip System: a diverse twochannel system with three parallel tracks on voting logic 2-outof-3 in each channel and a diverse three-channel system with two parallel tracks on voting logic 1-out-of-2 in each channel. The multi-fragmentation approach is proposed to provide a detailed description of RTS hardware-software interconnection. The resulting models are solved using a number of approaches and tools to verify the results. Obtained results can help to make informed decision between the observed RTS architectures. Keywords Reactor Trip Systems, Markov chains, multifargmentation, stiffness I. INTRODUCTION The diversity approach is commonly used to ensure reliability, availability, safety and other dependability attributes of critical computer-based systems, which implies the creation of several diverse design options for redundant channels or parts of the channels (hardware-software products) [1]. There are two basic problems emerging in this approach the choice of product-process diversity kinds and assessment of multi-version systems dependability [2], [3]. Provision of an accurate dependability assessment of multi-version complex safety-critical systems, such as NPPs I&C system, is of high importance for the development process. Assessment errors may lead to either wrong or suboptimal decisions. Dependability parameters of such systems are assessed using probabilistic models. State-space methods, such as Markov chains (MC) [4] are preferred to reliability block diagrams (RBD) and fault trees as they can better handle such complex situations as failure/repair dependencies and shared repair resources [5]. One of the main computational difficulties of MC applications is the size of the model s state space. System modelers are often interested in transient measures, which provide more useful information than steady-state measures. Modeling components interaction and interdependencies expands the model significantly, thus making the precise computation of system transient measures almost infeasible. Whilst numerical methods and imitation modeling can be applied to handling this problem, they are also limited by model size and such difficulties as stiffness [6] and sparsity [7]. Stiffness is a well-known undesirable property of many practical MCs [8] as it poses a problem of finding transient solutions. Using solution methods that do not preserve sparsity, is unacceptable for most large problems [7]. Several techniques were developed to deal efficiently with MC largeness and stiffness [6, 8, 9] and a number of tools can be used to find the MC solution [10, 12]. Such variety of tools and approaches is extremely helpful in the process of system modeling but this also poses a difficulty when it comes to choosing the most appropriate method for a specific assessment. As every tool is limited in its properties and applicability, a careful selection is needed for the tools used to solve large and stiff MCs accurately and efficiently. It should be noted that stiffness usually requires the modeler to focus on a number of math details to avoid the use of inefficient approaches, methods [12], and tools. This view goes against the recommendations of one of the leading standards in the safety area, IEC 61508-6 [13]. This standard asserts that methods for solving Markov models have been developed long ago and trying to improve these methods does not seem

sensible 1. The previous works show [10] that solving a large and/or stiff Markov model requires a careful selection of the solution method/tool. Using an inappropriate method/tool for the solution of a non-trivial MC may lead to significant errors. In this paper, we present the study for dependability assessment of a typical NPP instrumentation and control system (I&Cs), the Reactor Trip System (RTS) based on a digital platform produced by RPC Radiy. This is an FPGAbased system. We observe two possible RTS architectures: a two-channel system with three parallel tracks on voting logic 2-out-of-3 in each channel or a three-channel system with two parallel tracks on voting logic 1-out-of-2 in each channel. Each system uses diverse software versions. We use the multi-fragmentation (MFM) approach to model the system hardware-software interconnection and behavior, and a number of approaches and tools to find the transient solution and verify the results. The availability functions of both architectures are analyzed and compared. The results of such comparison can help to make an informed decision about the preferred architecture for a particular system. The paper consists of the following sections: Section 2 provides the general description of the multi-fragmentation approach, which was used to model both RTS architectures; Section 3 presents the description of the digital FPGA-based platform and an overview of RTS architectures; Section 4 presents the Markov models of those architectures taking physical faults into account; Section 5 presents the MFM for both architectures; Section 6 presents the solution results using several approaches and tools. In Section 7 we present the conclusions. II. MULTI-FRAGMENTATION APPROACH The main idea of the MFM approach is to present the complex hardware-software behavior of the researched system, taking into account the known residual amount of software faults. Using this approach the model is divided into N fragments that are with the same structure but may differ in one or more parameter values [21]. The model fragments describe the functioning of system hardware part and transitions between fragments are show the process of software failures detection. The approach is based on the following assumptions. The number of software defects is a known finite value. The failure rate of design faults λ d(i) is proportional to their residual amount n i in i different software versions [14]. This is a well-known assumption of concave software reliability grows models (SRGM) [17-18]. It shows an incremental change of the software failure rate after detected design fault elimination (λ d(i) vary by a constant Δλ d(i) ). All detected defects of software components are repaired immediately and no new defects are introduced. The mean time between failures and mean time to repair are exponentially distributed [15]. 1 IEC 61508 Part 6 states on p. 58: "Efficient algorithms have been developed and implemented in software packages a long time ago in order to solve above equations. Then, when using this approach, the analyst can focus only on the building of the models and not on the underlying mathematics" Software testing datasets are updated after each test. The testing is performed on the complete body of input data. The number of fragments N in the MFM depends on the number of expected undetected software faults n i in i different software versions (1). m N fr ( n i 1) i1 The multi-fragmentation approach is described below. 1) Calculating the software failure λ d(i) and recovery (i) rates and evaluating the quantity of expected undetected software faults n i in i different software versions. 2) Calculating the fragments number N fr using (1). 3) Studying the system behavior, defining physical failure and recovery events and constructing system states and transitions based on those events. 4) Defining design failure and recovery events and constructing the system states and transitions based on those events. 5) Building the initial fragment (F 0 ) using the states and transitions developed in step 3. The F 0 is a MC that illustrates the system operation process taking physical faults into account. It should be noted that every fragment also implicitly describes the amount of undetected software faults n i in i different software versions, thus F 1 can be defined by the following set: F0 {( n1, n2,, nm ) n1 i, n2 j,, n3 k, i, j, k N} where (n 1, n 2,, n m ), n 1 =i, n 2 =j,, n m = k is the amount (i, j,, k) of undetected software faults in m different software versions. 6) Generating all the fragments using (3). Each fragment is defined similarly to the F 1 (2): F F, F,, F n(0, N ), n } { 0 1 n fr N 7) Transitions between the fragments occur based on the design failure and recovery events (step 4). In case of a fault in n i software version the system proceeds from initial fragment F 0 to one of the states in the set M i - the set of software failure states, and recovers to the fragment F i. According to the taken assumptions, after each design failure elimination the λ d(i) ) is changed by a constant Δλ d(i) [14]. Thus, the fragment F i contains corresponds to a reduced value of λ d(i). The Fig. 1 shows the general multi-fragmental Markov (MFM) model built using this approach.

failure of any module implies the general failed state of the track. From sensors From discrete sensors AIM DIM LM To diagnostic indicators AOM DOM Fig. 2. The structure diagram of a typical track Analogue Signals Discrete Signals The RDB for the two-channel three-track architecture is presented on Fig. 3. Fig. 4 presents the RBD for the threechannel two-track architecture. In both architectures, all tracks in the channels have identical hardware structures, but the software run on the system channels is diverse [10], i.e. nonidentical but functionally equivalent software copies are deployed on the system channels. Each channel independently receives information from sensors and other NPP systems. The channels, each being capable of forming a reactor trip signal, are independent. Reliability index P pfi.j determines hardware reliability of the track T i.j (defined by physical faults), where i indicates main (T 1.j ) or diverse (T 2.j ) channels, and j indicates the track number. Reliability index P dfi determines software reliability of the main or diverse channels (defined by software faults), where i indicates the channel. Reliability index P mi, determines reliability of the majority element m i, where i (1,4 ). Fig. 1. General multi-fragmental Markov model III. RESEARCH ARCHITEXTURE DESCRIPTION Technology FPGA (Field Programmable Gates Arrays) is one of the most intensively developed and applied in I&Cs of nuclear power plants (NPPs) and other safety & mission critical domains [3]. The FPGA-based track is a basic component of both RTS architectures in the scope of this piece. Each track can contain up to 7 module types: analog and digital input modules (AIM, DIM); analog and digital output modules (AOM, DOM); logic module (LM); optical communication module (OCM); and analog input for neutron flux measurement module (AIFM). All modules are based on FPGA chips. The modules can be placed in 16 different positions on the track (two reserved positions for LM), using LVDS and fiber optical lines for internal/external communications. Such flexible redundancy management helps to ensure the high availability of the system. In this paper, we consider the tracks consisting of five modules: LM, DIM, DOM, AIM and AOM. The Fig. 2 presents the structure diagram of a typical track. It is assumed that the corresponding components of all the tracks in the channels are identical, i.e. DIM on the 1 st track is identical to the same module on other tracks in the channels, etc. The failure of the LM leads to the failure of the whole track, and failures of the DIM, DOM, AIM, AOM result in track malfunction. Therefore, it was assumed that P pf 1.1 P pf 1.2 P pf 1.3 P pf 2.1 P pf 2.2 P pf 2.3 P pf 1.1 P m1 2/3 P m2 2/3 P pf 1.2 P df 2 P pf 2.1 P pf 2.2 P pf 3.1 P pf 3.2 P df 2 P df 2 P df 2 P m1 P m2 P m3 P m3 Fig. 3. Reliability-block diagram of two-channel tree-track RTS P m4 2/3 Fig. 4. Reliability-block diagram of three-channel two-track RTS

IV. RTS S MARKOV MODELS. PHYSICAL FAULTS This section presents the Markov models (MM) for the considered architectures taking physical faults into account. Fig. 5 presents the MM for the two-channel three-track RTS, and Fig. 6 for the three-channel two-track RTS. The following assumptions were used during the MMs development: each element of the research system at an arbitrary moment of time can only be in one of the two states working and failure ; the systems majority and control elements provide non-stop correct functioning. Both models use the following parameters: λ p(i,, (i,, where i 3 and j 3 - the failure and repair rates for the failures caused by physical faults in the track T i,j. As each track consists of five module types, the λ p(i, and (i, of the track T i,j can be calculated using (1) and (2) respectively: ( i, p AIM ( i, 6λ p S 1 S 2 (3; 3) (3; 2) (2; 3) DIM ( i, DOM ( i, LM ( i, 2λ p 3λ p AOM ( i, ( i, ( i, /( p DOM ( i, AIM ( i, p / / DOM ( i, AIM ( i, S 3 S 4 DIM ( i, (3; 1) (1; 3) (2; 2) / LM ( i, AOM ( i, 3λ p 4λ p DIM ( i, / / 2λ p S 5 S 6 (2; 1) (1; 2) LM ( i, AOM ( i, ), (1; 1) where {λ DIM(i,, λ DOM(i,, λ LM(i,, λ AIM(i,, λ AOM(i, } and {μ DIM(i,, μ DOM(i,, μ LM(i,, μ AIM(i,, μ AOM(i, } are failure and repair rates caused by physical faults of DIM, DOM, LM, AIM, AOM, respectively. As all corresponding components of the tracks are identical, their failure and repair rates for the failures caused by physical faults are also equal. Thus, values of λ p(i,, (i, are equal for all T i,j tracks. A. Two-channel Three-track RTS Observing the two-channel three-track RTS, we define the following set S p ={(3; 3), (3; 2), (2; 3), (2; 2), (3; 1), (1; 3), (2; 1), (1; 2), (1; 1)}. Each member of this set presents the quantity of remaining working tracks in the main and diverse channels. The system operation process is as follows. At time t 0 all tracks in both channels operate correctly in S 1. At a random moment t n, a failure is caused by a physical fault in the track T i,j (for instance, in the main channel) and is subsequently detected by a majority component. The system moves to the S 2 state with a rate of 6λ p and repairs back to the state S 1 with a rate of. The system functioning after state S 2 can unfold in two possible ways. Fig. 5. Markov model of two-channel tree-track Reactor Trip System If during the T i,j track repair one of the remaining two tracks fails in the same channel (main channel), the system moves to the state S 3 with a rate of 2λ p and recovers back to the state S 2 with a rate of. System moves from state S 3 to state S 5 if during the repair of two tracks in one (main) channel, a failure occurs in one of the tracks of another (diverse) channel. System moves from S 2 to the S 4 if during the T i,j track repair one of the tracks fails in another channel (diverse); the system moves into the state with a rate of 3λ p and recovers back to the state with a rate of. If at the time of the last failed track repair, any new track in any channel fails, the system moves from state S 4 to state S 5 with a rate of 4λ p and repairs back to S 4 with a rate of. It should be noted that the priority recovering strategy is repairing back to two working channels. System moves from state S 5 to S 6 with a rate of 2λ p and recovers back to S 5 with a rate of the last track fails in the remaining functioning channel. The MM consist of following states: SF w = { S 1, S 5 } system working states; SF f={s 6 } system failure state. B. Single-channel Three-track RTS For the three-channel two-track RTS, we define the following set S p = {(2; 2; 2), (1; 2; 2), (2; 1; 2), (2; 2; 1),, (0; 0; 1), (0; 1; 0), (0; 0; 1)}, members of present the quantity of working tracks in each channel. The system operation process is as follows. At t 0 all tracks in the three channels operate correctly and deliver expected level of service (state S 1 ). At a random moment t n a failure occurs due to a physical fault, and the system moves to the state S 2 with a failure rate of 6λ p and repairs back to the state S 1 with a rate of. After the state S 2 the model presents two different cases of system operation process. If during the T i,j track repair the remaining track in the same channel fails, the system moves to the state S 4 with a rate of λ p and recovers back to the state S 2 with a rate of. The system moves from S 4 to S 6 with a rate of 4λ p if the physical failure occurs in one of the remaining two channels. If the same channel manifests yet another failure, the system moves from state S 6 to S 8 with a rate of λ p, where S 8 is a non-working state of the system. If during the T i,j track repair the failure occurs in one of the remaining channels excluding already failed tracks, the system moves from S 2 to S 3 with a rate of 4λ p and repairs back with a rate of. If during the last repair (state S 3 ) one more channel manifests a failure, the system proceeds from S 3 to S 5, representing the case when only one track still works in every channel,. The system transitions from S 5 to S 7 with a rate of 3λ p if the last working track fails in one of the channels, and transitions to state S 9 if the failure occurs in yet another channel. The state S 9 is a nonworking state. We use the same recovering strategy as in the previous model (Fig. 4), which is represented by transitions between states S 6 and S 3 (Fig. 5).

(0; 1; 1), (1; 0; 1), (1; 1; 0) S 1 S 7 S 8 S 9 (2; 2; 2) 6 (1; 2; 2), (2; 1; 2), (2; 2; 1) S 2 (1; 1; 2), (2; 1; 1), (0; 2; 2), (2; 0; 2), (1; 2; 1) (2; 2; 0) S 3 S 4 (0; 1; 2), (0; 2; 1), (1; 1; 1) S 5 S 6 (1; 0; 2), (2; 0; 1), (1; 2; 0) (2; 1; 0) (0; 0; 1), (0; 1; 0), (0; 0; 1) (0; 0; 2), (0; 2; 0), (0; 0; 2) Fig. 6. Markov model of three-channel two-track Reactor Trip System V. RTS S MARKOV MODELS. DESIGN FAULTS The Fig. 7 presents the uniform MFM of both RTS architectures taking into account design faults in diverse software versions. F 0 ={(2; 2) n 1 =2; n 2 =2} M 1 λ d λ d Δλ d M 2 M 3 F 2 ={(1; 1) n 1 =1; n 2 =1} M 4 2λ d F 1 ={(2; 1), (1; 2) n 1 =2; n 2 =1 or n 1 =1; n 2 =2} M 5 F 4 ={(1; 0), (0; 1) n 1 =1; n 2 =0 or n 1 =0; n 2 =1} λ d Δλ d 2(λ d Δλ d ) λ d F 3 ={(2; 0), (0; 2) n 1 =2; n 2 =0 or n 1 =0; n 2 =2} The main difference between those architectures is the type of F 0 fragment. The MM presented in Fig. 5 is used as F 0 for the two-channel three-track RTS, and MM in Fig. 6 for the three-channel two-track RTS. Thus, the detailed MFM model for the first RTS architecture (two-channel three-track) is presented on the Fig. 8 and detailed MFM for three-track twochannel architecture on Fig.9. The system failed states are marked with grey color. The following assumptions were used to build the MFM. Not more than two undetected software design faults are expected in the resulting software. The failure and repair rates for the failures caused by software design faults are equal (6-7). The steps of the failure rate decrease (after the channel recovery) are equal for both channels Δλ d1 = Δλ d2 [15], [16]: S23 d1 d 2 d d1 d 2, 2 d ( i) d1 d 2; d d /( ) i1 d ( i) S1 2 6 S2 S3 S4 S5 S7 S8 S9 S10 S11 S12 6 S13 S18 S19 S20 S21 S22 6 S24 S25 S27 S28 S14 S15 S26 2( Δ) 2( Δ) 2( Δ) 2( Δ) 2( Δ) S16 S6 S17 S29 S30 S31 S32 S33 S34 6 S35 S36 S37 S38 S40 S41 S42 S43 S44 S45 S46 S47 S48 S49 S50 6 2 S51 2 2 2 S52 S53 S54 Δ Δ Δ Δ Δ S55 S39 M 6 F 5 ={(0; 0) n 1 =0; n 2 =0} Fig. 7 Uniform multi-fragmental model for Reactor Trip System architectures S56 S57 S58 S59 S60 S61 6 S62 S63 S64 S65 Fig. 8 The multi-fragmental model for two-channel three-track RTS architecture S66

S1 2 6 S2 S4 S3 2 2 2 2 2 2 S10 S11 S12 S13 S14 S15 S16 S17 6 S18 S26 S27 S28 S29 S30 S31 S32 S36 S38 S40 S6 S5 S20 S19 S22 S21 S49 6 S50 S8 S7 S24 S23 S9 S25 S42 S43 S44 S45 S46 S47 S48 S52 S51 S54 S53 S58 S59 S60 S61 S62 S63 S64 S65 S66 S67 S68 S69 S70 S71 6 S41 S33 S34 S35 S37 S39 2( Δ) 2( Δ) 2( Δ) 2( Δ) 2( Δ) 2( Δ) 2( Δ) S72 S88 6 Δ S73 S75 S74 S77 S76 S79 S78 S81 S82 S83 S84 S85 S86 S87 6 S89 Δ Δ S91 S90 S93 S92 Δ Δ Δ S95 S94 S80 Δ S96 S56 S55 Fig. 9 The multi-fragmental model for three-channel two-track RTS architecture VI. MODELS SOLUTION One of the main computational difficulties affecting the use of numerical methods is a model s stiffness. There is no commonly adopted definition of stiffness but a few of the most widely used ones are summarized below. du The Cauchy problem F( x, u) is said to be stiff on the dx interval [x 0,X], if there exists an x from this interval for which the following condition holds: max Re( i ) i1, n s( x) 1, (8) min Re( i ) i1, n where s(x) denotes the stiffness index and λ i are the eigenvalues of the Jacobian matrix ( Re i 0, i 1,2,, n )[9]. S57 The stiffness index of an MC was also defined in [6], [7] as the product of the largest total exit rate from a state and the length of solution interval (=λ i t), where λ i are the eigenvalues of the Jacobian matrix. A system of differential equations (DE) is said to be stiff on the interval [0,t) if there exists a solution component of the system that has variation on that interval that is large enough compared to 1/t. Thus, the length of the solution interval also becomes a measure of stiffness [7]. Many approaches have been developed to deal efficiently with MC stiffness [6, 8, 9]. They can be split into two groups - stiffness-tolerance (STA) and stiffness-avoidance approaches (SAA) [8]. Within STA, specialized numerical methods are used to provide highly accurate results in spite of stiffness. The SAA solution, on the other hand, is based on an approximation algorithm, which initially converts a stiff MC to a non-stiff chain that typically has a significantly smaller state space [8]. An advantage of this approach is that it can deal effectively with large stiff MCs. Achieving high accuracy, however, may be problematic with SAA, In this section, we provide the results of MFMs (Fig.8 and Fig.9) solution using two approaches stiffness-avoidance and stiffness-tolerance. The MFMs (Fig. 8 and Fig. 9) were solved for the values: λ p = 10-4, = 1, λ d = 5 10-5, = 0.01, Δλ d = 2.5 10-5 and an accuracy requirement of 10-6. Both models are of moderate-stiffness [10], with s(x)= 1.667 10-3 (8), with max Re(λ i ) = 1.00063 and min Re(λ i ) = 0.0006. The assessment of availability function A(t) using STA for both MFMs was performed in mathematical packages Mathematica and Matlab using a built-in function implementing the implicit Runge-Kutta method. The solution was computed on the time interval of [0; 10 000] hours with a time step of h=200 hours. The MFMs were also solved using the SAA approach, essentially an aggregation/disaggregation technique developed by K. S. Trivedi, A. Bobbio and A. Reibmann [8]. The comparison of the results obtained by STA and SAA approaches for both RTS architectures are presented on Fig. 10. Table 1 presents the average difference ω between STA and SAA A(t) results for both RTS architectures under study. TABLE I. Approaches Matlab Mathematica AVERAGE DIFFERENCE BETWEEN STA AND SAA RESULTS FOR BOTH RTS ARCHITECTURES RTS architectures / Average ω Two-channel threetrack Three-channel twotrack 0,0001035 0,0003035 Matlab SAA 0,0004001 0,0005417 Mathematica SAA 0,0004002 0,0007179

Fig. 10. Comparison of the A(t) results obtained using stiffness-avoidance and stiffness-tolerance approaches for three-channel RTS architecture VII. CONCLUSIONS We have discussed two models of the RTS architectures: the diverse two-channel three-track system and diverse threechannel two-track system. The models were built using multifragmental approach, which helps to illustrate the variation of system parameters. Based on the assessment results we can conclude that three-channel two-track diverse RTS provides more reliable functioning during the studied time period. The average difference between STA results showed the importance of its verification by the means of SAA technique. It should also be noted that in the process of selection from several math packages the usability becomes an important concern. REFERENCE [1] L. Pullum, Software Fault Tolerance Techniques and Implementation, Artech House Computing Library, 2001. [2] R. Wood, R. Belles, M Cetiner, et al, Diversity Strategies for NPP I&C Systems, NUREG/CR-7007 ORNL/TM-2009/302, 2009. [3] V. Kharchenko, A. Siora, V. Sklyar et al, Multi-Diversity Versus Common Cause Failures FPGA-Based Multi-Version NPP I&C Systems, Proc. of the 76 th Conf. NPIC&HMIT, Las-Vegas, Nevada, USA, 2010. [4] K. S. Trivedi, G. Ciardo, B. Dasarathy, M. Grottke, R. Matias, A. Rindos, B. Varshaw, Achieving and Assuring High Availability. IEEE International symposium, IPDPS 2008, pp. 1-7. [5] S. Archana, R. Srinivasan, K. S. Trivedi, Availability Models in Practice, Proc. Int. Workshop on Fault-Tolerant Control and Computing (FTCC-1), May 22-23, Seoul, Korea, 2000. [6] M. Malhotra, J. K. Muppala, K. S. Trivedi, Stiffness-Tolerant Methods for Transient Analysis of Stiff Markov Chains, Microelectronic Reliability, vol.34(11), 1994, pp.1825-1841. [7] A. Reibman, K. S. Trivedi, Numerical Transient Analysis of Markov models, Comput. Opns. Res. vol.15(1), 1988, pp. 19-36. [8] A. Bobbio, K. S. Trivedi, A Aggregation Technique for Transient Analysis of Stiff Markov Chains, IEEE Trans. on Comp., C-35, 1986, pp. 803-814. [9] O. Arushanyan, S. Zaletkin, Numerical Solution of Ordinary Differential Equations using FORTRAN, Moscow State University, Moscow, 1990, p. 336. [10] V. Kharchenko, O. Odarushchenko, V. Odarushchenko, P. Popov, Availability Assessment of Computer Systems Described by Stiff Markov Chains: Case Study, Springer Verlang, Berlin-Heidelberg, CCIS, vol. 412, 2013, pp. 112-135. [11] K. S. Trivedi, R. Sahner, SHARPE at the Age of Twenty Two, ACM SIGMETRICS Performance Evaluation Review, vol.36(4), 2009, pp. 52-57. [12] E. Hairer, G. Wanner, Solving Ordinary Differential Equations II: Stiff and Differential-Algebraic Problems, (Bank, R., Graham, R. I., Stoer, J., Varga, R., Yserentant, H. eds.), Springer-Verlang, 2010, p. 631. [13] IEC 61508 (6 part), Functional Safety of Electrical/Electronic/ Programmable Electronic Safety-Related Systems, 2010. [14] Z. Jelinski, P. L. Moranda, Software Reliability Research. Statistical Computer Performance Evaluation, W. Freiberger, ed., Academic press, New York, 1972, pp. 365 484. [15] J. D. Musa, Software Reliability. Measurement. Prediction. Application, McGraw-Hill Company, 1987, p. 395. [16] V. Kharchenko, O. Odarushchenko, Y. Ponochovny, E. Odarushchenko, O. Kharibin, V. Odarushchenko, High Availability Systems and Technologies (Kharchenko, V. ed.), Lectures, National aerospace university KhAI Press, 2012, p. 249. [17] P. Ganesh, A Survey of Software Reliability Models, arxiv Preprint, 2002, p. 12. [18] M. R. Lyu, Handbook of Software Reliability Engineering, IEEE Computer Society Press, 1996, p. 819.