Rare Event Simulation using Monte Carlo Methods

Similar documents
Practical Statistics for Geographers and Earth Scientists

Statistical Hypothesis Testing with SAS and R

VIBRATIONS AND WAVES. George C. King. School of Physics & Astronomy, The University of Manchester, Manchester, UK

STRESS IN ASME PRESSURE VESSELS, BOILERS, AND NUCLEAR COMPONENTS

PRACTICAL RAMAN SPECTROSCOPY AN INTRODUCTION

Proceedings of the 2015 Winter Simulation Conference L. Yilmaz, W. K. V. Chan, I. Moon, T. M. K. Roeder, C. Macal, and M. D. Rossetti, eds.

FORENSIC ANALYTICAL TECHNIQUES

Organic Spectroscopy Workbook

A First Course in Stochastic Models. Henk C. Tijms Vrije Universiteit, Amsterdam, The Netherlands

Introduction to Finite Element Analysis

Field Geophysics THIRD EDITION. John Milsom University College London

STOCHASTIC DYNAMICS OF STRUCTURES

Workbook for Organic Synthesis: The Disconnection Approach

Principles and Applications of Photochemistry

Nuclear and Particle Physics

Colloids and Interfaces with Surfactants and Polymers An Introduction

A First Course in Stochastic Models. Henk C. Tijms Vrije Universiteit, Amsterdam, The Netherlands

Basic Geological Mapping

THE STATIC network reliability problem, which consists

AN INTRODUCTION TO PROBABILITY AND STATISTICS

William R. Wade Fourth Edition

Introduction to Rare Event Simulation

An Introduction to the Optical Spectroscopy of Inorganic Solids

BASICS OF ANALYTICAL CHEMISTRY AND CHEMICAL EQUILIBRIA

STATISTICAL ANALYSIS WITH MISSING DATA

Robustesse des techniques de Monte Carlo dans l analyse d événements rares

Arrow Pushing in Organic Chemistry

BASIC STRUCTURAL DYNAMICS

Statistical Theory and Modeling for Turbulent Flows

COMBINATION OF CONDITIONAL MONTE CARLO AND APPROXIMATE ZERO-VARIANCE IMPORTANCE SAMPLING FOR NETWORK RELIABILITY ESTIMATION

Nuclear Magnetic Resonance Data

Physics for Scientists & Engineers with Modern Physics Douglas C. Giancoli Fourth Edition

Introductory Chemistry Essentials Nivaldo J. Tro Fourth Edition

A First Course in Finite Elements

Bayesian Models for Categorical Data

Introductory Statistics Neil A. Weiss Ninth Edition

FRACTIONAL CALCULUS IN PHYSICS

VARIATIONS INTRODUCTION TO THE CALCULUS OF. 3rd Edition. Introduction to the Calculus of Variations Downloaded from

Geometrical Properties of Differential Equations Downloaded from by on 05/09/18. For personal use only.

ASYMPTOTIC ROBUSTNESS OF ESTIMATORS IN RARE-EVENT SIMULATION. Bruno Tuffin. IRISA/INRIA, Campus de Beaulieu Rennes Cedex, France

Discriminant Analysis and Statistical Pattern Recognition

COMPARATIVE STATICS ANALYSIS in ECONOMICS

Athena A C A D E M I C. V. Babu

Analysis and Improvements of Path-Based Methods for Monte Carlo Reliability Evaluation of Static Models

Chemistry by Computer. An Overview of the Applications of Computers in Chemistry

Elementary Statistics in Social Research Essentials Jack Levin James Alan Fox Third Edition

The Practice Book for Conceptual Physics. Paul G. Hewitt Eleventh Edition

A Hostile model for network reliability analysis

PROTEIN SEQUENCING AND IDENTIFICATION USING TANDEM MASS SPECTROMETRY

Applied Regression Modeling

Infrared Thermography Errors and Uncertainties

Collection of problems in probability theory

QUANTUM MECHANICS. For Electrical Engineers. Quantum Mechanics Downloaded from

Elementary Linear Algebra with Applications Bernard Kolman David Hill Ninth Edition

A First Course in Probability Sheldon Ross Ninth Edition

TRANSPORT PHENOMENA AND UNIT OPERATIONS

Functional Integrals: Approximate Evaluation and Applications

Two -Dimensional Digital Signal Processing II

Symmetry and Structure

Student Workbook for Physics for Scientists and Engineers: A Strategic Approach with Modern Physics Randall D. Knight Third Edition

Earth Life System. An Introduction to the

Understanding Bioanalytical Chemistry

TEACH YOURSELF THE BASICS OF ASPEN PLUS

Nuclear Chemistry. Principles of. Principles of Nuclear Chemistry Downloaded from

A FIRST COURSE IN INTEGRAL EQUATIONS

BIBLIOTHECA MATHEMATICA

APPLIED STRUCTURAL EQUATION MODELLING FOR RESEARCHERS AND PRACTITIONERS. Using R and Stata for Behavioural Research

ENERGY FUNCTION ANALYSIS FOR POWER SYSTEM STABILI1Y

SpringerBriefs in Mathematics

Astronomy with a Budget Telescope

in this web service Cambridge University Press

Introduction to rare event simulation

Applied Structural Equation Modelling for Researchers and Practitioners Using R and Stata for Behavioural Research

Probability Theory, Random Processes and Mathematical Statistics

Geophysical Interpretation using Integral Equations

Modern Power Systems Analysis

Stochastic Optimization Methods

COUNTING. Solutions Manual. 2nd Edition. Counting Downloaded from by on 02/19/18. For personal use only.

ANALYSIS OF ELECTRIC MACHINERY AND DRIVE SYSTEMS

Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies throughout the world

Springer Series on Atomic, Optical, and Plasma Physics

An Introduction to Advanced Quantum Physics

Numerical Integration of Stochastic Differential Equations

Landolt-Börnstein / New Series

STOCHASTIC PROCESSES FOR PHYSICISTS. Understanding Noisy Systems

SPECIAL FUNCTIONS AN INTRODUCTION TO THE CLASSICAL FUNCTIONS OF MATHEMATICAL PHYSICS

Analysis and Control of Age-Dependent Population Dynamics

The Manchester Physics Series

ORE MICROSCOPY AND ORE PETROGRAPHY

Spatial and Syndromic Surveillance for Public Health

HOW TO FIND CHEMICAL INFORMATION

ORGANIC REACTION MECHANISMS * 1991

INTRODUCTION TO SOL-GEL PROCESSING

This content has been downloaded from IOPscience. Please scroll down to see the full text.

IMAGE: AN INTEGRATED MODEL TO ASSESS THE GREENHOUSE EFFECT

Numerical Methods for the Solution of Ill-Posed Problems

ATLANTIS STUDIES IN MATHEMATICS VOLUME 3 SERIES EDITOR: J. VAN MILL

Ownership of Satellites

Enantiomer Separation

CISM International Centre for Mechanical Sciences

Transcription:

Rare Event Simulation using Monte Carlo Methods Edited by Gerardo Rubino And Bruno Tuffin INRIA, Rennes, France A John Wiley and Sons, Ltd., Publication

Rare Event Simulation using Monte Carlo Methods

Rare Event Simulation using Monte Carlo Methods Edited by Gerardo Rubino And Bruno Tuffin INRIA, Rennes, France A John Wiley and Sons, Ltd., Publication

This edition first published 2009 2009 John Wiley & Sons Ltd. Registered office John Wiley & Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, United Kingdom For details of our global editorial offices, for customer services and for information about how to apply for permission to reuse the copyright material in this book please see our website at www.wiley.com. The right of the author to be identified as the author of this work has been asserted in accordance with the Copyright, Designs and Patents Act 1988. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by the UK Copyright, Designs and Patents Act 1988, without the prior permission of the publisher. Wiley also publishes its books in a variety of electronic formats. Some content that appears in print may not be available in electronic books. Designations used by companies to distinguish their products are often claimed as trademarks. All brand names and product names used in this book are trade names, service marks, trademarks or registered trademarks of their respective owners. The publisher is not associated with any product or vendor mentioned in this book. This publication is designed to provide accurate and authoritative information in regard to the subject matter covered. It is sold on the understanding that the publisher is not engaged in rendering professional services. If professional advice or other expert assistance is required, the services of a competent professional should be sought. Library of Congress Cataloging-in-Publication Data Rare event simulation using Monte Carlo methods/edited by Gerardo Rubino, Bruno Tuffin. p. cm. Includes bibliographical references and index. ISBN 978-0-470-77269-0 (cloth) 1. Limit theorems (Probability theory) 2. Monte Carlo method. 3. System analysis Data processing. 4. Digital computer simulation. I. Rubino, Gerardo, 1955- II. Tuffin, Bruno. QA273.67.R37 2009 519.2 dc22 2009004186 A catalogue record for this book is available from the British Library ISBN: 978-0-470-77269-0 Typeset in 10/12pt Times by Laserwords Private Limited, Chennai, India Printed in the UK by TJ International, Padstow, Cornwall.

Contents Contributors Preface vii ix 1 Introduction to Rare Event Simulation 1 Gerardo Rubino and Bruno Tuffin PART I THEORY 15 2 Importance Sampling in Rare Event Simulation 17 Pierre L Ecuyer, Michel Mandjes and Bruno Tuffin 3 Splitting Techniques 39 Pierre L Ecuyer, François Le Gland, Pascal Lezaud and Bruno Tuffin 4 Robustness Properties and Confidence Interval Reliability Issues 63 Peter W. Glynn, Gerardo Rubino and Bruno Tuffin PART II APPLICATIONS 85 5 Rare Event Simulation for Queues 87 José Blanchet and Michel Mandjes 6 Markovian Models for Dependability Analysis 125 Gerardo Rubino and Bruno Tuffin 7 Rare Event Analysis by Monte Carlo Techniques in Static Models 145 Héctor Cancela, Mohamed El Khadiri and Gerardo Rubino 8 Rare Event Simulation and Counting Problems 171 José Blanchet and Daniel Rudoy

vi CONTENTS 9 Rare Event Estimation for a Large-Scale Stochastic Hybrid System with Air Traffic Application 193 Henk A. P. Blom, G. J. (Bert) Bakker and Jaroslav Krystul 10 Particle Transport Applications 215 Thomas Booth 11 Rare Event Simulation Methodologies in Systems Biology 243 Werner Sandmann Index 267

Contributors We list the contributors to this volume in alphabetical order, with their respective electronic addresses. José Blanchet, University of Columbia, New York, USA jose.blanchet@columbia.edu Henk Blom, NLR, The Netherlands blom@nlr.nl Thomas Booth, Los Alamos National Laboratory, USA teb@lanl.gov Héctor Cancela, University of the Republic, Montevideo, Uruguay cancela@fing.edu.uy Mohamed El Khadiri, University of Nantes/IUT of Saint-Nazaire, France mohamed.elkhadiri@iutsn.univ-nantes.fr Peter Glynn, University of Stanford, USA glynn@stanford.edu Pierre L Ecuyer, University of Montréal, Canada lecuyer@iro.umontreal.ca François Legland, INRIA, France legland@irisa.fr Pascal Lezaud, DGAC, France lezaud@cena.fr Michel Mandjes, CWI, The Netherlands mmandjes@science.uva.nl Gerardo Rubino, INRIA,France gerardo.rubino@inria.fr

viii CONTRIBUTORS Daniel Rudoy, Harvard University, Cambridge, USA rudoy@seas.harvard.edu Werner Sandmann, Bamberg University werner.sandmann@wiai.uni-bamberg.de Bruno Tuffin, INRIA, France bruno.tuffin@inria.fr

Preface Rare event simulation has attracted a great deal of attention since the first development of Monte Carlo techniques on computers, at Los Alamos during the production of the first nuclear bomb. It has found numerous applications in fields such as physics, biology, telecommunications, transporting systems, and insurance risk analysis. Despite the amount of work on the topic in the last sixty years, there are still domains needing to be explored because of new applications. A typical illustration is the area of telecommunications, where, with the advent of the Internet, light-tailed processes traditionally used in queuing networks now have to be replaced by heavy-tailed ones, and new developments of rare event simulation theory are required. Surprisingly, we found that not much was written on the subject, in fact only one book was devoted to it, with a special focus on large-deviations theory. The idea of writing this book therefore started from a collaborative project managed in France by Institut National de Recherche en Informatique et Automatique (INRIA) in 2005 2006 (see http://www.irisa.fr/armor/rare/), with groups of researchers from INRIA, the University of Nice, the CWI in the Netherlands, Bamberg University in Germany, and the University of Montréal in Canada. In order to cover the broad range of applications in greater depth, we decided to request contributions from authors who were not members of the project. This book is the result of that effort. As editors, we would like to thank the contributors for their effort in writing the chapters of this book. We are also grateful to John Wiley & Sons staff members, in particular Susan Barclay and Heather Kay, for their assistance and patience. Gerardo Rubino and Bruno Tuffin

1 Introduction to rare event simulation Gerardo Rubino and Bruno Tuffin This monograph deals with the analysis by simulation of rare situations in systems of quite different types, that is, situations that happen very infrequently, but important enough to justify their study. A rare event is an event occurring with a very small probability, the definition of small depending on the application domain. These events are of interest in many areas. Typical examples come, for instance, from transportation systems, where catastrophic failures must be rare enough. For instance, a representative specification for civil aircraft is that the probability of failure must be less than, say, 10 9 during an average-length flight (a flight of about 8 hours). Transportation systems are called critical in the dependability area because of the existence of these types of failures, that is, failures that can lead to loss of human life if they occur. Aircraft, trains, subways, all these systems belong to this class. The case of cars is less clear, mainly because the probability of a catastrophic failure is, in many contexts, much higher. Security systems in nuclear plants are also examples of critical systems. Nowadays we also call critical other systems where catastrophic failures may lead to significant loss of money rather than human lives (banking information systems, for example). In telecommunications, modern networks often offer very high speed links. Since information travels in small units or messages (packets in the Internet world, cells in asynchronous transfer mode infrastructures, etc.), the saturation of the memory of a node in the network, even during a small amount of time, may induce a huge amount of losses (in most cases, any unit arriving at Rare Event Simulation using Monte Carlo Methods 2009 John Wiley & Sons, Ltd Edited by G. Rubino and B. Tuffin

2 INTRODUCTION a saturated node is lost). For this reason, the designer wants the overflow of such a buffer to be a rare event, with probabilities of the order of 10 9. Equivalently, the probability of ruin is a central issue for the overall wealth of an insurance company: the (time) evolution of the reserves of the company is represented by a stochastic process, with initial value R 0 ; the reserves may decrease due to incoming claims, but also have a linear positive drift thanks to the premiums paid by customers. A critical issue is to estimate the probability of ruin, that is, the probability of the reserve process reaching zero. In biological systems, molecular reactions may occur on different time scales, and reactions with extremely small occurrence rates are therefore rare events. As a consequence, this stiffness requires the introduction of specific techniques to solve the embedded differential equations. This chapter introduces the general area of rare event simulation, recalls the basic background elements necessary to understand the technical content of the following chapters, and gives an overview of the contents of those chapters. 1.1 Basics in Monte Carlo Solving scientific problems often requires the computation of sums or integrals, or the solution of equations. Direct computations, also called analytic techniques, become quickly useless due to their stringent requirements in terms of complexity and/or assumptions on the model. In that case, approximation techniques can sometimes be used. On the other hand, standard numerical analysis procedures also require assumptions (even if less stringent) on the model, and suffer from inefficiency as soon as the mathematical dimension of the problem increases. A typical illustration is when using quadrature rules for numerical integration. Considering, for instance, the trapezoidal rule with n points in dimension s, the speed of convergence to the exact value is usually O(n 2/s ), therefore slow when s is large. The number of points necessary to reach a given precision increases exponentially with the dimension. To cope with those problems, we can use Monte Carlo simulation techniques, which are statistical approximation techniques, instead of the above mentioned deterministic ones. Let us start with the basic concepts behind Monte Carlo techniques. Suppose that the probability γ of some event A is to be estimated. A model of the system is simulated n times (we say that we build an n-sample of the model) and at each realization we record whether A happens or not. In the simplest (and most usual) case, the n samples are independent (stochastically speaking) of each other. If X i is the (Bernoulli) random variable X i = (A occurs in the nth sample) (i.e., X i = 1ifA occurs in sample i, 0 if not), we estimate γ by γ = (X 1 + + X n )/n. Observe that E(X i ) = γ and Var(X i ) = γ(1 γ)which we denote by σ 2 (for basic results on probability theory and to verify, for instance, typical results on Bernoulli random variables, the reader can consult textbooks such as [6]). How far will the given estimator γ be from the actual value γ? To answer this question, we can apply the central limit theorem, which says that X 1 + +X n

INTRODUCTION 3 is approximately normal (if n is large enough ) [6]. Let us scale things first: E(X 1 + +X n ) = nγ and Var(X 1 + +X n ) = nσ 2, so the random variable Z = [nσ 2 ] 1/2 (X 1 + +X n nγ ) has mean 0 and variance 1. The central limit theorem says that as n, the distribution of Z tends to the standard normal distribution N (0, 1), whose cdf is (x) = (2π) 1/2 x exp( u2 /2)du. We then assume that n is large enough so that Z N (0, 1) in distribution. This means, for instance, that P( z Z z) = 2 (z) 1(using ( z) = 1 (z)), which is equivalent to writing (( P γ zσ, γ + zσ ) ) γ 2 (z) 1. n n The random interval I = ( γ zσn 1/2 ) is called a confidence interval for γ, with level 2 (z) 1. We typically consider, for instance, z = 1.96 because 2 (1.96) 1 = 0.95 (or z = 2.56, for which 2 (2.56) 1 = 0.99). The preceding observations lead to P(γ ( γ 1.96σn 1/2 )) 0.95. In general, for a confidence interval with level α, 0<α<1, we take ( γ 1 ((1 + α)/2)σ n 1/2 ). From the practical point of view, we build our n-sample (i.e., we perform our n system simulations), we estimate γ by γ and, since σ 2 is unknown, we estimate it using σ 2 = n γ(1 γ )/(n 1). The reason for dividing by n 1 and not by n is to have an unbiased estimator (which means E( σ 2 ) = σ 2,asE( γ)= γ ), although from the practical point of view this is not relevant, since n will be usually large enough. Finally, the result of our estimation work will take the form I = ( γ 1.96 σn 1/2 ), which says that γ is, with high probability (our confidence level), inside this interval and by high we mean 0.95. The speed of convergence is measured by the size of the confidence interval, that is, 2zσn 1/2. This decreases as the inverse square root of the sample size, independently of the mathematical dimension s of the problem, and therefore faster than standard numerical techniques, even for small values of s. Now, suppose that A is a rare event, that is to say, that γ 1. For very small numbers, the absolute error (given by the size of the confidence interval, or by half this size) is not of sufficient interest: the accuracy of the simulation process is captured by the relative error instead, that is, the absolute error divided by the actual value: RE = zn 1/2 σ/γ. This leads immediately to the main problem with rare events, because if γ 1, then γ(1 γ) RE = z nγ z n γ 1 (unless n is huge enough). To illustrate this, let us assume that we want a relative error less than 10%, and that γ = 10 9. The constraint RE 0.1 translates into n 3.84 10 11. In words, this means that we need a few hundred billion experiments to get a modest 10% relative error in the answer. If the system being simulated is complex enough, this will be impossible, and something different must be done in order to provide the required estimation. More formally, if we