There is no straightforward approach for choosing the warmup period l.

Similar documents
Output Analysis (2, Chapters 10 &11 Law)

Chapter 11 Output Analysis for a Single Model. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation


A Question. Output Analysis. Example. What Are We Doing Wrong? Result from throwing a die. Let X be the random variable

1 Inferential Methods for Correlation and Regression Analysis

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

1 Introduction to reducing variance in Monte Carlo simulations

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

Sample Size Determination (Two or More Samples)

Estimation for Complete Data

Properties and Hypothesis Testing

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Basis for simulation techniques

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

A statistical method to determine sample size to estimate characteristic value of soil parameters

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Statistical Analysis on Uncertainty for Autocorrelated Measurements and its Applications to Key Comparisons

Stat 200 -Testing Summary Page 1

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Random Variables, Sampling and Estimation

Stochastic Simulation

6.3 Testing Series With Positive Terms

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

Simulation. Two Rule For Inverting A Distribution Function

Chapter 13, Part A Analysis of Variance and Experimental Design

Chapter 6 Sampling Distributions

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

GG313 GEOLOGICAL DATA ANALYSIS

Statistics 511 Additional Materials

Lecture 2: Monte Carlo Simulation

Common Large/Small Sample Tests 1/55

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

Stat 421-SP2012 Interval Estimation Section

Mathematical Statistics - MS

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Confidence Intervals

CS/ECE 715 Spring 2004 Homework 5 (Due date: March 16)

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

1.010 Uncertainty in Engineering Fall 2008

Data Analysis and Statistical Methods Statistics 651

c. Explain the basic Newsvendor model. Why is it useful for SC models? e. What additional research do you believe will be helpful in this area?

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Math 113 Exam 3 Practice

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines)

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Analysis of Experimental Data

Topic 9: Sampling Distributions of Estimators

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Expectation and Variance of a random variable

Lecture 3: August 31

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Topic 9: Sampling Distributions of Estimators

1036: Probability & Statistics

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Sequences. Notation. Convergence of a Sequence

Statistics 20: Final Exam Solutions Summer Session 2007

Statistical inference: example 1. Inferential Statistics

MA238 Assignment 4 Solutions (part a)

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

SDS 321: Introduction to Probability and Statistics

Chapter 8: Estimating with Confidence

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Output Analysis and Run-Length Control

The standard deviation of the mean

Approximations and more PMFs and PDFs

(7 One- and Two-Sample Estimation Problem )

Last Lecture. Wald Test

Y i n. i=1. = 1 [number of successes] number of successes = n

Reliability and Queueing

Topic 9: Sampling Distributions of Estimators

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

GUIDELINES ON REPRESENTATIVE SAMPLING

This is an introductory course in Analysis of Variance and Design of Experiments.

Stat 319 Theory of Statistics (2) Exercises

32 estimating the cumulative distribution function

Math 113 Exam 3 Practice

Error & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i :

G. R. Pasha Department of Statistics Bahauddin Zakariya University Multan, Pakistan

Transcription:

B. Maddah INDE 504 Discrete-Evet Simulatio Output Aalysis () Statistical Aalysis for Steady-State Parameters I a otermiatig simulatio, the iterest is i estimatig the log ru steady state measures of performace. However, iitial coditios could bias the estimatio. To avoid this bias, the most commo techique is to warm up the model, also called iitial-data deletio. Idetify idex l (for discrete-time processes) or time t 0 (for cotiuous-time processes) beyod which the output appears ot to be driftig ay more statistically. The, cosider data beyod the cut-off poit to estimate the desired measure of performace. For example, suppose the objective is to estimate the steady state mea of a process based o observed values, Y 1, Y,.Y m, i a sigle replicatio. The, after removig l iitial trasiet data, a estimator for Yi i l 1 the mea is Y ( m, l) m l m There is o straightforward approach for choosig the warmup period l.. 1

The simplest method for fidig l is a graphical procedure based o multiple replicatios. (May replicatios are used because it s difficult to estimate l from just oe replicatio.) The idea is to plot a estimate for E[Y i ] versus i ad choose l by eyeballig a poit where the curves flattes out. To estimate E[Y i ], first a average across replicatios is take leadig toy i ad the a movig average with widow w is applied to for further smoothig leadig to Yi ( w ). A plot of Yi ( w ) is the used to estimate l as show.

Note fially that there is o easy way to specify the ru legth i a otermiatig simulatio 1. You ca (i) rely o a graphical procedure to judge if steady state is reached or (ii) keep icreasig the legth of the simulatio util the variability i the output measure is reasoably small. Replicatio/Deletio Approaches for Estimatig Meas i a Notermiatig Simulatio Assume that a appropriate warm-up period l has bee idetified. The replicatio/deletio techique starts with replicatios each havig m observatios ad delete the first l observatios of each replicatio. The, it proceeds just like the case of termiatig simulatio to estimate a CI for the mea based o the ormal approximatio ad the t-distributio. Other Approaches for Estimatig Meas i a Notermiatig Simulatio Other approaches for estimatig meas i a otermiatig simulatio are based o havig oe log ru (versus may short replicatios i Replicatio/Deletio). 1 Whitt (1998) [Maagemet Sciece 35: 1341 1366] offers simple formulas for estimatig the ru legth of queueig simulatios. See also Maddah et al. (017) [Europea Joural of Operatioal Research, 6: 60-619] for extesive testig ad validatio of Whitt s formulas. 3

Batch Meas The batch meas approach cosiders a sigle ru with m observatios ad divides the observatios ito batches of legth k each (m = k). (Warm-up period data may be elimiated first.) Let Yj ( k ) be the mea of the observatios i batch j. 4

The batch meas approach is based o the approximate assumptio that Yj ( k ) are iid ormal ad that they provide a ubiased estimator for the mea. The, a cofidece iterval for the mea is foud as Y ( m) t 1, / S ( ) where Y( m) j1 Y ( k) j, S ( ) j1 ( Y ( k) Y ( m)) j. 1 The mai advatage of the batch meas approach is its simplicity ad the major drawback is the difficulty of choosig a appropriate batch size that reduces correlatio. See BCNN (pp. 447-450) book for correlatio tests, a example, ad further details. 5

Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe o the basis of a statistical aalysis. For example, whe comparig two systems, oe system ca be better o some replicatios ad worse o others. Tellig which system is better requires ca be oly doe approximately based o statistical aalysis. Example 1 Suppose a bak is cosiderig two possible ATM systems. o Zippy: M/M/1 queue with arrival rate 1 ad oe fast server with mea service time 0.9 miutes. o Kluky: M/M/ queue with arrival rate 1 ad two slow servers with mea service time 1.8 miutes each. The performace measure of iterest is the mea delay of the first 100 customers, d(100) (system starts empty). 6

The true measures (evaluated based o exact aalysis) are d Z (100) = 4.13 ad d K (100) = 3.70. So Kluky is better. Simulatig these two systems ad comparig the output of oe ru at a time i 100 replicatios, gives the followig. I 53 Out of 100 replicatios, d Z (100) < d K (100). That is, P{wrog aswer} = 0.53. This meas that a aalysis based o a sigle ru is likely to lead to wrog aswers. To improve the compariso oe could simulate each system for replicatio ad the base the decisio o comparig the averages of the replicatios. Doig 100 comparisos this way (with a total of 100 replicatios) gave the followig results for differet s. 7

A dot plot illustrates what s happeig. This example highlights the eed for methods to assess the ucertaity ad give statistical bouds or guaratees for coclusios ad decisios. Cofidece Itervals for the Differece of Systems Cosider two alterative systems with performace measures i, i = 1, (the mea of somethig). Suppose that i replicatios are made for system i. 8

Let X ij be the observatio from system i o replicatio j. The idea is to use the X ij s to build a cofidece iterval for If the iterval misses 0, we coclude there is a statistical differece betwee the two systems. We ca also have a idea of how sigificat is the differece betwee the two systems. There are two approaches for buildig cofidece itervals for Paired-t ad two-sample-t. Paired-t cofidece Iterval Assume replicatios of each system are performed. Let Z j = X 1j X j. The, the Z j s are IID. The paired-t approach works by buildig a cofidece iterval based o the sample mea ad variace for Z, Z ( Z Z ( )) j j1 j1 Z Z ( ), S ( ) j 1. Assume that the Z j s are ormally distributed (which may be justified by the cetral limit theorem). The, the followig is a approximate 100(1 ) percet cofidece iterval for 9

SZ ( ) Z( ) t 1, /. Oe importat fact about the paired-t cofidece iterval is that X 1j ad X j eed ot to be idepedet. This could be useful whe utilizig a variace reductio techique. Example Cosider comparig two alterative orderig policies i a (s, S) system ((0, 40) ad (0, 80)). The measure of performace of iterest is the average mothly cost i a 10 moth horizo. The simulatio results of five replicatios of the two systems are as follows. The, Z(5) 4.98 ad S Z (5) 1.19, ad usig a 10% sigificace level, t 4, 0.05 a approximate 90% t-paired cofidece iterval is 1.19 4.98.13 4.98 3.33 (1.65,8.31). 5 10

Sice the iterval does ot cotai 0, this implies that the two (s, S) policies are sesibly differet. I additio, the secod policy seems to be better (as it has lower cost). A two-sample cofidece iterval This method allows samples of uequal sizes (i.e. 1 ) from both systems. However, it requires that X 1j ad X j be idepedet. This ca be doe by usig differet radom umber streams for obtaiig X 1j ad X j. The method works by estimatig the mea ad variace for each sample separately. That is, oe first estimates 1 1 X ( X X ( )) 1 j 1 j 1 1 j1 j1 1 1 1 1 1 1 1 X ( ), S ( ), X ( X X ( )) j j j1 j1 1 X ( ), S ( ). A approximate 100(1 ) % cofidece iterval for is S1 ( 1) S ( ) X1( 1) X ( ) t fˆ, /, 1 where S1 ( 1 ) / 1 S ( ) / fˆ. S1 ( 1 ) / 1 /( 1 1) S ( ) / /( 1) 11

Example 3 Applyig the two-sample cofidece iterval approach for the ivetory system i Example, gives X (5) 15.57, X (5) 10.59, S (5) 4, S (5) 3.76. 1 1 The, f ˆ 7.99 8, ad t ˆ,0.95 1.86, ad approximate 90% cofidece iterval is f 4.98 1.86(1.46) 4.98.3 (.66,7.3). This also idicates that the two policies are sesibly differet with the secod policy beig better. Paired-t versus two-sample cofidece iterval Comparig steady-state measures of two system The approaches for buildig cofidece itervals require iid observatios X 11, X 1,, X 11, ad X1, X,, X each system. These ca be obtaied easily for termiatig simulatio., from 1

For o-termiatig steady-state simulatios such observatios ca be obtaied by the methods discussed before such as replicatio/deletio ad batch meas. Cofidece Itervals for the Differece of > Systems If more that two systems are to be compared, the cofidece itervals for the differece of the performace measure betwee differet pairs of the systems are required. A key issue here is the validity of idividual cofidece itervals for multiple comparisos. Specifically, if c cofidece itervals are to be developed with a overall cofidece level of 1 the each iterval must have a cofidece level 1 c. (This is called the Boferroi iequality.) This implies that makig multiple comparisos require a large umber of replicatios to achieve the high cofidece level 1 c. Comparig with a stadard Here, oe of the systems (say system 1) is the stadard (e.g., the existig cofiguratio). We eed to compare each of the other k systems with the stadard, ad obtai cofidece itervals o 13

k The, to obtai a overall cofidece of 1 each iterval should have a cofidece level 1 k 1). Example 4 The simulatio of five differet (s, S) policies i a ivetory system gave the followig results i 5 replicatios. The performace measure of iterest is the average mothly cost over a horizo o 10 moths. The first policy is cosidered the stadard system. The desired overall cofidece level is 90%. So, itervals for k k =,3,4,5, are developed with a cofidece level (1 0.1/4)% = 97.5%. The cofidece itervals are as follows. 14

Here we see that system 4 ad 5 are obviously worse tha the stadard, while system ad 3 are ot ecessarily better. All pair-wise compariso Here all systems are compared pair-wise. This would be doe if all alteratives receive equal cosideratio. Cofidece itervals for i 1 i, i 1 = 1,, k, i = 1,, k, i 1 i. The total umber of comparisos to be made is k(k 1)/. So to obtai a overall cofidece of 1 the cofidece level for every iterval should be 1 k(k 1)/]. See BCNN book (pp. 475-476) for a example of all pairwise compariso. 15