More on Input Distributions

Similar documents
B. Maddah INDE 504 Discrete-Event Simulation. Output Analysis (1)

Overall Plan of Simulation and Modeling I. Chapters

Using SPSS for One Way Analysis of Variance

Unit 5a: Comparisons via Simulation. Kwok Tsui (and Seonghee Kim) School of Industrial and Systems Engineering Georgia Institute of Technology

Using Microsoft Excel

Simulation. Where real stuff starts

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr.

CPSC 531: System Modeling and Simulation. Carey Williamson Department of Computer Science University of Calgary Fall 2017

Virtual Beach Building a GBM Model

Simulation. Where real stuff starts

Solution manual of Simulation with Arena 6th Edition by W. David Kelton, Randall Sadowski and Nancy Zupick

Business Statistics. Lecture 5: Confidence Intervals

Thank you for your interest in the Support Resistance Strength Analyzer!

Chapter 10 Verification and Validation of Simulation Models. Banks, Carson, Nelson & Nicol Discrete-Event System Simulation

Description of the ED library Basic Atoms

Session-Based Queueing Systems

The data for this lab comes from McDonald Forest. We will be working with spatial data representing the forest boundary, streams, roads, and stands.

Cover Page. The handle holds various files of this Leiden University dissertation

Contents. 13. Graphs of Trigonometric Functions 2 Example Example

Motion II. Goals and Introduction

Queueing Review. Christos Alexopoulos and Dave Goldsman 10/25/17. (mostly from BCNN) Georgia Institute of Technology, Atlanta, GA, USA

Chapter 6: CPU Scheduling

P1 Chapter 3 :: Equations and Inequalities

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:

ISyE 6644 Fall 2014 Test #2 Solutions (revised 11/7/16)

Scheduling I. Today. Next Time. ! Introduction to scheduling! Classical algorithms. ! Advanced topics on scheduling

let s examine pupation rates. With the conclusion of that data collection, we will go on to explore the rate at which new adults appear, a process

14 Random Variables and Simulation

The entire data set consists of n = 32 widgets, 8 of which were made from each of q = 4 different materials.

LAB 2 - ONE DIMENSIONAL MOTION

LAB 3 - VELOCITY AND ACCELERATION

Lab I. 2D Motion. 1 Introduction. 2 Theory. 2.1 scalars and vectors LAB I. 2D MOTION 15

Mathematica Project 3

Selecting the Best Simulated System. Dave Goldsman. Georgia Tech. July 3 4, 2003

Computer simulation of radioactive decay

EXPERIMENT: REACTION TIME

STA Module 10 Comparing Two Proportions

An area chart emphasizes the trend of each value over time. An area chart also shows the relationship of parts to a whole.

AMS 132: Discussion Section 2

Lab 1 Uniform Motion - Graphing and Analyzing Motion

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Taguchi Method and Robust Design: Tutorial and Guideline

module, with the exception that the vials are larger and you only use one initial population size.

Computer Science, Informatik 4 Communication and Distributed Systems. Simulation. Discrete-Event System Simulation. Dr.

Fog Monitor 100 (FM 100) Extinction Module. Operator Manual

Module 5: CPU Scheduling

1. Double-click the ArcMap icon on your computer s desktop. 2. When the ArcMap start-up dialog box appears, click An existing map and click OK.

Lab I. 2D Motion. 1 Introduction. 2 Theory. 2.1 scalars and vectors LAB I. 2D MOTION 15

The Geodatabase Working with Spatial Analyst. Calculating Elevation and Slope Values for Forested Roads, Streams, and Stands.

Watershed Modeling Orange County Hydrology Using GIS Data

1 Introduction to Minitab

Jaguar DFT Optimizations and Transition State Searches

Introduction to Computer Tools and Uncertainties

Chapter 11. Output Analysis for a Single Model Prof. Dr. Mesut Güneş Ch. 11 Output Analysis for a Single Model

Conformational Analysis of n-butane

Mnova Software for Analyzing Reaction Monitoring NMR Spectra

IOE 202: lectures 11 and 12 outline

Spatial Data Analysis in Archaeology Anthropology 589b. Kriging Artifact Density Surfaces in ArcGIS

Financial Econometrics Review Session Notes 3

Queueing Review. Christos Alexopoulos and Dave Goldsman 10/6/16. (mostly from BCNN) Georgia Institute of Technology, Atlanta, GA, USA

Exercises Solutions. Automation IEA, LTH. Chapter 2 Manufacturing and process systems. Chapter 5 Discrete manufacturing problems

Practice Problems Section Problems

CPU scheduling. CPU Scheduling

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Scheduling I. Today Introduction to scheduling Classical algorithms. Next Time Advanced topics on scheduling

A Scientific Model for Free Fall.

Introduction to ArcMap

Using SkyTools to log Texas 45 list objects

Construction Operation Simulation

WEATHER AND CLIMATE COMPLETING THE WEATHER OBSERVATION PROJECT CAMERON DOUGLAS CRAIG

Output Data Analysis for a Single System

Geography 281 Map Making with GIS Project Four: Comparing Classification Methods

41. Sim Reactions Example

Specific Differences. Lukas Meier, Seminar für Statistik

Experiment: Oscillations of a Mass on a Spring

Graphical User Interfaces for Emittance and Correlation Plot. Henrik Loos

Momentum & Collisions

Physics 201 Lab 2 Air Drag Simulation

OECD QSAR Toolbox v.4.1. Tutorial illustrating new options for grouping with metabolism

2 Prediction and Analysis of Variance

Safety: BE SURE TO KEEP YOUR SMART CART UPSIDE-DOWN WHEN YOU RE NOT ACTIVELY USING IT TO RECORD DATA.

Getting a Good Grip on Grouping ACEIT User Conference

Chem 1 Kinetics. Objectives. Concepts

Virtual Beach Making Nowcast Predictions

Using the Budget Features in Quicken 2008

57:022 Principles of Design II Final Exam Solutions - Spring 1997

TDDI04, K. Arvidsson, IDA, Linköpings universitet CPU Scheduling. Overview: CPU Scheduling. [SGG7] Chapter 5. Basic Concepts.

Announcements Wednesday, August 30

Supplementary Technical Details and Results

Distribution Fitting (Censored Data)

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Multiple Testing. Gary W. Oehlert. January 28, School of Statistics University of Minnesota

Analysis of Software Artifacts

Homework 6: Image Completion using Mixture of Bernoullis

v Prerequisite Tutorials GSSHA WMS Basics Watershed Delineation using DEMs and 2D Grid Generation Time minutes

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

How many states. Record high temperature

Clojure Concurrency Constructs, Part Two. CSCI 5828: Foundations of Software Engineering Lecture 13 10/07/2014

Systematic Uncertainty Max Bean John Jay College of Criminal Justice, Physics Program

Gravity Modelling Forward Modelling Of Synthetic Data

Transcription:

More on Input Distributions Importance of Using the Correct Distribution Replacing a distribution with its mean Arrivals Waiting line Processing order System Service mean interarrival time = 1 minute mean service time = 0.99 minute Exit Case 1: Exponential interarrival and service times (M/M/1 queue, assume actual system) Long-run average number in queue 98 Case : Constant interarrival and service times Average number in queue = 0 Conclusion: One must also capture the variability in the input processes. Using the Wrong Distribution Single-server queueing system with exponential interarrival times Weibull, Exponential, Normal, and Lognormal distributions were fit to 00 observed service times (see histogram). Proportion Histogram 0.0 0.15 0.10 0.05 0.00 0.10 0.50 0.90 1.30 1.70.10.50.90 Interval Midpoint 15 intervals of width 0. between 0 and 3 Histogram Density/Proportion It can be shown that the Weibull distribution provides the best fit for the data [see Figure below and Section 6.7 in Law and Kelton(000)]. 0.5 0.0 0.15 0.10 0.05 0.00 Density/Histogram Overplot 0.10 0.50 0.90 1.30 1.70.10.50.90 Interval Midpoint 15 intervals of width 0. between 0 and 3 1 - Weibull - Lognormal 3 - Exponential Best Fit 4 Histogram of Service-Time Data. 5 Density / histogram overplot for the service-time data. 6

Simulation Results Based on 100,000 Delays Lab Session Summary Model Building Service-time distribution Average delay Percentage error Weibull* 4.36 _ exponential 6.71 53.9 Lab Session Summary Create Modules, Connect Modules to generate process flow Define attributes, TNOW Define Input arrival rates, service rates Define resource schedule rules Shifts, breaks, failure frequency, expression builder. Define Project parameters normal 6.04 38.5 Resource utilizations, queue length, time in queue lognormal 7.19 64.9 Define run parameters Run length, time units, replications Define animation parameters * Best fit 7 Run simulation Generate Reports 9 Output Analysis Time frame of simulations Strategy for data collection and analysis Confidence intervals Comparing two alternatives What We ll Do... Comparing many alternatives via the Arena Process Analyzer (PAN) Searching for an optimal alternative with OptQuestyu 11 Types of Statistics Reported Many output statistics are one of three types: Tally avg., max, min of a discrete list of numbers Used for discrete-time output processes like waiting times in queue, total times in system Time-persistent time-average, max, min of a plot of something where the x-axis is continuous time Used for continuous-time output processes like queue lengths, WIP, server-busy functions (for utilizations) Counter accumulated sums of something, usually just counts of how many times something happened Often used to count entities passing through a point in the model 1

Introduction Time Frame of Simulations Strategy for Data Collection and Analysis Random input leads to random output (RIRO) Run a simulation (once) what does it mean? Was this run typical or not? Variability from run to run (of the same model)? Need statistical analysis of output data From a single model configuration Compare two or more different configurations Search for an optimal configuration Terminating: Specific starting, stopping conditions Run length will be well-defined (and finite; Known starting and stopping conditions) Steady-state: Long-run (technically forever) Theoretically, initial conditions don t matter (but practically they usually do) Not clear how to terminate a simulation run (theoretically infinite) Interested in system response over long period of time This is really a question of intent of the study For terminating case, make IID replications Run > Setup > Replication Parameters: Number of Replications field Check both boxes for Initialize Between Replications Separate results for each replication Category by Replication report Model 5-; Daily Profit, Daily Late Wait Jobs; 10 replications Each run is an observation of the Two RVs Statistical analysis of output is often ignored Has major impact on how output analysis is done This is a big mistake no idea of precision of results 13 Sometimes it s not clear which is appropriate 14 Is 10 replications sufficient? 15 Strategy for Data Collection and Analysis (cont d.) Category Overview report will have some statistical-analysis results of the output across the replications Turn off animation altogether for max speed Run > Run Control > Batch Run (No Animation) 16 How Many Replications? The confidence level of simulation output drawn from a set of simulation runs depends on the size of data set. The larger the number of runs, the higher is the associated confidence. However, more simulation runs also require more effort and resources for large systems. Thus, the main goal must be in finding the smallest number of simulation runs that will provide the desirable confidence. 17 Confidence Intervals for Terminating Systems Replication summary outputs as the basic data: Possibly most useful part 95% confidence interval on expected values This information (except standard deviation) is in Category Overview report One way to reduce the half-width of CI is to increase n, the sample size (in this case the number of replications) 18

Half Width and Number of Replications Half Width and Number of Replications (cont d.) Half Width and Number of Replications (cont d.) Prefer smaller confidence intervals precision Notation: Confidence interval: Half-width = Can t control t or s Must increase n how much? Want this to be small, say < h where h is prespecified 19 Set half-width = h, solve for Not really solved for n (t, s depend on n) Approximation: Replace t by z, corresponding normal critical value Pretend that current s will hold for larger samples Get s = sample standard deviation from initial number n 0 of replications Easier but different approximation: n grows quadratically as h decreases Application to Model 5- From initial 10 replications, 95% half-width on Daily Profit was ±$50.0... let s get this down to ±$0 or less First formula: n 1.96 (70.17 /0 ) = 47.3, so 48 Second formula: n 10(50.0 /0 ) = 63.0, so 63 Run the model for 100 replications Got 49.63 ± 13.81, satisfying criterion (Against 479.89 ± 50.0 for 10 replications) h 0 = half width from initial number n 0 of replications 0 1 Interpretation of Confidence Intervals Introduction Interval with random (data-dependent) endpoints that s supposed to have stated probability of containing, or covering, the expected value Target expected value is a fixed, but unknown, number Expected value = average of infinite number of replications Not an interval that contains, say, 95% of the data That s a prediction interval useful too, but different Usual formulas assume normally-distributed data Never true in simulation Might be approximately true if output is an average, rather than an extreme Central limit theorem Finding the Best System (Comparing Alternative Solutions) Simulations are typically performed to compare two or more alternate solutions Consider comparisons among a small number of systems (say to 30) The method that is appropriate depends on the type of comparison desired and properties of the output data Consider situations when there is no known functional relationship among alternative systems 4

Comparing Two Systems Two Systems (cont d) Paired t Test Example: We evaluate two different re-start strategies that an airline can evoke following a major traffic disruption, e.g., a snowstorm in the Northeast. We want to minimize a certain cost function associated with the re-start. Xi = cost from i th run of strategy 1. Assume: X 1,X,...,X n i.i.d. normal with unknown mean µ X and unknown variance. Y i = cost from i th run of strategy. Assume: Y 1, Y,..., Ym i.i.d. normal with unknown mean µ Y and unknown variance. Goal: Obtain confidence interval for µ X -µ Y. 5 How can we justify these assumptions? Independent data by controlling random numbers between replications. Identically distributed by performing replications under identical conditions. Normally distributed data (since we add up many subcosts to get overall costs for both strategies). 6 H 0 = µ X -µ Y = 0 Take n observations from both strategies Set D i = X i Y i for i = 1,,, n D 1 n = n n D i 1 S D = n ( D n-1 i -D n ) (Sample variance of D s) 100(1-α)% confidence interval: µ X -µ Y D n ± t α/, n-1 (Sample mean of D s) S D n 7 Paired t Test Cont Modified t Test (Welch Approach) Modified t Test (Welch Approach) Cont Find the CI IF CI includes 0 H 0 is true. (Improved system is not better!) x µ X -µ Y 0 Strong evidence that µ X < µ Y System Y is better than X ( 0 µ x ) X -µ Y Weak evidence that one system is better than the other 0 x µ X -µ Y Does not pair observations n X n Y X n = n n 1 X i and Y 1 m = n m Y m (Sample mean of X s and Y s) 1 S X = n ( X n-1 i - X n ) (Sample variance of X s) 1 S Y = m ( Y m-1 i - Y m ) (Sample variance of Y s) 100(1-α)% confidence interval: Where µ X -µ Y X n - Y m ± t α/, ν ν = S X n S X n S + Y m S + Y m S X n S Y m + n + 1 m + 1 Strong evidence that µ X > µ Y System X is better than Y 8 9 30

Comparing Two Alternatives Comparing Two Alternatives (cont d.) Compare Means via the Output Analyzer Usually compare alternative system scenarios, configurations, layouts, sensitivity analysis Model 6-3 Model 6-1, but add file Daily Profit.dat to Statistic module, Output column, Daily Profit row Saves this output statistic to this file for each replication Two versions Base case all inputs as originally defined More-bookings case Change Max Load from 4 to 8 hours (allow more bookings per day... increase utilization, profit? Maybe) 31 Reasonable but not-quite-right idea Make confidence intervals on expected outputs from each alternative, see if they overlap Base case: 49.63 ± 13.81, or [478.8, 506.44] More-bookings case: 564.53 ±.59, or [541.94, 587.1] No Overlap But this doesn t allow for a precise, efficient statistical conclusion 3 Output Analyzer is a separate application that operates on.dat files produced by Arena Launch separately from Windows, not from Arena To save output values (Expressions) of entries in Statistic data module (Type = Output) enter filename.dat in Output File column Just did for Daily Profit, not Daily Late Wait Jobs Will overwrite this file name next time either change the name here or out in Windows before the next run.dat files are binary can only be read by Output Analyzer 33 Compare Means via the Output Analyzer (cont d.) Compare Means via the Output Analyzer (cont d.) Evaluating Many Alternatives (K > ) Start Output Analyzer, open a new data group Basically, a list of.dat files of current interest Can save data group for later use.dgr file extension Add button to select (Open).dat files for the data group Analyze > Compare Means menu option Add data files A and B for the two alternatives Select Lumped for Replications field Title, confidence level, accept Paired-t Test, Scale Display 34 Results: Confidence interval on difference misses 0, so conclude that there is a (statistically) significant difference 35 Hypothesis Tests. Of course, it is well known that we will eventually reject H 0 : µ 1 = = µ k if the sample size is large enough. Simultaneous Confidence Intervals Bonferroni Tukey-Kramer Ranking and Selection Procedures Multiple Comparison Procedures 36

Evaluating Many Alternatives (K > ) Cont All Pair-wise Comparison Example R&S selects the best system, or a subset of systems that includes the best. Guarantee probability of correct selection. MCPs treat the comparison problem as an inference problem. Account for simultaneous errors. Both are relevant in simulation: Normally distributed data by batching. Independence by controlling random numbers. Multiple-stage sampling by retaining seeds. Form simultaneous CI for µ i -µ j for all i j Systems are simulated independently i.i.d. outputs Y i1,, Y ini Y i = 1 n i n i Y ij j=1 S k = 1 k n i 1-1 n i Y ij - Y i j=1 (Sample mean Y s) (Sample variance of Y s) Tukey s simultaneous confidence interval s are: tk, ν Y i - Y j ± 1 S ni + n 1 where ν = k (n i 1) j Compare 4 alternate designs for the architecture of a new computer system. Response time is the performance measure of interest. Smaller response time is preferred. For each architecture we take n = 6 replications Following is the summary data: Y 1 = 7 Y = 85 Y 3 = 76 Y 4 = 6 S = 100.9 Determine, with confidence 0.95, bounds on the difference between the expected response time of each architecture 37 Coverage 1 - α for any values of the n i 38 39 t - Table Solution Evaluating Many Alternatives with the Process Analyzer (PAN) ν/k 3 5 4.60 10 3.88 15 3.67 0 3.58 30 3.49 t - Table 4 5. 4.33 4.08 3.96 3.85 5 5.67 4.65 4.37 4.3 4.10 6 6.03 4.91 4.60 4.45 4.30 Tukey s CI with half width: 3.96 100.9 1 6 + 1 6 t = 3.96 from the t - table = 16 For instance, a CI for µ µ 4 is 85 6 ± 16 or, 3 ± 16 milliseconds. Since this CI does not contain 0, and since shorter response time is Better, we can screen out architecture for further consideration! With (many) more than two alternatives to compare, two problems are Simple mechanics of making many parameter changes, making many runs, keeping track of many output files Statistical methods for drawing reliable and useful conclusions Process Analyzer (PAN) addresses these PAN operates on program (.p) files produced when.doe file is run (or just checked) Start PAN from Arena (Tools > Process Analyzer) or via Windows 40 41 PAN runs on its own, separate from Arena 4

PAN Scenarios PAN Projects and Runs Model 6-4 for PAN Experiments A scenario in PAN is a combination of: A program (.p) file Set of input controls that you choose Chosen from Variables and Resource capacities think ahead You fill in specific numerical values Set of output responses that you choose Chosen from automatic Arena outputs or your own Variables Values initially empty to be filled in after run(s) To create a new scenario in PAN, double-click where indicated, get Scenario Properties dialog Specify Name, Tool Tip Text,.p file, controls, responses Values of controls initially as in the model, but you can change them in PAN this is the real utility of PAN Duplicate (right-click, Duplicate) scenarios, then edit for a new one Think of a scenario as a row 43 A project in PAN is a collection of scenarios Program files can be the same.p file, or.p files from different model.doe files Controls, responses can be the same or differ across scenarios in a project usually will be mostly the same Think of a project as a collection of scenario rows a table Can save as a PAN (.pan extension) file Select scenarios in project to run (maybe all) PAN runs selected models with specified controls PAN fills in output-response values in table Equivalent to setting up, running them all by hand but much easier, faster, less error-prone 44 Same as Model 6-3 except remove Output File entry in Statistic module PAN will keep track of outputs itself, so this is faster Controls set up a formal 3 factorial experiment Responses Control (factor) Max Load Max Wait Wait Allowance Daily Profit Daily Late Wait Jobs Level 0 1 0.5 + Level 40 7.0 3 = 8 Scenarios Also do Base Case Not required to do a designed experiment with PAN, but this is more informative than haphazard fooling around 45 Running Model 6-4 with PAN Statistical Comparisons with PAN Statistical Comparisons with PAN (cont d.) Scenarios Select all to run (click on left of row, Ctrl-Click or Shift-Click for more) To execute, or Run > Go or F5 What to make of all this? Statistical meaningfulness? 46 Model 6-4 alternatives were made with 100 replications each Better than one replication, but what about statistical validity of comparisons, selection of the best? Select Total Cost column, Insert > Chart (or or right-click on column, then Insert Chart) Chart Type: Box and Whisker Next, Total Cost; Next defaults Next, Identify Best Scenarios Bigger is Better, Error Tolerance = 0 (not the default) Show Best Scenarios; Finish 47 Vertical boxes: 95% confidence intervals Red scenarios statistically significantly better than blues More precisely, red scenarios are 95% sure to contain the best one Narrow down red set more replications, or Error Tolerance > 0 More details in book 48

A Follow-Up PAN Experiment A Follow-Up PAN Experiment (cont d.) Searching for an Optimal Alternative with OptQuest From 3 factorial experiment, it s clear that Max Load matters the most, and bigger appears better It s factor 1, varying between and + in each scenario as ordered there, creating clear down/up/down/up pattern Could also see this by computing main effects estimates Consult an experimental-design text Eliminate other two factors (fix them at their base-case levels) and study Max Load alone Let it be 0,, 4,..., 40 Set up a second PAN experiment to do this, created chart as before 49 Here, profit-maximizing Max Load is about 30 But Daily Late Wait Jobs keeps increasing (worsening) as Max Load increases At profit-maximizing Max Load = 30, it s 0.908 job/day, which seems bad since we only take 5 wait jobs/day Would like to require that it be at most 0.75 job/day... still want to maximize Daily Profit Allow other two factors back into the picture... 50 The scenarios we ve considered with PAN are just a few of many possibilities Seek input controls maximizing Daily Profit while keeping Daily Late Wait Jobs 0.75 Formulate as an optimization problem: Maximize Daily Profit Objective function is the simulation model Subject to 0 Max Load 40 Constraints on the Could also have 1 Max Wait 7 input control constraints on linear 0.5 Wait Allowance.0 (decision) variables combinations of input control variables (but we Daily Late Wait Jobs < 0.75 An output requirement, don t in this problem) not an input constraint Reasonable starting place best acceptable scenario so far: (the base case, actually) Where to go from here? Explore all of feasible three-dimensional space exhaustively? No. 51 OptQuest Using OptQuest Using OptQuest (cont d.) OptQuest searches intelligently for an optimum Like PAN, OptQuest Runs as a separate application can be launched from Arena Takes over the running of your model Asks that you identify the input controls and the output (just one) response objective Unlike PAN, OptQuest Allows you to specify constraints on the input controls Allows you to specify requirements on outputs Decides itself what input-control-value combinations to try Uses internal heuristic algorithms to decide how to change the input controls to move toward an optimum configuration You specify stopping criterion for the search Tools > OptQuest for Arena New session (File > New or Ctrl+N or ) Make sure the desired model window is active Select controls Variables, Resource levels Max Load, Lower Bound = 0, Upper Bound = 40, Conts. Max Wait, Lower Bound = 1, Upper Bound = 7, Discrete (Input Step Size 1) Wait Allowance, Lower Bound = 0.5, Upper Bound =, Conts. Constraints none here other than earlier Bounds Objective and Requirement Daily Profit Response Select Maximize Objective Daily Late Wait Jobs Response Select Requirement, enter 0.75 for Upper Bound Options window computational limits, procedures Time tab run for 0 minutes Precision tab vary number of replications from 10 to 100 Preferences tab various settings (accept defaults) Can revisit Controls, Constraints, Objective and Requirements, or Options windows via Run via wizard (first time through a new project), or Run > Start or View > Status and Solutions and View > Performance Graph to watch progress Can t absolutely guarantee a true optimum Usually finds far better configuration than possible by hand 5 53 54