quan col... Probabilistic Forecasts of Bike-Sharing Systems for Journey Planning

Size: px
Start display at page:

Download "quan col... Probabilistic Forecasts of Bike-Sharing Systems for Journey Planning"

Transcription

1 Probabilistic Forecasts of Bike-Sharing Systems for Journey Planning Nicolas Gast 1 Guillaume Massonnet 1 Daniël Reijsbergen 2 Mirco Tribastone 3 1 INRIA Grenoble 2 University of Edinburgh 3 Institute for Advanced Studies Lucca CIKM, 21 October / 27

2 Preamble Goal: provide users with information about bicycle/slot availability at a bicycle station in the future. Two types of predictions/forecasts: point forecast e.g., there will be 2 bikes available in fifteen minutes probabilistic forecast e.g., probabilities of finding 0, 1, 2 or 3 bikes in fifteen minutes are 25%, 25%, 30% and 20% respectively. Focus of literature about bike-sharing system analysis has been on point predictions. (Example: MoReBikeS Challenge.) Our paper: methodology for evaluating accuracy of forecasting methods, with focus on trip feasibility. 2 / 27

3 Probabilistic Forecasts of Bike-Sharing Systems 1 Motivation 2 Probabilistic Model 3 Prediction Algorithms Last Value Historical Mean Queueing Model 4 Evaluating Predictor Performance 5 Results 6 Conclusions 3 / 27

4 Probabilistic Forecasts of Bike-Sharing Systems 1 Motivation 2 Probabilistic Model 3 Prediction Algorithms Last Value Historical Mean Queueing Model 4 Evaluating Predictor Performance 5 Results 6 Conclusions 4 / 27

5 Bike Sharing Systems quan.....col Bike-sharing system: users rent bicycles for trips between stations. There are over 700 bike sharing systems across the world. Biggest systems worldwide: Wuhan and Hangzhou, with 90,000 and 60,600 bicycles respectively. Melbourne has 600. Biggest in Europe: Ve lib system in Paris, with 18,000 bicycles and image source: Wikimedia Commons 1,230 stations. 5 / 27

6 Data Some early experiments (1965 Amsterdam, 1993 Cambridge) failed because too many bikes went missing, others (1974 La Rochelle) worked because tourists had to hand in passport. Modern systems rely on IT-driven customer identification data The data can be used for multiple purposes, including: 1. Analysis of historical system performance, mostly in terms of the feasibility of trips (inhibited by empty or full stations). 2. Impact of bike redistribution strategies. 3. Improvement of the user experience: data of the current system state can be used to make predictions about trip feasibility. We will focus on Item 3. 6 / 27

7 Probabilistic Forecasts of Bike-Sharing Systems 1 Motivation 2 Probabilistic Model 3 Prediction Algorithms Last Value Historical Mean Queueing Model 4 Evaluating Predictor Performance 5 Results 6 Conclusions 7 / 27

8 Basic Bike Model I Known bicycle network topology: we focus on a single station. Y t,i : number of bikes at the station at time t on day i, t T, i I. T is the set of times in a single day, could be interval on R +, in practice discrete either up to seconds, or set of polling times (e.g., every 20 seconds). I is the set of days being considered. 8 / 27

9 Basic Bike Model II Assume that for some subset J I, probability measure governing Y t,i, i J, is the same; e.g., all weekdays the same, or all weekend days, and within the same season. Let Y t be the occupancy process on any day in J. 9 / 27

10 Research Question Let the bike station have fixed capacity c (in reality, this fluctuates in time). We are interested in probabilistic forecasts of future bike availability: P(Y t+h = y Y t = x), x, y {0,..., c}, t, h R +. Particular interest: y = 0, y = c. Alternatively, one could use a single-value or point prediction Ŷ t+h (x, t, h) of future bike availability. Literature focus has been on point predictions. 10 / 27

11 Probabilistic Forecasts of Bike-Sharing Systems 1 Motivation 2 Probabilistic Model 3 Prediction Algorithms Last Value Historical Mean Queueing Model 4 Evaluating Predictor Performance 5 Results 6 Conclusions 11 / 27

12 Classical Predictors Last Value Predictor: Does not depend on h. Historical Predictor: P(Y t+h = y Y t = x) = { 1 if y = x, 0 otherwise. P(Y t+h = y Y t = x) = 1 I {i I : y t+h,i = y}. Does not depend on x. 12 / 27

13 Queueing Model Predictor Idea: model a bike station as a single queue. Poisson arrival process with intensity λ(t) if non-full, departure intensity µ(t) if non-empty. This results in a time-inhomogeneous Continuous-Time Markov Chain with generator Q(t): ( h ) P(Y t+h = y Y t = x) = exp Q(t + s)ds. 0 x,y Assumption: λ(t) and µ(t) are piecewise constant functions of t. 13 / 27

14 Parameter Estimation We use the data to learn λ(t) and µ(t). Goodness-of-fit to Poisson distribution gives an idea about performance of queueing model predictor. CDFs CDFs # arrivals # arrivals (a) Station Pecqueur (b) Station Gare du Nord 14 / 27

15 Probabilistic Forecasts of Bike-Sharing Systems 1 Motivation 2 Probabilistic Model 3 Prediction Algorithms Last Value Historical Mean Queueing Model 4 Evaluating Predictor Performance 5 Results 6 Conclusions 15 / 27

16 Root Mean Square Error Given the fact that various prediction algorithms exist, the question is how to compare their performance. Literature benchmark: the Root Mean Square Error (RMSE), which can only be applied to point predictions. Given a vector x t,i of predictions and y t,i of observations for station n at time t on day i, with A the set of prediction/observation pairs, the RMSE is defined as 1 (x t,i y t,i ) A 2 (t,i) A for a given prediction method and time window h. 16 / 27

17 Remarks about the RMSE RMSE may depend more strongly on the variance inherent in the bike availability process than the performance of the prediction algorithm. RMSE only works for point predictions. Not suitable for evaluating trip feasibility predictions. Can we construct better performance metrics? 17 / 27

18 Proper Scoring Rules A scoring rule S(P, i) assigns to a combination of a probabilistic forecast P and an outcome i a score in R { }. The idea is that better prediction methods give higher scores. A scoring rule is said to be proper when the true probability distribution Q is the one that maximises the expected score. That is, for all forecasts P, E(S(Q, X )) E(S(P, X )). If the inequality is strict when P Q, S is said to be strictly proper. 18 / 27

19 Proper Scoring Rules: Example (I) Example: given a prediction, the user is advised to head to a station if the probability of finding an available bike is higher than 80%. Idea: we assign a score of 1 to correct predictions and a score of 0 to wrong predictions. 1 if P(X > 0) > 0.8 and x > 0, 0 if P(X > 0) > 0.8 and x = 0, Formally: S(P, x) = 0 if P(X > 0) 0.8 and x > 0, 1 if P(X > 0) 0.8 and x = 0. Is this a proper scoring rule? 19 / 27

20 Proper Scoring Rules: Example (II) Example: given a prediction, the user is advised to head to a station if the probability of finding an available bike is higher than 80%. Idea: we assign a score of 1 to correct predictions and a score of 0 to wrong predictions. This is not a proper scoring rule. For example, if the true probability of finding an available bike is always 70%, then a perfect predictor would be wrong 70% of the time, and yield an average score of 0.3. A predictor that incorrectly forecasts the probability of finding an available bike to be over 80% would be right 70% of the time, resulting in an average score of / 27

21 Proper Scoring Rules: Example (III) Say that given a prediction, the user is advised to head to a station if the probability of finding an available bike is higher than 80%. Proper scoring rule: 1 if P(X > 0) > 0.8 and x > 0, 4 if P(X > 0) > 0.8 and x = 0, S(P, x) = 1 4 if P(X > 0) 0.8 and x > 0, 1 if P(X > 0) 0.8 and x = 0. Includes a penalty of 4 for incorrectly recommending to go. 21 / 27

22 Probabilistic Forecasts of Bike-Sharing Systems 1 Motivation 2 Probabilistic Model 3 Prediction Algorithms Last Value Historical Mean Queueing Model 4 Evaluating Predictor Performance 5 Results 6 Conclusions 22 / 27

23 Classical Scoring Rules: Brier score. Spherical score. Logarithmic score. Brier-score (availability) Queue (QMP) historic (HP) last value (LVP) Spherical-score (availability) Queue (QMP) historic (HP) last value (LVP) prediction horizon (in hours) (a) Brier score prediction horizon (in hours) (b) Spherical score 23 / 27

24 Trip Feasibility Scoring Rules Proper scoring rules can be defined for different penalties U of incorrectly recommending to go. score of predictor (for the success) Queue QMP Historic HP Last-value LVP always go prediction horizon (in hours) (a) U=0 score of predictor (for the success) Queue QMP Historic HP Last-value LVP always go prediction horizon (in hours) (b) U= 5 score of predictor (for the success) Queue QMP Historic HP 0.0 Last-value LVP always go prediction horizon (in hours) (c) U= / 27

25 Probabilistic Forecasts of Bike-Sharing Systems 1 Motivation 2 Probabilistic Model 3 Prediction Algorithms Last Value Historical Mean Queueing Model 4 Evaluating Predictor Performance 5 Results 6 Conclusions 25 / 27

26 Conclusions & Future Work From the perspective of the user, probabilistic forecasts are more informative than point predictions. To compare forecasting algorithms, a good evaluation methodology is required. We introduced an evaluation methodology based on proper scoring rules. Particularly suited for trip feasibility predictions. Future work: include state-of-the-art prediction algorithms (based on time series analysis) in comparison. 26 / 27

27 Thank you for your attention. 27 / 27

Moment-based Availability Prediction for Bike-Sharing Systems

Moment-based Availability Prediction for Bike-Sharing Systems Moment-based Availability Prediction for Bike-Sharing Systems Jane Hillston Joint work with Cheng Feng and Daniël Reijsbergen LFCS, School of Informatics, University of Edinburgh http://www.quanticol.eu

More information

The exponential distribution and the Poisson process

The exponential distribution and the Poisson process The exponential distribution and the Poisson process 1-1 Exponential Distribution: Basic Facts PDF f(t) = { λe λt, t 0 0, t < 0 CDF Pr{T t) = 0 t λe λu du = 1 e λt (t 0) Mean E[T] = 1 λ Variance Var[T]

More information

Aberrant Behavior Detection in Time Series for Monitoring Business-Critical Metrics (DRAFT)

Aberrant Behavior Detection in Time Series for Monitoring Business-Critical Metrics (DRAFT) Aberrant Behavior Detection in Time Series for Monitoring Business-Critical Metrics (DRAFT) Evan Miller IMVU, Inc. emiller@imvu.com Oct. 28, 2007 1 Abstract Detecting failures swiftly is a key process

More information

Performance Modelling of Computer Systems

Performance Modelling of Computer Systems Performance Modelling of Computer Systems Mirco Tribastone Institut für Informatik Ludwig-Maximilians-Universität München Fundamentals of Queueing Theory Tribastone (IFI LMU) Performance Modelling of Computer

More information

Continuous Random Variables

Continuous Random Variables 1 Continuous Random Variables Example 1 Roll a fair die. Denote by X the random variable taking the value shown by the die, X {1, 2, 3, 4, 5, 6}. Obviously the probability mass function is given by (since

More information

Stochastic Processes

Stochastic Processes Stochastic Processes Stochastic Process Non Formal Definition: Non formal: A stochastic process (random process) is the opposite of a deterministic process such as one defined by a differential equation.

More information

Discrete Probability Refresher

Discrete Probability Refresher ECE 1502 Information Theory Discrete Probability Refresher F. R. Kschischang Dept. of Electrical and Computer Engineering University of Toronto January 13, 1999 revised January 11, 2006 Probability theory

More information

YEQT, , Eindhoven

YEQT, , Eindhoven in in YEQT, 11-11-2015, Eindhoven out 1 / 39 in in out What are vehicle sharing systems? 2 / 39 in Vehicle Sharing Systems in out Figure: Autolib and Velib stations in Paris. 3 / 39 in Vehicle Sharing

More information

Queueing Theory and Simulation. Introduction

Queueing Theory and Simulation. Introduction Queueing Theory and Simulation Based on the slides of Dr. Dharma P. Agrawal, University of Cincinnati and Dr. Hiroyuki Ohsaki Graduate School of Information Science & Technology, Osaka University, Japan

More information

Fundamental Tools - Probability Theory IV

Fundamental Tools - Probability Theory IV Fundamental Tools - Probability Theory IV MSc Financial Mathematics The University of Warwick October 1, 2015 MSc Financial Mathematics Fundamental Tools - Probability Theory IV 1 / 14 Model-independent

More information

Lecture 2: Repetition of probability theory and statistics

Lecture 2: Repetition of probability theory and statistics Algorithms for Uncertainty Quantification SS8, IN2345 Tobias Neckel Scientific Computing in Computer Science TUM Lecture 2: Repetition of probability theory and statistics Concept of Building Block: Prerequisites:

More information

1 Review of Winnow Algorithm

1 Review of Winnow Algorithm COS 511: Theoretical Machine Learning Lecturer: Rob Schapire Lecture # 17 Scribe: Xingyuan Fang, Ethan April 9th, 2013 1 Review of Winnow Algorithm We have studied Winnow algorithm in Algorithm 1. Algorithm

More information

Guidelines for Solving Probability Problems

Guidelines for Solving Probability Problems Guidelines for Solving Probability Problems CS 1538: Introduction to Simulation 1 Steps for Problem Solving Suggested steps for approaching a problem: 1. Identify the distribution What distribution does

More information

Name of the Student:

Name of the Student: SUBJECT NAME : Probability & Queueing Theory SUBJECT CODE : MA 6453 MATERIAL NAME : Part A questions REGULATION : R2013 UPDATED ON : November 2017 (Upto N/D 2017 QP) (Scan the above QR code for the direct

More information

FORECASTING: A REVIEW OF STATUS AND CHALLENGES. Eric Grimit and Kristin Larson 3TIER, Inc. Pacific Northwest Weather Workshop March 5-6, 2010

FORECASTING: A REVIEW OF STATUS AND CHALLENGES. Eric Grimit and Kristin Larson 3TIER, Inc. Pacific Northwest Weather Workshop March 5-6, 2010 SHORT-TERM TERM WIND POWER FORECASTING: A REVIEW OF STATUS AND CHALLENGES Eric Grimit and Kristin Larson 3TIER, Inc. Pacific Northwest Weather Workshop March 5-6, 2010 Integrating Renewable Energy» Variable

More information

Continuous-Time Markov Chain

Continuous-Time Markov Chain Continuous-Time Markov Chain Consider the process {X(t),t 0} with state space {0, 1, 2,...}. The process {X(t),t 0} is a continuous-time Markov chain if for all s, t 0 and nonnegative integers i, j, x(u),

More information

Chapter 1: Revie of Calculus and Probability

Chapter 1: Revie of Calculus and Probability Chapter 1: Revie of Calculus and Probability Refer to Text Book: Operations Research: Applications and Algorithms By Wayne L. Winston,Ch. 12 Operations Research: An Introduction By Hamdi Taha, Ch. 12 OR441-Dr.Khalid

More information

Chapter 2 Queueing Theory and Simulation

Chapter 2 Queueing Theory and Simulation Chapter 2 Queueing Theory and Simulation Based on the slides of Dr. Dharma P. Agrawal, University of Cincinnati and Dr. Hiroyuki Ohsaki Graduate School of Information Science & Technology, Osaka University,

More information

Time: 1 hour 30 minutes

Time: 1 hour 30 minutes Paper Reference(s) 6684/01 Edexcel GCE Statistics S2 Bronze Level B4 Time: 1 hour 30 minutes Materials required for examination papers Mathematical Formulae (Green) Items included with question Nil Candidates

More information

Y i = η + ɛ i, i = 1,...,n.

Y i = η + ɛ i, i = 1,...,n. Nonparametric tests If data do not come from a normal population (and if the sample is not large), we cannot use a t-test. One useful approach to creating test statistics is through the use of rank statistics.

More information

CS 237: Probability in Computing

CS 237: Probability in Computing CS 237: Probability in Computing Wayne Snyder Computer Science Department Boston University Lecture 13: Normal Distribution Exponential Distribution Recall that the Normal Distribution is given by an explicit

More information

A A Multi-Echelon Inventory Model for a Reparable Item with one-for. for- one Replenishment

A A Multi-Echelon Inventory Model for a Reparable Item with one-for. for- one Replenishment A A Multi-Echelon Inventory Model for a Reparable Item with one-for for- one Replenishment Steve Graves, 1985 Management Science, 31(10) Presented by Hongmin Li This summary presentation is based on: Graves,

More information

Page 0 of 5 Final Examination Name. Closed book. 120 minutes. Cover page plus five pages of exam.

Page 0 of 5 Final Examination Name. Closed book. 120 minutes. Cover page plus five pages of exam. Final Examination Closed book. 120 minutes. Cover page plus five pages of exam. To receive full credit, show enough work to indicate your logic. Do not spend time calculating. You will receive full credit

More information

HW7 Solutions. f(x) = 0 otherwise. 0 otherwise. The density function looks like this: = 20 if x [10, 90) if x [90, 100]

HW7 Solutions. f(x) = 0 otherwise. 0 otherwise. The density function looks like this: = 20 if x [10, 90) if x [90, 100] HW7 Solutions. 5 pts.) James Bond James Bond, my favorite hero, has again jumped off a plane. The plane is traveling from from base A to base B, distance km apart. Now suppose the plane takes off from

More information

Grundlagen der Künstlichen Intelligenz

Grundlagen der Künstlichen Intelligenz Grundlagen der Künstlichen Intelligenz Uncertainty & Probabilities & Bandits Daniel Hennes 16.11.2017 (WS 2017/18) University Stuttgart - IPVS - Machine Learning & Robotics 1 Today Uncertainty Probability

More information

BINOMIAL DISTRIBUTION

BINOMIAL DISTRIBUTION BINOMIAL DISTRIBUTION The binomial distribution is a particular type of discrete pmf. It describes random variables which satisfy the following conditions: 1 You perform n identical experiments (called

More information

Structural Uncertainty in Health Economic Decision Models

Structural Uncertainty in Health Economic Decision Models Structural Uncertainty in Health Economic Decision Models Mark Strong 1, Hazel Pilgrim 1, Jeremy Oakley 2, Jim Chilcott 1 December 2009 1. School of Health and Related Research, University of Sheffield,

More information

Warwick Business School Forecasting System. Summary. Ana Galvao, Anthony Garratt and James Mitchell November, 2014

Warwick Business School Forecasting System. Summary. Ana Galvao, Anthony Garratt and James Mitchell November, 2014 Warwick Business School Forecasting System Summary Ana Galvao, Anthony Garratt and James Mitchell November, 21 The main objective of the Warwick Business School Forecasting System is to provide competitive

More information

Probability and Distributions

Probability and Distributions Probability and Distributions What is a statistical model? A statistical model is a set of assumptions by which the hypothetical population distribution of data is inferred. It is typically postulated

More information

Course Outline Introduction to Transportation Highway Users and their Performance Geometric Design Pavement Design

Course Outline Introduction to Transportation Highway Users and their Performance Geometric Design Pavement Design Course Outline Introduction to Transportation Highway Users and their Performance Geometric Design Pavement Design Speed Studies - Project Traffic Queuing Intersections Level of Service in Highways and

More information

Queueing Theory II. Summary. ! M/M/1 Output process. ! Networks of Queue! Method of Stages. ! General Distributions

Queueing Theory II. Summary. ! M/M/1 Output process. ! Networks of Queue! Method of Stages. ! General Distributions Queueing Theory II Summary! M/M/1 Output process! Networks of Queue! Method of Stages " Erlang Distribution " Hyperexponential Distribution! General Distributions " Embedded Markov Chains M/M/1 Output

More information

Queueing Theory. VK Room: M Last updated: October 17, 2013.

Queueing Theory. VK Room: M Last updated: October 17, 2013. Queueing Theory VK Room: M1.30 knightva@cf.ac.uk www.vincent-knight.com Last updated: October 17, 2013. 1 / 63 Overview Description of Queueing Processes The Single Server Markovian Queue Multi Server

More information

Entropy Rate of Stochastic Processes

Entropy Rate of Stochastic Processes Entropy Rate of Stochastic Processes Timo Mulder tmamulder@gmail.com Jorn Peters jornpeters@gmail.com February 8, 205 The entropy rate of independent and identically distributed events can on average be

More information

Computer Systems Modelling

Computer Systems Modelling Computer Systems Modelling Computer Laboratory Computer Science Tripos, Part II Lent Term 2010/11 R. J. Gibbens Problem sheet William Gates Building 15 JJ Thomson Avenue Cambridge CB3 0FD http://www.cl.cam.ac.uk/

More information

56:171 Operations Research Final Examination December 15, 1998

56:171 Operations Research Final Examination December 15, 1998 56:171 Operations Research Final Examination December 15, 1998 Write your name on the first page, and initial the other pages. Answer both Parts A and B, and 4 (out of 5) problems from Part C. Possible

More information

Answers to selected exercises

Answers to selected exercises Answers to selected exercises A First Course in Stochastic Models, Henk C. Tijms 1.1 ( ) 1.2 (a) Let waiting time if passengers already arrived,. Then,, (b) { (c) Long-run fraction for is (d) Let waiting

More information

Overall Plan of Simulation and Modeling I. Chapters

Overall Plan of Simulation and Modeling I. Chapters Overall Plan of Simulation and Modeling I Chapters Introduction to Simulation Discrete Simulation Analytical Modeling Modeling Paradigms Input Modeling Random Number Generation Output Analysis Continuous

More information

Machine Learning for Large-Scale Data Analysis and Decision Making A. Week #1

Machine Learning for Large-Scale Data Analysis and Decision Making A. Week #1 Machine Learning for Large-Scale Data Analysis and Decision Making 80-629-17A Week #1 Today Introduction to machine learning The course (syllabus) Math review (probability + linear algebra) The future

More information

Forecasting Using Time Series Models

Forecasting Using Time Series Models Forecasting Using Time Series Models Dr. J Katyayani 1, M Jahnavi 2 Pothugunta Krishna Prasad 3 1 Professor, Department of MBA, SPMVV, Tirupati, India 2 Assistant Professor, Koshys Institute of Management

More information

Bernoulli Counting Process with p=0.1

Bernoulli Counting Process with p=0.1 Stat 28 October 29, 21 Today: More Ch 7 (Sections 7.4 and part of 7.) Midterm will cover Ch 7 to section 7.4 Review session will be Nov. Exercises to try (answers in book): 7.1-, 7.2-3, 7.3-3, 7.4-7 Where

More information

Stochastic models in product form: the (E)RCAT methodology

Stochastic models in product form: the (E)RCAT methodology Stochastic models in product form: the (E)RCAT methodology 1 Maria Grazia Vigliotti 2 1 Dipartimento di Informatica Università Ca Foscari di Venezia 2 Department of Computing Imperial College London Second

More information

Exercises Stochastic Performance Modelling. Hamilton Institute, Summer 2010

Exercises Stochastic Performance Modelling. Hamilton Institute, Summer 2010 Exercises Stochastic Performance Modelling Hamilton Institute, Summer Instruction Exercise Let X be a non-negative random variable with E[X ]

More information

THE ROYAL STATISTICAL SOCIETY 2009 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULAR FORMAT MODULE 3 STOCHASTIC PROCESSES AND TIME SERIES

THE ROYAL STATISTICAL SOCIETY 2009 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULAR FORMAT MODULE 3 STOCHASTIC PROCESSES AND TIME SERIES THE ROYAL STATISTICAL SOCIETY 9 EXAMINATIONS SOLUTIONS GRADUATE DIPLOMA MODULAR FORMAT MODULE 3 STOCHASTIC PROCESSES AND TIME SERIES The Society provides these solutions to assist candidates preparing

More information

Queues and Queueing Networks

Queues and Queueing Networks Queues and Queueing Networks Sanjay K. Bose Dept. of EEE, IITG Copyright 2015, Sanjay K. Bose 1 Introduction to Queueing Models and Queueing Analysis Copyright 2015, Sanjay K. Bose 2 Model of a Queue Arrivals

More information

Transient behaviour in highly dependable Markovian systems: new regimes, many paths.

Transient behaviour in highly dependable Markovian systems: new regimes, many paths. Transient behaviour in highly dependable Markovian systems: new regimes, many paths. Daniël Reijsbergen Pieter-Tjerk de Boer Werner Scheinhardt University of Twente RESIM, June 22nd, 2010 Outline Problem

More information

Discrete and continuous

Discrete and continuous Discrete and continuous A curve, or a function, or a range of values of a variable, is discrete if it has gaps in it - it jumps from one value to another. In practice in S2 discrete variables are variables

More information

Review of Probabilities and Basic Statistics

Review of Probabilities and Basic Statistics Alex Smola Barnabas Poczos TA: Ina Fiterau 4 th year PhD student MLD Review of Probabilities and Basic Statistics 10-701 Recitations 1/25/2013 Recitation 1: Statistics Intro 1 Overview Introduction to

More information

Since D has an exponential distribution, E[D] = 0.09 years. Since {A(t) : t 0} is a Poisson process with rate λ = 10, 000, A(0.

Since D has an exponential distribution, E[D] = 0.09 years. Since {A(t) : t 0} is a Poisson process with rate λ = 10, 000, A(0. IEOR 46: Introduction to Operations Research: Stochastic Models Chapters 5-6 in Ross, Thursday, April, 4:5-5:35pm SOLUTIONS to Second Midterm Exam, Spring 9, Open Book: but only the Ross textbook, the

More information

AUTOMATED TEMPLATE MATCHING METHOD FOR NMIS AT THE Y-12 NATIONAL SECURITY COMPLEX

AUTOMATED TEMPLATE MATCHING METHOD FOR NMIS AT THE Y-12 NATIONAL SECURITY COMPLEX AUTOMATED TEMPLATE MATCHING METHOD FOR NMIS AT THE Y-1 NATIONAL SECURITY COMPLEX J. A. Mullens, J. K. Mattingly, L. G. Chiang, R. B. Oberer, J. T. Mihalczo ABSTRACT This paper describes a template matching

More information

Random variables. DS GA 1002 Probability and Statistics for Data Science.

Random variables. DS GA 1002 Probability and Statistics for Data Science. Random variables DS GA 1002 Probability and Statistics for Data Science http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall17 Carlos Fernandez-Granda Motivation Random variables model numerical quantities

More information

Random variable X is a mapping that maps each outcome s in the sample space to a unique real number x, x. X s. Real Line

Random variable X is a mapping that maps each outcome s in the sample space to a unique real number x, x. X s. Real Line Random Variable Random variable is a mapping that maps each outcome s in the sample space to a unique real number,. s s : outcome Sample Space Real Line Eamples Toss a coin. Define the random variable

More information

Homework 10 (due December 2, 2009)

Homework 10 (due December 2, 2009) Homework (due December, 9) Problem. Let X and Y be independent binomial random variables with parameters (n, p) and (n, p) respectively. Prove that X + Y is a binomial random variable with parameters (n

More information

2129 NORTH MAIN STREET HOTE PROJECT ULI SHARED PARKING STUDY City of Santa Ana, California

2129 NORTH MAIN STREET HOTE PROJECT ULI SHARED PARKING STUDY City of Santa Ana, California 2129 NORTH MAIN STREET HOTE PROJECT ULI SHARED PARKING STUDY City of Santa Ana, California Prepared for: NET DEVELOPMENT COMPANY 3130 Airway Avenue Costa Mesa, CA 92626 Prepared by: RK ENGINEERING GROUP,

More information

1.225J J (ESD 205) Transportation Flow Systems

1.225J J (ESD 205) Transportation Flow Systems 1.225J J (ESD 25) Transportation Flow Systems Lecture 9 Simulation Models Prof. Ismail Chabini and Prof. Amedeo R. Odoni Lecture 9 Outline About this lecture: It is based on R16. Only material covered

More information

Probability and Statistics Concepts

Probability and Statistics Concepts University of Central Florida Computer Science Division COT 5611 - Operating Systems. Spring 014 - dcm Probability and Statistics Concepts Random Variable: a rule that assigns a numerical value to each

More information

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation.

Mark your answers ON THE EXAM ITSELF. If you are not sure of your answer you may wish to provide a brief explanation. CS 189 Spring 2015 Introduction to Machine Learning Midterm You have 80 minutes for the exam. The exam is closed book, closed notes except your one-page crib sheet. No calculators or electronic items.

More information

Model Based Clustering of Count Processes Data

Model Based Clustering of Count Processes Data Model Based Clustering of Count Processes Data Tin Lok James Ng, Brendan Murphy Insight Centre for Data Analytics School of Mathematics and Statistics May 15, 2017 Tin Lok James Ng, Brendan Murphy (Insight)

More information

Markov processes and queueing networks

Markov processes and queueing networks Inria September 22, 2015 Outline Poisson processes Markov jump processes Some queueing networks The Poisson distribution (Siméon-Denis Poisson, 1781-1840) { } e λ λ n n! As prevalent as Gaussian distribution

More information

Online Supplement to Delay-Based Service Differentiation with Many Servers and Time-Varying Arrival Rates

Online Supplement to Delay-Based Service Differentiation with Many Servers and Time-Varying Arrival Rates Online Supplement to Delay-Based Service Differentiation with Many Servers and Time-Varying Arrival Rates Xu Sun and Ward Whitt Department of Industrial Engineering and Operations Research, Columbia University

More information

Introduction to queuing theory

Introduction to queuing theory Introduction to queuing theory Queu(e)ing theory Queu(e)ing theory is the branch of mathematics devoted to how objects (packets in a network, people in a bank, processes in a CPU etc etc) join and leave

More information

Lecture 4: Bernoulli Process

Lecture 4: Bernoulli Process Lecture 4: Bernoulli Process Hyang-Won Lee Dept. of Internet & Multimedia Eng. Konkuk University Lecture 4 Hyang-Won Lee 1 / 12 Stochastic Process A stochastic process is a mathematical model of a probabilistic

More information

Recurrent Latent Variable Networks for Session-Based Recommendation

Recurrent Latent Variable Networks for Session-Based Recommendation Recurrent Latent Variable Networks for Session-Based Recommendation Panayiotis Christodoulou Cyprus University of Technology paa.christodoulou@edu.cut.ac.cy 27/8/2017 Panayiotis Christodoulou (C.U.T.)

More information

Design and Implementation of Speech Recognition Systems

Design and Implementation of Speech Recognition Systems Design and Implementation of Speech Recognition Systems Spring 2013 Class 7: Templates to HMMs 13 Feb 2013 1 Recap Thus far, we have looked at dynamic programming for string matching, And derived DTW from

More information

CPSC 531: System Modeling and Simulation. Carey Williamson Department of Computer Science University of Calgary Fall 2017

CPSC 531: System Modeling and Simulation. Carey Williamson Department of Computer Science University of Calgary Fall 2017 CPSC 531: System Modeling and Simulation Carey Williamson Department of Computer Science University of Calgary Fall 2017 Motivating Quote for Queueing Models Good things come to those who wait - poet/writer

More information

Examination paper for TMA4265 Stochastic Processes

Examination paper for TMA4265 Stochastic Processes Department of Mathematical Sciences Examination paper for TMA4265 Stochastic Processes Academic contact during examination: Andrea Riebler Phone: 456 89 592 Examination date: December 14th, 2015 Examination

More information

57:022 Principles of Design II Final Exam Solutions - Spring 1997

57:022 Principles of Design II Final Exam Solutions - Spring 1997 57:022 Principles of Design II Final Exam Solutions - Spring 1997 Part: I II III IV V VI Total Possible Pts: 52 10 12 16 13 12 115 PART ONE Indicate "+" if True and "o" if False: + a. If a component's

More information

Stochastic Networks and Parameter Uncertainty

Stochastic Networks and Parameter Uncertainty Stochastic Networks and Parameter Uncertainty Assaf Zeevi Graduate School of Business Columbia University Stochastic Processing Networks Conference, August 2009 based on joint work with Mike Harrison Achal

More information

Clustering Techniques and their applications at ECMWF

Clustering Techniques and their applications at ECMWF Clustering Techniques and their applications at ECMWF Laura Ferranti European Centre for Medium-Range Weather Forecasts Training Course NWP-PR: Clustering techniques and their applications at ECMWF 1/32

More information

Stochastic Models in Computer Science A Tutorial

Stochastic Models in Computer Science A Tutorial Stochastic Models in Computer Science A Tutorial Dr. Snehanshu Saha Department of Computer Science PESIT BSC, Bengaluru WCI 2015 - August 10 to August 13 1 Introduction 2 Random Variable 3 Introduction

More information

Some Finitely Additive (Statistical) Decision Theory or How Bruno de Finetti might have channeled Abraham Wald

Some Finitely Additive (Statistical) Decision Theory or How Bruno de Finetti might have channeled Abraham Wald Some Finitely Additive (Statistical) Decision Theory or How Bruno de Finetti might have channeled Abraham Wald T.Seidenfeld ------------ Based on our T.R.: What Finite Additivity Can Add to Decision Theory

More information

CDA6530: Performance Models of Computers and Networks. Chapter 2: Review of Practical Random Variables

CDA6530: Performance Models of Computers and Networks. Chapter 2: Review of Practical Random Variables CDA6530: Performance Models of Computers and Networks Chapter 2: Review of Practical Random Variables Two Classes of R.V. Discrete R.V. Bernoulli Binomial Geometric Poisson Continuous R.V. Uniform Exponential,

More information

Information Retrieval

Information Retrieval Information Retrieval Online Evaluation Ilya Markov i.markov@uva.nl University of Amsterdam Ilya Markov i.markov@uva.nl Information Retrieval 1 Course overview Offline Data Acquisition Data Processing

More information

Time: 1 hour 30 minutes

Time: 1 hour 30 minutes Paper Reference(s) 6684/01 Edexcel GCE Statistics S2 Gold Level G3 Time: 1 hour 30 minutes Materials required for examination papers Mathematical Formulae (Green) Items included with question Nil Candidates

More information

An Operational Evaluation of the Ground Delay Program (GDP) Parameters Selection Model (GPSM)

An Operational Evaluation of the Ground Delay Program (GDP) Parameters Selection Model (GPSM) An Operational Evaluation of the Ground Delay Program (GDP) Parameters Selection Model (GPSM) Lara Shisler, Christopher Provan Mosaic ATM Dave Clark MIT Lincoln Lab William Chan, Shon Grabbe NASA Ames

More information

Statistics 150: Spring 2007

Statistics 150: Spring 2007 Statistics 150: Spring 2007 April 23, 2008 0-1 1 Limiting Probabilities If the discrete-time Markov chain with transition probabilities p ij is irreducible and positive recurrent; then the limiting probabilities

More information

Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis.

Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis. Service Engineering Class 11 Non-Parametric Models of a Service System; GI/GI/1, GI/GI/n: Exact & Approximate Analysis. G/G/1 Queue: Virtual Waiting Time (Unfinished Work). GI/GI/1: Lindley s Equations

More information

S n = x + X 1 + X X n.

S n = x + X 1 + X X n. 0 Lecture 0 0. Gambler Ruin Problem Let X be a payoff if a coin toss game such that P(X = ) = P(X = ) = /2. Suppose you start with x dollars and play the game n times. Let X,X 2,...,X n be payoffs in each

More information

Modeling and Simulating Rainfall

Modeling and Simulating Rainfall Modeling and Simulating Rainfall Kenneth Shirley, Daniel Osgood, Andrew Robertson, Paul Block, Upmanu Lall, James Hansen, Sergey Kirshner, Vincent Moron, Michael Norton, Amor Ines, Calum Turvey, Tufa Dinku

More information

MEI STRUCTURED MATHEMATICS 4767

MEI STRUCTURED MATHEMATICS 4767 OXFORD CAMBRIDGE AND RSA EXAMINATIONS Advanced Subsidiary General Certificate of Education Advanced General Certificate of Education MEI STRUCTURED MATHEMATICS 4767 Statistics 2 Thursday 9 JUNE 2005 Morning

More information

Time: 1 hour 30 minutes

Time: 1 hour 30 minutes Paper Reference(s) 6684/01 Edexcel GCE Statistics S Silver Level S4 Time: 1 hour 30 minutes Materials required for examination papers Mathematical Formulae (Green) Items included with question Nil Candidates

More information

16 : Markov Chain Monte Carlo (MCMC)

16 : Markov Chain Monte Carlo (MCMC) 10-708: Probabilistic Graphical Models 10-708, Spring 2014 16 : Markov Chain Monte Carlo MCMC Lecturer: Matthew Gormley Scribes: Yining Wang, Renato Negrinho 1 Sampling from low-dimensional distributions

More information

Lecture 1: Brief Review on Stochastic Processes

Lecture 1: Brief Review on Stochastic Processes Lecture 1: Brief Review on Stochastic Processes A stochastic process is a collection of random variables {X t (s) : t T, s S}, where T is some index set and S is the common sample space of the random variables.

More information

Probability Models in Electrical and Computer Engineering Mathematical models as tools in analysis and design Deterministic models Probability models

Probability Models in Electrical and Computer Engineering Mathematical models as tools in analysis and design Deterministic models Probability models Probability Models in Electrical and Computer Engineering Mathematical models as tools in analysis and design Deterministic models Probability models Statistical regularity Properties of relative frequency

More information

Fluid limits for Markov chains I

Fluid limits for Markov chains I Fluid limits for Markov chains I James Norris Durham Symposium on Markov Processes, Mixing Times and Cutoff 2017 Markov chains Consider a discrete-time Markov chain X = (X n ) n 0 with state-space E and

More information

Antti Salonen PPU Le 2: Forecasting 1

Antti Salonen PPU Le 2: Forecasting 1 - 2017 1 Forecasting Forecasts are critical inputs to business plans, annual plans, and budgets Finance, human resources, marketing, operations, and supply chain managers need forecasts to plan: output

More information

Incentives and redistribution in homogeneous bike-sharing systems with stations of finite capacity

Incentives and redistribution in homogeneous bike-sharing systems with stations of finite capacity Incentives and redistribution in homogeneous bike-sharing systems with stations of finite capacity Christine Fricker, Nicolas Gast To cite this version: Christine Fricker, Nicolas Gast. Incentives and

More information

Signal Processing - Lecture 7

Signal Processing - Lecture 7 1 Introduction Signal Processing - Lecture 7 Fitting a function to a set of data gathered in time sequence can be viewed as signal processing or learning, and is an important topic in information theory.

More information

PPU411 Antti Salonen. Forecasting. Forecasting PPU Forecasts are critical inputs to business plans, annual plans, and budgets

PPU411 Antti Salonen. Forecasting. Forecasting PPU Forecasts are critical inputs to business plans, annual plans, and budgets - 2017 1 Forecasting Forecasts are critical inputs to business plans, annual plans, and budgets Finance, human resources, marketing, operations, and supply chain managers need forecasts to plan: output

More information

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable Distributions of Functions of Random Variables 5.1 Functions of One Random Variable 5.2 Transformations of Two Random Variables 5.3 Several Random Variables 5.4 The Moment-Generating Function Technique

More information

Antti Salonen KPP Le 3: Forecasting KPP227

Antti Salonen KPP Le 3: Forecasting KPP227 - 2015 1 Forecasting Forecasts are critical inputs to business plans, annual plans, and budgets Finance, human resources, marketing, operations, and supply chain managers need forecasts to plan: output

More information

Point Process Control

Point Process Control Point Process Control The following note is based on Chapters I, II and VII in Brémaud s book Point Processes and Queues (1981). 1 Basic Definitions Consider some probability space (Ω, F, P). A real-valued

More information

Logistical and Transportation Planning Methods 1.203J/6.281J/13.665J/15.073J/16.76J/ESD.216J

Logistical and Transportation Planning Methods 1.203J/6.281J/13.665J/15.073J/16.76J/ESD.216J Logistical and Transportation Planning Methods 1.203J/6.281J/13.665J/15.073J/16.76J/ESD.216J Massachusetts Institute of Technology Cambridge, Massachusetts Quiz #1 October 26, 2005 OPEN BOOK TWO HOURS

More information

Algebra 1 Summer Review Packet DUE THE FIRST DAY OF SCHOOL

Algebra 1 Summer Review Packet DUE THE FIRST DAY OF SCHOOL Name: Algebra 1 Summer Review Packet DUE THE FIRST DAY OF SCHOOL About Algebra 1: Algebra 1 teaches students to think, reason and communicate mathematically. Students use variables to determine solutions

More information

MATH 151, FINAL EXAM Winter Quarter, 21 March, 2014

MATH 151, FINAL EXAM Winter Quarter, 21 March, 2014 Time: 3 hours, 8:3-11:3 Instructions: MATH 151, FINAL EXAM Winter Quarter, 21 March, 214 (1) Write your name in blue-book provided and sign that you agree to abide by the honor code. (2) The exam consists

More information

Modelling residual wind farm variability using HMMs

Modelling residual wind farm variability using HMMs 8 th World IMACS/MODSIM Congress, Cairns, Australia 3-7 July 2009 http://mssanz.org.au/modsim09 Modelling residual wind farm variability using HMMs Ward, K., Korolkiewicz, M. and Boland, J. School of Mathematics

More information

Birth-Death Processes

Birth-Death Processes Birth-Death Processes Birth-Death Processes: Transient Solution Poisson Process: State Distribution Poisson Process: Inter-arrival Times Dr Conor McArdle EE414 - Birth-Death Processes 1/17 Birth-Death

More information

City of Hermosa Beach Beach Access and Parking Study. Submitted by. 600 Wilshire Blvd., Suite 1050 Los Angeles, CA

City of Hermosa Beach Beach Access and Parking Study. Submitted by. 600 Wilshire Blvd., Suite 1050 Los Angeles, CA City of Hermosa Beach Beach Access and Parking Study Submitted by 600 Wilshire Blvd., Suite 1050 Los Angeles, CA 90017 213.261.3050 January 2015 TABLE OF CONTENTS Introduction to the Beach Access and Parking

More information

Statistics 2. The Normal Distribution

Statistics 2. The Normal Distribution Chapter assessment Statistics The Normal Distribution 1. Soup tins have a capacity of 65 ml. The volume of soup, X ml, dispensed into each tin is Normally distributed with mean 610 and standard deviation.

More information

MAS1302 Computational Probability and Statistics

MAS1302 Computational Probability and Statistics MAS1302 Computational Probability and Statistics April 23, 2008 3. Simulating continuous random behaviour 3.1 The Continuous Uniform U(0,1) Distribution We have already used this random variable a great

More information

Management Programme. MS-08: Quantitative Analysis for Managerial Applications

Management Programme. MS-08: Quantitative Analysis for Managerial Applications MS-08 Management Programme ASSIGNMENT SECOND SEMESTER 2013 MS-08: Quantitative Analysis for Managerial Applications School of Management Studies INDIRA GANDHI NATIONAL OPEN UNIVERSITY MAIDAN GARHI, NEW

More information