Combining SP probabilities and RP discrete choice in departure time modelling: joint MNL and ML estimations

Similar documents
Spitsmijden Reward Experiments. Jasper Knockaert VU University Amsterdam

(u v) = f (u,v) Equation 1

Predicting flight on-time performance

Accounting for inertia in modal choices:

Optimal Adaptive Departure Time Choices with Real-Time Traveler Information Considering Arrival Reliability

Limited Dependent Variable Models II

Graduate Econometrics I: What is econometrics?

Anticipatory Pricing to Manage Flow Breakdown. Jonathan D. Hall University of Toronto and Ian Savage Northwestern University

A Model of Human Capital Accumulation and Occupational Choices. A simplified version of Keane and Wolpin (JPE, 1997)

c) What are cumulative curves, and how are they constructed? (1 pt) A count of the number of vehicles over time at one location (1).

A Joint Tour-Based Model of Vehicle Type Choice and Tour Length

Supplementary Technical Details and Results

Single-part-type, multiple stage systems

SUPPLEMENT TO UNCOVERING THE DISTRIBUTION OF MOTORISTS PREFERENCES FOR TRAVEL TIME AND RELIABILITY

Discrete panel data. Michel Bierlaire

Departure time choice equilibrium problem with partial implementation of congestion pricing

Longitudinal and Multilevel Methods for Multinomial Logit David K. Guilkey

The influence of adverse weather conditions on probability of congestion on Dutch motorways

x y

Route Choice Analysis: Data, Models, Algorithms and Applications Emma Frejinger Thesis Supervisor: Michel Bierlaire

1.225 Transportation Flow Systems Quiz (December 17, 2001; Duration: 3 hours)

The Danish Value of Time Study

METHODOLOGICAL ISSUES IN MODELING TIME-OF-TRAVEL

Understanding Travel Time to Airports in New York City Sierra Gentry Dominik Schunack

Traffic Flow Theory & Simulation

arxiv: v1 [cs.gt] 16 Jul 2012

Recovery of inter- and intra-personal heterogeneity using mixed logit models

Analysis and Design of Urban Transportation Network for Pyi Gyi Ta Gon Township PHOO PWINT ZAN 1, DR. NILAR AYE 2

Rationally heterogeneous forecasters

Calculating indicators with PythonBiogeme

CIV3703 Transport Engineering. Module 2 Transport Modelling

Binary choice. Michel Bierlaire

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model

Midterm, Form A 07MAR2000

How to Estimate, Take Into Account, and Improve Travel Time Reliability in Transportation Networks

INTRODUCTION TO TRANSPORTATION SYSTEMS

Math 218 Supplemental Instruction Spring 2008 Final Review Part A

MEZZO: OPEN SOURCE MESOSCOPIC. Centre for Traffic Research Royal Institute of Technology, Stockholm, Sweden

Extracting mobility behavior from cell phone data DATA SIM Summer School 2013

(a) Find the mean and standard deviation of X. (5)

TRANSPORT MODE CHOICE AND COMMUTING TO UNIVERSITY: A MULTINOMIAL APPROACH

Are You The Beatles Or The Rolling Stones?

Investigating uncertainty in BPR formula parameters

Capturing Correlation in Route Choice Models using Subnetworks

The morning commute in urban areas with heterogeneous trip lengths. ISTTT22, Northwestern University Raphaël Lamotte, Nikolas Geroliminis

Statistics for Business and Economics

Homework 4 solutions

Recovery of inter- and intra-personal heterogeneity using mixed logit models

CTR Employer Survey Report

Time-Varying Causal. Treatment Effects

TEEN DRIVER SEAT BELT OBSERVATION FORM

ROUNDTABLE ON SOCIAL IMPACTS OF TIME AND SPACE-BASED ROAD PRICING Luis Martinez (with Olga Petrik, Francisco Furtado and Jari Kaupilla)

Using prospect theory to investigate the low value of travel time for small time changes

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL

Resolving Over-constrained Temporal Problems with Uncertain Durations. Peng Yu March 7, 2014

A sampling of alternatives approach for route choice models

Stochastic prediction of train delays with dynamic Bayesian networks. Author(s): Kecman, Pavle; Corman, Francesco; Peterson, Anders; Joborn, Martin

The prediction of passenger flow under transport disturbance using accumulated passenger data

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo

Be able to define the following terms and answer basic questions about them:

TRIP-TIMING DECISIONS AND CONGESTION WITH HOUSEHOLD SCHEDULING PREFERENCES. André de PALMA Robin LINDSEY Nathalie PICARD.

ANALYSIS OF CAR BEHAVIOR IN MATSUYAMA CITY

Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data. Fred Mannering University of South Florida

Metro SafeTrack Impact on Individual Travel Behavior & Regional Traffic Conditions. 1. Introduction. 2. Focus of this Volume & Issue

Lesson 5: Increasing and Decreasing Functions

MASSACHUSETTS INSTITUTE OF TECHNOLOGY Department of Electrical Engineering and Computer Science

Weather and Travel Time Decision Support

Houston County School System Mathematics

TEEN DRIVER ELECTRONIC DEVICE OBSERVATION FORM

Examining travel time variability using AVI data

When is the concept of generalized transport costs useless? The effects of the change in the value of time

Randomization-Based Inference With Complex Data Need Not Be Complex!

MARKOV MODEL WITH COSTS In Markov models we are often interested in cost calculations.

A Study on Factors for Travel Time Variability in Dhaka City Corporation Area

Problem #1 #2 #3 #4 #5 #6 Total Points /6 /8 /14 /10 /8 /10 /56

On the Modeling of Airport Arrival and Departure Delay Distributions

MAT SYS 5120 (Winter 2012) Assignment 5 (not to be submitted) There are 4 questions.

Houston County School System Mathematics

Statistical Tests. Matthieu de Lapparent

Tatsuya UEMURA*, Researcher Yasuaki MATSUDA**, Director for Road Research Yasuhiko KAJIYA*, Director Kazuhiro TANJI***, Chief

Design and Analysis of Algorithms April 16, 2015 Massachusetts Institute of Technology Profs. Erik Demaine, Srini Devadas, and Nancy Lynch Quiz 2

Thresholds in choice behaviour and the size of travel time savings

Multidimensional Spatial Heterogeneity in Ecosystem Service Values: Advancing the Frontier

Outline. The binary choice model. The multinomial choice model. Extensions of the basic choice model

EXPERT GROUP S SUMMARY

A Guide to Modern Econometric:

Assessing Time-Varying Causal Effect. Moderation

The exam is closed book, closed calculator, and closed notes except your one-page crib sheet.

Outdoor Recreation Valuation (ORVal) User Guide

Bayesian Social Learning with Random Decision Making in Sequential Systems

Estimation of mixed generalized extreme value models

BENEFIT EVALUATION OF ROAD SNOW REMOVAL IN THE TOHOKU REGION

Review of Probabilities and Basic Statistics

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Predicting the Treatment Status

Optimal Adaptive Routing and Traffic Assignment in Stochastic Time-Dependent Networks. Song Gao

Overfitting, Bias / Variance Analysis

FINANCE & GRAPHS Questions

Encapsulating Urban Traffic Rhythms into Road Networks

Chapter 7 Forecasting Demand

Transcription:

Combining SP probabilities and RP discrete choice in departure time modelling: joint MNL and ML estimations Jasper Knockaert Yinyen Tseng Erik Verhoef Jan Rouwendal

Introduction Central question: How do RP and SP compare? Context of departure time and mode choice model: Experiment with rewards Micro data (~panel) Discrete choice theory Joint RP-SP estimation

Spitsmijden: Where & When? EVI Zwaardslootseweg (N206) THE HAGUE EVI Zoetermeerse rijbaan EVI A12 EVI Katwijkerlaan Reference: 2 weeks Reward: 10 weeks No Reward: 1 week Sep 18 Okt 2 2006 Dec 4 Dec 11

Spitsmijden: What? Money: 3 types of reward, 232 participants 3 0 3 7 0 7 7 3 0 3 7 7h 8h 9h 10h

Spitsmijden: What? Yeti smartphone (108 participants): Traffic information Saving credits to keep phone at end of experiment credit no credit credit information only 7h 8h 9h 10h

Spitsmijden: What? Reward adapted on basis of initial behaviour Less frequent drivers are rewarded less frequent Rewards' regimes varied over participants over the duration of the experiment Exclude the bias caused by learning impact or effects of change of season (Bikes,.)

Discrete choice theory Each choice alternative j has for each choice m by respondent n a random utility U jmn : U jmn = β x jmn + ε jmn (+ μ η n ) We use linear utility specification: generic explanatory variables: travel time, scheduling, reward alternative specific coefficients standard normal distributed variable η n for inter personal choice heterogeneity (panel mixed logit)

Table 0.1: Definition of choice variables use in model estimates (choice situation m corresponds to a working day in RP or a choice set in SP ) attribute unit definition t jm hour car travel time corresponding to time interval j in choice situation m m jmn euro marginal loss of monetary reward for participant n of rush hour car travel in time interval j in choice situation m credit marginal loss of Yeti credit for participant n of rush hour car travel in time interval y jmn Y mn j in choice situation m credit number of credits needed by participant n to collect Yeti in choice situation m (varies across choice sets in SP but fixed across choice situations in RP ) p n hour unrewarded passage time for participant n (based on RP observations in periods before/after experiment) p j hour actual passage time of rush hour car travel in time interval j w m C maximum temperature observed on day m C mode mode specific constant η n panel specified error term: η n N (0,1)

Table 0.5: Deterministic utility V RP jmn of alternative j faced on day m by participant n alternative j deterministic utility V RP jmn mixed logit private car at time p j bike other mode β euro m jmn + β yeti y jmn + β tt t jm +β sde max (p n p j,0) + β sdl max (p j p n,0) +µη n β weather w m + C bike C mode

Table 0.6: Panel mixed logit model (RP dataset) robust statistics decision variable coeff. std err t-stat p-value β euro [euro] (monetary reward) 0,350 0,0198 17,70 0,00 β sde [hour] (early arrival) 1,95 0,126 15,51 0,00 β sdl [hour] (late arrival) 2,24 0,0963 23,23 0,00 β yeti (smartphone reward) 2,18 0,168 12,96 0,00 β t [hour] (motorway car travel time) 1,54 0,372 4,15 0,00 β weather [max temp in C] (bike) 0,0999 0,0243 4,11 0,00 C py (constant for public transport) 2,24 0,173 12,97 0,00 C bike (constant for bike) 4,98 0,522 9,55 0,00 C offpeak (constant for car not 7h 10h) 1,51 0,142 10,66 0,00 C rideshare (constant for carpool) 3,78 0,231 16,39 0,00 C home (constant for telework) 3,20 0,173 18,50 0,00 C other (constant for other) 4,01 0,358 11,22 0,00 µ (inter personal heterogeneity) 1,69 0,0796 21,27 0,00 estimation statistics observations 12371 individuals 322 Halton draws 4000 log-likelihood 27043,676

SP survey Completed by participants to reward experiment (before trial started) 6 choice sets randomly selected out of a 17 set design Choice alternatives: 3 or 5 departure time intervals for private car 4 other non-car alternatives (including no-travel)

Table 1 Example of the choice set presentation for monetary reward participants Choice variant The reward you would receive if you travel by car in the following time slot: before 7.00 am you receive 7,00 (5 minute delay in the traffic jam) between 7.00 and 7.30 am you receive 3,50 (20 minute delay in the traffic jam) between 7.30 and 8.30 am you receive no reward (35 minute delay in the traffic jam) between 8.30 and 9.00 am you receive 3,50 (20 minute delay in the traffic jam) after 9.00 am you receive 7,00 (5 minute delay in the traffic jam) The travel time of public transport is 10 minutes longer than the travel time in the central peak. If you travel by public transport, bike, or work from home, you would receive the reward of 7,00 Suppose the experiment lasts for 50 days. How often would you travel in the following possibilities: By car By public By bike Work from Other before 7.00 am :... day(s) transport home between 7.00 and 7.30 am :... day(s) between 7.30 and 8.30 am :... day(s) between 8.30 and 9.00 am :... day(s) after 9.00 am :... day(s)... day(s)... day(s)... day(s)... day(s) Table 2 Example of the choice set presentation for yeti credit participants Choice variant The reward you would receive if you travel by car in the following time slot: before 7.00 am you receive 1 credit (5 minute delay in the traffic jam) between 7.00 and 7.30 am you receive 0.5 credit (20 minute delay in the traffic jam) between 7.30 and 8.30 am you receive no credit (35 minute delay in the traffic jam) between 8.30 and 9.00 am you receive 0.5 credit (20 minute delay in the traffic jam) after 9.00 am you receive 1 credit (5 minute delay in the traffic jam) The travel time of public transport is 10 minutes longer than the travel time in the central peak. If you travel by public transport, bike, or work from home, you would receive 1 credit. You would earn the Yeti if you could save up to 15 credits. Suppose the experiment lasts for 25 days. How often would you travel in the following possibilities: By car By public By bike Work from Other before 7.00 am :... day(s) transport home between 7.00 and 7.30 am :... day(s) between 7.30 and 8.30 am :... day(s) between 8.30 and 9.00 am :... day(s)... day(s)... day(s)... day(s)... day(s) after 9.00 am :... day(s) 5

Ratio scale Classic setup in estimation: P mn = j y jmn P jmn = j P jmn^y jmn LL mn = ln P mn Ratio scale format departs from classic setup Obviously, not 25 (or 50) independent choices Formulate as one choice to get correct results Python Biogeme log likelihood specification: P mn = j P jmn^p jmn LL mn = ln P mn (= j p jmn ln P jmn )

Table 0.2: Deterministic utility V SP jmn of alternative j faced in choice set m by participant n alternative j deterministic utility V SP jmn mixed logit private car at time p j public transport other mode β euro m jmn + β yeti y jmn /Y mn + β tt t jm +β sde max (p n p j,0) + β sdl max (p j p n,0) +µη n C pt + β pt t jm C mode

Table 0.3: Multinomial logit model (SP dataset) robust statistics model coefficient value std err t-stat p-value β euro [euro] (monetary reward) 0,104 0,0219 4,77 0,00 β sde [hour] (early arrival) 1,25 0,0535 23,30 0,00 β sdl [hour] (late arrival) 1,37 0,0636 21,61 0,00 β yeti (smartphone reward) 19,7 2,07 9,54 0,00 β pt [hour] (excess pt travel time) 0,0507 0,346 0,15 0,88 β t [hour] (car congestion delay) 0,932 0,313 2,98 0,00 C pt (constant for public transport) 1,56 0,258 6,05 0,00 C bike (constant for bike) 2,57 0,105 24,58 0,00 C home (constant for telework) 2,82 0,0977 28,87 0,00 C other (constant for other modes) 2,98 0,114 26,21 0,00 estimation statistics observations 1530 log-likelihood 2681,507

Table 0.4: Panel mixed logit model (SP dataset) robust statistics model coefficient value std err t-stat p-value β euro [euro] (monetary reward) 0,115 0,0246 4,67 0,00 β sde [hour] (early arrival) 1,25 0,106 11,73 0,00 β sdl [hour] (late arrival) 1,37 0,121 11,37 0,00 β yeti (smartphone reward) 20,6 2,83 7,27 0,00 β pt [hour] (excess pt travel time) 0,0653 0,339 0,19 0,85 β t [hour] (car congestion delay) 0,745 0,286 2,61 0,01 C pt (constant for public transport) 1,79 0,301 5,96 0,00 C bike (constant for bike) 2,81 0,213 13,20 0,00 C home (constant for telework) 3,06 0,190 16,13 0,00 C other (constant for other modes) 3,22 0,233 13,84 0,00 µ (inter personal heterogeneity) 1,48 0,104 14,21 0,00 estimation statistics observations 1530 individuals 286 Halton draws 1000 log-likelihood 2591,148

Joint RP-SP estimation Common approach: scale the utility U jmn of one of both: U joint = s RP U RP + s SP U SP Panel: consistency across all observations for same respondent n

Table 0.7: Multinomial logit model (joint RP-SP dataset) 1/2 robust statistics model coefficient value std err t-stat p-value joint coefficients β euro [euro] (monetary reward) 0,324 0,00680 47,65 0,00 revealed preference coefficients β sde [hour] (early arrival) 1,85 0,0339 54,50 0,00 β sdl [hour] (late arrival) 2,20 0,0374 58,82 0,00 β yeti (smartphone reward) 2,29 0,0743 30,78 0,00 β t [hour] (motorway car travel time) 1,67 0,237 7,05 0,00 β weather [max temp in C] (bike) 0,115 0,0170 6,78 0,00 C pt (constant for public transport) 1,88 0,0433 43,32 0,00 C bike (constant for bike) 4,85 0,288 16,84 0,00 C offpeak (constant for car not 7h 10h) 1,15 0,0389 29,47 0,00 C rideshare (constant for carpool) 3,42 0,0758 45,07 0,00 C home (constant for telework) 2,84 0,0606 46,79 0,00 C other (constant for other modes) 3,65 0,0835 43,68 0,00

Table 0.8: Multinomial logit model (joint RP-SP dataset) 2/2 robust statistics model coefficient value std err t-stat p-value stated preference coefficients β sde [hour] (early arrival) 3,95 0,794 4,98 0,00 β sdl [hour] (late arrival) 4,37 0,875 4,99 0,00 β yeti (smartphone reward) 58,3 8,70 6,70 0,00 β pt [hour] (excess pt travel time) 0,207 1,11 0,19 0,85 β t [hour] (car congestion delay) 3,07 1,50 2,05 0,04 C pt (constant for public transport) 5,00 1,30 3,85 0,00 C bike (constant for bike) 8,27 1,76 4,71 0,00 C home (constant for telework) 9,06 1,91 4,76 1,00 C other (constant for other modes) 9,56 2,01 4,76 0,00 scale parameters s RP 1 fixed s SP 0,312 0,0640 4,88 0,00 estimation statistics SP observations 1530 RP observations 12371 log-likelihood 31335,698

Table 0.9: Panel mixed logit model (joint RP-SP dataset) 1/2 robust statistics model coefficient value std err t-stat p-value joint coefficients β euro [euro] (monetary reward) 0,350 0,0198 17,68 0,00 revealed preference coefficients β sde [hour] (early arrival) 1,95 0,125 15,55 0,00 β sdl [hour] (late arrival) 2,23 0,0962 23,22 0,00 β yeti (smartphone reward) 2,18 0,168 13,00 0,00 β t [hour] (motorway car travel time) 1,54 0,371 4,14 0,00 β weather [max temp in C] (bike) 0,0999 0,0243 4,12 0,00 C pt (constant for public transport) 2,23 0,173 12,87 0,00 C bike (constant for bike) 4,97 0,522 9,52 0,00 C offpeak (constant for car not 7h 10h) 1,49 0,141 10,61 0,00 C rideshare (constant for carpool) 3,77 0,232 16,25 0,00 C home (constant for telework) 3,19 0,174 18,36 0,00 C other (constant for other modes) 4,00 0,359 11,14 0,00 µ (inter personal heterogeneity) 1,70 0,0784 21,69 0,00

Table 0.10: Panel mixed logit model (joint RP-SP dataset) 2/2 robust statistics model coefficient value std err t-stat p-value stated preference coefficients β sde [hour] (early arrival) 3,84 0,678 5,66 0,00 β sdl [hour] (late arrival) 4,23 0,746 5,67 0,00 β yeti (smartphone reward) 49,4 8,18 6,04 0,00 β pt [hour] (excess pt travel time) 0,540 1,09 0,50 0,62 β t [hour] (car congestion delay) 3,36 1,23 2,73 0,01 C pt (constant for public transport) 4,90 1,16 4,21 0,00 C bike (constant for bike) 8,18 1,54 5,31 0,00 C home (constant for telework) 8,92 1,61 5,54 0,00 C other (constant for other modes) 9,38 1,74 5,38 0,00 µ (inter personal heterogeneity) 1,75 0,447 3,92 0,00 scale coefficients s RP 1 fixed s SP 0,336 0,0633 5,31 0,00 estimation statistics SP observations 1530 RP observations 12371 individuals 322 Halton draws 250 log-likelihood 29681,480

Conclusions Ratio scale allows for consistent discrete choice model estimation SP-RP on microdata scheduling model: Time and scheduling valuation about factor 2 higher in SP than RP Consistent panel specification makes sense Paper will appear at http://www.tinbergen.nl