Estimation of mixed generalized extreme value models

Size: px
Start display at page:

Download "Estimation of mixed generalized extreme value models"

Transcription

1 Estimation of mixed generalized extreme value models Michel Bierlaire Operations Research Group ROSO Institute of Mathematics EPFL Katholieke Universiteit Leuven, November 2004 p.1

2 Introduction It is our choices that show what we truly are, far more than our abilities Albus Dumbledore Katholieke Universiteit Leuven, November 2004 p.2

3 Introduction Katholieke Universiteit Leuven, November 2004 p.3

4 Introduction Nobel Prize 2000 to D. Mc Fadden for his development of theory and methods for analyzing discrete choice Katholieke Universiteit Leuven, November 2004 p.4

5 Introduction Discrete choice models: P (i C n ) where C n = {1,..., J} Random utility models: U in = V in + ε in and P (i C n ) = P (U in U jn, j = 1,..., J) Utility is a latent concept Katholieke Universiteit Leuven, November 2004 p.5

6 Multinomial Logit Model Assumption: ε in are the maximum of many r.v. capturing unobservable attributes (e.g. mood, experience), measurement and specification errors. Gumbel theorem: the maximum of many i.i.d. random variables (with a tail) approximately follows a Gumbel distribution. ε in Gumbel(0, µ) Katholieke Universiteit Leuven, November 2004 p.6

7 Multinomial Logit Model Gumbel(η, µ), with µ > 0 : f(t) = µe µ(t η) e e µ(t η) If ε Gumbel(η, µ), then P (c ε) = F (c) = c f(t)dt = e e µ(t η) Katholieke Universiteit Leuven, November 2004 p.7

8 Multinomial Logit Model If then where E[ε] = η + γ µ ε Gumbel(η, µ) and Var[ε] = π2 6µ 2 γ = lim k k i=1 1 i ln k Euler constant Katholieke Universiteit Leuven, November 2004 p.8

9 Multinomial Logit Model The difference of two Gumbel distribution is logistic We have P (i {i, j}) = P (V i + ε i V j + ε j ) = P (V i V j ε j ε i ) We obtain the multinomial logit model P (i C n ) = e V in j C n e V jn Katholieke Universiteit Leuven, November 2004 p.9

10 Multinomial Logit Model Multinomial logit model: ε in i.i.d. Gumbel Gumbel is an Extreme Value distribution ε in is the maximum of many r.v. capturing unobservable attributes, measurement and specification errors. Key assumption: Independence Katholieke Universiteit Leuven, November 2004 p.10

11 Relaxing the independence assumption U 1n. = V 1n. + ε 1n. U Jn V Jn ε Jn that is U n = V n + ε n and ε n is a vector of random variables. Katholieke Universiteit Leuven, November 2004 p.11

12 Relaxing the independence assumption ε n N(0, Σ): multinomial probit model No closed form for the multifold integral Numerical integration is computationally infeasible Extensions of multinomial logit model Nested logit model Generalized Extreme Value (GEV) models Katholieke Universiteit Leuven, November 2004 p.12

13 GEV models Family of models proposed by McFadden (1978) Idea: a model is generated by a function From G, we can build G : R J R The cumulative distribution function (CDF) of ε n The probability model The expected maximum utility Not equivalent to GEV in statistics Katholieke Universiteit Leuven, November 2004 p.13

14 GEV models 1. G is homogeneous of degree µ > 0, that is G(αx) = α µ G(x) 2. lim x i + G(x 1,..., x i,..., x J ) = +, for each i = 1,..., J, 3. the kth partial derivative with respect to k distincts x i is non negative if k is odd and non positive if k is even, i.e., for all (disctincts) indices i 1,..., i k {1,..., J}, we have ( 1) k k G (x) 0, x R J x i1... x +. ik Katholieke Universiteit Leuven, November 2004 p.14

15 GEV models Density function: F (ε 1,..., ε J ) = e G(e ε 1,...,e ε J ) Probability: P (i C) = ev i +ln G i (ev 1,...,e V J ) Pj C ev j +ln G j (ev 1,...,e V J ) with G i = G x i. This is a closed form Expected maximum utility: V C = Euler s constant. ln G(...)+γ µ where γ is Note: P (i C) = V C V i. Katholieke Universiteit Leuven, November 2004 p.15

16 GEV models Example: G(e V 1,..., e V J ) = J i=1 eµv i P (i) = e V i+ln G i (e V 1,...,e V J ) with G j C ev j+ln G j (e V 1,...,e V J ) i(x) = µx µ 1 i e V i+ln G i (e V 1,...,e V J ) = e V i+ln µ+(µ 1) ln e V i = e ln µ+µv i P (i) = e ln µ+µv i = eµvi j C eln µ+µv j j C eµv j Multinomial Logit Model Katholieke Universiteit Leuven, November 2004 p.16

17 GEV models Multinomial logit model Nested logit model Cross-nested logit model and more... Katholieke Universiteit Leuven, November 2004 p.17

18 GEV models Closed form probability model Provides a great deal of flexibility Formulation not in term of correlations Require heavy proofs Katholieke Universiteit Leuven, November 2004 p.18

19 Properties of GEV Let R J i be p subspaces spanning R J. For any vector y R J, [y] i denotes the projection of y on R J i. It is assumed that the projection is such that all entries of [y] i are strictly positive. Let G i : R J i + R, i = 1,..., p be µ-gev functions. Then, the function G : R J + R : y G(y) = p i=1 α i G i ([y] i ) is also a µ-gev function if α i > 0, i = 1,..., p. Katholieke Universiteit Leuven, November 2004 p.19

20 Properties of GEV Let G : R J + R be a µ-gev function. Then G β is a (µβ)-gev function if 0 < β 1. Katholieke Universiteit Leuven, November 2004 p.20

21 Properties of GEV Let R J i be p subspaces spanning R J. For any vector y R J, denote by [y] i the projection of y on R J i. Let G i : R J i + R, i = 1,..., p be µ i -GEV functions. Then, the function G : R J + R : y G(y) = p i=1 α i G i ([y] i ) µ µ i is a µ-gev function if α i > 0 and 0 < µ µ i, i = 1,..., p. Katholieke Universiteit Leuven, November 2004 p.21

22 Properties of GEV Moreover, P (i C) = e V i+ln G i (e V 1,...,e V J ) j C ev j+ln G j (e V 1,...,e V J ) So, α i e V i+ln G i (e V 1,...,e V J ) = e V i+ln G i (e V 1 + ln α i,...,e V J + ln α i ) α i P (i C) j B α jp (j C) = evi+ln Gi(eV1+ln αi,...,evj +ln αi ) j B ev j+ln G j (e V 1 +ln αj,...,e V J +ln αj ) Katholieke Universiteit Leuven, November 2004 p.22

23 Applications These properties have practical consequences Network GEV Sampling strategy Katholieke Universiteit Leuven, November 2004 p.23

24 Network GEV Extension of the tree representation for Nested Logit Investigate new GEV models Provide the proof once for all Katholieke Universiteit Leuven, November 2004 p.24

25 Network GEV Let (V, E) be a network with link parameters α (i,j) 0 Assumptions: 1. No circuit. 2. One node without predecessor: root. 3. J nodes without successor: alternatives. 4. For each node v i, there exists at least one path from the root to v i such that P k=1 α (i k 1,i k ) > 0. Katholieke Universiteit Leuven, November 2004 p.25

26 Network GEV For each node v i, we define a set of indices I i {1,..., J} of J i relevant alternatives, a homogeneous function G i : R J i R, and a parameter µ i. Recursive definition of I i : I i = {i} for alternatives, I i = j succ(i) I j for all other nodes. Katholieke Universiteit Leuven, November 2004 p.26

27 Network GEV Recursive definition of G i : For alternatives: G i : R R : G i (x i ) = x µ i i For all others: G i : R J i R : G i (x) = j succ(i) i = 1,..., J α (i,j) G j (x) µ i µ j Katholieke Universiteit Leuven, November 2004 p.27

28 Network GEV Example: Cross-Nested Logit i=4,5 α 0i (α i1 y µ i 1 + α i2 y µ i 2 + α i3 y µ i y µ 1 1 y µ 2 2 y µ 3 3 G = m 3 ) µ 0 µ i ( j C α 51 y µ α 52 y µ α 53 y µ 5 3 ) µ µm α jm y µ m j Katholieke Universiteit Leuven, November 2004 p.28

29 Network GEV Daly & Bierlaire (2003) GEV calculus Possibility to define new GEV models No more proof needed for Network GEV Katholieke Universiteit Leuven, November 2004 p.29

30 Sampling Population probability of choice i C and socio-eco char: P i (z, β )p(z). Probability of being sampled: R(i, z) exogeneous sample: R(i, z) = R(z) choice-based sample: R(i, z) = R(i) Sampling of alternative: A(z) = {j C R(j, z) > 0} Let s draw B, a subset of A(z), with probability S(B i, z). Analyze choice as if it were limited to B. Katholieke Universiteit Leuven, November 2004 p.30

31 Sampling Contribution to the likelihood: P (i z, B, β) = P i (z, β)r(i, z)s(b i, z) j B P j(z, β)r(j, z)s(b j, z) If P i (z, β) is given by a GEV model, we obtain P (i z, B, β) = R(i, z)s(b i, z)e V i+ln G i (e V 1,...,e V J ) j B R(j, z)s(b j, z)ev j+ln G j (e V 1,...,e V J ) Because of the property, let α(i, z) = ln R(i, z) + ln S(B i, z) P (i z, B, β) = e V i+ln G i (e V 1 +α(i,z),...,e V J +α(i,z) ) j B ev j+ln G j (e V 1 +α(i,z),...,e V J +α(i,z) ) Katholieke Universiteit Leuven, November 2004 p.31

32 Sampling The model can be estimated as if a pure random sampling strategy was used Only the constants are affected. Restrictions apply on the sampling of alternatives Katholieke Universiteit Leuven, November 2004 p.32

33 Mixed GEV GEV models cannot handle all possible correlation structures Cannot capture heteroscedasticity and heterogeneity Necessity of mixing the model Katholieke Universiteit Leuven, November 2004 p.33

34 Mixed GEV U n = V n + ε n ε n compliant with GEV theory V n contains random parameters. V n = β T X n where β N( β, Σ) Using the Cholesky factorization, we have β = β + P ζ where Σ = P P T and ζ are i.i.d. standard normal variates. Katholieke Universiteit Leuven, November 2004 p.34

35 Mixed GEV McFadden & Train(2000) Under mild regularity conditions, any discrete choice model derived from random utility maximization has choice probabilities that can be approximated as closely as one pleases by a Mixed MNL model. Why bother with Mixed GEV? Katholieke Universiteit Leuven, November 2004 p.35

36 Mixed GEV GEV has closed form formulation Mixed models require simulated maximum likelihood estimation Capture as much as possible of the correlation using GEV Use the mixing distribution for the rest Issue: estimation Katholieke Universiteit Leuven, November 2004 p.36

37 BIOGEME Motivations GEV family must be explored Complicated implementation No appropriate software package Most researchers use commercial packages: LIMDEP, ALOGIT, HieLoW or Gauss, Matlab, SAS Freeware: Kenneth Train (but based on Gauss) Katholieke Universiteit Leuven, November 2004 p.37

38 BIOGEME Objectives Maximum likelihood estimation of a wide variety of GEV models Use various nonlinear optimization algorithms Open source Designed for researchers Flexible and easily extensible Katholieke Universiteit Leuven, November 2004 p.38

39 BIOGEME BIerlaire s Optimization toolbox for GEV Models Estimation Development : Version 0.0: July 2, Version 0.7: December 15, 2003 Version 0.8: March 19, 2004 Version 1.0: September 17, 2004 Katholieke Universiteit Leuven, November 2004 p.39

40 BIOGEME Input files mymodel.mod: model specification sample.dat: sample data mymodel.par: general control of the package Output files mymodel.html : estimated parameters + statistics mymodel.sta : sample statistics technical and debugging reports Katholieke Universiteit Leuven, November 2004 p.40

41 GEV models Available in BIOGEME Multinomial logit model Nested logit model Cross-nested logit model Network GEV model Katholieke Universiteit Leuven, November 2004 p.41

42 Heterogeneity GEV models are homoscedastic Assume there are two different groups such that and Var(ε in2 ) = α 2 Var(ε in1 ) Then we prefer the model U in1 = V in1 + ε in1 U in2 = V in2 + ε in2 αu in1 = αv in1 + αε in1 U in2 = V in2 + ε in2 Katholieke Universiteit Leuven, November 2004 p.42

43 Heterogeneity If V in1 is linear-in-parameters, that is V in1 = j β j x jin1 then αv in1 = j αβ j x jin1 is nonlinear. Katholieke Universiteit Leuven, November 2004 p.43

44 Nonlinear utility funtions Other types of nonlinearities Box-Cox Box-Tukey transforms β (x + α)λ 1 λ where β, α and λ must be estimated Continuous market segmentation. Example: the cost parameter varies with income, ( ) λ β cost = ˆβ inc cost with λ = β cost inc ref inc inc β cost Katholieke Universiteit Leuven, November 2004 p.44

45 Mixed GEV models V n contains random parameters. V n = f(β f, β N, β U, X n ) where β f are deterministic β N N( β N, Σ) (β U ) i U(a i, b i ) Because f is nonlinear, other distributions than normal and uniform are possible Katholieke Universiteit Leuven, November 2004 p.45

46 Mixed GEV models Lognormal: if β is normal, then e β is lognormal Triangular: if β 1 and β 2 are uniform [0,1], then 1 2 (β 1 + β 2 ) is triangular. S B distribution: if β is normal, then e β 1 + e β is a S B distribution between 0 and 1. Katholieke Universiteit Leuven, November 2004 p.46

47 Mixed GEV models S B distribution Katholieke Universiteit Leuven, November 2004 p.47

48 Mixed GEV models Example: specification with correlated normally and lognormally distributed random coefficients V in = β 1 X i1n + β 2 X i2n where β 1 and β 2 are generated from ( β1 ln β 2 ) = ( β1 β 2 ) + [ p11 0 p 21 p 22 and ζ 1 and ζ 2 are independent N(0, 1). ] [ ζ1 ζ 2 ] Katholieke Universiteit Leuven, November 2004 p.48

49 Panel data Several observations are available for each individual Need to capture the individual-specific effects At each instance t, we have V nt = V (β nt, β n, X nt ) where β n are random parameters constant across t for a given individual n. Katholieke Universiteit Leuven, November 2004 p.49

50 Miscellaneous features Functionalities can be combined Model specification language BETA1 [ BETA1_S ] * x11 + exp( BETA2 [ BETA2_S ] ) * x12 Simulation with Halton draws Several optimization packages Constrained likelihood estimation Katholieke Universiteit Leuven, November 2004 p.50

51 Miscellaneous features Robust variance-covariance ( sandwich ) Output in HTML format Katholieke Universiteit Leuven, November 2004 p.51

52 And there is more than in BIOGEME... BIOSIM for forecasting by sample enumeration BIOROUTE for route choice models BIOLOOP to generate large-scale models chi2.xls to perform χ 2 tests Katholieke Universiteit Leuven, November 2004 p.52

53 Short course Lausanne, March 20-24, 2005 M. Ben Akiva D. McFadden M. Bierlaire D. Bolduc Katholieke Universiteit Leuven, November 2004 p.53

The Network GEV model

The Network GEV model The Network GEV model Michel Bierlaire January, 2002 Discrete choice models play a major role in transportation demand analysis. Their nice and strong theoretical properties, and their flexibility to capture

More information

Binary choice. Michel Bierlaire

Binary choice. Michel Bierlaire Binary choice Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne M. Bierlaire (TRANSP-OR ENAC EPFL)

More information

BIOGEME: a free package for the estimation of discrete choice models

BIOGEME: a free package for the estimation of discrete choice models BIOGEME: a free package for the estimation of discrete choice models Michel Bierlaire, EPFL Conference paper STRC 2003 STRC 3rd Swiss Transport Research Conference Monte Verita, Ascona, March 2003 2 BIOGEME:

More information

Divers aspects de la modélisation de la demande en transport

Divers aspects de la modélisation de la demande en transport Divers aspects de la modélisation de la demande en transport Michel Bierlaire Laboratoire Transport et Mobilité, Ecole Polytechnique Fédérale de Lausanne Divers aspects de la modélisation de la demande

More information

Normalization and correlation of cross-nested logit models

Normalization and correlation of cross-nested logit models Normalization and correlation of cross-nested logit models E. Abbe, M. Bierlaire, T. Toledo Lab. for Information and Decision Systems, Massachusetts Institute of Technology Inst. of Mathematics, Ecole

More information

Hiroshima University 2) The University of Tokyo

Hiroshima University 2) The University of Tokyo September 25-27, 2015 The 14th Summer course for Behavior Modeling in Transportation Networks @The University of Tokyo Advanced behavior models Recent development of discrete choice models Makoto Chikaraishi

More information

Introduction to mixtures in discrete choice models

Introduction to mixtures in discrete choice models Introduction to mixtures in discrete choice models p. 1/42 Introduction to mixtures in discrete choice models Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory Introduction to

More information

A general and operational representation of GEV models

A general and operational representation of GEV models A general and operational representation of GEV models Andrew Daly Michel Bierlaire Report RO-030502 May 2, 2003 Abstract Generalised Extreme Value models provide an interesting theoretical framework to

More information

Normalization and correlation of cross-nested logit models

Normalization and correlation of cross-nested logit models Normalization and correlation of cross-nested logit models E. Abbe M. Bierlaire T. Toledo August 5, 25 Abstract We address two fundamental issues associated with the use of cross-nested logit models. On

More information

Capturing Correlation in Route Choice Models using Subnetworks

Capturing Correlation in Route Choice Models using Subnetworks Capturing Correlation in Route Choice Models using Subnetworks Emma Frejinger and Michel Bierlaire Transport and Mobility Laboratory (TRANSP-OR), EPFL Capturing Correlation with Subnetworks in Route Choice

More information

Hiroshima University 2) Tokyo University of Science

Hiroshima University 2) Tokyo University of Science September 23-25, 2016 The 15th Summer course for Behavior Modeling in Transportation Networks @The University of Tokyo Advanced behavior models Recent development of discrete choice models Makoto Chikaraishi

More information

A sampling of alternatives approach for route choice models

A sampling of alternatives approach for route choice models Seminar, Namur, February 2009, A sampling of alternatives approach for route choice models p. 1/34 A sampling of alternatives approach for route choice models Emma Frejinger Centre for Transport Studies,

More information

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2 Tetsuya Matsubayashi University of North Texas November 9, 2010 Learn multiple responses models that do not require the assumption

More information

Normalization and correlation of cross-nested logit models

Normalization and correlation of cross-nested logit models Normalization and correlation of cross-nested logit models E. Abbe M. Bierlaire T. Toledo Paper submitted to Transportation Research B Final version November 2006 Abstract We address two fundamental issues

More information

The estimation of Generalized Extreme Value models from choice-based samples

The estimation of Generalized Extreme Value models from choice-based samples The estimation of Generalized Extreme Value models from choice-based samples M. Bierlaire D. Bolduc D. McFadden August 10, 2006 Abstract We consider an estimation procedure for discrete choice models in

More information

Lecture-20: Discrete Choice Modeling-I

Lecture-20: Discrete Choice Modeling-I Lecture-20: Discrete Choice Modeling-I 1 In Today s Class Introduction to discrete choice models General formulation Binary choice models Specification Model estimation Application Case Study 2 Discrete

More information

Route choice models: Introduction and recent developments

Route choice models: Introduction and recent developments Route choice models: Introduction and recent developments Michel Bierlaire transp-or.epfl.ch Transport and Mobility Laboratory, EPFL Route choice models: Introduction and recent developments p.1/40 Route

More information

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1 Tetsuya Matsubayashi University of North Texas November 2, 2010 Random utility model Multinomial logit model Conditional logit model

More information

Lecture 6: Discrete Choice: Qualitative Response

Lecture 6: Discrete Choice: Qualitative Response Lecture 6: Instructor: Department of Economics Stanford University 2011 Types of Discrete Choice Models Univariate Models Binary: Linear; Probit; Logit; Arctan, etc. Multinomial: Logit; Nested Logit; GEV;

More information

Practical note on specification of discrete choice model. Toshiyuki Yamamoto Nagoya University, Japan

Practical note on specification of discrete choice model. Toshiyuki Yamamoto Nagoya University, Japan Practical note on specification of discrete choice model Toshiyuki Yamamoto Nagoya University, Japan 1 Contents Comparison between bary logit model and bary probit model Comparison between multomial logit

More information

Masking Identification of Discrete Choice Models under Simulation Methods *

Masking Identification of Discrete Choice Models under Simulation Methods * Masking Identification of Discrete Choice Models under Simulation Methods * Lesley Chiou 1 and Joan L. Walker 2 Abstract We present examples based on actual and synthetic datasets to illustrate how simulation

More information

A Dynamic Programming Approach for Quickly Estimating Large Scale MEV Models

A Dynamic Programming Approach for Quickly Estimating Large Scale MEV Models A Dynamic Programming Approach for Quickly Estimating Large Scale MEV Models Tien Mai Emma Frejinger Mogens Fosgerau Fabien Bastin June 2015 A Dynamic Programming Approach for Quickly Estimating Large

More information

6 Mixed Logit. 6.1 Choice Probabilities

6 Mixed Logit. 6.1 Choice Probabilities 6 Mixed Logit 6.1 Choice Probabilities Mixed logit is a highly flexible model that can approximate any random utility model (McFadden & Train, 2000). It obviates the three limitations of standard logit

More information

Binary choice 3.3 Maximum likelihood estimation

Binary choice 3.3 Maximum likelihood estimation Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation We explain here the various outputs from the maximum likelihood estimation procedure. Solution of the maximum likelihood

More information

Modeling Binary Outcomes: Logit and Probit Models

Modeling Binary Outcomes: Logit and Probit Models Modeling Binary Outcomes: Logit and Probit Models Eric Zivot December 5, 2009 Motivating Example: Women s labor force participation y i = 1 if married woman is in labor force = 0 otherwise x i k 1 = observed

More information

Estimating hybrid choice models with the new version of Biogeme

Estimating hybrid choice models with the new version of Biogeme Estimating hybrid choice models with the new version of Biogeme p. 1/34 Estimating hybrid choice models with the new version of Biogeme Michel Bierlaire transp-or.epfl.ch Transport and Mobility Laboratory,

More information

Chapter 3 Choice Models

Chapter 3 Choice Models Chapter 3 Choice Models 3.1 Introduction This chapter describes the characteristics of random utility choice model in a general setting, specific elements related to the conjoint choice context are given

More information

Discrete panel data. Michel Bierlaire

Discrete panel data. Michel Bierlaire Discrete panel data Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne M. Bierlaire (TRANSP-OR ENAC

More information

Logit kernel (or mixed logit) models for large multidimensional choice problems: identification and estimation

Logit kernel (or mixed logit) models for large multidimensional choice problems: identification and estimation Logit kernel (or mixed logit) models for large multidimensional choice problems: identification and estimation John L. Bowman, Ph. D., Research Affiliate, Massachusetts Institute of Technology, 5 Beals

More information

The 17 th Behavior Modeling Summer School

The 17 th Behavior Modeling Summer School The 17 th Behavior Modeling Summer School September 14-16, 2017 Introduction to Discrete Choice Models Giancarlos Troncoso Parady Assistant Professor Urban Transportation Research Unit Department of Urban

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

Duration Analysis. Joan Llull

Duration Analysis. Joan Llull Duration Analysis Joan Llull Panel Data and Duration Models Barcelona GSE joan.llull [at] movebarcelona [dot] eu Introduction Duration Analysis 2 Duration analysis Duration data: how long has an individual

More information

Introduction to Discrete Choice Models

Introduction to Discrete Choice Models Chapter 7 Introduction to Dcrete Choice Models 7.1 Introduction It has been mentioned that the conventional selection bias model requires estimation of two structural models, namely the selection model

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 11 Project

More information

STA 414/2104: Machine Learning

STA 414/2104: Machine Learning STA 414/2104: Machine Learning Russ Salakhutdinov Department of Computer Science! Department of Statistics! rsalakhu@cs.toronto.edu! http://www.cs.toronto.edu/~rsalakhu/ Lecture 9 Sequential Data So far

More information

Panel Data Seminar. Discrete Response Models. Crest-Insee. 11 April 2008

Panel Data Seminar. Discrete Response Models. Crest-Insee. 11 April 2008 Panel Data Seminar Discrete Response Models Romain Aeberhardt Laurent Davezies Crest-Insee 11 April 2008 Aeberhardt and Davezies (Crest-Insee) Panel Data Seminar 11 April 2008 1 / 29 Contents Overview

More information

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo Lecture 1 Behavioral Models Multinomial Logit: Power and limitations Cinzia Cirillo 1 Overview 1. Choice Probabilities 2. Power and Limitations of Logit 1. Taste variation 2. Substitution patterns 3. Repeated

More information

A Mixed Spatially Correlated Logit Model: Formulation and Application to Residential Choice Modeling

A Mixed Spatially Correlated Logit Model: Formulation and Application to Residential Choice Modeling A Mixed Spatially Correlated Logit Model: Formulation and Application to Residential Choice Modeling Chandra R. Bhat and Jessica Guo Department of Civil Engineering, ECJ Hall, Suite 6.8 The University

More information

An Overview of Choice Models

An Overview of Choice Models An Overview of Choice Models Dilan Görür Gatsby Computational Neuroscience Unit University College London May 08, 2009 Machine Learning II 1 / 31 Outline 1 Overview Terminology and Notation Economic vs

More information

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables

More information

A Joint Tour-Based Model of Vehicle Type Choice and Tour Length

A Joint Tour-Based Model of Vehicle Type Choice and Tour Length A Joint Tour-Based Model of Vehicle Type Choice and Tour Length Ram M. Pendyala School of Sustainable Engineering & the Built Environment Arizona State University Tempe, AZ Northwestern University, Evanston,

More information

Single-level Models for Binary Responses

Single-level Models for Binary Responses Single-level Models for Binary Responses Distribution of Binary Data y i response for individual i (i = 1,..., n), coded 0 or 1 Denote by r the number in the sample with y = 1 Mean and variance E(y) =

More information

Calculating indicators with Biogeme

Calculating indicators with Biogeme Calculating indicators with Biogeme Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne September 25,

More information

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated

More information

disc choice5.tex; April 11, ffl See: King - Unifying Political Methodology ffl See: King/Tomz/Wittenberg (1998, APSA Meeting). ffl See: Alvarez

disc choice5.tex; April 11, ffl See: King - Unifying Political Methodology ffl See: King/Tomz/Wittenberg (1998, APSA Meeting). ffl See: Alvarez disc choice5.tex; April 11, 2001 1 Lecture Notes on Discrete Choice Models Copyright, April 11, 2001 Jonathan Nagler 1 Topics 1. Review the Latent Varible Setup For Binary Choice ffl Logit ffl Likelihood

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 1 Jakub Mućk Econometrics of Panel Data Meeting # 1 1 / 31 Outline 1 Course outline 2 Panel data Advantages of Panel Data Limitations of Panel Data 3 Pooled

More information

Probabilistic Choice Models

Probabilistic Choice Models Probabilistic Choice Models James J. Heckman University of Chicago Econ 312 This draft, March 29, 2006 This chapter examines dierent models commonly used to model probabilistic choice, such as eg the choice

More information

Logistic regression. 11 Nov Logistic regression (EPFL) Applied Statistics 11 Nov / 20

Logistic regression. 11 Nov Logistic regression (EPFL) Applied Statistics 11 Nov / 20 Logistic regression 11 Nov 2010 Logistic regression (EPFL) Applied Statistics 11 Nov 2010 1 / 20 Modeling overview Want to capture important features of the relationship between a (set of) variable(s)

More information

Sample Problems. Note: If you find the following statements true, you should briefly prove them. If you find them false, you should correct them.

Sample Problems. Note: If you find the following statements true, you should briefly prove them. If you find them false, you should correct them. Sample Problems 1. True or False Note: If you find the following statements true, you should briefly prove them. If you find them false, you should correct them. (a) The sample average of estimated residuals

More information

How Indecisiveness in Choice Behaviour affects the Magnitude of Parameter Estimates obtained in Discrete Choice Models. Abstract

How Indecisiveness in Choice Behaviour affects the Magnitude of Parameter Estimates obtained in Discrete Choice Models. Abstract How Indecisiveness in Choice Behaviour affects the Magnitude of Parameter Estimates obtained in Discrete Choice Models Abstract Parameter estimates ( βˆ ) obtained in discrete choice models are confounded

More information

Data-analysis and Retrieval Ordinal Classification

Data-analysis and Retrieval Ordinal Classification Data-analysis and Retrieval Ordinal Classification Ad Feelders Universiteit Utrecht Data-analysis and Retrieval 1 / 30 Strongly disagree Ordinal Classification 1 2 3 4 5 0% (0) 10.5% (2) 21.1% (4) 42.1%

More information

I. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as

I. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as Econ 513, USC, Fall 2005 Lecture 15 Discrete Response Models: Multinomial, Conditional and Nested Logit Models Here we focus again on models for discrete choice with more than two outcomes We assume that

More information

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2016 Instructor: Victor Aguirregabiria

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2016 Instructor: Victor Aguirregabiria ECOOMETRICS II (ECO 24S) University of Toronto. Department of Economics. Winter 26 Instructor: Victor Aguirregabiria FIAL EAM. Thursday, April 4, 26. From 9:am-2:pm (3 hours) ISTRUCTIOS: - This is a closed-book

More information

Lecture 14 More on structural estimation

Lecture 14 More on structural estimation Lecture 14 More on structural estimation Economics 8379 George Washington University Instructor: Prof. Ben Williams traditional MLE and GMM MLE requires a full specification of a model for the distribution

More information

Development and estimation of a semicompensatory model with a flexible error structure

Development and estimation of a semicompensatory model with a flexible error structure Development and estimation of a semicompensatory model with a flexible error structure Sigal Kaplan, Shlomo Bekhor, Yoram Shiftan Transportation Research Institute, Technion Workshop on Discrete Choice

More information

Generalized Linear Models (GLZ)

Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) Generalized Linear Models (GLZ) are an extension of the linear modeling process that allows models to be fit to data that follow probability distributions other than the

More information

The estimation of multivariate extreme value models from choice-based samples

The estimation of multivariate extreme value models from choice-based samples The estimation of multivariate extreme value models from choice-based samples M. Bierlaire, D. Bolduc and D. McFadden Transport and Mobility Laboratory, EPFL Département d économique, Université Laval,

More information

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business

Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business Conditional Least Squares and Copulae in Claims Reserving for a Single Line of Business Michal Pešta Charles University in Prague Faculty of Mathematics and Physics Ostap Okhrin Dresden University of Technology

More information

ISQS 5349 Spring 2013 Final Exam

ISQS 5349 Spring 2013 Final Exam ISQS 5349 Spring 2013 Final Exam Name: General Instructions: Closed books, notes, no electronic devices. Points (out of 200) are in parentheses. Put written answers on separate paper; multiple choices

More information

Chapter 5 continued. Chapter 5 sections

Chapter 5 continued. Chapter 5 sections Chapter 5 sections Discrete univariate distributions: 5.2 Bernoulli and Binomial distributions Just skim 5.3 Hypergeometric distributions 5.4 Poisson distributions Just skim 5.5 Negative Binomial distributions

More information

Bayesian linear regression

Bayesian linear regression Bayesian linear regression Linear regression is the basis of most statistical modeling. The model is Y i = X T i β + ε i, where Y i is the continuous response X i = (X i1,..., X ip ) T is the corresponding

More information

Non-linear panel data modeling

Non-linear panel data modeling Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1

More information

Route Choice Analysis: Data, Models, Algorithms and Applications Emma Frejinger Thesis Supervisor: Michel Bierlaire

Route Choice Analysis: Data, Models, Algorithms and Applications Emma Frejinger Thesis Supervisor: Michel Bierlaire p. 1/15 Emma Frejinger Thesis Supervisor: Michel Bierlaire p. 2/15 Which route would a given traveler take to go from one location to another in a transportation network? Car trips (uni-modal networks)

More information

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense

More information

July 31, 2009 / Ben Kedem Symposium

July 31, 2009 / Ben Kedem Symposium ing The s ing The Department of Statistics North Carolina State University July 31, 2009 / Ben Kedem Symposium Outline ing The s 1 2 s 3 4 5 Ben Kedem ing The s Ben has made many contributions to time

More information

A short introduc-on to discrete choice models

A short introduc-on to discrete choice models A short introduc-on to discrete choice models BART Kenneth Train, Discrete Choice Models with Simula-on, Chapter 3. Ques-ons Impact of cost, commu-ng -me, walk -me, transfer -me, number of transfers, distance

More information

Preferred citation style. Schüssler, N. (2009) Challenges of route choice models derived from GPS, Workshop on Discrete Choice Models, EPF

Preferred citation style. Schüssler, N. (2009) Challenges of route choice models derived from GPS, Workshop on Discrete Choice Models, EPF Preferred citation style Schüssler, N. (2009) Challenges of route choice models derived from GPS, Workshop on Discrete Choice Models, EPF Lausanne, Lausanne, August 2009. Challenges of route choice models

More information

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis CIMAT Taller de Modelos de Capture y Recaptura 2010 Known Fate urvival Analysis B D BALANCE MODEL implest population model N = λ t+ 1 N t Deeper understanding of dynamics can be gained by identifying variation

More information

Destination Choice Model including panel data using WiFi localization in a pedestrian facility

Destination Choice Model including panel data using WiFi localization in a pedestrian facility Destination Choice Model including panel data using WiFi localization in a pedestrian facility Loïc Tinguely, Antonin Danalet, Matthieu de Lapparent & Michel Bierlaire Transport and Mobility Laboratory

More information

A general forulation of the cross-nested logit odel Michel Bierlaire, Dpt of Matheatics, EPFL, Lausanne Phone: Fax:

A general forulation of the cross-nested logit odel Michel Bierlaire, Dpt of Matheatics, EPFL, Lausanne Phone: Fax: A general forulation of the cross-nested logit odel Michel Bierlaire, EPFL Conference paper STRC 2001 Session: Choices A general forulation of the cross-nested logit odel Michel Bierlaire, Dpt of Matheatics,

More information

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns

More information

General Regression Model

General Regression Model Scott S. Emerson, M.D., Ph.D. Department of Biostatistics, University of Washington, Seattle, WA 98195, USA January 5, 2015 Abstract Regression analysis can be viewed as an extension of two sample statistical

More information

MLE and GMM. Li Zhao, SJTU. Spring, Li Zhao MLE and GMM 1 / 22

MLE and GMM. Li Zhao, SJTU. Spring, Li Zhao MLE and GMM 1 / 22 MLE and GMM Li Zhao, SJTU Spring, 2017 Li Zhao MLE and GMM 1 / 22 Outline 1 MLE 2 GMM 3 Binary Choice Models Li Zhao MLE and GMM 2 / 22 Maximum Likelihood Estimation - Introduction For a linear model y

More information

Statistical Tools for Making Predictions. LCPC Training Week 5 October 2010 Dr Colin Caprani

Statistical Tools for Making Predictions. LCPC Training Week 5 October 2010 Dr Colin Caprani www.colincaprani.com 1 Introduction About me: Did PhD in Bridge Traffic Loading (BTL) in UCD, 2001-2005 Concentrated on statistical analysis of BTL Continuing research in the BTL (and new areas) About

More information

Probabilistic Choice Models

Probabilistic Choice Models Econ 3: James J. Heckman Probabilistic Choice Models This chapter examines different models commonly used to model probabilistic choice, such as eg the choice of one type of transportation from among many

More information

Simulation. Li Zhao, SJTU. Spring, Li Zhao Simulation 1 / 19

Simulation. Li Zhao, SJTU. Spring, Li Zhao Simulation 1 / 19 Simulation Li Zhao, SJTU Spring, 2017 Li Zhao Simulation 1 / 19 Introduction Simulation consists of drawing from a density, calculating a statistic for each draw, and averaging the results. Simulation

More information

Rank-order conjoint experiments: efficiency and design s

Rank-order conjoint experiments: efficiency and design s Faculty of Business and Economics Rank-order conjoint experiments: efficiency and design s Bart Vermeulen, Peter Goos and Martina Vandebroek DEPARTMENT OF DECISION SCIENCES AND INFORMATION MANAGEMENT (KBI)

More information

Recovery of inter- and intra-personal heterogeneity using mixed logit models

Recovery of inter- and intra-personal heterogeneity using mixed logit models Recovery of inter- and intra-personal heterogeneity using mixed logit models Stephane Hess Kenneth E. Train Abstract Most applications of discrete choice models in transportation now utilise a random coefficient

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Calculating indicators with PythonBiogeme

Calculating indicators with PythonBiogeme Calculating indicators with PythonBiogeme Michel Bierlaire May 17, 2017 Report TRANSP-OR 170517 Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique

More information

(I AL BL 2 )z t = (I CL)ζ t, where

(I AL BL 2 )z t = (I CL)ζ t, where ECO 513 Fall 2011 MIDTERM EXAM The exam lasts 90 minutes. Answer all three questions. (1 Consider this model: x t = 1.2x t 1.62x t 2 +.2y t 1.23y t 2 + ε t.7ε t 1.9ν t 1 (1 [ εt y t = 1.4y t 1.62y t 2

More information

CSE 250a. Assignment Noisy-OR model. Out: Tue Oct 26 Due: Tue Nov 2

CSE 250a. Assignment Noisy-OR model. Out: Tue Oct 26 Due: Tue Nov 2 CSE 250a. Assignment 4 Out: Tue Oct 26 Due: Tue Nov 2 4.1 Noisy-OR model X 1 X 2 X 3... X d Y For the belief network of binary random variables shown above, consider the noisy-or conditional probability

More information

Mixed Models for Longitudinal Ordinal and Nominal Outcomes

Mixed Models for Longitudinal Ordinal and Nominal Outcomes Mixed Models for Longitudinal Ordinal and Nominal Outcomes Don Hedeker Department of Public Health Sciences Biological Sciences Division University of Chicago hedeker@uchicago.edu Hedeker, D. (2008). Multilevel

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017 Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent

More information

Multinomial Discrete Choice Models

Multinomial Discrete Choice Models hapter 2 Multinomial Discrete hoice Models 2.1 Introduction We present some discrete choice models that are applied to estimate parameters of demand for products that are purchased in discrete quantities.

More information

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College Original December 2016, revised July 2017 Abstract Lewbel (2012)

More information

(Behavioral Model and Duality Theory)

(Behavioral Model and Duality Theory) 2018 (Behavioral Model and Duality Theory) fukuda@plan.cv.titech.ac.jp 2 / / (Rational Inattention) 6 max u(x 1,x 2 ) x 1,x 2 s.t. p 1 x 1 + p 2 x 2 = I u = V (p 1,p 2,I) I = M(p 1,p 2, ū) x H j (p 1,p

More information

Chapter 14 Logistic Regression, Poisson Regression, and Generalized Linear Models

Chapter 14 Logistic Regression, Poisson Regression, and Generalized Linear Models Chapter 14 Logistic Regression, Poisson Regression, and Generalized Linear Models 許湘伶 Applied Linear Regression Models (Kutner, Nachtsheim, Neter, Li) hsuhl (NUK) LR Chap 10 1 / 29 14.1 Regression Models

More information

Econometric Choice Formulations: Alternative Model Structures, Estimation Techniques, and Emerging Directions

Econometric Choice Formulations: Alternative Model Structures, Estimation Techniques, and Emerging Directions Econometric Choice Formulations: Alternative Model Structures, Estimation Techniues, and Emerging Directions Resource paper for workshop entitled Econometric Models of Choice: Formulation and Estimation

More information

Are You The Beatles Or The Rolling Stones?

Are You The Beatles Or The Rolling Stones? N. Ortelli, G. Lederrey, A. Alahi, M. Bierlaire Transport and Mobility Laboratory School of Architecture, Civil, and Environmental Engineering ENAC École Polytechnique Fédérale de Lausanne 13th Workshop

More information

LIST OF FORMULAS FOR STK1100 AND STK1110

LIST OF FORMULAS FOR STK1100 AND STK1110 LIST OF FORMULAS FOR STK1100 AND STK1110 (Version of 11. November 2015) 1. Probability Let A, B, A 1, A 2,..., B 1, B 2,... be events, that is, subsets of a sample space Ω. a) Axioms: A probability function

More information

The outline for Unit 3

The outline for Unit 3 The outline for Unit 3 Unit 1. Introduction: The regression model. Unit 2. Estimation principles. Unit 3: Hypothesis testing principles. 3.1 Wald test. 3.2 Lagrange Multiplier. 3.3 Likelihood Ratio Test.

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and

More information

Fall, 2007 Nonlinear Econometrics. Theory: Consistency for Extremum Estimators. Modeling: Probit, Logit, and Other Links.

Fall, 2007 Nonlinear Econometrics. Theory: Consistency for Extremum Estimators. Modeling: Probit, Logit, and Other Links. 14.385 Fall, 2007 Nonlinear Econometrics Lecture 2. Theory: Consistency for Extremum Estimators Modeling: Probit, Logit, and Other Links. 1 Example: Binary Choice Models. The latent outcome is defined

More information

Part I Behavioral Models

Part I Behavioral Models Part I Behavioral Models 2 Properties of Discrete Choice Models 2.1 Overview This chapter describes the features that are common to all discrete choice models. We start by discussing the choice set, which

More information

Chapter 11. Regression with a Binary Dependent Variable

Chapter 11. Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score

More information

Estimating Single-Agent Dynamic Models

Estimating Single-Agent Dynamic Models Estimating Single-Agent Dynamic Models Paul T. Scott Empirical IO Fall, 2013 1 / 49 Why are dynamics important? The motivation for using dynamics is usually external validity: we want to simulate counterfactuals

More information

Integrated schedule planning with supply-demand interactions for a new generation of aircrafts

Integrated schedule planning with supply-demand interactions for a new generation of aircrafts Integrated schedule planning with supply-demand interactions for a new generation of aircrafts Bilge Atasoy, Matteo Salani and Michel Bierlaire Abstract We present an integrated schedule planning model

More information

LINEAR MODELS FOR CLASSIFICATION. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception

LINEAR MODELS FOR CLASSIFICATION. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception LINEAR MODELS FOR CLASSIFICATION Classification: Problem Statement 2 In regression, we are modeling the relationship between a continuous input variable x and a continuous target variable t. In classification,

More information