Introduction to mixtures in discrete choice models

Size: px
Start display at page:

Download "Introduction to mixtures in discrete choice models"

Transcription

1 Introduction to mixtures in discrete choice models p. 1/42 Introduction to mixtures in discrete choice models Michel Bierlaire Transport and Mobility Laboratory

2 Introduction to mixtures in discrete choice models p. 2/42 Outline Mixtures Capturing correlation Alternative specific variance Taste heterogeneity Latent classes

3 Introduction to mixtures in discrete choice models p. 3/42 Mixtures In statistics, a mixture probability distribution function is a convex combination of other probability distribution functions. If f(ε, θ) is a distribution function, and if w(θ) is a non negative function such that w(θ)dθ = 1 then θ g(ε) = θ w(θ)f(ε, θ)dθ is also a distribution function. We say that g is a w-mixture of f. If f is a logit model, g is a continuous w-mixture of logit If f is a MEV model, g is a continuous w-mixture of MEV

4 Introduction to mixtures in discrete choice models p. 4/42 Mixtures Discrete mixtures are also possible. If w i, i = 1,...,n are non negative weights such that then g(ε) = n w i = 1 i=1 n w i f(ε, θ i ) i=1 is also a distribution function where θ i, i = 1,...,n are parameters. We say that g is a discrete w-mixture of f.

5 Introduction to mixtures in discrete choice models p. 5/42 Example: discrete mixture of normal distributions 2.5 N(5,0.16) N(8,1) 0.6 N(5,0.16) N(8,1)

6 Introduction to mixtures in discrete choice models p. 6/42 Example: discrete mixture of binary logit models P(1 s=1,x) P(1 s=2,x) 0. 4 P(1 s=1,x) P(1 s=2,x)

7 Introduction to mixtures in discrete choice models p. 7/42 Mixtures General motivation: generate flexible distributional forms For discrete choice: correlation across alternatives alternative specific variances taste heterogeneity...

8 Introduction to mixtures in discrete choice models p. 8/42 Continuous Mixtures of logit Combining probit and logit Error decomposed into two parts U in = V in + ξ + ν Normal distribution (probit): flexibility i.i.d EV (logit): tractability

9 Introduction to mixtures in discrete choice models p. 9/42 Logit Utility: ν i.i.d. extreme value Probability: U auto = βx auto + ν auto U bus = βx bus + ν bus U subway = βx subway + ν subway Λ(auto X) = e βx auto e βx auto + e βx bus + e βx subway

10 Introduction to mixtures in discrete choice models p. 10/42 Normal mixture of logit Utility: U auto = βx auto + ξ auto + ν auto U bus = βx bus + ξ bus + ν bus U subway = βx subway + ξ subway + ν subway ν i.i.d. extreme value, ξ N(0, Σ) Probability: Λ(auto X, ξ) = e βx auto+ξ auto e βx auto+ξ auto + e βx bus +ξ bus + e βx subway +ξ subway P(auto X) = ξ Λ(auto X, ξ)f(ξ)dξ

11 Introduction to mixtures in discrete choice models p. 11/42 Capturing correlations: nesting Utility: U auto = βx auto + ν auto U bus = βx bus + σ transit η transit + ν bus U subway = βx subway + σ transit η transit + ν subway ν i.i.d. extreme value, η transit N(0, 1), σ 2 transit =cov(bus,subway) Probability: Λ(auto X, η transit ) = e βx auto e βx auto + e βx bus +σ transit η transit + e βx subway+σ transit η transit P(auto X) = η Λ(auto X, ξ)f(η)dη

12 Introduction to mixtures in discrete choice models p. 12/42 Nesting structure Example: residential telephone ASC_BM ASC_SM ASC_LF ASC_EF BETA_C σ M σ F BM ln(cost(bm)) η M 0 SM ln(cost(sm)) η M 0 LF ln(cost(lf)) 0 η F EF ln(cost(ef)) 0 η F MF ln(cost(mf)) 0 η F

13 Introduction to mixtures in discrete choice models p. 13/42 Nesting structure Identification issues: If there are two nests, only one σ is identified If there are more than two nests, all σ s are identified Walker (2001) Results with 5000 draws..

14 NL NML NML NML NML σ F = 0 σ M = 0 σ F = σ M L Value Scaled Value Scaled Value Scaled Value Scaled Value Scaled ASC BM ASC EF ASC LF ASC SM B LOGCOST FLAT MEAS σ F σ M σ 2 F + σ2 M

15 Introduction to mixtures in discrete choice models p. 14/42 Comments The scale of the parameters is different between NL and the mixture model Normalization can be performed in several ways σ F = 0 σ M = 0 σ F = σ M Final log likelihood should be the same But... estimation relies on simulation Only an approximation of the log likelihood is available Final log likelihood with draws: Unnormalized: σ M = σ F : σ F = 0: σ M = 0:

16 Introduction to mixtures in discrete choice models p. 15/42 Cross nesting 5 5Nest 1 5Nest Bus Train Car Ped. Bike U bus = V bus +ξ 1 +ε bus U train = V train +ξ 1 +ε train U car = V car +ξ 1 +ξ 2 +ε car U ped = V ped +ξ 2 +ε ped U bike = V bike +ξ 2 +ε bike P(car) = P(car ξ 1, ξ 2 )f(ξ 1 )f(ξ 2 )dξ 2 dξ 1 ξ 2 ξ 1

17 Introduction to mixtures in discrete choice models p. 16/42 Identification issue Not all parameters can be identified For logit, one ASC has to be constrained to zero Identification of NML is important and tricky See Walker, Ben-Akiva & Bolduc (2007) for a detailed analysis

18 Introduction to mixtures in discrete choice models p. 17/42 Alternative specific variance Error terms in logit are i.i.d. and, in particular, have the same variance U in = β T x in + ASC i + ε in ε in i.i.d. extreme value Var(ε in ) = π 2 /6µ 2 In order allow for different variances, we use mixtures U in = β T x in + ASC i + σ i ξ i + ε in where ξ i N(0, 1) Variance: Var(σ i ξ i + ε in ) = σ 2 i + π2 6µ 2

19 Introduction to mixtures in discrete choice models p. 18/42 Alternative specific variance Identification issue: Not all σs are identified One of them must be constrained to zero Not necessarily the one associated with the ASC constrained to zero In theory, the smallest σ must be constrained to zero In practice, we don t know a priori which one it is Solution: 1. Estimate a model with a full set of σs 2. Identify the smallest one and constrain it to zero.

20 Introduction to mixtures in discrete choice models p. 19/42 Alternative specific variance Example with Swissmetro ASC_CAR ASC_SBB ASC_SM B_COST B_FR B_TIME Car cost 0 time Train cost freq. time Swissmetro cost freq. time + alternative specific variance

21 Logit ASV ASV norm. L Value Scaled Value Scaled Value Scaled ASC CAR ASC SM B COST B FR B TIME SIGMA CAR SIGMA TRAIN SIGMA SM

22 Introduction to mixtures in discrete choice models p. 20/42 Taste heterogeneity Population is heterogeneous Taste heterogeneity is captured by segmentation Deterministic segmentation is desirable but not always possible Distribution of a parameter in the population

23 Introduction to mixtures in discrete choice models p. 21/42 Random parameters U i = β t T i + β c C i + ε i U j = β t T j + β c C j + ε j Let β t N( β t, σt 2 ), or, equivalently, β t = β t + σ t ξ, with ξ N(0, 1). U i = β t T i + σ t ξt i + β c C i + ε i U j = β t T j + σ t ξt j + β c C j + ε j If ε i and ε j are i.i.d. EV and ξ is given, we have P(i ξ) = e β t T i +σ t ξt i +β c C i, and e β t T i +σ t ξt i +β c C i + e βt T j +σ t ξt j +β c C j P(i) = P(i ξ)f(ξ)dξ. ξ

24 Introduction to mixtures in discrete choice models p. 22/42 Random parameters Example with Swissmetro ASC_CAR ASC_SBB ASC_SM B_COST B_FR B_TIME Car cost 0 time Train cost freq. time Swissmetro cost freq. time B_TIME randomly distributed across the population, normal distribution

25 Introduction to mixtures in discrete choice models p. 23/42 Random parameters Logit RC L ASC_CAR_SP ASC_SM_SP B_COST B_FR B_TIME S_TIME Prob(B_TIME 0) 8.8% χ

26 Introduction to mixtures in discrete choice models p. 24/42 Random parameters 25 Distribution of B_TIME

27 Introduction to mixtures in discrete choice models p. 25/42 Random parameters Example with Swissmetro ASC_CAR ASC_SBB ASC_SM B_COST B_FR B_TIME Car cost 0 time Train cost freq. time Swissmetro cost freq. time B_TIME randomly distributed across the population, log normal distribution

28 Introduction to mixtures in discrete choice models p. 26/42 Random parameters [Utilities] 11 SBB_SP TRAIN_AV_SP ASC_SBB_SP * one + B_COST * TRAIN_COST + B_FR * TRAIN_FR 21 SM_SP SM_AV ASC_SM_SP * one + B_COST * SM_COST + B_FR * SM_FR 31 Car_SP CAR_AV_SP ASC_CAR_SP * one + B_COST * CAR_CO [GeneralizedUtilities] 11 - exp( B_TIME [ S_TIME ] ) * TRAIN_TT 21 - exp( B_TIME [ S_TIME ] ) * SM_TT 31 - exp( B_TIME [ S_TIME ] ) * CAR_TT

29 Introduction to mixtures in discrete choice models p. 27/42 Random parameters Logit RC-norm. RC-logn ASC_CAR_SP ASC_SM_SP B_COST B_FR B_TIME S_TIME Prob(β > 0) 8.8% 0.0% χ

30 Introduction to mixtures in discrete choice models p. 28/42 Random parameters 40 Lognormal Distribution of B_TIME Normal Distribution of B_TIME Lognormal mean Normal mean MNL

31 Introduction to mixtures in discrete choice models p. 29/42 Random parameters Example with Swissmetro ASC_CAR ASC_SBB ASC_SM B_COST B_FR B_TIME Car cost 0 time Train cost freq. time Swissmetro cost freq. time B_TIME randomly distributed across the population, discrete distribution P(β time = ˆβ) = ω 1 P(β time = 0) = ω 2 = 1 ω 1

32 Introduction to mixtures in discrete choice models p. 30/42 Random parameters [DiscreteDistributions] B_TIME < B_TIME_1 ( W1 ) B_TIME_2 ( W2 ) > [LinearConstraints] W1 + W2 = 1.0

33 Introduction to mixtures in discrete choice models p. 31/42 Random parameters Logit RC-norm. RC-logn. RC-disc ASC_CAR_SP ASC_SM_SP B_COST B_FR B_TIME S_TIME W W Prob(β > 0) 8.8% 0.0% 0.0% χ

34 Introduction to mixtures in discrete choice models p. 32/42 Latent classes Latent classes capture unobserved heterogeneity They can represent different: Choice sets Decision protocols Tastes Model structures etc.

35 Introduction to mixtures in discrete choice models p. 33/42 Latent classes P(i) = S Λ(i s)q(s) s=1 Λ(i s) is the class-specific choice model probability of choosing i given that the individual belongs to class s Q(s) is the class membership model probability of belonging to class s

36 Introduction to mixtures in discrete choice models p. 34/42 Example: residential location Hypothesis Lifestyle preferences exist (e.g., suburb vs. urban) Lifestyle differences lead to differences in considerations, criterion, and preferences for residential location choices Infer lifestyle preferences from choice behavior using latent class choice model Latent classes = lifestyle Choice model = location decisions

37 Example: residential location Introduction to mixtures in discrete choice models p. 35/42

38 Introduction to mixtures in discrete choice models p. 36/42 Latent lifestyle segmentation Class 1 Class 2 Class 3 Suburban, school, Transit, school, High density, urban auto affluent, less affluent, activity, older, more established younger families non-family, professionals families

39 Introduction to mixtures in discrete choice models p. 37/42 Summary Logit mixtures models Computationally more complex than MEV Allow for more flexibility than MEV Continuous mixtures: alternative specific variance, nesting structures, random parameters P(i) = ξ Λ(i ξ)f(ξ)dξ Discrete mixtures: well-defined latent classes of decision makers S P(i) = Λ(i s)q(s). s=1

40 Introduction to mixtures in discrete choice models p. 38/42 Tips for applications Be careful: simulation can mask specification and identification issues Do not forget about the systematic portion

Swissmetro. March 3, Model Specification with Generic Attributes

Swissmetro. March 3, Model Specification with Generic Attributes Swissmetro March 3, 2015 Model Specification with Generic Attributes Model file: MNL_SM_generic.mod The dataset consists of survey data collected on the trains between St. Gallen and Geneva in Switzerland.

More information

Calculating indicators with Biogeme

Calculating indicators with Biogeme Calculating indicators with Biogeme Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne September 25,

More information

The estimation of multivariate extreme value models from choice-based samples

The estimation of multivariate extreme value models from choice-based samples The estimation of multivariate extreme value models from choice-based samples M. Bierlaire, D. Bolduc and D. McFadden Transport and Mobility Laboratory, EPFL Département d économique, Université Laval,

More information

Estimation of mixed generalized extreme value models

Estimation of mixed generalized extreme value models Estimation of mixed generalized extreme value models Michel Bierlaire michel.bierlaire@epfl.ch Operations Research Group ROSO Institute of Mathematics EPFL Katholieke Universiteit Leuven, November 2004

More information

Capturing Correlation in Route Choice Models using Subnetworks

Capturing Correlation in Route Choice Models using Subnetworks Capturing Correlation in Route Choice Models using Subnetworks Emma Frejinger and Michel Bierlaire Transport and Mobility Laboratory (TRANSP-OR), EPFL Capturing Correlation with Subnetworks in Route Choice

More information

Binary choice. Michel Bierlaire

Binary choice. Michel Bierlaire Binary choice Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne M. Bierlaire (TRANSP-OR ENAC EPFL)

More information

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1 Tetsuya Matsubayashi University of North Texas November 2, 2010 Random utility model Multinomial logit model Conditional logit model

More information

The 17 th Behavior Modeling Summer School

The 17 th Behavior Modeling Summer School The 17 th Behavior Modeling Summer School September 14-16, 2017 Introduction to Discrete Choice Models Giancarlos Troncoso Parady Assistant Professor Urban Transportation Research Unit Department of Urban

More information

Lecture-20: Discrete Choice Modeling-I

Lecture-20: Discrete Choice Modeling-I Lecture-20: Discrete Choice Modeling-I 1 In Today s Class Introduction to discrete choice models General formulation Binary choice models Specification Model estimation Application Case Study 2 Discrete

More information

The estimation of Generalized Extreme Value models from choice-based samples

The estimation of Generalized Extreme Value models from choice-based samples The estimation of Generalized Extreme Value models from choice-based samples M. Bierlaire D. Bolduc D. McFadden August 10, 2006 Abstract We consider an estimation procedure for discrete choice models in

More information

INTRODUCTION TO TRANSPORTATION SYSTEMS

INTRODUCTION TO TRANSPORTATION SYSTEMS INTRODUCTION TO TRANSPORTATION SYSTEMS Lectures 5/6: Modeling/Equilibrium/Demand 1 OUTLINE 1. Conceptual view of TSA 2. Models: different roles and different types 3. Equilibrium 4. Demand Modeling References:

More information

Estimating Single-Agent Dynamic Models

Estimating Single-Agent Dynamic Models Estimating Single-Agent Dynamic Models Paul T. Scott Empirical IO Fall, 2013 1 / 49 Why are dynamics important? The motivation for using dynamics is usually external validity: we want to simulate counterfactuals

More information

Statistical Tests. Matthieu de Lapparent

Statistical Tests. Matthieu de Lapparent Statistical Tests Matthieu de Lapparent matthieu.delapparent@epfl.ch Transport and Mobility Laboratory, School of Architecture, Civil and Environmental Engineering, Ecole Polytechnique Fédérale de Lausanne

More information

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo Lecture 1 Behavioral Models Multinomial Logit: Power and limitations Cinzia Cirillo 1 Overview 1. Choice Probabilities 2. Power and Limitations of Logit 1. Taste variation 2. Substitution patterns 3. Repeated

More information

Discrete panel data. Michel Bierlaire

Discrete panel data. Michel Bierlaire Discrete panel data Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne M. Bierlaire (TRANSP-OR ENAC

More information

Masking Identification of Discrete Choice Models under Simulation Methods *

Masking Identification of Discrete Choice Models under Simulation Methods * Masking Identification of Discrete Choice Models under Simulation Methods * Lesley Chiou 1 and Joan L. Walker 2 Abstract We present examples based on actual and synthetic datasets to illustrate how simulation

More information

Normalization and correlation of cross-nested logit models

Normalization and correlation of cross-nested logit models Normalization and correlation of cross-nested logit models E. Abbe, M. Bierlaire, T. Toledo Lab. for Information and Decision Systems, Massachusetts Institute of Technology Inst. of Mathematics, Ecole

More information

Practical note on specification of discrete choice model. Toshiyuki Yamamoto Nagoya University, Japan

Practical note on specification of discrete choice model. Toshiyuki Yamamoto Nagoya University, Japan Practical note on specification of discrete choice model Toshiyuki Yamamoto Nagoya University, Japan 1 Contents Comparison between bary logit model and bary probit model Comparison between multomial logit

More information

Choice Theory. Matthieu de Lapparent

Choice Theory. Matthieu de Lapparent Choice Theory Matthieu de Lapparent matthieu.delapparent@epfl.ch Transport and Mobility Laboratory, School of Architecture, Civil and Environmental Engineering, Ecole Polytechnique Fédérale de Lausanne

More information

A Joint Tour-Based Model of Vehicle Type Choice and Tour Length

A Joint Tour-Based Model of Vehicle Type Choice and Tour Length A Joint Tour-Based Model of Vehicle Type Choice and Tour Length Ram M. Pendyala School of Sustainable Engineering & the Built Environment Arizona State University Tempe, AZ Northwestern University, Evanston,

More information

Lecture 6: Discrete Choice: Qualitative Response

Lecture 6: Discrete Choice: Qualitative Response Lecture 6: Instructor: Department of Economics Stanford University 2011 Types of Discrete Choice Models Univariate Models Binary: Linear; Probit; Logit; Arctan, etc. Multinomial: Logit; Nested Logit; GEV;

More information

Route choice models: Introduction and recent developments

Route choice models: Introduction and recent developments Route choice models: Introduction and recent developments Michel Bierlaire transp-or.epfl.ch Transport and Mobility Laboratory, EPFL Route choice models: Introduction and recent developments p.1/40 Route

More information

Estimating Single-Agent Dynamic Models

Estimating Single-Agent Dynamic Models Estimating Single-Agent Dynamic Models Paul T. Scott New York University Empirical IO Course Fall 2016 1 / 34 Introduction Why dynamic estimation? External validity Famous example: Hendel and Nevo s (2006)

More information

An Overview of Choice Models

An Overview of Choice Models An Overview of Choice Models Dilan Görür Gatsby Computational Neuroscience Unit University College London May 08, 2009 Machine Learning II 1 / 31 Outline 1 Overview Terminology and Notation Economic vs

More information

Hiroshima University 2) The University of Tokyo

Hiroshima University 2) The University of Tokyo September 25-27, 2015 The 14th Summer course for Behavior Modeling in Transportation Networks @The University of Tokyo Advanced behavior models Recent development of discrete choice models Makoto Chikaraishi

More information

Exploratory analysis of endogeneity in discrete choice models. Anna Fernández Antolín Amanda Stathopoulos Michel Bierlaire

Exploratory analysis of endogeneity in discrete choice models. Anna Fernández Antolín Amanda Stathopoulos Michel Bierlaire Exploratory analysis of endogeneity in discrete choice models Anna Fernández Antolín Amanda Stathopoulos Michel Bierlaire Transportation and Mobility Laboratory, ENAC, EPFL April 2014 Transportation and

More information

Lecture 10 Demand for Autos (BLP) Bronwyn H. Hall Economics 220C, UC Berkeley Spring 2005

Lecture 10 Demand for Autos (BLP) Bronwyn H. Hall Economics 220C, UC Berkeley Spring 2005 Lecture 10 Demand for Autos (BLP) Bronwyn H. Hall Economics 220C, UC Berkeley Spring 2005 Outline BLP Spring 2005 Economics 220C 2 Why autos? Important industry studies of price indices and new goods (Court,

More information

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2 Tetsuya Matsubayashi University of North Texas November 9, 2010 Learn multiple responses models that do not require the assumption

More information

Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data. Fred Mannering University of South Florida

Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data. Fred Mannering University of South Florida Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data Fred Mannering University of South Florida Highway Accidents Cost the lives of 1.25 million people per year Leading cause

More information

July 31, 2009 / Ben Kedem Symposium

July 31, 2009 / Ben Kedem Symposium ing The s ing The Department of Statistics North Carolina State University July 31, 2009 / Ben Kedem Symposium Outline ing The s 1 2 s 3 4 5 Ben Kedem ing The s Ben has made many contributions to time

More information

Are You The Beatles Or The Rolling Stones?

Are You The Beatles Or The Rolling Stones? N. Ortelli, G. Lederrey, A. Alahi, M. Bierlaire Transport and Mobility Laboratory School of Architecture, Civil, and Environmental Engineering ENAC École Polytechnique Fédérale de Lausanne 13th Workshop

More information

Lecture 14 More on structural estimation

Lecture 14 More on structural estimation Lecture 14 More on structural estimation Economics 8379 George Washington University Instructor: Prof. Ben Williams traditional MLE and GMM MLE requires a full specification of a model for the distribution

More information

Development and estimation of a semicompensatory model with a flexible error structure

Development and estimation of a semicompensatory model with a flexible error structure Development and estimation of a semicompensatory model with a flexible error structure Sigal Kaplan, Shlomo Bekhor, Yoram Shiftan Transportation Research Institute, Technion Workshop on Discrete Choice

More information

Gibbs Sampling in Endogenous Variables Models

Gibbs Sampling in Endogenous Variables Models Gibbs Sampling in Endogenous Variables Models Econ 690 Purdue University Outline 1 Motivation 2 Identification Issues 3 Posterior Simulation #1 4 Posterior Simulation #2 Motivation In this lecture we take

More information

Some Customers Would Rather Leave Without Saying Goodbye

Some Customers Would Rather Leave Without Saying Goodbye Some Customers Would Rather Leave Without Saying Goodbye Eva Ascarza, Oded Netzer, and Bruce G. S. Hardie Web Appendix A: Model Estimation In this appendix we describe the hierarchical Bayesian framework

More information

6 Mixed Logit. 6.1 Choice Probabilities

6 Mixed Logit. 6.1 Choice Probabilities 6 Mixed Logit 6.1 Choice Probabilities Mixed logit is a highly flexible model that can approximate any random utility model (McFadden & Train, 2000). It obviates the three limitations of standard logit

More information

Part I Behavioral Models

Part I Behavioral Models Part I Behavioral Models 2 Properties of Discrete Choice Models 2.1 Overview This chapter describes the features that are common to all discrete choice models. We start by discussing the choice set, which

More information

Model Selection and Geometry

Model Selection and Geometry Model Selection and Geometry Pascal Massart Université Paris-Sud, Orsay Leipzig, February Purpose of the talk! Concentration of measure plays a fundamental role in the theory of model selection! Model

More information

Limited Dependent Variables and Panel Data

Limited Dependent Variables and Panel Data and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement

More information

Random taste heterogeneity in discrete choice models: Flexible nonparametric finite mixture distributions

Random taste heterogeneity in discrete choice models: Flexible nonparametric finite mixture distributions Random taste heterogeneity in discrete choice models: Flexible nonparametric finite mixture distributions Transportation Research Part B: Methodological, Volume 106, December 2017, Pages 76-101 Akshay

More information

BIOGEME: a free package for the estimation of discrete choice models

BIOGEME: a free package for the estimation of discrete choice models BIOGEME: a free package for the estimation of discrete choice models Michel Bierlaire, EPFL Conference paper STRC 2003 STRC 3rd Swiss Transport Research Conference Monte Verita, Ascona, March 2003 2 BIOGEME:

More information

Integrating advanced discrete choice models in mixed integer linear optimization

Integrating advanced discrete choice models in mixed integer linear optimization Integrating advanced discrete choice models in mixed integer linear optimization Meritxell Pacheco Shadi Sharif Azadeh, Michel Bierlaire, Bernard Gendron Transport and Mobility Laboratory (TRANSP-OR) École

More information

STAT 535 Lecture 5 November, 2018 Brief overview of Model Selection and Regularization c Marina Meilă

STAT 535 Lecture 5 November, 2018 Brief overview of Model Selection and Regularization c Marina Meilă STAT 535 Lecture 5 November, 2018 Brief overview of Model Selection and Regularization c Marina Meilă mmp@stat.washington.edu Reading: Murphy: BIC, AIC 8.4.2 (pp 255), SRM 6.5 (pp 204) Hastie, Tibshirani

More information

Introduction to Discrete Choice Models

Introduction to Discrete Choice Models Chapter 7 Introduction to Dcrete Choice Models 7.1 Introduction It has been mentioned that the conventional selection bias model requires estimation of two structural models, namely the selection model

More information

Hypothesis Testing. Econ 690. Purdue University. Justin L. Tobias (Purdue) Testing 1 / 33

Hypothesis Testing. Econ 690. Purdue University. Justin L. Tobias (Purdue) Testing 1 / 33 Hypothesis Testing Econ 690 Purdue University Justin L. Tobias (Purdue) Testing 1 / 33 Outline 1 Basic Testing Framework 2 Testing with HPD intervals 3 Example 4 Savage Dickey Density Ratio 5 Bartlett

More information

Standard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j

Standard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j Standard Errors & Confidence Intervals β β asy N(0, I( β) 1 ), where I( β) = [ 2 l(β, φ; y) ] β i β β= β j We can obtain asymptotic 100(1 α)% confidence intervals for β j using: β j ± Z 1 α/2 se( β j )

More information

Incorporating Spatial Dependencies in Random Parameter Discrete Choice Models

Incorporating Spatial Dependencies in Random Parameter Discrete Choice Models Incorporating Spatial Dependencies in Random Parameter Discrete Choice Models Abolfazl (Kouros) Mohammadian Department of Civil and Materials Engineering University of Illinois at Chicago 842 W. Taylor

More information

Variance reduction. Michel Bierlaire. Transport and Mobility Laboratory. Variance reduction p. 1/18

Variance reduction. Michel Bierlaire. Transport and Mobility Laboratory. Variance reduction p. 1/18 Variance reduction p. 1/18 Variance reduction Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory Variance reduction p. 2/18 Example Use simulation to compute I = 1 0 e x dx We

More information

Pattern Recognition. Parameter Estimation of Probability Density Functions

Pattern Recognition. Parameter Estimation of Probability Density Functions Pattern Recognition Parameter Estimation of Probability Density Functions Classification Problem (Review) The classification problem is to assign an arbitrary feature vector x F to one of c classes. The

More information

Logit kernel (or mixed logit) models for large multidimensional choice problems: identification and estimation

Logit kernel (or mixed logit) models for large multidimensional choice problems: identification and estimation Logit kernel (or mixed logit) models for large multidimensional choice problems: identification and estimation John L. Bowman, Ph. D., Research Affiliate, Massachusetts Institute of Technology, 5 Beals

More information

Sparse Kernel Density Estimation Technique Based on Zero-Norm Constraint

Sparse Kernel Density Estimation Technique Based on Zero-Norm Constraint Sparse Kernel Density Estimation Technique Based on Zero-Norm Constraint Xia Hong 1, Sheng Chen 2, Chris J. Harris 2 1 School of Systems Engineering University of Reading, Reading RG6 6AY, UK E-mail: x.hong@reading.ac.uk

More information

A Bayesian Probit Model with Spatial Dependencies

A Bayesian Probit Model with Spatial Dependencies A Bayesian Probit Model with Spatial Dependencies Tony E. Smith Department of Systems Engineering University of Pennsylvania Philadephia, PA 19104 email: tesmith@ssc.upenn.edu James P. LeSage Department

More information

Hiroshima University 2) Tokyo University of Science

Hiroshima University 2) Tokyo University of Science September 23-25, 2016 The 15th Summer course for Behavior Modeling in Transportation Networks @The University of Tokyo Advanced behavior models Recent development of discrete choice models Makoto Chikaraishi

More information

Joint Modeling of Longitudinal Item Response Data and Survival

Joint Modeling of Longitudinal Item Response Data and Survival Joint Modeling of Longitudinal Item Response Data and Survival Jean-Paul Fox University of Twente Department of Research Methodology, Measurement and Data Analysis Faculty of Behavioural Sciences Enschede,

More information

A sampling of alternatives approach for route choice models

A sampling of alternatives approach for route choice models Seminar, Namur, February 2009, A sampling of alternatives approach for route choice models p. 1/34 A sampling of alternatives approach for route choice models Emma Frejinger Centre for Transport Studies,

More information

Normalization and correlation of cross-nested logit models

Normalization and correlation of cross-nested logit models Normalization and correlation of cross-nested logit models E. Abbe M. Bierlaire T. Toledo August 5, 25 Abstract We address two fundamental issues associated with the use of cross-nested logit models. On

More information

(Behavioral Model and Duality Theory)

(Behavioral Model and Duality Theory) 2018 (Behavioral Model and Duality Theory) fukuda@plan.cv.titech.ac.jp 2 / / (Rational Inattention) 6 max u(x 1,x 2 ) x 1,x 2 s.t. p 1 x 1 + p 2 x 2 = I u = V (p 1,p 2,I) I = M(p 1,p 2, ū) x H j (p 1,p

More information

I. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as

I. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as Econ 513, USC, Fall 2005 Lecture 15 Discrete Response Models: Multinomial, Conditional and Nested Logit Models Here we focus again on models for discrete choice with more than two outcomes We assume that

More information

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns

More information

Machine Learning. Gaussian Mixture Models. Zhiyao Duan & Bryan Pardo, Machine Learning: EECS 349 Fall

Machine Learning. Gaussian Mixture Models. Zhiyao Duan & Bryan Pardo, Machine Learning: EECS 349 Fall Machine Learning Gaussian Mixture Models Zhiyao Duan & Bryan Pardo, Machine Learning: EECS 349 Fall 2012 1 The Generative Model POV We think of the data as being generated from some process. We assume

More information

Riemann Manifold Methods in Bayesian Statistics

Riemann Manifold Methods in Bayesian Statistics Ricardo Ehlers ehlers@icmc.usp.br Applied Maths and Stats University of São Paulo, Brazil Working Group in Statistical Learning University College Dublin September 2015 Bayesian inference is based on Bayes

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

A Lagrangian relaxation method for solving choice-based mixed linear optimization models that integrate supply and demand interactions

A Lagrangian relaxation method for solving choice-based mixed linear optimization models that integrate supply and demand interactions A Lagrangian relaxation method for solving choice-based mixed linear optimization models that integrate supply and demand interactions Meritxell Pacheco Shadi Sharif Azadeh, Michel Bierlaire, Bernard Gendron

More information

Quantifying Stochastic Model Errors via Robust Optimization

Quantifying Stochastic Model Errors via Robust Optimization Quantifying Stochastic Model Errors via Robust Optimization IPAM Workshop on Uncertainty Quantification for Multiscale Stochastic Systems and Applications Jan 19, 2016 Henry Lam Industrial & Operations

More information

Expectation Maximization

Expectation Maximization Expectation Maximization Bishop PRML Ch. 9 Alireza Ghane c Ghane/Mori 4 6 8 4 6 8 4 6 8 4 6 8 5 5 5 5 5 5 4 6 8 4 4 6 8 4 5 5 5 5 5 5 µ, Σ) α f Learningscale is slightly Parameters is slightly larger larger

More information

Bayes methods for categorical data. April 25, 2017

Bayes methods for categorical data. April 25, 2017 Bayes methods for categorical data April 25, 2017 Motivation for joint probability models Increasing interest in high-dimensional data in broad applications Focus may be on prediction, variable selection,

More information

Distributed detection of topological changes in communication networks. Riccardo Lucchese, Damiano Varagnolo, Karl H. Johansson

Distributed detection of topological changes in communication networks. Riccardo Lucchese, Damiano Varagnolo, Karl H. Johansson 1 Distributed detection of topological changes in communication networks Riccardo Lucchese, Damiano Varagnolo, Karl H. Johansson Thanks to... 2 The need: detecting changes in topological networks 3 The

More information

Divers aspects de la modélisation de la demande en transport

Divers aspects de la modélisation de la demande en transport Divers aspects de la modélisation de la demande en transport Michel Bierlaire Laboratoire Transport et Mobilité, Ecole Polytechnique Fédérale de Lausanne Divers aspects de la modélisation de la demande

More information

Estimating hybrid choice models with the new version of Biogeme

Estimating hybrid choice models with the new version of Biogeme Estimating hybrid choice models with the new version of Biogeme p. 1/34 Estimating hybrid choice models with the new version of Biogeme Michel Bierlaire transp-or.epfl.ch Transport and Mobility Laboratory,

More information

Machine Learning Techniques for Computer Vision

Machine Learning Techniques for Computer Vision Machine Learning Techniques for Computer Vision Part 2: Unsupervised Learning Microsoft Research Cambridge x 3 1 0.5 0.2 0 0.5 0.3 0 0.5 1 ECCV 2004, Prague x 2 x 1 Overview of Part 2 Mixture models EM

More information

The Network GEV model

The Network GEV model The Network GEV model Michel Bierlaire January, 2002 Discrete choice models play a major role in transportation demand analysis. Their nice and strong theoretical properties, and their flexibility to capture

More information

How Indecisiveness in Choice Behaviour affects the Magnitude of Parameter Estimates obtained in Discrete Choice Models. Abstract

How Indecisiveness in Choice Behaviour affects the Magnitude of Parameter Estimates obtained in Discrete Choice Models. Abstract How Indecisiveness in Choice Behaviour affects the Magnitude of Parameter Estimates obtained in Discrete Choice Models Abstract Parameter estimates ( βˆ ) obtained in discrete choice models are confounded

More information

Discrete Choice Models I

Discrete Choice Models I Discrete Choice Models I 1 Introduction A discrete choice model is one in which decision makers choose among a set of alternatives. 1 To fit within a discrete choice framework, the set of alternatives

More information

Calculating indicators with PythonBiogeme

Calculating indicators with PythonBiogeme Calculating indicators with PythonBiogeme Michel Bierlaire May 17, 2017 Report TRANSP-OR 170517 Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique

More information

Outline. Clustering. Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models

Outline. Clustering. Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models Collaboration with Rudolf Winter-Ebmer, Department of Economics, Johannes Kepler University

More information

A short introduc-on to discrete choice models

A short introduc-on to discrete choice models A short introduc-on to discrete choice models BART Kenneth Train, Discrete Choice Models with Simula-on, Chapter 3. Ques-ons Impact of cost, commu-ng -me, walk -me, transfer -me, number of transfers, distance

More information

Hierarchical Linear Models

Hierarchical Linear Models Hierarchical Linear Models Statistics 220 Spring 2005 Copyright c 2005 by Mark E. Irwin The linear regression model Hierarchical Linear Models y N(Xβ, Σ y ) β σ 2 p(β σ 2 ) σ 2 p(σ 2 ) can be extended

More information

Bayesian non-parametric model to longitudinally predict churn

Bayesian non-parametric model to longitudinally predict churn Bayesian non-parametric model to longitudinally predict churn Bruno Scarpa Università di Padova Conference of European Statistics Stakeholders Methodologists, Producers and Users of European Statistics

More information

A Multinomial Probit Model with Latent Factors

A Multinomial Probit Model with Latent Factors A Multinomial Probit Model with Latent Factors Identification and Interpretation without a Measurement System Rémi Piatek Department of Economics University of Copenhagen Miriam Gensowski Department of

More information

Estimation of Static Discrete Choice Models Using Market Level Data

Estimation of Static Discrete Choice Models Using Market Level Data Estimation of Static Discrete Choice Models Using Market Level Data NBER Methods Lectures Aviv Nevo Northwestern University and NBER July 2012 Data Structures Market-level data cross section/time series/panel

More information

Lecture 14. Clustering, K-means, and EM

Lecture 14. Clustering, K-means, and EM Lecture 14. Clustering, K-means, and EM Prof. Alan Yuille Spring 2014 Outline 1. Clustering 2. K-means 3. EM 1 Clustering Task: Given a set of unlabeled data D = {x 1,..., x n }, we do the following: 1.

More information

Lecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH

Lecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH Lecture 5: Spatial probit models James P. LeSage University of Toledo Department of Economics Toledo, OH 43606 jlesage@spatial-econometrics.com March 2004 1 A Bayesian spatial probit model with individual

More information

Ordered Response and Multinomial Logit Estimation

Ordered Response and Multinomial Logit Estimation Ordered Response and Multinomial Logit Estimation Quantitative Microeconomics R. Mora Department of Economics Universidad Carlos III de Madrid Outline Introduction 1 Introduction 2 3 Introduction The Ordered

More information

Posterior Regularization

Posterior Regularization Posterior Regularization 1 Introduction One of the key challenges in probabilistic structured learning, is the intractability of the posterior distribution, for fast inference. There are numerous methods

More information

STA 216, GLM, Lecture 16. October 29, 2007

STA 216, GLM, Lecture 16. October 29, 2007 STA 216, GLM, Lecture 16 October 29, 2007 Efficient Posterior Computation in Factor Models Underlying Normal Models Generalized Latent Trait Models Formulation Genetic Epidemiology Illustration Structural

More information

GARCH Models Estimation and Inference

GARCH Models Estimation and Inference GARCH Models Estimation and Inference Eduardo Rossi University of Pavia December 013 Rossi GARCH Financial Econometrics - 013 1 / 1 Likelihood function The procedure most often used in estimating θ 0 in

More information

10-701/15-781, Machine Learning: Homework 4

10-701/15-781, Machine Learning: Homework 4 10-701/15-781, Machine Learning: Homewor 4 Aarti Singh Carnegie Mellon University ˆ The assignment is due at 10:30 am beginning of class on Mon, Nov 15, 2010. ˆ Separate you answers into five parts, one

More information

Bayesian Model Diagnostics and Checking

Bayesian Model Diagnostics and Checking Earvin Balderama Quantitative Ecology Lab Department of Forestry and Environmental Resources North Carolina State University April 12, 2013 1 / 34 Introduction MCMCMC 2 / 34 Introduction MCMCMC Steps in

More information

Accounting for Complex Sample Designs via Mixture Models

Accounting for Complex Sample Designs via Mixture Models Accounting for Complex Sample Designs via Finite Normal Mixture Models 1 1 University of Michigan School of Public Health August 2009 Talk Outline 1 2 Accommodating Sampling Weights in Mixture Models 3

More information

Normalization and correlation of cross-nested logit models

Normalization and correlation of cross-nested logit models Normalization and correlation of cross-nested logit models E. Abbe M. Bierlaire T. Toledo Paper submitted to Transportation Research B Final version November 2006 Abstract We address two fundamental issues

More information

Limited Dependent Variable Models II

Limited Dependent Variable Models II Limited Dependent Variable Models II Fall 2008 Environmental Econometrics (GR03) LDV Fall 2008 1 / 15 Models with Multiple Choices The binary response model was dealing with a decision problem with two

More information

Discrete Choice Modeling

Discrete Choice Modeling [Part 6] 1/55 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent Class 11 Mixed Logit 12 Stated Preference

More information

Preferred citation style. Schüssler, N. (2009) Challenges of route choice models derived from GPS, Workshop on Discrete Choice Models, EPF

Preferred citation style. Schüssler, N. (2009) Challenges of route choice models derived from GPS, Workshop on Discrete Choice Models, EPF Preferred citation style Schüssler, N. (2009) Challenges of route choice models derived from GPS, Workshop on Discrete Choice Models, EPF Lausanne, Lausanne, August 2009. Challenges of route choice models

More information

Sparse Linear Models (10/7/13)

Sparse Linear Models (10/7/13) STA56: Probabilistic machine learning Sparse Linear Models (0/7/) Lecturer: Barbara Engelhardt Scribes: Jiaji Huang, Xin Jiang, Albert Oh Sparsity Sparsity has been a hot topic in statistics and machine

More information

39th Annual ISMS Marketing Science Conference University of Southern California, June 8, 2017

39th Annual ISMS Marketing Science Conference University of Southern California, June 8, 2017 Permuted and IROM Department, McCombs School of Business The University of Texas at Austin 39th Annual ISMS Marketing Science Conference University of Southern California, June 8, 2017 1 / 36 Joint work

More information

Outline. Overview of Issues. Spatial Regression. Luc Anselin

Outline. Overview of Issues. Spatial Regression. Luc Anselin Spatial Regression Luc Anselin University of Illinois, Urbana-Champaign http://www.spacestat.com Outline Overview of Issues Spatial Regression Specifications Space-Time Models Spatial Latent Variable Models

More information

The Generalized Multinomial Logit Model

The Generalized Multinomial Logit Model The Generalized Multinomial Logit Model By Denzil G. Fiebig University of New South Wales Michael P. Keane University of Technology Sydney Jordan Louviere University of Technology Sydney Nada Wasi University

More information

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January

More information

(I AL BL 2 )z t = (I CL)ζ t, where

(I AL BL 2 )z t = (I CL)ζ t, where ECO 513 Fall 2011 MIDTERM EXAM The exam lasts 90 minutes. Answer all three questions. (1 Consider this model: x t = 1.2x t 1.62x t 2 +.2y t 1.23y t 2 + ε t.7ε t 1.9ν t 1 (1 [ εt y t = 1.4y t 1.62y t 2

More information

Numerical Methods-Lecture XIII: Dynamic Discrete Choice Estimation

Numerical Methods-Lecture XIII: Dynamic Discrete Choice Estimation Numerical Methods-Lecture XIII: Dynamic Discrete Choice Estimation (See Rust 1989) Trevor Gallen Fall 2015 1 / 1 Introduction Rust (1989) wrote a paper about Harold Zurcher s decisions as the superintendent

More information