Binary choice. Michel Bierlaire
|
|
- Primrose Evans
- 6 years ago
- Views:
Transcription
1 Binary choice Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 1 / 64
2 Outline Outline 1 Model specification Error term 2 Applying the model 3 Maximum likelihood estimation 4 Output of the estimation Summary statistics 5 Back to the scale 6 Appendix M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 2 / 64
3 Outline Simple example Choice between Auto and Transit Data Time Time Time Time # auto transit Choice # auto transit Choice T T T C C C T T T C C C C T T C T C T T C M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 3 / 64
4 Model specification Outline 1 Model specification Error term 2 Applying the model 3 Maximum likelihood estimation 4 Output of the estimation Summary statistics 5 Back to the scale 6 Appendix M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 4 / 64
5 Model specification Binary choice model Specification of the utilities U C = β 1 T C +ε C U T = β 1 T T +ε T where T C is the travel time with car (min) and T T the travel time with transit (min). Choice model where ε = ε T ε C. P(C {C,T}) = Pr(U C U T ) = Pr(β 1 T C +ε C β 1 T T +ε T ) = Pr(β 1 (T C T T ) ε T ε C ) = Pr(ε β 1 (T C T T )) M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 5 / 64
6 Model specification Error term Three assumptions about the random variables ε T and ε C 1 What s their mean? 2 What s their variance? 3 What s their distribution? Note For binary choice, it would be sufficient to make assumptions about ε = ε T ε C. But we want to generalize later on. M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 6 / 64
7 Model specification Error term The mean Change of variables Define E[ε C ] = β C and E[ε T ] = β T. Define ε C = ε C β C and ε T = ε T β T, so that E[ε C ] = E[ε T ] = 0. Choice model P(C {C,T}) = Pr(β 1 (T C T T ) ε T ε C ) = Pr(β 1 (T C T T ) ε T +β T ε C β C) = Pr(β 1 (T C T T )+(β C β T ) ε T ε C ) = Pr(β 1 (T C T T )+β 0 ε ) where β 0 = β C β T and ε = ε T ε C. M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 7 / 64
8 Model specification Error term The mean Specification The means of the error terms can be included as parameters of the deterministic part. Only the mean of the difference of the error terms is identified. Alternative Specific Constant Equivalent specifications: U C = β 1 T C +ε C or U C = β 1 T C +β C +ε C U T = β 1 T T +β T +ε T U T = β 1 T T +ε T In practice: associate an alternative specific constant with all alternatives but one. M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 8 / 64
9 Model specification Error term The mean Note Adding the same constant to all utility functions does not affect the choice model Pr(U C U T ) = Pr(U C +K U T +K) K R n. The bottom line... If the deterministic part of the utility functions contains an Alternative Specific Constant (ASC) for all alternatives but one, the mean of the error terms can be assumed to be zero without loss of generality. M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 9 / 64
10 Model specification Error term The variance Utility is ordinal Utilities can be scaled up or down without changing the choice probability Pr(U C U T ) = Pr(αU C αu T ) α > 0 Link with the variance Var(αU C ) = α 2 Var(U C ) Var(αU T ) = α 2 Var(U T ) Variance is not identified As any α can be selected arbitrarily, any variance can be assumed. No way to identify the variance of the error terms from data. The scale has to be arbitrarily decided. M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 10 / 64
11 Model specification Error term The distribution Assumption 1 ε T and ε C are the sum of many r.v. capturing unobservable attributes (e.g. mood, experience), measurement and specification errors. Central-limit theorem The sum of many i.i.d. random variables approximately follows a normal distribution: N(µ,σ 2 ). Assumed distribution ε C N(0,1), ε T N(0,1) M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 11 / 64
12 Model specification Error term The Normal distribution N(µ,σ 2 ) Probability density function (pdf) f(t) = 1 σ (t µ) 2 2π e 2σ 2 Cumulative distribution function (CDF) P(c ε) = F(c) = No closed form. c f(t)dt M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 12 / 64
13 Model specification Error term The distribution ε = ε T ε C From the properties of the normal distribution, we have ε C N(0,1) ε T N(0,1) ε = ε T ε C N(0,2) As the variance is arbitrary, we may also assume ε C N(0,0.5) ε T N(0,0.5) ε = ε T ε C N(0,1) M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 13 / 64
14 Model specification Error term The binary probit model Choice model P(C {C,T}) = Pr(β 1 (T C T T )+β 0 ε) = F ε (β 1 (T C T T )+β 0 ) The binary probit model Not a closed form expression P(C {C,T}) = 1 2π β1 (T C T T ) β 0 e 1 2 t2 dt M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 14 / 64
15 Model specification Error term The distribution Assumption 2 ε T and ε C are the maximum of many r.v. capturing unobservable attributes (e.g. mood, experience), measurement and specification errors. Gumbel theorem The maximum of many i.i.d. random variables approximately follows an Extreme Value distribution: EV(η, µ). Assumed distribution ε C EV(0,1), ε T EV(0,1). M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 15 / 64
16 Model specification Error term The Extreme Value distribution EV(η, µ) Probability density function (pdf) f(t) = µe µ(t η) e e µ(t η) Cumulative distribution function (CDF) P(c ε) = F(c) = c f(t)dt = e e µ(c η) M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 16 / 64
17 Model specification Error term The Extreme Value distribution pdf EV(0,1) CDF EV(0,1) M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 17 / 64
18 Model specification Error term The Extreme Value distribution Properties If ε EV(η,µ) then E[ε] = η + γ µ where γ is Euler s constant. and Var[ε] = π2 6µ 2 Euler s constant γ = lim k k i=1 1 i lnk = 0 e x lnxdx M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 18 / 64
19 Model specification Error term The distribution ε = ε T ε C From the properties of the extreme value distribution, we have ε C EV(0,1) ε T EV(0,1) ε Logistic(0,1) M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 19 / 64
20 Model specification Error term The Logistic distribution: Logistic(η,µ) Probability density function (pdf) f(t) = µe µ(t η) (1+e µ(t η) ) 2 Cumulative distribution function (CDF) with µ > 0. P(c ε) = F(c) = c f(t)dt = 1 1+e µ(c η) M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 20 / 64
21 Model specification Error term The binary logit model Choice model P(C {C,T}) = Pr(β 1 (T C T T )+β 0 ε) = F ε (β 1 (T C T T )+β 0 ) The binary logit model P(C {C,T}) = The binary logit model 1 1+e (β 1(T C T T )+β 0 ) = e β1tc+β0 e β 1T C +β 0 +e β 1 T T P(C {C,T}) = e V C e V C +e V T M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 21 / 64
22 Applying the model Outline 1 Model specification Error term 2 Applying the model 3 Maximum likelihood estimation 4 Output of the estimation Summary statistics 5 Back to the scale 6 Appendix M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 22 / 64
23 Applying the model Back to the example Choice between Auto and Transit Data Time Time Time Time # auto transit Choice # auto transit Choice T T T C C C T T T C C C C T T C T C T T C M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 23 / 64
24 Applying the model First individual Parameters Let s assume that β 0 = 0.5 and β 1 = 0.1 Variables Let s consider the first observation: T C1 = 52.9 T T1 = 4.4 Choice = transit: y auto,1 = 0, y transit,1 = 1 Choice What s the probability given by the model that this individual indeed chooses transit? M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 24 / 64
25 Applying the model First individual Utility functions V C1 = β 1 T C1 = 5.29 V T1 = β 1 T T1 +β 0 = 0.06 Choice model P 1 (transit) = e V T1 e V T1 +e V C1 = Comments The model fits the observation very well. e 0.06 e e 5.29 = 1 Consistent with the assumption that travel time is the only explanatory variable. M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 25 / 64
26 Applying the model Second individual Parameters Let s assume that β 0 = 0.5 and β 1 = 0.1 Variables T C2 = 4.1 T T2 = 28.5 Choice = transit: y auto,2 = 0, y transit,2 = 1 Choice What s the probability given by the model that this individual indeed chooses transit? M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 26 / 64
27 Applying the model Second individual Utility functions V C2 = β 1 T C2 = 0.41 V T2 = β 1 T T2 +β 0 = 2.35 Choice model P 2 (transit) = e V T2 e V T2 +e V C2 = Comment The model poorly fits the observation. e 2.35 e e 0.41 = 0.13 But the assumption is that travel time is the only explanatory variable. Still, the probability is not small. M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 27 / 64
28 Applying the model Likelihood Two observations The probability that the model reproduces both observations is P 1 (transit)p 2 (transit) = 0.13 All observations The probability that the model reproduces all observations is Likelihood of the sample P 1 (transit)p 2 (transit)...p 21 (auto) = L = n (P n (auto) yauto,n P n (transit) y transit,n ) where y j,n is 1 if individual n has chosen alternative j, 0 otherwise M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 28 / 64
29 Applying the model Likelihood Likelihood Probability that the model fits all observations. It is a function of the parameters. Examples β 0 β 1 L M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 29 / 64
30 Applying the model Likelihood function β 0 β M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 30 / 64
31 Applying the model Likelihood function (zoom) β β M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 31 / 64
32 Maximum likelihood estimation Outline 1 Model specification Error term 2 Applying the model 3 Maximum likelihood estimation 4 Output of the estimation Summary statistics 5 Back to the scale 6 Appendix M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 32 / 64
33 Maximum likelihood estimation Maximum likelihood estimation Estimators for the parameters Parameters that achieve the maximum likelihood max (P n (auto;β) yauto,n P n (transit;β) y transit,n ) β Log likelihood n Alternatively, we prefer to maximize the log likelihood max β ln n (P n (auto) yauto,n P n (transit) y transit,n ) = max β ln(y auto,n P n (auto)+y transit,n P n (transit)) n M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 33 / 64
34 Maximum likelihood estimation Maximum likelihood estimation β 0 β M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 34 / 64
35 Maximum likelihood estimation Maximum likelihood estimation β β M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 35 / 64
36 Maximum likelihood estimation Solving the optimization problem Unconstrained nonlinear optimization Iterative methods Designed to identify a local maximum When the function is concave, a local maximum is also a global maximum For binary logit, the log likelihood is concave Use the derivatives of the objective function M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 36 / 64
37 Maximum likelihood estimation Algorithm CFSQP in Biogeme β β 0 M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 37 / 64
38 Maximum likelihood estimation Algorithm CFSQP in Biogeme β β 0 M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 38 / 64
39 Maximum likelihood estimation Algorithm CFSQP in Biogeme β β 0 M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 39 / 64
40 Maximum likelihood estimation Algorithm CFSQP in Biogeme β β 0 M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 40 / 64
41 Maximum likelihood estimation Algorithm CFSQP in Biogeme β β 0 M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 41 / 64
42 Maximum likelihood estimation Algorithm CFSQP in Biogeme β β 0 M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 42 / 64
43 Maximum likelihood estimation Algorithm CFSQP in Biogeme β β 0 M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 43 / 64
44 Maximum likelihood estimation Algorithm CFSQP in Biogeme β β 0 M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 44 / 64
45 Maximum likelihood estimation Nonlinear optimization Things to be aware of... Iterative methods terminate when a given stopping criterion is verified, based on the fact that, if β is the optimum, L(β ) = 0 Stopping criteria usually vary across optimization packages, which may produce slightly different solutions They are usually using a parameter defining the required precision Most methods are sensitive to the conditioning of the problem. A well-conditioned problem is a problem for which all parameters have almost the same magnitude M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 45 / 64
46 Maximum likelihood estimation Nonlinear optimization Time in min. Time in sec. M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 46 / 64
47 Maximum likelihood estimation Nonlinear optimization Time in min. Time in sec. M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 47 / 64
48 Maximum likelihood estimation Nonlinear optimization Things to be aware of... Convergence may be very slow or even fail if the likelihood function is flat It happens when the model is not identifiable Structural flaw in the model (e.g. full set of alternative specific constants) Lack of variability in the data (all prices are the same across the sample) M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 48 / 64
49 Maximum likelihood estimation Nonlinear optimization M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 49 / 64
50 Output of the estimation Outline 1 Model specification Error term 2 Applying the model 3 Maximum likelihood estimation 4 Output of the estimation Summary statistics 5 Back to the scale 6 Appendix M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 50 / 64
51 Output of the estimation Output of the estimation Solution of max β R K L(β) β L(β ) Case study β0 = β1 = L(β0,β 1 ) = M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 51 / 64
52 Output of the estimation Second derivatives Information about the quality of the estimators 2 L 2 L β 2 β 1 1 β 2 2 L β 1 β K 2 L 2 L β 2 β 1 2 L β 2 L(β 2 β 2 2 β K ) = L 2 L β K β 1 β K β 2 2 L βk 2 2 L(β ) 1 is a consistent estimator of the variance-covariance matrix of the estimates M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 52 / 64
53 Output of the estimation Statistics Statistics on the parameters Summary statistics L(β ) = L(0) = Parameter Value Std Err. t-test β β (L(0) L(β )) = ρ 2 = 0.576, ρ 2 = M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 53 / 64
54 Output of the estimation Summary statistics Null log likelihood L(0) sample log likelihood with a trivial model where all parameters are zero, that is a model always predicting P(1 {1,2}) = P(2 {1,2}) = 1 2 Purely a function of sample size L(0) = log( 1 2N) = Nlog(2) M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 54 / 64
55 Output of the estimation Summary statistics Likelihood ratio 2(L(0) L(β )) ( L ) (0) log L (β = log(l (0)) log(l (β )) = L(0) L(β ) ) Likelihood ratio test H 0 : the two models are equivalent Under H 0, 2(L(0) L(β )) is asymptotically distributed as χ 2 with K degrees of freedom. Similar to the F test in regression models M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 55 / 64
56 Output of the estimation Summary statistics Rho (bar) squared ρ 2 ρ 2 = 1 L(β ) L(0) Similar to the R 2 in regression models ρ 2 ρ 2 = 1 L(β ) K L(0) M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 56 / 64
57 Back to the scale Outline 1 Model specification Error term 2 Applying the model 3 Maximum likelihood estimation 4 Output of the estimation Summary statistics 5 Back to the scale 6 Appendix M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 57 / 64
58 Back to the scale Back to the scale Comparing models Arbitrary scale may be problematic when comparing models Binary probit: σ 2 = Var(ε i ε j ) = 1 Binary logit: Var(ε i ε j ) = π 2 /(3µ) = π 2 /3 Var(αU) = α 2 Var (U). Scaled logit coeff. are π/ 3 larger than scaled probit coeff. M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 58 / 64
59 Back to the scale Comparing models Estimation results Note: π/ Probit Logit Probit * π/ 3 L β β M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 59 / 64
60 Appendix Outline 1 Model specification Error term 2 Applying the model 3 Maximum likelihood estimation 4 Output of the estimation Summary statistics 5 Back to the scale 6 Appendix M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 60 / 64
61 Appendix Maximum likelihood for binary logit Let C n = {i,j} Let y in = 1 if i is chosen by n, 0 otherwise Let y jn = 1 if j is chosen by n, 0 otherwise Obviously, y in = 1 y jn Log-likelihood of the sample N ( e V in e V ) jn y in ln e V +y in +e V jn ln jn e V in +e V jn n=1 M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 61 / 64
62 Appendix Maximum likelihood for binary logit P n (i) = e V in e V in +e V jn lnp n (i) = V in ln(e V in +e V jn ) lnp n (i) V in = 1 e V in e V in +e V jn = 1 P n (i) = P n (j) lnp n (i) V jn = evjn e V in +e V jn = P n (j) M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 62 / 64
63 Appendix Maximum likelihood for binary logit L V in L θ = L V in V in θ + L V jn V jn θ = N n=1 ( ) lnp y n(i) lnp in V in +y n(j) jn V in = N n=1 (y inp n (j) y jn P n (i)) = N n=1 (y in(1 P n (i)) (1 y in )P n (i)) = N n=1 (y in P n (i)) = N n=1 (y jn P n (j)) M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 63 / 64
64 Appendix Maximum likelihood for binary logit L θ = N (y in P n (i)) V in θ +(y jn P n (j)) V jn θ n=1 = N (y in P n (i))( V in θ V jn θ ) n=1 If V in = k θ kx ink, then L θ k = N (y in P n (i))(x ink x jnk ) n=1 M. Bierlaire (TRANSP-OR ENAC EPFL) Binary choice 64 / 64
Optimization and Simulation
Optimization and Simulation Variance reduction Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne M.
More informationEstimation of mixed generalized extreme value models
Estimation of mixed generalized extreme value models Michel Bierlaire michel.bierlaire@epfl.ch Operations Research Group ROSO Institute of Mathematics EPFL Katholieke Universiteit Leuven, November 2004
More informationDiscrete panel data. Michel Bierlaire
Discrete panel data Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne M. Bierlaire (TRANSP-OR ENAC
More informationStatistical Tests. Matthieu de Lapparent
Statistical Tests Matthieu de Lapparent matthieu.delapparent@epfl.ch Transport and Mobility Laboratory, School of Architecture, Civil and Environmental Engineering, Ecole Polytechnique Fédérale de Lausanne
More informationOptimization and Simulation
Optimization and Simulation Simulating events: the Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne
More informationBinary choice 3.3 Maximum likelihood estimation
Binary choice 3.3 Maximum likelihood estimation Michel Bierlaire Output of the estimation We explain here the various outputs from the maximum likelihood estimation procedure. Solution of the maximum likelihood
More informationCapturing Correlation in Route Choice Models using Subnetworks
Capturing Correlation in Route Choice Models using Subnetworks Emma Frejinger and Michel Bierlaire Transport and Mobility Laboratory (TRANSP-OR), EPFL Capturing Correlation with Subnetworks in Route Choice
More informationCalculating indicators with PythonBiogeme
Calculating indicators with PythonBiogeme Michel Bierlaire May 17, 2017 Report TRANSP-OR 170517 Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique
More informationChoice Theory. Matthieu de Lapparent
Choice Theory Matthieu de Lapparent matthieu.delapparent@epfl.ch Transport and Mobility Laboratory, School of Architecture, Civil and Environmental Engineering, Ecole Polytechnique Fédérale de Lausanne
More informationLecture-20: Discrete Choice Modeling-I
Lecture-20: Discrete Choice Modeling-I 1 In Today s Class Introduction to discrete choice models General formulation Binary choice models Specification Model estimation Application Case Study 2 Discrete
More informationLecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo
Lecture 1 Behavioral Models Multinomial Logit: Power and limitations Cinzia Cirillo 1 Overview 1. Choice Probabilities 2. Power and Limitations of Logit 1. Taste variation 2. Substitution patterns 3. Repeated
More informationModeling Binary Outcomes: Logit and Probit Models
Modeling Binary Outcomes: Logit and Probit Models Eric Zivot December 5, 2009 Motivating Example: Women s labor force participation y i = 1 if married woman is in labor force = 0 otherwise x i k 1 = observed
More informationAre You The Beatles Or The Rolling Stones?
N. Ortelli, G. Lederrey, A. Alahi, M. Bierlaire Transport and Mobility Laboratory School of Architecture, Civil, and Environmental Engineering ENAC École Polytechnique Fédérale de Lausanne 13th Workshop
More informationData-analysis and Retrieval Ordinal Classification
Data-analysis and Retrieval Ordinal Classification Ad Feelders Universiteit Utrecht Data-analysis and Retrieval 1 / 30 Strongly disagree Ordinal Classification 1 2 3 4 5 0% (0) 10.5% (2) 21.1% (4) 42.1%
More informationCalculating indicators with Biogeme
Calculating indicators with Biogeme Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne September 25,
More informationGoals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model
Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1 Tetsuya Matsubayashi University of North Texas November 2, 2010 Random utility model Multinomial logit model Conditional logit model
More informationNormalization and correlation of cross-nested logit models
Normalization and correlation of cross-nested logit models E. Abbe, M. Bierlaire, T. Toledo Lab. for Information and Decision Systems, Massachusetts Institute of Technology Inst. of Mathematics, Ecole
More informationECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2016 Instructor: Victor Aguirregabiria
ECOOMETRICS II (ECO 24S) University of Toronto. Department of Economics. Winter 26 Instructor: Victor Aguirregabiria FIAL EAM. Thursday, April 4, 26. From 9:am-2:pm (3 hours) ISTRUCTIOS: - This is a closed-book
More informationEconometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit
Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables
More informationA Lagrangian relaxation method for solving choice-based mixed linear optimization models that integrate supply and demand interactions
A Lagrangian relaxation method for solving choice-based mixed linear optimization models that integrate supply and demand interactions Meritxell Pacheco Shadi Sharif Azadeh, Michel Bierlaire, Bernard Gendron
More informationChapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a
Chapter 9 Regression with a Binary Dependent Variable Multiple Choice ) The binary dependent variable model is an example of a a. regression model, which has as a regressor, among others, a binary variable.
More informationApplied Econometrics (QEM)
Applied Econometrics (QEM) based on Prinicples of Econometrics Jakub Mućk Department of Quantitative Economics Jakub Mućk Applied Econometrics (QEM) Meeting #3 1 / 42 Outline 1 2 3 t-test P-value Linear
More informationThe 17 th Behavior Modeling Summer School
The 17 th Behavior Modeling Summer School September 14-16, 2017 Introduction to Discrete Choice Models Giancarlos Troncoso Parady Assistant Professor Urban Transportation Research Unit Department of Urban
More informationExploratory analysis of endogeneity in discrete choice models. Anna Fernández Antolín Amanda Stathopoulos Michel Bierlaire
Exploratory analysis of endogeneity in discrete choice models Anna Fernández Antolín Amanda Stathopoulos Michel Bierlaire Transportation and Mobility Laboratory, ENAC, EPFL April 2014 Transportation and
More informationIntroduction to mixtures in discrete choice models
Introduction to mixtures in discrete choice models p. 1/42 Introduction to mixtures in discrete choice models Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory Introduction to
More informationIntegrating advanced discrete choice models in mixed integer linear optimization
Integrating advanced discrete choice models in mixed integer linear optimization Meritxell Pacheco Shadi Sharif Azadeh, Michel Bierlaire, Bernard Gendron Transport and Mobility Laboratory (TRANSP-OR) École
More information(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1.
Problem 1 (21 points) An economist runs the regression y i = β 0 + x 1i β 1 + x 2i β 2 + x 3i β 3 + ε i (1) The results are summarized in the following table: Equation 1. Variable Coefficient Std. Error
More informationIntegrated schedule planning with supply-demand interactions for a new generation of aircrafts
Integrated schedule planning with supply-demand interactions for a new generation of aircrafts Bilge Atasoy, Matteo Salani and Michel Bierlaire Abstract We present an integrated schedule planning model
More informationLogistic regression. 11 Nov Logistic regression (EPFL) Applied Statistics 11 Nov / 20
Logistic regression 11 Nov 2010 Logistic regression (EPFL) Applied Statistics 11 Nov 2010 1 / 20 Modeling overview Want to capture important features of the relationship between a (set of) variable(s)
More informationWeek 7: Binary Outcomes (Scott Long Chapter 3 Part 2)
Week 7: (Scott Long Chapter 3 Part 2) Tsun-Feng Chiang* *School of Economics, Henan University, Kaifeng, China April 29, 2014 1 / 38 ML Estimation for Probit and Logit ML Estimation for Probit and Logit
More informationDestination Choice Model including panel data using WiFi localization in a pedestrian facility
Destination Choice Model including panel data using WiFi localization in a pedestrian facility Loïc Tinguely, Antonin Danalet, Matthieu de Lapparent & Michel Bierlaire Transport and Mobility Laboratory
More informationST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples
ST3241 Categorical Data Analysis I Generalized Linear Models Introduction and Some Examples 1 Introduction We have discussed methods for analyzing associations in two-way and three-way tables. Now we will
More informationLatent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent
Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary
More informationStandard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j
Standard Errors & Confidence Intervals β β asy N(0, I( β) 1 ), where I( β) = [ 2 l(β, φ; y) ] β i β β= β j We can obtain asymptotic 100(1 α)% confidence intervals for β j using: β j ± Z 1 α/2 se( β j )
More informationBinary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment
BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense
More informationSTRC. A Lagrangian relaxation technique for the demandbased benefit maximization problem
A Lagrangian relaxation technique for the demandbased benefit maximization problem Meritxell Pacheco Paneque Bernard Gendron Virginie Lurkin Shadi Sharif Azadeh Michel Bierlaire Transport and Mobility
More informationLinear Regression With Special Variables
Linear Regression With Special Variables Junhui Qian December 21, 2014 Outline Standardized Scores Quadratic Terms Interaction Terms Binary Explanatory Variables Binary Choice Models Standardized Scores:
More informationIntroduction to disaggregate demand models
Introduction to disaggregate demand models Michel Bierlaire Virginie Lurkin Mai 08, 2017 Report TRANSP-OR 170508 Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering
More informationINTRODUCTION TO TRANSPORTATION SYSTEMS
INTRODUCTION TO TRANSPORTATION SYSTEMS Lectures 5/6: Modeling/Equilibrium/Demand 1 OUTLINE 1. Conceptual view of TSA 2. Models: different roles and different types 3. Equilibrium 4. Demand Modeling References:
More informationEstimating hybrid choice models with the new version of Biogeme
Estimating hybrid choice models with the new version of Biogeme p. 1/34 Estimating hybrid choice models with the new version of Biogeme Michel Bierlaire transp-or.epfl.ch Transport and Mobility Laboratory,
More informationA sampling of alternatives approach for route choice models
Seminar, Namur, February 2009, A sampling of alternatives approach for route choice models p. 1/34 A sampling of alternatives approach for route choice models Emma Frejinger Centre for Transport Studies,
More informationSTAT 100C: Linear models
STAT 100C: Linear models Arash A. Amini June 9, 2018 1 / 56 Table of Contents Multiple linear regression Linear model setup Estimation of β Geometric interpretation Estimation of σ 2 Hat matrix Gram matrix
More informationTMA 4275 Lifetime Analysis June 2004 Solution
TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,
More informationA New Generalized Gumbel Copula for Multivariate Distributions
A New Generalized Gumbel Copula for Multivariate Distributions Chandra R. Bhat* The University of Texas at Austin Department of Civil, Architectural & Environmental Engineering University Station, C76,
More informationNon-linear panel data modeling
Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1
More informationProbabilistic Choice Models
Econ 3: James J. Heckman Probabilistic Choice Models This chapter examines different models commonly used to model probabilistic choice, such as eg the choice of one type of transportation from among many
More informationRoute choice models: Introduction and recent developments
Route choice models: Introduction and recent developments Michel Bierlaire transp-or.epfl.ch Transport and Mobility Laboratory, EPFL Route choice models: Introduction and recent developments p.1/40 Route
More information1. Hypothesis testing through analysis of deviance. 3. Model & variable selection - stepwise aproaches
Sta 216, Lecture 4 Last Time: Logistic regression example, existence/uniqueness of MLEs Today s Class: 1. Hypothesis testing through analysis of deviance 2. Standard errors & confidence intervals 3. Model
More informationVariance reduction. Michel Bierlaire. Transport and Mobility Laboratory. Variance reduction p. 1/18
Variance reduction p. 1/18 Variance reduction Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory Variance reduction p. 2/18 Example Use simulation to compute I = 1 0 e x dx We
More informationLecture 6: Discrete Choice: Qualitative Response
Lecture 6: Instructor: Department of Economics Stanford University 2011 Types of Discrete Choice Models Univariate Models Binary: Linear; Probit; Logit; Arctan, etc. Multinomial: Logit; Nested Logit; GEV;
More informationNormalization and correlation of cross-nested logit models
Normalization and correlation of cross-nested logit models E. Abbe M. Bierlaire T. Toledo Paper submitted to Transportation Research B Final version November 2006 Abstract We address two fundamental issues
More informationECON Introductory Econometrics. Lecture 11: Binary dependent variables
ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability
More informationAppendix A: The time series behavior of employment growth
Unpublished appendices from The Relationship between Firm Size and Firm Growth in the U.S. Manufacturing Sector Bronwyn H. Hall Journal of Industrial Economics 35 (June 987): 583-606. Appendix A: The time
More informationECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria
ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria SOLUTION TO FINAL EXAM Friday, April 12, 2013. From 9:00-12:00 (3 hours) INSTRUCTIONS:
More informationIntroduction to Maximum Likelihood Estimation
Introduction to Maximum Likelihood Estimation Eric Zivot July 26, 2012 The Likelihood Function Let 1 be an iid sample with pdf ( ; ) where is a ( 1) vector of parameters that characterize ( ; ) Example:
More informationDiscrete Dependent Variable Models
Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)
More informationAppendix A. Math Reviews 03Jan2007. A.1 From Simple to Complex. Objectives. 1. Review tools that are needed for studying models for CLDVs.
Appendix A Math Reviews 03Jan007 Objectives. Review tools that are needed for studying models for CLDVs.. Get you used to the notation that will be used. Readings. Read this appendix before class.. Pay
More informationNon-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models
Optimum Design for Mixed Effects Non-Linear and generalized Linear Models Cambridge, August 9-12, 2011 Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models
More informationGeneralized Linear Models Introduction
Generalized Linear Models Introduction Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Generalized Linear Models For many problems, standard linear regression approaches don t work. Sometimes,
More informationAsymptotic Statistics-III. Changliang Zou
Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (
More informationThe Logit Model: Estimation, Testing and Interpretation
The Logit Model: Estimation, Testing and Interpretation Herman J. Bierens October 25, 2008 1 Introduction to maximum likelihood estimation 1.1 The likelihood function Consider a random sample Y 1,...,
More informationLinear Classification. CSE 6363 Machine Learning Vassilis Athitsos Computer Science and Engineering Department University of Texas at Arlington
Linear Classification CSE 6363 Machine Learning Vassilis Athitsos Computer Science and Engineering Department University of Texas at Arlington 1 Example of Linear Classification Red points: patterns belonging
More informationDiagnostics can identify two possible areas of failure of assumptions when fitting linear models.
1 Transformations 1.1 Introduction Diagnostics can identify two possible areas of failure of assumptions when fitting linear models. (i) lack of Normality (ii) heterogeneity of variances It is important
More informationChapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression
BSTT523: Kutner et al., Chapter 1 1 Chapter 1: Linear Regression with One Predictor Variable also known as: Simple Linear Regression Bivariate Linear Regression Introduction: Functional relation between
More informationStatistics 203: Introduction to Regression and Analysis of Variance Course review
Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying
More informationEcon 583 Final Exam Fall 2008
Econ 583 Final Exam Fall 2008 Eric Zivot December 11, 2008 Exam is due at 9:00 am in my office on Friday, December 12. 1 Maximum Likelihood Estimation and Asymptotic Theory Let X 1,...,X n be iid random
More informationFall, 2007 Nonlinear Econometrics. Theory: Consistency for Extremum Estimators. Modeling: Probit, Logit, and Other Links.
14.385 Fall, 2007 Nonlinear Econometrics Lecture 2. Theory: Consistency for Extremum Estimators Modeling: Probit, Logit, and Other Links. 1 Example: Binary Choice Models. The latent outcome is defined
More informationPanel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:
Panel Data Exercises Manuel Arellano Exercise 1 Using panel data, a researcher considers the estimation of the following system: y 1t = α 1 + βx 1t + v 1t. (t =1,..., T ) y Nt = α N + βx Nt + v Nt where
More informationcovariance between any two observations
1 Ordinary Least Squares (OLS) 1.1 Single Linear Regression Model assumptions of Classical Linear Regression Model (CLRM) (i) true relationship y i = α + βx i + ε i, i = 1,..., N where α, β = population
More informationMasking Identification of Discrete Choice Models under Simulation Methods *
Masking Identification of Discrete Choice Models under Simulation Methods * Lesley Chiou 1 and Joan L. Walker 2 Abstract We present examples based on actual and synthetic datasets to illustrate how simulation
More informationModels for Heterogeneous Choices
APPENDIX B Models for Heterogeneous Choices Heteroskedastic Choice Models In the empirical chapters of the printed book we are interested in testing two different types of propositions about the beliefs
More informationLinear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52
Statistics for Applications Chapter 10: Generalized Linear Models (GLMs) 1/52 Linear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52 Components of a linear model The two
More informationClassification. Chapter Introduction. 6.2 The Bayes classifier
Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode
More informationThe Network GEV model
The Network GEV model Michel Bierlaire January, 2002 Discrete choice models play a major role in transportation demand analysis. Their nice and strong theoretical properties, and their flexibility to capture
More informationHeteroskedasticity. Part VII. Heteroskedasticity
Part VII Heteroskedasticity As of Oct 15, 2015 1 Heteroskedasticity Consequences Heteroskedasticity-robust inference Testing for Heteroskedasticity Weighted Least Squares (WLS) Feasible generalized Least
More informationMultivariate Regression
Multivariate Regression The so-called supervised learning problem is the following: we want to approximate the random variable Y with an appropriate function of the random variables X 1,..., X p with the
More informationIntroduction to Signal Detection and Classification. Phani Chavali
Introduction to Signal Detection and Classification Phani Chavali Outline Detection Problem Performance Measures Receiver Operating Characteristics (ROC) F-Test - Test Linear Discriminant Analysis (LDA)
More informationLOGISTIC REGRESSION Joseph M. Hilbe
LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of
More informationApplied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid
Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?
More information12 Modelling Binomial Response Data
c 2005, Anthony C. Brooms Statistical Modelling and Data Analysis 12 Modelling Binomial Response Data 12.1 Examples of Binary Response Data Binary response data arise when an observation on an individual
More informationEXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING
EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING DATE AND TIME: August 30, 2018, 14.00 19.00 RESPONSIBLE TEACHER: Niklas Wahlström NUMBER OF PROBLEMS: 5 AIDING MATERIAL: Calculator, mathematical
More informationKernel Logistic Regression and the Import Vector Machine
Kernel Logistic Regression and the Import Vector Machine Ji Zhu and Trevor Hastie Journal of Computational and Graphical Statistics, 2005 Presented by Mingtao Ding Duke University December 8, 2011 Mingtao
More informationLikelihood Ratio Test in High-Dimensional Logistic Regression Is Asymptotically a Rescaled Chi-Square
Likelihood Ratio Test in High-Dimensional Logistic Regression Is Asymptotically a Rescaled Chi-Square Yuxin Chen Electrical Engineering, Princeton University Coauthors Pragya Sur Stanford Statistics Emmanuel
More informationMSH3 Generalized linear model
Contents MSH3 Generalized linear model 5 Logit Models for Binary Data 173 5.1 The Bernoulli and binomial distributions......... 173 5.1.1 Mean, variance and higher order moments.... 173 5.1.2 Normal limit....................
More informationMaximum Likelihood and. Limited Dependent Variable Models
Maximum Likelihood and Limited Dependent Variable Models Michele Pellizzari IGIER-Bocconi, IZA and frdb May 24, 2010 These notes are largely based on the textbook by Jeffrey M. Wooldridge. 2002. Econometric
More informationMLE and GMM. Li Zhao, SJTU. Spring, Li Zhao MLE and GMM 1 / 22
MLE and GMM Li Zhao, SJTU Spring, 2017 Li Zhao MLE and GMM 1 / 22 Outline 1 MLE 2 GMM 3 Binary Choice Models Li Zhao MLE and GMM 2 / 22 Maximum Likelihood Estimation - Introduction For a linear model y
More informationExercises Chapter 4 Statistical Hypothesis Testing
Exercises Chapter 4 Statistical Hypothesis Testing Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans December 5, 013 Christophe Hurlin (University of Orléans) Advanced Econometrics
More informationParametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1
Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson
More informationRandom Matrix Eigenvalue Problems in Probabilistic Structural Mechanics
Random Matrix Eigenvalue Problems in Probabilistic Structural Mechanics S Adhikari Department of Aerospace Engineering, University of Bristol, Bristol, U.K. URL: http://www.aer.bris.ac.uk/contact/academic/adhikari/home.html
More informationBayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence
Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns
More informationGeneralized Linear Models
Generalized Linear Models Advanced Methods for Data Analysis (36-402/36-608 Spring 2014 1 Generalized linear models 1.1 Introduction: two regressions So far we ve seen two canonical settings for regression.
More informationA Discrete choice framework for acceleration and direction change behaviors in walking pedestrians
A Discrete choice framework for acceleration and direction change behaviors in walking pedestrians G. Antonini 1 and M. Bierlaire 1 The walking process is interpreted as a sequence of decisions about where
More informationSCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models
SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION
More informationBinary Logistic Regression
The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b
More informationGARCH Models Estimation and Inference
GARCH Models Estimation and Inference Eduardo Rossi University of Pavia December 013 Rossi GARCH Financial Econometrics - 013 1 / 1 Likelihood function The procedure most often used in estimating θ 0 in
More informationPart I Behavioral Models
Part I Behavioral Models 2 Properties of Discrete Choice Models 2.1 Overview This chapter describes the features that are common to all discrete choice models. We start by discussing the choice set, which
More informationOutline. Supervised Learning. Hong Chang. Institute of Computing Technology, Chinese Academy of Sciences. Machine Learning Methods (Fall 2012)
Outline Hong Chang Institute of Computing Technology, Chinese Academy of Sciences Machine Learning Methods (Fall 2012) Outline Outline I 1 Linear Models for Regression Linear Regression Probabilistic Interpretation
More informationI. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as
Econ 513, USC, Fall 2005 Lecture 15 Discrete Response Models: Multinomial, Conditional and Nested Logit Models Here we focus again on models for discrete choice with more than two outcomes We assume that
More informationNormalization and correlation of cross-nested logit models
Normalization and correlation of cross-nested logit models E. Abbe M. Bierlaire T. Toledo August 5, 25 Abstract We address two fundamental issues associated with the use of cross-nested logit models. On
More informationTesting Restrictions and Comparing Models
Econ. 513, Time Series Econometrics Fall 00 Chris Sims Testing Restrictions and Comparing Models 1. THE PROBLEM We consider here the problem of comparing two parametric models for the data X, defined by
More information