Probabilistic Choice Models
|
|
- Mark Watts
- 6 years ago
- Views:
Transcription
1 Econ 3: James J. Heckman Probabilistic Choice Models This chapter examines different models commonly used to model probabilistic choice, such as eg the choice of one type of transportation from among many choices available to the consumer. Section discusses derivation and limitations of conditional logit models. Section 2 discusses probit models and Section 3 discusses the nested logit (generalized extreme value models), which address some of the limitations of the conditional logit models. The Conditional Logit Model In this section we investigate conditional logit models. We discuss its derivation from a random utility model (Section.2) with Extreme Value Type I distributed shocks. The relevant properties of the Extreme Value Type I distribution are discussed in Section.. We also derive the conditional logit model from the Luce axioms (Section.3). In Section.4, we discuss some of the limitations of the conditional logit models.. TheExtremeValueTypeIDistribution Suppose ε is independent (not necessarily identical) Extreme Value Type I random variable. Then the CDF of ε is: Pr(ε <c)f (c) ( ( (c + α i ))) where α i is a parameter of the Extreme Value Type I CDF. Also, by the assumption of independence, we can write: ny ny F (ε, ε 2,, ε n ) F (ε i ) ( ( (ε i + α i ))) i McFadden (974) mistakenly referred to this as the Weibull distribution, and this misclassification shows up in the literature. i
2 The Extreme Value Type I distribution has two useful features. First, the difference between two Extreme Value Type I random variables is a logit. Second, Extreme Value Type Is are closed under maximization, since (assuming independence): ³ Pr max{ε i } ε i ny Pr(ε i ε) i ny ( ( (ε + α i ))) i à à à nx!! ( (ε + α i )) i ( ( ε))! nx ( α i ) i () P Consider n ( α i ). We can solve for α in the following equation: i which implies: nx ( α i ) ( α) i à nx! α log ( α i ). We can then substitute this value of α into equation () to get: ³ Pr max{ε i } ε i i ( ( ( ε)) ( α)) ( ( (ε + α))) which is indeed a Extreme Value Type I random variable. 2
3 .2 Random Utility Model An individual with characteristics s has a choice set x,where x B, B is a feasible set. We write: Pr (x s, B) as the probability that a person of characteristics s chooses x from the feasible set. We also suppose that: U (s, x) v(s, x)+ε(s, x) where ε is independent Extreme Value Type I. From our information on Extreme Value Type Is in section, we know that ε i + v i,(andthusu i ), has an Extreme Value Type I distribution with parameter α i v i,asshown below: F Ui (ε) Pr(ε i + v i < ε) Pr(ε i < ε v i ) ( ( (ε + α i v i ))) Let us now suppose that there are two goods and two corresponding utilities. Consumers govern their choices by the obvious decision rule: choose good one if U >U 2. More generally, if there are n goods, then good j will be selected if U j argmax {U i } n i. Specifically, in our two good case: Pr ( is chosen) Pr(U >U 2 )Pr(ε + v > ε 2 + v 2 ) Imposing that ε is independent Extreme Value Type I, we can be much more precise about this probability: Pr(ε + v > ε 2 + v 2 ) Pr(ε + v v 2 > ε 2 ) Z Z f(ε ) Z ε +v v 2 f(ε 2 )dε 2 dε f(ε )( (ε + v v 2 + α 2 )) dε (2) 3
4 Observe that F (ε )( (ε + α )), which implies: f(ε ) F (ε ) ε ( (ε + α )) ( (ε + α )) (ε + α )(( (ε + α ))) Substituting this into equation(2), gives us: Pr ( ischosen) Z (ε + α )(( (ε + α ))) ( (ε + v v 2 + α 2 )) dε e α Z e ε e [ ( ε )][( α ) (v v 2 +α 2 )] dε ( α ) ( α )+ (v v 2 + α 2 ) he i [ ( ε )][( α ) (v v 2 +α 2 )] ( α ) ( α )+ (v v 2 + α 2 ) (v α ) (v α )+(v 2 α 2 ) This result generalizes, because the max over (n ) choices is still an Extreme Value Type I, so we can make a two stage maximization argument, as follows : 4
5 Pr(ε + v > ε i + v i, where ṽ j v j α j. i, 2,,n) Pr ε + v > max (ε i + v i ) i2,,n (v α ) (v α ) + (v 2 α 2 )+ +(v n α n ) (ṽ ) np (ṽ i ) i This type of model of probabilistic choice is called a conditional or multinomial logit model. The difference between conditional" and multinomial" is simply that in the conditional" logit case, the values of the variables (usually choice characteristics) vary across the choices, while the parameters are common across the choices. In the multinomial" logit case, the values of the variables are common across choices for the same person (usually individual characteristics) but the parameters vary across choices. For eg we have in the linear v i case, the probability of individual j making choice i from among m choices is: Conditional Logit case: P ij (β0 c ij ) mp,wherec ij is the vector of values (β 0 c kj ) k of characteristics of choice i as perceived by individual j. Multinomial Logit case: P ij (α0 i s j) mp,wheres j is a vector of individual (α 0 k s j) k characteristics for individual j. Note that we can easily combine the two cases under one model, as described below: Generalized case: We can combine the conditional and multinomial logit models by generalizing either one of the two types of models. For eg, we could permit the coefficients in the multinomial logit case to depend on choice characteristics, ie have: α i φ i + c 0 ijθ 5
6 Then we get the generalized case, where the probability of choice i by individual j depends on both individual as well as choice characteristics (as well as interaction terms) 2 : P ij (α0 i s j) mp (α 0 k s j) k (φ0 is j + θ 0 c ij s j ) mp (φ 0 ks j + θ 0 c kj s j ) k We could similarly modify the coefficients in the conditional logit case to obtain the generalized version..3 Derivation of Logit from the Luce Axioms We will now show how the conditional logit can be derived from the random utility model discussed in Section.2 and the Luce Axioms presented below..3. Luce Axioms Axiom : Independence of Irrelevant Alternatives(IIA) Suppose that x, y B, s S. Then, Pr (x s, {x, y})pr(y s, B) Pr(y s, {x, y})pr(x s, B) or, we have: Pr (x s, {x, y}) Pr (y s, {x, y}) Pr (x s, B) Pr (y s, B). The term on the left is the odds ratio; the ratio of probabilities of choosing x to y given characteristics s and {x, y}. This axiom has been named Independence of Irrelevant Alternatives for an obvious reason the odds of our choice are not effected by adding additional alternatives. Note that this assumes that the additional choices entering in B affect probability of choosing x in the same manner as they affect the probability of choosing y; implicitly we are assuming that the additional choices have equivalent relationship with choice x and choice y. We will see how this assumption is a limitation, in Section.4 below. 2 Note that by allowing sj (the vector of individual characteristics) to have an intercept column of s, the model would have pure choice characteristic effects, in addition to the interaction terms. 6
7 Axiom 2: Positivity This axiom states that the probability of choosing any one of the choices is strictly greater than zero: Pr (y s, B) > 0 y B.3.2 Derivation of logit With the Luce assumptions set out in the preceeding section, we can now proceed to our derivation of the logit. Define P yx Pr(y s, {x, y}). Then by Axiom above, we know: Pyx P xy Pr (x s, B) Pr(y s, B) (3) Summing over y, we get: Pr (x s, B) X y B Pyx P xy Pr (x s, B) Again using Axiom, for z B: P y B Pyx P xy (4) Pyz P zy Pxz P zx Pr (z s, B) Pr(y s, B) Pr (z s, B) Pr(x s, B) (5a) (5b) Substituting these in equation (3), we get: Pyx P xy Pr (y s, B) Pr (x s, B) Pyz P zy Pxz P zy Pr (z s, B) Pr (z s, B) P yz P zy P xz P zx (6) 7
8 Now, in terms of the random utility model discussed in Section.2, define the mean utility of a person with characteristics s choosing x from set {x, z} as: v(s, x, z) ln P xz P zx P xz P zx (v(s, x, z)) Define a comparable ression for P yz P zy. Replacing this into equation (6) produces: P yx P xy Then from equation (4), we get: Pr (x s, B) P y B ³ (v(s, y, z)) (v(s, x, z)) (v(s, y, z)) (v(s, x, z)) (v(s,x,z)) Py B P y B ( (v(s, y, z))) (v(s, x, z)) ( (v(s, y, z))). Assume additionally, additive separability of v(s, x, z) as follows: v(s, x, z) v(s, x) v(s, z) Note that this is equivalent to assuming irrelevance of the benchmark. From 8
9 this assumption, we get: Pr (x s, B) P y B (v(s, x) v(s, z)) ( (v(s, y) v(s, z))) v(s, x)( v(s, z)) ³ P ( v(s, z)) y B (v(s, y)) v(s, x) Py B (v(s, y)) (7) which gives the multinomial logit. McFadden (974) shows that Luce Axioms and a condition on ε ( Translation Completeness ) produce the Extreme Value Type I (which he mistakenly referred to as the Weibull)..4 Consequences of Independence: Limitations of Logit Models We just showed that: so that: P i P (v i) i (v i) P i P j (v i ) P i (v i) (v j ) P i (v i) (v i) (v j ) (v i v j ) ln Pi P j v i v j A common specification for v i is v i z i β. Thus: Pi ln Pi P j ln (z i z j ) β β P j z j or, changes in characteristics z j have a common effect on the ratio of log probabilities. This allows for estimation of the probabilities of purchasing a new good. (One could obtain an estimate of β from the existing goods. This estimate can then be combined with the characteristics, z new,ofthe new good to estimate the probability of selection, as in equation 7). 9
10 Further, from equation (7): and: Pr (2 {, 2, 3}) Pr (2 {, 2}) ev 2 e v + e v 2 e v 2 e v + e v 2 + e v 3 < Pr (2 {, 2}) This leads us to a restrictive property of the conditional logit model - we have assumed independence of the ε i, when in fact, they may be correlated. This is illustrated by McFadden s famous red bus, blue bus problem: Suppose we are modelling transportation choice and our alternatives consist of {car, bus, train}. If the alternatives are replaced by {car, red bus, blue bus}, then we have violated our assumption of dissimilar alternatives; if U 2 >U, then the event U 3 >U is more likely. One can see by the preceding equation that adding more bus colors continually decreases the probability that car travel is chosen. We can deal with the problem of similar alternatives by using the nested logit model (Nested Logit) or the random coefficient probit model. 2 Probit: Random Coefficients In this section (as in Section.4 above), we make v i a simple linear function of the choice characteristics alone (as discussed in Section.2, we can easily generalize this to include individual characteristics as well as interactions). Then we have, utility from choice i is: U i Z i β + η i where: η i N(0, σ 2 i ), η i Z i, β, η j, i, j. Moreover, β is a random variable, with β ( β, P β), sothat: It follows that: U i Z i β + Zi β β + ηi U U 2 0 (Z Z 2 ) β +(Z Z 2 ) β β +(η η 2 ) 0 U U 3 0 (Z Z 3 ) β +(Z Z 3 ) β β +(η η 3 ) 0. 0
11 Further: Var (U U 2 ) E{[(U U 2 ) E (U U 2 )] 0 [(U U 2 ) E (U U 2 )]} E [(Z Z 2 )(β β)+(η η 2 )] 0 [(Z Z 2 )(β β)+(η η 2 )] ª E (Z Z 2 )(β β)(β β) 0 (Z Z 2 ) 0 +(η η 2 )(η η 2 ) 0ª (Z Z 2 ) X β (Z Z 2 ) 0 + σ 2 + σ 2 2 (since σ 2 0). Similarly: Thus: so: Var (U U 3 )(Z Z 3 ) X β (Z Z 3 ) 0 + σ 2 + σ 2 3 Cov (U U 2,U U 3 )(Z Z 2 ) X β (Z Z 3 ) 0 + σ 2 ρ Corr (U U 2,U U 3 ) (Z Z 2 ) P β (Z Z 3 ) 0 + σ 2 p Var (U U 2 ) Var (U U 3 ) We now seek to derive the probability of choosing good in a three good case: Pr ( {, 2, 3}) Pr(U U 2 0 and U U 3 0). From before, we know that: U U 2 N (Z Z 2 ) β, Var (U U 2 ) U U 3 N (Z Z 3 ) β, Var (U U 3 ). Thus: Pr (U U 2 0 and U U 3 0) Prh pvar (U U 2 )t +(Z Z 2 ) β 0
12 and p Var (U U 3 )t 2 +(Z Z 3 ) β i 0 where t and t 2 are standard normal. Thus, the above equation reduces to: à Pr t (Z Z 2 ) p β Var (U U 2 ) and t 2 (Z Z 3 ) p β! Var (U U 3 ) Pr à t (Z Z 2 ) β p Var (U U 2 ) and t 2 (Z Z 3 ) p β! Var (U U 3 ) As t and t 2 may be correlated, we integrate over the joint density to get the probability: Z a Z b t2-2ρt t 2 +t 2 2 Pr (choosing ) 2π p -ρ e 2 -ρ 2 2 dt 2 dt where: a (Z Z 2 ) β p Var (U U 2 ), and b (Z Z 3 ) β p Var (U U 3 ) Now consider adding a third good to the two good case, under two alternative scenarios. If the third good has identical characteristics as the first, then Z 2 Z 3. If there is no stochastic component (no utility innovation), then σ 2 σ 2 2 σ Therefore, in this case: Pr ( chosen) Pr(U U 2 0 and U U 3 0) Pr(U U 2 0) Thus, there is no change in the probability of choosing good despite the addition of a third good. Again focusing on the two good case, we 2
13 observe: Pr ( {, 2}) Pr(U U 2 0) Ã Pr t (Z Z 2 ) p β! Var (U U 2 ) 2π Z (Z Z2) β [(Z Z 2 )Σ β (Z Z 2 ) 0 +σ 2 +σ2 2 ]/2 t2 2 dt which can be evaluated to derive the desired probability. Here P we consider a McFadden-Luce type of set up, where one imposes β 0. Defining σ p σ 2 + σ2 2, we observe that the probability of choosing good in the two-good case is: Z (Z Z2) β σ t2 dt 2π 2 Adding a third good to the scene with identical characteristics, (Z 2 Z 3 ), yields the probability for good being purchased as: Z (Z Z 2 ) β σ Ã Z (Z Z 2 ) β σ Ã t 2 2π p ρ 2ρt t 2 + t 2!! 2 dt 2 2 ρ 2 dt 2 One can show that, upon evaluation of these integrals, the probability derived from addition of the third good is less than the probability in the two good case. This leads us to a similar problem as the multinomial/conditional logit - adding alternatives decreases the probability of choice, despite the fact that the alternatives are quite similar. Thus, in the probit case we are able to avoid the limitation of the logit models with regard to addition of an identical good, through the covariance structure of the random coefficients. As illustrated in case 2, probit models without random coefficients suffer from the same limitation. Note that while the richer covariance structure is able to capture the relationship between choices in the probit model, applications involving many choices are practically limited as evaluation of higher-order multivariate normal integrals is difficult (refer discussion in Greene, Section a). 3
14 3 Nested Logit: Generalized Extreme Value (GEV) Model Consider a function G(y,y 2,,y J ), where G satisfies: i. Non-negativity: G (y,y 2,,y J ) 0 (y,y 2,,y J ) 0. ii. Homogeneous of degree : G (αy, αy 2,, αy J )αg(y,y 2,,y J ). iii. Derivative property: k G y y 2 y J 0 if k even 0 if k odd. If G satisfies these conditions, then we get the following probability: P (y i {y,y 2,..., y J }) P i y ig i (y,y 2,,y J ) G (y,y 2,,y J ), where P i is a probability that can be derived from utility maximization. We can use the theorem above to derive a special case of the nested logit model. Define: G ((v ), (v 2 ),, (v J )) σ v3 vj (v ) (v )+ ((v 2 )) σ + +((v J )) σ σ Observe that σ 0is the ordinary logit model. (With G definedinthis way, we are assuming that ε is uncorrelated with all of the other ε j, while the remaining ε i may be correlated. The parameter σ is a kind of measure of correlation between the remaining ε i. It is this correlation structure that would allow the GEV model to tackle the limitation of the ordinary conditional/multinomial logit models. ). This function obviously meets the conditions for the GEV model. For, i. Non-negativity: obvious as 0 < σ < ii. Homogeneity: G (α (v ), α (v 2 ),, α (v J )) 4
15 α (v )+ (α (v 2 )) σ + +(α (v J )) σ σ α (v )+ α σ ((v 2 )) ³ σ + + α σ ( (v J )) σ σ α (v )+α Ã α (v )+ + + αg ((v ), (v 2 ),, (v J )) vj vj + + σ! σ iii. By inspection, one can see that this derivative property will hold. (It is obvious when differentiating with respect to (v. For other derivatives, the fact that 0 < σ < gives the needed alternation in sign. Note that y i in the definition of the property is analogous to (v i here.) Thus, we can now proceed to derive our probabilities. First, consider: Pr ( {, 2}) e v ³ σ e v + e v 2 σ e v e v + e which is simply our binomial logit model. Also note that in the three good case: Ã! σ v3 G 2 ( σ) + σ Ã! σ σ v3 + 5
16 Now suppose that we eliminate choice (by letting v ). Then: σ v3 (v 2 ) + Pr (2 {2, 3}) σ v3 + Observe that: + v3 σ Pr ( {, 2, 3}) e v e v + ³ e v 2 σ + e v3 σ e v σ v 2 e v + e ³ +e v 3 v 2 σ e v e v + e v 2 e v 3 Ã+ e v 2 σ! σ (8) σ Letting σ, and supposing e v 2 >ev 3, we get: e v 3 e v 3 < e v 2 and thus from equation (8), we have: e v 2 σ 0 as σ Pr ( {, 2, 3}) ev e v + e v 2 (9) 6
17 Conversely, if e v 3 >ev 2, just reverse the roles of v 2 and v 3 so: Pr ( {, 2, 3}) e v e v + e v 2 e v 3 e v 2 ev e v + e v 3 (0) Combining equations (9) and (0), we get, as σ : Pr ( {, 2, 3}) e v e v +max{e v 2,e v 3} () Equations (9), (0) & () imply that in this GEV model, the probability of choice on addition of a choice 3 identical to choice 2, does not necessarily fall, as was the case in the ordinary conditional/multinomial logit case. Equations (9) shows that if the added choice 3 is highly correlated to choice 2(σ ) but yields less utility, then the probability in the three choice case reduces to the binomial logit (the probability in the two choice case), with choice 3 dropping out, as one would intuitively ect. What about the probability of choice 2 how does this change when we add an identical choice 3 in this GEV model? To answer this, consider: ( Ã! v Ã!) 2 v σ 3 Ã! σv 2 e v 2 ( σ) + Pr (2 {, 2, 3}) (! Ã v 2 e v + Ã v 2!( Ã v 2 ( Ã v 2 e v + ( Ã v 2 e v + Ã v 3 +! Ã v 3 +! Ã v 3 + Ã σ v 2! Ã v 3 +!) σ!) σ!!) σ!) σ Ã Ã v 2! Ã v 3 + When σ 0, ie when there is no correlation between choice 2 and choice 3, we have ordinary conditional/multinomial logit. Suppose v 2 >v 3 and σ. By appealing to the result derived in equation (), we get:!! σ 7
18 P (2 {, 2, 3}) v + v3. + v3 + + σ v3 σ (2) We know for v 2 >v 3 : e v 3 e v 3 < e v 2 e 0, as σ and thus, from equation (2), we get: (v 2 ) Pr (2 {, 2, 3}) (v )+(v 2 ), as σ (One could derive a similar result be assuming that v 3 >v 2 ). This equation tells us that in the GEV model, if choices 2 and 3 are very similar, if utility from 2 is greater than that from 3, then choice 3 gets disregarded (same as in Equation 9 earlier), which agrees with our intuition. Finally, supposing that v 2 v 3,weget: G e v + + σ e v + 2 (v )+2 σ (v 2 ). Thus: σ 8
19 (v 2 )2 σ Pr (2 {, 2, 3}) (v )+2 σ (v 2 ) v 2 2 σ v +2v 2 σ lim Pr (2 {, 2, 3}) (v 2 ) 2 (v ) + (v 2 ) This final equation tells us if the characteristics are identical in the nested logit model, then the probability, in the three choice case, of choosing one of the two identical choices is equal half the probability of the two choice case, which is again what is intuitively ected. Thus the nested logit (GEV) model is able to avoid the key limitation of the conditional/multinomial logit imposed by the IIA assumption. References [] Maddala, Limited-Dependent and Qualitative Variables in Econometrics,983, chapters 2 & 3. [2] Greene, Econometric Analysis, 2000, chapter 9. 9
Probabilistic Choice Models
Probabilistic Choice Models James J. Heckman University of Chicago Econ 312 This draft, March 29, 2006 This chapter examines dierent models commonly used to model probabilistic choice, such as eg the choice
More informationI. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as
Econ 513, USC, Fall 2005 Lecture 15 Discrete Response Models: Multinomial, Conditional and Nested Logit Models Here we focus again on models for discrete choice with more than two outcomes We assume that
More informationGoals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model
Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1 Tetsuya Matsubayashi University of North Texas November 2, 2010 Random utility model Multinomial logit model Conditional logit model
More informationAn Overview of Choice Models
An Overview of Choice Models Dilan Görür Gatsby Computational Neuroscience Unit University College London May 08, 2009 Machine Learning II 1 / 31 Outline 1 Overview Terminology and Notation Economic vs
More informationIntroduction to Discrete Choice Models
Chapter 7 Introduction to Dcrete Choice Models 7.1 Introduction It has been mentioned that the conventional selection bias model requires estimation of two structural models, namely the selection model
More informationLimited Dependent Variable Models II
Limited Dependent Variable Models II Fall 2008 Environmental Econometrics (GR03) LDV Fall 2008 1 / 15 Models with Multiple Choices The binary response model was dealing with a decision problem with two
More informationEconometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit
Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables
More informationHiroshima University 2) The University of Tokyo
September 25-27, 2015 The 14th Summer course for Behavior Modeling in Transportation Networks @The University of Tokyo Advanced behavior models Recent development of discrete choice models Makoto Chikaraishi
More informationGoals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL
Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2 Tetsuya Matsubayashi University of North Texas November 9, 2010 Learn multiple responses models that do not require the assumption
More informationNotes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market
Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market Heckman and Sedlacek, JPE 1985, 93(6), 1077-1125 James Heckman University of Chicago
More informationh=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio
Multivariate Response Models The response variable is unordered and takes more than two values. The term unordered refers to the fact that response 3 is not more favored than response 2. One choice from
More informationA Model of Human Capital Accumulation and Occupational Choices. A simplified version of Keane and Wolpin (JPE, 1997)
A Model of Human Capital Accumulation and Occupational Choices A simplified version of Keane and Wolpin (JPE, 1997) We have here three, mutually exclusive decisions in each period: 1. Attend school. 2.
More informationState-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Fin. Econometrics / 53
State-space Model Eduardo Rossi University of Pavia November 2014 Rossi State-space Model Fin. Econometrics - 2014 1 / 53 Outline 1 Motivation 2 Introduction 3 The Kalman filter 4 Forecast errors 5 State
More informationLecture 6: Discrete Choice: Qualitative Response
Lecture 6: Instructor: Department of Economics Stanford University 2011 Types of Discrete Choice Models Univariate Models Binary: Linear; Probit; Logit; Arctan, etc. Multinomial: Logit; Nested Logit; GEV;
More informationBinary choice. Michel Bierlaire
Binary choice Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne M. Bierlaire (TRANSP-OR ENAC EPFL)
More informationChapter 3 Choice Models
Chapter 3 Choice Models 3.1 Introduction This chapter describes the characteristics of random utility choice model in a general setting, specific elements related to the conjoint choice context are given
More informationChoice Theory. Matthieu de Lapparent
Choice Theory Matthieu de Lapparent matthieu.delapparent@epfl.ch Transport and Mobility Laboratory, School of Architecture, Civil and Environmental Engineering, Ecole Polytechnique Fédérale de Lausanne
More informationLecture Notes 721: Discrete Choice Modeling. Petra E. Todd
Lecture Notes 721: Discrete Choice Modeling Petra E. Todd Fall, 2016 2 Contents i ii CONTENTS Chapter 1 Discrete choice modeling Consider the classical regression model: y = xβ + ε E(ε x = 0 ε N(0, 2 This
More informationBinary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment
BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense
More informationEcon 673: Microeconometrics
Econ 673: Microeconometrics Chapter 4: Properties of Discrete Choice Models Fall 2008 Herriges (ISU) Chapter 4: Discrete Choice Models Fall 2008 1 / 29 Outline 1 2 Deriving Choice Probabilities 3 Identification
More informationUsing Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models
Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models James J. Heckman and Salvador Navarro The University of Chicago Review of Economics and Statistics 86(1)
More informationState-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Financial Econometrics / 49
State-space Model Eduardo Rossi University of Pavia November 2013 Rossi State-space Model Financial Econometrics - 2013 1 / 49 Outline 1 Introduction 2 The Kalman filter 3 Forecast errors 4 State smoothing
More informationECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2016 Instructor: Victor Aguirregabiria
ECOOMETRICS II (ECO 24S) University of Toronto. Department of Economics. Winter 26 Instructor: Victor Aguirregabiria FIAL EAM. Thursday, April 4, 26. From 9:am-2:pm (3 hours) ISTRUCTIOS: - This is a closed-book
More informationLecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo
Lecture 1 Behavioral Models Multinomial Logit: Power and limitations Cinzia Cirillo 1 Overview 1. Choice Probabilities 2. Power and Limitations of Logit 1. Taste variation 2. Substitution patterns 3. Repeated
More informationGrothendieck s Inequality
Grothendieck s Inequality Leqi Zhu 1 Introduction Let A = (A ij ) R m n be an m n matrix. Then A defines a linear operator between normed spaces (R m, p ) and (R n, q ), for 1 p, q. The (p q)-norm of A
More informationdisc choice5.tex; April 11, ffl See: King - Unifying Political Methodology ffl See: King/Tomz/Wittenberg (1998, APSA Meeting). ffl See: Alvarez
disc choice5.tex; April 11, 2001 1 Lecture Notes on Discrete Choice Models Copyright, April 11, 2001 Jonathan Nagler 1 Topics 1. Review the Latent Varible Setup For Binary Choice ffl Logit ffl Likelihood
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationQuantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be
Quantile methods Class Notes Manuel Arellano December 1, 2009 1 Unconditional quantiles Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Q τ (Y ) q τ F 1 (τ) =inf{r : F
More informationCourse 1 Solutions November 2001 Exams
Course Solutions November Exams . A For i =,, let R = event that a red ball is drawn form urn i i B = event that a blue ball is drawn from urn i. i Then if x is the number of blue balls in urn, ( R R)
More informationFinal Exam. Economics 835: Econometrics. Fall 2010
Final Exam Economics 835: Econometrics Fall 2010 Please answer the question I ask - no more and no less - and remember that the correct answer is often short and simple. 1 Some short questions a) For each
More informationSimulation. Li Zhao, SJTU. Spring, Li Zhao Simulation 1 / 19
Simulation Li Zhao, SJTU Spring, 2017 Li Zhao Simulation 1 / 19 Introduction Simulation consists of drawing from a density, calculating a statistic for each draw, and averaging the results. Simulation
More informationMath 4263 Homework Set 1
Homework Set 1 1. Solve the following PDE/BVP 2. Solve the following PDE/BVP 2u t + 3u x = 0 u (x, 0) = sin (x) u x + e x u y = 0 u (0, y) = y 2 3. (a) Find the curves γ : t (x (t), y (t)) such that that
More informationNon-linear panel data modeling
Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1
More informationReview: mostly probability and some statistics
Review: mostly probability and some statistics C2 1 Content robability (should know already) Axioms and properties Conditional probability and independence Law of Total probability and Bayes theorem Random
More informationSimulation-Based Estimation with Applications in S-PLUS to Probabilistic Discrete Choice Models and Continuous-Time Financial Models.
Simulation-Based Estimation with Applications in S-PLUS to Probabilistic Discrete Choice Models and Continuous-Time Financial Models Eric Zivot Department of Economics, University of Washington and Insightful
More informationECON 594: Lecture #6
ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was
More informationStochastic Processes. M. Sami Fadali Professor of Electrical Engineering University of Nevada, Reno
Stochastic Processes M. Sami Fadali Professor of Electrical Engineering University of Nevada, Reno 1 Outline Stochastic (random) processes. Autocorrelation. Crosscorrelation. Spectral density function.
More informationControlling for Correlation Across Choice Occasions and Sites in a Repeated Mixed Logit Model of Recreation Demand *
Controlling for Correlation Across Choice Occasions and Sites in a Repeated Mixed Logit Model of Recreation Demand * by Joseph A. Herriges ** DEPARTMENT OF ECONOMICS, IOWA STATE UNIVERSITY, 60 HEADY HALL,
More informationRandom Variables and Expectations
Inside ECOOMICS Random Variables Introduction to Econometrics Random Variables and Expectations A random variable has an outcome that is determined by an experiment and takes on a numerical value. A procedure
More informationEconometrics Summary Algebraic and Statistical Preliminaries
Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L
More informationLecture-20: Discrete Choice Modeling-I
Lecture-20: Discrete Choice Modeling-I 1 In Today s Class Introduction to discrete choice models General formulation Binary choice models Specification Model estimation Application Case Study 2 Discrete
More informationMaximum Likelihood and. Limited Dependent Variable Models
Maximum Likelihood and Limited Dependent Variable Models Michele Pellizzari IGIER-Bocconi, IZA and frdb May 24, 2010 These notes are largely based on the textbook by Jeffrey M. Wooldridge. 2002. Econometric
More informationECON 721 Lecture Notes
ECON 721 Lecture Notes P. Todd Spring, 2007 1 Introduction to discrete choice modeling Classical regression model: y xβ + ε E(ε x 0 ε N(0,σ 2 This model is untenable if y is discrete. To model discrete
More informationVALUATION USING HOUSEHOLD PRODUCTION LECTURE PLAN 15: APRIL 14, 2011 Hunt Allcott
VALUATION USING HOUSEHOLD PRODUCTION 14.42 LECTURE PLAN 15: APRIL 14, 2011 Hunt Allcott PASTURE 1: ONE SITE Introduce intuition via PowerPoint slides Draw demand curve with nearby and far away towns. Question:
More informationNormalization and correlation of cross-nested logit models
Normalization and correlation of cross-nested logit models E. Abbe, M. Bierlaire, T. Toledo Lab. for Information and Decision Systems, Massachusetts Institute of Technology Inst. of Mathematics, Ecole
More informationThe 17 th Behavior Modeling Summer School
The 17 th Behavior Modeling Summer School September 14-16, 2017 Introduction to Discrete Choice Models Giancarlos Troncoso Parady Assistant Professor Urban Transportation Research Unit Department of Urban
More informationDiscrete Dependent Variable Models
Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)
More informationEstimation of Treatment Effects under Essential Heterogeneity
Estimation of Treatment Effects under Essential Heterogeneity James Heckman University of Chicago and American Bar Foundation Sergio Urzua University of Chicago Edward Vytlacil Columbia University March
More informationFor a stochastic process {Y t : t = 0, ±1, ±2, ±3, }, the mean function is defined by (2.2.1) ± 2..., γ t,
CHAPTER 2 FUNDAMENTAL CONCEPTS This chapter describes the fundamental concepts in the theory of time series models. In particular, we introduce the concepts of stochastic processes, mean and covariance
More informationA Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated
More informationStatistics II. Management Degree Management Statistics IIDegree. Statistics II. 2 nd Sem. 2013/2014. Management Degree. Simple Linear Regression
Model 1 2 Ordinary Least Squares 3 4 Non-linearities 5 of the coefficients and their to the model We saw that econometrics studies E (Y x). More generally, we shall study regression analysis. : The regression
More informationA DARK GREY P O N T, with a Switch Tail, and a small Star on the Forehead. Any
Y Y Y X X «/ YY Y Y ««Y x ) & \ & & } # Y \#$& / Y Y X» \\ / X X X x & Y Y X «q «z \x» = q Y # % \ & [ & Z \ & { + % ) / / «q zy» / & / / / & x x X / % % ) Y x X Y $ Z % Y Y x x } / % «] «] # z» & Y X»
More informationMultinomial Discrete Choice Models
hapter 2 Multinomial Discrete hoice Models 2.1 Introduction We present some discrete choice models that are applied to estimate parameters of demand for products that are purchased in discrete quantities.
More informationMA5219 LOGIC AND FOUNDATION OF MATHEMATICS I. Lecture 1: Programs. Tin Lok Wong 13 August, 2018
MA5219 LOGIC AND FOUNDATION OF MATHEMATICS I Lecture 1: Programs Tin Lok Wong 13 August, 2018 The aim of this lecture is to characterize S N : some algorithm can tell whether n N belongs to S}. By an algorithm
More informationAxioms of Kleene Algebra
Introduction to Kleene Algebra Lecture 2 CS786 Spring 2004 January 28, 2004 Axioms of Kleene Algebra In this lecture we give the formal definition of a Kleene algebra and derive some basic consequences.
More informationRewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35
Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate
More informationWhat can we learn about correlations from multinomial probit estimates?
What can we learn about correlations from multinomial probit estimates? Chiara Monfardini J.M.C. Santos Silva February 2006 Abstract It is well known that, in a multinomial probit, only the covariance
More informationAssortment Optimization under Variants of the Nested Logit Model
Assortment Optimization under Variants of the Nested Logit Model James M. Davis School of Operations Research and Information Engineering, Cornell University, Ithaca, New York 14853, USA jmd388@cornell.edu
More informationESTIMATION THEORY. Chapter Estimation of Random Variables
Chapter ESTIMATION THEORY. Estimation of Random Variables Suppose X,Y,Y 2,...,Y n are random variables defined on the same probability space (Ω, S,P). We consider Y,...,Y n to be the observed random variables
More informationA New Generalized Gumbel Copula for Multivariate Distributions
A New Generalized Gumbel Copula for Multivariate Distributions Chandra R. Bhat* The University of Texas at Austin Department of Civil, Architectural & Environmental Engineering University Station, C76,
More informationPanel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:
Panel Data Exercises Manuel Arellano Exercise 1 Using panel data, a researcher considers the estimation of the following system: y 1t = α 1 + βx 1t + v 1t. (t =1,..., T ) y Nt = α N + βx Nt + v Nt where
More informationTobit and Selection Models
Tobit and Selection Models Class Notes Manuel Arellano November 24, 2008 Censored Regression Illustration : Top-coding in wages Suppose Y log wages) are subject to top coding as is often the case with
More informationEcon 424 Time Series Concepts
Econ 424 Time Series Concepts Eric Zivot January 20 2015 Time Series Processes Stochastic (Random) Process { 1 2 +1 } = { } = sequence of random variables indexed by time Observed time series of length
More informationECON 4160, Autumn term Lecture 1
ECON 4160, Autumn term 2017. Lecture 1 a) Maximum Likelihood based inference. b) The bivariate normal model Ragnar Nymoen University of Oslo 24 August 2017 1 / 54 Principles of inference I Ordinary least
More informationAsymptotic Statistics-III. Changliang Zou
Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (
More informationComments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006
Comments on: Panel Data Analysis Advantages and Challenges Manuel Arellano CEMFI, Madrid November 2006 This paper provides an impressive, yet compact and easily accessible review of the econometric literature
More informationImbens/Wooldridge, Lecture Notes 11, NBER, Summer 07 1
Imbens/Wooldridge, Lecture Notes 11, NBER, Summer 07 1 What s New in Econometrics NBER, Summer 2007 Lecture 11, Wednesday, Aug 1st, 9.00-10.30am Discrete Choice Models 1. Introduction In this lecture we
More informationAssortment Optimization under Variants of the Nested Logit Model
WORKING PAPER SERIES: NO. 2012-2 Assortment Optimization under Variants of the Nested Logit Model James M. Davis, Huseyin Topaloglu, Cornell University Guillermo Gallego, Columbia University 2012 http://www.cprm.columbia.edu
More informationLatent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent
Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary
More informationECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Winter 2017 TOPIC 3: MULTINOMIAL CHOICE MODELS
ECONOMETRICS II (ECO 2401) Victor Aguirregabiria Winter 2017 TOPIC 3: MULTINOMIAL CHOICE MODELS 1. Introduction 2. Nonparametric model 3. Random Utility Models - De nition; - Common Speci cation and Normalizations;
More informationGreat Expectations. Part I: On the Customizability of Generalized Expected Utility*
Great Expectations. Part I: On the Customizability of Generalized Expected Utility* Francis C. Chu and Joseph Y. Halpern Department of Computer Science Cornell University Ithaca, NY 14853, U.S.A. Email:
More informationPart I Behavioral Models
Part I Behavioral Models 2 Properties of Discrete Choice Models 2.1 Overview This chapter describes the features that are common to all discrete choice models. We start by discussing the choice set, which
More informationLecture 4: Least Squares (LS) Estimation
ME 233, UC Berkeley, Spring 2014 Xu Chen Lecture 4: Least Squares (LS) Estimation Background and general solution Solution in the Gaussian case Properties Example Big picture general least squares estimation:
More informationWageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice
Wageningen Summer School in Econometrics The Bayesian Approach in Theory and Practice September 2008 Slides for Lecture on Qualitative and Limited Dependent Variable Models Gary Koop, University of Strathclyde
More informationFair Divsion in Theory and Practice
Fair Divsion in Theory and Practice Ron Cytron (Computer Science) Maggie Penn (Political Science) Lecture 6-b: Arrow s Theorem 1 Arrow s Theorem The general question: Given a collection of individuals
More informationLecture notes: Rust (1987) Economics : M. Shum 1
Economics 180.672: M. Shum 1 Estimate the parameters of a dynamic optimization problem: when to replace engine of a bus? This is a another practical example of an optimal stopping problem, which is easily
More informationLecture Notes 1: Decisions and Data. In these notes, I describe some basic ideas in decision theory. theory is constructed from
Topics in Data Analysis Steven N. Durlauf University of Wisconsin Lecture Notes : Decisions and Data In these notes, I describe some basic ideas in decision theory. theory is constructed from The Data:
More informationAdditional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix)
Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix Flavio Cunha The University of Pennsylvania James Heckman The University
More information4.8 Instrumental Variables
4.8. INSTRUMENTAL VARIABLES 35 4.8 Instrumental Variables A major complication that is emphasized in microeconometrics is the possibility of inconsistent parameter estimation due to endogenous regressors.
More informationHypothesis Testing. Econ 690. Purdue University. Justin L. Tobias (Purdue) Testing 1 / 33
Hypothesis Testing Econ 690 Purdue University Justin L. Tobias (Purdue) Testing 1 / 33 Outline 1 Basic Testing Framework 2 Testing with HPD intervals 3 Example 4 Savage Dickey Density Ratio 5 Bartlett
More informationReduced Error Logistic Regression
Reduced Error Logistic Regression Daniel M. Rice, Ph.D. Rice Analytics, St. Louis, MO Classification Society Annual Conference June 11, 2009 Copyright, Rice Analytics 2009, All Rights Reserved Rice Analytics
More informationCS286.2 Lecture 8: A variant of QPCP for multiplayer entangled games
CS286.2 Lecture 8: A variant of QPCP for multiplayer entangled games Scribe: Zeyu Guo In the first lecture, we saw three equivalent variants of the classical PCP theorems in terms of CSP, proof checking,
More information2. We care about proportion for categorical variable, but average for numerical one.
Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is
More informationAGEC 661 Note Fourteen
AGEC 661 Note Fourteen Ximing Wu 1 Selection bias 1.1 Heckman s two-step model Consider the model in Heckman (1979) Y i = X iβ + ε i, D i = I {Z iγ + η i > 0}. For a random sample from the population,
More informationA Constructive Proof of the Nash Bargaining Solution
Working Paper 14/0 /01 A Constructive Proof of the Nash Bargaining Solution Luís Carvalho ISCTE - Instituto Universitário de Lisboa BRU-IUL Unide-IUL) Av. Forças Armadas 1649-126 Lisbon-Portugal http://bru-unide.iscte.pt/
More informationConstrained Assortment Optimization for the Nested Logit Model
Constrained Assortment Optimization for the Nested Logit Model Guillermo Gallego Department of Industrial Engineering and Operations Research Columbia University, New York, New York 10027, USA gmg2@columbia.edu
More informationLogistic Regression. Robot Image Credit: Viktoriya Sukhanova 123RF.com
Logistic Regression These slides were assembled by Eric Eaton, with grateful acknowledgement of the many others who made their course materials freely available online. Feel free to reuse or adapt these
More informationMaximum Likelihood Methods
Maximum Likelihood Methods Some of the models used in econometrics specify the complete probability distribution of the outcomes of interest rather than just a regression function. Sometimes this is because
More informationCS8803: Statistical Techniques in Robotics Byron Boots. Hilbert Space Embeddings
CS8803: Statistical Techniques in Robotics Byron Boots Hilbert Space Embeddings 1 Motivation CS8803: STR Hilbert Space Embeddings 2 Overview Multinomial Distributions Marginal, Joint, Conditional Sum,
More informationFall, 2007 Nonlinear Econometrics. Theory: Consistency for Extremum Estimators. Modeling: Probit, Logit, and Other Links.
14.385 Fall, 2007 Nonlinear Econometrics Lecture 2. Theory: Consistency for Extremum Estimators Modeling: Probit, Logit, and Other Links. 1 Example: Binary Choice Models. The latent outcome is defined
More informationSHEBA. Experiential Learning For Non Compensatory Choice. Khaled Boughanmi Asim Ansari Rajeev Kohli. Columbia Business School
SHEBA Experiential Learning For Non Compensatory Choice Khaled Boughanmi Asim Ansari Rajeev Kohli Columbia Business School 1 Research Questions 1. How is learning under non-compensatory decision rules
More informationSolutions to Homework Set #5 (Prepared by Lele Wang) MSE = E [ (sgn(x) g(y)) 2],, where f X (x) = 1 2 2π e. e (x y)2 2 dx 2π
Solutions to Homework Set #5 (Prepared by Lele Wang). Neural net. Let Y X + Z, where the signal X U[,] and noise Z N(,) are independent. (a) Find the function g(y) that minimizes MSE E [ (sgn(x) g(y))
More informationM E 320 Professor John M. Cimbala Lecture 10
M E 320 Professor John M. Cimbala Lecture 10 Today, we will: Finish our example problem rates of motion and deformation of fluid particles Discuss the Reynolds Transport Theorem (RTT) Show how the RTT
More informationReview (Probability & Linear Algebra)
Review (Probability & Linear Algebra) CE-725 : Statistical Pattern Recognition Sharif University of Technology Spring 2013 M. Soleymani Outline Axioms of probability theory Conditional probability, Joint
More informationNinth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"
Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric
More informationTechnical Note: Capacitated Assortment Optimization under the Multinomial Logit Model with Nested Consideration Sets
Technical Note: Capacitated Assortment Optimization under the Multinomial Logit Model with Nested Consideration Sets Jacob Feldman Olin Business School, Washington University, St. Louis, MO 63130, USA
More information1 Exercises for lecture 1
1 Exercises for lecture 1 Exercise 1 a) Show that if F is symmetric with respect to µ, and E( X )
More informationMathematics for Economics ECON MA/MSSc in Economics-2017/2018. Dr. W. M. Semasinghe Senior Lecturer Department of Economics
Mathematics for Economics ECON 53035 MA/MSSc in Economics-2017/2018 Dr. W. M. Semasinghe Senior Lecturer Department of Economics MATHEMATICS AND STATISTICS LERNING OUTCOMES: By the end of this course unit
More informationL2: Review of probability and statistics
Probability L2: Review of probability and statistics Definition of probability Axioms and properties Conditional probability Bayes theorem Random variables Definition of a random variable Cumulative distribution
More informationP1: GEM/IKJ P2: GEM/IKJ QC: GEM/ABE T1: GEM CB495-05Drv CB495/Train KEY BOARDED August 20, :28 Char Count= 0
5 Probit 5.1 Choice Probabilities The logit model is limited in three important ways. It cannot represent random taste variation. It exhibits restrictive substitution patterns due to the IIA property.
More information