Probabilistic Choice Models
|
|
- Loraine Watkins
- 5 years ago
- Views:
Transcription
1 Probabilistic Choice Models James J. Heckman University of Chicago Econ 312 This draft, March 29, 2006
2 This chapter examines dierent models commonly used to model probabilistic choice, such as eg the choice of one type of transportation from among many choices available to the consumer. Section 1 discusses derivation and limitations of conditional logit models. Section 2 discusses probit models and Section 3 discusses the nested logit (generalized extreme value models), which address some of the limitations of the conditional logit models. 1 The Conditional Logit Model In this section we investigate conditional logit models. We discuss its derivation from a random utility model (Section 1.2) with Extreme Value Type I distributed shocks. The relevant 1
3 properties of the Extreme Value Type I distribution are discussed in Section 1.1. We also derive the conditional logit model from the Luce axioms (Section 1.3). In Section 1.4, we discuss some of the limitations of the conditional logit models. 1.1 The Extreme Value Type I Distribution Suppose is independent (not necessarily identical) Extreme ValueTypeIrandomvariable.ThentheCDFof is: 1 Pr( )= () =exp( exp ( ( + ))) 1 McFadden (1974) mistakenly referred to this as the Weibull distribution, and this misclassification shows up in the literature. 2
4 where is a parameter of the Extreme Value Type I CDF. Also, by the assumption of independence, we can write: ( 1 2 )= Y ( )= Y exp ( exp ( ( + ))) =1 =1 The Extreme Value Type I distribution has two useful features. First, the dierence between two Extreme Value Type I random variables is a logit. Second, Extreme Value Type Is are closed under maximization, since (assuming independence): 3
5 ³ Pr max{ } Y = Pr( ) = =1 Y exp ( exp ( ( + ))) =1 Ã Ã X!! = exp exp ( ( + )) =1 Ã! X = exp exp () exp( ) (1) =1 4
6 P Consider exp( ) We can solve for in the following equation: =1 which implies: X exp( )=exp() =1 =log à X! exp( ) =1 We can then substitute this value of into equation (1) to get: ³ Pr max{ } = exp( (exp ()) exp()) = exp( exp ( ( + ))) which is indeed a Extreme Value Type I random variable. 5
7 1.2 Random Utility Model An individual with characteristics has a choice set ;with element, is a feasible set. We write: Pr ( ) as the probability that a person of characteristics chooses from the feasible set. We also suppose that: ( ) =( )+( ) where is independent Extreme Value Type I. From our information on Extreme Value Type Is in section 1, we know that +,(andthus ),hasanextremevaluetypeidistribution 6
8 with parameter, as shown below: () =Pr( + ) = Pr( ) = exp( exp ( ( + ))) Letusnowsupposethattherearetwogoodsandtwocorresponding utilities. Consumers govern their choices by the obvious decision rule: choose good one if 1 2 More generally, if there are goods, then good will be selected if argmax { } =1. Specifically, in our two good case: Pr (1 is chosen) =Pr( 1 2 )=Pr( ) 7
9 Imposing that is independent Extreme Value Type I, we can be much more precise about this probability: Pr ( ) (2) = Pr( ) = = Z Z ( 1 ) μz ( 2 ) 2 1 ( 1 )exp( exp ( )) 1 Observe that ( 1 )=exp(exp ( )) which implies: ( 1 ) = ( 1) 1 = exp (exp ( )) (exp ( )) = exp ( )(exp(exp ( ))) 8
10 Substituting this into equation(2), gives us: Pr (1 ischosen) = Z exp ( )(exp( exp ( ))) = 1 exp ( exp ( )) 1 Z 1 [ exp( 1 )][exp( 1 )exp ( )] 1 9
11 = exp( 1 ) = = 1 exp ( 1 )+exp( ) [ exp( 1 )][exp( 1 )exp ( )] exp ( 1 ) exp ( 1 )+exp( ) exp( 1 1 ) exp( 1 1 )+exp( 2 2 ) This result generalizes, because the max over ( 1) choices is still an Extreme Value Type I, so we can make a two stage maximization argument, as follows: 10
12 Pr ( =12 ) μ = Pr max ( + ) =2 = = exp( 1 1 ) exp( 1 1 )+exp( 2 2 )+ +exp( ) exp( 1 ) P exp ( ) =1 where = This type of model of probabilistic choice is called a conditional or multinomial logit model. The dierence between conditional and multinomial is simply that in the conditional 11
13 logit case, the values of the variables (usually choice characteristics) vary across the choices, while the parameters are common across the choices. In the multinomial logit case, the values of the variables are common across choices for the same person (usually individual characteristics) but the parameters vary across choices. For e.g. we have in the linear case, the probability of individual making choice from among m choices is: Conditional Logit case: P = exp(0 ) P exp( 0 ) =1,where is the vector of values of characteristics of choice as perceived by individual. 12
14 Multinomial Logit case: P = exp(0 ) P exp( 0 ) =1,where is a vector of individual characteristics for individual. Note that we can easily combine the two cases under one model, as described below: Generalized case: We can combine the conditional and multinomial logit models by generalizing either one of the two types of models. For eg, we could permit the coecients in the multinomial logit case to depend on choice characteristics, ie have: =
15 Then we get the generalized case, where the probability of choice by individual depends on both individual as well as choice characteristics (as well as interaction terms) 2 : = exp(0 ) P exp( 0 ) =1 = exp(0 + 0 ) P exp( ) =1 We could similarly modify the coecients in the conditional logit case to obtain the generalized version. 2 Note that by allowing (the vector of individual characteristics) to have an intercept column of 1s, the model would have pure choice characteristic eects, in addition to the interaction terms. 14
16 1.3 Derivation of Logit from the Luce Axioms We will now show how the conditional logit can be derived from the random utility model discussed in Section 1.2 and theluceaxiomspresentedbelow Luce Axioms Axiom 1: Independence of Irrelevant Alternatives(IIA) Suppose that Then, Pr ( { })Pr( ) =Pr( { })Pr( ) 15
17 or, we have: Pr ( { }) Pr ( { }) = Pr ( ) Pr ( ) The term on the left is the odds ratio; the ratio of probabilities of choosing to given characteristics and { } This axiom has been named Independence of Irrelevant Alternatives for an obvious reason the odds of our choice are not eected by adding additional alternatives. Note that this assumes that the additional choices entering in B aect probability of choosing in the same manner as they aect the probability of choosing ; implicitly we are assuming that the additional choices have equivalent relationship with choice and choice. We will see how this assumption is a limita- 16
18 tion, in Section 1.4 below. Axiom 2: Positivity This axiom states that the probability of choosing any one of the choices is strictly greater than zero: Pr ( ) Derivation of logit With the Luce assumptions set out in the preceeding section, we can now proceed to our derivation of the logit. Define =Pr( { }). Then by Axiom 1 above, we know: μ Pr ( ) =Pr( ) (3) 17
19 Summing over, weget: Pr ( ) X μ = 1 = Pr ( ) = P 1 μ (4) Again using Axiom 1, for : μ Pr ( ) =Pr( ) μ Pr ( ) =Pr( ) (5a) (5b) Substituting these in equation (3), we get: 18
20 μ μ = Pr ( ) Pr ( ) = μ Pr ( ) Pr ( ) = (6) Now, in terms of the random utility model discussed in Section 1.2, define the mean utility of a person with characteristics choosing from set { } as: ( ) ln = =exp(( )) 19
21 Define a comparable expression for. Replacing this into equation (6) produces: = Then from equation (4), we get: Pr ( ) = = = exp (( )) exp (( )) 1 μ P exp (( )) exp (( )) 1 ³ 1 P exp(()) (exp (( ))) P exp (( )) (exp (( ))) 20
22 Assume additionally, additive separability of ( ) as follows: ( ) =( ) ( ) Note that this is equivalent to assuming irrelevance of the benchmark. From this assumption, we get: Pr ( ) = = = P exp (( ) ( )) (exp (( ) ( ))) exp ( )exp(( )) ³ P exp (( )) exp (( )) exp ( ) P exp (( )) (7) 21
23 which gives the multinomial logit. McFadden (1974) shows that Luce Axioms and a condition on ( Translation Completeness ) produce the Extreme Value Type I (which he mistakenly referred to as the Weibull). 1.4 Consequences of Independence: Limitations of Logit Models We just showed that: = exp( ) P exp( ) 22
24 so that: = exp( ) P exp( ) exp( ) P exp( ) = exp( μ ) exp( ) =exp( ) ln A common specification for is = Thus: μ μ ln ln =( ) = = or, changes in characteristics have a common eect on the ratio of log probabilities. This allows for estimation of the probabilities of purchasing a new good. (One could obtain an 23
25 estimate of from the existing goods. This estimate can then be combined with the characteristics,, of the new good to estimate the probability of selection, as in equation 7). Further, from equation (7): and: Pr (2 {1 2}) = Pr (2 {1 2 3}) = Pr (2 {1 2}) This leads us to a restrictive property of the conditional logit model - we have assumed independence of the when in fact, they may be correlated. This is illustrated by McFadden s famous red bus, blue bus problem: Suppose we are modelling 24
26 transportation choice and our alternatives consist of {car, bus, train}. If the alternatives are replaced by {car, red bus, blue bus}, then we have violated our assumption of dissimilar alternatives; if 2 1 then the event 3 1 is more likely. One can see by the preceding equation that adding more bus colors continually decreases the probability that car travel is chosen. We can deal with the problem of similar alternatives by using the nested logit model (Nested Logit) or the random coecient probit model. 2 Probit: Random Coecients In this section (as in Section 1.4 above), we make asimple linear function of the choice characteristics alone (as discussed 25
27 in Section 1.2, we can easily generalize this to include individual characteristics as well as interactions). Then we have, utility from choice is: = + where: (0 2 ), Moreover, is a random variable, with ( P ), sothat: It follows that: = ( 1 2 ) +( 1 2 ) +( 1 2 ) ( 1 3 ) +( 1 3 ) +( 1 3 ) 0 26
28 Further: Var ( 1 2 ) = ( (1 2 ) ( 1 2 ) 0 (1 2 ) ( 1 2 ) ) = [( 1 2 )( )+( 1 2 )] 0 [( 1 2 )( )+( 1 2 )] ª ½ (1 = 2 )( )( ) 0 ( 1 2 ) 0 +( 1 2 )( 1 2 ) 0 ¾ (since 12 =0). Similarly: = ( 1 2 ) X ( 1 2 ) Var ( 1 3 )=( 1 3 ) X ( 1 3 )
29 Thus: so: Cov ( )=( 1 2 ) X ( 1 3 ) = Corr ( )= ( 1 2 ) P ( 1 3 ) p Var (1 2 ) Var ( 1 3 ) We now seek to derive the probability of choosing good 1 in a three good case: Pr (1 {1 2 3}) =Pr( and 1 3 0) From before, we know that: 1 2 ( 1 2 ) Var ( 1 2 ) 1 3 ( 1 3 ) Var ( 1 3 ) 28
30 Thus: Pr ( and 1 3 0) p Var (1 = Pr 2 ) 1 +( 1 2 ) 0 and p Var ( 1 3 ) 2 +( 1 3 ) 0 where 1 and 2 are standard normal. Thus, the above equation reduces to: à Pr 1 ( 1 2 ) p Var (1 2 ) and 2 ( 1 3 ) p! Var (1 3 ) =Pr à 1 ( 1 2 ) p Var (1 2 ) and 2 ( 1 3 ) p! Var (1 3 ) 29
31 As 1 and 2 may be correlated, we integrate over the joint density to get the probability: = Pr (choosing 1) Z Z 1 2 p where: = ( 1 2 ) p and = ( Var (1 2 ) 1 3 ) p Var (1 3 ) Now consider adding a third good to the two good case, under two alternative scenarios. 30
32 Case 1: Non-random utility, random coecients. If the third good has identical characteristics as the first, then 2 = 3 If there is no stochastic component (no utility innovation), then 2 1 = 2 2 = 2 3 =0 Therefore, in this case: Pr (1 chosen) = Pr( and 1 3 0) = Pr( 1 2 0) Thus, there is no change in the probability of choosing good 1 despite the addition of a third good. Again fo- 31
33 cusing on the two good case, we observe: Pr (1 {12}) = Pr( 1 2 0) Ã = Pr 1 ( 1 2 ) p! Var (1 2 ) = Z 1 2 (12) [( 1 2 ) ( 1 2 ) ]12 exp μ which can be evaluated to derive the desired probability. Case 2: Random utility, non-random coecients. Here we consider a McFadden-Luce type of set up, where one imposes P =0 Defining = p we observe 32
34 that the probability of choosing good 1 in the two-good case is: Z (12) μ μ 1 exp Adding a third good to the scene with identical characteristics, ( 2 = 3 ), yields the probability for good 1 being purchased as: Z ( 1 2 ) Ã Z ( 1 2 ) Ã μ!! 1 2 p 2 1 exp One can show that, upon evaluation of these integrals, the probability derived from addition of the third good is less than the probability in the two good case. This leads us to a similar problem as the multinomial/conditional 33
35 logit adding alternatives decreases the probability of choice, despite the fact that the alternatives are quite similar. Thus, in the probit case we are able to avoid the limitation of the logit models with regard to addition of an identical good, through the covariance structure of the random coecients. As illustrated in case 2, probit models without random coefficients suer from the same limitation. Note that while the richer covariance structure is able to capture the relationship between choices in the probit model, applications involving many choices are practically limited as evaluation of higherorder multivariate normal integrals is dicult (refer discussion in Greene, Section a). 34
36 3 Nested Logit: Generalized Extreme Value (GEV) Model Consider a function ( 1 2 ) where satisfies: i. Non-negativity: ( 1 2 ) 0 ( 1 2 ) 0 ii. Homogeneous of degree 1: ( 1 2 )= ( 1 2 ) iii. Derivative property: if even 0 if odd. 35
37 If satisfies these conditions, then we get the following probability: ( { 1 2 }) = ( 1 2 ) ( 1 2 ) where is a probability that can be derived from utility maximization. We can use the theorem above to derive 36
38 a special case of the nested logit model. Define: (exp( 1 ) exp( 2 ) exp( )) μ μ 2 exp +exp exp( 1 )+ 1 μ + +exp 1 = exp( 1 )+ 3 1 (exp( 2 )) (exp( )) Observe that =0is the ordinary logit model. (With defined in this way, we are assuming that 1 is uncorrelated with all of the other while the remaining may be correlated. The parameter is a kind of measure of correlation between the remaining. It is this correla- 37
39 tion structure that would allow the GEV model to tackle the limitation of the ordinary conditional/multinomial logit models. ). This function obviously meets the conditions for the GEV model. For, i. Non-negativity: obvious as 0 1 ii. Homogeneity: ( exp( 1 )exp( 2 ) exp( )) = exp( 1 )+ ( exp( 2 )) (exp ( )) μ = exp( 1 )+ 1 1 (exp( 2 )) ³ 1 1 (exp ( ))
40 μ μ = exp( 1 )+ exp = Ã exp( 1 )+ μ exp = (exp( 1 ) exp( 2 ) exp( )) + + μ + +exp μ μ exp 1 iii. By inspection, one can see that this derivative property will hold. (It is obvious when dierentiating with respect to exp( 1. For other derivatives, the fact that 0 1 gives the needed alternation in sign. Note that in the definition of the property is analogous to exp( here.) 1 1 1! 39
41 Thus, we can now proceed to derive our probabilities. First, consider: Pr (1 {1 2}) = ³ = which is simply our binomial logit model. Also note that in the three good case: Ã μ μ! = (1 ) exp +exp exp μ Ã μ μ! = exp exp +exp μ
42 Now suppose that we eliminate choice 1 (by letting 1 ). Then: μ μ μ exp( 2 )exp exp + exp Pr (2 {23}) = μ μ exp + exp 1 1 μ 2 exp 1 = μ μ 2 3 exp + exp
43 Observe that: Pr (1 {1 2 3}) = = 1 + ³ ³ = Ã1+ 1 μ 3 2 1! 1 (8) 1 Letting 1 and supposing 2 3 we get: 42
44 μ 3 2 1= μ 3 2 andthusfromequation(8),wehave: as 1 Pr (1 {123}) (9) Conversely, if 3 2 just reverse the roles of 2 and 3 so: Pr (1 {1 2 3}) μ 3 2 = (10) Combining equations (9) and (10), we get, as 1: Pr (1 {1 2 3}) 1 1 +max{ 2 3} (11) 43
45 Equations (9), (10) & (11) imply that in this GEV model, the probability of choice 1 on addition of a choice 3 identical to choice 2, does not necessarily fall, as was the case in the ordinary conditional/multinomial logit case. Equations (9) shows that if the added choice 3 is highly correlated to choice 2( 1) but yields less utility, then the probability in the three choice case reduces to the binomial logit (the probability in the two choice case), with choice 3 dropping out, as one would intuitively expect. What about the probability of choice 2 how does this change when we add an identical choice 3 in this GEV model? To 44
46 answer this, consider: = = = Pr (2 {123}) " ½ μ μ ¾ # μ (1 ) exp +exp exp 2 1 μ μ ¾ ½exp +exp 1 1 μ ½ μ μ ¾ exp exp +exp μ μ ¾ ½exp +exp 1 1 μ 2 exp 1 ½ μ μ ¾! 1 μ μ μ Ã 1 + exp +exp exp +exp
47 When =0, ie when there is no correlation between choice 2 and choice 3, we have ordinary conditional/multinomial logit. Suppose 2 3 and 1. By appealing to the result derived in equation (11), we get: (2 {123}) μ 2 exp = 1 μ μ 2 3 (12) exp +exp 1 1 μ μ exp +exp 1 1 μ μ exp 1 + exp +exp
48 We know for 2 3 : μ μ as 1 and thus, from equation (12), we get: Pr (2 {1 2 3}) exp( 2 ) as 1 exp( 1 )+exp( 2 ) (One could derive a similar result be assuming that 3 2 ) This equation tells us that in the GEV model, if choices 2 and 3 are very similar, if utility from 2 is greater than that from 3, then choice 3 gets disregarded (same as in Equation 9 earlier), which agrees with our intuition. 47
49 Finally, supposing that 2 = 3,weget: μ = exp +exp 1 μ 1 = exp 1 = exp( 1 )+2 1 exp ( 2 ) μ Thus: = exp ( 2 )2 Pr (2 {123}) = exp( 1 )+2 1 exp ( 2 ) exp 2 = 2 exp 1 +2exp 2 lim Pr (2 {123}) 1 exp( 2 ) 1 2 exp( 1 )+exp( 2 ) 48
50 This final equation tells us if the characteristics are identical in the nested logit model, then the probability, in the three choice case, of choosing one of the two identical choices is equal half the probability of the two choice case, which is again what is intuitively expected. Thus the nested logit (GEV) model is able to avoid the key limitation of the conditional/multinomial logit imposed by the IIA assumption. References [1] Maddala, Limited-Dependent and Qualitative Variables in Econometrics,1983, chapters 2 & 3. [2] Greene, Econometric Analysis, 2000, chapter
Probabilistic Choice Models
Econ 3: James J. Heckman Probabilistic Choice Models This chapter examines different models commonly used to model probabilistic choice, such as eg the choice of one type of transportation from among many
More informationDiscrete Dependent Variable Models
Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)
More informationAn Overview of Choice Models
An Overview of Choice Models Dilan Görür Gatsby Computational Neuroscience Unit University College London May 08, 2009 Machine Learning II 1 / 31 Outline 1 Overview Terminology and Notation Economic vs
More informationGoals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model
Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1 Tetsuya Matsubayashi University of North Texas November 2, 2010 Random utility model Multinomial logit model Conditional logit model
More informationLimited Dependent Variable Models II
Limited Dependent Variable Models II Fall 2008 Environmental Econometrics (GR03) LDV Fall 2008 1 / 15 Models with Multiple Choices The binary response model was dealing with a decision problem with two
More informationIntroduction to Discrete Choice Models
Chapter 7 Introduction to Dcrete Choice Models 7.1 Introduction It has been mentioned that the conventional selection bias model requires estimation of two structural models, namely the selection model
More informationAbility Bias, Errors in Variables and Sibling Methods. James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006
Ability Bias, Errors in Variables and Sibling Methods James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006 1 1 Ability Bias Consider the model: log = 0 + 1 + where =income, = schooling,
More informationh=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio
Multivariate Response Models The response variable is unordered and takes more than two values. The term unordered refers to the fact that response 3 is not more favored than response 2. One choice from
More informationRewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35
Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate
More informationLabor Supply and the Two-Step Estimator
Labor Supply and the Two-Step Estimator James J. Heckman University of Chicago Econ 312 This draft, April 7, 2006 In this lecture, we look at a labor supply model and discuss various approaches to identify
More informationUsing Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models
Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models James J. Heckman and Salvador Navarro The University of Chicago Review of Economics and Statistics 86(1)
More informationI. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as
Econ 513, USC, Fall 2005 Lecture 15 Discrete Response Models: Multinomial, Conditional and Nested Logit Models Here we focus again on models for discrete choice with more than two outcomes We assume that
More informationNotes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market
Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market Heckman and Sedlacek, JPE 1985, 93(6), 1077-1125 James Heckman University of Chicago
More informationHypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006
Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312 This draft, April 20, 2006 1 1 A Brief Review of Hypothesis Testing and Its Uses values and pure significance tests (R.A. Fisher)
More informationEconometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit
Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables
More informationGoals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL
Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2 Tetsuya Matsubayashi University of North Texas November 9, 2010 Learn multiple responses models that do not require the assumption
More informationBinary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment
BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense
More informationEcon 673: Microeconometrics
Econ 673: Microeconometrics Chapter 4: Properties of Discrete Choice Models Fall 2008 Herriges (ISU) Chapter 4: Discrete Choice Models Fall 2008 1 / 29 Outline 1 2 Deriving Choice Probabilities 3 Identification
More informationFour Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil
Four Parameters of Interest in the Evaluation of Social Programs James J. Heckman Justin L. Tobias Edward Vytlacil Nueld College, Oxford, August, 2005 1 1 Introduction This paper uses a latent variable
More informationMultinomial Discrete Choice Models
hapter 2 Multinomial Discrete hoice Models 2.1 Introduction We present some discrete choice models that are applied to estimate parameters of demand for products that are purchased in discrete quantities.
More informationEcon 424 Time Series Concepts
Econ 424 Time Series Concepts Eric Zivot January 20 2015 Time Series Processes Stochastic (Random) Process { 1 2 +1 } = { } = sequence of random variables indexed by time Observed time series of length
More informationChapter 3 Choice Models
Chapter 3 Choice Models 3.1 Introduction This chapter describes the characteristics of random utility choice model in a general setting, specific elements related to the conjoint choice context are given
More informationHiroshima University 2) The University of Tokyo
September 25-27, 2015 The 14th Summer course for Behavior Modeling in Transportation Networks @The University of Tokyo Advanced behavior models Recent development of discrete choice models Makoto Chikaraishi
More informationLecture 6: Discrete Choice: Qualitative Response
Lecture 6: Instructor: Department of Economics Stanford University 2011 Types of Discrete Choice Models Univariate Models Binary: Linear; Probit; Logit; Arctan, etc. Multinomial: Logit; Nested Logit; GEV;
More informationA Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated
More informationA Model of Human Capital Accumulation and Occupational Choices. A simplified version of Keane and Wolpin (JPE, 1997)
A Model of Human Capital Accumulation and Occupational Choices A simplified version of Keane and Wolpin (JPE, 1997) We have here three, mutually exclusive decisions in each period: 1. Attend school. 2.
More informationAnalogy Principle. Asymptotic Theory Part II. James J. Heckman University of Chicago. Econ 312 This draft, April 5, 2006
Analogy Principle Asymptotic Theory Part II James J. Heckman University of Chicago Econ 312 This draft, April 5, 2006 Consider four methods: 1. Maximum Likelihood Estimation (MLE) 2. (Nonlinear) Least
More informationLecture-20: Discrete Choice Modeling-I
Lecture-20: Discrete Choice Modeling-I 1 In Today s Class Introduction to discrete choice models General formulation Binary choice models Specification Model estimation Application Case Study 2 Discrete
More informationECON 594: Lecture #6
ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was
More informationProbit Estimation in gretl
Probit Estimation in gretl Quantitative Microeconomics R. Mora Department of Economics Universidad Carlos III de Madrid Outline Introduction 1 Introduction 2 3 The Probit Model and ML Estimation The Probit
More informationIntroduction to Linear Regression Analysis
Introduction to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018 Econometrics: What is it? Interaction of economic theory, observed data and statistical methods. The science of testing
More information2. We care about proportion for categorical variable, but average for numerical one.
Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is
More informationStructural Econometrics: Dynamic Discrete Choice. Jean-Marc Robin
Structural Econometrics: Dynamic Discrete Choice Jean-Marc Robin 1. Dynamic discrete choice models 2. Application: college and career choice Plan 1 Dynamic discrete choice models See for example the presentation
More informationAdditional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix)
Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix Flavio Cunha The University of Pennsylvania James Heckman The University
More informationVolume Title: The Interpolation of Time Series by Related Series. Volume URL:
This PDF is a selection from an out-of-print volume from the National Bureau of Economic Research Volume Title: The Interpolation of Time Series by Related Series Volume Author/Editor: Milton Friedman
More informationIntroduction to Computational Finance and Financial Econometrics Probability Theory Review: Part 2
Introduction to Computational Finance and Financial Econometrics Probability Theory Review: Part 2 Eric Zivot July 7, 2014 Bivariate Probability Distribution Example - Two discrete rv s and Bivariate pdf
More informationNinth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"
Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric
More informationLecture Notes 1: Decisions and Data. In these notes, I describe some basic ideas in decision theory. theory is constructed from
Topics in Data Analysis Steven N. Durlauf University of Wisconsin Lecture Notes : Decisions and Data In these notes, I describe some basic ideas in decision theory. theory is constructed from The Data:
More informationNormalization and correlation of cross-nested logit models
Normalization and correlation of cross-nested logit models E. Abbe M. Bierlaire T. Toledo August 5, 25 Abstract We address two fundamental issues associated with the use of cross-nested logit models. On
More informationQuantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be
Quantile methods Class Notes Manuel Arellano December 1, 2009 1 Unconditional quantiles Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Q τ (Y ) q τ F 1 (τ) =inf{r : F
More informationReduced Error Logistic Regression
Reduced Error Logistic Regression Daniel M. Rice, Ph.D. Rice Analytics, St. Louis, MO Classification Society Annual Conference June 11, 2009 Copyright, Rice Analytics 2009, All Rights Reserved Rice Analytics
More informationMaximum Likelihood (ML) Estimation
Econometrics 2 Fall 2004 Maximum Likelihood (ML) Estimation Heino Bohn Nielsen 1of32 Outline of the Lecture (1) Introduction. (2) ML estimation defined. (3) ExampleI:Binomialtrials. (4) Example II: Linear
More informationControlling for Correlation Across Choice Occasions and Sites in a Repeated Mixed Logit Model of Recreation Demand *
Controlling for Correlation Across Choice Occasions and Sites in a Repeated Mixed Logit Model of Recreation Demand * by Joseph A. Herriges ** DEPARTMENT OF ECONOMICS, IOWA STATE UNIVERSITY, 60 HEADY HALL,
More informationHiroshima University 2) Tokyo University of Science
September 23-25, 2016 The 15th Summer course for Behavior Modeling in Transportation Networks @The University of Tokyo Advanced behavior models Recent development of discrete choice models Makoto Chikaraishi
More informationMaximum Likelihood and. Limited Dependent Variable Models
Maximum Likelihood and Limited Dependent Variable Models Michele Pellizzari IGIER-Bocconi, IZA and frdb May 24, 2010 These notes are largely based on the textbook by Jeffrey M. Wooldridge. 2002. Econometric
More informationAssortment Optimization under Variants of the Nested Logit Model
Assortment Optimization under Variants of the Nested Logit Model James M. Davis School of Operations Research and Information Engineering, Cornell University, Ithaca, New York 14853, USA jmd388@cornell.edu
More informationLecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo
Lecture 1 Behavioral Models Multinomial Logit: Power and limitations Cinzia Cirillo 1 Overview 1. Choice Probabilities 2. Power and Limitations of Logit 1. Taste variation 2. Substitution patterns 3. Repeated
More informationEconometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur
Econometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur Module No. # 01 Lecture No. # 28 LOGIT and PROBIT Model Good afternoon, this is doctor Pradhan
More informationGreat Expectations. Part I: On the Customizability of Generalized Expected Utility*
Great Expectations. Part I: On the Customizability of Generalized Expected Utility* Francis C. Chu and Joseph Y. Halpern Department of Computer Science Cornell University Ithaca, NY 14853, U.S.A. Email:
More informationEntry for Daniel McFadden in the New Palgrave Dictionary of Economics
Entry for Daniel McFadden in the New Palgrave Dictionary of Economics 1. Introduction Daniel L. McFadden, the E. Morris Cox Professor of Economics at the University of California at Berkeley, was the 2000
More informationHypothesis Testing. Econ 690. Purdue University. Justin L. Tobias (Purdue) Testing 1 / 33
Hypothesis Testing Econ 690 Purdue University Justin L. Tobias (Purdue) Testing 1 / 33 Outline 1 Basic Testing Framework 2 Testing with HPD intervals 3 Example 4 Savage Dickey Density Ratio 5 Bartlett
More informationGeneralized Linear Models for Non-Normal Data
Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture
More informationECLT 5810 Linear Regression and Logistic Regression for Classification. Prof. Wai Lam
ECLT 5810 Linear Regression and Logistic Regression for Classification Prof. Wai Lam Linear Regression Models Least Squares Input vectors is an attribute / feature / predictor (independent variable) The
More informationChapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a
Chapter 9 Regression with a Binary Dependent Variable Multiple Choice ) The binary dependent variable model is an example of a a. regression model, which has as a regressor, among others, a binary variable.
More informationAnswer Key for STAT 200B HW No. 7
Answer Key for STAT 200B HW No. 7 May 5, 2007 Problem 2.2 p. 649 Assuming binomial 2-sample model ˆπ =.75, ˆπ 2 =.6. a ˆτ = ˆπ 2 ˆπ =.5. From Ex. 2.5a on page 644: ˆπ ˆπ + ˆπ 2 ˆπ 2.75.25.6.4 = + =.087;
More informationSolutions to Odd-Numbered End-of-Chapter Exercises: Chapter 14
Introduction to Econometrics (3 rd Updated Edition) by James H. Stock and Mark W. Watson Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 14 (This version July 0, 014) 015 Pearson Education,
More informationConstrained Assortment Optimization for the Nested Logit Model
Constrained Assortment Optimization for the Nested Logit Model Guillermo Gallego Department of Industrial Engineering and Operations Research Columbia University, New York, New York 10027, USA gmg2@columbia.edu
More informationNormalization and correlation of cross-nested logit models
Normalization and correlation of cross-nested logit models E. Abbe M. Bierlaire T. Toledo Paper submitted to Transportation Research B Final version November 2006 Abstract We address two fundamental issues
More informationdisc choice5.tex; April 11, ffl See: King - Unifying Political Methodology ffl See: King/Tomz/Wittenberg (1998, APSA Meeting). ffl See: Alvarez
disc choice5.tex; April 11, 2001 1 Lecture Notes on Discrete Choice Models Copyright, April 11, 2001 Jonathan Nagler 1 Topics 1. Review the Latent Varible Setup For Binary Choice ffl Logit ffl Likelihood
More informationL2: Review of probability and statistics
Probability L2: Review of probability and statistics Definition of probability Axioms and properties Conditional probability Bayes theorem Random variables Definition of a random variable Cumulative distribution
More informationComparing IRT with Other Models
Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used
More informationEconomics 583: Econometric Theory I A Primer on Asymptotics
Economics 583: Econometric Theory I A Primer on Asymptotics Eric Zivot January 14, 2013 The two main concepts in asymptotic theory that we will use are Consistency Asymptotic Normality Intuition consistency:
More informationMaximum Likelihood Methods
Maximum Likelihood Methods Some of the models used in econometrics specify the complete probability distribution of the outcomes of interest rather than just a regression function. Sometimes this is because
More informationAssortment Optimization under Variants of the Nested Logit Model
WORKING PAPER SERIES: NO. 2012-2 Assortment Optimization under Variants of the Nested Logit Model James M. Davis, Huseyin Topaloglu, Cornell University Guillermo Gallego, Columbia University 2012 http://www.cprm.columbia.edu
More informationChoice Theory. Matthieu de Lapparent
Choice Theory Matthieu de Lapparent matthieu.delapparent@epfl.ch Transport and Mobility Laboratory, School of Architecture, Civil and Environmental Engineering, Ecole Polytechnique Fédérale de Lausanne
More informationA COEFFICIENT OF DETERMINATION FOR LOGISTIC REGRESSION MODELS
A COEFFICIENT OF DETEMINATION FO LOGISTIC EGESSION MODELS ENATO MICELI UNIVESITY OF TOINO After a brief presentation of the main extensions of the classical coefficient of determination ( ), a new index
More informationThe estimation of Generalized Extreme Value models from choice-based samples
The estimation of Generalized Extreme Value models from choice-based samples M. Bierlaire D. Bolduc D. McFadden August 10, 2006 Abstract We consider an estimation procedure for discrete choice models in
More information1/sqrt(B) convergence 1/B convergence B
The Error Coding Method and PICTs Gareth James and Trevor Hastie Department of Statistics, Stanford University March 29, 1998 Abstract A new family of plug-in classication techniques has recently been
More information16/018. Efficiency Gains in Rank-ordered Multinomial Logit Models. June 13, 2016
16/018 Efficiency Gains in Rank-ordered Multinomial Logit Models Arie Beresteanu and Federico Zincenko June 13, 2016 Efficiency Gains in Rank-ordered Multinomial Logit Models Arie Beresteanu and Federico
More informationEstimation of mixed generalized extreme value models
Estimation of mixed generalized extreme value models Michel Bierlaire michel.bierlaire@epfl.ch Operations Research Group ROSO Institute of Mathematics EPFL Katholieke Universiteit Leuven, November 2004
More informationPart I Behavioral Models
Part I Behavioral Models 2 Properties of Discrete Choice Models 2.1 Overview This chapter describes the features that are common to all discrete choice models. We start by discussing the choice set, which
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationNormalization and correlation of cross-nested logit models
Normalization and correlation of cross-nested logit models E. Abbe, M. Bierlaire, T. Toledo Lab. for Information and Decision Systems, Massachusetts Institute of Technology Inst. of Mathematics, Ecole
More informationLinear Classification: Probabilistic Generative Models
Linear Classification: Probabilistic Generative Models Sargur N. University at Buffalo, State University of New York USA 1 Linear Classification using Probabilistic Generative Models Topics 1. Overview
More informationThe 17 th Behavior Modeling Summer School
The 17 th Behavior Modeling Summer School September 14-16, 2017 Introduction to Discrete Choice Models Giancarlos Troncoso Parady Assistant Professor Urban Transportation Research Unit Department of Urban
More informationEPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7
Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review
More information5.5 Yitzhaki Weights as a Version of Theil Weights (1950)
5.5 Yitzhaki Weights as a Version of Theil Weights (195) Another intuition behind the Yitzhaki weights (Yitzhaki, 1989). Sample Size (finite sample) Take Case () =() (propensity score is the instrument).
More informationECON Introductory Econometrics. Lecture 11: Binary dependent variables
ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability
More informationLDA, QDA, Naive Bayes
LDA, QDA, Naive Bayes Generative Classification Models Marek Petrik 2/16/2017 Last Class Logistic Regression Maximum Likelihood Principle Logistic Regression Predict probability of a class: p(x) Example:
More informationReview (Probability & Linear Algebra)
Review (Probability & Linear Algebra) CE-725 : Statistical Pattern Recognition Sharif University of Technology Spring 2013 M. Soleymani Outline Axioms of probability theory Conditional probability, Joint
More informationMultivariate Versus Multinomial Probit: When are Binary Decisions Made Separately also Jointly Optimal?
Multivariate Versus Multinomial Probit: When are Binary Decisions Made Separately also Jointly Optimal? Dale J. Poirier and Deven Kapadia University of California, Irvine March 10, 2012 Abstract We provide
More informationDepartment of Economics
Department of Economics Two-stage Bargaining Solutions Paola Manzini and Marco Mariotti Working Paper No. 572 September 2006 ISSN 1473-0278 Two-stage bargaining solutions Paola Manzini and Marco Mariotti
More informationECONOMETRICS HONOR S EXAM REVIEW SESSION
ECONOMETRICS HONOR S EXAM REVIEW SESSION Eunice Han ehan@fas.harvard.edu March 26 th, 2013 Harvard University Information 2 Exam: April 3 rd 3-6pm @ Emerson 105 Bring a calculator and extra pens. Notes
More informationOrder choice probabilities in random utility models
Order choice probabilities in random utility models André de Palma Karim Kilani May 1, 2015 Abstract We prove a general identity which states that any element of a tuple (ordered set) can be obtained as
More informationDiscrete choice models Michel Bierlaire Intelligent Transportation Systems Program Massachusetts Institute of Technology URL: http
Discrete choice models Michel Bierlaire Intelligent Transportation Systems Program Massachusetts Institute of Technology E-mail: mbi@mit.edu URL: http://web.mit.edu/mbi/www/michel.html May 23, 1997 Abstract
More informationA short introduc-on to discrete choice models
A short introduc-on to discrete choice models BART Kenneth Train, Discrete Choice Models with Simula-on, Chapter 3. Ques-ons Impact of cost, commu-ng -me, walk -me, transfer -me, number of transfers, distance
More informationFinal Exam. Economics 835: Econometrics. Fall 2010
Final Exam Economics 835: Econometrics Fall 2010 Please answer the question I ask - no more and no less - and remember that the correct answer is often short and simple. 1 Some short questions a) For each
More informationThe Instability of Correlations: Measurement and the Implications for Market Risk
The Instability of Correlations: Measurement and the Implications for Market Risk Prof. Massimo Guidolin 20254 Advanced Quantitative Methods for Asset Pricing and Structuring Winter/Spring 2018 Threshold
More informationSpecification Test on Mixed Logit Models
Specification est on Mixed Logit Models Jinyong Hahn UCLA Jerry Hausman MI December 1, 217 Josh Lustig CRA Abstract his paper proposes a specification test of the mixed logit models, by generalizing Hausman
More informationOutline. Supervised Learning. Hong Chang. Institute of Computing Technology, Chinese Academy of Sciences. Machine Learning Methods (Fall 2012)
Outline Hong Chang Institute of Computing Technology, Chinese Academy of Sciences Machine Learning Methods (Fall 2012) Outline Outline I 1 Linear Models for Regression Linear Regression Probabilistic Interpretation
More informationOrdered Response and Multinomial Logit Estimation
Ordered Response and Multinomial Logit Estimation Quantitative Microeconomics R. Mora Department of Economics Universidad Carlos III de Madrid Outline Introduction 1 Introduction 2 3 Introduction The Ordered
More informationIntroduction: structural econometrics. Jean-Marc Robin
Introduction: structural econometrics Jean-Marc Robin Abstract 1. Descriptive vs structural models 2. Correlation is not causality a. Simultaneity b. Heterogeneity c. Selectivity Descriptive models Consider
More informationdqd: A command for treatment effect estimation under alternative assumptions
UC3M Working Papers Economics 14-07 April 2014 ISSN 2340-5031 Departamento de Economía Universidad Carlos III de Madrid Calle Madrid, 126 28903 Getafe (Spain) Fax (34) 916249875 dqd: A command for treatment
More informationLab 07 Introduction to Econometrics
Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand
More informationWageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice
Wageningen Summer School in Econometrics The Bayesian Approach in Theory and Practice September 2008 Slides for Lecture on Qualitative and Limited Dependent Variable Models Gary Koop, University of Strathclyde
More informationA New Generalized Gumbel Copula for Multivariate Distributions
A New Generalized Gumbel Copula for Multivariate Distributions Chandra R. Bhat* The University of Texas at Austin Department of Civil, Architectural & Environmental Engineering University Station, C76,
More informationAGEC 661 Note Fourteen
AGEC 661 Note Fourteen Ximing Wu 1 Selection bias 1.1 Heckman s two-step model Consider the model in Heckman (1979) Y i = X iβ + ε i, D i = I {Z iγ + η i > 0}. For a random sample from the population,
More informationEconomics 201B Economic Theory (Spring 2017) Bargaining. Topics: the axiomatic approach (OR 15) and the strategic approach (OR 7).
Economics 201B Economic Theory (Spring 2017) Bargaining Topics: the axiomatic approach (OR 15) and the strategic approach (OR 7). The axiomatic approach (OR 15) Nash s (1950) work is the starting point
More informationCS 340 Lec. 18: Multivariate Gaussian Distributions and Linear Discriminant Analysis
CS 3 Lec. 18: Multivariate Gaussian Distributions and Linear Discriminant Analysis AD March 11 AD ( March 11 1 / 17 Multivariate Gaussian Consider data { x i } N i=1 where xi R D and we assume they are
More informationDiscriminant analysis and supervised classification
Discriminant analysis and supervised classification Angela Montanari 1 Linear discriminant analysis Linear discriminant analysis (LDA) also known as Fisher s linear discriminant analysis or as Canonical
More information