Multinomial Discrete Choice Models

Similar documents
ECON 594: Lecture #6

Introduction to Discrete Choice Models

Chapter 3 Choice Models

Testing for Regime Switching: A Comment

MC3: Econometric Theory and Methods. Course Notes 4

Very simple marginal effects in some discrete choice models *

MIXED MNL MODELS FOR DISCRETE RESPONSE

Probabilistic Choice Models

A Course on Advanced Econometrics

Limited Dependent Variable Models II

6 Mixed Logit. 6.1 Choice Probabilities

Lecture-20: Discrete Choice Modeling-I

Wageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice

Discrete Dependent Variable Models

A Model of Human Capital Accumulation and Occupational Choices. A simplified version of Keane and Wolpin (JPE, 1997)

h=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio

GMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails

Introduction: structural econometrics. Jean-Marc Robin

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo

Supplemental Material 1 for On Optimal Inference in the Linear IV Model

Lecture Notes on Measurement Error

Quasi-Random Simulation of Discrete Choice Models

Economics 241B Estimation with Instruments

16/018. Efficiency Gains in Rank-ordered Multinomial Logit Models. June 13, 2016

Environmental Econometrics

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008

Index. Pagenumbersfollowedbyf indicate figures; pagenumbersfollowedbyt indicate tables.

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research

Time Series Models and Inference. James L. Powell Department of Economics University of California, Berkeley

Linear Models in Econometrics

A Note on Demand Estimation with Supply Information. in Non-Linear Models

A Joint Tour-Based Model of Vehicle Type Choice and Tour Length

Specification Test on Mixed Logit Models

Lecture 6: Discrete Choice: Qualitative Response

TRB 2005: For Presentation and Publication Paper # Final Submission: April 1, 2005 Word Count: 8,191

Bayesian Inference for DSGE Models. Lawrence J. Christiano

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Winter 2017 TOPIC 3: MULTINOMIAL CHOICE MODELS

I. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as

Random Utility Models, Attention Sets and Status Quo Bias

Simple Estimators for Semiparametric Multinomial Choice Models

EconS Vertical Integration

Econ 673: Microeconometrics

14.13 Lecture 8. Xavier Gabaix. March 2, 2004

Masking Identification of Discrete Choice Models under Simulation Methods *

Chapter 11. Regression with a Binary Dependent Variable

1 Correlation between an independent variable and the error

Probabilistic Choice Models

Online Appendix to: Marijuana on Main Street? Estimating Demand in Markets with Limited Access

The Kuhn-Tucker Problem

Normal Probability Plot Probability Probability

ECON4515 Finance theory 1 Diderik Lund, 5 May Perold: The CAPM

xtunbalmd: Dynamic Binary Random E ects Models Estimation with Unbalanced Panels

P1: GEM/IKJ P2: GEM/IKJ QC: GEM/ABE T1: GEM CB495-05Drv CB495/Train KEY BOARDED August 20, :28 Char Count= 0

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

Efficient estimation of Probit models with correlated errors

4.8 Instrumental Variables

Constrained Assortment Optimization for the Nested Logit Model

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

An Overview of Choice Models

Limited Dependent Variables and Panel Data

GMM Based Tests for Locally Misspeci ed Models

Estimating Single-Agent Dynamic Models

Women. Sheng-Kai Chang. Abstract. In this paper a computationally practical simulation estimator is proposed for the twotiered

Nonlinear Programming (NLP)

ECONOMETRIC MODELS. The concept of Data Generating Process (DGP) and its relationships with the analysis of specication.

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1

Chapter 1. GMM: Basic Concepts

Barnali Gupta Miami University, Ohio, U.S.A. Abstract

1 Differentiated Products: Motivation

Tests for Cointegration, Cobreaking and Cotrending in a System of Trending Variables

ECON0702: Mathematical Methods in Economics

Maximum Likelihood (ML) Estimation

Speci cation of Conditional Expectation Functions

Bayesian IV: the normal case with multiple endogenous variables

GMM estimation of spatial panels

Bayesian Inference for DSGE Models. Lawrence J. Christiano

EconS Microeconomic Theory II Homework #9 - Answer key

Gibbs Sampling in Endogenous Variables Models

2014 Preliminary Examination

A New Computational Algorithm for Random Coe cients Model with Aggregate-level Data

Entry for Daniel McFadden in the New Palgrave Dictionary of Economics

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit

MKTG 555: Marketing Models

Introduction to Maximum Likelihood Estimation

Econ 673: Microeconometrics

Math Spring 2017 Mathematical Models in Economics

Simulation Estimation of Mixed Discrete Choice Models Using Randomized and Scrambled Halton Sequences

1 Regression with Time Series Variables

Eco517 Fall 2004 C. Sims MIDTERM EXAM

Assortment Optimization under Variants of the Nested Logit Model

Questions in Decision Theory

Estimation of Static Discrete Choice Models Using Market Level Data

Lecture Notes based on Koop (2003) Bayesian Econometrics

Choice Theory. Matthieu de Lapparent

Spatial Discrete Choice Models

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2014 Instructor: Victor Aguirregabiria

8 Nominal and Ordinal Logistic Regression

Tutorial 1: Linear Algebra

Bisection Ideas in End-Point Conditioned Markov Process Simulation

Chapter 6: Endogeneity and Instrumental Variables (IV) estimator

Transcription:

hapter 2 Multinomial Discrete hoice Models 2.1 Introduction We present some discrete choice models that are applied to estimate parameters of demand for products that are purchased in discrete quantities. In these models the different products from the market are considered to be different choice alternatives. The models we are concerned with assume that in a given period one product or none of the products is purchased by a consumer, which we regard as the decision-maker unit. The set of all choice alternatives that a consumer faces is assumed to be nite and to contain all products available in the market. Denote an alternative by m> and the choice set of all alternatives by i4> 5> ===> Mj. In a discrete choice model a consumer l who chooses alternative m has a random utility x that is not directly observed by the researcher. In many situations, however, the probability that m is chosen is observed with a small sampling error in the form of the market share. Hence it is typically assumed that l maximizes his utility choosing m such that x is the highest of all x o >u@4> ===> M= The probability that l chooses m can be computed if we know the underlying distributions in the utility. This chapter describes brieày the main features of some well-known multinomial discrete choice models: the standard logit, the mixed logit and the probit. An early source of the logit and probit was the application of probability models to biological experiments of binary choice in the rst part of the last century. The rst applications in economics used the binomial probit model (e.g., Farrell (1954)), which appeared to be the winner in the dispute among biologists on the ability of the logit and probit to model binary choice. Theil

6 HAPTER 2. MULTINOMIAL DISRETE HOIE MODELS (1969) generalized the binomial logit to the multinomial logit opening up several further developments and applications. At the beginning of the 70 s Mc- Fadden and his collaborators, who studied some transportation research problems, generalized the logit model in several directions and made it scientifically respectable by providing a theoretical framework in the utility theory of discrete choice (e.g., McFadden (1973), McFadden and Reid (1975), Mc- Fadden (1977)). For more detailed historical background we refer to ramer (1991, p. 39-42) and Anderson, de Palma and Thisse (1992, chapter 2). 2.2 The standard logit In this section we present brieày the standard logit model. We discuss its major theoretical advantage and practical disadvantage. Then we derive some basic formulae related to the asymptotic properties of the maximum likelihood estimator of the model. 2.2.1 Model speci cation In the standard logit model the utility is a linear function of the attributes of the alternative: x @ {. % > (2.1) where { is a N 4 vector containing the characteristics of consumer l and alternative m> is a N 4 vector of parameters and the variables % >m @ 4>===>M> are assumed to be random with independent standard extreme value distribution of type I whose cumulative distribution function is and density function is I +%, @ h{s + h{s +%,, (2.2) i +%, @ h{s +%, h{s + h{s +%,, = (2.3) Under the utility maximization principle, the probability that l chooses alternative m is v @Su+x x o > for all u @4>===>M, = Using the utility expression +5=4, > the probability is further equal to v @Su % o %. { { o> for all u @4>===>M =

2.2. THE STANDARD LOGIT 7 This kind of probabilities are typically computed as a M-dimensional integral using the joint distribution of the random terms: v @ 0o$0n% q3% i +% o q >===>% a, d% ===d% a = for all o' cca Since the % s are independent, we have v @ 0 o $0 n% q3% i +% o q, i +% a, d% ===d% a > for all o' cca which is further equal to i +%, U 0o$0n% q3% o q o' \ i +% o, d% ===d% a = o' The larger, +M 4,-dimensional part of this integral is equal to the product of +M 4, one-dimensional integrals of the densities so we can use the cumulative distribution functions: v @ i +%, \ I %. { { o d% = U o' Using the expressions of these functions +5=5, and +5=6, > we obtain # a[ v @ h{s +%, h{s h{s %. { { o $ d% = U o' This integral has a closed form that can be computed in the following way: v @ 4 S a h{s o' { { o # a[ h{s h{s %. { { o $ " > o' 0'3" which then leads to v @ h{s+{ S, a h{s (2.4) o' +{ o,=

8 HAPTER 2. MULTINOMIAL DISRETE HOIE MODELS Due to this attractive closed form of the probability in +5=7, > the standard logit model is popular in disciplines where choice models are employed. In the following chapters we will apply the standard logit model in contexts when the characteristics vary solely with the products and not with the consumers. That is, { @ { for all m @4> ===> M= Due to this, the probability that consumer l chooses product m is equal to v @ h{s+{, S a o' h{s +{ o, (2.5) for all consumers. Hence this is the probability that product m is purchased in the market. This quantity in a market with a large number of consumers is equal to the market share of product m= Though the expression of the probability that an alternative is chosen is simple and therefore computationally accessible to practitioners, the standard logit also has a serious disadvantage. This in fact comes from the simplicity of the utility model, namely that the utilities of each consumer depend on a scalar function of the characteristics, { > and not on the whole vector of characteristics { = This way the ratio of probabilities of two alternatives m and t does not depend on the existence and attributes of any other alternatives since v v^ @ h{s+{, h{s = (2.6) { ^ This property is known as independence from irrelevant alternatives. It is an unattractive property since if we add a new alternative to the set of all alternatives that happens to be a close substitute for m but not for t> we would expect for v to go down much more than v^= This way the ratio of the two probabilities should also go down. We illustrate this problem with a simple example. Assume that in a market there are two products having attributes { @+4> 4> 4, and { 2 @+=8> 4=8> 4, and that these are the only two alternatives of choice. Let @+4> 4> 4, = Then v @ v 2 @ = Now introduce a third product having attributes { 2 @+4> 4> 4, = In this new situation v @ v 2 @ v @ > that is, standard logit predicts that consumers will substitute for products 1 and 2 to the same extent. This is wrong since roughly half of the population prefers product 1 which is a close substitute to product 2, so normally a part of the consumers who preferred 1 shouldswitchto3andthosewhopreferred2shouldstaywith2.

2.2. THE STANDARD LOGIT 9 2.2.2 Log-likelihood and rst-order properties Since later on we use results related to the estimation of the standard logit model, here we provide some of those results. The estimation of the standard logit parameters is based on the idea that we are in a multinomial choice situation when the probability that an alternative m is chosen is known, v = Then if we observe a large number of choices, the frequency of alternative m> that is, the number of times when m was chosen divided by the total number of choices should be approximately equal to v = Denote the frequency of alternative m by i and let q be the total number of choices, that is, the number of consumers. Then qi q+i > ===> i a, is assumed to have a multinomial distribution with parameters v > ===> v a = Therefore the probability that alternatives 4> ===> M occur with frequencies i > ===> i a > respectively, is in fact the likelihood function corresponding to the standard logit model:? O @ F v s vs a a > where F is a coef cient corresponding to the multinomial distribution. Since this does not depend on the parameters of interest, we ignore it when writing the log-likelihood, which then is equal to oq O @ q a[ i oq v = (2.7) ' Estimation of the parameters can be done by maximizing the log-likelihood function. Let V denote the diagonal matrix with the diagonal v @+v > ===> v a, = We can write oq O @ q i oq v= Hence the rst-order derivative of the loglikelihood can be written as since oq v @ V oq O @ @ q q i oq v v V 3 i 3 v v = In order to derive we note that v @ { v v [v > where [ @^{ === { a` > which leads to v @ V vv [= (2.8)

10 HAPTER 2. MULTINOMIAL DISRETE HOIE MODELS This can be used to obtain the rst order derivatives of the log-likelihood: oq O @ q [ V vv V 3 i= Now we can compute the asymptotic information matrix: oq O L +m[, @ E oq O @ q 2 [ V vv V 3 E ii V 3 V vv [= The multinomial distribution has the property that E^ii `@? V. 4? vv = Substituting this into the above expression, after some calculus, we obtain L +m[, @q [ V vv [= We use this formula in chapter 6 in order to construct designs with improved ef ciency for choice experiments. We refer to some other formulae derived here, like +5=;, > in chapter 5 when discussing ef cient estimation of the standard logit model with price competition. 2.3 The mixed logit The mixed logit is a model that does not satisfy the independence from irrelevant alternatives property. It does so due to the fact that in the utility speci cation the coef cients are considered to be random. This way each consumer will have a different coef cient in the utility. This speci cation has the appealing feature that it parameterizes consumers attitude towards the characteristics of the alternatives through the variance of the random parameters. Similarly to the standard logit, the rst economic applications of the mixed logit appeared to beinthe eld of transportation research (Boyd and Mellman (1980), ardell and Dunbar (1980)). For more detailed overviews of mixed logit models we refer to McFadden and Train (1998) and Brownstone and Train (1999). In this section we present the model and the main asymptotic properties of its maximum likelihood estimator. 2.3.1 Model speci cation In a similar choice framework as in the case of the standard logit, now the utility of consumer l choosing alternative m is de ned as x @ {. % > (2.9)

2.3. THE MIXED LOGIT 11 where the only difference to +5=4, is that here is a parameter vector speci c to consumer l. We assume that Q+>,> l@4> ===> q> iid, where is a N 4 vector of parameters and is a symmetric and positive-de nite matrix of parameters. The error term % is assumed to be independent of = For computational convenience, we write the random parameter vector in the following form: @.y > (2.10) where isanymatrixsuchthat @ and y is a N 4 random vector with a standard normal distribution. Notice that we omitted the subscript l in +5=43, on the right hand side since the random parameters have the same distribution. If we knew y we could use formula +5=8, since we would be in a standard logit situation. In this case the probability that alternative m is chosen would be k l h{s { +.y, Su+mmy,@ S a h{s o' ^{ o +.y,`= (2.11) The unconditional probability that alternative m is chosen is given by k l Ug h{s { +.y, v @ E ^Su+mmy,` @ S a h{s o' ^{ o +.y,`! +y, dy > (2.12) where! +y, is the density function of y = Note that, since +5=9, is not satis ed in this case, the mixed logit does not satisfy the independence from irrelevant alternatives property. The probability expression from +5=45, is an integral that generally cannot be computed analytically. Hence it is typically estimated by Monte arlo or quasi-monte arlo simulations. These estimation techniques are discussed in detail in the next chapter, therefore here we just mention the essence of them. We draw several values of the random vector y > compute the function in the integral at each value, and compute the average of these quantities. The average obtained is the (quasi-) Monte arlo estimator of the integral. In order to illustrate how in the mixed logit model consumers attitude toward different characteristics is modeled, consider a simpler version of the covariance matrix, namely when it is a diagonal matrix: @ 3 E 2 3... 3 2 g 4 F D =

12 HAPTER 2. MULTINOMIAL DISRETE HOIE MODELS In this case we can take 3 E @ 3... 4 F D > 3 g so we can write the utility from +5=<, > using +5=43, as g[ x @ {. & y & { &. % = &' Writing the utility in this way allows us to interpret intuitively the effect of the random part of the parameter. Each consumer l is associated with a vector y of which elements represent the consumer s valuation of the characteristics. If l has a taste that values more alternatives with large values of a characteristic, then he will have a large positive value in y corresponding to that characteristic. For example, in the case of cars, if consumer l prefers large cars and size is characteristic k in the model, then this consumer will have a large positive y Å = We are now able to explain why the mixed logit generates reasonable substitution patters. Going on with the example on cars, if a small car enters the market, the model predicts that the above consumer will not be very willing to switch to the new small car. This is because, for this consumer Å y Å is disproportionately larger than the corresponding quantity for the other characteristics and therefore the term Å y Å { Å is more sensitive to variations in { Å than in the case of the other characteristics. This will keep the utility of l high for large { Å s and small for low values of them. The above reasoning implies that the higher the variance parameters of consumers attitudes, & > the less likely it is that a consumer preferring a given alternative will substitute for a non-similar alternative. We illustrate this further with the example from the end of section 2.2.1. In addition to the standard logitcaseherewetake+ > 2 >,@ +4> 4> 4, = When only products 1 and 2 are in the market, their probabilities are v @ v 2 @ > irrespective of the 2 value of = When product 3 enters the market then v 2 becomes 0.35 for @4> 0.42 for @8> 0.47 for @43and0.49for @63= So for large variance there is no substitution of product 2 for product 3, while for small variance the substitution is very similar to that generated by the standard logit.

2.3. THE MIXED LOGIT 13 2.3.2 Log-likelihood and rst-order properties In this subsection we derive some useful formulae for the mixed logit model. Denote the conditional probability from +5=44, by +y, = Hence v @ +y,! +y, dy= U g The choice model in this case can be formulated in a similar way as for standard logit. Hence the log-likelihood function of this model is oq O @ q a[ i oq v > ' where i again denotes the observed frequency of product m= For saving notation, we write integrals like U U g +,!+y,dy as U U +,d (e.g., U g +y,!+y,dy U d). For maximum likelihood estimation we are interested in the rst order derivatives of the log-likelihood. Since we need these resultsforthecasewherethecovariancematrixoftherandomcoef cients is diagonal, we derive the formulae only for this case. For computational convenience we change the notation of the random part of the coef cients from y to Y>where Y is the diagonal matrix with diagonal y and @+ > ===> g, = Similarly to the standard logit case, O v @ q V 3 i and O v @ q V 3 i= (2.13) Hence we need the following formulae: v @ d and v @ d> where @+ > ===> a, = In a similar way as for the standard logit we obtain @ [ and @ [Y> where is the diagonal matrix with diagonal = These yield v @ [d and v @ [Y d= (2.14)

14 HAPTER 2. MULTINOMIAL DISRETE HOIE MODELS We substitute these into +5=46, and obtain O @ q [ d V 3 i and (2.15) O @ q Y[ d V 3 i= The asymptotic information matrix is de ned as: 5 9 L +>m[, @ 7 E O O O E O E O O E O O For simplifying this expression we use the fact that i isassumedtohavethe multinomial distribution with Ei @ v= Then i has the property that Eii @? V. 4? vv = Using this and the formulae from +5=48, we obtain 6 : 8 = P L +>m[, @q V 3 P T V 3 P P V 3 T TV 3 T > (2.16) where P @ [d and T @ [Y d= Formula +5=49, is used in chapter 7 for designing choice experiments for the mixed logit model. We refer to formulae +5=47, in chapter 5 when discussing ef cient estimation of the mixed logit model of price competition. 2.4 The probit The probit is a model similar to the mixed logit in which all random elements, coef cients and errors, are assumed to have a normal distribution. One of the rst presentations of the multinomial probit model in an econometric context was by McFadden (1976). In this section we present brieày the model and provide a way of estimating the choice probabilities implied, commonly known as the GHK simulator. Unlike for the standard and mixed logit, for the probit we do not present the likelihood function and the rst-order properties. For a concise exposition of these we refer to Wansbeek and Wedel (2001).

2.4. THE PROBIT 15 2.4.1 Model speci cation Assume that the utilities corresponding to M products for consumer l are x @ {. % > where, as for the mixed logit, Q+>,> l @4>===>q> iid, and is a N 4 vector of parameters and is a symmetric and positive-de nite matrix of parameters. The error term % here is assumed to be normally distributed without requiring it to be independent of = Due to the attractive property of the normal distribution that the sum of two normally distributed random variables is normal, by choosing appropriate variables and > the vector of the above utilities can be represented as x @.h Q +>, (2.17) where = M 4> =M M such that @> and h Q +3>L a, = If the random terms h are independent, then we can omit the subscript l> and with each consumer associate a realization of the random vector h. Notice that if in the mixed logit utility +5=<, we replace the extreme value distribution assumption of the residuals by the normal distribution then we obtain a particular case of +5=4:, = The reason that the latter speci cation is more general is that here no independence is assumed between the residual and the random coef cients. Out of the M products the one with the highest utility is chosen. The probability that a given product m is chosen is equal to: v @Su+x x o > ;u, @Su+Y x 3, @ Su +Y. Y h 3, with 5 6 4 3 === 4 === 3 3 4 === 4 === 3 Y @ 9 : 7.... 8 3 3 === 4 === 4 where the column with -1 sisthem th column. Denote W Y and choose such that W is lower triangular. This is possible since WW @ Y Y > which is positive de nite. Then v @Su+Wh y, > where y Y = (2.18)

16 HAPTER 2. MULTINOMIAL DISRETE HOIE MODELS If we denote 5 W @ 9 7 w 3 === 3 w 2 w 22 === 3... w a w a2 === w aa 6 : 8 > then v @ Su h y >h 2 y 2 w 2 h >=== > w w 22 h a y a w a h === w aa3 h a3 w aa = Let G @ +h > ===> h a, 5 U a = h y > ===> h a y a w a h === w aa3 h a3 = w w aa Then the formula of the probability that m is chosen becomes v @! +h, ===! +h a, dh...dh a (2.19) where! is the standard normal density function. ( 2.4.2 Estimation of the probabilities To obtain a computationally more tractable version of the probability, we notice that the h s have truncated normal distribution. If h e and h Q +3> 4, then we can draw the h s fasterifwedrawx uniform on ^3> 4` and let h @ 3 +x +e,, > where is the standard normal distribution function. It turns out that based on this idea we can transform the integral +5=4<, to an integral whose arguments belong to the unit cube, like the integral +6=4, from the chapter 3. In order to do the transformation we can use the idea from the above para-

2.4. THE PROBIT 17 graphinthem-dimensional case. This implies the transformation h @ x 3 y # w +x > ===> x a, h 2 @ x 3 y2 w 2 h +x, 2 # 2 +x >===>x a, (2.20) w 22. h a @ x 3 ya w a h === w aa3 h a3 a # +x > ===> x a, w aa where x > ===> x a 5 ^3> 4` andinthelastformulah is viewed as a function of x > ===> x for m @4> ===> M 4. Then +5=4<, becomes dfc o a! +# +x > ===> x a,, ===! +# a +x > ===> x a,, mmm dx...dx a where mmm is the Jacobian, that is, mmm @ YÆ YÄ. YÆ YÄ a YÆ a YÄ.... YÆ a YÄ a @ EÆ E YÆ a YÄ..... 3 a 3 a e 33 aa3 e a3 EÆ a E aa > which is the determinant of an upper-triangular matrix, so it is equal to the product of the diagonal terms, i.e.,: a 3 a e 33 aa3 e a3 aa mmm @ =! +# +===,,! +# a +===,, Using this +5=4<, becomes y ya w a h === w aa3 h a3 dx...dx a = dfc o a w w aa

18 HAPTER 2. MULTINOMIAL DISRETE HOIE MODELS Then we obtain the probability formula y y2 w 2 h v @ w dfc o a3 w aa w 22 ya w a h === w aa3 h a3 dx ===dx a3 = (2.21) The integral from this formula can be interpreted as the expectation of the integrand function at a uniformly distributed random M 4-vector. As we show in the next chapter, this integral can be estimated with Monte arlo simulation by drawing a sample of the random vector and computing its mean. The estimator obtained this way is called the GHK simulator (Börsch-Supan and Hajivassiliou (1993)) or RIS (i.e., recursive importance sampling) simulator based on truncated normal density (Vijverberg (1997)). Vijverberg also discusses other RIS simulators. A number of other types of simulators are presented in Hajivassiliou, McFadden and Ruud (1996).