Quasi-Maximum Likelihood: Applications

Size: px
Start display at page:

Download "Quasi-Maximum Likelihood: Applications"

Transcription

1 Chapter 10 Quasi-Maximum Likelihood: Applications 10.1 Binary Choice Models In many economic applications the dependent variables of interest may assume only finitely many integer values, each labeling a category of data. he simplest case of discrete dependent variables is the binary variable that takes on the values one and zero. For example, the ownership of a durable goods and the choice of participating a particular event may be represented by a binary dependent variable. Such variables are different from standard continuous variables such as GDP and income. It is then natural to take into account the data characteristics in the specification of quasilikelihood function. Conditional on the explanatory variables x t, the binary variable y t is such that { 1, with probability IP(y y t = t =1 x t ), 0, with probability 1 IP(y t =1 x t ). he density function of y t given x t is of the Bernoulli type: g(y t x t )=IP(y t =1 x t ) yt [1 IP(y t =1 x t )] 1 yt. A standard modeling approach is to find a function F such that F (x t ; θ) approximates the conditional probability IP(y t =1 x t ). he quasi-likelihood function is then: f(y t x t ; θ) =F (x t ; θ) yt [1 F (x t ; θ)] 1 yt. 267

2 268 CHAPER 10. QUASI-MAXIMUM LIKELIHOOD: APPLICAIONS he QMLE θ is then obtained by maximizing L (θ) = 1 [ yt log F (x t ; θ)+(1 y t )log(1 F (x t ; θ)) ]. As 0 IP(y t =1 x t ) 1, it is natural to choose F as a function bounded between zero and one, such as a distribution functions. When F (x t ; θ) =Φ(x tθ) withφthe standard normal distribution function: u 1 Φ(u) = e v2 /2 dv, 2π we have the probit model. For Φ(u) =p, itsinverseφ 1 (p) isknownastheprobit transformation. WhenF (x t ; θ) =G(x tθ)withg the logistic distribution function: 1 eu G(u) = = 1+e u 1+e u. it is the logit model. he logistic distribution has mean zero and variance π 2 /3, and it is more peaked around its mean and has slightly thicker tails than the standard normal distribution. For G(u) =p, itsinverse ( p ) G 1 (p) =log, 1 p is known as the logit transformation. ItiseasytoverifythatG (u) =G(u)[1 G(u)] which is convenient for estimating a logit model. and For the logit model, θ L (θ) = 1 = 1 2 θ L (θ) = 1 [ G (x y tθ) t G(x t θ) (1 y t) [y t G(x tθ)]x t, G(x t θ)[1 G(x t θ)]x t x t. G (x tθ) 1 G(x t θ) Given that G(x tθ)[1 G(x tθ)] > 0 for all θ, this matrix is is negative definite, so that the quasi-log-likelihood function is globally concave on the parameter space. When x t contains a constant term, the first order condition implies ] x t 1 y t = 1 G(x t θ );

3 10.1. BINARY CHOICE MODELS 269 that is, the average of fitted values is the relative frequency of y t =1. For the probit model, θ L (θ) = 1 = 1 [ φ(x y tθ) t Φ(x t θ) (1 y t) φ(x tθ) 1 Φ(x t θ) y t Φ(x t θ) Φ(x t θ)[1 Φ(x t θ)]φ(x tθ)x t, ] x t where φ is the standard normal density function. It can also be verified that 2 θ L (θ) = 1 [ y t φ(x tθ)+x tθφ(x tθ) Φ 2 (x t θ) +(1 y t ) φ(x tθ) x tθ[1 Φ(x tθ)] [1 Φ(x t θ)]2 which is also negative definite; see e.g., Amemiya (1985, pp ). ] φ(x tθ)x t x t, Clearly, the conditional mean of y t is just the conditional probability P (y t =1 x t ), and its conditional variance is var(y t x t )=IP(y t =1 x t )[1 IP(y t =1 x t )], which changes with x t.hus,f(x t ; θ) is also an approximation to the conditional mean function; F (x t ; θ)[1 F (x t ; θ)] is an approximation to the conditional variance function. Writing y t = F (x t ; θ)+e t, we can see that the probit and logit models are in effect different nonlinear mean specifications with conditional heteroskedasticity. Although θ may be estimated using the NLS method, the resulting estimator cannot be efficient because it ignores conditional heteroskedasticity. Even a weighted NLS estimator that takes into account the conditional variance is still inefficient because it does not consider the Bernoulli feature of y t. Note that the linear probability model in which F (x t ; θ) =x tθ (Section 4.4) is not appropriate because the fitted values may be outside the range of [0, 1]. It should be noted that the marginal response of the choice probability to the change of a particular variable x tj in a probit model is Φ(x t θ) x tj = ϕ(x tθ)θ j,

4 270 CHAPER 10. QUASI-MAXIMUM LIKELIHOOD: APPLICAIONS and the marginal response in a logit model is G(x tθ) = G(x t x θ)[1 G(x t θ)]θ j, tj which change with x t. By contrast, the marginal response to the change of a regressor in a linear regression model is the associated coefficient which is invariant with respect to t. hus, it is typical to evaluate the marginal response based on a particular value of x t,suchasx t = 0 or x t = x, the sample average of x t. Note that when x t = 0, ϕ(0) 0.4 andg(0)[1 G(0)] = his suggests that the QMLE for the logit model is approximately 1.6 times the QMLE for the probit model when x t are close to zero. An alternative view of the binary choice model is to assume that the observed variable y t is determined by the latent (index) variable yt : { 1, y y t = t > 0, 0, yt 0, where y t = x t β + e t.hus, IP(y t =1 x t )=IP(y t > 0 x t )=IP(e t > x t β x t ). he probability on the right-hand side is also IP(e t < x t β x t ) provided that e t is symmetric about zero. he probit specification Φ or the logit specification G can then be viewed as specifications of the conditional distribution of e t. his suggests that a possible modification of these two models is to consider an asymmetric distribution function for e t Models for Multiple Choices A more general discrete choice model would be needed when one faces multiple choices of job, event or transportation alternatives. In this case, the dependent variable y t takes on J + 1 integer values (0, 1,...,J) which correspond to different categories that are not overlapping and do not have a natural ordering. We have 0, with probability IP(y t =0 x t ), 1, with probability IP(y t =1 x t ), y t =. J, with probability IP(y t = J x t ). he number of choices J may depend on t because, for example, an individual who does not own a car can not have the choice of driving to work. We shall not consider this complication here, however.

5 10.2. MODELS FOR MULIPLE CHOICES Multinomial Logit Models Define the new binary variable d t,j for j =0, 1,...,J as d t,j = { 1, if y t = j, 0, otherwise. Note that J j=0 d t,j = 1. he density function of d t,0,...,d t,j given x t is then g(d t,0,...,d t,j x t )= J IP(y t = j x t ) d t,j. j=0 he conditional probabilities can be approximated by some functions F (x t ; θ j ), where the regressors are t-specific (individual specific) but not choice specific, yet their responses to the choices j may be different. Note that only J conditional probabilities need to be specified; otherwise, there would be an identification problem because the sum of J + 1 probabilities must be one. Common specifications of the conditional probabilities are F (x t ; θ 1,...,θ J )=G t,0 = 1 1+ J k=1 exp(x t θ k ), exp(x tθ j ) F (x t ; θ 1,...,θ J )=G t,j = 1+ J k=1 exp(x t θ k ), which lead to the so-called multinomial logit model. he quasi-log-likelihood function of this model reads ( ) L (θ 1,...,θ J )= 1 J d t,j x t θ j 1 J log 1+ exp(x t θ k ). j=1 he gradient vector with respect to θ j is θj L (θ 1,...,θ J )= 1 k=1 (d t,j G t,j )x t, j =1,...,J. Setting these equations to zero we can solve for the QMLE of θ j. he first order condition again implies that, when x t contains a constant term, the predicted relative frequency of the j th category equals its actual relative frequency. It can also be verified that the (i, j) th off-diagonal block of the Hessian matrix is θj θ i L (θ 1,...,θ J )= 1 (G t,j G t,i )x t x t, i j, i, j =1,...,J,

6 272 CHAPER 10. QUASI-MAXIMUM LIKELIHOOD: APPLICAIONS and the j th diagonal block is θj θ j L (θ 1,...,θ J )= 1 G t,j (1 G t,j )x t x t, j =1,...,J. We may then use these information to compute the asymptotic covariance matrix for the QMLEs. As in the binomial logit model, one should be careful about the the marginal response of the choice probability, G t,j, to the change of the variables x t.itiseasytoverifythat J xt G t,0 = G t,0 G t,i θ i, i=1 xt G t,j = G t,j (θ j ) J G t,i θ i, j =1,...,J. i=1 hus, when x t changes, all coefficient vectors θ i, i = 1,...,J, enter the marginal response of G t,j Conditional Logit Model A model closely related to the multinomial logit model is McFadden s conditional logit model, in which the regressors are choice-specific. For example, when an individual makes choice among several commuting modes, the regressors may include the in-vehicle time and waiting time that vary with the vehicle he/she chooses. As such, we shall consider x t =(x t,0 x t,1... x t,j ),wherex t,j is the vector of regressors characterizing the t th individual s attributes with respect to the j th choice. As in Section , we define the binary variable d t,j for j =0, 1,...,J as { 1, if yt = j, d t,j = 0, otherwise, and the density function of d t,0,...,d t,j given x t is J g(d t,0,...,d t,j x t )= IP(y t = j x t ) d t,j. j=0 In the conditional logit model, the conditional probabilities are approximated by G t,j = exp(x t,j θ) J k=0 exp(x t,k θ), j =0, 1,...,J.

7 10.2. MODELS FOR MULIPLE CHOICES 273 Note that θ is now common for all j, so that we may have J + 1 specifications for the conditional probabilities without causing an identification problem. his model would be convenient if one wants to predict the probability of a new choice. Similar to the preceding subsection, the quasi-log-likelihood function is ( L (θ) = 1 J d t,j x t,jθ 1 J ) log exp(x t,k θ). j=0 he first order condition is k=0 θ L (θ) = 1 J (d t,j G t,j )x t,j = 0, j=0 from which the QMLE for θ can be easily calculated. he Hessian matrix of the quasilog-likelihood function is 2 θ L (θ) = 1 J J J G t,j x t,jx t,j G t,j x t,j j=0 j=0 j=0 G t,j x t,j = 1 J G t,j (x t,j x t )(x t,j x t ), j=0 where x t = J j=0 G t,j x t,j is the weighted average of x t,j. It is then clear that the Hessian matrix is negative definite so that the quasi-log-likelihood function is globally concave. he marginal response of G t,j to x t,i, is xt,i G t,j = G t,j G t,i θ, i j, i =0,...,J, and the marginal response of G t,j to x t,j is xt,j G t,j = G t,j (1 G t,j )θ, j =0,...,J. his shows that each choice probability is affected not only by x t,j but also the regressors for other choices, x t,i. he conditional logit model may be understood under a random utility framework (McFadden, 1974). Given the random utility of the choice j: U t,j = x t,jθ + ε t,j, j =0, 1,...,J, the alternative i would be chosen if IP(y t = i x t )=IP(U t,i >U t,j, for all j i x t ).

8 274 CHAPER 10. QUASI-MAXIMUM LIKELIHOOD: APPLICAIONS Suppose that ε t,j are independent random variables across j such that they have the type I extreme value distribution: exp[ exp( ε t,j )]. o see how this setup leads to a conditional logit model, consider the simple case that there are 3 choices (J = 2). Letting μ t,j = x t,jθ we have IP(y t =2 x t )=IP(ε t,2 + μ t,2 μ t,1 >ε t,1 and ε t,2 + μ t,2 μ t,0 >ε t,0 ) ( εt,2 +μ t,2 μ t,1 ) = f(ε t,2 ) f(ε t,1 )dε t,1 ( εt,2 +μ t,2 μ t,0 f(ε t,0 )dε t,0 ) dε t,2, where f(ε t,j ) is the type I extreme value density function: exp( ε t,j )exp[ exp( ε t,j )]. hen, IP(y t =2 x t )= = exp( ε t,2 )exp[ exp( ε t,2 )] exp[ exp( ε t,2 μ t,2 + μ t,1 )] exp[ exp( ε t,2 μ t,2 + μ t,0 )] dε t,2 exp(μ t,2 ) exp(μ t,0 )+exp(μ t,1 )+exp(μ t,2 ). his is precisely the conditional logit specification of IP(y t =2 x t ). he specifications for other conditional probabilities can be obtained similarly. hus, the conditional logit model hinges on the condition that the choices must be quite different such that they are independent of each other. his is also known as the condition of independence of irrelevant alternatives (IIA). he IIA condition may not hold in applications and therefore must be tested empirically Ordered Data In some applications, the multiple categories of interest may have a natural ordering, such as the levels of education, opinion, and price. In particular, y t is said to be ordered if 0, yt c 1 1, c 1 <yt c 2 y t =. J, c J <yt ; otherwise, it is unordered. hevariabley t in Section is unordered.

9 10.3. MODELS OF COUN DAA 275 Setting yt = x t θ + e t, the conditional probabilities are: IP(y t = j x t )=IP(c j <y t <c j+1 x t ) = F (c j+1 x tθ) F (c j x tθ), j =0, 1,...,J, where c 0 =, c J+1 =, andf is the distribution of e t. When F is the logistic distribution function G, thisistheordered logit model; whenf is the standard normal distribution function Φ, it is the ordered probit model. he quasi-log-likelihood function of the ordered model is L (θ) = 1 J log[f (c j+1 x t θ) F (c j x t θ)], j=0 from which we can easily compute its gradient and Hessian matrix. Pratt (1981) showed that the Hessian matrix is negative definite so that the quasi-log-likelihood function is globally concave. he marginal responses of the choice probabilities to the change of the variables can be easily calculated. For the ordered probit model, these responses are φ(c 1 x tθ)θ, j =0 [ xt IP(y t = j x t )= φ(cj x tθ) φ(c j+1 x tθ) ] θ, j =1,...,J 1, φ(c J x tθ)θ, j = J. See Exercise 10.5 for the marginal responses of the ordered logit model Models of Count Data In practice, there are integer-valued dependent variables that are not categorical, such as the number of accidents of a plant or the number of patents filed by a company. Such data, also known as count data, differ from categorical data in that they represent the intensity of the occurrence of an event. It is typical to postulate a Poisson distribution for the conditional probability of the count data y t =0, 1, 2,...: IP(y t x t )=exp( λ t ) λyt t y t!, where the parameter λ t, which is also the conditional mean, has a log-linear form: log λ t = x tθ.

10 276 CHAPER 10. QUASI-MAXIMUM LIKELIHOOD: APPLICAIONS he quasi-log-likelihood function is then L (θ) = 1 = 1 [ λt + y t log(λ t ) log(y t!) ] [ exp(x t θ)+y t (x t θ) log(y t!)]. he gradient vector is θ L (θ) = 1 [y t exp(x tθ)]x t, and the Hessian matrix is 2 θ L (θ) = 1 exp(x tθ)x t x t. Since exp(x tθ) are non-negative, the Hessian matrix is also negative definite on the parameter space. A drawback of the Poisson specification is that it in effect restricts the conditional variance to be the same as the conditional mean λ t.writing y t =exp(x tθ)+e t, we simply have a nonlinear specification of the conditional mean function without restricting the conditional variance. Such a specification does not consider the feature that y t are discrete-valued count data, however Models of Limited Dependent Variables here are also numerous economic applications in which a dependent variable can only be observed within a limited range. Such a variable is known as a limited dependent variable. For example, in the study of the household expenditure on durable goods, obin (1958) noticed that positive expenditures can be observed only when households spend more than the cheapest available price; potential expenditures below the minimum price are not observable but appear as zeros. In this case, the expenditure data are said to be censored. On the other hand, data may be truncated if they are completely lost outside a given range; for example, income data may be truncated when they are below certain level. A proper specification for a limited dependent variable must take data censoring or truncation into account, as will be apparent in subsequent sub-sections.

11 10.4. MODELS OF LIMIED DEPENDEN VARIABLES runcated Regression Models First consider the dependent variable y t which is truncated from below at c, i.e., y t >c. he conditional density of truncated y t is g(y t y t >c,x t )=g(y t x t )/ IP(y t >c x t ), and we may approximate g(y t x t )using f(y t x t ; β,σ 2 )= Setting u t = y t /σ, wehave IP(y t >c x t )= 1 σ ( 1 exp (y t x tβ) 2 ) 2πσ 2 2σ 2 c ( yt x t φ β ) dy σ t = 1 ( σ φ yt x ) tβ. σ = φ(u t )du t (c x t β)/σ ( c x =1 Φ t β ). σ It follows that the truncated density function g(y t y t >c,x t ) can be approximated by f(y t y t >c,x t ; β,σ 2 )= φ[(y t x tβ)/σ] σ [ 1 Φ ( (c x t β)/σ)], which, of course, depends on the truncation parameter c. he quasi-log-likelihood function is therefore 1 2 [log(2π)+log(σ2 )] 1 2σ 2 (y t x tβ) 2 1 [ ( c x log 1 Φ t β ) ]. σ Reparameterizing by α = β/σ and γ = σ 1,wehave L (θ) = log(2π) 2 +log(γ) 1 2 (γy t x t α)2 1 log [ 1 Φ(γc x t α)], where θ =(α γ). he first-order condition is 1 [ ] (γy t x t θ L (θ) = α) φ(γc x t α) 1 Φ(γc x t α) x t 1 γ 1 [ ] (γy t x t α)y t φ(γc x t α) 1 Φ(γc x α)c t = 0, from which we can solve for the QMLE of θ. It can be verified that L (θ) is globally concave in θ.

12 278 CHAPER 10. QUASI-MAXIMUM LIKELIHOOD: APPLICAIONS When f(y t y t >c,x t ; θ o ) is the correct specification for g(y t y t >c,x t ), it is not too difficult to derive the conditional mean of truncated y t as IE(y t y t >c,x t )= y t f(y t y t >c,x t ; θ o )dy t c = x t β o + σ φ ( (c x t β o )/σ ) o o 1 Φ ( (c x t β ); o)/σ o (10.1) which differs from x tβ o by a nonlinear term; see Exercise hus, the OLS estimator of regressing y t on x t would be inconsistent for β o when y t are truncated. his also shows why a proper model must consider data truncation. Define the hazard function λ as λ(u) =φ(u)/[1 Φ(u)], which is also known as the inverse Mill s ratio. Setting u t =(c x tβ o )/σ o, the truncated mean can be expressed as IE(y t y t >c,x t )=x tβ o + σ o λ(u t ). It can also be shown that the truncated variance is var(y t y t >c,x t )=σo[ 2 1 λ 2 ] (u t )+λ(u t )u t, instead of σo; 2 see Exercise he marginal response of IE(y t y t >c,x t ) to a change of x t is β o [ 1 λ 2 (u t )+λ(u t )u t ] = β o σ 2 o var(y t y t >c,x t ), which depends on the truncated variance. Consider now the case that the dependent variable y t is truncated from above at c, i.e., y t <c. he truncated density function g(y t y t <c,x t ) can be approximated by f(y t y t <c,x t ; β,σ 2 )= φ[(y t x tβ)/σ] σ Φ ( (c x t β)/σ). hen, analogous to (10.1), we have IE(y t y t <c,x t )=x φ ( (c x ) tβ o σ tβ o )/σ o o Φ ( (c x t β o )/σ ), o with the inverse Mill s ratio λ(u) = φ(u)/φ(u).

13 10.4. MODELS OF LIMIED DEPENDEN VARIABLES Censored Regression Models Consider the following censored dependent variable: { yt, yt > 0, y t = 0, yt 0, where yt = x t β + e t is an index variable. It does not matter whether the threshold value of yt is zero or a non-zero constant c (e.g., the cheapest available price) because c can always be absorbed into the constant term in the specification of yt. Let g and g denote, respectively, the densities of y t and yt conditional on x t. When yt > 0, g(y t x t )=g (yt x t ), and when yt 0, censoring yields the conditional probability IP(y t =0 x t )= 0 g (y t x t )dy t. In the standard obit model (obin, 1958), g (yt x t ) is approximated by ( f(yt x t ; 1 β,σ2 )= exp (y t x tβ) 2 ) 2πσ 2 2σ 2 = 1 ( y σ φ t x ) tβ. σ For yt 0, IP{y t =0 x t } is approximated by 1 0 x φ((yt x σ tβ)/σ) dyt t β/σ = φ(v t )dv t =1 Φ ( x t β Letting α = β/σ and γ = σ 1, the model would be correctly specified for {yt x t } if there exists θ o =(α o γ o ) such that f(yt x t ; θ o )=g (yt x t ). Given the specification above, the quasi-log-likelihood function is 1 2 [log(2π)+log(σ2 )] + 1 {t:y t=0} log(1 Φ(x t β/σ)) 1 2 σ ). {t:y t>0} [(y t x t β)/σ]2, where 1 is the number of t such that y t > 0. he QMLE of θ =(α γ) can then be obtained by maximizing L (θ) = 1 log(1 Φ(x t α)) + 1 log γ 1 (γy 2 t x t α)2. {t:y t=0} he first order condition is θ L (θ) = 1 {t:y t=0} {t:y t>0} φ(x t α) 1 Φ(x t α)x t + {t:y t>0} (γy t x tα)x t 1 γ {t:y t>0} (γy t x t α)y t. = 0,

14 280 CHAPER 10. QUASI-MAXIMUM LIKELIHOOD: APPLICAIONS from which the QMLE can be computed. Again, L (θ) is globally concave in α and γ. he conditional mean of censored y t is IE(y t x t )=IE(y t y t > 0, x t )IP(y t > 0 x t )+IE(y t y t =0, x t )IP(y t =0 x t ) =IE(y t y t > 0, x t )IP(y t > 0 x t ). When f(yt x t ; β,σ2 ) is correctly specified for g (yt x t ), ( y IP(yt > 0 x t )=IP t x ) tβ o > x tβ o σ o σ x t =Φ(x t β o /σ o ). o his leads to the following conditional density: g(y t y t > 0, x t )= g (y t x t ) IP(y t > 0 x t ) = φ[(y t x tβ o )/σ o ] σ o Φ(x t β o /σ o ). hen, analogous to (10.1), we have IE(y t y t > 0, x t )= 0 y t g(y t y t > 0, x t )dy t = x tβ o + σ o φ(x t β o /σ o ) Φ(x t β o /σ o ); (10.2) see Exercise It follows that IE(y t x t )=x tβ o Φ(x tβ o /σ o )+σ o φ(x tβ o /σ o ). his shows that x tβ can not be the correct specification of the conditional mean of censored y t. Similar to truncated regression models, regressing y t on x t results in an inconsistent estimator for β o. It is also easy to calculate that the marginal response of IE(y t x t ) to a change of x t is β o Φ(x t β o /σ o ). Consider the case that y t is censored from above: { y t, yt < 0, y t = 0, y t 0, where y t = x tβ + e t.whenf(y t x t ; β,σ 2 ) is correctly specified for g (y t x t ), hen, IP(y t < 0 x t )=1 Φ(x tβ o /σ o ). g(y t y t < 0, x t )= g (y t x t ) IP(y t < 0 x t ) = φ[(y t x t β o )/σ o ] σ o [1 Φ(x t β o /σ o )],

15 10.4. MODELS OF LIMIED DEPENDEN VARIABLES 281 and hence, analogous to (10.2), IE(y t y t < 0, x t )=x tβ o σ o φ(x tβ o /σ o ) 1 Φ(x t β o /σ o ). Consequently, IE(y t x t )=x tβ o [1 Φ(x tβ o /σ o )] σ o φ(x tβ o /σ o ). and its marginal response to a change of x t is β o [1 Φ(x t β o /σ o )] Models of Sample Selection It is also common to encounter the problem of sample selection or incidental truncation in empirical studies. For example, in the study on how wages and individual characteristics determine working hours, the data of working hours are not resulted from random sampling but from those who select to work. In other words, only those whose market wages are above their reservation wages (the minimum wages for which they are willing to work) may be observed. hus, the decision of working must be related to working hours. Consider two variables y 1 and y t,wherey 1 is a binary choice variable. he selection problem arises when the former affects the latter. hese two variables are { 1, y y 1,t = 1,t > 0, 0, y1,t 0. y 2,t = y2,t, if y 1,t =1, where y1,t = x 1,t β + e 1,t and y 2,t = x 2,t γ + e 2,t are index variables. We observe y 1,t but not the index variable y1,t. Also, we observe y 2,t when y 1,t =1sothaty 2,t are incidentally truncated when y 1,t = 0. It is typical to model (y1,t y 2,t ) basedona bivariate normal distribution: (( ) [ ]) x N 1,t β σ 2 x 2,t γ, 1 σ 12. σ 12 σ2 2 When this is a correct specification for the conditional distribution of (y1,t y 2,t ),the corresponding true parameters are β o, γ o, σ1,o 2, σ2 2,o and σ 12,o. Computing the QMLE based on this bivariate specification is quite complicated. Heckman (1976) suggest a simpler, two-step estimator for the sample-selection model.

16 282 CHAPER 10. QUASI-MAXIMUM LIKELIHOOD: APPLICAIONS For notation simplicity, we shall, in what follows, omit the conditioning variables x 1,t and x 2,t. First recall that the conditional mean of y 2,t given y 1,t is IE(y 2,t y 1,t) =x 2,tγ o + σ 12,o (y 1,t x 1,tβ o )/σ 2 1,o. he result of the truncated regression model also shows that, when the truncation parameter c =0, IE(y 1,t y 1,t > 0) = x 1,tβ o + σ 1,o λ(x 1,tβ o /σ 1,o ), with λ(u) = φ(u)/φ(u). Ptting these results together we have IE(y2,t y 1,t > 0) = x 2,t γ o + σ ( 12,o x 1,t β ) o λ. σ 1,o σ 1,o his motivates the following specification: y 2,t = x 2,t γ + σ 12 σ 1 λ(x 1,t β/σ 1 )+e t, which is a complicated nonlinear regression with heteroskedastic errors. Heckman s two-step estimator is computed as follows. 1. Estimate α = β/σ 1 based on the probit model of y 1,t and denote the estimator as α. 2. Regress y 2,t on x 2,t and λ(x 1,t α ). As the QMLE α in the first step is consistent for α o, the second step is equivalent to regressing y 2,t on x 2,t and λ(x 1,t α o). hus, the OLS estimator of regressing y 2,t on x 2,t alone, which ignores the λ term, is inconsistent and may have severe sample-selection bias in finite samples.

17 10.4. MODELS OF LIMIED DEPENDEN VARIABLES 283 Exercises 10.1 In the logit model with the k 1 parameter vector θ, consider the null hypothesis that an s 1subvectorθ is zero, where s<k. Write down the Wald statistic with a hteroskedasticity-consistent covariance matrix estimator In the logit model with the k 1 parameter vector θ, consider the null hypothesis that an s 1 subvector θ is zero, where s<k. Write down the LM statistic with a hteroskedasticity-consistent covariance matrix estimator Construct a PSE test for testing the null hypothesis of a logit model with F (x t ; θ) = G(x tθ) against the alternative hypothesis of a probit model with F (x t ; θ) = Φ(x tθ) In the conditional logit model with the k 1 parameter vector θ, consider the null hypothesis that an s 1 subvector θ is zero, where s<k. Write down the Wald statistic with a hteroskedasticity-consistent covariance matrix estimator For the ordered logit model, find the marginal responses of the choice probabilities to the change of the variables In the Poisson regression model with the parameter vector θ, consider the null hypothesis Rθ = r, withr a q k given matrix. Write down the Wald statistic with a hteroskedasticity-consistent covariance matrix estimator In the Poisson regression model with the parameter vector θ, consider the null hypothesis Rθ = r, withr a q k given matrix. Write down the LM statistic with a hteroskedasticity-consistent covariance matrix estimator In the truncated regression model with the truncation parameter c, show that IE(y t y t >c,x t ) is given by (10.1) In the truncated regression model with the truncation parameter c, show that var(y t y t >c,x t )=σo[ 2 1 λ 2 ] (u t )+λ(u t )u t,whereλ(ut )=φ(u t )/[1 Φ(u t )] and u t =(c x tβ o )/σ o In the obit model, show that IE(y t y t > 0, x t ) is given by (10.2).

18 284 CHAPER 10. QUASI-MAXIMUM LIKELIHOOD: APPLICAIONS References Amemiya, akeshi (1985). Advanced Econometrics, Cambridge, MA: Harvard University Press. Heckman, James J. (1976). Common structure of statistical medels of truncation, sample selection, and limited dependent variables and a simple estimator for such models, Annals of Economic and Social Measurement, 5, McFadden, Daniel L. (1973). Conditional logit analysis of qualitative choice behavior, in P. Zarembka (ed.), Frontier of Econometrics, New York: Academic Press. obin, James (1958). Estimation of relationships for limited dependent variables, Econometrica, 26,

Quasi Maximum Likelihood Theory

Quasi Maximum Likelihood Theory Quasi Maximum Likelihood heory CHUNG-MING KUAN Department of Finance & CREA June 17, 2010 C.-M. Kuan (National aiwan Univ.) Quasi Maximum Likelihood heory June 17, 2010 1 / 119 Lecture Outline 1 Kullback-Leibler

More information

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables

More information

Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions

Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions Econ 513, USC, Department of Economics Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions I Introduction Here we look at a set of complications with the

More information

ECON 594: Lecture #6

ECON 594: Lecture #6 ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was

More information

I. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as

I. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as Econ 513, USC, Fall 2005 Lecture 15 Discrete Response Models: Multinomial, Conditional and Nested Logit Models Here we focus again on models for discrete choice with more than two outcomes We assume that

More information

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama Problem Set #1 (Random Data Generation) 1. Generate =500random numbers from both the uniform 1 ( [0 1], uniformbetween zero and one) and exponential exp ( ) (set =2and let [0 1]) distributions. Plot the

More information

Truncation and Censoring

Truncation and Censoring Truncation and Censoring Laura Magazzini laura.magazzini@univr.it Laura Magazzini (@univr.it) Truncation and Censoring 1 / 35 Truncation and censoring Truncation: sample data are drawn from a subset of

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Non-linear panel data modeling

Non-linear panel data modeling Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1

More information

Limited Dependent Variables and Panel Data

Limited Dependent Variables and Panel Data and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement

More information

Maximum Likelihood Methods

Maximum Likelihood Methods Maximum Likelihood Methods Some of the models used in econometrics specify the complete probability distribution of the outcomes of interest rather than just a regression function. Sometimes this is because

More information

Discrete Dependent Variable Models

Discrete Dependent Variable Models Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)

More information

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools Syllabus By Joan Llull Microeconometrics. IDEA PhD Program. Fall 2017 Chapter 1: Introduction and a Brief Review of Relevant Tools I. Overview II. Maximum Likelihood A. The Likelihood Principle B. The

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Censoring and truncation b)

More information

Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland

Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2018 Part III Limited Dependent Variable Models As of Jan 30, 2017 1 Background 2 Binary Dependent Variable The Linear Probability

More information

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated

More information

h=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio

h=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio Multivariate Response Models The response variable is unordered and takes more than two values. The term unordered refers to the fact that response 3 is not more favored than response 2. One choice from

More information

The Quasi-Maximum Likelihood Method: Theory

The Quasi-Maximum Likelihood Method: Theory Chapter 9 he Quasi-Maximum Likelihood Method: heory As discussed in preceding chapters, estimating linear and nonlinear regressions by the least squares method results in an approximation to the conditional

More information

Applied Health Economics (for B.Sc.)

Applied Health Economics (for B.Sc.) Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative

More information

Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation

Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation University of Illinois Fall 2016 Department of Economics Roger Koenker Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation The simplest of this general class of models is Tobin s (1958)

More information

Poisson Regression. Ryan Godwin. ECON University of Manitoba

Poisson Regression. Ryan Godwin. ECON University of Manitoba Poisson Regression Ryan Godwin ECON 7010 - University of Manitoba Abstract. These lecture notes introduce Maximum Likelihood Estimation (MLE) of a Poisson regression model. 1 Motivating the Poisson Regression

More information

Econometrics Master in Business and Quantitative Methods

Econometrics Master in Business and Quantitative Methods Econometrics Master in Business and Quantitative Methods Helena Veiga Universidad Carlos III de Madrid This chapter deals with truncation and censoring. Truncation occurs when the sample data are drawn

More information

A Guide to Modern Econometric:

A Guide to Modern Econometric: A Guide to Modern Econometric: 4th edition Marno Verbeek Rotterdam School of Management, Erasmus University, Rotterdam B 379887 )WILEY A John Wiley & Sons, Ltd., Publication Contents Preface xiii 1 Introduction

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

Generalized Method of Moment

Generalized Method of Moment Generalized Method of Moment CHUNG-MING KUAN Department of Finance & CRETA National Taiwan University June 16, 2010 C.-M. Kuan (Finance & CRETA, NTU Generalized Method of Moment June 16, 2010 1 / 32 Lecture

More information

Estimation of Dynamic Regression Models

Estimation of Dynamic Regression Models University of Pavia 2007 Estimation of Dynamic Regression Models Eduardo Rossi University of Pavia Factorization of the density DGP: D t (x t χ t 1, d t ; Ψ) x t represent all the variables in the economy.

More information

Testing for Regime Switching: A Comment

Testing for Regime Switching: A Comment Testing for Regime Switching: A Comment Andrew V. Carter Department of Statistics University of California, Santa Barbara Douglas G. Steigerwald Department of Economics University of California Santa Barbara

More information

PhD/MA Econometrics Examination January 2012 PART A

PhD/MA Econometrics Examination January 2012 PART A PhD/MA Econometrics Examination January 2012 PART A ANSWER ANY TWO QUESTIONS IN THIS SECTION NOTE: (1) The indicator function has the properties: (2) Question 1 Let, [defined as if using the indicator

More information

Econometric Analysis of Panel Data. Final Examination: Spring 2013

Econometric Analysis of Panel Data. Final Examination: Spring 2013 Econometric Analysis of Panel Data Professor William Greene Phone: 212.998.0876 Office: KMC 7-90 Home page:www.stern.nyu.edu/~wgreene Email: wgreene@stern.nyu.edu URL for course web page: people.stern.nyu.edu/wgreene/econometrics/paneldataeconometrics.htm

More information

Ordered Response and Multinomial Logit Estimation

Ordered Response and Multinomial Logit Estimation Ordered Response and Multinomial Logit Estimation Quantitative Microeconomics R. Mora Department of Economics Universidad Carlos III de Madrid Outline Introduction 1 Introduction 2 3 Introduction The Ordered

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maximum Likelihood Estimation Additional Topics Raphael Cunha Program in Statistics and Methodology PRISM Department of Political Science The Ohio State University cunha.6@osu.edu March 28, 2014 Overview

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

More on Roy Model of Self-Selection

More on Roy Model of Self-Selection V. J. Hotz Rev. May 26, 2007 More on Roy Model of Self-Selection Results drawn on Heckman and Sedlacek JPE, 1985 and Heckman and Honoré, Econometrica, 1986. Two-sector model in which: Agents are income

More information

Econometric Analysis of Panel Data. Final Examination: Spring 2018

Econometric Analysis of Panel Data. Final Examination: Spring 2018 Department of Economics Econometric Analysis of Panel Data Professor William Greene Phone: 212.998.0876 Office: KMC 7-90 Home page: people.stern.nyu.edu/wgreene Email: wgreene@stern.nyu.edu URL for course

More information

Linear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52

Linear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52 Statistics for Applications Chapter 10: Generalized Linear Models (GLMs) 1/52 Linear model A linear model assumes Y X N(µ(X),σ 2 I), And IE(Y X) = µ(x) = X β, 2/52 Components of a linear model The two

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Testing and Model Selection

Testing and Model Selection Testing and Model Selection This is another digression on general statistics: see PE App C.8.4. The EViews output for least squares, probit and logit includes some statistics relevant to testing hypotheses

More information

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College Original December 2016, revised July 2017 Abstract Lewbel (2012)

More information

Introduction to GSEM in Stata

Introduction to GSEM in Stata Introduction to GSEM in Stata Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Introduction to GSEM in Stata Boston College, Spring 2016 1 /

More information

Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models

Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models Massimiliano Bratti & Alfonso Miranda In many fields of applied work researchers need to model an

More information

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense

More information

8 Nominal and Ordinal Logistic Regression

8 Nominal and Ordinal Logistic Regression 8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on

More information

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College December 2016 Abstract Lewbel (2012) provides an estimator

More information

Discrete Choice Modeling

Discrete Choice Modeling [Part 6] 1/55 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent Class 11 Mixed Logit 12 Stated Preference

More information

Lecture 11/12. Roy Model, MTE, Structural Estimation

Lecture 11/12. Roy Model, MTE, Structural Estimation Lecture 11/12. Roy Model, MTE, Structural Estimation Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy model The Roy model is a model of comparative advantage: Potential earnings

More information

Panel Data Seminar. Discrete Response Models. Crest-Insee. 11 April 2008

Panel Data Seminar. Discrete Response Models. Crest-Insee. 11 April 2008 Panel Data Seminar Discrete Response Models Romain Aeberhardt Laurent Davezies Crest-Insee 11 April 2008 Aeberhardt and Davezies (Crest-Insee) Panel Data Seminar 11 April 2008 1 / 29 Contents Overview

More information

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1 Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, Discrete Changes 1 JunXuJ.ScottLong Indiana University 2005-02-03 1 General Formula The delta method is a general

More information

Tobit and Selection Models

Tobit and Selection Models Tobit and Selection Models Class Notes Manuel Arellano November 24, 2008 Censored Regression Illustration : Top-coding in wages Suppose Y log wages) are subject to top coding as is often the case with

More information

Limited Dependent Variable Models II

Limited Dependent Variable Models II Limited Dependent Variable Models II Fall 2008 Environmental Econometrics (GR03) LDV Fall 2008 1 / 15 Models with Multiple Choices The binary response model was dealing with a decision problem with two

More information

Chapter 11. Regression with a Binary Dependent Variable

Chapter 11. Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score

More information

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Linear

More information

1. Basic Model of Labor Supply

1. Basic Model of Labor Supply Static Labor Supply. Basic Model of Labor Supply.. Basic Model In this model, the economic unit is a family. Each faimily maximizes U (L, L 2,.., L m, C, C 2,.., C n ) s.t. V + w i ( L i ) p j C j, C j

More information

Appendix A: The time series behavior of employment growth

Appendix A: The time series behavior of employment growth Unpublished appendices from The Relationship between Firm Size and Firm Growth in the U.S. Manufacturing Sector Bronwyn H. Hall Journal of Industrial Economics 35 (June 987): 583-606. Appendix A: The time

More information

Using EViews Vox Principles of Econometrics, Third Edition

Using EViews Vox Principles of Econometrics, Third Edition Using EViews Vox Principles of Econometrics, Third Edition WILLIAM E. GRIFFITHS University of Melbourne R. CARTER HILL Louisiana State University GUAY С LIM University of Melbourne JOHN WILEY & SONS, INC

More information

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples ST3241 Categorical Data Analysis I Generalized Linear Models Introduction and Some Examples 1 Introduction We have discussed methods for analyzing associations in two-way and three-way tables. Now we will

More information

Control Function and Related Methods: Nonlinear Models

Control Function and Related Methods: Nonlinear Models Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear

More information

Regression with time series

Regression with time series Regression with time series Class Notes Manuel Arellano February 22, 2018 1 Classical regression model with time series Model and assumptions The basic assumption is E y t x 1,, x T = E y t x t = x tβ

More information

Chapter 6 Stochastic Regressors

Chapter 6 Stochastic Regressors Chapter 6 Stochastic Regressors 6. Stochastic regressors in non-longitudinal settings 6.2 Stochastic regressors in longitudinal settings 6.3 Longitudinal data models with heterogeneity terms and sequentially

More information

Limited Dependent Variables and Panel Data

Limited Dependent Variables and Panel Data Limited Dependent Variables and Panel Data Logit, Probit and Friends Benjamin Bittschi Sebastian Koch Outline Binary dependent variables Logit Fixed Effects Models Probit Random Effects Models Censored

More information

Course Notes: Statistical and Econometric Methods

Course Notes: Statistical and Econometric Methods Course Notes: Statistical and Econometric Methods h(t) 4 2 0 h 1 (t) h (t) 2 h (t) 3 1 2 3 4 5 t h (t) 4 1.0 0.5 0 1 2 3 4 t F(t) h(t) S(t) f(t) sn + + + + + + + + + + + + + + + + + + + + + + + + - - -

More information

(θ θ ), θ θ = 2 L(θ ) θ θ θ θ θ (θ )= H θθ (θ ) 1 d θ (θ )

(θ θ ), θ θ = 2 L(θ ) θ θ θ θ θ (θ )= H θθ (θ ) 1 d θ (θ ) Setting RHS to be zero, 0= (θ )+ 2 L(θ ) (θ θ ), θ θ = 2 L(θ ) 1 (θ )= H θθ (θ ) 1 d θ (θ ) O =0 θ 1 θ 3 θ 2 θ Figure 1: The Newton-Raphson Algorithm where H is the Hessian matrix, d θ is the derivative

More information

QED. Queen s Economics Department Working Paper No Russell Davidson Queen s University. James G. MacKinnon Queen s University

QED. Queen s Economics Department Working Paper No Russell Davidson Queen s University. James G. MacKinnon Queen s University QED Queen s Economics Department Working Paper No. 514 Convenient Specification Tests for Logit and Probit Models Russell Davidson Queen s University James G. MacKinnon Queen s University Department of

More information

Gibbs Sampling in Endogenous Variables Models

Gibbs Sampling in Endogenous Variables Models Gibbs Sampling in Endogenous Variables Models Econ 690 Purdue University Outline 1 Motivation 2 Identification Issues 3 Posterior Simulation #1 4 Posterior Simulation #2 Motivation In this lecture we take

More information

Eksamen på Økonomistudiet 2006-II Econometrics 2 June 9, 2006

Eksamen på Økonomistudiet 2006-II Econometrics 2 June 9, 2006 Eksamen på Økonomistudiet 2006-II Econometrics 2 June 9, 2006 This is a four hours closed-book exam (uden hjælpemidler). Please answer all questions. As a guiding principle the questions 1 to 4 have equal

More information

Linear Regression With Special Variables

Linear Regression With Special Variables Linear Regression With Special Variables Junhui Qian December 21, 2014 Outline Standardized Scores Quadratic Terms Interaction Terms Binary Explanatory Variables Binary Choice Models Standardized Scores:

More information

Lecture 6: Discrete Choice: Qualitative Response

Lecture 6: Discrete Choice: Qualitative Response Lecture 6: Instructor: Department of Economics Stanford University 2011 Types of Discrete Choice Models Univariate Models Binary: Linear; Probit; Logit; Arctan, etc. Multinomial: Logit; Nested Logit; GEV;

More information

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models David Rosenberg New York University April 12, 2015 David Rosenberg (New York University) DS-GA 1003 April 12, 2015 1 / 20 Conditional Gaussian Regression Gaussian Regression Input

More information

Lecture-20: Discrete Choice Modeling-I

Lecture-20: Discrete Choice Modeling-I Lecture-20: Discrete Choice Modeling-I 1 In Today s Class Introduction to discrete choice models General formulation Binary choice models Specification Model estimation Application Case Study 2 Discrete

More information

Modeling Binary Outcomes: Logit and Probit Models

Modeling Binary Outcomes: Logit and Probit Models Modeling Binary Outcomes: Logit and Probit Models Eric Zivot December 5, 2009 Motivating Example: Women s labor force participation y i = 1 if married woman is in labor force = 0 otherwise x i k 1 = observed

More information

Simultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations

Simultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations Simultaneous Equation Models. Introduction: basic definitions 2. Consequences of ignoring simultaneity 3. The identification problem 4. Estimation of simultaneous equation models 5. Example: IS LM model

More information

MC3: Econometric Theory and Methods. Course Notes 4

MC3: Econometric Theory and Methods. Course Notes 4 University College London Department of Economics M.Sc. in Economics MC3: Econometric Theory and Methods Course Notes 4 Notes on maximum likelihood methods Andrew Chesher 25/0/2005 Course Notes 4, Andrew

More information

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017 Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent

More information

Lecture 6: Dynamic Models

Lecture 6: Dynamic Models Lecture 6: Dynamic Models R.G. Pierse 1 Introduction Up until now we have maintained the assumption that X values are fixed in repeated sampling (A4) In this lecture we look at dynamic models, where the

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation NELS 88 Table 2.3 Adjusted odds ratios of eighth-grade students in 988 performing below basic levels of reading and mathematics in 988 and dropping out of school, 988 to 990, by basic demographics Variable

More information

DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005

DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005 DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005 The lectures will survey the topic of count regression with emphasis on the role on unobserved heterogeneity.

More information

Functional Form. Econometrics. ADEi.

Functional Form. Econometrics. ADEi. Functional Form Econometrics. ADEi. 1. Introduction We have employed the linear function in our model specification. Why? It is simple and has good mathematical properties. It could be reasonable approximation,

More information

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo Lecture 1 Behavioral Models Multinomial Logit: Power and limitations Cinzia Cirillo 1 Overview 1. Choice Probabilities 2. Power and Limitations of Logit 1. Taste variation 2. Substitution patterns 3. Repeated

More information

Multinomial Discrete Choice Models

Multinomial Discrete Choice Models hapter 2 Multinomial Discrete hoice Models 2.1 Introduction We present some discrete choice models that are applied to estimate parameters of demand for products that are purchased in discrete quantities.

More information

Parametric Identification of Multiplicative Exponential Heteroskedasticity

Parametric Identification of Multiplicative Exponential Heteroskedasticity Parametric Identification of Multiplicative Exponential Heteroskedasticity Alyssa Carlson Department of Economics, Michigan State University East Lansing, MI 48824-1038, United States Dated: October 5,

More information

Women. Sheng-Kai Chang. Abstract. In this paper a computationally practical simulation estimator is proposed for the twotiered

Women. Sheng-Kai Chang. Abstract. In this paper a computationally practical simulation estimator is proposed for the twotiered Simulation Estimation of Two-Tiered Dynamic Panel Tobit Models with an Application to the Labor Supply of Married Women Sheng-Kai Chang Abstract In this paper a computationally practical simulation estimator

More information

Standard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j

Standard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j Standard Errors & Confidence Intervals β β asy N(0, I( β) 1 ), where I( β) = [ 2 l(β, φ; y) ] β i β β= β j We can obtain asymptotic 100(1 α)% confidence intervals for β j using: β j ± Z 1 α/2 se( β j )

More information

Practical Econometrics. for. Finance and Economics. (Econometrics 2)

Practical Econometrics. for. Finance and Economics. (Econometrics 2) Practical Econometrics for Finance and Economics (Econometrics 2) Seppo Pynnönen and Bernd Pape Department of Mathematics and Statistics, University of Vaasa 1. Introduction 1.1 Econometrics Econometrics

More information

Dynamic Models Part 1

Dynamic Models Part 1 Dynamic Models Part 1 Christopher Taber University of Wisconsin December 5, 2016 Survival analysis This is especially useful for variables of interest measured in lengths of time: Length of life after

More information

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes Maximum Likelihood Estimation Econometrics II Department of Economics Universidad Carlos III de Madrid Máster Universitario en Desarrollo y Crecimiento Económico Outline 1 3 4 General Approaches to Parameter

More information

Lecture 14 More on structural estimation

Lecture 14 More on structural estimation Lecture 14 More on structural estimation Economics 8379 George Washington University Instructor: Prof. Ben Williams traditional MLE and GMM MLE requires a full specification of a model for the distribution

More information

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2016 Instructor: Victor Aguirregabiria

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2016 Instructor: Victor Aguirregabiria ECOOMETRICS II (ECO 24S) University of Toronto. Department of Economics. Winter 26 Instructor: Victor Aguirregabiria FIAL EAM. Thursday, April 4, 26. From 9:am-2:pm (3 hours) ISTRUCTIOS: - This is a closed-book

More information

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns

More information

2. We care about proportion for categorical variable, but average for numerical one.

2. We care about proportion for categorical variable, but average for numerical one. Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is

More information

Multivariate Versus Multinomial Probit: When are Binary Decisions Made Separately also Jointly Optimal?

Multivariate Versus Multinomial Probit: When are Binary Decisions Made Separately also Jointly Optimal? Multivariate Versus Multinomial Probit: When are Binary Decisions Made Separately also Jointly Optimal? Dale J. Poirier and Deven Kapadia University of California, Irvine March 10, 2012 Abstract We provide

More information

Lecture 4: Exponential family of distributions and generalized linear model (GLM) (Draft: version 0.9.2)

Lecture 4: Exponential family of distributions and generalized linear model (GLM) (Draft: version 0.9.2) Lectures on Machine Learning (Fall 2017) Hyeong In Choi Seoul National University Lecture 4: Exponential family of distributions and generalized linear model (GLM) (Draft: version 0.9.2) Topics to be covered:

More information

LOGISTIC REGRESSION Joseph M. Hilbe

LOGISTIC REGRESSION Joseph M. Hilbe LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models Advanced Methods for Data Analysis (36-402/36-608 Spring 2014 1 Generalized linear models 1.1 Introduction: two regressions So far we ve seen two canonical settings for regression.

More information

Environmental Econometrics

Environmental Econometrics Environmental Econometrics Syngjoo Choi Fall 2008 Environmental Econometrics (GR03) Fall 2008 1 / 37 Syllabus I This is an introductory econometrics course which assumes no prior knowledge on econometrics;

More information

Estimation in the Fixed Effects Ordered Logit Model. Chris Muris (SFU)

Estimation in the Fixed Effects Ordered Logit Model. Chris Muris (SFU) Estimation in the Fixed Effects Ordered Logit Model Chris Muris (SFU) Outline Introduction Model and main result Cut points Estimation Simulations and illustration Conclusion Setting 1. Fixed-T panel.

More information

4.8 Instrumental Variables

4.8 Instrumental Variables 4.8. INSTRUMENTAL VARIABLES 35 4.8 Instrumental Variables A major complication that is emphasized in microeconometrics is the possibility of inconsistent parameter estimation due to endogenous regressors.

More information

An Introduction to Econometrics. A Self-contained Approach. Frank Westhoff. The MIT Press Cambridge, Massachusetts London, England

An Introduction to Econometrics. A Self-contained Approach. Frank Westhoff. The MIT Press Cambridge, Massachusetts London, England An Introduction to Econometrics A Self-contained Approach Frank Westhoff The MIT Press Cambridge, Massachusetts London, England How to Use This Book xvii 1 Descriptive Statistics 1 Chapter 1 Prep Questions

More information

Econometrics Master in Business and Quantitative Methods

Econometrics Master in Business and Quantitative Methods Econometrics Master in Business and Quantitative Methods Helena Veiga Universidad Carlos III de Madrid Models with discrete dependent variables and applications of panel data methods in all fields of economics

More information