14. Binary Outcomes. A. Colin Cameron Pravin K. Trivedi Copyright 2006
|
|
- Bethany Cameron
- 5 years ago
- Views:
Transcription
1 14. Binary Outcomes A. Colin Cameron Pravin K. Trivedi Copyright 2006 These slides were prepared in They cover material similar to Sections of our subsequent book Microeconometrics: Methods and Applications, Cambridge University Press, 2005.
2 INTRODUCTION Discrete choice or qualitative response models are for + that takes only a nite number of discrete values. Here we consider binary outcome models where only two values are taken, f and Particularly logit and probit models. These are relatively straightforward nonlinear models. Later consider complications such as discrete variable endogeneity, panel, etc. Reason for this course. 2
3 OUTLINE General Results Probit, logit, LPM and OLS models. Latent variable formulations, especially random utility model. Choice-based samples. Semi-parametric estimation. Grouped or aggregate data. 3
4 STATISTICAL MOTIVATION The coin toss example of introductory statistics. Let R denote the probability of a head E+ ' on one coin toss. Then Prd+ ' o'r and Prd+ 'fo' R. For? tosses + is the of? independent realizations of head or tail. The MLE for R is the sample mean +, i.e. the proportion of tosses that are heads. 4
5 BERNOULLI DISTRIBUTION Prd+ ' o'r and Prd+ 'fo' R Compact expression for density se+ 'R + E R + This is Bernoulli density which is the binomial with one trial per observation. Moments Ed+o ' R nf E R 'R Vd+o 'E R R nef R E R 'RE R. Note that R can be interpreted as Ed+o or as Prd+ ' o 5
6 ECONOMIC EXAMPLES Leading economic applications are labor supply: + ' if work and + 'fotherwise transportation mode choice: + ' if commute to work by public transit and + 'fotherwise. Independent trials may be reasonable. Assuming a constant probability R for each trial is not. It will vary with an individual s characteristics. So generalize to let R be a function of regressors. 6
7 BINARY OUTCOME MODELS Regression model formed by parameterizing R to depend on regressors and parameters. Usually specify single-index model (d+ m o'r ' 8 E 3 Usually chose 8 E to be a cumulative distribution function (cdf). Then f 8 E, f R. logistic cdf gives logit model. standard normal cdf gives probit model. 7
8 MLE Density se+ 'R + E R + c R ' 8 E 3, *? se+ '+ *? 8 E 3 ne + *?E 8 E 3 Log-likelihood function is OE ' [? ' + *? 8 E 3 ne + *?E 8 E 3 Let 8 3 E5 'Y8E5*Y5. MLE solves [? ' +, + 8 E E 3 n + 8 E E 3 ' f 8
9 DISTRIBUTION OF MLE The MLE f.o.c. simplify to [? + 8 E 3 ' 8 E 3 E 8 E E 3 ' f General ML result if density correctly speci ed e 0/ 1 % f c (dy 2 O*YY 3 o f For binary outcome MLE e 0/ 1 7 f c # [? ' $ 6 8 E 3 fe 8 E 3 f 8 3 E 3 f
10 MISSPECIFIED MODEL For binary data the dgp density is always Bernoulli as hd+ ' o'r, hd+ 'fo' hd+ ' o' 8E 3 Therefore only possible misspeci cation of dgp is if R 9' 8 E 3. Clearly inconsistent estimator if R 9' 8 E 3 as then (d+ 8 E 3 o 9' f leading to left-hand side of f.o.c. not having expected value f. 10
11 WEIGHTED NLS INTERPRETATION Since (d+m o '8 E 3 9d+m o '8 E 3 E 8 E 3 Y(d+m o*yq ' 8 3 E 3 the MLE rst-order conditions imply [? + (d+ m oy(d+ m o ' 9d+ m o Y ' f Residuals are orthogonal to regressors upon weighting to adjust for heteroskedasticity. i.e. nonlinear WLS. 11
12 Similarly e f c # [? ' 9d+ m o is of nonlinear WLS form. Y(d+ m o Yq Y(d+ m o Yq 3 f $ 6 These properties are more generally those of the quasi-mle where the speci ed density is in the linear exponential family. 8 12
13 LOGIT MODEL The logit model speci es R '\E 3 ' e 3 ne 3 c \E5 'e 5 *E n e 5 ' *E n e 5 is the logistic cdf. The derivative \ 3 E5 ' \E5E \E5 is the logistic density. For this reason also called logistic regression model. 13
14 The logit ML conditions simplify to [? ' E+ \E 3 ' f The logit MLE has [? e /RJLW 1 f c ' \E 3 fe \E 3 f 3 14
15 PROBIT MODEL The probit model speci es R 'xe 3 xe5 ' U 5 4 Er_r is the standard normal. The derivative x 3 E5 'E5 'E * s 2Zi TE5 2 *2 is the standard normal density function. There is no simpli cation to the f.o.c., unlike logit case. 15
16 The probit MLE has distribution e 3URELW 1 7 f c # [? ' E 3 f 2 $ 6 xe 3 fe xe 3 f
17 LINEAR PROBABILITY MODEL (LPM) The LPM speci es R ' 3 The LPM MLE f.o.c. conditions are [? ' E 3 ' fc The LPM MLE has distribution e /30 1 7c # [? ' $ 6 3 fe 3 f
18 The LPM model has the obvious weakness of permitting probabilities outside the (0, 1) interval. Furthermore, the estimator can be numerically unstable if 3 close to f or 18
19 OLS The LPM is better estimated by OLS, which also speci es (d+ m o' 3. Allow for the intrinsic heteroskedasticity of binary data e 1 k f c Ej 3 j j 3 ljej 3 j l where for l use el ' 'LDJdE+ 3 e 2 o or el ' 'LDJd 3 e E 3 e o 19
20 COEFFICIENT INTERPRETATION Coef cients in different models are not directly comparable due to different scaling. Instead compare across models effect of a one unit change in regressors on Pd+ ' m o 'Ed+ ' m o. Now (d+m o '8 E 3, Y(d+m o*y ' 8 3 E 3 where 8 3 E5 'Y8E5*Y5. Thus the effect depends on the functional form of 8 and the evaluation point, in addition to parameter. 20
21 For Y(d+m o*y ' 8 3 E 3 consider 8 3 E 3. Logit: 8 3 E5 f2d since 8 3 E5 '\E5E \E5 is maximized for f \E5 when \E5 'fd. Probit model 8 3 E5 fef since 8 3 E5 'E5which is at maximum of * s 2Z * fe at 5 'f. LPM or OLS: 8 3 E5 '. This suggests for slope parameters theruleofthumb e /RJLW * e e 2/6 e 3URELW * 2D e 2/6 e /RJLW * S e 3URELW 21
22 Amemiya (1981, p.1488) demonstrates this works quite well, for f 8 E 3 fb. 22
23 ODDS RATIO FOR LOGIT For the logit model R ' i TE 3 *E n i TE 3, R R ' i TE 3, *? R R ' 3 R*E R is the odds ratio which measures the probability that + ' relative to the probability that + 'f. E.g. Pharmaceutical drug study where + ' denotes survival and + 'fdenotes death. An odds ratio of 2 means that the odds of survival are twice those of death. 23
24 Statistical analyses and packages use R*E R ' i TE 3. Suppose the regressor increases by one unit. Then 3 increases to 3 nq. And i TE 3 increases to i TE 3 nq ' i TE 3 i T Eq. Thus the odds ratio has increased by a multiple i T Eq. E.g. a logit slope parameter of f means that a one unit change in the regressor increases the odds ratio by a multiple i TEf * f fd. The relative probability of survival has increased by fd percent. This interpretation widely used in applied biostatistics. 24
25 For economists it is more natural to interpret q as a semi-elasticity for the odds ratio, since *? R*E R ' 3. Then a logit slope parameter of f means that a one unit change in the regressor increases the odds ratio by a multiple f. This coincides exactly with the interpretation used in statistics for very small of q, since then i TEq 'q. 25
26 WHICH BINARY CHOICE MODEL? Which model logit, probit or linear probability? Theoretically it depends on the data generating process (dgp). Unlike other applications of ML there is no problem in specifying the distribution the only possible distribution for a Efc variable is the Bernoulli. The problem lies in specifying a functional form for the parameter of this distribution. If the dgp has R '\E 3 f then a logit model should 26
27 be used, and estimators based on other models such as probit are potentially inconsistent. Similar conclusions hold if instead for the dgp has R 'xe 3 f or R ' 3 f. 27
28 Aside: Misspeci cation consequences, however, are not as large as this. As long as in the true model the probability or mean is of the single-index form R ' 8 E 3 f, then choosing the wrong function 8 effects all slope parameters equally, and the ratio of slope parameters is constant across the models. See Ruud (1986, 1993). 28
29 WHY LOGIT MODEL? Logit model is the binary model used by statisticians: F.o.c. and asymptotic distribution are relatively simple. Logit model corresponds ises the canonical link function for the binomial, a generalized linear model. Coef cients can be interpreted in terms of the logodds ratio. Easy generalization to multinomial logit. A discriminant analysis interpretation can be given. 29
30 DISCRIMINANT ANALYSIS Aside: In discriminant analysis both + and are random variables is observed but + is not observed given only we determine whether + equals0or1. E.g. Classify what type of humanoid (+ 'for )askull belongs to given various dimensions ( ) of the skull. If m+ is multivariate normal distributed, the posterior probability of +m is similar to logit model probability. 30
31 WHY PROBIT MODEL? The probit model is often used by economists. It is motivated by a latent normal random variable. So ties in with tobit models and multinomial probit. Empirically, either logit and probit can be used little difference between results from probit and logit analysis, once rescale parameter estimates. Greatest difference is in prediction of probabilities close to f or. 31
32 WHY OLS? The LPM should not be used as probabilities outside the (0, 1) interval and be numerically unstable. Nonetheless OLS can be useful for preliminary data analysis. In practice standard errors of slope coef cients are often quite similar across logit, probit and OLS (even using the incorrect r 2 Ej 3 j in the case of OLS). Final results should, however, use probit or logit. 32
33 DETERMINING MODEL ADEQUACY Several measures of model adequacy have been proposed. Many are very speci c to binary outcome models. There is no single best measure. See Amemiya (1981) and Maddala (1983). Approaches: R-squared measures. Compare e+ with + Compare predicted e 3Ud+ ' owith actual 3Ud+ ' o. 33
34 McFADDEN S R-SQUARED There are many --squareds for binary models as - 2 in linear model has many interpretations. McFadden proposed two. We favor McFadden (1974) where - 2 ' O s O f c O s ' log-likelihood in the tted model O f is the log-likelihood in the intercept-only model. This - 2 should be only used for discrete choice models. 34
35 In other nonlinear models instead use - 2 ' EO 4@ O s *EO 4@ O f c where O 4@ is the maximum possible value of the log-likelihood. For binary outcome models O 4@ 'f. For some other models O 4@ can be unbounded restricting use of this. 35
36 PREDICTION THAT ) ' Many measures compare predicted e+ with +. The problem is in de ning a rule for when e+ '. Obvious is e+ ' when er ' 8 E 3 e : fd. But this can e.g. yield e+ 'fall the time if most of the sample has + 'f The receiver operating characteristics (ROC) curve considers what happens as the cuttoff e+ :Svaries. 36
37 PREDICTION OF Pr[) ' o Can compare predicted e 3U[) ' o with Pr[) ' o. But testing whether on on average the predicted probabilities equal the sample frequencies is not helpful over the entire sample, since for the logit model with an intercept the f.o.c. imply S? ' + \E 3 e 'f,sothat S?' er '7+. Useful for subsamples. 37
38 LATENT VARIABLE APPROACHES Let + be an unobserved underlying continuous latent variable. When + crosses a threshold + ' is generated. Theobservedbinaryoutcome+ is still Bernoulli. The latent variable + determines the functional form of the parameter of this Bernoulli distribution. Two latent variable approaches: index function random utility. 38
39 INDEX FUNCTION Interest is in explaining the underlying unobserved continuous random variable +. The natural regression model for + is the index function model + ' 3 n This cannot be estimated as + is not observed. Instead we observe + if + : f + ' f if + f The choice of 0 as the threshold is a normalization. If 39
40 instead + :Sis used as the threshold, exactly the same slope parameter estimates are obtained if includes a constant term. Examples: + is a person s tendency to work and we observe only whether or not the person works (+ ' ) + is a person s propensity to commute by public transit and we observe only whether or not the public transit is used (+ ' ). 40
41 INDEX FUNCTION... Then hd+ ' o 'hd+ : fo 'hd 3 n :fo 'hd 3 o ' 8 E 3 c where 8 is the cdf of (which equals the cdf of in the usual case of density symmetric about f in which case 8 E ' 8 E.) The index function therefore gives a way to interpret the function 8. 41
42 The probit model arises from standard normal errors in the index model. The logit model arises from logistic errors in the index model. Index function approach Gives a way to interpret the parameter as the change in + when changes by one unit, but this is of limited use as + is essentially unitless. Cangeneralize to ordered models, multivariate models and limited dependent variable models. 42
43 RANDOM UTILITY MODELS In the random utility formulation a consumer selects the choice with highest utility. The discrete variable + takesvalue if choice has higher utility takesvaluefif choice f has higher utility. 43
44 The random utility model speci es the utilities of alternatives f and to be L f ' > f n 0 f L ' > n 0 where > f and > are deterministic components of utility, whose dependence on regressors is detailed below. 0 f and 0 are random components of utility. 44
45 The alternative with highest utility is chosen. So the observed choice is hd+ ' o 'hdl:l f o 'hd> n 0 :> f n 0 f o 'hd0 f 0 > > f o ' 8 E> > f c where 8 is the cdf of E0 f 0. Different distributions of 0 f and 0 give different discrete choice models. 45
46 Binary probit arises if 0 f and 0 are normal, as is readily seen by noting that then E0 f 0 is normally distributed, upon normalization of the variance of E0 f 0 to unity. Binary logit model arises if 0 f and 0 are type I extreme value distributed, de ned soon, as then the difference E0 f 0 can be shown to be logistic distributed. The random component 0 in utility model is needed. Otherwise, choice would be deterministic, with alternative 1 always chosen if > :> f. 46
47 REGRESSORS The simplest applications let > f ' 3 f and > ' 3 where are individual characteristics that do not vary with the choice. More generally we let where > ' 3 3 n 3 c 'fc c 3 are characteristics that vary with alternatives and are characteristics that do not vary with the alternatives. 47
48 BRUTE FORCE: LOGIT This yields hd+ ' o'8 EE3 3 f 3 n 3 E f For alternative-invariant regressors only the difference E f canbeidenti ed. 48
49 BRUTE FORCE In the binary case it is suf cient to work with the distribution of 0 f 0. But in more general multinomial models presented later it is necessary to work directly with the distributions of 0 f and 0. We present the algebra here for completeness. 49
50 In general hd+ ' o'hd0 f 0 > > f o 0 n > > f o 'hd0 f ' U 4 U 0 n> > f 4 4 ' U 4 4 se0 q U 0 n> > f 4 se0 f c0 _0 f _0 r se0 f _0 f _0 c where in the last line 0 f and 0 areassumedtobe independent. 50
51 BRUTE FORCE: LOGIT Assume 0 are iid log Weibull (or type I extreme value) distributed with density se0 'e 0 i TEe 0 Then substituting for se0 f yields hd+ ' o' U q 4 U r 0 n> 4 se0 > f 4 e 0 f i TEe 0 f_0 f ' U 4 4 se0 di TEe 0 fo 0 n> > f 4 _0 ' U 4 4 se0 i TEe E0 n> > f _0 _0 51
52 Now substitute for se0 yields hd+ ' o ' U 4 4 e 0 i TEe 0 i TEe E0 n> > f _0 ' U q r 4 4 e 0 i TEe 0 n e E0 n> > f _0 ' U q r 4 4 e 0 i TEe 0 n e 0 e E> > f _0 ' U q r 4 4 e 0 i T e 0 E n e E> > f _0 Since U 4 0 i _0 ' it follows that U 4 4 e 0 i TE@e _0 ' *@. 52
53 Using this result ' ne E> > f yields hd+ ' o ' E n e E> > f e > ' e > f n e > ' e> > f ne > > f This last result implies the logit model hd+ ' o' e 3 ne 3 c where 3 'E3 3 f 3 n 3 E f. 53
54 BRUTE FORCE: PROBIT To instead obtain the probit model assume E0 f c0 are bivariate normal with means zero, variances j 2 f and j2 and covariance j f. Then similar analysis yields hd+ ' o 'hd> n 3 0 :> f n 0 f o 4 E > ' x > f F Ct D j 2 f n j2 2j 2 This implies the probit model hd+ ' o'xe 3 *j c 54
55 where 3 'E3 3 f 3 n 3 E f and j 2 ' j 2 f n j2 2j 2. It is simpler to use the result that E0 f 0 is normally distributed with mean zero and variance j 2. 55
56 CHOICE-BASED SAMPLING Choice-based sampling arises when one choice is over-sampled. This occurs when relatively few people make a choice, so a random sample would pick up too few + '. E.g. Oversampling bus riders by sampling at bus stops. MLE based on the conditional density of ) given j leads to inconsistent estimates. Instead MLE needs to be based on the joint density of E)c j. 56
57 SEMI-PARAMETRIC REGRESSION Here as usual + is Bernoulli with parameter R modelled in the single-index form 8 E 3. But a functional form for 8 is not speci ed. Instead both and 8 are estimated from the data. This is a very active area, and a leading example of semiparametric estimation. 57
58 GROUPED DATA Suppose regressor vector, ' c c?c takes only A distinct values, where A is much smaller then?. Then for each value of the regressors we have multiple observations on +. This type of grouped data is called many observations per cell. It can arise particularly in experimental data. Economics data are rarely of grouped form, unless the regressors are just a few indicator variables. 58
59 Let, ' c c Ac be the A distinct values,? be the number of observations on + for the distinct value of,so S A '? '?, R be the proportion of times + ' when ' R ' hd+ ' m ' o'8e 3 Berkson s minimum chi-square estimatorestimates the transformed model 8 ER ' 3 n by weighted least squares with weights j 2 ' R E R? d8 3 E8 ER o 2 59
60 Then e 0& 1 k SA ' Ej 2 3 SA ' c Ej 2 3 l S A ' Ej 2 8 ER where the asymptotic theory requires? $4. This is simpletoimplement, as it only requires an OLS package. Yetitisfullyef cient, as has same asymptotic distribution as the MLE which would treat each observation separately. 60
61 For the logit model this estimator is especially simple. The transformed model becomes *? R R ' 3 n c which is estimated by weighted least squares with weights j 2 '? R E R The transformed dependent variable is the log-odds ratio. Advantage of min chi-square estimator is computational simplicity. This is moot now. 61
62 AGGREGATE DATA Example: Suppose R equals the unemployment rate in region and equals the average level of schooling in region. One possible model is LS regression of R on 7. Because f R, many studies instead use the logodds ratio to transform to a dependent variable which lies in E4c 4, estimating the model *? R R ' 7 3 n eoojo 62
63 This looks similar to the minimum chi-square estimator, butisnot. Berkson s estimator is appropriate if all regressors in the cell take the same value. Here instead regressors can take different values, as different people in region have different schooling. If apply minimum chi-square the resulting estimator cannot be interpreted as being an estimate of the parameter in an underlying binary choice model, say hd+ ' o'8e 3, asignoring the heterogeneity leads to an inconsistent estimator of. 63
64 To see this suppose + ' if + : f, data are only observed by region, and 1d> c P o in region. Then in region hd+ ' mc % c P o'hd 3 n : fmc c P o 3 n > 3 'h s & j 2 n 3 P : fmc c P # > 3 'x s $ j 2 n 3 P since 3 n 1d> 3 cj2 n 3 P o. 64
65 This suggests estimating the model x E7+ ' 7 3 s j 2 n 3 P Also see Allenby and Berry, Levinsohn and Pakes. 65
66 APPLICATION: LABOR SUPPLY Use data of Mroz (1987) on 753 married women from the 1976 Panel Survey of Income Dynamics (PSID). Dependent variable LFP is an indicator variable that equals 1 if worked in previous year, and 0 if no work. For this sample 428 (or 57%) worked. 66
67 The regressors are a constant term and 1. KL6: Number of children less than six 2. K618: Number of children more than six 3. AGE: Age 4. ED: Education (years of schooling completed) 5. NLINCOME: annual nonlabor income of wife measuredin$10,000 s. 67
68 Variable Coeff t-stat u7 Logit Probit u7 Logit Probit. SeD. 2 e22 ef fb fb gus 2b. eh Hb2 H.D.H gs H f 2 fss fh fh f fb C. f 2 fd. fe es ed ed.( fd2 2DH DS S. S SD uu. SHe eb 2fb ed ee es *? u ede H ede22 68
69 Consider, for example, the effect of education. The logit coef cient is 1.7 times the probit as. fh *fss (vs. rule of thumb multiplier of 1.6). The OLS coef cient is 0.3 times the probit as f fh *f 2 (vs. rule of thumb multiplier of 0.4). All models give similar results regarding statistical signi cance (though the reported OLS standard errors here are the incorrect ones). 69
16. Tobit and Selection A. Colin Cameron Pravin K. Trivedi Copyright 2006
16. Tobit and Selection A. Colin Cameron Pravin K. Trivedi Copyright 2006 These slides were prepared in 1999. They cover material similar to Sections 16.2-16.3 and 16.5 of our subsequent book Microeconometrics:
More information21. Panel: Linear A. Colin Cameron Pravin K. Trivedi Copyright 2006
21. Panel: Linear A. Colin Cameron Pravin K. Trivedi Copyright 2006 These slides were prepared in 1999. They cover material similar to Chapter 21 of our subsequent book Microeconometrics: Methods and Applications,
More informationECON 594: Lecture #6
ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was
More informationh=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio
Multivariate Response Models The response variable is unordered and takes more than two values. The term unordered refers to the fact that response 3 is not more favored than response 2. One choice from
More informationNinth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"
Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric
More informationECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria
ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria SOLUTION TO FINAL EXAM Friday, April 12, 2013. From 9:00-12:00 (3 hours) INSTRUCTIONS:
More information11. Bootstrap Methods
11. Bootstrap Methods c A. Colin Cameron & Pravin K. Trivedi 2006 These transparencies were prepared in 20043. They can be used as an adjunct to Chapter 11 of our subsequent book Microeconometrics: Methods
More informationA Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,
A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type
More informationEconometric Analysis of Cross Section and Panel Data
Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND
More informationChapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a
Chapter 9 Regression with a Binary Dependent Variable Multiple Choice ) The binary dependent variable model is an example of a a. regression model, which has as a regressor, among others, a binary variable.
More informationECONOMETRICS FIELD EXAM Michigan State University May 9, 2008
ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008 Instructions: Answer all four (4) questions. Point totals for each question are given in parenthesis; there are 00 points possible. Within
More informationGeneralized Linear Models for Non-Normal Data
Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture
More informationSTAT5044: Regression and Anova
STAT5044: Regression and Anova Inyoung Kim 1 / 18 Outline 1 Logistic regression for Binary data 2 Poisson regression for Count data 2 / 18 GLM Let Y denote a binary response variable. Each observation
More informationDiscrete Dependent Variable Models
Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)
More informationLecture 3.1 Basic Logistic LDA
y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More information1 Outline. Introduction: Econometricians assume data is from a simple random survey. This is never the case.
24. Strati cation c A. Colin Cameron & Pravin K. Trivedi 2006 These transparencies were prepared in 2002. They can be used as an adjunct to Chapter 24 (sections 24.2-24.4 only) of our subsequent book Microeconometrics:
More information4.8 Instrumental Variables
4.8. INSTRUMENTAL VARIABLES 35 4.8 Instrumental Variables A major complication that is emphasized in microeconometrics is the possibility of inconsistent parameter estimation due to endogenous regressors.
More informationModels, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley
Models, Testing, and Correction of Heteroskedasticity James L. Powell Department of Economics University of California, Berkeley Aitken s GLS and Weighted LS The Generalized Classical Regression Model
More informationECON Introductory Econometrics. Lecture 11: Binary dependent variables
ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability
More informationEconomics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama
Problem Set #1 (Random Data Generation) 1. Generate =500random numbers from both the uniform 1 ( [0 1], uniformbetween zero and one) and exponential exp ( ) (set =2and let [0 1]) distributions. Plot the
More informationMultinomial Discrete Choice Models
hapter 2 Multinomial Discrete hoice Models 2.1 Introduction We present some discrete choice models that are applied to estimate parameters of demand for products that are purchased in discrete quantities.
More information1. The Multivariate Classical Linear Regression Model
Business School, Brunel University MSc. EC550/5509 Modelling Financial Decisions and Markets/Introduction to Quantitative Methods Prof. Menelaos Karanasos (Room SS69, Tel. 08956584) Lecture Notes 5. The
More informationLimited Dependent Variable Models II
Limited Dependent Variable Models II Fall 2008 Environmental Econometrics (GR03) LDV Fall 2008 1 / 15 Models with Multiple Choices The binary response model was dealing with a decision problem with two
More informationEnvironmental Econometrics
Environmental Econometrics Syngjoo Choi Fall 2008 Environmental Econometrics (GR03) Fall 2008 1 / 37 Syllabus I This is an introductory econometrics course which assumes no prior knowledge on econometrics;
More informationLimited Dependent Variables and Panel Data
and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement
More informationA simple alternative to the linear probability model for binary choice models with endogenous regressors
A simple alternative to the linear probability model for binary choice models with endogenous regressors Christopher F Baum, Yingying Dong, Arthur Lewbel, Tao Yang Boston College/DIW Berlin, U.Cal Irvine,
More informationCRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.
CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Linear
More informationControl Function and Related Methods: Nonlinear Models
Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear
More informationEconometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit
Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables
More informationModels of Qualitative Binary Response
Models of Qualitative Binary Response Probit and Logit Models October 6, 2015 Dependent Variable as a Binary Outcome Suppose we observe an economic choice that is a binary signal. The focus on the course
More informationComparing IRT with Other Models
Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used
More informationA Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated
More informationGroup comparisons in logit and probit using predicted probabilities 1
Group comparisons in logit and probit using predicted probabilities 1 J. Scott Long Indiana University May 27, 2009 Abstract The comparison of groups in regression models for binary outcomes is complicated
More informationLecture Notes based on Koop (2003) Bayesian Econometrics
Lecture Notes based on Koop (2003) Bayesian Econometrics A.Colin Cameron University of California - Davis November 15, 2005 1. CH.1: Introduction The concepts below are the essential concepts used throughout
More informationChapter 11. Regression with a Binary Dependent Variable
Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score
More informationNon-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models
Optimum Design for Mixed Effects Non-Linear and generalized Linear Models Cambridge, August 9-12, 2011 Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models
More information2. We care about proportion for categorical variable, but average for numerical one.
Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is
More informationSemiparametric Generalized Linear Models
Semiparametric Generalized Linear Models North American Stata Users Group Meeting Chicago, Illinois Paul Rathouz Department of Health Studies University of Chicago prathouz@uchicago.edu Liping Gao MS Student
More informationEPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7
Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review
More informationPartial effects in fixed effects models
1 Partial effects in fixed effects models J.M.C. Santos Silva School of Economics, University of Surrey Gordon C.R. Kemp Department of Economics, University of Essex 22 nd London Stata Users Group Meeting
More informationPanel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43
Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression
More informationA Course on Advanced Econometrics
A Course on Advanced Econometrics Yongmiao Hong The Ernest S. Liu Professor of Economics & International Studies Cornell University Course Introduction: Modern economies are full of uncertainties and risk.
More informationECE521 week 3: 23/26 January 2017
ECE521 week 3: 23/26 January 2017 Outline Probabilistic interpretation of linear regression - Maximum likelihood estimation (MLE) - Maximum a posteriori (MAP) estimation Bias-variance trade-off Linear
More informationRecent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data
Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)
More informationLecture 11/12. Roy Model, MTE, Structural Estimation
Lecture 11/12. Roy Model, MTE, Structural Estimation Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy model The Roy model is a model of comparative advantage: Potential earnings
More informationLecture-20: Discrete Choice Modeling-I
Lecture-20: Discrete Choice Modeling-I 1 In Today s Class Introduction to discrete choice models General formulation Binary choice models Specification Model estimation Application Case Study 2 Discrete
More informationSyllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools
Syllabus By Joan Llull Microeconometrics. IDEA PhD Program. Fall 2017 Chapter 1: Introduction and a Brief Review of Relevant Tools I. Overview II. Maximum Likelihood A. The Likelihood Principle B. The
More informationIntroduction: structural econometrics. Jean-Marc Robin
Introduction: structural econometrics Jean-Marc Robin Abstract 1. Descriptive vs structural models 2. Correlation is not causality a. Simultaneity b. Heterogeneity c. Selectivity Descriptive models Consider
More informationTruncation and Censoring
Truncation and Censoring Laura Magazzini laura.magazzini@univr.it Laura Magazzini (@univr.it) Truncation and Censoring 1 / 35 Truncation and censoring Truncation: sample data are drawn from a subset of
More informationIntroduction to Econometrics
Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle
More informationIV estimators and forbidden regressions
Economics 8379 Spring 2016 Ben Williams IV estimators and forbidden regressions Preliminary results Consider the triangular model with first stage given by x i2 = γ 1X i1 + γ 2 Z i + ν i and second stage
More informationMC3: Econometric Theory and Methods. Course Notes 4
University College London Department of Economics M.Sc. in Economics MC3: Econometric Theory and Methods Course Notes 4 Notes on maximum likelihood methods Andrew Chesher 25/0/2005 Course Notes 4, Andrew
More informationA time series plot: a variable Y t on the vertical axis is plotted against time on the horizontal axis
TIME AS A REGRESSOR A time series plot: a variable Y t on the vertical axis is plotted against time on the horizontal axis Many economic variables increase or decrease with time A linear trend relationship
More informationApplied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid
Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?
More informationA Guide to Modern Econometric:
A Guide to Modern Econometric: 4th edition Marno Verbeek Rotterdam School of Management, Erasmus University, Rotterdam B 379887 )WILEY A John Wiley & Sons, Ltd., Publication Contents Preface xiii 1 Introduction
More informationProbabilistic modeling. The slides are closely adapted from Subhransu Maji s slides
Probabilistic modeling The slides are closely adapted from Subhransu Maji s slides Overview So far the models and algorithms you have learned about are relatively disconnected Probabilistic modeling framework
More informationRewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35
Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate
More informationGMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails
GMM-based inference in the AR() panel data model for parameter values where local identi cation fails Edith Madsen entre for Applied Microeconometrics (AM) Department of Economics, University of openhagen,
More informationSingle-level Models for Binary Responses
Single-level Models for Binary Responses Distribution of Binary Data y i response for individual i (i = 1,..., n), coded 0 or 1 Denote by r the number in the sample with y = 1 Mean and variance E(y) =
More informationOrdered Response and Multinomial Logit Estimation
Ordered Response and Multinomial Logit Estimation Quantitative Microeconomics R. Mora Department of Economics Universidad Carlos III de Madrid Outline Introduction 1 Introduction 2 3 Introduction The Ordered
More informationMaximum Likelihood Estimation
Maximum Likelihood Estimation Additional Topics Raphael Cunha Program in Statistics and Methodology PRISM Department of Political Science The Ohio State University cunha.6@osu.edu March 28, 2014 Overview
More informationLecture Notes on Measurement Error
Steve Pischke Spring 2000 Lecture Notes on Measurement Error These notes summarize a variety of simple results on measurement error which I nd useful. They also provide some references where more complete
More informationAppendix A: The time series behavior of employment growth
Unpublished appendices from The Relationship between Firm Size and Firm Growth in the U.S. Manufacturing Sector Bronwyn H. Hall Journal of Industrial Economics 35 (June 987): 583-606. Appendix A: The time
More information7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis
Lecture 6: Logistic Regression Analysis Christopher S. Hollenbeak, PhD Jane R. Schubart, PhD The Outcomes Research Toolbox Review Homework 2 Overview Logistic regression model conceptually Logistic regression
More informationNELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation
NELS 88 Table 2.3 Adjusted odds ratios of eighth-grade students in 988 performing below basic levels of reading and mathematics in 988 and dropping out of school, 988 to 990, by basic demographics Variable
More informationWageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice
Wageningen Summer School in Econometrics The Bayesian Approach in Theory and Practice September 2008 Slides for Lecture on Qualitative and Limited Dependent Variable Models Gary Koop, University of Strathclyde
More informationJeffrey M. Wooldridge Michigan State University
Fractional Response Models with Endogenous Explanatory Variables and Heterogeneity Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Fractional Probit with Heteroskedasticity 3. Fractional
More informationECLT 5810 Linear Regression and Logistic Regression for Classification. Prof. Wai Lam
ECLT 5810 Linear Regression and Logistic Regression for Classification Prof. Wai Lam Linear Regression Models Least Squares Input vectors is an attribute / feature / predictor (independent variable) The
More informationTesting and Model Selection
Testing and Model Selection This is another digression on general statistics: see PE App C.8.4. The EViews output for least squares, probit and logit includes some statistics relevant to testing hypotheses
More informationBinary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment
BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense
More informationECONOMETFUCS FIELD EXAM Michigan State University May 11, 2007
ECONOMETFUCS FIELD EXAM Michigan State University May 11, 2007 Instructions: Answer all four (4) questions. Point totals for each question are given in parenthesis; there are 100 points possible. Within
More informationBayesian Modeling of Conditional Distributions
Bayesian Modeling of Conditional Distributions John Geweke University of Iowa Indiana University Department of Economics February 27, 2007 Outline Motivation Model description Methods of inference Earnings
More informationApplied Health Economics (for B.Sc.)
Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative
More informationGibbs Sampling in Latent Variable Models #1
Gibbs Sampling in Latent Variable Models #1 Econ 690 Purdue University Outline 1 Data augmentation 2 Probit Model Probit Application A Panel Probit Panel Probit 3 The Tobit Model Example: Female Labor
More informationChapter 6: Endogeneity and Instrumental Variables (IV) estimator
Chapter 6: Endogeneity and Instrumental Variables (IV) estimator Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans December 15, 2013 Christophe Hurlin (University of Orléans)
More informationMax. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes
Maximum Likelihood Estimation Econometrics II Department of Economics Universidad Carlos III de Madrid Máster Universitario en Desarrollo y Crecimiento Económico Outline 1 3 4 General Approaches to Parameter
More informationChapter 1. GMM: Basic Concepts
Chapter 1. GMM: Basic Concepts Contents 1 Motivating Examples 1 1.1 Instrumental variable estimator....................... 1 1.2 Estimating parameters in monetary policy rules.............. 2 1.3 Estimating
More informationIdentification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case
Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College December 2016 Abstract Lewbel (2012) provides an estimator
More informationGibbs Sampling in Endogenous Variables Models
Gibbs Sampling in Endogenous Variables Models Econ 690 Purdue University Outline 1 Motivation 2 Identification Issues 3 Posterior Simulation #1 4 Posterior Simulation #2 Motivation In this lecture we take
More informationA better way to bootstrap pairs
A better way to bootstrap pairs Emmanuel Flachaire GREQAM - Université de la Méditerranée CORE - Université Catholique de Louvain April 999 Abstract In this paper we are interested in heteroskedastic regression
More informationDiscrete Choice Modeling
[Part 6] 1/55 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent Class 11 Mixed Logit 12 Stated Preference
More informationWhat s New in Econometrics? Lecture 14 Quantile Methods
What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression
More informationEconomics 241B Estimation with Instruments
Economics 241B Estimation with Instruments Measurement Error Measurement error is de ned as the error resulting from the measurement of a variable. At some level, every variable is measured with error.
More informationIntroduction to GSEM in Stata
Introduction to GSEM in Stata Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Introduction to GSEM in Stata Boston College, Spring 2016 1 /
More informationApplied Microeconometrics (L5): Panel Data-Basics
Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics
More informationEconometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland
Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2018 Part III Limited Dependent Variable Models As of Jan 30, 2017 1 Background 2 Binary Dependent Variable The Linear Probability
More informationWe begin by thinking about population relationships.
Conditional Expectation Function (CEF) We begin by thinking about population relationships. CEF Decomposition Theorem: Given some outcome Y i and some covariates X i there is always a decomposition where
More informationProbability and Estimation. Alan Moses
Probability and Estimation Alan Moses Random variables and probability A random variable is like a variable in algebra (e.g., y=e x ), but where at least part of the variability is taken to be stochastic.
More informationMarginal Specifications and a Gaussian Copula Estimation
Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required
More informationPATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS Parametric Distributions Basic building blocks: Need to determine given Representation: or? Recall Curve Fitting Binary Variables
More informationEconomics 620, Lecture 18: Nonlinear Models
Economics 620, Lecture 18: Nonlinear Models Nicholas M. Kiefer Cornell University Professor N. M. Kiefer (Cornell University) Lecture 18: Nonlinear Models 1 / 18 The basic point is that smooth nonlinear
More informationA Joint Tour-Based Model of Vehicle Type Choice and Tour Length
A Joint Tour-Based Model of Vehicle Type Choice and Tour Length Ram M. Pendyala School of Sustainable Engineering & the Built Environment Arizona State University Tempe, AZ Northwestern University, Evanston,
More informationCENSORED DATA AND CENSORED NORMAL REGRESSION
CENSORED DATA AND CENSORED NORMAL REGRESSION Data censoring comes in many forms: binary censoring, interval censoring, and top coding are the most common. They all start with an underlying linear model
More informationLOGISTIC REGRESSION Joseph M. Hilbe
LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of
More informationSimultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser
Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor
More informationmultilevel modeling: concepts, applications and interpretations
multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models
More informationSigmaplot di Systat Software
Sigmaplot di Systat Software SigmaPlot Has Extensive Statistical Analysis Features SigmaPlot is now bundled with SigmaStat as an easy-to-use package for complete graphing and data analysis. The statistical
More informationNew Developments in Econometrics Lecture 16: Quantile Estimation
New Developments in Econometrics Lecture 16: Quantile Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. Review of Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile
More informationNotes on Mathematical Expectations and Classes of Distributions Introduction to Econometric Theory Econ. 770
Notes on Mathematical Expectations and Classes of Distributions Introduction to Econometric Theory Econ. 77 Jonathan B. Hill Dept. of Economics University of North Carolina - Chapel Hill October 4, 2 MATHEMATICAL
More information