14. Binary Outcomes. A. Colin Cameron Pravin K. Trivedi Copyright 2006

Size: px
Start display at page:

Download "14. Binary Outcomes. A. Colin Cameron Pravin K. Trivedi Copyright 2006"

Transcription

1 14. Binary Outcomes A. Colin Cameron Pravin K. Trivedi Copyright 2006 These slides were prepared in They cover material similar to Sections of our subsequent book Microeconometrics: Methods and Applications, Cambridge University Press, 2005.

2 INTRODUCTION Discrete choice or qualitative response models are for + that takes only a nite number of discrete values. Here we consider binary outcome models where only two values are taken, f and Particularly logit and probit models. These are relatively straightforward nonlinear models. Later consider complications such as discrete variable endogeneity, panel, etc. Reason for this course. 2

3 OUTLINE General Results Probit, logit, LPM and OLS models. Latent variable formulations, especially random utility model. Choice-based samples. Semi-parametric estimation. Grouped or aggregate data. 3

4 STATISTICAL MOTIVATION The coin toss example of introductory statistics. Let R denote the probability of a head E+ ' on one coin toss. Then Prd+ ' o'r and Prd+ 'fo' R. For? tosses + is the of? independent realizations of head or tail. The MLE for R is the sample mean +, i.e. the proportion of tosses that are heads. 4

5 BERNOULLI DISTRIBUTION Prd+ ' o'r and Prd+ 'fo' R Compact expression for density se+ 'R + E R + This is Bernoulli density which is the binomial with one trial per observation. Moments Ed+o ' R nf E R 'R Vd+o 'E R R nef R E R 'RE R. Note that R can be interpreted as Ed+o or as Prd+ ' o 5

6 ECONOMIC EXAMPLES Leading economic applications are labor supply: + ' if work and + 'fotherwise transportation mode choice: + ' if commute to work by public transit and + 'fotherwise. Independent trials may be reasonable. Assuming a constant probability R for each trial is not. It will vary with an individual s characteristics. So generalize to let R be a function of regressors. 6

7 BINARY OUTCOME MODELS Regression model formed by parameterizing R to depend on regressors and parameters. Usually specify single-index model (d+ m o'r ' 8 E 3 Usually chose 8 E to be a cumulative distribution function (cdf). Then f 8 E, f R. logistic cdf gives logit model. standard normal cdf gives probit model. 7

8 MLE Density se+ 'R + E R + c R ' 8 E 3, *? se+ '+ *? 8 E 3 ne + *?E 8 E 3 Log-likelihood function is OE ' [? ' + *? 8 E 3 ne + *?E 8 E 3 Let 8 3 E5 'Y8E5*Y5. MLE solves [? ' +, + 8 E E 3 n + 8 E E 3 ' f 8

9 DISTRIBUTION OF MLE The MLE f.o.c. simplify to [? + 8 E 3 ' 8 E 3 E 8 E E 3 ' f General ML result if density correctly speci ed e 0/ 1 % f c (dy 2 O*YY 3 o f For binary outcome MLE e 0/ 1 7 f c # [? ' $ 6 8 E 3 fe 8 E 3 f 8 3 E 3 f

10 MISSPECIFIED MODEL For binary data the dgp density is always Bernoulli as hd+ ' o'r, hd+ 'fo' hd+ ' o' 8E 3 Therefore only possible misspeci cation of dgp is if R 9' 8 E 3. Clearly inconsistent estimator if R 9' 8 E 3 as then (d+ 8 E 3 o 9' f leading to left-hand side of f.o.c. not having expected value f. 10

11 WEIGHTED NLS INTERPRETATION Since (d+m o '8 E 3 9d+m o '8 E 3 E 8 E 3 Y(d+m o*yq ' 8 3 E 3 the MLE rst-order conditions imply [? + (d+ m oy(d+ m o ' 9d+ m o Y ' f Residuals are orthogonal to regressors upon weighting to adjust for heteroskedasticity. i.e. nonlinear WLS. 11

12 Similarly e f c # [? ' 9d+ m o is of nonlinear WLS form. Y(d+ m o Yq Y(d+ m o Yq 3 f $ 6 These properties are more generally those of the quasi-mle where the speci ed density is in the linear exponential family. 8 12

13 LOGIT MODEL The logit model speci es R '\E 3 ' e 3 ne 3 c \E5 'e 5 *E n e 5 ' *E n e 5 is the logistic cdf. The derivative \ 3 E5 ' \E5E \E5 is the logistic density. For this reason also called logistic regression model. 13

14 The logit ML conditions simplify to [? ' E+ \E 3 ' f The logit MLE has [? e /RJLW 1 f c ' \E 3 fe \E 3 f 3 14

15 PROBIT MODEL The probit model speci es R 'xe 3 xe5 ' U 5 4 Er_r is the standard normal. The derivative x 3 E5 'E5 'E * s 2Zi TE5 2 *2 is the standard normal density function. There is no simpli cation to the f.o.c., unlike logit case. 15

16 The probit MLE has distribution e 3URELW 1 7 f c # [? ' E 3 f 2 $ 6 xe 3 fe xe 3 f

17 LINEAR PROBABILITY MODEL (LPM) The LPM speci es R ' 3 The LPM MLE f.o.c. conditions are [? ' E 3 ' fc The LPM MLE has distribution e /30 1 7c # [? ' $ 6 3 fe 3 f

18 The LPM model has the obvious weakness of permitting probabilities outside the (0, 1) interval. Furthermore, the estimator can be numerically unstable if 3 close to f or 18

19 OLS The LPM is better estimated by OLS, which also speci es (d+ m o' 3. Allow for the intrinsic heteroskedasticity of binary data e 1 k f c Ej 3 j j 3 ljej 3 j l where for l use el ' 'LDJdE+ 3 e 2 o or el ' 'LDJd 3 e E 3 e o 19

20 COEFFICIENT INTERPRETATION Coef cients in different models are not directly comparable due to different scaling. Instead compare across models effect of a one unit change in regressors on Pd+ ' m o 'Ed+ ' m o. Now (d+m o '8 E 3, Y(d+m o*y ' 8 3 E 3 where 8 3 E5 'Y8E5*Y5. Thus the effect depends on the functional form of 8 and the evaluation point, in addition to parameter. 20

21 For Y(d+m o*y ' 8 3 E 3 consider 8 3 E 3. Logit: 8 3 E5 f2d since 8 3 E5 '\E5E \E5 is maximized for f \E5 when \E5 'fd. Probit model 8 3 E5 fef since 8 3 E5 'E5which is at maximum of * s 2Z * fe at 5 'f. LPM or OLS: 8 3 E5 '. This suggests for slope parameters theruleofthumb e /RJLW * e e 2/6 e 3URELW * 2D e 2/6 e /RJLW * S e 3URELW 21

22 Amemiya (1981, p.1488) demonstrates this works quite well, for f 8 E 3 fb. 22

23 ODDS RATIO FOR LOGIT For the logit model R ' i TE 3 *E n i TE 3, R R ' i TE 3, *? R R ' 3 R*E R is the odds ratio which measures the probability that + ' relative to the probability that + 'f. E.g. Pharmaceutical drug study where + ' denotes survival and + 'fdenotes death. An odds ratio of 2 means that the odds of survival are twice those of death. 23

24 Statistical analyses and packages use R*E R ' i TE 3. Suppose the regressor increases by one unit. Then 3 increases to 3 nq. And i TE 3 increases to i TE 3 nq ' i TE 3 i T Eq. Thus the odds ratio has increased by a multiple i T Eq. E.g. a logit slope parameter of f means that a one unit change in the regressor increases the odds ratio by a multiple i TEf * f fd. The relative probability of survival has increased by fd percent. This interpretation widely used in applied biostatistics. 24

25 For economists it is more natural to interpret q as a semi-elasticity for the odds ratio, since *? R*E R ' 3. Then a logit slope parameter of f means that a one unit change in the regressor increases the odds ratio by a multiple f. This coincides exactly with the interpretation used in statistics for very small of q, since then i TEq 'q. 25

26 WHICH BINARY CHOICE MODEL? Which model logit, probit or linear probability? Theoretically it depends on the data generating process (dgp). Unlike other applications of ML there is no problem in specifying the distribution the only possible distribution for a Efc variable is the Bernoulli. The problem lies in specifying a functional form for the parameter of this distribution. If the dgp has R '\E 3 f then a logit model should 26

27 be used, and estimators based on other models such as probit are potentially inconsistent. Similar conclusions hold if instead for the dgp has R 'xe 3 f or R ' 3 f. 27

28 Aside: Misspeci cation consequences, however, are not as large as this. As long as in the true model the probability or mean is of the single-index form R ' 8 E 3 f, then choosing the wrong function 8 effects all slope parameters equally, and the ratio of slope parameters is constant across the models. See Ruud (1986, 1993). 28

29 WHY LOGIT MODEL? Logit model is the binary model used by statisticians: F.o.c. and asymptotic distribution are relatively simple. Logit model corresponds ises the canonical link function for the binomial, a generalized linear model. Coef cients can be interpreted in terms of the logodds ratio. Easy generalization to multinomial logit. A discriminant analysis interpretation can be given. 29

30 DISCRIMINANT ANALYSIS Aside: In discriminant analysis both + and are random variables is observed but + is not observed given only we determine whether + equals0or1. E.g. Classify what type of humanoid (+ 'for )askull belongs to given various dimensions ( ) of the skull. If m+ is multivariate normal distributed, the posterior probability of +m is similar to logit model probability. 30

31 WHY PROBIT MODEL? The probit model is often used by economists. It is motivated by a latent normal random variable. So ties in with tobit models and multinomial probit. Empirically, either logit and probit can be used little difference between results from probit and logit analysis, once rescale parameter estimates. Greatest difference is in prediction of probabilities close to f or. 31

32 WHY OLS? The LPM should not be used as probabilities outside the (0, 1) interval and be numerically unstable. Nonetheless OLS can be useful for preliminary data analysis. In practice standard errors of slope coef cients are often quite similar across logit, probit and OLS (even using the incorrect r 2 Ej 3 j in the case of OLS). Final results should, however, use probit or logit. 32

33 DETERMINING MODEL ADEQUACY Several measures of model adequacy have been proposed. Many are very speci c to binary outcome models. There is no single best measure. See Amemiya (1981) and Maddala (1983). Approaches: R-squared measures. Compare e+ with + Compare predicted e 3Ud+ ' owith actual 3Ud+ ' o. 33

34 McFADDEN S R-SQUARED There are many --squareds for binary models as - 2 in linear model has many interpretations. McFadden proposed two. We favor McFadden (1974) where - 2 ' O s O f c O s ' log-likelihood in the tted model O f is the log-likelihood in the intercept-only model. This - 2 should be only used for discrete choice models. 34

35 In other nonlinear models instead use - 2 ' EO 4@ O s *EO 4@ O f c where O 4@ is the maximum possible value of the log-likelihood. For binary outcome models O 4@ 'f. For some other models O 4@ can be unbounded restricting use of this. 35

36 PREDICTION THAT ) ' Many measures compare predicted e+ with +. The problem is in de ning a rule for when e+ '. Obvious is e+ ' when er ' 8 E 3 e : fd. But this can e.g. yield e+ 'fall the time if most of the sample has + 'f The receiver operating characteristics (ROC) curve considers what happens as the cuttoff e+ :Svaries. 36

37 PREDICTION OF Pr[) ' o Can compare predicted e 3U[) ' o with Pr[) ' o. But testing whether on on average the predicted probabilities equal the sample frequencies is not helpful over the entire sample, since for the logit model with an intercept the f.o.c. imply S? ' + \E 3 e 'f,sothat S?' er '7+. Useful for subsamples. 37

38 LATENT VARIABLE APPROACHES Let + be an unobserved underlying continuous latent variable. When + crosses a threshold + ' is generated. Theobservedbinaryoutcome+ is still Bernoulli. The latent variable + determines the functional form of the parameter of this Bernoulli distribution. Two latent variable approaches: index function random utility. 38

39 INDEX FUNCTION Interest is in explaining the underlying unobserved continuous random variable +. The natural regression model for + is the index function model + ' 3 n This cannot be estimated as + is not observed. Instead we observe + if + : f + ' f if + f The choice of 0 as the threshold is a normalization. If 39

40 instead + :Sis used as the threshold, exactly the same slope parameter estimates are obtained if includes a constant term. Examples: + is a person s tendency to work and we observe only whether or not the person works (+ ' ) + is a person s propensity to commute by public transit and we observe only whether or not the public transit is used (+ ' ). 40

41 INDEX FUNCTION... Then hd+ ' o 'hd+ : fo 'hd 3 n :fo 'hd 3 o ' 8 E 3 c where 8 is the cdf of (which equals the cdf of in the usual case of density symmetric about f in which case 8 E ' 8 E.) The index function therefore gives a way to interpret the function 8. 41

42 The probit model arises from standard normal errors in the index model. The logit model arises from logistic errors in the index model. Index function approach Gives a way to interpret the parameter as the change in + when changes by one unit, but this is of limited use as + is essentially unitless. Cangeneralize to ordered models, multivariate models and limited dependent variable models. 42

43 RANDOM UTILITY MODELS In the random utility formulation a consumer selects the choice with highest utility. The discrete variable + takesvalue if choice has higher utility takesvaluefif choice f has higher utility. 43

44 The random utility model speci es the utilities of alternatives f and to be L f ' > f n 0 f L ' > n 0 where > f and > are deterministic components of utility, whose dependence on regressors is detailed below. 0 f and 0 are random components of utility. 44

45 The alternative with highest utility is chosen. So the observed choice is hd+ ' o 'hdl:l f o 'hd> n 0 :> f n 0 f o 'hd0 f 0 > > f o ' 8 E> > f c where 8 is the cdf of E0 f 0. Different distributions of 0 f and 0 give different discrete choice models. 45

46 Binary probit arises if 0 f and 0 are normal, as is readily seen by noting that then E0 f 0 is normally distributed, upon normalization of the variance of E0 f 0 to unity. Binary logit model arises if 0 f and 0 are type I extreme value distributed, de ned soon, as then the difference E0 f 0 can be shown to be logistic distributed. The random component 0 in utility model is needed. Otherwise, choice would be deterministic, with alternative 1 always chosen if > :> f. 46

47 REGRESSORS The simplest applications let > f ' 3 f and > ' 3 where are individual characteristics that do not vary with the choice. More generally we let where > ' 3 3 n 3 c 'fc c 3 are characteristics that vary with alternatives and are characteristics that do not vary with the alternatives. 47

48 BRUTE FORCE: LOGIT This yields hd+ ' o'8 EE3 3 f 3 n 3 E f For alternative-invariant regressors only the difference E f canbeidenti ed. 48

49 BRUTE FORCE In the binary case it is suf cient to work with the distribution of 0 f 0. But in more general multinomial models presented later it is necessary to work directly with the distributions of 0 f and 0. We present the algebra here for completeness. 49

50 In general hd+ ' o'hd0 f 0 > > f o 0 n > > f o 'hd0 f ' U 4 U 0 n> > f 4 4 ' U 4 4 se0 q U 0 n> > f 4 se0 f c0 _0 f _0 r se0 f _0 f _0 c where in the last line 0 f and 0 areassumedtobe independent. 50

51 BRUTE FORCE: LOGIT Assume 0 are iid log Weibull (or type I extreme value) distributed with density se0 'e 0 i TEe 0 Then substituting for se0 f yields hd+ ' o' U q 4 U r 0 n> 4 se0 > f 4 e 0 f i TEe 0 f_0 f ' U 4 4 se0 di TEe 0 fo 0 n> > f 4 _0 ' U 4 4 se0 i TEe E0 n> > f _0 _0 51

52 Now substitute for se0 yields hd+ ' o ' U 4 4 e 0 i TEe 0 i TEe E0 n> > f _0 ' U q r 4 4 e 0 i TEe 0 n e E0 n> > f _0 ' U q r 4 4 e 0 i TEe 0 n e 0 e E> > f _0 ' U q r 4 4 e 0 i T e 0 E n e E> > f _0 Since U 4 0 i _0 ' it follows that U 4 4 e 0 i TE@e _0 ' *@. 52

53 Using this result ' ne E> > f yields hd+ ' o ' E n e E> > f e > ' e > f n e > ' e> > f ne > > f This last result implies the logit model hd+ ' o' e 3 ne 3 c where 3 'E3 3 f 3 n 3 E f. 53

54 BRUTE FORCE: PROBIT To instead obtain the probit model assume E0 f c0 are bivariate normal with means zero, variances j 2 f and j2 and covariance j f. Then similar analysis yields hd+ ' o 'hd> n 3 0 :> f n 0 f o 4 E > ' x > f F Ct D j 2 f n j2 2j 2 This implies the probit model hd+ ' o'xe 3 *j c 54

55 where 3 'E3 3 f 3 n 3 E f and j 2 ' j 2 f n j2 2j 2. It is simpler to use the result that E0 f 0 is normally distributed with mean zero and variance j 2. 55

56 CHOICE-BASED SAMPLING Choice-based sampling arises when one choice is over-sampled. This occurs when relatively few people make a choice, so a random sample would pick up too few + '. E.g. Oversampling bus riders by sampling at bus stops. MLE based on the conditional density of ) given j leads to inconsistent estimates. Instead MLE needs to be based on the joint density of E)c j. 56

57 SEMI-PARAMETRIC REGRESSION Here as usual + is Bernoulli with parameter R modelled in the single-index form 8 E 3. But a functional form for 8 is not speci ed. Instead both and 8 are estimated from the data. This is a very active area, and a leading example of semiparametric estimation. 57

58 GROUPED DATA Suppose regressor vector, ' c c?c takes only A distinct values, where A is much smaller then?. Then for each value of the regressors we have multiple observations on +. This type of grouped data is called many observations per cell. It can arise particularly in experimental data. Economics data are rarely of grouped form, unless the regressors are just a few indicator variables. 58

59 Let, ' c c Ac be the A distinct values,? be the number of observations on + for the distinct value of,so S A '? '?, R be the proportion of times + ' when ' R ' hd+ ' m ' o'8e 3 Berkson s minimum chi-square estimatorestimates the transformed model 8 ER ' 3 n by weighted least squares with weights j 2 ' R E R? d8 3 E8 ER o 2 59

60 Then e 0& 1 k SA ' Ej 2 3 SA ' c Ej 2 3 l S A ' Ej 2 8 ER where the asymptotic theory requires? $4. This is simpletoimplement, as it only requires an OLS package. Yetitisfullyef cient, as has same asymptotic distribution as the MLE which would treat each observation separately. 60

61 For the logit model this estimator is especially simple. The transformed model becomes *? R R ' 3 n c which is estimated by weighted least squares with weights j 2 '? R E R The transformed dependent variable is the log-odds ratio. Advantage of min chi-square estimator is computational simplicity. This is moot now. 61

62 AGGREGATE DATA Example: Suppose R equals the unemployment rate in region and equals the average level of schooling in region. One possible model is LS regression of R on 7. Because f R, many studies instead use the logodds ratio to transform to a dependent variable which lies in E4c 4, estimating the model *? R R ' 7 3 n eoojo 62

63 This looks similar to the minimum chi-square estimator, butisnot. Berkson s estimator is appropriate if all regressors in the cell take the same value. Here instead regressors can take different values, as different people in region have different schooling. If apply minimum chi-square the resulting estimator cannot be interpreted as being an estimate of the parameter in an underlying binary choice model, say hd+ ' o'8e 3, asignoring the heterogeneity leads to an inconsistent estimator of. 63

64 To see this suppose + ' if + : f, data are only observed by region, and 1d> c P o in region. Then in region hd+ ' mc % c P o'hd 3 n : fmc c P o 3 n > 3 'h s & j 2 n 3 P : fmc c P # > 3 'x s $ j 2 n 3 P since 3 n 1d> 3 cj2 n 3 P o. 64

65 This suggests estimating the model x E7+ ' 7 3 s j 2 n 3 P Also see Allenby and Berry, Levinsohn and Pakes. 65

66 APPLICATION: LABOR SUPPLY Use data of Mroz (1987) on 753 married women from the 1976 Panel Survey of Income Dynamics (PSID). Dependent variable LFP is an indicator variable that equals 1 if worked in previous year, and 0 if no work. For this sample 428 (or 57%) worked. 66

67 The regressors are a constant term and 1. KL6: Number of children less than six 2. K618: Number of children more than six 3. AGE: Age 4. ED: Education (years of schooling completed) 5. NLINCOME: annual nonlabor income of wife measuredin$10,000 s. 67

68 Variable Coeff t-stat u7 Logit Probit u7 Logit Probit. SeD. 2 e22 ef fb fb gus 2b. eh Hb2 H.D.H gs H f 2 fss fh fh f fb C. f 2 fd. fe es ed ed.( fd2 2DH DS S. S SD uu. SHe eb 2fb ed ee es *? u ede H ede22 68

69 Consider, for example, the effect of education. The logit coef cient is 1.7 times the probit as. fh *fss (vs. rule of thumb multiplier of 1.6). The OLS coef cient is 0.3 times the probit as f fh *f 2 (vs. rule of thumb multiplier of 0.4). All models give similar results regarding statistical signi cance (though the reported OLS standard errors here are the incorrect ones). 69

16. Tobit and Selection A. Colin Cameron Pravin K. Trivedi Copyright 2006

16. Tobit and Selection A. Colin Cameron Pravin K. Trivedi Copyright 2006 16. Tobit and Selection A. Colin Cameron Pravin K. Trivedi Copyright 2006 These slides were prepared in 1999. They cover material similar to Sections 16.2-16.3 and 16.5 of our subsequent book Microeconometrics:

More information

21. Panel: Linear A. Colin Cameron Pravin K. Trivedi Copyright 2006

21. Panel: Linear A. Colin Cameron Pravin K. Trivedi Copyright 2006 21. Panel: Linear A. Colin Cameron Pravin K. Trivedi Copyright 2006 These slides were prepared in 1999. They cover material similar to Chapter 21 of our subsequent book Microeconometrics: Methods and Applications,

More information

ECON 594: Lecture #6

ECON 594: Lecture #6 ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was

More information

h=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio

h=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio Multivariate Response Models The response variable is unordered and takes more than two values. The term unordered refers to the fact that response 3 is not more favored than response 2. One choice from

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria SOLUTION TO FINAL EXAM Friday, April 12, 2013. From 9:00-12:00 (3 hours) INSTRUCTIONS:

More information

11. Bootstrap Methods

11. Bootstrap Methods 11. Bootstrap Methods c A. Colin Cameron & Pravin K. Trivedi 2006 These transparencies were prepared in 20043. They can be used as an adjunct to Chapter 11 of our subsequent book Microeconometrics: Methods

More information

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i, A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Chapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a

Chapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a Chapter 9 Regression with a Binary Dependent Variable Multiple Choice ) The binary dependent variable model is an example of a a. regression model, which has as a regressor, among others, a binary variable.

More information

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008 ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008 Instructions: Answer all four (4) questions. Point totals for each question are given in parenthesis; there are 00 points possible. Within

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

STAT5044: Regression and Anova

STAT5044: Regression and Anova STAT5044: Regression and Anova Inyoung Kim 1 / 18 Outline 1 Logistic regression for Binary data 2 Poisson regression for Count data 2 / 18 GLM Let Y denote a binary response variable. Each observation

More information

Discrete Dependent Variable Models

Discrete Dependent Variable Models Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)

More information

Lecture 3.1 Basic Logistic LDA

Lecture 3.1 Basic Logistic LDA y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

1 Outline. Introduction: Econometricians assume data is from a simple random survey. This is never the case.

1 Outline. Introduction: Econometricians assume data is from a simple random survey. This is never the case. 24. Strati cation c A. Colin Cameron & Pravin K. Trivedi 2006 These transparencies were prepared in 2002. They can be used as an adjunct to Chapter 24 (sections 24.2-24.4 only) of our subsequent book Microeconometrics:

More information

4.8 Instrumental Variables

4.8 Instrumental Variables 4.8. INSTRUMENTAL VARIABLES 35 4.8 Instrumental Variables A major complication that is emphasized in microeconometrics is the possibility of inconsistent parameter estimation due to endogenous regressors.

More information

Models, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley

Models, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley Models, Testing, and Correction of Heteroskedasticity James L. Powell Department of Economics University of California, Berkeley Aitken s GLS and Weighted LS The Generalized Classical Regression Model

More information

ECON Introductory Econometrics. Lecture 11: Binary dependent variables

ECON Introductory Econometrics. Lecture 11: Binary dependent variables ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability

More information

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama Problem Set #1 (Random Data Generation) 1. Generate =500random numbers from both the uniform 1 ( [0 1], uniformbetween zero and one) and exponential exp ( ) (set =2and let [0 1]) distributions. Plot the

More information

Multinomial Discrete Choice Models

Multinomial Discrete Choice Models hapter 2 Multinomial Discrete hoice Models 2.1 Introduction We present some discrete choice models that are applied to estimate parameters of demand for products that are purchased in discrete quantities.

More information

1. The Multivariate Classical Linear Regression Model

1. The Multivariate Classical Linear Regression Model Business School, Brunel University MSc. EC550/5509 Modelling Financial Decisions and Markets/Introduction to Quantitative Methods Prof. Menelaos Karanasos (Room SS69, Tel. 08956584) Lecture Notes 5. The

More information

Limited Dependent Variable Models II

Limited Dependent Variable Models II Limited Dependent Variable Models II Fall 2008 Environmental Econometrics (GR03) LDV Fall 2008 1 / 15 Models with Multiple Choices The binary response model was dealing with a decision problem with two

More information

Environmental Econometrics

Environmental Econometrics Environmental Econometrics Syngjoo Choi Fall 2008 Environmental Econometrics (GR03) Fall 2008 1 / 37 Syllabus I This is an introductory econometrics course which assumes no prior knowledge on econometrics;

More information

Limited Dependent Variables and Panel Data

Limited Dependent Variables and Panel Data and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement

More information

A simple alternative to the linear probability model for binary choice models with endogenous regressors

A simple alternative to the linear probability model for binary choice models with endogenous regressors A simple alternative to the linear probability model for binary choice models with endogenous regressors Christopher F Baum, Yingying Dong, Arthur Lewbel, Tao Yang Boston College/DIW Berlin, U.Cal Irvine,

More information

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Linear

More information

Control Function and Related Methods: Nonlinear Models

Control Function and Related Methods: Nonlinear Models Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear

More information

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables

More information

Models of Qualitative Binary Response

Models of Qualitative Binary Response Models of Qualitative Binary Response Probit and Logit Models October 6, 2015 Dependent Variable as a Binary Outcome Suppose we observe an economic choice that is a binary signal. The focus on the course

More information

Comparing IRT with Other Models

Comparing IRT with Other Models Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used

More information

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated

More information

Group comparisons in logit and probit using predicted probabilities 1

Group comparisons in logit and probit using predicted probabilities 1 Group comparisons in logit and probit using predicted probabilities 1 J. Scott Long Indiana University May 27, 2009 Abstract The comparison of groups in regression models for binary outcomes is complicated

More information

Lecture Notes based on Koop (2003) Bayesian Econometrics

Lecture Notes based on Koop (2003) Bayesian Econometrics Lecture Notes based on Koop (2003) Bayesian Econometrics A.Colin Cameron University of California - Davis November 15, 2005 1. CH.1: Introduction The concepts below are the essential concepts used throughout

More information

Chapter 11. Regression with a Binary Dependent Variable

Chapter 11. Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score

More information

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models Optimum Design for Mixed Effects Non-Linear and generalized Linear Models Cambridge, August 9-12, 2011 Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

More information

2. We care about proportion for categorical variable, but average for numerical one.

2. We care about proportion for categorical variable, but average for numerical one. Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is

More information

Semiparametric Generalized Linear Models

Semiparametric Generalized Linear Models Semiparametric Generalized Linear Models North American Stata Users Group Meeting Chicago, Illinois Paul Rathouz Department of Health Studies University of Chicago prathouz@uchicago.edu Liping Gao MS Student

More information

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review

More information

Partial effects in fixed effects models

Partial effects in fixed effects models 1 Partial effects in fixed effects models J.M.C. Santos Silva School of Economics, University of Surrey Gordon C.R. Kemp Department of Economics, University of Essex 22 nd London Stata Users Group Meeting

More information

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43 Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression

More information

A Course on Advanced Econometrics

A Course on Advanced Econometrics A Course on Advanced Econometrics Yongmiao Hong The Ernest S. Liu Professor of Economics & International Studies Cornell University Course Introduction: Modern economies are full of uncertainties and risk.

More information

ECE521 week 3: 23/26 January 2017

ECE521 week 3: 23/26 January 2017 ECE521 week 3: 23/26 January 2017 Outline Probabilistic interpretation of linear regression - Maximum likelihood estimation (MLE) - Maximum a posteriori (MAP) estimation Bias-variance trade-off Linear

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Lecture 11/12. Roy Model, MTE, Structural Estimation

Lecture 11/12. Roy Model, MTE, Structural Estimation Lecture 11/12. Roy Model, MTE, Structural Estimation Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy model The Roy model is a model of comparative advantage: Potential earnings

More information

Lecture-20: Discrete Choice Modeling-I

Lecture-20: Discrete Choice Modeling-I Lecture-20: Discrete Choice Modeling-I 1 In Today s Class Introduction to discrete choice models General formulation Binary choice models Specification Model estimation Application Case Study 2 Discrete

More information

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools Syllabus By Joan Llull Microeconometrics. IDEA PhD Program. Fall 2017 Chapter 1: Introduction and a Brief Review of Relevant Tools I. Overview II. Maximum Likelihood A. The Likelihood Principle B. The

More information

Introduction: structural econometrics. Jean-Marc Robin

Introduction: structural econometrics. Jean-Marc Robin Introduction: structural econometrics Jean-Marc Robin Abstract 1. Descriptive vs structural models 2. Correlation is not causality a. Simultaneity b. Heterogeneity c. Selectivity Descriptive models Consider

More information

Truncation and Censoring

Truncation and Censoring Truncation and Censoring Laura Magazzini laura.magazzini@univr.it Laura Magazzini (@univr.it) Truncation and Censoring 1 / 35 Truncation and censoring Truncation: sample data are drawn from a subset of

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle

More information

IV estimators and forbidden regressions

IV estimators and forbidden regressions Economics 8379 Spring 2016 Ben Williams IV estimators and forbidden regressions Preliminary results Consider the triangular model with first stage given by x i2 = γ 1X i1 + γ 2 Z i + ν i and second stage

More information

MC3: Econometric Theory and Methods. Course Notes 4

MC3: Econometric Theory and Methods. Course Notes 4 University College London Department of Economics M.Sc. in Economics MC3: Econometric Theory and Methods Course Notes 4 Notes on maximum likelihood methods Andrew Chesher 25/0/2005 Course Notes 4, Andrew

More information

A time series plot: a variable Y t on the vertical axis is plotted against time on the horizontal axis

A time series plot: a variable Y t on the vertical axis is plotted against time on the horizontal axis TIME AS A REGRESSOR A time series plot: a variable Y t on the vertical axis is plotted against time on the horizontal axis Many economic variables increase or decrease with time A linear trend relationship

More information

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?

More information

A Guide to Modern Econometric:

A Guide to Modern Econometric: A Guide to Modern Econometric: 4th edition Marno Verbeek Rotterdam School of Management, Erasmus University, Rotterdam B 379887 )WILEY A John Wiley & Sons, Ltd., Publication Contents Preface xiii 1 Introduction

More information

Probabilistic modeling. The slides are closely adapted from Subhransu Maji s slides

Probabilistic modeling. The slides are closely adapted from Subhransu Maji s slides Probabilistic modeling The slides are closely adapted from Subhransu Maji s slides Overview So far the models and algorithms you have learned about are relatively disconnected Probabilistic modeling framework

More information

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35 Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate

More information

GMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails

GMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails GMM-based inference in the AR() panel data model for parameter values where local identi cation fails Edith Madsen entre for Applied Microeconometrics (AM) Department of Economics, University of openhagen,

More information

Single-level Models for Binary Responses

Single-level Models for Binary Responses Single-level Models for Binary Responses Distribution of Binary Data y i response for individual i (i = 1,..., n), coded 0 or 1 Denote by r the number in the sample with y = 1 Mean and variance E(y) =

More information

Ordered Response and Multinomial Logit Estimation

Ordered Response and Multinomial Logit Estimation Ordered Response and Multinomial Logit Estimation Quantitative Microeconomics R. Mora Department of Economics Universidad Carlos III de Madrid Outline Introduction 1 Introduction 2 3 Introduction The Ordered

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maximum Likelihood Estimation Additional Topics Raphael Cunha Program in Statistics and Methodology PRISM Department of Political Science The Ohio State University cunha.6@osu.edu March 28, 2014 Overview

More information

Lecture Notes on Measurement Error

Lecture Notes on Measurement Error Steve Pischke Spring 2000 Lecture Notes on Measurement Error These notes summarize a variety of simple results on measurement error which I nd useful. They also provide some references where more complete

More information

Appendix A: The time series behavior of employment growth

Appendix A: The time series behavior of employment growth Unpublished appendices from The Relationship between Firm Size and Firm Growth in the U.S. Manufacturing Sector Bronwyn H. Hall Journal of Industrial Economics 35 (June 987): 583-606. Appendix A: The time

More information

7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis

7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis Lecture 6: Logistic Regression Analysis Christopher S. Hollenbeak, PhD Jane R. Schubart, PhD The Outcomes Research Toolbox Review Homework 2 Overview Logistic regression model conceptually Logistic regression

More information

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation NELS 88 Table 2.3 Adjusted odds ratios of eighth-grade students in 988 performing below basic levels of reading and mathematics in 988 and dropping out of school, 988 to 990, by basic demographics Variable

More information

Wageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice

Wageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice Wageningen Summer School in Econometrics The Bayesian Approach in Theory and Practice September 2008 Slides for Lecture on Qualitative and Limited Dependent Variable Models Gary Koop, University of Strathclyde

More information

Jeffrey M. Wooldridge Michigan State University

Jeffrey M. Wooldridge Michigan State University Fractional Response Models with Endogenous Explanatory Variables and Heterogeneity Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Fractional Probit with Heteroskedasticity 3. Fractional

More information

ECLT 5810 Linear Regression and Logistic Regression for Classification. Prof. Wai Lam

ECLT 5810 Linear Regression and Logistic Regression for Classification. Prof. Wai Lam ECLT 5810 Linear Regression and Logistic Regression for Classification Prof. Wai Lam Linear Regression Models Least Squares Input vectors is an attribute / feature / predictor (independent variable) The

More information

Testing and Model Selection

Testing and Model Selection Testing and Model Selection This is another digression on general statistics: see PE App C.8.4. The EViews output for least squares, probit and logit includes some statistics relevant to testing hypotheses

More information

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense

More information

ECONOMETFUCS FIELD EXAM Michigan State University May 11, 2007

ECONOMETFUCS FIELD EXAM Michigan State University May 11, 2007 ECONOMETFUCS FIELD EXAM Michigan State University May 11, 2007 Instructions: Answer all four (4) questions. Point totals for each question are given in parenthesis; there are 100 points possible. Within

More information

Bayesian Modeling of Conditional Distributions

Bayesian Modeling of Conditional Distributions Bayesian Modeling of Conditional Distributions John Geweke University of Iowa Indiana University Department of Economics February 27, 2007 Outline Motivation Model description Methods of inference Earnings

More information

Applied Health Economics (for B.Sc.)

Applied Health Economics (for B.Sc.) Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative

More information

Gibbs Sampling in Latent Variable Models #1

Gibbs Sampling in Latent Variable Models #1 Gibbs Sampling in Latent Variable Models #1 Econ 690 Purdue University Outline 1 Data augmentation 2 Probit Model Probit Application A Panel Probit Panel Probit 3 The Tobit Model Example: Female Labor

More information

Chapter 6: Endogeneity and Instrumental Variables (IV) estimator

Chapter 6: Endogeneity and Instrumental Variables (IV) estimator Chapter 6: Endogeneity and Instrumental Variables (IV) estimator Advanced Econometrics - HEC Lausanne Christophe Hurlin University of Orléans December 15, 2013 Christophe Hurlin (University of Orléans)

More information

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes Maximum Likelihood Estimation Econometrics II Department of Economics Universidad Carlos III de Madrid Máster Universitario en Desarrollo y Crecimiento Económico Outline 1 3 4 General Approaches to Parameter

More information

Chapter 1. GMM: Basic Concepts

Chapter 1. GMM: Basic Concepts Chapter 1. GMM: Basic Concepts Contents 1 Motivating Examples 1 1.1 Instrumental variable estimator....................... 1 1.2 Estimating parameters in monetary policy rules.............. 2 1.3 Estimating

More information

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College December 2016 Abstract Lewbel (2012) provides an estimator

More information

Gibbs Sampling in Endogenous Variables Models

Gibbs Sampling in Endogenous Variables Models Gibbs Sampling in Endogenous Variables Models Econ 690 Purdue University Outline 1 Motivation 2 Identification Issues 3 Posterior Simulation #1 4 Posterior Simulation #2 Motivation In this lecture we take

More information

A better way to bootstrap pairs

A better way to bootstrap pairs A better way to bootstrap pairs Emmanuel Flachaire GREQAM - Université de la Méditerranée CORE - Université Catholique de Louvain April 999 Abstract In this paper we are interested in heteroskedastic regression

More information

Discrete Choice Modeling

Discrete Choice Modeling [Part 6] 1/55 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent Class 11 Mixed Logit 12 Stated Preference

More information

What s New in Econometrics? Lecture 14 Quantile Methods

What s New in Econometrics? Lecture 14 Quantile Methods What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression

More information

Economics 241B Estimation with Instruments

Economics 241B Estimation with Instruments Economics 241B Estimation with Instruments Measurement Error Measurement error is de ned as the error resulting from the measurement of a variable. At some level, every variable is measured with error.

More information

Introduction to GSEM in Stata

Introduction to GSEM in Stata Introduction to GSEM in Stata Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Introduction to GSEM in Stata Boston College, Spring 2016 1 /

More information

Applied Microeconometrics (L5): Panel Data-Basics

Applied Microeconometrics (L5): Panel Data-Basics Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics

More information

Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland

Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2018 Part III Limited Dependent Variable Models As of Jan 30, 2017 1 Background 2 Binary Dependent Variable The Linear Probability

More information

We begin by thinking about population relationships.

We begin by thinking about population relationships. Conditional Expectation Function (CEF) We begin by thinking about population relationships. CEF Decomposition Theorem: Given some outcome Y i and some covariates X i there is always a decomposition where

More information

Probability and Estimation. Alan Moses

Probability and Estimation. Alan Moses Probability and Estimation Alan Moses Random variables and probability A random variable is like a variable in algebra (e.g., y=e x ), but where at least part of the variability is taken to be stochastic.

More information

Marginal Specifications and a Gaussian Copula Estimation

Marginal Specifications and a Gaussian Copula Estimation Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required

More information

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS Parametric Distributions Basic building blocks: Need to determine given Representation: or? Recall Curve Fitting Binary Variables

More information

Economics 620, Lecture 18: Nonlinear Models

Economics 620, Lecture 18: Nonlinear Models Economics 620, Lecture 18: Nonlinear Models Nicholas M. Kiefer Cornell University Professor N. M. Kiefer (Cornell University) Lecture 18: Nonlinear Models 1 / 18 The basic point is that smooth nonlinear

More information

A Joint Tour-Based Model of Vehicle Type Choice and Tour Length

A Joint Tour-Based Model of Vehicle Type Choice and Tour Length A Joint Tour-Based Model of Vehicle Type Choice and Tour Length Ram M. Pendyala School of Sustainable Engineering & the Built Environment Arizona State University Tempe, AZ Northwestern University, Evanston,

More information

CENSORED DATA AND CENSORED NORMAL REGRESSION

CENSORED DATA AND CENSORED NORMAL REGRESSION CENSORED DATA AND CENSORED NORMAL REGRESSION Data censoring comes in many forms: binary censoring, interval censoring, and top coding are the most common. They all start with an underlying linear model

More information

LOGISTIC REGRESSION Joseph M. Hilbe

LOGISTIC REGRESSION Joseph M. Hilbe LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of

More information

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor

More information

multilevel modeling: concepts, applications and interpretations

multilevel modeling: concepts, applications and interpretations multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models

More information

Sigmaplot di Systat Software

Sigmaplot di Systat Software Sigmaplot di Systat Software SigmaPlot Has Extensive Statistical Analysis Features SigmaPlot is now bundled with SigmaStat as an easy-to-use package for complete graphing and data analysis. The statistical

More information

New Developments in Econometrics Lecture 16: Quantile Estimation

New Developments in Econometrics Lecture 16: Quantile Estimation New Developments in Econometrics Lecture 16: Quantile Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. Review of Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile

More information

Notes on Mathematical Expectations and Classes of Distributions Introduction to Econometric Theory Econ. 770

Notes on Mathematical Expectations and Classes of Distributions Introduction to Econometric Theory Econ. 770 Notes on Mathematical Expectations and Classes of Distributions Introduction to Econometric Theory Econ. 77 Jonathan B. Hill Dept. of Economics University of North Carolina - Chapel Hill October 4, 2 MATHEMATICAL

More information