Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment

Similar documents
Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit

ECON 594: Lecture #6

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1

Modeling Binary Outcomes: Logit and Probit Models

Limited Dependent Variable Models II

Limited Dependent Variables and Panel Data

Binary Dependent Variables

Chapter 11. Regression with a Binary Dependent Variable

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid

Maximum Likelihood and. Limited Dependent Variable Models

Limited Dependent Variables and Panel Data

Logistic Regression. Continued Psy 524 Ainsworth

7. Assumes that there is little or no multicollinearity (however, SPSS will not assess this in the [binary] Logistic Regression procedure).

Statistical Modelling with Stata: Binary Outcomes

Lecture 3.1 Basic Logistic LDA

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han

Chapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a

Applied Health Economics (for B.Sc.)

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

covariance between any two observations

Chap 2. Linear Classifiers (FTH, ) Yongdai Kim Seoul National University

Binary Outcomes. Objectives. Demonstrate the limitations of the Linear Probability Model (LPM) for binary outcomes

h=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio

LOGISTIC REGRESSION Joseph M. Hilbe

Ordered Response and Multinomial Logit Estimation

Discrete Dependent Variable Models

LOGISTIC REGRESSION. Lalmohan Bhar Indian Agricultural Statistics Research Institute, New Delhi

Instructions: Closed book, notes, and no electronic devices. Points (out of 200) in parentheses

Spring RMC Professional Development Series January 14, Generalized Linear Mixed Models (GLMMs): Concepts and some Demonstrations

ECONOMETRICS HONOR S EXAM REVIEW SESSION

Statistics: A review. Why statistics?

Lecture 4: Heteroskedasticity

Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions

Week 7: Binary Outcomes (Scott Long Chapter 3 Part 2)

Lecture 6: Discrete Choice: Qualitative Response

2. We care about proportion for categorical variable, but average for numerical one.

Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland

Discrete Choice Modeling

Truncation and Censoring

STA 216, GLM, Lecture 16. October 29, 2007

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas

Binary Logistic Regression

Binary Models with Endogenous Explanatory Variables

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Lecture 11/12. Roy Model, MTE, Structural Estimation

Estimation in the Fixed Effects Ordered Logit Model. Chris Muris (SFU)

POLI 618 Notes. Stuart Soroka, Department of Political Science, McGill University. March 2010

Basic Medical Statistics Course

Homework Solutions Applied Logistic Regression

I. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as

Non-maximum likelihood estimation and statistical inference for linear and nonlinear mixed models

Introduction to Econometrics

Partial effects in fixed effects models

Logistic Regression: Regression with a Binary Dependent Variable

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Logistic & Tobit Regression

Parametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON

Panel Models for Non-Continuous Dependent Variables

CS6220: DATA MINING TECHNIQUES

Single-level Models for Binary Responses

Double-Hurdle Models with Dependent Errors and Heteroscedasticity

Binary choice. Michel Bierlaire

Non-linear panel data modeling

Topic 5: Non-Linear Relationships and Non-Linear Least Squares

Stat 642, Lecture notes for 04/12/05 96

Linear Classification: Probabilistic Generative Models

Models with Limited Dependent Variables

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Comparing groups using predicted probabilities

Using the same data as before, here is part of the output we get in Stata when we do a logistic regression of Grade on Gpa, Tuce and Psi.

Generalized Linear Models for Non-Normal Data

Lecture 1: intro. to regresions

Group comparisons in logit and probit using predicted probabilities 1

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama

ECON Introductory Econometrics. Lecture 11: Binary dependent variables

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

Binomial Model. Lecture 10: Introduction to Logistic Regression. Logistic Regression. Binomial Distribution. n independent trials

MS&E 226: Small Data

Lecture notes to Chapter 11, Regression with binary dependent variables - probit and logit regression

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL

disc choice5.tex; April 11, ffl See: King - Unifying Political Methodology ffl See: King/Tomz/Wittenberg (1998, APSA Meeting). ffl See: Alvarez

Probabilistic Choice Models

Heteroscedasticity 1

Lecture 10: Introduction to Logistic Regression

Parametric Identification of Multiplicative Exponential Heteroskedasticity

Extensions to the Basic Framework II

MS&E 226: Small Data

Introductory Econometrics

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

Experimental Design and Statistical Methods. Workshop LOGISTIC REGRESSION. Jesús Piedrafita Arilla.

Logistic Regression. Sargur N. Srihari. University at Buffalo, State University of New York USA

Comprehensive Examination Quantitative Methods Spring, 2018


26:010:557 / 26:620:557 Social Science Research Methods

Assessing the Calibration of Dichotomous Outcome Models with the Calibration Belt

Data-analysis and Retrieval Ordinal Classification

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

Transcription:

BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense predictions ( < 0, > 1) functional form: linear parameters always have the same effect ε heteroskedastic 1

2

Binary Variables E( y ) = µ Var( y ) = µ (1 µ ) because: 2 2 2 Var( y): E ( y µ ) = E( y 2 yµ + µ ) = 2 2 E( y ) 2 µ E( y) + µ 2 since y = 1 or 0 E( y ) = E( y ) = µ 2 2 Var (y) = µ 2 µ + µ = µ (1 µ ) Heteroskedasticity Variance increases with xβ... ( µ ) 3

LATENT VARIABLES e.g. employment decision (0 or 1) Must be based on a latent propensity to work Y i * : (based on labor supply) Y = X β + ε * i i i Y i 1 if Y * > 0 = i 0 if Y * 0 i other values are possible thresholds as well 4

( ) ( ) P= Pr Y = 1 = F Xβ with F as symmetric cumulative probability density. F( Xβ ) X β + F( Xβ ) lim = 1 lim = 0 X β Therefore: Y = 1 if Y * > 0 X β + ε > 0 X β > ε ε > X β i i i i Pr ( Y = 1) = Pr ( ε > Xβ) = F( Xβ) Pr ( Y = 0) = 1 F( Xβ ) 5

6

MAXIMUM LIKELIHOOD ESTIMATION The likelihood of receiving a certain result Y i in the sample can be maximised via the choice of an adequate parameter β. ( ) L = Pr Y = y, Y = y,, Y = y = N i= 1 1 1 2 2 ( β) F( X β) = 1 F X i = i Y = 0 Y = 1 i N = 1 F X i= 1 i 1 yi ( β) F( X β) i ( ) ( β) ( β) { } i i i i ln L = 1 y ln 1 F X + y ln F X MAX β Log-likelihood function is globally concave ==> unique global maximum N i N y i 7

The form of F (.) Probit-Model normal distribution X β 2 1 t F ( X β) =F ( X β) = exp dt 2p 2 Logit-Model standard logistic distribution ( β ) F X exp = 1 + exp ( X β ) ( X β ) Both yield very similar results 8

GOODNESS OF FIT 2 ln L Pseudo R :1, ln L0 ln L value of log-likelihood in a model without covariates 0 + variations (McFadden, etc.) Problem: Pseudo 2 R is only 1 if X β ± Observed/predicted table (estat classification in STATA after Logit) Amount of Right Predictions : ˆ * Y i = 1 if Pˆi > P (usually 0.5), separately for each subgroup Comparison of Y ˆi and Y i Hosmer-Lemeshow-Test for misspecification: tests, for a number of subgroups, whether or not the mean predicted probability for P=1 is the same as in the sample. (mostly conducted for deciles; has to be the same for the whole sample) 9

ESTIMATION PROBLEMS Maximum Likelihood is o consistent, o asymptotic efficient and asymptotically normally distributed problems with small samples (should be >100) ln L is globally concave, nevertheless estimation problems (no convergence of the iteration procedure, singularity of the hessian matrix in the 2 nd derivative) multicollinearity of variables dummy-variable explains outcome entirely ( huge coefficients) + linear combinations 10

d i : if d i =1, then y i =1 i= 1 { i ( iβ d i) ( i) ( iβ d i) } N ln L = y ln F X + d + 1 y ln 1 F X + d MAX β i { } ( β d) ( β) ( ) ( β) = ln F X + + y ln F X + 1 yln 1 F X i i i i i d =1 d =0 i δ is only estimated in the first term, maximizing ln L means δ increases to maximum ==> + scaling of variables check standard errors 11

PROPERTIES OF THE ESTIMATOR only small troubles with OLS if some assumptions are violated. (e.g. still consistent estimates if error is heteroscedastic or suffers from autocorrelation) with Probit/Logit: only β is identified instead of β. σ General assumption: σ=1. heteroscedasticity: o if independent of RHS-variables, no problem but: 2 if σi = exp( γ1 + γ2x1 i), β cannot be consistently estimated o hetpro (STATA) as simple method to test heteroscedasticity; the general assumption is that errors correlate with one of the variables 12

INTERPRETATION OF THE COEFFICIENT Non-linear procedure impact of variables depend on their position marginal effects are necessary dp dx 1 = β f 1 ( Xβ) Which position should be used to evaluate the marginal effect? all i, then 0 sample mean relevant combinations of variables 13

14

15

DUMMY-VARIABLES Discrete change X k ( Y X) Pr = 1 X k ( Y X X ) ( Y X X ) = Pr = 1, = 0 Pr = 1, = 1 k k STATA provides automated processes for marginal effects tables in marginal effects are always better Scott Long (Univ. of Indiana) developed own procedures for interpretations (see website) 16

ODDS RATIO FOR LOGIT Odds: ( ) ( ) ( ) ( ) Pr Y = 1 X Pr Y = 1 X = Pr Y = 0 X 1 Pr Y = 1 X how often happens 1, relative to 0, variation between 0 and ln(odds): variation between and : ( Y = X) ( Y X) Pr 1 ln 1 Pr = 1 = X β equivalent to logit: ( X β ) ( X β ) exp Pr ( Y = 1 X) = yields an interesting interpretation. 1 + exp 17

ODDS ( Ω ) P Ω ( X) = = 1 P e X β Two realisations of X are given: X 1 und X 0 Odds ratio: Ω Ω ( X1 ) ( X ) 0 = e ( X X ) 1 0 β If 1, ( 0) j e β > β j >, X j increases odds of observing Y = 1 18

PANEL MODELS For fixed T and N, the estimator is inconsistent. Incidental parameter problem, fixed effect α i not estimable. In linear model (OLS), it was possible to eliminate α i. Conditional maximum likelihood model: look for a sufficient statistic t i for α i, such that: conditional on t i, an individual likelihood s contribution no longer depends on α i, but on β. In linear models, a sufficient statistic for α i is y. This is not the case for a Probit model: Fixed Effects Probit is inconsistent. Fixed Effects Logit and Random Effects Probit is possible. 19

Ordered Probit/Logit Ordinal variable e.g. school grades * Latent variable Y (, ) is measured only in N values Yi = m if * τ m 1 Y < i τm, m 1 N τ threshold, cutpoints Thresholds are unknown must be estimated as well! 20

For ordered probit: ( Y = ) =Φ( Xβ ) ( Y = ) =Φ( τ1 Xβ) Φ( Xβ) ( Y = ) =Φ( τ Xβ) Φ( τ Xβ) Pr 0 0 Pr 1 Pr 2 2 1 ( Y = N) = Φ( τ Xβ) Pr 1 N 1 a change in X shifts the entire probability density tot he left/right Attention: Interpretation is difficult and only explicit at the borders! 21

22

23

24