Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model

Size: px
Start display at page:

Download "Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model"

Transcription

1 Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1 Tetsuya Matsubayashi University of North Texas November 2, 2010 Random utility model Multinomial logit model Conditional logit model Independence of Irrelevant Alternatives Nested logit model (next week) Mixed logit model (next week) Multinomial probit model (next week) 1 / 47 2 / 47 Multinomial Dependent Variable Random Utility Model Vote: Bush, Clinton, or Perot in the 1992 presidential election Travel: car, bus, or train Occupation: blue-color job, white-color job, professional job etc. A decision maker chooses one alternative from a choice set. The choice set is characterized as follows: Alternatives must be mutually exclusive. The choice set must be exhaustive. The number of alternatives must be definite. 3 / 47 4 / 47

2 Random Utility Model Random Utility Model Thus, the model is expressed as: The random utility model assumes that a decision maker i attaches a utility to each alternative, U im, m 1... M. The random utility model assumes that the utility consists of two components: Systematic component, which we can observe. Random component, which we cannot observe. U im V im + ɛ im where U im is a decision maker i s utility for alternative m, V im is the systematic component for a decision maker i associated with choice m, and ɛ im denotes the random component of utility for a decision maker i associated with choice m. For example, the utility of voting for Obama increases as ideological proximity increases. 5 / 47 6 / 47 Random Utility Model Random Utility Model We assume that the systematic component for the utility is a linear function of some exogenous variables V im x im β if variables are choice-specific and V im x i β m if variables are individual-specific. In the case of presidential vote choice, candidates traits are choice specific variables and individuals demographic characteristics are individual-specific variables.p The utility of individual i for choice m is rewritten as: U im x im β + ɛ im U im x i β m + ɛ im The model assumes that the decision maker chooses choice m if and only if: U im > U ij j m 7 / 47 8 / 47

3 Multinomial Logit Model: Set Up Set Up We begin with the random utility model using individual-specific variables: U im x i β m + ɛ im where x i denotes a vector of individual-specific characteristics. β m is a vector of choice specific parameters. Thus, the effect of x i varies across the choices. Suppose a vote choice model in the 1992 presidential election. Income is a key exogenous variable. The utilities are written as Bush: U i1 β 10 + β 11 Income i + ɛ i1 Clinton: U i2 β 20 + β 21 Income i + ɛ i2 Perot: U i3 β 30 + β 31 Income i + ɛ i3 9 / / 47 Set Up Set Up Suppose that individual i chooses one of two alternatives. The probability of choosing alternative 1 is the probability that the utility 1 exceeds the utility from alternative 2: Pr(y i 1) Pr(U i1 > U i2 ) This is a binary choice model. Pr(V i1 + ɛ i1 > V i2 + ɛ i2 ) Pr(ɛ i2 ɛ i1 < V i1 V i2 ) Suppose that individual i chooses one of three alternatives. The probability of choosing alternative 1 is the probability that the utility 1 exceeds the utility from alternative 2 and the utility from alternative 3: P(y i 1) Pr[(U i1 > U i2 ) and (U i1 > U i3 )] Pr[(V i1 + ɛ i1 > V i2 + ɛ i2 ) and (V i1 + ɛ i1 > V i3 + ɛ i3 )] Pr[(ɛ i2 ɛ i1 < V i1 V i2 ) and (ɛ i3 ɛ i1 < V i1 V i3 )] 11 / / 47

4 Set Up Distributional Assumption When there are J choices, the probability of choice m is P(y i m) Pr(U im > U ij ) j m For example, the probability of voting for Bush equals the probability that the utility gained from voting for Bush exceeds the utilities from voting for Clinton and Perot. First, random components are independently and identically distributed (IID). In other words, the random components of the utility of all alternatives are uncorrelated with the unobserved components of utility for all other alternatives, and each of these unobserved components has identical distribution. Second, random components are distributed according to type I extreme value. 13 / / 47 Distributional Assumption The Probability Density Function The CDF of type I extreme value distribution is F (ɛ im ) e e ɛ im The PDF of type I extreme value distribution is f (ɛ im ) e ɛ im e e ɛ im The choice of the distribution is motivated by the simplicity, tractability, and usefulness of the resulting model. This distribution has mode 0, mean.58, and standard deviation pdf x 15 / / 47

5 The Cumulative Distribution Function Distributional Assumption cdf The difference between two extreme value variables is distributed logistic. That is, if ɛ im and ɛ in are iid extreme value, then, ɛ imn follows the logistic distribution: eɛ imn F (ɛ imn) 1 + e ɛ imn x 17 / / 47 Distributional Assumption Identification The choice probability is: P im (ɛ in ɛ im < V im V in )f (ɛ im )dɛ im Some algebraic manipulation of this integral results in a succinct, closed form expression: P im Pr(y i m) e V im J J1 ev ij e x i β m J J1 ex i β J It is convenient to code the outcomes as j 0, 1,..., J so there are J + 1 alternatives in this notation. In the current set up, the ˆβ m are unidentified. For any vector of constants q, we find the same probabilities whether we use β m or β where β β m + q. We could add an arbitrary constant to all the coefficients in the model, yet get the same probabilities. which is the logit choice probability. (See Train, 2003, 78-9 for this derivation.) 19 / / 47

6 Identification Identification Consider the following example with 3 choices: P(y i m) e x(β 1+q) e x(β k +q) e xβ 1 e xq e x(β1+q) + e x(β2+q) + e x(β 3+q) e xβ 1 e xq e xβ 1 e xq + e xβ 2 e xq + e xβ 3 e xq e xβ 1 e xq ( e xβ k )e xq e xβ 1 e xβ k Therefore, the model cannot distinguish the true parameters from the parameters plus an arbitrary constant. As with ordered response models, we need an assumption or normalization which will identify the parameters. A convenient normalization that solves the identification problem is to assume that one of the sets of coefficients (the coefficients for one of the choices) are all zero. 21 / / 47 Identification Identification Specifically, assume that all β 0 0 for category zero. More generally, for j 0, 1,..., J. P(y i 0) P(y i j) e x 0 e x 0 + J k1 exβ k J k1 exβ k e xβ j 1 + J k1 exβ k The first alternative becomes the reference category to which all of the results are compared. In this form, it is clear that when J 1, we have the binary logit as a special case of the multinomial logit: P(y i 1) e xβ e xβ 1 23 / / 47

7 Estimation Estimation Estimation of this model is relatively easy since the log-likelihood is globally concave. To specify the likelihood, first define d ij 1 if individual i chooses alternative j, and d ij 0 otherwise. This means there are J + 1 d ij s, each indicating a choice. Use these to select the appropriate terms in the likelihood function. As with ordered response models, there is a different probability expression for each selected outcome. The likelihood function for individual i is L i P d i0 0 P d i1 1 P d i P d ij J Since we assume these are independent, the joint likelihood is the product of the likelihood of each outcome: L N i1 P d i0 0 Pd i1 1 Pd i P d ij J 25 / / 47 Estimation Estimation The log-likelihood is lnl where β 0 0. N i1 m0 N i1 m0 J d im lnp m ( J d im ln e x i β m 1 + J J1 ex i β j ) Estimate the vote choice model in the 1992 Presidential election: vote i β m0 + β m1 Economy i + β m2 Democrat i +β m3 Republican i + β m4 Income i + ɛ im where vote i has three categories (Bush, Clinton, Perot). Use multinom in nnet library. 27 / / 47

8 Interpreting Coefficients Marginal Effects There are two sets of coefficients for each independent variable. The signs of coefficients can be interpreted in a direct manner. For example, a negative coefficient indicates the the independent variable reduces the probability of voting for a candidate compared to the baseline candidate. Statistical inference is done as usual. We can calculate the marginal effect of one continuous independent variable on the probabilities of the outcome categories. P m x k P m (β km P m (β m β) J P j β kj ) j0 This is the weighted sum of β k where the weights are the outcome probabilities. This tells us the effect on the probabilities of choosing m if a variable increases by small amount. 29 / / 47 Predicted Probabilities Odds Ratio Predicted probabilities can be computed with the following equation: ˆP m e x ˆβ m 1 + J j1 ex ˆβ j The values of the key independent variable change, while the other variables are held constant. Odds ratios are useful when you want to know the odds of choosing one alternative relative to the other. We first write: Ω mn (x i ) P im P in where Ω mn (x i ) is the odds of outcome m versus outcome n given x i. x i includes all independent variables. 31 / / 47

9 Odds Ratio We continue: Ω mn (x i ) P im P in ex i β m e x i β n e x i [β m β n] e x i βm J J1 ex i β J e x i βn J J1 ex i β J An individual with characteristics specified in x i is e x i [β m β n] more likely to choose m over n. If you want to use the odds ratio as opposed to the baseline category, the equation is simplified to: Ω m1 (x i ) e x i β m An individual with characteristics specified in x i is e x i β m more likely to choose m over the baseline category. 33 / 47 Odds Ratio You can assess how a change in a particular independent variable affects the odds ratio of m to the baseline category. The effect is computed by: Ω m1 (x i, x ik + δ) Ω m1 (x i, x ik ) e β km δ where x ik is the k th independent variable for individual i and β km is the coefficient associated with the k th independent variable for alternative m. For a change of δ in x ik, the odds of outcome m versus the baseline category are expected to change by a factor of e β km δ, holding all other variables constant. The factor change in the odds for a change in x ik does not depend on the level of x ik or on the level of any other variable. 34 / 47 Conditional Logit Model Conditional Logit Model In the MNL model, each explanatory variable denotes individual-specific characteristics and has a different effect on each outcome. The utility for the MNL model is expressed as U im x i β m + ɛ im The conditional logit model is slightly different from MNL since it considers the impact of choice-specific attributes instead of individual-specific attributes. The utility for the CL model is written as U im z im γ + ɛ im where z im denotes a vector of choice-specific attributes. In the case of the vote choice model in 1992, z im would be a perceived candidate trait, for example. Importantly, the parameters are not choice-specific attributes; there is only one for each attribute. In the three-candidate race, the utilities are expressed as Bush: U i1 β 1 honesty i1 + ɛ i1 Clinton: U i2 β 1 honesty i2 + ɛ i2 Perot: U i3 β 1 honesty i3 + ɛ i3 The utility gets larger when perceived honesty increases. 35 / / 47

10 Data for Conditional Logit Model Conditional Logit Model outcome i outcome chosen honesty age The probability that individual i chooses alternative m in the CL model is e z imγ Pr(y i m) J J1 ez ij γ which should be compared to the MNL model: where β 1 0. Pr(y i m) e x i β m J J1 ex i β J 37 / / 47 (Mixed) Conditional Logit Model (Mixed) Conditional Logit Model It is possible to include both individual-specific and choice-specific attributes in the model. The utility is given by U im x i β m + z im γ + ɛ im where x i contains individual-specific attributes for individual i and z im contains choice-specific attributes for outcome m. The probability that individual i chooses alternative m is: where β 1 0. P(y i m) ex i β m+z im γ J J1 ex i β j +z ij γ 39 / / 47

11 Interpretation Independence of Irrelevant Alternatives You can interpret the coefficients in exactly the same way as you do in the MNL model for the individual-specific variables. For choice-specific variables, the signs of the coefficients indicate how an increase in z affects the likelihood that the individual chooses one alternative. You can also use the same techniques (e.g., predicted probabilities) to make an interpretation. In the multinomial logit model, the equation for the odds of m versus n is P(y i m) P(y i n) exi βm e x i β n evim e V in This equation indicates that the odds are determined without reference to the other outcomes that might be available. This property is called as the independence of irrelevant alternatives or IIA. This is a consequence of assuming independence of ɛ ij in the random utility model. 41 / / 47 Independence of Irrelevant Alternatives Independence of Irrelevant Alternatives Think about McFadden s famous example. A person has two choices for commuting to work: a private car that is chosen with P(car) 1/2 and a red bus with P(red bus) 1/2. The implied odds of taking the car versus the red bus is 1. Suppose a new bus company is started that is identical the current service except that the buses are blue. IIA requires that the new probabilities are P(car) 1/3, P(red bus).1/3, and P(blue bus) 1/3. This is necessary so that the odds of a car versus a red bus remain 1. However, if the only thing to distinguish the new bus service from the old is the color of the bus, we would not expect car travelers to start taking the bus (i.e., the utility does not change). Instead, the share of red bus riders would be split, resulting in P(car) 1/2 P(red bus) 1/4, and P(blue bus) 1/4. The new, implied odds for car versus red bus are 2 1/2 1/4, which violates the IIA assumption! The IIA assumption requires that if a new alternative becomes available, then all probabilities for the prior choices must adjust in precisely the amount necessary to retain the original odds among all pairs of outcomes. 43 / / 47

12 Independence of Irrelevant Alternatives Testing IIA We assumed the the disturbances were distributed identically and independently according to Type 1 Extreme Value distribution. The violation of IIA indicates that the errors ɛ ij are not independent across alternatives j. The non-independence causes us to overestimate the probability of choosing alternatives that are similar to each other. A Hausman-type test is available to assess the property of IIA. If the IIA property holds, then the parameter estimates obtained on the subset of alternatives will not be significantly different from those obtained on the full set of alternatives. 45 / / 47 Testing IIA The Hausman test proceeds as follows: 1 Estimate coefficients ˆβ F and covariance matrix ˆV F with all J alternatives. 2 Estimate coefficients ˆβ R and covariance matrix ˆV R with reduced alternatives. 3 Compare both estimates based on Hausman statistic: ( ˆβ R ˆβ F ) [ ˆV R ˆV F ] 1 ( ˆβ R ˆβ F ) which follows χ 2 distribution with k degrees of freedom where k is the number of elements in the β vector. 4 If the test statistic is larger than a critical value, we reject the null hypothesis that the IIA property holds. See Fry and Harris (1998) for alternative tests. 47 / 47

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL

Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2 Tetsuya Matsubayashi University of North Texas November 9, 2010 Learn multiple responses models that do not require the assumption

More information

POLI 7050 Spring 2008 February 27, 2008 Unordered Response Models I

POLI 7050 Spring 2008 February 27, 2008 Unordered Response Models I POLI 7050 Spring 2008 February 27, 2008 Unordered Response Models I Introduction For the next couple weeks we ll be talking about unordered, polychotomous dependent variables. Examples include: Voter choice

More information

disc choice5.tex; April 11, ffl See: King - Unifying Political Methodology ffl See: King/Tomz/Wittenberg (1998, APSA Meeting). ffl See: Alvarez

disc choice5.tex; April 11, ffl See: King - Unifying Political Methodology ffl See: King/Tomz/Wittenberg (1998, APSA Meeting). ffl See: Alvarez disc choice5.tex; April 11, 2001 1 Lecture Notes on Discrete Choice Models Copyright, April 11, 2001 Jonathan Nagler 1 Topics 1. Review the Latent Varible Setup For Binary Choice ffl Logit ffl Likelihood

More information

Discrete Choice Models I

Discrete Choice Models I Discrete Choice Models I 1 Introduction A discrete choice model is one in which decision makers choose among a set of alternatives. 1 To fit within a discrete choice framework, the set of alternatives

More information

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo Lecture 1 Behavioral Models Multinomial Logit: Power and limitations Cinzia Cirillo 1 Overview 1. Choice Probabilities 2. Power and Limitations of Logit 1. Taste variation 2. Substitution patterns 3. Repeated

More information

Limited Dependent Variable Models II

Limited Dependent Variable Models II Limited Dependent Variable Models II Fall 2008 Environmental Econometrics (GR03) LDV Fall 2008 1 / 15 Models with Multiple Choices The binary response model was dealing with a decision problem with two

More information

Lecture-20: Discrete Choice Modeling-I

Lecture-20: Discrete Choice Modeling-I Lecture-20: Discrete Choice Modeling-I 1 In Today s Class Introduction to discrete choice models General formulation Binary choice models Specification Model estimation Application Case Study 2 Discrete

More information

Probabilistic Choice Models

Probabilistic Choice Models Econ 3: James J. Heckman Probabilistic Choice Models This chapter examines different models commonly used to model probabilistic choice, such as eg the choice of one type of transportation from among many

More information

A short introduc-on to discrete choice models

A short introduc-on to discrete choice models A short introduc-on to discrete choice models BART Kenneth Train, Discrete Choice Models with Simula-on, Chapter 3. Ques-ons Impact of cost, commu-ng -me, walk -me, transfer -me, number of transfers, distance

More information

I. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as

I. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as Econ 513, USC, Fall 2005 Lecture 15 Discrete Response Models: Multinomial, Conditional and Nested Logit Models Here we focus again on models for discrete choice with more than two outcomes We assume that

More information

The Logit Model: Estimation, Testing and Interpretation

The Logit Model: Estimation, Testing and Interpretation The Logit Model: Estimation, Testing and Interpretation Herman J. Bierens October 25, 2008 1 Introduction to maximum likelihood estimation 1.1 The likelihood function Consider a random sample Y 1,...,

More information

Introduction to Discrete Choice Models

Introduction to Discrete Choice Models Chapter 7 Introduction to Dcrete Choice Models 7.1 Introduction It has been mentioned that the conventional selection bias model requires estimation of two structural models, namely the selection model

More information

Probabilistic Choice Models

Probabilistic Choice Models Probabilistic Choice Models James J. Heckman University of Chicago Econ 312 This draft, March 29, 2006 This chapter examines dierent models commonly used to model probabilistic choice, such as eg the choice

More information

Single-level Models for Binary Responses

Single-level Models for Binary Responses Single-level Models for Binary Responses Distribution of Binary Data y i response for individual i (i = 1,..., n), coded 0 or 1 Denote by r the number in the sample with y = 1 Mean and variance E(y) =

More information

An Overview of Choice Models

An Overview of Choice Models An Overview of Choice Models Dilan Görür Gatsby Computational Neuroscience Unit University College London May 08, 2009 Machine Learning II 1 / 31 Outline 1 Overview Terminology and Notation Economic vs

More information

Applied Health Economics (for B.Sc.)

Applied Health Economics (for B.Sc.) Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative

More information

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary

More information

The 17 th Behavior Modeling Summer School

The 17 th Behavior Modeling Summer School The 17 th Behavior Modeling Summer School September 14-16, 2017 Introduction to Discrete Choice Models Giancarlos Troncoso Parady Assistant Professor Urban Transportation Research Unit Department of Urban

More information

Week 7: Binary Outcomes (Scott Long Chapter 3 Part 2)

Week 7: Binary Outcomes (Scott Long Chapter 3 Part 2) Week 7: (Scott Long Chapter 3 Part 2) Tsun-Feng Chiang* *School of Economics, Henan University, Kaifeng, China April 29, 2014 1 / 38 ML Estimation for Probit and Logit ML Estimation for Probit and Logit

More information

Linear Regression With Special Variables

Linear Regression With Special Variables Linear Regression With Special Variables Junhui Qian December 21, 2014 Outline Standardized Scores Quadratic Terms Interaction Terms Binary Explanatory Variables Binary Choice Models Standardized Scores:

More information

Fixed Effects Models for Panel Data. December 1, 2014

Fixed Effects Models for Panel Data. December 1, 2014 Fixed Effects Models for Panel Data December 1, 2014 Notation Use the same setup as before, with the linear model Y it = X it β + c i + ɛ it (1) where X it is a 1 K + 1 vector of independent variables.

More information

POLI 7050 Spring 2008 March 5, 2008 Unordered Response Models II

POLI 7050 Spring 2008 March 5, 2008 Unordered Response Models II POLI 7050 Spring 2008 March 5, 2008 Unordered Response Models II Introduction Today we ll talk about interpreting MNL and CL models. We ll start with general issues of model fit, and then get to variable

More information

Maximum Likelihood and. Limited Dependent Variable Models

Maximum Likelihood and. Limited Dependent Variable Models Maximum Likelihood and Limited Dependent Variable Models Michele Pellizzari IGIER-Bocconi, IZA and frdb May 24, 2010 These notes are largely based on the textbook by Jeffrey M. Wooldridge. 2002. Econometric

More information

h=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio

h=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio Multivariate Response Models The response variable is unordered and takes more than two values. The term unordered refers to the fact that response 3 is not more favored than response 2. One choice from

More information

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables

More information

Models of Qualitative Binary Response

Models of Qualitative Binary Response Models of Qualitative Binary Response Probit and Logit Models October 6, 2015 Dependent Variable as a Binary Outcome Suppose we observe an economic choice that is a binary signal. The focus on the course

More information

Binary Logistic Regression

Binary Logistic Regression The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b

More information

Review of Statistics

Review of Statistics Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and

More information

Chapter 3 Choice Models

Chapter 3 Choice Models Chapter 3 Choice Models 3.1 Introduction This chapter describes the characteristics of random utility choice model in a general setting, specific elements related to the conjoint choice context are given

More information

Binary choice. Michel Bierlaire

Binary choice. Michel Bierlaire Binary choice Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne M. Bierlaire (TRANSP-OR ENAC EPFL)

More information

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns

More information

Lecture 6: Discrete Choice: Qualitative Response

Lecture 6: Discrete Choice: Qualitative Response Lecture 6: Instructor: Department of Economics Stanford University 2011 Types of Discrete Choice Models Univariate Models Binary: Linear; Probit; Logit; Arctan, etc. Multinomial: Logit; Nested Logit; GEV;

More information

Econ 673: Microeconometrics

Econ 673: Microeconometrics Econ 673: Microeconometrics Chapter 4: Properties of Discrete Choice Models Fall 2008 Herriges (ISU) Chapter 4: Discrete Choice Models Fall 2008 1 / 29 Outline 1 2 Deriving Choice Probabilities 3 Identification

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Econometric Analysis of Games 1

Econometric Analysis of Games 1 Econometric Analysis of Games 1 HT 2017 Recap Aim: provide an introduction to incomplete models and partial identification in the context of discrete games 1. Coherence & Completeness 2. Basic Framework

More information

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION

More information

Standard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j

Standard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j Standard Errors & Confidence Intervals β β asy N(0, I( β) 1 ), where I( β) = [ 2 l(β, φ; y) ] β i β β= β j We can obtain asymptotic 100(1 α)% confidence intervals for β j using: β j ± Z 1 α/2 se( β j )

More information

Instrumental Variables and the Problem of Endogeneity

Instrumental Variables and the Problem of Endogeneity Instrumental Variables and the Problem of Endogeneity September 15, 2015 1 / 38 Exogeneity: Important Assumption of OLS In a standard OLS framework, y = xβ + ɛ (1) and for unbiasedness we need E[x ɛ] =

More information

2. We care about proportion for categorical variable, but average for numerical one.

2. We care about proportion for categorical variable, but average for numerical one. Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is

More information

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test

More information

UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables

UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables To be provided to students with STAT2201 or CIVIL-2530 (Probability and Statistics) Exam Main exam date: Tuesday, 20 June 1

More information

Review of Multinomial Distribution If n trials are performed: in each trial there are J > 2 possible outcomes (categories) Multicategory Logit Models

Review of Multinomial Distribution If n trials are performed: in each trial there are J > 2 possible outcomes (categories) Multicategory Logit Models Chapter 6 Multicategory Logit Models Response Y has J > 2 categories. Extensions of logistic regression for nominal and ordinal Y assume a multinomial distribution for Y. 6.1 Logit Models for Nominal Responses

More information

Bias Variance Trade-off

Bias Variance Trade-off Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]

More information

A Bayesian Probit Model with Spatial Dependencies

A Bayesian Probit Model with Spatial Dependencies A Bayesian Probit Model with Spatial Dependencies Tony E. Smith Department of Systems Engineering University of Pennsylvania Philadephia, PA 19104 email: tesmith@ssc.upenn.edu James P. LeSage Department

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

Comparing IRT with Other Models

Comparing IRT with Other Models Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used

More information

Multiple regression: Categorical dependent variables

Multiple regression: Categorical dependent variables Multiple : Categorical Johan A. Elkink School of Politics & International Relations University College Dublin 28 November 2016 1 2 3 4 Outline 1 2 3 4 models models have a variable consisting of two categories.

More information

Advanced Quantitative Methods: limited dependent variables

Advanced Quantitative Methods: limited dependent variables Advanced Quantitative Methods: Limited Dependent Variables I University College Dublin 2 April 2013 1 2 3 4 5 Outline Model Measurement levels 1 2 3 4 5 Components Model Measurement levels Two components

More information

Maximum Likelihood Methods

Maximum Likelihood Methods Maximum Likelihood Methods Some of the models used in econometrics specify the complete probability distribution of the outcomes of interest rather than just a regression function. Sometimes this is because

More information

Review of One-way Tables and SAS

Review of One-way Tables and SAS Stat 504, Lecture 7 1 Review of One-way Tables and SAS In-class exercises: Ex1, Ex2, and Ex3 from http://v8doc.sas.com/sashtml/proc/z0146708.htm To calculate p-value for a X 2 or G 2 in SAS: http://v8doc.sas.com/sashtml/lgref/z0245929.htmz0845409

More information

[y i α βx i ] 2 (2) Q = i=1

[y i α βx i ] 2 (2) Q = i=1 Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation

More information

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63 1 / 63 Panel Data Models Chapter 5 Financial Econometrics Michael Hauser WS17/18 2 / 63 Content Data structures: Times series, cross sectional, panel data, pooled data Static linear panel data models:

More information

CONTINUOUS RANDOM VARIABLES

CONTINUOUS RANDOM VARIABLES the Further Mathematics network www.fmnetwork.org.uk V 07 REVISION SHEET STATISTICS (AQA) CONTINUOUS RANDOM VARIABLES The main ideas are: Properties of Continuous Random Variables Mean, Median and Mode

More information

Part I Behavioral Models

Part I Behavioral Models Part I Behavioral Models 2 Properties of Discrete Choice Models 2.1 Overview This chapter describes the features that are common to all discrete choice models. We start by discussing the choice set, which

More information

Outline. The binary choice model. The multinomial choice model. Extensions of the basic choice model

Outline. The binary choice model. The multinomial choice model. Extensions of the basic choice model Outline The binary choice model Illustration Specification of the binary choice model Interpreting the results of binary choice models ME output The multinomial choice model Illustration Specification

More information

Can a Pseudo Panel be a Substitute for a Genuine Panel?

Can a Pseudo Panel be a Substitute for a Genuine Panel? Can a Pseudo Panel be a Substitute for a Genuine Panel? Min Hee Seo Washington University in St. Louis minheeseo@wustl.edu February 16th 1 / 20 Outline Motivation: gauging mechanism of changes Introduce

More information

Item Response Theory for Conjoint Survey Experiments

Item Response Theory for Conjoint Survey Experiments Item Response Theory for Conjoint Survey Experiments Devin Caughey Hiroto Katsumata Teppei Yamamoto Massachusetts Institute of Technology PolMeth XXXV @ Brigham Young University July 21, 2018 Conjoint

More information

Classification. Chapter Introduction. 6.2 The Bayes classifier

Classification. Chapter Introduction. 6.2 The Bayes classifier Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode

More information

Lecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH

Lecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH Lecture 5: Spatial probit models James P. LeSage University of Toledo Department of Economics Toledo, OH 43606 jlesage@spatial-econometrics.com March 2004 1 A Bayesian spatial probit model with individual

More information

Instrumental Variables

Instrumental Variables Instrumental Variables Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Noncompliance in Experiments Stat186/Gov2002 Fall 2018 1 / 18 Instrumental Variables

More information

Lecture 11. Probability Theory: an Overveiw

Lecture 11. Probability Theory: an Overveiw Math 408 - Mathematical Statistics Lecture 11. Probability Theory: an Overveiw February 11, 2013 Konstantin Zuev (USC) Math 408, Lecture 11 February 11, 2013 1 / 24 The starting point in developing the

More information

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University. Panel GLMs Department of Political Science and Government Aarhus University May 12, 2015 1 Review of Panel Data 2 Model Types 3 Review and Looking Forward 1 Review of Panel Data 2 Model Types 3 Review

More information

Lecture 14: Introduction to Poisson Regression

Lecture 14: Introduction to Poisson Regression Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why

More information

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week

More information

Nevo on Random-Coefficient Logit

Nevo on Random-Coefficient Logit Nevo on Random-Coefficient Logit March 28, 2003 Eric Rasmusen Abstract Overheads for logit demand estimation for G604. These accompany the Nevo JEMS article. Indiana University Foundation Professor, Department

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

Logit Regression and Quantities of Interest

Logit Regression and Quantities of Interest Logit Regression and Quantities of Interest Stephen Pettigrew March 4, 2015 Stephen Pettigrew Logit Regression and Quantities of Interest March 4, 2015 1 / 57 Outline 1 Logistics 2 Generalized Linear Models

More information

EC402 - Problem Set 3

EC402 - Problem Set 3 EC402 - Problem Set 3 Konrad Burchardi 11th of February 2009 Introduction Today we will - briefly talk about the Conditional Expectation Function and - lengthily talk about Fixed Effects: How do we calculate

More information

1 Hotz-Miller approach: avoid numeric dynamic programming

1 Hotz-Miller approach: avoid numeric dynamic programming 1 Hotz-Miller approach: avoid numeric dynamic programming Rust, Pakes approach to estimating dynamic discrete-choice model very computer intensive. Requires using numeric dynamic programming to compute

More information

UC Berkeley Math 10B, Spring 2015: Midterm 2 Prof. Sturmfels, April 9, SOLUTIONS

UC Berkeley Math 10B, Spring 2015: Midterm 2 Prof. Sturmfels, April 9, SOLUTIONS UC Berkeley Math 10B, Spring 2015: Midterm 2 Prof. Sturmfels, April 9, SOLUTIONS 1. (5 points) You are a pollster for the 2016 presidential elections. You ask 0 random people whether they would vote for

More information

Truncation and Censoring

Truncation and Censoring Truncation and Censoring Laura Magazzini laura.magazzini@univr.it Laura Magazzini (@univr.it) Truncation and Censoring 1 / 35 Truncation and censoring Truncation: sample data are drawn from a subset of

More information

Econometrics for PhDs

Econometrics for PhDs Econometrics for PhDs Amine Ouazad April 2012, Final Assessment - Answer Key 1 Questions with a require some Stata in the answer. Other questions do not. 1 Ordinary Least Squares: Equality of Estimates

More information

Ordinary Least Squares Regression

Ordinary Least Squares Regression Ordinary Least Squares Regression Goals for this unit More on notation and terminology OLS scalar versus matrix derivation Some Preliminaries In this class we will be learning to analyze Cross Section

More information

Introduction to mixtures in discrete choice models

Introduction to mixtures in discrete choice models Introduction to mixtures in discrete choice models p. 1/42 Introduction to mixtures in discrete choice models Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory Introduction to

More information

INTRODUCTION TO TRANSPORTATION SYSTEMS

INTRODUCTION TO TRANSPORTATION SYSTEMS INTRODUCTION TO TRANSPORTATION SYSTEMS Lectures 5/6: Modeling/Equilibrium/Demand 1 OUTLINE 1. Conceptual view of TSA 2. Models: different roles and different types 3. Equilibrium 4. Demand Modeling References:

More information

Summary of Chapters 7-9

Summary of Chapters 7-9 Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two

More information

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?

More information

Final Exam. Economics 835: Econometrics. Fall 2010

Final Exam. Economics 835: Econometrics. Fall 2010 Final Exam Economics 835: Econometrics Fall 2010 Please answer the question I ask - no more and no less - and remember that the correct answer is often short and simple. 1 Some short questions a) For each

More information

Chapter 11. Regression with a Binary Dependent Variable

Chapter 11. Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score

More information

Comments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006

Comments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006 Comments on: Panel Data Analysis Advantages and Challenges Manuel Arellano CEMFI, Madrid November 2006 This paper provides an impressive, yet compact and easily accessible review of the econometric literature

More information

Generalized logit models for nominal multinomial responses. Local odds ratios

Generalized logit models for nominal multinomial responses. Local odds ratios Generalized logit models for nominal multinomial responses Categorical Data Analysis, Summer 2015 1/17 Local odds ratios Y 1 2 3 4 1 π 11 π 12 π 13 π 14 π 1+ X 2 π 21 π 22 π 23 π 24 π 2+ 3 π 31 π 32 π

More information

Lecture notes to Chapter 11, Regression with binary dependent variables - probit and logit regression

Lecture notes to Chapter 11, Regression with binary dependent variables - probit and logit regression Lecture notes to Chapter 11, Regression with binary dependent variables - probit and logit regression Tore Schweder October 28, 2011 Outline Examples of binary respons variables Probit and logit - examples

More information

Statistical Tests. Matthieu de Lapparent

Statistical Tests. Matthieu de Lapparent Statistical Tests Matthieu de Lapparent matthieu.delapparent@epfl.ch Transport and Mobility Laboratory, School of Architecture, Civil and Environmental Engineering, Ecole Polytechnique Fédérale de Lausanne

More information

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools Syllabus By Joan Llull Microeconometrics. IDEA PhD Program. Fall 2017 Chapter 1: Introduction and a Brief Review of Relevant Tools I. Overview II. Maximum Likelihood A. The Likelihood Principle B. The

More information

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2015-16 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This

More information

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Remember: Z.05 = 1.645, Z.01 = 2.33 We will only cover one-sided hypothesis testing (cases 12.3, 12.4.2, 12.5.2,

More information

Homework 1 Solutions

Homework 1 Solutions 36-720 Homework 1 Solutions Problem 3.4 (a) X 2 79.43 and G 2 90.33. We should compare each to a χ 2 distribution with (2 1)(3 1) 2 degrees of freedom. For each, the p-value is so small that S-plus reports

More information

CHAPTER 5. Logistic regression

CHAPTER 5. Logistic regression CHAPTER 5 Logistic regression Logistic regression is the standard way to model binary outcomes (that is, data y i that take on the values 0 or 1). Section 5.1 introduces logistic regression in a simple

More information

Introduction to Estimation Methods for Time Series models. Lecture 1

Introduction to Estimation Methods for Time Series models. Lecture 1 Introduction to Estimation Methods for Time Series models Lecture 1 Fulvio Corsi SNS Pisa Fulvio Corsi Introduction to Estimation () Methods for Time Series models Lecture 1 SNS Pisa 1 / 19 Estimation

More information

Ordered Response and Multinomial Logit Estimation

Ordered Response and Multinomial Logit Estimation Ordered Response and Multinomial Logit Estimation Quantitative Microeconomics R. Mora Department of Economics Universidad Carlos III de Madrid Outline Introduction 1 Introduction 2 3 Introduction The Ordered

More information

Gibbs Sampling in Latent Variable Models #1

Gibbs Sampling in Latent Variable Models #1 Gibbs Sampling in Latent Variable Models #1 Econ 690 Purdue University Outline 1 Data augmentation 2 Probit Model Probit Application A Panel Probit Panel Probit 3 The Tobit Model Example: Female Labor

More information

Statistical Analysis of the Item Count Technique

Statistical Analysis of the Item Count Technique Statistical Analysis of the Item Count Technique Kosuke Imai Department of Politics Princeton University Joint work with Graeme Blair May 4, 2011 Kosuke Imai (Princeton) Item Count Technique UCI (Statistics)

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)

More information

Limited Dependent Variables and Panel Data

Limited Dependent Variables and Panel Data and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement

More information

ECON 594: Lecture #6

ECON 594: Lecture #6 ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was

More information

Bivariate Distributions

Bivariate Distributions STAT/MATH 395 A - PROBABILITY II UW Winter Quarter 17 Néhémy Lim Bivariate Distributions 1 Distributions of Two Random Variables Definition 1.1. Let X and Y be two rrvs on probability space (Ω, A, P).

More information

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen January 23-24, 2012 Page 1 Part I The Single Level Logit Model: A Review Motivating Example Imagine we are interested in voting

More information

Estimation of mixed generalized extreme value models

Estimation of mixed generalized extreme value models Estimation of mixed generalized extreme value models Michel Bierlaire michel.bierlaire@epfl.ch Operations Research Group ROSO Institute of Mathematics EPFL Katholieke Universiteit Leuven, November 2004

More information

Statistics 203 Introduction to Regression Models and ANOVA Practice Exam

Statistics 203 Introduction to Regression Models and ANOVA Practice Exam Statistics 203 Introduction to Regression Models and ANOVA Practice Exam Prof. J. Taylor You may use your 4 single-sided pages of notes This exam is 7 pages long. There are 4 questions, first 3 worth 10

More information

What can we learn about correlations from multinomial probit estimates?

What can we learn about correlations from multinomial probit estimates? What can we learn about correlations from multinomial probit estimates? Chiara Monfardini J.M.C. Santos Silva February 2006 Abstract It is well known that, in a multinomial probit, only the covariance

More information