Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model
|
|
- Colleen Scott
- 5 years ago
- Views:
Transcription
1 Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1 Tetsuya Matsubayashi University of North Texas November 2, 2010 Random utility model Multinomial logit model Conditional logit model Independence of Irrelevant Alternatives Nested logit model (next week) Mixed logit model (next week) Multinomial probit model (next week) 1 / 47 2 / 47 Multinomial Dependent Variable Random Utility Model Vote: Bush, Clinton, or Perot in the 1992 presidential election Travel: car, bus, or train Occupation: blue-color job, white-color job, professional job etc. A decision maker chooses one alternative from a choice set. The choice set is characterized as follows: Alternatives must be mutually exclusive. The choice set must be exhaustive. The number of alternatives must be definite. 3 / 47 4 / 47
2 Random Utility Model Random Utility Model Thus, the model is expressed as: The random utility model assumes that a decision maker i attaches a utility to each alternative, U im, m 1... M. The random utility model assumes that the utility consists of two components: Systematic component, which we can observe. Random component, which we cannot observe. U im V im + ɛ im where U im is a decision maker i s utility for alternative m, V im is the systematic component for a decision maker i associated with choice m, and ɛ im denotes the random component of utility for a decision maker i associated with choice m. For example, the utility of voting for Obama increases as ideological proximity increases. 5 / 47 6 / 47 Random Utility Model Random Utility Model We assume that the systematic component for the utility is a linear function of some exogenous variables V im x im β if variables are choice-specific and V im x i β m if variables are individual-specific. In the case of presidential vote choice, candidates traits are choice specific variables and individuals demographic characteristics are individual-specific variables.p The utility of individual i for choice m is rewritten as: U im x im β + ɛ im U im x i β m + ɛ im The model assumes that the decision maker chooses choice m if and only if: U im > U ij j m 7 / 47 8 / 47
3 Multinomial Logit Model: Set Up Set Up We begin with the random utility model using individual-specific variables: U im x i β m + ɛ im where x i denotes a vector of individual-specific characteristics. β m is a vector of choice specific parameters. Thus, the effect of x i varies across the choices. Suppose a vote choice model in the 1992 presidential election. Income is a key exogenous variable. The utilities are written as Bush: U i1 β 10 + β 11 Income i + ɛ i1 Clinton: U i2 β 20 + β 21 Income i + ɛ i2 Perot: U i3 β 30 + β 31 Income i + ɛ i3 9 / / 47 Set Up Set Up Suppose that individual i chooses one of two alternatives. The probability of choosing alternative 1 is the probability that the utility 1 exceeds the utility from alternative 2: Pr(y i 1) Pr(U i1 > U i2 ) This is a binary choice model. Pr(V i1 + ɛ i1 > V i2 + ɛ i2 ) Pr(ɛ i2 ɛ i1 < V i1 V i2 ) Suppose that individual i chooses one of three alternatives. The probability of choosing alternative 1 is the probability that the utility 1 exceeds the utility from alternative 2 and the utility from alternative 3: P(y i 1) Pr[(U i1 > U i2 ) and (U i1 > U i3 )] Pr[(V i1 + ɛ i1 > V i2 + ɛ i2 ) and (V i1 + ɛ i1 > V i3 + ɛ i3 )] Pr[(ɛ i2 ɛ i1 < V i1 V i2 ) and (ɛ i3 ɛ i1 < V i1 V i3 )] 11 / / 47
4 Set Up Distributional Assumption When there are J choices, the probability of choice m is P(y i m) Pr(U im > U ij ) j m For example, the probability of voting for Bush equals the probability that the utility gained from voting for Bush exceeds the utilities from voting for Clinton and Perot. First, random components are independently and identically distributed (IID). In other words, the random components of the utility of all alternatives are uncorrelated with the unobserved components of utility for all other alternatives, and each of these unobserved components has identical distribution. Second, random components are distributed according to type I extreme value. 13 / / 47 Distributional Assumption The Probability Density Function The CDF of type I extreme value distribution is F (ɛ im ) e e ɛ im The PDF of type I extreme value distribution is f (ɛ im ) e ɛ im e e ɛ im The choice of the distribution is motivated by the simplicity, tractability, and usefulness of the resulting model. This distribution has mode 0, mean.58, and standard deviation pdf x 15 / / 47
5 The Cumulative Distribution Function Distributional Assumption cdf The difference between two extreme value variables is distributed logistic. That is, if ɛ im and ɛ in are iid extreme value, then, ɛ imn follows the logistic distribution: eɛ imn F (ɛ imn) 1 + e ɛ imn x 17 / / 47 Distributional Assumption Identification The choice probability is: P im (ɛ in ɛ im < V im V in )f (ɛ im )dɛ im Some algebraic manipulation of this integral results in a succinct, closed form expression: P im Pr(y i m) e V im J J1 ev ij e x i β m J J1 ex i β J It is convenient to code the outcomes as j 0, 1,..., J so there are J + 1 alternatives in this notation. In the current set up, the ˆβ m are unidentified. For any vector of constants q, we find the same probabilities whether we use β m or β where β β m + q. We could add an arbitrary constant to all the coefficients in the model, yet get the same probabilities. which is the logit choice probability. (See Train, 2003, 78-9 for this derivation.) 19 / / 47
6 Identification Identification Consider the following example with 3 choices: P(y i m) e x(β 1+q) e x(β k +q) e xβ 1 e xq e x(β1+q) + e x(β2+q) + e x(β 3+q) e xβ 1 e xq e xβ 1 e xq + e xβ 2 e xq + e xβ 3 e xq e xβ 1 e xq ( e xβ k )e xq e xβ 1 e xβ k Therefore, the model cannot distinguish the true parameters from the parameters plus an arbitrary constant. As with ordered response models, we need an assumption or normalization which will identify the parameters. A convenient normalization that solves the identification problem is to assume that one of the sets of coefficients (the coefficients for one of the choices) are all zero. 21 / / 47 Identification Identification Specifically, assume that all β 0 0 for category zero. More generally, for j 0, 1,..., J. P(y i 0) P(y i j) e x 0 e x 0 + J k1 exβ k J k1 exβ k e xβ j 1 + J k1 exβ k The first alternative becomes the reference category to which all of the results are compared. In this form, it is clear that when J 1, we have the binary logit as a special case of the multinomial logit: P(y i 1) e xβ e xβ 1 23 / / 47
7 Estimation Estimation Estimation of this model is relatively easy since the log-likelihood is globally concave. To specify the likelihood, first define d ij 1 if individual i chooses alternative j, and d ij 0 otherwise. This means there are J + 1 d ij s, each indicating a choice. Use these to select the appropriate terms in the likelihood function. As with ordered response models, there is a different probability expression for each selected outcome. The likelihood function for individual i is L i P d i0 0 P d i1 1 P d i P d ij J Since we assume these are independent, the joint likelihood is the product of the likelihood of each outcome: L N i1 P d i0 0 Pd i1 1 Pd i P d ij J 25 / / 47 Estimation Estimation The log-likelihood is lnl where β 0 0. N i1 m0 N i1 m0 J d im lnp m ( J d im ln e x i β m 1 + J J1 ex i β j ) Estimate the vote choice model in the 1992 Presidential election: vote i β m0 + β m1 Economy i + β m2 Democrat i +β m3 Republican i + β m4 Income i + ɛ im where vote i has three categories (Bush, Clinton, Perot). Use multinom in nnet library. 27 / / 47
8 Interpreting Coefficients Marginal Effects There are two sets of coefficients for each independent variable. The signs of coefficients can be interpreted in a direct manner. For example, a negative coefficient indicates the the independent variable reduces the probability of voting for a candidate compared to the baseline candidate. Statistical inference is done as usual. We can calculate the marginal effect of one continuous independent variable on the probabilities of the outcome categories. P m x k P m (β km P m (β m β) J P j β kj ) j0 This is the weighted sum of β k where the weights are the outcome probabilities. This tells us the effect on the probabilities of choosing m if a variable increases by small amount. 29 / / 47 Predicted Probabilities Odds Ratio Predicted probabilities can be computed with the following equation: ˆP m e x ˆβ m 1 + J j1 ex ˆβ j The values of the key independent variable change, while the other variables are held constant. Odds ratios are useful when you want to know the odds of choosing one alternative relative to the other. We first write: Ω mn (x i ) P im P in where Ω mn (x i ) is the odds of outcome m versus outcome n given x i. x i includes all independent variables. 31 / / 47
9 Odds Ratio We continue: Ω mn (x i ) P im P in ex i β m e x i β n e x i [β m β n] e x i βm J J1 ex i β J e x i βn J J1 ex i β J An individual with characteristics specified in x i is e x i [β m β n] more likely to choose m over n. If you want to use the odds ratio as opposed to the baseline category, the equation is simplified to: Ω m1 (x i ) e x i β m An individual with characteristics specified in x i is e x i β m more likely to choose m over the baseline category. 33 / 47 Odds Ratio You can assess how a change in a particular independent variable affects the odds ratio of m to the baseline category. The effect is computed by: Ω m1 (x i, x ik + δ) Ω m1 (x i, x ik ) e β km δ where x ik is the k th independent variable for individual i and β km is the coefficient associated with the k th independent variable for alternative m. For a change of δ in x ik, the odds of outcome m versus the baseline category are expected to change by a factor of e β km δ, holding all other variables constant. The factor change in the odds for a change in x ik does not depend on the level of x ik or on the level of any other variable. 34 / 47 Conditional Logit Model Conditional Logit Model In the MNL model, each explanatory variable denotes individual-specific characteristics and has a different effect on each outcome. The utility for the MNL model is expressed as U im x i β m + ɛ im The conditional logit model is slightly different from MNL since it considers the impact of choice-specific attributes instead of individual-specific attributes. The utility for the CL model is written as U im z im γ + ɛ im where z im denotes a vector of choice-specific attributes. In the case of the vote choice model in 1992, z im would be a perceived candidate trait, for example. Importantly, the parameters are not choice-specific attributes; there is only one for each attribute. In the three-candidate race, the utilities are expressed as Bush: U i1 β 1 honesty i1 + ɛ i1 Clinton: U i2 β 1 honesty i2 + ɛ i2 Perot: U i3 β 1 honesty i3 + ɛ i3 The utility gets larger when perceived honesty increases. 35 / / 47
10 Data for Conditional Logit Model Conditional Logit Model outcome i outcome chosen honesty age The probability that individual i chooses alternative m in the CL model is e z imγ Pr(y i m) J J1 ez ij γ which should be compared to the MNL model: where β 1 0. Pr(y i m) e x i β m J J1 ex i β J 37 / / 47 (Mixed) Conditional Logit Model (Mixed) Conditional Logit Model It is possible to include both individual-specific and choice-specific attributes in the model. The utility is given by U im x i β m + z im γ + ɛ im where x i contains individual-specific attributes for individual i and z im contains choice-specific attributes for outcome m. The probability that individual i chooses alternative m is: where β 1 0. P(y i m) ex i β m+z im γ J J1 ex i β j +z ij γ 39 / / 47
11 Interpretation Independence of Irrelevant Alternatives You can interpret the coefficients in exactly the same way as you do in the MNL model for the individual-specific variables. For choice-specific variables, the signs of the coefficients indicate how an increase in z affects the likelihood that the individual chooses one alternative. You can also use the same techniques (e.g., predicted probabilities) to make an interpretation. In the multinomial logit model, the equation for the odds of m versus n is P(y i m) P(y i n) exi βm e x i β n evim e V in This equation indicates that the odds are determined without reference to the other outcomes that might be available. This property is called as the independence of irrelevant alternatives or IIA. This is a consequence of assuming independence of ɛ ij in the random utility model. 41 / / 47 Independence of Irrelevant Alternatives Independence of Irrelevant Alternatives Think about McFadden s famous example. A person has two choices for commuting to work: a private car that is chosen with P(car) 1/2 and a red bus with P(red bus) 1/2. The implied odds of taking the car versus the red bus is 1. Suppose a new bus company is started that is identical the current service except that the buses are blue. IIA requires that the new probabilities are P(car) 1/3, P(red bus).1/3, and P(blue bus) 1/3. This is necessary so that the odds of a car versus a red bus remain 1. However, if the only thing to distinguish the new bus service from the old is the color of the bus, we would not expect car travelers to start taking the bus (i.e., the utility does not change). Instead, the share of red bus riders would be split, resulting in P(car) 1/2 P(red bus) 1/4, and P(blue bus) 1/4. The new, implied odds for car versus red bus are 2 1/2 1/4, which violates the IIA assumption! The IIA assumption requires that if a new alternative becomes available, then all probabilities for the prior choices must adjust in precisely the amount necessary to retain the original odds among all pairs of outcomes. 43 / / 47
12 Independence of Irrelevant Alternatives Testing IIA We assumed the the disturbances were distributed identically and independently according to Type 1 Extreme Value distribution. The violation of IIA indicates that the errors ɛ ij are not independent across alternatives j. The non-independence causes us to overestimate the probability of choosing alternatives that are similar to each other. A Hausman-type test is available to assess the property of IIA. If the IIA property holds, then the parameter estimates obtained on the subset of alternatives will not be significantly different from those obtained on the full set of alternatives. 45 / / 47 Testing IIA The Hausman test proceeds as follows: 1 Estimate coefficients ˆβ F and covariance matrix ˆV F with all J alternatives. 2 Estimate coefficients ˆβ R and covariance matrix ˆV R with reduced alternatives. 3 Compare both estimates based on Hausman statistic: ( ˆβ R ˆβ F ) [ ˆV R ˆV F ] 1 ( ˆβ R ˆβ F ) which follows χ 2 distribution with k degrees of freedom where k is the number of elements in the β vector. 4 If the test statistic is larger than a critical value, we reject the null hypothesis that the IIA property holds. See Fry and Harris (1998) for alternative tests. 47 / 47
Goals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2. Recap: MNL. Recap: MNL
Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 2 Tetsuya Matsubayashi University of North Texas November 9, 2010 Learn multiple responses models that do not require the assumption
More informationPOLI 7050 Spring 2008 February 27, 2008 Unordered Response Models I
POLI 7050 Spring 2008 February 27, 2008 Unordered Response Models I Introduction For the next couple weeks we ll be talking about unordered, polychotomous dependent variables. Examples include: Voter choice
More informationdisc choice5.tex; April 11, ffl See: King - Unifying Political Methodology ffl See: King/Tomz/Wittenberg (1998, APSA Meeting). ffl See: Alvarez
disc choice5.tex; April 11, 2001 1 Lecture Notes on Discrete Choice Models Copyright, April 11, 2001 Jonathan Nagler 1 Topics 1. Review the Latent Varible Setup For Binary Choice ffl Logit ffl Likelihood
More informationDiscrete Choice Models I
Discrete Choice Models I 1 Introduction A discrete choice model is one in which decision makers choose among a set of alternatives. 1 To fit within a discrete choice framework, the set of alternatives
More informationLecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo
Lecture 1 Behavioral Models Multinomial Logit: Power and limitations Cinzia Cirillo 1 Overview 1. Choice Probabilities 2. Power and Limitations of Logit 1. Taste variation 2. Substitution patterns 3. Repeated
More informationLimited Dependent Variable Models II
Limited Dependent Variable Models II Fall 2008 Environmental Econometrics (GR03) LDV Fall 2008 1 / 15 Models with Multiple Choices The binary response model was dealing with a decision problem with two
More informationLecture-20: Discrete Choice Modeling-I
Lecture-20: Discrete Choice Modeling-I 1 In Today s Class Introduction to discrete choice models General formulation Binary choice models Specification Model estimation Application Case Study 2 Discrete
More informationProbabilistic Choice Models
Econ 3: James J. Heckman Probabilistic Choice Models This chapter examines different models commonly used to model probabilistic choice, such as eg the choice of one type of transportation from among many
More informationA short introduc-on to discrete choice models
A short introduc-on to discrete choice models BART Kenneth Train, Discrete Choice Models with Simula-on, Chapter 3. Ques-ons Impact of cost, commu-ng -me, walk -me, transfer -me, number of transfers, distance
More informationI. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as
Econ 513, USC, Fall 2005 Lecture 15 Discrete Response Models: Multinomial, Conditional and Nested Logit Models Here we focus again on models for discrete choice with more than two outcomes We assume that
More informationThe Logit Model: Estimation, Testing and Interpretation
The Logit Model: Estimation, Testing and Interpretation Herman J. Bierens October 25, 2008 1 Introduction to maximum likelihood estimation 1.1 The likelihood function Consider a random sample Y 1,...,
More informationIntroduction to Discrete Choice Models
Chapter 7 Introduction to Dcrete Choice Models 7.1 Introduction It has been mentioned that the conventional selection bias model requires estimation of two structural models, namely the selection model
More informationProbabilistic Choice Models
Probabilistic Choice Models James J. Heckman University of Chicago Econ 312 This draft, March 29, 2006 This chapter examines dierent models commonly used to model probabilistic choice, such as eg the choice
More informationSingle-level Models for Binary Responses
Single-level Models for Binary Responses Distribution of Binary Data y i response for individual i (i = 1,..., n), coded 0 or 1 Denote by r the number in the sample with y = 1 Mean and variance E(y) =
More informationAn Overview of Choice Models
An Overview of Choice Models Dilan Görür Gatsby Computational Neuroscience Unit University College London May 08, 2009 Machine Learning II 1 / 31 Outline 1 Overview Terminology and Notation Economic vs
More informationApplied Health Economics (for B.Sc.)
Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative
More informationLatent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent
Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary
More informationThe 17 th Behavior Modeling Summer School
The 17 th Behavior Modeling Summer School September 14-16, 2017 Introduction to Discrete Choice Models Giancarlos Troncoso Parady Assistant Professor Urban Transportation Research Unit Department of Urban
More informationWeek 7: Binary Outcomes (Scott Long Chapter 3 Part 2)
Week 7: (Scott Long Chapter 3 Part 2) Tsun-Feng Chiang* *School of Economics, Henan University, Kaifeng, China April 29, 2014 1 / 38 ML Estimation for Probit and Logit ML Estimation for Probit and Logit
More informationLinear Regression With Special Variables
Linear Regression With Special Variables Junhui Qian December 21, 2014 Outline Standardized Scores Quadratic Terms Interaction Terms Binary Explanatory Variables Binary Choice Models Standardized Scores:
More informationFixed Effects Models for Panel Data. December 1, 2014
Fixed Effects Models for Panel Data December 1, 2014 Notation Use the same setup as before, with the linear model Y it = X it β + c i + ɛ it (1) where X it is a 1 K + 1 vector of independent variables.
More informationPOLI 7050 Spring 2008 March 5, 2008 Unordered Response Models II
POLI 7050 Spring 2008 March 5, 2008 Unordered Response Models II Introduction Today we ll talk about interpreting MNL and CL models. We ll start with general issues of model fit, and then get to variable
More informationMaximum Likelihood and. Limited Dependent Variable Models
Maximum Likelihood and Limited Dependent Variable Models Michele Pellizzari IGIER-Bocconi, IZA and frdb May 24, 2010 These notes are largely based on the textbook by Jeffrey M. Wooldridge. 2002. Econometric
More informationh=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio
Multivariate Response Models The response variable is unordered and takes more than two values. The term unordered refers to the fact that response 3 is not more favored than response 2. One choice from
More informationEconometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit
Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables
More informationModels of Qualitative Binary Response
Models of Qualitative Binary Response Probit and Logit Models October 6, 2015 Dependent Variable as a Binary Outcome Suppose we observe an economic choice that is a binary signal. The focus on the course
More informationBinary Logistic Regression
The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b
More informationReview of Statistics
Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and
More informationChapter 3 Choice Models
Chapter 3 Choice Models 3.1 Introduction This chapter describes the characteristics of random utility choice model in a general setting, specific elements related to the conjoint choice context are given
More informationBinary choice. Michel Bierlaire
Binary choice Michel Bierlaire Transport and Mobility Laboratory School of Architecture, Civil and Environmental Engineering Ecole Polytechnique Fédérale de Lausanne M. Bierlaire (TRANSP-OR ENAC EPFL)
More informationBayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence
Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns
More informationLecture 6: Discrete Choice: Qualitative Response
Lecture 6: Instructor: Department of Economics Stanford University 2011 Types of Discrete Choice Models Univariate Models Binary: Linear; Probit; Logit; Arctan, etc. Multinomial: Logit; Nested Logit; GEV;
More informationEcon 673: Microeconometrics
Econ 673: Microeconometrics Chapter 4: Properties of Discrete Choice Models Fall 2008 Herriges (ISU) Chapter 4: Discrete Choice Models Fall 2008 1 / 29 Outline 1 2 Deriving Choice Probabilities 3 Identification
More informationNinth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"
Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric
More informationEconometric Analysis of Games 1
Econometric Analysis of Games 1 HT 2017 Recap Aim: provide an introduction to incomplete models and partial identification in the context of discrete games 1. Coherence & Completeness 2. Basic Framework
More informationSCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models
SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION
More informationStandard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j
Standard Errors & Confidence Intervals β β asy N(0, I( β) 1 ), where I( β) = [ 2 l(β, φ; y) ] β i β β= β j We can obtain asymptotic 100(1 α)% confidence intervals for β j using: β j ± Z 1 α/2 se( β j )
More informationInstrumental Variables and the Problem of Endogeneity
Instrumental Variables and the Problem of Endogeneity September 15, 2015 1 / 38 Exogeneity: Important Assumption of OLS In a standard OLS framework, y = xβ + ɛ (1) and for unbiasedness we need E[x ɛ] =
More information2. We care about proportion for categorical variable, but average for numerical one.
Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is
More informationSTAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression
STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test
More informationUQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables
UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables To be provided to students with STAT2201 or CIVIL-2530 (Probability and Statistics) Exam Main exam date: Tuesday, 20 June 1
More informationReview of Multinomial Distribution If n trials are performed: in each trial there are J > 2 possible outcomes (categories) Multicategory Logit Models
Chapter 6 Multicategory Logit Models Response Y has J > 2 categories. Extensions of logistic regression for nominal and ordinal Y assume a multinomial distribution for Y. 6.1 Logit Models for Nominal Responses
More informationBias Variance Trade-off
Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]
More informationA Bayesian Probit Model with Spatial Dependencies
A Bayesian Probit Model with Spatial Dependencies Tony E. Smith Department of Systems Engineering University of Pennsylvania Philadephia, PA 19104 email: tesmith@ssc.upenn.edu James P. LeSage Department
More informationStatistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018
Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical
More informationComparing IRT with Other Models
Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used
More informationMultiple regression: Categorical dependent variables
Multiple : Categorical Johan A. Elkink School of Politics & International Relations University College Dublin 28 November 2016 1 2 3 4 Outline 1 2 3 4 models models have a variable consisting of two categories.
More informationAdvanced Quantitative Methods: limited dependent variables
Advanced Quantitative Methods: Limited Dependent Variables I University College Dublin 2 April 2013 1 2 3 4 5 Outline Model Measurement levels 1 2 3 4 5 Components Model Measurement levels Two components
More informationMaximum Likelihood Methods
Maximum Likelihood Methods Some of the models used in econometrics specify the complete probability distribution of the outcomes of interest rather than just a regression function. Sometimes this is because
More informationReview of One-way Tables and SAS
Stat 504, Lecture 7 1 Review of One-way Tables and SAS In-class exercises: Ex1, Ex2, and Ex3 from http://v8doc.sas.com/sashtml/proc/z0146708.htm To calculate p-value for a X 2 or G 2 in SAS: http://v8doc.sas.com/sashtml/lgref/z0245929.htmz0845409
More information[y i α βx i ] 2 (2) Q = i=1
Least squares fits This section has no probability in it. There are no random variables. We are given n points (x i, y i ) and want to find the equation of the line that best fits them. We take the equation
More informationPanel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63
1 / 63 Panel Data Models Chapter 5 Financial Econometrics Michael Hauser WS17/18 2 / 63 Content Data structures: Times series, cross sectional, panel data, pooled data Static linear panel data models:
More informationCONTINUOUS RANDOM VARIABLES
the Further Mathematics network www.fmnetwork.org.uk V 07 REVISION SHEET STATISTICS (AQA) CONTINUOUS RANDOM VARIABLES The main ideas are: Properties of Continuous Random Variables Mean, Median and Mode
More informationPart I Behavioral Models
Part I Behavioral Models 2 Properties of Discrete Choice Models 2.1 Overview This chapter describes the features that are common to all discrete choice models. We start by discussing the choice set, which
More informationOutline. The binary choice model. The multinomial choice model. Extensions of the basic choice model
Outline The binary choice model Illustration Specification of the binary choice model Interpreting the results of binary choice models ME output The multinomial choice model Illustration Specification
More informationCan a Pseudo Panel be a Substitute for a Genuine Panel?
Can a Pseudo Panel be a Substitute for a Genuine Panel? Min Hee Seo Washington University in St. Louis minheeseo@wustl.edu February 16th 1 / 20 Outline Motivation: gauging mechanism of changes Introduce
More informationItem Response Theory for Conjoint Survey Experiments
Item Response Theory for Conjoint Survey Experiments Devin Caughey Hiroto Katsumata Teppei Yamamoto Massachusetts Institute of Technology PolMeth XXXV @ Brigham Young University July 21, 2018 Conjoint
More informationClassification. Chapter Introduction. 6.2 The Bayes classifier
Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode
More informationLecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH
Lecture 5: Spatial probit models James P. LeSage University of Toledo Department of Economics Toledo, OH 43606 jlesage@spatial-econometrics.com March 2004 1 A Bayesian spatial probit model with individual
More informationInstrumental Variables
Instrumental Variables Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Noncompliance in Experiments Stat186/Gov2002 Fall 2018 1 / 18 Instrumental Variables
More informationLecture 11. Probability Theory: an Overveiw
Math 408 - Mathematical Statistics Lecture 11. Probability Theory: an Overveiw February 11, 2013 Konstantin Zuev (USC) Math 408, Lecture 11 February 11, 2013 1 / 24 The starting point in developing the
More informationReview of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.
Panel GLMs Department of Political Science and Government Aarhus University May 12, 2015 1 Review of Panel Data 2 Model Types 3 Review and Looking Forward 1 Review of Panel Data 2 Model Types 3 Review
More informationLecture 14: Introduction to Poisson Regression
Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why
More informationModelling counts. Lecture 14: Introduction to Poisson Regression. Overview
Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week
More informationNevo on Random-Coefficient Logit
Nevo on Random-Coefficient Logit March 28, 2003 Eric Rasmusen Abstract Overheads for logit demand estimation for G604. These accompany the Nevo JEMS article. Indiana University Foundation Professor, Department
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science
UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator
More informationLogit Regression and Quantities of Interest
Logit Regression and Quantities of Interest Stephen Pettigrew March 4, 2015 Stephen Pettigrew Logit Regression and Quantities of Interest March 4, 2015 1 / 57 Outline 1 Logistics 2 Generalized Linear Models
More informationEC402 - Problem Set 3
EC402 - Problem Set 3 Konrad Burchardi 11th of February 2009 Introduction Today we will - briefly talk about the Conditional Expectation Function and - lengthily talk about Fixed Effects: How do we calculate
More information1 Hotz-Miller approach: avoid numeric dynamic programming
1 Hotz-Miller approach: avoid numeric dynamic programming Rust, Pakes approach to estimating dynamic discrete-choice model very computer intensive. Requires using numeric dynamic programming to compute
More informationUC Berkeley Math 10B, Spring 2015: Midterm 2 Prof. Sturmfels, April 9, SOLUTIONS
UC Berkeley Math 10B, Spring 2015: Midterm 2 Prof. Sturmfels, April 9, SOLUTIONS 1. (5 points) You are a pollster for the 2016 presidential elections. You ask 0 random people whether they would vote for
More informationTruncation and Censoring
Truncation and Censoring Laura Magazzini laura.magazzini@univr.it Laura Magazzini (@univr.it) Truncation and Censoring 1 / 35 Truncation and censoring Truncation: sample data are drawn from a subset of
More informationEconometrics for PhDs
Econometrics for PhDs Amine Ouazad April 2012, Final Assessment - Answer Key 1 Questions with a require some Stata in the answer. Other questions do not. 1 Ordinary Least Squares: Equality of Estimates
More informationOrdinary Least Squares Regression
Ordinary Least Squares Regression Goals for this unit More on notation and terminology OLS scalar versus matrix derivation Some Preliminaries In this class we will be learning to analyze Cross Section
More informationIntroduction to mixtures in discrete choice models
Introduction to mixtures in discrete choice models p. 1/42 Introduction to mixtures in discrete choice models Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory Introduction to
More informationINTRODUCTION TO TRANSPORTATION SYSTEMS
INTRODUCTION TO TRANSPORTATION SYSTEMS Lectures 5/6: Modeling/Equilibrium/Demand 1 OUTLINE 1. Conceptual view of TSA 2. Models: different roles and different types 3. Equilibrium 4. Demand Modeling References:
More informationSummary of Chapters 7-9
Summary of Chapters 7-9 Chapter 7. Interval Estimation 7.2. Confidence Intervals for Difference of Two Means Let X 1,, X n and Y 1, Y 2,, Y m be two independent random samples of sizes n and m from two
More informationApplied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid
Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?
More informationFinal Exam. Economics 835: Econometrics. Fall 2010
Final Exam Economics 835: Econometrics Fall 2010 Please answer the question I ask - no more and no less - and remember that the correct answer is often short and simple. 1 Some short questions a) For each
More informationChapter 11. Regression with a Binary Dependent Variable
Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score
More informationComments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006
Comments on: Panel Data Analysis Advantages and Challenges Manuel Arellano CEMFI, Madrid November 2006 This paper provides an impressive, yet compact and easily accessible review of the econometric literature
More informationGeneralized logit models for nominal multinomial responses. Local odds ratios
Generalized logit models for nominal multinomial responses Categorical Data Analysis, Summer 2015 1/17 Local odds ratios Y 1 2 3 4 1 π 11 π 12 π 13 π 14 π 1+ X 2 π 21 π 22 π 23 π 24 π 2+ 3 π 31 π 32 π
More informationLecture notes to Chapter 11, Regression with binary dependent variables - probit and logit regression
Lecture notes to Chapter 11, Regression with binary dependent variables - probit and logit regression Tore Schweder October 28, 2011 Outline Examples of binary respons variables Probit and logit - examples
More informationStatistical Tests. Matthieu de Lapparent
Statistical Tests Matthieu de Lapparent matthieu.delapparent@epfl.ch Transport and Mobility Laboratory, School of Architecture, Civil and Environmental Engineering, Ecole Polytechnique Fédérale de Lausanne
More informationSyllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools
Syllabus By Joan Llull Microeconometrics. IDEA PhD Program. Fall 2017 Chapter 1: Introduction and a Brief Review of Relevant Tools I. Overview II. Maximum Likelihood A. The Likelihood Principle B. The
More informationWISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A
WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2015-16 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This
More informationMarketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)
Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Remember: Z.05 = 1.645, Z.01 = 2.33 We will only cover one-sided hypothesis testing (cases 12.3, 12.4.2, 12.5.2,
More informationHomework 1 Solutions
36-720 Homework 1 Solutions Problem 3.4 (a) X 2 79.43 and G 2 90.33. We should compare each to a χ 2 distribution with (2 1)(3 1) 2 degrees of freedom. For each, the p-value is so small that S-plus reports
More informationCHAPTER 5. Logistic regression
CHAPTER 5 Logistic regression Logistic regression is the standard way to model binary outcomes (that is, data y i that take on the values 0 or 1). Section 5.1 introduces logistic regression in a simple
More informationIntroduction to Estimation Methods for Time Series models. Lecture 1
Introduction to Estimation Methods for Time Series models Lecture 1 Fulvio Corsi SNS Pisa Fulvio Corsi Introduction to Estimation () Methods for Time Series models Lecture 1 SNS Pisa 1 / 19 Estimation
More informationOrdered Response and Multinomial Logit Estimation
Ordered Response and Multinomial Logit Estimation Quantitative Microeconomics R. Mora Department of Economics Universidad Carlos III de Madrid Outline Introduction 1 Introduction 2 3 Introduction The Ordered
More informationGibbs Sampling in Latent Variable Models #1
Gibbs Sampling in Latent Variable Models #1 Econ 690 Purdue University Outline 1 Data augmentation 2 Probit Model Probit Application A Panel Probit Panel Probit 3 The Tobit Model Example: Female Labor
More informationStatistical Analysis of the Item Count Technique
Statistical Analysis of the Item Count Technique Kosuke Imai Department of Politics Princeton University Joint work with Graeme Blair May 4, 2011 Kosuke Imai (Princeton) Item Count Technique UCI (Statistics)
More informationSimple Linear Regression
Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)
More informationLimited Dependent Variables and Panel Data
and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement
More informationECON 594: Lecture #6
ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was
More informationBivariate Distributions
STAT/MATH 395 A - PROBABILITY II UW Winter Quarter 17 Néhémy Lim Bivariate Distributions 1 Distributions of Two Random Variables Definition 1.1. Let X and Y be two rrvs on probability space (Ω, A, P).
More informationThe Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen
The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen January 23-24, 2012 Page 1 Part I The Single Level Logit Model: A Review Motivating Example Imagine we are interested in voting
More informationEstimation of mixed generalized extreme value models
Estimation of mixed generalized extreme value models Michel Bierlaire michel.bierlaire@epfl.ch Operations Research Group ROSO Institute of Mathematics EPFL Katholieke Universiteit Leuven, November 2004
More informationStatistics 203 Introduction to Regression Models and ANOVA Practice Exam
Statistics 203 Introduction to Regression Models and ANOVA Practice Exam Prof. J. Taylor You may use your 4 single-sided pages of notes This exam is 7 pages long. There are 4 questions, first 3 worth 10
More informationWhat can we learn about correlations from multinomial probit estimates?
What can we learn about correlations from multinomial probit estimates? Chiara Monfardini J.M.C. Santos Silva February 2006 Abstract It is well known that, in a multinomial probit, only the covariance
More information