Economics 472. Lecture 16. Binary Dependent Variable Models
|
|
- Aubrie Crawford
- 5 years ago
- Views:
Transcription
1 University of Illinois Fall 998 Department of Economics Roger Koenker Economics 472 Lecture 6 Binary Dependent Variable Models Let's begin with a model for an observed proportion, or frequency. We would like to explain variation in the proportion p i as a function of covariates x i.we could simply specify that p i = x 0 i æ + error and run it as OLS regression. But this has certain problems. For example, we might ænd that ^p i =2 ë0; ë: So we typically consider transformations gèp i è=x 0 i æ + error where g is usually called the ëlink" function. A typical example of g is the logit function gèpè = logitèpè =logèp=, pè this corresponds to the logistic df. The transformation may be seen to induce a certain degree of heteroscedasticity into the model. Suppose each observation ^p i is based on a moderately large sample of n i observations with ^p i! p i : We may then use the æ-method to compute the variability of logitè^p i è, so V ègè^p i èè = èg 0 èp i èè 2 V è^p i è gèpè = logèp=è, pèè ç g 0 èpè =, p d p æ p dp, p V è^p i è = p iè, p i è n i V èlogitè^p i èè = n i p i è, p i è ç = pè, pè Thus GLS would suggest running the weighted regression of logitè^p i èonx i with weights n i p i è, p i è. Of course, we could, based on considerations so far, replace logitè^p i èwithany other quantile-type transformation from ë0,ë to jr. For example, we might use æ, è^p i è in which case the same logic suggests regressing æ, è^p i èonx i with weights n iç 2 èæ, èp i èè p i è, p i è An immediate problem presents itself, however, if we would like to apply the foregoing to data in which some of the observed p i are either 0 or. Since the foregoing approach seems rather ad hoc any way based as it is an approximate normality of the ^p i wemightaswell leap in the briar patch of mle. But to keep things quite close to the
2 2 regression setting we will posit the following latent variable model. We posit the model for the latent èunobservedè variable y æ i and assume that the observed binary response variable y i is generated as, y æ i = so letting the df of u i be denoted by F, x0 i æ + u i ç if y æ y i = i é 0 0 otherwise P èy =è = P èu i é,x 0 i æè=, F è,x0 i æè P èy =0è = F è,x 0 i æè For F symmetric F èzè+f è,zè =sofèzè =fè,zè and we have P èy =è = F èx 0 i æè P èy =0è =, F èx 0 i æè and we may write the likelihood of seeing Y the sample fy i ;x i Y è: Lèæè = è, F èx 0 i æèè = i:y i =0 ny i= F y i i è, F i è,y i i:y i = i =;:::;ng as F èx 0 i æè Now we need to make somechoice of F. There are several popular choices: èiè: Logit p = F èzè = ez è +e R logèp=, pè =z so Ey z i = p i = F èx i æè è logitèp i è=x 0 i æ z èiiè: Probit F èzè =æèzè =, çèxèdx æ, èpè =x 0 i æ èiiiè: Cauchy F èzè + 2 ç, tan, èzè F, èpè =tanèçèèp, èè = 2 x0 i æ: èivè: Complementary log log èvè: log-log F, èpè =logè, logè, pèè = x 0 i æ F, èpè =, logè, logèpèè = x i æ
3 probablility scale log-log logit scale probit logit cauchy -4 c-loglog -6 Figure. Comparison of æve link functions: The horizontal axis is on the logistic scale so the logit link appears as the 45 degree line. Symmetry around the y-axis indicates symmetry of the distribution corresponding with the link as in the logit, probit and Cauchy cases. The log-log forms are asymmetric in this respect. Note that while the probit and logit are quite similar the Cauchy link is much more long tailed.
4 4 ëèplots of link functions for binary dep models postscriptè"fig.ps",horizontal=f,width=6.5,height=6.5, font=7,pointsize=2è eps_.2 u_è:999èè000 plotèlogèuèè-uèè,logèuèè-uèè,type="l",axes=f, xlab="",ylab=""è tics_cè.00,.0,.è tics_cètics,-ticsè ytics_0*tics segmentsèlogèticsèè-ticsèè,ytics,logèticsèè-ticsèè,ytics+epsè textèlogèticsèè-ticsèè,ytics+3*eps,pasteèformatèroundètics,3èèèè textèlogèticsë2ëèè-ticsë2ëèè,.3,"probablility scale"è tics_cè2,4,6è tics_cètics,-ticsè ytics_0*tics segmentsètics,ytics,tics,ytics-epsè textètics,ytics-3*eps,pasteèformatèroundèticsèèèè textèticsë2ë,-.3,"logit scale"è segmentsè-eps,ytics,eps,yticsè textè-3*eps,tics,pasteèformatèroundèticsèèèè ablineèh=0è ablineèv=0è linesèlogèuèè-uèè,qnormèuè,lty=2è linesèlogèuèè-uèè,logè-logè-uèè,lty=3è linesèlogèuèè-uèè,-logè-logèuèè,lty=4è linesèlogèuèè-uèè,tanèpi*èu-.5èè,lty=5è textè-cè6,6,5,5,2è,-cè,3,4,6,4è,cè"log-log","probit","logit","c-loglog","cauchy"èè frameèè
5 5 In regression we are used to the idea that Interpretation of the = æ i provided we really have a linear model in x i, but under our symmetry assumption here the situation is slightly more complicated. Now, so now for logit we have so while for probit we have Eèy i jx i è=æ P èy i =è+0æ P èy i =0è=F èx i = fèx 0 æèæ j ez F èzè = +e z fèzè =F èzèè, F èzèè fèzè =çèzè = p 2ç e,z2 =2 and for Cauchy fèzè = çè + zè 2 We can compare these, for example, at z =0whereweget factor from fè0è logit è4 probit p = 2ç Cauchy =ç and roughly speaking the whole ^æ-vector should scale by these factors so e.g., 4 ælogit j ç p b probit j 2ç so æ logit j ç :60æ probit j Diagnostic For the Logistic Link Function Let gèpè = logitèpè in the usual one observation per cell logit model, and suppose we've ætted the model logitèp i è=xæ
6 6 but we'd like to know if there is some more general form for the density which works better. Pregibon suggests, following Box-Cox, gèpè = pæ,æ, æ, æ, è, pèæ,æ, æ + æ note as æ; æ! 0weget = log p, logè, pè = logèp=, pè: æ =0è symmetry, æ governs fatness of tails. Expanding g in æ; æ we get èwith diligenceè gèpè = g 0 èpè+æg æ 0 èpè+æg æ 0èpè g æ 0 èpè = 2 ëlog2 èpè, log 2 è, pèë g æ 0èpè =, 2 ëlog2 èpè + log 2 è, pèë LM tests of signiæcance of g æ ;g æ ; in an expanded model in which we include g æè^pè 0 andgæ 0 è^pè where these variables are constructed from a preliminary logistic regression, can be used to evaluate the reasonableness of the logit speciæcation. Semiparametric Methods for Binary Choice Models It is worthwhile to explore what happens when we relax the assumptions of the prior analysis, in particular the assumptions that a.è we know the form of the df F, and b.è that the u i 's in the latent variable formulation are iid. Recall that in ordinary linear regression we can justify OLS methods with the minimal assumption that u is mean independent of the covariates x, i.e., that Eèujxè = 0. We will see that this condition is not suæcient to identify the parameters æ in the latent variable form of the binary choice model. The following example is taken from Horowitz è998è. Suppose we have the simple logistic model, where u i is iid logistic, i.e., has df y æ i = x i æ + u i F èuè ==è + e,u è It is clear that multiplying the latent variable equation through by ç levels observable choices unchanged, so the ærst observation about identiæcation in this model is that we can only identify æ ëup to scale". This is essentially the reason we are entitled to impose the assumption that u has a df with known scale. Now let æ be another parameter vector such that æ 6= çæ for any choice of the scalar ç: It is easy to construct new random variables, say v, whose dfs will now depend upon x, and for which èæè F vjx èx 0 æè==è + expè,xæèè
7 and Eèvjxè=0 Thus, æ and the v's would generate the same observable probabilities as æ and the u's and both would have mean independent errors with respect to x. The argument is most easily seen by drawing a picture. Suppose we have the original èæ; uè model with nice logistic densities at each x, the and a line representing xæ and we could imagine recentering the logistic densities so that they were centered with respect to the xæ line. Now on the left side of the picture imagine stretching the right tail of the density until the mean matches xæ, similarly we can stretch the left tail an the right side of the picture í as long as the stretching doesn't move mass across the xæ line è*è is satisæed. What this shows is that mean independence is the wrong idea for thinking about binary choice models. What is appropriate? The example illustrates that the right concept is median independence. As long as medianèujxè =0 we do get identiæcation under two rather mild conditions. A simple way toseehow to exploit this is to recall that under the general quantile regression model, Q y èçjxè xæ equivariance to monotone transformations implies that for the rather drastic transformation Ièx é0è we have Q Ièyé0èèçjxè =Ièxæ é 0è but Ièy é0è is just the observable binary variable so this suggests the following estimation strategy X ç I èy i, Ièx i æé0èè min jjæjj= where y i is the binary variable. This problem is rather tricky computationally but it has a natural interpretation í we want tochose æ so that as often as possible Ièx i æé0è predicts correctly. Manski è975è introduced this idea under the rather unfortunate name ëmaximum score" estimator, writing it as X è2yi, èè2ièx 0 iæ ç 0è, è max jjæjj= In this form we try to maximize the number of matches, rather than minimizing the number of unmatches but the two problems are equivalent. The large sample theory of this estimator is rather complicated, but an interesting aspect of the quantile regression formulation is that it enables us by estimating the model for various values of ç to explore the problem of ëheteroscedasticity" in the binary choice model. 7 Discrete Choice Models í Some Theory The theory of discrete choice has a long history in both psychology and economics. McFadden's version of the Thurstone è927è model may be very concisely expressed as follows:
8 8 m choices y æ i = çutility ofi th choice if y æ y i = i =maxfy æ ;:::;yæ mg 0 otherwise Suppose we can express the ëutility of the i th choice" as, y æ i = vèx i è+u i where x i is a vector of attributes of the i th choice, and fu i g are iid draws from some df F. Note that in contrast to classical economic models of choice, here utility has a random component. This randomness has an important role to play, because it allows us to develop simple models with ëcommon tastes" in which not everyone make exactly the same choices. Thm. Iftheu i are iid with F èuè =F èu i éuè=e,e,u, then P èy i =jx i è= ev i P e v i where v i ç vèx i è. Remark. F èæè is often called the Type extreme value distribution. Proof. y æ i =maxfçg è u i + v i év j + v j for all j 6= i or u j éu i + v i, v j. So, conditioning on u i and then integrating with respect to the marginal density ofu i, P èy æ i =jx iè= Z Y F èu i + v i, v j èfèu i èdu i Note if F èuè takes the hypothesized form, then fèuè =e,e,u æ e,u = e,u,e,u so Y i F èu i + v i, v j èfèu i è = Y j expè, expè,u i, v i + v j èè expè,u i, expè,u i èè = expè,u i, e,u i è + X j6=i e v j e v i èè let so P èy æ i =jx iè = X ç i = logè + e v j =e v i X m è=logè e v j =e v i è j6=i j= Z expè,u i, e,èu i,ç iè èdu i Z = e,ç i expè,~u i, e,~u i èd~u i ~u i = u i, ç i = e,ç i = e v i P n j= ev j Extensions: y æ ij = utility ofi th person for j th choice y æ ij + x ij æ + z i æ + u ij
9 9 Then assuming u ij iid F yields p ij = P èy ij =jx ij;zi è= P e x ij æ+z i æ ex ijæ+z i æ E.g., here x ij is a vector of choice speciæc individual characteristics like travel time to work by thei th mode of transport, and z i is a vector of individual characteristics, like income, age, etc. Critique of IIA í Independence of Irrelevant Alternatives Luce derived aversion of the above model from the assumption that the odds of choosing alternatives i and j shouldn't depend on the characteristics of a 3 rd alternative k. Clearly here P i = P èy i =è P j P èy j =è = evi e v j which is independent ofv k. One should resist the temptation to relate this to similarly named concepts in the theory of voting. For some purchases this is a desirable feature of choice model, but in other circumstances it might be considered a ëbug." Debreu in a famous critique of the IIA property suggested that it might be unreasonable to think that the choice between car and bus transportation would be invariant to the introduction of a new form of bus which diæered from the original one only in terms of color. In this red-bus-blue-bus example we would expect that the draws of u i 's for the two bus modes would be highly correlated, not independent. Recently, there has been considerable interest in multinomial probit models of this type in which correlation can be easily incorporated. References Pregibon, D. è980è Goodness of link tests for generalized linear models, Applied Stat, 29, 5-24.
P.B. Stark. January 29, 1998
Statistics 210B, Spring 1998 Class Notes P.B. Stark stark@stat.berkeley.edu www.stat.berkeley.eduèçstarkèindex.html January 29, 1998 Second Set of Notes 1 More on Testing and Conædence Sets See Lehmann,
More informationEconometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit
Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables
More informationh=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio
Multivariate Response Models The response variable is unordered and takes more than two values. The term unordered refers to the fact that response 3 is not more favored than response 2. One choice from
More informationLecture 6: Discrete Choice: Qualitative Response
Lecture 6: Instructor: Department of Economics Stanford University 2011 Types of Discrete Choice Models Univariate Models Binary: Linear; Probit; Logit; Arctan, etc. Multinomial: Logit; Nested Logit; GEV;
More informationEconomics Midterm Answer Key. Q1 èiè In this question we have a Marshallian demand function with arguments Cèp;mè =
Economics 7 997 Midterm Answer Key PART A Q èiè In this question we have a Marshallian demand function with arguments Cè;mè = Cè; w; w Lè. We can determine this function from the solution to max fc;lg
More informationThe method of teepet decent i probably the bet known procedure for ænding aymptotic behavior of integral of the form Z è1è Ièè = gèzè e f èzè dz; C wh
UNIFORM ASYMPTOTIC EXPANSIONS R. Wong Department of Mathematic City Univerity of Hong Kong Tat Chee Ave Kowloon, Hong Kong for NATOèASI Special Function 2000 1 The method of teepet decent i probably the
More informationalso has x æ as a local imizer. Of course, æ is typically not known, but an algorithm can approximate æ as it approximates x æ èas the augmented Lagra
Introduction to sequential quadratic programg Mark S. Gockenbach Introduction Sequential quadratic programg èsqpè methods attempt to solve a nonlinear program directly rather than convert it to a sequence
More informationGoals. PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1. Multinomial Dependent Variable. Random Utility Model
Goals PSCI6000 Maximum Likelihood Estimation Multiple Response Model 1 Tetsuya Matsubayashi University of North Texas November 2, 2010 Random utility model Multinomial logit model Conditional logit model
More informationEconomics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation
University of Illinois Fall 2016 Department of Economics Roger Koenker Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation The simplest of this general class of models is Tobin s (1958)
More informationECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria
ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria SOLUTION TO FINAL EXAM Friday, April 12, 2013. From 9:00-12:00 (3 hours) INSTRUCTIONS:
More informationIntroduction to Discrete Choice Models
Chapter 7 Introduction to Dcrete Choice Models 7.1 Introduction It has been mentioned that the conventional selection bias model requires estimation of two structural models, namely the selection model
More informationdisc choice5.tex; April 11, ffl See: King - Unifying Political Methodology ffl See: King/Tomz/Wittenberg (1998, APSA Meeting). ffl See: Alvarez
disc choice5.tex; April 11, 2001 1 Lecture Notes on Discrete Choice Models Copyright, April 11, 2001 Jonathan Nagler 1 Topics 1. Review the Latent Varible Setup For Binary Choice ffl Logit ffl Likelihood
More informationECON 594: Lecture #6
ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was
More informationLimited Dependent Variable Models II
Limited Dependent Variable Models II Fall 2008 Environmental Econometrics (GR03) LDV Fall 2008 1 / 15 Models with Multiple Choices The binary response model was dealing with a decision problem with two
More informationA Scaled Diæerence Chi-square Test Statistic. Albert Satorra. Universitat Pompeu Fabra. and. Peter M. Bentler. August 3, 1999
A Scaled Diæerence Chi-square Test Statistic for Moment Structure Analysis æ Albert Satorra Universitat Pompeu Fabra and Peter M. Bentler University of California, Los Angeles August 3, 1999 æ Research
More information2 I. Pritsker and R. Varga Wesay that the triple èg; W;æè has the rational approximation property if, iè iiè for any f èzè which is analytic in G and
Rational approximation with varying weights in the complex plane Igor E. Pritsker and Richard S. Varga Abstract. Given an open bounded set G in the complex plane and a weight function W èzè which is analytic
More informationMultinomial Discrete Choice Models
hapter 2 Multinomial Discrete hoice Models 2.1 Introduction We present some discrete choice models that are applied to estimate parameters of demand for products that are purchased in discrete quantities.
More informationGeneralized Linear Models for Non-Normal Data
Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture
More informationProbabilistic Choice Models
Probabilistic Choice Models James J. Heckman University of Chicago Econ 312 This draft, March 29, 2006 This chapter examines dierent models commonly used to model probabilistic choice, such as eg the choice
More informationSome Preliminary Market Research: A Googoloscopy. Parametric Links for Binary Response. Some Preliminary Market Research: A Googoloscopy
Some Preliminary Market Research: A Googoloscopy Parametric Links for Binary Response Roger Koenker and Jungmo Yoon University of Illinois, Urbana-Champaign UseR! 2006 Link GoogleHits Logit 2,800,000 Probit
More informationMaximum Likelihood and. Limited Dependent Variable Models
Maximum Likelihood and Limited Dependent Variable Models Michele Pellizzari IGIER-Bocconi, IZA and frdb May 24, 2010 These notes are largely based on the textbook by Jeffrey M. Wooldridge. 2002. Econometric
More informationContinuity of Bçezier patches. Jana Pçlnikovça, Jaroslav Plaçcek, Juraj ç Sofranko. Faculty of Mathematics and Physics. Comenius University
Continuity of Bezier patches Jana Plnikova, Jaroslav Placek, Juraj Sofranko Faculty of Mathematics and Physics Comenius University Bratislava Abstract The paper is concerned about the question of smooth
More informationProbabilistic Choice Models
Econ 3: James J. Heckman Probabilistic Choice Models This chapter examines different models commonly used to model probabilistic choice, such as eg the choice of one type of transportation from among many
More informationECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2014 Instructor: Victor Aguirregabiria
ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2014 Instructor: Victor guirregabiria SOLUTION TO FINL EXM Monday, pril 14, 2014. From 9:00am-12:00pm (3 hours) INSTRUCTIONS:
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationIn the previous chapters we have presented synthesis methods for optimal H 2 and
Chapter 8 Robust performance problems In the previous chapters we have presented synthesis methods for optimal H 2 and H1 control problems, and studied the robust stabilization problem with respect to
More informationSPRING Final Exam. May 5, 1999
IE 230: PROBABILITY AND STATISTICS IN ENGINEERING SPRING 1999 Final Exam May 5, 1999 NAME: Show all your work. Make sure that any notation you use is clear and well deæned. For example, if you use a new
More informationEconomics 536 Lecture 20 Binary Response Models
University of Illinois Fall 2016 Department of Economics Roger Koenker Economics 536 Lecture 20 Binary Response Models Let s begin with a model for an observed proportion, or frequency. We would like to
More informationEconomics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models
University of Illinois Fall 2016 Department of Economics Roger Koenker Economics 536 Lecture 7 Introduction to Specification Testing in Dynamic Econometric Models In this lecture I want to briefly describe
More informationwhere A and B are constants of integration, v t = p g=k is the terminal velocity, g is the acceleration of gravity, and m is the mass of the projectil
Homework 5 Physics 14 Methods in Theoretical Physics Due: Wednesday February 16, 4 Reading Assignment: Thornton and Marion, Ch..4. 1. è1d motion, Newtonian resistance.è In class, we considered the motion
More informationAnalysis of Categorical Data. Nick Jackson University of Southern California Department of Psychology 10/11/2013
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/2013 1 Overview Data Types Contingency Tables Logit Models Binomial Ordinal Nominal 2 Things not
More informationThe total current injected in the system must be zero, therefore the sum of vector entries B j j is zero, è1;:::1èb j j =: Excluding j by means of è1è
1 Optimization of networks 1.1 Description of electrical networks Consider an electrical network of N nodes n 1 ;:::n N and M links that join them together. Each linkl pq = l k joints the notes numbered
More informationStat 642, Lecture notes for 04/12/05 96
Stat 642, Lecture notes for 04/12/05 96 Hosmer-Lemeshow Statistic The Hosmer-Lemeshow Statistic is another measure of lack of fit. Hosmer and Lemeshow recommend partitioning the observations into 10 equal
More informationChapter 11. Regression with a Binary Dependent Variable
Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score
More informationLinear Regression With Special Variables
Linear Regression With Special Variables Junhui Qian December 21, 2014 Outline Standardized Scores Quadratic Terms Interaction Terms Binary Explanatory Variables Binary Choice Models Standardized Scores:
More informationSingle-level Models for Binary Responses
Single-level Models for Binary Responses Distribution of Binary Data y i response for individual i (i = 1,..., n), coded 0 or 1 Denote by r the number in the sample with y = 1 Mean and variance E(y) =
More informationECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2016 Instructor: Victor Aguirregabiria
ECOOMETRICS II (ECO 24S) University of Toronto. Department of Economics. Winter 26 Instructor: Victor Aguirregabiria FIAL EAM. Thursday, April 4, 26. From 9:am-2:pm (3 hours) ISTRUCTIOS: - This is a closed-book
More informationNon-linear panel data modeling
Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1
More informationParametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON
Parametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON Department of Economics, Michigan State University East Lansing, MI 48824-1038, United States (email: carls405@msu.edu)
More informationSemiparametric Generalized Linear Models
Semiparametric Generalized Linear Models North American Stata Users Group Meeting Chicago, Illinois Paul Rathouz Department of Health Studies University of Chicago prathouz@uchicago.edu Liping Gao MS Student
More informationWorking Paper No Maximum score type estimators
Warsaw School of Economics Institute of Econometrics Department of Applied Econometrics Department of Applied Econometrics Working Papers Warsaw School of Economics Al. iepodleglosci 64 02-554 Warszawa,
More informationLecture-20: Discrete Choice Modeling-I
Lecture-20: Discrete Choice Modeling-I 1 In Today s Class Introduction to discrete choice models General formulation Binary choice models Specification Model estimation Application Case Study 2 Discrete
More informationI. Multinomial Logit Suppose we only have individual specific covariates. Then we can model the response probability as
Econ 513, USC, Fall 2005 Lecture 15 Discrete Response Models: Multinomial, Conditional and Nested Logit Models Here we focus again on models for discrete choice with more than two outcomes We assume that
More informationEconomics 620, Lecture 19: Introduction to Nonparametric and Semiparametric Estimation
Economics 620, Lecture 19: Introduction to Nonparametric and Semiparametric Estimation Nicholas M. Kiefer Cornell University Professor N. M. Kiefer (Cornell University) Lecture 19: Nonparametric Analysis
More informationFORM 1 Physics 240 í Fall Exam 1 CONTI COULTER GRAFE SKALSEY ZORN. Please æll in the information requested on the scantron answer sheet
FORM 1 Physics 240 í Fall 1999 Exam 1 Name: Discussion section instructor and time: ècircle ONE TIMEè CONTI COULTER GRAFE SKALSEY ZORN MW 10-11 èè23è MW 8-9 èè4è MW 2-3 èè7è MW 11-12 èè6è MW 9-10 èè12è
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationChapter 3 Choice Models
Chapter 3 Choice Models 3.1 Introduction This chapter describes the characteristics of random utility choice model in a general setting, specific elements related to the conjoint choice context are given
More informationSTAT331. Cox s Proportional Hazards Model
STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations
More informationISQS 5349 Spring 2013 Final Exam
ISQS 5349 Spring 2013 Final Exam Name: General Instructions: Closed books, notes, no electronic devices. Points (out of 200) are in parentheses. Put written answers on separate paper; multiple choices
More informationFinal Exam. Name: Solution:
Final Exam. Name: Instructions. Answer all questions on the exam. Open books, open notes, but no electronic devices. The first 13 problems are worth 5 points each. The rest are worth 1 point each. HW1.
More information2. We care about proportion for categorical variable, but average for numerical one.
Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is
More informationThe Logit Model: Estimation, Testing and Interpretation
The Logit Model: Estimation, Testing and Interpretation Herman J. Bierens October 25, 2008 1 Introduction to maximum likelihood estimation 1.1 The likelihood function Consider a random sample Y 1,...,
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 12: Logistic regression (v1) Ramesh Johari ramesh.johari@stanford.edu Fall 2015 1 / 30 Regression methods for binary outcomes 2 / 30 Binary outcomes For the duration of this
More informationLECTURE 17. Algorithms for Polynomial Interpolation
LECTURE 7 Algorithms for Polynomial Interpolation We have thus far three algorithms for determining the polynomial interpolation of a given set of data.. Brute Force Method. Set and solve the following
More informationproblem of detection naturally arises in technical diagnostics, where one is interested in detecting cracks, corrosion, or any other defect in a sampl
In: Structural and Multidisciplinary Optimization, N. Olhoæ and G. I. N. Rozvany eds, Pergamon, 1995, 543í548. BOUNDS FOR DETECTABILITY OF MATERIAL'S DAMAGE BY NOISY ELECTRICAL MEASUREMENTS Elena CHERKAEVA
More informationInstructions: Closed book, notes, and no electronic devices. Points (out of 200) in parentheses
ISQS 5349 Final Spring 2011 Instructions: Closed book, notes, and no electronic devices. Points (out of 200) in parentheses 1. (10) What is the definition of a regression model that we have used throughout
More informationWhat s New in Econometrics? Lecture 14 Quantile Methods
What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression
More information2 2. EQUATIONS AND ALGORITHMS 2.1. WAITING ELEMENTS Brittle-elastic bar. Consider a stretched rod from a homogeneous elastic-brittle material. If load
DYNAMICS OF DAMAGE IN TWO-DIMENSIONAL STRUCTURES WITH WAITING LINKS Andrej Cherkaev and Liya Zhornitskaya Keywords: Dynamics of damage, Failure, Structures. Abstract The paper deals with simulation of
More informationSpatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions
Spatial inference I will start with a simple model, using species diversity data Strong spatial dependence, Î = 0.79 what is the mean diversity? How precise is our estimate? Sampling discussion: The 64
More informationDiscrete Dependent Variable Models
Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)
More informationW 1 æw 2 G + 0 e? u K y Figure 5.1: Control of uncertain system. For MIMO systems, the normbounded uncertainty description is generalized by assuming
Chapter 5 Robust stability and the H1 norm An important application of the H1 control problem arises when studying robustness against model uncertainties. It turns out that the condition that a control
More informationThe Generalized Roy Model and Treatment Effects
The Generalized Roy Model and Treatment Effects Christopher Taber University of Wisconsin November 10, 2016 Introduction From Imbens and Angrist we showed that if one runs IV, we get estimates of the Local
More informationReferences. Regression standard errors in clustered samples. diagonal matrix with elements
Stata Technical Bulletin 19 diagonal matrix with elements ( q=ferrors (0) if r > 0 W ii = (1 q)=f errors (0) if r < 0 0 otherwise and R 2 is the design matrix X 0 X. This is derived from formula 3.11 in
More informationconsistency is faster than the usual T 1=2 consistency rate. In both cases more general error distributions were considered as well. Consistency resul
LIKELIHOOD ANALYSIS OF A FIRST ORDER AUTOREGRESSIVE MODEL WITH EPONENTIAL INNOVATIONS By B. Nielsen & N. Shephard Nuæeld College, Oxford O1 1NF, UK bent.nielsen@nuf.ox.ac.uk neil.shephard@nuf.ox.ac.uk
More informationSimple Estimators for Semiparametric Multinomial Choice Models
Simple Estimators for Semiparametric Multinomial Choice Models James L. Powell and Paul A. Ruud University of California, Berkeley March 2008 Preliminary and Incomplete Comments Welcome Abstract This paper
More informationEPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7
Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review
More informationAn Overview of Choice Models
An Overview of Choice Models Dilan Görür Gatsby Computational Neuroscience Unit University College London May 08, 2009 Machine Learning II 1 / 31 Outline 1 Overview Terminology and Notation Economic vs
More informationSTA216: Generalized Linear Models. Lecture 1. Review and Introduction
STA216: Generalized Linear Models Lecture 1. Review and Introduction Let y 1,..., y n denote n independent observations on a response Treat y i as a realization of a random variable Y i In the general
More informationLecture 8. Roy Model, IV with essential heterogeneity, MTE
Lecture 8. Roy Model, IV with essential heterogeneity, MTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Heterogeneity When we talk about heterogeneity, usually we mean heterogeneity
More informationModels, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley
Models, Testing, and Correction of Heteroskedasticity James L. Powell Department of Economics University of California, Berkeley Aitken s GLS and Weighted LS The Generalized Classical Regression Model
More informationLecture 1. Introduction. The importance, ubiquity, and complexity of embedded systems are growing
Lecture 1 Introduction Karl Henrik Johansson The importance, ubiquity, and complexity of embedded systems are growing tremendously thanks to the revolution in digital technology. This has created a need
More informationTesting and Model Selection
Testing and Model Selection This is another digression on general statistics: see PE App C.8.4. The EViews output for least squares, probit and logit includes some statistics relevant to testing hypotheses
More informationGeneralized linear models
Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models
More informationLecture 3: Multiple Regression
Lecture 3: Multiple Regression R.G. Pierse 1 The General Linear Model Suppose that we have k explanatory variables Y i = β 1 + β X i + β 3 X 3i + + β k X ki + u i, i = 1,, n (1.1) or Y i = β j X ji + u
More informationProblem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM
Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods MIT 14.385, Fall 2007 Due: Wednesday, 07 November 2007, 5:00 PM 1 Applied Problems Instructions: The page indications given below give you
More informationMaximum Likelihood, Logistic Regression, and Stochastic Gradient Training
Maximum Likelihood, Logistic Regression, and Stochastic Gradient Training Charles Elkan elkan@cs.ucsd.edu January 17, 2013 1 Principle of maximum likelihood Consider a family of probability distributions
More informationClassification. Chapter Introduction. 6.2 The Bayes classifier
Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode
More information,..., θ(2),..., θ(n)
Likelihoods for Multivariate Binary Data Log-Linear Model We have 2 n 1 distinct probabilities, but we wish to consider formulations that allow more parsimonious descriptions as a function of covariates.
More informationMultivariate Regression: Part I
Topic 1 Multivariate Regression: Part I ARE/ECN 240 A Graduate Econometrics Professor: Òscar Jordà Outline of this topic Statement of the objective: we want to explain the behavior of one variable as a
More informationIDENTIFICATION OF THE BINARY CHOICE MODEL WITH MISCLASSIFICATION
IDENTIFICATION OF THE BINARY CHOICE MODEL WITH MISCLASSIFICATION Arthur Lewbel Boston College December 19, 2000 Abstract MisclassiÞcation in binary choice (binomial response) models occurs when the dependent
More informationLECTURE 12. Special Solutions of Laplace's Equation. 1. Separation of Variables with Respect to Cartesian Coordinates. Suppose.
50 LECTURE 12 Special Solutions of Laplace's Equation 1. Sepaation of Vaiables with Respect to Catesian Coodinates Suppose è12.1è èx; yè =XèxèY èyè is a solution of è12.2è Then we must have è12.3è 2 x
More informationR. Koenker Spring 2017 Economics 574 Problem Set 1
R. Koenker Spring 207 Economics 574 Problem Set.: Suppose X, Y are random variables with joint density f(x, y) = x 2 + xy/3 x [0, ], y [0, 2]. Find: (a) the joint df, (b) the marginal density of X, (c)
More informationCorrelation and Regression
Correlation and Regression October 25, 2017 STAT 151 Class 9 Slide 1 Outline of Topics 1 Associations 2 Scatter plot 3 Correlation 4 Regression 5 Testing and estimation 6 Goodness-of-fit STAT 151 Class
More informationK. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =
K. Model Diagnostics We ve already seen how to check model assumptions prior to fitting a one-way ANOVA. Diagnostics carried out after model fitting by using residuals are more informative for assessing
More information8 Nominal and Ordinal Logistic Regression
8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on
More informationLecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo
Lecture 1 Behavioral Models Multinomial Logit: Power and limitations Cinzia Cirillo 1 Overview 1. Choice Probabilities 2. Power and Limitations of Logit 1. Taste variation 2. Substitution patterns 3. Repeated
More informationExtensions to the Basic Framework II
Topic 7 Extensions to the Basic Framework II ARE/ECN 240 A Graduate Econometrics Professor: Òscar Jordà Outline of this topic Nonlinear regression Limited Dependent Variable regression Applications of
More informationLast few slides from last time
Last few slides from last time Example 3: What is the probability that p will fall in a certain range, given p? Flip a coin 50 times. If the coin is fair (p=0.5), what is the probability of getting an
More informationLoglikelihood and Confidence Intervals
Stat 504, Lecture 2 1 Loglikelihood and Confidence Intervals The loglikelihood function is defined to be the natural logarithm of the likelihood function, l(θ ; x) = log L(θ ; x). For a variety of reasons,
More informationA short introduc-on to discrete choice models
A short introduc-on to discrete choice models BART Kenneth Train, Discrete Choice Models with Simula-on, Chapter 3. Ques-ons Impact of cost, commu-ng -me, walk -me, transfer -me, number of transfers, distance
More informationApplied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid
Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?
More informationAnswer all questions from part I. Answer two question from part II.a, and one question from part II.b.
B203: Quantitative Methods Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. Part I: Compulsory Questions. Answer all questions. Each question carries
More informationNew Developments in Econometrics Lecture 16: Quantile Estimation
New Developments in Econometrics Lecture 16: Quantile Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. Review of Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile
More informationComparing IRT with Other Models
Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used
More informationA Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,
A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type
More informationChapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a
Chapter 9 Regression with a Binary Dependent Variable Multiple Choice ) The binary dependent variable model is an example of a a. regression model, which has as a regressor, among others, a binary variable.
More informationBayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence
Bayesian Inference in GLMs Frequentists typically base inferences on MLEs, asymptotic confidence limits, and log-likelihood ratio tests Bayesians base inferences on the posterior distribution of the unknowns
More informationA Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated
More informationeasy to make g by æ èp,1è=q where æ generates Zp. æ We can use a secure prime modulus p such that èp, 1è=2q is also prime or each prime factor of èp,
Additional Notes to ëultimate Solution to Authentication via Memorable Password" May 1, 2000 version Taekyoung Kwon æ May 25, 2000 Abstract This short letter adds informative discussions to our previous
More informationGeneralized Models: Part 1
Generalized Models: Part 1 Topics: Introduction to generalized models Introduction to maximum likelihood estimation Models for binary outcomes Models for proportion outcomes Models for categorical outcomes
More information