Discrete Choice Modeling
|
|
- Gwendoline Rose
- 5 years ago
- Views:
Transcription
1 [Part 6] 1/55 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent Class 11 Mixed Logit 12 Stated Preference 13 Hybrid Choice William Greene Stern School of Business New York University
2 [Part 6] 2/55 Application: Major Derogatory Reports AmEx Credit Card Holders N = 1310 (of 13,777) Number of major derogatory reports in 1 year Issues: Nonrandom selection Excess zeros
3 [Part 6] 3/55 Histogram for Credit Data Histogram for MAJORDRG NOBS= 1310, Too low: 0, Too high: 0 Bin Lower limit Upper limit Frequency Cumulative Frequency ======================================================================== (.8038) 1053(.8038) (.1038) 1189(.9076) (.0382) 1239(.9458) (.0183) 1263(.9641) (.0130) 1280(.9771) (.0076) 1290(.9847) (.0038) 1295(.9885) (.0046) 1301(.9931) (.0000) 1301(.9931) (.0015) 1303(.9947) (.0008) 1304(.9954) (.0031) 1308(.9985) (.0008) 1309(.9992) (.0000) 1309(.9992) (.0008) 1310(1.0000)
4 [Part 6] 4/55
5 [Part 6] 5/55 Doctor Visits
6 [Part 6] 6/55 Basic Modeling for Counts of Events E.g., Visits to site, number of purchases, number of doctor visits Regression approach Quantitative outcome measured Discrete variable, model probabilities Poisson probabilities loglinear model Prob[Y = j x ] = λ = exp( β'x i ) = E[y i i i i exp(-λ i)λ j! x ] i j i
7 Poisson Model for Doctor Visits [Part 6] 7/ Poisson Regression Dependent variable DOCVIS Log likelihood function Restricted log likelihood Chi squared [ 6 d.f.] Significance level McFadden Pseudo R-squared Estimation based on N = 27326, K = 7 Information Criteria: Normalization=1/N Normalized Unnormalized AIC Chi- squared = RsqP=.0818 G - squared = RsqD=.0601 Overdispersion tests: g=mu(i) : Overdispersion tests: g=mu(i)^2: Variable Coefficient Standard Error b/st.er. P[ Z >z] Mean of X Constant.77267*** AGE.01763*** EDUC *** FEMALE.29287*** MARRIED HHNINC *** HHKIDS ***
8 [Part 6] 8/55 Partial Effects Partial derivatives of expected val. with respect to the vector of characteristics. Effects are averaged over individuals. Observations used for means are All Obs. Conditional Mean at Sample Point Scale Factor for Marginal Effects E[y x x Variable Coefficient Standard Error b/st.er. P[ Z >z] Mean of X AGE.05613*** EDUC *** FEMALE.93237*** MARRIED HHNINC *** HHKIDS *** i i i ] = λ β i
9 [Part 6] 9/55 Poisson Model Specification Issues Equi-Dispersion: Var[y i x i ] = E[y i x i ]. Overdispersion: If i = exp[ x i + ε i ], E[y i x i ] = γexp[ x i ] Var[y i ] > E[y i ] (overdispersed) ε i ~ log-gamma Negative binomial model ε i ~ Normal[0, 2 ] Normal-mixture model ε i is viewed as unobserved heterogeneity ( frailty ). Normal model may be more natural. Estimation is a bit more complicated.
10 [Part 6] 10/55 exp(-λ i)λ Prob[Y i = j xi] = j! λ = exp( β'x ) = E[y i i i Moment Equations : x ] Estimation: Nonlinear Least Squares: i 1 i 1 Inefficient but robust if nonpoisson Maximum Likelihood: Moment Equations : i N j i Min Max N i 1 y N y x i i i i i 1 i i 2 log log( y!) Efficient, also robust to some kinds of NonPoissonness y N i i i x y i i i i
11 Poisson Model for Doctor Visits [Part 6] 11/ Poisson Regression Dependent variable DOCVIS Log likelihood function Restricted log likelihood Chi squared [ 6 d.f.] Significance level McFadden Pseudo R-squared Estimation based on N = 27326, K = 7 Information Criteria: Normalization=1/N Normalized Unnormalized AIC Chi- squared = RsqP=.0818 G - squared = RsqD=.0601 Overdispersion tests: g=mu(i) : Overdispersion tests: g=mu(i)^2: Variable Coefficient Standard Error b/st.er. P[ Z >z] Mean of X Constant.77267*** AGE.01763*** EDUC *** FEMALE.29287*** MARRIED HHNINC *** HHKIDS ***
12 Variable Coefficient Standard Error b/st.er. P[ Z >z] Mean of X Standard Negative Inverse of Second Derivatives Constant.77267*** AGE.01763*** EDUC *** FEMALE.29287*** MARRIED HHNINC *** HHKIDS *** Robust Sandwich Constant.77267*** AGE.01763*** EDUC *** FEMALE.29287*** MARRIED HHNINC *** HHKIDS *** Cluster Correction Constant.77267*** AGE.01763*** EDUC *** FEMALE.29287*** MARRIED HHNINC *** HHKIDS *** [Part 6] 12/55
13 [Part 6] 13/55 Negative Binomial Specification Prob(Y i =j x i ) has greater mass to the right and left of the mean Conditional mean function is the same as the Poisson: E[y i x i ] = λ i =Exp( x i ), so marginal effects have the same form. Variance is Var[y i x i ] = λ i (1 + α λ i ), α is the overdispersion parameter; α = 0 reverts to the Poisson. Poisson is consistent when NegBin is appropriate. Therefore, this is a case for the ROBUST covariance matrix estimator. (Neglected heterogeneity that is uncorrelated with x i.)
14 NegBin Model for Doctor Visits [Part 6] 14/ Negative Binomial Regression Dependent variable DOCVIS Log likelihood function NegBin LogL Restricted log likelihood Poisson LogL Chi squared [ 1 d.f.] Reject Poisson model Significance level McFadden Pseudo R-squared Estimation based on N = 27326, K = 8 Information Criteria: Normalization=1/N Normalized Unnormalized AIC NegBin form 2; Psi(i) = theta Variable Coefficient Standard Error b/st.er. P[ Z >z] Mean of X Constant.80825*** AGE.01806*** EDUC *** FEMALE.32596*** MARRIED HHNINC *** HHKIDS *** Dispersion parameter for count data model Alpha ***
15 [Part 6] 15/55 Marginal Effects Scale Factor for Marginal Effects POISSON Variable Coefficient Standard Error b/st.er. P[ Z >z] Mean of X AGE.05613*** EDUC *** FEMALE.93237*** MARRIED HHNINC *** HHKIDS *** Scale Factor for Marginal Effects NEGATIVE BINOMIAL AGE.05767*** EDUC *** FEMALE *** MARRIED HHNINC *** HHKIDS ***
16 [Part 6] 16/55
17 [Part 6] 17/55
18 Model Formulations [Part 6] 18/55 Poisson exp( i) i Prob[ Y yi xi], (1 y ) exp( x ), y 0,1,..., i 1,..., N i i i E[ y x ] Var[ y x ] i i i i y i E[y i x i ]=λ i
19 NegBin-P Model [Part 6] 19/ Negative Binomial (P) Model Dependent variable DOCVIS Log likelihood function Restricted log likelihood Chi squared [ 1 d.f.] Variable Coefficient Standard Error b/st.er. NB-2 NB-1 Poisson Constant.60840*** AGE.01710*** EDUC *** FEMALE.36386*** MARRIED.03670* HHNINC *** HHKIDS *** Dispersion parameter for count data model Alpha *** Negative Binomial. General form, NegBin P P ***
20 [Part 6] 20/55
21 [Part 6] 21/55 Zero Inflation ZIP Models Two regimes: (Recreation site visits) Zero (with probability 1). (Never visit site) Poisson with Pr(0) = exp[- x i ]. (Number of visits, including zero visits this season.) Unconditional: Pr[0] = P(regime 0) + P(regime 1)*Pr[0 regime 1] Pr[j j >0] = P(regime 1)*Pr[j regime 1] Two inflation Number of children These are latent class models
22 [Part 6] 22/55 Zero Inflation Models exp(-λ i)λi Prob(y i = j x i) =, λ i = exp( βx i) j! j Zero Inflation = ZIP Prob(0 regime) = F( γz ) i ZIP - tau = ZIP(τ) [Not generally used] Prob(0 regime) = F( βx ) i
23 [Part 6] 23/55 Notes on Zero Inflation Models Poisson is not nested in ZIP. γ = 0 in ZIP does not produce Poisson; it produces ZIP with P(regime 0) = ½. Standard tests are not appropriate Use Vuong statistic. ZIP model almost always wins. Zero Inflation models extend to NB models ZINB is a standard model Creates two sources of overdispersion Generally difficult to estimate
24 ZIP Model [Part 6] 24/ Zero Altered Poisson Regression Model Logistic distribution used for splitting model. ZAP term in probability is F[tau x Z(i) ] Comparison of estimated models Pr[0 means] Number of zeros Log-likelihood Poisson Act.= Prd.= Z.I.Poisson Act.= Prd.= Vuong statistic for testing ZIP vs. unaltered model is Distributed as standard normal. A value greater than favors the zero altered Z.I.Poisson model. A value less than rejects the ZIP model. Variable Coefficient Standard Error b/st.er. P[ Z >z] Mean of X Poisson/NB/Gamma regression model Constant *** AGE.01100*** EDUC *** FEMALE.10943*** MARRIED *** HHNINC *** HHKIDS *** Zero inflation model Constant *** FEMALE *** EDUC.04114***
25 [Part 6] 25/55 Marginal Effects for Different Models Scale Factor for Marginal Effects POISSON Variable Coefficient Standard Error b/st.er. P[ Z >z] Mean of X AGE.05613*** EDUC *** FEMALE.93237*** MARRIED HHNINC *** HHKIDS *** Scale Factor for Marginal Effects NEGATIVE BINOMIAL - 2 AGE.05767*** EDUC *** FEMALE *** MARRIED HHNINC *** HHKIDS *** Scale Factor for Marginal Effects ZERO INFLATED POISSON AGE.03427*** EDUC *** FEMALE.97958*** MARRIED *** HHNINC *** HHKIDS ***
26 [Part 6] 26/55 Zero Inflation Models Zero Inflation = ZIP exp(-λ i)λi Prob(y i = j x i) =, λ i = exp( βxi) j! Prob(0 regime) = F( γz ) i j
27 An Unidentified ZINB Model [Part 6] 27/55
28 [Part 6] 28/55
29 [Part 6] 29/55
30 [Part 6] 30/55 Partial Effects for Different Models
31 [Part 6] 31/55 The Vuong Statistic for Nonnested Models Model 0: logl = logf (y x, ) = m i,0 0 i i 0 i,0 Model 0 is the Zero Inflation Model Model 1: logl = logf (y x, ) = m i,1 1 i i 1 i,1 Model 1 is the Poisson model (Not nested. =0 implies the splitting probability is 1/2, not 1) 0 i i 0 Define ai mi,0 mi,1 log f 1 (y i x i, 1 ) [a] V s / n a f (y x, ) 1 n f 0(y i x i, 0) n i 1 log n f 1(y i x i, 1) 1 n f 0(y i x i, 0) f 0(y i x i, 0) i 1 log log n 1 f 1(y i x i, 1) f 1(y i x i, 1) Limiting distribution is standard normal. Large + favors model 0, large - favors model 1, < V < 1.96 is inconclusive. 2
32 [Part 6] 32/55
33 [Part 6] 33/55 Two part model: A Hurdle Model Model 1: Probability model for more than zero occurrences Model 2: Model for number of occurrences given that the number is greater than zero. Applications common in health economics Usage of health care facilities Use of drugs, alcohol, etc.
34 [Part 6] 34/55
35 [Part 6] 35/55 Hurdle Model Two Part Model Prob[y > 0] = F( γ'x) Prob[y=j] Prob[y=j] Prob[y = j y > 0] = = Prob[y>0] 1 Pr ob[y 0 x] A Poisson Hurdle Model with Logit Hurdle exp( γ'x) Prob[y>0]= 1+exp( γ'x ) j exp(- ) Prob[y=j y>0,x]=, =exp( β'x) j![1 exp(- )] F( γ'x)exp( β'x) E[y x] =0 Prob[y=0]+Prob[y>0] E[y y>0] = 1-exp[-exp( β'x )] Marginal effects involve both parts of the model.
36 [Part 6] 36/55 Hurdle Model for Doctor Visits
37 [Part 6] 37/55 Partial Effects
38 [Part 6] 38/55
39 [Part 6] 39/55
40 [Part 6] 40/55
41 [Part 6] 41/55 Application of Several of the Models Discussed in this Section
42 [Part 6] 42/55
43 [Part 6] 43/55 See also: van Ophem H Modeling selectivity in count data models. Journal of Business and Economic Statistics 18: Winkelmann finds that there is no correlation between the decisions A significant correlation is expected [T]he correlation comes from the way the relation between the decisions is modeled.
44 [Part 6] 44/55 Probit Participation Equation Poisson-Normal Intensity Equation
45 [Part 6] 45/55 Bivariate-Normal Heterogeneity in Participation and Intensity Equations Gaussian Copula for Participation and Intensity Equations
46 [Part 6] 46/55 Correlation between Heterogeneity Terms Correlation between Counts
47 [Part 6] 47/55 Panel Data Models Heterogeneity; λ it = exp(β x it + c i ) Fixed Effects Poisson: Standard, no incidental parameters issue Negative Binomial Hausman, Hall, Griliches (1984) put FE in variance, not the mean Use brute force to get a conventional FE model Random Effects Poisson Log-gamma heterogeneity becomes an NB model Contemporary treatments are using normal heterogeneity with simulation or quadrature based estimators NB with random effects is equivalent to two effects one time varying one time invariant. The model is probably overspecified Random Parameters: Mixed models, latent class models, hiererchical all extended to Poisson and NB
48 [Part 6] 48/55 Poisson (log)normal Mixture
49 [Part 6] 49/55
50 [Part 6] 50/55 A Peculiarity of the FENB Model True FE model has λ i =exp(α i +x it β). Cannot be fit if there are time invariant variables. Hausman, Hall and Griliches (Econometrica, 1984) has α i appearing in θ. Produces different results Implies that the FEM can contain time invariant variables.
51 [Part 6] 51/55
52 [Part 6] 52/55 See: Allison and Waterman (2002), Guimaraes (2007) Greene, Econometric Analysis (2012)
53 [Part 6] 53/55
54 [Part 6] 54/55 Bivariate Random Effects
55 [Part 6] 55/55
The 2010 Medici Summer School in Management Studies. William Greene Department of Economics Stern School of Business
The 2010 Medici Summer School in Management Studies William Greene Department of Economics Stern School of Business Econometric Models When There Are Unusual Events Part 5: Binary Outcomes Agenda General
More informationDiscrete Choice Modeling
[Part 4] 1/43 Discrete Choice Modeling 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 Count Data 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent
More informationMohammed. Research in Pharmacoepidemiology National School of Pharmacy, University of Otago
Mohammed Research in Pharmacoepidemiology (RIPE) @ National School of Pharmacy, University of Otago What is zero inflation? Suppose you want to study hippos and the effect of habitat variables on their
More informationSpatial Discrete Choice Models
Spatial Discrete Choice Models Professor William Greene Stern School of Business, New York University SPATIAL ECONOMETRICS ADVANCED INSTITUTE University of Rome May 23, 2011 Spatial Correlation Spatially
More informationParametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1
Parametric Modelling of Over-dispersed Count Data Part III / MMath (Applied Statistics) 1 Introduction Poisson regression is the de facto approach for handling count data What happens then when Poisson
More informationLecture-19: Modeling Count Data II
Lecture-19: Modeling Count Data II 1 In Today s Class Recap of Count data models Truncated count data models Zero-inflated models Panel count data models R-implementation 2 Count Data In many a phenomena
More informationChapter 11 The COUNTREG Procedure (Experimental)
Chapter 11 The COUNTREG Procedure (Experimental) Chapter Contents OVERVIEW................................... 419 GETTING STARTED.............................. 420 SYNTAX.....................................
More informationFunctional Form and Heterogeneity in Models for Count Data
Foundations and Trends R in Econometrics Vol. 1, No. 2 (2005) 113 218 c 2007 W. Greene DOI: 10.1561/0800000008 Functional Form and Heterogeneity in Models for Count Data William Greene Department of Economics,
More informationNon-linear panel data modeling
Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1
More informationEconometric Analysis of Panel Data. Final Examination: Spring 2013
Econometric Analysis of Panel Data Professor William Greene Phone: 212.998.0876 Office: KMC 7-90 Home page:www.stern.nyu.edu/~wgreene Email: wgreene@stern.nyu.edu URL for course web page: people.stern.nyu.edu/wgreene/econometrics/paneldataeconometrics.htm
More informationEconometric Analysis of Panel Data. Final Examination: Spring 2018
Department of Economics Econometric Analysis of Panel Data Professor William Greene Phone: 212.998.0876 Office: KMC 7-90 Home page: people.stern.nyu.edu/wgreene Email: wgreene@stern.nyu.edu URL for course
More informationInference and Regression
Name Inference and Regression Final Examination, 2015 Department of IOMS This course and this examination are governed by the Stern Honor Code. Instructions Please write your name at the top of this page.
More informationNELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation
NELS 88 Table 2.3 Adjusted odds ratios of eighth-grade students in 988 performing below basic levels of reading and mathematics in 988 and dropping out of school, 988 to 990, by basic demographics Variable
More informationDEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005
DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005 The lectures will survey the topic of count regression with emphasis on the role on unobserved heterogeneity.
More informationMODELING COUNT DATA Joseph M. Hilbe
MODELING COUNT DATA Joseph M. Hilbe Arizona State University Count models are a subset of discrete response regression models. Count data are distributed as non-negative integers, are intrinsically heteroskedastic,
More informationLab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p )
Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p. 376-390) BIO656 2009 Goal: To see if a major health-care reform which took place in 1997 in Germany was
More informationGeneralized Multilevel Models for Non-Normal Outcomes
Generalized Multilevel Models for Non-Normal Outcomes Topics: 3 parts of a generalized (multilevel) model Models for binary, proportion, and categorical outcomes Complications for generalized multilevel
More informationCARLETON ECONOMIC PAPERS
CEP 16-01 The Role of Model Specification in Estimating Health Care Demand Hossein Kavand Carleton University Marcel-Cristian Voia Carleton University January 2016 CARLETON ECONOMIC PAPERS Department of
More informationGeneralized Linear Models for Count, Skewed, and If and How Much Outcomes
Generalized Linear Models for Count, Skewed, and If and How Much Outcomes Today s Class: Review of 3 parts of a generalized model Models for discrete count or continuous skewed outcomes Models for two-part
More informationReview of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.
Panel GLMs Department of Political Science and Government Aarhus University May 12, 2015 1 Review of Panel Data 2 Model Types 3 Review and Looking Forward 1 Review of Panel Data 2 Model Types 3 Review
More informationLimited Dependent Variables and Panel Data
and Panel Data June 24 th, 2009 Structure 1 2 Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement
More informationLongitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois
Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control
More informationAnalysis of Count Data A Business Perspective. George J. Hurley Sr. Research Manager The Hershey Company Milwaukee June 2013
Analysis of Count Data A Business Perspective George J. Hurley Sr. Research Manager The Hershey Company Milwaukee June 2013 Overview Count data Methods Conclusions 2 Count data Count data Anything with
More informationGeneralized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.
Texts in Statistical Science Generalized Linear Mixed Models Modern Concepts, Methods and Applications Walter W. Stroup CRC Press Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint
More informationNinth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"
Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric
More informationTesting and Model Selection
Testing and Model Selection This is another digression on general statistics: see PE App C.8.4. The EViews output for least squares, probit and logit includes some statistics relevant to testing hypotheses
More informationLecture 3.1 Basic Logistic LDA
y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data
More informationPoisson Regression. Ryan Godwin. ECON University of Manitoba
Poisson Regression Ryan Godwin ECON 7010 - University of Manitoba Abstract. These lecture notes introduce Maximum Likelihood Estimation (MLE) of a Poisson regression model. 1 Motivating the Poisson Regression
More informationComparing IRT with Other Models
Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used
More informationECONOMETRICS I Take Home Final Examination
Department of Economics ECONOMETRICS I Take Home Final Examination Fall 2016 Professor William Greene Phone: 212.998.0876 Office: KMC 7-90 URL: people.stern.nyu.edu/wgreene e-mail: wgreene@stern.nyu.edu
More informationYou can specify the response in the form of a single variable or in the form of a ratio of two variables denoted events/trials.
The GENMOD Procedure MODEL Statement MODEL response = < effects > < /options > ; MODEL events/trials = < effects > < /options > ; You can specify the response in the form of a single variable or in the
More informationEconometric Analysis of Cross Section and Panel Data
Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND
More informationGeneralized Linear Models for Non-Normal Data
Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture
More informationBinary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment
BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense
More informationUnderstanding the multinomial-poisson transformation
The Stata Journal (2004) 4, Number 3, pp. 265 273 Understanding the multinomial-poisson transformation Paulo Guimarães Medical University of South Carolina Abstract. There is a known connection between
More informationClass Notes: Week 8. Probit versus Logit Link Functions and Count Data
Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While
More informationSTAT5044: Regression and Anova
STAT5044: Regression and Anova Inyoung Kim 1 / 18 Outline 1 Logistic regression for Binary data 2 Poisson regression for Count data 2 / 18 GLM Let Y denote a binary response variable. Each observation
More informationEconomics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama
Problem Set #1 (Random Data Generation) 1. Generate =500random numbers from both the uniform 1 ( [0 1], uniformbetween zero and one) and exponential exp ( ) (set =2and let [0 1]) distributions. Plot the
More informationLecture 2: Poisson and logistic regression
Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK S 3 RI, 11-12 December 2014 introduction to Poisson regression application to the BELCAP study introduction
More informationA Guide to Modern Econometric:
A Guide to Modern Econometric: 4th edition Marno Verbeek Rotterdam School of Management, Erasmus University, Rotterdam B 379887 )WILEY A John Wiley & Sons, Ltd., Publication Contents Preface xiii 1 Introduction
More informationDiscrete Choice Modeling
[Part 11] 1/52 Discrete Choice Modeling 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 Count Data 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent
More informationST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples
ST3241 Categorical Data Analysis I Generalized Linear Models Introduction and Some Examples 1 Introduction We have discussed methods for analyzing associations in two-way and three-way tables. Now we will
More informationA Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated
More informationSelection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models
Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models Massimiliano Bratti & Alfonso Miranda In many fields of applied work researchers need to model an
More informationCounts using Jitters joint work with Peng Shi, Northern Illinois University
of Claim Longitudinal of Claim joint work with Peng Shi, Northern Illinois University UConn Actuarial Science Seminar 2 December 2011 Department of Mathematics University of Connecticut Storrs, Connecticut,
More informationMLE for a logit model
IGIDR, Bombay September 4, 2008 Goals The link between workforce participation and education Analysing a two-variable data-set: bivariate distributions Conditional probability Expected mean from conditional
More informationHierarchical Hurdle Models for Zero-In(De)flated Count Data of Complex Designs
for Zero-In(De)flated Count Data of Complex Designs Marek Molas 1, Emmanuel Lesaffre 1,2 1 Erasmus MC 2 L-Biostat Erasmus Universiteit - Rotterdam Katholieke Universiteit Leuven The Netherlands Belgium
More informationLatent class models for utilisation of health care
HEDG Working Paper 05/01 Latent class models for utilisation of health care Teresa Bago d Uva June 2005 ISSN 1751-1976 york.ac.uk/res/herc/hedgwp Latent Class Models for utilisation of health care Teresa
More informationZERO INFLATED POISSON REGRESSION
STAT 6500 ZERO INFLATED POISSON REGRESSION FINAL PROJECT DEC 6 th, 2013 SUN JEON DEPARTMENT OF SOCIOLOGY UTAH STATE UNIVERSITY POISSON REGRESSION REVIEW INTRODUCING - ZERO-INFLATED POISSON REGRESSION SAS
More informationGeneralized linear models
Generalized linear models Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Generalized linear models Boston College, Spring 2016 1 / 1 Introduction
More informationUnobserved Heterogeneity and the Statistical Analysis of Highway Accident Data. Fred Mannering University of South Florida
Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data Fred Mannering University of South Florida Highway Accidents Cost the lives of 1.25 million people per year Leading cause
More informationCopula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011
Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011 Outline Ordinary Least Squares (OLS) Regression Generalized Linear Models
More informationModeling Longitudinal Count Data with Excess Zeros and Time-Dependent Covariates: Application to Drug Use
Modeling Longitudinal Count Data with Excess Zeros and : Application to Drug Use University of Northern Colorado November 17, 2014 Presentation Outline I and Data Issues II Correlated Count Regression
More informationSingle-level Models for Binary Responses
Single-level Models for Binary Responses Distribution of Binary Data y i response for individual i (i = 1,..., n), coded 0 or 1 Denote by r the number in the sample with y = 1 Mean and variance E(y) =
More informationLongitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 2017, Boston, Massachusetts
Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 217, Boston, Massachusetts Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control
More informationIntroduction to GSEM in Stata
Introduction to GSEM in Stata Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Introduction to GSEM in Stata Boston College, Spring 2016 1 /
More informationLecture 5: Poisson and logistic regression
Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK S 3 RI, 3-5 March 2014 introduction to Poisson regression application to the BELCAP study introduction
More informationInference and Regression
Name Inference and Regression Final Examination, 2016 Department of IOMS This course and this examination are governed by the Stern Honor Code. Instructions Please write your name at the top of this page.
More informationDeal with Excess Zeros in the Discrete Dependent Variable, the Number of Homicide in Chicago Census Tract
Deal with Excess Zeros in the Discrete Dependent Variable, the Number of Homicide in Chicago Census Tract Gideon D. Bahn 1, Raymond Massenburg 2 1 Loyola University Chicago, 1 E. Pearson Rm 460 Chicago,
More informationHomework 5: Answer Key. Plausible Model: E(y) = µt. The expected number of arrests arrests equals a constant times the number who attend the game.
EdPsych/Psych/Soc 589 C.J. Anderson Homework 5: Answer Key 1. Probelm 3.18 (page 96 of Agresti). (a) Y assume Poisson random variable. Plausible Model: E(y) = µt. The expected number of arrests arrests
More informationOverdispersion Workshop in generalized linear models Uppsala, June 11-12, Outline. Overdispersion
Biostokastikum Overdispersion is not uncommon in practice. In fact, some would maintain that overdispersion is the norm in practice and nominal dispersion the exception McCullagh and Nelder (1989) Overdispersion
More informationUNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator
UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS Duration - 3 hours Aids Allowed: Calculator LAST NAME: FIRST NAME: STUDENT NUMBER: There are 27 pages
More informationCRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.
CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Linear
More informationLinear Regression. Data Model. β, σ 2. Process Model. ,V β. ,s 2. s 1. Parameter Model
Regression: Part II Linear Regression y~n X, 2 X Y Data Model β, σ 2 Process Model Β 0,V β s 1,s 2 Parameter Model Assumptions of Linear Model Homoskedasticity No error in X variables Error in Y variables
More informationEstimation of hurdle models for overdispersed count data
The Stata Journal (2011) 11, Number 1, pp. 82 94 Estimation of hurdle models for overdispersed count data Helmut Farbmacher Department of Economics University of Munich, Germany helmut.farbmacher@lrz.uni-muenchen.de
More informationUsing the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1
Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, Discrete Changes 1 JunXuJ.ScottLong Indiana University 2005-02-03 1 General Formula The delta method is a general
More informationZero inflated negative binomial-generalized exponential distribution and its applications
Songklanakarin J. Sci. Technol. 6 (4), 48-491, Jul. - Aug. 014 http://www.sst.psu.ac.th Original Article Zero inflated negative binomial-generalized eponential distribution and its applications Sirinapa
More informationGeneralized linear models
Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models
More informationA simple bivariate count data regression model. Abstract
A simple bivariate count data regression model Shiferaw Gurmu Georgia State University John Elder North Dakota State University Abstract This paper develops a simple bivariate count data regression model
More informationTruncation and Censoring
Truncation and Censoring Laura Magazzini laura.magazzini@univr.it Laura Magazzini (@univr.it) Truncation and Censoring 1 / 35 Truncation and censoring Truncation: sample data are drawn from a subset of
More informationRobustness of Logit Analysis: Unobserved Heterogeneity and Misspecified Disturbances
Discussion Paper: 2006/07 Robustness of Logit Analysis: Unobserved Heterogeneity and Misspecified Disturbances J.S. Cramer www.fee.uva.nl/ke/uva-econometrics Amsterdam School of Economics Department of
More informationSimplified Implementation of the Heckman Estimator of the Dynamic Probit Model and a Comparison with Alternative Estimators
DISCUSSION PAPER SERIES IZA DP No. 3039 Simplified Implementation of the Heckman Estimator of the Dynamic Probit Model and a Comparison with Alternative Estimators Wiji Arulampalam Mark B. Stewart September
More informationLecture 14 More on structural estimation
Lecture 14 More on structural estimation Economics 8379 George Washington University Instructor: Prof. Ben Williams traditional MLE and GMM MLE requires a full specification of a model for the distribution
More informationEconometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit
Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables
More informationDiscrete Choice Modeling William Greene Stern School of Business, New York University. Lab Session 2 Assignment
Discrete Choice Modeling William Greene Stern School of Business, New York University Part 1. Binary Choice Modeling Lab Session 2 Assignment This exercise will involve estimating and analyzing binary
More informationEndogenous Treatment Effects for Count Data Models with Endogenous Participation or Sample Selection
Endogenous Treatment Effects for Count Data Models with Endogenous Participation or Sample Selection Massimilano Bratti & Alfonso Miranda Institute of Education University of London c Bratti&Miranda (p.
More informationRecent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data
Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)
More informationEconometric Analysis of Count Data
Econometric Analysis of Count Data Springer-Verlag Berlin Heidelberg GmbH Rainer Winkelmann Econometric Analysis of Count Data Third, Revised and Enlarged Edition With l3 Figures and 20 Tables, Springer
More informationControl Function and Related Methods: Nonlinear Models
Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear
More informationParameters Estimation Methods for the Negative Binomial-Crack Distribution and Its Application
Original Parameters Estimation Methods for the Negative Binomial-Crack Distribution and Its Application Pornpop Saengthong 1*, Winai Bodhisuwan 2 Received: 29 March 2013 Accepted: 15 May 2013 Abstract
More informationMixed models in R using the lme4 package Part 5: Generalized linear mixed models
Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates Madison January 11, 2011 Contents 1 Definition 1 2 Links 2 3 Example 7 4 Model building 9 5 Conclusions 14
More informationSTA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).
STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) T In 2 2 tables, statistical independence is equivalent to a population
More informationA Dynamic Hurdle Model for Zero-Inflated Count Data: With an Application to Health Care Utilization
University of Zurich Department of Economics Working Paper Series ISSN 1664-7041 (print) ISSN 1664-705X (online) Working Paper No. 151 A Dynamic Hurdle Model for Zero-Inflated Count Data: With an Application
More information36-463/663: Multilevel & Hierarchical Models
36-463/663: Multilevel & Hierarchical Models (P)review: in-class midterm Brian Junker 132E Baker Hall brian@stat.cmu.edu 1 In-class midterm Closed book, closed notes, closed electronics (otherwise I have
More informationTobit and Selection Models
Tobit and Selection Models Class Notes Manuel Arellano November 24, 2008 Censored Regression Illustration : Top-coding in wages Suppose Y log wages) are subject to top coding as is often the case with
More informationChapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a
Chapter 9 Regression with a Binary Dependent Variable Multiple Choice ) The binary dependent variable model is an example of a a. regression model, which has as a regressor, among others, a binary variable.
More informationInstitute of Actuaries of India
Institute of Actuaries of India Subject CT3 Probability and Mathematical Statistics For 2018 Examinations Subject CT3 Probability and Mathematical Statistics Core Technical Syllabus 1 June 2017 Aim The
More informationEPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7
Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review
More information,..., θ(2),..., θ(n)
Likelihoods for Multivariate Binary Data Log-Linear Model We have 2 n 1 distinct probabilities, but we wish to consider formulations that allow more parsimonious descriptions as a function of covariates.
More informationChapter 1. Modeling Basics
Chapter 1. Modeling Basics What is a model? Model equation and probability distribution Types of model effects Writing models in matrix form Summary 1 What is a statistical model? A model is a mathematical
More informationModeling Binary Outcomes: Logit and Probit Models
Modeling Binary Outcomes: Logit and Probit Models Eric Zivot December 5, 2009 Motivating Example: Women s labor force participation y i = 1 if married woman is in labor force = 0 otherwise x i k 1 = observed
More informationModels for Binary Outcomes
Models for Binary Outcomes Introduction The simple or binary response (for example, success or failure) analysis models the relationship between a binary response variable and one or more explanatory variables.
More informationLinear Regression With Special Variables
Linear Regression With Special Variables Junhui Qian December 21, 2014 Outline Standardized Scores Quadratic Terms Interaction Terms Binary Explanatory Variables Binary Choice Models Standardized Scores:
More informationThe Fixed-Effects Zero-Inflated Poisson Model with an Application to Health Care Utilization Majo, M.C.; van Soest, Arthur
Tilburg University The Fixed-Effects Zero-Inflated Poisson Model with an Application to Health Care Utilization Majo, M.C.; van Soest, Arthur Publication date: 2011 Link to publication Citation for published
More informationEconometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction
Econometrics I Professor William Greene Stern School of Business Department of Economics 1-1/40 http://people.stern.nyu.edu/wgreene/econometrics/econometrics.htm 1-2/40 Overview: This is an intermediate
More informationMixed models in R using the lme4 package Part 5: Generalized linear mixed models
Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates 2011-03-16 Contents 1 Generalized Linear Mixed Models Generalized Linear Mixed Models When using linear mixed
More informationPartial effects in fixed effects models
1 Partial effects in fixed effects models J.M.C. Santos Silva School of Economics, University of Surrey Gordon C.R. Kemp Department of Economics, University of Essex 22 nd London Stata Users Group Meeting
More informationApplied Econometrics Lecture 1
Lecture 1 1 1 Università di Urbino Università di Urbino PhD Programme in Global Studies Spring 2018 Outline of this module Beyond OLS (very brief sketch) Regression and causality: sources of endogeneity
More informationSyllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools
Syllabus By Joan Llull Microeconometrics. IDEA PhD Program. Fall 2017 Chapter 1: Introduction and a Brief Review of Relevant Tools I. Overview II. Maximum Likelihood A. The Likelihood Principle B. The
More information-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model
-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model Mark B. Stewart University of Warwick January 2006 1 The model The latent equation for the random effects
More information