Comparing groups using predicted probabilities

Similar documents
Group comparisons in logit and probit using predicted probabilities 1

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Single-level Models for Binary Responses

Interpreting and using heterogeneous choice & generalized ordered logit models

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1

ECON 594: Lecture #6

h=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio

Speci cation of Conditional Expectation Functions

Limited Dependent Variable Models II

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Finansiell Statistik, GN, 15 hp, VT2008 Lecture 17-1: Regression with dichotomous outcome variable - Logistic Regression

Using predictions to compare groups in regression models for binary outcomes

Economics Introduction to Econometrics - Fall 2007 Final Exam - Answers

Comparing groups in binary regression models using predictions

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

Binary Outcomes. Objectives. Demonstrate the limitations of the Linear Probability Model (LPM) for binary outcomes

Introduction: structural econometrics. Jean-Marc Robin

1 A Non-technical Introduction to Regression

General Linear Model (Chapter 4)

MC3: Econometric Theory and Methods. Course Notes 4

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment

Tables and Figures. This draft, July 2, 2007

GMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails

We begin by thinking about population relationships.

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008

ECON Interactions and Dummies

Chapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a

7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis

Generalized Linear Models for Non-Normal Data

Econometrics I Lecture 7: Dummy Variables

Wageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice

Estimating Explained Variation of a Latent Scale Dependent Variable Underlying a Binary Indicator of Event Occurrence

1 The Multiple Regression Model: Freeing Up the Classical Assumptions

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois

STA 216, GLM, Lecture 16. October 29, 2007

Marginal effects and extending the Blinder-Oaxaca. decomposition to nonlinear models. Tamás Bartus

Lecture 4: Linear panel models

Introduction to Econometrics

Appendix A. Math Reviews 03Jan2007. A.1 From Simple to Complex. Objectives. 1. Review tools that are needed for studying models for CLDVs.

Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 2017, Boston, Massachusetts

Fitting heterogeneous choice models with oglm

Binary Logistic Regression

Lecture 3.1 Basic Logistic LDA

ECON 482 / WH Hong Binary or Dummy Variables 1. Qualitative Information

Strati cation in Multivariate Modeling

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, Last revised January 20, 2018

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS

Lecture Notes 12 Advanced Topics Econ 20150, Principles of Statistics Kevin R Foster, CCNY Spring 2012

DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005

Online Appendix to: Marijuana on Main Street? Estimating Demand in Markets with Limited Access

Regression with Qualitative Information. Part VI. Regression with Qualitative Information

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

Chapter 11. Regression with a Binary Dependent Variable

Limited Dependent Variables and Panel Data

Semiparametric Generalized Linear Models

11. Bootstrap Methods

Empirical Methods in Applied Microeconomics

Regression. Econometrics II. Douglas G. Steigerwald. UC Santa Barbara. D. Steigerwald (UCSB) Regression 1 / 17

Compare Predicted Counts between Groups of Zero Truncated Poisson Regression Model based on Recycled Predictions Method

Comparing Logit & Probit Coefficients Between Nested Models (Old Version)

13 Endogeneity and Nonparametric IV

Ch 7: Dummy (binary, indicator) variables

Rethinking Inequality Decomposition: Comment. *London School of Economics. University of Milan and Econpubblica

Logistic regression: Uncovering unobserved heterogeneity

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

Lab 10 - Binary Variables

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014

Regression techniques provide statistical analysis of relationships. Research designs may be classified as experimental or observational; regression

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

Editor Executive Editor Associate Editors Copyright Statement:

Lecture Notes Part 4: Nonlinear Models

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Binary Dependent Variables

A Note on the Closed-form Identi cation of Regression Models with a Mismeasured Binary Regressor

Econometrics Problem Set 10

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

Logistic Regression. Some slides from Craig Burkett. STA303/STA1002: Methods of Data Analysis II, Summer 2016 Michael Guerzhoy

Multiple Regression: Chapter 13. July 24, 2015

Simple Estimators for Semiparametric Multinomial Choice Models

Problem set 1 - Solutions

Notes on Generalized Method of Moments Estimation

1 Fixed E ects and Random E ects

Week 7: Binary Outcomes (Scott Long Chapter 3 Part 2)

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

CORRELATION. suppose you get r 0. Does that mean there is no correlation between the data sets? many aspects of the data may a ect the value of r

CIVL 7012/8012. Simple Linear Regression. Lecture 3

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit

Problem Set 10: Panel Data

Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions

Christopher Dougherty London School of Economics and Political Science

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Pseudo panels and repeated cross-sections

Making sense of Econometrics: Basics

Cohen s s Kappa and Log-linear Models

Answer Key: Problem Set 5

Economics 326 Methods of Empirical Research in Economics. Lecture 14: Hypothesis testing in the multiple regression model, Part 2

xtunbalmd: Dynamic Binary Random E ects Models Estimation with Unbalanced Panels

Transcription:

Comparing groups using predicted probabilities J. Scott Long Indiana University May 9, 2006 MAPSS - May 9, 2006 - Page 1

The problem Allison (1999): Di erences in the estimated coe cients tell us nothing about the di erences in the underlying impact of x on the two groups. MAPSS - May 9, 2006 - Page 2

Overview 1. Does the e ect of x on y di er across groups? 2. Comparing s across groups in the LRM 3. Problems with comparing s in logit and probit models 4. Tests and CIs for comparing Pr (y = 1) across groups 5. Example of gender di erences in tenure 6. Complications due to nonlinearity of models MAPSS - May 9, 2006 - Page 3

LRM - Chow test 1. For example, Men: y = m + m educ educ + m ageage + " Women: y = w + w educ educ + w ageage + " 2. Do men and women have the same return for education? 3. We compute: z = H 0 : m educ = w educ b m educ b w educ r Var b m educ + Var b w educ MAPSS - May 9, 2006 - Page 4

Logit and probit using latent y 1. Chow type test is invalid for logit & probit (Allison 1999). 2. Structural model with a latent y : y = + x + " 3. Error is normal for probit or logistic for logit: " f (0; " ) 4. We only observed a binary y: y = ( 1 if y > 0 0 if y 0 5. Graphically... MAPSS - May 9, 2006 - Page 5

Logit and probit using latent y MAPSS - May 9, 2006 - Page 6

y and Pr (y = 1) 1. Link to observed data: Pr (y = 1 j x) = Pr (y > 0 j x) = Pr (" < [ + x] j x) 2. Since y is not observed, is only identi ed up to a scale factor. MAPSS - May 9, 2006 - Page 7

Identi cation of and " 1. Model A: 2. Model B: ya = a + a xx + " a where a = 1 = 6 + 1x + " a 3. Where: yb = b + b x x + " b where b = 2 = 12 + 2x + " b b = 2 a b x = 2 a x b = 2 a MAPSS - May 9, 2006 - Page 8

Pr (y = 1 j x) in model A MAPSS - May 9, 2006 - Page 9

Pr (y = 1 j x) in model B MAPSS - May 9, 2006 - Page 10

The problem and a solution In terms of Pr(y = 1), these are empirically indistinguishable: Case 1: A change in x of 1 when a x = 1 and a = 1: Case 2: A change in x of 1 when b x = 2 and b = 2. MAPSS - May 9, 2006 - Page 11

Implications for group comparisons 1. Comparing propensity for tenure for men and women: Men : y = m + m articles articles + " m Women : y = w + w articles articles + " w 2. Assume the s are equal: m articles = w articles 3. Assume women have more unobserved heterogeneity: w > m 4. Now we estimate the model... MAPSS - May 9, 2006 - Page 12

Estimation 1. Probit software assumes that = 1. 2. For men, the estimated model is: y = m + m articles articles + " m m m m m = e m + em articlesarticles + e" m ; sd (e" m ) = 1 3. For women, the estimated model is: y = w + w articles articles + " w w w w w = e w + ew articlesarticles + e" w ; sd (e" w ) = 1 MAPSS - May 9, 2006 - Page 13

Problem with Chow-type tests 1. We want to test: H 0 : m articles = w articles 2. But, end up testing: H 0 : em articles = ew articles 3. And, e m articles = e w articles does not imply m articles = w articles unless m = w. MAPSS - May 9, 2006 - Page 14

Alternatives for testing group di erences 1. Allison s (1999) test of H 0 : m x = w x : (a) Disentangles the s and " s. (b) But requires that m z = w z for some z. 2. I propose testing H 0 : Pr (y = 1 j x) m = Pr (y = 1 j x) w since the probabilities are invariant to ". 3. Graphically... MAPSS - May 9, 2006 - Page 15

Invariance of Pr(y = 1 j x) MAPSS - May 9, 2006 - Page 16

Group comparisons of Pr(y = 1 j x) MAPSS - May 9, 2006 - Page 17

Testing (x) 1. De ne: (x) = Pr (y = 1 j x) m Pr (y = 1 j x) w 2. Then: z = c (x) rv ar hc (x) i 3. Or the con dence interval: Pr ( LB (x) UB ) = :95 4. Delta method is very fast; bootstrap is very slow; both are available with SPost s prvalue and prgen. MAPSS - May 9, 2006 - Page 18

Comparing groups di erences in Pr (y = 1) 1. Estimate: Pr (y = 1) = F ( w w + w x wx + m m + m x mx) where w = 1 for women, else 0; m = 1 for men, else 0; wx = w x; and mx = m x: 2. Then: Pr (y = 1 j x) w = F ( w + w x x) Pr (y = 1 j x) m = F ( m + m x x) 3. The gender di erence is: (x) = Pr (y = 1 j x) m Pr (y = 1 j x) w MAPSS - May 9, 2006 - Page 19

Example: gender di erences in tenure Women Men Variable Mean SD Mean SD Is tenured? tenure 0.11 0.31 0.13 0.34 Year year 3.97 2.38 3.78 2.25 Year-squared yearsq 21.46 23.14 19.39 21.50 Bachelor s selectivity select 5.00 1.48 4.99 1.37 Total articles articles 7.41 7.43 6.83 5.99 Distinguished job? distinguished 0.05 0.23 0.04 0.19 Prestige of job prestige 2.66 0.77 2.64 0.78 Person-years 1,121 1,824 Scientists 177 301 MAPSS - May 9, 2006 - Page 20

M1: articles only Women Men Variable e z e z constant -2.47-18.30-2.69-23.00 articles 0.042 1.04 4.26 0.10 1.10 9.93 Log-lik -375.17-663.03 N 1,121 1,824 MAPSS - May 9, 2006 - Page 21

M1: Pr(tenure j articles) 1 Men Women.8 Pr(tenure).6.4.2 0 0 10 20 30 40 50 Number of Articles (group_ten03a.do 11Apr2006) MAPSS - May 9, 2006 - Page 22

M1: group di erences in probabilities 1. We are interested in group di erences in predictions: 2. We can test: 3. Or: (articles) = Pr (y = 1 j articles) m Pr (y = 1 j articles) w H 0 : (articles) = 0 (articles) LowerBound, (articles) UpperBound 4. With one RHS variable, we can plot all comparisons. MAPSS - May 9, 2006 - Page 23

M1: (articles) with con dence intervals.8.7 Difference 95% confidence interval Pr(men) Pr(women).6.5.4.3.2.1 0.1 0 10 20 30 40 50 Number of Articles (group_ten03a.do 11Apr2006) MAPSS - May 9, 2006 - Page 24

Adding variables additional variables 1. Structural model: y = + x x + z z + " 2. Setting z = Z changes the intercept: y = + x x + z Z + " = [ + z Z] + x x + " = + x x + " 3. Di erent values of z lead to di erent probability curves: Pr (y = 1 j x; z = Z) = Pr (" < [ + x x]) = F ( + x x) MAPSS - May 9, 2006 - Page 25

E ect of levels of other variables 1.8 a* = (a + b_z 4) = 8 a* = (a + b_z 3) = 10 a* = (a + b_z 2) = 12 a* = (a + b_z 1) = 14 Pr(y=1 x,z ).6.4.2 0 0 2 4 6 8 10 x (group_parallel 2006 04 07) MAPSS - May 9, 2006 - Page 26

Comparing groups with added variables 1. For given z = Z : Men: Pr (y = 1 j x; Z) m = F ( m + m x x) Women: Pr (y = 1 j x; Z) w = F ( w + w x x) 2. Group di erences depend on z: (x; Z) = Pr (y = 1 j x; Z) m Pr (y = 1 j x; Z) w 3. (x; Z) for a given x depends on the level of other variables. MAPSS - May 9, 2006 - Page 27

M2: adding job type Women Men Variable e z e z constant -2.54-17.90-2.68-22.87 articles 0.06 1.06 5.00 0.10 1.11 10.01 distinguished -1.63 0.19-2.43-0.73 0.48-1.71 Log-lik -375.17-663.03 N 1,121 1,824 Chow tests: Allison tests: 2 articles =7.93, df=1, p<.01; 2 dist =1.38, df=1, p>.10. 2 articles =15.06, df=1, p<.01 (other e ects equal). 2 distinguished =3.54, df=1, p=.06 (other e ects equal). MAPSS - May 9, 2006 - Page 28

M2: Pr(tenure j articles) 1 Men distinguished Women distinguished.8 Pr(tenure).6.4.2 0 0 10 20 30 40 50 Number of Articles (group_ten04a.do 14Apr2006) MAPSS - May 9, 2006 - Page 29

M2: Pr(tenure j articles) 1.8 Men distinguished Women distinguished Men not distinguished Women not distinguished Pr(tenure).6.4.2 0 0 10 20 30 40 50 Number of Articles (group_ten04a.do 14Apr2006) MAPSS - May 9, 2006 - Page 30

M2: (articles if not distinguished) Pr(men) Pr(women).9.7.5.3.1 Scientists not in distinguished departments Male Female difference 95% confidence interval.1 0 10 20 30 40 50 Number of Articles (group_ten04a.do 11Apr2006) MAPSS - May 9, 2006 - Page 31

M2: (articles if distinguished) Pr(men) Pr(women).9.7.5.3.1 Scientists in distinguished departments Male Female difference 95% confidence interval.1 0 10 20 30 40 50 Number of Articles (group_ten04a.do 11Apr2006) MAPSS - May 9, 2006 - Page 32

M2: (articles) by job type Pr(men) Pr(women).9.7.5.3.1 Distinguished if not significant Not distinguished if not significant.1 0 10 20 30 40 50 Number of Articles (group_ten04a.do 11Apr2006) MAPSS - May 9, 2006 - Page 33

M3: full model Women Men Variable e z e z constant -4.21-6.68-5.82-11.55 year 0.77 6.12 1.07 9.08 yearsq -0.04-4.93-0.07-7.51 select 0.03 1.04 0.50 0.21 1.23 3.69 articles 0.04 1.04 2.98 0.07 1.08 6.84 prestige -0.35 0.71-2.29-0.38 0.69-3.64 Log-lik -338.85-579.23 N 1,121 1,824 Chow tests: Allison tests: 2 articles =1.07, df=1, p=.30. 2 prestige =0.03, df=1, p=.86. 2 articles =1.73, df=1, p=.19 (other e ects equal). 2 prestige =1.55, df=1, p=.21 (other e ects equal). MAPSS - May 9, 2006 - Page 34

M3: Pr(tenure if prestige = 5) 1.8 Year 7 with prestige 5 Men Women Pr(tenure).6.4.2 0 0 10 20 30 40 50 Number of Articles (group_ten05b.do 04May2006) MAPSS - May 9, 2006 - Page 35

M3: (articles if prestige = 5).7 Year 7 with prestige 5 Male Female difference 95% confidence interval Pr(men) Pr(women).5.3.1.1 0 10 20 30 40 50 Number of Articles (group_ten05b.do 04May2006) MAPSS - May 9, 2006 - Page 36

M3: (articles by prestige) Pr(men) Pr(women).5.4.3.2.1 Weak (prestige=1) Adequate (prestige=2) Good (prestige=3) Strong (prestige=4) Distinguished (prestige=5) 0 0 10 20 30 40 50 Number of Articles (group_ten05b.do 12Apr2006) MAPSS - May 9, 2006 - Page 37

M3: (prestige by articles).5.4 0 articles 10 articles 20 articles 30 articles 40 articles 50 articles Pr(men) Pr(women).3.2.1 0 1 2 3 4 5 Job Prestige (group_ten05c.do 12Apr2006) MAPSS - May 9, 2006 - Page 38

M3: (articles, prestige) MAPSS - May 9, 2006 - Page 39

Conclusions 1. Predictions o er a general approach for comparing groups. 2. The approach cannot distinguish between di erent processes generating the same probabilities. For example: Case 1: m x = w x m 6= w ) (x) = 0 Case 2: m x 6= w x m = w ) (x) = 0 3. But, it provides a very detailed understanding of group di erences. 4. Implications for interpreting nonlinear models. 5. In Stata, enter: findit spost_groups for examples. 6. www.indiana.edu/~jslsoc/research.htm MAPSS - May 9, 2006 - Page 40

References Allison, Paul D. 1999. Comparing Logit and Probit Coe cients Across Groups. Sociological Methods and Research 28:186-208. Chow, G.C. 1960. Tests of equality between sets of coe cients in two linear regressions. Econometrica 28:591-605. Long, J.S. and Freese, J. 2005. Regression Models for Categorical and Limited Dependent Variables with Stata. Second Edition. College Station, TX: Stata Press. Long, J. Scott, Paul D. Allison, and Robert McGinnis. 1993. Rank Advancement in Academic Careers: Sex Di erences and the E ects of Productivity. American Sociological Review 58:703-722. Xu, J. and J.S. Long, 2005, Con dence intervals for predicted outcomes in regression models for categorical outcomes. The Stata Journal 5: 537-559. MAPSS - May 9, 2006 - Page 41

Delta method for (x; ) 1. Start with the predicted probability: G() = Pr (y = 1 j x) = F (x) 2. A Taylor series expansion of G( ): b G( ) b G() + ( b ) T G 0 () 3. Where G 0 () = h @F (x) @ 0 = f(x)x T @F (x) @ 1 @F (x) @ K i T MAPSS - May 9, 2006 - Page 42

4. Under standard assumptions, G( b ) is distributed normally around G() with a variance Var h G( b ) i = G 0 ( b ) T V ar( b )G 0 ( b ) 5. For a di erence in probability, let G () = F (jx a ) F (jx b ) MAPSS - May 9, 2006 - Page 43

6. Then V ar h F ( jx b a ) F jx b i b = " @F (jxa ) @ T V ar( ) b @F (jx a) @ " @F (jxa ) @ T V ar( ) b @F (jx b) @ " @F (jxb ) @ T V ar( ) b @F (jx a) @ " @F (jxb ) + @ T V ar( ) b @F (jx b) @ # # # # MAPSS - May 9, 2006 - Page 44

Proof of invariance of Pr(y = 1) to " 1. If N (0; = 1) 2. Then: (") = 1 p 2 exp " 2 2 2 Pr (y = 1 j x) = 3. If N (0; = 2): Z +x 1 (") = 1 p 2 exp " 2 2 2! = 1 p 2 exp 1 p 2 exp! t 2! 2 " 2! 2 dt (1) = 1 2 p 2 exp " 2! 8 MAPSS - May 9, 2006 - Page 45

4. Then, Pr (y = 1 j x) = Z (+x) 1 1 2 p 2 exp t 2! dt (2) 8 5. Equations 1 and 2 simply involve a linear change of variables, so the probabilities are equal. 6. If z = g (x) and x = g 1 (z), then Z A B f (z) dz = Z g 1 (A) g 1 (B) f [g (z)] dz dx dx 7. See Long 1997: 49-50 for an alternative proof. MAPSS - May 9, 2006 - Page 46