Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland
|
|
- Melinda Richard
- 5 years ago
- Views:
Transcription
1 Department of Mathematics and Statistics, University of Vaasa, Finland Spring 2018
2 Part III Limited Dependent Variable Models As of Jan 30, 2017
3 1 Background 2 Binary Dependent Variable The Linear Probability Model The Logit and Probit Model 3 Tobit Model Interpreting Tobit Estimates Predicting with Tobit Regression Checking Specification of Tobit Models
4 Limited dependent variables refer to variables whose range of values is substantially restricted. A binary variable takes only two values (0/1) is an example. Other examples are is a variable that takes a small number of integer values. Other kinds of limited variables are those whose values are truncated for some reasons. For example, number of passenger tickets in an airplane or some sports event, etc. Note however that not all truncated cases need special treatment. An example is wage, which must be positive. Typical truncated value variables are those that have in the limiting value a big concentration of observations.
5 1 Background 2 Binary Dependent Variable The Linear Probability Model The Logit and Probit Model 3 Tobit Model Interpreting Tobit Estimates Predicting with Tobit Regression Checking Specification of Tobit Models
6 1 Background 2 Binary Dependent Variable The Linear Probability Model The Logit and Probit Model 3 Tobit Model Interpreting Tobit Estimates Predicting with Tobit Regression Checking Specification of Tobit Models
7 1 Background 2 Binary Dependent Variable The Linear Probability Model The Logit and Probit Model 3 Tobit Model Interpreting Tobit Estimates Predicting with Tobit Regression Checking Specification of Tobit Models
8 Up until now in regression y = x β + u, (1) where x β = β 0 + β 1 x β k x k, y has had quantitative meaning (e.g. wage). What if y indicates a qualitative event (e.g., firm has gone to bankruptcy), such that y = 1 indicates the occurrence of the event ( success ) and y = 0 non-occurrence ( fail ), and we want to explain it by some explanatory variables?
9 The meaning of the regression y = x β + u, when y is a binary variable. Then, because E[u x] = 0, E[y x] = x β. (2) Because y is a random variable that can have only values 0 or 1, we can define probabilities for y as P(y = 1 x) and P(y = 0 x) = 1 P(y = 1 x), such that E[y x] = 0 P(y = 0 x) + 1 P(y = 1 x) = P(y = 1 x).
10 Thus, E[y x] = P(y = 1 x) indicates the success probability and regression in equation 2 models P(y = 1 x) = β 0 + β 1 x β k x k, (3) the probability of success. This is called the linear probability model (LPM). The slope coefficients indicate the marginal effect of corresponding x-variable on the success probability, i.e., change in the probability as x changes, or P(y = 1 x) = β j x j. (4)
11 In the OLS estimated model ŷ = β 0 + ˆβ 1 x ˆβ k x k (5) ŷ is the estimated or predicted probability of success. In order to correctly specify the binary variable, it may be useful to name the variable according to the success category (e.g., in a bankruptcy study, bankrupt = 1 for bankrupt firms and bankrupt = 0 for non-bankrupt firm [thus success is just a generic term]).
12 Example 1 (Married women participation in labor force (year 1975)) Linear probability model (See R-snippet for the R-commands): lm(formula = inlf ~ nwifeinc + educ + exper + I(exper^2) + age + kidslt6 + kidsge6, data = wkng) Residuals: Min 1Q Median 3Q Max Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) *** nwifeinc * educ e-07 *** exper e-12 *** I(exper^2) ** age e-10 *** kidslt e-14 *** kidsge Signif. codes: 0 *** ** 0.01 * Residual standard error: on 745 degrees of freedom Multiple R-squared: ,Adjusted R-squared: F-statistic: on 7 and 745 DF, p-value: < 2.2e-16
13 All others but kidsge6 are statistically significant with signs as might be expected. The coefficients indicate the marginal effects of the variables on the probability that inlf = 1. Thus e.g., an additional year of educ increases the probability by (other variables held fixed). Marginal effect of experince on marri women labor force participation Marginal effect of eduction on marrie women labor force participation Probability Probability Experience (years) Education (years)
14 Some issues with associated to the LPM. Dependent left hand side restricted to (0, 1), while right hand side (, ), which may result to probability predictions less than zero or larger than one. Heteroskedasticity of u, since by denoting p(x) = P(y = 1 x) = x β var[u x] = (1 p(x))p(x) (6) which is not a constant but depends on x, and hence violating Assumption 2.
15 1 Background 2 Binary Dependent Variable The Linear Probability Model The Logit and Probit Model 3 Tobit Model Interpreting Tobit Estimates Predicting with Tobit Regression Checking Specification of Tobit Models
16 The first of the above problems can be technically easily solved by mapping the linear function on the right hand side of equation (3) by a non-linear function to the range (0, 1). Such a function is generally called a link function. That is, instead we write equation (3) as P(y = 1 x) = G(x β). (7) Although any function G : R [0, 1] applies in principle, so called logit and probit transformations are in practice most popular (the former is based on logistic distribution and the latter normal distribution). Economists favor often the probit transformation such that G is the distribution function of the standard normal density, i.e., G(z) = Φ(z) = z 1 2π e 1 2 v 2 dv, (8)
17 In the logit tranformation G(z) = Both as S-shaped ez 1 + e z = e z = z e v dv. (9) (1 + e v 2 ) Probit transformation Logit transformation G(z) G(z) z z
18 The price, however, is that the interpretation of the marginal effects is not any more as straightforward as with the LPM. However, negative sign indicates decreasing effect on the probability and positive increasing. More precisely, using equation (7), the marginal change with respect to x j (keeping others unchanged) is P(y = 1 x β) g(x β)β j x j, (10) where g is the derivative function of G (g(x β) = (1/ 2π) exp ( (x β) 2 /2 ) for probit and g(x β) = exp( x β)/ (1 + exp( x β)) 2 for logit).
19 Typically the marginal effects are evaluated by unit changes in x j (i.e., x j = 1) at sample means of the x-variables with estimated β-coefficients [partial effect at the average (PEA)]. Another commonly used approach is to evaluate at the sample mean 1 n g(x i n ˆβ). (11) i=1
20 There are various pseudo R-suared measures for binary response models. One is McFadden measure. Another is squared correlation between ŷ i s (prediceted probability) and observed y i s (which have 0/1 values). Using R, the former can be computed as 1 (residual deviance)/(null deviance), where residual deviance is the value of the likelihood function of the fitted model, and null deviance is the value of the likelihood function when the intercept is included into the model.
21 Example 2 (Married women s labor force... ) Probit: (family = binomial(link = probit ) in glm) Call: glm(formula = inlf ~ nwifeinc + educ + exper + I(exper^2) + age + kidslt6 + kidsge6, family = binomial(link = "probit"), data = wkng) Deviance Residuals: Min 1Q Median 3Q Max Coefficients: Estimate Std. Error z value Pr(> z ) (Intercept) nwifeinc * educ e-07 *** exper e-11 *** I(exper^2) ** age e-10 *** kidslt e-13 *** kidsge Signif. codes: 0 *** ** 0.01 * Null deviance: on 752 degrees of freedom Residual deviance: on 745 degrees of freedom AIC: Pseudo R-square: / = 0.221
22 Logit: (family = binomial(link = logit ) in glm) glm(formula = inlf ~ nwifeinc + educ + exper + I(exper^2) + age + kidslt6 + kidsge6, family = binomial(link = "logit"), data = wkng) Deviance Residuals: Min 1Q Median 3Q Max Coefficients: Estimate Std. Error z value Pr(> z ) (Intercept) nwifeinc * educ e-07 *** exper e-10 *** I(exper^2) ** age e-09 *** kidslt e-12 *** kidsge Signif. codes: 0 *** ** 0.01 * Null deviance: on 752 degrees of freedom Residual deviance: on 745 degrees of freedom AIC: , Pseudo R-squared: / = Qualitatively the results are similar to those of the LPM. (R exercise: create similar graphs to those of the linear case for the marginal effects.)
23 1 Background 2 Binary Dependent Variable The Linear Probability Model The Logit and Probit Model 3 Tobit Model Interpreting Tobit Estimates Predicting with Tobit Regression Checking Specification of Tobit Models
24 Limited dependent variable is called a corner solution response variable if the variable is zero (say) for a nontrivial fraction in the population but is roughly continuously distributed over positive values. An example is the amount an individual is consuming alcohol in a given month. Nothing in principle prevents using a linear model for such a y. The problem is that fitted values may be negative.
25 In cases where it is important to have a model that implies nonnegative predicted values for y, the Tobit model is convenient. The Tobit model (typically) expresses the observed response, y, in terms of an underlying latent variable, y, y = x β + u (12) with and u x N(0, σ 2 ). y = max(0, y ) (13)
26 Accordingly y N(x β, σ 2 ) and y = y for y 0, but y = 0 for y < 0. Given sample of observations on y, the parameters can be estimated by the method of maximum likelihood. The log-likelihood function for observation i is l i (β, σ 2 ) = 1(y i = 0) log ( 1 Φ(x iβ/σ) ) (14) ( 1 +1(y i > 0) log σ φ ( (y i x iβ)/σ ) ) where 1(A) is an indicator function with value 1 if the condition A is true and zero otherwise, Φ( ) is the distribution function and φ( ) the density function of the N(0, 1) distribution. The maximization of the log-likelihood, l(β, σ) = i l i(β, σ), to obtain the ML estimates of β and σ is done by numerical methods.
27 Example 3 (Married women annual working hours) Married women working hours Frequency Hours
28 OLS results lm(formula = hours ~ nwifeinc + educ + exper + I(exper^2) + age + kidslt6 + kidsge6, data = wkng) Residuals: Min 1Q Median 3Q Max Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) e-06 *** nwifeinc educ * exper e-11 *** I(exper^2) * age e-12 *** kidslt e-13 *** kidsge Signif. codes: 0 *** ** 0.01 * Residual standard error: on 745 degrees of freedom Multiple R-squared: ,Adjusted R-squared: F-statistic: 38.5 on 7 and 745 DF, p-value: < 2.2e-16
29 Tobit regression vglm(formula = hours ~ nwifeinc + educ + exper + I(exper^2) + age + kidslt6 + kidsge6, family = tobit(lower = 0), data = wkng) Pearson residuals: Min 1Q Median 3Q Max mu loge(sd) Coefficients: Estimate Std. Error z value Pr(> z ) (Intercept): * (Intercept): < 2e-16 *** nwifeinc * educ *** exper e-14 *** I(exper^2) *** age e-13 *** kidslt e-15 *** kidsge Signif. codes: 0 *** ** 0.01 * Number of linear predictors: 2 Names of linear predictors: mu, loge(sd) Log-likelihood: on 1497 degrees of freedom Number of iterations: 6
30 (Intercept):2 is an extra statistic related to residual standard deviation. OLS generally results to biased estimation due to the censored y-values. Tobit regression accounts the biasing effect. However, we should make some adjustments to the Tobit coefficients before interpreting the magnitudes, as discussed below.
31 Interpreting Tobit Estimates 1 Background 2 Binary Dependent Variable The Linear Probability Model The Logit and Probit Model 3 Tobit Model Interpreting Tobit Estimates Predicting with Tobit Regression Checking Specification of Tobit Models
32 Interpreting Tobit Estimates Similar to regression, the interest is in the conditional expectation E[y x]. Given E[y y > 0, x] we can compute E[y x] as we can consider E[y y > 0, x] as the value of the binary random variable z which has value E[y y > 0, x] with with probability P[y > 0 x], when y > 0 and E[y y = 0, x] = 0 with probability P[y = 0 x] when y = 0. Accordingly using the law of iterated expectation (LIE) 2 E[y x] = E z [E[y z, x]] = P[y > 0 x] E[y y > 0, x] (17) 2 Generally, given random variables x, y, and z, E[x z] = E y [E[x y, z]] (15) and in particular E[x] = E y [E[x y]]. (16)
33 Interpreting Tobit Estimates Because y N(x β, σ 2 ) and y = y for y > 0 and y = 0 for y < 0, we have P(y > 0 x) = 1 Φ( x β/σ) = Φ(x β), such that E[y x] in (17) becomes E[y x] = Φ(x β/σ)e[y y > 0, x]. (18) To obtain E[y y > 0, x] we can use the general result for z N(0, 1): For any c E[z z > c] = φ(c)/ (1 Φ(c)) from which we obtain, by noting that y = x β + u and E[y y > 0, x] = x β + E[u u > x β], E[y y > 0, x] = x β + σ φ(xβ/σ), (19) where φ(c) = φ(c)/φ(c) [note: φ( c) = φ(c) and 1 Φ( c) = Φ(c)].
34 Interpreting Tobit Estimates Thus the marginal contribution of x j to the (conditional) expectation is x j E[y y > 0, x] = β j + β j φ (x β), (20) where φ ( ) is the derivative of φ( ). Because for standard normal distribution φ (z) = dφ(z)/dz = zφ(z) and Φ (z) = dφ(z)/dz = φ(z), we get finally ( E[y y > 0, x] = β j 1 x φ(x ( β/σ) x β/σ + φ(x )) β/σ). (21) j
35 Interpreting Tobit Estimates Equation (21) shows that the β j does not exactly reflect the marginal effect of x j on E[y y > 0, x]. It ( becomes adjusted ( by the factor )) 1 φ(x β/σ) x β/σ + φ(x β/σ). The marginal effect of x j on E[y x]: Combining equations (17) and (19), we have E[y x] = Φ(x β/σ)x β + σφ(x β), (22) where we have used the result Φ(z) φ(z) = φ(z).
36 Interpreting Tobit Estimates From equation (17) we can compute the marginal effect of x j by utilizing φ (z) = zφ(z), so that x j E[y x] = β j Φ(x β/σ) + β j φ(x β/σ)x β β j φ(x β)x β = β j Φ(x β/σ). (23) Again β becomes adjusted to some extend (causing difference from OLS). After estimating β and σ, Φ(x β/σ) is often evaluated at the mean n 1 i Φ(x i ˆβ/ˆσ).
37 Predicting with Tobit Regression 1 Background 2 Binary Dependent Variable The Linear Probability Model The Logit and Probit Model 3 Tobit Model Interpreting Tobit Estimates Predicting with Tobit Regression Checking Specification of Tobit Models
38 Predicting with Tobit Regression Predicteions of E[y x] in equation (22) can be obtained by replacing the parameters by their estimates ŷ = Φ(x ˆβ/ˆσ)x ˆβ + ˆσφ(x ˆβ/ˆσ), (24) where Φ is the standard normal cumulative distribution function and φ the standard normal density function (derivative function of Φ). Exercise: Using R, plot the predicted values for working hours as a function of education (educ) when the other explanatory are set to their means (for a solution, see R snippet for Example 3 on the course home page).
39 Predicting with Tobit Regression Remark 1 In OLS the R-square is the correlation of the observed values with the predicted values. Using this practice, one can compute an R-square for a Tobit model as well. For the OLS solution, R 2 = Saving the R vglm results into an object (above wkh.tbt), the predicted values can be extracted with the fitted() function. In R S4 object the sub-objects are called slots. The observed dependent values are in i.e., in our case wkh.tbt@y. Thus, for the Tobit model command cor(wkh.tbt@y, fitted(wkh)) 2 produces R 2 = 0.261, which is close to that of OLS.
40 Checking Specification of Tobit Models 1 Background 2 Binary Dependent Variable The Linear Probability Model The Logit and Probit Model 3 Tobit Model Interpreting Tobit Estimates Predicting with Tobit Regression Checking Specification of Tobit Models
41 Checking Specification of Tobit Models If we introduce a dummy variable w = 0 when y = 0 and w = 1 if y > 0, then E[w x] = P[w = 1 x] = Φ(x β/σ) is the probit model. Accordingly, if the Tobit model holds, we can expect that the (scaled) Tobit slope estimate ˆβ j /ˆσ of x j should be fairly close to that of probit estimate ˆγ j. Comparing closeness of the slope coefficients can be used as an informal specification check of appropriateness of the Tobit model. ================================= Tobit/sigma Probit (Intercept): nwifeinc educ exper I(exper^2) age kidslt kidsge (Insignificant in both models) ================================= The (scaled) slope coefficients of the Tobit model are fairly close to those of the probit model, suggesting appropriateness of the Tobit model.
Regression with Qualitative Information. Part VI. Regression with Qualitative Information
Part VI Regression with Qualitative Information As of Oct 17, 2017 1 Regression with Qualitative Information Single Dummy Independent Variable Multiple Categories Ordinal Information Interaction Involving
More informationEconometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit
Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit R. G. Pierse 1 Introduction In lecture 5 of last semester s course, we looked at the reasons for including dichotomous variables
More informationEconometrics I Lecture 7: Dummy Variables
Econometrics I Lecture 7: Dummy Variables Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 27 Introduction Dummy variable: d i is a dummy variable
More informationModeling Binary Outcomes: Logit and Probit Models
Modeling Binary Outcomes: Logit and Probit Models Eric Zivot December 5, 2009 Motivating Example: Women s labor force participation y i = 1 if married woman is in labor force = 0 otherwise x i k 1 = observed
More informationEstimating the return to education for married women mroz.csv: 753 observations and 22 variables
Return to education Estimating the return to education for married women mroz.csv: 753 observations and 22 variables 1. inlf =1 if in labor force, 1975 2. hours hours worked, 1975 3. kidslt6 # kids < 6
More informationECON 482 / WH Hong Binary or Dummy Variables 1. Qualitative Information
1. Qualitative Information Qualitative Information Up to now, we assume that all the variables has quantitative meaning. But often in empirical work, we must incorporate qualitative factor into regression
More informationLinear Regression Models P8111
Linear Regression Models P8111 Lecture 25 Jeff Goldsmith April 26, 2016 1 of 37 Today s Lecture Logistic regression / GLMs Model framework Interpretation Estimation 2 of 37 Linear regression Course started
More information5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is
Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do
More informationECON 594: Lecture #6
ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was
More informationLogistic & Tobit Regression
Logistic & Tobit Regression Different Types of Regression Binary Regression (D) Logistic transformation + e P( y x) = 1 + e! " x! + " x " P( y x) % ln$ ' = ( + ) x # 1! P( y x) & logit of P(y x){ P(y
More informationBinary Dependent Variables
Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome
More informationLinear Regression With Special Variables
Linear Regression With Special Variables Junhui Qian December 21, 2014 Outline Standardized Scores Quadratic Terms Interaction Terms Binary Explanatory Variables Binary Choice Models Standardized Scores:
More informationLecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions
Econ 513, USC, Department of Economics Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions I Introduction Here we look at a set of complications with the
More informationIntroduction to the Generalized Linear Model: Logistic regression and Poisson regression
Introduction to the Generalized Linear Model: Logistic regression and Poisson regression Statistical modelling: Theory and practice Gilles Guillot gigu@dtu.dk November 4, 2013 Gilles Guillot (gigu@dtu.dk)
More informationBMI 541/699 Lecture 22
BMI 541/699 Lecture 22 Where we are: 1. Introduction and Experimental Design 2. Exploratory Data Analysis 3. Probability 4. T-based methods for continous variables 5. Power and sample size for t-based
More informationST430 Exam 1 with Answers
ST430 Exam 1 with Answers Date: October 5, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textook are permitted but you may use a calculator.
More informationGibbs Sampling in Latent Variable Models #1
Gibbs Sampling in Latent Variable Models #1 Econ 690 Purdue University Outline 1 Data augmentation 2 Probit Model Probit Application A Panel Probit Panel Probit 3 The Tobit Model Example: Female Labor
More informationEconometrics II Tutorial Problems No. 1
Econometrics II Tutorial Problems No. 1 Lennart Hoogerheide & Agnieszka Borowska 15.02.2017 1 Summary Binary Response Model: A model for a binary (or dummy, i.e. with two possible outcomes 0 and 1) dependent
More informationA Generalized Linear Model for Binomial Response Data. Copyright c 2017 Dan Nettleton (Iowa State University) Statistics / 46
A Generalized Linear Model for Binomial Response Data Copyright c 2017 Dan Nettleton (Iowa State University) Statistics 510 1 / 46 Now suppose that instead of a Bernoulli response, we have a binomial response
More informationWeek 7: Binary Outcomes (Scott Long Chapter 3 Part 2)
Week 7: (Scott Long Chapter 3 Part 2) Tsun-Feng Chiang* *School of Economics, Henan University, Kaifeng, China April 29, 2014 1 / 38 ML Estimation for Probit and Logit ML Estimation for Probit and Logit
More informationModel Specification and Data Problems. Part VIII
Part VIII Model Specification and Data Problems As of Oct 24, 2017 1 Model Specification and Data Problems RESET test Non-nested alternatives Outliers A functional form misspecification generally means
More informationClassification. Chapter Introduction. 6.2 The Bayes classifier
Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode
More informationStatistics 203 Introduction to Regression Models and ANOVA Practice Exam
Statistics 203 Introduction to Regression Models and ANOVA Practice Exam Prof. J. Taylor You may use your 4 single-sided pages of notes This exam is 7 pages long. There are 4 questions, first 3 worth 10
More informationSCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models
SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION
More informationChapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a
Chapter 9 Regression with a Binary Dependent Variable Multiple Choice ) The binary dependent variable model is an example of a a. regression model, which has as a regressor, among others, a binary variable.
More informationLogistic regression. 11 Nov Logistic regression (EPFL) Applied Statistics 11 Nov / 20
Logistic regression 11 Nov 2010 Logistic regression (EPFL) Applied Statistics 11 Nov 2010 1 / 20 Modeling overview Want to capture important features of the relationship between a (set of) variable(s)
More informationAnalysing categorical data using logit models
Analysing categorical data using logit models Graeme Hutcheson, University of Manchester The lecture notes, exercises and data sets associated with this course are available for download from: www.research-training.net/manchester
More information9 Generalized Linear Models
9 Generalized Linear Models The Generalized Linear Model (GLM) is a model which has been built to include a wide range of different models you already know, e.g. ANOVA and multiple linear regression models
More informationGeneralized linear models
Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models
More information12 Modelling Binomial Response Data
c 2005, Anthony C. Brooms Statistical Modelling and Data Analysis 12 Modelling Binomial Response Data 12.1 Examples of Binary Response Data Binary response data arise when an observation on an individual
More informationExam Applied Statistical Regression. Good Luck!
Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.
More informationCategorical Predictor Variables
Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively
More informationApplied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid
Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?
More informationCHAPTER 7. + ˆ δ. (1 nopc) + ˆ β1. =.157, so the new intercept is = The coefficient on nopc is.157.
CHAPTER 7 SOLUTIONS TO PROBLEMS 7. (i) The coefficient on male is 87.75, so a man is estimated to sleep almost one and one-half hours more per week than a comparable woman. Further, t male = 87.75/34.33
More informationCourse Econometrics I
Course Econometrics I 3. Multiple Regression Analysis: Binary Variables Martin Halla Johannes Kepler University of Linz Department of Economics Last update: April 29, 2014 Martin Halla CS Econometrics
More informationLogistic Regressions. Stat 430
Logistic Regressions Stat 430 Final Project Final Project is, again, team based You will decide on a project - only constraint is: you are supposed to use techniques for a solution that are related to
More informationSTA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).
STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) T In 2 2 tables, statistical independence is equivalent to a population
More informationGeneralized Linear Models. stat 557 Heike Hofmann
Generalized Linear Models stat 557 Heike Hofmann Outline Intro to GLM Exponential Family Likelihood Equations GLM for Binomial Response Generalized Linear Models Three components: random, systematic, link
More informationApplied Health Economics (for B.Sc.)
Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative
More informationTreatment Effects with Normal Disturbances in sampleselection Package
Treatment Effects with Normal Disturbances in sampleselection Package Ott Toomet University of Washington December 7, 017 1 The Problem Recent decades have seen a surge in interest for evidence-based policy-making.
More informationMultiple Regression Analysis. Part III. Multiple Regression Analysis
Part III Multiple Regression Analysis As of Sep 26, 2017 1 Multiple Regression Analysis Estimation Matrix form Goodness-of-Fit R-square Adjusted R-square Expected values of the OLS estimators Irrelevant
More informationLogistic Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University
Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Logistic Regression 1 / 38 Logistic Regression 1 Introduction
More informationTento projekt je spolufinancován Evropským sociálním fondem a Státním rozpočtem ČR InoBio CZ.1.07/2.2.00/
Tento projekt je spolufinancován Evropským sociálním fondem a Státním rozpočtem ČR InoBio CZ.1.07/2.2.00/28.0018 Statistical Analysis in Ecology using R Linear Models/GLM Ing. Daniel Volařík, Ph.D. 13.
More informationLab 10 - Binary Variables
Lab 10 - Binary Variables Spring 2017 Contents 1 Introduction 1 2 SLR on a Dummy 2 3 MLR with binary independent variables 3 3.1 MLR with a Dummy: different intercepts, same slope................. 4 3.2
More informationEconomics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation
University of Illinois Fall 2016 Department of Economics Roger Koenker Economics 536 Lecture 21 Counts, Tobit, Sample Selection, and Truncation The simplest of this general class of models is Tobin s (1958)
More informationControl Function and Related Methods: Nonlinear Models
Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear
More informationMatched Pair Data. Stat 557 Heike Hofmann
Matched Pair Data Stat 557 Heike Hofmann Outline Marginal Homogeneity - review Binary Response with covariates Ordinal response Symmetric Models Subject-specific vs Marginal Model conditional logistic
More information7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis
Lecture 6: Logistic Regression Analysis Christopher S. Hollenbeak, PhD Jane R. Schubart, PhD The Outcomes Research Toolbox Review Homework 2 Overview Logistic regression model conceptually Logistic regression
More informationMaking sense of Econometrics: Basics
Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/
More informationStandard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j
Standard Errors & Confidence Intervals β β asy N(0, I( β) 1 ), where I( β) = [ 2 l(β, φ; y) ] β i β β= β j We can obtain asymptotic 100(1 α)% confidence intervals for β j using: β j ± Z 1 α/2 se( β j )
More informationLimited Dependent Variables and Panel Data
Limited Dependent Variables and Panel Data Logit, Probit and Friends Benjamin Bittschi Sebastian Koch Outline Binary dependent variables Logit Fixed Effects Models Probit Random Effects Models Censored
More informationLatent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent
Latent Variable Models for Binary Data Suppose that for a given vector of explanatory variables x, the latent variable, U, has a continuous cumulative distribution function F (u; x) and that the binary
More informationSingle-level Models for Binary Responses
Single-level Models for Binary Responses Distribution of Binary Data y i response for individual i (i = 1,..., n), coded 0 or 1 Denote by r the number in the sample with y = 1 Mean and variance E(y) =
More informationNon-linear panel data modeling
Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1
More informationGeneralized linear models
Generalized linear models Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Generalized linear models Boston College, Spring 2016 1 / 1 Introduction
More informationLogistic Regression 21/05
Logistic Regression 21/05 Recall that we are trying to solve a classification problem in which features x i can be continuous or discrete (coded as 0/1) and the response y is discrete (0/1). Logistic regression
More informationGeneralized linear models for binary data. A better graphical exploratory data analysis. The simple linear logistic regression model
Stat 3302 (Spring 2017) Peter F. Craigmile Simple linear logistic regression (part 1) [Dobson and Barnett, 2008, Sections 7.1 7.3] Generalized linear models for binary data Beetles dose-response example
More informationToday. HW 1: due February 4, pm. Aspects of Design CD Chapter 2. Continue with Chapter 2 of ELM. In the News:
Today HW 1: due February 4, 11.59 pm. Aspects of Design CD Chapter 2 Continue with Chapter 2 of ELM In the News: STA 2201: Applied Statistics II January 14, 2015 1/35 Recap: data on proportions data: y
More informationLecture 10: Alternatives to OLS with limited dependent variables. PEA vs APE Logit/Probit Poisson
Lecture 10: Alternatives to OLS with limited dependent variables PEA vs APE Logit/Probit Poisson PEA vs APE PEA: partial effect at the average The effect of some x on y for a hypothetical case with sample
More informationMultiple Linear Regression
Multiple Linear Regression ST 430/514 Recall: a regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates).
More informationData-analysis and Retrieval Ordinal Classification
Data-analysis and Retrieval Ordinal Classification Ad Feelders Universiteit Utrecht Data-analysis and Retrieval 1 / 30 Strongly disagree Ordinal Classification 1 2 3 4 5 0% (0) 10.5% (2) 21.1% (4) 42.1%
More informationLecture 3.1 Basic Logistic LDA
y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data
More informationLecture notes to Chapter 11, Regression with binary dependent variables - probit and logit regression
Lecture notes to Chapter 11, Regression with binary dependent variables - probit and logit regression Tore Schweder October 28, 2011 Outline Examples of binary respons variables Probit and logit - examples
More informationFinal Exam. Name: Solution:
Final Exam. Name: Instructions. Answer all questions on the exam. Open books, open notes, but no electronic devices. The first 13 problems are worth 5 points each. The rest are worth 1 point each. HW1.
More informationThe general linear regression with k explanatory variables is just an extension of the simple regression as follows
3. Multiple Regression Analysis The general linear regression with k explanatory variables is just an extension of the simple regression as follows (1) y i = β 0 + β 1 x i1 + + β k x ik + u i. Because
More informationExercise 5.4 Solution
Exercise 5.4 Solution Niels Richard Hansen University of Copenhagen May 7, 2010 1 5.4(a) > leukemia
More informationSTA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).
STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) (b) (c) (d) (e) In 2 2 tables, statistical independence is equivalent
More informationEconometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018
Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate
More informationProblem set - Selection and Diff-in-Diff
Problem set - Selection and Diff-in-Diff 1. You want to model the wage equation for women You consider estimating the model: ln wage = α + β 1 educ + β 2 exper + β 3 exper 2 + ɛ (1) Read the data into
More informationNinth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"
Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric
More informationDensity Temp vs Ratio. temp
Temp Ratio Density 0.00 0.02 0.04 0.06 0.08 0.10 0.12 Density 0.0 0.2 0.4 0.6 0.8 1.0 1. (a) 170 175 180 185 temp 1.0 1.5 2.0 2.5 3.0 ratio The histogram shows that the temperature measures have two peaks,
More informationClass Notes: Week 8. Probit versus Logit Link Functions and Count Data
Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While
More informationSTA 450/4000 S: January
STA 450/4000 S: January 6 005 Notes Friday tutorial on R programming reminder office hours on - F; -4 R The book Modern Applied Statistics with S by Venables and Ripley is very useful. Make sure you have
More informationMcGill University. Faculty of Science. Department of Mathematics and Statistics. Statistics Part A Comprehensive Exam Methodology Paper
Student Name: ID: McGill University Faculty of Science Department of Mathematics and Statistics Statistics Part A Comprehensive Exam Methodology Paper Date: Friday, May 13, 2016 Time: 13:00 17:00 Instructions
More informationECON 5350 Class Notes Functional Form and Structural Change
ECON 5350 Class Notes Functional Form and Structural Change 1 Introduction Although OLS is considered a linear estimator, it does not mean that the relationship between Y and X needs to be linear. In this
More informationST430 Exam 2 Solutions
ST430 Exam 2 Solutions Date: November 9, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textbook are permitted but you may use a calculator. Giving
More informationTruncation and Censoring
Truncation and Censoring Laura Magazzini laura.magazzini@univr.it Laura Magazzini (@univr.it) Truncation and Censoring 1 / 35 Truncation and censoring Truncation: sample data are drawn from a subset of
More informationIntroduction to General and Generalized Linear Models
Introduction to General and Generalized Linear Models Generalized Linear Models - part III Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs.
More informationIntermediate Econometrics
Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the
More informationAdministration. Homework 1 on web page, due Feb 11 NSERC summer undergraduate award applications due Feb 5 Some helpful books
STA 44/04 Jan 6, 00 / 5 Administration Homework on web page, due Feb NSERC summer undergraduate award applications due Feb 5 Some helpful books STA 44/04 Jan 6, 00... administration / 5 STA 44/04 Jan 6,
More informationFormulary Applied Econometrics
Department of Economics Formulary Applied Econometrics c c Seminar of Statistics University of Fribourg Formulary Applied Econometrics 1 Rescaling With y = cy we have: ˆβ = cˆβ With x = Cx we have: ˆβ
More informationLecture 14: Introduction to Poisson Regression
Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why
More informationModelling counts. Lecture 14: Introduction to Poisson Regression. Overview
Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week
More information2. We care about proportion for categorical variable, but average for numerical one.
Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is
More informationReview of Multinomial Distribution If n trials are performed: in each trial there are J > 2 possible outcomes (categories) Multicategory Logit Models
Chapter 6 Multicategory Logit Models Response Y has J > 2 categories. Extensions of logistic regression for nominal and ordinal Y assume a multinomial distribution for Y. 6.1 Logit Models for Nominal Responses
More informationWould you have survived the sinking of the Titanic? Felix Pretis (Oxford) Econometrics Oxford University, / 38
Would you have survived the sinking of the Titanic? Felix Pretis (Oxford) Econometrics Oxford University, 2016 1 / 38 Introduction Econometrics: Computer Modelling Felix Pretis Programme for Economic Modelling
More informationNATIONAL UNIVERSITY OF SINGAPORE EXAMINATION. ST3241 Categorical Data Analysis. (Semester II: ) April/May, 2011 Time Allowed : 2 Hours
NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION Categorical Data Analysis (Semester II: 2010 2011) April/May, 2011 Time Allowed : 2 Hours Matriculation No: Seat No: Grade Table Question 1 2 3 4 5 6 Full marks
More informationOutline of GLMs. Definitions
Outline of GLMs Definitions This is a short outline of GLM details, adapted from the book Nonparametric Regression and Generalized Linear Models, by Green and Silverman. The responses Y i have density
More informationStatistical Inference. Part IV. Statistical Inference
Part IV Statistical Inference As of Oct 5, 2017 Sampling Distributions of the OLS Estimator 1 Statistical Inference Sampling Distributions of the OLS Estimator Testing Against One-Sided Alternatives Two-Sided
More informationSTA102 Class Notes Chapter Logistic Regression
STA0 Class Notes Chapter 0 0. Logistic Regression We continue to study the relationship between a response variable and one or more eplanatory variables. For SLR and MLR (Chapters 8 and 9), our response
More informationLog-linear Models for Contingency Tables
Log-linear Models for Contingency Tables Statistics 149 Spring 2006 Copyright 2006 by Mark E. Irwin Log-linear Models for Two-way Contingency Tables Example: Business Administration Majors and Gender A
More informationCourse Econometrics I
Course Econometrics I 4. Heteroskedasticity Martin Halla Johannes Kepler University of Linz Department of Economics Last update: May 6, 2014 Martin Halla CS Econometrics I 4 1/31 Our agenda for today Consequences
More informationGeneralized Linear Models
York SPIDA John Fox Notes Generalized Linear Models Copyright 2010 by John Fox Generalized Linear Models 1 1. Topics I The structure of generalized linear models I Poisson and other generalized linear
More informationBinary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment
BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense
More informationi (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x)
ECO 6375 Prof Millimet Problem Set #2: Answer Key Stata problem 2 Q 3 Q (a) The sample average of the individual-specific marginal effects is 0039 for educw and -0054 for white Thus, on average, an extra
More information22s:152 Applied Linear Regression. Example: Study on lead levels in children. Ch. 14 (sec. 1) and Ch. 15 (sec. 1 & 4): Logistic Regression
22s:52 Applied Linear Regression Ch. 4 (sec. and Ch. 5 (sec. & 4: Logistic Regression Logistic Regression When the response variable is a binary variable, such as 0 or live or die fail or succeed then
More informationExercise sheet 6 Models with endogenous explanatory variables
Exercise sheet 6 Models with endogenous explanatory variables Note: Some of the exercises include estimations and references to the data files. Use these to compare them to the results you obtained with
More informationRegression Methods for Survey Data
Regression Methods for Survey Data Professor Ron Fricker! Naval Postgraduate School! Monterey, California! 3/26/13 Reading:! Lohr chapter 11! 1 Goals for this Lecture! Linear regression! Review of linear
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science
UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator
More informationApplied Regression Analysis
Applied Regression Analysis Chapter 3 Multiple Linear Regression Hongcheng Li April, 6, 2013 Recall simple linear regression 1 Recall simple linear regression 2 Parameter Estimation 3 Interpretations of
More information