Econ 371 Problem Set #6 Answer Sheet. deaths per 10,000. The 90% confidence interval for the change in death rate is 1.81 ±

Size: px
Start display at page:

Download "Econ 371 Problem Set #6 Answer Sheet. deaths per 10,000. The 90% confidence interval for the change in death rate is 1.81 ±"

Transcription

1 Econ 371 Problem Set #6 Answer Sheet 10.1 This question focuses on the regression model results in Table a. The first part of this question asks you to predict the number of lives that would be saved in New Jersey if the tax on a case of beer was increased by $1. With a $1 increase in the beer tax, the expected number of lives that would be saved is 0.45 per 10,000 people. Since New Jersey has a population of 8.1 million, the expected number of lives saved is = The 95% confidence interval is (0.45 ± ) 810 = [15.228, ]. b. When New Jersey lowers its drinking age from 21 to 18, the expected fatality rate increases by deaths per 10,000. The 95% confidence interval for the change in death rate is ± = [ , ]. With a population of 8.1 million, the number of fatalities will increase by = with a 95% confidence interval [ , ] 810 = [ , ]. c. When real income per capita in new Jersey increases by 1%, the expected fatality rate increases by 1.81 deaths per 10,000. The 90% confidence interval for the change in death rate is 1.81 ± = [1.04, 2.58]. With a population of 8.1 million, the number of fatalities will increase by = with a 90% confidence interval [1.04, 2.58] 810 = [840, 2092]. d. The low p-value (or high F -statistic) associated with the F -test on the assumption that time effects are zero suggests that the time effects should be included in the regression. e. The difference in the significance levels arises primarily because the estimated coefficient is higher in (5) than in (4). However, (5) leaves out two variables (unemployment rate and real income per capita) that are statistically significant. Thus, the estimated coefficient on Beer Tax in (5) may suffer from omitted variable bias. The results from (4) seem more reliable. In general, statistical significance should be used to measure reliability only if the regression is well-specified (no important omitted variable bias, correct functional form, no simultaneous causality or selection bias, and so forth.) f. In this case, you would want to define a binary variable west which equals 1 for the western states and 0 for the other states and then include the interaction term between the binary variable west and the unemployment rate (i.e., west (unemploymentrate)) in the regression equation corresponding to column (4). Suppose the coefficient associated with unemployment rate is β, and the coefficient associated with west (unemploymentrate) is γ. Then β captures the effect of the unemployment rate in the eastern states, and β + γ captures the effect of the unemployment rate in the western states. The difference in the effect of the unemployment rate in the western and eastern states is γ. Using the coefficient estimate (ˆγ) and the standard error (SE(ˆγ)) you can calculate the t-statistic to test whether γ is statistically significant at a given significance level This question focuses on the regression model described in equation (10.11). You are asked to describe the slope and intercept for different entities and times periods. Notice that in this model, the slope does not change across time periods, nor do any of the intercepts (i.e., this model does not have fixed time effects). The only thing that varies is the intercept, which is different for different entities. a. For Entity 1 in time Period 1, we have D2 1 = = Dn 1 = 0, so that the model reduces to: with an intercept of β 0 and a slope of β 1. Y 11 = β 0 + β 1 X 11 + u 11 (1) b. For Entity 1 in time Period 3, we still have D2 1 = = Dn 1 = 0, so that the model reduces to: with an intercept of β 0 and a slope of β 1. Y 13 = β 0 + β 1 X 13 + u 13 (2) c. For Entity 3 in time Period 1, we have D2 3 = D4 3 = = Dn 3 = 0 and D3 3 = 1 so that the model reduces to: Y 31 = β 0 + γ 3 + β 1 X 31 + u 31 (3) with an intercept of β 0 + γ 3 and a slope of β 1. 1

2 d. For Entity 3 in time Period 3, we still have D2 3 = D4 3 = = Dn 3 = 0 and D3 3 = 1 so that the model reduces to: Y 33 = β 0 + γ 3 + β 1 X 33 + u 33 (4) with an intercept of β 0 + γ 3 and a slope of β In this question, you are asked to comment on competing methods for estimating the effect of snow on traffic fatalities. a. The first method adds a regressor containing the average snow fall for each state (AverageSnow i ). The problem with this regressor is that average snow fall does not vary over time, and thus will be perfectly collinear with the state fixed effect. b. In the second approach, snowfall in each state and each year is used as a regressor. Since Snow it does vary with time, this method can be used along with state fixed effects This question focuses on the estimated probit model results in equation (11.8). a. You are first asked what the loan denial probability would be for a black applicant with a P/I Ratio of In this case, we have P r[y i = 1 P/Iratio = 0.35, black = 1] = Φ( ) = Φ( 0.59) = 27.76%. b. Now, your are asked how this probability changes if the P/I ratio is reduced to Now we have P r[y i = 1 P/Iratio = 0.30, black = 1] = Φ( ) = Φ( 0.73) = 23.27%. The difference in denial probabilities compared to (a) is 4.4 percentage points lower. c. In part (c), you are asked to repeat this exercise for a white loan applicant. In this case we have: and P r[y i = 1 P/Iratio = 0.35, black = 0] = Φ( ) = 9.7% (5) P r[y i = 1 P/Iratio = 0.30, black = 0] = Φ( ) = 7.5% (6) so that the change is only 2.2%. d. Finally, you are asked if the marginal effect of the P/I ratio on the probability of mortgage denial depend on race. From the results in parts (a)-(c), we can see that the marginal effect of the P/I ratio on the probability of mortgage denial depends on race. In the probit regression functional form, the marginal effect depends on the level of probability which in turn depends on the race of the applicant. The coefficient on black is statistically significant at the 1% level This question asks you to repeat the previous question, now using the logit model results in equation (11.10). In this case, we have: P r[y i = 1 P/Iratio = 0.35, black = 1] = Λ( ) = 27.28% (7) P r[y i = 1 P/Iratio = 0.30, black = 1] = Λ( ) = 22.29% (8) The difference in denial probabilities compared to (a) is 4.99 percentage points lower. P r[y i = 1 P/Iratio = 0.35, black = 0] = Λ( ) = 9.53% (9) P r[y i = 1 P/Iratio = 0.30, black = 0] = Λ( ) = 7.45% (10) so that the change is only 2.08%. The logit and probit results are similar. The two empirical exercises in this homework use the same dataset: Smoking. The data can be downloaded from the Web site listed in the assignment (which you can also reach from the class website). A program that carries all of the tasks for problems E11.1 and E11.2 is appended to this answer sheet. E11.1 This first question asks you to estimate various linear probability models for the smoking data set. a. This first question can be answered using the summarize command and the fact that SE(ˆp) = ˆσ Y N. Specifically, we have the following estimates of the probability of smoking (mean of smoker): 2

3 group ˆp SE( ˆp) All Workers No Smoking Ban Smoking Ban b. This question asks you to determine if the workplace smoking ban alters the probability of smoking using a linear probability model. The LPM yields Variable ˆβ SE( ˆβ) Intercept Smoking Ban The resulting t-statistic on the smoking ban dummy variable is 8.66, so the coefficient is statistically significant. Notice that the intercept is the same as ˆp in part (a) for those cases without a smoking ban. c. In this question, you are asked to estimate a more general LPM, including a wide variety of variables and to compare the estimated impact of a smoking ban in the case. The resulting regression parameter estimates are: Variable ˆβ SE( ˆβ) Intercept Smoking Ban female age age hsdrop hsgrad colsome colgrad black hispanic From model in (c) the estimated difference is , smaller than the effect in model (b). Evidently (b) suffers from omitted variable bias. That is, smkban may be correlated with the education/race/gender indicators or with age. For example, workers with a college degree are more likely to work in an office with a smoking ban than high-school dropouts, and college graduates are less likely to smoke than high-school dropouts. d. The t-statistic is -5.27, so the coefficient is statistically significant at the 1% level. e. The F-statistic (140.09) has a p-value of < 0.01, so the coefficients are significant. The omitted education status is Masters degree or higher. Thus the coefficients show the increase in probability relative to someone with a postgraduate degree. For example, the coefficient on Colgrad is 0.045, so the probability of smoking for a college graduate is (4.5%) higher than for someone with a postgraduate degree. Similarly, the coefficient on HSdrop is 0.323, so the probability of smoking for a college graduate is (32.3%) higher than for someone with a postgraduate degree. Because the coefficients are all positive and get smaller as educational attainment increases, the probability of smoking falls as educational attainment increases. E11.2 This question continues the analysis of the smoking data set, now focusing on estimating probit models. a. This first question asks you to use the same variables as in E11.1(c), but this time in a probit model. The resulting parameter estimates are: 3

4 Variable ˆβ SE( ˆβ) Intercept Smoking Ban female age age hsdrop hsgrad colsome colgrad black hispanic b. The t-statistic is -5.47, very similar to the value for the linear probability model. Again, we would reject that smkban has a zero coefficient. c. The F-statistic (now ) is significant at the 1% level, as in the linear probability model. d. In this case, you are asked to compute the smoking probability for Mr. A with and without a smoking ban in place and to compute the effect of the smoking ban. We have P r[y i = 1 Mr.A, noban] = Φ[ (20) (20 2 ) ] = Φ[ 0.090] = P r[y i = 1 Mr.A, ban] = Φ[ (20) (20 2 ) ] = Φ[ 0.249] = Therefore the workplace bans would reduce the probability of smoking by (6.2%). e. This question asks you to repeat your calculations using Ms. B, who is female, 40-years old and a college graduate. In this case, we get: P r[y i = 1 Ms.B, noban] = Φ[ (40) (40 2 ) ] = Φ[ 1.064] = P r[y i = 1 Ms.B, ban] = Φ[ (40) (40 2 ) ] = Φ[ 1.222] = Therefore the workplace bans would reduce the probability of smoking by (3.4%). 4

5 ; Problem Set #6 ; # delimit ; clear; cap log close; ; Specify the output file ; log using Problemset6.log,replace; set more off; ; Read in and summarize the data ; use Smoking.dta; describe; summarize smoker; summarize smoker if smkban==0; summarize smoker if smkban==1; ; Estimate the model for question E11.1b ; reg smoker smkban,r; ; Estimate the model for question E11.1c ; generate age2 = age^2; reg smoker smkban female age age2 hsdrop hsgrad colsome colgrad black hispanic,r; test hsdrop hsgrad colsome colgrad; ; Estimate the model for question E11.2a ; probit smoker smkban female age age2 hsdrop hsgrad colsome colgrad black hispanic,r; test hsdrop hsgrad colsome colgrad; scalar A1 = (_b[_cons] + _b[age]20 + _b[age2](20^2) + _b[hsdrop]); scalar A2 = (_b[_cons] + _b[age]20 + _b[age2](20^2) + _b[hsdrop] + _b[smkban]); scalar PA1 = normal(a1); scalar PA2 = normal(a2); scalar B1 = (_b[_cons] + _b[age]40 + _b[age2](40^2) + _b[colgrad] + _b[female] + _b[black]); scalar B2 = (_b[_cons] + _b[age]40 + _b[age2](40^2) + _b[colgrad]

6 + _b[female] + _b[black] + _b[smkban]); scalar PB1 = normal(b1); scalar PB2 = normal(b2); scalar list; log close; clear; exit;

7 Problemset6.log log: C:\Documents and Settings\jaherrig\My Documents\Classes\Economics 371\Stata\Problemset6.log log type: text opened on: 18 Nov 2008, 13:03:55. set more off;. ;. > Read in and summarize the data > > ;. use Smoking.dta;. describe; Contains data from Smoking.dta obs: 10,000 vars: Feb :44 size: 140,000 (86.6% of memory free) storage display value variable name type format label variable label smoker byte %8.0g =1 if a current smoker smkban byte %9.0g =1 if there is a work area smoking bans age byte %9.0g age in years hsdrop byte %9.0g =1 if hs dropout hsgrad byte %9.0g =1 if hs grad colsome byte %9.0g =1 if some college colgrad byte %9.0g =1 if college grad black byte %9.0g =1 if black hispanic byte %9.0g =1 if hispanic female byte %9.0g =1 if female Sorted by:. summarize smoker; Variable Obs Mean Std. Dev. Min Max smoker summarize smoker if smkban==0; Variable Obs Mean Std. Dev. Min Max smoker summarize smoker if smkban==1; Variable Obs Mean Std. Dev. Min Max smoker ;. Page 1

8 Problemset6.log > Estimate the model for question E11.1b > > ;. reg smoker smkban,r; Linear regression Number of obs = F( 1, 9998) = Prob > F = R-squared = Root MSE = Robust smoker Coef. Std. Err. t P> t [95% Conf. Interval] smkban _cons ;. > Estimate the model for question E11.1c > > ;. generate age2 = age^2;. reg smoker smkban female age age2 hsdrop hsgrad colsome colgrad black > hispanic,r; Linear regression Number of obs = F( 10, 9989) = Prob > F = R-squared = Root MSE = Robust smoker Coef. Std. Err. t P> t [95% Conf. Interval] smkban female age age hsdrop hsgrad colsome colgrad black hispanic _cons test hsdrop hsgrad colsome colgrad; ( 1) hsdrop = 0 ( 2) hsgrad = 0 ( 3) colsome = 0 ( 4) colgrad = 0 F( 4, 9989) = Prob > F = ; Page 2

9 Problemset6.log. > Estimate the model for question E11.2a > > ;. probit smoker smkban female age age2 hsdrop hsgrad colsome colgrad black > hispanic,r; Iteration 0: log pseudolikelihood = Iteration 1: log pseudolikelihood = Iteration 2: log pseudolikelihood = Iteration 3: log pseudolikelihood = Probit regression Number of obs = Wald chi2(10) = Prob > chi2 = Log pseudolikelihood = Pseudo R2 = Robust smoker Coef. Std. Err. z P> z [95% Conf. Interval] smkban female age age hsdrop hsgrad colsome colgrad black hispanic _cons test hsdrop hsgrad colsome colgrad; ( 1) hsdrop = 0 ( 2) hsgrad = 0 ( 3) colsome = 0 ( 4) colgrad = 0 chi2( 4) = Prob > chi2 = scalar A1 = (_b[_cons] + _b[age]20 + _b[age2](20^2) + _b[hsdrop]);. scalar A2 = (_b[_cons] + _b[age]20 + _b[age2](20^2) + _b[hsdrop] > + _b[smkban]);. scalar PA1 = normal(a1);. scalar PA2 = normal(a2);. scalar B1 = (_b[_cons] + _b[age]40 + _b[age2](40^2) + _b[colgrad] > + _b[female] + _b[black]);. scalar B2 = (_b[_cons] + _b[age]40 + _b[age2](40^2) + _b[colgrad] > + _b[female] + _b[black] + _b[smkban]);. scalar PB1 = normal(b1);. scalar PB2 = normal(b2); Page 3

10 . scalar list; PB2 = PB1 = B2 = B1 = PA2 = PA1 = A2 = A1 = Problemset6.log. log close; log: C:\Documents and Settings\jaherrig\My Documents\Classes\Economics 371\Stata\Problemset6.log log type: text closed on: 18 Nov 2008, 13:03: Page 4

Econ 371 Problem Set #6 Answer Sheet In this first question, you are asked to consider the following equation:

Econ 371 Problem Set #6 Answer Sheet In this first question, you are asked to consider the following equation: Econ 37 Problem Set #6 Answer Sheet 0. In this first question, you are asked to consider the following equation: Y it = β 0 + β X it + β 3 S t + u it. () You are asked how you might time-demean the data

More information

Econometrics Problem Set 10

Econometrics Problem Set 10 Econometrics Problem Set 0 WISE, Xiamen University Spring 207 Conceptual Questions Dependent variable: P ass Probit Logit LPM Probit Logit LPM Probit () (2) (3) (4) (5) (6) (7) Experience 0.03 0.040 0.006

More information

Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 10

Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 10 Introduction to Econometrics (3 rd Updated Edition) by James H. Stock and Mark W. Watson Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 10 (This version July 20, 2014) Stock/Watson - Introduction

More information

ECON Introductory Econometrics. Lecture 11: Binary dependent variables

ECON Introductory Econometrics. Lecture 11: Binary dependent variables ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability

More information

Chapter 11. Regression with a Binary Dependent Variable

Chapter 11. Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score

More information

Panel Data. STAT-S-301 Exercise session 5. November 10th, vary across entities but not over time. could cause omitted variable bias if omitted

Panel Data. STAT-S-301 Exercise session 5. November 10th, vary across entities but not over time. could cause omitted variable bias if omitted Panel Data STAT-S-301 Exercise session 5 November 10th, 2016 Panel data consist of observations on the same n entities at two or mor time periods (T). If two variables Y, and X are observed, the data is

More information

Homework Solutions Applied Logistic Regression

Homework Solutions Applied Logistic Regression Homework Solutions Applied Logistic Regression WEEK 6 Exercise 1 From the ICU data, use as the outcome variable vital status (STA) and CPR prior to ICU admission (CPR) as a covariate. (a) Demonstrate that

More information

2. We care about proportion for categorical variable, but average for numerical one.

2. We care about proportion for categorical variable, but average for numerical one. Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is

More information

Binary Dependent Variable. Regression with a

Binary Dependent Variable. Regression with a Beykent University Faculty of Business and Economics Department of Economics Econometrics II Yrd.Doç.Dr. Özgür Ömer Ersin Regression with a Binary Dependent Variable (SW Chapter 11) SW Ch. 11 1/59 Regression

More information

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid Applied Economics Regression with a Binary Dependent Variable Department of Economics Universidad Carlos III de Madrid See Stock and Watson (chapter 11) 1 / 28 Binary Dependent Variables: What is Different?

More information

Binary Dependent Variables

Binary Dependent Variables Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome

More information

Regression with a Binary Dependent Variable (SW Ch. 9)

Regression with a Binary Dependent Variable (SW Ch. 9) Regression with a Binary Dependent Variable (SW Ch. 9) So far the dependent variable (Y) has been continuous: district-wide average test score traffic fatality rate But we might want to understand the

More information

Exam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h.

Exam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h. Exam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h. This is an open book examination where all printed and written resources, in addition to a calculator, are allowed. If you are

More information

Empirical Application of Simple Regression (Chapter 2)

Empirical Application of Simple Regression (Chapter 2) Empirical Application of Simple Regression (Chapter 2) 1. The data file is House Data, which can be downloaded from my webpage. 2. Use stata menu File Import Excel Spreadsheet to read the data. Don t forget

More information

ECON3150/4150 Spring 2016

ECON3150/4150 Spring 2016 ECON3150/4150 Spring 2016 Lecture 6 Multiple regression model Siv-Elisabeth Skjelbred University of Oslo February 5th Last updated: February 3, 2016 1 / 49 Outline Multiple linear regression model and

More information

Problem Set 4 ANSWERS

Problem Set 4 ANSWERS Economics 20 Problem Set 4 ANSWERS Prof. Patricia M. Anderson 1. Suppose that our variable for consumption is measured with error, so cons = consumption + e 0, where e 0 is uncorrelated with inc, educ

More information

Sociology 362 Data Exercise 6 Logistic Regression 2

Sociology 362 Data Exercise 6 Logistic Regression 2 Sociology 362 Data Exercise 6 Logistic Regression 2 The questions below refer to the data and output beginning on the next page. Although the raw data are given there, you do not have to do any Stata runs

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 7 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 68 Outline of Lecture 7 1 Empirical example: Italian labor force

More information

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10) Name Economics 170 Spring 2004 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment for the

More information

Lab 07 Introduction to Econometrics

Lab 07 Introduction to Econometrics Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand

More information

raise Coef. Std. Err. z P> z [95% Conf. Interval]

raise Coef. Std. Err. z P> z [95% Conf. Interval] 1 We will use real-world data, but a very simple and naive model to keep the example easy to understand. What is interesting about the example is that the outcome of interest, perhaps the probability or

More information

ECON Introductory Econometrics. Lecture 6: OLS with Multiple Regressors

ECON Introductory Econometrics. Lecture 6: OLS with Multiple Regressors ECON4150 - Introductory Econometrics Lecture 6: OLS with Multiple Regressors Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 6 Lecture outline 2 Violation of first Least Squares assumption

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics STAT-S-301 Panel Data (2016/2017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 Regression with Panel Data A panel dataset contains observations on multiple entities

More information

Problem set - Selection and Diff-in-Diff

Problem set - Selection and Diff-in-Diff Problem set - Selection and Diff-in-Diff 1. You want to model the wage equation for women You consider estimating the model: ln wage = α + β 1 educ + β 2 exper + β 3 exper 2 + ɛ (1) Read the data into

More information

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do

More information

Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers. (1 θ) if y i = 0. which can be written in an analytically more convenient way as

Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers. (1 θ) if y i = 0. which can be written in an analytically more convenient way as Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers 1. Consider a binary random variable y i that describes a Bernoulli trial in which the probability of observing y i = 1 in any draw is given

More information

Practice 2SLS with Artificial Data Part 1

Practice 2SLS with Artificial Data Part 1 Practice 2SLS with Artificial Data Part 1 Yona Rubinstein July 2016 Yona Rubinstein (LSE) Practice 2SLS with Artificial Data Part 1 07/16 1 / 16 Practice with Artificial Data In this note we use artificial

More information

THE AUSTRALIAN NATIONAL UNIVERSITY. Second Semester Final Examination November, Econometrics II: Econometric Modelling (EMET 2008/6008)

THE AUSTRALIAN NATIONAL UNIVERSITY. Second Semester Final Examination November, Econometrics II: Econometric Modelling (EMET 2008/6008) THE AUSTRALIAN NATIONAL UNIVERSITY Second Semester Final Examination November, 2014 Econometrics II: Econometric Modelling (EMET 2008/6008) Reading Time: 5 Minutes Writing Time: 90 Minutes Permitted Materials:

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Empirical Application of Panel Data Regression

Empirical Application of Panel Data Regression Empirical Application of Panel Data Regression 1. We use Fatality data, and we are interested in whether rising beer tax rate can help lower traffic death. So the dependent variable is traffic death, while

More information

Introduction to Econometrics. Regression with Panel Data

Introduction to Econometrics. Regression with Panel Data Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Regression with Panel Data Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1 Regression with Panel

More information

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory

More information

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2015-16 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This

More information

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points EEP 118 / IAS 118 Elisabeth Sadoulet and Kelly Jones University of California at Berkeley Fall 2008 Introductory Applied Econometrics Final examination Scores add up to 125 points Your name: SID: 1 1.

More information

Practice exam questions

Practice exam questions Practice exam questions Nathaniel Higgins nhiggins@jhu.edu, nhiggins@ers.usda.gov 1. The following question is based on the model y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + u. Discuss the following two hypotheses.

More information

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like.

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like. Measurement Error Often a data set will contain imperfect measures of the data we would ideally like. Aggregate Data: (GDP, Consumption, Investment are only best guesses of theoretical counterparts and

More information

Regression #8: Loose Ends

Regression #8: Loose Ends Regression #8: Loose Ends Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #8 1 / 30 In this lecture we investigate a variety of topics that you are probably familiar with, but need to touch

More information

Heteroskedasticity Example

Heteroskedasticity Example ECON 761: Heteroskedasticity Example L Magee November, 2007 This example uses the fertility data set from assignment 2 The observations are based on the responses of 4361 women in Botswana s 1988 Demographic

More information

Econometrics Homework 4 Solutions

Econometrics Homework 4 Solutions Econometrics Homework 4 Solutions Computer Question (Optional, no need to hand in) (a) c i may capture some state-specific factor that contributes to higher or low rate of accident or fatality. For example,

More information

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections Answer Key Fixed Effect and First Difference Models 1. See discussion in class.. David Neumark and William Wascher published a study in 199 of the effect of minimum wages on teenage employment using a

More information

Lab 11 - Heteroskedasticity

Lab 11 - Heteroskedasticity Lab 11 - Heteroskedasticity Spring 2017 Contents 1 Introduction 2 2 Heteroskedasticity 2 3 Addressing heteroskedasticity in Stata 3 4 Testing for heteroskedasticity 4 5 A simple example 5 1 1 Introduction

More information

Lecture#12. Instrumental variables regression Causal parameters III

Lecture#12. Instrumental variables regression Causal parameters III Lecture#12 Instrumental variables regression Causal parameters III 1 Demand experiment, market data analysis & simultaneous causality 2 Simultaneous causality Your task is to estimate the demand function

More information

Econometrics Problem Set 4

Econometrics Problem Set 4 Econometrics Problem Set 4 WISE, Xiamen University Spring 2016-17 Conceptual Questions 1. This question refers to the estimated regressions in shown in Table 1 computed using data for 1988 from the CPS.

More information

ECON3150/4150 Spring 2016

ECON3150/4150 Spring 2016 ECON3150/4150 Spring 2016 Lecture 4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo Last updated: January 26, 2016 1 / 49 Overview These lecture slides covers: The linear regression

More information

Question 1 [17 points]: (ch 11)

Question 1 [17 points]: (ch 11) Question 1 [17 points]: (ch 11) A study analyzed the probability that Major League Baseball (MLB) players "survive" for another season, or, in other words, play one more season. They studied a model of

More information

2.1. Consider the following production function, known in the literature as the transcendental production function (TPF).

2.1. Consider the following production function, known in the literature as the transcendental production function (TPF). CHAPTER Functional Forms of Regression Models.1. Consider the following production function, known in the literature as the transcendental production function (TPF). Q i B 1 L B i K i B 3 e B L B K 4 i

More information

Problem Set 10: Panel Data

Problem Set 10: Panel Data Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005

More information

Exercise 7.4 [16 points]

Exercise 7.4 [16 points] STATISTICS 226, Winter 1997, Homework 5 1 Exercise 7.4 [16 points] a. [3 points] (A: Age, G: Gestation, I: Infant Survival, S: Smoking.) Model G 2 d.f. (AGIS).008 0 0 (AGI, AIS, AGS, GIS).367 1 (AG, AI,

More information

Meta-Analysis in Stata, 2nd edition p.158 Exercise Silgay et al. (2004)

Meta-Analysis in Stata, 2nd edition p.158 Exercise Silgay et al. (2004) Stata LightStone Stata 14 Funnel StataPress Meta-Analysis in Stata, 2nd edition p.153 Harbord et al. metabias metabias Steichen (1998) Begg Egger Stata Stata metabias Begg Egger Harbord Peters ado metafunnel

More information

Quantitative Methods Final Exam (2017/1)

Quantitative Methods Final Exam (2017/1) Quantitative Methods Final Exam (2017/1) 1. Please write down your name and student ID number. 2. Calculator is allowed during the exam, but DO NOT use a smartphone. 3. List your answers (together with

More information

Lab 10 - Binary Variables

Lab 10 - Binary Variables Lab 10 - Binary Variables Spring 2017 Contents 1 Introduction 1 2 SLR on a Dummy 2 3 MLR with binary independent variables 3 3.1 MLR with a Dummy: different intercepts, same slope................. 4 3.2

More information

Handout 11: Measurement Error

Handout 11: Measurement Error Handout 11: Measurement Error In which you learn to recognise the consequences for OLS estimation whenever some of the variables you use are not measured as accurately as you might expect. A (potential)

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

Lecture 24: Partial correlation, multiple regression, and correlation

Lecture 24: Partial correlation, multiple regression, and correlation Lecture 24: Partial correlation, multiple regression, and correlation Ernesto F. L. Amaral November 21, 2017 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015. Statistics: A

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Estimating and Interpreting Effects for Nonlinear and Nonparametric Models

Estimating and Interpreting Effects for Nonlinear and Nonparametric Models Estimating and Interpreting Effects for Nonlinear and Nonparametric Models Enrique Pinzón September 18, 2018 September 18, 2018 1 / 112 Objective Build a unified framework to ask questions about model

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

Problem Set 5 ANSWERS

Problem Set 5 ANSWERS Economics 20 Problem Set 5 ANSWERS Prof. Patricia M. Anderson 1, 2 and 3 Suppose that Vermont has passed a law requiring employers to provide 6 months of paid maternity leave. You are concerned that women

More information

Lab 6 - Simple Regression

Lab 6 - Simple Regression Lab 6 - Simple Regression Spring 2017 Contents 1 Thinking About Regression 2 2 Regression Output 3 3 Fitted Values 5 4 Residuals 6 5 Functional Forms 8 Updated from Stata tutorials provided by Prof. Cichello

More information

ECON Introductory Econometrics. Lecture 4: Linear Regression with One Regressor

ECON Introductory Econometrics. Lecture 4: Linear Regression with One Regressor ECON4150 - Introductory Econometrics Lecture 4: Linear Regression with One Regressor Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 4 Lecture outline 2 The OLS estimators The effect of

More information

i (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x)

i (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x) ECO 6375 Prof Millimet Problem Set #2: Answer Key Stata problem 2 Q 3 Q (a) The sample average of the individual-specific marginal effects is 0039 for educw and -0054 for white Thus, on average, an extra

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Problem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval]

Problem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval] Problem Set #3-Key Sonoma State University Economics 317- Introduction to Econometrics Dr. Cuellar 1. Use the data set Wage1.dta to answer the following questions. a. For the regression model Wage i =

More information

Nonlinear Regression Functions

Nonlinear Regression Functions Nonlinear Regression Functions (SW Chapter 8) Outline 1. Nonlinear regression functions general comments 2. Nonlinear functions of one variable 3. Nonlinear functions of two variables: interactions 4.

More information

Handout 12. Endogeneity & Simultaneous Equation Models

Handout 12. Endogeneity & Simultaneous Equation Models Handout 12. Endogeneity & Simultaneous Equation Models In which you learn about another potential source of endogeneity caused by the simultaneous determination of economic variables, and learn how to

More information

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

Lecture 12: Effect modification, and confounding in logistic regression

Lecture 12: Effect modification, and confounding in logistic regression Lecture 12: Effect modification, and confounding in logistic regression Ani Manichaikul amanicha@jhsph.edu 4 May 2007 Today Categorical predictor create dummy variables just like for linear regression

More information

Ecmt 675: Econometrics I

Ecmt 675: Econometrics I Ecmt 675: Econometrics I Assignment 7 Problem 1 a. reg hours lwage educ age kidslt6 kidsge6 nwifeinc, r Linear regression Number of obs = 428 F( 6, 421) = 3.93 Prob > F = 0.0008 R-squared = 0.0670 Root

More information

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL INTRODUCTION TO BASIC LINEAR REGRESSION MODEL 13 September 2011 Yogyakarta, Indonesia Cosimo Beverelli (World Trade Organization) 1 LINEAR REGRESSION MODEL In general, regression models estimate the effect

More information

ECON Introductory Econometrics. Lecture 13: Internal and external validity

ECON Introductory Econometrics. Lecture 13: Internal and external validity ECON4150 - Introductory Econometrics Lecture 13: Internal and external validity Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 9 Lecture outline 2 Definitions of internal and external

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 5 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 44 Outline of Lecture 5 Now that we know the sampling distribution

More information

Course Econometrics I

Course Econometrics I Course Econometrics I 3. Multiple Regression Analysis: Binary Variables Martin Halla Johannes Kepler University of Linz Department of Economics Last update: April 29, 2014 Martin Halla CS Econometrics

More information

Lecture 7: OLS with qualitative information

Lecture 7: OLS with qualitative information Lecture 7: OLS with qualitative information Dummy variables Dummy variable: an indicator that says whether a particular observation is in a category or not Like a light switch: on or off Most useful values:

More information

Interaction effects between continuous variables (Optional)

Interaction effects between continuous variables (Optional) Interaction effects between continuous variables (Optional) Richard Williams, University of Notre Dame, https://www.nd.edu/~rwilliam/ Last revised February 0, 05 This is a very brief overview of this somewhat

More information

ECON Interactions and Dummies

ECON Interactions and Dummies ECON 351 - Interactions and Dummies Maggie Jones 1 / 25 Readings Chapter 6: Section on Models with Interaction Terms Chapter 7: Full Chapter 2 / 25 Interaction Terms with Continuous Variables In some regressions

More information

Econometrics I Lecture 7: Dummy Variables

Econometrics I Lecture 7: Dummy Variables Econometrics I Lecture 7: Dummy Variables Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 27 Introduction Dummy variable: d i is a dummy variable

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 13 Nonlinearities Saul Lach October 2018 Saul Lach () Applied Statistics and Econometrics October 2018 1 / 91 Outline of Lecture 13 1 Nonlinear regression functions

More information

Question 1 carries a weight of 25%; Question 2 carries 20%; Question 3 carries 20%; Question 4 carries 35%.

Question 1 carries a weight of 25%; Question 2 carries 20%; Question 3 carries 20%; Question 4 carries 35%. UNIVERSITY OF EAST ANGLIA School of Economics Main Series PGT Examination 017-18 ECONOMETRIC METHODS ECO-7000A Time allowed: hours Answer ALL FOUR Questions. Question 1 carries a weight of 5%; Question

More information

Week 3: Simple Linear Regression

Week 3: Simple Linear Regression Week 3: Simple Linear Regression Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ALL RIGHTS RESERVED 1 Outline

More information

PhD/MA Econometrics Examination January 2012 PART A

PhD/MA Econometrics Examination January 2012 PART A PhD/MA Econometrics Examination January 2012 PART A ANSWER ANY TWO QUESTIONS IN THIS SECTION NOTE: (1) The indicator function has the properties: (2) Question 1 Let, [defined as if using the indicator

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests ECON4150 - Introductory Econometrics Lecture 5: OLS with One Regressor: Hypothesis Tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 5 Lecture outline 2 Testing Hypotheses about one

More information

Econometrics Problem Set 6

Econometrics Problem Set 6 Econometrics Problem Set 6 WISE, Xiamen University Spring 2016-17 Conceptual Questions 1. This question refers to the estimated regressions shown in Table 1 computed using data for 1988 from the CPS. The

More information

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 20, 2018

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 20, 2018 Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 20, 2018 References: Long 1997, Long and Freese 2003 & 2006 & 2014,

More information

At this point, if you ve done everything correctly, you should have data that looks something like:

At this point, if you ve done everything correctly, you should have data that looks something like: This homework is due on July 19 th. Economics 375: Introduction to Econometrics Homework #4 1. One tool to aid in understanding econometrics is the Monte Carlo experiment. A Monte Carlo experiment allows

More information

Appendix Table 1. Predictive Power of the Pre-Game Point Spread versus the Halftime Point Spread.

Appendix Table 1. Predictive Power of the Pre-Game Point Spread versus the Halftime Point Spread. Appendix Table 1. Predictive Power of the Pre-Game Point Spread versus the Halftime Point Spread. Probit Regression Dependent Variable = Win (1) (2) (3) (4) (5) (6) Spread -.081 -.057 (.004) (.004) [-.033]

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics STAT-S-301 Introduction to Time Series Regression and Forecasting (2016/2017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 Introduction to Time Series Regression

More information

Essential of Simple regression

Essential of Simple regression Essential of Simple regression We use simple regression when we are interested in the relationship between two variables (e.g., x is class size, and y is student s GPA). For simplicity we assume the relationship

More information

Lecture 10: Introduction to Logistic Regression

Lecture 10: Introduction to Logistic Regression Lecture 10: Introduction to Logistic Regression Ani Manichaikul amanicha@jhsph.edu 2 May 2007 Logistic Regression Regression for a response variable that follows a binomial distribution Recall the binomial

More information

Lecture 3.1 Basic Logistic LDA

Lecture 3.1 Basic Logistic LDA y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data

More information

Problem Set 1 ANSWERS

Problem Set 1 ANSWERS Economics 20 Prof. Patricia M. Anderson Problem Set 1 ANSWERS Part I. Multiple Choice Problems 1. If X and Z are two random variables, then E[X-Z] is d. E[X] E[Z] This is just a simple application of one

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle

More information

2. (3.5) (iii) Simply drop one of the independent variables, say leisure: GP A = β 0 + β 1 study + β 2 sleep + β 3 work + u.

2. (3.5) (iii) Simply drop one of the independent variables, say leisure: GP A = β 0 + β 1 study + β 2 sleep + β 3 work + u. BOSTON COLLEGE Department of Economics EC 228 Econometrics, Prof. Baum, Ms. Yu, Fall 2003 Problem Set 3 Solutions Problem sets should be your own work. You may work together with classmates, but if you

More information

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.

More information

Fixed and Random Effects Models: Vartanian, SW 683

Fixed and Random Effects Models: Vartanian, SW 683 : Vartanian, SW 683 Fixed and random effects models See: http://teaching.sociology.ul.ie/dcw/confront/node45.html When you have repeated observations per individual this is a problem and an advantage:

More information

From the help desk: Comparing areas under receiver operating characteristic curves from two or more probit or logit models

From the help desk: Comparing areas under receiver operating characteristic curves from two or more probit or logit models The Stata Journal (2002) 2, Number 3, pp. 301 313 From the help desk: Comparing areas under receiver operating characteristic curves from two or more probit or logit models Mario A. Cleves, Ph.D. Department

More information

1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11

1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11 Econ 495 - Econometric Review 1 Contents 1 Linear Regression Analysis 4 1.1 The Mincer Wage Equation................. 4 1.2 Data............................. 6 1.3 Econometric Model.....................

More information

Chapter 7. Hypothesis Tests and Confidence Intervals in Multiple Regression

Chapter 7. Hypothesis Tests and Confidence Intervals in Multiple Regression Chapter 7 Hypothesis Tests and Confidence Intervals in Multiple Regression Outline 1. Hypothesis tests and confidence intervals for a single coefficie. Joint hypothesis tests on multiple coefficients 3.

More information