Econ 371 Problem Set #6 Answer Sheet In this first question, you are asked to consider the following equation:
|
|
- Candace Long
- 5 years ago
- Views:
Transcription
1 Econ 37 Problem Set #6 Answer Sheet 0. In this first question, you are asked to consider the following equation: Y it = β 0 + β X it + β 3 S t + u it. () You are asked how you might time-demean the data so as to estimate β. To answer this question, let s start much like when we had entity-effects by gathering the nuisance variable S t with the intercept; i.e., by rewriting () as: Y it = β 0 + β 3 S t + β X it + u it (2) = φ t + β X it + u it. (3) where φ t β 0 + β 3 S t becomes our time-specific intercept term. We can then consider averaging the equation over all individuals in given time period, yielding: Ȳ t n = n = n Y it (φ t + β X it + u it ) (φ t ) + n = φ t + β Xt + ū t (β X it ) + n (u it ) where X t n n (X it) and ū t n n (u it). Subtracting the above equation from equation (3) yields: Ỹ it Y it Ȳt (4) = [φ t + β X it + u it ] [ φ t + β Xt + ū t ] (5) = β [ Xit X t ] + [uit ū t ] (6) = β Xit + ũ it (7) where X it X it X t and ũ it u it ū t. Notice that equation (7) no longer contains the time-specific intercepts. We have time-demeaned the data. We can then estimate β by simply estimating the model Ỹ it = β Xit + ũ it. 0.2 The second question asks that you again consider a model with only time effects (as in equation ()). The question notes that the transition from equation () above to equation (3). Finally, the question suggests that the equation can also be written in the form: Y it = β 0 + δ 2 B2 t + + δ T BT t + β X it + u it (8) where Bs t = if t = s and = 0 otherwise. You are asked to determine the relationship between the parameters in equation (3) and in equation (8). One way to see the relationships between the parameters is to note that we can use the Bs ts to write equation (3) as: Y it = φ B t + φ 2 B2 t + + φ T BT t + β X it + u it (9) However, since all the Bs ts together sum to, we have that B t = ( B2 t BT t ). Substituting this into equation (9) gives us: Y it = φ ( B2 t BT t ) + φ 2 B2 t + + φ T BT t + β X it + u it (0) = φ φ B2 t φ BT t + φ 2 B2 t + + φ T BT t + β X it + u it () = φ + (φ 2 φ )B2 t + + (φ T φ )BT t + β X it + u it. (2)
2 Comparing equations (2) and (8), we can see that: β 0 = φ and δ t = (φ t φ ) for t = 2,..., T. In words, β 0 is just the time-specific intercept for period. The δ t s measure the difference between the time-specific intercept for period t and the time specific intercept for period. Finally, the problem with adding a δ B t term to equation (8) is that we would have perfect multicollinearity between the intercept and the Bs ts (since they sum to one). The OLS estimates could not be computed in this case. Dropping the overall intercept would solve this problem. 0.3 In this next question, you are asked to consider a simple panel data model of the form: Y it = β 0 + β X it + u it. (3) To this model, you are asked to add both entity-fixed effects and time-fixed-effects for several cases and to comment on the total number of parameters involved. The resulting model is of the form in equation (0.20) in the text. Note that, just under the equation, it lists the resulting parameters, with a total of + + (n ) + (T ) = n + T parameters (i.e., the overall intercept, slope coefficient, (n ) entity-fixed effects, and (T ) time-fixed effects). To answer, the question all you need to know is n and T. a. In this case, we have n = 9 and T = 7, so there are 6 parameters. b. In this case, we have n = 5 and T = 38, so there are 43 parameters. c. In this case, we have n = 04 and T = 3, so there are 07 parameters. d. In this case, we have n = 30 and T = 2, so there are 32 parameters. 0.4 You are asked to consider the following logit regression: P r(y = X) = Λ( X). (4) You are then asked to calculate the change in probability for X increasing by 0 for X = 40 and X = 60 and then explain why is there such a large difference in the change in probabilities. Remember from class (and the book) that Λ(z) = ez + e z =. + e z (5) Thus: P r(y = 40) = Λ( ) = Λ(5.7) = = e 5.7 (6) P r(y = 50) = Λ( ) = Λ(3.3) = = e 3.3 (7) P r(y = 60) = Λ( ) = Λ(0.9) = = e0.9 (8) P r(y = 70) = Λ( ) = Λ(.5) = = e.5 (9) Using these results, we have: and P r(y = 50) P r(y = 40) = = (20) P r(y = 70) P r(y = 60) = = (2) The large differences happens as a result of the non-linearity of the function, and the points at which they are calculated. 0.5 This question focuses on the three discrete choice models and the relationship among their parameters. a. In the first case, you are asked to approximate the probit and LPM slope coefficients based on the coefficient estimates from the logit model. Using the formulas provided one gets: and ˆβ probit = ˆβ logit = 0.625( 0.236) = (22) ˆβ linear = 0.25 ˆβ logit = 0.25( 0.236) = (23) Both approximations are relatively close to the estimated coefficients in their respective models, though the linear approximation is off by a larger amount in percentage terms. 2
3 b. In this second question, you are given the formulas for transforming the intercepts. These yield: and ˆβ 0,probit = ˆβ 0,logit = 0.625(5.297) = 9.56 (24) ˆβ 0,linear = 0.25 ˆβ 0,logit = 0.25(5.297) = (25) You are then asked to provide the fitted probabilities for various temperatures: LPM Probit Logit Temp. (X) Actual Approx. Actual Approx. Actual In terms of calculated probabilities, the approximation is closer for the probit model than for the linear probability model. Also, the logit and probit models yield very similar fitted probabilities. E.3 The empirical exercise in this homework uses the dataset: Insurance. A program that carries all of the tasks for this problem is appended to this answer sheet. Note that the questions in the homework set are slightly different from those in the book. a. In this first question, you are asked to estimate a probit model of the individual s health insurance status (insured) as a function of the discrete variable (selfemp). Then you are asked if your results suggest that self-employed individuals are less likely to have health insurance and whether the difference large. From the attached Stata output, we can see that the coefficient on selfemp that the effect of employment type is statistically significant. The program also uses the scalar command to compute the fitted probability for individuals for are self-employed (P=.689) versus for individuals who are not self-employed (P2=.867). The difference is That is, self-employed individuals are 2.76% less likely to have insurances, which is a fairly substantial difference. b. In the second question, you are asked to add additional explanatory variables, including the individual s age, race, and whether or not they have a college degree. Notice that in my code, I assumed that they have a college degree if they have a masters, PhD or other post-graduate degree. i. You are first asked why you might include these additional variables. Well, as in the linear regression model, we are also worried about omitted variables bias in the probit model. The problem is that, if we exclude important other factors and those factors are correlated with the included variable (in this case selfemp), then the coefficient on selfemp will end up being biased, capturing not only its own impact but also reflecting the impact of the excluded variables. For example, self-employed individuals may tend to be older (since it might take time to come up with an idea for a new company and the money to back its start-up). But older individuals may also be more likely to have insurance (in part because they may now be able to afford it and may have dependents who need to be covered). All else equal, self-employed may have higher insurance rates than we might expect simply because they tend to be older. ii. Next, you are asked which coefficients are the most important factors. Notice that all of the factors are statistically significant at any reasonable significance level. This is first step in determining whether a factor is important. Second, as we discussed in class, we need to consider both the size of the coefficient and the standard deviation of the corresponding variable. The table below provides this information: Variable β k σ Xk β k σ Xk selfemp age race bl race ot colgrad The results suggest that an individual s age has the biggest impact, followed by the individual s degree, self-employment status and then their racial background. iii. Finally, you are asked whether self-employment status still matters in this more complex model. The answer is yes. In fact, the self-employment effect is computed in the code for each individual and then summarized. We see that the mean change in the impact of self-employment is now 8.% 3
4 Problem Set #6 # delimit ; clear; cap log close; cd "R:\users\jaherrig\My Documents\Classes\Economics 37\Problem Sets"; Specify the output file log using Problemset6.log,replace; set more off; Read in and summarize the data use Insurance.dta; describe; summarize; Estimate the model for question E.3a probit insured selfemp,r; scalar P = normal(_b[_cons] + _b[selfemp]); scalar P2 = normal(_b[_cons]); scalar Diff = P-P2; scalar list; Estimate the model for question E.3b generate colgrad = deg_ba + deg_ma + deg_phd + deg_oth; probit insured selfemp age race_bl race_ot colgrad,r; generate PC = normal(_b[_cons] + _b[selfemp] + _b[age]age + _b[race_bl]race_bl + _b[race_ot]race_ot + _b[colgrad]colgrad); generate P2C = normal(_b[_cons] + _b[age]age + _b[race_bl]race_bl + _b[race_ot]race_ot + _b[colgrad]colgrad); generate DiffC = PC-P2C; summarize PC P2C DiffC; summarize selfemp age race_bl race_ot colgrad; Estimate the model for question E.3c
5 generate ageself = ageselfemp; probit insured selfemp ageself age race_bl race_ot colgrad,r; log close; clear; exit;
6 log: R:\users\jaherrig\My Documents\Classes\Economics 37\Problem Sets \Problemset6.log log type: text opened on: 3 Nov 2009, 2:06:47. set more off;.. > Read in and summarize the data > >. use Insurance.dta;. describe; Contains data from Insurance.dta obs: 8,802 vars: Dec :50 size: 809,784 (90.3% of memory free) storage display value variable name type format label variable label healthy age anylim male insured deg_nd deg_ged deg_hs deg_ba deg_ma deg_phd deg_oth married selfemp familysz reg_ne reg_mw reg_so reg_we race_bl race_ot race_wht Sorted by:. summarize; Variable Obs Mean Std. Dev. Min Max healthy age
7 anylim male insured deg_nd deg_ged deg_hs deg_ba deg_ma deg_phd deg_oth married selfemp familysz reg_ne reg_mw reg_so reg_we race_bl race_ot race_wht > Estimate the model for question E.3a > >. probit insured selfemp,r; Iteration 0: log pseudolikelihood = Iteration : log pseudolikelihood = Iteration 2: log pseudolikelihood = Probit regression Number of obs = 8802 Wald chi2() = Prob > chi2 = Log pseudolikelihood = Pseudo R2 = Robust insured Coef. Std. Err. z P> z [95% Conf. Interval] selfemp _cons scalar P = normal(_b[_cons] + _b[selfemp]);. scalar P2 = normal(_b[_cons]);. scalar Diff = P-P2;. scalar list; Diff = P2 = P =
8 .. > Estimate the model for question E.3b > >. generate colgrad = deg_ba + deg_ma + deg_phd + deg_oth;. probit insured selfemp age race_bl race_ot colgrad,r; Iteration 0: log pseudolikelihood = Iteration : log pseudolikelihood = Iteration 2: log pseudolikelihood = Iteration 3: log pseudolikelihood = Iteration 4: log pseudolikelihood = Probit regression Number of obs = 8802 Wald chi2(5) = Prob > chi2 = Log pseudolikelihood = Pseudo R2 = Robust insured Coef. Std. Err. z P> z [95% Conf. Interval] selfemp age race_bl race_ot colgrad _cons generate PC = normal(_b[_cons] + _b[selfemp] + _b[age]age > + _b[race_bl]race_bl + _b[race_ot]race_ot > + _b[colgrad]colgrad);. generate P2C = normal(_b[_cons] + _b[age]age > + _b[race_bl]race_bl + _b[race_ot]race_ot > + _b[colgrad]colgrad);. generate DiffC = PC-P2C;. summarize PC P2C DiffC; Variable Obs Mean Std. Dev. Min Max PC P2C DiffC summarize selfemp age race_bl race_ot colgrad; Variable Obs Mean Std. Dev. Min Max selfemp age race_bl race_ot colgrad
9 .. > Estimate the model for question E.3c > >. generate ageself = ageselfemp;. probit insured selfemp ageself age race_bl race_ot colgrad,r; Iteration 0: log pseudolikelihood = Iteration : log pseudolikelihood = Iteration 2: log pseudolikelihood = Iteration 3: log pseudolikelihood = Probit regression Number of obs = 8802 Wald chi2(6) = Prob > chi2 = Log pseudolikelihood = Pseudo R2 = Robust insured Coef. Std. Err. z P> z [95% Conf. Interval] selfemp ageself age race_bl race_ot colgrad _cons log close; log: R:\users\jaherrig\My Documents\Classes\Economics 37\Problem Sets \Problemset6.log log type: text closed on: 3 Nov 2009, 2:06:47
Econ 371 Problem Set #6 Answer Sheet. deaths per 10,000. The 90% confidence interval for the change in death rate is 1.81 ±
Econ 371 Problem Set #6 Answer Sheet 10.1 This question focuses on the regression model results in Table 10.1. a. The first part of this question asks you to predict the number of lives that would be saved
More informationHomework Solutions Applied Logistic Regression
Homework Solutions Applied Logistic Regression WEEK 6 Exercise 1 From the ICU data, use as the outcome variable vital status (STA) and CPR prior to ICU admission (CPR) as a covariate. (a) Demonstrate that
More informationBinary Dependent Variables
Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome
More informationLecture 4: Multivariate Regression, Part 2
Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above
More informationEmpirical Application of Simple Regression (Chapter 2)
Empirical Application of Simple Regression (Chapter 2) 1. The data file is House Data, which can be downloaded from my webpage. 2. Use stata menu File Import Excel Spreadsheet to read the data. Don t forget
More informationFinal Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)
Name Economics 170 Spring 2004 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment for the
More informationECON Introductory Econometrics. Lecture 11: Binary dependent variables
ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability
More information5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is
Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do
More informationChapter 11. Regression with a Binary Dependent Variable
Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score
More information2. We care about proportion for categorical variable, but average for numerical one.
Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is
More informationExam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h.
Exam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h. This is an open book examination where all printed and written resources, in addition to a calculator, are allowed. If you are
More informationLecture#12. Instrumental variables regression Causal parameters III
Lecture#12 Instrumental variables regression Causal parameters III 1 Demand experiment, market data analysis & simultaneous causality 2 Simultaneous causality Your task is to estimate the demand function
More informationLecture 4: Multivariate Regression, Part 2
Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above
More informationProblem set - Selection and Diff-in-Diff
Problem set - Selection and Diff-in-Diff 1. You want to model the wage equation for women You consider estimating the model: ln wage = α + β 1 educ + β 2 exper + β 3 exper 2 + ɛ (1) Read the data into
More informationProblem Set 10: Panel Data
Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005
More informationIntroduction to Econometrics
Introduction to Econometrics STAT-S-301 Panel Data (2016/2017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 Regression with Panel Data A panel dataset contains observations on multiple entities
More information(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections
Answer Key Fixed Effect and First Difference Models 1. See discussion in class.. David Neumark and William Wascher published a study in 199 of the effect of minimum wages on teenage employment using a
More informationi (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x)
ECO 6375 Prof Millimet Problem Set #2: Answer Key Stata problem 2 Q 3 Q (a) The sample average of the individual-specific marginal effects is 0039 for educw and -0054 for white Thus, on average, an extra
More informationIntroduction to Econometrics. Regression with Panel Data
Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Regression with Panel Data Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1 Regression with Panel
More informationWeek 3: Simple Linear Regression
Week 3: Simple Linear Regression Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ALL RIGHTS RESERVED 1 Outline
More informationCourse Econometrics I
Course Econometrics I 3. Multiple Regression Analysis: Binary Variables Martin Halla Johannes Kepler University of Linz Department of Economics Last update: April 29, 2014 Martin Halla CS Econometrics
More informationWarwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation
Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory
More informationSTATISTICS 110/201 PRACTICE FINAL EXAM
STATISTICS 110/201 PRACTICE FINAL EXAM Questions 1 to 5: There is a downloadable Stata package that produces sequential sums of squares for regression. In other words, the SS is built up as each variable
More informationQuestion 1 [17 points]: (ch 11)
Question 1 [17 points]: (ch 11) A study analyzed the probability that Major League Baseball (MLB) players "survive" for another season, or, in other words, play one more season. They studied a model of
More informationCRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.
CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Linear
More informationECON3150/4150 Spring 2016
ECON3150/4150 Spring 2016 Lecture 4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo Last updated: January 26, 2016 1 / 49 Overview These lecture slides covers: The linear regression
More informationNonlinear Econometric Analysis (ECO 722) : Homework 2 Answers. (1 θ) if y i = 0. which can be written in an analytically more convenient way as
Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers 1. Consider a binary random variable y i that describes a Bernoulli trial in which the probability of observing y i = 1 in any draw is given
More informationEcon 1123: Section 2. Review. Binary Regressors. Bivariate. Regression. Omitted Variable Bias
Contact Information Elena Llaudet Sections are voluntary. My office hours are Thursdays 5pm-7pm in Littauer Mezzanine 34-36 (Note room change) You can email me administrative questions to ellaudet@gmail.com.
More informationAnswers to Problem Set #4
Answers to Problem Set #4 Problems. Suppose that, from a sample of 63 observations, the least squares estimates and the corresponding estimated variance covariance matrix are given by: bβ bβ 2 bβ 3 = 2
More informationBinary Dependent Variable. Regression with a
Beykent University Faculty of Business and Economics Department of Economics Econometrics II Yrd.Doç.Dr. Özgür Ömer Ersin Regression with a Binary Dependent Variable (SW Chapter 11) SW Ch. 11 1/59 Regression
More informationLab 11 - Heteroskedasticity
Lab 11 - Heteroskedasticity Spring 2017 Contents 1 Introduction 2 2 Heteroskedasticity 2 3 Addressing heteroskedasticity in Stata 3 4 Testing for heteroskedasticity 4 5 A simple example 5 1 1 Introduction
More informationECON Introductory Econometrics. Lecture 6: OLS with Multiple Regressors
ECON4150 - Introductory Econometrics Lecture 6: OLS with Multiple Regressors Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 6 Lecture outline 2 Violation of first Least Squares assumption
More informationJeffrey M. Wooldridge Michigan State University
Fractional Response Models with Endogenous Explanatory Variables and Heterogeneity Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Fractional Probit with Heteroskedasticity 3. Fractional
More information1 A Review of Correlation and Regression
1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then
More informationECON3150/4150 Spring 2016
ECON3150/4150 Spring 2016 Lecture 6 Multiple regression model Siv-Elisabeth Skjelbred University of Oslo February 5th Last updated: February 3, 2016 1 / 49 Outline Multiple linear regression model and
More informationPractice exam questions
Practice exam questions Nathaniel Higgins nhiggins@jhu.edu, nhiggins@ers.usda.gov 1. The following question is based on the model y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + u. Discuss the following two hypotheses.
More informationApplied Statistics and Econometrics
Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple
More information1 A Non-technical Introduction to Regression
1 A Non-technical Introduction to Regression Chapters 1 and Chapter 2 of the textbook are reviews of material you should know from your previous study (e.g. in your second year course). They cover, in
More informationLecture 7: OLS with qualitative information
Lecture 7: OLS with qualitative information Dummy variables Dummy variable: an indicator that says whether a particular observation is in a category or not Like a light switch: on or off Most useful values:
More informationFrom the help desk: Comparing areas under receiver operating characteristic curves from two or more probit or logit models
The Stata Journal (2002) 2, Number 3, pp. 301 313 From the help desk: Comparing areas under receiver operating characteristic curves from two or more probit or logit models Mario A. Cleves, Ph.D. Department
More informationSociology 362 Data Exercise 6 Logistic Regression 2
Sociology 362 Data Exercise 6 Logistic Regression 2 The questions below refer to the data and output beginning on the next page. Although the raw data are given there, you do not have to do any Stata runs
More informationEconometrics II Censoring & Truncation. May 5, 2011
Econometrics II Censoring & Truncation Måns Söderbom May 5, 2011 1 Censored and Truncated Models Recall that a corner solution is an actual economic outcome, e.g. zero expenditure on health by a household
More informationHeteroskedasticity Example
ECON 761: Heteroskedasticity Example L Magee November, 2007 This example uses the fertility data set from assignment 2 The observations are based on the responses of 4361 women in Botswana s 1988 Demographic
More informationEmpirical Application of Panel Data Regression
Empirical Application of Panel Data Regression 1. We use Fatality data, and we are interested in whether rising beer tax rate can help lower traffic death. So the dependent variable is traffic death, while
More informationExercise 7.4 [16 points]
STATISTICS 226, Winter 1997, Homework 5 1 Exercise 7.4 [16 points] a. [3 points] (A: Age, G: Gestation, I: Infant Survival, S: Smoking.) Model G 2 d.f. (AGIS).008 0 0 (AGI, AIS, AGS, GIS).367 1 (AG, AI,
More informationFinal Exam. 1. Definitions: Briefly Define each of the following terms as they relate to the material covered in class.
Name Answer Key Economics 170 Spring 2003 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment
More informationNonlinear Regression Functions
Nonlinear Regression Functions (SW Chapter 8) Outline 1. Nonlinear regression functions general comments 2. Nonlinear functions of one variable 3. Nonlinear functions of two variables: interactions 4.
More informationLab 6 - Simple Regression
Lab 6 - Simple Regression Spring 2017 Contents 1 Thinking About Regression 2 2 Regression Output 3 3 Fitted Values 5 4 Residuals 6 5 Functional Forms 8 Updated from Stata tutorials provided by Prof. Cichello
More informationStatistical Inference with Regression Analysis
Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Steven Buck Lecture #13 Statistical Inference with Regression Analysis Next we turn to calculating confidence intervals and hypothesis testing
More informationEconometrics Review questions for exam
Econometrics Review questions for exam Nathaniel Higgins nhiggins@jhu.edu, 1. Suppose you have a model: y = β 0 x 1 + u You propose the model above and then estimate the model using OLS to obtain: ŷ =
More information4 Instrumental Variables Single endogenous variable One continuous instrument. 2
Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................
More informationLongitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois
Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control
More informationEconometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018
Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate
More informationLecture 3.1 Basic Logistic LDA
y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data
More informationAppendix Table 1. Predictive Power of the Pre-Game Point Spread versus the Halftime Point Spread.
Appendix Table 1. Predictive Power of the Pre-Game Point Spread versus the Halftime Point Spread. Probit Regression Dependent Variable = Win (1) (2) (3) (4) (5) (6) Spread -.081 -.057 (.004) (.004) [-.033]
More informationECON Introductory Econometrics. Lecture 16: Instrumental variables
ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental
More informationApplied Statistics and Econometrics
Applied Statistics and Econometrics Lecture 7 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 68 Outline of Lecture 7 1 Empirical example: Italian labor force
More information2. (3.5) (iii) Simply drop one of the independent variables, say leisure: GP A = β 0 + β 1 study + β 2 sleep + β 3 work + u.
BOSTON COLLEGE Department of Economics EC 228 Econometrics, Prof. Baum, Ms. Yu, Fall 2003 Problem Set 3 Solutions Problem sets should be your own work. You may work together with classmates, but if you
More information4 Instrumental Variables Single endogenous variable One continuous instrument. 2
Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................
More information1 Warm-Up: 2 Adjusted R 2. Introductory Applied Econometrics EEP/IAS 118 Spring Sylvan Herskowitz Section #
Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Sylvan Herskowitz Section #10 4-1-15 1 Warm-Up: Remember that exam you took before break? We had a question that said this A researcher wants to
More informationMultilevel Modeling Day 2 Intermediate and Advanced Issues: Multilevel Models as Mixed Models. Jian Wang September 18, 2012
Multilevel Modeling Day 2 Intermediate and Advanced Issues: Multilevel Models as Mixed Models Jian Wang September 18, 2012 What are mixed models The simplest multilevel models are in fact mixed models:
More informationEconometrics Midterm Examination Answers
Econometrics Midterm Examination Answers March 4, 204. Question (35 points) Answer the following short questions. (i) De ne what is an unbiased estimator. Show that X is an unbiased estimator for E(X i
More informationHow To Do Piecewise Exponential Survival Analysis in Stata 7 (Allison 1995:Output 4.20) revised
WM Mason, Soc 213B, S 02, UCLA Page 1 of 15 How To Do Piecewise Exponential Survival Analysis in Stata 7 (Allison 1995:Output 420) revised 4-25-02 This document can function as a "how to" for setting up
More informationLab 07 Introduction to Econometrics
Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand
More informationQuantitative Methods Final Exam (2017/1)
Quantitative Methods Final Exam (2017/1) 1. Please write down your name and student ID number. 2. Calculator is allowed during the exam, but DO NOT use a smartphone. 3. List your answers (together with
More informationMediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013
Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013 In Chap. 11 of Statistical Analysis of Management Data (Gatignon, 2014), tests of mediation are
More informationMultiple Regression: Inference
Multiple Regression: Inference The t-test: is ˆ j big and precise enough? We test the null hypothesis: H 0 : β j =0; i.e. test that x j has no effect on y once the other explanatory variables are controlled
More informationWISE International Masters
WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are
More informationLatent class analysis and finite mixture models with Stata
Latent class analysis and finite mixture models with Stata Isabel Canette Principal Mathematician and Statistician StataCorp LLC 2017 Stata Users Group Meeting Madrid, October 19th, 2017 Introduction Latent
More informationLecture 5. In the last lecture, we covered. This lecture introduces you to
Lecture 5 In the last lecture, we covered. homework 2. The linear regression model (4.) 3. Estimating the coefficients (4.2) This lecture introduces you to. Measures of Fit (4.3) 2. The Least Square Assumptions
More informationraise Coef. Std. Err. z P> z [95% Conf. Interval]
1 We will use real-world data, but a very simple and naive model to keep the example easy to understand. What is interesting about the example is that the outcome of interest, perhaps the probability or
More informationMeta-Analysis in Stata, 2nd edition p.158 Exercise Silgay et al. (2004)
Stata LightStone Stata 14 Funnel StataPress Meta-Analysis in Stata, 2nd edition p.153 Harbord et al. metabias metabias Steichen (1998) Begg Egger Stata Stata metabias Begg Egger Harbord Peters ado metafunnel
More informationLab 10 - Binary Variables
Lab 10 - Binary Variables Spring 2017 Contents 1 Introduction 1 2 SLR on a Dummy 2 3 MLR with binary independent variables 3 3.1 MLR with a Dummy: different intercepts, same slope................. 4 3.2
More informationECON3150/4150 Spring 2015
ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2
More informationRegression #8: Loose Ends
Regression #8: Loose Ends Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #8 1 / 30 In this lecture we investigate a variety of topics that you are probably familiar with, but need to touch
More informationProblem Set 1 ANSWERS
Economics 20 Prof. Patricia M. Anderson Problem Set 1 ANSWERS Part I. Multiple Choice Problems 1. If X and Z are two random variables, then E[X-Z] is d. E[X] E[Z] This is just a simple application of one
More informationECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests
ECON4150 - Introductory Econometrics Lecture 7: OLS with Multiple Regressors Hypotheses tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 7 Lecture outline 2 Hypothesis test for single
More informationPlease discuss each of the 3 problems on a separate sheet of paper, not just on a separate page!
Econometrics - Exam May 11, 2011 1 Exam Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page! Problem 1: (15 points) A researcher has data for the year 2000 from
More informationGMM Estimation in Stata
GMM Estimation in Stata Econometrics I Department of Economics Universidad Carlos III de Madrid Master in Industrial Economics and Markets 1 Outline Motivation 1 Motivation 2 3 4 2 Motivation 3 Stata and
More informationUsing the same data as before, here is part of the output we get in Stata when we do a logistic regression of Grade on Gpa, Tuce and Psi.
Logistic Regression, Part III: Hypothesis Testing, Comparisons to OLS Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 14, 2018 This handout steals heavily
More informationCase of single exogenous (iv) variable (with single or multiple mediators) iv à med à dv. = β 0. iv i. med i + α 1
Mediation Analysis: OLS vs. SUR vs. ISUR vs. 3SLS vs. SEM Note by Hubert Gatignon July 7, 2013, updated November 15, 2013, April 11, 2014, May 21, 2016 and August 10, 2016 In Chap. 11 of Statistical Analysis
More informationName: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm
Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam June 8 th, 2016: 9am to 1pm Instructions: 1. This is exam is to be completed independently. Do not discuss your work with
More informationWrite your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).
STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods in Economics 2 Course code: EC2402 Examiner: Peter Skogman Thoursie Number of credits: 7,5 credits (hp) Date of exam: Saturday,
More informationTHE MULTIVARIATE LINEAR REGRESSION MODEL
THE MULTIVARIATE LINEAR REGRESSION MODEL Why multiple regression analysis? Model with more than 1 independent variable: y 0 1x1 2x2 u It allows : -Controlling for other factors, and get a ceteris paribus
More informationLogit estimates Number of obs = 5054 Wald chi2(1) = 2.70 Prob > chi2 = Log pseudolikelihood = Pseudo R2 =
August 2005 Stata Application Tutorial 4: Discrete Models Data Note: Code makes use of career.dta, and icpsr_discrete1.dta. All three data sets are available on the Event History website. Code is based
More informationLecture 8: Instrumental Variables Estimation
Lecture Notes on Advanced Econometrics Lecture 8: Instrumental Variables Estimation Endogenous Variables Consider a population model: y α y + β + β x + β x +... + β x + u i i i i k ik i Takashi Yamano
More informationA Journey to Latent Class Analysis (LCA)
A Journey to Latent Class Analysis (LCA) Jeff Pitblado StataCorp LLC 2017 Nordic and Baltic Stata Users Group Meeting Stockholm, Sweden Outline Motivation by: prefix if clause suest command Factor variables
More informationEstimating and Interpreting Effects for Nonlinear and Nonparametric Models
Estimating and Interpreting Effects for Nonlinear and Nonparametric Models Enrique Pinzón September 18, 2018 September 18, 2018 1 / 112 Objective Build a unified framework to ask questions about model
More informationcrreg: A New command for Generalized Continuation Ratio Models
crreg: A New command for Generalized Continuation Ratio Models Shawn Bauldry Purdue University Jun Xu Ball State University Andrew Fullerton Oklahoma State University Stata Conference July 28, 2017 Bauldry
More informationFTE Employment before FTE Employment after
1. (25 points) In 1992, there was an increase in the (state) minimum wage in one U.S. state (New Jersey) but not in a neighboring location (eastern Pennsylvania). The study provides you with the following
More informationHandout 12. Endogeneity & Simultaneous Equation Models
Handout 12. Endogeneity & Simultaneous Equation Models In which you learn about another potential source of endogeneity caused by the simultaneous determination of economic variables, and learn how to
More informationControl Function and Related Methods: Nonlinear Models
Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear
More informationEconometrics Problem Set 4
Econometrics Problem Set 4 WISE, Xiamen University Spring 2016-17 Conceptual Questions 1. This question refers to the estimated regressions in shown in Table 1 computed using data for 1988 from the CPS.
More informationINTRODUCTION TO BASIC LINEAR REGRESSION MODEL
INTRODUCTION TO BASIC LINEAR REGRESSION MODEL 13 September 2011 Yogyakarta, Indonesia Cosimo Beverelli (World Trade Organization) 1 LINEAR REGRESSION MODEL In general, regression models estimate the effect
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More informationBIOSTATS 640 Spring 2018 Unit 2. Regression and Correlation (Part 1 of 2) STATA Users
Unit Regression and Correlation 1 of - Practice Problems Solutions Stata Users 1. In this exercise, you will gain some practice doing a simple linear regression using a Stata data set called week0.dta.
More informationECON 594: Lecture #6
ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was
More informationAssessing the Calibration of Dichotomous Outcome Models with the Calibration Belt
Assessing the Calibration of Dichotomous Outcome Models with the Calibration Belt Giovanni Nattino The Ohio Colleges of Medicine Government Resource Center The Ohio State University Stata Conference -
More informationEconometrics. 8) Instrumental variables
30C00200 Econometrics 8) Instrumental variables Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Thery of IV regression Overidentification Two-stage least squates
More informationECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics August 2013
ECONOMICS AND ECONOMIC METHODS PRELIM EXAM Statistics and Econometrics August 2013 Instructions: Answer all six (6) questions. Point totals for each question are given in parentheses. The parts within
More information