Lecture 1: intro. to regresions

Size: px
Start display at page:

Download "Lecture 1: intro. to regresions"

Transcription

1 Lecture : intro. to regresions Class basics: Coach, Bro., Prof.,Not Dr.; grading (exams, HW, papers), Quizes Simple regression model (one X): Salary = b0 + b*experience + error Multiple regression model (two or more X) adding a dummy (0,) variable Salary = b0 + b*experience + b2*male+error Applic:Difference-in-Difference Estimation Wg=b0+b*LR+b2*after+b3(LR*after)+error

2 Lecture : some things to remember Y N X = ;. = ( ) = 0 i N i= X DevX Xi X N i= N N 2 ( Xi X) ( Xi X)( Yi Y) i= i= Var ( X) = ; Cov ( Y, X) = N = β + β X + μ i 0 i i N N ( X X)( Y Y) ˆ β ˆ β ˆ βx i i i= = ; 0 = Y N 2 ( Xi X) i=

3 Lecture : some things to remember--continued N ( X X)( Y Y) i i ˆ i= th β = (. var ) N 2 ( Xi X) i= generalizes for the coeff of the k iable to N ( ( X Y ( ( ki, i ˆ i= β k = N ( 2 X ki, where X k ( Yi) is the part of the var iable X k ( Y ) i= that is uncorrelated with the other var iable s in the mod el

4 Lecture 2: vectors and orthogonality Vector definition Vector sums and differences Dot product of vector defines orthogonality Regressions seen in terms of variable vectors Steps for Proofs in 388: definitions, substitutions, simplications, and expected value when appropriate

5 Lecture 2: some things to remember OLS coefficients come from minimizing the error, so that the error vector is orthogonal to the regression plane; for simple regrss: ˆ μ X = 0; ˆ μ = 0; For multivariate regression: ˆ = 0; ˆ = 0; ˆ = 0;...; ˆ = 0; μ μ μ μ X X X 2 3 k or in matrix terms, X ' ˆ μ = 0

6 Lecture 3: Multivariate Regrss, Frisch Thm Minimizing the (length of the) error vector yields normal equations for OLS Solving Normal equations yields OLS estimators Outcomes (Y) are decomposed into a part explained by X (the regression plane or model) and a part unexplained by X, the error In OLS, the coefficient for the kth regressor measures the partial effect of Xk on Y

7 Lecture 3: some things to remember Y = Yˆ+ ˆ μ, whereyˆ= X ˆ β and ˆ μ= Y X ˆ β X ˆ μ = yields ˆ β = X X X Y ' 0 ( ' ) ' ˆ = ( ' ) ' = X, substituting we get Y X X X X Y P Y ˆ = ( ' ) ' = X, X = X and μ Y X X X X Y M Y where M I P Let M = ( I P ); so that M X = X P X j x,..., x, x,... j j j x,..., x, x,... j j j+ j j+ then ˆ β k = ( M X )'( M Y) j j j ( M X )'( M X ) j j j j

8 Lecture 4: Variances and testing Use variances to apply the EMPIRICAL rule (which works for normals, and means in large samples): 68% within standard deviation of the mean 95% within 2 standard deviations of the mean 99% within 3 standard deviations of the mean (centered) R-square the variation in Y explained by the variation in the Xs

9 Lecture 4: some things to remember Standard deviation=sq. root of est. variance Ex.: s. d. of X = N i= ( X X) i ( N ) 2 Ex.2: s. d. of ˆ s N i= μ = ( ˆ μ 0) i ( N k) 2 ( statistic hypo. value) N(0,) or " z scores" = 2 about 95% of the time s tan dard deviation 2 PX Y Uncentered R-square = i= ( ˆ μ ) ˆ ( N 2) Ex.3: s. error of β ( simple OLS ) = = Centered R-square (usual) = N i 2 N N 2 2 ( Xi X) ( Xi X) i= i= Y 2 s P M X Y M Y 2 2

10 Lecture 5: Date Transf; Model Assump Non-singular transformations of X (i.e., postmultiplying by kxk matrix A) doesn t affect the projection of Y Pre-multiplied transformations by nxn matrix C is used to fix regression model when its assumptions fail Testing linear restrictions (the third transf.) Model assumptions:.y=xb+u 2. X X full rank 3. E(u X)=0

11 Lecture 5: some things to remember Only need the first three assumptions to prove betas are unbiased (use definition, substitution, simplification, expected value) ˆ ( X ' X) X ' β = β + μ Omitted variable bias: E( ˆ X) = + ( X ' X) X ' z β β α

12 Lecture 6: More Assump.; Theorems Assum.4: error has constant variances (no heteroskedasticity), and covariances between errors is zero (no serial correlation): Variance estimator of u, s-squared, is unibased OLS is BLUE (Gauss-Markov theorem) Assum. 5: normality of the error term, which implies normality of Y, which implies normality of the beta vector

13 Lecture 6: some things to remember Definitions of expected value and variance for matrices Useful theorem A: for Y ~ N ( θ, Σ ) then AY ~ N ( Aθ, AΣA ') Trace of matrix=sum of diagonal elements Positive definitive matrix C if z Cx>0 for any vector z with at least one non-zero value

14 Lecture 7: MR Hypothesis Testing Normality assumption used for hypothesis testing Classical frequentist approach: minimize Type I error (rejection of the null hypothesis when it is true), while guarding against Type 2 error (accepting the null hypothesis when the alternative hypothesis is true) One sided vs. two sided alternative hypo.

15 Lecture 7: some things to remember Steps in figuring out the size of type 2 error for the jth beta coefficient:. specify null and alternative hypotheses (one-sided) for that coefficient 2. use z-scores to figure out critical value for the jth coefficient (say at the 5% level) 3. using that critical value, and assuming the alternative hypo. Is true, calculate the associated z-score and look up its probability in a N(0,) table (or have STATA calculate it)

16 Lecture 8: MR testing linear combinations Likelihood=given the sample outcome, how probable was the outcome under the null hypothesis (i.e., as a fn. of the parm.) Three classical likelihood tests: LM: uses the likelihd at the restricted values Wald: uses the likelihd at the unrestr. Values Likelihd ratio test: compares restr. and unrestr values All three of these tests have the same asymptotic distribution as the F test

17 Lecture 8: some things to remember F = ( SSR SSR )/ r R U SSR /( n k) U STATA: test (x=0) (x3=0)(x4=0); SAS: test x,x3,x4;

18 Lecture 9: Asymptotics: Consistency LLN: for large samples, a mean converges in distribution to the mean of the distribution from which it was drawn CLT: for large samples, means tend to be normal, even if the underlying distribution is not normal Since betas are means, both rules apply

19 Lecture 9: some things to remember Betas are means: N N ˆ = ( Xi' Xi) ( Xi' Yi) N i= N i= β Consistency proofs usually are of 2 types: If estimator is always unbiased, show its variance goes to zero as N gets large, OR Use the LLN directly where applicable

20 Lecture 0: Large Sample MR inference Central Limit Theorem (CLT): the distribution of averages tends to be normal So betas in regression models with large samples tend to be normally distributed, even if the underlying errors are not. Instrumental Variable (IV) estimators are consistent if you have good instruments, and are asymptotically normal

21 Lecture 0: some things to remember Instrument variables (Z) need to be: Uncorrelated with the errors, but Highly correlated with the r.h.s variables ˆ ( ' ) ' β IV = Z X Z Y When you have surplus instruments, two stage least squares is the efficient way to use the additional information (see lectures 20/2)

22 Lecture : MR specification Issues Always have constants in your regressions, but solemn worry about interpreting them Slope coefficients can be normalized so that they are unit-free :.elasticities coefficients from log-log specifications 2.beta coefficients std dev change in Y for a s.d. change in X When your have interactions and quadratic terms (squares), take partial derivatives to help you interpret the results Predictions are stochastic, use the correct prediction interval in interpreting the outcomes

23 Lecture : some things to remember Calculus approximation of the extremal point for 2 the model is Y= β + βx+ β X + μ 0 2 ˆ β X max,min = 2* ˆ β2 Prediction variance for the model Y = x ˆ β + μ ˆP P P Is calculated as estimator of Var Yˆ s x X X x 2 ' ( P) = ( P( ' ) P + )

24 Lecture 2: Dummy Variables (0,) Dummy variables are the way we quantify qualitative things: status (gender, race, health etc.) or choice (married, emplyd, RM, tai chi, etc) Rules about dummy variables: If m groups, using m- dummy variables Dummy var. coeff. Represent shift (in intercept) of that group relative to the omitted group Useful for fixed effects model (panels with repeated observations on the same unit)

25 Lecture 2: some things to remember dummy variables: If m groups, using m- dummy variables Dummy coeff. represent shift (in intercept) of that group relative to the omitted group Dummy can be used for continuous var. too when you want to allow for more complexity The OLS estimator in panel data sets is a (matrix) weighted average of the between group estimator (using just group means) and within group estimator (using just deviations from means within groups

26 Lecture 3: Dummy Variable Interactions Tests for structural (coefficient) shifts in the model, across groups or over time, can be done with dummy variable interactions: Dummies by themselves ( interacted with the constant, a vector of ones) allows intercept shifts Dummies interacted with X allows slope shifts in the relevant X variable coefficient F-tests for such shifts are readily made in STATA or SAS

27 Lecture 3: some things to remember reg sales age covey dilbert age_cov age_dilbert; Test (covey=0) (dilbert=0); *tests intercept shifts; Test (age_cov=0) (age_dilbert=0); *tests slope shifts; Test (covey=0) (dilbert=0) (age_cov=0) (age_dilbert=0); *slope and slope shifts-tests for full structural shift in the equation;

28 Lecture 4: Dummy Dependent Variables Dummy dependent variable models are choice models, where the likelihood of having made a choice is modeled as a function of Xs: marriage, college, exercising, memorizing the proclamation on the family, etc. Linear version: LPM, heterosk and boundd Nonlinear versions: logits, probits, max likelihood estimation

29 Lecture 4: some things to remember General notation: Pr ob( Y = X ) = G( X β) i i i LPM version: exp( Xiβ) Logit version: Pr ob( Yi = Xi) = + exp( X β) Probit version: Log-liklhd ratio tests: Marginal effect: Pr ob( Y = X ) = X β i i i X i β Pr ob( Yi = Xi) = ϕ() v dv where ϕ()~ v N(0,) R U 2*(loglik log lik ) PY ( i = X ) Gz () Gz () Gz () = βj where = for LPM ; = G( z)*( G( z)) for Logit X z z z j i

30 Lecture 5: Heteroskedasticity Heteroskedasticity=variance changes between observations Coeff are still unbiased, but Standard errors (t-tests) may be wrong Modern approach: use robust standard errors: regress y x x2 x3, robust; Old approach: model the heterosked. Modern detection of het: LM tests Old detection of het: Goldfeld Quandt tests

31 Lecture 5: some things to remember Heteroskedasticity affects coeff covariance matrix, but does not bias the coefficients Tests LM tests: regress residuals on Xs then 2 N * R ~ chi square ( d. f. = number slope coeff.) Goldfeld-Q: divide data into 3 sections then SSR III SSR I / df / df III I III I ~ F( df, df )

32 Lecture 6: Weighted Least Squares Weighting to correct heteroskedasticity: give more weight to the observations with the better information (lower variance) by dividing each observation by its variance When we do weighted least squares we no longer do orthogonal projections (because our error distributions are no longer spherical), but oblique projections

33 Lecture 6: some things to remember. transform all the data approach 2 Y = Xβ+ μ, so μ~ N(0, Σ) whereσ σ I TY = TXβ+ Tμ so Tμ N σ I 2, ~ (0, ) so TΣ T ' = σ I, or T ' T = σ Σ. Hence, ˆ β = ( X ' Σ X) X ' Σ Y 2 2 T 2. transform the covariance matrix approach estimator of V( ˆ β) = ( X ' X) X ' ˆˆ μμ' X( X ' X)

34 Lecture 7: Time Series Data Asymptotics Time series: random var ordered in time: Can t replicate Non-independent Additional assumptions on process: Covariance stationarity Weak dependence (asymptotically indep.) Types of stochastic (time series) process: White noise Random walk Moving Average (MA(.)) Autoregressive (AR(.))

35 Lecture 7: some things to remember White noise e t Stationary Weakly dependent Random Walk y = y + e t t t Not Stationary Not Weakly dependent Moving Average Autoregressive y = α e + α e t 0 t t y y e t = ρ t + t Stationary Stationary Weakly dependent Weakly dependent (if ρ < )

36 Lecture 8: Persistent Time Series I(0)-integrated of order zero series,weakly dependent, examples: AR() with rho< Get consistency with large samples I()-integrated of order one series,not weakly dependent, examples: AR() with rho= (i.e., random walks) Consistency problems of estimators, even if N large Dynamically complete models: add lagged r.h.s. variables until no serial correlation in error

37 Lecture 8: some things to remember y = y + e ρ For AR() t t t IF ρ < then you can get consistent estimates in large samples by OLS IF ρ = then you need to first difference the data to consistent estimates in large samples by OLS Rule of Thumb: if ρ.9 then first difference (because of the unit root problem)

38 Lecture 9: Serially Correlated Errors Violates assumption IV: namely, serial correlation occurs when the residuals are correlated (positive error at t- increases prob. of positive error at t) Coeff. unbiased, but cov(coeff.) incorrect so that t-stats and tests of coeff are biased Common time series problem: μt = ρμt + et Detecting serial correlation: DW, regression Correcting serial correlation: two-stage differencing to increase efficiency, correct t-stats

39 Lecture 9: some things to remember Detecting AR(): Regression: DW statistic: ˆ μ = ρμˆ + e DW t t t T 2 ( ˆ μ ˆ t μt ) t= 2 = T 2 ˆ μt t= 2( ˆ ρ) Correcting AR() by differencing: y t= xtβ + μt; μt = ρμt + et y ˆ ρ y = ( x ˆ ρx ) β + e t t t t t

40 Lecture 20 & 2: Endo.; I.V., 2SLS Endogeneity: omitted var, measurement error, or simultaneous causation (S&D) Endogeneity violates assumption III, and everything is biased Fix: find instrumental variables (IV), Z, s.t. Correlated with Xs (r.h.s. regressors) Uncorrelated with the error

41 Lecture 20&2: things to remember IV in general: ˆ = ( ' ) ' β IV Z X Z Y V ˆ β = s ZX ZZZX 2 ( IV ) ( ' ) ' ( ' ) 2SLS as a type of IV: ˆ β ˆ ˆ ˆ 2SLS = ( X ' X) X' Y where X = V( V' V) V' X ˆ β ˆ SLS = X VVV V X XVVV VY 2 ( ' ( ' ) ' ) ' ( ' ) ' V( ˆ β ) = s ( X' V( V' V) V' X) 2SLS 2

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han Econometrics Honor s Exam Review Session Spring 2012 Eunice Han Topics 1. OLS The Assumptions Omitted Variable Bias Conditional Mean Independence Hypothesis Testing and Confidence Intervals Homoskedasticity

More information

ECONOMETRICS HONOR S EXAM REVIEW SESSION

ECONOMETRICS HONOR S EXAM REVIEW SESSION ECONOMETRICS HONOR S EXAM REVIEW SESSION Eunice Han ehan@fas.harvard.edu March 26 th, 2013 Harvard University Information 2 Exam: April 3 rd 3-6pm @ Emerson 105 Bring a calculator and extra pens. Notes

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Economics 308: Econometrics Professor Moody

Economics 308: Econometrics Professor Moody Economics 308: Econometrics Professor Moody References on reserve: Text Moody, Basic Econometrics with Stata (BES) Pindyck and Rubinfeld, Econometric Models and Economic Forecasts (PR) Wooldridge, Jeffrey

More information

Section I. Define or explain the following terms (3 points each) 1. centered vs. uncentered 2 R - 2. Frisch theorem -

Section I. Define or explain the following terms (3 points each) 1. centered vs. uncentered 2 R - 2. Frisch theorem - First Exam: Economics 388, Econometrics Spring 006 in R. Butler s class YOUR NAME: Section I (30 points) Questions 1-10 (3 points each) Section II (40 points) Questions 11-15 (10 points each) Section III

More information

Homoskedasticity. Var (u X) = σ 2. (23)

Homoskedasticity. Var (u X) = σ 2. (23) Homoskedasticity How big is the difference between the OLS estimator and the true parameter? To answer this question, we make an additional assumption called homoskedasticity: Var (u X) = σ 2. (23) This

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = 0 + 1 x 1 + x +... k x k + u 6. Heteroskedasticity What is Heteroskedasticity?! Recall the assumption of homoskedasticity implied that conditional on the explanatory variables,

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the

More information

Econometrics Summary Algebraic and Statistical Preliminaries

Econometrics Summary Algebraic and Statistical Preliminaries Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L

More information

1 The Multiple Regression Model: Freeing Up the Classical Assumptions

1 The Multiple Regression Model: Freeing Up the Classical Assumptions 1 The Multiple Regression Model: Freeing Up the Classical Assumptions Some or all of classical assumptions were crucial for many of the derivations of the previous chapters. Derivation of the OLS estimator

More information

Multiple Regression Analysis: Heteroskedasticity

Multiple Regression Analysis: Heteroskedasticity Multiple Regression Analysis: Heteroskedasticity y = β 0 + β 1 x 1 + β x +... β k x k + u Read chapter 8. EE45 -Chaiyuth Punyasavatsut 1 topics 8.1 Heteroskedasticity and OLS 8. Robust estimation 8.3 Testing

More information

Christopher Dougherty London School of Economics and Political Science

Christopher Dougherty London School of Economics and Political Science Introduction to Econometrics FIFTH EDITION Christopher Dougherty London School of Economics and Political Science OXFORD UNIVERSITY PRESS Contents INTRODU CTION 1 Why study econometrics? 1 Aim of this

More information

Week 11 Heteroskedasticity and Autocorrelation

Week 11 Heteroskedasticity and Autocorrelation Week 11 Heteroskedasticity and Autocorrelation İnsan TUNALI Econ 511 Econometrics I Koç University 27 November 2018 Lecture outline 1. OLS and assumptions on V(ε) 2. Violations of V(ε) σ 2 I: 1. Heteroskedasticity

More information

Model Mis-specification

Model Mis-specification Model Mis-specification Carlo Favero Favero () Model Mis-specification 1 / 28 Model Mis-specification Each specification can be interpreted of the result of a reduction process, what happens if the reduction

More information

Regression Models - Introduction

Regression Models - Introduction Regression Models - Introduction In regression models there are two types of variables that are studied: A dependent variable, Y, also called response variable. It is modeled as random. An independent

More information

Review of Econometrics

Review of Econometrics Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,

More information

Econometrics. 9) Heteroscedasticity and autocorrelation

Econometrics. 9) Heteroscedasticity and autocorrelation 30C00200 Econometrics 9) Heteroscedasticity and autocorrelation Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Heteroscedasticity Possible causes Testing for

More information

LECTURE 10: MORE ON RANDOM PROCESSES

LECTURE 10: MORE ON RANDOM PROCESSES LECTURE 10: MORE ON RANDOM PROCESSES AND SERIAL CORRELATION 2 Classification of random processes (cont d) stationary vs. non-stationary processes stationary = distribution does not change over time more

More information

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 4 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 23 Recommended Reading For the today Serial correlation and heteroskedasticity in

More information

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35 Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate

More information

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication G. S. Maddala Kajal Lahiri WILEY A John Wiley and Sons, Ltd., Publication TEMT Foreword Preface to the Fourth Edition xvii xix Part I Introduction and the Linear Regression Model 1 CHAPTER 1 What is Econometrics?

More information

Heteroscedasticity and Autocorrelation

Heteroscedasticity and Autocorrelation Heteroscedasticity and Autocorrelation Carlo Favero Favero () Heteroscedasticity and Autocorrelation 1 / 17 Heteroscedasticity, Autocorrelation, and the GLS estimator Let us reconsider the single equation

More information

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model

More information

Introduction to Econometrics. Heteroskedasticity

Introduction to Econometrics. Heteroskedasticity Introduction to Econometrics Introduction Heteroskedasticity When the variance of the errors changes across segments of the population, where the segments are determined by different values for the explanatory

More information

Econometrics Review questions for exam

Econometrics Review questions for exam Econometrics Review questions for exam Nathaniel Higgins nhiggins@jhu.edu, 1. Suppose you have a model: y = β 0 x 1 + u You propose the model above and then estimate the model using OLS to obtain: ŷ =

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Heteroskedasticity and Autocorrelation

Heteroskedasticity and Autocorrelation Lesson 7 Heteroskedasticity and Autocorrelation Pilar González and Susan Orbe Dpt. Applied Economics III (Econometrics and Statistics) Pilar González and Susan Orbe OCW 2014 Lesson 7. Heteroskedasticity

More information

THE MULTIVARIATE LINEAR REGRESSION MODEL

THE MULTIVARIATE LINEAR REGRESSION MODEL THE MULTIVARIATE LINEAR REGRESSION MODEL Why multiple regression analysis? Model with more than 1 independent variable: y 0 1x1 2x2 u It allows : -Controlling for other factors, and get a ceteris paribus

More information

Linear Regression. Junhui Qian. October 27, 2014

Linear Regression. Junhui Qian. October 27, 2014 Linear Regression Junhui Qian October 27, 2014 Outline The Model Estimation Ordinary Least Square Method of Moments Maximum Likelihood Estimation Properties of OLS Estimator Unbiasedness Consistency Efficiency

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Rockefeller College University at Albany

Rockefeller College University at Albany Rockefeller College University at Albany PAD 705 Handout: Suggested Review Problems from Pindyck & Rubinfeld Original prepared by Professor Suzanne Cooper John F. Kennedy School of Government, Harvard

More information

388 Index Differencing test ,232 Distributed lags , 147 arithmetic lag.

388 Index Differencing test ,232 Distributed lags , 147 arithmetic lag. INDEX Aggregation... 104 Almon lag... 135-140,149 AR(1) process... 114-130,240,246,324-325,366,370,374 ARCH... 376-379 ARlMA... 365 Asymptotically unbiased... 13,50 Autocorrelation... 113-130, 142-150,324-325,365-369

More information

Reading Assignment. Serial Correlation and Heteroskedasticity. Chapters 12 and 11. Kennedy: Chapter 8. AREC-ECON 535 Lec F1 1

Reading Assignment. Serial Correlation and Heteroskedasticity. Chapters 12 and 11. Kennedy: Chapter 8. AREC-ECON 535 Lec F1 1 Reading Assignment Serial Correlation and Heteroskedasticity Chapters 1 and 11. Kennedy: Chapter 8. AREC-ECON 535 Lec F1 1 Serial Correlation or Autocorrelation y t = β 0 + β 1 x 1t + β x t +... + β k

More information

Lecture 8: Instrumental Variables Estimation

Lecture 8: Instrumental Variables Estimation Lecture Notes on Advanced Econometrics Lecture 8: Instrumental Variables Estimation Endogenous Variables Consider a population model: y α y + β + β x + β x +... + β x + u i i i i k ik i Takashi Yamano

More information

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals (SW Chapter 5) Outline. The standard error of ˆ. Hypothesis tests concerning β 3. Confidence intervals for β 4. Regression

More information

Lecture 8. Using the CLR Model. Relation between patent applications and R&D spending. Variables

Lecture 8. Using the CLR Model. Relation between patent applications and R&D spending. Variables Lecture 8. Using the CLR Model Relation between patent applications and R&D spending Variables PATENTS = No. of patents (in 000) filed RDEP = Expenditure on research&development (in billions of 99 $) The

More information

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N 1) [4 points] Let Econ 836 Final Exam Y Xβ+ ε, X w+ u, w N w~ N(, σi ), u N u~ N(, σi ), ε N ε~ Nu ( γσ, I ), where X is a just one column. Let denote the OLS estimator, and define residuals e as e Y X.

More information

Economics 113. Simple Regression Assumptions. Simple Regression Derivation. Changing Units of Measurement. Nonlinear effects

Economics 113. Simple Regression Assumptions. Simple Regression Derivation. Changing Units of Measurement. Nonlinear effects Economics 113 Simple Regression Models Simple Regression Assumptions Simple Regression Derivation Changing Units of Measurement Nonlinear effects OLS and unbiased estimates Variance of the OLS estimates

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

Reliability of inference (1 of 2 lectures)

Reliability of inference (1 of 2 lectures) Reliability of inference (1 of 2 lectures) Ragnar Nymoen University of Oslo 5 March 2013 1 / 19 This lecture (#13 and 14): I The optimality of the OLS estimators and tests depend on the assumptions of

More information

Topic 7: Heteroskedasticity

Topic 7: Heteroskedasticity Topic 7: Heteroskedasticity Advanced Econometrics (I Dong Chen School of Economics, Peking University Introduction If the disturbance variance is not constant across observations, the regression is heteroskedastic

More information

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler Basic econometrics Tutorial 3 Dipl.Kfm. Introduction Some of you were asking about material to revise/prepare econometrics fundamentals. First of all, be aware that I will not be too technical, only as

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Answers to Problem Set #4

Answers to Problem Set #4 Answers to Problem Set #4 Problems. Suppose that, from a sample of 63 observations, the least squares estimates and the corresponding estimated variance covariance matrix are given by: bβ bβ 2 bβ 3 = 2

More information

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10) Name Economics 170 Spring 2004 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment for the

More information

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional Heteroskedasticity We now consider the implications of relaxing the assumption that the conditional variance V (u i x i ) = σ 2 is common to all observations i = 1,..., In many applications, we may suspect

More information

Chapter 11. Regression with a Binary Dependent Variable

Chapter 11. Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 5 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 44 Outline of Lecture 5 Now that we know the sampling distribution

More information

Introductory Econometrics

Introductory Econometrics Based on the textbook by Wooldridge: : A Modern Approach Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna November 23, 2013 Outline Introduction

More information

Covers Chapter 10-12, some of 16, some of 18 in Wooldridge. Regression Analysis with Time Series Data

Covers Chapter 10-12, some of 16, some of 18 in Wooldridge. Regression Analysis with Time Series Data Covers Chapter 10-12, some of 16, some of 18 in Wooldridge Regression Analysis with Time Series Data Obviously time series data different from cross section in terms of source of variation in x and y temporal

More information

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning Økonomisk Kandidateksamen 2004 (I) Econometrics 2 Rettevejledning This is a closed-book exam (uden hjælpemidler). Answer all questions! The group of questions 1 to 4 have equal weight. Within each group,

More information

Multiple Linear Regression CIVL 7012/8012

Multiple Linear Regression CIVL 7012/8012 Multiple Linear Regression CIVL 7012/8012 2 Multiple Regression Analysis (MLR) Allows us to explicitly control for many factors those simultaneously affect the dependent variable This is important for

More information

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests ECON4150 - Introductory Econometrics Lecture 5: OLS with One Regressor: Hypothesis Tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 5 Lecture outline 2 Testing Hypotheses about one

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 2 Jakub Mućk Econometrics of Panel Data Meeting # 2 1 / 26 Outline 1 Fixed effects model The Least Squares Dummy Variable Estimator The Fixed Effect (Within

More information

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017 Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent

More information

13. Time Series Analysis: Asymptotics Weakly Dependent and Random Walk Process. Strict Exogeneity

13. Time Series Analysis: Asymptotics Weakly Dependent and Random Walk Process. Strict Exogeneity Outline: Further Issues in Using OLS with Time Series Data 13. Time Series Analysis: Asymptotics Weakly Dependent and Random Walk Process I. Stationary and Weakly Dependent Time Series III. Highly Persistent

More information

Simple Linear Regression: The Model

Simple Linear Regression: The Model Simple Linear Regression: The Model task: quantifying the effect of change X in X on Y, with some constant β 1 : Y = β 1 X, linear relationship between X and Y, however, relationship subject to a random

More information

The regression model with one fixed regressor cont d

The regression model with one fixed regressor cont d The regression model with one fixed regressor cont d 3150/4150 Lecture 4 Ragnar Nymoen 27 January 2012 The model with transformed variables Regression with transformed variables I References HGL Ch 2.8

More information

Least Squares Estimation-Finite-Sample Properties

Least Squares Estimation-Finite-Sample Properties Least Squares Estimation-Finite-Sample Properties Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Finite-Sample 1 / 29 Terminology and Assumptions 1 Terminology and Assumptions

More information

ECON 4551 Econometrics II Memorial University of Newfoundland. Panel Data Models. Adapted from Vera Tabakova s notes

ECON 4551 Econometrics II Memorial University of Newfoundland. Panel Data Models. Adapted from Vera Tabakova s notes ECON 4551 Econometrics II Memorial University of Newfoundland Panel Data Models Adapted from Vera Tabakova s notes 15.1 Grunfeld s Investment Data 15.2 Sets of Regression Equations 15.3 Seemingly Unrelated

More information

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations.

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations. Exercises for the course of Econometrics Introduction 1. () A researcher is using data for a sample of 30 observations to investigate the relationship between some dependent variable y i and independent

More information

Heteroskedasticity. Part VII. Heteroskedasticity

Heteroskedasticity. Part VII. Heteroskedasticity Part VII Heteroskedasticity As of Oct 15, 2015 1 Heteroskedasticity Consequences Heteroskedasticity-robust inference Testing for Heteroskedasticity Weighted Least Squares (WLS) Feasible generalized Least

More information

CHAPTER 6: SPECIFICATION VARIABLES

CHAPTER 6: SPECIFICATION VARIABLES Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero

More information

7 Introduction to Time Series

7 Introduction to Time Series Econ 495 - Econometric Review 1 7 Introduction to Time Series 7.1 Time Series vs. Cross-Sectional Data Time series data has a temporal ordering, unlike cross-section data, we will need to changes some

More information

Environmental Econometrics

Environmental Econometrics Environmental Econometrics Jérôme Adda j.adda@ucl.ac.uk Office # 203 EEC. I Syllabus Course Description: This course is an introductory econometrics course. There will be 2 hours of lectures per week and

More information

Final Exam. 1. Definitions: Briefly Define each of the following terms as they relate to the material covered in class.

Final Exam. 1. Definitions: Briefly Define each of the following terms as they relate to the material covered in class. Name Answer Key Economics 170 Spring 2003 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment

More information

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like.

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like. Measurement Error Often a data set will contain imperfect measures of the data we would ideally like. Aggregate Data: (GDP, Consumption, Investment are only best guesses of theoretical counterparts and

More information

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 48

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 48 ECON2228 Notes 10 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 10 2014 2015 1 / 48 Serial correlation and heteroskedasticity in time series regressions Chapter 12:

More information

Lab 07 Introduction to Econometrics

Lab 07 Introduction to Econometrics Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand

More information

The Multiple Regression Model Estimation

The Multiple Regression Model Estimation Lesson 5 The Multiple Regression Model Estimation Pilar González and Susan Orbe Dpt Applied Econometrics III (Econometrics and Statistics) Pilar González and Susan Orbe OCW 2014 Lesson 5 Regression model:

More information

14.32 Final : Spring 2001

14.32 Final : Spring 2001 14.32 Final : Spring 2001 Please read the entire exam before you begin. You have 3 hours. No books or notes should be used. Calculators are allowed. There are 105 points. Good luck! A. True/False/Sometimes

More information

Econ 510 B. Brown Spring 2014 Final Exam Answers

Econ 510 B. Brown Spring 2014 Final Exam Answers Econ 510 B. Brown Spring 2014 Final Exam Answers Answer five of the following questions. You must answer question 7. The question are weighted equally. You have 2.5 hours. You may use a calculator. Brevity

More information

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008 ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008 Instructions: Answer all four (4) questions. Point totals for each question are given in parenthesis; there are 00 points possible. Within

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

Econometrics. 8) Instrumental variables

Econometrics. 8) Instrumental variables 30C00200 Econometrics 8) Instrumental variables Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Thery of IV regression Overidentification Two-stage least squates

More information

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors:

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors: Wooldridge, Introductory Econometrics, d ed. Chapter 3: Multiple regression analysis: Estimation In multiple regression analysis, we extend the simple (two-variable) regression model to consider the possibility

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, Linear Regression In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, y = Xβ + ɛ, where y t = (y 1,..., y n ) is the column vector of target values,

More information

Regression with Qualitative Information. Part VI. Regression with Qualitative Information

Regression with Qualitative Information. Part VI. Regression with Qualitative Information Part VI Regression with Qualitative Information As of Oct 17, 2017 1 Regression with Qualitative Information Single Dummy Independent Variable Multiple Categories Ordinal Information Interaction Involving

More information

LECTURE 11. Introduction to Econometrics. Autocorrelation

LECTURE 11. Introduction to Econometrics. Autocorrelation LECTURE 11 Introduction to Econometrics Autocorrelation November 29, 2016 1 / 24 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists of choosing: 1. correct

More information

7 Introduction to Time Series Time Series vs. Cross-Sectional Data Detrending Time Series... 15

7 Introduction to Time Series Time Series vs. Cross-Sectional Data Detrending Time Series... 15 Econ 495 - Econometric Review 1 Contents 7 Introduction to Time Series 3 7.1 Time Series vs. Cross-Sectional Data............ 3 7.2 Detrending Time Series................... 15 7.3 Types of Stochastic

More information

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do

More information

Lecture 8a: Spurious Regression

Lecture 8a: Spurious Regression Lecture 8a: Spurious Regression 1 Old Stuff The traditional statistical theory holds when we run regression using (weakly or covariance) stationary variables. For example, when we regress one stationary

More information

Handout 11: Measurement Error

Handout 11: Measurement Error Handout 11: Measurement Error In which you learn to recognise the consequences for OLS estimation whenever some of the variables you use are not measured as accurately as you might expect. A (potential)

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Linear Regression with Time Series Data

Linear Regression with Time Series Data Econometrics 2 Linear Regression with Time Series Data Heino Bohn Nielsen 1of21 Outline (1) The linear regression model, identification and estimation. (2) Assumptions and results: (a) Consistency. (b)

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Introduction to Eco n o m et rics

Introduction to Eco n o m et rics 2008 AGI-Information Management Consultants May be used for personal purporses only or by libraries associated to dandelon.com network. Introduction to Eco n o m et rics Third Edition G.S. Maddala Formerly

More information

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley Review of Classical Least Squares James L. Powell Department of Economics University of California, Berkeley The Classical Linear Model The object of least squares regression methods is to model and estimate

More information

Binary Dependent Variables

Binary Dependent Variables Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome

More information

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. B203: Quantitative Methods Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. Part I: Compulsory Questions. Answer all questions. Each question carries

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression Asymptotics Asymptotics Multiple Linear Regression: Assumptions Assumption MLR. (Linearity in parameters) Assumption MLR. (Random Sampling from the population) We have a random

More information

Chapter 6: Linear Regression With Multiple Regressors

Chapter 6: Linear Regression With Multiple Regressors Chapter 6: Linear Regression With Multiple Regressors 1-1 Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution

More information

Introduction to Econometrics. Multiple Regression (2016/2017)

Introduction to Econometrics. Multiple Regression (2016/2017) Introduction to Econometrics STAT-S-301 Multiple Regression (016/017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 OLS estimate of the TS/STR relation: OLS estimate of the Test Score/STR relation:

More information

We begin by thinking about population relationships.

We begin by thinking about population relationships. Conditional Expectation Function (CEF) We begin by thinking about population relationships. CEF Decomposition Theorem: Given some outcome Y i and some covariates X i there is always a decomposition where

More information

In Class Review Exercises Vartanian: SW 540

In Class Review Exercises Vartanian: SW 540 In Class Review Exercises Vartanian: SW 540 1. Given the following output from an OLS model looking at income, what is the slope and intercept for those who are black and those who are not black? b SE

More information

Dynamic Panels. Chapter Introduction Autoregressive Model

Dynamic Panels. Chapter Introduction Autoregressive Model Chapter 11 Dynamic Panels This chapter covers the econometrics methods to estimate dynamic panel data models, and presents examples in Stata to illustrate the use of these procedures. The topics in this

More information

Graduate Econometrics Lecture 4: Heteroskedasticity

Graduate Econometrics Lecture 4: Heteroskedasticity Graduate Econometrics Lecture 4: Heteroskedasticity Department of Economics University of Gothenburg November 30, 2014 1/43 and Autocorrelation Consequences for OLS Estimator Begin from the linear model

More information

Simple Linear Regression Model & Introduction to. OLS Estimation

Simple Linear Regression Model & Introduction to. OLS Estimation Inside ECOOMICS Introduction to Econometrics Simple Linear Regression Model & Introduction to Introduction OLS Estimation We are interested in a model that explains a variable y in terms of other variables

More information