ECONOMETRICS Introduction & First Principles

Size: px
Start display at page:

Download "ECONOMETRICS Introduction & First Principles"

Transcription

1 ECONOMETRICS Introduction & First Principles DA V I D C. BR O A D S T O C K Research Institute of Economics & Management, China. OSEC Pre-Master Course, 2015.

2 COURSE OUTLINE Part 1. Introduction to econometrics: including review of types of data used and basic notation for cross sectional, time-series and panel data. OLS and the linear regression. Understanding test statistics with a review of Monte-Carlo simulation. Mis-specification testing Heteroskedasticity, heterogeneity and structural breaks. Panel data. Time series methods. Advanced topics.

3 CLASS SCHEDULE Dates/locations of econometrics classes TABLE : Class schedule Date Venue Lecture Lecturer 21 Mar Saturday-morning C205 Jingshi Lecture 1 David Broadstock 28 Mar Saturday-morning C205 Jingshi Lecture 2 David Broadstock 11 Apr Saturday-morning C205 Jingshi Lecture 3 David Broadstock 18 Apr Saturday-morning C205 Jingshi Lecture 4 David Broadstock 25 Apr Saturday-morning C205 Jingshi Lecture 5 «Guest» 09 May Saturday-morning C205 Jingshi Lecture 6 David Broadstock 16 May Saturday-morning C205 Jingshi Lecture 7 David Broadstock 23 May Saturday-morning C205 Jingshi Lecture 8 David Broadstock TBA TBA Final Exam

4 GRADING/CLASS REQUIREMENTS This course is aimed at reviewing the overall merits and approaches of (statistical) econometric methods. Attendance = 5% Homeworks = 25% In class presentation = 10% Final Exam = 60% Each week we will have a presentation by 1 group discussing a paper of my choosing - aiming at 15 minutes. These will be used to tackle issue relating to econometric practices from a wider viewpoint. We shall begin with the birth of Econometrica and the first winner of the Nobel Prize in Economics!

5 SOME ONLINE RESOURCES Use the web - it is invaluable! This course is aimed at reviewing the overall merits and approaches of (statistical) econometric methods. The website of the Wooldridge book (datasets available) Econometrics Journal s online resources: Under the above, link to a list of different econometrics software packages (many are free). You should also be able to implement basic operations in a spreadsheet, or find software within the university. You may also wish to check the course webpage (click on me), where lecture slides and homework materials will be posted.

6 INTRODUCTION Today s learning outcomes. To get familiar with each other To spark an interest to econometric thinking To develop an apprieciation of the different types of data in the real-world To understand the components of a simple linear model To understand the components of an estimated simple linear model To understand what the least squares criterion is To be aware of the different types of data we may work with To begin to consider the notion of causality (probably next week)

7 ECONOMETRICS - WHAT IS IT? Analysis of statistical relationships in economic data. A highly useful tool, with lots of applications in Business in Government in Research Centres in your 3rd year dissertation in postgraduate studies Takes an effort to learn, but once learned, it is very rewarding, both intellectually and in terms of your employability. If you don t learn Econometrics while at University, you ll probably never learn it!

8 HOW DO WE THINK OF THE PROBLEM Part 1/2. Electricity consumption (Y ) depends on income (X). However consumption also depends on several other economic variables that we ignore for now In economics we express our ideas about relationships between economic variables using the mathematical concept of a function For example, to express the relationship between food electricity consumption for individual i, Y i and his/her income X i we may write Y i = f (X i ) (1) However, when studying this relationship one recognizes that the actual consumption by an individual is the result of a systematic part and/or a random and unpredictable component u i that we call the random term Y i = f (X i ) + u i (2) The random term accounts for the many factors that affect sales that we have omitted from this simple model, and it also reflects the intrinsic uncertainty in the behaviour of individuals.

9 HOW DO WE THINK OF THE PROBLEM Part 2/2. To complete the specification of the econometric model, we must also say something about the form of the algebraic relationship among our economic variables In the example about the relationship of annual electricity expenditure and annual income, we assume that the systematic part of the demand relation is linear f (X i ) = β 1 + β 2 X i (3) The corresponding econometric model is Y i = β 1 + β 2 X i + u i (4) This together with some assumptions about u i and X i is called the simple linear regression model

10 THE PROBLEM OF INFERENCE We will spend some time on this. Given that the data has been generated by a model that has the form Y i = β 1 + β 2 X i + u i, what are the values of β 1 and β 2 in such a model? Using a sample of observations on Y and X we would like to estimate the parameters b 1 and b 2 We would like to test hypotheses on these parameters For example we want to know if b 1 and b 2 are somehow informative/stable Also, we would like to understand if we are using the correct model in the first place

11 SOME JARGON There is a lot... Y i f (X i ), f (X i ) = β 1 + β 2 X i (plus assumptions) is the linear regression model f (X i ) = β 1 + β 2 X i is called linear regression β 1 and β 2 are the parameters β 1 is the intercept parameter β 2 is the slope parameter. df (X) Since = d(β 1+β 2 X i ) = β d(x) d(x) 2, then β 2 tells us the increase of E(Y X) for each unit increase in X Y is the dependent variable X is the independent variable, or the regressor u is the error term

12 SOME MORE JARGON... get used to the terms Given two estimates b 1 and b 2, the quantity Ŷi = β 1 + β 2 X i is the predicted value for individual i The difference Y i Ŷi = û i is the residual Notice that the residuals and the errors are different things The error of a sample is the deviation of the sample from the (unobservable) true function value, while the residual of a sample is the difference between the sample and the estimated function value. (Wikipedia!)

13 BEFORE DISCUSSING ESTIMATION... lets think about data and variables a little There are a number of different types of data structures - we will not handle them all in this course, but here are some of the main ones to be aware of: Cross-sectional Panel Time series Discrete/categorical outcomes Count-data/truncated data Big-data

14 CROSS-SECTIONAL DATA Cross-sectional data is the simplest type of data available to us. We rarely work with this type of data in empirical research, but the teaching of econometrics and development of econometric theory still takes advantage of it: many innovative estimators start life without considering aspects of time or individual-type heterogeneity often considering these (necessary) refinements once the cross-sectional world is understood. Imagine we have a sample of households for a single year. Denote each household i, and the total number of households in the sample as I. Note, we could easily have a cross section of individuals, or countries or firms etc. We typically will denote the data Y i for the dependent variable and X i for independent variables. An example cross-sectional demand function might look something like: Q i = α + β P P i + β Y Y i + ε i

15 PLOTTING CROSS-SECTION DATA Below are some hypothetical data on price, income and quantity consumed - we will revisit next week why it might be interesting to work with hypothetical data sometimes P Q Y Q

16 PANEL DATA Panel data is an intuitively simple extension to cross-sectional data. In short, panel data includes information on individuals i = 1,..., I, similar in this sense to cross-sectional data, but in addition records this information over many time periods t = 1,..., T e.g. years, months quarters etc. Time periods are ideally (though not always) equidistant, meaning that the amount of time passed between periods 1 and 2 is the same as for 2 and 3 and all other sequential periods. We typically will denote the data Y it for the dependent variable and X it for independent variables. An example panel-data demand function might look something like: Q it = α i + β P P it + β Y Y it + ε it Note: There is an important difference between panel data (sometimes the name given to repeated cross-sections ) and longitudinal data (sometimes referred to as pure panel )

17 PANEL-DATA ILLUSTRATED Below is an example of energy demand for 17 countries OECD 17 log energy consumption per capita log(epc) Time

18 TIME SERIES DATA Time series data is data which concentrates on a single individual (again noting the general interpretation we have for the term individual ) over multiple time periods. It allows for more comprehensive understanding of trends in the world and the exploration of phenomena such as periods of economic boom, or sudden price collapses. We typically will denote the data Y t for the dependent variable and X t for independent variables. An example time-series demand function might look something like: Q t = α i + β P P t + β Y Y t + ε t A particular interest in time series is the treatment of high-frequency data which are increasingly commonplace.

19 TIME-SERIES DATA ILLUSTRATED Below is an example of UK gasoline demand UK real (weighted) gasoline price log(price) Time

20 DISCRETE/CATEGORICAL OUTCOMES Discrete or categorical variables can be found in cross-sections, panel-data and time series. These variables represent things that have a finite number of outcomes, for example the outcome of tossing a coin, or perhaps the decision by OPEC to change supply. Discrete variables can be simple consider for example a variable OPEC t intended to reflect the decision by OPEC to change (either increase or decrease) oil supply in period t. We could have: OPEC t = { 0 if supply remains unchanged 1 if supply changes We must be careful to consider if the discrete variable has an order. For instance choosing which color to have your car requires comparing many colors, which have no natural order. Alternatively a variable describing satisfaction (1=unhappy, 5=very happy) has an intuitive ordering to it. This can have important consequences for estimation.

21 COUNT-DATA/TRUNCATED DATA Count-data and truncated data are two other types of special variables, which can again have important consequences for estimation when present: Count data will generally be variables measured in integers (e.g. cannot be obtained in parts) and which may take any integer value from 0 to. Truncated data is often continuous over a range, for example the share of gasoline in total energy will be a maximum of 1 and a minimum of zero - but is perfectly continuous on this range and not restricted to integer values.

22 BIG DATA Big Data represents the Information assets characterized by such a High Volume, Velocity and Variety to require specific Technology and Analytical Methods for its transformation into Value [De Mauro et al. (2015)] Walmart handles more than 1 million customer transactions every hour, which are imported into databases estimated to contain more than 2.5 petabytes (2560 terabytes) of data the equivalent of 167 times the information contained in all the books in the US Library of Congress. [The Economist (2010)] I am presently trying to work with a 400GB dataset on energy consumption across 16,000 households - simply opening the data is beyond the capacity of most software and PC s - not to mention the programming ability of the average economist...

23 VARIABLE TYPES Most data types can contain one or more of the following We will review for general types of variables: Continuous variables These are variables that in their purest sense can take any value from minus infinity to plus infinity. We will in most cases assume this for the dependent variable. Bounded/truncated variables Variables of this form may not go above/below certain values. They can be bound from one or two sides. Can have important effects on density estimation for example (solved via reflection). Discrete As already discussed these variables depict selection among a finite set of outcomes and can lead to using quite specialized statistical approaches. Latent variables see 2 slides later

24 VARIABLE TYPES ILLUSTRATED Below are some examples of how different variables look A continuous normal variable A 'truncated from the left' normal variable A 'truncated from above and below' normal variable Frequency Frequency Frequency mean=0, sd= mean=0, sd=1, lower=0, upper=infinity mean=0, sd=1, lower=0,upper=1

25 LATENT VARIABLES Latent variables, and more generally latent information are things economists have used in various ways for some time. Beyond simply unobserved, latent variables often fall into the territory of being unobservable, in economics this might for example take the form of preferences This will at first seem counter-intuitive, how can the behavior of something immeasurable be quantified? The answers to this are interesting and highlight the more elegant aspects of economics and statistics - incorporating wisdom into rigorous mathematical structures. We will consider technological progress and underlying trends as some intuitive pedagogical examples. A relatively mainstream example would also be factor analysis and it s variants.

26 GUESSING THE PARAMETERS Art or science? Let us turn attention back to the the econometrics, and more specifically the nature of estimation. So, how to determine b 1 and b 2? Fit the line in a way, that our prediction mistakes are minimised in the best possible way. Does it mean we can minimize the sum of the residuals?

27 GUESSING THE PARAMETERS The least squares criterion Instead of choosing b 1 and b 2 to minimize the sum of the residuals, we choose b 1 and b 2 to minimize the sum of the squared residuals û û û û 2 n (5) = (Y i b 1 b 2 X i ) 2 + (Y 2 b 1 b 2 X 2 ) 2 + (Y 3 b 1 b 2 X 3 ) (Y n b 1 b 2 X n) 2 (6) Notice that the sum of the squared residuals can be zero if and only if all the residuals are zero

28 THE LEAST SQUARES CRITERION Our first estimator The study of the linear regression model is the focus of this course A good starting point for understanding the linear regression model are Chapter s 1 and 2 of Wooldridge This course requires that you are familiar with the material in the Review section at the end of the book. It is highly recommended that you read through it during the beginning of the term.

29 A RESEARCH FORMAT Before we go any further with the mechanics. It all starts with a problem (or question) Economic theory gives us a way of thinking about the problem: What economic variables are involved and what is the possible direction of the relationship(s)? The working economic model leads to an econometric model. We must choose a functional form and make some assumptions about the nature of the error term Sample data are obtained, and a desirable method of statistical analysis chosen, based on our initial assumptions, and our understanding of how the data were collected Estimates of the unknown parameters are obtained with the help of a statistical software package, predictions are made and hypothesis tests are performed Model diagnostics are performed to check the validity of the assumptions that were made. For example, were all of the right-hand side explanatory variables relevant? Was the correct functional form used? The economic consequences and the implications of the empirical results are analyzed and evaluated. What economic resource allocation and distribution results are implied, and what are their policy-choice implications? What remaining questions might be answered with further study or new and better data?

30 ORDINARY LEAST SQUARES (OLS) The workhorse of econometrics. Getting back to the idea of guessing the parameters, we have already introduced the least squares criterion In the next slides we review the mechanics of least squares estimation It is little more than standard optimization, but over a large number of observations We concentrate on the simple linear regression for illustration, i.e. the case where there is only one X variable, since the mechanics get much more involved with more than one X, after which matrix algebra becomes favorable

31 MINIMIZATION (1/6) Simple linear regression The minimization problem min (b 1,b 2 ) (Y i b 1 b 2 X i ) 2 (7) Leads to the following first order conditions: b 1 b 2 (Y i b 1 b 2 X i ) 2 = 0 (8a) (Y i b 1 b 2 X i ) 2 = 0 (8b)

32 MINIMIZATION (2/6) Simple linear regression By taking the derivatives of each term of the sums, we can re-write the first order conditions as b 1 (Y i b 1 b 2 X i ) 2 = 0 b 2 (Y i b 1 b 2 X i ) 2 = 0 Then, evaluating these two derivatives we obtain 2 2 (9a) (9b) (Y i b 1 b 2 X i ) = 0 (10a) X(Y i b 1 b 2 X i ) = 0 (10b)

33 MINIMIZATION (3/6) Simple linear regression The 2 term cancels from each equation giving: (Y i b 1 b 2 X i ) = 0 (11a) X(Y i b 1 b 2 X i ) = 0 Expanding out the brackets and re-arranging these we obtain the normal equations: (11b) Y i = nb 1 + b 2 X i Y i = b 1 X i + b 2 X i X 2 i (12a) (12b)

34 MINIMIZATION (4/6) Simple linear regression Now we can solve for b 1 and b 2 simultaneously by multiplying 12a by n X i and also by multiplying 12b by n to give: n X i Y i = nb 1 X i Y i = nb 1 X i + b 2 ( X i ) 2 X i + nb 2 X 2 i (13a) (13b)

35 MINIMIZATION (5/6) Simple linear regression Subtracting 13a from 13b n X i Y i From which it follows that: X i Y i = b 2 [n X 2 i ( ] X i ) 2 (14) b 2 = n n X iy i n X n i Y i [ n n X i 2 ( ] (15) n X i) 2

36 MINIMIZATION (6/6) Simple linear regression Now, given b 2 we can recover b 1 by recalling normal equation 12a and re-arranging to give: nb 1 = Y i b 2 X i (16a) n b 1 = Y i n b 2 n X i n = Ȳ b 2 X (16b)

37 MINIMIZATION SUMMARY The ordinary least squares estimator Note that the slope of the estimated relationship (b 2 ) is equivalent to cov(x, Y ) divided by var(x) : b 2 = cov(x,y ) var(x) Also note that the 2 nd equation implies that the estimated line always passes through the means of X i and Y i : Ȳ = b 1 + b 2 X [This will be important to remember for hypothesis testing and model validation] Since these formulas work for any values of the sample data, they are the least squares estimators.

38 THE MAIN ASSUMPTIONS To be reviewed in multiple regression SLR.1 Linear in parameters y = β 0 + β 1 x + u (17) SLR.2 Random sampling: We can take a random sample of size n, {x i, y i : i = 1, 2,..., n} from the population model SLR.3 Zero conditional mean: E(u x) = 0 SLR.4 Sample variation in the independent variable: In the sample, the independent variable s x i, i = 1,..., n, are not all equal to the same constant. This requires some variation in x in the population.

39 MEANING OF LINEAR REGRESSION Linearity in parameters This can be a little confusing, as it is possible to specify some non-linear relationships using the simple linear regression model Clarity over this confusion comes in terms of the role of the parameters. The parameters in the equation must enter in a linear fashion, however it is still possible for variables to enter into the equation non-linearly e.g. interaction terms, which we come back to in a future class Can anybody think of examples when we might wish to try and control for non-linear behaviour? Econometrics can deal with non-linear models (i.e. non-linear in parameters) but this is beyond the extent of this course.

40 MULTIPLE LINEAR REGRESSION When there is more than one X. Y i = b 1 b 2 X 1i b 3 X 2i b k X ki + u i (18) The X variables can be transformations e.g. X 1i = X 1i, X 2i = X 2 1i The only limitations on the number of included variables is the sample size (and desired degrees of freedom for inference - to be discussed next week) Ceteris paribus - specification of the standard Multiple linear regression allows for a holding other things fixed interpretation/control environment. However it does not require anything to be fixed during data collection in order to work.

41 MINIMIZATION (1/8) Multiple regression The minimization problem min (b 1,b 2,b 3 ) (Y i b 1 b 2 X 1 b 3 X 2 ) 2 (19) Leads to the following first order conditions: b 1 b 2 b 3 (Y i b 1 b 2 X 1 b 3 X 2 ) 2 = 0 (20a) (Y i b 1 b 2 X 1 b 3 X 2 ) 2 = 0 (20b) (Y i b 1 b 2 X 1 b 3 X 2 ) 2 = 0 (20c)

42 MINIMIZATION (2/8) Multiple regression By taking the derivatives of each term of the sums evaluating the three derivatives we obtain: 2 (Y i b 1 b 2 X 1 b 3 X 2 ) = 0 (21a) 2 X 1 (Y i b 1 b 2 X 1 b 3 X 2 ) = 0 2 X 2 (Y i b 1 b 2 X 1 b 3 X 2 ) = 0 (21b) (21c)

43 MINIMIZATION Multiple regression This is where Wooldridge stops the derivation for the multiple linear regression

44 MINIMIZATION (3/8) Multiple regression Canceling the 2 terms from each equation and re-arranging we obtain the normal equations: Y i = nb 1 + b 2 X 1 + b 3 X 1 Y i = b 1 X 2 Y i = b 1 X 1 + b 2 X1 2 + b 3 X 1 + b 2 X 2 X 1 X 2 X 1 X 2 + b 3 X 2 2 (22a) (22b) (22c)

45 MINIMIZATION Multiple regression When it comes to solving this set of equations it is convenient to apply matrix algebra or Cramer s rule It is not required for this course to use these methods It is not really required to derive the least squares estimators for the multiple regression model, though it is instructive to do so The following slides outline an approach to deriving the parameter estimates using the same notation as applied for the simple linear regression

46 MINIMIZATION (4/8) Multiple regression In order to solve this set of equations it is useful to recall the structure of the two (X) variable regression model: Ŷ = ˆb 1 + ˆb 2 X 1 + ˆb 3 X 2 + u (23) Averaging over the sample observations (noting also that ū = 0) gives: Ȳ = ˆb 1 + ˆb 2 X1 + ˆb 3 X2 (24) Now subtracting (24) from (23) gives the deviation form : y = ˆb 2 x 1 + ˆb 3 x 2 + u (25)

47 MINIMIZATION (5/8) Multiple regression The intercept b 1 disappears from the deviations form of the regression but is easily recovered by re-arranging the averages form of the equation to give: ˆb 1 = Ȳ ˆb 2 X1 ˆb 3 X2 (26) in order determine the values for b 2 and b 3, as in the case of the simple linear regression, we wish to minimise the sum of the squared residuals for the deviations form of the regression: min (b 2,b 3 ) (y b 2 x 1 b 3 x 2 ) 2 (27)

48 MINIMIZATION (6/8) Multiple regression Evaluating the first order derivatives gives the following first order conditions: b 2 b 3 which give respectively: (y i b 2 x 1 b 3 x 2 ) 2 = 0 (28a) (y i b 2 x 1 b 3 x 2 ) 2 = 0 (28b) x 1 y = b 2 x1 2 b 3 x 1 x 2 x 2 y = b 2 x 1 x 2 b 3 x 2 2 (29a) (29b)

49 MINIMIZATION (7/8) Multiple regression In order to eliminate one of the unknown parameters we multiply equation (24a) by n x 2 2 and (24b) by n x 1x 2 to give: x 1 y x2 2 = b 2 x 2 y x 1 x 2 = b 2 x 2 1 x2 2 b 3 x 1 x 2 (x 1 x 2 ) 2 b 3 x 1 x 2 x 2 2 x 2 2 (30a) (30b) Then subtract (25b) from (25a) to give: x 1 y x2 2 x 2 y [ x 1 x 2 = b 2 (x 1 x 2 ) 2 x 2 1 ] x2 2 (31)

50 MINIMIZATION (8/8) Multiple regression This can be re-arranged to give: n b 2 = x 1y n x 2 2 n x 2y n x 1x 2 [ n (x 1x 2 ) 2 ] (32) n x 1 2 n x 2 2 In a similar fashion we can find that: n b 3 = x 2y n [ n x 2 1 x 2 1 n x 1y n x 1x 2 n x 2 2 n (x 1x 2 ) 2 ] (33)

51 R 2 (GOODNESS-OF-FIT) The coefficient of determination - multiple regression Given SST (the total sum of squares), SSE (the explained sum of squares) and SSR (the sum of squared residuals) we can define R 2 the ratio of the explained variation compared to the total variation; thus, it is interpreted as the fraction of the sample variation in y that is explained by x. Where R 2 = SSE SST = 1 SSR SST SST = SSE = (34) (y i ȳ) 2 (35a) (ŷ i ȳ) 2 (35b) SSR = (ū i ) 2 (35c)

52 R 2 (GOODNESS-OF-FIT) The coefficient of determination - some notes R 2 provides an extremely useful measure of the ability of the specified regression equation to explain the variation in the independent variable R 2 never decreases and it usually increases when another independent variable is added to a regression This makes R 2 a poor tool for deciding whether one additional variable or many additional variables should be added to a model However, as we will see in week 4 and in later weeks, the R 2 does provide useful information for considering whether groups of independent variables are useful in explaining the dependent variable

53 THE MAIN ASSUMPTIONS In the context of the multiple regression In the context of the more general multiple regression we have a slightly different set of assumptions: MLR.1 Linear in parameters y = β 0 + β 1 x 1 + β 2 x β k x k + u (36) MLR.2 Random sampling: We have a random sample of n observatoins, {x 1, x 2,..., x k, y i : i = 1, 2,..., n} from the population model described by assumptions MLR.1. MLR.3 Zero conditional mean: E(u x 1, x 2,..., x k ) = 0 MLR.4 No perfect collinearity: In the sample (and therefore the population), none of the independent variables is constant, and there are no exact linear relationships among the independent variables. MLR.5 Homoskedasticity: Var(u x 1, x 2,..., x k ) = σ 2

54 MLR.1 LINEAR IN PARAMETERS The meaning of linear regression This can be a little confusing, as it is possible to specify some non-linear relationships using the simple linear regression model The answer comes in terms of the role of the parameters. The parameters in the equation must enter in a linear fashion, however it is still possible for variables to enter into the equation non-linearly Can anybody think of examples when we might wish to try and control for non-linear behaviour? Econometrics can deal with non-linear models (i.e. non-linear in parameters) but this is beyond the extent of this course.

55 MLR.2 RANDOM SAMPLING The meaning of linear regression We assume that the data are randomly drawn from the population For OLS to be unbiased, this assumption needs to hold for the population

56 MLR.3 ZERO CONDITIONAL MEAN Exogenous explanatory variables One way in which this assumption can fail is if the functional relationship between the explained and the explanatory variables, for example omitting a quadratic term when in fact it should be present Functional form mis-specification can be a problem and its detection is considered in Chapter 9 of Wooldridge Omitting an important variable that is correlated with any of x 1, x 2,..., x k of the included variables can also cause MLR.3 to fail as it will mean that there is still important information contained in the residual term Next week we will show how this can generate bias in the results, and in the following weeks we will consider what can be done to remedy it

57 MLR.4 NO PERFECT COLLINEARITY Sample properties In the sample (and therefore the population), none of the independent variables is constant, and there are no exact linear relationships among the independent variables. This says nothing about the relationship defined by MLR.3, (i.e. the relationship with u, rather it relates to the relationships among any of x 1, x 2,..., x k of the included variables. If one of the independent variables is an exact linear relationship of another, then we say that it is perfectly co-linear and it cannot be estimated by OLS. This assumption does still allow for correlation between x 1, x 2,..., x k - it is the co-movement in the variables which determines the value of the coefficients

58 RECALL REGRESSION ESTIMATOR Why X must take at least two values Regarding constant terms, from simple OLS recall that: n (Y Ȳ )(X X) b 2 = n (X X) (37) 2 If X had no variation at all, so that each X i would be equal to the mean of X, the OLS estimator would not be defined, as one can t divide by zero But, even if there is some variation, more of it would be better for the precision of the OLS estimator as will be seen later Regarding the slope parameters, consider the equation for b 2, given earlier, if x 2 = 1(x 1 ): n b 2 = x 1y n x 2 2 n x 2y n x 1x 2 [ n (x 1x 2 ) 2 ] (38) n x 1 2 n x 2 2

59 MLR.5 HOMOSKEDASTICITY Var(u x 1, x 2,..., x k ) = σ 2 This assumption means that the variance in the error term, u, conditional on the explanatory variables, is the same for all combinations of outcomes of the explanatory variables. If this variance changes with any of the explanatory variables, then the residual process is said to exhibit heteroskedasticity heteroskedasticity generated problems that can have important implications for model inference and will be reviewed in week 7 Assumptions MLR.1-MLR.5 are known as the Gauss-Markov assumptions for cross sectional regression. The set of assumptions developed so far are only appropriate for cross-sectional regression. Stating the assumptions for time series or for panel data is much more difficult, though there are similarities.

60 OLS VARIANCE AND COVARIANCE Parameter uncertainty It will prove important for us to understand the variance and co-variance of the OLS estimates for individual parameters. Below are definitions that we will again return to later in the course. σ 2 Var(b j ) = SST j (1 Rj 2 ) se(b 2 ) = Var(ˆb 2 ) (39a) (39b) where σ 2 is the error variance, defined in the following slide. Variance for any given parameter is therefore a combination of the (average) model uncertainty, inversely weighted by the ability of the model to describe the data.

61 ERROR VARIANCE Model uncertainty We wish to define σ 2 = E(u 2 ) in which the expectation is equivalent to n 1 n u2 i. However ui 2 is unobserved and so we replace it with the residuals from the estimated regression Instead of taking a direct average, the denominator is equal to n k 1 as opposed to n, and is referred to as the degrees of freedom ) ( n σ 2 = u2 i (n k 1) = SSR (n k 1) (40) You may wish to read Wooldridge regarding degrees of freedom

62 BLUE Best Linear Unbiased Estimator Best: refers to the estimator with the smallest variance. This is met insofar as it is the objective function for OLS Linear: means that b 1 and b 2 are linear estimators; that is, they are linear functions of the random variable Y Unbiased: An unbiased estimator is an estimator in which E( β j ) = β j Estimator: simply refers to the fact that we are looking at an estimator. Under the main MLR assumptions β j are the BLUE s of b j for all j ( j) hence the OLS estimators are BLUE (by the Gauss Markov Theorem) hence they are often referred to as the Gauss Markov assumptions. The importance of these assumptions is that when the standard set of assumptions holds we need not look for alternative unbiased estimators.

63 OMITTED VARIABLE BIAS 1/3 The consequences of leaving something important out Imagine the true PRF is: But however we estimate: Y = β 1 + β 1 X 1 + β 1 X 2 + u Ỹ = β 1 + β 1 X 1 Note that we use rather thanˆto emphasize that β j comes from an underspecified model We know that the OLS estimator of β 2 is; n β 2 = (Y i Ȳ )(X i X) n (X i X = 2 ) n (X i X)Y i n (X i X 2 )

64 OMITTED VARIABLE BIAS 2/3 The consequences of leaving something important out Since we know that Y = β 1 + β 2 X 1 + β 3 X 2 + u we can re-write the numerator of β 2 as follows: (X i X)(β 1 + β 2 X 1 + β 3 X 2 + u) = β 2 (X i X) 2 + β 3 (X i X)X 2 + = β 2 SST 1 + β 3 (X i X)X 2 + (X i X)u (X i X)u

65 OMITTED VARIABLE BIAS 3/3 The consequences of leaving something important out Dividing by SST 1 and taking the expectation conditional on the independent variables (noting that E(u) = 0) we have E( β 2 ) = β 2 + β 3 n (X i X)X 2 n (X i X) 2 n (X i X)X 2 n (X i X) 2 is equivalent to the slope coefficient from a regression of X 2 on X 1, which we could define as X 2 = δ 1 + δ 2 X 1. We can therefore see that E( β 2 ) = β 2 + β 3 δ2 E( β 2 ) β 2 = β 3 δ2 Where E( β 2 ) β 2 is defined as the bias

66 Thanks for listening! Any questions/comments are warmly welcomed.

67 INTERPRETATION OF MODEL OUTPUT The impact of different functional forms (given that y = β 0 + β 1 x) Model Dependent variable Independent variable Interpretation of β 1 level-level y x y = β 1 x level-log y ln (x) y = β % x log-level ln (y) x % y = (100 β 1 ) x log-log ln (y) ln (x) % y = β 1 % x

ECON The Simple Regression Model

ECON The Simple Regression Model ECON 351 - The Simple Regression Model Maggie Jones 1 / 41 The Simple Regression Model Our starting point will be the simple regression model where we look at the relationship between two variables In

More information

ECNS 561 Multiple Regression Analysis

ECNS 561 Multiple Regression Analysis ECNS 561 Multiple Regression Analysis Model with Two Independent Variables Consider the following model Crime i = β 0 + β 1 Educ i + β 2 [what else would we like to control for?] + ε i Here, we are taking

More information

Introductory Econometrics

Introductory Econometrics Based on the textbook by Wooldridge: : A Modern Approach Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna October 16, 2013 Outline Introduction Simple

More information

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors:

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors: Wooldridge, Introductory Econometrics, d ed. Chapter 3: Multiple regression analysis: Estimation In multiple regression analysis, we extend the simple (two-variable) regression model to consider the possibility

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Multiple Linear Regression CIVL 7012/8012

Multiple Linear Regression CIVL 7012/8012 Multiple Linear Regression CIVL 7012/8012 2 Multiple Regression Analysis (MLR) Allows us to explicitly control for many factors those simultaneously affect the dependent variable This is important for

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

Economics 113. Simple Regression Assumptions. Simple Regression Derivation. Changing Units of Measurement. Nonlinear effects

Economics 113. Simple Regression Assumptions. Simple Regression Derivation. Changing Units of Measurement. Nonlinear effects Economics 113 Simple Regression Models Simple Regression Assumptions Simple Regression Derivation Changing Units of Measurement Nonlinear effects OLS and unbiased estimates Variance of the OLS estimates

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Markus Haas LMU München Summer term 2011 15. Mai 2011 The Simple Linear Regression Model Considering variables x and y in a specific population (e.g., years of education and wage

More information

Multiple Regression Analysis. Part III. Multiple Regression Analysis

Multiple Regression Analysis. Part III. Multiple Regression Analysis Part III Multiple Regression Analysis As of Sep 26, 2017 1 Multiple Regression Analysis Estimation Matrix form Goodness-of-Fit R-square Adjusted R-square Expected values of the OLS estimators Irrelevant

More information

Econometrics I Lecture 3: The Simple Linear Regression Model

Econometrics I Lecture 3: The Simple Linear Regression Model Econometrics I Lecture 3: The Simple Linear Regression Model Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 32 Outline Introduction Estimating

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 2: Simple Regression Egypt Scholars Economic Society Happy Eid Eid present! enter classroom at http://b.socrative.com/login/student/ room name c28efb78 Outline

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

Homoskedasticity. Var (u X) = σ 2. (23)

Homoskedasticity. Var (u X) = σ 2. (23) Homoskedasticity How big is the difference between the OLS estimator and the true parameter? To answer this question, we make an additional assumption called homoskedasticity: Var (u X) = σ 2. (23) This

More information

Chapter 2: simple regression model

Chapter 2: simple regression model Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.

More information

Econometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018

Econometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018 Econometrics I KS Module 1: Bivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: March 12, 2018 Alexander Ahammer (JKU) Module 1: Bivariate

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model

Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model Most of this course will be concerned with use of a regression model: a structure in which one or more explanatory

More information

Lecture 3: Multiple Regression

Lecture 3: Multiple Regression Lecture 3: Multiple Regression R.G. Pierse 1 The General Linear Model Suppose that we have k explanatory variables Y i = β 1 + β X i + β 3 X 3i + + β k X ki + u i, i = 1,, n (1.1) or Y i = β j X ji + u

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

The general linear regression with k explanatory variables is just an extension of the simple regression as follows

The general linear regression with k explanatory variables is just an extension of the simple regression as follows 3. Multiple Regression Analysis The general linear regression with k explanatory variables is just an extension of the simple regression as follows (1) y i = β 0 + β 1 x i1 + + β k x ik + u i. Because

More information

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research Linear models Linear models are computationally convenient and remain widely used in applied econometric research Our main focus in these lectures will be on single equation linear models of the form y

More information

The Simple Regression Model. Simple Regression Model 1

The Simple Regression Model. Simple Regression Model 1 The Simple Regression Model Simple Regression Model 1 Simple regression model: Objectives Given the model: - where y is earnings and x years of education - Or y is sales and x is spending in advertising

More information

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity LECTURE 10 Introduction to Econometrics Multicollinearity & Heteroskedasticity November 22, 2016 1 / 23 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists

More information

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47 ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with

More information

An Introduction to Parameter Estimation

An Introduction to Parameter Estimation Introduction Introduction to Econometrics An Introduction to Parameter Estimation This document combines several important econometric foundations and corresponds to other documents such as the Introduction

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3

More information

Introductory Econometrics

Introductory Econometrics Based on the textbook by Wooldridge: : A Modern Approach Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna November 23, 2013 Outline Introduction

More information

Econometrics Summary Algebraic and Statistical Preliminaries

Econometrics Summary Algebraic and Statistical Preliminaries Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L

More information

ACE 564 Spring Lecture 8. Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information. by Professor Scott H.

ACE 564 Spring Lecture 8. Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information. by Professor Scott H. ACE 564 Spring 2006 Lecture 8 Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information by Professor Scott H. Irwin Readings: Griffiths, Hill and Judge. "Collinear Economic Variables,

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Econometrics - 30C00200

Econometrics - 30C00200 Econometrics - 30C00200 Lecture 11: Heteroskedasticity Antti Saastamoinen VATT Institute for Economic Research Fall 2015 30C00200 Lecture 11: Heteroskedasticity 12.10.2015 Aalto University School of Business

More information

Intro to Applied Econometrics: Basic theory and Stata examples

Intro to Applied Econometrics: Basic theory and Stata examples IAPRI-MSU Technical Training Intro to Applied Econometrics: Basic theory and Stata examples Training materials developed and session facilitated by icole M. Mason Assistant Professor, Dept. of Agricultural,

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/

More information

ECON3150/4150 Spring 2016

ECON3150/4150 Spring 2016 ECON3150/4150 Spring 2016 Lecture 6 Multiple regression model Siv-Elisabeth Skjelbred University of Oslo February 5th Last updated: February 3, 2016 1 / 49 Outline Multiple linear regression model and

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

OSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity

OSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity OSU Economics 444: Elementary Econometrics Ch.0 Heteroskedasticity (Pure) heteroskedasticity is caused by the error term of a correctly speciþed equation: Var(² i )=σ 2 i, i =, 2,,n, i.e., the variance

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 4 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 23 Recommended Reading For the today Serial correlation and heteroskedasticity in

More information

Statistical Inference with Regression Analysis

Statistical Inference with Regression Analysis Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Steven Buck Lecture #13 Statistical Inference with Regression Analysis Next we turn to calculating confidence intervals and hypothesis testing

More information

2. Linear regression with multiple regressors

2. Linear regression with multiple regressors 2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions

More information

1 Correlation between an independent variable and the error

1 Correlation between an independent variable and the error Chapter 7 outline, Econometrics Instrumental variables and model estimation 1 Correlation between an independent variable and the error Recall that one of the assumptions that we make when proving the

More information

Correlation and Regression

Correlation and Regression Correlation and Regression October 25, 2017 STAT 151 Class 9 Slide 1 Outline of Topics 1 Associations 2 Scatter plot 3 Correlation 4 Regression 5 Testing and estimation 6 Goodness-of-fit STAT 151 Class

More information

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler Basic econometrics Tutorial 3 Dipl.Kfm. Introduction Some of you were asking about material to revise/prepare econometrics fundamentals. First of all, be aware that I will not be too technical, only as

More information

The cover page of the Encyclopedia of Health Economics (2014) Introduction to Econometric Application in Health Economics

The cover page of the Encyclopedia of Health Economics (2014) Introduction to Econometric Application in Health Economics PHPM110062 Teaching Demo The cover page of the Encyclopedia of Health Economics (2014) Introduction to Econometric Application in Health Economics Instructor: Mengcen Qian School of Public Health What

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators

More information

Environmental Econometrics

Environmental Econometrics Environmental Econometrics Syngjoo Choi Fall 2008 Environmental Econometrics (GR03) Fall 2008 1 / 37 Syllabus I This is an introductory econometrics course which assumes no prior knowledge on econometrics;

More information

Lecture-1: Introduction to Econometrics

Lecture-1: Introduction to Econometrics Lecture-1: Introduction to Econometrics 1 Definition Econometrics may be defined as 2 the science in which the tools of economic theory, mathematics and statistical inference is applied to the analysis

More information

Simple Linear Regression: The Model

Simple Linear Regression: The Model Simple Linear Regression: The Model task: quantifying the effect of change X in X on Y, with some constant β 1 : Y = β 1 X, linear relationship between X and Y, however, relationship subject to a random

More information

Simple Linear Regression. Material from Devore s book (Ed 8), and Cengagebrain.com

Simple Linear Regression. Material from Devore s book (Ed 8), and Cengagebrain.com 12 Simple Linear Regression Material from Devore s book (Ed 8), and Cengagebrain.com The Simple Linear Regression Model The simplest deterministic mathematical relationship between two variables x and

More information

Lecture 16 - Correlation and Regression

Lecture 16 - Correlation and Regression Lecture 16 - Correlation and Regression Statistics 102 Colin Rundel April 1, 2013 Modeling numerical variables Modeling numerical variables So far we have worked with single numerical and categorical variables,

More information

EC4051 Project and Introductory Econometrics

EC4051 Project and Introductory Econometrics EC4051 Project and Introductory Econometrics Dudley Cooke Trinity College Dublin Dudley Cooke (Trinity College Dublin) Intro to Econometrics 1 / 23 Project Guidelines Each student is required to undertake

More information

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables. Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate

More information

EMERGING MARKETS - Lecture 2: Methodology refresher

EMERGING MARKETS - Lecture 2: Methodology refresher EMERGING MARKETS - Lecture 2: Methodology refresher Maria Perrotta April 4, 2013 SITE http://www.hhs.se/site/pages/default.aspx My contact: maria.perrotta@hhs.se Aim of this class There are many different

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

Simple Regression Model (Assumptions)

Simple Regression Model (Assumptions) Simple Regression Model (Assumptions) Lecture 18 Reading: Sections 18.1, 18., Logarithms in Regression Analysis with Asiaphoria, 19.6 19.8 (Optional: Normal probability plot pp. 607-8) 1 Height son, inches

More information

Section 3: Simple Linear Regression

Section 3: Simple Linear Regression Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

2 Prediction and Analysis of Variance

2 Prediction and Analysis of Variance 2 Prediction and Analysis of Variance Reading: Chapters and 2 of Kennedy A Guide to Econometrics Achen, Christopher H. Interpreting and Using Regression (London: Sage, 982). Chapter 4 of Andy Field, Discovering

More information

Applied Statistics and Econometrics. Giuseppe Ragusa Lecture 15: Instrumental Variables

Applied Statistics and Econometrics. Giuseppe Ragusa Lecture 15: Instrumental Variables Applied Statistics and Econometrics Giuseppe Ragusa Lecture 15: Instrumental Variables Outline Introduction Endogeneity and Exogeneity Valid Instruments TSLS Testing Validity 2 Instrumental Variables Regression

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

Review of Econometrics

Review of Econometrics Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,

More information

Quantitative Economics for the Evaluation of the European Policy

Quantitative Economics for the Evaluation of the European Policy Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,

More information

Chapter 2 The Simple Linear Regression Model: Specification and Estimation

Chapter 2 The Simple Linear Regression Model: Specification and Estimation Chapter The Simple Linear Regression Model: Specification and Estimation Page 1 Chapter Contents.1 An Economic Model. An Econometric Model.3 Estimating the Regression Parameters.4 Assessing the Least Squares

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the

More information

Chapter 4: Regression Models

Chapter 4: Regression Models Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,

More information

ECO220Y Simple Regression: Testing the Slope

ECO220Y Simple Regression: Testing the Slope ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x

More information

THE MULTIVARIATE LINEAR REGRESSION MODEL

THE MULTIVARIATE LINEAR REGRESSION MODEL THE MULTIVARIATE LINEAR REGRESSION MODEL Why multiple regression analysis? Model with more than 1 independent variable: y 0 1x1 2x2 u It allows : -Controlling for other factors, and get a ceteris paribus

More information

1 A Non-technical Introduction to Regression

1 A Non-technical Introduction to Regression 1 A Non-technical Introduction to Regression Chapters 1 and Chapter 2 of the textbook are reviews of material you should know from your previous study (e.g. in your second year course). They cover, in

More information

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0 Introduction to Econometrics Midterm April 26, 2011 Name Student ID MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (5,000 credit for each correct

More information

Two-Variable Regression Model: The Problem of Estimation

Two-Variable Regression Model: The Problem of Estimation Two-Variable Regression Model: The Problem of Estimation Introducing the Ordinary Least Squares Estimator Jamie Monogan University of Georgia Intermediate Political Methodology Jamie Monogan (UGA) Two-Variable

More information

Inference in Regression Model

Inference in Regression Model Inference in Regression Model Christopher Taber Department of Economics University of Wisconsin-Madison March 25, 2009 Outline 1 Final Step of Classical Linear Regression Model 2 Confidence Intervals 3

More information

Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE. Sampling Distributions of OLS Estimators

Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE. Sampling Distributions of OLS Estimators 1 2 Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE Hüseyin Taştan 1 1 Yıldız Technical University Department of Economics These presentation notes are based on Introductory

More information

Inference in Regression Analysis

Inference in Regression Analysis ECNS 561 Inference Inference in Regression Analysis Up to this point 1.) OLS is unbiased 2.) OLS is BLUE (best linear unbiased estimator i.e., the variance is smallest among linear unbiased estimators)

More information

Applied Microeconometrics (L5): Panel Data-Basics

Applied Microeconometrics (L5): Panel Data-Basics Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Regression Analysis. BUS 735: Business Decision Making and Research

Regression Analysis. BUS 735: Business Decision Making and Research Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn

More information

Least Squares Estimation-Finite-Sample Properties

Least Squares Estimation-Finite-Sample Properties Least Squares Estimation-Finite-Sample Properties Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Finite-Sample 1 / 29 Terminology and Assumptions 1 Terminology and Assumptions

More information

Multivariate Regression Analysis

Multivariate Regression Analysis Matrices and vectors The model from the sample is: Y = Xβ +u with n individuals, l response variable, k regressors Y is a n 1 vector or a n l matrix with the notation Y T = (y 1,y 2,...,y n ) 1 x 11 x

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

Essential of Simple regression

Essential of Simple regression Essential of Simple regression We use simple regression when we are interested in the relationship between two variables (e.g., x is class size, and y is student s GPA). For simplicity we assume the relationship

More information

Hypothesis testing Goodness of fit Multicollinearity Prediction. Applied Statistics. Lecturer: Serena Arima

Hypothesis testing Goodness of fit Multicollinearity Prediction. Applied Statistics. Lecturer: Serena Arima Applied Statistics Lecturer: Serena Arima Hypothesis testing for the linear model Under the Gauss-Markov assumptions and the normality of the error terms, we saw that β N(β, σ 2 (X X ) 1 ) and hence s

More information

Linear Models in Econometrics

Linear Models in Econometrics Linear Models in Econometrics Nicky Grant At the most fundamental level econometrics is the development of statistical techniques suited primarily to answering economic questions and testing economic theories.

More information

Warm-up Using the given data Create a scatterplot Find the regression line

Warm-up Using the given data Create a scatterplot Find the regression line Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444

More information

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2. Updated: November 17, 2011 Lecturer: Thilo Klein Contact: tk375@cam.ac.uk Contest Quiz 3 Question Sheet In this quiz we will review concepts of linear regression covered in lecture 2. NOTE: Please round

More information

Applied Regression Analysis. Section 2: Multiple Linear Regression

Applied Regression Analysis. Section 2: Multiple Linear Regression Applied Regression Analysis Section 2: Multiple Linear Regression 1 The Multiple Regression Model Many problems involve more than one independent variable or factor which affects the dependent or response

More information

An overview of applied econometrics

An overview of applied econometrics An overview of applied econometrics Jo Thori Lind September 4, 2011 1 Introduction This note is intended as a brief overview of what is necessary to read and understand journal articles with empirical

More information

Econometrics. Week 11. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 11. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 11 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 30 Recommended Reading For the today Advanced Time Series Topics Selected topics

More information

Statistics II. Management Degree Management Statistics IIDegree. Statistics II. 2 nd Sem. 2013/2014. Management Degree. Simple Linear Regression

Statistics II. Management Degree Management Statistics IIDegree. Statistics II. 2 nd Sem. 2013/2014. Management Degree. Simple Linear Regression Model 1 2 Ordinary Least Squares 3 4 Non-linearities 5 of the coefficients and their to the model We saw that econometrics studies E (Y x). More generally, we shall study regression analysis. : The regression

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation

More information

The Simple Regression Model. Part II. The Simple Regression Model

The Simple Regression Model. Part II. The Simple Regression Model Part II The Simple Regression Model As of Sep 22, 2015 Definition 1 The Simple Regression Model Definition Estimation of the model, OLS OLS Statistics Algebraic properties Goodness-of-Fit, the R-square

More information

Ordinary Least Squares Regression

Ordinary Least Squares Regression Ordinary Least Squares Regression Goals for this unit More on notation and terminology OLS scalar versus matrix derivation Some Preliminaries In this class we will be learning to analyze Cross Section

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor 1. The regression equation 2. Estimating the equation 3. Assumptions required for

More information

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage

More information

Heteroskedasticity ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD

Heteroskedasticity ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD Heteroskedasticity ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD Introduction For pedagogical reasons, OLS is presented initially under strong simplifying assumptions. One of these is homoskedastic errors,

More information