Economics 300. Econometrics Multiple Regression: Extensions and Issues

Size: px
Start display at page:

Download "Economics 300. Econometrics Multiple Regression: Extensions and Issues"

Transcription

1 Economics 300 Econometrics Multiple : Extensions and Dennis C. Plott University of Illinois at Chicago Department of Economics Fall 2014 Dennis C. Plott (UIC) ECON 300 Fall / 36

2 The Classical s The Classical s The classical assumptions must be met in order for OLS estimators to be the best (i.e., minimum variance) linear unbiased estimators (BLUE) The seven 1 classical assumptions are: 1. The population model is linear in parameters (i.e., β) and correctly specified. y = β 0 + β 1 x β k x k + u 1 five to seven, depends on source Dennis C. Plott (UIC) The Classical s ECON 300 Fall / 36

3 The Classical s The Classical s The classical assumptions must be met in order for OLS estimators to be the best (i.e., minimum variance) linear unbiased estimators (BLUE) The seven 1 classical assumptions are: 1. The population model is linear in parameters (i.e., β) and correctly specified. y = β 0 + β 1 x β k x k + u 1 five to seven, depends on source Dennis C. Plott (UIC) The Classical s ECON 300 Fall / 36

4 The Classical s The Classical s The classical assumptions must be met in order for OLS estimators to be the best (i.e., minimum variance) linear unbiased estimators (BLUE) The seven 1 classical assumptions are: 1. The population model is linear in parameters (i.e., β) and correctly specified. y = β 0 + β 1 x β k x k + u y = β 0 β 3 + β 1 x β k x k + u 1 five to seven, depends on source Dennis C. Plott (UIC) The Classical s ECON 300 Fall / 36

5 The Classical s The Classical s The classical assumptions must be met in order for OLS estimators to be the best (i.e., minimum variance) linear unbiased estimators (BLUE) The seven 1 classical assumptions are: 1. The population model is linear in parameters (i.e., β) and correctly specified. y = β 0 + β 1 x β k x k + u y = β 0 β 3 + β 1 x β k x k + u 1 five to seven, depends on source Dennis C. Plott (UIC) The Classical s ECON 300 Fall / 36

6 The Classical s The Classical s The classical assumptions must be met in order for OLS estimators to be the best (i.e., minimum variance) linear unbiased estimators (BLUE) The seven 1 classical assumptions are: 1. The population model is linear in parameters (i.e., β) and correctly specified. y = β 0 + β 1 x β k x k + u y = β 0 β 3 + β 1 x β k x k + u y = β 0 + β 2 1 x β k x k + u 1 five to seven, depends on source Dennis C. Plott (UIC) The Classical s ECON 300 Fall / 36

7 The Classical s The Classical s The classical assumptions must be met in order for OLS estimators to be the best (i.e., minimum variance) linear unbiased estimators (BLUE) The seven 1 classical assumptions are: 1. The population model is linear in parameters (i.e., β) and correctly specified. y = β 0 + β 1 x β k x k + u y = β 0 β 3 + β 1 x β k x k + u y = β 0 + β 2 1 x β k x k + u 1 five to seven, depends on source Dennis C. Plott (UIC) The Classical s ECON 300 Fall / 36

8 The Classical s The Classical s The classical assumptions must be met in order for OLS estimators to be the best (i.e., minimum variance) linear unbiased estimators (BLUE) The seven 1 classical assumptions are: 1. The population model is linear in parameters (i.e., β) and correctly specified. y = β 0 + β 1 x β k x k + u y = β 0 β 3 + β 1 x β k x k + u y = β 0 + β 2 1 x β k x k + u y = β 0 + β 1 x β kx k + u 1 five to seven, depends on source Dennis C. Plott (UIC) The Classical s ECON 300 Fall / 36

9 The Classical s The Classical s The classical assumptions must be met in order for OLS estimators to be the best (i.e., minimum variance) linear unbiased estimators (BLUE) The seven 1 classical assumptions are: 1. The population model is linear in parameters (i.e., β) and correctly specified. y = β 0 + β 1 x β k x k + u y = β 0 β 3 + β 1 x β k x k + u y = β 0 + β 2 1 x β k x k + u y = β 0 + β 1 x β kx k + u 2. {x i,y i } are a random sample; i.e., each individual in a population is equally likely to be picked. Also, all data points come from the same population. 1 five to seven, depends on source Dennis C. Plott (UIC) The Classical s ECON 300 Fall / 36

10 The Classical s The Classical s (Continued) 3. None of the independent variables is constant and there are no exact linear relationships; no perfect collinearity in the regressors. y = β 0 + β 1 x 1 + β 2 x 2 + u x 1 = γ 0 + γ 1 x 2 i.e., if I know x 2, then I know x 1. For example, housep = β 0 + β 1 sqft + β 2 sqm + u. 4. Zero conditional mean of errors! E(u i x i ) = E(u i ) = 0 If, for example, I regress wage on educ and I know that someone has 16 years of education, then this does not help me to predict if they are above, below, or on the population regression line. Dennis C. Plott (UIC) The Classical s ECON 300 Fall / 36

11 The Classical s The Classical s (Continued) 5. Homoskedastic Errors; i.e., constant variance 6. No Serial Correlation Var(u i x i ) = σ 2 = constant, i Cov(u i,u j ) = 0, i j This means errors are independent knowing one of the errors, does not help predict another error. 7. (Optional) Normally Distributed Errors u N(0,σ 2 u ) This also implies that the conditional distribution of y is normally distributed The normality assumption does not apply to the independent variables Dennis C. Plott (UIC) The Classical s ECON 300 Fall / 36

12 Symptoms Irrelevant See do-file. The t-statistics for the coefficients are not significant. Yet, the overall F-statistic is significant. Even though both independent variables have the same standard errors and almost identical correlations with y, their estimated effects are radically different. x 1 and x 2 are very highly correlated. The sample is relatively small. These are all indicators that multicollinearity might be a problem in these data. Dennis C. Plott (UIC) ECON 300 Fall / 36

13 Irrelevant Causes Improper use of dummy variables; e.g., failure to exclude one category. Including a variable that is computed from other variables in the equation (e.g., family income = husband s income + wife s income, and the regression includes all three income measures). In effect, including the same or almost the same variable twice (height in feet and height in inches.) The above all imply some sort of error on the researcher s part. Sometimes models include variables that are nonlinear or non-additive (interactive) functions of other variables in the model. For example, the independent variables might include both x and x 2. In such cases, the original variables and the variables computed from them can be highly correlated. Also, multicollinearity between nonlinear and non-additive terms could make it difficult to determine whether there is multicollinearity involving other variables. Dennis C. Plott (UIC) ECON 300 Fall / 36

14 Irrelevant Consequences Even extreme multicollinearity (so long as it is not perfect) does not violate OLS assumptions. OLS estimates are still unbiased and BLUE. Nevertheless, the greater the multicollinearity, the greater the standard errors. When high multicollinearity is present, confidence intervals for coefficients tend to be very wide and t-statistics tend to be very small. Coefficients will have to be larger in order to be statistically significant, i.e. it will be harder to reject the null when multicollinearity is present. Note, however, that large standard errors can be caused by things besides multicollinearity. When two independent variables are highly and positively correlated, their slope coefficient estimators will tend to be highly and negatively correlated. Further, a different sample will likely produce the opposite result. In other words, if you overestimate the effect of one parameter, you will tend to underestimate the effect of the other. Hence, coefficient estimates tend to be very shaky from one sample to the next. Dennis C. Plott (UIC) ECON 300 Fall / 36

15 Detection & Solution Irrelevant Detection is a matter of degree. There is no irrefutable test that it is or is not a problem. See symptoms; slide 5. In Stata, type vif after a regression. Solution The Variance Inflation Factor (VIF) shows us how much the variance of the coefficient estimate is being inflated by multicollinearity. For example, if the VIF for a variable were 9, its standard error would be three times as large as it would be if its VIF was 1. In such a case, the coefficient would have to be 3 times as large to be statistically significant. More and better data. Dennis C. Plott (UIC) ECON 300 Fall / 36

16 Irrelevant Recall homoskedasticity: Var(u i x i ) = σ 2 = constant, i. Notice the lack of a subscript on the sigma. If the error terms do not have constant variance, they are said to be heteroskedastic: Var(u i x i ) = σ 2 constant. Notice the subscript i on the sigma; i i.e., the variance will depend on observation of the independent variable. Non Sequitur: heteroskedastic comes from Ancient Greek with hetero (different) and skedasis (dispersion). Dennis C. Plott (UIC) ECON 300 Fall / 36

17 Irrelevant Occurrence Errors may increase as the value of an independent variable increases. For example, consider a model in which annual family income is the independent variables and annual family expenditures on vacations is the dependent variable. Families with low incomes will spend relatively little on vacations, and the variations in expenditures across such families will be small. But for families with large incomes, the amount of discretionary income will be higher. The mean amount spent on vacations will be higher, and there will also be greater variability among such families, resulting in heteroskedasticity. Note that, in this example, a high family income is a necessary but not sufficient condition for large vacation expenditures. Any time a high value for an independent variable is a necessary but not sufficient condition for an observation to have a high value on a dependent variable, heteroskedasticity is likely. Errors may also increase as the values of an independent variable becomes more extreme in either direction. For example, with attitudes that range from extremely negative to extremely positive. This will produce something that looks like an hourglass shape or bow-tie. Dennis C. Plott (UIC) ECON 300 Fall / 36

18 Irrelevant Occurrence (Continued) Measurement error can cause heteroskedasticity. Some respondents might provide more accurate responses than others. (Note: this problem arises from the violation of another assumption: variables are measured without error; i.e., zero conditional mean.) can also occur if there are sub-population differences or other interaction effects; e.g., the effect of income on expenditures differs for whites and blacks. (Again, the problem arises from violation of the assumption that no such differences exist or have already been incorporated into the model; i.e., the model is correctly specified.) Other model misspecifications can produce heteroskedasticity. For example, it may be that instead of using y, you should be using the log of y. Instead of using x, maybe you should be using x 2, or both x and x 2. Important variables may be omitted from the model. If the model were correctly specified, you might find that the patterns of heteroskedasticity disappear. Dennis C. Plott (UIC) ECON 300 Fall / 36

19 Consequences Irrelevant Note that heteroskedasticity is often a by-product of other violations of assumptions. These violations have their own consequences which we will deal with elsewhere. For now, we will assume that other assumptions except heteroskedasticity have been met. Then, does NOT result in biased parameter estimates. However, OLS estimates are no longer BLUE. That is, among all the unbiased estimators, OLS does not provide the estimate with the smallest variance. Depending on the nature of the heteroskedasticity, significance tests can be too high or too low. In addition, the standard errors are biased when heteroskedasticity is present. This in turn leads to bias in test statistics (e.g., t- and F-statistics) and confidence intervals. Dennis C. Plott (UIC) ECON 300 Fall / 36

20 Irrelevant Detection Visual Inspection Do a visual inspection of residuals plotted against fitted values; or, plot the independent variable suspected to be correlated with the variance of the error term. In Stata, after running a regression, you could use the rvfplot (residuals versus fitted values) or rvpplot command (residual versus predictor plot; e.g., plot the residuals versus one of the x variables included in the equation). Formal Tests (A bunch, differ in the form of heteroskedasticity tested and whether or not the error is Normally distributed.) Breusch-Pagan/Cook-Weisberg Test for designed to detect any linear form of heteroskedasticity. The null hypothesis is that the error variances are all equal versus the alternative that the error variances are a multiplicative function of one or more variables. In Stata, after running a regression, use the following command: estat hettest. White s General Test for Data Type In Stata, estat imtest, white. occurs most often in cross-sectional data. These are data where observations are all for the same time period (e.g., a particular month, day, or year) but are from different entities (e.g., people, firms, provinces, countries, etc.) Dennis C. Plott (UIC) ECON 300 Fall / 36

21 Irrelevant Solution Respecify the /Transform the As noted before, sometimes heteroskedasticity results from improper model specification. Note: specification is determined from theory. Research and think carefully about what belongs in the model prior to running regressions. There may be subgroup differences. Effects of variables may not be linear. Perhaps some important variables have been left out of the model. Use Robust Standard Errors or Cluster Stata includes options with most routines for estimating robust standard errors (you will also hear these referred to as Huber/White estimators or sandwich estimators of variance). As noted above, heteroskedasticity causes standard errors to be biased. OLS assumes that errors are both independent and identically distributed; robust standard errors relax either or both of those assumptions. Hence, when heteroskedasticity is present, robust standard errors tend to be more trustworthy. Dennis C. Plott (UIC) ECON 300 Fall / 36

22 Irrelevant Solution (Continued) The use of robust standard errors does not change coefficient estimates, but (because the standard errors are changed) the test statistics will give you reasonably accurate p-values. With Stata, robust standard errors can usually be computed via the addition of two parameters, robust and cluster. The robust option relaxes the assumption that the errors are identically distributed, while cluster relaxes the assumption that the error terms are independent of each other. Caution: Do not confuse robust standard errors with robust regression. Despite their similar names, they deal with different problems: Robust standard errors address the problem of errors that are not independent and identically distributed. The use of robust standard errors will not change the coefficient estimates provided by OLS, but they will change the standard errors and significance tests. Dennis C. Plott (UIC) ECON 300 Fall / 36

23 Irrelevant Irrelevant This refers to the case of including a variable in an equation when it does not belong there. In other words, not derived from (sound) economic theory; i.e., corr(x 1,x 2 ) 0 Assume that the true regression specification is: y i = β 0 + β 1 x 1 + u (1) But the researcher for some reason includes an extra variable: y i = β 0 + β 1 x 1 + β 2 x 2 + v (2) The misspecified equation s error term then becomes: v = u β 2 x 2 (3) Dennis C. Plott (UIC) Irrelevant ECON 300 Fall / 36

24 Irrelevant (Continued) Irrelevant So, the inclusion of an irrelevant variable will not cause bias (since the true coefficient of the irrelevant variable is zero, and so the second term will drop out of Equation 3) However, the inclusion of an irrelevant variable will: Increase the variance of the estimated coefficients, and this increased variance will tend to decrease the absolute magnitude of their t-scores. Decrease the R 2, but not the R 2 Dennis C. Plott (UIC) Irrelevant ECON 300 Fall / 36

25 Internal Validity Irrelevant Internal validity: the statistical inferences about causal effects are valid for the population being studied Five threats to internal validity 1. Omitted variable bias Wrong functional form 2. Sample selection bias 3. Simultaneous causality bias (or reversed causality) 4. Measurement error ( Errors-in-variables bias) All threats lead to a violation of the conditional independence assumption (zero conditional mean assumption) Dennis C. Plott (UIC) ECON 300 Fall / 36

26 Omitted Variable Bias Irrelevant Two reasons why an important explanatory variable might have been left out: 1. we forgot it is not available in the dataset, we are examining Either way, this may lead to omitted variable bias (or, more generally, specification bias) The reason for this is that when a variable is not included, it cannot be held constant Omitting a relevant variable usually is evidence that the entire equation is a suspect, because of the likely bias of the coefficients. Dennis C. Plott (UIC) ECON 300 Fall / 36

27 Irrelevant Omitted Variable Bias Consequences Suppose the true regression model is: y i = β 0 + β 1 x 1 + β 2 x 2 + u (4) where u is a classical error term and corr(x 1,x 2 ) 0 If x 2 is omitted, the equation becomes instead: y i = β 0 + β 1 x 1 + v (5) Where: v = u + β 2 x 2 (6) Hence, the explanatory variables in the estimated regression (5) are not independent of the error term (unless the omitted variable is uncorrelated with all the included variables) Dennis C. Plott (UIC) ECON 300 Fall / 36

28 Irrelevant Omitted Variable Bias Consequences (Continued) What happens if we estimate Equation (5) when Equation (4) is correct? Bias This means: E( ˆβ 1 ) β 1 (7) The amount of bias is a function of the impact of the omitted variable on the dependent variable times a function of the correlation between the included and the omitted variable So, the bias exists unless: 1. the true coefficient equals zero, or 2. the included and omitted variables are uncorrelated Dennis C. Plott (UIC) ECON 300 Fall / 36

29 Omitted Variable Bias Potential Solutions Irrelevant include variable if measurable (or proxy) use panel data (solves problem when there are time-constant individual effects) use instrumental variable regression run a randomized experiment Dennis C. Plott (UIC) ECON 300 Fall / 36

30 Irrelevant Endogeneity and Unobserved Heterogeneity Omitted variable bias is the most common illustration of what economists refer to as endogeneity. Endogenous variables are variables determined by other variables in the system, while exogenous variables are variables which can be considered external shocks to the system. Other important sources of endogeneity include reverse causality and measurement error. Functional misspecification is really a special case of omitted variable bias. To truly be able to make a causal claim, we need a truly exogenous variable that is, a variable which is not related to any of the other variables in the system, unobserved and observed. The problem with observational data is that there are an infinite number of unobserved variables which could render our observed relationship endogenous. This is the problem of unobserved heterogeneity in our sample. Dennis C. Plott (UIC) ECON 300 Fall / 36

31 Endogeneity and Unobserved Heterogeneity (Continued) Irrelevant As an example, let s look at a simple question. Do private schools improve student s test performance? Let s say we had a sample of public and private school students math test scores. We could look at the difference in the average score between groups. But it would be dangerous to assume that such a difference reflected the "treatment" of private schools, because it seems likely that more apt students are more likely to self-select into private schools. One standard solution is to control for all the observed measures that might lead to such self-selection which are available to us. The problem is that we are unlikely to effectively control for all of this selectivity, because some variables associated with the selection process are probably unobserved. Even if all the important variables were observed, we would only completely control for them if we correctly specified the functional form of their relationship to test scores. Dennis C. Plott (UIC) ECON 300 Fall / 36

32 Irrelevant Measurement Error (Errors in ) Many economists feel that the greatest drawback to econometrics is the fact that the data with which econometricians must work are so poor. A well-known quotation expressing this feeling is due to Josiah Stamp: The Government are very keen on amassing statistics they collect them, add them, raise them to the nth power, take the cube root and prepare wonderful diagrams. But what you must never forget is that every one of those figures comes in the first instance from the village watchman, who just puts down what he damn pleases. The errors-in-variables problem is concerned with the implication of using incorrectly measured variables, whether these measurement errors arise from the whims of the village watchman or from the use by econometricians of a proxy variable in place of an unobservable variable suggested by economic theory. Examples survey data on households: recall bias: how much time did you spend unemployed last year? rounding bias: how much money did you spend on food last week? Dennis C. Plott (UIC) ECON 300 Fall / 36

33 Irrelevant Measurement Error (Continued) Errors in measuring the dependent variables are incorporated in the disturbance term; their existence causes no problems. provided the measurement error in y is uncorrelated with (correctly measured) x When there are errors in measuring an independent variable, however, the zero conditional mean assumption is violated, since these measurement errors make this independent variable stochastic; the seriousness of this depends on whether or not this regressor is distributed independently of the disturbance. If only one of the explanatory variables is measured with error, it can be shown: the OLS estimator of the coefficient on that variable is biased towards zero (attenuation bias) this bias does NOT disappear in large samples (OLS is inconsistent) the OLS estimator of the coefficients on the other explanatory variables are also biased, in unknown directions If several explanatory variables are measured with error, it is very difficult to sign the biases for any of the coefficients Dennis C. Plott (UIC) ECON 300 Fall / 36

34 Irrelevant Measurement Error (Continued) Morgenstern (1963) wrote an entire book examining the accuracy of economic data. Some spectacular examples of data fudging by government agencies can be found in Streissler (1970, pp. 27 9). Example: a large overstatement of housing starts in Austria was compensated for by deliberately understating several subsequent housing start figures. Streissler claims that often the econometrician more or less completely misunderstands what the statistics he works with really mean. Griliches (1985) offers four responses to Morgenstern: 1. The data are not that bad. 2. The data are lousy but it doesn t matter. 3. The data are bad but we have learned how to live with them and adjust for their foibles. 4. That is all there is it is the only game in town and we have to make the best of it. In some instances it could be argued that economic agents respond to the measured rather than the true variables, implying that the original estimating equation should be specified in terms of the measured rather than the true values of the regressors. This eliminates the errors-in-variables problem. Dennis C. Plott (UIC) ECON 300 Fall / 36

35 Irrelevant Sample Selection Bias Often assumed in regression framework: simple random sampling of the population, but sometimes sample selects itself Sample selection bias arises when a selection process Influences the availability of data and that process is related to the dependent variable. Induces correlation between regressor and error term (again violates conditional mean independence) Example: Returns to Education Random sample from the population of workers, data on earnings and years of education Problem: factors that determine whether someone works are quite similar to the factors that determine how much that person earns when employed Potential Solutions Randomized controlled experiment Construct a model of the sample selection problem and estimate that model (Heckman s sample selection model (1979) which was cited in his 2000 Nobel) Dennis C. Plott (UIC) ECON 300 Fall / 36

36 Simultaneity (Reverse Causality) Irrelevant Example of reversed causality: low wages cause bad health or bad health causes low wages? Potential solutions: Randomized controlled experiment: because x is chosen at random by the experimenter, there is no feedback from the outcome variable to y (assuming perfect compliance) Use instrumental variables regression to estimate the causal effect of interest (use the variation in x that is exogenous) Develop and estimate a complete model of both directions of causality (idea behind large structural models, very difficult in practice). Dennis C. Plott (UIC) ECON 300 Fall / 36

37 Binary Linear Probability (LPM) Dummy In this class of models, we consider the case where the dependent variable can take the value of 0 or 1. They are often termed dichotomous variables. The literature on this type of model is extensive, it can include cases where there are more than 2 possible outcomes, however we are only covering an introductory section of this area of econometrics. These types of model tend to be associated with the cross-sectional econometrics rather than time series. There are many examples of this type of model in the finance literature. For instance, if we were to examine bank failures. { 1,if the bank fails y = 0, otherwise Dennis C. Plott (UIC) Binary ECON 300 Fall / 36

38 Data Binary Linear Probability (LPM) When examining the dummy dependent variables we need to ensure there are sufficient numbers of 0s and 1s. If we were assessing bank failures, we would need a sample of both banks that have failed and those that have not failed. This can create problems with sampling, as it is easier to find data for solvent banks relative to failed banks. Dennis C. Plott (UIC) Binary ECON 300 Fall / 36

39 Linear Probability (LPM) Binary Linear Probability (LPM) The Linear Probability, uses OLS to estimate the model, the coefficients, t-statistics, etc. are then interpreted in the usual way. This produces the usual linear regression line, which is fitted through the two sets of observations. Dennis C. Plott (UIC) Linear Probability (LPM) ECON 300 Fall / 36

40 Features of the LPM Binary Linear Probability (LPM) The dependent variable has two values, the value 1 has a probability of p and the value 0 has a probability of (1 p). This is known as the Bernoulli probability distribution. In this case the expected value of a random variable following a Bernoulli distribution is the probability the variable equals 1. Since the probability of p must lie between 0 and 1, then the expected value of the dependent variable must also lie between 0 and 1. Dennis C. Plott (UIC) Linear Probability (LPM) ECON 300 Fall / 36

41 Binary Linear Probability (LPM) Problems with LPM The error term is not normally distributed, it also follows the Bernoulli distribution. The variance of the error term is heteroskedastistic. The variance for the Bernoulli distribution is p(1 p), where p is the probability of a success. Possibly the most problematic aspect of the LPM is the non-fulfillment of the requirement that the estimated value of the dependent variable y lies between 0 and 1. One way around the problem is to assume that all values below 0 and above 1 are actually 0 or 1 respectively. An alternative remedy to the problem is to use a technique such as the Logit or Probit models. Dennis C. Plott (UIC) Linear Probability (LPM) ECON 300 Fall / 36

42 Problems with LPM (Continued) Binary Linear Probability (LPM) The final problem with the LPM is that it is a linear model and assumes that the probability of the dependent variable equaling 1 is linearly related to the explanatory variable. For example if we have a model where the dependent variable takes the value of 1 if a mortgage is granted to a bank customer and 0 otherwise, regressed on the customers income. The probability of being granted a mortgage will rise steadily at low income levels, but change hardly at all at high income levels. Dennis C. Plott (UIC) Linear Probability (LPM) ECON 300 Fall / 36

43 Binary Linear Probability (LPM) LPM Example with Interpretation highbp i = age i lincome i ( se) ˆ (0.124) (0.001) (0.076) { 1,if have high blood pressure highbp = 0, otherwise (0.021) diabetes i Interpretation Dennis C. Plott (UIC) Linear Probability (LPM) ECON 300 Fall / 36

44 Binary Linear Probability (LPM) LPM Example with Interpretation highbp i = age i lincome i ( se) ˆ (0.124) (0.001) (0.076) { 1,if have high blood pressure highbp = 0, otherwise (0.021) diabetes i Interpretation age: On average, a one year increase in age will increase the probability of having high blood pressure by 0.5 percentage points, ceteris paribus. Dennis C. Plott (UIC) Linear Probability (LPM) ECON 300 Fall / 36

45 Binary Linear Probability (LPM) LPM Example with Interpretation highbp i = age i lincome i ( se) ˆ (0.124) (0.001) (0.076) { 1,if have high blood pressure highbp = 0, otherwise (0.021) diabetes i Interpretation age: On average, a one year increase in age will increase the probability of having high blood pressure by 0.5 percentage points, ceteris paribus. lincome: On average, a 100% increase in income will reduced the probability of having high blood pressure by 4.1 percentage points, ceteris paribus. Dennis C. Plott (UIC) Linear Probability (LPM) ECON 300 Fall / 36

46 Binary Linear Probability (LPM) LPM Example with Interpretation highbp i = age i lincome i ( se) ˆ (0.124) (0.001) (0.076) { 1,if have high blood pressure highbp = 0, otherwise (0.021) diabetes i Interpretation age: On average, a one year increase in age will increase the probability of having high blood pressure by 0.5 percentage points, ceteris paribus. lincome: On average, a 100% increase in income will reduced the probability of having high blood pressure by 4.1 percentage points, ceteris paribus. diabetes: On average, people with diabetes have 19.2 percentage point higher probability of having high blood pressure than those without diabetes, ceteris paribus. Dennis C. Plott (UIC) Linear Probability (LPM) ECON 300 Fall / 36

Economics 300. Econometrics Multiple Regression: Extensions and Issues

Economics 300. Econometrics Multiple Regression: Extensions and Issues Economics 300 Econometrics Multiple : Extensions and Dennis C. Plott University of Illinois at Chicago Department of Economics www.dennisplott.com Fall 2014 Dennis C. Plott (UIC) ECON 300 Fall 2014 1 /

More information

Multiple Linear Regression CIVL 7012/8012

Multiple Linear Regression CIVL 7012/8012 Multiple Linear Regression CIVL 7012/8012 2 Multiple Regression Analysis (MLR) Allows us to explicitly control for many factors those simultaneously affect the dependent variable This is important for

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

ECNS 561 Multiple Regression Analysis

ECNS 561 Multiple Regression Analysis ECNS 561 Multiple Regression Analysis Model with Two Independent Variables Consider the following model Crime i = β 0 + β 1 Educ i + β 2 [what else would we like to control for?] + ε i Here, we are taking

More information

Iris Wang.

Iris Wang. Chapter 10: Multicollinearity Iris Wang iris.wang@kau.se Econometric problems Multicollinearity What does it mean? A high degree of correlation amongst the explanatory variables What are its consequences?

More information

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity LECTURE 10 Introduction to Econometrics Multicollinearity & Heteroskedasticity November 22, 2016 1 / 23 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists

More information

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage

More information

Econometrics Summary Algebraic and Statistical Preliminaries

Econometrics Summary Algebraic and Statistical Preliminaries Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L

More information

Linear Models in Econometrics

Linear Models in Econometrics Linear Models in Econometrics Nicky Grant At the most fundamental level econometrics is the development of statistical techniques suited primarily to answering economic questions and testing economic theories.

More information

Assessing Studies Based on Multiple Regression

Assessing Studies Based on Multiple Regression Assessing Studies Based on Multiple Regression Outline 1. Internal and External Validity 2. Threats to Internal Validity a. Omitted variable bias b. Functional form misspecification c. Errors-in-variables

More information

Applied Health Economics (for B.Sc.)

Applied Health Economics (for B.Sc.) Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative

More information

FinQuiz Notes

FinQuiz Notes Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable

More information

Econometrics - 30C00200

Econometrics - 30C00200 Econometrics - 30C00200 Lecture 11: Heteroskedasticity Antti Saastamoinen VATT Institute for Economic Research Fall 2015 30C00200 Lecture 11: Heteroskedasticity 12.10.2015 Aalto University School of Business

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Friday, June 5, 009 Examination time: 3 hours

More information

Lab 11 - Heteroskedasticity

Lab 11 - Heteroskedasticity Lab 11 - Heteroskedasticity Spring 2017 Contents 1 Introduction 2 2 Heteroskedasticity 2 3 Addressing heteroskedasticity in Stata 3 4 Testing for heteroskedasticity 4 5 A simple example 5 1 1 Introduction

More information

CHAPTER 6: SPECIFICATION VARIABLES

CHAPTER 6: SPECIFICATION VARIABLES Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero

More information

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors:

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors: Wooldridge, Introductory Econometrics, d ed. Chapter 3: Multiple regression analysis: Estimation In multiple regression analysis, we extend the simple (two-variable) regression model to consider the possibility

More information

Notes 11: OLS Theorems ECO 231W - Undergraduate Econometrics

Notes 11: OLS Theorems ECO 231W - Undergraduate Econometrics Notes 11: OLS Theorems ECO 231W - Undergraduate Econometrics Prof. Carolina Caetano For a while we talked about the regression method. Then we talked about the linear model. There were many details, but

More information

Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity

Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity R.G. Pierse 1 Omitted Variables Suppose that the true model is Y i β 1 + β X i + β 3 X 3i + u i, i 1,, n (1.1) where β 3 0 but that the

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

LECTURE 11. Introduction to Econometrics. Autocorrelation

LECTURE 11. Introduction to Econometrics. Autocorrelation LECTURE 11 Introduction to Econometrics Autocorrelation November 29, 2016 1 / 24 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists of choosing: 1. correct

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS Page 1 MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level

More information

PBAF 528 Week 8. B. Regression Residuals These properties have implications for the residuals of the regression.

PBAF 528 Week 8. B. Regression Residuals These properties have implications for the residuals of the regression. PBAF 528 Week 8 What are some problems with our model? Regression models are used to represent relationships between a dependent variable and one or more predictors. In order to make inference from the

More information

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, 2015-16 Academic Year Exam Version: A INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This

More information

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han Econometrics Honor s Exam Review Session Spring 2012 Eunice Han Topics 1. OLS The Assumptions Omitted Variable Bias Conditional Mean Independence Hypothesis Testing and Confidence Intervals Homoskedasticity

More information

ECON 497: Lecture 4 Page 1 of 1

ECON 497: Lecture 4 Page 1 of 1 ECON 497: Lecture 4 Page 1 of 1 Metropolitan State University ECON 497: Research and Forecasting Lecture Notes 4 The Classical Model: Assumptions and Violations Studenmund Chapter 4 Ordinary least squares

More information

Econ 1123: Section 5. Review. Internal Validity. Panel Data. Clustered SE. STATA help for Problem Set 5. Econ 1123: Section 5.

Econ 1123: Section 5. Review. Internal Validity. Panel Data. Clustered SE. STATA help for Problem Set 5. Econ 1123: Section 5. Outline 1 Elena Llaudet 2 3 4 October 6, 2010 5 based on Common Mistakes on P. Set 4 lnftmpop = -.72-2.84 higdppc -.25 lackpf +.65 higdppc * lackpf 2 lnftmpop = β 0 + β 1 higdppc + β 2 lackpf + β 3 lackpf

More information

Multiple Regression Analysis. Part III. Multiple Regression Analysis

Multiple Regression Analysis. Part III. Multiple Regression Analysis Part III Multiple Regression Analysis As of Sep 26, 2017 1 Multiple Regression Analysis Estimation Matrix form Goodness-of-Fit R-square Adjusted R-square Expected values of the OLS estimators Irrelevant

More information

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors Laura Mayoral IAE, Barcelona GSE and University of Gothenburg Gothenburg, May 2015 Roadmap Deviations from the standard

More information

THE MULTIVARIATE LINEAR REGRESSION MODEL

THE MULTIVARIATE LINEAR REGRESSION MODEL THE MULTIVARIATE LINEAR REGRESSION MODEL Why multiple regression analysis? Model with more than 1 independent variable: y 0 1x1 2x2 u It allows : -Controlling for other factors, and get a ceteris paribus

More information

Multiple Regression. Midterm results: AVG = 26.5 (88%) A = 27+ B = C =

Multiple Regression. Midterm results: AVG = 26.5 (88%) A = 27+ B = C = Economics 130 Lecture 6 Midterm Review Next Steps for the Class Multiple Regression Review & Issues Model Specification Issues Launching the Projects!!!!! Midterm results: AVG = 26.5 (88%) A = 27+ B =

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0 Introduction to Econometrics Midterm April 26, 2011 Name Student ID MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (5,000 credit for each correct

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

ECONOMETRICS HONOR S EXAM REVIEW SESSION

ECONOMETRICS HONOR S EXAM REVIEW SESSION ECONOMETRICS HONOR S EXAM REVIEW SESSION Eunice Han ehan@fas.harvard.edu March 26 th, 2013 Harvard University Information 2 Exam: April 3 rd 3-6pm @ Emerson 105 Bring a calculator and extra pens. Notes

More information

FNCE 926 Empirical Methods in CF

FNCE 926 Empirical Methods in CF FNCE 926 Empirical Methods in CF Lecture 2 Linear Regression II Professor Todd Gormley Today's Agenda n Quick review n Finish discussion of linear regression q Hypothesis testing n n Standard errors Robustness,

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3

More information

Applied Quantitative Methods II

Applied Quantitative Methods II Applied Quantitative Methods II Lecture 4: OLS and Statistics revision Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 4 VŠE, SS 2016/17 1 / 68 Outline 1 Econometric analysis Properties of an estimator

More information

2. Linear regression with multiple regressors

2. Linear regression with multiple regressors 2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions

More information

Heteroskedasticity Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 30, 2015

Heteroskedasticity Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 30, 2015 Heteroskedasticity Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 30, 2015 These notes draw heavily from Berry and Feldman, and, to a lesser extent, Allison,

More information

ECON 497: Lecture Notes 10 Page 1 of 1

ECON 497: Lecture Notes 10 Page 1 of 1 ECON 497: Lecture Notes 10 Page 1 of 1 Metropolitan State University ECON 497: Research and Forecasting Lecture Notes 10 Heteroskedasticity Studenmund Chapter 10 We'll start with a quote from Studenmund:

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Homoskedasticity. Var (u X) = σ 2. (23)

Homoskedasticity. Var (u X) = σ 2. (23) Homoskedasticity How big is the difference between the OLS estimator and the true parameter? To answer this question, we make an additional assumption called homoskedasticity: Var (u X) = σ 2. (23) This

More information

EMERGING MARKETS - Lecture 2: Methodology refresher

EMERGING MARKETS - Lecture 2: Methodology refresher EMERGING MARKETS - Lecture 2: Methodology refresher Maria Perrotta April 4, 2013 SITE http://www.hhs.se/site/pages/default.aspx My contact: maria.perrotta@hhs.se Aim of this class There are many different

More information

The general linear regression with k explanatory variables is just an extension of the simple regression as follows

The general linear regression with k explanatory variables is just an extension of the simple regression as follows 3. Multiple Regression Analysis The general linear regression with k explanatory variables is just an extension of the simple regression as follows (1) y i = β 0 + β 1 x i1 + + β k x ik + u i. Because

More information

Chapter 1. An Overview of Regression Analysis. Econometrics and Quantitative Analysis. What is Econometrics? (cont.) What is Econometrics?

Chapter 1. An Overview of Regression Analysis. Econometrics and Quantitative Analysis. What is Econometrics? (cont.) What is Econometrics? Econometrics and Quantitative Analysis Using Econometrics: A Practical Guide A.H. Studenmund 6th Edition. Addison Wesley Longman Chapter 1 An Overview of Regression Analysis Instructor: Dr. Samir Safi

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

Introduction to Econometrics. Assessing Studies Based on Multiple Regression

Introduction to Econometrics. Assessing Studies Based on Multiple Regression Introduction to Econometrics The statistical analysis of economic (and related) data STATS301 Assessing Studies Based on Multiple Regression Titulaire: Christopher Bruffaerts Assistant: Lorenzo Ricci 1

More information

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2. Updated: November 17, 2011 Lecturer: Thilo Klein Contact: tk375@cam.ac.uk Contest Quiz 3 Question Sheet In this quiz we will review concepts of linear regression covered in lecture 2. NOTE: Please round

More information

8. Instrumental variables regression

8. Instrumental variables regression 8. Instrumental variables regression Recall: In Section 5 we analyzed five sources of estimation bias arising because the regressor is correlated with the error term Violation of the first OLS assumption

More information

Handout 12. Endogeneity & Simultaneous Equation Models

Handout 12. Endogeneity & Simultaneous Equation Models Handout 12. Endogeneity & Simultaneous Equation Models In which you learn about another potential source of endogeneity caused by the simultaneous determination of economic variables, and learn how to

More information

Ordinary Least Squares Regression

Ordinary Least Squares Regression Ordinary Least Squares Regression Goals for this unit More on notation and terminology OLS scalar versus matrix derivation Some Preliminaries In this class we will be learning to analyze Cross Section

More information

Econ 1123: Section 2. Review. Binary Regressors. Bivariate. Regression. Omitted Variable Bias

Econ 1123: Section 2. Review. Binary Regressors. Bivariate. Regression. Omitted Variable Bias Contact Information Elena Llaudet Sections are voluntary. My office hours are Thursdays 5pm-7pm in Littauer Mezzanine 34-36 (Note room change) You can email me administrative questions to ellaudet@gmail.com.

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

ECON Introductory Econometrics. Lecture 13: Internal and external validity

ECON Introductory Econometrics. Lecture 13: Internal and external validity ECON4150 - Introductory Econometrics Lecture 13: Internal and external validity Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 9 Lecture outline 2 Definitions of internal and external

More information

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated

More information

An overview of applied econometrics

An overview of applied econometrics An overview of applied econometrics Jo Thori Lind September 4, 2011 1 Introduction This note is intended as a brief overview of what is necessary to read and understand journal articles with empirical

More information

Linear Regression with Multiple Regressors

Linear Regression with Multiple Regressors Linear Regression with Multiple Regressors (SW Chapter 6) Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution

More information

Problem 13.5 (10 points)

Problem 13.5 (10 points) BOSTON COLLEGE Department of Economics EC 327 Financial Econometrics Spring 2013, Prof. Baum, Mr. Park Problem Set 2 Due Monday 25 February 2013 Total Points Possible: 210 points Problem 13.5 (10 points)

More information

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models James J. Heckman and Salvador Navarro The University of Chicago Review of Economics and Statistics 86(1)

More information

Multiple Regression Analysis: Heteroskedasticity

Multiple Regression Analysis: Heteroskedasticity Multiple Regression Analysis: Heteroskedasticity y = β 0 + β 1 x 1 + β x +... β k x k + u Read chapter 8. EE45 -Chaiyuth Punyasavatsut 1 topics 8.1 Heteroskedasticity and OLS 8. Robust estimation 8.3 Testing

More information

6. Assessing studies based on multiple regression

6. Assessing studies based on multiple regression 6. Assessing studies based on multiple regression Questions of this section: What makes a study using multiple regression (un)reliable? When does multiple regression provide a useful estimate of the causal

More information

ECON 497 Midterm Spring

ECON 497 Midterm Spring ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain

More information

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data Panel data Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data - possible to control for some unobserved heterogeneity - possible

More information

1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11

1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11 Econ 495 - Econometric Review 1 Contents 1 Linear Regression Analysis 4 1.1 The Mincer Wage Equation................. 4 1.2 Data............................. 6 1.3 Econometric Model.....................

More information

Econ 510 B. Brown Spring 2014 Final Exam Answers

Econ 510 B. Brown Spring 2014 Final Exam Answers Econ 510 B. Brown Spring 2014 Final Exam Answers Answer five of the following questions. You must answer question 7. The question are weighted equally. You have 2.5 hours. You may use a calculator. Brevity

More information

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35 Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate

More information

FAQ: Linear and Multiple Regression Analysis: Coefficients

FAQ: Linear and Multiple Regression Analysis: Coefficients Question 1: How do I calculate a least squares regression line? Answer 1: Regression analysis is a statistical tool that utilizes the relation between two or more quantitative variables so that one variable

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Functional form misspecification We may have a model that is correctly specified, in terms of including

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 7: Multicollinearity Egypt Scholars Economic Society November 22, 2014 Assignment & feedback Multicollinearity enter classroom at room name c28efb78 http://b.socrative.com/login/student/

More information

Topic 7: HETEROSKEDASTICITY

Topic 7: HETEROSKEDASTICITY Universidad Carlos III de Madrid César Alonso ECONOMETRICS Topic 7: HETEROSKEDASTICITY Contents 1 Introduction 1 1.1 Examples............................. 1 2 The linear regression model with heteroskedasticity

More information

ECON 4230 Intermediate Econometric Theory Exam

ECON 4230 Intermediate Econometric Theory Exam ECON 4230 Intermediate Econometric Theory Exam Multiple Choice (20 pts). Circle the best answer. 1. The Classical assumption of mean zero errors is satisfied if the regression model a) is linear in the

More information

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor Introduction to Econometrics Spring 2012 Ken Simons Linear Regression with 1 Regressor 1. The regression equation 2. Estimating the equation 3. Assumptions required for

More information

Review of Econometrics

Review of Econometrics Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,

More information

Introductory Econometrics

Introductory Econometrics Based on the textbook by Wooldridge: : A Modern Approach Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna October 16, 2013 Outline Introduction Simple

More information

ECON Introductory Econometrics. Lecture 16: Instrumental variables

ECON Introductory Econometrics. Lecture 16: Instrumental variables ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental

More information

1 A Non-technical Introduction to Regression

1 A Non-technical Introduction to Regression 1 A Non-technical Introduction to Regression Chapters 1 and Chapter 2 of the textbook are reviews of material you should know from your previous study (e.g. in your second year course). They cover, in

More information

Chapter 2: simple regression model

Chapter 2: simple regression model Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.

More information

Introduction to Econometrics. Heteroskedasticity

Introduction to Econometrics. Heteroskedasticity Introduction to Econometrics Introduction Heteroskedasticity When the variance of the errors changes across segments of the population, where the segments are determined by different values for the explanatory

More information

Instrumental Variables and the Problem of Endogeneity

Instrumental Variables and the Problem of Endogeneity Instrumental Variables and the Problem of Endogeneity September 15, 2015 1 / 38 Exogeneity: Important Assumption of OLS In a standard OLS framework, y = xβ + ɛ (1) and for unbiasedness we need E[x ɛ] =

More information

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i, A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type

More information

Linear Regression with Multiple Regressors

Linear Regression with Multiple Regressors Linear Regression with Multiple Regressors (SW Chapter 6) Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution

More information

Christopher Dougherty London School of Economics and Political Science

Christopher Dougherty London School of Economics and Political Science Introduction to Econometrics FIFTH EDITION Christopher Dougherty London School of Economics and Political Science OXFORD UNIVERSITY PRESS Contents INTRODU CTION 1 Why study econometrics? 1 Aim of this

More information

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9. Section 7 Model Assessment This section is based on Stock and Watson s Chapter 9. Internal vs. external validity Internal validity refers to whether the analysis is valid for the population and sample

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle

More information

Lecture 4: Heteroskedasticity

Lecture 4: Heteroskedasticity Lecture 4: Heteroskedasticity Econometric Methods Warsaw School of Economics (4) Heteroskedasticity 1 / 24 Outline 1 What is heteroskedasticity? 2 Testing for heteroskedasticity White Goldfeld-Quandt Breusch-Pagan

More information

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 4 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 23 Recommended Reading For the today Serial correlation and heteroskedasticity in

More information

Motivation for multiple regression

Motivation for multiple regression Motivation for multiple regression 1. Simple regression puts all factors other than X in u, and treats them as unobserved. Effectively the simple regression does not account for other factors. 2. The slope

More information

Simple Regression Model (Assumptions)

Simple Regression Model (Assumptions) Simple Regression Model (Assumptions) Lecture 18 Reading: Sections 18.1, 18., Logarithms in Regression Analysis with Asiaphoria, 19.6 19.8 (Optional: Normal probability plot pp. 607-8) 1 Height son, inches

More information

ACE 564 Spring Lecture 8. Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information. by Professor Scott H.

ACE 564 Spring Lecture 8. Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information. by Professor Scott H. ACE 564 Spring 2006 Lecture 8 Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information by Professor Scott H. Irwin Readings: Griffiths, Hill and Judge. "Collinear Economic Variables,

More information

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook) Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook) 1 2 Panel Data Panel data is obtained by observing the same person, firm, county, etc over several periods. Unlike the pooled cross sections,

More information

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like.

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like. Measurement Error Often a data set will contain imperfect measures of the data we would ideally like. Aggregate Data: (GDP, Consumption, Investment are only best guesses of theoretical counterparts and

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

More on Specification and Data Issues

More on Specification and Data Issues More on Specification and Data Issues Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Specification and Data Issues 1 / 35 Functional Form Misspecification Functional

More information

Intro to Applied Econometrics: Basic theory and Stata examples

Intro to Applied Econometrics: Basic theory and Stata examples IAPRI-MSU Technical Training Intro to Applied Econometrics: Basic theory and Stata examples Training materials developed and session facilitated by icole M. Mason Assistant Professor, Dept. of Agricultural,

More information

Job Training Partnership Act (JTPA)

Job Training Partnership Act (JTPA) Causal inference Part I.b: randomized experiments, matching and regression (this lecture starts with other slides on randomized experiments) Frank Venmans Example of a randomized experiment: Job Training

More information