Econometrics. 5) Dummy variables
|
|
- Everett Banks
- 5 years ago
- Views:
Transcription
1 30C00200 Econometrics 5) Dummy variables Timo Kuosmanen Professor, Ph.D.
2 Today s topics Qualitative factors as explanatory variables Binary qualitative factors Dummy variables and their interpretation Categorical variables Dummy variable trap Modeling ordinal scale data by dummy variables Slope dummies and their interpretation F-test of joint significance
3 Quantitative variables Thus far, our regressors (x) have been quantitative, interval or ratio-scale variables Examples Size of apartment in m 2 Year of construction Number of bedrooms
4 Qualitative factors In many cases, the dependent variable is influenced by qualitative factors Examples: Is there elevator in the building? Yes / no Is there sauna in the apartment? Yes / no Is there a balcony in the apartment? Yes / no
5 Dummy variables The qualitative factors can be coded numerically by using binary {0,1} numbers Example: Elevator = {1 if there is elevator; 0 if not} Sauna = {1 if there is sauna; 0 if not} Balcony = {1 if there is balcony; 0 if not} Such binary variables can be directly inserted to the regression model as explanatory variables (x), referred to as dummy variables.
6 Interpretation of dummy variables Example: Hedonic model with Elevator dummy: Price = β 1 + β 2 Size + + δ Elevator + ε i δ is the effect of elevator on price For apartments without elevator, the intercept is β 1 For apartments with elevator, the intercept is β 1 + δ
7 Example: hedonic model with elevator dummy SUMMARY OUTPUT Regression Statistics Multiple R 0,90764 R Square 0,82381 Adjusted R Square 0, Standard Error 80551,73 Observations 67 ANOVA df SS MS F Significance F Regression 4 1,88E+12 4,7E+11 72, ,12E-22 Residual 62 4,02E+11 6,49E+09 Total 66 2,28E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept , ,42 2, , , size m2 7023,05 858,5364 8, ,93E , ,24 nr. Bedrooms , ,87-3, , ,2 Age -2553,42 558,2156-4, ,34E , ,57 Elevator 24616, ,74 1, , , ,46
8 Categorical variables Some qualitative factors have more than two possible values Example: Districts in Espoo Tapiola Espoonlahti Kivenlahti Leppävaara Olari Such qualitative factors with many categories can be coded numerically by using multiple dummy variables: D1 = {1 if district=tapiola; 0 otherwise} D2 = {1 if district=espoonlahti; 0 otherwise} D3 = {1 if district=kivenlahti; 0 otherwise}
9 SUMMARY OUTPUT Regression Statistics Multiple R 0, R Square 0, Adjusted R Square 0, Standard Error 53206,42 Observations 642 Example: hedonic model with a larger sample ANOVA df SS MS F Significance F Regression 14 5,67E+12 4,05E ,0872 1,7E-184 Residual 627 1,77E+12 2,83E+09 Total 641 7,45E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 85388, ,18 4, ,8E , Size m2 4137, , , ,52E , ,29 Bedrooms ,7 5323,126-6,1794 1,16E ,4 Age -2584,75 151, ,096 4,51E , ,85 Elevator 6216, ,772 1, , , ,94 1st floor -9137, ,723-1, , ,7 2187,183 Top floor 9576, ,709 1, , , ,19 Energy class A-C 33899, ,676 3, , , ,58 Condition satisfactory -2680, ,14-0, , , ,98 Condition good 18120, ,51 1, , , ,51 Loc. Espoonlahti , ,91-1, , ,5 6219,35 Loc. Kivenlahti ,2 8488,473-2, , ,5-2072,91 Loc. Leppävaara -4761, ,203-0, , ,7 7915,191 Loc. Olari -714, ,031-0, , , ,94 Loc. Tapiola ,6 7110,665 17, ,58E ,2
10 Categorical variables When modeling qualitative factors with q different categories, we need to introduce q-1 dummy variables, and leave one level as a reference category, represented by the common intercept β 1 Other effects are expressed relative to the reference category D1 = {1 if district=tapiola; 0 otherwise} D2 = {1 if district=espoonlahti; 0 otherwise} D3 = {1 if district=kivenlahti; 0 otherwise} Price = β 1 + β 2 Size + + δ 1 D1 + δ 2 D2 + δ 3 D3 + ε i Intercept β 1 represent the regression equation for other districts of Espoo Parameter δ 1 captures the effect of Tapiola district
11 Dummy variable trap What if we included separate dummy for all categories? Example: D1 = {1 if district=tapiola; 0 otherwise} D2 = {1 if district=espoonlahti; 0 otherwise} D3 = {1 if district=kivenlahti; 0 otherwise} D6 = {1 if district= other ; 0 otherwise} Price = β 1 + β 2 Size + + δ 1 D1 + δ 2 D2 + δ 3 D3 + + δ 6 D6 + ε i But then, the effect of other districts is β 1 + δ 6 We could freely increase β 1 if we simultaneously decrease all δ parameters. Hence, the model is unidentified: the problem is referred to as the dummy variable trap.
12 Ordinal variables We can similarly use dummy variables for modeling ordinal variables. Example: condition of apartment 1) weak 2) satisfactory 3) good Variables measured on ordinal scale can be modeled using dummy variables, taking one of the levels (e.g. weak) as the reference category.
13 SUMMARY OUTPUT Regression Statistics Multiple R 0, R Square 0, Adjusted R Square 0, Standard Error 53206,42 Observations 642 Example: hedonic model with a larger sample ANOVA df SS MS F Significance F Regression 14 5,67E+12 4,05E ,0872 1,7E-184 Residual 627 1,77E+12 2,83E+09 Total 641 7,45E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 85388, ,18 4, ,8E , Size m2 4137, , , ,52E , ,29 Bedrooms ,7 5323,126-6,1794 1,16E ,4 Age -2584,75 151, ,096 4,51E , ,85 Elevator 6216, ,772 1, , , ,94 1st floor -9137, ,723-1, , ,7 2187,183 Top floor 9576, ,709 1, , , ,19 Energy class A-C 33899, ,676 3, , , ,58 Condition satisfactory -2680, ,14-0, , , ,98 Condition good 18120, ,51 1, , , ,51 Loc. Espoonlahti , ,91-1, , ,5 6219,35 Loc. Kivenlahti ,2 8488,473-2, , ,5-2072,91 Loc. Leppävaara -4761, ,203-0, , ,7 7915,191 Loc. Olari -714, ,031-0, , , ,94 Loc. Tapiola ,6 7110,665 17, ,58E ,2
14 Quantitative dummy variables? Sometimes quantitative variables are modelled as dummy variables as well. Example: number of bedrooms in the hedonic model We implictly assumed that the change from 1 room apartment to 2 room apartment has the same effect on price as the change from 2 rooms to 3 rooms, etc. To capture the differences, we could use dummy variables: Assume the reference category is 3 bedrooms or more D1 = {1 if apartment has 1 bedroom; 0 otherwise} D2 = {1 if apartment has 2 bedrooms; 0 otherwise}
15 SUMMARY OUTPUT Regression Statistics Multiple R 0, R Square 0, Adjusted R Square 0, Standard Error 53524,35 Observations 642 ANOVA df SS MS F Significance F Regression 15 5,65E+12 3,77E ,5378 5E-182 Residual 626 1,79E+12 2,86E+09 Total 641 7,45E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 23887, ,41 1, , , ,68 Size m2 3560, , , ,27E , ,866 1Bedroom 61666, ,29 5, ,18E , ,16 2Bedroom 18828, , , , ,58 Age -2643,66 153, ,257 7,07E ,5-2342,83 Elevator 7326, ,629 1, , , ,46 1st floor ,9 5827,307-2, , ,3-765,43 Top floor 7664, ,457 1, , , ,48 Energy class A-C 36051, ,075 3, , , ,35 Condition satisfactory -1955, ,55-0, , , ,75 Condition good 21884, ,75 1, , , ,63 Loc. Espoonlahti , ,76-1, , ,7 2839,068 Loc. Kivenlahti ,9 8549,151-2, , ,3-2548,37 Loc. Leppävaara -2237, ,609-0, , , ,86 Loc. Olari -352, ,148-0,0437 0, , ,36 Loc. Tapiola ,8 7170,01 17, ,46E ,
16 Slope dummies Thus far, we have assumed the dummy variables shift the regression line via the intercept, but do not affect the slopes. Dummy variables can also be used for modeling the effect on the slopes of quantitative variables (slope dummies). Example: the number of bedrooms can influence the marginal value of size Price = β 1 + β 2 Size + β 3 D1 Size + β 4 D2 Size + + ε i D1 = {1 if apartment has 1 bedroom; 0 otherwise} D2 = {1 if apartment has 2 bedrooms; 0 otherwise}
17 SUMMARY OUTPUT Regression Statistics Multiple R 0, R Square 0, Adjusted R Square 0, Standard Error 53914,47 Observations 642 ANOVA df SS MS F Significance F Regression 15 5,63E+12 3,75E ,0393 4,6E-180 Residual 626 1,82E+12 2,91E+09 Total 641 7,45E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 47765, ,87 2, , , ,99 Size m2 3279, , , ,02E , ,91 Size X 1Bedroom 1295, ,891 4, ,53E , ,825 Size x 2Bedroom 143, ,5412 1, , , ,9625 Age -2626,49 153, ,0579 7,27E , ,12 Elevator 7783, ,02 1, , , ,83 1st floor ,3 5868,643-1,9615 0, ,27122 Top floor 7934, ,085 1, , , ,71 Energy class A-C 37274, ,31 3, , , ,18 Condition satisfactory -1648, ,8-0, , , ,82 Condition good 21748, ,1 1, , , ,34 Loc. Espoonlahti , ,61-1, , ,2 2368,139 Loc. Kivenlahti ,4 8612,746-2, , ,7-1879,98 Loc. Leppävaara -2373, ,595-0, , , ,3 Loc. Olari -116, ,084-0, , , ,15 Loc. Tapiola ,1 7201,74 17, ,17E , ,6
18 Dummies or separate regressions? Consider a model of single regressor x and a single dummy D: y = β 1 + β 2 x + δ D + γ D x + ε where δ is the coefficient of intercept dummy and γ is the coefficient of slope dummy. The model could be equivalently estimated in parts: split the sample in two subsamples (D=0, D=1), and run two separate regressions: For D=0: For D=1 y = β 1 + β 2 x + ε y = β 3 + β 4 x + ε [Note: β 3 = β 1 + δ; β 4 = β 2 + γ]
19 Dummies or separate regressions? Partitioning subsamples (D=0, D=1) to separate regression models is easy to implement, especially if there are many regressors. Pooling the sub-samples (D=0, D=1) in the same model enables us to test for significance of coefficients δ and γ by t-test. We can use the F-test for testing joint significance of the parameters δ and γ (see the next lecture)
20 Qualitative factors as dependent variables It is also possible to model binary {0,1} variables and ordinary variables as dependent variables. For example, regress variable y = elevator dummy on explanatory variables x (e.g., age, number of floors, etc.) Such models can be estimated using Probit and Logit approaches (based on maximum likelihood).
21 F-test of joint significance Statistical significance of the parameters β 1, β 2 can be tested by using t-test (one- and two-sided tests) p-value (two-sided tests only) confidence intervals (two-sided tests, 95% significance) In the multiple regression model, it is possible that some of the regressors are significant while others are not It is useful to test if the model as a whole is significant
22 SUMMARY OUTPUT Regression Statistics Multiple R 0, R Square 0, Adjusted R Square 0, Standard Error 53206,42 Observations 642 ANOVA df SS MS F Significance F Regression 14 5,67E+12 4,05E ,0872 1,7E-184 Residual 627 1,77E+12 2,83E+09 Total 641 7,45E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 85388, ,18 4, ,8E , Size m2 4137, , , ,52E , ,29 Bedrooms ,7 5323,126-6,1794 1,16E ,4 Age -2584,75 151, ,096 4,51E , ,85 Elevator 6216, ,772 1, , , ,94 1st floor -9137, ,723-1, , ,7 2187,183 Top floor 9576, ,709 1, , , ,19 Energy class A-C 33899, ,676 3, , , ,58 Condition Satisfactory -2680, ,14-0, , , ,98 Condition Good 18120, ,51 1, , , ,51 Loc. Espoonlahti , ,91-1, , ,5 6219,35 Loc. Kivenlahti ,2 8488,473-2, , ,5-2072,91 Loc. Leppävaara -4761, ,203-0, , ,7 7915,191 Loc. Olari -714, ,031-0, , , ,94 22 Loc. Tapiola ,6 7110,665 17, ,58E ,2
23 F-test of joint significance Multiple regression model: y = β 1 + β 2 x 2 + β 3 x β K x K + ε H 0 : β 2 = β 3 = = β K =0 (true model: y = β 1 + ε) H 1 : β k 0 for some k = 2,,K Note: in the special case of a single regression model: y = β 1 + β 2 x 2 + ε H 0 : β 2 = 0 H 1 : β 2 0
24 F-test of joint significance Empirical fit of the estimated model measured by the coefficient of determination (R 2 ) When H 0 (y = β 1 + ε) is true, R 2 is equal to zero. Is R 2 of the estimated model significantly greater than zero?
25 Test statistic: F-test of joint significance ESS K R K F RSS n K R n K 2 / ( 1) / ( 1) 2 / ( ) (1 ) / ( ) If H0 holds, then the test statistic follows the F-distribution with (K-1) and (n-k) degrees of freedom: F[(K-1), (n-k)] In Excel: =FINV(prob.; df1; df2)
26 F-test of joint significance Plots of the density function of F distribution at different values of df
27 F-test of joint significance Critical values of the F distribution at different sample sizes n, single regressor case, 5% significance level: n critical F 20 4, , , , , , , ,843
28 F-test of joint significance In the single regression case, we could test for the hypothesis H 0 : β 2 = 0 using the F-test or the t-test. Which one is more powerful? In fact, the two tests are equivalent (See 2.12 for details) F 2 R / ( K 1) ( b ) (1 R ) / ( n K) s. e.( b ) t 2 Also the critical values of F-distribution are obtained as the squared values of the critical t
29 Next time Wed 7 March Topic: Model specification
Econometrics. 7) Endogeneity
30C00200 Econometrics 7) Endogeneity Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Common types of endogeneity Simultaneity Omitted variables Measurement errors
More informationEconometrics. 4) Statistical inference
30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution
More informationMULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics
MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES Business Statistics CONTENTS Multiple regression Dummy regressors Assumptions of regression analysis Predicting with regression analysis Old exam question
More informationCategorical Predictor Variables
Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively
More informationChapter 14 Student Lecture Notes 14-1
Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this
More informationCHAPTER 4 & 5 Linear Regression with One Regressor. Kazu Matsuda IBEC PHBU 430 Econometrics
CHAPTER 4 & 5 Linear Regression with One Regressor Kazu Matsuda IBEC PHBU 430 Econometrics Introduction Simple linear regression model = Linear model with one independent variable. y = dependent variable
More informationBasic Business Statistics, 10/e
Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:
More information2. Linear regression with multiple regressors
2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions
More informationECON 4230 Intermediate Econometric Theory Exam
ECON 4230 Intermediate Econometric Theory Exam Multiple Choice (20 pts). Circle the best answer. 1. The Classical assumption of mean zero errors is satisfied if the regression model a) is linear in the
More informationChapter 3 Multiple Regression Complete Example
Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be
More informationChapter 7 Student Lecture Notes 7-1
Chapter 7 Student Lecture Notes 7- Chapter Goals QM353: Business Statistics Chapter 7 Multiple Regression Analysis and Model Building After completing this chapter, you should be able to: Explain model
More information1 A Non-technical Introduction to Regression
1 A Non-technical Introduction to Regression Chapters 1 and Chapter 2 of the textbook are reviews of material you should know from your previous study (e.g. in your second year course). They cover, in
More informationAnalisi Statistica per le Imprese
, Analisi Statistica per le Imprese Dip. di Economia Politica e Statistica 4.3. 1 / 33 You should be able to:, Underst model building using multiple regression analysis Apply multiple regression analysis
More informationNew York University Department of Economics. Applied Statistics and Econometrics G Spring 2013
New York University Department of Economics Applied Statistics and Econometrics G31.1102 Spring 2013 Text: Econometric Analysis, 7 h Edition, by William Greene (Prentice Hall) Optional: A Guide to Modern
More informationFinal Exam - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your
More informationLab 07 Introduction to Econometrics
Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand
More informationChapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression
Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing
More information1.) Fit the full model, i.e., allow for separate regression lines (different slopes and intercepts) for each species
Lecture notes 2/22/2000 Dummy variables and extra SS F-test Page 1 Crab claw size and closing force. Problem 7.25, 10.9, and 10.10 Regression for all species at once, i.e., include dummy variables for
More informationSMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning
SMA 6304 / MIT 2.853 / MIT 2.854 Manufacturing Systems Lecture 10: Data and Regression Analysis Lecturer: Prof. Duane S. Boning 1 Agenda 1. Comparison of Treatments (One Variable) Analysis of Variance
More information" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2
Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationRockefeller College University at Albany
Rockefeller College University at Albany PAD 705 Handout: Suggested Review Problems from Pindyck & Rubinfeld Original prepared by Professor Suzanne Cooper John F. Kennedy School of Government, Harvard
More information22s:152 Applied Linear Regression
22s:152 Applied Linear Regression Chapter 7: Dummy Variable Regression So far, we ve only considered quantitative variables in our models. We can integrate categorical predictors by constructing artificial
More informationChapter 4. Regression Models. Learning Objectives
Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing
More informationLECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit
LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationA discussion on multiple regression models
A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value
More informationMultiple Regression. Peerapat Wongchaiwat, Ph.D.
Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com The Multiple Regression Model Examine the linear relationship between 1 dependent (Y) & 2 or more independent variables (X i ) Multiple Regression Model
More informationSection 5: Dummy Variables and Interactions
Section 5: Dummy Variables and Interactions Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Example: Detecting
More informationRegression with Qualitative Information. Part VI. Regression with Qualitative Information
Part VI Regression with Qualitative Information As of Oct 17, 2017 1 Regression with Qualitative Information Single Dummy Independent Variable Multiple Categories Ordinal Information Interaction Involving
More informationIntroduction to Linear Regression Analysis
Introduction to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018 Econometrics: What is it? Interaction of economic theory, observed data and statistical methods. The science of testing
More informationSIMPLE REGRESSION ANALYSIS. Business Statistics
SIMPLE REGRESSION ANALYSIS Business Statistics CONTENTS Ordinary least squares (recap for some) Statistical formulation of the regression model Assessing the regression model Testing the regression coefficients
More informationRegression Models. Chapter 4. Introduction. Introduction. Introduction
Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager
More informationLecture 3: Multiple Regression
Lecture 3: Multiple Regression R.G. Pierse 1 The General Linear Model Suppose that we have k explanatory variables Y i = β 1 + β X i + β 3 X 3i + + β k X ki + u i, i = 1,, n (1.1) or Y i = β j X ji + u
More informationThe Multiple Regression Model
Multiple Regression The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables:
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More information22s:152 Applied Linear Regression. Take random samples from each of m populations.
22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each
More informationLecture 2 Simple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: Chapter 1
Lecture Simple Linear Regression STAT 51 Spring 011 Background Reading KNNL: Chapter 1-1 Topic Overview This topic we will cover: Regression Terminology Simple Linear Regression with a single predictor
More informationInference for the Regression Coefficient
Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates
More informationEconometrics. 9) Heteroscedasticity and autocorrelation
30C00200 Econometrics 9) Heteroscedasticity and autocorrelation Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Heteroscedasticity Possible causes Testing for
More information22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA
22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each
More informationMBA Statistics COURSE #4
MBA Statistics 51-651-00 COURSE #4 Simple and multiple linear regression What should be the sales of ice cream? Example: Before beginning building a movie theater, one must estimate the daily number of
More information1 Correlation and Inference from Regression
1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is
More informationStatistics and Quantitative Analysis U4320
Statistics and Quantitative Analysis U3 Lecture 13: Explaining Variation Prof. Sharyn O Halloran Explaining Variation: Adjusted R (cont) Definition of Adjusted R So we'd like a measure like R, but one
More information2 Regression Analysis
FORK 1002 Preparatory Course in Statistics: 2 Regression Analysis Genaro Sucarrat (BI) http://www.sucarrat.net/ Contents: 1 Bivariate Correlation Analysis 2 Simple Regression 3 Estimation and Fit 4 T -Test:
More informationThe multiple regression model; Indicator variables as regressors
The multiple regression model; Indicator variables as regressors Ragnar Nymoen University of Oslo 28 February 2013 1 / 21 This lecture (#12): Based on the econometric model specification from Lecture 9
More informationLECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity
LECTURE 10 Introduction to Econometrics Multicollinearity & Heteroskedasticity November 22, 2016 1 / 23 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists
More informationEconometrics I Lecture 3: The Simple Linear Regression Model
Econometrics I Lecture 3: The Simple Linear Regression Model Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 32 Outline Introduction Estimating
More informationIntroduction To Logistic Regression
Introduction To Lecture 22 April 28, 2005 Applied Regression Analysis Lecture #22-4/28/2005 Slide 1 of 28 Today s Lecture Logistic regression. Today s Lecture Lecture #22-4/28/2005 Slide 2 of 28 Background
More informationLecture 5: Omitted Variables, Dummy Variables and Multicollinearity
Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity R.G. Pierse 1 Omitted Variables Suppose that the true model is Y i β 1 + β X i + β 3 X 3i + u i, i 1,, n (1.1) where β 3 0 but that the
More informationLecture 18 Miscellaneous Topics in Multiple Regression
Lecture 18 Miscellaneous Topics in Multiple Regression STAT 512 Spring 2011 Background Reading KNNL: 8.1-8.5,10.1, 11, 12 18-1 Topic Overview Polynomial Models (8.1) Interaction Models (8.2) Qualitative
More informationRegression Analysis II
Regression Analysis II Measures of Goodness of fit Two measures of Goodness of fit Measure of the absolute fit of the sample points to the sample regression line Standard error of the estimate An index
More informationHypothesis Testing hypothesis testing approach
Hypothesis Testing In this case, we d be trying to form an inference about that neighborhood: Do people there shop more often those people who are members of the larger population To ascertain this, we
More informationStatistics in medicine
Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu
More informationAnalysis of Variance
Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also
More informationLinear Regression with one Regressor
1 Linear Regression with one Regressor Covering Chapters 4.1 and 4.2. We ve seen the California test score data before. Now we will try to estimate the marginal effect of STR on SCORE. To motivate these
More informationSTA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.
STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory
More informationECON 5350 Class Notes Functional Form and Structural Change
ECON 5350 Class Notes Functional Form and Structural Change 1 Introduction Although OLS is considered a linear estimator, it does not mean that the relationship between Y and X needs to be linear. In this
More informationInference for Regression Simple Linear Regression
Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating
More information( ), which of the coefficients would end
Discussion Sheet 29.7.9 Qualitative Variables We have devoted most of our attention in multiple regression to quantitative or numerical variables. MR models can become more useful and complex when we consider
More informationChapter 13. Multiple Regression and Model Building
Chapter 13 Multiple Regression and Model Building Multiple Regression Models The General Multiple Regression Model y x x x 0 1 1 2 2... k k y is the dependent variable x, x,..., x 1 2 k the model are the
More informationProblems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B
Simple Linear Regression 35 Problems 1 Consider a set of data (x i, y i ), i =1, 2,,n, and the following two regression models: y i = β 0 + β 1 x i + ε, (i =1, 2,,n), Model A y i = γ 0 + γ 1 x i + γ 2
More informationSimple logistic regression
Simple logistic regression Biometry 755 Spring 2009 Simple logistic regression p. 1/47 Model assumptions 1. The observed data are independent realizations of a binary response variable Y that follows a
More informationLab 10 - Binary Variables
Lab 10 - Binary Variables Spring 2017 Contents 1 Introduction 1 2 SLR on a Dummy 2 3 MLR with binary independent variables 3 3.1 MLR with a Dummy: different intercepts, same slope................. 4 3.2
More informationSTATISTICS 141 Final Review
STATISTICS 141 Final Review Bin Zou bzou@ualberta.ca Department of Mathematical & Statistical Sciences University of Alberta Winter 2015 Bin Zou (bzou@ualberta.ca) STAT 141 Final Review Winter 2015 1 /
More informationEC312: Advanced Econometrics Problem Set 3 Solutions in Stata
EC312: Advanced Econometrics Problem Set 3 Solutions in Stata Nicola Limodio www.nicolalimodio.com N.Limodio1@lse.ac.uk The data set AIRQ contains observations for 30 standard metropolitan statistical
More informationLecture-1: Introduction to Econometrics
Lecture-1: Introduction to Econometrics 1 Definition Econometrics may be defined as 2 the science in which the tools of economic theory, mathematics and statistical inference is applied to the analysis
More informationLecture 9: Linear Regression
Lecture 9: Linear Regression Goals Develop basic concepts of linear regression from a probabilistic framework Estimating parameters and hypothesis testing with linear models Linear regression in R Regression
More informationSTAT5044: Regression and Anova. Inyoung Kim
STAT5044: Regression and Anova Inyoung Kim 2 / 47 Outline 1 Regression 2 Simple Linear regression 3 Basic concepts in regression 4 How to estimate unknown parameters 5 Properties of Least Squares Estimators:
More informationRegression With a Categorical Independent Variable: Mean Comparisons
Regression With a Categorical Independent Variable: Mean Lecture 16 March 29, 2005 Applied Regression Analysis Lecture #16-3/29/2005 Slide 1 of 43 Today s Lecture comparisons among means. Today s Lecture
More informationGeneralized Linear Models
York SPIDA John Fox Notes Generalized Linear Models Copyright 2010 by John Fox Generalized Linear Models 1 1. Topics I The structure of generalized linear models I Poisson and other generalized linear
More informationChapter 4: Regression Models
Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,
More informationChristopher Dougherty London School of Economics and Political Science
Introduction to Econometrics FIFTH EDITION Christopher Dougherty London School of Economics and Political Science OXFORD UNIVERSITY PRESS Contents INTRODU CTION 1 Why study econometrics? 1 Aim of this
More informationLinear Regression With Special Variables
Linear Regression With Special Variables Junhui Qian December 21, 2014 Outline Standardized Scores Quadratic Terms Interaction Terms Binary Explanatory Variables Binary Choice Models Standardized Scores:
More informationEconometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur
Econometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur Module No. # 01 Lecture No. # 28 LOGIT and PROBIT Model Good afternoon, this is doctor Pradhan
More informationLAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION
LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION In this lab you will learn how to use Excel to display the relationship between two quantitative variables, measure the strength and direction of the
More informationCh 3: Multiple Linear Regression
Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery
More informationCorrelation Analysis
Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the
More informationECONOMETRIC MODEL WITH QUALITATIVE VARIABLES
ECONOMETRIC MODEL WITH QUALITATIVE VARIABLES How to quantify qualitative variables to quantitative variables? Why do we need to do this? Econometric model needs quantitative variables to estimate its parameters
More informationLogistic Regression. Continued Psy 524 Ainsworth
Logistic Regression Continued Psy 524 Ainsworth Equations Regression Equation Y e = 1 + A+ B X + B X + B X 1 1 2 2 3 3 i A+ B X + B X + B X e 1 1 2 2 3 3 Equations The linear part of the logistic regression
More informationThe regression model with one fixed regressor cont d
The regression model with one fixed regressor cont d 3150/4150 Lecture 4 Ragnar Nymoen 27 January 2012 The model with transformed variables Regression with transformed variables I References HGL Ch 2.8
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS Page 1 MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level
More informationSleep data, two drugs Ch13.xls
Model Based Statistics in Biology. Part IV. The General Linear Mixed Model.. Chapter 13.3 Fixed*Random Effects (Paired t-test) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch
More informationLecture 10 Multiple Linear Regression
Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More informationChapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression
Chapter 12 12-1 North Seattle Community College BUS21 Business Statistics Chapter 12 Learning Objectives In this chapter, you learn:! How to use regression analysis to predict the value of a dependent
More informationMFin Econometrics I Session 5: F-tests for goodness of fit, Non-linearity and Model Transformations, Dummy variables
MFin Econometrics I Session 5: F-tests for goodness of fit, Non-linearity and Model Transformations, Dummy variables Thilo Klein University of Cambridge Judge Business School Session 5: Non-linearity,
More information* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.
Name of the course Statistical methods and data analysis Audience The course is intended for students of the first or second year of the Graduate School in Materials Engineering. The aim of the course
More informationQuestion 1 carries a weight of 25%; Question 2 carries 20%; Question 3 carries 20%; Question 4 carries 35%.
UNIVERSITY OF EAST ANGLIA School of Economics Main Series PGT Examination 017-18 ECONOMETRIC METHODS ECO-7000A Time allowed: hours Answer ALL FOUR Questions. Question 1 carries a weight of 5%; Question
More informationChapter 26 Multiple Regression, Logistic Regression, and Indicator Variables
Chapter 26 Multiple Regression, Logistic Regression, and Indicator Variables 26.1 S 4 /IEE Application Examples: Multiple Regression An S 4 /IEE project was created to improve the 30,000-footlevel metric
More informationResearch Design - - Topic 13a Split Plot Design with Either a Continuous or Categorical Between Subjects Factor 2008 R.C. Gardner, Ph.D.
esearch Design - - Topic 3a Split Plot Design with Either a ontinuous or ategorical etween Subjects Factor 8.. Gardner Ph.D. General Description Example Purpose Two ategorical Factors Using Multiple egression
More informationChapter 3: Multiple Regression. August 14, 2018
Chapter 3: Multiple Regression August 14, 2018 1 The multiple linear regression model The model y = β 0 +β 1 x 1 + +β k x k +ǫ (1) is called a multiple linear regression model with k regressors. The parametersβ
More informationFinQuiz Notes
Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable
More informationChapter 8: Regression Models with Qualitative Predictors
Chapter 8: Regression Models with Qualitative Predictors Some predictors may be binary (e.g., male/female) or otherwise categorical (e.g., small/medium/large). These typically enter the regression model
More informationRegression With a Categorical Independent Variable
Regression With a Categorical Independent Variable Lecture 15 March 17, 2005 Applied Regression Analysis Lecture #15-3/17/2005 Slide 1 of 29 Today s Lecture» Today s Lecture» Midterm Note» Example Regression
More informationMaking sense of Econometrics: Basics
Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/
More informationCRP 272 Introduction To Regression Analysis
CRP 272 Introduction To Regression Analysis 30 Relationships Among Two Variables: Interpretations One variable is used to explain another variable X Variable Independent Variable Explaining Variable Exogenous
More informationContest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.
Updated: November 17, 2011 Lecturer: Thilo Klein Contact: tk375@cam.ac.uk Contest Quiz 3 Question Sheet In this quiz we will review concepts of linear regression covered in lecture 2. NOTE: Please round
More informationMultiple Regression Analysis
1 OUTLINE Analysis of Data and Model Hypothesis Testing Dummy Variables Research in Finance 2 ANALYSIS: Types of Data Time Series data Cross-Sectional data Panel data Trend Seasonal Variation Cyclical
More information