Regression. The Simple Linear Regression Model

Similar documents
Chapter 14 Simple Linear Regression

Chapter 15 - Multiple Regression

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

Statistics for Business and Economics

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Economics & Business

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Chapter 11: Simple Linear Regression and Correlation

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

Comparison of Regression Lines

Chapter 13: Multiple Regression

Statistics II Final Exam 26/6/18

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Basic Business Statistics, 10/e

Learning Objectives for Chapter 11

a. (All your answers should be in the letter!

Chapter 15 Student Lecture Notes 15-1

STAT 3008 Applied Regression Analysis

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

Correlation and Regression

Biostatistics 360 F&t Tests and Intervals in Regression 1

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

Lecture 6: Introduction to Linear Regression

Scatter Plot x

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Multiple Regression Analysis

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis

x i1 =1 for all i (the constant ).

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Economics 130. Lecture 4 Simple Linear Regression Continued

Negative Binomial Regression

STATISTICS QUESTIONS. Step by Step Solutions.

Regression Analysis. Regression Analysis

SIMPLE LINEAR REGRESSION and CORRELATION

18. SIMPLE LINEAR REGRESSION III

Lecture 4 Hypothesis Testing

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Systematic Error Illustration of Bias. Sources of Systematic Errors. Effects of Systematic Errors 9/23/2009. Instrument Errors Method Errors Personal

STAT 3340 Assignment 1 solutions. 1. Find the equation of the line which passes through the points (1,1) and (4,5).

28. SIMPLE LINEAR REGRESSION III

17 - LINEAR REGRESSION II

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1

Unit 10: Simple Linear Regression and Correlation

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

The SAS program I used to obtain the analyses for my answers is given below.

UNIVERSITY OF TORONTO. Faculty of Arts and Science JUNE EXAMINATIONS STA 302 H1F / STA 1001 H1F Duration - 3 hours Aids Allowed: Calculator

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

Topic- 11 The Analysis of Variance

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Statistics MINITAB - Lab 2

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Introduction to Regression

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

e i is a random error

Chapter 9: Statistical Inference and the Relationship between Two Variables

experimenteel en correlationeel onderzoek

Topic 7: Analysis of Variance

β0 + β1xi and want to estimate the unknown

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

PubH 7405: REGRESSION ANALYSIS SLR: PARAMETER ESTIMATION

January Examinations 2015

PubH 7405: REGRESSION ANALYSIS. SLR: INFERENCES, Part II

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li

The Ordinary Least Squares (OLS) Estimator

Polynomial Regression Models

Tests of Single Linear Coefficient Restrictions: t-tests and F-tests. 1. Basic Rules. 2. Testing Single Linear Coefficient Restrictions

Professor Chris Murray. Midterm Exam

Statistics Chapter 4

x = , so that calculated

Diagnostics in Poisson Regression. Models - Residual Analysis

Chap 10: Diagnostics, p384

/ n ) are compared. The logic is: if the two

Chapter 5: Hypothesis Tests, Confidence Intervals & Gauss-Markov Result

SIMPLE LINEAR REGRESSION

First Year Examination Department of Statistics, University of Florida

Problem of Estimation. Ordinary Least Squares (OLS) Ordinary Least Squares Method. Basic Econometrics in Transportation. Bivariate Regression Analysis

University of California at Berkeley Fall Introductory Applied Econometrics Final examination

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors

CHAPTER 8. Exercise Solutions

Chapter 10. What is Regression Analysis? Simple Linear Regression Analysis. Examples

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

MD. LUTFOR RAHMAN 1 AND KALIPADA SEN 2 Abstract

A Comparative Study for Estimation Parameters in Panel Data Model

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Analysis of Variance and Design of Experiments-II

A METHOD FOR DETECTING OUTLIERS IN FUZZY REGRESSION

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Unbalanced Nested ANOVA - Sokal & Rohlf Example

Methods of Detecting Outliers in A Regression Analysis Model.

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Transcription:

Regresson Smple Lnear Regresson Model Least Squares Method Coeffcent of Determnaton Model Assumptons Testng for Sgnfcance Usng the Estmated Regresson Equaton for Estmaton and Predcton Resdual Analss: Valdatng Model Assumptons Outlers and Influental Oservatons The Smple Lnear Regresson Model Smple Lnear Regresson Model β 0 + β x + ε Smple Lnear Regresson Equaton E( β 0 + β x ^ Estmated Smple Lnear Regresson Equaton 0 + x

Least Squares Method Least Squares Crteron mn ( $ oserved value of the dependent varale ^ for the th oservaton estmated value of the dependent varale for the th oservaton The Least Squares Method Slope for the Estmated Regresson Equaton x ( x / n x ( x / n -Intercept for the Estmated Regresson Equaton 0 - x x value of ndependent varale for th oservaton value of dependent varale for th oservaton _ x mean value for ndependent varale _ mean value for dependent varale n total numer of oservatons

The Coeffcent of Determnaton Relatonshp Among SST, SSR, SSE SST SSR + SSE ( ( ^ + ( ^ Coeffcent of Determnaton r SSR/SST SST total sum of squares SSR sum of squares due to regresson SSE sum of squares due to error Model Assumptons Assumptons Aout the Error Term ε The error ε s a random varale wth mean of zero. The varance of ε, denoted σ, s the same for all values of the ndependent varale. The values of ε are ndependent. The error ε s a normall dstruted random varale.

Testng for Sgnfcance To test for a sgnfcant regresson relatonshp, we must conduct a hpothess test to determne whether the value of β s zero. Two tests are commonl used t Test F Test Both tests requre an estmate of σ, the varance of ε n the regresson model. An Estmate of σ Testng for Sgnfcance The mean square error (MSE provdes the estmate of σ, and the notaton s s also used. s MSE SSE/(n- SSE ( ˆ ( 0 x An Estmate of σ To estmate σ we take the square root of σ. The resultng s s called the standard error of the estmate. SSE s MSE n

Testng for Sgnfcance: t Test Hpotheses Test Statstc Rejecton Rule H 0 : β 0 H a : β 0 t s Reject H 0 f t < -t α/ or t > t α/ where t α/ s ased on a t dstruton wth n - degrees of freedom. Confdence Interval for β We can use a 95% confdence nterval for β to test the hpotheses just used n the t test. H 0 s rejected f the hpotheszed value of β s not ncluded n the confdence nterval for β. The form of a confdence nterval for β s: ± t α / where s the pont estmate t α / s s the margn of error t α / s the t value provdng an area of α/ n the upper tal of a t dstruton wth n - degrees of freedom s

Testng for Sgnfcance: F Test Hpotheses Test Statstc Rejecton Rule H 0 : β 0 H a : β 0 F MSR/MSE Reject H 0 f F > F α where F α s ased on an F dstruton wth d.f. n the numerator and n - d.f. n the denomnator. Some Cautons aout the Interpretaton of Sgnfcance Tests Rejectng H 0 : β 0 and concludng that the relatonshp etween x and s sgnfcant does not enale us to conclude that a cause-and-effect relatonshp s present etween x and. Just ecause we are ale to reject H 0 : β 0 and demonstrate statstcal sgnfcance does not enale us to conclude that there s a lnear relatonshp etween x and.

Usng the Estmated Regresson Equaton for Estmaton and Predcton Confdence Interval Estmate of E( p $ p ± t α / s $ p Predcton Interval Estmate of p p + t α/ s nd where the confdence coeffcent s - α and t α/ s ased on a t dstruton wth n - d.f. Resdual Analss If the assumptons aout the error term ε appear questonale, the hpothess tests aout the sgnfcance of the regresson relatonshp and the nterval estmaton results ma not e vald. The resduals provde the est nformaton aout ε. Much of the resdual analss s ased on an examnaton of graphcal plots.

Resdual Plot Aganst x If the assumpton that the varance of ε s the same for all values of x s vald, and the assumed regresson model s an adequate representaton of the relatonshp etween the varales: The resdual plot should gve an overall mpresson of a horzontal and of ponts Standardzed Resdual for Oservaton Standardzed Resduals ˆ s ˆ ˆ s s h ˆ ( x x h + n ( x x The standardzed resdual plot can provde nsght aout the assumpton that the error term ε has a normal dstruton. If ths assumpton s satsfed, the dstruton of the standardzed resduals should appear to come from a standard normal proalt dstruton.

Outlers and Influental Oservatons Detectng Outlers An outler s an oservaton that s unusual n comparson wth the other data. Mnta classfes an oservaton as an outler f ts standardzed resdual value s < - or > +. Ths standardzed resdual rule sometmes fals to dentf an unusuall large oservaton as eng an outler. Ths rule s shortcomng can e crcumvented usng studentzed deleted resduals. The th studentzed deleted resdual wll e larger than the th standardzed resdual.