SIMPLE LINEAR REGRESSION AND CORRELATION ANALYSIS

Similar documents
Regression, Inference, and Model Building

S Y Y = ΣY 2 n. Using the above expressions, the correlation coefficient is. r = SXX S Y Y

STP 226 ELEMENTARY STATISTICS

1 Inferential Methods for Correlation and Regression Analysis

Simple Regression. Acknowledgement. These slides are based on presentations created and copyrighted by Prof. Daniel Menasce (GMU) CS 700

Correlation Regression

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Linear Regression Models

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

11 Correlation and Regression

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).

Simple Linear Regression

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

(all terms are scalars).the minimization is clearer in sum notation:

Lecture 11 Simple Linear Regression

Dr. Maddah ENMG 617 EM Statistics 11/26/12. Multiple Regression (2) (Chapter 15, Hines)

Simple Regression Model

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

II. Descriptive Statistics D. Linear Correlation and Regression. 1. Linear Correlation

Chapter 13, Part A Analysis of Variance and Experimental Design

University of California, Los Angeles Department of Statistics. Simple regression analysis

Response Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable

Circle the single best answer for each multiple choice question. Your choice should be made clearly.

Simple Linear Regression

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

Chapter 12 Correlation

Ismor Fischer, 1/11/

Statistics 20: Final Exam Solutions Summer Session 2007

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

Grant MacEwan University STAT 252 Dr. Karen Buro Formula Sheet

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Correlation and Regression

TAMS24: Notations and Formulas

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

Assessment and Modeling of Forests. FR 4218 Spring Assignment 1 Solutions

Mathematical Notation Math Introduction to Applied Statistics

Linear Regression Analysis. Analysis of paired data and using a given value of one variable to predict the value of the other

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

REGRESSION MODELS ANOVA

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So,

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Regression, Part I. A) Correlation describes the relationship between two variables, where neither is independent or a predictor.

Stat 139 Homework 7 Solutions, Fall 2015

Regression. Correlation vs. regression. The parameters of linear regression. Regression assumes... Random sample. Y = α + β X.

Describing the Relation between Two Variables

Properties and Hypothesis Testing

Paired Data and Linear Correlation

UNIT 11 MULTIPLE LINEAR REGRESSION

Measuring the fit of the model - SSR

Statistics 203 Introduction to Regression and Analysis of Variance Assignment #1 Solutions January 20, 2005

Lesson 11: Simple Linear Regression

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

A Relationship Between the One-Way MANOVA Test Statistic and the Hotelling Lawley Trace Test Statistic

Question 1: Exercise 8.2

n but for a small sample of the population, the mean is defined as: n 2. For a lognormal distribution, the median equals the mean.

STA6938-Logistic Regression Model

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Dept. of maths, MJ College.

University of California, Los Angeles Department of Statistics. Practice problems - simple regression 2 - solutions

Topic 9: Sampling Distributions of Estimators

bwght = cigs

Algebra of Least Squares

Formulas and Tables for Gerstman

Section 14. Simple linear regression.

Sample Size Determination (Two or More Samples)

WEIGHTED LEAST SQUARES - used to give more emphasis to selected points in the analysis. Recall, in OLS we minimize Q =! % =!

STP 226 EXAMPLE EXAM #1

Bivariate Sample Statistics Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 7

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes

Statistical Properties of OLS estimators

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43

Agenda: Recap. Lecture. Chapter 12. Homework. Chapt 12 #1, 2, 3 SAS Problems 3 & 4 by hand. Marquette University MATH 4740/MSCS 5740

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

REGRESSION AND ANALYSIS OF VARIANCE. Motivation. Module structure

Good luck! School of Business and Economics. Business Statistics E_BK1_BS / E_IBA1_BS. Date: 25 May, Time: 12:00. Calculator allowed:

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x).

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 4

Pearson Edexcel Level 3 Advanced Subsidiary and Advanced GCE in Statistics

Least-Squares Regression

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

Solutions to Odd Numbered End of Chapter Exercises: Chapter 4

Common Large/Small Sample Tests 1/55

Lecture 1, Jan 19. i=1 p i = 1.

Unit 9 Regression and Correlation

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

Ch 2: Simple Linear Regression

Chapter 1 Simple Linear Regression (part 6: matrix version)

Topic 9: Sampling Distributions of Estimators

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 4 - Summarizing Numerical Data

Correlation and Covariance

Chapter If n is odd, the median is the exact middle number If n is even, the median is the average of the two middle numbers

Topic 9: Sampling Distributions of Estimators

Linear Regression Demystified

Simple Linear Regression. Copyright 2012 Pearson Education, Inc. Publishing as Prentice Hall. Chapter Chapter. β0 β1. β β = 1. a. b.

ECON 3150/4150, Spring term Lecture 3

Transcription:

SIMPLE LINEAR REGRESSION AND CORRELATION ANALSIS

INTRODUCTION There are lot of statistical ivestigatio to kow whether there is a relatioship amog variables Two aalyses: (1) regressio aalysis; () correlatio aalysis Regressio aalysis: to fid out a regressio which ca be used to predict variable (called depedet, respose variable) from variables 1,,..., k (called idepedet, predictor variables) Correlatio aalysis: to fid out the stregth of assosiatio betwee variables 1,,..., k ad variable.

SIMPLE LINEAR REGRESSION We build liear model i such a way that the values of variable ca be predicted from values of oe variable There are ordered pairs of idepedet observatio ( i, i ), i = 1,,..., ; where i are i-th values of idepedet variable ad i are i-th values of depedet variable

Example

Scatter diagram

DATA MODEL ON POPULATION i i i i = the i-th values of variable α = costat, the mea of if all are zero, itercept β = costat, regressio coefficiet o ε i = radom disturbace, error, for idividual i α is estimated by a o sample β is estimated by b o sample ε i are estimated by e i

REGRESSION LINE Ŷ a b Ŷ is predictive values,predicted froma certai values of Usually the predictive values of are ot the same as the real values The differece betwee the predictive value ad the real value is called residual or error ad is called e i the sample e i i Ŷ i

LEAST SQUARES METHOD 1 i i i 1 i i ) b a ( e D a b By usig theorems o calculus, D will be at miimum if the values of a ad b are as follows:

EAMPLE

EAMPLE

SCATTER DIAGRAM

SUM OF SQUARES (SSs) SST Ŷ Ŷ = SSR + SSE SST = total sum of squares SSR = sum of squares due to regressio SSE = error sum of squares

COMPUTATION FORMULAS SST SSR a b SSE a b

EAMPLE

EAMPLE

SIGNIFICANT TEST OF REGRESSION LINE H o : regressio lie o is ot sigificat H 1 : regressio lie o is sigificat Those ull hypothesis is tested usig aova approach

EAMPLE

THE SIMPLE CORRELATION COEFFICIENT The product-momet correlatio coefficiet is a statistic descriptive of the magitude of the relatio betwee two variables Correlatio coefficiets are traditioally defied i such a way as to take values extedig from 1 to + 1

THE SIMPLE CORRELATION COEFFICIENT A egative value idicates a egative relatio; that is, decreases as icreases, ad icreases as decreases. A positive value idicates a positive relatio; that is, icreases as icreases, ad decreases as decreases.

THE SIMPLE CORRELATION COEFFICIENT The symbol r is commo practice used to deote the sample value of the correlatio coefficiet. The symbol ρ is commo practice used to deote the populatio value of the correlatio coefficiet. So, r ˆ

THE DEFINITION OF r The measure ρ of liear relatioship betwee two variables ad is estimated by the sample correlatio coefficiet r, where r xy b sxx syy sxy sxsy

THE DEFINITION OF r sxsy sxy syy sxx xy b r 1 1 ) )( ( s xy 1 1 s xx 1 1 s yy covariace betwee ad variace of variace of

COMPUTATION FORMULA r xy xy x y where x ad y

COMPUTATION FORMULA r xy The above formula is called Karl Pearso product momet correlatio coefficiet. This formula is widely used, because it is simply couted from raw data

THE INTERPRETATION

THE INTERPRETATION

EAMPLE 53305 951; 665; 76095 3755; 54.88636364 s 11 1 (665)(951) 53305 1 xy 61.17444 s 11 1 665 3755 1 xx 66.0454545 s 11 1 951 76095 1 yy 7.8139645 61.17444 s x 8.13661757 66.0454545 s y

SIGNIFICANT TEST USING t-test AND F-test Usig t-test: t r xy 1r xy with ( - ) degree of fredom Usig F-test: F r xy () 1r xy with 1ad ( - ) df

SIGNIFICANT TEST USING ANOVA H o : ρ xy = 0 H 1 : ρ xy 0 Those ull hypothesis is tested usig aova approach

SIGNIFICANT TEST USING ANOVA SST SSE (1 SSR (r r y )SST )SST

EAMPLE

EAMPLE b r xy b sxx syy (0.897136) 7.8139645 8.13661757 0.86 r xy s xy s x s y 54.88636364 (7.8139645 )(8.13661757 ) 0.86

EAMPLE r xy 665; 951; 53305 3755; 76095 r xy (1)(53305) (665)(951) (1)(3755) 665 (1)(76095) 951 745 8400.4419566 0.86

Exercises