Chapter 4 Describing the Relation between Two Variables

Similar documents
Business Statistics. Lecture 10: Correlation and Linear Regression

Section Linear Correlation and Regression. Copyright 2013, 2010, 2007, Pearson, Education, Inc.

Can you tell the relationship between students SAT scores and their college grades?

Correlation and Regression

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

AMS 7 Correlation and Regression Lecture 8

CREATED BY SHANNON MARTIN GRACEY 146 STATISTICS GUIDED NOTEBOOK/FOR USE WITH MARIO TRIOLA S TEXTBOOK ESSENTIALS OF STATISTICS, 3RD ED.

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

REVIEW 8/2/2017 陈芳华东师大英语系

THE PEARSON CORRELATION COEFFICIENT

Correlation. What Is Correlation? Why Correlations Are Used

Objectives. 2.3 Least-squares regression. Regression lines. Prediction and Extrapolation. Correlation and r 2. Transforming relationships

Linear correlation. Contents. 1 Linear correlation. 1.1 Introduction. Anthony Tanbakuchi Department of Mathematics Pima Community College

Slide 7.1. Theme 7. Correlation

Linear Regression and Correlation. February 11, 2009

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Single and multiple linear regression analysis

Chapter 6: Exploring Data: Relationships Lesson Plan

Correlation and Regression Notes. Categorical / Categorical Relationship (Chi-Squared Independence Test)

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

Important note: Transcripts are not substitutes for textbook assignments. 1

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics

Lecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population

y n 1 ( x i x )( y y i n 1 i y 2

BIOSTATISTICS NURS 3324

Arvind Borde / MAT , Week 5: Relationships I

MATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression

AP Statistics Two-Variable Data Analysis

Chapter 5 Least Squares Regression

Chapter 10 Correlation and Regression

We will now find the one line that best fits the data on a scatter plot.

Chapter 16. Simple Linear Regression and dcorrelation

STATS DOESN T SUCK! ~ CHAPTER 16

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

A company recorded the commuting distance in miles and number of absences in days for a group of its employees over the course of a year.

Correlation Analysis

Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall)

Determine is the equation of the LSRL. Determine is the equation of the LSRL of Customers in line and seconds to check out.. Chapter 3, Section 2

Bivariate Relationships Between Variables

Describing the Relationship between Two Variables

Correlation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up?

11 Regression. Introduction. The Correlation Coefficient. The Least-Squares Regression Line

Chapter 16. Simple Linear Regression and Correlation

Chapter 12 - Part I: Correlation Analysis

CHAPTER 4 DESCRIPTIVE MEASURES IN REGRESSION AND CORRELATION

1. Simple Linear Regression

Correlation and simple linear regression S5

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania

1 A Review of Correlation and Regression

+ Statistical Methods in

Bivariate Data Summary

Chapter 3: Describing Relationships

Least Squares Regression

Chapter 7. Scatterplots, Association, and Correlation

Multiple linear regression S6

Linear Correlation and Regression Analysis

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships

Related Example on Page(s) R , 148 R , 148 R , 156, 157 R3.1, R3.2. Activity on 152, , 190.

13 Simple Linear Regression

Unit 6 - Introduction to linear regression

Scatterplots and Correlation

df=degrees of freedom = n - 1

STATISTICS 110/201 PRACTICE FINAL EXAM

Variance. Standard deviation VAR = = value. Unbiased SD = SD = 10/23/2011. Functional Connectivity Correlation and Regression.

APPENDIX 1 BASIC STATISTICS. Summarizing Data

Relationship Between Interval and/or Ratio Variables: Correlation & Regression. Sorana D. BOLBOACĂ

MATH 2560 C F03 Elementary Statistics I LECTURE 9: Least-Squares Regression Line and Equation

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation

Correlation and Regression

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis

Warm-up Using the given data Create a scatterplot Find the regression line

Example: Forced Expiratory Volume (FEV) Program L13. Example: Forced Expiratory Volume (FEV) Example: Forced Expiratory Volume (FEV)

Stat 101 L: Laboratory 5

Week 8: Correlation and Regression

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

Summarizing Data: Paired Quantitative Data

Linear Regression Communication, skills, and understanding Calculator Use

Lecture 15: Chapter 10

Topic 10 - Linear Regression

Regression Analysis: Exploring relationships between variables. Stat 251

Solving Equations by Factoring. Solve the quadratic equation x 2 16 by factoring. We write the equation in standard form: x

How to mathematically model a linear relationship and make predictions.

Information Sources. Class webpage (also linked to my.ucdavis page for the class):

Chapter 5 Friday, May 21st

2 Regression Analysis

Analysing data: regression and correlation S6 and S7

Statistical View of Least Squares

Reminder: Univariate Data. Bivariate Data. Example: Puppy Weights. You weigh the pups and get these results: 2.5, 3.5, 3.3, 3.1, 2.6, 3.6, 2.

Scatter plot of data from the study. Linear Regression

Correlation. Relationship between two variables in a scatterplot. As the x values go up, the y values go down.

GUIDED NOTES 4.1 LINEAR FUNCTIONS

How to mathematically model a linear relationship and make predictions.

Chapter 27 Summary Inferences for Regression

Regression Models. Chapter 4. Introduction. Introduction. Introduction

The cover page of the Encyclopedia of Health Economics (2014) Introduction to Econometric Application in Health Economics

Correlation & Regression. Dr. Moataza Mahmoud Abdel Wahab Lecturer of Biostatistics High Institute of Public Health University of Alexandria

6.6 General Form of the Equation for a Linear Relation

Chapter 12 : Linear Correlation and Linear Regression

Transcription:

Chapter 4 Describing the Relation between Two Variables 4.1 Scatter Diagrams and Correlation The is the variable whose value can be explained by the value of the or. A is a graph that shows the relationship between two quantitative variables measured on the same individual. Each individual in the data set is represented by a point in the scatter diagram. I. Scatter plot (x) # hour of sleep 6 8 10 2 (y) performance 3 5 4 1 1

II. Correlation coefficient (r) The linear correlation coefficient or Pearson product moment correlation coefficient is a measure of the strength and direction of the linear relation between two quantitative variables. The Greek letter ρ (rho) represents the population correlation coefficient, and r represents the sample correlation coefficient. We present only the formula for the sample correlation coefficient. Sample Linear Correlation Coefficient r x i x s x n 1 y i y s y where x is the sample mean of the explanatory variable s x is the sample standard deviation of the explanatory variable y is the sample mean of the response variable s y is the sample standard deviation of the response variable n is the number of individuals in the sample Properties of the Linear Correlation Coefficient 1. The linear correlation coefficient is always between 1 and 1, inclusive. That is, 1 r 1. 2. If r = + 1, then a perfect positive linear relation exists between the two variables. 3. If r = 1, then a perfect negative linear relation exists between the two variables. 4. The closer r is to +1, the stronger is the evidence of positive association between the two variables. 5. The closer r is to 1, the stronger is the evidence of negative association between the two variables. 6. If r is close to 0, then little or no evidence exists of a linear relation between the two variables. So r close to 0 does not imply no relation, just no linear relation. 7. The linear correlation coefficient is a unit less measure of association. So the unit of measure for x and y plays no role in the interpretation of r. 8. The correlation coefficient is not resistant. Therefore, an observation that does not follow the overall pattern of the data could affect the value of the linear correlation coefficient. 2

EXAMPLE Determining the Linear Correlation Coefficient Determine the linear correlation coefficient of the drilling data. Testing for a Linear Relation Step 1 Determine the absolute value of the correlation coefficient Step 2 Find the critical value in Table II from Appendix A for the given sample size Step 3 If the absolute value of the correlation coefficient is greater than the critical value, we say a linear relation exists between the two variables. Otherwise, no linear relation exists. EXAMPLE Does a Linear Relation Exist? 4.2 Least-Squares Regression EXAMPLE Finding an Equation that Describes Linearly Relate Data Using the following sample data: 3

(a) Find a linear equation that relates x (the explanatory variable) and y (the response variable) by selecting two points and finding the equation of the line containing the points. (b) Graph the equation on the scatter diagram. (c) Use the equation to predict y if x = 3. The difference between the observed value of y and the predicted value of y is the error, or residual. Using the line from the last example, and the predicted value at x = 3: residual = observed y predicted y Least-Squares Regression Criterion If there is positive / negative correlation between X and Y, find the best fitted line for the data. The least-squares regression line is the line that minimizes the sum of the squared errors (or residuals). This line minimizes the sum of the squared vertical distance between the observed values of y and those predicted by the line the squared errors). ŷ, ( y-hat ). We represent this as minimize Σ residuals 2 (minimizes the sum of 4

The Least-Squares Regression Line The equation of the least-squares regression line is given by where is the slope of the least-squares regression line and is the y-intercept of the least-squares regression line The Least-Squares Regression Line Note: is the sample mean and s x is the sample standard deviation of the explanatory variable x ; is the sample mean and s y is the sample standard deviation of the response variable y. EXAMPLE Finding the Least-squares Regression Line Using the drilling data (a) Find the least-squares regression line. (b) Predict the drilling time if drilling starts at 130 feet. (c) Is the observed drilling time at 130 feet above, or below, average. (d) Draw the least-squares regression line on the scatter diagram of the data. Interpretation of Slope: Interpretation of the y-intercept: Caution: If the least-squares regression line is used to make predictions based on values of the explanatory variable that are much larger or much smaller than the observed values, we say the researcher is working outside the scope of the model. Never use a least-squares regression line to make 5

predictions outside the scope of the model because we can t be sure the linear relation continues to exist. Predictions When There is No Linear Relation: When the correlation coefficient indicates no linear relation between the explanatory and response variables, and the scatter diagram indicates no relation at all between the variables, then we use the mean value of the response variables, then we use the mean value of the response variable as the predicted value so that ŷ y Summary 1. Use StatCrunch to plot a scatter plot 2. Use StatCrunch to calculate r 3. Determine whether there is a positive/negative linear correlation between X and Y. 4. If there is a linear correlation between X and Y, use StatCrunch to find the least squares regression line. Otherwise, do not find the least squares regression line. 5. When a value is assigned to X if there is a correlation between X and Y, use the least squares regression line to find the best predicted Y. 6. When a value is assigned to X if there is no correlation between X and Y, use StatCrunch to find y and the best predicted Y is y for any X. 6