Describing the Relationship between Two Variables

Similar documents
LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

Correlation and Regression

GUIDED NOTES 2.2 LINEAR EQUATIONS IN ONE VARIABLE

Correlation and Regression (Excel 2007)

GUIDED NOTES 4.1 LINEAR FUNCTIONS

Characteristics of Linear Functions (pp. 1 of 8)

We will now find the one line that best fits the data on a scatter plot.

BIVARIATE DATA data for two variables

Linear Equations. Find the domain and the range of the following set. {(4,5), (7,8), (-1,3), (3,3), (2,-3)}

Contents. 9. Fractional and Quadratic Equations 2 Example Example Example

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis

Appendix A. Common Mathematical Operations in Chemistry

Math 243 OpenStax Chapter 12 Scatterplots and Linear Regression OpenIntro Section and

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet

regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist

Reteach 2-3. Graphing Linear Functions. 22 Holt Algebra 2. Name Date Class

Linear Regression and Correlation. February 11, 2009

Simple Linear Regression

Chapter 12 - Part I: Correlation Analysis

This module focuses on the logic of ANOVA with special attention given to variance components and the relationship between ANOVA and regression.

CREATED BY SHANNON MARTIN GRACEY 146 STATISTICS GUIDED NOTEBOOK/FOR USE WITH MARIO TRIOLA S TEXTBOOK ESSENTIALS OF STATISTICS, 3RD ED.

Obtaining Uncertainty Measures on Slope and Intercept

MAC Module 2 Modeling Linear Functions. Rev.S08

Lab 1 Uniform Motion - Graphing and Analyzing Motion

IT 403 Practice Problems (2-2) Answers

GUIDED NOTES 5.6 RATIONAL FUNCTIONS

Sect Polynomial and Rational Inequalities

Chapter 5: Data Transformation

Chapter 7: Exponents

Chapter 4 Describing the Relation between Two Variables

Bivariate Data Summary

Summer Review. For Students Entering. Algebra 2 & Analysis

Introduction to Spectroscopy: Analysis of Copper Ore

How do I apply concepts of rational and irrational numbers? Concepts Competencies Resources Assessments CC E.1. number system properties.

BIOSTATISTICS NURS 3324

STEP 1: Ask Do I know the SLOPE of the line? (Notice how it s needed for both!) YES! NO! But, I have two NO! But, my line is

Algebra I Course Online/Outcomes

Experiment: Oscillations of a Mass on a Spring

MEASUREMENT OF THE CHARGE TO MASS RATIO (e/m e ) OF AN ELECTRON

X On record with the USOE.

Lecture 7, Chapter 7 summary

α m ! m or v T v T v T α m mass

Richter Scale and Logarithms

36-309/749 Math Review 2014

Business Statistics. Lecture 9: Simple Regression

Danville District #118 Mathematics Transitional Algebra (PARCC Framework version) Curriculum and Scope and Sequence

x On record with the USOE.

Linear Motion with Constant Acceleration

Algebra Final Exam Review Packet

Remember that C is a constant and ë and n are variables. This equation now fits the template of a straight line:

M61 1 M61.1 PC COMPUTER ASSISTED DETERMINATION OF ANGULAR ACCELERATION USING TORQUE AND MOMENT OF INERTIA

Module 8: Linear Regression. The Applied Research Center

CHAPTER 4 DESCRIPTIVE MEASURES IN REGRESSION AND CORRELATION

Advanced Chemistry 2008

Physics 1020 Experiment 5. Momentum

Using Microsoft Excel

Dover-Sherborn High School Mathematics Curriculum Algebra I Level 1 CP/Honors

MEASUREMENT OF THE CHARGE TO MASS RATIO (e/m e ) OF AN ELECTRON

Intermediate Algebra Summary - Part I

Chapter 10 Correlation and Regression

Algebra II Summer Packet. Summer Name:

2-7 Solving Quadratic Inequalities. ax 2 + bx + c > 0 (a 0)

Physics 103 Newton s 2 nd Law On Atwood s Machine with Computer Based Data Collection

Prob and Stats, Sep 23

Radnor Middle School Course Overview. Math Introduction to Algebra 1. Prerequisite: Pre-Algebra, Course 3 Grade: 8

STAT 350 Final (new Material) Review Problems Key Spring 2016

Important note: Transcripts are not substitutes for textbook assignments. 1

x On record with the USOE.

Module 2A Turning Multivariable Models into Interactive Animated Simulations

An area chart emphasizes the trend of each value over time. An area chart also shows the relationship of parts to a whole.

Chapter 1 Linear Equations and Graphs

Solving Linear Equations (in one variable)

Chapter 7: Exponents

Geometry Summer Assignment 2018

2017 SUMMER REVIEW FOR STUDENTS ENTERING GEOMETRY

LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION

Sociology 6Z03 Review I

Lesson 3-1: Solving Linear Systems by Graphing

MATH 1150 Chapter 2 Notation and Terminology

Common Core Coach. Mathematics. First Edition

Scatter plot of data from the study. Linear Regression

Regression, Part I. - In correlation, it would be irrelevant if we changed the axes on our graph.

How many states. Record high temperature

correlated to the Utah 2007 Secondary Math Core Curriculum Algebra 1

Madison County Schools Suggested 8 th Grade Math Pacing Guide for Connected Mathematics

Chemical Kinetics: Integrated Rate Laws. ** updated Procedure for Spec 200 use **

A-G Algebra 1. Gorman Learning Center (052344) Basic Course Information

x On record with the USOE.

OHS Algebra 2 Summer Packet

Boyle s Law: A Multivariable Model and Interactive Animated Simulation

College Algebra. Linear Functions and their Applications (Continued) Dr. Nguyen September 24, 2018

GUIDED NOTES 6.4 GRAPHS OF LOGARITHMIC FUNCTIONS

Free Fall. v gt (Eq. 4) Goals and Introduction

Section 1.4. Meaning of Slope for Equations, Graphs, and Tables

COMMON CORE STATE STANDARDS TO BOOK CORRELATION

Summit Public Schools. Summit, New Jersey. Grade Level 8/ Content Area: Mathematics. Length of Course: Full Academic Year

Correlation and Regression Notes. Categorical / Categorical Relationship (Chi-Squared Independence Test)

Least-Squares Regression. Unit 3 Exploring Data

Transcription:

1 Describing the Relationship between Two Variables Key Definitions Scatter : A graph made to show the relationship between two different variables (each pair of x s and y s) measured from the same equation. Linear Relationship: A linear relationship will have all the points close together and no curves, dips, etc. in the graph. It will be an almost straight line up or down. Nonlinear Relationship: A nonlinear relationship will still have the points close together, but will have a curve or dip. No Relationship: Having no relationship in a scatter diagram means that the data doesn t have a particular pattern to it. The data is dispersed. Positively Associated: This is when you have one value increase, the other value will increase as well. The line on your diagram will start low and increase upwards. This is only for equations who have a linear relationship. Negatively Associated: This is when you have one value decrease, the other will decrease as well. The line on your diagram will start high and decrease downwards. This is only for equations who have a linear relationship. Linear Correlation Coefficient: A calculation that shows us the strength of the linear relation and the direction of the linear relation. Lurking Variable: This is something that shows a linear relation between the variables, but is not actually correlated. Residual: A residual is the space in between the observed y and the predicted y which is also known as the error. Slope: The rate that the line is increasing or decreasing at. Y-Intercept: The point on the line where x =. Coefficient of Determination: The percentage of total variation in the response variable that is explained in the least-squares regression line. Total Variation: The deviation between the observed and mean values. Explained Variation: The deviation between the predicted and mean values. Unexplained Variation: The deviation between the observed and predicted values. Scatter, Relations, and Association How a Scatter is Set Up: Your scatter diagram is like a usual graph from algebra where we graph x and y values, but there is no line connecting each point. On the graph, the x-axis is called the explanatory variables and the y-axis is called the response variables. Plot each point on the graph to create the graph.

2 Determining Meaning from the Scatter : After each point is plotted on the graph, you are able to determine if the equation has a linear relation, nonlinear relation, or no relation. You can also determine whether the equation is positively associated or negatively associated. Examples of Different Scatter s: 1 Positive, Linear Scatter 8 6 4 2 5 1 Negative, Linear Scatter 1 8 6 4 2 5 1 Nonlinear Scatter No Relation Scatter 2 15 1 5 5 1 12 1 8 6 4 2 5 1 Linear Correlation Coefficient How to Find the Correlation Coefficient: The following is the formula given to us on how to find the correlation coefficient: r = ( x i x )( y i y ) s x s y n 1 This formula takes a very long time to do by hand. Therefore, we use technology to help us find the answer. We do this using Excel. The following are step by step instructions: 1. First, input your data. Make sure you x-values and y-values are in separate columns. 2. In a blank cell, type in =CORREL(click and drag down to highlight the cells of the x-values, click and drag down to highlight the cells of the y-values) [The text italicized is instructions on what to input]. Press enter and the correlation coefficient answer will replace what you wrote.

3 Example of Finding the Correlation Coefficient: You are given the following data points to find the correlation coefficient: x -values y -values Correlation Coefficient: 82 13 =CORREL(A2:A11, B2:B11) 77 11.97296 9 121 71 1 62 95 68 98 74 1 84 118 94 116 88 114 How to Determine if the Correlation Coefficient Shows a Linear Relation: Using your textbook, go to the table in the back of your book that gives us critical values. Match up your sample size with the n on the chart. The number next to your n is the critical value. If the critical value is less than the absolute value of the correlation coefficient, we have a linear relation between the two variables. If it is not, there is not a linear relation. How to Interpret the Linear Correlation Coefficient: The correlation coefficient is only between -1 and 1. If your correlation coefficient is towards -1, that means you have a negative, linear relation. If your correlation coefficient is towards 1, that means you have a positive, linear relation. If your correlation coefficient is towards, that means you have no relation. Example on How to Determine and Interpret the Correlation Coefficient: Using the following data points, we need to interpret details of the equation from the correlation coefficient. Going into our textbook, we look for the critical value first which is.632 since our sample size is 1. From there we must see if the absolute value of our correlation coefficient is greater than the critical value:.9 >.632 Since our correlation coefficient is greater than the critical value, it is a strong linear relationship. Since our correlation coefficient is positive without the absolute values, it has a positive associativity. The Least-Squares Regression Line Finding the Linear Equation: We select two points from our data. From there, we find the slope for our equation using the slope formula: m = y 2 y 1 x 2 x 1 Note that y 2 and x 2 are the larger values. From there, we plug it into the point slope formula: y y 1 = m(x x 1 ) Make sure when using the point-slope formula use plug in only one of the points that you used to get the slope. We want to have the y and x remain unknown variable for the equation. Solve the equation for y to get your equation. Example of Finding the Linear Equation:

4 Using the data points given above in the correlation coefficient example, we can find the linear equation. First using the points (74, 1) and (68, 98), we will find the slope: 1 98 m = 74 68 = 2 6 = 1 3 =.33 Next, we plug the slope and one of the data points into the slope-point formula to find the equation: y 98 =.33 (x 68) y 98 =.33x 22.44 y =.33x 22.44 + 98 y =.33x + 75.56 Least Squares Method: A residual is the space in-between the observed y and the predicted y. The least squares method tries to make this distance and error as small as possible. To do this, we need to have the observed y (the linear equation) and the predicted y (the least-squares regression line). How to Find the Least-Squares Regression Line: To find the least-squares regression line, you need to find the slope and y-intercept first. We do not find it the same way we find the linear equation s slope and y-intercept. We do this by first finding the slope (we use the symbol b 1 ). To find the slope, you need to have the correlation coefficient, the standard deviation of the y-values, and the standard deviation of the x-values. The following is the formula for the slope: b 1 = r s y s x After finding the slope, you can now find the y-intercept. To find the y-intercept, you need to have the slope, mean of the x-values, and the mean of the y-values. The following is the formula: b = y b 1 x From there, you have all the information needed to put your information into the least-squares regression line formula: y = b 1 x + b Example of Finding the Least-Squares Regression Line: Using the data values from the example on the correlation coefficient, we know r =.9, x = 79, y = 16.6, s x = 1.35, and s y = 9.55. First, we find the slope: b 1 = r s y =.9 9.55 =.9 (.92) =.83 s x 1.35 Now that we know the slope, we can find the y-intercept: b = y b 1 x = 16.6.83(79) = 16.6 65.57 = 41.3 Now, we can put the information into our least-squares regression line formula: y = b 1 x + b y =.83x + 41.3 The Coefficient of Determination How to Find the Coefficient of Determination: To find the coefficient of determination for a leastsquares regression line, you take your linear correlation coefficient and square it. Therefore, the formula is the following: R 2 = r 2 Example of How to Find the Coefficient of Determination: Using the data from the linear correlation coefficient example, we know that r =.9. Now we just plug it into the formula:

5 R 2 = (.9) 2 =.81 Since the coefficient of determination is a percentage, we just turn our decimal into a percentage by multiplying it by 1%. Therefore, our answer is 81%. Symbol Guide Chapter Title Symbols Term Symbol Use Sample Mean x To identify the sample mean Sample Standard Deviation To identify the sample standard s deviation Sample Size n To identify the sample size Sum To identify when we add up everything Linear Correlation Coefficient To identify the correlation r coefficient Linear Line Slope m To identify the slope of a linear line Y-Intercept To identify the y-intercept of a b linear line Least-Squares Regression Line Slope To identify the least-squares b 1 regression line slope Least-Squares Regression Line Y-Intercept To identify the least-squares b regression line y-intercept Predicted y To identify the predicted y of a y least-squares regression line The Coefficient of Determination R 2 To identify the coefficient of determination