Correlation (pp. 1 of 6)

Similar documents
SOCIETY FOR AMERICAN BASEBALL RESEARCH The Pythagorean Won-Loss Formula in Baseball: An Introduction to Statistics and Modeling

SOCIETY FOR AMERICAN BASEBALL RESEARCH The Pythagorean Won-Loss Formula in Baseball: An Introduction to Statistics and Modeling

Data and slope and y-intercepts, Oh My! Linear Regression in the Common Core

Lesson 1 - Pre-Visit Safe at Home: Location, Place, and Baseball

The Pythagorean Won-Loss Theorem: An introduction to modeling

The Pythagorean Won-Loss Theorem: An introduction to modeling

q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Session 4 2:40 3:30. If neither the first nor second differences repeat, we need to try another

Pythagoras at the Bat: An Introduction to Statistics and Mathematical Modeling

Pythagoras at the Bat: An Introduction to Statistics and Mathematical Modeling

Pythagoras at the Bat: An Introduction to Stats and Modeling

13: SIMPLE LINEAR REGRESSION V

Pythagoras at the Bat: An Introduction to Stats and Modeling

12.2. Measures of Central Tendency: The Mean, Median, and Mode

Includes games of Wednesday, March 4, 2015 Major League Baseball

Name. The data below are airfares to various cities from Baltimore, MD (including the descriptive statistics).

Simple Linear Regression

Exponential Regression. Suppose we have paired sample data {{ x 1, y 1 }, { x 2, y 2 },..., { x n, y n }}, with all x i > 0,

Chapter 15: Other Regression Statistics and Pitfalls

Solving Quadratic Equations by Graphing 6.1. ft /sec. The height of the arrow h(t) in terms

Prof. Bodrero s Guide to Derivatives of Trig Functions (Sec. 3.5) Name:

Sem. 1 Review Ch. 1-3

Section 2.5 from Precalculus was developed by OpenStax College, licensed by Rice University, and is available on the Connexions website.

Includes games of Friday, February 23, 2018 Major League Baseball

Overview. 4.1 Tables and Graphs for the Relationship Between Two Variables. 4.2 Introduction to Correlation. 4.3 Introduction to Regression 3.

9-4. Quadratics and Projectiles. Vocabulary. Equations for the Paths of Projectiles. Activity. Lesson

Describing Bivariate Relationships

5.1 Bivariate Relationships

AP Statistics Bivariate Data Analysis Test Review. Multiple-Choice

Assignment #3; Exponential Functions

The Best, The Hottest, & The Luckiest: A Statistical Tale of Extremes

a) Do you see a pattern in the scatter plot, or does it look like the data points are

Math 243 OpenStax Chapter 12 Scatterplots and Linear Regression OpenIntro Section and

Mrs. Poyner/Mr. Page Chapter 3 page 1

Correlation Coefficient: the quantity, measures the strength and direction of a linear relationship between 2 variables.

Examining Relationships. Chapter 3

Quadratic Graphs and Their Properties

Some questions (c) 2012 by Region 10 Educational Service Center. Some questions (c) 2012 by STAAR Test Maker. Page 2 GO ON

One Variable and Two Variable Statistics

Scatterplots. 3.1: Scatterplots & Correlation. Scatterplots. Explanatory & Response Variables. Section 3.1 Scatterplots and Correlation

Unit 4 Linear Functions

The top two Wild Card teams from each league make the postseason and play each other in a one game playoff.

1-2 Analyzing Graphs of Functions and Relations

College Algebra. Word Problems

( )( ) of wins. This means that the team won 74 games.

20. Ignore the common effect question (the first one). Makes little sense in the context of this question.

Exercises 36 CHAPTER 2/ORGANIZATION AND DESCRIPTION OF DATA

Algebra II Notes Quadratic Functions Unit Applying Quadratic Functions. Math Background

Unit #2: Linear and Exponential Functions Lesson #13: Linear & Exponential Regression, Correlation, & Causation. Day #1

Central Tendency & Graphs of Data Long-Term Memory Review Grade 7, Standard 5.0 Review 1

Show that the set of ordered pairs (x, y) in the table below satisfied a quadratic relationship. Find. Think Pair Share

Year 10 Mathematics Semester 2 Bivariate Data Chapter 13

Comparing Linear, Exponential, and Quadratic Functions

Analyzing Lines of Fit

6.1.1 How can I make predictions?

DEFINITION Function A function from a set D to a set R is a rule that assigns a unique element in R to each element in D.

Topic 6 Part 1 [251 marks]

Algebra 1: Spring Semester Review

a. Yes, it is consistent. a. Positive c. Near Zero

American Tour: Climate Objective To introduce contour maps as data displays.

People, Tea, and Carbon Dioxide

Elementary Algebra SAMPLE Final Examination Spring 2015

3 9 Curve Fitting with Polynomials

Steps to take to do the descriptive part of regression analysis:

Linear Regression Communication, skills, and understanding Calculator Use

Math 12 - for 4 th year math students

SWBAT: Graph exponential functions and find exponential curves of best fit

a. Write what the survey would look like (Hint: there should be 2 questions and options to select for an answer!).

NAME: DATE: SECTION: MRS. KEINATH

ACTIVITY: Reading Thermometers. Work with a partner. The thermometers show the temperatures in four cities.

Modeling with quadratic functions Student Activity Sheet 5; use with Exploring Using y = ax 2 + bx + c to model data

City Number Pct. 1.2 STEMS AND LEAVES

CURRICULUM CATALOG. Algebra II (3135) VA

Intermediate Algebra Final Exam Review

Lesson 2: Exploring Quadratic Relations Quad Regression Unit 5 Quadratic Relations

Bivariate Data Summary

IE 336 Seat # Name (clearly) Closed book. One page of hand-written notes, front and back. No calculator. 60 minutes.

On a connection between the Bradley-Terry model and the Cox proportional hazards model

Percentile: Formula: To find the percentile rank of a score, x, out of a set of n scores, where x is included:

Cumulative chapters 1-3 review Period: 1. Tell whether each graph represents a function.

Interpreting Correlation & Examining Cause and Effect

Regression of Inflation on Percent M3 Change

First Quartile = 26 Third Quartile = 35 Interquartile Range = 9

Teaching a Prestatistics Course: Propelling Non-STEM Students Forward

Comparing Quantitative Variables

Chapter 8 Polynomials and Factoring

6.1 Quadratic Expressions, Rectangles, and Squares. 1. What does the word quadratic refer to? 2. What is the general quadratic expression?

Radical and Rational Functions

CHAPTER 5-1. Regents Exam Questions - PH Algebra Chapter 5 Page a, P.I. 8.G.13 What is the slope of line shown in the

Lesson 5 - Practice Problems

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Unit 3: Relations and Functions

(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables)

AP Statistics Summer Assignment

Guidelines for comparing boxplots

x and y, called the coordinates of the point.

The following formulas related to this topic are provided on the formula sheet:

IF YOU HAVE DATA VALUES:

16.2 Solving Exponential Equations

Transcription:

Correlation (pp. 1 of 6) Car dealers want to know how mileage affects price on used Corvettes. Biologists are studying the effects of temperature on cricket chirps. Farmers are trying to determine if there is a relationship between the yield of a type of grain and the time between flowering and harvesting in that type of grain. In the collection and analysis of these data, it is important to determine not only how the two variables are related but also the strength of the relationship. Why would the strength of the relationship be important? 1. Study the scatterplots below and describe the relationship between the two variables. Graph A Graph B Graph C Graph D 2009, TESCCC 08/01/09 page 54 of 69

Correlation (pp. 2 of 6) 2. Use the scatterplots to compare the strength of the relationship between the variables. Let s Play Ball The following set of data gives the percentage of games won by teams in the American Football League and the average attendance (rounded to the nearest thousand) for each home game in the 1993 season. 3. Sketch a scatterplot and use the graph to determine if there is a relationship between the variables. Find a linear regression model for the relationship. Study the graph and predict the strength of the relationship. Team Percentage of Games Won Average Attendance per Home Game (in thousands) Baltimore Orioles 53 45 Boston Red Socks 49 30 California Angels 44 25 Chicago White Socks 58 32 Cleveland Indians 47 27 Detroit Tigers 53 24 Kansas City Royals 52 24 Milwaukee Brewers 43 21 Minnesota Twins 44 25 New York Yankees 54 30 Oakland Athletics 42 25 Seattle Mariners 51 25 Texas Rangers 53 28 Toronto Blue Jays 59 50 2009, TESCCC 08/01/09 page 55 of 69

Correlation (pp. 3 of 6) In order to make appropriate and accurate predictions and decisions from the data, we must go further than just analysis of the graphs. One factor to consider is the value of the correlation coefficient. This is a numeric value that assesses the strength of the relationship between the first and second coordinates in a set of points. The value of the correlation coefficient lays in the range -1 r 1. If the points in a scatterplot do not appear to have any relationship, the correlation value will be approximately zero. If the points in a scatterplot all fall on a regression line that increases from left to right, the linear correlation value will equal +1. If the points in a scatterplot all fall on a regression line that decreases from left to right, the linear correlation value will equal -1. r= +1 r= -1 The common correlation value classifications are given in the following list. Correlation can be found for both linear and non-linear data. 0 < r < 0.33 Weak, very weak, to no correlation as it approaches 0 0.34 < r < 0.67 Moderate correlation 0.68 < r < 1.00 Strong, very strong, to perfect at ± 1.00 4. Analyze the scatterplots on Graphs A, B, C, and D on the first page of this handout and estimate the correlation coefficient value for each graph. Explain your reasoning. 2009, TESCCC 08/01/09 page 56 of 69

Correlation (pp. 4 of 6) 5. Stronger correlation values mean that the model can better be used to make predictions and decisions from the data. Which sets of data in Graphs A, B, C, and D could best be used to make predictions? Explain your reasoning. Calculating the Correlation Value by Formula (Optional) The correlation coefficient, r, can be calculated manually for a set of (x, y) data points using the following formula. r x x y y S x n S y x = mean of the x variables x = individual domain variables y = mean of the y variables y = individual range variables S x = standard deviation of the x variables S = standard deviation of the y variables y sum of all values n = number of points Correlation Coefficient by Graphing Calculator Although the correlation coefficient can be calculated manually, it is much quicker and more reliable to use statistical software or the graphing calculator. When the regression equation is determined on the graphing calculator, the calculator can simultaneously give the r-value, if the diagnostic tool of the calculator is turned on. Turn on the diagnostics: CATALOG Scroll to DiagnosticOn ENTER ENTER 2009, TESCCC 08/01/09 page 57 of 69

Correlation (pp. 5 of 6) 6. Enter the data comparing percentage of games won to average attendance at baseball games in 1993 into the L1 and L2 lists in the graphing calculator. Graph a scatterplot of the data on the calculator. Run a linear regression. (Be sure to turn on the diagnostics.) What is the strength of the linear regression according to the r-value? Enter the linear function in y1. Use the graph and function model to describe the situation. 2009, TESCCC 08/01/09 page 58 of 69

Correlation (pp. 6 of 6) Independent Practice 1. Sketch a scatterplot to represent each of the following. Give a range between which the r- value would fall. a. Moderate, positive, linear correlation b. Moderate, negative, quadratic correlation c. Perfect, negative, linear correlation d. Weak, positive, linear correlation e. Strong, positive, exponential correlation 2. The table below gives the first semester and second semester scores for last year s Math Models course. Analyze the data and use the data to determine if the first semester scores for this year s class would make good predictors for the second semester scores. Do any scores not fit the predictions? If so, explain the possible reasons. 1 st Semester 2 nd Semester 81 80 75 82 71 83 61 57 96 100 56 30 85 68 18 56 70 40 77 87 71 65 91 86 88 82 79 57 77 75 68 47 2009, TESCCC 08/01/09 page 59 of 69