CHAPTER 3 Describing Relationships

Similar documents
CHAPTER 3 Describing Relationships

Scatterplots. 3.1: Scatterplots & Correlation. Scatterplots. Explanatory & Response Variables. Section 3.1 Scatterplots and Correlation

CHAPTER 1 Exploring Data

Chapter 6: Exploring Data: Relationships Lesson Plan

Describing Bivariate Relationships

Basic Practice of Statistics 7th

BIVARIATE DATA data for two variables

5.1 Bivariate Relationships

CHAPTER 10 Comparing Two Populations or Groups

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation

CHAPTER 2 Modeling Distributions of Data

The response variable depends on the explanatory variable.

F O R SOCI AL WORK RESE ARCH

Chapter 3: Describing Relationships

CHAPTER 10 Comparing Two Populations or Groups

CHAPTER 10 Comparing Two Populations or Groups

Chapter 3: Describing Relationships

Scatterplots and Correlations

3.1 Scatterplots and Correlation

CHAPTER 7 Sampling Distributions

AP Stats ~ 3A: Scatterplots and Correlation OBJECTIVES:

Bivariate Data Summary

Lecture 4 Scatterplots, Association, and Correlation

Lecture 4 Scatterplots, Association, and Correlation

Linear Regression and Correlation. February 11, 2009

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation

Chapter 6. September 17, Please pick up a calculator and take out paper and something to write with. Association and Correlation.

appstats8.notebook October 11, 2016

Notes 21: Scatterplots, Association, Causation

AP Statistics Two-Variable Data Analysis

Looking at Data Relationships. 2.1 Scatterplots W. H. Freeman and Company

Scatterplots and Correlation

1) A residual plot: A)

Lecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population

Scatterplots. STAT22000 Autumn 2013 Lecture 4. What to Look in a Scatter Plot? Form of an Association

If the roles of the variable are not clear, then which variable is placed on which axis is not important.

Year 10 Mathematics Semester 2 Bivariate Data Chapter 13

Chapter 3: Examining Relationships

T i t l e o f t h e w o r k : L a M a r e a Y o k o h a m a. A r t i s t : M a r i a n o P e n s o t t i ( P l a y w r i g h t, D i r e c t o r )

Chapter 7: Sampling Distributions

Important note: Transcripts are not substitutes for textbook assignments. 1

AP Statistics L I N E A R R E G R E S S I O N C H A P 7

Chapter 7: Sampling Distributions

Objectives. 2.1 Scatterplots. Scatterplots Explanatory and response variables Interpreting scatterplots Outliers

MATH140 Exam 2 - Sample Test 1 Detailed Solutions

GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Unit 6 - Introduction to linear regression

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

Unit 6 - Simple linear regression

CHAPTER 1 Exploring Data

The empirical ( ) rule

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation

Graphing Data. Example:

Chapter 1: Exploring Data

Edexcel GCE A Level Maths Further Maths 3 Matrices.

Measures of. U4 C 1.2 Dot plot and Histogram 2 January 15 16, 2015

Chapter 7. Scatterplots, Association, and Correlation. Copyright 2010 Pearson Education, Inc.

Chapter 7 Summary Scatterplots, Association, and Correlation

Simple Linear Regression

CHAPTER 5 Probability: What Are the Chances?

Chapter 7. Association, and Correlation. Scatterplots & Correlation. Scatterplots & Correlation. Stat correlation.

Chapter 8: Estimating with Confidence

AMS 7 Correlation and Regression Lecture 8

Nov 13 AP STAT. 1. Check/rev HW 2. Review/recap of notes 3. HW: pg #5,7,8,9,11 and read/notes pg smartboad notes ch 3.

ab = c a If the coefficients a,b and c are real then either α and β are real or α and β are complex conjugates

What students need to know for... ALGEBRA II

MFM2P 3.1 Interpreting Scatter Plots Name:

Review Math for Students Entering Geometry

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to:

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

{ }. The dots mean they continue in that pattern to both

Algebra 1 Summer Review Packet DUE THE FIRST DAY OF SCHOOL

Chapter 7. Scatterplots, Association, and Correlation. Scatterplots & Correlation. Scatterplots & Correlation. Stat correlation

Random Variables Chris Piech CS109, Stanford University. Piech, CS106A, Stanford University

AP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions

Winning a Candy Guessing Game using Volume Estimations

Central Tendency & Graphs of Data Long-Term Memory Review Grade 7, Standard 5.0 Review 1

3.2: Least Squares Regressions

SUMMER MATH PACKET College Algebra and Trigonometry A COURSE 235 and Pre-Calculus A COURSE 241

A C E. Answers Investigation 4. Applications

Chapter 6 Scatterplots, Association and Correlation

Scatterplots and Correlation

Chapter 1: Exploring Data

12.4. The Normal Distribution: A Problem-Solving Tool

7.0 Lesson Plan. Regression. Residuals

STATISTICS Relationships between variables: Correlation

Chapter 7. Scatterplots, Association, and Correlation

Review of Regression Basics

Chapter 5 Least Squares Regression

Level 1 Exam Appalachian State University

Q Scheme Marks AOs. Notes. Ignore any extra columns with 0 probability. Otherwise 1 for each. If 4, 5 or 6 missing B0B0.

For the given equation, first find the x-intercept by setting y = 0: Next, find the y-intercept by setting x = 0:

6.1.1 How can I make predictions?

STAB22 Statistics I. Lecture 7

Limits and Continuity. 2 lim. x x x 3. lim x. lim. sinq. 5. Find the horizontal asymptote (s) of. Summer Packet AP Calculus BC Page 4

Mrs. Poyner/Mr. Page Chapter 3 page 1

Bivariate data data from two variables e.g. Maths test results and English test results. Interpolate estimate a value between two known values.

probability of k samples out of J fall in R.

Transcription:

CHAPTER 3 Describing Relationships 3.1 Scatterplots and Correlation The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers

Scatterplots and Correlation Learning Objectives After this section, you should be able to: IDENTIFY explanatory and response variables in situations where one variable helps to explain or influences the other. MAKE a scatterplot to display the relationship between two quantitative variables. DESCRIBE the direction, form, and strength of a relationship displayed in a scatterplot and identify outliers in a scatterplot. INTERPRET the correlation. UNDERSTAND the basic properties of correlation, including how the correlation is influenced by outliers USE technology to calculate correlation. EXPLAIN why association does not imply causation. The Practice of Statistics, 5 th Edition 2

Explanatory and Response Variables Most statistical studies examine data on more than one variable. In many of these settings, the two variables play different roles. A response variable measures an outcome of a study. An explanatory variable may help explain or influence changes in a response variable. Note: In many studies, the goal is to show that changes in one or more explanatory variables actually cause changes in a response variable. However, other explanatory-response relationships don t involve direct causation. The Practice of Statistics, 5 th Edition 3

Displaying Relationships: Scatterplots A scatterplot shows the relationship between two quantitative variables measured on the same individuals. The values of one variable appear on the horizontal axis, and the values of the other variable appear on the vertical axis. Each individual in the data appears as a point on the graph. How to Make a Scatterplot 1. Decide which variable should go on each axis. Remember, the explanatory variable goes on the X-axis! 2. Label and scale your axes. 3. Plot individual data values. The Practice of Statistics, 5 th Edition 4

Describing Scatterplots To describe a scatterplot, follow the basic strategy of data analysis from Chapters 1 and 2. Look for patterns and important departures from those patterns. How to Examine a Scatterplot As in any graph of data, look for the overall pattern and for striking departures from that pattern. You can describe the overall pattern of a scatterplot by the direction, form, and strength of the relationship. An important kind of departure is an outlier, an individual value that falls outside the overall pattern of the relationship. The Practice of Statistics, 5 th Edition 5

Describing Scatterplots Two variables have a positive association when above-average values of one tend to accompany above-average values of the other and when below-average values also tend to occur together. Two variables have a negative association when above-average values of one tend to accompany below-average values of the other. Strength Describe the scatterplot. Direction Form There is a moderately strong, negative, curved relationship between the percent of students in a state who take the SAT and the mean SAT math score. Further, there are two distinct clusters of states and two possible outliers that fall outside the overall pattern. The Practice of Statistics, 5 th Edition 6

Example: Describing a scatterplot Direction: In general, it appears that teams that score more points per game have more wins and teams that score fewer points per game have fewer wins. We say that there is a positive association between points per game and wins. Form: There seems to be a linear pattern in the graph (that is, the overall pattern follows a straight line). Strength: Because the points do not vary much from the linear pattern, the relationship is fairly strong. There do not appear to be any values that depart from the linear pattern, so there are no outliers. The Practice of Statistics, 5 th Edition 7

Measuring Linear Association: Correlation A scatterplot displays the strength, direction, and form of the relationship between two quantitative variables. Linear relationships are important because a straight line is a simple pattern that is quite common. Unfortunately, our eyes are not good judges of how strong a linear relationship is. The correlation r measures the direction and strength of the linear relationship between two quantitative variables. r is always a number between -1 and 1 r > 0 indicates a positive association. r < 0 indicates a negative association. Values of r near 0 indicate a very weak linear relationship. The strength of the linear relationship increases as r moves away from 0 towards -1 or 1. The extreme values r = -1 and r = 1 occur only in the case of a perfect linear relationship. The Practice of Statistics, 5 th Edition 8

Measuring Linear Association: Correlation The Practice of Statistics, 5 th Edition 9

Calculating Correlation The formula for r is a bit complex. It helps us to see what correlation is, but in practice, you should use your calculator or software to find r. How to Calculate the Correlation r Suppose that we have data on variables x and y for n individuals. The values for the first individual are x 1 and y 1, the values for the second individual are x 2 and y 2, and so on. The means and standard deviations of the two variables are x-bar and s x for the x-values and y-bar and s y for the y-values. The correlation r between x and y is: r = 1 é æ ê ç n -1ë ê è r = 1 n -1 å x 1 - x s x æ ç è x i - x s x ö æ y - y ö 1 ç ø è s y ø + æ x - x ö æ 2 ç y - y ö 2 è s ç x ø è s y ø +...+ æ x - x ö æ n ç y - y ö ù n è s ç x ø è s ú y ø û ú ö æ y - y ö i ç ø è s y ø The Practice of Statistics, 5 th Edition 10

Facts About Correlation How correlation behaves is more important than the details of the formula. Here are some important facts about r. 1. Correlation makes no distinction between explanatory and response variables. 2. r does not change when we change the units of measurement of x, y, or both. 3. The correlation r itself has no unit of measurement. Cautions: Correlation requires that both variables be quantitative. Correlation does not describe curved relationships between variables, no matter how strong the relationship is. Correlation is not resistant. r is strongly affected by a few outlying observations. Correlation is not a complete summary of two-variable data. The Practice of Statistics, 5 th Edition 11

Correlation Practice For each graph, estimate the correlation r and interpret it in context. The Practice of Statistics, 5 th Edition 12

Scatterplots and Correlation Section Summary In this section, we learned how to IDENTIFY explanatory and response variables in situations where one variable helps to explain or influences the other. MAKE a scatterplot to display the relationship between two quantitative variables. DESCRIBE the direction, form, and strength of a relationship displayed in a scatterplot and identify outliers in a scatterplot. INTERPRET the correlation. UNDERSTAND the basic properties of correlation, including how the correlation is influenced by outliers USE technology to calculate correlation. EXPLAIN why association does not imply causation. The Practice of Statistics, 5 th Edition 13