What is the easiest way to lose points when making a scatterplot?

Similar documents
Related Example on Page(s) R , 148 R , 148 R , 156, 157 R3.1, R3.2. Activity on 152, , 190.

7. Do not estimate values for y using x-values outside the limits of the data given. This is called extrapolation and is not reliable.

Chapter 3: Describing Relationships

Warm-up Using the given data Create a scatterplot Find the regression line

Linear Regression Communication, skills, and understanding Calculator Use

AP Statistics Bivariate Data Analysis Test Review. Multiple-Choice

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation

Scatterplots and Correlation

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation

Chapter 5 Friday, May 21st

IF YOU HAVE DATA VALUES:

INFERENCE FOR REGRESSION

Unit 6 - Introduction to linear regression

Lecture 18: Simple Linear Regression

Analysis of Bivariate Data

3.1 Scatterplots and Correlation

Conditions for Regression Inference:

Determine is the equation of the LSRL. Determine is the equation of the LSRL of Customers in line and seconds to check out.. Chapter 3, Section 2

23. Inference for regression

Review of Regression Basics

The following formulas related to this topic are provided on the formula sheet:

Chapter 27 Summary Inferences for Regression

AMS 7 Correlation and Regression Lecture 8

Unit 6 - Simple linear regression

Test 3A AP Statistics Name:

AP Statistics Unit 2 (Chapters 7-10) Warm-Ups: Part 1

TEST 150 points

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories.

MODELING. Simple Linear Regression. Want More Stats??? Crickets and Temperature. Crickets and Temperature 4/16/2015. Linear Model

Ch. 3 Review - LSRL AP Stats

AP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions

Chapter 3: Describing Relationships

Nov 13 AP STAT. 1. Check/rev HW 2. Review/recap of notes 3. HW: pg #5,7,8,9,11 and read/notes pg smartboad notes ch 3.

q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Examining Relationships. Chapter 3

appstats27.notebook April 06, 2017

AP Statistics L I N E A R R E G R E S S I O N C H A P 7

Chapter 9. Correlation and Regression

1. Use Scenario 3-1. In this study, the response variable is

CHAPTER 3 Describing Relationships

STATS DOESN T SUCK! ~ CHAPTER 16

The response variable depends on the explanatory variable.

Summarizing Data: Paired Quantitative Data

Inference for Regression Inference about the Regression Model and Using the Regression Line

Study Guide AP Statistics

MATH 80S Residuals and LSR Models October 3, 2011

Lecture 4 Scatterplots, Association, and Correlation

Chapter 3: Examining Relationships

Start with review, some new definitions, and pictures on the white board. Assumptions in the Normal Linear Regression Model

Correlation & Simple Regression

Inferences for Regression

Chapter 6: Exploring Data: Relationships Lesson Plan

Statistical View of Least Squares

SMAM 319 Exam1 Name. a B.The equation of a line is 3x + y =6. The slope is a. -3 b.3 c.6 d.1/3 e.-1/3

Practice Questions for Exam 1

Chapter 3: Examining Relationships

Sociology 6Z03 Review I

Lecture 4 Scatterplots, Association, and Correlation

Pre-Calculus Multiple Choice Questions - Chapter S8

22S39: Class Notes / November 14, 2000 back to start 1

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6

Approximate Linear Relationships

Multiple Regression an Introduction. Stat 511 Chap 9

AP Statistics Two-Variable Data Analysis

ST Correlation and Regression

Objectives. 2.3 Least-squares regression. Regression lines. Prediction and Extrapolation. Correlation and r 2. Transforming relationships

IT 403 Practice Problems (2-2) Answers

Chapter 3: Examining Relationships

Chapter 10 Correlation and Regression

THE PEARSON CORRELATION COEFFICIENT

Algebra 1 Fall Final Review

3.2: Least Squares Regressions

Correlation. Relationship between two variables in a scatterplot. As the x values go up, the y values go down.

Ch. 9 Pretest Correlation & Residuals

Section I: Multiple Choice Select the best answer for each question.

SMAM 314 Practice Final Examination Winter 2003

y n 1 ( x i x )( y y i n 1 i y 2

Mrs. Poyner/Mr. Page Chapter 3 page 1

Steps to take to do the descriptive part of regression analysis:

Chapter 7. Scatterplots, Association, and Correlation

Ch 13 & 14 - Regression Analysis

Chapter 5 Least Squares Regression

6.1.1 How can I make predictions?

Stat 101 Exam 1 Important Formulas and Concepts 1

Mini-Lecture 4.1 Scatter Diagrams and Correlation

BIVARIATE DATA data for two variables

CHAPTER 4 DESCRIPTIVE MEASURES IN REGRESSION AND CORRELATION

Chapter 7. Linear Regression (Pt. 1) 7.1 Introduction. 7.2 The Least-Squares Regression Line

Multiple Regression Examples

28. SIMPLE LINEAR REGRESSION III

Chapter 4 Describing the Relation between Two Variables

Sem. 1 Review Ch. 1-3

Direct variation. Consider y = mx.

Algebra 1 2nd Semester Exam Review 2015

Business 320, Fall 1999, Final

Simple linear regression: linear relationship between two qunatitative variables. Linear Regression. The regression line

Chapter 4: Regression Models

Quantitative Bivariate Data

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Sampling, Frequency Distributions, and Graphs (12.1)

Transcription:

Day #1: Read 141-142 3.1 Describing Relationships Why do we study relationships between two variables? Read 143-144 Page 144: Check Your Understanding Read 144-149 How do you know which variable to put on which axis? What is the easiest way to lose points when making a scatterplot? Alternate Example: Track and Field Day! The table below shows data for 13 students in a statistics class. Each member of the class ran a 40-yard sprint and then did a long jump (with a running start). Make a scatterplot of the relationship between sprint time (in seconds) and long jump distance (in inches). Sprint Time (s) 5.41 5.05 9.49 8.09 7.01 7.17 6.83 6.73 8.01 5.68 5.78 6.31 6.04 Long Jump Distance (in) 171 184 48 151 90 65 94 78 71 130 173 143 141 20

What four characteristics should you consider when interpreting a scatterplot? Does a strong association between two variables indicate a cause-and-effect relationship? Page 149: Check Your Understanding Read 149-150: Using technology to create scatterplots HW #1: page 158 (1, 5, 7, 9, 11) 21

Day #2: 3.1 Correlation / 3.2 Regression Lines Just like two distributions can have the same shape and center with different spreads, two associations can have the same directions and form, but very different strengths. Read 150-151 How do we measure the strength of a linear relationship between two quantitative variables? What are some characteristics of correlation? Activity page 152 Can you determine the form of a relationship using only the correlation? Is correlation a resistant measure of strength? Read 152-155 Do you need to know the formula for correlation? 22

Read 155-156 What are some additional facts about correlation? Read 164-166 What is a regression line? What is regression to the mean? What is the general form of a regression equation? What is the difference between y and ŷ? How do you interpret the slope and y intercept of a regression line? Alternate Example: Used Hondas The following scatterplot shows the number of miles driven (in thousands) and advertised price (in thousands) for 11 used Honda CR- Vs from the 2002-2006 model years. The regression line shown on the scatterplot is ŷ = 18773 86.18x. Interpret the slope and y intercept in context. 23

Read 166-167 What is extrapolation? Why is it dangerous? Page 167 Check Your Understanding HW #2 page 160 (15-18, 21, 27-32) page 191 (35, 37) Day #3: 3.2 Residuals Read 168-171 What is a residual? How do you interpret a residual? How can we determine the best regression line for a set of data? Is the least-squares regression line resistant to outliers? 24

Alternate Example: Track and Field Day! The table below shows data for 13 students in a statistics class. Each member of the class ran a 40-yard sprint and then did a long jump (with a running start). Sprint Time (s) 5.41 5.05 9.49 8.09 7.01 7.17 6.83 6.73 8.01 5.68 5.78 6.31 6.04 Long Jump Distance (in) 171 184 48 151 90 65 94 78 71 130 173 143 141 (a) Calculate the equation of the least-squares regression line using technology. (b) Interpret the slope in context. Does it make sense to interpret the y intercept? (c) Calculate and interpret the residual for the student who had a sprint time of 8.01 seconds. Read 172-174 How can you calculate the least-squares regression line from summary statistics? What is the relationship between the correlation and the slope? Read 174-177 What is a residual plot? What is the purpose of a residual plot? What two things do you look for in a residual plot? How can you tell if a linear model is appropriate? 25

Construct and interpret a residual plot for the Track-and-Field Day data. Suppose that a student was absent on the day these data were collected. Predict this student s long jump distance. Read 177-181 What is s? How do you calculate and interpret s? What is r 2? How do you calculate and interpret r 2? How are s and r 2 the same? Different? HW #3 page 191 (43, 47, 49, 53, 55, 57, 61) 26

Day #4: Chapter 3 Review (so far) The anticipation of the first blooms of spring flowers is one of the joys of April. One of the most beautiful is that of the Japanese cherry tree. Experience has taught us that, if the spring has been a warm one, the trees will bloom early, but if the spring has been cool, then the blossoms will appear later. Mr. Yamada is a gardener who has observed the date in April when the blossoms first appear for the last 24 years. His son, Hiro, went on the internet and found the average March temperature (in degrees Celsius) in his area for those years. For example, for the first year in the data table, the average March temperature was 4 C and the first bloom happened on the 14 th day of April. Temp Date 4 14 5.4 8 3.2 11 2.6 19 4.2 14 4.7 14 4.9 14 4 21 4.9 9 3.8 14 4 13 5.1 11 4.3 13 1.5 28 3.7 17 3.8 19 4.5 10 4.1 17 6.1 3 6.2 3 5.1 11 5 6 4.6 9 4 11 1. Why should temperature be the explanatory variable? 2. Draw a scatterplot and discuss the noticeable features. 3. Calculate the least squares regression line and graph it on the scatterplot. 4. Interpret the slope in the context of the problem. 5. Interpret the x- and y-intercepts in the context of the problem. Are these reasonable values in this context? 6. Calculate and interpret the value of the correlation coefficient. 7. If the temperature was measured in degrees Fahrenheit, how would this value change? 8. If r is high, can we conclude that warmer temperatures cause earlier blooms? 9. Calculate and interpret the residual for the first year in the data table. 10. Sketch the residual plot. What does it tell you? 11. Calculate and interpret the values of r 2 and s in the context of the problem. 12. If you were to use number of hours in April instead of number of days in April for the date of first bloom, how would the values of r 2 and s change? 13. Predict the date of first bloom for a year with an average March temperature of 3 C. How confident are you in your prediction? HW #4 page 193 (54, 56, 58, 59, 60) 27

Day #5: 3.2 Computer Output/ Regression Wisdom Read 181-183 Alternate Example: Fresh Flowers For their second semester project, two AP Statistics students decided to investigate the effect of sugar on the life of cut flowers. They went to the local grocery store and randomly selected 12 carnations. All the carnations seemed equally healthy when they were selected. When they got home, the students prepared 12 identical vases with exactly the same amount of water in each vase. They put 1 tablespoon of sugar in three vases, 2 tablespoons of sugar in three vases, and 3 tablespoons of sugar in three vases. In the remaining 3 vases, they put no sugar. After the vases were prepared and placed in the same location, the students randomly assigned one flower to each vase and observed how many hours each flower continued to look fresh. A scatterplot, residual plot, and computer output from the regression are shown. Predictor Coef SE Coef T P Constant 181.200 3.635 49.84 0.000 Sugar 15.200 1.943 7.82 0.000 S = 7.52596 R-Sq = 86.0% R-Sq(adj) = 84.5% (a) What is the equation of the least-squares regression line? Define any variables you use. (b) What is the correlation? (c) Is a linear model appropriate for this data? Explain. (d) Calculate and interpret the residual for the flower that got 3 tablespoons of sugar and lasted 222 hours. 28

Read 183-188 Does it matter which variable is x and which is y? What should you always do before calculating the correlation or least-squares regression line? How do outliers affect the correlation, least-squares regression line, and standard deviation of the residuals? Are all outliers influential? Does association imply causation? HW #5 page 194 (63, 68, 69, 71-78) 29