AP Statistics Two-Variable Data Analysis

Similar documents
Bivariate Data Summary

Describing Bivariate Relationships

5.1 Bivariate Relationships

BIVARIATE DATA data for two variables

CHAPTER. Scatterplots

Review of Regression Basics

Chapter 6 Scatterplots, Association and Correlation

Chapter 3: Examining Relationships

Nov 13 AP STAT. 1. Check/rev HW 2. Review/recap of notes 3. HW: pg #5,7,8,9,11 and read/notes pg smartboad notes ch 3.

Steps to take to do the descriptive part of regression analysis:

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis

Example: Can an increase in non-exercise activity (e.g. fidgeting) help people gain less weight?

Chapter 4 Data with Two Variables

Scatterplots and Correlation

Chapter 4 Data with Two Variables

Chapter 6: Exploring Data: Relationships Lesson Plan

Lecture 45 Sections Mon, Apr 14, 2008

Ch Inference for Linear Regression

Session 4 2:40 3:30. If neither the first nor second differences repeat, we need to try another

Least Squares Regression

y n 1 ( x i x )( y y i n 1 i y 2

Unit 6 - Introduction to linear regression

Regression Using an Excel Spreadsheet Using Technology to Determine Regression

Correlation A relationship between two variables As one goes up, the other changes in a predictable way (either mostly goes up or mostly goes down)

Least-Squares Regression. Unit 3 Exploring Data

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation

Lecture 4 Scatterplots, Association, and Correlation

Chapter 3: Examining Relationships

Lecture 4 Scatterplots, Association, and Correlation

Prob/Stats Questions? /32

CHAPTER 3 Describing Relationships

Name Class Date. Residuals and Linear Regression Going Deeper

THE PEARSON CORRELATION COEFFICIENT

Chapter 4 Describing the Relation between Two Variables

3.2: Least Squares Regressions

Chapter 3: Describing Relationships

Reminder: Univariate Data. Bivariate Data. Example: Puppy Weights. You weigh the pups and get these results: 2.5, 3.5, 3.3, 3.1, 2.6, 3.6, 2.

Section 2.2: LINEAR REGRESSION

Chapter 12 : Linear Correlation and Linear Regression

6.1.1 How can I make predictions?

Analysis of Bivariate Data

appstats8.notebook October 11, 2016

7. Do not estimate values for y using x-values outside the limits of the data given. This is called extrapolation and is not reliable.

Chapter 6. Exploring Data: Relationships

Chapter 3: Describing Relationships

Objectives. 2.3 Least-squares regression. Regression lines. Prediction and Extrapolation. Correlation and r 2. Transforming relationships

Related Example on Page(s) R , 148 R , 148 R , 156, 157 R3.1, R3.2. Activity on 152, , 190.

3.1 Scatterplots and Correlation

Scatterplots. 3.1: Scatterplots & Correlation. Scatterplots. Explanatory & Response Variables. Section 3.1 Scatterplots and Correlation

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation

MODELING. Simple Linear Regression. Want More Stats??? Crickets and Temperature. Crickets and Temperature 4/16/2015. Linear Model

Review of Regression Basics

Chapter 10 Correlation and Regression

Chapter 5 Friday, May 21st

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation

Linear Regression and Correlation. February 11, 2009

Important note: Transcripts are not substitutes for textbook assignments. 1

Unit 6 - Simple linear regression

Chapter 12: Linear Regression and Correlation

SAMPLE. Investigating the relationship between two numerical variables. Objectives

The response variable depends on the explanatory variable.

AP Statistics L I N E A R R E G R E S S I O N C H A P 7

Chapter 6. September 17, Please pick up a calculator and take out paper and something to write with. Association and Correlation.

Chapter 11. Correlation and Regression

If the roles of the variable are not clear, then which variable is placed on which axis is not important.

Unit 2, Ongoing Activity, Little Black Book of Algebra II Properties

Correlation & Regression

11 Correlation and Regression

AP Final Review II Exploring Data (20% 30%)

AP Stats ~ 3A: Scatterplots and Correlation OBJECTIVES:

Overview. 4.1 Tables and Graphs for the Relationship Between Two Variables. 4.2 Introduction to Correlation. 4.3 Introduction to Regression 3.

Correlation and Regression

s e, which is large when errors are large and small Linear regression model

Key Concepts. Correlation (Pearson & Spearman) & Linear Regression. Assumptions. Correlation parametric & non-para. Correlation

Introductory Statistics

SCATTERPLOTS. We can talk about the correlation or relationship or association between two variables and mean the same thing.

Chapter 5 Least Squares Regression

Determine is the equation of the LSRL. Determine is the equation of the LSRL of Customers in line and seconds to check out.. Chapter 3, Section 2

Chapter 8. Linear Regression /71

Chapter 7. Scatterplots, Association, and Correlation

a. Length of tube: Diameter of tube:

Chapter 9. Correlation and Regression

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation

The following formulas related to this topic are provided on the formula sheet:

1) A residual plot: A)

AP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions

Mrs. Poyner/Mr. Page Chapter 3 page 1

Can you tell the relationship between students SAT scores and their college grades?

Algebra II Notes Quadratic Functions Unit Applying Quadratic Functions. Math Background

Introduction and Single Predictor Regression. Correlation

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y

Summarizing Data: Paired Quantitative Data

PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics

Stat 101 Exam 1 Important Formulas and Concepts 1

OHS Algebra 2 Summer Packet

Lecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population

Examining Relationships. Chapter 3

1. Use Scenario 3-1. In this study, the response variable is

Test 3A AP Statistics Name:

Transcription:

AP Statistics Two-Variable Data Analysis

Key Ideas Scatterplots Lines of Best Fit The Correlation Coefficient Least Squares Regression Line Coefficient of Determination Residuals Outliers and Influential Points Transformations to Achieve Linearity

Two-Variable Data When looking at two-variable data, we are primarily interested in whether or not two variables have a linear relationship and how changes in one variable can predict changes in the other variable. Two-variable data is also called bivariate data.

Response, Explanatory A response variable measures an outcome of a study. An explanatory variable helps explain or influences changes in a response variable. Exercises: page 173 problems 3.1 3.4

Example STUDENT HOURS STUDIED SCORE ON EXAM A 0.5 65 B 2.5 80 C 3.0 77 D 1.5 60 E 1.25 68 F 0.75 70 G 4.0 83 H 2.25 85 I 1.5 70 J 6.0 96 K 3.25 84 L 2.5 84 M 0.0 51 N 1.75 63 O 2.0 71

Example, cont. A teacher wanted to know if additional studying resulted in higher grades. In other words, does studying have an effect on test performance? Draw a scatterplot putting one variable on the horizontal axis and the other on the vertical axis. If we have an explanatory variable, it should go on the horizontal axis and the response variable on the vertical axis.

Interpreting a Scatterplot Direction, form and strength of the relationship Striking deviations Outliers

Association? If the variable on the vertical axis tends to increase as the variable on the horizontal axis increases, we say that the two variables are positively associated. If one of them decreases as the other increases, we say they are negatively associated.

Calculator Tip To draw a scatterplot on your calculator, enter the data in two lists (L1 and L2). Go to STAT PLOT and choose the scatterplot icon. Enter L1 for Xlist and L2 for Ylist. Do ZOOM: ZoomStat. See Technology Toolbox on page 183

Exercises Page 179, problems 3.5 3.10

Correlation We are primarily interested in determining the extent to which two variables are linearly associated. The first statistic we have to determine a linear relationship is the Pearson product moment correlation, or more simply, the correlation coefficient, denoted by the letter r. The correlation coefficient is a measure of the strength of the linear relationship between two variables as well as an indicator of the direction of the linear relationship.

Formula If we have a sample of size n of paired data, say (x,y), and assuming that we have computed summary statistics for x and y (means and standard deviations), the correlation coefficient r is defined as follows: r 1 n 1 x x i y y i s x s y Find r for the previous example.

1 r 1 Properties of r If r is positive, it indicates that the variables are positively associated. If r is negative, the variables are negatively associated. If r = 0, it indicates that there is no linear association that would allow us to predict y from x. It doesn t mean that there is no relationship just not a linear one. It doesn t matter which variable you call x and which one you call y. r doesn t depend on units of measurements. r is not resistant to extreme values because it is based on the mean.

Guidelines There are no hard and fast rules about how strong a relationship is based on the numerical value of r. VALUE OF r -1 < r < -0.8 0.8 < r < 1-0.8 < r < -0.5 0.5 < r < 0.8 STRENGTH OF RELATIONSHIP strong moderate -0.5 < r < 0.5 weak

Calculator Tip You will first need to turn Diagnostic On. You can find this in CATALOG. Choose it and press ENTER twice. Enter values in L1 and L2. STAT: CALC: LinReg(a + bx) then ENTER Enter L1, L2 and press ENTER

Exercises Page 188-189, problems 3.13, 3.14

Correlation and Causation Association does not imply causation! Just because two things seem to go together does not mean that one caused the other. Some third variable, called a lurking variable, may be influencing them both.

YEAR Number of Methodist Ministers in New England 1860 63 8376 1865 48 6406 1870 53 7005 1875 64 8486 1880 72 9595 1885 80 10,643 1890 85 11,265 1895 76 10.071 1900 80 10,547 1905 83 11,008 1910 105 13,885 1915 140 18,559 1920 175 23,024 1925 183 24,185 1930 192 25,434 1935 221 29.238 Number of barrels of Cuban rum imported to Boston

Cautions Correlation requires that both variables be quantitative. Correlation does not describe curved relationships between variables, no matter how strong they are. Correlation is not a complete summary of two-variable data.

Line of Best Fit Once we have determined that two variables have a strong linear relationship, we can find a line of best fit so that we can predict values. This is called linear regression. In this situation, it matters which variable we call x and which one we call y. The line we are looking for is called the least squares regression line.

Exercises Page 193-195, problems 3.15-3.20 Section 3.1 Exercises: page 196-199, problems 3.21-3.28