Nov 13 AP STAT. 1. Check/rev HW 2. Review/recap of notes 3. HW: pg #5,7,8,9,11 and read/notes pg smartboad notes ch 3.

Similar documents
Describing Bivariate Relationships

5.1 Bivariate Relationships

Scatterplots and Correlation

AP Statistics Two-Variable Data Analysis

Bivariate Data Summary

3.2: Least Squares Regressions

BIVARIATE DATA data for two variables

Related Example on Page(s) R , 148 R , 148 R , 156, 157 R3.1, R3.2. Activity on 152, , 190.

Chapter 3: Describing Relationships

The response variable depends on the explanatory variable.

Chapter 3: Examining Relationships

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation

Chapter 3: Describing Relationships

Determine is the equation of the LSRL. Determine is the equation of the LSRL of Customers in line and seconds to check out.. Chapter 3, Section 2

Lecture 4 Scatterplots, Association, and Correlation

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation

Lecture 4 Scatterplots, Association, and Correlation

Objectives. 2.3 Least-squares regression. Regression lines. Prediction and Extrapolation. Correlation and r 2. Transforming relationships

Chapter 8. Linear Regression /71

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

AP Statistics L I N E A R R E G R E S S I O N C H A P 7

Scatterplots. 3.1: Scatterplots & Correlation. Scatterplots. Explanatory & Response Variables. Section 3.1 Scatterplots and Correlation

appstats8.notebook October 11, 2016

Review of Regression Basics

Prob/Stats Questions? /32

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation

The following formulas related to this topic are provided on the formula sheet:

Chapter 5 Least Squares Regression

6.1.1 How can I make predictions?

1. Create a scatterplot of this data. 2. Find the correlation coefficient.

What is the easiest way to lose points when making a scatterplot?

Chapter 7. Scatterplots, Association, and Correlation

Summarizing Data: Paired Quantitative Data

AP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions

Least-Squares Regression. Unit 3 Exploring Data

y n 1 ( x i x )( y y i n 1 i y 2

Relationships Regression

Looking at data: relationships

Chapter 6. September 17, Please pick up a calculator and take out paper and something to write with. Association and Correlation.

Reminder: Univariate Data. Bivariate Data. Example: Puppy Weights. You weigh the pups and get these results: 2.5, 3.5, 3.3, 3.1, 2.6, 3.6, 2.

Unit 6 - Introduction to linear regression

Steps to take to do the descriptive part of regression analysis:

MODELING. Simple Linear Regression. Want More Stats??? Crickets and Temperature. Crickets and Temperature 4/16/2015. Linear Model

Overview. 4.1 Tables and Graphs for the Relationship Between Two Variables. 4.2 Introduction to Correlation. 4.3 Introduction to Regression 3.

CHAPTER 3 Describing Relationships

Chapter 3: Examining Relationships

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation

Inferences for Regression

Announcements. Lecture 18: Simple Linear Regression. Poverty vs. HS graduate rate

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to:

Unit 6 - Simple linear regression

Linear Regression Communication, skills, and understanding Calculator Use

Important note: Transcripts are not substitutes for textbook assignments. 1

Chapter 9. Correlation and Regression

Chapter 5 Friday, May 21st

Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall)

Mrs. Poyner/Mr. Page Chapter 3 page 1

9. Linear Regression and Correlation

a. Length of tube: Diameter of tube:

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis

Section I: Multiple Choice Select the best answer for each question.

Examining Relationships. Chapter 3

Introduce Exploration! Before we go on, notice one more thing. We'll come back to the derivation if we have time.

appstats27.notebook April 06, 2017

Chapter 6: Exploring Data: Relationships Lesson Plan

Basic Practice of Statistics 7th

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Stat 101: Lecture 6. Summer 2006

Mathematical Notation Math Introduction to Applied Statistics

Analysis of Bivariate Data

7. Do not estimate values for y using x-values outside the limits of the data given. This is called extrapolation and is not reliable.

Chapter 10 Correlation and Regression

AP Statistics Bivariate Data Analysis Test Review. Multiple-Choice

Chapter 4 Data with Two Variables

AP Stats ~ 3A: Scatterplots and Correlation OBJECTIVES:

Chapter 4 Describing the Relation between Two Variables

Correlation and Regression

Chapter 7 Summary Scatterplots, Association, and Correlation

Chapter 4 Data with Two Variables

IF YOU HAVE DATA VALUES:

Announcements. Lecture 10: Relationship between Measurement Variables. Poverty vs. HS graduate rate. Response vs. explanatory

AP Final Review II Exploring Data (20% 30%)

Stat 101 Exam 1 Important Formulas and Concepts 1

Test 3A AP Statistics Name:

NAME: DATE: SECTION: MRS. KEINATH

Chapter 2: Looking at Data Relationships (Part 3)

The Simple Linear Regression Model

M 140 Test 1 B Name (1 point) SHOW YOUR WORK FOR FULL CREDIT! Problem Max. Points Your Points Total 75

PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics

Conditions for Regression Inference:

Least Squares Regression

1) A residual plot: A)

Looking at Data Relationships. 2.1 Scatterplots W. H. Freeman and Company

Example: Can an increase in non-exercise activity (e.g. fidgeting) help people gain less weight?

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

Business Statistics. Lecture 9: Simple Regression

YEAR 10 GENERAL MATHEMATICS 2017 STRAND: BIVARIATE DATA PART II CHAPTER 12 RESIDUAL ANALYSIS, LINEARITY AND TIME SERIES

Correlation & Simple Regression

Introduction to Regression

Transcription:

Nov 13 AP STAT 1. Check/rev HW 2. Review/recap of notes 3. HW: pg 179 184 #5,7,8,9,11 and read/notes pg 185 188 1

Chapter 3 Notes Review Exploring relationships between two variables. BIVARIATE DATA Is there a relationship and if so what is it? Categorical variables can now play a role Review: Response Variable and Explanatory Variable Response Var: Measures the outcome dependant variables Explanatory Var: What's being studied, what do we want explained, what explains the respnse variable. independent vairables. Though Independent and dependent mean something unrelated to this in statistics which is why we don't really use those terms here. Ex's: Alcohol and body temp. and Math SAT vs Verbal SAT whas there a relationship. Can you predict math score if you know the verbal score. 2

(do not consider the outlier rule) Scatter Plots and Correlation A SCATTER PLOT is the most effective way to display a relationship between quantitative data Be sure to label axes. Scale horizontal and vertical axes in uniform intervals Explanatory var is usally x, response is usually y Outliers are usually deviations that stand out from the rest of the data. Categoriacal data/info can be displayed with a different color or mark to determine if a pattern shows up (acts sort of like a third variable) can be used to make predictions based on pattern Directions/Associations Positive, negative, or none Form Are there Clusters Strength How close do they follow that form. 3

Correlation The correlation coefficient "r" Measures the strength of the relationship between the qualitative data 1 r 1 Closer it is to 1 or 1, the stronger the relationship. pos/neg is directional. Non resistant not a complete summary of the relationship its a start. Example pg 186: body weight vs backpack weight Body weight(lbs): 120 187 109 103 131 165 158 116 Backpack weight (lbs): 26 30 26 24 29 35 31 28 Make a scatter plot and determine r. 4

5

AP STAT Nov 14 1. Check/Review HW/compare with neighbor what Qs do you have whole group...(10 min) 2. Activity with scatter plot using a scatter plot to solve a mystery 3. HW: pg 193 197 #15,16,19,21,22,24 read and take notes 199 203 (notes quiz) mult 2.54 to convert in to cm 6

Nov 15 AP STAT objective: students will write regression and least squares regression equations to make predictions. 1. Notes quiz HKREILLY 2. check and rev HW probs 3. notes review 4. least sq regression equation (notes) 5: HW : pg 204 205 #29,31 pg 211 213 #33,36,37 7

pg199 203 Regression Line (our eyeballed line) Regression requires an explanatory variable and response variable. a line that describes how a response variable changes as an explanatory variable changes makes predictions mathematical model use y=a+bx slope doesn't tell you how important a relationship is our M &M mystery 8

Extrapolation using the regression equation to make predictions outside the scope of the data set/range. Usually inaccurate. Interpolation using regression line to make predictions within a range of data 9

Least Squares Regression Line this is the line that makes the sum of the vertical (y) distances of data points from the line as small as possible (a "true" best fit) a way to get a regression line that does not depend on a guess (eyeball) since we use form y= a + bx for LRL, use same format but for least squares regression we will use y(hat) or y = a + bx LSRL «where b= r(s y /S x ) and a= y bx y hat usually not exact as the actual observ but still provides good info. 10

deviations error 11

Let's write a LSRL Given: x(mean)= 19.65 Sx= 2.14 y(mean) = 169.29 Sy=10.69 r=.705 (from eyeball line). 12

In calc 1. data use mystery data. input into L1 and L2 2. Stat>Calc>#8> calculate 13

Looking Ahead: Residual plots being able to see the shape of the residuals is also telling about our prediction equation. the residual plot is a scatter plot of the residuals against the explanatory var(x/l1). Keeping in mind that the sum of the residuals should be zero or close to zero, the "trend" should be a horizontal line (slope=0). the residual plot tells us how well our regression equation fits our data. 14

If fanning of your residuals happens to take place, it indicates that your predicted y values (y hat) will be more inaccurate as your x increases. and vice versa 15

Lets make one: use list from M & M 1. L3: highlight it and type Y1(L1) enter it will fill your column 2. L4: highlight it and type L2 L3 3. now turn plot 1 off and turn plot 2 on nad select scatter plot with L1(x) and L4(y) 4. Turn off your Y= equations 5. Zoom #9 (zoom stat) 16

Typical prediction ERROR we use the standard deviation of the residuals. use 1 var stats wait for teacher to explain what to do. 17

Monday 11/20 AP STAT Objective: Students will use the coefficient of determination to describe LSRL and variation. 1. NOTES QUIZ: HKREILLY 2. Check/rev HW questions? 3. Review of notes 4. HW: pg 227 229 #44,48 Pg 230 233 #49,51,52,53,58 QUIZ WEDNESDAY 18

AP STAT Notes The coefficient of determination r 2 R sq is our correlation coefficient r...squared. This value essentially describes the percent variation that is due to the LSRL that we use to make predictions. The higher the value the better... For example: if r 2 =.82, then 82% of the variation is due to the given regression equation. The remaining "variation" of 18% would be due to the individual or subject. 82% is predictable from equation. The other 18%is variable for some other reason if r 2 =.60 that means our equation is relating the data "well" only about 60 percent of the time. the remaining 40 percent is individual which may say the LSRL isn't too great at helping make prediction or relating the two variables 19

Caution: Association is not causation. That is, just because a data set is characterized by having a large r squared value, it does not imply that x causes the changes in y. 20

4 Facts about Least squares regression 1. The distinction between explanatory and response variables is essential 2. There is a close connection between correlation and slope of the least squares regression line 3. The least squares regression line always passes through (x,y) 4. r 2 is the fraction of variation in the values of y that is explained by the least squares regression of y on x. other stuff: Since r 2 is a proportion, it is always a number between 0 and 1. If r 2 = 1, all of the data points fall perfectly on the regression line. The predictor x accounts for all of the variation in y! If r 2 = 0, the estimated regression line is perfectly horizontal. The predictor x accounts for none of the variation in y! 21

Nov 21 AP STAT Objective: Students will review topics in scatterplots, correlation, LSRL, and r sq 1. check HW and go over any questions 2. handout practice for quiz key will be on webpage 3. Be aware that the packet is not the end allreview all of sections 3.1,3.2 you will need to EXPLAIN answers on quiz 22

Nov 23 AP Stat 7:25 8:25 10:55 11:55 QUIZ! Pencil, calc spread out Have a great Thanksgiving weekend!!!???!!! There is HW on your calendar 23

Nov 27 AP Stat TEST WEDNESDAY 11/29 1. Section 3.3 Recap 2. Practice: pg 238 #62, pg 244 #70,72 24

Section 3.3 Correlation and Regression Wisdom -correlation and regression describe linear relationships -Extrapolation- unreliable predictions, predictions make no sense -Correlation- nonresistant (bc of SD- if points added changes SD) Outliers and Influential Observations: Subjective: how one point can change things LSRL or r. the graph/visual is important. Outlier: in real data gathering-check for error in collection/testing not all outliers are influential outliers are unusual data: in y direction, large residuals in x direction will not have large residuals Influential Data: influential data will have small residuals on x-not noticeable-thus scatter plot is crucial. pulls LSRL toward them (in x) so will not be noticeable on the residual plot- this means it will increase y intercept of line and decrease slope data point is influential if removing it changes the results in calculations If questionable point, find LSRL for both (with point and without point)- if changes more than slightly, then the point is influential 25

Making Decisions: Keep or throw away may need more data for that influential data or decide to exclude it. (Slow talker example) Lurking Variables: can make correlation and regression misleading not among explanatory or response variables but have influence in interpreting relationship between the initial variables/testing variables Lurking var can also show a nonsense relationship Lurking var can also disguise a relationship Remember Association does not imply causation. Other Odds and ends: remember that correlation (r) is based on z scores which are already standardized. Therefore, changing units of measure in an equation does not change the correlation 26

Review: pg 250-255 #77,78,79,85 27