BIVARIATE DATA data for two variables

Size: px
Start display at page:

Download "BIVARIATE DATA data for two variables"

Transcription

1 (Chapter 3) BIVARIATE DATA data for two variables INVESTIGATING RELATIONSHIPS We have compared the distributions of the same variable for several groups, using double boxplots and back-to-back stemplots. Now, we will compare the distributions of several variables for the same group of individuals. We can investigate linear relationships between the variables and provide a model that can be used to predict values of a response variable using values of an explanatory varaible. To gain insight into the relationship between two quantitative variables, the following graphs can be used: Scatterplot Residual plot Scatterplot used to investigate the relationship between two quantitative variables, to show how much one variable is related to the other A scatterplot is used to display the strength, direction, and form of the relationship between two quantitative variables measured on the same group of individuals. When the variables are related, the x-axis contains the explanatory variable, the reason/explanation for the outcome, and the y-axis contains the response variable, the outcome/resulting response. When the scatterplot shows a linear relationship, we can interpret the scatterplot by looking for an overall pattern and for deviations from that pattern [outliers (in the y direction) and influential observations (in the x direction)] and by describing the strength, direction, and form of the relationship. Look for patterns: Strength: Pattern of close points with little scatter indicates strong relationship while scattered, spread-out points indicates weak relationship Direction: Increasing positive direction associates larger values of one variable with larger values of the other variable while decreasing negative direction associates larger values of one variable with smaller values of the other variable Form: Linear, nonlinear (curved/exponential, cluster, etc.) Indicators of categorical information can be added to the points on the scatterplot using symbols.

2 Residual plot graph of the residuals (on the vertical axis) and the independent variable (on the horizontal axis) A residual plot is a graph of the residuals versus the x-values, with a horizontal regression line at y = 0. It is used to verify that a linear model is appropriate for a data set. Evidence of a linear relationship appears as a graph of points with no pattern, randomly scattered above and below the horizontal line, without shape or direction and without curves. A definite pattern indicates that a nonlinear model is appropriate instead. A residual is the difference between an observed value (an actual y-value from the data set) and a predicted value (a predicted y-value from the linear regression equation) y - y ˆ. The mean of the residuals is always zero. Residuals on calculator To create a list of residuals in calculator: 1. Enter x-values in L 1 and y-values in 2. 2 nd, LIST, RESID, Enter 3. STO, 2 nd, L 3 To view residual plot on calculator: 1. 2 nd, STAT PLOT, Enter 2. Choose scatterplot (first graph top row) 3. Assign Xlist as x-values and Ylist as residuals, Enter 4. ZOOM, 9 To show residuals in calculator: 1. 2 nd Catalog, Diagnostics On, Enter 2. STAT, CALC, LinReg a + bx, Enter 3. 2 nd STAT PLOT, ZOOM, 9 L 2

3 Correlation When the scatterplot shows a fairly straight line and there is no pattern in the residual plot, correlation coefficient r may be a useful numerical summary/measurement of the strength of the linear relationship (but it is not a complete description of two-variable data). Correlation coefficient r measures the strength (and indicates the direction) of a linear relationship between any two quantitative variables. Correlation r values range from -1 to 1, with a strong correlation near these extremes and a weaker correlation at values near 0. Correlation is 0 if it does not yield a linear scatterplot, but not necessarily that there is no relationship; correlation is 1 or -1 when the linear relationship is exact, but correlation 1 or -1 does not guarantee a linear relationship. Don t trust your eyes to interpret the strength of a linear relationship on a scatterplot; consider the scaling, zoom in/out. Correlation coefficient r: r = 1 Σ n 1 (x i x s x ) ( y i y s y ) where: x i each x-value y i each y-value standard deviation of x-values s x (Calculator: STAT, CALC, 2-Var Stats) Although standard deviation and, therefore, slope, are affected by what units you choose for x and y, correlation r is not affected by what units you choose for x and y nor which variable is labeled x and y. When you reverse x and y [ (x, y) (y, x)], correlation r stays the same but the regression line will be different. If you want to make predictions in the other direction (the effect of y on x), then you must create another model completely. Calculated using means and standard deviations, the correlation coefficient is not resistant, so use it cautiously when the data set contains outliers. With standardized values, r is equal to the slope of the regression line. Remember that evidence of a relationship is not evidence of causation and that lurking variables may be affecting the variables, resulting in what may only appear to be a correlation. Calculator to compute correlation r: 1) Enter data values in L 1 and L 2 (Calculator: STAT, Edit) 2) Define L 3 : L 3 = ( L 1 x ) ( L 2 y ) (Calculator to find variables: VARS, Statistics) 1 s x s y 3) Enter formula: sum(l n 1 3) (Calculator to find sum : 2 nd, LIST, MATH)

4 Least Squares Regression A least-squares regression line (y = a + bx) is a mathematical model for the overall pattern of a linear relationship between an explanatory variable and a response variable. Using the scatterplot and patterns (and some formulas/calculator), we can create an equation to summarize the linear explanatory/response relationship between the two quantitative variables by performing least-squares regression. This best-fitting straight line, known as the Least Squares Regression Line, LSRL, is the line that minimizes the sum of squares of the vertical distances of data points from the line. This regression line describes how response variable y changes as explanatory variable x changes. It is created using means x and y, standard deviations s x and s y, and correlation r, and it always passes through the point (x, y ). The LSRL is used to predict y-values for x-values, but beware of extrapolation, predicting for x-values that are beyond the scope of the original data set. Least Squares Regression Line, LSRL: y = a + bx where: b = r s y (b: slope) s x a = y bx (a: y-intercept) (Calculator: STAT, CALC, LinReg(a+bx)) The LSRL equation in point-slope form: ŷ = predicted y-value æ y - y = r S ö y ç (x - x). è ø Be sure to interpret the slope in context of the problem and to express it as, For each additional (unit) in (variable x), the predicted (variable y) should (increase/decrease) by (slope, in units) on average. S x SAMPLE: yˆ x For each additional inch in length, the predicted weight should increase by 1.5 pounds on average. Although correlation r and a regression line can be found for any two variables, they are only useful for linear relationships.

5 Coefficient of determination While the variation in the residuals allows us to assess how well the regression model fits the data, the coefficient of determination r 2 allows us to assess the usefulness of the LSRL as it is the ratio of the variation in the values of y that is explained by the least-squares regression of y on x. While correlation r is given as a decimal (between -1 and +1), the coefficient of determination r 2 is usually given as a percentage (with 100% as a perfect fit). Remember to interpret r 2 in context of the problem. We will write a sentence in the form, About (%) of the variability in (variable y) is explained by the LSRL of (variable y) on (variable x). Computer Regression With computer outputs, we focus on y-intercept a, slope b, independent variable x, dependent variable y, correlation coefficient r, and coefficient of determination r 2. SAMPLE 1: SAMPLE 2:

6 EXAMPLE: a. Write the LSRL for the following computer print-out. b. Write a sentence to interpret coefficient of determination. EXAMPLE: a. Write the LSRL for the following computer print-out.

7 b. Write a sentence to interpret coefficient of determination. EXAMPLE: An article in the August 1997 issue of the Journal of the American Medical Association reported on a survey that tracked emergency room visits at randomly selected hospitals nationwide. The data on the number of jet skis in use, the number of accidents, and the number of fatalities for the years follow: Year Number in use Accidents Fatalities , , , ,376 1, ,915 1, ,283 1, ,545 2, ,000 3, ,000 4, ,000 4, Examine the relationship between number of jet skis in use and number of accidents. 1) Make a scatter plot (title, any necessary breaks, labeled axes including units, etc.) (Calculator) Enter data: STAT, EDIT, explanatory variable in L 1, response variable in L 2. (Calculator) View scatter plot: 2 nd, STAT PLOT, etc. Interpret the scatter plot. --Address the strength, direction, and form. There is a very strong positive linear relationship between the number of jet skis in use and the number of accidents.

8 2) View and interpret the residual plot, a scatterplot of x versus residuals, to determine if LSRL is a good fit, the suitability of LRSL. The residual plot shows no distinct pattern, so we will proceed with linear regression. 3) Calculate correlation coefficient r, and determine if value is reasonable. --Formula for correlation coefficient: r = 1 n -1 -Work for correlation coefficient: å æ ç è x i - x S x ö æ y i - y ö ç ø è S y ø (Calculator) Find mean and standard deviation of each variable: STAT, CALC, 2-Var stats. r = 1 æ 92, ,227ö æ 376-1,947ö æ 900, ,227ö æ 4,010-1,947ö ç ç ç ç 10-1è 273,923 ø è 1,335 ø è 273,923 ø è 1,335 ø -Value of correlation coefficient: (Calculator) Find correlation coefficient r: STAT, CALC, LinReg(a+bx) (Make sure calculator is programmed for DiagnosticOn under 2 nd, CATALOG.) r = Address reasonableness by noting strength and direction. The correlation coefficient is reasonable because it shows a very strong positive linear correlation between number of jet skis in use and number of accidents. 4) Find LSRL and plot. --Use point-slope form for Least Squares Regression Line (see formula in above notes). (show formula work here ) we can also sometimes use a and b for Least Squares Regression Line (see formula in above notes). (Calculator) Find a and b: STAT, CALC, LinReg(a+bx) y ˆ = a + bx y ˆ = x Number of accidents = (number of jet skis in use) --Plot the LSRL onto scatter plot by making up any two x-values that are not part of data set and plugging into LSRL to find y. Plot and connect to form a line. (150,000, ) and (340,000, ) (150,000, 719.2) and (340,000, ) (Label points using a symbol/color to differentiate between actual points.) 5) Interpret slope.

9 --Put in terms of unit change: as x goes up by 1, y (increases/decreases) by b. ( For each additional (unit) in (x variable), the predicted (y variable) should (increase/decrease) by (slope, in units) on average. ) For each additional number of jet skis in use, the predicted number of accidents should increase by.0048 on average. 6) Compute and interpret the coefficient of determination r 2, the proportion of variation that is defined by the line, the ratio of explained error to total error. --Calculate SST and SSE: å SST (Sum of Squares for Total) (y - y ) 2 (1) Highlight L 3 and define as (L 2 - y ) 2. (Find y under VARS, Statistics) (2) Find sum of L 3. (2 nd, STAT, MATH, sum) SST is 16,042, SSE (Sum of Squares for Error) å (y - y ˆ ) 2 (1) Highlight L 4 and define as (L 2 -Y 1 (L 1 )) 2. (Find Y 1 under VARS, Y-VARS, Function) (2) Find sum of L 4. (2 nd, STAT, MATH, sum) SSE is 283, Use SST and SSE to find r 2 : r 2 = SST - SSE SST = SSP SST The coefficient of determination is Interpretation: About 98% of the variability in the number of accidents is explained by the LSRL of the number of accidents on the number of jet skis in use. 7) Predict the number of accidents when the number of jet skis in use is 200, Input 200,000 as the x-variable into the LSRL to make the prediction. y ˆ = x y ˆ = (200,000) y ˆ = *Be careful to not extrapolate beyond the range of data points. 8) Show how the residual value for 600,000 jet skis in use is calculated. --Residuals (errors): vertical distances from the points to the line (Observed value) (Predicted value of y) y - y ˆ When x = 600,000, y = 3002 (found from table of data values)

10 Residual: y - ˆ y 3002 Y 1 (600,000) (Calculator) 2 nd, STAT, RESID, STO( L 5 or ). Scroll to same line as the given x in L 1 the residual on that same line (600,000) to find Positive residuals indicate LSRL is below the y-value (actual y is above predicted y), and negative residuals indicate the LSRL is above the y-value.

Bivariate Data Summary

Bivariate Data Summary Bivariate Data Summary Bivariate data data that examines the relationship between two variables What individuals to the data describe? What are the variables and how are they measured Are the variables

More information

Describing Bivariate Relationships

Describing Bivariate Relationships Describing Bivariate Relationships Bivariate Relationships What is Bivariate data? When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response variables Plot the data

More information

5.1 Bivariate Relationships

5.1 Bivariate Relationships Chapter 5 Summarizing Bivariate Data Source: TPS 5.1 Bivariate Relationships What is Bivariate data? When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response variables

More information

CHAPTER 3 Describing Relationships

CHAPTER 3 Describing Relationships CHAPTER 3 Describing Relationships 3.1 Scatterplots and Correlation The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Scatterplots and Correlation Learning

More information

Session 4 2:40 3:30. If neither the first nor second differences repeat, we need to try another

Session 4 2:40 3:30. If neither the first nor second differences repeat, we need to try another Linear Quadratics & Exponentials using Tables We can classify a table of values as belonging to a particular family of functions based on the math operations found on any calculator. First differences

More information

Least-Squares Regression. Unit 3 Exploring Data

Least-Squares Regression. Unit 3 Exploring Data Least-Squares Regression Unit 3 Exploring Data Regression Line A straight line that describes how a variable,, changes as an variable,, changes unlike, requires an and variable used to predict the value

More information

AP Statistics Two-Variable Data Analysis

AP Statistics Two-Variable Data Analysis AP Statistics Two-Variable Data Analysis Key Ideas Scatterplots Lines of Best Fit The Correlation Coefficient Least Squares Regression Line Coefficient of Determination Residuals Outliers and Influential

More information

Prob/Stats Questions? /32

Prob/Stats Questions? /32 Prob/Stats 10.4 Questions? 1 /32 Prob/Stats 10.4 Homework Apply p551 Ex 10-4 p 551 7, 8, 9, 10, 12, 13, 28 2 /32 Prob/Stats 10.4 Objective Compute the equation of the least squares 3 /32 Regression A scatter

More information

6.1.1 How can I make predictions?

6.1.1 How can I make predictions? CCA Ch 6: Modeling Two-Variable Data Name: Team: 6.1.1 How can I make predictions? Line of Best Fit 6-1. a. Length of tube: Diameter of tube: Distance from the wall (in) Width of field of view (in) b.

More information

Least Squares Regression

Least Squares Regression Least Squares Regression Sections 5.3 & 5.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 14-2311 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Relationships Regression

Relationships Regression Relationships Regression BPS chapter 5 2006 W.H. Freeman and Company Objectives (BPS chapter 5) Regression Regression lines The least-squares regression line Using technology Facts about least-squares

More information

Objectives. 2.3 Least-squares regression. Regression lines. Prediction and Extrapolation. Correlation and r 2. Transforming relationships

Objectives. 2.3 Least-squares regression. Regression lines. Prediction and Extrapolation. Correlation and r 2. Transforming relationships Objectives 2.3 Least-squares regression Regression lines Prediction and Extrapolation Correlation and r 2 Transforming relationships Adapted from authors slides 2012 W.H. Freeman and Company Straight Line

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Section 3.2 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 3 Describing Relationships 3.1 Scatterplots and Correlation 3.2 Section 3.2

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Section 3.2 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 3 Describing Relationships 3.1 Scatterplots and Correlation 3.2 Section 3.2

More information

s e, which is large when errors are large and small Linear regression model

s e, which is large when errors are large and small Linear regression model Linear regression model we assume that two quantitative variables, x and y, are linearly related; that is, the the entire population of (x, y) pairs are related by an ideal population regression line y

More information

Sociology 6Z03 Review I

Sociology 6Z03 Review I Sociology 6Z03 Review I John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review I Fall 2016 1 / 19 Outline: Review I Introduction Displaying Distributions Describing

More information

Steps to take to do the descriptive part of regression analysis:

Steps to take to do the descriptive part of regression analysis: STA 2023 Simple Linear Regression: Least Squares Model Steps to take to do the descriptive part of regression analysis: A. Plot the data on a scatter plot. Describe patterns: 1. Is there a strong, moderate,

More information

Scatterplots and Correlation

Scatterplots and Correlation Bivariate Data Page 1 Scatterplots and Correlation Essential Question: What is the correlation coefficient and what does it tell you? Most statistical studies examine data on more than one variable. Fortunately,

More information

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation AP Statistics Chapter 6 Scatterplots, Association, and Correlation Objectives: Scatterplots Association Outliers Response Variable Explanatory Variable Correlation Correlation Coefficient Lurking Variables

More information

IF YOU HAVE DATA VALUES:

IF YOU HAVE DATA VALUES: Unit 02 Review Ways to obtain a line of best fit IF YOU HAVE DATA VALUES: 1. In your calculator, choose STAT > 1.EDIT and enter your x values into L1 and your y values into L2 2. Choose STAT > CALC > 8.

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Chapter 3: Examining Relationships Most statistical studies involve more than one variable. Often in the AP Statistics exam, you will be asked to compare two data sets by using side by side boxplots or

More information

Summarizing Data: Paired Quantitative Data

Summarizing Data: Paired Quantitative Data Summarizing Data: Paired Quantitative Data regression line (or least-squares line) a straight line model for the relationship between explanatory (x) and response (y) variables, often used to produce a

More information

Chapter 8. Linear Regression /71

Chapter 8. Linear Regression /71 Chapter 8 Linear Regression 1 /71 Homework p192 1, 2, 3, 5, 7, 13, 15, 21, 27, 28, 29, 32, 35, 37 2 /71 3 /71 Objectives Determine Least Squares Regression Line (LSRL) describing the association of two

More information

Example: Can an increase in non-exercise activity (e.g. fidgeting) help people gain less weight?

Example: Can an increase in non-exercise activity (e.g. fidgeting) help people gain less weight? Example: Can an increase in non-exercise activity (e.g. fidgeting) help people gain less weight? 16 subjects overfed for 8 weeks Explanatory: change in energy use from non-exercise activity (calories)

More information

Nov 13 AP STAT. 1. Check/rev HW 2. Review/recap of notes 3. HW: pg #5,7,8,9,11 and read/notes pg smartboad notes ch 3.

Nov 13 AP STAT. 1. Check/rev HW 2. Review/recap of notes 3. HW: pg #5,7,8,9,11 and read/notes pg smartboad notes ch 3. Nov 13 AP STAT 1. Check/rev HW 2. Review/recap of notes 3. HW: pg 179 184 #5,7,8,9,11 and read/notes pg 185 188 1 Chapter 3 Notes Review Exploring relationships between two variables. BIVARIATE DATA Is

More information

Reminder: Univariate Data. Bivariate Data. Example: Puppy Weights. You weigh the pups and get these results: 2.5, 3.5, 3.3, 3.1, 2.6, 3.6, 2.

Reminder: Univariate Data. Bivariate Data. Example: Puppy Weights. You weigh the pups and get these results: 2.5, 3.5, 3.3, 3.1, 2.6, 3.6, 2. TP: To review Standard Deviation, Residual Plots, and Correlation Coefficients HW: Do a journal entry on each of the calculator tricks in this lesson. Lesson slides will be posted with notes. Do Now: Write

More information

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis 4.1 Introduction Correlation is a technique that measures the strength (or the degree) of the relationship between two variables. For example, we could measure how strong the relationship is between people

More information

Chapter 2: Looking at Data Relationships (Part 3)

Chapter 2: Looking at Data Relationships (Part 3) Chapter 2: Looking at Data Relationships (Part 3) Dr. Nahid Sultana Chapter 2: Looking at Data Relationships 2.1: Scatterplots 2.2: Correlation 2.3: Least-Squares Regression 2.5: Data Analysis for Two-Way

More information

3.2: Least Squares Regressions

3.2: Least Squares Regressions 3.2: Least Squares Regressions Section 3.2 Least-Squares Regression After this section, you should be able to INTERPRET a regression line CALCULATE the equation of the least-squares regression line CALCULATE

More information

Linear Regression Communication, skills, and understanding Calculator Use

Linear Regression Communication, skills, and understanding Calculator Use Linear Regression Communication, skills, and understanding Title, scale and label the horizontal and vertical axes Comment on the direction, shape (form), and strength of the relationship and unusual features

More information

Unit 6 - Simple linear regression

Unit 6 - Simple linear regression Sta 101: Data Analysis and Statistical Inference Dr. Çetinkaya-Rundel Unit 6 - Simple linear regression LO 1. Define the explanatory variable as the independent variable (predictor), and the response variable

More information

Describing the Relationship between Two Variables

Describing the Relationship between Two Variables 1 Describing the Relationship between Two Variables Key Definitions Scatter : A graph made to show the relationship between two different variables (each pair of x s and y s) measured from the same equation.

More information

y n 1 ( x i x )( y y i n 1 i y 2

y n 1 ( x i x )( y y i n 1 i y 2 STP3 Brief Class Notes Instructor: Ela Jackiewicz Chapter Regression and Correlation In this chapter we will explore the relationship between two quantitative variables, X an Y. We will consider n ordered

More information

Overview. 4.1 Tables and Graphs for the Relationship Between Two Variables. 4.2 Introduction to Correlation. 4.3 Introduction to Regression 3.

Overview. 4.1 Tables and Graphs for the Relationship Between Two Variables. 4.2 Introduction to Correlation. 4.3 Introduction to Regression 3. 3.1-1 Overview 4.1 Tables and Graphs for the Relationship Between Two Variables 4.2 Introduction to Correlation 4.3 Introduction to Regression 3.1-2 4.1 Tables and Graphs for the Relationship Between Two

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Chapter 3: Examining Relationships 3.1 Scatterplots 3.2 Correlation 3.3 Least-Squares Regression Fabric Tenacity, lb/oz/yd^2 26 25 24 23 22 21 20 19 18 y = 3.9951x + 4.5711 R 2 = 0.9454 3.5 4.0 4.5 5.0

More information

Unit 6 - Introduction to linear regression

Unit 6 - Introduction to linear regression Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,

More information

Ch Inference for Linear Regression

Ch Inference for Linear Regression Ch. 12-1 Inference for Linear Regression ACT = 6.71 + 5.17(GPA) For every increase of 1 in GPA, we predict the ACT score to increase by 5.17. population regression line β (true slope) μ y = α + βx mean

More information

Scatterplots. 3.1: Scatterplots & Correlation. Scatterplots. Explanatory & Response Variables. Section 3.1 Scatterplots and Correlation

Scatterplots. 3.1: Scatterplots & Correlation. Scatterplots. Explanatory & Response Variables. Section 3.1 Scatterplots and Correlation 3.1: Scatterplots & Correlation Scatterplots A scatterplot shows the relationship between two quantitative variables measured on the same individuals. The values of one variable appear on the horizontal

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression OI CHAPTER 7 Important Concepts Correlation (r or R) and Coefficient of determination (R 2 ) Interpreting y-intercept and slope coefficients Inference (hypothesis testing and confidence

More information

Chapter 5 Friday, May 21st

Chapter 5 Friday, May 21st Chapter 5 Friday, May 21 st Overview In this Chapter we will see three different methods we can use to describe a relationship between two quantitative variables. These methods are: Scatterplot Correlation

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Important note: Transcripts are not substitutes for textbook assignments. 1

Important note: Transcripts are not substitutes for textbook assignments. 1 In this lesson we will cover correlation and regression, two really common statistical analyses for quantitative (or continuous) data. Specially we will review how to organize the data, the importance

More information

Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall)

Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall) Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall) We will cover Chs. 5 and 6 first, then 3 and 4. Mon,

More information

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation Chapter 1 Summarizing Bivariate Data Linear Regression and Correlation This chapter introduces an important method for making inferences about a linear correlation (or relationship) between two variables,

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

S12 - HS Regression Labs Workshop. Linear. Quadratic (not required) Logarithmic. Exponential. Power

S12 - HS Regression Labs Workshop. Linear. Quadratic (not required) Logarithmic. Exponential. Power Summer 2006 I2T2 Probability & Statistics Page 181 S12 - HS Regression Labs Workshop Regression Types: Needed for Math B Linear Quadratic (not required) Logarithmic Exponential Power You can calculate

More information

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation Scatterplots and Correlation Name Hr A scatterplot shows the relationship between two quantitative variables measured on the same individuals. variable (y) measures an outcome of a study variable (x) may

More information

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION In this lab you will learn how to use Excel to display the relationship between two quantitative variables, measure the strength and direction of the

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

appstats8.notebook October 11, 2016

appstats8.notebook October 11, 2016 Chapter 8 Linear Regression Objective: Students will construct and analyze a linear model for a given set of data. Fat Versus Protein: An Example pg 168 The following is a scatterplot of total fat versus

More information

Name. The data below are airfares to various cities from Baltimore, MD (including the descriptive statistics).

Name. The data below are airfares to various cities from Baltimore, MD (including the descriptive statistics). Name The data below are airfares to various cities from Baltimore, MD (including the descriptive statistics). 178 138 94 278 158 258 198 188 98 179 138 98 N Mean Std. Dev. Min Q 1 Median Q 3 Max 12 166.92

More information

Determine is the equation of the LSRL. Determine is the equation of the LSRL of Customers in line and seconds to check out.. Chapter 3, Section 2

Determine is the equation of the LSRL. Determine is the equation of the LSRL of Customers in line and seconds to check out.. Chapter 3, Section 2 3.2c Computer Output, Regression to the Mean, & AP Formulas Be sure you can locate: the slope, the y intercept and determine the equation of the LSRL. Slope is always in context and context is x value.

More information

a. Length of tube: Diameter of tube:

a. Length of tube: Diameter of tube: CCA Ch 6: Modeling Two-Variable Data Name: 6.1.1 How can I make predictions? Line of Best Fit 6-1. a. Length of tube: Diameter of tube: Distance from the wall (in) Width of field of view (in) b. Make a

More information

AP Statistics. Chapter 9 Re-Expressing data: Get it Straight

AP Statistics. Chapter 9 Re-Expressing data: Get it Straight AP Statistics Chapter 9 Re-Expressing data: Get it Straight Objectives: Re-expression of data Ladder of powers Straight to the Point We cannot use a linear model unless the relationship between the two

More information

Chapter 4 Data with Two Variables

Chapter 4 Data with Two Variables Chapter 4 Data with Two Variables 1 Scatter Plots and Correlation and 2 Pearson s Correlation Coefficient Looking for Correlation Example Does the number of hours you watch TV per week impact your average

More information

CHAPTER 3 Describing Relationships

CHAPTER 3 Describing Relationships CHAPTER 3 Describing Relationships 3.1 Scatterplots and Correlation The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Scatterplots and Correlation Learning

More information

Looking at data: relationships

Looking at data: relationships Looking at data: relationships Least-squares regression IPS chapter 2.3 2006 W. H. Freeman and Company Objectives (IPS chapter 2.3) Least-squares regression p p The regression line Making predictions:

More information

Chapter 9. Correlation and Regression

Chapter 9. Correlation and Regression Chapter 9 Correlation and Regression Lesson 9-1/9-2, Part 1 Correlation Registered Florida Pleasure Crafts and Watercraft Related Manatee Deaths 100 80 60 40 20 0 1991 1993 1995 1997 1999 Year Boats in

More information

MINI LESSON. Lesson 2a Linear Functions and Applications

MINI LESSON. Lesson 2a Linear Functions and Applications MINI LESSON Lesson 2a Linear Functions and Applications Lesson Objectives: 1. Compute AVERAGE RATE OF CHANGE 2. Explain the meaning of AVERAGE RATE OF CHANGE as it relates to a given situation 3. Interpret

More information

Chapter 4 Data with Two Variables

Chapter 4 Data with Two Variables Chapter 4 Data with Two Variables 1 Scatter Plots and Correlation and 2 Pearson s Correlation Coefficient Looking for Correlation Example Does the number of hours you watch TV per week impact your average

More information

Lecture 46 Section Tue, Apr 15, 2008

Lecture 46 Section Tue, Apr 15, 2008 ar Koer ar Lecture 46 Section 13.3.2 Koer Hampden-Sydney College Tue, Apr 15, 2008 Outline ar Koer 1 2 3 4 5 ar Koer We are now ready to calculate least-squares regression line. formulas are a bit daunting,

More information

Review of Regression Basics

Review of Regression Basics Review of Regression Basics When describing a Bivariate Relationship: Make a Scatterplot Strength, Direction, Form Model: y-hat=a+bx Interpret slope in context Make Predictions Residual = Observed-Predicted

More information

Reteach 2-3. Graphing Linear Functions. 22 Holt Algebra 2. Name Date Class

Reteach 2-3. Graphing Linear Functions. 22 Holt Algebra 2. Name Date Class -3 Graphing Linear Functions Use intercepts to sketch the graph of the function 3x 6y 1. The x-intercept is where the graph crosses the x-axis. To find the x-intercept, set y 0 and solve for x. 3x 6y 1

More information

CHAPTER 4 DESCRIPTIVE MEASURES IN REGRESSION AND CORRELATION

CHAPTER 4 DESCRIPTIVE MEASURES IN REGRESSION AND CORRELATION STP 226 ELEMENTARY STATISTICS CHAPTER 4 DESCRIPTIVE MEASURES IN REGRESSION AND CORRELATION Linear Regression and correlation allows us to examine the relationship between two or more quantitative variables.

More information

Chapter 7. Scatterplots, Association, and Correlation

Chapter 7. Scatterplots, Association, and Correlation Chapter 7 Scatterplots, Association, and Correlation Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 29 Objective In this chapter, we study relationships! Instead, we investigate

More information

Chapter 6 Scatterplots, Association and Correlation

Chapter 6 Scatterplots, Association and Correlation Chapter 6 Scatterplots, Association and Correlation Looking for Correlation Example Does the number of hours you watch TV per week impact your average grade in a class? Hours 12 10 5 3 15 16 8 Grade 70

More information

UNIT 12 ~ More About Regression

UNIT 12 ~ More About Regression ***SECTION 15.1*** The Regression Model When a scatterplot shows a relationship between a variable x and a y, we can use the fitted to the data to predict y for a given value of x. Now we want to do tests

More information

Chapter 5 Least Squares Regression

Chapter 5 Least Squares Regression Chapter 5 Least Squares Regression A Royal Bengal tiger wandered out of a reserve forest. We tranquilized him and want to take him back to the forest. We need an idea of his weight, but have no scale!

More information

Section 2.2: LINEAR REGRESSION

Section 2.2: LINEAR REGRESSION Section 2.2: LINEAR REGRESSION OBJECTIVES Be able to fit a regression line to a scatterplot. Find and interpret correlation coefficients. Make predictions based on lines of best fit. Key Terms line of

More information

Prof. Bodrero s Guide to Derivatives of Trig Functions (Sec. 3.5) Name:

Prof. Bodrero s Guide to Derivatives of Trig Functions (Sec. 3.5) Name: Prof. Bodrero s Guide to Derivatives of Trig Functions (Sec. 3.5) Name: Objectives: Understand how the derivatives of the six basic trig functions are found. Be able to find the derivative for each of

More information

Related Example on Page(s) R , 148 R , 148 R , 156, 157 R3.1, R3.2. Activity on 152, , 190.

Related Example on Page(s) R , 148 R , 148 R , 156, 157 R3.1, R3.2. Activity on 152, , 190. Name Chapter 3 Learning Objectives Identify explanatory and response variables in situations where one variable helps to explain or influences the other. Make a scatterplot to display the relationship

More information

Correlation and Regression Notes. Categorical / Categorical Relationship (Chi-Squared Independence Test)

Correlation and Regression Notes. Categorical / Categorical Relationship (Chi-Squared Independence Test) Relationship Hypothesis Tests Correlation and Regression Notes Categorical / Categorical Relationship (Chi-Squared Independence Test) Ho: Categorical Variables are independent (show distribution of conditional

More information

Chapter 6: Exploring Data: Relationships Lesson Plan

Chapter 6: Exploring Data: Relationships Lesson Plan Chapter 6: Exploring Data: Relationships Lesson Plan For All Practical Purposes Displaying Relationships: Scatterplots Mathematical Literacy in Today s World, 9th ed. Making Predictions: Regression Line

More information

Chapter 6. Exploring Data: Relationships

Chapter 6. Exploring Data: Relationships Chapter 6 Exploring Data: Relationships For All Practical Purposes: Effective Teaching A characteristic of an effective instructor is fairness and consistenc in grading and evaluating student performance.

More information

Math 52 Linear Regression Instructions TI-83

Math 52 Linear Regression Instructions TI-83 Math 5 Linear Regression Instructions TI-83 Use the following data to study the relationship between average hours spent per week studying and overall QPA. The idea behind linear regression is to determine

More information

Linear Regression and Correlation. February 11, 2009

Linear Regression and Correlation. February 11, 2009 Linear Regression and Correlation February 11, 2009 The Big Ideas To understand a set of data, start with a graph or graphs. The Big Ideas To understand a set of data, start with a graph or graphs. If

More information

Inference for Regression Simple Linear Regression

Inference for Regression Simple Linear Regression Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating

More information

7. Do not estimate values for y using x-values outside the limits of the data given. This is called extrapolation and is not reliable.

7. Do not estimate values for y using x-values outside the limits of the data given. This is called extrapolation and is not reliable. AP Statistics 15 Inference for Regression I. Regression Review a. r à correlation coefficient or Pearson s coefficient: indicates strength and direction of the relationship between the explanatory variables

More information

Chapter 10 Correlation and Regression

Chapter 10 Correlation and Regression Chapter 10 Correlation and Regression 10-1 Review and Preview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple Regression 10-6 Modeling Copyright 2010, 2007, 2004

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Chapter 7 Linear Regression

Chapter 7 Linear Regression Chapter 7 Linear Regression 1 7.1 Least Squares: The Line of Best Fit 2 The Linear Model Fat and Protein at Burger King The correlation is 0.76. This indicates a strong linear fit, but what line? The line

More information

Graphing Equations in Slope-Intercept Form 4.1. Positive Slope Negative Slope 0 slope No Slope

Graphing Equations in Slope-Intercept Form 4.1. Positive Slope Negative Slope 0 slope No Slope Slope-Intercept Form y = mx + b m = slope b = y-intercept Graphing Equations in Slope-Intercept Form 4.1 Positive Slope Negative Slope 0 slope No Slope Example 1 Write an equation in slope-intercept form

More information

Correlation A relationship between two variables As one goes up, the other changes in a predictable way (either mostly goes up or mostly goes down)

Correlation A relationship between two variables As one goes up, the other changes in a predictable way (either mostly goes up or mostly goes down) Two-Variable Statistics Correlation A relationship between two variables As one goes up, the other changes in a predictable way (either mostly goes up or mostly goes down) Positive Correlation As one variable

More information

Analysis of Bivariate Data

Analysis of Bivariate Data Analysis of Bivariate Data Data Two Quantitative variables GPA and GAES Interest rates and indices Tax and fund allocation Population size and prison population Bivariate data (x,y) Case corr&reg 2 Independent

More information

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc. Chapter 8 Linear Regression Copyright 2010 Pearson Education, Inc. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu: Copyright

More information

Lesson 4 Linear Functions and Applications

Lesson 4 Linear Functions and Applications In this lesson, we take a close look at Linear Functions and how real world situations can be modeled using Linear Functions. We study the relationship between Average Rate of Change and Slope and how

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

Lecture 11: Simple Linear Regression

Lecture 11: Simple Linear Regression Lecture 11: Simple Linear Regression Readings: Sections 3.1-3.3, 11.1-11.3 Apr 17, 2009 In linear regression, we examine the association between two quantitative variables. Number of beers that you drink

More information

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation Chapter 14 Describing Relationships: Scatterplots and Correlation Chapter 14 1 Statistical versus Deterministic Relationships Distance versus Speed (when travel time is constant). Income (in millions of

More information

Year 10 Mathematics Semester 2 Bivariate Data Chapter 13

Year 10 Mathematics Semester 2 Bivariate Data Chapter 13 Year 10 Mathematics Semester 2 Bivariate Data Chapter 13 Why learn this? Observations of two or more variables are often recorded, for example, the heights and weights of individuals. Studying the data

More information

AP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions

AP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions AP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions Know the definitions of the following words: bivariate data, regression analysis, scatter diagram, correlation coefficient, independent

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

AMS 7 Correlation and Regression Lecture 8

AMS 7 Correlation and Regression Lecture 8 AMS 7 Correlation and Regression Lecture 8 Department of Applied Mathematics and Statistics, University of California, Santa Cruz Suumer 2014 1 / 18 Correlation pairs of continuous observations. Correlation

More information

Basic Practice of Statistics 7th

Basic Practice of Statistics 7th Basic Practice of Statistics 7th Edition Lecture PowerPoint Slides In Chapter 4, we cover Explanatory and response variables Displaying relationships: Scatterplots Interpreting scatterplots Adding categorical

More information

1) A residual plot: A)

1) A residual plot: A) 1) A residual plot: A) B) C) D) E) displays residuals of the response variable versus the independent variable. displays residuals of the independent variable versus the response variable. displays residuals

More information

3 9 Curve Fitting with Polynomials

3 9 Curve Fitting with Polynomials 3 9 Curve Fitting with Polynomials Relax! You will do fine today! We will review for quiz!!! (which is worth 10 points, has 20 questions, group, graphing calculator allowed, and will not be on your first

More information

The response variable depends on the explanatory variable.

The response variable depends on the explanatory variable. A response variable measures an outcome of study. > dependent variables An explanatory variable attempts to explain the observed outcomes. > independent variables The response variable depends on the explanatory

More information

Practice Questions for Exam 1

Practice Questions for Exam 1 Practice Questions for Exam 1 1. A used car lot evaluates their cars on a number of features as they arrive in the lot in order to determine their worth. Among the features looked at are miles per gallon

More information

THE PEARSON CORRELATION COEFFICIENT

THE PEARSON CORRELATION COEFFICIENT CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There

More information