AP Statistics. Chapter 9 Re-Expressing data: Get it Straight

Size: px
Start display at page:

Download "AP Statistics. Chapter 9 Re-Expressing data: Get it Straight"

Transcription

1 AP Statistics Chapter 9 Re-Expressing data: Get it Straight

2 Objectives: Re-expression of data Ladder of powers

3 Straight to the Point We cannot use a linear model unless the relationship between the two variables is linear. Often re-expression (transformation) can save the day, straightening bent relationships so that we can fit and use a simple linear model. Two simple ways to re-express data are with logarithms and reciprocals. Re-expressions can be seen in everyday life everybody does it.

4 Straight to the Point The relationship between fuel efficiency (in miles per gallon) and weight (in pounds) for late model cars looks fairly linear at first:

5 Straight to the Point A look at the residuals plot shows a problem:

6 Straight to the Point We can re-express fuel efficiency as gallons per hundred miles (a reciprocal) and eliminate the bend in the original scatterplot:

7 Straight to the Point A look at the residuals plot for the new model seems more reasonable:

8 Goals of Re-expression Goal 1: Make the distribution of a variable (as seen in its histogram, for example) more symmetric. It s easier to summarize the center of a symmetric distribution, we can use the mean and standard deviation. If the distribution is unimodal also, we can analysis using the normal model. Here taking the log of the explanatory variable.

9 Goals of Re-expression Goal 2: Make the spread of several groups (as seen in side-by-side boxplots) more alike, even if their centers differ. Groups that share a common spread are easier to compare. Here taking the log makes the individual boxplots more symmetric and gives them spreads that are more nearly equal.

10 Goals of Re-expression Goal 3: Make the form of a scatterplot more nearly linear. Linear scatterplots are easier to model. By re-expressing to straighten the scatterplot relationship we can fit a linear model and use linear techniques to analysis. Here taking the log of the response variable.

11 Goals of Re-expression Goal 4: Make the scatter in a scatterplot spread out evenly rather than thickening at one end. Having an even scatter is a condition of many methods of Statistics, as we will see later. This is closely related to goal 2, but often comes along with goal 3, as seen below. When taking the log to straighten the data, it also evened out the spread.

12 The Ladder of Powers There is a family of simple re-expressions that move data toward our goals in a consistent way. This collection of re-expressions is called the Ladder of Powers. The Ladder of Powers orders the effects that the re-expressions have on data.

13 The Ladder of Powers Power 2 1 ½ 0 1/2 1 Name Square of data values Raw data Square root of data values We ll use logarithms here Reciprocal square root The reciprocal of the data Comment Try with unimodal distributions that are skewed to the left. Data with positive and negative values and no bounds are less likely to benefit from re-expression. Counts often benefit from a square root re-expression. Measurements that cannot be negative often benefit from a log re-expression. An uncommon re-expression, but sometimes useful. Ratios of two quantities (e.g., mph) often benefit from a reciprocal.

14 The Ladder of Powers The Ladder of Powers orders the effects that the re-expressions have on data. How it works. If you try taking the square root of all the values in a variable and it helps, but not enough, then move further down the ladder to the log or reciprocal root. Those re-expressions will have a similar, but even stronger, effect on your data. If you go too far, you can always back up. Remember, when you take a negative power, the direction of the relationship will change. This is OK, you can always change the sign of the response variable if you want to keep the same direction.

15 Plan B: Attack of the Logarithms When none of the data values is zero or negative, logarithms can be a helpful ally in the search for a useful model. Try taking the logs of both the x- and y- variable. Then re-express the data using some combination of x or log(x) vs. y or log(y).

16 Plan B: Attack of the Logarithms

17 Multiple Benefits We often choose a re-expression for one reason and then discover that it has helped other aspects of an analysis. For example, a re-expression that makes a histogram more symmetric might also straighten a scatterplot or stabilize variance.

18 Why Not Just Use a Curve? If there s a curve in the scatterplot, why not just fit a curve to the data?

19 Why Not Just Use a Curve? The mathematics and calculations for curves of best fit are considerably more difficult than lines of best fit. Besides, straight lines are easy to understand. We know how to think about the slope and the y- intercept.

20 More Plan B: Modeling Nonlinear Data - Logarithms Two specific types of nonlinear growth. 1. Exponential function (form y = ab x ) 2. Power function (form y = ax b ) Equations of both forms can be transformed into linear forms. Can then use linear regression to model and analyze the transformed data. Can also perform an inverse transformation to obtain a model of the original data.

21 To Transform the exponential Function use its Inverse the Logarithmic Function Properties of Logarithms

22 Using Logarithms to Transform Data Logarithms can be useful in straightening a scatterplot whose data values are greater than zero. Remember, you cannot take the logarithm of a nonpositive number. When you use transformed data to create a linear model, your regression equation is not in terms of (x,y) but in terms of the transformed variable(s) (log ŷ or log x).

23 Logarithm Transformations

24 Example: Testing for Exponential Association Data

25 View Scatterplot Looks like it has a curved pattern, could possibly be an exponential relationship.

26 Your Turn: Is the following data exponential & if so, what is r?

27 Your Turn: Is the following data (Hours vs. Number) exponential & if so, what is r?

28 Exponential Regression Procedure 1. Verify data is exponential. Graph scatterplot Transform data to linear by taking the log of the response variable. 2. Calculate the LSRL for the transformed data; log ŷ =b 0 +b 1 x (linear model). Analyze using linear techniques, LSRL, r, r 2, and residuals. 3. Find exponential model for the original data by inverse transformation of the LSRL, exponentiating both sides of the LSRL equation to base 10; ŷ = C 10 kx (exponential model).

29 Example: Data Annual crude oil production from 1880 to 1970 Year Mbbl ,412 2,150 3,803 7,674 16,690

30 What to do: 1. Graph scatterplot. 2. Transform data to linear (take the log of y). 3. Calculate LSRL of transformed data & graph. 4. Analyze transformed data (r, r 2, residual plot). 5. Perform inverse transformation (exponentiate LSRL to base 10). 6. Graph exponential model.

31 Back to the Data Annual crude oil production from 1880 to 1970 Year Mbbl ,412 2,150 3,803 7,674 16,690

32 Models of Data Data is exponential (scatterplot curved pattern and constant common ratio 2.1) Linear model log ŷ= x Exponential model ŷ=( ) x Use model on the calculate to make predictions, not the exponential model equation. Predict oil production for Mbbl Predict oil production for Mbbl extrapolation, be careful.

33 Your Turn: Exponential Regression

34 Models of Data Data is exponential (scatterplot curved pattern and constant common ratio 1.5) Linear Model Log ŷ = x Exponential Model ŷ = ( ) ( x )

35 Your Turn: Age vs Height

36 Models for Data Data is exponential (scatterplot curved pattern and constant common ratio 1.04) Linear Model Log ŷ = x Exponential Model ŷ = ( ) ( x ) If comparing Height vs Weight, could a common ratio be calculated? NO, because the explanatory variable Height does not in crease in equal increments. Have to calculate different models and see which best fits the data.

37 Transforming or Re-Expression Power Data

38 Power Function Model Power Function general form: y = ax b When we apply the log transformation to the response variable y in an exponential growth model, we produce a linear relationship. To produce a linear relationship from a power function model, we apply the log transformation to both variables (x & y). Here is how it is done. Power function model: y = ax b Take the log of both sides of the equation: log y = log (ax b ) Using the product and power properties of logs, this results in a linear relationship between log y and log x. log y = log a + log x b log y = log a + b log x The power b in the power function model becomes the slope of the straight line that links log y to log x.

39 Inverse Transformation Obtaining a power function model for the original data from the LSRL on the transformed data. LSRL will have the form: log ŷ = a + b log x Inverse transform the LSRL by exponentiating both sides of the equation to base log ŷ = 10 (a + b log x) ŷ = (10 a )(10 b log x ) ŷ = (10 a )(10 log x ) b ŷ = (10 a )(x b ) which is in the form y = C x b A Power Function (can not be done on the calulator, must be done by hand).

40 Power Function Procedure 1. Graph scatterplot. 2. Determine it is a power function (ie. not exponential). 3. Transform data to linear (take the log of y & x). 4. Calculate LSRL of transformed data & graph. 5. Analyze transformed data (r, r 2, residual plot). 6. Perform inverse transformation (exponentiate LSRL to base 10). 7. Graph power model. 8. Make predictions based on the power model.

41 Example 1 The table shows the temperature of an instrument measured as its distance from a heat source is varied. Find a suitable model for Dist. vs Temp. LSRL: log(temp.) = log(dist.) log ŷ = log x Power model: Temp. = ( ) (Dist.)-.255 ŷ = x -.255

42 Your Turn: The owner of a Video Game Store records the business costs and revenue for different years with the results listed. Find the best model. LSRL: log ŷ = log x Power model: ŷ = x.4 or ŷ = (1995)x.4

43 What Can Go Wrong? Don t expect your model to be perfect. Don t stray too far from the ladder. Don t choose a model based on R 2 alone:

44 What Can Go Wrong? Beware of multiple modes. Re-expression cannot pull separate modes together. Watch out for scatterplots that turn around. Re-expression can straighten many bent relationships, but not those that go up then down, or down then up.

45 What Can Go Wrong? Watch out for negative data values. It s impossible to re-express negative values by any power that is not a whole number on the Ladder of Powers or to re-express values that are zero for negative powers. Watch for data far from 1. Data values that are all very far from 1 may not be much affected by re-expression unless the range is very large. If all the data values are large (e.g., years), consider subtracting a constant to bring them back near 1.

46 What have we learned? When the conditions for regression are not met, a simple re-expression of the data may help. A re-expression may make the: Distribution of a variable more symmetric. Spread across different groups more similar. Form of a scatterplot straighter. Scatter around the line in a scatterplot more consistent.

47 What have we learned? Taking logs is often a good, simple starting point. To search further, the Ladder of Powers or the log-log approach can help us find a good reexpression. Our models won t be perfect, but re-expression can lead us to a useful model.

AP Statistics. The only statistics you can trust are those you falsified yourself. RE- E X P R E S S I N G D A T A ( P A R T 2 ) C H A P 9

AP Statistics. The only statistics you can trust are those you falsified yourself. RE- E X P R E S S I N G D A T A ( P A R T 2 ) C H A P 9 AP Statistics 1 RE- E X P R E S S I N G D A T A ( P A R T 2 ) C H A P 9 The only statistics you can trust are those you falsified yourself. Sir Winston Churchill (1874-1965) (Attribution to Churchill is

More information

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc. Chapter 8 Linear Regression Copyright 2010 Pearson Education, Inc. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu: Copyright

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

appstats8.notebook October 11, 2016

appstats8.notebook October 11, 2016 Chapter 8 Linear Regression Objective: Students will construct and analyze a linear model for a given set of data. Fat Versus Protein: An Example pg 168 The following is a scatterplot of total fat versus

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Warm-up Using the given data Create a scatterplot Find the regression line

Warm-up Using the given data Create a scatterplot Find the regression line Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444

More information

Chapter 8. Linear Regression /71

Chapter 8. Linear Regression /71 Chapter 8 Linear Regression 1 /71 Homework p192 1, 2, 3, 5, 7, 13, 15, 21, 27, 28, 29, 32, 35, 37 2 /71 3 /71 Objectives Determine Least Squares Regression Line (LSRL) describing the association of two

More information

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals Chapter 8 Linear Regression Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8-1 Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Fat Versus

More information

Chapter 10 Re-expressing Data: Get It Straight!

Chapter 10 Re-expressing Data: Get It Straight! Chapter 0 Re-expressing Data: Get It Straight! 23 Chapter 0 Re-expressing Data: Get It Straight!. s. a) The residuals plot shows no pattern. No re-expression is needed. b) The residuals plot shows a curved

More information

Chapter 9 Re-expressing Data: Get It Straight!

Chapter 9 Re-expressing Data: Get It Straight! Chapter 9 Re-expressing Data: Get It Straight! 53 Chapter 9 Re-expressing Data: Get It Straight!. s. a) The residuals plot shows no pattern. No re-expression is needed. b) The residuals plot shows a curved

More information

Ch Inference for Linear Regression

Ch Inference for Linear Regression Ch. 12-1 Inference for Linear Regression ACT = 6.71 + 5.17(GPA) For every increase of 1 in GPA, we predict the ACT score to increase by 5.17. population regression line β (true slope) μ y = α + βx mean

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Chapter 3: Examining Relationships Most statistical studies involve more than one variable. Often in the AP Statistics exam, you will be asked to compare two data sets by using side by side boxplots or

More information

HOLLOMAN S AP STATISTICS BVD CHAPTER 08, PAGE 1 OF 11. Figure 1 - Variation in the Response Variable

HOLLOMAN S AP STATISTICS BVD CHAPTER 08, PAGE 1 OF 11. Figure 1 - Variation in the Response Variable Chapter 08: Linear Regression There are lots of ways to model the relationships between variables. It is important that you not think that what we do is the way. There are many paths to the summit We are

More information

1. Create a scatterplot of this data. 2. Find the correlation coefficient.

1. Create a scatterplot of this data. 2. Find the correlation coefficient. How Fast Foods Compare Company Entree Total Calories Fat (grams) McDonald s Big Mac 540 29 Filet o Fish 380 18 Burger King Whopper 670 40 Big Fish Sandwich 640 32 Wendy s Single Burger 470 21 1. Create

More information

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation AP Statistics Chapter 6 Scatterplots, Association, and Correlation Objectives: Scatterplots Association Outliers Response Variable Explanatory Variable Correlation Correlation Coefficient Lurking Variables

More information

BIVARIATE DATA data for two variables

BIVARIATE DATA data for two variables (Chapter 3) BIVARIATE DATA data for two variables INVESTIGATING RELATIONSHIPS We have compared the distributions of the same variable for several groups, using double boxplots and back-to-back stemplots.

More information

MULTIPLE REGRESSION METHODS

MULTIPLE REGRESSION METHODS DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 MULTIPLE REGRESSION METHODS I. AGENDA: A. Residuals B. Transformations 1. A useful procedure for making transformations C. Reading:

More information

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to:

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to: STA 2023 Module 5 Regression and Correlation Learning Objectives Upon completing this module, you should be able to: 1. Define and apply the concepts related to linear equations with one independent variable.

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Section 3.2 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 3 Describing Relationships 3.1 Scatterplots and Correlation 3.2 Section 3.2

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 24, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

Transforming to Achieve Linearity

Transforming to Achieve Linearity Transforming to Achieve Linearity Essential Question: Why do we transform data to make it appear linear? In a previous chapter, we learned how to analyze relationships between two quantitative variables

More information

Correlation. Relationship between two variables in a scatterplot. As the x values go up, the y values go down.

Correlation. Relationship between two variables in a scatterplot. As the x values go up, the y values go down. Correlation Relationship between two variables in a scatterplot. As the x values go up, the y values go up. As the x values go up, the y values go down. There is no relationship between the x and y values

More information

Conditions for Regression Inference:

Conditions for Regression Inference: AP Statistics Chapter Notes. Inference for Linear Regression We can fit a least-squares line to any data relating two quantitative variables, but the results are useful only if the scatterplot shows a

More information

AP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions

AP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions AP STATISTICS Name: Period: Review Unit IV Scatterplots & Regressions Know the definitions of the following words: bivariate data, regression analysis, scatter diagram, correlation coefficient, independent

More information

Algebra II Chapter 5

Algebra II Chapter 5 Algebra II Chapter 5 5.1 Quadratic Functions The graph of a quadratic function is a parabola, as shown at rig. Standard Form: f ( x) = ax2 + bx + c vertex: (x, y) = b 2a, f b 2a a < 0 graph opens down

More information

22 Approximations - the method of least squares (1)

22 Approximations - the method of least squares (1) 22 Approximations - the method of least squares () Suppose that for some y, the equation Ax = y has no solutions It may happpen that this is an important problem and we can t just forget about it If we

More information

Chapter 7 Linear Regression

Chapter 7 Linear Regression Chapter 7 Linear Regression 1 7.1 Least Squares: The Line of Best Fit 2 The Linear Model Fat and Protein at Burger King The correlation is 0.76. This indicates a strong linear fit, but what line? The line

More information

Index I-1. in one variable, solution set of, 474 solving by factoring, 473 cubic function definition, 394 graphs of, 394 x-intercepts on, 474

Index I-1. in one variable, solution set of, 474 solving by factoring, 473 cubic function definition, 394 graphs of, 394 x-intercepts on, 474 Index A Absolute value explanation of, 40, 81 82 of slope of lines, 453 addition applications involving, 43 associative law for, 506 508, 570 commutative law for, 238, 505 509, 570 English phrases for,

More information

UNIT 12 ~ More About Regression

UNIT 12 ~ More About Regression ***SECTION 15.1*** The Regression Model When a scatterplot shows a relationship between a variable x and a y, we can use the fitted to the data to predict y for a given value of x. Now we want to do tests

More information

Using a Graphing Calculator

Using a Graphing Calculator Using a Graphing Calculator Unit 1 Assignments Bridge to Geometry Name Date Period Warm Ups Name Period Date Friday Directions: Today s Date Tuesday Directions: Today s Date Wednesday Directions: Today

More information

Relationships Regression

Relationships Regression Relationships Regression BPS chapter 5 2006 W.H. Freeman and Company Objectives (BPS chapter 5) Regression Regression lines The least-squares regression line Using technology Facts about least-squares

More information

Correlation: basic properties.

Correlation: basic properties. Correlation: basic properties. 1 r xy 1 for all sets of paired data. The closer r xy is to ±1, the stronger the linear relationship between the x-data and y-data. If r xy = ±1 then there is a perfect linear

More information

Chapter 3: Describing Relationships

Chapter 3: Describing Relationships Chapter 3: Describing Relationships Section 3.2 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 3 Describing Relationships 3.1 Scatterplots and Correlation 3.2 Section 3.2

More information

7. Do not estimate values for y using x-values outside the limits of the data given. This is called extrapolation and is not reliable.

7. Do not estimate values for y using x-values outside the limits of the data given. This is called extrapolation and is not reliable. AP Statistics 15 Inference for Regression I. Regression Review a. r à correlation coefficient or Pearson s coefficient: indicates strength and direction of the relationship between the explanatory variables

More information

Bivariate Data Summary

Bivariate Data Summary Bivariate Data Summary Bivariate data data that examines the relationship between two variables What individuals to the data describe? What are the variables and how are they measured Are the variables

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

The following formulas related to this topic are provided on the formula sheet:

The following formulas related to this topic are provided on the formula sheet: Student Notes Prep Session Topic: Exploring Content The AP Statistics topic outline contains a long list of items in the category titled Exploring Data. Section D topics will be reviewed in this session.

More information

Sociology 6Z03 Review I

Sociology 6Z03 Review I Sociology 6Z03 Review I John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review I Fall 2016 1 / 19 Outline: Review I Introduction Displaying Distributions Describing

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

INFERENCE FOR REGRESSION

INFERENCE FOR REGRESSION CHAPTER 3 INFERENCE FOR REGRESSION OVERVIEW In Chapter 5 of the textbook, we first encountered regression. The assumptions that describe the regression model we use in this chapter are the following. We

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

1 Some Statistical Basics.

1 Some Statistical Basics. Q Some Statistical Basics. Statistics treats random errors. (There are also systematic errors e.g., if your watch is 5 minutes fast, you will always get the wrong time, but it won t be random.) The two

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Chapter 6. Exploring Data: Relationships. Solutions. Exercises:

Chapter 6. Exploring Data: Relationships. Solutions. Exercises: Chapter 6 Exploring Data: Relationships Solutions Exercises: 1. (a) It is more reasonable to explore study time as an explanatory variable and the exam grade as the response variable. (b) It is more reasonable

More information

Exponential Functions

Exponential Functions CONDENSED LESSON 5.1 Exponential Functions In this lesson, you Write a recursive formula to model radioactive decay Find an exponential function that passes through the points of a geometric sequence Learn

More information

IF YOU HAVE DATA VALUES:

IF YOU HAVE DATA VALUES: Unit 02 Review Ways to obtain a line of best fit IF YOU HAVE DATA VALUES: 1. In your calculator, choose STAT > 1.EDIT and enter your x values into L1 and your y values into L2 2. Choose STAT > CALC > 8.

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

Determine is the equation of the LSRL. Determine is the equation of the LSRL of Customers in line and seconds to check out.. Chapter 3, Section 2

Determine is the equation of the LSRL. Determine is the equation of the LSRL of Customers in line and seconds to check out.. Chapter 3, Section 2 3.2c Computer Output, Regression to the Mean, & AP Formulas Be sure you can locate: the slope, the y intercept and determine the equation of the LSRL. Slope is always in context and context is x value.

More information

Chapter 7 Summary Scatterplots, Association, and Correlation

Chapter 7 Summary Scatterplots, Association, and Correlation Chapter 7 Summary Scatterplots, Association, and Correlation What have we learned? We examine scatterplots for direction, form, strength, and unusual features. Although not every relationship is linear,

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

Approximations - the method of least squares (1)

Approximations - the method of least squares (1) Approximations - the method of least squares () In many applications, we have to consider the following problem: Suppose that for some y, the equation Ax = y has no solutions It could be that this is an

More information

Sem. 1 Review Ch. 1-3

Sem. 1 Review Ch. 1-3 AP Stats Sem. 1 Review Ch. 1-3 Name 1. You measure the age, marital status and earned income of an SRS of 1463 women. The number and type of variables you have measured is a. 1463; all quantitative. b.

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

MATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression

MATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression MATH 1070 Introductory Statistics Lecture notes Relationships: Correlation and Simple Regression Objectives: 1. Learn the concepts of independent and dependent variables 2. Learn the concept of a scatterplot

More information

Chapter 6. September 17, Please pick up a calculator and take out paper and something to write with. Association and Correlation.

Chapter 6. September 17, Please pick up a calculator and take out paper and something to write with. Association and Correlation. Please pick up a calculator and take out paper and something to write with. Sep 17 8:08 AM Chapter 6 Scatterplots, Association and Correlation Copyright 2015, 2010, 2007 Pearson Education, Inc. Chapter

More information

Announcements. Lecture 18: Simple Linear Regression. Poverty vs. HS graduate rate

Announcements. Lecture 18: Simple Linear Regression. Poverty vs. HS graduate rate Announcements Announcements Lecture : Simple Linear Regression Statistics 1 Mine Çetinkaya-Rundel March 29, 2 Midterm 2 - same regrade request policy: On a separate sheet write up your request, describing

More information

Table 2.1 presents examples and explains how the proper results should be written. Table 2.1: Writing Your Results When Adding or Subtracting

Table 2.1 presents examples and explains how the proper results should be written. Table 2.1: Writing Your Results When Adding or Subtracting When you complete a laboratory investigation, it is important to make sense of your data by summarizing it, describing the distributions, and clarifying messy data. Analyzing your data will allow you to

More information

Related Example on Page(s) R , 148 R , 148 R , 156, 157 R3.1, R3.2. Activity on 152, , 190.

Related Example on Page(s) R , 148 R , 148 R , 156, 157 R3.1, R3.2. Activity on 152, , 190. Name Chapter 3 Learning Objectives Identify explanatory and response variables in situations where one variable helps to explain or influences the other. Make a scatterplot to display the relationship

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Overfitting Categorical Variables Interaction Terms Non-linear Terms Linear Logarithmic y = a +

More information

+ Statistical Methods in

+ Statistical Methods in + Statistical Methods in Practice STAT/MATH 3379 + Discovering Statistics 2nd Edition Daniel T. Larose Dr. A. B. W. Manage Associate Professor of Mathematics & Statistics Department of Mathematics & Statistics

More information

Topics Covered in Math 115

Topics Covered in Math 115 Topics Covered in Math 115 Basic Concepts Integer Exponents Use bases and exponents. Evaluate exponential expressions. Apply the product, quotient, and power rules. Polynomial Expressions Perform addition

More information

SECTION I Number of Questions 42 Percent of Total Grade 50

SECTION I Number of Questions 42 Percent of Total Grade 50 AP Stats Chap 7-9 Practice Test Name Pd SECTION I Number of Questions 42 Percent of Total Grade 50 Directions: Solve each of the following problems, using the available space (or extra paper) for scratchwork.

More information

AP Statistics Cumulative AP Exam Study Guide

AP Statistics Cumulative AP Exam Study Guide AP Statistics Cumulative AP Eam Study Guide Chapters & 3 - Graphs Statistics the science of collecting, analyzing, and drawing conclusions from data. Descriptive methods of organizing and summarizing statistics

More information

Chapter 4 Simultaneous Linear Equations

Chapter 4 Simultaneous Linear Equations Chapter 4 Simultaneous Linear Equations Section 4.: Understanding Solutions of Simultaneous Linear Equations Analyze and solve pairs of simultaneous linear equations. Understand that solutions to a system

More information

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation Chapter 1 Summarizing Bivariate Data Linear Regression and Correlation This chapter introduces an important method for making inferences about a linear correlation (or relationship) between two variables,

More information

Transforming with Powers and Roots

Transforming with Powers and Roots 12.2.1 Transforming with Powers and Roots When you visit a pizza parlor, you order a pizza by its diameter say, 10 inches, 12 inches, or 14 inches. But the amount you get to eat depends on the area of

More information

Chapter 7. Linear Regression (Pt. 1) 7.1 Introduction. 7.2 The Least-Squares Regression Line

Chapter 7. Linear Regression (Pt. 1) 7.1 Introduction. 7.2 The Least-Squares Regression Line Chapter 7 Linear Regression (Pt. 1) 7.1 Introduction Recall that r, the correlation coefficient, measures the linear association between two quantitative variables. Linear regression is the method of fitting

More information

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras Lecture - 39 Regression Analysis Hello and welcome to the course on Biostatistics

More information

Chapter 7. Scatterplots, Association, and Correlation. Copyright 2010 Pearson Education, Inc.

Chapter 7. Scatterplots, Association, and Correlation. Copyright 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation Copyright 2010 Pearson Education, Inc. Looking at Scatterplots Scatterplots may be the most common and most effective display for data. In a scatterplot,

More information

Student Guide: Chapter 1

Student Guide: Chapter 1 Student Guide: Chapter 1 1.1 1.1.1 I can solve puzzles in teams 1-4 to 1-8 1.1 1.1.2 1.1 1.1.3 I can investigate the growth of patterns 1-13 to 1-17 1-18 to 1-22 I can investigate the graphs of quadratic

More information

Stat 101 Exam 1 Important Formulas and Concepts 1

Stat 101 Exam 1 Important Formulas and Concepts 1 1 Chapter 1 1.1 Definitions Stat 101 Exam 1 Important Formulas and Concepts 1 1. Data Any collection of numbers, characters, images, or other items that provide information about something. 2. Categorical/Qualitative

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression OI CHAPTER 7 Important Concepts Correlation (r or R) and Coefficient of determination (R 2 ) Interpreting y-intercept and slope coefficients Inference (hypothesis testing and confidence

More information

1) A residual plot: A)

1) A residual plot: A) 1) A residual plot: A) B) C) D) E) displays residuals of the response variable versus the independent variable. displays residuals of the independent variable versus the response variable. displays residuals

More information

3.2: Least Squares Regressions

3.2: Least Squares Regressions 3.2: Least Squares Regressions Section 3.2 Least-Squares Regression After this section, you should be able to INTERPRET a regression line CALCULATE the equation of the least-squares regression line CALCULATE

More information

Important note: Transcripts are not substitutes for textbook assignments. 1

Important note: Transcripts are not substitutes for textbook assignments. 1 In this lesson we will cover correlation and regression, two really common statistical analyses for quantitative (or continuous) data. Specially we will review how to organize the data, the importance

More information

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation y = a + bx y = dependent variable a = intercept b = slope x = independent variable Section 12.1 Inference for Linear

More information

Linear Regression Communication, skills, and understanding Calculator Use

Linear Regression Communication, skills, and understanding Calculator Use Linear Regression Communication, skills, and understanding Title, scale and label the horizontal and vertical axes Comment on the direction, shape (form), and strength of the relationship and unusual features

More information

IT 403 Practice Problems (2-2) Answers

IT 403 Practice Problems (2-2) Answers IT 403 Practice Problems (2-2) Answers #1. Which of the following is correct with respect to the correlation coefficient (r) and the slope of the leastsquares regression line (Choose one)? a. They will

More information

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression Models. Chapter 4. Introduction. Introduction. Introduction Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager

More information

7.0 Lesson Plan. Regression. Residuals

7.0 Lesson Plan. Regression. Residuals 7.0 Lesson Plan Regression Residuals 1 7.1 More About Regression Recall the regression assumptions: 1. Each point (X i, Y i ) in the scatterplot satisfies: Y i = ax i + b + ɛ i where the ɛ i have a normal

More information

MATH 2560 C F03 Elementary Statistics I LECTURE 9: Least-Squares Regression Line and Equation

MATH 2560 C F03 Elementary Statistics I LECTURE 9: Least-Squares Regression Line and Equation MATH 2560 C F03 Elementary Statistics I LECTURE 9: Least-Squares Regression Line and Equation 1 Outline least-squares regresion line (LSRL); equation of the LSRL; interpreting the LSRL; correlation and

More information

Statistical Concepts. Constructing a Trend Plot

Statistical Concepts. Constructing a Trend Plot Module 1: Review of Basic Statistical Concepts 1.2 Plotting Data, Measures of Central Tendency and Dispersion, and Correlation Constructing a Trend Plot A trend plot graphs the data against a variable

More information

AP Statistics Unit 2 (Chapters 7-10) Warm-Ups: Part 1

AP Statistics Unit 2 (Chapters 7-10) Warm-Ups: Part 1 AP Statistics Unit 2 (Chapters 7-10) Warm-Ups: Part 1 2. A researcher is interested in determining if one could predict the score on a statistics exam from the amount of time spent studying for the exam.

More information

Regression Analysis: Exploring relationships between variables. Stat 251

Regression Analysis: Exploring relationships between variables. Stat 251 Regression Analysis: Exploring relationships between variables Stat 251 Introduction Objective of regression analysis is to explore the relationship between two (or more) variables so that information

More information

2: SIMPLE HARMONIC MOTION

2: SIMPLE HARMONIC MOTION 2: SIMPLE HARMONIC MOTION Motion of a Mass Hanging from a Spring If you hang a mass from a spring, stretch it slightly, and let go, the mass will go up and down over and over again. That is, you will get

More information

Regression Diagnostics Procedures

Regression Diagnostics Procedures Regression Diagnostics Procedures ASSUMPTIONS UNDERLYING REGRESSION/CORRELATION NORMALITY OF VARIANCE IN Y FOR EACH VALUE OF X For any fixed value of the independent variable X, the distribution of the

More information

Algebra I Vocabulary Cards

Algebra I Vocabulary Cards Algebra I Vocabulary Cards Table of Contents Expressions and Operations Natural Numbers Whole Numbers Integers Rational Numbers Irrational Numbers Real Numbers Absolute Value Order of Operations Expression

More information

4 Exponential and Logarithmic Functions

4 Exponential and Logarithmic Functions 4 Exponential and Logarithmic Functions 4.1 Exponential Functions Definition 4.1 If a > 0 and a 1, then the exponential function with base a is given by fx) = a x. Examples: fx) = x, gx) = 10 x, hx) =

More information

Chapter 2.1 Relations and Functions

Chapter 2.1 Relations and Functions Analyze and graph relations. Find functional values. Chapter 2.1 Relations and Functions We are familiar with a number line. A number line enables us to locate points, denoted by numbers, and find distances

More information

Solving Quadratic & Higher Degree Equations

Solving Quadratic & Higher Degree Equations Chapter 7 Solving Quadratic & Higher Degree Equations Sec 1. Zero Product Property Back in the third grade students were taught when they multiplied a number by zero, the product would be zero. In algebra,

More information

bx, which takes in a value of the explanatory variable and spits out the log of the predicted response.

bx, which takes in a value of the explanatory variable and spits out the log of the predicted response. Transforming the Data We are focusing on simple linear regression however, not all bivariate relationships are linear. Some are curved we will now look at how to straighten out two large families of curves.

More information

Announcements. Lecture 10: Relationship between Measurement Variables. Poverty vs. HS graduate rate. Response vs. explanatory

Announcements. Lecture 10: Relationship between Measurement Variables. Poverty vs. HS graduate rate. Response vs. explanatory Announcements Announcements Lecture : Relationship between Measurement Variables Statistics Colin Rundel February, 20 In class Quiz #2 at the end of class Midterm #1 on Friday, in class review Wednesday

More information

MAC Module 2 Modeling Linear Functions. Rev.S08

MAC Module 2 Modeling Linear Functions. Rev.S08 MAC 1105 Module 2 Modeling Linear Functions Learning Objectives Upon completing this module, you should be able to: 1. Recognize linear equations. 2. Solve linear equations symbolically and graphically.

More information

Scatterplots and Correlation

Scatterplots and Correlation Bivariate Data Page 1 Scatterplots and Correlation Essential Question: What is the correlation coefficient and what does it tell you? Most statistical studies examine data on more than one variable. Fortunately,

More information

BIOSTATISTICS NURS 3324

BIOSTATISTICS NURS 3324 Simple Linear Regression and Correlation Introduction Previously, our attention has been focused on one variable which we designated by x. Frequently, it is desirable to learn something about the relationship

More information

Prob/Stats Questions? /32

Prob/Stats Questions? /32 Prob/Stats 10.4 Questions? 1 /32 Prob/Stats 10.4 Homework Apply p551 Ex 10-4 p 551 7, 8, 9, 10, 12, 13, 28 2 /32 Prob/Stats 10.4 Objective Compute the equation of the least squares 3 /32 Regression A scatter

More information

Discussion # 6, Water Quality and Mercury in Fish

Discussion # 6, Water Quality and Mercury in Fish Solution: Discussion #, Water Quality and Mercury in Fish Summary Approach The purpose of the analysis was somewhat ambiguous: analysis to determine which of the explanatory variables appears to influence

More information