Vocabulary. Fitting a Line to Data. Lesson 2-2 Linear Models

Similar documents
2-3. Linear Regression and Correlation. Vocabulary

Essential Question How can you use a scatter plot and a line of fit to make conclusions about data?

2-6. _ k x and y = _ k. The Graph of. Vocabulary. Lesson

7-6. nth Roots. Vocabulary. Geometric Sequences in Music. Lesson. Mental Math

Determining Slope and y-intercept 8.4.C. Find the slope of the line using the points (0, 4) and (-3, 6).

3.2 Introduction to Functions

9-1. The Function with Equation y = ax 2. Vocabulary. Graphing y = x 2. Lesson

The slope, m, compares the change in y-values to the change in x-values. Use the points (2, 4) and (6, 6) to determine the slope.

Lesson 2-7 Inverse Variation Models

Bridge-Thickness Experiment. Student 2

Fair Game Review. Chapter 2. and y = 5. Evaluate the expression when x = xy 2. 4x. Evaluate the expression when a = 9 and b = 4.

Chapter 7. Linear Regression (Pt. 1) 7.1 Introduction. 7.2 The Least-Squares Regression Line

Exam 3 Practice Questions Psych , Fall 9

How can you construct and interpret a scatter plot? ACTIVITY: Constructing a Scatter Plot

Inequalities and Multiplication

ACTIVITY: Using a Table to Plot Points

Equations for Some Hyperbolas

Chapter 2: Functions and Models. 2.1 The Language of Functions. Definitions

Linear Equations and Arithmetic Sequences

LESSON #12 - FORMS OF A LINE COMMON CORE ALGEBRA II

Rate of Change and Slope. ESSENTIAL QUESTION How do you find a rate of change or a slope?

LESSON #11 - FORMS OF A LINE COMMON CORE ALGEBRA II

( 3x. Chapter Review. Review Key Vocabulary. Review Examples and Exercises 6.1 Properties of Square Roots (pp )

Solving Systems of Linear Equations by Graphing

Name Date. and y = 5.

4.1 Circles. Explore Deriving the Standard-Form Equation

Functions. Essential Question What is a function?

Functions. Essential Question What is a function? Work with a partner. Functions can be described in many ways.

Fair Game Review. Chapter = How many calculators are sold when the profit is $425? Solve the equation. Check your solution.

Parametric Equations for Circles and Ellipses

How can you write an equation of a line when you are given the slope and a point on the line? ACTIVITY: Writing Equations of Lines

8.4. If we let x denote the number of gallons pumped, then the price y in dollars can $ $1.70 $ $1.70 $ $1.70 $ $1.

c. Find the slope and y-intercept of the graph of the linear equation. Then sketch its graph.

Fair Game Review. Chapter 5. Input, x Output, y. 1. Input, x Output, y. Describe the pattern of inputs x and outputs y.

Algebra 1. Standard Create and Analyze Tables and Graphs. Categories Tables Graphs Context Problems Making Predictions

Modeling with Exponential and Logarithmic Functions 6.7. Essential Question How can you recognize polynomial, exponential, and logarithmic models?

3.4. Slope Intercept Form The Leaky Bottle. My Notes ACTIVITY

1.5. Analyzing Graphs of Functions. The Graph of a Function. What you should learn. Why you should learn it. 54 Chapter 1 Functions and Their Graphs

) approaches e

15.2 Graphing Logarithmic

1Write and graph. 2Solve problems. Now. Then. Why? New Vocabulary

Chapter 11. Correlation and Regression

Maintaining Mathematical Proficiency

74 Maths Quest 10 for Victoria

Equation of a Line. Equation of a Line

3.2 LOGARITHMIC FUNCTIONS AND THEIR GRAPHS

3.2 Understanding Relations and Functions-NOTES

13.2 Exponential Growth Functions

10.7. Interpret the Discriminant. For Your Notebook. x5 2b 6 Ï} b 2 2 4ac E XAMPLE 1. Use the discriminant KEY CONCEPT

Systems of Linear Equations

Equations and the Coordinate Plane

5 Linear Graphs and Equations

4.2 Start Thinking. 4.2 Warm Up. 4.2 Cumulative Review Warm Up

1.1. Use a Problem Solving Plan. Read a problem and make a plan. Goal p Use a problem solving plan to solve problems. VOCABULARY. Formula.

Vocabulary. The Pythagorean Identity. Lesson 4-3. Pythagorean Identity Theorem. Mental Math

Introduction Direct Variation Rates of Change Scatter Plots. Introduction. EXAMPLE 1 A Mathematical Model

Essential Question How can you use a quadratic function to model a real-life situation?

LESSON 4.3 GRAPHING INEQUALITIES

Solving Systems of Linear Equations by Graphing. ESSENTIAL QUESTION How can you solve a system of equations by graphing? 8.9 Slope-intercept form

9.1.1 What else can I solve?

Finding an Equation of a Line

Proportional Relationships. How can you use proportional relationships to solve real-world problems?

Chapter 9 BUILD YOUR VOCABULARY

4 Linear Functions 45

1. Radium has a half-life of 1600 years. How much radium will be left from a 1000-gram sample after 1600 years?

TOPIC ESSENTIAL QUESTION

13.1 Exponential Growth Functions

4-10. Modeling with Trigonometric Functions. Vocabulary. Lesson. Mental Math. build an equation that models real-world periodic data.

7-1. Exploring Exponential Models. Vocabulary. Review. Vocabulary Builder. Use Your Vocabulary. 1. Cross out the expressions that are NOT powers.

CONSUMER CHOICES Madison is thinking about leasing a car for. Example 1 Solve the system of equations by graphing.

Lesson Remember. Finding Domain and Range from a Graph EXAMPLE. Key Vocabulary

Graphing Linear Equations

Chapter 9 Regression. 9.1 Simple linear regression Linear models Least squares Predictions and residuals.

P.4 Lines in the Plane

9.1.1 What else can I solve?

Summary and Vocabulary

Learning Objective: We will construct and interpret scatterplots (G8M6L4)

Writing Equations in Point-Slope Form

11.1 Solving Linear Systems by Graphing

(x, y) ( 1, 2) (0, 1) (1, 0) (2, 1)

Algebra I. Relations and Functions. Slide 1 / 113 Slide 2 / 113. Slide 3 / 113. Slide 4 / 113. Slide 6 / 113. Slide 5 / 113.

Functions. Essential Question What are some of the characteristics of the graph of an exponential function? ) x e. f (x) = ( 1 3 ) x f.

10.4 Nonlinear Inequalities and Systems of Inequalities. OBJECTIVES 1 Graph a Nonlinear Inequality. 2 Graph a System of Nonlinear Inequalities.

Analytic Geometry 300 UNIT 9 ANALYTIC GEOMETRY. An air traffi c controller uses algebra and geometry to help airplanes get from one point to another.

MATH 021 UNIT 1 HOMEWORK ASSIGNMENTS

An Introduction to Systems of Equations

What You ll Learn Identify direct variation. Use direct variation to solve problems.

6.4 graphs OF logarithmic FUnCTIOnS

f(x) = 2x 2 + 2x - 4

Algebra Semester 1 Final Exam Review Name: Hour: Date:

Systems of Linear Equations

LESSON #24 - POWER FUNCTIONS COMMON CORE ALGEBRA II

Linear Functions. Essential Question How can you determine whether a function is linear or nonlinear?

Fair Game Review. Chapter 5. feet and the length is 2x feet. Find the. perimeter of the garden. 24x 5 3. Name Date. Simplify the expression. 6.

Algebra 1 Practice Test Modeling with Linear Functions Unit 6. Name Period Date

MATH GRADE 8 UNIT 4 LINEAR RELATIONSHIPS ANSWERS FOR EXERCISES. Copyright 2015 Pearson Education, Inc. 51

In grade 10, you used trigonometry to find sides and angles in triangles. For a right triangle, sin v hy

Lesson Goals. Unit 2 Functions Analyzing Graphs of Functions (Unit 2.2) Graph of a Function. Lesson Goals

NAME DATE PERIOD. Study Guide and Intervention. Ax + By = C, where A 0, A and B are not both zero, and A, B, and C are integers with GCF of 1.

7-7A. Describing a Function from its Graph. Vocabulary. Lesson

Transcription:

Lesson 2-2 Linear Models BIG IDEA The sum of squared deviations is a statistic for determining which of two lines fi ts the data better. A linear function is a set of ordered pairs (, ) satisfing an equation of the form = m + b, where m and b are constants. Recall that the graph of ever such function is a line with slope m and -intercept b. Fitting a Line to Data When data in a scatterplot lie near a line, we can create a linear model for the data, that is, a model of one variable as a linear function of the other. Even if the linear function does not contain all of the data points, it ma still be useful in describing the overall trend of the data or in predicting values of either the dependent or independent variable. In this lesson and the net, ou will learn several techniques for constructing linear models. One technique is to fit a line to data b ee, that is, to draw a line that seems close to the data. Vocabular linear function linear model interpolation etrapolation observed values predicted values residual sum of squared residuals Mental Math What is the sum of a set 25 elements whose mean is 300? GUIDED Eample 1 Jewelers emphasize that the price of a diamond is determined b cut, carat weight, color and clarit. The table at the right gives carat weights and approimate prices in U.S. dollars for twent diamond rings sold at a recent auction in Singapore. All rings are of the same qualit gold and contain a single diamond. The data are graphed below. Prices of Diamond Rings Sold in Singapore 1 200 0 0 0.1 0.2 0.3 0.4 Weight (carats of diamonds) Weight Price (U.S. $) 0.18 702.00 0.17 517.50 0.25 963.00 0.29 1290.00 0.27 1080.00 0.15 484.50 0.20 747.00 0.25 1017.00 0.21 724.50 0.17 529.50 0.35 1629.00 0.33 1417.50 0.26 994.50 0.16 513.00 0.12 334.50 0.18 664.50 0.15 430.50 0.16 507.00 0.16 498.00 0.23 829.50 Source: Journal of Statistics Education Linear Models 87

Chapter 2 The linear model is based on the weight of the diamond used. Although the data are not collinear, the line through (0.18, ) and (0.32, ) seems close to the points. It has been added to the graph. An equation of this graphed line is one linear model for these data. Is the size alone a good predictor of price? a. Find an equation of the graphed line which relates weight and price. b. Interpret the slope of the line in the contet of the problem. c. Use the model to estimate the price of a 0.3-carat diamond ring. d. Wh is the model not good for predicting the cost of a 0.05-carat diamond ring? e. Wh is the set of data not a function? Solution Hard Rock a. We use the points (0.18, ) and (0.32, ) to fi nd The largest rough diamond ever found an equation of the line: weighed 3106 carats. m = 2-1 2 - =?? 1 Substitute m = 5714.29 and (0.18, ) into the point-slope form of the equation for a line. - 1 = m( - 1 ) -? = 5714.29( -? ) =? b. The slope is the rate of change. The cost increases b about? dollars for ever 1-carat increase in weight. c. To predict the cost from the weight, substitute = 0.3 and solve for. When = 0.3, =?. According to the model, the cost of a 0.3-carat diamond ring will be about?. d. Substitute 0.05 for and solve for. The model predicts that a 0.05-carat diamond ring will cost?. This means that the seller would pa ou to take the ring! This is not plausible, so the model is not a good predictor. e. There is at least one -value that does not have a unique -value. There are two diamonds at 0.17 carats that have different prices, so the data set is not a function. In this eample, known diamond weights range from 0.12 to 0.35 carats. If ou use the model to predict the price of a 0.3-carat diamond, ou are making a prediction between known values. Prediction between known values is called interpolation. If ou calculate the price of a diamond weighing 0.05 carats, ou are making a prediction beond known values. Prediction beond known values is called etrapolation. Etrapolation is usuall more hazardous than interpolation, because it depends on an assumption that a relationship will continue past the known data. In this case, a diamond much smaller than those in the sample might be inepensive, but it will not be free. 88 Functions and Models

Measuring How Well a Line Models Data In Chapter 1, ou studied the sample variance, which was computed as the sum of the squared deviations from the mean divided b n - 1. A similar statistic, the residual, tells how far awa data are from our chosen model. At the right is a table showing a smaller sample of the diamond ring data on page 87. These data, collected from sources such as eperiments or surves, are called observed values. Below, the scatterplot of these data is shown together with the graph of the line with equation = 2 +. This equation seems to be a prett good model for the data. However, is it the best model? To compare models, we calculate residuals. The values predicted b a model are called predicted values. The observed value minus the predicted value is the residual. A residual is positive when the observed value is higher than what is predicted b the model. A residual is negative when the observed value is lower than what the model predicts. For instance, a 0.16-carat diamond ring sold for $507.00. This is lower than the price predicted b the model. predicted price = 2 (0.16) + = $784 The residual, or error, is residual = observed value - predicted value = 507-784 = 277. Diamond Ring Prices b Weight of Diamond Weight Price (U.S. dollars) 0.15 484.50 0.16 507.00 0.18 702.00 0.25 963.00 0.27 1080.00 0.33 1417.50 0.23 829.50 The actual cost was $277 less than predicted. On the graph, we see that the observed value is 277 units below the linear model = 2 +. That is wh the residual is negative. Ever data point has a residual, so n data points provide n residuals. The absolute value of a residual is the length of the vertical segment from the data point to the corresponding point on the linear model. Diamond Ring Prices b Weight of Diamond Predicted Price = 2 + (0.33, 1417.5) Predicted Value (0.16, 784) Residual = 507-784 = -277 (0.16, 507) Observed Value 200 0 0.16 0.20 0.24 0.28 0.32 0.36 Weight (carats) QY QY What is the residual for the 0.33-carat diamond? Linear Models 89

Chapter 2 On the graphs below, this line and another linear model are drawn to fit the seven points. Recall that when ou compute variance, ou add the squared deviations from the mean. Similarl, when modeling data with a line, we can measure the variation from the line b adding the squares of the residuals. You can use spreadsheets to calculate residuals and the sum of squared residuals, as shown below. You can also think of (residual) 2 as the area of a geometric square whose side has length equal to the absolute value of the residual. These geometric sqares are drawn on the graphs below. Linear Model 1 Squares are shown for a line that does not go through an data points. Diamond Ring Prices b Weight of Diamond Predicted Price = 2 + 0.16 0.20 0.24 0.28 0.32 0.36 Weight (carats) Linear Model 2 Squares are shown for a line through two of the data points. Diamond Ring Prices b Weight of Diamond Predicted Price = 3728.6 + 30.857 0.16 0.20 0.24 0.28 0.32 0.36 Weight (carats) Total area of the squares 237, Total area of the squares 59,870 The second line is a better model of the data because it has a smaller total area of the squares. The total area is the sum of squared residuals. Definition of Sum of Squared Residuals n Sum of squared residuals = (observed i - predicted i ) 2 i = 1 90 Functions and Models

The sum of squared residuals is a statistic that measures lack of fit. If ou compare two lines, the one with the larger sum of squared residuals is not as good a model as the one with the smaller sum of squared residuals. If ou have man possible lines ou could use to model data, compute the sum of squared residuals for each model. The line that gives the smallest value provides the best fit to the data. Activit This table shows the number of televisions per 100 Countr TVs per 100 Unemploed per 100 people in 1997 and the number of unemploed per Argentina 22.3 7.8 100 people in 2008 for nine countries from around Bulgaria 40.0 6.3 the world. Some people suggest that increased TV viewing leads to a less productive workforce. Are these statistics related? Step 1 Use a statistics utilit to create a scatterplot with number of TVs on the -ais and number of unemploed people on the -ais. Step 2 Add a movable line to the scatterplot. India Israel Netherlands New Zealand Poland South Africa 6.5 29.9 51.8 52.3 33.7 12.3 6.8 6.1 4.5 4.0 9.7 21.7 South Korea 34.7 3.2 Step 3 Move the line as close to = 0.3 + 17 Source: UNESCO Institute for Statistics, CIA World Fact Book as ou can. Your screen should look similar to the one at the right. Record the sum of squared residuals. Step 4 Move the line so that it goes through the points for Bulgaria and the Netherlands. Record the equation. Record the sum of squared residuals. Step 5 Tell which line is the better model of the data. Eplain our choice. Step 6 Move the line until ou get the smallest sum of squares that ou can. Record the equation and the sum of squared residuals. Questions COVERING THE IDEAS 1. Define linear function. 2. Refer to the diamond price data in Eample 1. a. Find an equation for the line that passes through the data points (0.35, 1629) and (0.12, 334.50). b. Write a sentence that states how cost increases for ever carat increase in weight according to our equation. c. Use our line to predict the price of a 0.17-carat diamond. d. Is this interpolation or etrapolation? 3. Fill in the Blanks For a data set, residuals are the differences between? values and? values. Linear Models 91

Chapter 2 4. When a residual is positive, is the observed value higher or lower than the predicted value? In 5 and 6, a diamond speculator used the line with equation = 5000-250 to estimate the price of diamond rings. 5. a. What would the speculator predict for the price of the 0.29-carat diamond ring? b. According to the data in Eample 1, what is the residual for the 0.29-carat diamond ring? 6. a. What would the speculator predict as a price for the 0.21-carat diamond ring? b. What is the residual for this diamond ring? 7. Does the phrase sum of squared residuals mean first sum the residuals and then square the sum or first square each residual and then sum the squares? 8. Consider the linear model f() = 5.2-3. a. Given a residual of 0.2 at = 1, find the observed value. b. Given a residual of 2.1 at = 4, find the observed value. c. Given a residual of 0 at = 3, find the observed value. APPLYING THE MATHEMATICS In 9 and 10, suppose Jane asked members of her famil how man states the had visited. Her data are in the table at the right. 9. Jane plotted the data and drew a line through the points for Ed and Grandma. She found its equation to be = 0.55 + 3.5. a. What is the slope of this line and what does it represent? b. Complete the spreadsheet below to calculate the residuals for this model. c. What is the sum of squared residuals for Jane s model? d. Jane used her equation to predict how man states she will have visited at age 30. Is this interpolation or etrapolation? e. What is Jane s prediction from Part d? Famil Member Age States Visited Jane 16 15 Cousin Xia 16 18 Dad 40 27 Grandma 70 42 Brother Ed 10 9 Uncle Ralph 45 19 1 2 3 4 5 6 7 A B C D Age 16 16 40 70 10 45 States Visited 15 57 2 18 27 42 9 19 Predicted 12.3 25.5 42.0 28.25 Residual 7.29 2.25 0 85.56 2 10. Xia thought that drawing the line through the points for Ed and Dad would look better because then there would be two points above the line and two below the line. Her line has equation = 0.6 + 3. 92 Functions and Models

a. What is the residual for Grandma using Xia s model? b. What is the squared residual for Dad using Xia s model? c. Calculate the sum of squared residuals for Xia s model. d. Xia uses the equation to predict how man states Grandma will have visited when she reaches 100. Is this interpolation or etrapolation? e. What is Xia s prediction from Part d? 11. The table at the right gives epected life Epected Life Span Mean Gestation spans and gestation periods for selected Animal (ears) (das) animals. Gestation for mammals is Black Bear 20 215 development time in the mother s uterus until birth. For humans, the mean time is Camel 40 390 266 das. Elephant 55 660 a. Make a scatterplot with Life Span as the Gorilla 50 251 independent variable. Gra Squirrel 9 42 b. Draw the line through the points for Hippopotamus 45 240 Lion and Elephant. Do ou think this Lion 12 100 line fits the data well? Moose 27 240 c. Find an equation for the line in Part b Wolf 10 63 in slope-intercept form. Source: BBC d. Interpret the slope of the line in Part b. e. What is the residual for the Moose under this model? f. Use the line in Part b to predict the gestation period for the Blue Whale, which has an estimated life span of 80 ears. (The actual gestation period is estimated at 11-12 months.) Is this interpolation or etrapolation? g. What is the sum of squared residuals for the line? REVIEW 12. Consider the function {( 2, 8), ( 1, 8), (0, 8), (3, 8), (8, 8)}. (Lesson 2-1) a. State its domain. b. Give its range. In 13 and 14, a sample of five math test scores is given. (Lesson 1-6) a. Find the standard deviation of the data set. b. Eplain a conclusion ou can draw about the spread of the data set b just looking at it. 13. 63, 77, 81, 83, 92 14. 94, 95, 95, 96, 100 EXPLORATION 15. Use statistical software to draw a line that has the smallest sum of squared residuals for the data in the table in Question 11. a. What is the equation? b. What is the sum of squared residuals? QY ANSWER $225.50 Linear Models 93