Historical Note. Regression. Line of Best Fit

Similar documents
Chapter 11. Correlation and Regression

H.Algebra 2 Summer Review Packet

The American School of Marrakesh. Algebra 2 Algebra 2 Summer Preparation Packet

Chapter 6. Exploring Data: Relationships

3.7 Linear and Quadratic Models

More Statistical Inference

Chapter 12 : Linear Correlation and Linear Regression

Introduction Direct Variation Rates of Change Scatter Plots. Introduction. EXAMPLE 1 A Mathematical Model

Using a graphic display calculator

determine whether or not this relationship is.

Session 4 2:40 3:30. If neither the first nor second differences repeat, we need to try another

Math 52 Linear Regression Instructions TI-83

Unit 3: Relations and Functions

Lesson 4 Linear Functions and Applications

3.2 Introduction to Functions

1 Introduction to Minitab

CORRELATION AND REGRESSION

Data transformation. Core: Data analysis. Chapter 5

Using Tables and Graphing Calculators in Math 11

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis

Algebra II Foundations

Regressions of Olympic Proportions

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics

Steps to take to do the descriptive part of regression analysis:

EDEXCEL ANALYTICAL METHODS FOR ENGINEERS H1 UNIT 2 - NQF LEVEL 4 OUTCOME 4 - STATISTICS AND PROBABILITY TUTORIAL 3 LINEAR REGRESSION

INTRODUCTION GOOD LUCK!

1.5. Analyzing Graphs of Functions. The Graph of a Function. What you should learn. Why you should learn it. 54 Chapter 1 Functions and Their Graphs

3 9 Curve Fitting with Polynomials

Graphing Equations in Slope-Intercept Form 4.1. Positive Slope Negative Slope 0 slope No Slope

UNIT 6 MODELING GEOMETRY Lesson 1: Deriving Equations Instruction

Exam 3 Practice Questions Psych , Fall 9

MPM2D - Practice Mastery Test #5

3.2 Understanding Relations and Functions-NOTES

MINI LESSON. Lesson 2a Linear Functions and Applications

y n 1 ( x i x )( y y i n 1 i y 2

Correlation and Residuals

74 Maths Quest 10 for Victoria

Dependence and scatter-plots. MVE-495: Lecture 4 Correlation and Regression

Chapter 1 Functions and Models

Essential Question How can you use a scatter plot and a line of fit to make conclusions about data?

Chapter 2 Linear Relations and Functions

Lesson 3 Average Rate of Change and Linear Functions

INVESTIGATE the Math

P.4 Lines in the Plane

Topic - 12 Linear Regression and Correlation

Six Sigma Black Belt Study Guides

Ch Inference for Linear Regression

DMA 50 Worksheet #1 Introduction to Graphs: Analyzing, Interpreting, and Creating Graphs

3.2 LOGARITHMIC FUNCTIONS AND THEIR GRAPHS

Notebook Tab 6 Pages 183 to ConteSolutions

Algebra 2 Chapter 2 Page 1

LEARNING WITH MINITAB Chapter 12 SESSION FIVE: DESIGNING AN EXPERIMENT

Final Exam Review. MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

Inferential Statistics and Distributions

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

1.2. Characteristics of Polynomial Functions. What are the key features of the graphs of polynomial functions?

A calculator may be used on the exam.

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Copyrighted by Gabriel Tang B.Ed., B.Sc. Page 1.

136 Maths Quest 10 for Victoria

MATH GRADE 8 UNIT 4 LINEAR RELATIONSHIPS ANSWERS FOR EXERCISES. Copyright 2015 Pearson Education, Inc. 51

8.4. If we let x denote the number of gallons pumped, then the price y in dollars can $ $1.70 $ $1.70 $ $1.70 $ $1.

MATH GRADE 8 UNIT 4 LINEAR RELATIONSHIPS EXERCISES

Ch 13 & 14 - Regression Analysis

Unit 7 Graphs and Graphing Utilities - Classwork

Introductory Statistics

Biostatistics in Research Practice - Regression I

LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output

STATISTICAL DATA ANALYSIS IN EXCEL

Chapter 16. Simple Linear Regression and Correlation

Quadratic equationsgraphs

CHAPTER P Preparation for Calculus

LOOKING FOR RELATIONSHIPS

Essential Question How can you use a quadratic function to model a real-life situation?

3-1. Solving Systems Using Tables and Graphs. Concept Summary. Graphical Solutions of Linear Systems VOCABULARY TEKS FOCUS ESSENTIAL UNDERSTANDING

Business Statistics. Lecture 9: Simple Regression

Algebra II Notes Quadratic Functions Unit Applying Quadratic Functions. Math Background

Chapter 9. Correlation and Regression

Section 2.2: LINEAR REGRESSION

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1

Lesson Remember. Finding Domain and Range from a Graph EXAMPLE. Key Vocabulary

Regression Analysis. BUS 735: Business Decision Making and Research

Red Hot Half-Life Modeling Nuclear Decay

Review Topics for MATH 1400 Elements of Calculus Table of Contents

Least Squares Regression

Module 1 Linear Regression

10.1 Simple Linear Regression

Overview. 4.1 Tables and Graphs for the Relationship Between Two Variables. 4.2 Introduction to Correlation. 4.3 Introduction to Regression 3.

2-3. Linear Regression and Correlation. Vocabulary

UNIT 3 Relationships

MATH 021 UNIT 1 HOMEWORK ASSIGNMENTS

a 2 x y 1 x 1 y SOL AII.1a

Mathematical Notation Math Introduction to Applied Statistics

Unit #2: Linear and Exponential Functions Lesson #13: Linear & Exponential Regression, Correlation, & Causation. Day #1

Vocabulary. Fitting a Line to Data. Lesson 2-2 Linear Models

Simple linear regression: linear relationship between two qunatitative variables. Linear Regression. The regression line

Year 10 Mathematics Semester 2 Bivariate Data Chapter 13

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Transcription:

11 4 Regression Objective 4. Compute the equation of the regression line. In studing relationships between two variables, collect the data and then construct a scatter plot. The purpose of the scatter plot, as indicated previousl, is to determine the nature of the relationship. The possibilities include a positive linear relationship, a negative linear relationship, a curvilinear relationship, or no discernible relationship. After the scatter plot is drawn, the net steps are to compute the value of the correlation coefficient and to test the significance of the relationship. If the value of the correlation coefficient is significant, the net step is to determine the equation of the regression line, which is the data s line of best fit. (Note: Determining the regression line when r is not significant and then making predictions using the regression line is meaningless.) The purpose of the regression line is to enable the researcher to see the trend and make predictions on the basis of the data. Line of Best Fit Figure 11 10 Scatter Plot with Three Lines Fit to the Data Figure 11 10 shows a scatter plot for the data of two variables. It shows that several lines can be drawn on the graph near the points. Given a scatter plot, one must be able to draw the line of best fit. Best fit means that the sum of the squares of the vertical distances from each point to the line is at a minimum. The reason one needs a line of best fit is that the values of will be predicted from the values of ; hence, the closer the points are to the line, the better the fit and the prediction will be. See Figure 11 11. Historical Note Francis Galton drew the line of best fit visuall. An assistant of Karl Pearson s named G. Yule devised the mathematical solution using the least-squares method, emploing a mathematical technique developed b Adrien Marie Legendre about 100 ears earlier. Figure 11 12 shows the relationship between the values of the correlation coefficient and the variabilit of the scores about the regression line. The closer the points fit the regression line, the higher the absolute value of r is and the closer it will be to 1 or 1. When all the points fall eactl on the line, r will equal 1 or 1, and this indicates

Section 11 4 Regression 479 a perfect linear relationship between the variables. Figure 11 12(a) shows r as positive and approimatel equal to 0.50; Figure 11 12(b) shows r approimatel equal to 0.90; and Figure 11 12(c) shows r equal to 1. For positive values of r, the line slopes upward from left to right. Figure 11 11 Line of Best Fit for a Set of Data Points d 6 d 7 Observed value d 5 d 3 d 1 d 2 d 4 Predicted value Figure 11 12 Relationship between the Correlation Coefficient and the Line of Best Fit (a) r = 0.50 (b) r = 0.90 (c) r = 1.00 (d) r = 0.50 (e) r = 0.90 (f) r = 1.00 Figure 11 12(d) shows r as negative and approimatel equal to 0.50; Figure 11 12(e) shows r approimatel equal to 0.90; and Figure 11 12(f) shows r equal to 1. For negative values of r, the line slopes downward from left to right. Determination of the Regression Line Equation In algebra, the equation of a line is usuall given as m b, where m is the slope of the line and b is the intercept. (Students who need an algebraic review of the properties of a line should refer to Appendi A Section A 3 before studing this section.) In statistics, the equation of the regression line is written as ' a b, where a is the ' intercept and b is the slope of the line. See Figure 11 13.

480 Chapter 11 Correlation and Regression Figure 11 13 A Line as Represented in Algebra and in Statistics Slope Intercept Intercept Slope = m + b = 0.5 + 5 = 2 = a + b = 5 + 0.5 = 2 = 4 = 4 5 m = = 2 4 = 0.5 5 b = = 2 4 = 0.5 (a) Algebra of a line (b) Statistical notation for a regression line Historical Note In 1795, Adrien Marie Legendre (1752 1833) measured the meridian arc on the earth s surface from Barcelona, Spain, to Dunkirk, England. This measure was used as the basis for the measure of the meter. Legendre developed the least-squares method around the ear 1805. There are several methods for finding the equation of the regression line. Two formulas are given here. These formulas use the same values that are used in computing the value of the correlation coefficient. The mathematical development of these formulas is beond the scope of this book. Formulas for the Regression Line ' a b a 2 n 2 2 n b n 2 2 where a is the ' intercept and b is the slope of the line. Rounding Rule for the Intercept and Slope places. Round the values of a and b to three decimal Eample 11 9 Find the equation of the regression line for the data in Eample 11 4, and graph the line on the scatter plot of the data. Solution The values needed for the equation are n 6, 345, 819, 47,634, and 2 20,399. Substituting in the formulas, one gets a 2 n 2 2 n 6 47,634 345 819 b 0.964 n 2 2 6 20,399 345 2 Hence, the equation of the regression line, ' a b, is ' 81.048 0.964 819 20,399 345 47,634 81.048 6 20,399 345 2

Section 11 4 Regression 481 The graph of the line is shown in Figure 11 14. Figure 11 14 Regression Line for Eample 11 9 150 140 Pressure 130 120 = 81.048 + 0.964 110 0 40 50 60 70 Age Note: When drawing the scatter plot and the regression line, it is sometimes desirable to truncate the graph (see Chapter 2). The reason is to show the line drawn in the range of the independent and dependent variables. For eample, the regression line in Figure 11 14 is drawn between the values of approimatel 43 and 82 and the ' values of approimatel 120 and 152. The range of the values in the original data shown in Eample 11 4 is 70 43 27 and the range of the ' values is 152 120 32. Notice that the ais has been truncated; the distance between 0 and 40 is not shown in the proper scale compared to the distance between 40 and 50, 50 and 60, etc. The ' ais has been similarl truncated. The important thing to remember is that when the ais and sometimes the ' ais have been truncated, do not use the ' intercept value, a, to graph the line. To be on the safe side when graphing the regression line, use a value for selected from the range of values. Eample 11 10 Find the equation of the regression line for the data in Eample 11 5, and graph the line on the scatter plot. Solution The values needed for the equation are n 7, 57, 511, 3745, and 2 579. Substituting in the formulas, one gets a 2 n 2 2 n 7 3745 57 511 b 3.622 n 2 2 7 579 57 2 Hence, the equation of the regression line, ' a b, is ' 102.493 3.622 The graph of the line is shown in Figure 11 15. 511 579 57 3745 102.493 7 579 57 2

482 Chapter 11 Correlation and Regression Figure 11 15 Regression Line for Eample 11 10 100 90 80 Final grade 70 60 50 40 = 102.493 3.622 30 0 5 10 15 Number of absences The sign of the correlation coefficient and the sign of the slope of the regression line will alwas be the same. That is, if r is positive, then b will be positive; if r is negative, then b will be negative. The reason is that the numerators of the formulas are the same and determine the signs of r and b, and the denominators are alwas positive. The regression line can be used to make predictions for the dependent variable. The method for making predictions is shown in the net eample. Eample 11 11 Using the equation of the regression line found in Eample 11 9, predict the blood pressure for a person who is 50 ears old. Solution Substituting 50 for in the regression line ' 81.048 0.964 gives ' 81.048 (0.964)(50) 129.248 (rounded to 129) In other words, the predicted sstolic blood pressure for a 50-ear-old person is 129. The value obtained in Eample 11 11 is a point prediction, and with point predictions, no degree of accurac or confidence can be determined. More information on prediction is given in Section 11 5, including prediction intervals. When r is not significantl different from 0, the best predictor of is the mean of the data values of. For valid predictions, the value of the correlation coefficient must be significant. Also, two other assumptions must be met. Assumptions for Valid Predictions in Regression 1. For an specific value of the independent variable, the value of the dependent variable must be normall distributed about the regression line. See Figure 11 16(a). 2. The standard deviation of each of the dependent variables must be the same for each value of the independent variable. See Figure 11 16(b).

Section 11 4 Regression 483 Figure 11 16 Assumptions for Predictions s = a + b µ µ 2 µ n µ 1 1 2 n (a) Dependent variable normall distributed (b) σ 1 = σ 2 =... = σ n When predictions are made beond the bounds of the data, the must be interpreted cautiousl. For eample, in 1979, some eperts predicted that the United States would run out of oil b the ear 2003. This prediction was based on the current consumption and on known oil reserves at that time. However, since then, the automobile industr has produced man new fuel-efficient vehicles. Also, there are man as et undiscovered oil fields. Finall, science ma someda discover a wa to run a car on something as unlikel but as common as peanut oil. In addition, the price of a gallon of gasoline was predicted to reach $10 a few ears later. Fortunatel this has not come to pass. Remember that when predictions are made, the are based on present conditions or on the premise that present trends will continue. This assumption ma or ma not prove true in the future. The steps for finding the value of the correlation coefficient and the regression line equation are summarized in the Procedure Table. Procedure Table Finding the Correlation Coefficient and the Regression Line Equation STEP 1 Make a table, as shown in Step 2. STEP 2 Find the values of, 2, and 2. Place them in the appropriate columns and sum each column. 2 2 2 2 STEP 3 Substitute in the formula to find the value of r. STEP 4 n r [n 2 2 ][n 2 2 When r is significant, substitute in the formulas to find the values of a and b for the regression line equation a b. a 2 n b n 2 2 n 2 2

484 Chapter 11 Correlation and Regression Eercises 11 31. What two things should be done before one performs a regression analsis? 11 32. What are the assumptions for regression analsis? 11 33. What is the general form for the regression line used in statistics? 11 34. What is the smbol for the slope? For the intercept? 11 35. What is meant b the line of best fit? 11 36. When all the points fall on the regression line, what is the value of the correlation coefficient? 11 37. What is the relationship of the sign of the correlation coefficient and the sign of the slope of the regression line? 11 38. As the value of the correlation coefficient increases from 0 to 1, or decreases from 0 to 1, how do the points of the scatter plot fit the regression line? 11 39. How is the value of the correlation coefficient related to the accurac of the predicted value for a specific value of? 11 40. For the data in Eercise 11 12, find the equation of the regression line, and predict ' when 7 ads. 11 41. For the data in Eercise 11 13, find the equation of the regression line, and predict ' when 35 ears. 11 42. For the data in Eercise 11 14, find the equation of the regression line, and predict ' when $1100. 11 43. For the data in Eercise 11 15, find the equation of the regression line, and predict ' when 4 ears. 11 44. For the data in Eercise 11 16, find the equation of the regression line, and predict ' when 47 ears. 11 45. For the data in Eercise 11 17, find the equation of the regression line, and predict ' when 104. 11 46. For the data in Eercise 11 18, find the equation of the regression line, and predict ' when 88. 11 47. For the data in Eercise 11 19, find the equation of the regression line, and predict ' when 35 hours. 11 48. For the data in Eercise 11 20, find the equation of the regression line, and predict ' when 80. 11 49. For the data in Eercise 11 21, find the equation of the regression line, and predict ' when 400 calories. 11 50. For the data in Eercise 11 22, find the equation of the regression line, and predict ' when 0.27 milligram. 11 51. For the data in Eercise 11 23, find the equation of the regression line, and predict ' when 9 EER. 11 52. For the data in Eercise 11 24, find the equation of the regression line, and predict ' when 1.8 pints. 11 53. For the data in Eercise 11 25, find the equation of the regression line, and predict ' when 104. 11 54. For the data in Eercise 11 26, find the equation of the regression line, and predict ' when 4 ears. 11 55. For the data in Eercise 11 27, find the equation of the regression line, and predict ' when 4 ears. Eplain that to me. Source: Reprinted with special permission of King Features Sndicate, Inc. For Eercises 11 56 through 11 61, do a complete regression analsis b performing the following steps. a. Draw a scatter plot. b. Compute the correlation coefficient. c. State the hpotheses. d. Test the hpotheses at 0.05. Use Table I. e. Determine the regression line equation. f. Plot the regression line on the scatter plot. g. Summarize the results. 11 56. The following data were obtained for the ears 1993 through 1998 and indicate the number of fireworks (in millions) used and the related injuries. Predict the number of injuries if 100 million fireworks are used during a given ear. Fireworks in use, 67.6 87.1 117 115 118 113 Related injuries, 12,100 12,600 12,500 10,900 7,800 7,000 Source: National Council of Fireworks Safet, American Protechnic Assoc. 11 57. The following data were obtained from a surve of the number of ears people smoked and the percentage of lung damage the sustained. Predict the percentage of lung damage for a person who has smoked for 30 ears.

Section 11 4 Regression 485 Years, 22 14 31 36 9 41 19 Damage, 20 14 54 63 17 71 23 11 58. A researcher wishes to determine whether a person s age is related to the number of hours he or she jogs per week. The data for the sample follow. Age, 34 22 48 56 62 Hours, 5.5 7 3.5 3 1 11 59. The following data were obtained from a sample of counties in Southwestern Pennslvania and indicate the number (in thousands) of tons of bituminous coal produced in each count and the number of emploees working in coal production in each count. Predict the number of emploees needed to produce 500 thousand tons of coal. The data are given here. Tons, 227 5410 5328 147 729 8095 No. of emploees, 110 731 1031 20 118 1162 Tons, 635 6157 No. of emploees, 103 752 11 60. A statistics instructor is interested in finding the strength of a relationship between the final eam grades of students enrolled in Statistics I and in Statistics II. The data are given here in percentages. Statistics I, 87 92 68 72 95 78 83 98 Statistics II, 83 88 70 74 90 74 83 99 11 61. An educator wants to see how the number of absences a student in her class has affects the student s final grade. The data obtained from a sample follow. No. of absences, 10 12 2 0 8 5 Final grade, 70 65 96 94 75 82 For Eercises 11 62 and 11 63 do a complete regression analsis and test the significance of r at 0.05 using the P-value method. 11 62. A phsician wishes to know whether there is a relationship between a father s weight (in pounds) and his newborn son s weight (in pounds). The data are given here. Father s weight, 176 160 187 210 196 142 205 215 Son s weight, 6.6 8.2 9.2 7.1 8.8 9.3 7.4 8.6 11 63. A car dealership owner wishes to determine if there is a relationship between the number of ears of eperience a salesperson has and the number of cars sold per month. The data are given here. Years of eperience, 3 9 2 5 1 No. of cars sold per month, 5 14 12 21 8 *11 64. For Eercises 11 12, 11 13, and 11 14, find the mean of the and variables, then substitute the mean of the variable into the corresponding regression line equations found in Eercises 11 40, 11 41, and 11 42 and find '. Compare the value of ' with _ for each eercise. Generalize the results. *11 65. The intercept value, a, can also be found b using the following equation: a _ b _ Verif this result b using the data in Eercises 11 15 and 11 43, 11 16 and 11 44, and 11 17 and 11 45. *11 66. The value of the correlation coefficient can also be found b using the formula r bs s Where s is the standard deviation of the value, and s is the standard deviation of the values. Verif this result for Eercises 11 18, 11 19, and 11 20. MINITAB Step b Step Technolog Step b Step Correlation and Regression These instructions use data from the age and blood pressure stud discussed in Eamples 11 1, 11 4, 11 7, and 11 9. Eample MT11 1 1. Enter the data into two columns of the worksheet. Name the columns Age and Pressure. 2. Select Stat>Basic Statistics>Correlation. 3. Double-click Pressure and double-click Age. The dependent variable should be first. 4. Check the bo for Displa p-values.

486 Chapter 11 Correlation and Regression Correlation Dialog Bo 5. Click [OK]. The correlation coefficient will be displaed in the session window. It is r 0.897. To make a scatterplot: 6. Select GRAPH>PLOT. 7. Double-click on Pressure for the [Y] variable and Age for the Predictor [X] variable. 8. The Displa tpe should be Smbol. If not, click the drop down list arrow and change it. 9. Click [OK]. A graph window will open showing the scatter plot. Plot Dialog Bo To determine the equation of the least-squares regression line: 10. Select Stat>Regression>Regression. 11. Double-click Pressure in the variable list to select it for the Response variable, Y. 12. Double-click Age in the variable list to select it for the Predictors variable, X. 13. Click on [Storage] then check the boes for Residuals and Fits. 14. Click [OK] twice. The session window will contain the equation of the regression line and additional analsis. Note that the P-value shown to the right of the Predictor Age, 0.015, indicates the significance of r. See the discussion after Eample 11 7.

Section 11 4 Regression 487 Regression Analsis: Pressure versus Age The regression equation is Pressure = 81.0 + 0.964 Age Predictor Coef SE Coef T P Constant 81.05 13.88 5.84 0.004 Age 0.9644 0.2381 4.05 0.015 S = 5.641 R-Sq = 80.4% R-Sq (adj) = 75.5% Analsis of Variance Source DF SS MS F P Regression 1 522.21 522.21 16.41 0.015 Residual Error 4 127.29 31.82 Total 5 649.50 TI-83 Step b Step Correlation and Regression To graph a scatter plot: 1. Enter the values in L 1 and the values in L 2. 2. Make sure the Window values are appropriate. Select an Xmin slightl less than the smallest data value and an Xma slightl larger than the largest data value. Do the same for Ymin and Yma. Also, ou ma need to change the Xsci and Yscl values, depending on the data. 3. Press 2nd [STAT PLOT] 1 for Plot 1. The other functions should be turned off. 4. Move the cursor to On and press ENTER on the Plot 1 menu. 5. Move the cursor to the graphic that looks like a scatter plot net to Tpe (first graph), and press ENTER. Make sure the X list is L 1, and the Y list is L 2. 6. Press GRAPH. Eample TI11 1 Draw a scatter plot for the data from Eample 11 1. 43 48 56 61 67 70 128 120 135 143 141 152 Input Input Output

488 Chapter 11 Correlation and Regression The input and output screens are shown. To find the equation of the regression line: 1. Press STAT and move the cursor to Calc. 2. Press 8 for LinReg(a+b) then ENTER. The values for a and b will be displaed. In order to have the calculator compute and displa the correlation coefficient and coefficient of determination as well as the equation of the line, ou must set the diagnostics displa mode to on. Follow these steps: 1. Press 2nd [CATALOG]. 2. Use the arrow kes to scroll down to Diagnostic On. 3. Press ENTER to cop the command to the home screen. 4. Press ENTER to eecute the command. Eample TI11 2 Find the equation of the regression line for the data in the previous eample, as shown in Eample 11 9. Input Output The input and output screens are shown. The equation of the regression line is ' 81.04808549 0.964381122. To plot the regression line on the scatter plot: 1. Press Y and CLEAR to clear an previous equations. 2. Press VARS and then 5 for Statistics. 3. Move the cursor to EQ and press 1 for RegEQ. The line will be in the Y= screen. 4. Press GRAPH. Eample TI11 3 Draw the regression line found in the previous eample on the scatter plot. Output Output

Section 11 4 Regression 489 The output screens are shown. To test the significance of b and : 1. Press STAT and move the cursor to TESTS. 2. Press E (ALPHA SIN) for LinRegTTest. Make sure the Xlist is L 1, the YList is L 2, and the Freq is 1. 3. Select the appropriate alternative hpothesis. 4. Move the cursor to Calculate and press ENTER. Eample TI11 4 Test the hpothesis from Eamples 11 4 and 11 7, H 0 : 0 for the data in Eample 11 1. Use 0.05. Input Output Output In this case, the t test value is 4.050983638. The P-value is 0.0154631742, which is significant. The decision is to reject the null hpothesis at 0.05, since 0.0154631742 0.05; r 0.8966728145, r 2 0.8040221364. Ecel Step b Step Scatter Plots Creating scatter plots in Ecel is straightforward using the Chart Wizard. 1. Click on the Chart Wizard icon (it looks like a colorful histogram). 2. Select chart tpe XY (Scatter) under the Standard Tpes tab. Click on [Net >]. 3. Enter the data range, and specif whether the data for each variable are stored in columns (as we have done in our eamples) or rows. Click on [Net >]. 4. The net dialog bo enables ou to set various options for displaing the plot. In most cases, the defaults will be oka. After entering the desired options (note that there are several tabs for this screen), click on [Net >]. 5. Use this final dialog bo to specif where the chart will be located. Click on [Finish].

490 Chapter 11 Correlation and Regression Correlation Coefficient The CORREL function returns the correlation coefficient. 1. Enter the data in columns A and B. 2. Select a blank cell, then click on the f button. 3. Under Function categor, select Statistical. From the Function name list, select CORREL. 4. Enter the data range (A1:AN, where N is the number of sample data pairs) for the first variable in Arra1. Enter the data range for the second variable in Arra2. The correlation coefficient will be displaed in the selected cell.