Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1

Similar documents
Regression and Correlation

Unit 6 - Introduction to linear regression

Unit 6 - Simple linear regression

Can you tell the relationship between students SAT scores and their college grades?

Sampling Distributions: Central Limit Theorem

This document contains 3 sets of practice problems.

Correlation and Linear Regression

Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall)

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

9. Linear Regression and Correlation

Correlation & Simple Regression

Mrs. Poyner/Mr. Page Chapter 3 page 1

x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators

Area1 Scaled Score (NAPLEX) .535 ** **.000 N. Sig. (2-tailed)

REVIEW 8/2/2017 陈芳华东师大英语系

Chapter 9 - Correlation and Regression

Chapter 12 : Linear Correlation and Linear Regression

Chapter 9. Correlation and Regression

Correlation and Regression (Excel 2007)

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126

3.2: Least Squares Regressions

11 Correlation and Regression

Hypothesis testing: Steps

Inferences for Regression

Intro to Linear Regression

9 Correlation and Regression

CHAPTER 5 LINEAR REGRESSION AND CORRELATION

Business Statistics. Lecture 10: Correlation and Linear Regression

Intro to Linear Regression

Information Sources. Class webpage (also linked to my.ucdavis page for the class):

Simple Linear Regression: One Qualitative IV

Mathematics for Economics MA course

Hypothesis testing: Steps

Review of Multiple Regression

23. Inference for regression

y response variable x 1, x 2,, x k -- a set of explanatory variables

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Multiple Linear Regression

Chapter 19 Sir Migo Mendoza

Quantitative Bivariate Data

SMAM 314 Exam 42 Name

Eco 391, J. Sandford, spring 2013 April 5, Midterm 3 4/5/2013

Simple Linear Regression Using Ordinary Least Squares

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS

Chapter 4. Regression Models. Learning Objectives

Math 147 Lecture Notes: Lecture 12

Simple Linear Regression: One Quantitative IV

Do not copy, post, or distribute

Example: Forced Expiratory Volume (FEV) Program L13. Example: Forced Expiratory Volume (FEV) Example: Forced Expiratory Volume (FEV)

Chapter 15. Correlation and Regression

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Test 3A AP Statistics Name:

Practice Questions for Exam 1

Inference for the Regression Coefficient

Answer Key: Problem Set 6

Final Exam - Solutions

Chapters 9 and 10. Review for Exam. Chapter 9. Correlation and Regression. Overview. Paired Data

Simple Linear Regression

Harvard University. Rigorous Research in Engineering Education

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

15.0 Linear Regression

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Sociology 593 Exam 2 Answer Key March 28, 2002

HOMEWORK (due Wed, Jan 23): Chapter 3: #42, 48, 74

Study Guide AP Statistics

Correlation & Regression. Dr. Moataza Mahmoud Abdel Wahab Lecturer of Biostatistics High Institute of Public Health University of Alexandria

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories.

Business Statistics. Lecture 9: Simple Regression

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6

Finding Relationships Among Variables

Measuring Associations : Pearson s correlation

CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

ALGEBRA I SEMESTER EXAMS PRACTICE MATERIALS SEMESTER (1.1) Examine the dotplots below from three sets of data Set A

How can we explore the association between two quantitative variables?

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:

Exam Applied Statistical Regression. Good Luck!

y n 1 ( x i x )( y y i n 1 i y 2

1 A Review of Correlation and Regression

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues

S.ID.C.8: Correlation Coefficient

Upon completion of this chapter, you should be able to:

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Correlation Coefficient: the quantity, measures the strength and direction of a linear relationship between 2 variables.

FSA Algebra I End-of-Course Review Packet

Sem. 1 Review Ch. 1-3

Inferences for Correlation

SECTION I Number of Questions 42 Percent of Total Grade 50

BIOSTATISTICS NURS 3324

Basic Statistics Exercises 66

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania

AP Statistics L I N E A R R E G R E S S I O N C H A P 7

Chapter 5 Least Squares Regression

Statistics 100 Exam 2 March 8, 2017

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except on problems 1 & 2. Work neatly.

Lecture 18: Simple Linear Regression

Econometrics Review questions for exam

Transcription:

9.1 Scatter Plots and Linear Correlation Answers 1. A high school psychologist wants to conduct a survey to answer the question: Is there a relationship between a student s athletic ability and his/her popularity? A newly wed wants to surprise her husband with a turkey dinner for supper. She checks her cookbook to find out the time needed to cook turkey which weighs 13 pounds. In the cookbook she finds the following chart: Weight (Pounds) Cooking Time (Hours) 6 4.0 8 4.5 11 5.0 15 6.0 19 7.25 23 8.0 She will use this data to determine a rule to use to find out the time needed to cook a turkey given the weight of the turkey. 2. a) Positive Correlation Concepts 1

b) Negative Correlation C) Perfect Correlation d) Zero Correlation 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 y x 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 Concepts 2

3. Weak Correlation Strong Correlation Concepts 3

4. The correlation coefficient is a measure of the linear relationship between two variables. 5. a) This is an example of perfect correlation. The value of r is one. b) The value of the Pearson correlation coefficient is 1.0. 6. a) The Pearson correlation coefficient is 0.568. b) The percentage of the variance r 2 in the scores of Quiz 2 associated with the variance in the scores of Quiz 1 is 32.3%. c) The correlation coefficient of the two quizzes is positive but its value of 0.568 indicates that the linear relationship between the two is not a strong relationship. The value of r 2 indicates that only 32.3% of the variance of Quiz 1 is shared with Quiz 2. 7. The three factors that affect the magnitude and accuracy are linearity, homogeneity of the group and the sample size. 8. a) A positive association b) No association c) A positive association 9. a) A scatterplot would be a good visual representation to determine what type of relationship exists between blood pressure and age. b) A scatterplot would not be a good visual representation because opinions are not quantitative data. c) A scatterplot would be a good visual representation to determine the strength and direction of the correlation. Concepts 4

10. The numbers -1.9 and 2.6 could not be values of the correlation coefficient since the values range from -1.0 to +1.0. A perfect correlation has a value of positive 1 or of negative 1. 11. The value -0.8 implies a stronger linear relationship. The value -0.8 is closer to -1.0 than 0.6 is to +1.0. 12. The correlation coefficient indicates a linear relationship between two variables. A correlation coefficient of zero means there is not a linear relationship. It does not mean that there is not a relationship between the variables. The relationship between the variables that resulted in a curvilinear relationship may be quadratic. 13. The figure insert is not displayed in the concept. 14. The correlation coefficient is 0.857 15. A scatterplot is a visual representation of two variables which are both quantitative. The scatterplot consists of plotted points that allow you to see the distribution of the data and to interpret the relationship between the variables. 16. a) The following scatterplot represents the marks of 11 students in an Algebra teat and in a Geometry test. What mark would a student who made 37 in the Algebra test make in the Geometry test? It was appropriate to insert a line of best fit for predicting. Concepts 5

b) The following scatterplot represents the height of girls at ages 2, 3, 4, 5, 6, 7, 8 and 9 years of age. Describe the shape, trend and strength of the relationship. No line of best fit will be inserted because the height of a girl does not depend upon her age. 17. The correlation coefficient would not change if all the weights were converted to ounces. 18. a) The correlation should be described as being STRONG not as high. b) A correlation value cannot be greater than one. c) The value of the correlation coefficient does not have units. It is a value only. Concepts 6

9.2 Least-Squares Regression Answers 1. a) b) This appears to be a linear relationship as no curve or arc is visible. c) The regression model to use is LinReg (a+bx). d) The regression coefficient is -2.95. This means that a change of 1 in the number times exercising per week is associated with a change of -2.95 units of the memory score. e) The regression equation is y 2.95x 13.65 f) g) The predicted memory test score for a person who exercises three times a week is 4.8 5. h) No, a data transformation is not necessary because there is no visible curve in the scatterplot. Concepts 7

Residuals Chapter 9 Regression and Correlation i) 10 Residuals 1.35-4.75 4.25 0.3 0.2-2.7 7.25-0.65-2.8-0.8 0.15-2.7-0.7 1.3 0.25 X Variable 1 Residual Plot 5 0-5 -10 0 1 2 3 4 5 X Variable 1 j) A transformation of the data is not necessary because the residual values are not extremely far from zero. Therefore the linear regression equation is very suitable for this dataset. 2. a) An estimate of the weight is 230 pounds. b) The slope of the regression line is 7. For each inch of height increase the weight increases 7 pounds. c) If the man is 70 inches tall his weight is 241 pounds. d) The residual is 48. Concepts 8

3. The units for are. The units for are. I am taking this to mean: The units for y are dollars. The units for x are years. 4. a) The slope of the regression line is 7/15 or 0.467 b) The estimated systolic blood pressure is 128. 5. The regression equation is y 2x 5 X Y y 4 15 13 4 11 13 7 19 19 10 21 25 10 29 25 6. a) The average GPA would be 2.9 or approximately 3. b) For a change of 1 in the GPA, the SAT score increases by a factor of 0.00365. c) The estimated difference in GPAs is 0.2. d) The SAT score of 200 would result in an average GPA of 1.3. If this were included in the dataset, the regression line equation would change slightly but not enough to alter any of the GPA estimations. Thus, the test score can be kept as part of the dataset. 7. The range of x-values to use is (2, 4, 6). The first three data points (2, 28); (4, 33) and (6, 45) result in a regression equation that gives the y value corresponding to the x value. 8. The slope of the line is 1.556. 9. The estimated height of the wife is 64 inches. 10. This means that a regression line is not the best fit for the dataset. 11. a) The regression equation is y =.052x+287.12. b) The length at 2 kg is 287.22 cm. The length at 3 kg is 287.28 cm. The length at 5 kg is 287.38 cm. The length at 105 kg is 292.58 cm. c) The length of the spring under no load is 287.12 cm. d) The constant in Hooke s law is the slope of the line. Concepts 9

12. The regression line y =1+4x is better for this data since y - y 2 is 4. 13. a) The answers to this part will vary. b) The answers to this part will vary 14. a) A correlation of -0.45 is a moderate to weak negative correlation. Therefore the x- variable and the y-variable relate by a weak negative correlation. 15. a) True b) The slope is negative since the correlation is negative c) The slope is -0.20 when x = 45 and y = -2.5 b) True c) False 16. a) Yˆ 29.98 0.576 x1 b) 7.73 c) The null hypothesis is that there is no correlation between the son s height and the dad s height. The slope is not zero as coefficient for dad/height is 0.57568. d) 30.56 Concepts 10

9.3 Inferences about Regression Answers 1. The predictor, or X, variable is the number of years of formal education. The reasoning behind this decision is that we are trying to determine and predict the financial benefits of further education (measured from the annual salary) by using the number of years of formal education 2. Yes. With an r-value of 0.67, these two variables appear to be moderately to strongly correlated. The nature of the relationship is a moderately strong, positive correlation. 3. y = 0.782x + 3.842 4. $14 399 5. a) H : 0, : 0 0 H a 6. 16.8% 7. 4.65% b) The critical values are set at t =±1.98. c) S 0.08, t 9.8 b d) Reject the null hypothesis since the test statistic value (t = 9.8) is greater than the critical value of 1.98. 8. S Y ˆ CI 3.14 95 (10.13, 22.57) 9. Answers will be vary. 10. Reject the null hypothesis since the test statistic value (t = 9.8) is greater than the critical value of 2.508. Concepts 11

9.4 Multiple Regression Answers 1. Three; Test 1, Test 2, and Test 3 2. The regression coefficient is 0.5560. This means that every 0.5560% change in Test 2 is associated with a 1.000% change in the final exam (assuming everything else is constant) 3. Y=0.0506X 1+-.5560X 2+0.2128X 3+10.7562 4. R 2 =0.4707. So 47.07% of the variance in the final exam can be accounted for by the combination of predictor variables. 5. F=13.621 with p=0.000. Therefore the probability of observing an R value have occurred by chance is not significant is very small (close to 0.000) 6. Test 2 since t = 3.885 is greater than the critical value (because of the low p-value of 0.003) 7. No because these two variables do not have test statistics that exceed the critical values. 8. a) i) 3.8; ii) 1.6 b) 0.2 9. a) 899, residual = 14 b) F (2, 46) = 97.27; p (T<97.27) = 0.9999 10. a) y=98.6-2.16x 1 + 0.134X 2 11. b) 98.6452; initial or base line temperature. c) As X 1 increases, January temperature would decrease. d) No. F (53, 3) = 0.1619; p (T<0.1619) = 0.5640 Set Weight (kg) Age Years Percent Life x 1 x 2 x 3 x 4 1 65 30 15 50 15.33 2 65 50 15 30 17.91 3 65 50 25 50 19.71 4 65 50 35 70 21.51 ŷ Concepts 12

Yˆ x x x ; response variable = college freshman year GPA, explanatory 12. 1 1 2 2 3 3 variables = total SAT scores (x 1), high school GPA (x 2), and high school rank (x 3). 13. Answers will vary. 14. There is no regression equation 15. a) Yˆ 1x1 2x2 3x3 b) 1 is the correlation coefficient of X 1 when the model fits a multiple regression linear equation; b 1 is the beta coefficient for the least squares estimate. Concepts 13