SMAM 314 Computer Assignment 5 due Nov 8,2012 Data Set 1. For each of the following data sets use Minitab to 1. Make a scatterplot.

Similar documents
SMAM 319 Exam1 Name. a B.The equation of a line is 3x + y =6. The slope is a. -3 b.3 c.6 d.1/3 e.-1/3

SMAM 319 Exam 1 Name. 1.Pick the best choice for the multiple choice questions below (10 points 2 each)

SMAM 314 Practice Final Examination Winter 2003

Multiple Regression Examples

SMAM 314 Exam 42 Name

1. An article on peanut butter in Consumer reports reported the following scores for various brands

Confidence Interval for the mean response

28. SIMPLE LINEAR REGRESSION III

Analysis of Bivariate Data

Notebook Tab 6 Pages 183 to ConteSolutions

Multiple Regression an Introduction. Stat 511 Chap 9

CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS

Histogram of Residuals. Residual Normal Probability Plot. Reg. Analysis Check Model Utility. (con t) Check Model Utility. Inference.

Simple Linear Regression. Steps for Regression. Example. Make a Scatter plot. Check Residual Plot (Residuals vs. X)

1. Least squares with more than one predictor

School of Mathematical Sciences. Question 1

INFERENCE FOR REGRESSION

Basic Business Statistics 6 th Edition

Examination paper for TMA4255 Applied statistics

2.4.3 Estimatingσ Coefficient of Determination 2.4. ASSESSING THE MODEL 23

(4) 1. Create dummy variables for Town. Name these dummy variables A and B. These 0,1 variables now indicate the location of the house.

Steps for Regression. Simple Linear Regression. Data. Example. Residuals vs. X. Scatterplot. Make a Scatter plot Does it make sense to plot a line?

Models with qualitative explanatory variables p216

Model Building Chap 5 p251

Six Sigma Black Belt Study Guides

Chapter 14. Multiple Regression Models. Multiple Regression Models. Multiple Regression Models

Simple Linear Regression: A Model for the Mean. Chap 7

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Institutionen för matematik och matematisk statistik Umeå universitet November 7, Inlämningsuppgift 3. Mariam Shirdel

Inference for Regression Inference about the Regression Model and Using the Regression Line

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation

Chapter 26 Multiple Regression, Logistic Regression, and Indicator Variables

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6

Chapter 12: Multiple Regression

22S39: Class Notes / November 14, 2000 back to start 1

Ch 13 & 14 - Regression Analysis

1 Introduction to Minitab

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises

W&M CSCI 688: Design of Experiments Homework 2. Megan Rose Bryant

Stat 231 Final Exam. Consider first only the measurements made on housing number 1.

Correlation & Simple Regression

Inferences for Regression

Introduction to Regression

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

23. Inference for regression

Stat 529 (Winter 2011) A simple linear regression (SLR) case study. Mammals brain weights and body weights

School of Mathematical Sciences. Question 1. Best Subsets Regression

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

Basic Business Statistics, 10/e

Business 320, Fall 1999, Final

TMA4255 Applied Statistics V2016 (5)

1 Use of indicator random variables. (Chapter 8)

Stat 501, F. Chiaromonte. Lecture #8

Is economic freedom related to economic growth?

The simple linear regression model discussed in Chapter 13 was written as

Chapter 9 - Correlation and Regression

Multiple Regression. More Hypothesis Testing. More Hypothesis Testing The big question: What we really want to know: What we actually know: We know:

This document contains 3 sets of practice problems.

27. SIMPLE LINEAR REGRESSION II

SMAM 314 Exam 3 Name. 1. Mark the following statements true or false. (6 points 2 each)

Multiple Linear Regression

Correlation and Regression

MULTIPLE LINEAR REGRESSION IN MINITAB

Ph.D. Preliminary Examination Statistics June 2, 2014

Multiple Regression Methods

STATISTICS 110/201 PRACTICE FINAL EXAM

Apart from this page, you are not permitted to read the contents of this question paper until instructed to do so by an invigilator.

Examination paper for TMA4255 Applied statistics

(1) The explanatory or predictor variables may be qualitative. (We ll focus on examples where this is the case.)

Simple Linear Regression

STAT 212 Business Statistics II 1

Conditions for Regression Inference:

Chapter 5 Friday, May 21st

Chapter 10. Correlation and Regression. Lecture 1 Sections:

PART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable,

Q Lecture Introduction to Regression

Regression. Estimation of the linear function (straight line) describing the linear component of the joint relationship between two variables X and Y.

Concordia University (5+5)Q 1.

Contents. 2 2 factorial design 4

10 Model Checking and Regression Diagnostics

Orthogonal contrasts for a 2x2 factorial design Example p130

Chapter Goals. To understand the methods for displaying and describing relationship among variables. Formulate Theories.

Chapter 5 Introduction to Factorial Designs Solutions

General Linear Model (Chapter 4)

[4+3+3] Q 1. (a) Describe the normal regression model through origin. Show that the least square estimator of the regression parameter is given by

Question Possible Points Score Total 100

1 A Review of Correlation and Regression

CRP 272 Introduction To Regression Analysis

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning

STAT 572 Assignment 5 - Answers Due: March 2, 2007

Chapter 1. Linear Regression with One Predictor Variable

Contents. 9. Fractional and Quadratic Equations 2 Example Example Example

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test, October 2013

STA220H1F Term Test Oct 26, Last Name: First Name: Student #: TA s Name: or Tutorial Room:

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

STAB27-Winter Term test February 18,2006. There are 14 pages including this page. Please check to see you have all the pages.

Any of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure.

Pre-Calculus Multiple Choice Questions - Chapter S8

STAT 212: BUSINESS STATISTICS II Third Exam Tuesday Dec 12, 6:00 PM

MULTIPLE REGRESSION METHODS

Transcription:

SMAM 314 Computer Assignment 5 due Nov 8,2012 Data Set 1. For each of the following data sets use Minitab to 1. Make a scatterplot. 2. Fit the linear regression line. Regression Analysis: y versus x y = 3.62-0.0147 x Predictor Coef SE Coef T P Constant 3.62091 0.09949 36.39 0.000 x -0.014711 0.001436-10.24 0.000 S = 0.147890 R-Sq = 89.7% R-Sq(adj) = 88.9% Analysis of Variance Regression 1 2.2947 2.2947 104.92 0.000 Residual Error 12 0.2625 0.0219 Total 13 2.5571 Unusual Observations Obs x y Fit SE Fit Residual St Resid 3 36 3.4000 3.0913 0.0559 0.3087 2.25R

R denotes an observation with a large standardized residual. Answer the following questions for each data set on a separate sheet of paper (1) Based only on the scatterplot a. Does a straight line model appear to be reasonable? Explain. Based on the scatterplot a straight line model appears to be reasonable because the points lie close to a straight line. b. Does the date appear to be positively or negatively correlated? Explain. The data appears to be negatively correlated. As x increases y decreases. (2) What is the equation of linear regression line? Y=3.62-0.147x (3) Is the linear regression line statistically significant at α =.05? Yes. For both the t statistic and the anova table the pvalue is zero less than.05. (4) What is the percentage of the variation accounted for by the regression line? The regression line accounts for 89.7% of the variation. (5) What is the correlation coefficient? The correlation coefficent is r = 0.947. Usually an adequate model (1) has a scatterplot reasonably close to a straight line or the curve that was fitted. (2) Accounts for at least 80% of the variation. (3) Has a correlation coefficient with absolute value at least 0.9.(Only relevant for a straight line) (4) Is statistically significant at α =.05. This in itself might not be enough though. Write a paragraph of 3-5 sentences that indicates whether the model is adequate with specific reference to the issues above citing the relevant values in the computer printout. The straight line model appears to be adequate. The scatterplot is reasonably close to a straight line. The regression line accounts for 89.7%>80% of the variation. The absolute value of the correlation coefficient is 0.947>0.9. The p value for the hypothesis test of whether the slope differs from zero is zero indicating that it is highly significant. There is some additional work for data set 2. Please see after the data set. Data set 1: This data is the result of an investigation of how the propagation of an ultrasonic stress wave through a substance depends on the properties of the substance. Data on fracture strength (x as a percentage of ultimate tensile strength) and attenuation (y in neper/cm, the decrease in amplitude of the stress wave) in fiberglass-reinforced polyester composites. Row x y 1 12 3.3 2 30 3.2 3 36 3.4

4 40 3.0 5 45 2.8 6 57 2.9 7 62 2.7 8 67 2.6 9 71 2.5 10 78 2.6 11 93 2.2 12 94 2.0 13 100 2.3 14 105 2.1 Data set 2: For this data set the independent(prediction) variable x represents the percentage of wood in a batch of pulp. The dependent (response) variable y represents the tensile strength of Kraft paper measured in pounds per square inch made from the batch. Row x y 1 1.0 6.3 2 1.5 11.1 3 2.0 20.0 4 3.0 24.0 5 4.0 26.1 6 4.5 30.0 7 5.0 33.8 8 5.5 34.0 9 6.0 38.1 10 6.5 39.9 11 7.0 42.0 12 8.0 46.1 13 9.0 53.1 14 10.0 52.0 15 11.0 52.5 16 12.0 48.0 17 13.0 42.8 18 14.0 27.8 19 15.0 21.9. For each of the following data sets use Minitab to 3. Make a scatterplot.

4. Fit the linear regression line. Regression Analysis: y versus x y = 3.62-0.0147 x Predictor Coef SE Coef T P Constant 3.62091 0.09949 36.39 0.000 x -0.014711 0.001436-10.24 0.000 S = 0.147890 R-Sq = 89.7% R-Sq(adj) = 88.9% Analysis of Variance Regression 1 2.2947 2.2947 104.92 0.000 Residual Error 12 0.2625 0.0219 Total 13 2.5571 Unusual Observations Obs x y Fit SE Fit Residual St Resid 3 36 3.4000 3.0913 0.0559 0.3087 2.25R R denotes an observation with a large standardized residual.

Results for: Worksheet 2 Scatterplot of y vs x Regression Analysis: y versus x y = 21.3 + 1.77 x Predictor Coef SE Coef T P Constant 21.321 5.430 3.93 0.001 x 1.7710 0.6478 2.73 0.014 S = 11.8159 R-Sq = 30.5% R-Sq(adj) = 26.5% Analysis of Variance Regression 1 1043.4 1043.4 7.47 0.014 Residual Error 17 2373.5 139.6 Total 18 3416.9 Unusual Observations Obs x y Fit SE Fit Residual St Resid 19 15.0 21.90 47.89 5.70-25.99-2.51R R denotes an observation with a large standardized residual. Answer the following questions for each data set on a separate sheet of paper 1.Based only on the scatterplot a. Does a straight line model appear to be reasonable? Explain. No it does not. The points are clearly curvilinear. b.does the data appear to be positively or negatively correlated? Explain. You cannot really tell. Part of the way as x increases y increases. The points reach a maximum and then decrease. 2. What is the equation of linear regression line? Y=21.3+1.77x 3.Is the linear regression line statistically significant at α =.05? Yes it is. The relavant p value is.014. Note that it is not significant at α =.01 4.What is the percentage of the variation accounted for by the regression line? The regression line accounts for only 30.5% of the variation. 5. What is the correlation coefficient? The correlation coefficient is.552. Usually an adequate model

(5) has a scatterplot reasonably close to a straight line or the curve that was fitted. (6) Accounts for at least 80% of the variation. (7) Has a correlation coefficient with absolute value at least 0.9.(Only relevant for a straight line) (8) Is statistically significant at α =.05. This in itself might not be enough though. Write a paragraph of 3-5 sentences that indicates whether the model is adequate with specific reference to the issues above citing the relevant values in the computer printout. The regression line is quite inadequate. The points appear to lie on a parabola instead of a straight line.only 30.5% of the variation is accounted for much less than 80%. The correlation coefficient is 0.552 suggesting at best a weak linear relationship. However, although based on the p value of.014 the model accounts for a statistically significant amount of the variation the scatterplot, Rsquare and the correlation coefficient suggest that there is much room for improvement. There is some additional work for data set 2. Please see after the data set. For data set 2 fit use fitted line plot under the regression menu to fit a quadratic curve. Based on criteria 1, 2 and 4 for an adequate model does the quadratic model appear to be adequate. Explain. [Note: the correlation coefficient is not relevant here because it is only a measure of the linear association.] Polynomial Regression Analysis: y versus x y = - 6.674 + 11.76 x - 0.6345 x**2 S = 4.42040 R-Sq = 90.9% R-Sq(adj) = 89.7%

Analysis of Variance Regression 2 3104.25 1552.12 79.43 0.000 Error 16 312.64 19.54 Total 18 3416.89 Sequential Analysis of Variance Source DF SS F P Linear 1 1043.43 7.47 0.014 Quadratic 1 2060.82 105.47 0.000 Fitted Line: y versus x This is a definite improvement. The scatterplot is indeed a quadratic curve. The curve accounts for 90.9% of the variation and the quadratic term is highly significant.