Basic Statistics Exercises 66

Size: px
Start display at page:

Download "Basic Statistics Exercises 66"

Transcription

1 Basic Statistics Exercises 66

2 42. Suppose we are interested in predicting a person's height from the person's length of stride (distance between footprints). The following data is recorded for a random sample of 5 people: Length of Stride (inches) Height (inches) (a) Identify the dependent (response) variable and the independent (explanatory) variable for a regression analysis. The dependent (response) variable is Y = height, and the independent (explanatory) variable is stride length. (b) Does the data appear to be observational or experimental? Since the ages look random, it appears that the data is observational. (c) Use the formulas from Exercise 41 find the equation of the least squares line. ^ y = x (d) Use the least squares line to predict the height of a person whose length of stride is 15 inches. ^ y = (15) = 58.2 inches Basic Statistics Exercises 67

3 43. The prediction of a male's right-hand grip strength from age is to be studied. A 0.05 significance level is to be used with a simple linear regression. The following data is recorded for a random sample of males: Age (years) Grip Strength (lbs.) (a) Identify the dependent (response) variable and the independent (explanatory) variable for a regression analysis. The dependent (response) variable is Y = grip strength, and the independent (explanatory) variable is age. (b) Does the data appear to be observational or experimental? Since the ages look random, it appears that the data is observational. (c) In order to use SPSS to do the calculations needed for the statistical analysis, enter the data into an SPSS data file named age_grip containing two variables named age and grip. Then, go to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), go to the section titled Hypothesis Tests Involving Two Variables, and read the steps in the subsection titled Performing a Simple Linear Regression with Checks of Linearity, Homoscedasticity, and Normality Assumptions. Use these steps as a guide to obtaining the output needed for the remainder of this exercise. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Basic Statistics Exercise 43(c) Verify that your SPSS output contains all of the following: Basic Statistics Exercises 68

4 43. - continued Basic Statistics Exercises 69

5 43. - continued Basic Statistics Exercises 70

6 43. - continued (b) Use the SPSS output to find each of the following: n = r = (11)(4.824) 2 = (c) Use the SPSS output to find the equation of the least squares line. ^ The least squares line can be written y = x. (d) Write a one-sentence interpretation of the slope in the least squares line. Grip strength appears to increase on average by about 2 pounds with each increase of one year in age. (e) Find the coefficient of determination, and write a one-sentence interpretation. From the SPSS output, we find r 2 = About 59.3% of the variation in grip strength is explained by age. (f) Find the standard error of estimate. From the SPSS output, we find s = Basic Statistics Exercises 71

7 43. - continued (g) Use the SPSS output to make a statement concerning whether each of the following assumptions in a simple linear regression is satisfied: the linearity assumption The data points appear to be randomly distributed about the least squares line on the scatter plot, and the residuals plotted against the predicted values look random. Consequently, the linearity assumption appears to be satisfied. the uniform variance (homoscedasticity) assumption The variation in standardized residuals around the horizontal line looks reasonably uniform. the normality assumption The histogram of standardized residuals looks somewhat bell-shaped, and the points on the normal probability plot do not seem to depart too far from the diagonal line. Since the necessary assumptions appear to be satisfied, we feel it is appropriate to proceed with the regression analysis. Basic Statistics Exercises 72

8 43 - continued (h) A 0.05 significance level is chosen for a hypothesis test to see if there is any evidence that the linear relationship between age and grip strength is significant, that is, that the slope in the regression is significantly different from zero (0). Write the results of this hypothesis test two different ways: Write the results of the f test in the ANOVA table in a format suitable for a journal article to be submitted for publication. The f test in the ANOVA for the regression to predict grip strength from age is statistically significant at the 0.05 level (f 1, 10 = , f 1, 10; 0.05 = 4.96, < p < 0.01 OR p = 0.003). We conclude that the linear relationship between age and grip strength is significant, and the data suggest a positive relationship. Write the results of the t test about the slope in a format suitable for a journal article to be submitted for publication. With a t test, the sample slope (2.00 lbs.) is statistically significantly different from zero at the 0.05 level (t 10 = 3.814, t 10; = 2.228, < p < 0.01 OR p = 0.003). We can be 95% confident that the slope in the regression to predict grip strength from age is between and lbs. Considering the results of the hypothesis test, decide whether or not a 95% confidence interval for the slope in the regression would be of interest. If yes, find and interpret the confidence interval; if not, explain why. Since rejecting H 0 suggests that the hypothesized zero slope is not correct, a 95% confidence interval will provide us with some information about the slope, which estimates the average change in grip strength with an increase of one year in age. 2 (2.228)(8.390/ ), 2 + (2.228)(8.390/ ) Basic Statistics Exercises 73

9 43 - continued (i) A 0.05 significance level is chosen for a hypothesis test to see if there is any evidence that the mean grip strength for 20 year old right-handed males is different from 80 lbs. Write the results of this hypothesis test in a format suitable for a journal article to be submitted for publication. With a t test, the estimated mean grip strength (66 lbs.) is statistically significantly different from the hypothesized mean (80 lbs.) at the 0.05 level (t 10 = 5.304, t 10; = 2.228, p < 0.001). We can be 95% confident that the mean grip strength for 20 year old right-handed males is between and lbs. Considering the results of the hypothesis test, decide whether or not a 95% confidence interval for the mean grip strength for 20 year old right-handed males would be of interest. If yes, find and interpret the confidence interval; if not, explain why. Since rejecting H 0 suggests that the hypothesized mean grip strength for 20 year old right-handed males is not correct, a 95% confidence interval will provide us with some information about this mean (20) = (2.228)( /12 + (20 18) 2 /255.98), 66 + (2.228)(8.390/ 1/12 + (20 18) 2 /255.98) Basic Statistics Exercises 74

10 43 - continued (j) Find and interpret a 95% prediction interval for the grip strength of a 20 year old right-handed male (20) = (2.228)( /12 + (20 18) 2 /255.98), 66 + (2.228)(8.390/ 1 + 1/12 + (20 18) 2 /255.98) We are 95% confident that the grip strength for a randomly selected 20-year old right-handed male will be between and lbs. OR At least 95% of 20-year old right-handed males have a grip strength between and lbs. (k) For what age group of right-handed males will the confidence interval for mean grip strength and the prediction interval for a particular grip strength both have the smallest length? 18 year olds Basic Statistics Exercises 75

11 44. The prediction of score (0-100) on a test from hours of study is to be studied. A 0.05 significance level is to be used with a simple linear regression. The following data is recorded for a random sample of students: Study Time(hrs) Test Score(points) (a) Identify the dependent (response) variable and the independent (explanatory) variable for a regression analysis. The dependent (response) variable is Y = test score, and the independent (explanatory) variable is study time. (b) Does the data appear to be observational or experimental? Since the times do not look random, it appears that the data is experimental. (c) In order to use SPSS to do the calculations needed for the statistical analysis, enter the data into an SPSS data file named time_score containing two variables named time and score. Then, go to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), go to the section titled Hypothesis Tests Involving Two Variables, and read the steps in the subsection titled Performing a Simple Linear Regression with Checks of Linearity, Homoscedasticity, and Normality Assumptions. Use these steps as a guide to obtaining the output needed for the remainder of this exercise. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Basic Statistics Exercise 44(c) Verify that your SPSS output contains all of the following: a scatter plot displaying the least squares line; tables titled Descriptive Statistics, Correlations, Model Summary, ANOVA, and Coefficients; a normal probability plot; a histogram on which a bell-shaped curve has been superimposed; a plot of standardized predicted values versus standardized residuals. Basic Statistics Exercises 76

12 44 - continued (b) Use the SPSS output to find each of the following: n = r = (14)(2.928) 2 = (c) Use the SPSS output to find the equation of the least squares line. ^ The least squares line can be written y = x. (d) Write a one-sentence interpretation of the slope in the least squares line. Test score appears to increase on average by about points with each increase of one hour in study time. (e) Find the coefficient of determination, and write a one-sentence interpretation. From the SPSS output, we find r 2 = About 28.1% of the variation in test score is explained by study time. (f) Find the standard error of estimate. From the SPSS output, we find s = Basic Statistics Exercises 77

13 44. - continued (g) Use the SPSS output to make a statement concerning whether each of the following assumptions in a simple linear regression is satisfied: the linearity assumption Since the data points appear to be randomly distributed about the least squares line on the scatter plot, and the residuals plotted against the predicted values look random. Consequently, the linearity assumption appears to be satisfied. the uniform variance (homoscedasticity) assumption The variation in standardized residuals around the horizontal line looks reasonably uniform. the normality assumption The histogram of standardized residuals looks somewhat bell-shaped, even though the points on the normal probability plot do not seem to depart too far from the diagonal line. Since the necessary assumptions do not appear to be drastically violated, we feel it is appropriate to proceed with the regression analysis. Basic Statistics Exercises 78

14 44 - continued In the Word document named Basic_Statistics_Result_Summaries (created previously), begin a section titled Basic Statistics Exercises 44. In this section, create a subsection for each of parts (h), (i), and (j) which follow, and in each subsection created, write the summaries for the corresponding part. Print the page(s) and insert them immediately after this page. (h) A 0.05 significance level is chosen for a hypothesis test to see if there is any evidence that the linear relationship between study time and test score is significant, that is, that the slope in the regression is significantly different from zero (0). Write the results of this hypothesis test two different ways: Write the results of the f test in the ANOVA table in a format suitable for a journal article to be submitted for publication. Write the results of the t test about the slope in a format suitable for a journal article to be submitted for publication. Also, considering the results of this hypothesis test, decide whether or not a 95% confidence interval for the slope in the regression would be of interest. If yes, find and interpret the confidence interval; if not, explain why. (i) A 0.05 significance level is chosen for a hypothesis test to see if there is any evidence that the mean test score for students who study for 5 hours is different from 85 points. Write the results of this hypothesis test in a format suitable for a journal article to be submitted for publication. Considering the results of the hypothesis test, decide whether or not a 95% confidence interval for the mean test score for students who study for 5 hours would be of interest. If yes, find and interpret the confidence interval; if not, explain why. (j) Find and interpret a 95% prediction interval for the test score of a student who studied for 5 hours. (k) For what study time will the confidence interval for mean test score and the prediction interval for a particular test score both have the smallest length? 6 hours Basic Statistics Exercises 79

15 45. In a study of the impact of temperature during the summer months on the maximum amount of power that must be generated to meet demand each day, the prediction of daily peak power load (megawatts) from daily high temperature (degrees Fahrenheit) is of interest. Data for 25 randomly selected summer days is stored in the SPSS data file powerloads (which can be accessed from the appropriate link on the course syllabus web page). A 0.05 significance level is chosen for hypothesis testing. (a) Identify the dependent (response) variable and the independent (explanatory) variable for a regression analysis. The dependent (response) variable is Y = daily peak power load, and the independent (explanatory) variable is X = daily high temperature. (b) Does the data appear to be observational or experimental? Since the daily high temperature is random, the data is observational. (c) Use SPSS to do the calculations needed for a simple linear regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two Variables, and reading the steps in the subsection titled Performing a Simple Linear Regression with Checks of Linearity, Homoscedasticity, and Normality Assumptions. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Basic Statistics Exercise 45(c) Verify that your SPSS output contains all of the following: Basic Statistics Exercises 80

16 45. - continued Basic Statistics Exercises 81

17 45. - continued Basic Statistics Exercises 82

18 45. - continued (d) Use the SPSS output to make a statement concerning whether each of the following assumptions in a simple linear regression is satisfied: the linearity assumption The data points do not appear to be randomly distributed about the least squares line on the scatter plot; it seems that as temperature increases, power load increases at a faster rate. Also, the residuals plotted against the predicted values do not look random. Consequently, the linearity assumption does not appear to be satisfied. the uniform variance (homoscedasticity) assumption the normality assumption Since the linearity assumption does not appear to be satisfied, it is not possible (or even relevant) to consider the uniform variance and normality assumptions. We do not feel it is appropriate to proceed with the regression analysis. Basic Statistics Exercises 83

19 45 - continued (e) It is decided that a quadratic model to predict daily peak power load from daily high temperature will be considered to improve prediction. Write an equation which describes this model. Y = a + b 1 X + b 2 X 2 OR powerload = a + b 1 (temp) + b 2 (temp) 2 (f) Use SPSS to do the calculations needed for a quadratic regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Quadratic Regression with Checks of Model, Homoscedasticity, and Normality Assumptions. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Basic Statistics Exercise 45(f) Verify that your SPSS output contains all of the following: Basic Statistics Exercises 84

20 45 - continued Basic Statistics Exercises 85

21 45 - continued (g) Use the SPSS output to make a statement concerning whether each of the following assumptions in the quadratic regression is satisfied: the assumption of the quadratic model Since the residuals plotted against the predicted values look random, the assumption of the quadratic model appears to be correct. the uniform variance (homoscedasticity) assumption The variation in standardized residuals around the horizontal line looks reasonably uniform. the normality assumption The histogram of standardized residuals looks somewhat bell-shaped, and the points on the normal probability plot do not seem to depart to far from the diagonal line. Based on these observations, we feel it is appropriate to proceed with the quadratic regression analysis. (h) A 0.05 significance level is chosen for the following hypothesis tests: First, write the results of the f test in the ANOVA table for the quadratic regression in a format suitable for a journal article to be submitted for publication. The f test in the ANOVA for the regression to predict power load from temperature and squared temperature is statistically significant at the 0.05 level (f 2, 22 = , f 2, 22; 0.05 = 3.44, p < 0.001). Basic Statistics Exercises 86

22 45(h) - continued Next, complete the steps outlined below to calculate the f statistic for the hypothesis test see if there is any evidence that the addition of squared temperature (the quadratic term) after temperature (the linear term) is statistically significant; the, write the results of this f test in a format suitable for a journal article to be submitted for publication. SSR(temp, temp 2 ) = the regression sum of squares from the ANOVA table with both temperature and squared temperature in the model = SSR(temp) = the regression sum of squares from the ANOVA table with only temperature in the model = MSE(temp, temp 2 ) = the error mean square from the ANOVA table with both temperature and squared temperature in the model = numerator df for the f statistic = number of new terms added to the model = denominator df for the f statistic = df associated with MSE(temp, temp 2 ) = SSR(temp, temp 2 ) SSR(temp) f statistic = = = MSE(temp, temp 2 ) The addition of squared temperature after temperature to predict power load is statistically significant at the 0.05 level (f 1, 22 = , f 1, 22; 0.05 = 4.30, p < 0.001). Basic Statistics Exercises 87

23 45 - continued (i) Use the SPSS output to find the equation of the least squares quadratic. The least squares line can be written ^ load = (temp) (temp) 2. (j) Find the multiple R 2, and write a one-sentence interpretation. From the SPSS output, we find R 2 = About 95.9% of the variation in power load is explained by temperature and squared temperature. (k) Find the standard error of estimate. From the SPSS output, we find s = (l) Use the least squares quadratic to predict the daily peak power load on a day when the high temperature is 75 degrees Fahrenheit, and also on a day when the high temperature is 85 degrees Fahrenheit (75) (75) 2 = megawatts (85) (85) 2 = megawatts Basic Statistics Exercises 88

24 46. In a study to predict IgG (milligrams of immunoglobulin in blood), which is an indicator of long-term immunity, from maximal oxygen uptake (milliliters per kilogram), which is a measure of aerobic fitness level, data is taken on randomly selected subjects and stored in the SPSS data file aerobic (which can be accessed from the appropriate link on the course syllabus web page). A 0.05 significance level is chosen for hypothesis testing. (a) Identify the dependent (response) variable and the independent (explanatory) variable for a regression analysis. The dependent (response) variable is Y = IgG, and the independent (explanatory) variable is X = maximal oxygen uptake. (b) Does the data appear to be observational or experimental? Since the maximal oxygen uptake is random, the data is observational. (c) Use SPSS to do the calculations needed for a simple linear regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two Variables, and reading the steps in the subsection titled Performing a Simple Linear Regression with Checks of Linearity, Homoscedasticity, and Normality Assumptions. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Basic Statistics Exercise 46(c) Verify that your SPSS output contains all of the following: a scatter plot displaying the least squares line; tables titled Descriptive Statistics, Correlations, Model Summary, ANOVA, and Coefficients; a normal probability plot; a histogram on which a bell-shaped curve has been superimposed; a plot of standardized predicted values versus standardized residuals. Basic Statistics Exercises 89

25 46. - continued (d) Use the SPSS output to make a statement concerning whether each of the following assumptions in a simple linear regression is satisfied: the linearity assumption The data points do not appear to be randomly distributed about the least squares line on the scatter plot; it seems that as maximal oxygen uptake increases, IgG increases at a slower rate. Also, the residuals plotted against the predicted values do not look random. Consequently, the linearity assumption does not appear to be satisfied. the uniform variance (homoscedasticity) assumption the normality assumption Since the linearity assumption does not appear to be satisfied, it is not possible (or even relevant) to consider the uniform variance and normality assumptions. We do not feel it is appropriate to proceed with the regression analysis. Basic Statistics Exercises 90

26 46 - continued (e) It is decided that a quadratic model to predict IgG from maximum oxygen intake will be considered to improve prediction. Write an equation which describes this model. Y = a + b 1 X + b 2 X 2 OR IgG = a + b 1 (maxoxy) + b 2 (maxoxy) 2 (f) Use SPSS to do the calculations needed for a quadratic regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Quadratic Regression with Checks of Model, Homoscedasticity, and Normality Assumptions. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Basic Statistics Exercise 46(f) Verify that your SPSS output contains all of the following: tables titled Descriptive Statistics, Correlations, Model Summary, ANOVA, and Coefficients; a normal probability plot; a histogram on which a bell-shaped curve has been superimposed; a plot of standardized predicted values versus standardized residuals. Basic Statistics Exercises 91

27 46 - continued (g) Use the SPSS output to make a statement concerning whether each of the following assumptions in the quadratic regression is satisfied: the assumption of the quadratic model Since the residuals plotted against the predicted values look random, the assumption of the quadratic model appears to be correct. the uniform variance (homoscedasticity) assumption The variation in standardized residuals around the horizontal line looks reasonably uniform. the normality assumption The histogram of standardized residuals looks somewhat bell-shaped, and the points on the normal probability plot do not seem to depart to far from the diagonal line. Based on these observations, we feel it is appropriate to proceed with the quadratic regression analysis. (h) A 0.05 significance level is chosen for the following hypothesis tests: First, write the results of the f test in the ANOVA table for the quadratic regression in a format suitable for a journal article to be submitted for publication. The f test in the ANOVA for the regression to predict IgG from maximal oxygen intake and squared maximal oxygen intake is statistically significant at the 0.05 level (f 2, 27 = , f 2, 27; 0.05 = 3.35, p < 0.001). Basic Statistics Exercises 92

28 46(h) - continued Next, complete the steps outlined below to calculate the f statistic for the hypothesis test see if there is any evidence that the addition of squared maximal oxygen intake (the quadratic term) after maximal oxygen intake (the linear term) is statistically significant; the, write the results of this f test in a format suitable for a journal article to be submitted for publication. SSR(maxoxy, maxoxy 2 ) = the regression sum of squares from the ANOVA table with both maximal oxygen intake and squared maximal oxygen intake in the model = SSR(maxoxy) = the regression sum of squares from the ANOVA table with only maximal oxygen intake in the model = MSE(maxoxy, maxoxy 2 ) = the error mean square from the ANOVA table with both maximal oxygen intake and squared maximal oxygen intake in the model = numerator df for the f statistic = number of new terms added to the model = denominator df for the f statistic = df associated with MSE(maxoxy, maxoxy 2 ) = f statistic = SSR(maxoxy, maxoxy 2 ) SSR(maxoxy) = = MSE(maxoxy, maxoxy 2 ) The addition of squared maximum oxygen intake after maximum oxygen intake to predict IgG is statistically significant at the 0.05 level (f 1, 27 = , f 1, 27; 0.05 = 4.21, p < 0.001). Basic Statistics Exercises 93

29 46 - continued (i) Use the SPSS output to find the equation of the least squares quadratic. The least squares line can be written ^ IgG = (maxoxy) 0.536(maxoxy) 2. (j) Find the multiple R 2, and write a one-sentence interpretation. From the SPSS output, we find R 2 = About 93.8% of the variation in IgG is explained by maximum oxygen intake and squared maximum oxygen intake. (k) Find the standard error of estimate. From the SPSS output, we find s = (l) Use the least squares quadratic to predict the the IgG for a person whose maximal oxygen uptake is 40 milliliters per kilogram (40) 0.536(40) 2 = milligrams Basic Statistics Exercises 94

Unit 27 One-Way Analysis of Variance

Unit 27 One-Way Analysis of Variance Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

ASSIGNMENT 3 SIMPLE LINEAR REGRESSION. Old Faithful

ASSIGNMENT 3 SIMPLE LINEAR REGRESSION. Old Faithful ASSIGNMENT 3 SIMPLE LINEAR REGRESSION In the simple linear regression model, the mean of a response variable is a linear function of an explanatory variable. The model and associated inferential tools

More information

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation y = a + bx y = dependent variable a = intercept b = slope x = independent variable Section 12.1 Inference for Linear

More information

CHAPTER 10. Regression and Correlation

CHAPTER 10. Regression and Correlation CHAPTER 10 Regression and Correlation In this Chapter we assess the strength of the linear relationship between two continuous variables. If a significant linear relationship is found, the next step would

More information

x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators

x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators Multiple Regression Relating a response (dependent, input) y to a set of explanatory (independent, output, predictor) variables x, x 2, x 3,, x q. A technique for modeling the relationship between variables.

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

SMAM 314 Exam 42 Name

SMAM 314 Exam 42 Name SMAM 314 Exam 42 Name Mark the following statements True (T) or False (F) (10 points) 1. F A. The line that best fits points whose X and Y values are negatively correlated should have a positive slope.

More information

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000 Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Simple Linear Regression

Simple Linear Regression 9-1 l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical Method for Determining Regression 9.4 Least Square Method 9.5 Correlation Coefficient and Coefficient

More information

Simple Linear Regression Using Ordinary Least Squares

Simple Linear Regression Using Ordinary Least Squares Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression

More information

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section: Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 You have until 10:20am to complete this exam. Please remember to put your name,

More information

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION In this lab you will learn how to use Excel to display the relationship between two quantitative variables, measure the strength and direction of the

More information

Inference for the Regression Coefficient

Inference for the Regression Coefficient Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates

More information

171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th

171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th Name 171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th Use the selected SAS output to help you answer the questions. The SAS output is all at the back of the exam on pages

More information

Module 8: Linear Regression. The Applied Research Center

Module 8: Linear Regression. The Applied Research Center Module 8: Linear Regression The Applied Research Center Module 8 Overview } Purpose of Linear Regression } Scatter Diagrams } Regression Equation } Regression Results } Example Purpose } To predict scores

More information

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1 9.1 Scatter Plots and Linear Correlation Answers 1. A high school psychologist wants to conduct a survey to answer the question: Is there a relationship between a student s athletic ability and his/her

More information

LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION

LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION In this lab you will first learn how to display the relationship between two quantitative variables with a scatterplot and also how to measure the strength of

More information

Unit 6 - Introduction to linear regression

Unit 6 - Introduction to linear regression Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,

More information

Ch14. Multiple Regression Analysis

Ch14. Multiple Regression Analysis Ch14. Multiple Regression Analysis 1 Goals : multiple regression analysis Model Building and Estimating More than 1 independent variables Quantitative( 量 ) independent variables Qualitative( ) independent

More information

Analysing data: regression and correlation S6 and S7

Analysing data: regression and correlation S6 and S7 Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association

More information

Inference with Simple Regression

Inference with Simple Regression 1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems

More information

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Relax and good luck! STP 231 Example EXAM #2. Instructor: Ela Jackiewicz

Relax and good luck! STP 231 Example EXAM #2. Instructor: Ela Jackiewicz STP 31 Example EXAM # Instructor: Ela Jackiewicz Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned.

More information

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1 MAT 2379, Introduction to Biostatistics Sample Calculator Problems for the Final Exam Note: The exam will also contain some problems

More information

Fish act Water temp

Fish act Water temp A regression of the amount of calories in a serving of breakfast cereal vs. the amount of fat gave the following results: Calories = 97.53 + 9.6525(Fat). Which of the following is FALSE? a) It is estimated

More information

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments. Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3 Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency

More information

AP Statistics - Chapter 2A Extra Practice

AP Statistics - Chapter 2A Extra Practice AP Statistics - Chapter 2A Extra Practice 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study is A) yield

More information

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters Objectives 10.1 Simple linear regression Statistical model for linear regression Estimating the regression parameters Confidence interval for regression parameters Significance test for the slope Confidence

More information

Any of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure.

Any of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure. STATGRAPHICS Rev. 9/13/213 Calibration Models Summary... 1 Data Input... 3 Analysis Summary... 5 Analysis Options... 7 Plot of Fitted Model... 9 Predicted Values... 1 Confidence Intervals... 11 Observed

More information

Lectures on Simple Linear Regression Stat 431, Summer 2012

Lectures on Simple Linear Regression Stat 431, Summer 2012 Lectures on Simple Linear Regression Stat 43, Summer 0 Hyunseung Kang July 6-8, 0 Last Updated: July 8, 0 :59PM Introduction Previously, we have been investigating various properties of the population

More information

Chapter 12 - Lecture 2 Inferences about regression coefficient

Chapter 12 - Lecture 2 Inferences about regression coefficient Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous

More information

23. Inference for regression

23. Inference for regression 23. Inference for regression The Practice of Statistics in the Life Sciences Third Edition 2014 W. H. Freeman and Company Objectives (PSLS Chapter 23) Inference for regression The regression model Confidence

More information

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012 University of California, Berkeley, Statistics 3A: Statistical Inference for the Social and Life Sciences Michael Lugo, Spring 202 Solutions to Exam Friday, March 2, 202. [5: 2+2+] Consider the stemplot

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

Analysis of Variance. Contents. 1 Analysis of Variance. 1.1 Review. Anthony Tanbakuchi Department of Mathematics Pima Community College

Analysis of Variance. Contents. 1 Analysis of Variance. 1.1 Review. Anthony Tanbakuchi Department of Mathematics Pima Community College Introductory Statistics Lectures Analysis of Variance 1-Way ANOVA: Many sample test of means Department of Mathematics Pima Community College Redistribution of this material is prohibited without written

More information

Simple Linear Regression: One Qualitative IV

Simple Linear Regression: One Qualitative IV Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

UNIT 12 ~ More About Regression

UNIT 12 ~ More About Regression ***SECTION 15.1*** The Regression Model When a scatterplot shows a relationship between a variable x and a y, we can use the fitted to the data to predict y for a given value of x. Now we want to do tests

More information

Six Sigma Black Belt Study Guides

Six Sigma Black Belt Study Guides Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships

More information

STAT 350 Final (new Material) Review Problems Key Spring 2016

STAT 350 Final (new Material) Review Problems Key Spring 2016 1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,

More information

Essential Question: What are the standard intervals for a normal distribution? How are these intervals used to solve problems?

Essential Question: What are the standard intervals for a normal distribution? How are these intervals used to solve problems? Acquisition Lesson Planning Form Plan for the Concept, Topic, or Skill Normal Distributions Key Standards addressed in this Lesson: MM3D2 Time allotted for this Lesson: Standard: MM3D2 Students will solve

More information

Chapter 9. Correlation and Regression

Chapter 9. Correlation and Regression Chapter 9 Correlation and Regression Lesson 9-1/9-2, Part 1 Correlation Registered Florida Pleasure Crafts and Watercraft Related Manatee Deaths 100 80 60 40 20 0 1991 1993 1995 1997 1999 Year Boats in

More information

Lecture notes on Regression & SAS example demonstration

Lecture notes on Regression & SAS example demonstration Regression & Correlation (p. 215) When two variables are measured on a single experimental unit, the resulting data are called bivariate data. You can describe each variable individually, and you can also

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet

WISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet WISE Regression/Correlation Interactive Lab Introduction to the WISE Correlation/Regression Applet This tutorial focuses on the logic of regression analysis with special attention given to variance components.

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis STAT 3900/4950 MIDTERM TWO Name: Spring, 205 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis Instructions: You may use your books, notes, and SPSS/SAS. NO

More information

Using SPSS for One Way Analysis of Variance

Using SPSS for One Way Analysis of Variance Using SPSS for One Way Analysis of Variance This tutorial will show you how to use SPSS version 12 to perform a one-way, between- subjects analysis of variance and related post-hoc tests. This tutorial

More information

4:3 LEC - PLANNED COMPARISONS AND REGRESSION ANALYSES

4:3 LEC - PLANNED COMPARISONS AND REGRESSION ANALYSES 4:3 LEC - PLANNED COMPARISONS AND REGRESSION ANALYSES FOR SINGLE FACTOR BETWEEN-S DESIGNS Planned or A Priori Comparisons We previously showed various ways to test all possible pairwise comparisons for

More information

Do not copy, post, or distribute

Do not copy, post, or distribute 14 CORRELATION ANALYSIS AND LINEAR REGRESSION Assessing the Covariability of Two Quantitative Properties 14.0 LEARNING OBJECTIVES In this chapter, we discuss two related techniques for assessing a possible

More information

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan COSC 341 Human Computer Interaction Dr. Bowen Hui University of British Columbia Okanagan 1 Last Class Introduced hypothesis testing Core logic behind it Determining results significance in scenario when:

More information

Regression. Marc H. Mehlman University of New Haven

Regression. Marc H. Mehlman University of New Haven Regression Marc H. Mehlman marcmehlman@yahoo.com University of New Haven the statistician knows that in nature there never was a normal distribution, there never was a straight line, yet with normal and

More information

16.3 One-Way ANOVA: The Procedure

16.3 One-Way ANOVA: The Procedure 16.3 One-Way ANOVA: The Procedure Tom Lewis Fall Term 2009 Tom Lewis () 16.3 One-Way ANOVA: The Procedure Fall Term 2009 1 / 10 Outline 1 The background 2 Computing formulas 3 The ANOVA Identity 4 Tom

More information

y n 1 ( x i x )( y y i n 1 i y 2

y n 1 ( x i x )( y y i n 1 i y 2 STP3 Brief Class Notes Instructor: Ela Jackiewicz Chapter Regression and Correlation In this chapter we will explore the relationship between two quantitative variables, X an Y. We will consider n ordered

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

Lecture 30. DATA 8 Summer Regression Inference

Lecture 30. DATA 8 Summer Regression Inference DATA 8 Summer 2018 Lecture 30 Regression Inference Slides created by John DeNero (denero@berkeley.edu) and Ani Adhikari (adhikari@berkeley.edu) Contributions by Fahad Kamran (fhdkmrn@berkeley.edu) and

More information

Regression Analysis. BUS 735: Business Decision Making and Research

Regression Analysis. BUS 735: Business Decision Making and Research Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

Multiple linear regression S6

Multiple linear regression S6 Basic medical statistics for clinical and experimental research Multiple linear regression S6 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/42 Introduction Two main motivations for doing multiple

More information

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE Course Title: Probability and Statistics (MATH 80) Recommended Textbook(s): Number & Type of Questions: Probability and Statistics for Engineers

More information

K. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =

K. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij = K. Model Diagnostics We ve already seen how to check model assumptions prior to fitting a one-way ANOVA. Diagnostics carried out after model fitting by using residuals are more informative for assessing

More information

This document contains 3 sets of practice problems.

This document contains 3 sets of practice problems. P RACTICE PROBLEMS This document contains 3 sets of practice problems. Correlation: 3 problems Regression: 4 problems ANOVA: 8 problems You should print a copy of these practice problems and bring them

More information

Factorial Independent Samples ANOVA

Factorial Independent Samples ANOVA Factorial Independent Samples ANOVA Liljenquist, Zhong and Galinsky (2010) found that people were more charitable when they were in a clean smelling room than in a neutral smelling room. Based on that

More information

Linear Correlation and Regression Analysis

Linear Correlation and Regression Analysis Linear Correlation and Regression Analysis Set Up the Calculator 2 nd CATALOG D arrow down DiagnosticOn ENTER ENTER SCATTER DIAGRAM Positive Linear Correlation Positive Correlation Variables will tend

More information

Statistics and Quantitative Analysis U4320

Statistics and Quantitative Analysis U4320 Statistics and Quantitative Analysis U3 Lecture 13: Explaining Variation Prof. Sharyn O Halloran Explaining Variation: Adjusted R (cont) Definition of Adjusted R So we'd like a measure like R, but one

More information

Simple Linear Regression. (Chs 12.1, 12.2, 12.4, 12.5)

Simple Linear Regression. (Chs 12.1, 12.2, 12.4, 12.5) 10 Simple Linear Regression (Chs 12.1, 12.2, 12.4, 12.5) Simple Linear Regression Rating 20 40 60 80 0 5 10 15 Sugar 2 Simple Linear Regression Rating 20 40 60 80 0 5 10 15 Sugar 3 Simple Linear Regression

More information

Independent Samples ANOVA

Independent Samples ANOVA Independent Samples ANOVA In this example students were randomly assigned to one of three mnemonics (techniques for improving memory) rehearsal (the control group; simply repeat the words), visual imagery

More information

Conditions for Regression Inference:

Conditions for Regression Inference: AP Statistics Chapter Notes. Inference for Linear Regression We can fit a least-squares line to any data relating two quantitative variables, but the results are useful only if the scatterplot shows a

More information

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables. Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3

More information

7. Do not estimate values for y using x-values outside the limits of the data given. This is called extrapolation and is not reliable.

7. Do not estimate values for y using x-values outside the limits of the data given. This is called extrapolation and is not reliable. AP Statistics 15 Inference for Regression I. Regression Review a. r à correlation coefficient or Pearson s coefficient: indicates strength and direction of the relationship between the explanatory variables

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

1. Use Scenario 3-1. In this study, the response variable is

1. Use Scenario 3-1. In this study, the response variable is Chapter 8 Bell Work Scenario 3-1 The height (in feet) and volume (in cubic feet) of usable lumber of 32 cherry trees are measured by a researcher. The goal is to determine if volume of usable lumber can

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

2. Outliers and inference for regression

2. Outliers and inference for regression Unit6: Introductiontolinearregression 2. Outliers and inference for regression Sta 101 - Spring 2016 Duke University, Department of Statistical Science Dr. Çetinkaya-Rundel Slides posted at http://bit.ly/sta101_s16

More information

Math Section MW 1-2:30pm SR 117. Bekki George 206 PGH

Math Section MW 1-2:30pm SR 117. Bekki George 206 PGH Math 3339 Section 21155 MW 1-2:30pm SR 117 Bekki George bekki@math.uh.edu 206 PGH Office Hours: M 11-12:30pm & T,TH 10:00 11:00 am and by appointment Linear Regression (again) Consider the relationship

More information

1. What does the alternate hypothesis ask for a one-way between-subjects analysis of variance?

1. What does the alternate hypothesis ask for a one-way between-subjects analysis of variance? 1. What does the alternate hypothesis ask for a one-way between-subjects analysis of variance? 2. What is the difference between between-group variability and within-group variability? 3. What does between-group

More information

Correlation and simple linear regression S5

Correlation and simple linear regression S5 Basic medical statistics for clinical and eperimental research Correlation and simple linear regression S5 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/41 Introduction Eample: Brain size and

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

Chapter 14 Student Lecture Notes 14-1

Chapter 14 Student Lecture Notes 14-1 Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this

More information

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

Lecture 18: Simple Linear Regression

Lecture 18: Simple Linear Regression Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

Information Sources. Class webpage (also linked to my.ucdavis page for the class):

Information Sources. Class webpage (also linked to my.ucdavis page for the class): STATISTICS 108 Outline for today: Go over syllabus Provide requested information I will hand out blank paper and ask questions Brief introduction and hands-on activity Information Sources Class webpage

More information

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered)

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered) Test 3 Practice Test A NOTE: Ignore Q10 (not covered) MA 180/418 Midterm Test 3, Version A Fall 2010 Student Name (PRINT):............................................. Student Signature:...................................................

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

Regression Analysis. Table Relationship between muscle contractile force (mj) and stimulus intensity (mv).

Regression Analysis. Table Relationship between muscle contractile force (mj) and stimulus intensity (mv). Regression Analysis Two variables may be related in such a way that the magnitude of one, the dependent variable, is assumed to be a function of the magnitude of the second, the independent variable; however,

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Checking model assumptions with regression diagnostics

Checking model assumptions with regression diagnostics @graemeleehickey www.glhickey.com graeme.hickey@liverpool.ac.uk Checking model assumptions with regression diagnostics Graeme L. Hickey University of Liverpool Conflicts of interest None Assistant Editor

More information

Taguchi Method and Robust Design: Tutorial and Guideline

Taguchi Method and Robust Design: Tutorial and Guideline Taguchi Method and Robust Design: Tutorial and Guideline CONTENT 1. Introduction 2. Microsoft Excel: graphing 3. Microsoft Excel: Regression 4. Microsoft Excel: Variance analysis 5. Robust Design: An Example

More information

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2 Review 6 Use the traditional method to test the given hypothesis. Assume that the samples are independent and that they have been randomly selected ) A researcher finds that of,000 people who said that

More information

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises LINEAR REGRESSION ANALYSIS MODULE XVI Lecture - 44 Exercises Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Exercise 1 The following data has been obtained on

More information