Basic Statistics Exercises 66
|
|
- Claude Richardson
- 6 years ago
- Views:
Transcription
1 Basic Statistics Exercises 66
2 42. Suppose we are interested in predicting a person's height from the person's length of stride (distance between footprints). The following data is recorded for a random sample of 5 people: Length of Stride (inches) Height (inches) (a) Identify the dependent (response) variable and the independent (explanatory) variable for a regression analysis. The dependent (response) variable is Y = height, and the independent (explanatory) variable is stride length. (b) Does the data appear to be observational or experimental? Since the ages look random, it appears that the data is observational. (c) Use the formulas from Exercise 41 find the equation of the least squares line. ^ y = x (d) Use the least squares line to predict the height of a person whose length of stride is 15 inches. ^ y = (15) = 58.2 inches Basic Statistics Exercises 67
3 43. The prediction of a male's right-hand grip strength from age is to be studied. A 0.05 significance level is to be used with a simple linear regression. The following data is recorded for a random sample of males: Age (years) Grip Strength (lbs.) (a) Identify the dependent (response) variable and the independent (explanatory) variable for a regression analysis. The dependent (response) variable is Y = grip strength, and the independent (explanatory) variable is age. (b) Does the data appear to be observational or experimental? Since the ages look random, it appears that the data is observational. (c) In order to use SPSS to do the calculations needed for the statistical analysis, enter the data into an SPSS data file named age_grip containing two variables named age and grip. Then, go to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), go to the section titled Hypothesis Tests Involving Two Variables, and read the steps in the subsection titled Performing a Simple Linear Regression with Checks of Linearity, Homoscedasticity, and Normality Assumptions. Use these steps as a guide to obtaining the output needed for the remainder of this exercise. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Basic Statistics Exercise 43(c) Verify that your SPSS output contains all of the following: Basic Statistics Exercises 68
4 43. - continued Basic Statistics Exercises 69
5 43. - continued Basic Statistics Exercises 70
6 43. - continued (b) Use the SPSS output to find each of the following: n = r = (11)(4.824) 2 = (c) Use the SPSS output to find the equation of the least squares line. ^ The least squares line can be written y = x. (d) Write a one-sentence interpretation of the slope in the least squares line. Grip strength appears to increase on average by about 2 pounds with each increase of one year in age. (e) Find the coefficient of determination, and write a one-sentence interpretation. From the SPSS output, we find r 2 = About 59.3% of the variation in grip strength is explained by age. (f) Find the standard error of estimate. From the SPSS output, we find s = Basic Statistics Exercises 71
7 43. - continued (g) Use the SPSS output to make a statement concerning whether each of the following assumptions in a simple linear regression is satisfied: the linearity assumption The data points appear to be randomly distributed about the least squares line on the scatter plot, and the residuals plotted against the predicted values look random. Consequently, the linearity assumption appears to be satisfied. the uniform variance (homoscedasticity) assumption The variation in standardized residuals around the horizontal line looks reasonably uniform. the normality assumption The histogram of standardized residuals looks somewhat bell-shaped, and the points on the normal probability plot do not seem to depart too far from the diagonal line. Since the necessary assumptions appear to be satisfied, we feel it is appropriate to proceed with the regression analysis. Basic Statistics Exercises 72
8 43 - continued (h) A 0.05 significance level is chosen for a hypothesis test to see if there is any evidence that the linear relationship between age and grip strength is significant, that is, that the slope in the regression is significantly different from zero (0). Write the results of this hypothesis test two different ways: Write the results of the f test in the ANOVA table in a format suitable for a journal article to be submitted for publication. The f test in the ANOVA for the regression to predict grip strength from age is statistically significant at the 0.05 level (f 1, 10 = , f 1, 10; 0.05 = 4.96, < p < 0.01 OR p = 0.003). We conclude that the linear relationship between age and grip strength is significant, and the data suggest a positive relationship. Write the results of the t test about the slope in a format suitable for a journal article to be submitted for publication. With a t test, the sample slope (2.00 lbs.) is statistically significantly different from zero at the 0.05 level (t 10 = 3.814, t 10; = 2.228, < p < 0.01 OR p = 0.003). We can be 95% confident that the slope in the regression to predict grip strength from age is between and lbs. Considering the results of the hypothesis test, decide whether or not a 95% confidence interval for the slope in the regression would be of interest. If yes, find and interpret the confidence interval; if not, explain why. Since rejecting H 0 suggests that the hypothesized zero slope is not correct, a 95% confidence interval will provide us with some information about the slope, which estimates the average change in grip strength with an increase of one year in age. 2 (2.228)(8.390/ ), 2 + (2.228)(8.390/ ) Basic Statistics Exercises 73
9 43 - continued (i) A 0.05 significance level is chosen for a hypothesis test to see if there is any evidence that the mean grip strength for 20 year old right-handed males is different from 80 lbs. Write the results of this hypothesis test in a format suitable for a journal article to be submitted for publication. With a t test, the estimated mean grip strength (66 lbs.) is statistically significantly different from the hypothesized mean (80 lbs.) at the 0.05 level (t 10 = 5.304, t 10; = 2.228, p < 0.001). We can be 95% confident that the mean grip strength for 20 year old right-handed males is between and lbs. Considering the results of the hypothesis test, decide whether or not a 95% confidence interval for the mean grip strength for 20 year old right-handed males would be of interest. If yes, find and interpret the confidence interval; if not, explain why. Since rejecting H 0 suggests that the hypothesized mean grip strength for 20 year old right-handed males is not correct, a 95% confidence interval will provide us with some information about this mean (20) = (2.228)( /12 + (20 18) 2 /255.98), 66 + (2.228)(8.390/ 1/12 + (20 18) 2 /255.98) Basic Statistics Exercises 74
10 43 - continued (j) Find and interpret a 95% prediction interval for the grip strength of a 20 year old right-handed male (20) = (2.228)( /12 + (20 18) 2 /255.98), 66 + (2.228)(8.390/ 1 + 1/12 + (20 18) 2 /255.98) We are 95% confident that the grip strength for a randomly selected 20-year old right-handed male will be between and lbs. OR At least 95% of 20-year old right-handed males have a grip strength between and lbs. (k) For what age group of right-handed males will the confidence interval for mean grip strength and the prediction interval for a particular grip strength both have the smallest length? 18 year olds Basic Statistics Exercises 75
11 44. The prediction of score (0-100) on a test from hours of study is to be studied. A 0.05 significance level is to be used with a simple linear regression. The following data is recorded for a random sample of students: Study Time(hrs) Test Score(points) (a) Identify the dependent (response) variable and the independent (explanatory) variable for a regression analysis. The dependent (response) variable is Y = test score, and the independent (explanatory) variable is study time. (b) Does the data appear to be observational or experimental? Since the times do not look random, it appears that the data is experimental. (c) In order to use SPSS to do the calculations needed for the statistical analysis, enter the data into an SPSS data file named time_score containing two variables named time and score. Then, go to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), go to the section titled Hypothesis Tests Involving Two Variables, and read the steps in the subsection titled Performing a Simple Linear Regression with Checks of Linearity, Homoscedasticity, and Normality Assumptions. Use these steps as a guide to obtaining the output needed for the remainder of this exercise. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Basic Statistics Exercise 44(c) Verify that your SPSS output contains all of the following: a scatter plot displaying the least squares line; tables titled Descriptive Statistics, Correlations, Model Summary, ANOVA, and Coefficients; a normal probability plot; a histogram on which a bell-shaped curve has been superimposed; a plot of standardized predicted values versus standardized residuals. Basic Statistics Exercises 76
12 44 - continued (b) Use the SPSS output to find each of the following: n = r = (14)(2.928) 2 = (c) Use the SPSS output to find the equation of the least squares line. ^ The least squares line can be written y = x. (d) Write a one-sentence interpretation of the slope in the least squares line. Test score appears to increase on average by about points with each increase of one hour in study time. (e) Find the coefficient of determination, and write a one-sentence interpretation. From the SPSS output, we find r 2 = About 28.1% of the variation in test score is explained by study time. (f) Find the standard error of estimate. From the SPSS output, we find s = Basic Statistics Exercises 77
13 44. - continued (g) Use the SPSS output to make a statement concerning whether each of the following assumptions in a simple linear regression is satisfied: the linearity assumption Since the data points appear to be randomly distributed about the least squares line on the scatter plot, and the residuals plotted against the predicted values look random. Consequently, the linearity assumption appears to be satisfied. the uniform variance (homoscedasticity) assumption The variation in standardized residuals around the horizontal line looks reasonably uniform. the normality assumption The histogram of standardized residuals looks somewhat bell-shaped, even though the points on the normal probability plot do not seem to depart too far from the diagonal line. Since the necessary assumptions do not appear to be drastically violated, we feel it is appropriate to proceed with the regression analysis. Basic Statistics Exercises 78
14 44 - continued In the Word document named Basic_Statistics_Result_Summaries (created previously), begin a section titled Basic Statistics Exercises 44. In this section, create a subsection for each of parts (h), (i), and (j) which follow, and in each subsection created, write the summaries for the corresponding part. Print the page(s) and insert them immediately after this page. (h) A 0.05 significance level is chosen for a hypothesis test to see if there is any evidence that the linear relationship between study time and test score is significant, that is, that the slope in the regression is significantly different from zero (0). Write the results of this hypothesis test two different ways: Write the results of the f test in the ANOVA table in a format suitable for a journal article to be submitted for publication. Write the results of the t test about the slope in a format suitable for a journal article to be submitted for publication. Also, considering the results of this hypothesis test, decide whether or not a 95% confidence interval for the slope in the regression would be of interest. If yes, find and interpret the confidence interval; if not, explain why. (i) A 0.05 significance level is chosen for a hypothesis test to see if there is any evidence that the mean test score for students who study for 5 hours is different from 85 points. Write the results of this hypothesis test in a format suitable for a journal article to be submitted for publication. Considering the results of the hypothesis test, decide whether or not a 95% confidence interval for the mean test score for students who study for 5 hours would be of interest. If yes, find and interpret the confidence interval; if not, explain why. (j) Find and interpret a 95% prediction interval for the test score of a student who studied for 5 hours. (k) For what study time will the confidence interval for mean test score and the prediction interval for a particular test score both have the smallest length? 6 hours Basic Statistics Exercises 79
15 45. In a study of the impact of temperature during the summer months on the maximum amount of power that must be generated to meet demand each day, the prediction of daily peak power load (megawatts) from daily high temperature (degrees Fahrenheit) is of interest. Data for 25 randomly selected summer days is stored in the SPSS data file powerloads (which can be accessed from the appropriate link on the course syllabus web page). A 0.05 significance level is chosen for hypothesis testing. (a) Identify the dependent (response) variable and the independent (explanatory) variable for a regression analysis. The dependent (response) variable is Y = daily peak power load, and the independent (explanatory) variable is X = daily high temperature. (b) Does the data appear to be observational or experimental? Since the daily high temperature is random, the data is observational. (c) Use SPSS to do the calculations needed for a simple linear regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two Variables, and reading the steps in the subsection titled Performing a Simple Linear Regression with Checks of Linearity, Homoscedasticity, and Normality Assumptions. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Basic Statistics Exercise 45(c) Verify that your SPSS output contains all of the following: Basic Statistics Exercises 80
16 45. - continued Basic Statistics Exercises 81
17 45. - continued Basic Statistics Exercises 82
18 45. - continued (d) Use the SPSS output to make a statement concerning whether each of the following assumptions in a simple linear regression is satisfied: the linearity assumption The data points do not appear to be randomly distributed about the least squares line on the scatter plot; it seems that as temperature increases, power load increases at a faster rate. Also, the residuals plotted against the predicted values do not look random. Consequently, the linearity assumption does not appear to be satisfied. the uniform variance (homoscedasticity) assumption the normality assumption Since the linearity assumption does not appear to be satisfied, it is not possible (or even relevant) to consider the uniform variance and normality assumptions. We do not feel it is appropriate to proceed with the regression analysis. Basic Statistics Exercises 83
19 45 - continued (e) It is decided that a quadratic model to predict daily peak power load from daily high temperature will be considered to improve prediction. Write an equation which describes this model. Y = a + b 1 X + b 2 X 2 OR powerload = a + b 1 (temp) + b 2 (temp) 2 (f) Use SPSS to do the calculations needed for a quadratic regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Quadratic Regression with Checks of Model, Homoscedasticity, and Normality Assumptions. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Basic Statistics Exercise 45(f) Verify that your SPSS output contains all of the following: Basic Statistics Exercises 84
20 45 - continued Basic Statistics Exercises 85
21 45 - continued (g) Use the SPSS output to make a statement concerning whether each of the following assumptions in the quadratic regression is satisfied: the assumption of the quadratic model Since the residuals plotted against the predicted values look random, the assumption of the quadratic model appears to be correct. the uniform variance (homoscedasticity) assumption The variation in standardized residuals around the horizontal line looks reasonably uniform. the normality assumption The histogram of standardized residuals looks somewhat bell-shaped, and the points on the normal probability plot do not seem to depart to far from the diagonal line. Based on these observations, we feel it is appropriate to proceed with the quadratic regression analysis. (h) A 0.05 significance level is chosen for the following hypothesis tests: First, write the results of the f test in the ANOVA table for the quadratic regression in a format suitable for a journal article to be submitted for publication. The f test in the ANOVA for the regression to predict power load from temperature and squared temperature is statistically significant at the 0.05 level (f 2, 22 = , f 2, 22; 0.05 = 3.44, p < 0.001). Basic Statistics Exercises 86
22 45(h) - continued Next, complete the steps outlined below to calculate the f statistic for the hypothesis test see if there is any evidence that the addition of squared temperature (the quadratic term) after temperature (the linear term) is statistically significant; the, write the results of this f test in a format suitable for a journal article to be submitted for publication. SSR(temp, temp 2 ) = the regression sum of squares from the ANOVA table with both temperature and squared temperature in the model = SSR(temp) = the regression sum of squares from the ANOVA table with only temperature in the model = MSE(temp, temp 2 ) = the error mean square from the ANOVA table with both temperature and squared temperature in the model = numerator df for the f statistic = number of new terms added to the model = denominator df for the f statistic = df associated with MSE(temp, temp 2 ) = SSR(temp, temp 2 ) SSR(temp) f statistic = = = MSE(temp, temp 2 ) The addition of squared temperature after temperature to predict power load is statistically significant at the 0.05 level (f 1, 22 = , f 1, 22; 0.05 = 4.30, p < 0.001). Basic Statistics Exercises 87
23 45 - continued (i) Use the SPSS output to find the equation of the least squares quadratic. The least squares line can be written ^ load = (temp) (temp) 2. (j) Find the multiple R 2, and write a one-sentence interpretation. From the SPSS output, we find R 2 = About 95.9% of the variation in power load is explained by temperature and squared temperature. (k) Find the standard error of estimate. From the SPSS output, we find s = (l) Use the least squares quadratic to predict the daily peak power load on a day when the high temperature is 75 degrees Fahrenheit, and also on a day when the high temperature is 85 degrees Fahrenheit (75) (75) 2 = megawatts (85) (85) 2 = megawatts Basic Statistics Exercises 88
24 46. In a study to predict IgG (milligrams of immunoglobulin in blood), which is an indicator of long-term immunity, from maximal oxygen uptake (milliliters per kilogram), which is a measure of aerobic fitness level, data is taken on randomly selected subjects and stored in the SPSS data file aerobic (which can be accessed from the appropriate link on the course syllabus web page). A 0.05 significance level is chosen for hypothesis testing. (a) Identify the dependent (response) variable and the independent (explanatory) variable for a regression analysis. The dependent (response) variable is Y = IgG, and the independent (explanatory) variable is X = maximal oxygen uptake. (b) Does the data appear to be observational or experimental? Since the maximal oxygen uptake is random, the data is observational. (c) Use SPSS to do the calculations needed for a simple linear regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two Variables, and reading the steps in the subsection titled Performing a Simple Linear Regression with Checks of Linearity, Homoscedasticity, and Normality Assumptions. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Basic Statistics Exercise 46(c) Verify that your SPSS output contains all of the following: a scatter plot displaying the least squares line; tables titled Descriptive Statistics, Correlations, Model Summary, ANOVA, and Coefficients; a normal probability plot; a histogram on which a bell-shaped curve has been superimposed; a plot of standardized predicted values versus standardized residuals. Basic Statistics Exercises 89
25 46. - continued (d) Use the SPSS output to make a statement concerning whether each of the following assumptions in a simple linear regression is satisfied: the linearity assumption The data points do not appear to be randomly distributed about the least squares line on the scatter plot; it seems that as maximal oxygen uptake increases, IgG increases at a slower rate. Also, the residuals plotted against the predicted values do not look random. Consequently, the linearity assumption does not appear to be satisfied. the uniform variance (homoscedasticity) assumption the normality assumption Since the linearity assumption does not appear to be satisfied, it is not possible (or even relevant) to consider the uniform variance and normality assumptions. We do not feel it is appropriate to proceed with the regression analysis. Basic Statistics Exercises 90
26 46 - continued (e) It is decided that a quadratic model to predict IgG from maximum oxygen intake will be considered to improve prediction. Write an equation which describes this model. Y = a + b 1 X + b 2 X 2 OR IgG = a + b 1 (maxoxy) + b 2 (maxoxy) 2 (f) Use SPSS to do the calculations needed for a quadratic regression by going to the document titled Using SPSS for Windows (which can be accessed from the appropriate link on the course syllabus web page), going to the section titled Hypothesis Tests Involving Two or More Variables, and reading the steps in the subsection titled Performing a Quadratic Regression with Checks of Model, Homoscedasticity, and Normality Assumptions. Once you have successfully generated SPSS output, add a title to the top of the output in the following format: YOUR NAME Basic Statistics Exercise 46(f) Verify that your SPSS output contains all of the following: tables titled Descriptive Statistics, Correlations, Model Summary, ANOVA, and Coefficients; a normal probability plot; a histogram on which a bell-shaped curve has been superimposed; a plot of standardized predicted values versus standardized residuals. Basic Statistics Exercises 91
27 46 - continued (g) Use the SPSS output to make a statement concerning whether each of the following assumptions in the quadratic regression is satisfied: the assumption of the quadratic model Since the residuals plotted against the predicted values look random, the assumption of the quadratic model appears to be correct. the uniform variance (homoscedasticity) assumption The variation in standardized residuals around the horizontal line looks reasonably uniform. the normality assumption The histogram of standardized residuals looks somewhat bell-shaped, and the points on the normal probability plot do not seem to depart to far from the diagonal line. Based on these observations, we feel it is appropriate to proceed with the quadratic regression analysis. (h) A 0.05 significance level is chosen for the following hypothesis tests: First, write the results of the f test in the ANOVA table for the quadratic regression in a format suitable for a journal article to be submitted for publication. The f test in the ANOVA for the regression to predict IgG from maximal oxygen intake and squared maximal oxygen intake is statistically significant at the 0.05 level (f 2, 27 = , f 2, 27; 0.05 = 3.35, p < 0.001). Basic Statistics Exercises 92
28 46(h) - continued Next, complete the steps outlined below to calculate the f statistic for the hypothesis test see if there is any evidence that the addition of squared maximal oxygen intake (the quadratic term) after maximal oxygen intake (the linear term) is statistically significant; the, write the results of this f test in a format suitable for a journal article to be submitted for publication. SSR(maxoxy, maxoxy 2 ) = the regression sum of squares from the ANOVA table with both maximal oxygen intake and squared maximal oxygen intake in the model = SSR(maxoxy) = the regression sum of squares from the ANOVA table with only maximal oxygen intake in the model = MSE(maxoxy, maxoxy 2 ) = the error mean square from the ANOVA table with both maximal oxygen intake and squared maximal oxygen intake in the model = numerator df for the f statistic = number of new terms added to the model = denominator df for the f statistic = df associated with MSE(maxoxy, maxoxy 2 ) = f statistic = SSR(maxoxy, maxoxy 2 ) SSR(maxoxy) = = MSE(maxoxy, maxoxy 2 ) The addition of squared maximum oxygen intake after maximum oxygen intake to predict IgG is statistically significant at the 0.05 level (f 1, 27 = , f 1, 27; 0.05 = 4.21, p < 0.001). Basic Statistics Exercises 93
29 46 - continued (i) Use the SPSS output to find the equation of the least squares quadratic. The least squares line can be written ^ IgG = (maxoxy) 0.536(maxoxy) 2. (j) Find the multiple R 2, and write a one-sentence interpretation. From the SPSS output, we find R 2 = About 93.8% of the variation in IgG is explained by maximum oxygen intake and squared maximum oxygen intake. (k) Find the standard error of estimate. From the SPSS output, we find s = (l) Use the least squares quadratic to predict the the IgG for a person whose maximal oxygen uptake is 40 milliliters per kilogram (40) 0.536(40) 2 = milligrams Basic Statistics Exercises 94
Unit 27 One-Way Analysis of Variance
Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationASSIGNMENT 3 SIMPLE LINEAR REGRESSION. Old Faithful
ASSIGNMENT 3 SIMPLE LINEAR REGRESSION In the simple linear regression model, the mean of a response variable is a linear function of an explanatory variable. The model and associated inferential tools
More informationy = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output
12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation y = a + bx y = dependent variable a = intercept b = slope x = independent variable Section 12.1 Inference for Linear
More informationCHAPTER 10. Regression and Correlation
CHAPTER 10 Regression and Correlation In this Chapter we assess the strength of the linear relationship between two continuous variables. If a significant linear relationship is found, the next step would
More informationx3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators
Multiple Regression Relating a response (dependent, input) y to a set of explanatory (independent, output, predictor) variables x, x 2, x 3,, x q. A technique for modeling the relationship between variables.
More informationCorrelation & Simple Regression
Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.
More informationSMAM 314 Exam 42 Name
SMAM 314 Exam 42 Name Mark the following statements True (T) or False (F) (10 points) 1. F A. The line that best fits points whose X and Y values are negatively correlated should have a positive slope.
More informationLecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000
Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination
More informationLecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)
Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination
More informationSimple Linear Regression
9-1 l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical Method for Determining Regression 9.4 Least Square Method 9.5 Correlation Coefficient and Coefficient
More informationSimple Linear Regression Using Ordinary Least Squares
Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression
More informationEcn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:
Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 You have until 10:20am to complete this exam. Please remember to put your name,
More informationLAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION
LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION In this lab you will learn how to use Excel to display the relationship between two quantitative variables, measure the strength and direction of the
More informationInference for the Regression Coefficient
Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates
More information171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th
Name 171:162 Design and Analysis of Biomedical Studies, Summer 2011 Exam #3, July 16th Use the selected SAS output to help you answer the questions. The SAS output is all at the back of the exam on pages
More informationModule 8: Linear Regression. The Applied Research Center
Module 8: Linear Regression The Applied Research Center Module 8 Overview } Purpose of Linear Regression } Scatter Diagrams } Regression Equation } Regression Results } Example Purpose } To predict scores
More informationAnswer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1
9.1 Scatter Plots and Linear Correlation Answers 1. A high school psychologist wants to conduct a survey to answer the question: Is there a relationship between a student s athletic ability and his/her
More informationLAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION
LAB 3 INSTRUCTIONS SIMPLE LINEAR REGRESSION In this lab you will first learn how to display the relationship between two quantitative variables with a scatterplot and also how to measure the strength of
More informationUnit 6 - Introduction to linear regression
Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,
More informationCh14. Multiple Regression Analysis
Ch14. Multiple Regression Analysis 1 Goals : multiple regression analysis Model Building and Estimating More than 1 independent variables Quantitative( 量 ) independent variables Qualitative( ) independent
More informationAnalysing data: regression and correlation S6 and S7
Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association
More informationInference with Simple Regression
1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems
More informationChapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression
Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing
More informationReview of Multiple Regression
Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate
More informationRelax and good luck! STP 231 Example EXAM #2. Instructor: Ela Jackiewicz
STP 31 Example EXAM # Instructor: Ela Jackiewicz Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned.
More informationMAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics
MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1 MAT 2379, Introduction to Biostatistics Sample Calculator Problems for the Final Exam Note: The exam will also contain some problems
More informationFish act Water temp
A regression of the amount of calories in a serving of breakfast cereal vs. the amount of fat gave the following results: Calories = 97.53 + 9.6525(Fat). Which of the following is FALSE? a) It is estimated
More informationAnalysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.
Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a
More informationOne-Way ANOVA. Some examples of when ANOVA would be appropriate include:
One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement
More information16.400/453J Human Factors Engineering. Design of Experiments II
J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential
More informationInference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3
Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency
More informationAP Statistics - Chapter 2A Extra Practice
AP Statistics - Chapter 2A Extra Practice 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study is A) yield
More informationObjectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters
Objectives 10.1 Simple linear regression Statistical model for linear regression Estimating the regression parameters Confidence interval for regression parameters Significance test for the slope Confidence
More informationAny of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure.
STATGRAPHICS Rev. 9/13/213 Calibration Models Summary... 1 Data Input... 3 Analysis Summary... 5 Analysis Options... 7 Plot of Fitted Model... 9 Predicted Values... 1 Confidence Intervals... 11 Observed
More informationLectures on Simple Linear Regression Stat 431, Summer 2012
Lectures on Simple Linear Regression Stat 43, Summer 0 Hyunseung Kang July 6-8, 0 Last Updated: July 8, 0 :59PM Introduction Previously, we have been investigating various properties of the population
More informationChapter 12 - Lecture 2 Inferences about regression coefficient
Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous
More information23. Inference for regression
23. Inference for regression The Practice of Statistics in the Life Sciences Third Edition 2014 W. H. Freeman and Company Objectives (PSLS Chapter 23) Inference for regression The regression model Confidence
More informationUniversity of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012
University of California, Berkeley, Statistics 3A: Statistical Inference for the Social and Life Sciences Michael Lugo, Spring 202 Solutions to Exam Friday, March 2, 202. [5: 2+2+] Consider the stemplot
More information1 A Review of Correlation and Regression
1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then
More informationAnalysis of Variance. Contents. 1 Analysis of Variance. 1.1 Review. Anthony Tanbakuchi Department of Mathematics Pima Community College
Introductory Statistics Lectures Analysis of Variance 1-Way ANOVA: Many sample test of means Department of Mathematics Pima Community College Redistribution of this material is prohibited without written
More informationSimple Linear Regression: One Qualitative IV
Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression
More informationMidterm 2 - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put
More informationUNIT 12 ~ More About Regression
***SECTION 15.1*** The Regression Model When a scatterplot shows a relationship between a variable x and a y, we can use the fitted to the data to predict y for a given value of x. Now we want to do tests
More informationSix Sigma Black Belt Study Guides
Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships
More informationSTAT 350 Final (new Material) Review Problems Key Spring 2016
1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,
More informationEssential Question: What are the standard intervals for a normal distribution? How are these intervals used to solve problems?
Acquisition Lesson Planning Form Plan for the Concept, Topic, or Skill Normal Distributions Key Standards addressed in this Lesson: MM3D2 Time allotted for this Lesson: Standard: MM3D2 Students will solve
More informationChapter 9. Correlation and Regression
Chapter 9 Correlation and Regression Lesson 9-1/9-2, Part 1 Correlation Registered Florida Pleasure Crafts and Watercraft Related Manatee Deaths 100 80 60 40 20 0 1991 1993 1995 1997 1999 Year Boats in
More informationLecture notes on Regression & SAS example demonstration
Regression & Correlation (p. 215) When two variables are measured on a single experimental unit, the resulting data are called bivariate data. You can describe each variable individually, and you can also
More information9. Linear Regression and Correlation
9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,
More information9 Correlation and Regression
9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the
More informationWISE Regression/Correlation Interactive Lab. Introduction to the WISE Correlation/Regression Applet
WISE Regression/Correlation Interactive Lab Introduction to the WISE Correlation/Regression Applet This tutorial focuses on the logic of regression analysis with special attention given to variance components.
More informationFinal Exam - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your
More informationSTAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis
STAT 3900/4950 MIDTERM TWO Name: Spring, 205 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis Instructions: You may use your books, notes, and SPSS/SAS. NO
More informationUsing SPSS for One Way Analysis of Variance
Using SPSS for One Way Analysis of Variance This tutorial will show you how to use SPSS version 12 to perform a one-way, between- subjects analysis of variance and related post-hoc tests. This tutorial
More information4:3 LEC - PLANNED COMPARISONS AND REGRESSION ANALYSES
4:3 LEC - PLANNED COMPARISONS AND REGRESSION ANALYSES FOR SINGLE FACTOR BETWEEN-S DESIGNS Planned or A Priori Comparisons We previously showed various ways to test all possible pairwise comparisons for
More informationDo not copy, post, or distribute
14 CORRELATION ANALYSIS AND LINEAR REGRESSION Assessing the Covariability of Two Quantitative Properties 14.0 LEARNING OBJECTIVES In this chapter, we discuss two related techniques for assessing a possible
More informationCOSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan
COSC 341 Human Computer Interaction Dr. Bowen Hui University of British Columbia Okanagan 1 Last Class Introduced hypothesis testing Core logic behind it Determining results significance in scenario when:
More informationRegression. Marc H. Mehlman University of New Haven
Regression Marc H. Mehlman marcmehlman@yahoo.com University of New Haven the statistician knows that in nature there never was a normal distribution, there never was a straight line, yet with normal and
More information16.3 One-Way ANOVA: The Procedure
16.3 One-Way ANOVA: The Procedure Tom Lewis Fall Term 2009 Tom Lewis () 16.3 One-Way ANOVA: The Procedure Fall Term 2009 1 / 10 Outline 1 The background 2 Computing formulas 3 The ANOVA Identity 4 Tom
More informationy n 1 ( x i x )( y y i n 1 i y 2
STP3 Brief Class Notes Instructor: Ela Jackiewicz Chapter Regression and Correlation In this chapter we will explore the relationship between two quantitative variables, X an Y. We will consider n ordered
More informationStatistics for Managers using Microsoft Excel 6 th Edition
Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of
More informationLecture 30. DATA 8 Summer Regression Inference
DATA 8 Summer 2018 Lecture 30 Regression Inference Slides created by John DeNero (denero@berkeley.edu) and Ani Adhikari (adhikari@berkeley.edu) Contributions by Fahad Kamran (fhdkmrn@berkeley.edu) and
More informationRegression Analysis. BUS 735: Business Decision Making and Research
Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn
More informationREVIEW 8/2/2017 陈芳华东师大英语系
REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p
More informationMultiple linear regression S6
Basic medical statistics for clinical and experimental research Multiple linear regression S6 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/42 Introduction Two main motivations for doing multiple
More informationFRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE
FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE Course Title: Probability and Statistics (MATH 80) Recommended Textbook(s): Number & Type of Questions: Probability and Statistics for Engineers
More informationK. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =
K. Model Diagnostics We ve already seen how to check model assumptions prior to fitting a one-way ANOVA. Diagnostics carried out after model fitting by using residuals are more informative for assessing
More informationThis document contains 3 sets of practice problems.
P RACTICE PROBLEMS This document contains 3 sets of practice problems. Correlation: 3 problems Regression: 4 problems ANOVA: 8 problems You should print a copy of these practice problems and bring them
More informationFactorial Independent Samples ANOVA
Factorial Independent Samples ANOVA Liljenquist, Zhong and Galinsky (2010) found that people were more charitable when they were in a clean smelling room than in a neutral smelling room. Based on that
More informationLinear Correlation and Regression Analysis
Linear Correlation and Regression Analysis Set Up the Calculator 2 nd CATALOG D arrow down DiagnosticOn ENTER ENTER SCATTER DIAGRAM Positive Linear Correlation Positive Correlation Variables will tend
More informationStatistics and Quantitative Analysis U4320
Statistics and Quantitative Analysis U3 Lecture 13: Explaining Variation Prof. Sharyn O Halloran Explaining Variation: Adjusted R (cont) Definition of Adjusted R So we'd like a measure like R, but one
More informationSimple Linear Regression. (Chs 12.1, 12.2, 12.4, 12.5)
10 Simple Linear Regression (Chs 12.1, 12.2, 12.4, 12.5) Simple Linear Regression Rating 20 40 60 80 0 5 10 15 Sugar 2 Simple Linear Regression Rating 20 40 60 80 0 5 10 15 Sugar 3 Simple Linear Regression
More informationIndependent Samples ANOVA
Independent Samples ANOVA In this example students were randomly assigned to one of three mnemonics (techniques for improving memory) rehearsal (the control group; simply repeat the words), visual imagery
More informationConditions for Regression Inference:
AP Statistics Chapter Notes. Inference for Linear Regression We can fit a least-squares line to any data relating two quantitative variables, but the results are useful only if the scatterplot shows a
More informationRegression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.
Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate
More informationChapter 16. Simple Linear Regression and Correlation
Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will
More informationSTOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3
More information7. Do not estimate values for y using x-values outside the limits of the data given. This is called extrapolation and is not reliable.
AP Statistics 15 Inference for Regression I. Regression Review a. r à correlation coefficient or Pearson s coefficient: indicates strength and direction of the relationship between the explanatory variables
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More information1. Use Scenario 3-1. In this study, the response variable is
Chapter 8 Bell Work Scenario 3-1 The height (in feet) and volume (in cubic feet) of usable lumber of 32 cherry trees are measured by a researcher. The goal is to determine if volume of usable lumber can
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science
UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator
More information2. Outliers and inference for regression
Unit6: Introductiontolinearregression 2. Outliers and inference for regression Sta 101 - Spring 2016 Duke University, Department of Statistical Science Dr. Çetinkaya-Rundel Slides posted at http://bit.ly/sta101_s16
More informationMath Section MW 1-2:30pm SR 117. Bekki George 206 PGH
Math 3339 Section 21155 MW 1-2:30pm SR 117 Bekki George bekki@math.uh.edu 206 PGH Office Hours: M 11-12:30pm & T,TH 10:00 11:00 am and by appointment Linear Regression (again) Consider the relationship
More information1. What does the alternate hypothesis ask for a one-way between-subjects analysis of variance?
1. What does the alternate hypothesis ask for a one-way between-subjects analysis of variance? 2. What is the difference between between-group variability and within-group variability? 3. What does between-group
More informationCorrelation and simple linear regression S5
Basic medical statistics for clinical and eperimental research Correlation and simple linear regression S5 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/41 Introduction Eample: Brain size and
More informationBasic Business Statistics 6 th Edition
Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based
More informationChapter 14 Student Lecture Notes 14-1
Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this
More informationChapter 4. Regression Models. Learning Objectives
Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing
More informationLecture 18: Simple Linear Regression
Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength
More informationy response variable x 1, x 2,, x k -- a set of explanatory variables
11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate
More informationInformation Sources. Class webpage (also linked to my.ucdavis page for the class):
STATISTICS 108 Outline for today: Go over syllabus Provide requested information I will hand out blank paper and ask questions Brief introduction and hands-on activity Information Sources Class webpage
More informationTest 3 Practice Test A. NOTE: Ignore Q10 (not covered)
Test 3 Practice Test A NOTE: Ignore Q10 (not covered) MA 180/418 Midterm Test 3, Version A Fall 2010 Student Name (PRINT):............................................. Student Signature:...................................................
More informationTable of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).
Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,
More informationRegression Analysis. Table Relationship between muscle contractile force (mj) and stimulus intensity (mv).
Regression Analysis Two variables may be related in such a way that the magnitude of one, the dependent variable, is assumed to be a function of the magnitude of the second, the independent variable; however,
More informationInference for Regression
Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationChecking model assumptions with regression diagnostics
@graemeleehickey www.glhickey.com graeme.hickey@liverpool.ac.uk Checking model assumptions with regression diagnostics Graeme L. Hickey University of Liverpool Conflicts of interest None Assistant Editor
More informationTaguchi Method and Robust Design: Tutorial and Guideline
Taguchi Method and Robust Design: Tutorial and Guideline CONTENT 1. Introduction 2. Microsoft Excel: graphing 3. Microsoft Excel: Regression 4. Microsoft Excel: Variance analysis 5. Robust Design: An Example
More informationReview 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2
Review 6 Use the traditional method to test the given hypothesis. Assume that the samples are independent and that they have been randomly selected ) A researcher finds that of,000 people who said that
More informationLINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises
LINEAR REGRESSION ANALYSIS MODULE XVI Lecture - 44 Exercises Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Exercise 1 The following data has been obtained on
More information