Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES

Size: px
Start display at page:

Download "Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES"

Transcription

1 M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page 46 4 C H A P T E R Regression Models TEACHING SUGGESTIONS Teaching Suggestion 4.1: Which Is the Independent Variable? We find that students are often confused about which variable is independent and which is dependent in a regression model. For example, in Triple A s problem, clarify which variable is X and which is Y. Emphasize that the dependent variable (Y ) is what we are trying to predict based on the value of the independent (X) variable. Use examples such as the time required to drive to a store and the distance traveled, the totals number of units sold and the selling price of a product, and the cost of a computer and the processor speed. Teaching Suggestion 4.2: Statistical Correlation Does Not Always Mean Causality. Students should understand that a high R 2 doesn t always mean one variable will be a good predictor of the other. Explain that skirt lengths and stock market prices may be correlated, but raising one doesn t necessarily mean the other will go up or down. An interesting study indicated that, over a 10-year period, the salaries of college professors were highly correlated to the dollar sales volume of alcoholic beverages (both were actually correlated with inflation). Teaching Suggestion 4.3: Give students a set of data and have them plot the data and manually draw a line through the data. A discussion of which line is best can help them appreciate the least squares criterion. Teaching Suggestion 4.4: Select some randomly generated values for X and Y (you can use random numbers from the random number table in Chapter 15 or use the RAND function in Excel). Develop a regression line using Excel and discuss the coefficient of determination and the F-test. Students will see that a regression line can always be developed, but it may not necessarily be useful. Teaching Suggestion 4.5: A discussion of the long formulas and short-cut formulas that are provided in the appendix is helpful. The long formulas provide students with a better understanding of the meaning of the SSE and SST. Since many people use computers for regression problems, it helps to see the original formulas. The short-cut formulas are helpful if students are performing the computations on a calculator. ALTERNATIVE EXAMPLES Alternative Example 4.1: The sales manager of a large apartment rental complex feels the demand for apartments may be related to the number of newspaper ads placed during the previous month. She has collected the data shown in the accompanying table. Ads purchased, (X) Apartments leased, (Y) We can find a mathematical equation by using the least squares regression approach. Leases, Y Ads, X (X X) 2 (X X)(Y Ȳ) Y 80 X 184 (X X) (X X)(Y Ȳ) Y = = 10; X = = b 1 306/ b (23) The estimated regression equation is Ŷ X or Apartments leased ads placed If the number of ads is 30, we can estimate the number of apartments leased with the regression equation (30) or 13 apartments Alternative Example 4.2: Given the data on ads and apartment rentals in Alternative Example 4.1, find the coefficient of determination. The following have been computed in the table that follows: SST 150; SSE 29.02; SSR (Note: Round-off error may cause this to be slightly different than a computer solution.) 46

2 M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page 47 CHAPTER 4 R EGRESSION M ODELS 47 Y X (Y Ȳ)2 Ŷ X (Y Ŷ) 2 (Ŷ Ȳ) SST SSE SSR From this the coefficient of determination is r 2 SSR/SST / Alternative Example 4.3: For Alternative Examples 4.1 and 4.2, dealing with ads, X, and apartments leased, Y, compute the correlation coefficient. Since r and the slope is positive ( 0.395), the positive square root of 0.81 is the correlation coefficient. r SOLUTIONS TO DISCUSSION QUESTIONS AND PROBLEMS 4-1. The term least-squares means that the regression line will minimize the sum of the squared errors (SSE). No other line will give a lower SSE Dummy variables are used when a qualitative factor such as the gender of an individual (male or female) is to be included in the model. Usually this is given a value of 1 when the condition is met (e.g. person is male) and 0 otherwise. When there are more than two levels or values for the qualitative factor, more than one dummy variable must be used. The number of dummy variables is one less than the number of possible values or categories. For example, if students are classified as freshmen, sophomores, juniors and seniors, three dummy variables would be necessary The coefficient of determination (r 2 ) is the square of the coefficient of correlation (r). Both of these give an indication of how well a regression model fits a particular set of data. An r 2 value of 1 would indicate a perfect fit of the regression model to the points. This would also mean that r would equal 1 or A scatter diagram is a plot of the data. This graphical image helps to determine if a linear relationship is present, or if another type of relationship would be more appropriate The adjusted r 2 value is used to help determine if a new variable should be added to a regression model. Generally, if the adjusted r 2 value increases when a new variable is added to a model, this new variable should be included in the model. If the adjusted r 2 value declines or does not increase when a new variable is added, then the variable should not be added to the model The F-test is used to determine if the overall regression model is helpful in predicting the value of the independent variable (Y). If the F-value is large and the p-value or significance level is low, then we can conclude that there is a linear relationship and the model is useful, as these results would probably not occur by chance. If the significance level is high, then the model is not useful and the results in the sample could be due to random variations The SSE is the sum of the squared errors in a regression model. SST SSE SSR When the residuals (errors) are plotted after a regression line is found, the errors should be random and should not show any significant pattern. If a pattern does exist, then the assumptions may not be met or another model (perhaps nonlinear) would be more appropriate a. Ŷ (70) 337 b. Ŷ (80) 380 c. Ŷ (90) a. Demand TV Appearances

3 M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page CHAPTER 4 R EGRESSION M ODELS b. Demand TV Appearances Y X (X X) 2 (Y Ȳ)2 (X X)(Y Ȳ) Ŷ (Y Ŷ)2 (Ŷ Ȳ) Y 39.0 X Ȳ 6.5 X 5.5 SST SSE SSR SST 29.5; SSE 12; SSR 17.5 b / b (5.5) 1 The regression equation is Ŷ 1 1X. c. Ŷ 1 1X 1 1(6) See the table for the solution to problem 4-10 to obtain some of these numbers. MSE = SSE/(n k 1) = 12/(6 1 1) = 3 MSR = SSR/k = 17.7/1 = 17.5 F = MSR/MSE = 17.5/3 = 5.83 df 1 = k = 1 df 2 = n k 1 = = 4 F 0.05, 1, 4 = 7.71 Do not reject H 0 since Therefore, we cannot conclude there is a statistically significant relationship at the 0.05 level Using Excel, the regression equation is Ŷ 1 1X. F 5.83, the significance level is This is significant at the 0.10 level ( ), but it is not significant at the 0.05 level. There is marginal evidence that there is a relationship between demand for drums and TV appearances Fin. Test 1 Ave,(Y) (X) (X X) 2 (Y Ȳ)2 (X X)(Y Ȳ) Y (Y Ŷ)2 (Ŷ Ȳ) b 1 = 1143/ = b 0 = (711/9) (730/9) = 18.99

4 M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page 49 CHAPTER 4 R EGRESSION M ODELS 49 a. Ŷ X b. Ŷ (83) c. r 2 = SSR/SST = /998 = 0.85; r 0.92; this means that 85% of the variability in the final average can be explained by the variability in the first test score See the table for the solution to problem 4-13 to obtain some of these numbers. MSE = SSE/(n k 1) = /(9 1 1) = MSR = SSR/k = /1 = F = MSR/MSE = /21.76 = 38.9 df 1 = k = 1 df 2 = n k 1 = = 7 F 0.05, 1, 7 = 5.59 Because , we can conclude (at the 0.05 level) that there is a statistically significant relationship between the first test grade and the final average F 38.86; the significance level (which is extremely small) so there is definitely a statistically significant relationship a. Ŷ 13, (1,860) $83,502. b. The predicted average selling price for a house this size would be $83,502. Some will sell for more and some will sell for less. There are other factors besides size that influence the price of the house. c. Some other variables that might be included are age of the house, number of bedrooms, and size of the lot. There are other factors in addition to these that one can identify. d. The coefficient of determination (r 2 ) (0.63) The multiple regression equation is Ŷ $90.00 $48.50X 1 $0.40X 2 a. Number of days on the road: X 1 5; Distance traveled: X miles The amount he may be expected to claim is Ŷ (5) $0.40(300) $ b. The reimbursement request, according to the model, appears to be too high. However, this does not mean that it is not justified. The accountants should question Thomas Williams about his expenses to see if there are other explanations for the high cost. c. A number of other variables should be included, such as the type of travel (air or car), conference fees if any, and expenses for entertainment of customers, and other transportation (cab and limousine) expenses. In addition, the coefficient of correlation is only 0.68 and r 2 (0.68) Thus, about 46% of the variability in the cost of the trip is explained by this model; the other 54% is due to other factors Using computer software to get the regression equation, we get Ŷ X where Ŷ predicted GPA and X SAT score. If a student scores 450 on the SAT, we get Ŷ (450) If a student scores 800 on the SAT, we get Ŷ (800) a. A linear model is reasonable from the graph below. Ridership (100,000s) Tourists (Millions) b. Ŷ X c. Ŷ (10) 20.99, or 2,099,000 people. d. If there are no tourists, the predicted ridership would be 5.06 (100,000s) or 506,000. Because X 0 is outside the range of values that were used to construct the regression model, this number may be questionable The F-value for the F-test is 52.6 and the significance level is extremely small ( ) which indicates that there is a statistically significant relationship between number of tourists and ridership. The coefficient of determination is 0.84 indicating that 84% of the variability in ridership from one year to the next could be explained by the variations in the number of tourists a. Ŷ 24, X X 2 where Ŷ predicted starting salary; X 1 GPA; X 2 1 if business major, 0 otherwise. b. Ŷ 24, (3.0) 6684(1) $40, c. The starting salary for business majors tends to be about $6,684 higher than non-business majors in this sample, even after adjusting for variations in GPA. d. The overall significance level is and r Thus, the model is significant at the 0.10 level and 69% of the variability in starting salary is explained by GPA and major. The model is useful in predicting starting salary a. Let Ŷ predicted selling price X 1 square footage X 2 number of bedrooms X 3 age The model with square footage: Ŷ X 1 ; r The model with number of bedrooms: Ŷ X 2 ; r The model with age: Ŷ X 3 ; r

5 M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page CHAPTER 4 R EGRESSION M ODELS All of these models are significant at the 0.01 level or less. The best model uses age as the independent variable. The coefficient of determination is highest for this, and it is significant Ŷ X X 2 and r Ŷ (2000) (3) 95, Notice the r 2 value is the same as it was in the previous problem with just square footage as the independent variable. Adding the number of bedrooms did not add any significant information that was not already captured by the square footage. It should not be included in the model. The r 2 for this is lower than for age alone in the previous problem Ŷ X X X 3 and r Ŷ (2000) (3) (10) $110, Ŷ X where Y DJIA and X S&P. r 0.84 and r Ŷ (1100) (rounded) With one independent variable, beds, in the model, r With just admissions in the model, r When both variables are in the model, r Thus, the model with only admissions as the independent variable is the best. Adding the number of beds had virtually no impact on r 2, and the adjusted r 2 decreased slightly. Thus, the best model is Ŷ X where Y expense and X admissions Using Excel with Y MPG; X 1 horsepower; X 2 weight the models are: Ŷ X 1 ; r Ŷ X 2 ; r Thus, the model with horsepower as the independent variable is better since r 2 is higher Ŷ 57, X X 2 where Y MPG X 1 horsepower X 2 weight r This model is better because the coefficient of determination is much higher with both variables than it is with either one individually Let Y MPG; X 1 horsepower; X 2 weight The model Ŷ b 0 b 1 X 1 b 2 X 1 2 is Ŷ X X 1 2 and has r The model Ŷ b 0 b 3 X 2 b 4 X 2 2 is Ŷ X X 2 2 and has r The model Ŷ b 0 b 1 X 1 b 2 X 1 2 b 3 X 2 b 4 X 2 2 is Ŷ X X X X 2 2 and has r This model has a higher r 2 value than the model in A graph of the data would show a nonlinear relationship If SAT median score alone is used to predict the cost, we get Ŷ X 1 with r If both SAT and a dummy variable (X 2 1 for private, 0 otherwise) are used to predict the cost, we get r The model is Ŷ X X 2. This says that a private school tends to be about $9,355 more expensive than a public school when the median SAT score is used to adjust for the quality of the school. The coefficient of determination indicates that about 79% of the variability in cost can be explained by these factors. The model is significant at the level Yˆ = X There is a significant relationship between the number of victories (Y) and the payroll (X) at the level, which is marginally significant. However, r 2 = 0.24, so the relationship is not very strong. Only about 24% of the variability in victories is explained by this model a. Yˆ = X b. Yˆ = X c. The correlation coefficient for the first stock is only 0.19 while the correlation coefficient for the second is Thus, there is a much stronger correlation between stock 2 and the DJI than there is for stock 1 and the DJI. CASE STUDIES SOLUTION TO NORTH SOUTH AIRLINE CASE Northern Airline Data Airframe Cost Engine Cost Average Age Year per Aircraft per Aircraft (Hours) , , , , , , ,390 Southeast Airline Data Airframe Cost Engine Cost Average Age Year per Aircraft per Aircraft (Hours) , , , , , , ,259 Utilizing QM for Windows, we can develop the following regression equations for the variables of interest. Northern Airline airframe maintenance cost: Cost (airframe age) Coefficient of determination Coefficient of correlation

6 M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page 51 CHAPTER 4 R EGRESSION M ODELS 51 Northern Airline engine maintenance cost: Cost (airframe age) Coefficient of determination Coefficient of correlation Southeast Airline airframe maintenance cost: Cost (airframe age) Coefficient of determination Coefficient of correlation Southeast Airline engine maintenance cost: Cost (airframe age) Coefficient of determination Coefficient of correlation The graphs below portray both the actual data and the regression lines for airframe and engine maintenance costs for both airlines. Note that the two graphs have been drawn to the same scale to facilitate comparisons between the two airlines. Northern Airline: There seem to be modest correlations between maintenance costs and airframe age for Northern Airline. There is certainly reason to conclude, however, that airframe age is not the only important factor. Southeast Airline: The relationships between maintenance costs and airframe age for Southeast Airline are much less well defined. It is even more obvious that airframe age is not the only important factor perhaps not even the most important factor. Overall, it would seem that: 1. Northern Airline has the smallest variance in maintenance costs, indicating that the day-to-day management of maintenance is working pretty well. 2. Maintenance costs seem to be more a function of airline than of airframe age. 3. The airframe and engine maintenance costs for Southeast Airline are not only lower but more nearly similar than those for Northern Airline, but, from the graphs at least, appear to be rising more sharply with age. 4. From an overall perspective, it appears that Southeast Airline may perform more efficiently on sporadic or emergency repairs, and Northern Airline may place more emphasis on preventive maintenance. Ms. Young s report should conclude that: 1. There is evidence to suggest that maintenance costs could be made to be a function of airframe age by implementing more effective management practices. 2. The difference between maintenance procedures of the two airlines should be investigated. 3. The data with which she is presently working do not provide conclusive results Northern Airline Southeast Airline Cost ($) Cost ($) Airframe Engine Airframe Engine Average Airframe Age (Thousands) Average Airframe Age (Thousands)

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression Models. Chapter 4. Introduction. Introduction. Introduction Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager

More information

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

Regression Models. Chapter 4

Regression Models. Chapter 4 Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Introduction Regression analysis

More information

Chapter 4: Regression Models

Chapter 4: Regression Models Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,

More information

LI EAR REGRESSIO A D CORRELATIO

LI EAR REGRESSIO A D CORRELATIO CHAPTER 6 LI EAR REGRESSIO A D CORRELATIO Page Contents 6.1 Introduction 10 6. Curve Fitting 10 6.3 Fitting a Simple Linear Regression Line 103 6.4 Linear Correlation Analysis 107 6.5 Spearman s Rank Correlation

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Chapter 7 Student Lecture Notes 7-1

Chapter 7 Student Lecture Notes 7-1 Chapter 7 Student Lecture Notes 7- Chapter Goals QM353: Business Statistics Chapter 7 Multiple Regression Analysis and Model Building After completing this chapter, you should be able to: Explain model

More information

Bayesian Analysis LEARNING OBJECTIVES. Calculating Revised Probabilities. Calculating Revised Probabilities. Calculating Revised Probabilities

Bayesian Analysis LEARNING OBJECTIVES. Calculating Revised Probabilities. Calculating Revised Probabilities. Calculating Revised Probabilities Valua%on and pricing (November 5, 2013) LEARNING OBJECTIVES Lecture 7 Decision making (part 3) Regression theory Olivier J. de Jong, LL.M., MM., MBA, CFD, CFFA, AA www.olivierdejong.com 1. List the steps

More information

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006 Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal yuppal@ysu.edu Sampling Distribution of b 1 Expected value of b 1 : Variance of b 1 : E(b 1 ) = 1 Var(b 1 ) = σ 2 /SS x Estimate of

More information

Chapter 3 Multiple Regression Complete Example

Chapter 3 Multiple Regression Complete Example Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

Simple Linear Regression

Simple Linear Regression 9-1 l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical Method for Determining Regression 9.4 Least Square Method 9.5 Correlation Coefficient and Coefficient

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

Chapter 14 Student Lecture Notes 14-1

Chapter 14 Student Lecture Notes 14-1 Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this

More information

Econ 3790: Statistics Business and Economics. Instructor: Yogesh Uppal

Econ 3790: Statistics Business and Economics. Instructor: Yogesh Uppal Econ 3790: Statistics Business and Economics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 14 Covariance and Simple Correlation Coefficient Simple Linear Regression Covariance Covariance between

More information

Chapter 15 Multiple Regression

Chapter 15 Multiple Regression Multiple Regression Learning Objectives 1. Understand how multiple regression analysis can be used to develop relationships involving one dependent variable and several independent variables. 2. Be able

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:

More information

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression Chapter 12 12-1 North Seattle Community College BUS21 Business Statistics Chapter 12 Learning Objectives In this chapter, you learn:! How to use regression analysis to predict the value of a dependent

More information

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

ECON 497 Midterm Spring

ECON 497 Midterm Spring ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain

More information

Regression Analysis. BUS 735: Business Decision Making and Research

Regression Analysis. BUS 735: Business Decision Making and Research Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn

More information

Exercises on Chapter 2: Linear Regression with one independent variable:

Exercises on Chapter 2: Linear Regression with one independent variable: Exercises on Chapter 2: Linear Regression with one independent variable: Summary: Simple Linear Regression Model: (distribution of error terms unspecified) (2.1) where, value of the response variable in

More information

Chapter 14 Multiple Regression Analysis

Chapter 14 Multiple Regression Analysis Chapter 14 Multiple Regression Analysis 1. a. Multiple regression equation b. the Y-intercept c. $374,748 found by Y ˆ = 64,1 +.394(796,) + 9.6(694) 11,6(6.) (LO 1) 2. a. Multiple regression equation b.

More information

The Multiple Regression Model

The Multiple Regression Model Multiple Regression The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables:

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore What is Multiple Linear Regression Several independent variables may influence the change in response variable we are trying to study. When several independent variables are included in the equation, the

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

Mrs. Poyner/Mr. Page Chapter 3 page 1

Mrs. Poyner/Mr. Page Chapter 3 page 1 Name: Date: Period: Chapter 2: Take Home TEST Bivariate Data Part 1: Multiple Choice. (2.5 points each) Hand write the letter corresponding to the best answer in space provided on page 6. 1. In a statistics

More information

Data Analysis 1 LINEAR REGRESSION. Chapter 03

Data Analysis 1 LINEAR REGRESSION. Chapter 03 Data Analysis 1 LINEAR REGRESSION Chapter 03 Data Analysis 2 Outline The Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression Other Considerations in Regression Model Qualitative

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression Simple linear regression tries to fit a simple line between two variables Y and X. If X is linearly related to Y this explains some of the variability in Y. In most cases, there

More information

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics Chapter 13 Student Lecture Notes 13-1 Department of Quantitative Methods & Information Sstems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analsis QMIS 0 Dr. Mohammad

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

Section 3: Simple Linear Regression

Section 3: Simple Linear Regression Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

Chapter 14 Simple Linear Regression (A)

Chapter 14 Simple Linear Regression (A) Chapter 14 Simple Linear Regression (A) 1. Characteristics Managerial decisions often are based on the relationship between two or more variables. can be used to develop an equation showing how the variables

More information

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Overview Introduction 10-1 Scatter Plots and Correlation 10- Regression 10-3 Coefficient of Determination and

More information

BNAD 276 Lecture 10 Simple Linear Regression Model

BNAD 276 Lecture 10 Simple Linear Regression Model 1 / 27 BNAD 276 Lecture 10 Simple Linear Regression Model Phuong Ho May 30, 2017 2 / 27 Outline 1 Introduction 2 3 / 27 Outline 1 Introduction 2 4 / 27 Simple Linear Regression Model Managerial decisions

More information

Overview. 4.1 Tables and Graphs for the Relationship Between Two Variables. 4.2 Introduction to Correlation. 4.3 Introduction to Regression 3.

Overview. 4.1 Tables and Graphs for the Relationship Between Two Variables. 4.2 Introduction to Correlation. 4.3 Introduction to Regression 3. 3.1-1 Overview 4.1 Tables and Graphs for the Relationship Between Two Variables 4.2 Introduction to Correlation 4.3 Introduction to Regression 3.1-2 4.1 Tables and Graphs for the Relationship Between Two

More information

Mathematics Level D: Lesson 2 Representations of a Line

Mathematics Level D: Lesson 2 Representations of a Line Mathematics Level D: Lesson 2 Representations of a Line Targeted Student Outcomes Students graph a line specified by a linear function. Students graph a line specified by an initial value and rate of change

More information

M112 Short Course In Calculus V. J. Motto Spring 2013 Applications of Derivatives Worksheet

M112 Short Course In Calculus V. J. Motto Spring 2013 Applications of Derivatives Worksheet M11 Short Course In Calculus V. J. Motto Spring 01 Applications of Derivatives Worksheet 1. A tomato is thrown from the top of a tomato cart its distance from the ground in feet is modeled by the equation

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

Simple Linear Regression: One Quantitative IV

Simple Linear Regression: One Quantitative IV Simple Linear Regression: One Quantitative IV Linear regression is frequently used to explain variation observed in a dependent variable (DV) with theoretically linked independent variables (IV). For example,

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Chapter 3 Review Chapter 3: Examining Relationships 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study

More information

Business Statistics (BK/IBA) Tutorial 4 Full solutions

Business Statistics (BK/IBA) Tutorial 4 Full solutions Business Statistics (BK/IBA) Tutorial 4 Full solutions Instruction In a tutorial session of 2 hours, we will obviously not be able to discuss all questions. Therefore, the following procedure applies:

More information

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Remember: Z.05 = 1.645, Z.01 = 2.33 We will only cover one-sided hypothesis testing (cases 12.3, 12.4.2, 12.5.2,

More information

Practice Questions for Exam 1

Practice Questions for Exam 1 Practice Questions for Exam 1 1. A used car lot evaluates their cars on a number of features as they arrive in the lot in order to determine their worth. Among the features looked at are miles per gallon

More information

CHAPTER 7. + ˆ δ. (1 nopc) + ˆ β1. =.157, so the new intercept is = The coefficient on nopc is.157.

CHAPTER 7. + ˆ δ. (1 nopc) + ˆ β1. =.157, so the new intercept is = The coefficient on nopc is.157. CHAPTER 7 SOLUTIONS TO PROBLEMS 7. (i) The coefficient on male is 87.75, so a man is estimated to sleep almost one and one-half hours more per week than a comparable woman. Further, t male = 87.75/34.33

More information

Section 2.5 from Precalculus was developed by OpenStax College, licensed by Rice University, and is available on the Connexions website.

Section 2.5 from Precalculus was developed by OpenStax College, licensed by Rice University, and is available on the Connexions website. Section 2.5 from Precalculus was developed by OpenStax College, licensed by Rice University, and is available on the Connexions website. It is used under a Creative Commons Attribution-NonCommercial- ShareAlike

More information

Chapter 6 Assessment. 3. Which points in the data set below are outliers? Multiple Choice. 1. The boxplot summarizes the test scores of a math class?

Chapter 6 Assessment. 3. Which points in the data set below are outliers? Multiple Choice. 1. The boxplot summarizes the test scores of a math class? Chapter Assessment Multiple Choice 1. The boxplot summarizes the test scores of a math class? Test Scores 3. Which points in the data set below are outliers? 73, 73, 7, 75, 75, 75, 77, 77, 77, 77, 7, 7,

More information

ANOVA - analysis of variance - used to compare the means of several populations.

ANOVA - analysis of variance - used to compare the means of several populations. 12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.

More information

q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question.

q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. q3_3 MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. Provide an appropriate response. 1) In 2007, the number of wins had a mean of 81.79 with a standard

More information

Quantitative Bivariate Data

Quantitative Bivariate Data Statistics 211 (L02) - Linear Regression Quantitative Bivariate Data Consider two quantitative variables, defined in the following way: X i - the observed value of Variable X from subject i, i = 1, 2,,

More information

This gives us an upper and lower bound that capture our population mean.

This gives us an upper and lower bound that capture our population mean. Confidence Intervals Critical Values Practice Problems 1 Estimation 1.1 Confidence Intervals Definition 1.1 Margin of error. The margin of error of a distribution is the amount of error we predict when

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters Objectives 10.1 Simple linear regression Statistical model for linear regression Estimating the regression parameters Confidence interval for regression parameters Significance test for the slope Confidence

More information

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM 1 REGRESSION AND CORRELATION As we learned in Chapter 9 ( Bivariate Tables ), the differential access to the Internet is real and persistent. Celeste Campos-Castillo s (015) research confirmed the impact

More information

1. The area of the surface of the Atlantic Ocean is approximately 31,830,000 square miles. How is this area written in scientific notation?

1. The area of the surface of the Atlantic Ocean is approximately 31,830,000 square miles. How is this area written in scientific notation? 1. The area of the surface of the tlantic Ocean is approximately 31,830,000 square miles. How is this area written in scientific notation? 3.183 x 10 4 B 3.183 x 10 5 C 3.183 x 10 6 D 3.183 x 10 7 2. In

More information

Regression Analysis. Regression: Methodology for studying the relationship among two or more variables

Regression Analysis. Regression: Methodology for studying the relationship among two or more variables Regression Analysis Regression: Methodology for studying the relationship among two or more variables Two major aims: Determine an appropriate model for the relationship between the variables Predict the

More information

Simple Linear Regression Using Ordinary Least Squares

Simple Linear Regression Using Ordinary Least Squares Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression

More information

ECON3150/4150 Spring 2016

ECON3150/4150 Spring 2016 ECON3150/4150 Spring 2016 Lecture 4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo Last updated: January 26, 2016 1 / 49 Overview These lecture slides covers: The linear regression

More information

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. x )

SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. x ) Midterm Review Name SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. Decide whether or not the arrow diagram defines a function. 1) Domain Range 1) Determine

More information

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

Examining Relationships. Chapter 3

Examining Relationships. Chapter 3 Examining Relationships Chapter 3 Scatterplots A scatterplot shows the relationship between two quantitative variables measured on the same individuals. The explanatory variable, if there is one, is graphed

More information

Year 10 Mathematics Semester 2 Bivariate Data Chapter 13

Year 10 Mathematics Semester 2 Bivariate Data Chapter 13 Year 10 Mathematics Semester 2 Bivariate Data Chapter 13 Why learn this? Observations of two or more variables are often recorded, for example, the heights and weights of individuals. Studying the data

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

CHAPTER 5 LINEAR REGRESSION AND CORRELATION

CHAPTER 5 LINEAR REGRESSION AND CORRELATION CHAPTER 5 LINEAR REGRESSION AND CORRELATION Expected Outcomes Able to use simple and multiple linear regression analysis, and correlation. Able to conduct hypothesis testing for simple and multiple linear

More information

Chapter 9. Correlation and Regression

Chapter 9. Correlation and Regression Chapter 9 Correlation and Regression Lesson 9-1/9-2, Part 1 Correlation Registered Florida Pleasure Crafts and Watercraft Related Manatee Deaths 100 80 60 40 20 0 1991 1993 1995 1997 1999 Year Boats in

More information

AP Statistics Unit 2 (Chapters 7-10) Warm-Ups: Part 1

AP Statistics Unit 2 (Chapters 7-10) Warm-Ups: Part 1 AP Statistics Unit 2 (Chapters 7-10) Warm-Ups: Part 1 2. A researcher is interested in determining if one could predict the score on a statistics exam from the amount of time spent studying for the exam.

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

STAT 212 Business Statistics II 1

STAT 212 Business Statistics II 1 STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb

More information

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables. Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate

More information

STAT 350 Final (new Material) Review Problems Key Spring 2016

STAT 350 Final (new Material) Review Problems Key Spring 2016 1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

Linear Regression Communication, skills, and understanding Calculator Use

Linear Regression Communication, skills, and understanding Calculator Use Linear Regression Communication, skills, and understanding Title, scale and label the horizontal and vertical axes Comment on the direction, shape (form), and strength of the relationship and unusual features

More information

ECON 450 Development Economics

ECON 450 Development Economics ECON 450 Development Economics Statistics Background University of Illinois at Urbana-Champaign Summer 2017 Outline 1 Introduction 2 3 4 5 Introduction Regression analysis is one of the most important

More information

Lectures on Simple Linear Regression Stat 431, Summer 2012

Lectures on Simple Linear Regression Stat 431, Summer 2012 Lectures on Simple Linear Regression Stat 43, Summer 0 Hyunseung Kang July 6-8, 0 Last Updated: July 8, 0 :59PM Introduction Previously, we have been investigating various properties of the population

More information

AP Statistics Bivariate Data Analysis Test Review. Multiple-Choice

AP Statistics Bivariate Data Analysis Test Review. Multiple-Choice Name Period AP Statistics Bivariate Data Analysis Test Review Multiple-Choice 1. The correlation coefficient measures: (a) Whether there is a relationship between two variables (b) The strength of the

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 24, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

Homework 1 Solutions

Homework 1 Solutions Homework 1 Solutions January 18, 2012 Contents 1 Normal Probability Calculations 2 2 Stereo System (SLR) 2 3 Match Histograms 3 4 Match Scatter Plots 4 5 Housing (SLR) 4 6 Shock Absorber (SLR) 5 7 Participation

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6 STA 8 Applied Linear Models: Regression Analysis Spring 011 Solution for Homework #6 6. a) = 11 1 31 41 51 1 3 4 5 11 1 31 41 51 β = β1 β β 3 b) = 1 1 1 1 1 11 1 31 41 51 1 3 4 5 β = β 0 β1 β 6.15 a) Stem-and-leaf

More information

Chapter 13. Multiple Regression and Model Building

Chapter 13. Multiple Regression and Model Building Chapter 13 Multiple Regression and Model Building Multiple Regression Models The General Multiple Regression Model y x x x 0 1 1 2 2... k k y is the dependent variable x, x,..., x 1 2 k the model are the

More information

Name Algebra 1 Midterm Review Period. = 10 4x e) x ) Solve for y: a) 6x 3y = 12 b) 4y 8x = 16

Name Algebra 1 Midterm Review Period. = 10 4x e) x ) Solve for y: a) 6x 3y = 12 b) 4y 8x = 16 Name Algebra 1 Date Midterm Review Period 1) Solve each equation: a) x 2x + 2 = 3 b) 5 5 + 9 = 13 c) 64 = 9x +1 d) x 7 2 = 10 4x e) x + 2 3 = 3x 2) Solve for y: a) 6x 3y = 12 b) 4y 8x = 16 3) Solve and

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X 1.04) =.8508. For z < 0 subtract the value from

More information

Chapter 12 - Part I: Correlation Analysis

Chapter 12 - Part I: Correlation Analysis ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

STA121: Applied Regression Analysis

STA121: Applied Regression Analysis STA121: Applied Regression Analysis Linear Regression Analysis - Chapters 3 and 4 in Dielman Artin Department of Statistical Science September 15, 2009 Outline 1 Simple Linear Regression Analysis 2 Using

More information

Econometrics Homework 1

Econometrics Homework 1 Econometrics Homework Due Date: March, 24. by This problem set includes questions for Lecture -4 covered before midterm exam. Question Let z be a random column vector of size 3 : z = @ (a) Write out z

More information

Lecture 4 Scatterplots, Association, and Correlation

Lecture 4 Scatterplots, Association, and Correlation Lecture 4 Scatterplots, Association, and Correlation Previously, we looked at Single variables on their own One or more categorical variable In this lecture: We shall look at two quantitative variables.

More information

Lecture 4 Scatterplots, Association, and Correlation

Lecture 4 Scatterplots, Association, and Correlation Lecture 4 Scatterplots, Association, and Correlation Previously, we looked at Single variables on their own One or more categorical variables In this lecture: We shall look at two quantitative variables.

More information

Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall)

Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall) Announcements: You can turn in homework until 6pm, slot on wall across from 2202 Bren. Make sure you use the correct slot! (Stats 8, closest to wall) We will cover Chs. 5 and 6 first, then 3 and 4. Mon,

More information

Statistics and Quantitative Analysis U4320

Statistics and Quantitative Analysis U4320 Statistics and Quantitative Analysis U3 Lecture 13: Explaining Variation Prof. Sharyn O Halloran Explaining Variation: Adjusted R (cont) Definition of Adjusted R So we'd like a measure like R, but one

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should

More information