Regression Models. Chapter 4. Introduction. Introduction. Introduction
|
|
- Maud Shelton
- 6 years ago
- Views:
Transcription
1 Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager There are generally two purposes for regression analysis. To understand the relationship between variables E.g. the relationship between the sales volume and the advertising spending amount, the relationship between the price of a house and the square footage, etc.. To predict the value of one variable based on the value of another variable 4 Introduction Introduction Three types of regression models will be studied Simple linear regression models have only two variables We will first develop this model Multiple regression models have more than two variables Nonlinear regression models are used when the relationships between the variables are not linear 4 3 The variable to be predicted is called the dependent variable Sometimes called the response variable The value of this variable depends on the value of the independent variable Sometimes called the explanatory or predictor variable Dependent variable Independent variable Independent variable = Prediction Relationship 4 4
2 Scatter Diagram One way to investigate the relationship between variables is by plotting the data on a graph Such a graph is often called a scatter diagram or a scatter plot The independent variable is normally plotted on the axis The dependent variable is normally plotted on the axis 4 5 Triple A Construction Example Triple A Construction renovates old homes They have found that the dollar volume of renovation work each year is dependent on the area payroll Triple A s revenues and the total wage earnings for the past six years are listed below dependent variable Table 4. TRIPLE A S SALES LOCAL PAROLL ($00,000 s) ($00,000,000 s) independent variable 4 6 Triple A Construction Example Triple A Construction Example Sales ($00 0,000) Payroll ($00 million) The graph indicates higher payroll seem to result in higher sales A line has been drawn to show the relationship between the payroll and the sales There is not a perfect relationship because not all points lie in a straight line Errors are involved if this line is used to predict sales based on payroll Many lines could be drawn through these points, but which one best represents the true relationship? Figure 4.: Scatter Diagram for Triple A Construction Company Data in Table
3 Simple Linear Regression Simple Linear Regression Regression models are used to find the relationship between variables i.e. to predict the value of one variable based on the other However there is some random error that cannot be predicted Regression models can also be used to test if a relationship exists between variables The underlying simple linear regression model is: 0 = β + β + ε where = dependent variable (response) = independent variable (predictor or explanatory) β 0 = intercept (value of when = 0) β = slope of the regression line ε = random error Simple Linear Regression Triple A Construction The random error cannot be predicted. So an approximation of the model is used where ^ ˆ = b 0 + b = predicted value of = independent variable (predictor or explanatory) b 0 = estimate of β 0 b = estimate of β Triple A Construction is trying to predict sales based on area payroll = Sales = Area payroll The line chosen in Figure 4. is the one that best fits the sample data by minimizing the sum of all errors Error = (Actual value) (Predicted value) e = ˆ 4 4
4 Triple A Construction The errors may be positive or negative large positive and negative errors may cancel each other result in very small average error thus errors are squared Error = [(Actual value) (Predicted value)] e = ( ) ˆ The best regression line is defined as the one that minimize the sum of squared errors, i.e. the total distance between the actual data points and the line Triple A Construction For the simple linear regression model, the values of the intercept and slope can be calculated from n sample data using the formulas below = = n = = n b ˆ = b + 0 b average (mean) of average (mean) of ( )( ) = b0 = b ( ) values values Triple A Construction Regression calculations ( ) ( )( ) 6 3 (3 4) = (3 4)(6 7) = 8 4 (4 4) = 0 (4 4)(8 7) = 0 Σ = 4 = 4/6 = (6 4) = 4 (6 4)(9 7) = (4 4) = 0 (4 4)(5 7) = ( 4) = 4 ( 4)(4.5 7) = (5 4) = (5 4)(9.5 7) =.5 Σ = 4 = 4/6 = 4 Σ( ) = 0 Σ( )( ) =.5 Triple A Construction Regression calculations = 6 4 = = = = = b ( )( ). 5 = = 5 ( ) 0 =. b0 = b = 7 (. 5)( 4) = Table 4. Therefore ˆ =
5 Triple A Construction Measuring the Fit of the Regression Model Regression calculations = 6 4 = = 4 sales = +.5(payroll) 6 4 If the payroll next = = = 7 year is $600 million 6 6 ( )( ˆ = ) ( 6) = 9. 5 or $ 950, 000 b = = =. 5 ( ) 0 b0 = b = 7 (. 5)( 4) = Therefore ˆ = Regression models can be developed for any variables and How do we know the model is good enough (with small errors) in predicting based on? The following measures are useful in describing the accuracy of the model Three measures of variability SST Total variability about the mean SSE Variability about the regression line SSR Total variability that is explained by the model 4 8 Measuring the Fit of the Regression Model Measuring the Fit of the Regression Model Sum of the squares total SST = ( ) ( ) ( ) ( ) 6 3 (6 7) = +.5(3) = ^ ^ ^ Sum of the squared error 8 4 (8 7) = +.5(4) = SSE = e = ( ˆ ) Sum of squares due to regression SSR = ˆ ( ) 9 6 (9 7) = 4 +.5(6) = (5 7) = 4 +.5(4) = (4.5 7) = () = (9.5 7) = (5) = An important relationship SST = SSR + SSE ( ) ^ =.5 ( ) ^ = ( ) = 5.65 = 7 SST =.5 SSE = SSR = 5.65 Table
6 Measuring the Fit of the Regression Model SST =.5 is the variability of the prediction using mean value of SSE = is the variability of the prediction using regression line Prediction using regression line has reduced the variability by = 5.65 SSR = 5.65 indicates how much of the total variability in is explained by the regression model Note: SST = SSR + SSE SSR explained variability SSE unexplained variability Sales ($00 0,000) Measuring the Fit of the Regression Model (SSR) (SSE) ^ ^ ^ = +.5 (SST) Payroll ($00 million) Figure Coefficient of Determination Correlation Coefficient The proportion of the variability in explained by regression equation is called the coefficient of determination The coefficient of determination is r r SSR SSE = = SST SST For Triple A Construction r = = About 69% of the variability in is explained by the equation based on payroll () If SSE 0, then r 00% 4 3 The correlation coefficient is an expression of the strength of the linear relationship between the variables r = ± It will always be between + and Negative slope r < 0; positive slope r > 0 The correlation coefficient is r For Triple A Construction r = r =
7 Correlation Coefficient Using Computer Software for Regression Figure 4.3 (a) Perfect Positive Correlation: r = + (c) No Correlation: r = 0 (b) Positive Correlation: 0 < r < (d) Perfect Negative Correlation: r = 4 5 Program 4.A 4 6 Using Computer Software for Regression Using Computer Software for Regression Program 4.B Program 4.C
8 Using Computer Software for Regression Correlation coefficient (r) is Multiple R in Excel Assumptions of the Regression Model If we make certain assumptions about the errors in a regression model, we can perform statistical tests to determine if the model is useful. Errors are independent. Errors are normally distributed 3. Errors have a mean of zero 4. Errors have a constant variance A plot of the residuals (errors) will often highlight any glaring violations of the assumption Program 4.D = +.5 ˆ Residual Plots Residual Plots A random plot of residuals Healthy pattern no violations Nonconstant error variance violation Errors increase as increases, violating the constant variance assumption Error Error = 0 Error Error = 0 Figure 4.4A Figure 4.4B
9 Residual Plots Nonlinear relationship violation Errors consistently increasing and then consistently decreasing indicate that the model is not linear (perhaps quadratic) Estimating the Variance Errors are assumed to have a constant variance (σ ), but we usually don t know this It can be estimated using the mean squared error (MSE), s Error Error = 0 s SSE = MSE = n k where n = number of observations in the sample k = number of independent variables Figure 4.4C Estimating the Variance Testing the Model for Significance For Triple A Construction s SSE = MSE = = = =. 788 n k 6 4 We can estimate the standard deviation, s This is also called the standard error of the estimate or the standard deviation of the regression s = MSE =. 788 =. 3 A small s or s means the actual data deviate within a small range from the predicted result Both r and the MSE (s ) provide a measure of accuracy in a regression model However when the sample size is too small, you can get good values for MSE and r even if there is no relationship between the variables Testing the model for significance helps determine if r and MSE are meaningful and if a linear relationship exists between the variables We do this by performing a statistical hypothesis test
10 Testing the Model for Significance We start with the general linear model 0 = β + β + ε If β = 0, the null hypothesis is that there is no relationship between and The alternate hypothesis is that there is a linear relationship (β 0) If the null hypothesis can be rejected, we have proven there is a linear relationship We use the F statistic for this test The F Distribution A continuous probability distribution (Fig..5) The area underneath the curve represents probability of the F statistic value falling within a particular interval. The F statistic is the ratio of two sample variances F distributions have two sets of degrees of freedom Degrees of freedom are based on sample size and used to calculate the numerator and denominator df = degrees of freedom for the numerator df = degrees of freedom for the denominator The F Distribution The F Distribution Consider the example: df = 5 df = 6 α = 0.05 (probability) From Appendix D, we get This means F α, df, df = F 0.05, 5, 6 = 4.39 P(F > 4.39) = 0.05 There is only a 5% probability that F will exceed 4.39 (see Fig..6) Figure.5 F α
11 The F Distribution Testing the Model for Significance Figure.6 F value for 0.05 probability with 5 and 6 degrees of freedom F = The F statistic for testing the model is based on the MSE (s ) and mean squared regression (MSR) SSR MSR = k where k = number of independent variables in the model The F statistic is F = MSR MSE This describes an F distribution with degrees of freedom for the numerator = df = k degrees of freedom for the denominator = df = n k Testing the Model for Significance If there is very little error, the MSE would be small and the F-statistic would be large indicating the model is useful If the F-statistic is large, the significance level (p-value) will be low, indicating it is unlikely this would have occurred by chance So when the F-value is large, we can reject the null hypothesis and accept that there is a linear relationship between and and the values of the MSE and r are meaningful Steps in a Hypothesis Test. Specify null and alternative hypotheses H0 : β = 0 H : β 0. Select the level of significance (α). Common values are 0.0 and Calculate the value of the test statistic using the formula MSR F = MSE
12 Steps in a Hypothesis Test Triple A Construction 4. Make a decision using one of the following methods a) Reject the null hypothesis if the test statistic is greater than the F-value from the table in Appendix D. Otherwise, do not reject the null hypothesis: Reject if df = k df = n k F calculated > Fα, df, df b) Reject the null hypothesis if the observed significance level, or p-value, is less than the level of significance (α). Otherwise, do not reject the null hypothesis: p - value = P( F > calculated test statistic) Reject if p - value < α Step. H 0 : β = 0 (no linear relationship between and ) H : β 0 (linear relationship exists between and ) Step. Select α = 0.05 Step 3. Calculate the value of the test statistic MSR F SSR = k MSR = MSE = = = = Triple A Construction Triple A Construction Step 4. Reject the null hypothesis if the test statistic is greater than the F-value in Appendix D df = k = df = n k = 6 = 4 The value of F associated with a 5% level of significance and with degrees of freedom and 4 is found in Appendix D We can conclude there is a statistically significant relationship between and The r value of 0.69 means about 69% of the variability in sales () is explained by local payroll () F 0.05,,4 = 7.7 F calculated = 9.09 Reject H 0 because 9.09 > 7.7 Figure F =
13 Triple A Construction The F-test determines whether or not there is a relationship between the variables r (coefficient of determination) is the best measure of the strength of the prediction relationship between the and variables Values closer to indicate a strong prediction relationship Good regression models have a low significance level for the F-test and high r value. Analysis of Variance (ANOVA) Table When software is used to develop a regression model, an ANOVA table is typically created that shows the observed significance level (p-value) for the calculated F value This can be compared to the level of significance (α) to make a decision DF SS MS F SIGNIFICANCE Regression k SSR MSR = SSR/k MSR/MSE P(F > MSR/MSE) Residual n - k - SSE MSE = SSE/(n - k - ) Total n - SST Table ANOVA for Triple A Construction Multiple Regression Analysis Multiple regression models are extensions to the simple linear model and allow the creation of models with several independent variables Program 4.D (partial) P(F > ) = Because this probability is less than 0.05, we reject the null hypothesis of no linear relationship and conclude there is a linear relationship between and = β 0 + β + β + + β k k + ε where = dependent variable (response variable) i = ith independent variable (predictor or explanatory variable) β 0 = intercept (value of when all i = 0) β I = coefficient of the ith independent variable k = number of independent variables ε = random error
14 Multiple Regression Analysis To estimate these values, samples are taken and the following equation is developed ˆ = b + b + b + + b 0... k k where Ŷ = predicted value of b 0 = sample intercept (and is an estimate of β 0 ) b i = sample coefficient of the ith variable (and is an estimate of β i ) 4 53 Jenny Wilson Realty Jenny Wilson wants to develop a model to determine the suggested listing price for houses based on the size and age of the house where ˆ = b + b + b 0 ˆ = predicted value of dependent variable (selling price) b 0 = intercept and = value of the two independent variables (square footage and age) respectively b and b = slopes for and respectively She selects a few samples of the houses sold recently and records the data shown in Table 4.5 She also saves information on house condition to be used later 4 54 Jenny Wilson Realty Jenny Wilson Realty Table 4.5 SELLING PRICE ($) SQUARE FOOTAGE AGE 95,000,96 30 Good CONDITION 9,000, Excellent 4,800,70 30 Excellent 35,000,396 5 Good 4,000,706 3 Mint 45,000, Mint 59,000,950 7 Mint 65,000,33 30 Excellent 8,000,85 6 Mint 83,000 3,75 35 Good 00,000,300 8 Good,000,55 7 Good 5,000 3, Excellent 9,000,740 Mint Program 4. ˆ =
15 Evaluating Multiple Regression Models Evaluation is similar to simple linear regression models The p-value for the F-test and r are interpreted the same The hypothesis is different because there is more than one independent variable The F-test is investigating whether all the coefficients are equal to 0 If the F-test is significant, it does not mean all independent variables are significant Evaluating Multiple Regression Models To determine which independent variables are significant, tests are performed for each variable H H 0 : β = : β The test statistic is calculated and if the p-value is lower than the level of significance (α), the null hypothesis is rejected Jenny Wilson Realty The model is statistically significant The p-value for the F-test is 0.00 r = so the model explains about 67% of the variation in selling price () But the F-test is for the entire model and we can t tell if one or both of the independent variables are significant By calculating the p-value of each variable, we can assess the significance of the individual variables Since the p-value for (square footage) and (age) are both less than the significance level of 0.05, both null hypotheses can be rejected Binary or Dummy Variables Binary (or dummy or indicator) variables are special variables created for qualitative data A binary variable is assigned a value of if a particular qualitative condition is met and a value of 0 otherwise Adding binary variables may increase the accuracy of the regression model The number of binary variables must be one less than the number of categories of the qualitative variable
16 Jenny Wilson Realty Jenny Wilson Realty Jenny believes a better model can be developed if she includes information about the condition of the property 3 = if house is in excellent condition = 0 otherwise 4 = if house is in mint (perfect) condition = 0 otherwise Two binary variables are used to describe the three categories of condition No variable is needed for good condition since if both 3 = 0 and 4 = 0, the house must be in good condition Program Jenny Wilson Realty Jenny Wilson Realty ˆ =, , 3, , Model explains about 89.8% of the variation in selling price F-value indicates significance Program Program 4.3 The two additional dummy variables result in higher r and smaller significance value. Low p-values indicate each variable is significant 4 64
17 Model Building Model Building The best model is a statistically significant model with a high r and few variables As more variables are added to the model, the r -value usually increases However more variables does not necessarily mean better model For this reason, the adjusted r value is often used to determine if additional independent variable is beneficial The adjusted r takes into account the number of independent variables in the model 4 65 The formula for r r SSR = = SST The formula for adjusted r SSE SST SSE /( n k ) Adjusted r = SST /( n ) As the number of independent variables (k) increases, n-k- decreases. This causes SSE/(n-k-) to increase and the adjusted r to decrease unless the extra variable causes a significant decrease in the SSE (and error) to offset the change in k 4 66 Model Building Model Building Note when new variables are added to the model, the value of r will never decrease; however the adjusted r may decrease In general, if a new variable increases the adjusted r, it should probably be included in the model A variable should not be added to the model if it causes the adjusted r to decrease Compare the adjusted r before and after adding the two binary variables in Jenny Wilson Realty example (0.6 vs 0.856) 4 67 In some cases, variables contain duplicate information E.g. size of the lot, # of bedrooms and # of bathrooms might be correlated with the square footage of the house When two independent variables are correlated, they are said to be collinear When more than two independent variables are correlated, multicollinearity exists The model is still good for prediction purpose when multicollinearity is present But hypothesis tests (p-values) for the individual variables and the interpretation of their coefficients are not valid 4 68
18 Nonlinear Regression In some situations relationships between variables are not linear Transformations may be used to turn a nonlinear model into a linear model to use linear regression analysis programs e.g. Excel Linear relationship Nonlinear relationship Colonel Motors The engineers want to use regression analysis to improve fuel efficiency They have been asked to study the impact of weight on miles per gallon (MPG) MPG WEIGHT (,000 LBS.) MPG WEIGHT (,000 LBS.) Table Colonel Motors Colonel Motors MPG Figure 4.6A Linear model ˆ = b + b Weight (,000 lb.) Program 4.4 A useful model with a small F-test for significance and a good r value =
19 MPG Colonel Motors A nonlinear model seams better MPG = b weight + ) 0 + b( ) b ( weight Figure 4.6B Weight (,000 lb.) Colonel Motors The nonlinear model is a quadratic model The easiest way to work with this model is to develop a new variable = ( weight) This gives us a model that can be solved with linear regression software ˆ = b b b Colonel Motors Cautions and Pitfalls Program 4.5 ˆ = A better model with a smaller F-test for significance and a larger adjusted r value Interpretation of coefficients and P-values are not valid 4 75 If the assumptions about the errors are not met, the statistical test may not be valid Correlation does not necessarily mean causation (e.g. price of automobiles and your annual salary) Multicollinearity makes interpreting coefficients problematic, but the model may still be good Using a regression model beyond the range of is questionable, the relationship may not hold outside the sample data (e.g. advertising amount and sales volume) 4 76
20 Cautions and Pitfalls t-tests for the intercept (b 0 ) may be ignored as this point (=0) is often outside the range of the model A linear relationship may not be the best relationship, even if the F-test returns an acceptable value A nonlinear relationship can exist even if a linear relationship does not Just because a relationship is statistically significant doesn't mean it has any practical value r must also be significant Homework Assignment
Chapter 4. Regression Models. Learning Objectives
Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing
More informationChapter 4: Regression Models
Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,
More informationRegression Models. Chapter 4
Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Introduction Regression analysis
More informationBayesian Analysis LEARNING OBJECTIVES. Calculating Revised Probabilities. Calculating Revised Probabilities. Calculating Revised Probabilities
Valua%on and pricing (November 5, 2013) LEARNING OBJECTIVES Lecture 7 Decision making (part 3) Regression theory Olivier J. de Jong, LL.M., MM., MBA, CFD, CFFA, AA www.olivierdejong.com 1. List the steps
More informationThe Multiple Regression Model
Multiple Regression The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables:
More informationChapter 7 Student Lecture Notes 7-1
Chapter 7 Student Lecture Notes 7- Chapter Goals QM353: Business Statistics Chapter 7 Multiple Regression Analysis and Model Building After completing this chapter, you should be able to: Explain model
More informationRegression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES
M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page 46 4 C H A P T E R Regression Models TEACHING SUGGESTIONS Teaching Suggestion 4.1: Which Is the Independent Variable? We find that students are often confused
More informationStatistics for Managers using Microsoft Excel 6 th Edition
Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of
More informationChapter 3 Multiple Regression Complete Example
Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be
More informationChapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression
Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing
More informationCorrelation Analysis
Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the
More informationBusiness Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal
Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing
More informationBasic Business Statistics, 10/e
Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:
More informationChapter 14 Student Lecture Notes 14-1
Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this
More informationBasic Business Statistics 6 th Edition
Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based
More informationChapter 16. Simple Linear Regression and dcorrelation
Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will
More informationChapter 16. Simple Linear Regression and Correlation
Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationEcon 3790: Business and Economics Statistics. Instructor: Yogesh Uppal
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal yuppal@ysu.edu Sampling Distribution of b 1 Expected value of b 1 : Variance of b 1 : E(b 1 ) = 1 Var(b 1 ) = σ 2 /SS x Estimate of
More informationChapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression
Chapter 12 12-1 North Seattle Community College BUS21 Business Statistics Chapter 12 Learning Objectives In this chapter, you learn:! How to use regression analysis to predict the value of a dependent
More informationKeller: Stats for Mgmt & Econ, 7th Ed July 17, 2006
Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will
More informationCh 13 & 14 - Regression Analysis
Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more
More informationBusiness Statistics. Lecture 10: Correlation and Linear Regression
Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form
More informationChapter 14 Simple Linear Regression (A)
Chapter 14 Simple Linear Regression (A) 1. Characteristics Managerial decisions often are based on the relationship between two or more variables. can be used to develop an equation showing how the variables
More informationMathematics for Economics MA course
Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between
More informationRegression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.
Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate
More informationRegression Analysis. BUS 735: Business Decision Making and Research
Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn
More informationLI EAR REGRESSIO A D CORRELATIO
CHAPTER 6 LI EAR REGRESSIO A D CORRELATIO Page Contents 6.1 Introduction 10 6. Curve Fitting 10 6.3 Fitting a Simple Linear Regression Line 103 6.4 Linear Correlation Analysis 107 6.5 Spearman s Rank Correlation
More informationChapter 13. Multiple Regression and Model Building
Chapter 13 Multiple Regression and Model Building Multiple Regression Models The General Multiple Regression Model y x x x 0 1 1 2 2... k k y is the dependent variable x, x,..., x 1 2 k the model are the
More informationRegression Analysis II
Regression Analysis II Measures of Goodness of fit Two measures of Goodness of fit Measure of the absolute fit of the sample points to the sample regression line Standard error of the estimate An index
More informationEstimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.
Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.
More informationInference for Regression Inference about the Regression Model and Using the Regression Line
Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about
More informationWhat is a Hypothesis?
What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill in this city is μ = $42 population proportion Example:
More informationThe simple linear regression model discussed in Chapter 13 was written as
1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple
More informationSTA121: Applied Regression Analysis
STA121: Applied Regression Analysis Linear Regression Analysis - Chapters 3 and 4 in Dielman Artin Department of Statistical Science September 15, 2009 Outline 1 Simple Linear Regression Analysis 2 Using
More informationData Analysis 1 LINEAR REGRESSION. Chapter 03
Data Analysis 1 LINEAR REGRESSION Chapter 03 Data Analysis 2 Outline The Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression Other Considerations in Regression Model Qualitative
More informationChapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics
Chapter 13 Student Lecture Notes 13-1 Department of Quantitative Methods & Information Sstems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analsis QMIS 0 Dr. Mohammad
More informationSimple Linear Regression
9-1 l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical Method for Determining Regression 9.4 Least Square Method 9.5 Correlation Coefficient and Coefficient
More informationCh14. Multiple Regression Analysis
Ch14. Multiple Regression Analysis 1 Goals : multiple regression analysis Model Building and Estimating More than 1 independent variables Quantitative( 量 ) independent variables Qualitative( ) independent
More informationChapte The McGraw-Hill Companies, Inc. All rights reserved.
12er12 Chapte Bivariate i Regression (Part 1) Bivariate Regression Visual Displays Begin the analysis of bivariate data (i.e., two variables) with a scatter plot. A scatter plot - displays each observed
More informationOrdinary Least Squares Regression Explained: Vartanian
Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent
More informationSIMPLE REGRESSION ANALYSIS. Business Statistics
SIMPLE REGRESSION ANALYSIS Business Statistics CONTENTS Ordinary least squares (recap for some) Statistical formulation of the regression model Assessing the regression model Testing the regression coefficients
More informationEcon 3790: Statistics Business and Economics. Instructor: Yogesh Uppal
Econ 3790: Statistics Business and Economics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 14 Covariance and Simple Correlation Coefficient Simple Linear Regression Covariance Covariance between
More informationInference for Regression
Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationFinding Relationships Among Variables
Finding Relationships Among Variables BUS 230: Business and Economic Research and Communication 1 Goals Specific goals: Re-familiarize ourselves with basic statistics ideas: sampling distributions, hypothesis
More informationLinear Regression and Correlation
Linear Regression and Correlation Chapter 13 McGraw-Hill/Irwin Copyright 2010 by The McGraw-Hill Companies, Inc. All rights reserved. GOALS 1. Understand and interpret the terms dependent and independent
More informationRegression Analysis. Regression: Methodology for studying the relationship among two or more variables
Regression Analysis Regression: Methodology for studying the relationship among two or more variables Two major aims: Determine an appropriate model for the relationship between the variables Predict the
More informationregression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist
regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist sales $ (y - dependent variable) advertising $ (x - independent variable)
More informationVariance Decomposition and Goodness of Fit
Variance Decomposition and Goodness of Fit 1. Example: Monthly Earnings and Years of Education In this tutorial, we will focus on an example that explores the relationship between total monthly earnings
More informationChapter 12 - Part I: Correlation Analysis
ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,
More informationOrdinary Least Squares Regression Explained: Vartanian
Ordinary Least Squares Regression Eplained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent
More informationPredictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore
What is Multiple Linear Regression Several independent variables may influence the change in response variable we are trying to study. When several independent variables are included in the equation, the
More informationLinear regression. We have that the estimated mean in linear regression is. ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. The standard error of ˆµ Y X=x is.
Linear regression We have that the estimated mean in linear regression is The standard error of ˆµ Y X=x is where x = 1 n s.e.(ˆµ Y X=x ) = σ ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. 1 n + (x x)2 i (x i x) 2 i x i. The
More informationCorrelation and the Analysis of Variance Approach to Simple Linear Regression
Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation
More informationLecture 15 Multiple regression I Chapter 6 Set 2 Least Square Estimation The quadratic form to be minimized is
Lecture 15 Multiple regression I Chapter 6 Set 2 Least Square Estimation The quadratic form to be minimized is Q = (Y i β 0 β 1 X i1 β 2 X i2 β p 1 X i.p 1 ) 2, which in matrix notation is Q = (Y Xβ) (Y
More informationSingle and multiple linear regression analysis
Single and multiple linear regression analysis Marike Cockeran 2017 Introduction Outline of the session Simple linear regression analysis SPSS example of simple linear regression analysis Additional topics
More informationSTAT 350 Final (new Material) Review Problems Key Spring 2016
1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,
More informationLECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit
LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define
More information1 A Non-technical Introduction to Regression
1 A Non-technical Introduction to Regression Chapters 1 and Chapter 2 of the textbook are reviews of material you should know from your previous study (e.g. in your second year course). They cover, in
More informationF-tests and Nested Models
F-tests and Nested Models Nested Models: A core concept in statistics is comparing nested s. Consider the Y = β 0 + β 1 x 1 + β 2 x 2 + ǫ. (1) The following reduced s are special cases (nested within)
More informationMultiple Regression Methods
Chapter 1: Multiple Regression Methods Hildebrand, Ott and Gray Basic Statistical Ideas for Managers Second Edition 1 Learning Objectives for Ch. 1 The Multiple Linear Regression Model How to interpret
More informationSection 3: Simple Linear Regression
Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction
More informationMultiple Regression. Peerapat Wongchaiwat, Ph.D.
Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com The Multiple Regression Model Examine the linear relationship between 1 dependent (Y) & 2 or more independent variables (X i ) Multiple Regression Model
More information(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.
FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December
More informationVariance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017
Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017 PDF file location: http://www.murraylax.org/rtutorials/regression_anovatable.pdf
More informationBNAD 276 Lecture 10 Simple Linear Regression Model
1 / 27 BNAD 276 Lecture 10 Simple Linear Regression Model Phuong Ho May 30, 2017 2 / 27 Outline 1 Introduction 2 3 / 27 Outline 1 Introduction 2 4 / 27 Simple Linear Regression Model Managerial decisions
More informationLecture 10 Multiple Linear Regression
Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable
More informationMULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics
MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES Business Statistics CONTENTS Multiple regression Dummy regressors Assumptions of regression analysis Predicting with regression analysis Old exam question
More informationA discussion on multiple regression models
A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value
More informationChapter 15 Multiple Regression
Multiple Regression Learning Objectives 1. Understand how multiple regression analysis can be used to develop relationships involving one dependent variable and several independent variables. 2. Be able
More informationInference for Regression Simple Linear Regression
Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating
More informationPubH 7405: REGRESSION ANALYSIS. MLR: INFERENCES, Part I
PubH 7405: REGRESSION ANALYSIS MLR: INFERENCES, Part I TESTING HYPOTHESES Once we have fitted a multiple linear regression model and obtained estimates for the various parameters of interest, we want to
More informationdf=degrees of freedom = n - 1
One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:
More informationECON 497 Midterm Spring
ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain
More informationMidterm 2 - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis February 24, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put
More information1 Correlation and Inference from Regression
1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is
More informationSimple Linear Regression. Material from Devore s book (Ed 8), and Cengagebrain.com
12 Simple Linear Regression Material from Devore s book (Ed 8), and Cengagebrain.com The Simple Linear Regression Model The simplest deterministic mathematical relationship between two variables x and
More informationSimple Linear Regression
Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationCorrelation and Regression Analysis. Linear Regression and Correlation. Correlation and Linear Regression. Three Questions.
10/8/18 Correlation and Regression Analysis Correlation Analysis is the study of the relationship between variables. It is also defined as group of techniques to measure the association between two variables.
More informationTHE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS
THE ROYAL STATISTICAL SOCIETY 008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS The Society provides these solutions to assist candidates preparing for the examinations
More informationCorrelation & Simple Regression
Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.
More informationDraft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM
1 REGRESSION AND CORRELATION As we learned in Chapter 9 ( Bivariate Tables ), the differential access to the Internet is real and persistent. Celeste Campos-Castillo s (015) research confirmed the impact
More information3. Diagnostics and Remedial Measures
3. Diagnostics and Remedial Measures So far, we took data (X i, Y i ) and we assumed where ɛ i iid N(0, σ 2 ), Y i = β 0 + β 1 X i + ɛ i i = 1, 2,..., n, β 0, β 1 and σ 2 are unknown parameters, X i s
More informationCHAPTER EIGHT Linear Regression
7 CHAPTER EIGHT Linear Regression 8. Scatter Diagram Example 8. A chemical engineer is investigating the effect of process operating temperature ( x ) on product yield ( y ). The study results in the following
More informationSTA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.
STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory
More informationInference with Simple Regression
1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems
More informationMultiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company
Multiple Regression Inference for Multiple Regression and A Case Study IPS Chapters 11.1 and 11.2 2009 W.H. Freeman and Company Objectives (IPS Chapters 11.1 and 11.2) Multiple regression Data for multiple
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationST Correlation and Regression
Chapter 5 ST 370 - Correlation and Regression Readings: Chapter 11.1-11.4, 11.7.2-11.8, Chapter 12.1-12.2 Recap: So far we ve learned: Why we want a random sample and how to achieve it (Sampling Scheme)
More informationCS 5014: Research Methods in Computer Science
Computer Science Clifford A. Shaffer Department of Computer Science Virginia Tech Blacksburg, Virginia Fall 2010 Copyright c 2010 by Clifford A. Shaffer Computer Science Fall 2010 1 / 207 Correlation and
More information5. Multiple Regression (Regressioanalyysi) (Azcel Ch. 11, Milton/Arnold Ch. 12) The k-variable Multiple Regression Model
5. Multiple Regression (Regressioanalyysi) (Azcel Ch. 11, Milton/Arnold Ch. 12) The k-variable Multiple Regression Model The population regression model of a dependent variable Y on a set of k independent
More informationChapter 14 Multiple Regression Analysis
Chapter 14 Multiple Regression Analysis 1. a. Multiple regression equation b. the Y-intercept c. $374,748 found by Y ˆ = 64,1 +.394(796,) + 9.6(694) 11,6(6.) (LO 1) 2. a. Multiple regression equation b.
More informationECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47
ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with
More informationLectures on Simple Linear Regression Stat 431, Summer 2012
Lectures on Simple Linear Regression Stat 43, Summer 0 Hyunseung Kang July 6-8, 0 Last Updated: July 8, 0 :59PM Introduction Previously, we have been investigating various properties of the population
More informationCHAPTER 5 LINEAR REGRESSION AND CORRELATION
CHAPTER 5 LINEAR REGRESSION AND CORRELATION Expected Outcomes Able to use simple and multiple linear regression analysis, and correlation. Able to conduct hypothesis testing for simple and multiple linear
More informationST430 Exam 2 Solutions
ST430 Exam 2 Solutions Date: November 9, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textbook are permitted but you may use a calculator. Giving
More informationLinear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x).
Linear Regression Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). A dependent variable is a random variable whose variation
More informationCorrelation and regression
1 Correlation and regression Yongjua Laosiritaworn Introductory on Field Epidemiology 6 July 2015, Thailand Data 2 Illustrative data (Doll, 1955) 3 Scatter plot 4 Doll, 1955 5 6 Correlation coefficient,
More informationWhat Is ANOVA? Comparing Groups. One-way ANOVA. One way ANOVA (the F ratio test)
What Is ANOVA? One-way ANOVA ANOVA ANalysis Of VAriance ANOVA compares the means of several groups. The groups are sometimes called "treatments" First textbook presentation in 95. Group Group σ µ µ σ µ
More information