Regression Models. Chapter 4. Introduction. Introduction. Introduction

Size: px
Start display at page:

Download "Regression Models. Chapter 4. Introduction. Introduction. Introduction"

Transcription

1 Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager There are generally two purposes for regression analysis. To understand the relationship between variables E.g. the relationship between the sales volume and the advertising spending amount, the relationship between the price of a house and the square footage, etc.. To predict the value of one variable based on the value of another variable 4 Introduction Introduction Three types of regression models will be studied Simple linear regression models have only two variables We will first develop this model Multiple regression models have more than two variables Nonlinear regression models are used when the relationships between the variables are not linear 4 3 The variable to be predicted is called the dependent variable Sometimes called the response variable The value of this variable depends on the value of the independent variable Sometimes called the explanatory or predictor variable Dependent variable Independent variable Independent variable = Prediction Relationship 4 4

2 Scatter Diagram One way to investigate the relationship between variables is by plotting the data on a graph Such a graph is often called a scatter diagram or a scatter plot The independent variable is normally plotted on the axis The dependent variable is normally plotted on the axis 4 5 Triple A Construction Example Triple A Construction renovates old homes They have found that the dollar volume of renovation work each year is dependent on the area payroll Triple A s revenues and the total wage earnings for the past six years are listed below dependent variable Table 4. TRIPLE A S SALES LOCAL PAROLL ($00,000 s) ($00,000,000 s) independent variable 4 6 Triple A Construction Example Triple A Construction Example Sales ($00 0,000) Payroll ($00 million) The graph indicates higher payroll seem to result in higher sales A line has been drawn to show the relationship between the payroll and the sales There is not a perfect relationship because not all points lie in a straight line Errors are involved if this line is used to predict sales based on payroll Many lines could be drawn through these points, but which one best represents the true relationship? Figure 4.: Scatter Diagram for Triple A Construction Company Data in Table

3 Simple Linear Regression Simple Linear Regression Regression models are used to find the relationship between variables i.e. to predict the value of one variable based on the other However there is some random error that cannot be predicted Regression models can also be used to test if a relationship exists between variables The underlying simple linear regression model is: 0 = β + β + ε where = dependent variable (response) = independent variable (predictor or explanatory) β 0 = intercept (value of when = 0) β = slope of the regression line ε = random error Simple Linear Regression Triple A Construction The random error cannot be predicted. So an approximation of the model is used where ^ ˆ = b 0 + b = predicted value of = independent variable (predictor or explanatory) b 0 = estimate of β 0 b = estimate of β Triple A Construction is trying to predict sales based on area payroll = Sales = Area payroll The line chosen in Figure 4. is the one that best fits the sample data by minimizing the sum of all errors Error = (Actual value) (Predicted value) e = ˆ 4 4

4 Triple A Construction The errors may be positive or negative large positive and negative errors may cancel each other result in very small average error thus errors are squared Error = [(Actual value) (Predicted value)] e = ( ) ˆ The best regression line is defined as the one that minimize the sum of squared errors, i.e. the total distance between the actual data points and the line Triple A Construction For the simple linear regression model, the values of the intercept and slope can be calculated from n sample data using the formulas below = = n = = n b ˆ = b + 0 b average (mean) of average (mean) of ( )( ) = b0 = b ( ) values values Triple A Construction Regression calculations ( ) ( )( ) 6 3 (3 4) = (3 4)(6 7) = 8 4 (4 4) = 0 (4 4)(8 7) = 0 Σ = 4 = 4/6 = (6 4) = 4 (6 4)(9 7) = (4 4) = 0 (4 4)(5 7) = ( 4) = 4 ( 4)(4.5 7) = (5 4) = (5 4)(9.5 7) =.5 Σ = 4 = 4/6 = 4 Σ( ) = 0 Σ( )( ) =.5 Triple A Construction Regression calculations = 6 4 = = = = = b ( )( ). 5 = = 5 ( ) 0 =. b0 = b = 7 (. 5)( 4) = Table 4. Therefore ˆ =

5 Triple A Construction Measuring the Fit of the Regression Model Regression calculations = 6 4 = = 4 sales = +.5(payroll) 6 4 If the payroll next = = = 7 year is $600 million 6 6 ( )( ˆ = ) ( 6) = 9. 5 or $ 950, 000 b = = =. 5 ( ) 0 b0 = b = 7 (. 5)( 4) = Therefore ˆ = Regression models can be developed for any variables and How do we know the model is good enough (with small errors) in predicting based on? The following measures are useful in describing the accuracy of the model Three measures of variability SST Total variability about the mean SSE Variability about the regression line SSR Total variability that is explained by the model 4 8 Measuring the Fit of the Regression Model Measuring the Fit of the Regression Model Sum of the squares total SST = ( ) ( ) ( ) ( ) 6 3 (6 7) = +.5(3) = ^ ^ ^ Sum of the squared error 8 4 (8 7) = +.5(4) = SSE = e = ( ˆ ) Sum of squares due to regression SSR = ˆ ( ) 9 6 (9 7) = 4 +.5(6) = (5 7) = 4 +.5(4) = (4.5 7) = () = (9.5 7) = (5) = An important relationship SST = SSR + SSE ( ) ^ =.5 ( ) ^ = ( ) = 5.65 = 7 SST =.5 SSE = SSR = 5.65 Table

6 Measuring the Fit of the Regression Model SST =.5 is the variability of the prediction using mean value of SSE = is the variability of the prediction using regression line Prediction using regression line has reduced the variability by = 5.65 SSR = 5.65 indicates how much of the total variability in is explained by the regression model Note: SST = SSR + SSE SSR explained variability SSE unexplained variability Sales ($00 0,000) Measuring the Fit of the Regression Model (SSR) (SSE) ^ ^ ^ = +.5 (SST) Payroll ($00 million) Figure Coefficient of Determination Correlation Coefficient The proportion of the variability in explained by regression equation is called the coefficient of determination The coefficient of determination is r r SSR SSE = = SST SST For Triple A Construction r = = About 69% of the variability in is explained by the equation based on payroll () If SSE 0, then r 00% 4 3 The correlation coefficient is an expression of the strength of the linear relationship between the variables r = ± It will always be between + and Negative slope r < 0; positive slope r > 0 The correlation coefficient is r For Triple A Construction r = r =

7 Correlation Coefficient Using Computer Software for Regression Figure 4.3 (a) Perfect Positive Correlation: r = + (c) No Correlation: r = 0 (b) Positive Correlation: 0 < r < (d) Perfect Negative Correlation: r = 4 5 Program 4.A 4 6 Using Computer Software for Regression Using Computer Software for Regression Program 4.B Program 4.C

8 Using Computer Software for Regression Correlation coefficient (r) is Multiple R in Excel Assumptions of the Regression Model If we make certain assumptions about the errors in a regression model, we can perform statistical tests to determine if the model is useful. Errors are independent. Errors are normally distributed 3. Errors have a mean of zero 4. Errors have a constant variance A plot of the residuals (errors) will often highlight any glaring violations of the assumption Program 4.D = +.5 ˆ Residual Plots Residual Plots A random plot of residuals Healthy pattern no violations Nonconstant error variance violation Errors increase as increases, violating the constant variance assumption Error Error = 0 Error Error = 0 Figure 4.4A Figure 4.4B

9 Residual Plots Nonlinear relationship violation Errors consistently increasing and then consistently decreasing indicate that the model is not linear (perhaps quadratic) Estimating the Variance Errors are assumed to have a constant variance (σ ), but we usually don t know this It can be estimated using the mean squared error (MSE), s Error Error = 0 s SSE = MSE = n k where n = number of observations in the sample k = number of independent variables Figure 4.4C Estimating the Variance Testing the Model for Significance For Triple A Construction s SSE = MSE = = = =. 788 n k 6 4 We can estimate the standard deviation, s This is also called the standard error of the estimate or the standard deviation of the regression s = MSE =. 788 =. 3 A small s or s means the actual data deviate within a small range from the predicted result Both r and the MSE (s ) provide a measure of accuracy in a regression model However when the sample size is too small, you can get good values for MSE and r even if there is no relationship between the variables Testing the model for significance helps determine if r and MSE are meaningful and if a linear relationship exists between the variables We do this by performing a statistical hypothesis test

10 Testing the Model for Significance We start with the general linear model 0 = β + β + ε If β = 0, the null hypothesis is that there is no relationship between and The alternate hypothesis is that there is a linear relationship (β 0) If the null hypothesis can be rejected, we have proven there is a linear relationship We use the F statistic for this test The F Distribution A continuous probability distribution (Fig..5) The area underneath the curve represents probability of the F statistic value falling within a particular interval. The F statistic is the ratio of two sample variances F distributions have two sets of degrees of freedom Degrees of freedom are based on sample size and used to calculate the numerator and denominator df = degrees of freedom for the numerator df = degrees of freedom for the denominator The F Distribution The F Distribution Consider the example: df = 5 df = 6 α = 0.05 (probability) From Appendix D, we get This means F α, df, df = F 0.05, 5, 6 = 4.39 P(F > 4.39) = 0.05 There is only a 5% probability that F will exceed 4.39 (see Fig..6) Figure.5 F α

11 The F Distribution Testing the Model for Significance Figure.6 F value for 0.05 probability with 5 and 6 degrees of freedom F = The F statistic for testing the model is based on the MSE (s ) and mean squared regression (MSR) SSR MSR = k where k = number of independent variables in the model The F statistic is F = MSR MSE This describes an F distribution with degrees of freedom for the numerator = df = k degrees of freedom for the denominator = df = n k Testing the Model for Significance If there is very little error, the MSE would be small and the F-statistic would be large indicating the model is useful If the F-statistic is large, the significance level (p-value) will be low, indicating it is unlikely this would have occurred by chance So when the F-value is large, we can reject the null hypothesis and accept that there is a linear relationship between and and the values of the MSE and r are meaningful Steps in a Hypothesis Test. Specify null and alternative hypotheses H0 : β = 0 H : β 0. Select the level of significance (α). Common values are 0.0 and Calculate the value of the test statistic using the formula MSR F = MSE

12 Steps in a Hypothesis Test Triple A Construction 4. Make a decision using one of the following methods a) Reject the null hypothesis if the test statistic is greater than the F-value from the table in Appendix D. Otherwise, do not reject the null hypothesis: Reject if df = k df = n k F calculated > Fα, df, df b) Reject the null hypothesis if the observed significance level, or p-value, is less than the level of significance (α). Otherwise, do not reject the null hypothesis: p - value = P( F > calculated test statistic) Reject if p - value < α Step. H 0 : β = 0 (no linear relationship between and ) H : β 0 (linear relationship exists between and ) Step. Select α = 0.05 Step 3. Calculate the value of the test statistic MSR F SSR = k MSR = MSE = = = = Triple A Construction Triple A Construction Step 4. Reject the null hypothesis if the test statistic is greater than the F-value in Appendix D df = k = df = n k = 6 = 4 The value of F associated with a 5% level of significance and with degrees of freedom and 4 is found in Appendix D We can conclude there is a statistically significant relationship between and The r value of 0.69 means about 69% of the variability in sales () is explained by local payroll () F 0.05,,4 = 7.7 F calculated = 9.09 Reject H 0 because 9.09 > 7.7 Figure F =

13 Triple A Construction The F-test determines whether or not there is a relationship between the variables r (coefficient of determination) is the best measure of the strength of the prediction relationship between the and variables Values closer to indicate a strong prediction relationship Good regression models have a low significance level for the F-test and high r value. Analysis of Variance (ANOVA) Table When software is used to develop a regression model, an ANOVA table is typically created that shows the observed significance level (p-value) for the calculated F value This can be compared to the level of significance (α) to make a decision DF SS MS F SIGNIFICANCE Regression k SSR MSR = SSR/k MSR/MSE P(F > MSR/MSE) Residual n - k - SSE MSE = SSE/(n - k - ) Total n - SST Table ANOVA for Triple A Construction Multiple Regression Analysis Multiple regression models are extensions to the simple linear model and allow the creation of models with several independent variables Program 4.D (partial) P(F > ) = Because this probability is less than 0.05, we reject the null hypothesis of no linear relationship and conclude there is a linear relationship between and = β 0 + β + β + + β k k + ε where = dependent variable (response variable) i = ith independent variable (predictor or explanatory variable) β 0 = intercept (value of when all i = 0) β I = coefficient of the ith independent variable k = number of independent variables ε = random error

14 Multiple Regression Analysis To estimate these values, samples are taken and the following equation is developed ˆ = b + b + b + + b 0... k k where Ŷ = predicted value of b 0 = sample intercept (and is an estimate of β 0 ) b i = sample coefficient of the ith variable (and is an estimate of β i ) 4 53 Jenny Wilson Realty Jenny Wilson wants to develop a model to determine the suggested listing price for houses based on the size and age of the house where ˆ = b + b + b 0 ˆ = predicted value of dependent variable (selling price) b 0 = intercept and = value of the two independent variables (square footage and age) respectively b and b = slopes for and respectively She selects a few samples of the houses sold recently and records the data shown in Table 4.5 She also saves information on house condition to be used later 4 54 Jenny Wilson Realty Jenny Wilson Realty Table 4.5 SELLING PRICE ($) SQUARE FOOTAGE AGE 95,000,96 30 Good CONDITION 9,000, Excellent 4,800,70 30 Excellent 35,000,396 5 Good 4,000,706 3 Mint 45,000, Mint 59,000,950 7 Mint 65,000,33 30 Excellent 8,000,85 6 Mint 83,000 3,75 35 Good 00,000,300 8 Good,000,55 7 Good 5,000 3, Excellent 9,000,740 Mint Program 4. ˆ =

15 Evaluating Multiple Regression Models Evaluation is similar to simple linear regression models The p-value for the F-test and r are interpreted the same The hypothesis is different because there is more than one independent variable The F-test is investigating whether all the coefficients are equal to 0 If the F-test is significant, it does not mean all independent variables are significant Evaluating Multiple Regression Models To determine which independent variables are significant, tests are performed for each variable H H 0 : β = : β The test statistic is calculated and if the p-value is lower than the level of significance (α), the null hypothesis is rejected Jenny Wilson Realty The model is statistically significant The p-value for the F-test is 0.00 r = so the model explains about 67% of the variation in selling price () But the F-test is for the entire model and we can t tell if one or both of the independent variables are significant By calculating the p-value of each variable, we can assess the significance of the individual variables Since the p-value for (square footage) and (age) are both less than the significance level of 0.05, both null hypotheses can be rejected Binary or Dummy Variables Binary (or dummy or indicator) variables are special variables created for qualitative data A binary variable is assigned a value of if a particular qualitative condition is met and a value of 0 otherwise Adding binary variables may increase the accuracy of the regression model The number of binary variables must be one less than the number of categories of the qualitative variable

16 Jenny Wilson Realty Jenny Wilson Realty Jenny believes a better model can be developed if she includes information about the condition of the property 3 = if house is in excellent condition = 0 otherwise 4 = if house is in mint (perfect) condition = 0 otherwise Two binary variables are used to describe the three categories of condition No variable is needed for good condition since if both 3 = 0 and 4 = 0, the house must be in good condition Program Jenny Wilson Realty Jenny Wilson Realty ˆ =, , 3, , Model explains about 89.8% of the variation in selling price F-value indicates significance Program Program 4.3 The two additional dummy variables result in higher r and smaller significance value. Low p-values indicate each variable is significant 4 64

17 Model Building Model Building The best model is a statistically significant model with a high r and few variables As more variables are added to the model, the r -value usually increases However more variables does not necessarily mean better model For this reason, the adjusted r value is often used to determine if additional independent variable is beneficial The adjusted r takes into account the number of independent variables in the model 4 65 The formula for r r SSR = = SST The formula for adjusted r SSE SST SSE /( n k ) Adjusted r = SST /( n ) As the number of independent variables (k) increases, n-k- decreases. This causes SSE/(n-k-) to increase and the adjusted r to decrease unless the extra variable causes a significant decrease in the SSE (and error) to offset the change in k 4 66 Model Building Model Building Note when new variables are added to the model, the value of r will never decrease; however the adjusted r may decrease In general, if a new variable increases the adjusted r, it should probably be included in the model A variable should not be added to the model if it causes the adjusted r to decrease Compare the adjusted r before and after adding the two binary variables in Jenny Wilson Realty example (0.6 vs 0.856) 4 67 In some cases, variables contain duplicate information E.g. size of the lot, # of bedrooms and # of bathrooms might be correlated with the square footage of the house When two independent variables are correlated, they are said to be collinear When more than two independent variables are correlated, multicollinearity exists The model is still good for prediction purpose when multicollinearity is present But hypothesis tests (p-values) for the individual variables and the interpretation of their coefficients are not valid 4 68

18 Nonlinear Regression In some situations relationships between variables are not linear Transformations may be used to turn a nonlinear model into a linear model to use linear regression analysis programs e.g. Excel Linear relationship Nonlinear relationship Colonel Motors The engineers want to use regression analysis to improve fuel efficiency They have been asked to study the impact of weight on miles per gallon (MPG) MPG WEIGHT (,000 LBS.) MPG WEIGHT (,000 LBS.) Table Colonel Motors Colonel Motors MPG Figure 4.6A Linear model ˆ = b + b Weight (,000 lb.) Program 4.4 A useful model with a small F-test for significance and a good r value =

19 MPG Colonel Motors A nonlinear model seams better MPG = b weight + ) 0 + b( ) b ( weight Figure 4.6B Weight (,000 lb.) Colonel Motors The nonlinear model is a quadratic model The easiest way to work with this model is to develop a new variable = ( weight) This gives us a model that can be solved with linear regression software ˆ = b b b Colonel Motors Cautions and Pitfalls Program 4.5 ˆ = A better model with a smaller F-test for significance and a larger adjusted r value Interpretation of coefficients and P-values are not valid 4 75 If the assumptions about the errors are not met, the statistical test may not be valid Correlation does not necessarily mean causation (e.g. price of automobiles and your annual salary) Multicollinearity makes interpreting coefficients problematic, but the model may still be good Using a regression model beyond the range of is questionable, the relationship may not hold outside the sample data (e.g. advertising amount and sales volume) 4 76

20 Cautions and Pitfalls t-tests for the intercept (b 0 ) may be ignored as this point (=0) is often outside the range of the model A linear relationship may not be the best relationship, even if the F-test returns an acceptable value A nonlinear relationship can exist even if a linear relationship does not Just because a relationship is statistically significant doesn't mean it has any practical value r must also be significant Homework Assignment

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

Chapter 4: Regression Models

Chapter 4: Regression Models Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,

More information

Regression Models. Chapter 4

Regression Models. Chapter 4 Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Introduction Regression analysis

More information

Bayesian Analysis LEARNING OBJECTIVES. Calculating Revised Probabilities. Calculating Revised Probabilities. Calculating Revised Probabilities

Bayesian Analysis LEARNING OBJECTIVES. Calculating Revised Probabilities. Calculating Revised Probabilities. Calculating Revised Probabilities Valua%on and pricing (November 5, 2013) LEARNING OBJECTIVES Lecture 7 Decision making (part 3) Regression theory Olivier J. de Jong, LL.M., MM., MBA, CFD, CFFA, AA www.olivierdejong.com 1. List the steps

More information

The Multiple Regression Model

The Multiple Regression Model Multiple Regression The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables:

More information

Chapter 7 Student Lecture Notes 7-1

Chapter 7 Student Lecture Notes 7-1 Chapter 7 Student Lecture Notes 7- Chapter Goals QM353: Business Statistics Chapter 7 Multiple Regression Analysis and Model Building After completing this chapter, you should be able to: Explain model

More information

Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES

Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page 46 4 C H A P T E R Regression Models TEACHING SUGGESTIONS Teaching Suggestion 4.1: Which Is the Independent Variable? We find that students are often confused

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

Chapter 3 Multiple Regression Complete Example

Chapter 3 Multiple Regression Complete Example Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be

More information

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:

More information

Chapter 14 Student Lecture Notes 14-1

Chapter 14 Student Lecture Notes 14-1 Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal yuppal@ysu.edu Sampling Distribution of b 1 Expected value of b 1 : Variance of b 1 : E(b 1 ) = 1 Var(b 1 ) = σ 2 /SS x Estimate of

More information

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression Chapter 12 12-1 North Seattle Community College BUS21 Business Statistics Chapter 12 Learning Objectives In this chapter, you learn:! How to use regression analysis to predict the value of a dependent

More information

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006 Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Chapter 14 Simple Linear Regression (A)

Chapter 14 Simple Linear Regression (A) Chapter 14 Simple Linear Regression (A) 1. Characteristics Managerial decisions often are based on the relationship between two or more variables. can be used to develop an equation showing how the variables

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables. Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate

More information

Regression Analysis. BUS 735: Business Decision Making and Research

Regression Analysis. BUS 735: Business Decision Making and Research Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn

More information

LI EAR REGRESSIO A D CORRELATIO

LI EAR REGRESSIO A D CORRELATIO CHAPTER 6 LI EAR REGRESSIO A D CORRELATIO Page Contents 6.1 Introduction 10 6. Curve Fitting 10 6.3 Fitting a Simple Linear Regression Line 103 6.4 Linear Correlation Analysis 107 6.5 Spearman s Rank Correlation

More information

Chapter 13. Multiple Regression and Model Building

Chapter 13. Multiple Regression and Model Building Chapter 13 Multiple Regression and Model Building Multiple Regression Models The General Multiple Regression Model y x x x 0 1 1 2 2... k k y is the dependent variable x, x,..., x 1 2 k the model are the

More information

Regression Analysis II

Regression Analysis II Regression Analysis II Measures of Goodness of fit Two measures of Goodness of fit Measure of the absolute fit of the sample points to the sample regression line Standard error of the estimate An index

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

What is a Hypothesis?

What is a Hypothesis? What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill in this city is μ = $42 population proportion Example:

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

STA121: Applied Regression Analysis

STA121: Applied Regression Analysis STA121: Applied Regression Analysis Linear Regression Analysis - Chapters 3 and 4 in Dielman Artin Department of Statistical Science September 15, 2009 Outline 1 Simple Linear Regression Analysis 2 Using

More information

Data Analysis 1 LINEAR REGRESSION. Chapter 03

Data Analysis 1 LINEAR REGRESSION. Chapter 03 Data Analysis 1 LINEAR REGRESSION Chapter 03 Data Analysis 2 Outline The Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression Other Considerations in Regression Model Qualitative

More information

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics Chapter 13 Student Lecture Notes 13-1 Department of Quantitative Methods & Information Sstems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analsis QMIS 0 Dr. Mohammad

More information

Simple Linear Regression

Simple Linear Regression 9-1 l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical Method for Determining Regression 9.4 Least Square Method 9.5 Correlation Coefficient and Coefficient

More information

Ch14. Multiple Regression Analysis

Ch14. Multiple Regression Analysis Ch14. Multiple Regression Analysis 1 Goals : multiple regression analysis Model Building and Estimating More than 1 independent variables Quantitative( 量 ) independent variables Qualitative( ) independent

More information

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapte The McGraw-Hill Companies, Inc. All rights reserved. 12er12 Chapte Bivariate i Regression (Part 1) Bivariate Regression Visual Displays Begin the analysis of bivariate data (i.e., two variables) with a scatter plot. A scatter plot - displays each observed

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

SIMPLE REGRESSION ANALYSIS. Business Statistics

SIMPLE REGRESSION ANALYSIS. Business Statistics SIMPLE REGRESSION ANALYSIS Business Statistics CONTENTS Ordinary least squares (recap for some) Statistical formulation of the regression model Assessing the regression model Testing the regression coefficients

More information

Econ 3790: Statistics Business and Economics. Instructor: Yogesh Uppal

Econ 3790: Statistics Business and Economics. Instructor: Yogesh Uppal Econ 3790: Statistics Business and Economics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 14 Covariance and Simple Correlation Coefficient Simple Linear Regression Covariance Covariance between

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Finding Relationships Among Variables

Finding Relationships Among Variables Finding Relationships Among Variables BUS 230: Business and Economic Research and Communication 1 Goals Specific goals: Re-familiarize ourselves with basic statistics ideas: sampling distributions, hypothesis

More information

Linear Regression and Correlation

Linear Regression and Correlation Linear Regression and Correlation Chapter 13 McGraw-Hill/Irwin Copyright 2010 by The McGraw-Hill Companies, Inc. All rights reserved. GOALS 1. Understand and interpret the terms dependent and independent

More information

Regression Analysis. Regression: Methodology for studying the relationship among two or more variables

Regression Analysis. Regression: Methodology for studying the relationship among two or more variables Regression Analysis Regression: Methodology for studying the relationship among two or more variables Two major aims: Determine an appropriate model for the relationship between the variables Predict the

More information

regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist

regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist sales $ (y - dependent variable) advertising $ (x - independent variable)

More information

Variance Decomposition and Goodness of Fit

Variance Decomposition and Goodness of Fit Variance Decomposition and Goodness of Fit 1. Example: Monthly Earnings and Years of Education In this tutorial, we will focus on an example that explores the relationship between total monthly earnings

More information

Chapter 12 - Part I: Correlation Analysis

Chapter 12 - Part I: Correlation Analysis ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Eplained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore What is Multiple Linear Regression Several independent variables may influence the change in response variable we are trying to study. When several independent variables are included in the equation, the

More information

Linear regression. We have that the estimated mean in linear regression is. ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. The standard error of ˆµ Y X=x is.

Linear regression. We have that the estimated mean in linear regression is. ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. The standard error of ˆµ Y X=x is. Linear regression We have that the estimated mean in linear regression is The standard error of ˆµ Y X=x is where x = 1 n s.e.(ˆµ Y X=x ) = σ ˆµ Y X=x = ˆβ 0 + ˆβ 1 x. 1 n + (x x)2 i (x i x) 2 i x i. The

More information

Correlation and the Analysis of Variance Approach to Simple Linear Regression

Correlation and the Analysis of Variance Approach to Simple Linear Regression Correlation and the Analysis of Variance Approach to Simple Linear Regression Biometry 755 Spring 2009 Correlation and the Analysis of Variance Approach to Simple Linear Regression p. 1/35 Correlation

More information

Lecture 15 Multiple regression I Chapter 6 Set 2 Least Square Estimation The quadratic form to be minimized is

Lecture 15 Multiple regression I Chapter 6 Set 2 Least Square Estimation The quadratic form to be minimized is Lecture 15 Multiple regression I Chapter 6 Set 2 Least Square Estimation The quadratic form to be minimized is Q = (Y i β 0 β 1 X i1 β 2 X i2 β p 1 X i.p 1 ) 2, which in matrix notation is Q = (Y Xβ) (Y

More information

Single and multiple linear regression analysis

Single and multiple linear regression analysis Single and multiple linear regression analysis Marike Cockeran 2017 Introduction Outline of the session Simple linear regression analysis SPSS example of simple linear regression analysis Additional topics

More information

STAT 350 Final (new Material) Review Problems Key Spring 2016

STAT 350 Final (new Material) Review Problems Key Spring 2016 1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

1 A Non-technical Introduction to Regression

1 A Non-technical Introduction to Regression 1 A Non-technical Introduction to Regression Chapters 1 and Chapter 2 of the textbook are reviews of material you should know from your previous study (e.g. in your second year course). They cover, in

More information

F-tests and Nested Models

F-tests and Nested Models F-tests and Nested Models Nested Models: A core concept in statistics is comparing nested s. Consider the Y = β 0 + β 1 x 1 + β 2 x 2 + ǫ. (1) The following reduced s are special cases (nested within)

More information

Multiple Regression Methods

Multiple Regression Methods Chapter 1: Multiple Regression Methods Hildebrand, Ott and Gray Basic Statistical Ideas for Managers Second Edition 1 Learning Objectives for Ch. 1 The Multiple Linear Regression Model How to interpret

More information

Section 3: Simple Linear Regression

Section 3: Simple Linear Regression Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

Multiple Regression. Peerapat Wongchaiwat, Ph.D.

Multiple Regression. Peerapat Wongchaiwat, Ph.D. Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com The Multiple Regression Model Examine the linear relationship between 1 dependent (Y) & 2 or more independent variables (X i ) Multiple Regression Model

More information

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December

More information

Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017

Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017 Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017 PDF file location: http://www.murraylax.org/rtutorials/regression_anovatable.pdf

More information

BNAD 276 Lecture 10 Simple Linear Regression Model

BNAD 276 Lecture 10 Simple Linear Regression Model 1 / 27 BNAD 276 Lecture 10 Simple Linear Regression Model Phuong Ho May 30, 2017 2 / 27 Outline 1 Introduction 2 3 / 27 Outline 1 Introduction 2 4 / 27 Simple Linear Regression Model Managerial decisions

More information

Lecture 10 Multiple Linear Regression

Lecture 10 Multiple Linear Regression Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable

More information

MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics

MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES Business Statistics CONTENTS Multiple regression Dummy regressors Assumptions of regression analysis Predicting with regression analysis Old exam question

More information

A discussion on multiple regression models

A discussion on multiple regression models A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value

More information

Chapter 15 Multiple Regression

Chapter 15 Multiple Regression Multiple Regression Learning Objectives 1. Understand how multiple regression analysis can be used to develop relationships involving one dependent variable and several independent variables. 2. Be able

More information

Inference for Regression Simple Linear Regression

Inference for Regression Simple Linear Regression Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating

More information

PubH 7405: REGRESSION ANALYSIS. MLR: INFERENCES, Part I

PubH 7405: REGRESSION ANALYSIS. MLR: INFERENCES, Part I PubH 7405: REGRESSION ANALYSIS MLR: INFERENCES, Part I TESTING HYPOTHESES Once we have fitted a multiple linear regression model and obtained estimates for the various parameters of interest, we want to

More information

df=degrees of freedom = n - 1

df=degrees of freedom = n - 1 One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:

More information

ECON 497 Midterm Spring

ECON 497 Midterm Spring ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 24, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

Simple Linear Regression. Material from Devore s book (Ed 8), and Cengagebrain.com

Simple Linear Regression. Material from Devore s book (Ed 8), and Cengagebrain.com 12 Simple Linear Regression Material from Devore s book (Ed 8), and Cengagebrain.com The Simple Linear Regression Model The simplest deterministic mathematical relationship between two variables x and

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Correlation and Regression Analysis. Linear Regression and Correlation. Correlation and Linear Regression. Three Questions.

Correlation and Regression Analysis. Linear Regression and Correlation. Correlation and Linear Regression. Three Questions. 10/8/18 Correlation and Regression Analysis Correlation Analysis is the study of the relationship between variables. It is also defined as group of techniques to measure the association between two variables.

More information

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS THE ROYAL STATISTICAL SOCIETY 008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS The Society provides these solutions to assist candidates preparing for the examinations

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM 1 REGRESSION AND CORRELATION As we learned in Chapter 9 ( Bivariate Tables ), the differential access to the Internet is real and persistent. Celeste Campos-Castillo s (015) research confirmed the impact

More information

3. Diagnostics and Remedial Measures

3. Diagnostics and Remedial Measures 3. Diagnostics and Remedial Measures So far, we took data (X i, Y i ) and we assumed where ɛ i iid N(0, σ 2 ), Y i = β 0 + β 1 X i + ɛ i i = 1, 2,..., n, β 0, β 1 and σ 2 are unknown parameters, X i s

More information

CHAPTER EIGHT Linear Regression

CHAPTER EIGHT Linear Regression 7 CHAPTER EIGHT Linear Regression 8. Scatter Diagram Example 8. A chemical engineer is investigating the effect of process operating temperature ( x ) on product yield ( y ). The study results in the following

More information

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information. STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory

More information

Inference with Simple Regression

Inference with Simple Regression 1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems

More information

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company Multiple Regression Inference for Multiple Regression and A Case Study IPS Chapters 11.1 and 11.2 2009 W.H. Freeman and Company Objectives (IPS Chapters 11.1 and 11.2) Multiple regression Data for multiple

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

ST Correlation and Regression

ST Correlation and Regression Chapter 5 ST 370 - Correlation and Regression Readings: Chapter 11.1-11.4, 11.7.2-11.8, Chapter 12.1-12.2 Recap: So far we ve learned: Why we want a random sample and how to achieve it (Sampling Scheme)

More information

CS 5014: Research Methods in Computer Science

CS 5014: Research Methods in Computer Science Computer Science Clifford A. Shaffer Department of Computer Science Virginia Tech Blacksburg, Virginia Fall 2010 Copyright c 2010 by Clifford A. Shaffer Computer Science Fall 2010 1 / 207 Correlation and

More information

5. Multiple Regression (Regressioanalyysi) (Azcel Ch. 11, Milton/Arnold Ch. 12) The k-variable Multiple Regression Model

5. Multiple Regression (Regressioanalyysi) (Azcel Ch. 11, Milton/Arnold Ch. 12) The k-variable Multiple Regression Model 5. Multiple Regression (Regressioanalyysi) (Azcel Ch. 11, Milton/Arnold Ch. 12) The k-variable Multiple Regression Model The population regression model of a dependent variable Y on a set of k independent

More information

Chapter 14 Multiple Regression Analysis

Chapter 14 Multiple Regression Analysis Chapter 14 Multiple Regression Analysis 1. a. Multiple regression equation b. the Y-intercept c. $374,748 found by Y ˆ = 64,1 +.394(796,) + 9.6(694) 11,6(6.) (LO 1) 2. a. Multiple regression equation b.

More information

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47 ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with

More information

Lectures on Simple Linear Regression Stat 431, Summer 2012

Lectures on Simple Linear Regression Stat 431, Summer 2012 Lectures on Simple Linear Regression Stat 43, Summer 0 Hyunseung Kang July 6-8, 0 Last Updated: July 8, 0 :59PM Introduction Previously, we have been investigating various properties of the population

More information

CHAPTER 5 LINEAR REGRESSION AND CORRELATION

CHAPTER 5 LINEAR REGRESSION AND CORRELATION CHAPTER 5 LINEAR REGRESSION AND CORRELATION Expected Outcomes Able to use simple and multiple linear regression analysis, and correlation. Able to conduct hypothesis testing for simple and multiple linear

More information

ST430 Exam 2 Solutions

ST430 Exam 2 Solutions ST430 Exam 2 Solutions Date: November 9, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textbook are permitted but you may use a calculator. Giving

More information

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x).

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). Linear Regression Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). A dependent variable is a random variable whose variation

More information

Correlation and regression

Correlation and regression 1 Correlation and regression Yongjua Laosiritaworn Introductory on Field Epidemiology 6 July 2015, Thailand Data 2 Illustrative data (Doll, 1955) 3 Scatter plot 4 Doll, 1955 5 6 Correlation coefficient,

More information

What Is ANOVA? Comparing Groups. One-way ANOVA. One way ANOVA (the F ratio test)

What Is ANOVA? Comparing Groups. One-way ANOVA. One way ANOVA (the F ratio test) What Is ANOVA? One-way ANOVA ANOVA ANalysis Of VAriance ANOVA compares the means of several groups. The groups are sometimes called "treatments" First textbook presentation in 95. Group Group σ µ µ σ µ

More information