Chapter 3 Multiple Regression Complete Example

Size: px
Start display at page:

Download "Chapter 3 Multiple Regression Complete Example"

Transcription

1 Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal

2 Review Goals After completing this lecture, you should be able to: Formulate null and alternative hypotheses for applications involving a single population mean or proportion Formulate a decision rule for testing a hypothesis Know how to use the test statistic, critical value, and p-value approaches to test the null hypothesis Know what Type I and Type II errors are 2

3 Review Goals explain model building using multiple regression analysis apply multiple regression analysis to business decision-making situations (continued) analyze and interpret the computer output for a multiple regression model test the significance of the independent variables in a multiple regression model 3

4 Review Goals recognize potential problems in multiple regression analysis and take steps to correct the problems incorporate qualitative variables into the regression model by using dummy variables (continued) use variable transformations to model nonlinear relationships 4

5 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill of this city is µ = $42 population proportion Example: The proportion of adults in this city with cell phones is P =.68 5

6 The Null Hypothesis, H 0 States the assumption (numerical) to be tested Example: The average number of TV sets in H 0 : μ U.S. Homes is at least three ( ) Is always about a population parameter, not about a sample statistic 3 H 0 : μ 3 : x 3 H 0 6

7 The Null Hypothesis, H 0 Begin with the assumption that the null hypothesis is true Similar to the notion of innocent until proven guilty Refers to the status quo Always contains =, or sign May or may not be rejected (continued) 7

8 The Alternative Hypothesis, H A Is the opposite of the null hypothesis e.g.: The average number of TV sets in U.S. homes is less than 3 ( H A : µ < 3 ) Challenges the status quo Never contains the =, or sign May or may not be accepted Is generally the hypothesis that is believed (or needs to be supported) by the researcher a research hypothesis 8

9 Formulating Hypotheses Example 1: Ford motor company has worked to reduce road noise inside the cab of the redesigned F150 pickup truck. It would like to report in its advertising that the truck is quieter. The average of the prior design was 68 decibels at 60 mph. What is the appropriate hypothesis test? 9

10 Formulating Hypotheses Example 1: Ford motor company has worked to reduce road noise inside the cab of the redesigned F150 pickup truck. It would like to report in its advertising that the truck is quieter. The average of the prior design was 68 decibels at 60 mph. What is the appropriate test? H 0 : µ 68 (the truck is not quieter) status quo H A : µ < 68 (the truck is quieter) wants to support If the null hypothesis is rejected, Ford has sufficient evidence to support that the truck is now quieter. 10

11 Formulating Hypotheses Example 2: The average annual income of buyers of Ford F150 pickup trucks is claimed to be $65,000 per year. An industry analyst would like to test this claim. What is the appropriate hypothesis test? 11

12 Formulating Hypotheses Example 1: The average annual income of buyers of Ford F150 pickup trucks is claimed to be $65,000 per year. An industry analyst would like to test this claim. What is the appropriate test? H 0 : µ = 65,000 (income is as claimed) status quo H A : µ 65,000 (income is different than claimed) The analyst will believe the claim unless sufficient evidence is found to discredit it. 12

13 Hypothesis Testing Process Claim: the population mean age is 50. Null Hypothesis: H 0 : µ = 50 Population Now select a random sample: Sample Suppose the sample mean age is 20: x = 20 Is x = 20 likely if µ = 50? If not likely, REJECT Null Hypothesis 13

14 Reason for Rejecting H 0 Sampling Distribution of x 20 If it is unlikely that we would get a sample mean of this value... μ = 50 If H 0 is true... if in fact this were the population mean x... then we reject the null hypothesis that μ =

15 Errors in Making Decisions Type I Error Reject a true null hypothesis Considered a serious type of error The probability of Type I Error is Called level of significance of the test Set by researcher in advance 15

16 Errors in Making Decisions (continued) Type II Error Fail to reject a false null hypothesis The probability of Type II Error is β β is a calculated value, the formula is discussed later in the chapter 16

17 Outcomes and Probabilities Possible Hypothesis Test Outcomes Key: Outcome (Probability) Decision Do Not Reject H 0 Reject H 0 State of Nature H 0 True No error (1 - ) Type I Error ( ) H 0 False Type II Error ( β ) No Error ( 1 - β ) 17

18 Type I & II Error Relationship Type I and Type II errors cannot happen at the same time Type I error can only occur if H 0 is true Type II error can only occur if H 0 is false If Type I error probability ( ) Type II error probability ( β ), then 18

19 Factors Affecting Type II Error All else equal, β when the difference between hypothesized parameter and its true value β when β when σ β when n The formula used to compute the value of β is discussed later in the chapter 19

20 Level of Significance, Defines unlikely values of sample statistic if null hypothesis is true Defines rejection region of the sampling distribution Is designated by, (level of significance) Typical values are.01,.05, or.10 Is selected by the researcher at the beginning Provides the critical value(s) of the test 20

21 Hypothesis Tests for the Mean Hypothesis Tests for σ Known σ Unknown Assume first that the population standard deviation σ is known 21

22 Process of Hypothesis Testing 1. Specify population parameter of interest 2. Formulate the null and alternative hypotheses 3. Specify the desired significance level, α 4. Define the rejection region 5. Take a random sample and determine whether or not the sample result is in the rejection region 6. Reach a decision and draw a conclusion 22

23 Level of Significance and the Rejection Region Level of significance = Lower tail test Example: H 0 : μ 3 H A : μ < 3 Upper tail test Example: H 0 : μ 3 H A : μ > 3 Two tailed test Example: H 0 : μ = 3 H A : μ 3 /2 /2 -z α 0 0 z α -z α/2 0 z α/2 Reject H Do not Do not 0 Reject H 0 Reject H Do not 0 Reject H reject H 0 0 reject H 0 reject H 0 23

24 Critical Value for Lower Tail Test The cutoff value, -z α or x α, is called a critical value H 0 : μ 3 H A : μ < 3 Reject H 0 Do not reject H 0 -z α 0 x μ z σ n x α µ=3 24

25 Critical Value for Upper Tail Test z α The cutoff value, or, is called a critical value x α H 0 : μ 3 H A : μ > 3 Do not reject H 0 0 µ=3 z α x α Reject H 0 x μ z σ n 25

26 Critical Values for Two Tailed Tests There are two cutoff values (critical values): H 0 : μ = 3 H A : μ 3 ± z α/2 or /2 /2 x α/2 Lower x α/2 Upper Reject H 0 -z α/2 x α/2 Lower Do not reject H 0 Reject H 0 z α/2 0 µ=3 x α/2 Upper x /2 μ z /2 σ n 26

27 The Rejection Region Lower tail test Example: H 0 : μ 3 H A : μ < 3 Upper tail test Example: H 0 : μ 3 H A : μ > 3 Two tailed test Example: H 0 : μ = 3 H A : μ 3 /2 /2 -z α 0 0 z α -z α/2 0 z α/2 x α Reject H Do not Do not 0 Reject H 0 Reject H Do not 0 Reject H reject H 0 0 reject H 0 reject H 0 x α xα/2(l) xα/2(u) Reject H 0 if z < -z α Reject H 0 if z > z α Reject H 0 if z < -z α/2 or z > z α/2 i.e., if x < x α i.e., if x > x α i.e., if x < x α/2(l) or x > x α/2(u) 27

28 Two Equivalent Approaches to Hypothesis Testing z-units: For given, find the critical z value(s): -z α, z α,or ±z α/2 Convert the sample mean x to a z test statistic: Reject H 0 if z is in the rejection region, z x μ σ n otherwise do not reject H 0 x units: Given, calculate the critical value(s) x α, or x α/2(l) and x α/2(u) The sample mean is the test statistic. Reject H 0 if x is in the rejection region, otherwise do not reject H 0 28

29 Hypothesis Testing Example Test the claim that the true mean # of TV sets in US homes is at least 3. (Assume σ = 0.8) 1. Specify the population value of interest The mean number of TVs in US homes 2. Formulate the appropriate null and alternative hypotheses H 0 : μ 3 H A : μ < 3 (This is a lower tail test) 3. Specify the desired level of significance Suppose that =.05 is chosen for this test 29

30 Hypothesis Testing Example 4. Determine the rejection region (continued) =.05 Reject H 0 Do not reject H 0 -z α = This is a one-tailed test with =.05. Since σ is known, the cutoff value is a z value: Reject H 0 if z < z = ; otherwise do not reject H 0 30

31 Hypothesis Testing Example 5. Obtain sample evidence and compute the test statistic Suppose a sample is taken with the following results: n = 100, x = 2.84 ( = 0.8 is assumed known) Then the test statistic is: z x σ μ n

32 Hypothesis Testing Example 6. Reach a decision and interpret the result (continued) =.05 Reject H 0 Do not reject H Since z = -2.0 < , we reject the null hypothesis that the mean number of TVs in US homes is at least 3. There is sufficient evidence that the mean is less than 3. z 32

33 Hypothesis Testing Example An alternate way of constructing rejection region: =.05 (continued) Now expressed in x, not z units x Reject H 0 Do not reject H Since x = 2.84 < , we reject the null hypothesis x α μ z α σ n

34 p-value Approach to Testing Convert Sample Statistic ( x ) to Test Statistic (a z value, if σ is known) Determine the p-value from a table or computer Compare the p-value with If p-value <, reject H 0 If p-value, do not reject H 0 34

35 p-value Approach to Testing p-value: Probability of obtaining a test statistic more extreme ( or ) than the observed sample value given H 0 is true Also called observed level of significance Smallest value of for which H 0 can be rejected (continued) 35

36 p-value example Example: How likely is it to see a sample mean of 2.84 (or something further below the mean) if the true mean is = 3.0? P(x P z 2.84 μ 3.0) P(z 2.0).0228 =.05 p-value = x z 36

37 p-value example (continued) Compare the p-value with If p-value <, reject H 0 If p-value, do not reject H 0 Here: p-value =.0228 =.05 Since.0228 <.05, we reject the null hypothesis =.05 p-value =

38 Example: Upper Tail z Test for Mean ( Known) A phone industry manager thinks that customer monthly cell phone bill have increased, and now average over $52 per month. The company wishes to test this claim. (Assume = 10 is known) Form hypothesis test: H 0 : μ 52 H A : μ > 52 the average is not over $52 per month the average is greater than $52 per month (i.e., sufficient evidence exists to support the manager s claim) 38

39 Example: Find Rejection Region Suppose that =.10 is chosen for this test (continued) Find the rejection region: Reject H 0 =.10 Do not reject H 0 0 z α =1.28 Reject H 0 Reject H 0 if z >

40 Review: Finding Critical Value - One Tail What is z given = 0.10? z Critical Value = 1.28 =.10 Standard Normal Distribution Table (Portion) Z

41 Example: Test Statistic Obtain sample evidence and compute the test statistic Suppose a sample is taken with the following results: n = 64, x = 53.1 (=10 was assumed known) Then the test statistic is: (continued) z x σ μ n 64 41

42 Example: Decision (continued) Reach a decision and interpret the result: Reject H 0 =.10 Do not reject H 0 0 z = Reject H 0 Do not reject H 0 since z = i.e.: there is not sufficient evidence that the mean bill is over $52 42

43 p -Value Solution (continued) Calculate the p-value and compare to p-value =.1894 Reject H 0 =.10 0 Do not reject H z =.88 Reject H 0 P(x 53.1 μ 52.0) Pz P(z 0.88) Do not reject H 0 since p-value =.1894 > =.10 43

44 Critical Value Approach to Testing When σ is known, convert sample statistic ( x ) to a z test statistic Hypothesis Tests for Known The test statistic is: x μ z σ n Unknown 44

45 Critical Value Approach to Testing When σ is unknown, convert sample statistic ( x ) to a t test statistic Hypothesis Tests for Known (The population must be approximately normal) Unknown The test statistic is: tn 1 x μ s n 45

46 Hypothesis Tests for μ, σ Unknown 1. Specify the population value of interest 2. Formulate the appropriate null and alternative hypotheses 3. Specify the desired level of significance 4. Determine the rejection region (critical values are from the t-distribution with n-1 d.f.) 5. Obtain sample evidence and compute the t test statistic 6. Reach a decision and interpret the result 46

47 Example: Two-Tail Test ( Unknown) The average cost of a hotel room in New York is said to be $168 per night. A random sample of 25 hotels resulted in x = $ and s = $ Test at the = 0.05 level. (Assume the population distribution is normal) H 0 : μ = 168 H A : μ

48 Example Solution: Two-Tail Test H 0 : μ = 168 H A : μ 168 /2=.025 /2=.025 = 0.05 n = 25 Critical Values: t 24 = ± is unknown, so use a t statistic tn 1 Reject H 0 Reject H 0 t α/2 Do not reject H 0 -t α/ x μ s n Do not reject H 0 : not sufficient evidence that true mean cost is different than $168 48

49 Type II Error Type II error is the probability of failing to reject a false H 0 Suppose we fail to reject H 0 : μ 52 when in fact the true mean is μ = Reject H 0 : μ Do not reject H 0 : μ 52 49

50 Type II Error (continued) Suppose we do not reject H 0 : 52 when in fact the true mean is = 50 This is the true distribution of x if = 50 This is the range of x where H 0 is not rejected 50 Reject H 0 : Do not reject H 0 : 52 50

51 Type II Error Suppose we do not reject H 0 : μ 52 when in fact the true mean is μ = 50 (continued) Here, β = P( x cutoff ) if μ = 50 β 50 Reject H 0 : μ Do not reject H 0 : μ 52 51

52 Calculating β Suppose n = 64, σ = 6, and =.05 cutoff (for H 0 : μ 52) x μ z σ n So β = P( x ) if μ = Reject H 0 : μ 52 Do not reject H 0 : μ 52 52

53 Calculating β Suppose n = 64, σ = 6, and =.05 (continued) P( x μ 50) P z P(z 1.02) Probability of type II error: β = Reject H 0 : μ Do not reject H 0 : μ 52 53

54 Hypothesis Tests in Minitab 54

55 Hypothesis Tests in Minitab 55

56 Sample Minitab Output 56

57 Hypothesis Tests Summary Addressed hypothesis testing methodology Performed z Test for the mean (σ known) Discussed p value approach to hypothesis testing Performed one-tail and two-tail tests... 57

58 Hypothesis Tests Summary (continued) Performed t test for the mean (σ unknown) Performed z test for the proportion Discussed Type II error and computed its probability 58

59 Multiple Regression Assumptions Errors (residuals) from the regression model: e = (y y) < The model errors are independent and random The errors are normally distributed The mean of the errors is zero Errors have a constant variance 59

60 Model Specification Decide what you want to do and select the dependent variable Determine the potential independent variables for your model Gather sample data (observations) for all variables 60

61 The Correlation Matrix Correlation between the dependent variable and selected independent variables can be found using Excel: Formula Tab: Data Analysis / Correlation Can check for statistical significance of correlation with a t test 61

62 Example A distributor of frozen desert pies wants to evaluate factors thought to influence demand Dependent variable: Pie sales (units per week) Independent variables: Price (in $) Data are collected for 15 weeks Advertising ($100 s) 62

63 Pie Sales Model Week Pie Sales Price ($) Advertising ($100s) Multiple regression model: Sales = b 0 + b 1 (Price) + b 2 (Advertising) Correlation matrix: Pie Sales Price Advertising Pie Sales 1 Price Advertising

64 Interpretation of Estimated Coefficients Slope (b i ) Estimates that the average value of y changes by b i units for each 1 unit increase in X i holding all other variables constant Example: if b 1 = -20, then sales (y) is expected to decrease by an estimated 20 pies per week for each $1 increase in selling price (x 1 ), net of the effects of changes due to advertising (x 2 ) y-intercept (b 0 ) The estimated average value of y when all x i = 0 (assuming all x i = 0 is within the range of observed values) 64

65 Pie Sales Correlation Matrix Pie Sales 1 Pie Sales Price Advertising Price Advertising Price vs. Sales : r = There is a negative association between price and sales Advertising vs. Sales : r = There is a positive association between advertising and sales 65

66 Scatter Diagrams Sales 600 Sales vs. Price Price Sales Sales vs. Advertising Advertising 66

67 Estimating a Multiple Linear Regression Equation Computer software is generally used to generate the coefficients and measures of goodness of fit for multiple regression Excel: Data / Data Analysis / Regression Minitab: Stat / Regression / Regression 67

68 Estimating a Multiple Linear Regression Equation Excel: 68

69 Minitab: Estimating a Multiple Linear Regression Equation 69

70 Multiple Regression Output Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 15 Sales (Price) (Advertising) ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept Price Advertising

71 The Multiple Regression Equation Sales (Price) (Advertising) where Sales is in number of pies per week Price is in $ Advertising is in $100 s. b 1 = : sales will decrease, on average, by pies per week for each $1 increase in selling price, net of the effects of changes due to advertising b 2 = : sales will increase, on average, by pies per week for each $100 increase in advertising, net of the effects of changes due to price 71

72 Using The Model to Make Predictions Predict sales for a week in which the selling price is $5.50 and advertising is $350: Sales (Price) (Advertising) (5.50) (3.5) Predicted sales is pies Note that Advertising is in $100 s, so $350 means that x 2 =

73 Predictions in Minitab 73

74 Predictions in Minitab (continued) Predicted y value < Confidence interval for the mean y value, given these x s < Prediction interval for an individual y value, given these x s < 74

75 Multiple Coefficient of Determination (R 2 ) Reports the proportion of total variation in y explained by all x variables taken together R 2 SSR SST Sum of squaresregression Total sumof squares 75

76 Multiple Coefficient of Determination (continued) Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 15 R 2 SSR SST % of the variation in pie sales is explained by the variation in price and advertising ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept Price Advertising

77 Adjusted R 2 R 2 never decreases when a new x variable is added to the model This can be a disadvantage when comparing models What is the net effect of adding a new variable? We lose a degree of freedom when a new x variable is added Did the new x variable add enough explanatory power to offset the loss of one degree of freedom? 77

78 Adjusted R 2 Shows the proportion of variation in y explained by all x variables adjusted for the number of x variables used R 2 A 1 (1 R ) n (where n = sample size, k = number of independent variables) Penalize excessive use of unimportant independent variables Smaller than R 2 Useful in comparing among models 2 n 1 k 1 (continued) 78

79 Multiple Coefficient of Determination (continued) Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 15 R 2 A % of the variation in pie sales is explained by the variation in price and advertising, taking into account the sample size and number of independent variables ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept Price Advertising

80 Is the Model Significant? F-Test for Overall Significance of the Model Shows if there is a linear relationship between all of the x variables considered together and y Use F test statistic Hypotheses: H 0 : β 1 = β 2 = = β k = 0 (no linear relationship) H A : at least one β i 0 (at least one independent variable affects y) 80

81 F-Test for Overall Significance Test statistic: (continued) F SSR k SSE n k 1 MSR MSE where F has (numerator) D 1 = k and (denominator) D 2 = (n k 1) degrees of freedom 81

82 F-Test for Overall Significance (continued) Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 15 F MSR MSE With 2 and 12 degrees of freedom P-value for the F-Test ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept Price Advertising

83 F-Test for Overall Significance 0 H 0 : β 1 = β 2 = 0 H A : β 1 and β 2 not both zero =.05 df 1 = 2 df 2 = 12 Do not reject H 0 Critical Value: F = =.05 Reject H 0 F.05 = F Test Statistic: MSR F MSE Decision: Reject H 0 at = 0.05 Conclusion: (continued) The regression model does explain a significant portion of the variation in pie sales (There is evidence that at least one independent variable affects y ) 83

84 Are Individual Variables Significant? Use t-tests of individual variable slopes Shows if there is a linear relationship between the variable x i and y Hypotheses: H 0 : β i = 0 (no linear relationship) H A : β i 0 (linear relationship does exist between x i and y) 84

85 Are Individual Variables Significant? H 0 : β i = 0 (no linear relationship) H A : β i 0 (linear relationship does exist between x i and y ) Test Statistic: (continued) t b i 0 (df = n k 1) s b i 85

86 Are Individual Variables Significant? (continued) Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 15 t-value for Price is t = , with p-value.0398 t-value for Advertising is t = 2.855, with p-value.0145 ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept Price Advertising

87 Inferences about the Slope: t Test Example H 0 : β i = 0 H A : β i 0 d.f. = = 12 =.05 t /2 = /2=.025 Reject H 0 Do not reject H -t 0 α/2 t α/ From Excel output: Coefficients Standard Error t Stat P-value Price Advertising The test statistic for each variable falls in the rejection region (p-values <.05) /2=.025 Reject H 0 Decision: Reject H 0 for each variable Conclusion: There is evidence that both Price and Advertising affect pie sales at =.05 87

88 Confidence Interval Estimate for the Slope Confidence interval for the population slope β 1 (the effect of changes in price on pie sales): b i t / 2 s b i where t has (n k 1) d.f. Coefficients Standard Error Lower 95% Upper 95% Intercept Price Advertising Example: Weekly sales are estimated to be reduced by between 1.37 to pies for each increase of $1 in the selling price 88

89 Standard Deviation of the Regression Model The estimate of the standard deviation of the regression model is: s SSE n k 1 MSE Is this value large or small? Must compare to the mean size of y for comparison 89

90 Standard Deviation of the Regression Model (continued) Regression Statistics Multiple R R Square Adjusted R Square The standard deviation of the regression model is Standard Error Observations 15 ANOVA df SS MS F Significance F Regression Residual Total Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept Price Advertising

91 Standard Deviation of the Regression Model (continued) The standard deviation of the regression model is A rough prediction range for pie sales in a given week is 2(47.46) 94.2 Pie sales in the sample were in the 300 to 500 per week range, so this range is probably too large to be acceptable. The analyst may want to look for additional variables that can explain more of the variation in weekly sales 91

92 Multicollinearity Multicollinearity: High correlation exists between two independent variables This means the two variables contribute redundant information to the multiple regression model 92

93 Multicollinearity Including two highly correlated independent variables can adversely affect the regression results No new information provided Can lead to unstable coefficients (large standard error and low t-values) Coefficient signs may not match prior expectations (continued) 93

94 Some Indications of Severe Multicollinearity Incorrect signs on the coefficients Large change in the value of a previous coefficient when a new variable is added to the model A previously significant variable becomes insignificant when a new independent variable is added The estimate of the standard deviation of the model increases when a variable is added to the model 94

95 Detect Collinearity (Variance Inflationary Factor) VIF j is used to measure collinearity: VIF j 1 1 R 2 j R 2 j is the coefficient of determination when the j th independent variable is regressed against the remaining k 1 independent variables If VIF j 5, x j is highly correlated with the other explanatory variables 95

96 Detect Collinearity Regression Analysis Price and all other X Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 15 VIF Output for the pie sales example: Since there are only two explanatory variables, only one VIF is reported VIF is < 5 There is no evidence of collinearity between Price and Advertising 96

97 Qualitative (Dummy) Variables Categorical explanatory variable (dummy variable) with two or more levels: yes or no, on or off, male or female coded as 0 or 1 Regression intercepts are different if the variable is significant Assumes equal slopes for other variables The number of dummy variables needed is (number of levels 1) 97

98 Dummy-Variable Model Example (with 2 Levels) Let: ŷ b 2 0 b x b x 1 2 y = pie sales 1 x 1 = price x 2 = holiday (X 2 = 1 if a holiday occurred during the week) (X 2 = 0 if there was no holiday that week) 98

99 Dummy-Variable Model Example (with 2 Levels) (continued) ŷ b 0 b x 1 1 b 2 (1) (b 0 b 2 ) b x 1 1 Holiday ŷ b 0 b x 1 1 b 2 (0) b 0 b x 1 1 No Holiday y (sales) b 0 + b 2 b 0 Different intercept Same slope If H 0 : β 2 = 0 is rejected, then Holiday has a significant effect on pie sales x 1 (Price) 99

100 Interpreting the Dummy Variable Coefficient (with 2 Levels) Example: Sales (Price) 15(Holiday) Sales: number of pies sold per week Price: pie price in $ 1 If a holiday occurred during the week 0 If no holiday occurred b 2 = 15: on average, sales were 15 pies greater in weeks with a holiday than in weeks without a holiday, given the same price 100

101 Dummy-Variable Models (more than 2 Levels) The number of dummy variables is one less than the number of levels Example: y = house price ; x 1 = square feet The style of the house is also thought to matter: Style = ranch, split level, condo Three levels, so two dummy variables are needed 101

102 Dummy-Variable Models (more than 2 Levels) Let the default category be condo (continued) 1 if ranch x2 x3 0 if not 1 0 if splitlevel if not ŷ b 3 0 b x1 b x2 b x b 2 shows the impact on price if the house is a ranch style, compared to a condo b 3 shows the impact on price if the house is a split level style, compared to a condo 102

103 Interpreting the Dummy Variable Coefficients (with 3 Levels) Suppose the estimated equation is ŷ x x x3 For a condo: x 2 = x 3 = 0 ŷ ŷ ŷ x 1 For a ranch: x 3 = x For a split level: x 2 = x With the same square feet, a ranch will have an estimated average price of thousand dollars more than a condo With the same square feet, a ranch will have an estimated average price of thousand dollars more than a condo. 103

104 Interaction Effects Hypothesizes interaction between pairs of x variables Response to one x variable varies at different levels of another x variable Contains two-way cross product terms y β 2 0 β1x1 β2x2 β3x3 β4x1x2 β5x1x2 Basic Terms Interactive Terms 104

105 Effect of Interaction Given: y β β1x1 β2x2 β3x1x2 0 ε Without interaction term, effect of x 1 on y is measured by β 1 With interaction term, effect of x 1 on y is measured by β 1 + β 3 x 2 Effect changes as x 2 increases 105

106 Interaction Example y y = 1 + 2x 1 + 3x 2 + 4x 1 x where x 2 = 0 or 1 (dummy variable) x 2 = 1 y = 1 + 2x 1 + 3(1) + 4x 1 (1) = 4 + 6x 1 x 2 = 0 y = 1 + 2x 1 + 3(0) + 4x 1 (0) = 1 + 2x 1 Effect (slope) of x 1 on y does depend on x 2 value x 1 106

107 Interaction Regression Model Worksheet Case, i y i x 1i x 2i x 1i x 2i : : : : : multiply x 1 by x 2 to get x 1 x 2, then run regression with y, x 1, x 2, x 1 x 2 107

108 Evaluating Presence of Interaction Hypothesize interaction between pairs of independent variables y β β1x1 β2x2 β3x1x2 0 ε Hypotheses: H 0 : β 3 = 0 (no interaction between x 1 and x 2 ) H A : β 3 0 (x 1 interacts with x 2 ) 108

109 Model Building Goal is to develop a model with the best set of independent variables Easier to interpret if unimportant variables are removed Lower probability of collinearity Stepwise regression procedure Provide evaluation of alternative models as variables are added Best-subset approach Try all combinations and select the best using the highest adjusted R 2 and lowest s ε 109

110 Stepwise Regression Idea: develop the least squares regression equation in steps, either through forward selection, backward elimination, or through standard stepwise regression The coefficient of partial determination is the measure of the marginal contribution of each independent variable, given that other independent variables are in the model 110

111 Best Subsets Regression Idea: estimate all possible regression equations using all possible combinations of independent variables Choose the best fit by looking for the highest adjusted R 2 and lowest standard error s ε Stepwise regression and best subsets regression can be performed using Minitab, or other statistical software packages 111

Chapter 14 Student Lecture Notes 14-1

Chapter 14 Student Lecture Notes 14-1 Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this

More information

Chapter 7 Student Lecture Notes 7-1

Chapter 7 Student Lecture Notes 7-1 Chapter 7 Student Lecture Notes 7- Chapter Goals QM353: Business Statistics Chapter 7 Multiple Regression Analysis and Model Building After completing this chapter, you should be able to: Explain model

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:

More information

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing

More information

The Multiple Regression Model

The Multiple Regression Model Multiple Regression The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables:

More information

What is a Hypothesis?

What is a Hypothesis? What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill in this city is μ = $42 population proportion Example:

More information

Econ 325: Introduction to Empirical Economics

Econ 325: Introduction to Empirical Economics Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population

More information

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

Chapter 4: Regression Models

Chapter 4: Regression Models Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,

More information

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression Models. Chapter 4. Introduction. Introduction. Introduction Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

Chapter 13. Multiple Regression and Model Building

Chapter 13. Multiple Regression and Model Building Chapter 13 Multiple Regression and Model Building Multiple Regression Models The General Multiple Regression Model y x x x 0 1 1 2 2... k k y is the dependent variable x, x,..., x 1 2 k the model are the

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests 1999 Prentice-Hall, Inc. Chap. 8-1 Chapter Topics Hypothesis Testing Methodology Z Test

More information

Bayesian Analysis LEARNING OBJECTIVES. Calculating Revised Probabilities. Calculating Revised Probabilities. Calculating Revised Probabilities

Bayesian Analysis LEARNING OBJECTIVES. Calculating Revised Probabilities. Calculating Revised Probabilities. Calculating Revised Probabilities Valua%on and pricing (November 5, 2013) LEARNING OBJECTIVES Lecture 7 Decision making (part 3) Regression theory Olivier J. de Jong, LL.M., MM., MBA, CFD, CFFA, AA www.olivierdejong.com 1. List the steps

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Multiple Regression. Peerapat Wongchaiwat, Ph.D.

Multiple Regression. Peerapat Wongchaiwat, Ph.D. Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com The Multiple Regression Model Examine the linear relationship between 1 dependent (Y) & 2 or more independent variables (X i ) Multiple Regression Model

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

Analisi Statistica per le Imprese

Analisi Statistica per le Imprese , Analisi Statistica per le Imprese Dip. di Economia Politica e Statistica 4.3. 1 / 33 You should be able to:, Underst model building using multiple regression analysis Apply multiple regression analysis

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore What is Multiple Linear Regression Several independent variables may influence the change in response variable we are trying to study. When several independent variables are included in the equation, the

More information

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression Chapter 12 12-1 North Seattle Community College BUS21 Business Statistics Chapter 12 Learning Objectives In this chapter, you learn:! How to use regression analysis to predict the value of a dependent

More information

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics Chapter 13 Student Lecture Notes 13-1 Department of Quantitative Methods & Information Sstems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analsis QMIS 0 Dr. Mohammad

More information

STA121: Applied Regression Analysis

STA121: Applied Regression Analysis STA121: Applied Regression Analysis Linear Regression Analysis - Chapters 3 and 4 in Dielman Artin Department of Statistical Science September 15, 2009 Outline 1 Simple Linear Regression Analysis 2 Using

More information

Multiple Regression Methods

Multiple Regression Methods Chapter 1: Multiple Regression Methods Hildebrand, Ott and Gray Basic Statistical Ideas for Managers Second Edition 1 Learning Objectives for Ch. 1 The Multiple Linear Regression Model How to interpret

More information

STAT 212 Business Statistics II 1

STAT 212 Business Statistics II 1 STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb

More information

Regression Analysis. BUS 735: Business Decision Making and Research

Regression Analysis. BUS 735: Business Decision Making and Research Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables. Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate

More information

Hypothesis testing Goodness of fit Multicollinearity Prediction. Applied Statistics. Lecturer: Serena Arima

Hypothesis testing Goodness of fit Multicollinearity Prediction. Applied Statistics. Lecturer: Serena Arima Applied Statistics Lecturer: Serena Arima Hypothesis testing for the linear model Under the Gauss-Markov assumptions and the normality of the error terms, we saw that β N(β, σ 2 (X X ) 1 ) and hence s

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

Ch14. Multiple Regression Analysis

Ch14. Multiple Regression Analysis Ch14. Multiple Regression Analysis 1 Goals : multiple regression analysis Model Building and Estimating More than 1 independent variables Quantitative( 量 ) independent variables Qualitative( ) independent

More information

Chapter 14 Simple Linear Regression (A)

Chapter 14 Simple Linear Regression (A) Chapter 14 Simple Linear Regression (A) 1. Characteristics Managerial decisions often are based on the relationship between two or more variables. can be used to develop an equation showing how the variables

More information

MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics

MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES Business Statistics CONTENTS Multiple regression Dummy regressors Assumptions of regression analysis Predicting with regression analysis Old exam question

More information

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December

More information

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006 Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Lecture 10 Multiple Linear Regression

Lecture 10 Multiple Linear Regression Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable

More information

Inference for the Regression Coefficient

Inference for the Regression Coefficient Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates

More information

LI EAR REGRESSIO A D CORRELATIO

LI EAR REGRESSIO A D CORRELATIO CHAPTER 6 LI EAR REGRESSIO A D CORRELATIO Page Contents 6.1 Introduction 10 6. Curve Fitting 10 6.3 Fitting a Simple Linear Regression Line 103 6.4 Linear Correlation Analysis 107 6.5 Spearman s Rank Correlation

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal yuppal@ysu.edu Sampling Distribution of b 1 Expected value of b 1 : Variance of b 1 : E(b 1 ) = 1 Var(b 1 ) = σ 2 /SS x Estimate of

More information

Inference for Regression Simple Linear Regression

Inference for Regression Simple Linear Regression Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Simple Linear Regression: One Qualitative IV

Simple Linear Regression: One Qualitative IV Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression

More information

Regression Analysis II

Regression Analysis II Regression Analysis II Measures of Goodness of fit Two measures of Goodness of fit Measure of the absolute fit of the sample points to the sample regression line Standard error of the estimate An index

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

MBA Statistics COURSE #4

MBA Statistics COURSE #4 MBA Statistics 51-651-00 COURSE #4 Simple and multiple linear regression What should be the sales of ice cream? Example: Before beginning building a movie theater, one must estimate the daily number of

More information

Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES

Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page 46 4 C H A P T E R Regression Models TEACHING SUGGESTIONS Teaching Suggestion 4.1: Which Is the Independent Variable? We find that students are often confused

More information

SIMPLE REGRESSION ANALYSIS. Business Statistics

SIMPLE REGRESSION ANALYSIS. Business Statistics SIMPLE REGRESSION ANALYSIS Business Statistics CONTENTS Ordinary least squares (recap for some) Statistical formulation of the regression model Assessing the regression model Testing the regression coefficients

More information

Econ 3790: Statistics Business and Economics. Instructor: Yogesh Uppal

Econ 3790: Statistics Business and Economics. Instructor: Yogesh Uppal Econ 3790: Statistics Business and Economics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 14 Covariance and Simple Correlation Coefficient Simple Linear Regression Covariance Covariance between

More information

Statistics and Quantitative Analysis U4320

Statistics and Quantitative Analysis U4320 Statistics and Quantitative Analysis U3 Lecture 13: Explaining Variation Prof. Sharyn O Halloran Explaining Variation: Adjusted R (cont) Definition of Adjusted R So we'd like a measure like R, but one

More information

Regression Analysis IV... More MLR and Model Building

Regression Analysis IV... More MLR and Model Building Regression Analysis IV... More MLR and Model Building This session finishes up presenting the formal methods of inference based on the MLR model and then begins discussion of "model building" (use of regression

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Multiple regression: Model building. Topics. Correlation Matrix. CQMS 202 Business Statistics II Prepared by Moez Hababou

Multiple regression: Model building. Topics. Correlation Matrix. CQMS 202 Business Statistics II Prepared by Moez Hababou Multiple regression: Model building CQMS 202 Business Statistics II Prepared by Moez Hababou Topics Forward versus backward model building approach Using the correlation matrix Testing for multicolinearity

More information

LECTURE 5. Introduction to Econometrics. Hypothesis testing

LECTURE 5. Introduction to Econometrics. Hypothesis testing LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will

More information

Class time (Please Circle): 11:10am-12:25pm. or 12:45pm-2:00pm

Class time (Please Circle): 11:10am-12:25pm. or 12:45pm-2:00pm Name: UIN: Class time (Please Circle): 11:10am-12:25pm. or 12:45pm-2:00pm Instructions: 1. Please provide your name and UIN. 2. Circle the correct class time. 3. To get full credit on answers to this exam,

More information

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Overfitting Categorical Variables Interaction Terms Non-linear Terms Linear Logarithmic y = a +

More information

Multiple linear regression

Multiple linear regression Multiple linear regression Course MF 930: Introduction to statistics June 0 Tron Anders Moger Department of biostatistics, IMB University of Oslo Aims for this lecture: Continue where we left off. Repeat

More information

A discussion on multiple regression models

A discussion on multiple regression models A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators

More information

STAT 350 Final (new Material) Review Problems Key Spring 2016

STAT 350 Final (new Material) Review Problems Key Spring 2016 1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

Single and multiple linear regression analysis

Single and multiple linear regression analysis Single and multiple linear regression analysis Marike Cockeran 2017 Introduction Outline of the session Simple linear regression analysis SPSS example of simple linear regression analysis Additional topics

More information

12.12 MODEL BUILDING, AND THE EFFECTS OF MULTICOLLINEARITY (OPTIONAL)

12.12 MODEL BUILDING, AND THE EFFECTS OF MULTICOLLINEARITY (OPTIONAL) 12.12 Model Building, and the Effects of Multicollinearity (Optional) 1 Although Excel and MegaStat are emphasized in Business Statistics in Practice, Second Canadian Edition, some examples in the additional

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018

Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018 Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018 Chapter One: Data and Statistics Statistics A collection of procedures and principles

More information

BNAD 276 Lecture 10 Simple Linear Regression Model

BNAD 276 Lecture 10 Simple Linear Regression Model 1 / 27 BNAD 276 Lecture 10 Simple Linear Regression Model Phuong Ho May 30, 2017 2 / 27 Outline 1 Introduction 2 3 / 27 Outline 1 Introduction 2 4 / 27 Simple Linear Regression Model Managerial decisions

More information

Finding Relationships Among Variables

Finding Relationships Among Variables Finding Relationships Among Variables BUS 230: Business and Economic Research and Communication 1 Goals Specific goals: Re-familiarize ourselves with basic statistics ideas: sampling distributions, hypothesis

More information

Multiple Linear Regression. Chapter 12

Multiple Linear Regression. Chapter 12 13 Multiple Linear Regression Chapter 12 Multiple Regression Analysis Definition The multiple regression model equation is Y = b 0 + b 1 x 1 + b 2 x 2 +... + b p x p + ε where E(ε) = 0 and Var(ε) = s 2.

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

Section 3: Simple Linear Regression

Section 3: Simple Linear Regression Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

Data Analysis 1 LINEAR REGRESSION. Chapter 03

Data Analysis 1 LINEAR REGRESSION. Chapter 03 Data Analysis 1 LINEAR REGRESSION Chapter 03 Data Analysis 2 Outline The Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression Other Considerations in Regression Model Qualitative

More information

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval

More information

Econometrics. 4) Statistical inference

Econometrics. 4) Statistical inference 30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution

More information

Inference with Simple Regression

Inference with Simple Regression 1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems

More information

x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators

x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators Multiple Regression Relating a response (dependent, input) y to a set of explanatory (independent, output, predictor) variables x, x 2, x 3,, x q. A technique for modeling the relationship between variables.

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

Unit 27 One-Way Analysis of Variance

Unit 27 One-Way Analysis of Variance Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied

More information

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company Multiple Regression Inference for Multiple Regression and A Case Study IPS Chapters 11.1 and 11.2 2009 W.H. Freeman and Company Objectives (IPS Chapters 11.1 and 11.2) Multiple regression Data for multiple

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)

More information

CHAPTER EIGHT Linear Regression

CHAPTER EIGHT Linear Regression 7 CHAPTER EIGHT Linear Regression 8. Scatter Diagram Example 8. A chemical engineer is investigating the effect of process operating temperature ( x ) on product yield ( y ). The study results in the following

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3 Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency

More information

Lecture 6: Linear Regression

Lecture 6: Linear Regression Lecture 6: Linear Regression Reading: Sections 3.1-3 STATS 202: Data mining and analysis Jonathan Taylor, 10/5 Slide credits: Sergio Bacallado 1 / 30 Simple linear regression Model: y i = β 0 + β 1 x i

More information

The Hong Kong University of Science & Technology ISOM551 Introductory Statistics for Business Assignment 4 Suggested Solution

The Hong Kong University of Science & Technology ISOM551 Introductory Statistics for Business Assignment 4 Suggested Solution 8 TUNG, Yik-Man The Hong Kong University of cience & Technology IOM55 Introductory tatistics for Business Assignment 4 uggested olution Note All values of statistics are obtained by Excel Qa Theoretically,

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X 1.04) =.8508. For z < 0 subtract the value from

More information

Six Sigma Black Belt Study Guides

Six Sigma Black Belt Study Guides Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships

More information

Review of Statistics

Review of Statistics Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and

More information

Chapter 14 Multiple Regression Analysis

Chapter 14 Multiple Regression Analysis Chapter 14 Multiple Regression Analysis 1. a. Multiple regression equation b. the Y-intercept c. $374,748 found by Y ˆ = 64,1 +.394(796,) + 9.6(694) 11,6(6.) (LO 1) 2. a. Multiple regression equation b.

More information

ECO220Y Simple Regression: Testing the Slope

ECO220Y Simple Regression: Testing the Slope ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

STAT Chapter 8: Hypothesis Tests

STAT Chapter 8: Hypothesis Tests STAT 515 -- Chapter 8: Hypothesis Tests CIs are possibly the most useful forms of inference because they give a range of reasonable values for a parameter. But sometimes we want to know whether one particular

More information