Business Statistics (BK/IBA) Tutorial 4 Full solutions

Size: px
Start display at page:

Download "Business Statistics (BK/IBA) Tutorial 4 Full solutions"

Transcription

1 Business Statistics (BK/IBA) Tutorial 4 Full solutions Instruction In a tutorial session of 2 hours, we will obviously not be able to discuss all questions. Therefore, the following procedure applies: we expect students to prepare all exercises in advance; we will discuss only a selection of exercises; exercises that were not discussed during class are nevertheless part of the course; students can indicate their wish list of exercises to be discussed during the session; teachers may invite students to answer questions, orally or on the blackboard. We further understand that your time is limited, and in particular that your time between lecture and tutorial may be limited. In case you have no time to prepare everything, we kindly advise you to give priority to the exercises that are indicated with the are not relevant! 7A+B Several μs: comparison + Several μs and medians: more issues icon. This does not mean that the other questions Q1 (based on Doane & Seward, 4/E, 11.7 and 11.13) Semester GPAs are compared for seven randomly chosen students in each class level at Oxnard University. a. Does the data show a significant difference in mean GPAs? b. Which pairs of mean GPAs differ significantly (4 majors)? Excel output for ANOVA Anova: Single Factor SUMMARY Groups Count Sum Average Variance Accounting 7 19,84 2, , Finance 7 21,17 3, , Human Resources 7 22,69 3, , Marketing 7 23,6 3, , ANOVA Source of Variation SS df MS F P-value F crit Between Groups 1, , , , , Within Groups 2, , Total 3, SPSS output for post-hoc BS 1 Tutorial 4

2 Q1 a. Yes; b. the mean score for Marketing differs significantly from the mean score for Accounting. Sol a. Please use 5 steps procedure and use α = step 0 Model: Y ij = μ + α i + ε ij with ε~n(0, σ 2 ) (i) The hypotheses to be tested are: H 0 : α 1 = α 2 = α 3 = α 4 = 0 versus H 1 : not H 0 (α = 0.05) (ii).sample Statistic: F = MSA ; reject for large values MSW (iii) Distribution test statistic under H 0 : F~F 3,24 Requirement: all four populations normal; all four variances equal (which we will assume). (iv) Calculated test statistic: F calc = Reported p-value: ; p-value of this statistical problem: Critical value: F crit;upper = F 3,24;0.05 = 3.01 (from table) or (from Excel) (v) We reject the null hypothesis since the test statistic exceeds the critical value. This is a fairly close decision. The p-value of is less than The difference between the GPA means most likely did not occur by chance. From the dot plot (below), we see the GPA for Accounting below the overall mean and Human Resources and Marketing above the overall mean. BS 2 Tutorial 4

3 Extra Q2 b. At α = 0.05, the mean score for Marketing differs significantly from the mean score for Accounting (in fact: it is higher). All other pairs do not differ significantly. In a five-step procedure at the exam, you should use the critical value approach or the p-value approach. It is not needed to use both. In our solutions, we sometimes write down both approaches, to help you check your solution. The retailing manager of a supermarket chain wants to determine whether product location has any effect on the sale of pet toys. Three different aisle locations are considered: front, middle, and rear. A random sample of 18 stores is selected, with 6 stores randomly assigned to each aisle location. The size of the display area and price of the product are constant for all stores. At the end of a one-month trial period, the sales volumes (in thousands of dollars) of the product in each store were as follows (and are stored in the file Locate). Aisle Location Front Middle Rear 8,6 3,2 4,6 7,2 2,4 6,0 5,4 2,0 4,0 6,2 1,4 2,8 5,0 1,8 2,2 4,0 1,6 2,8 a. At the 0.05 level of significance, is there evidence of a significant difference in mean sales among the various aisle locations? b. If appropriate, which aisle locations appear to differ significantly in mean sales? c. At the 0.05 level of significance, is there evidence of a significant difference in the variation in sales among the various aisle locations? d. What should the retailing manager conclude? Fully describe the retailing manager s options with respect to aisle locations. Test of Homogeneity of Variances SALES Levene Statistic df1 df2 Sig. 2, ,142 BS 3 Tutorial 4

4 ANOVA SALES Between Groups Within Groups Total Sum of Mean Squares df Square F Sig. 48, ,222 14,105,000 25, ,717 74, Dependent Variable: sales Multiple Comparisons Tukey HSD (I) locate front middle rear (J) locate middle rear front rear front middle *. The mean difference is significant at the.05 level. Mean Difference (I-J) Std. Error Sig. 4,0000*,7566,0003 2,3333*,7566,0195-4,0000*,7566,0003-1,6667,7566,1031-2,3333*,7566,0195 1,6667,7566,1031 sales Subset for alpha =.05 locate N 1 2 Tukey HSD a middle 6 2,067 rear 6 3,733 front 6 6,067 Sig.,103 1,000 Means for groups in homogeneous subsets are displayed. a. Uses Harmonic Mean Sample Size = 6,000. Sol Q2 a. Yes; b. μfront differs significantly from μmiddle and μrear; c. No; d. The front aisle is best. a. Please use 5 steps procedure. Model: Y ij = μ + α i + ε ij with ε ij ~N(0, σ 2 ) To test at the 0.05 level of significance whether the average sales volumes in thousands of dollars are different across the three store aisle locations, we conduct an F test (ANOVA). (i) H 0 : α front = α middle = α rear = 0 versus H 1 : not H 0 (α = 0.05) (ii) Sample statistic: F = MSA ; reject for large values MSW (iii) Distribution (standardized) test statistic under H 0 : F~F 2,15 Required: all three populations normal; all three variances equal. To test normality: no computer output available; to test equal variances: see output Test of Homogeneity of Variance, which suggests the variances are equal (p-value of 0.142). (iv) Calculated test statistic: F calc = Critical value: F upper = F 2,15 = 3.68 Reported p-value: 0.000, p-value of this statistical problem: BS 4 Tutorial 4

5 Extra (v) Decision: Since F calc = is above the critical bound of F crit = 3.68, reject H 0. There is enough evidence to conclude that the average sales volumes in thousands of dollars are different across the three store aisle locations. b. The three means are: y middle = 2.067, y rear = and y front = To determine which of the means are significantly different from one another, we use the Tukey-Kramer procedure. SPSS gives the homogeneous subsets with α = 0.05: μ front differs significantly from μ middle and μ rear, while μ middle and μ rear do not differ significantly from each other = σ rear = σ front versus H 1 : At least one variance is different (α = 0.05) c. (i) H 0 : σ middle (ii) Use Levene s F-ratio for homogeneity of variance. (iii) Under H 0 we have F~??? (SPSS computations). We do not need further assumptions on the distributions of all three populations. Specifically, in contrast to the usual two-sample Ftest for variances, normal populations are not necessary. (iv) Use the SPSS output for Levene s test for homogeneity of variance: F calc = and p-value = P(F 2.227) = (v) Since the p-value = > 0.05, do not reject H 0. There is no evidence of a significant difference in the variation in sales among the various aisle locations. This suggests that the use of the ANOVA is justified, at least it does not lead to the conclusion that it is not justified. d. The front aisle is best for the sale of this product. The manager should evaluate the trade-off in switching the location of this product and the product that is currently intended for the front location. In a five-step procedure at the exam, you should use the critical value approach or the p-value approach. It is not needed to use both. In our solutions, we sometimes write down both approaches, to help you check your solution. Q3 (based on Doane & Seward, 4/E, 16.10) The results shown below are mean productivity measurements (average number of assemblies completed per hour) for a random sample of workers at each of three work stations. a. At α =.05, is there a difference in median productivity? b. Use one-factor ANOVA to compare the means. c. Do you reach the same conclusion? BS 5 Tutorial 4

6 Sol Q3 a. reject H0; b. reject H0; c. the same. a. Use the Kruskal-Wallis non-parametric ANOVA. (i) H 0 : M A = M B = M C ; H 1 : not H 0 ; α = 0.05 (ii) Sample statistic: use mean rank sums T A, T B, and T C to compute H-statistic; reject for large values 2 (iii) Under H 0 : H~χ 2 Requirement: n for all groups 5: OK (iv) χ calc = 9.479; χ crit = χ upper;0.05;2 = Reported p-value: (v) Reject H 0 because χ calc χ crit (or because p-value α) b. Use the parametric ANOVA. (i) H 0 : μ A = μ B = μ C ; H 1 : not H 0 ; α = 0.05 (ii) Sample statistic: F = MSA ; reject for large values MSW (iii) Under H 0 : F~F 2,22 Requirement: normal populations, all with equal variances (which we will assume) (iv) F calc = 7.718; F crit = F upper;2,22;0.05 = 3.44 Reported p-value: BS 6 Tutorial 4

7 Extra (v) Reject H 0 because F calc F crit (or because p-value α) c. The final conclusion is the same. Both tests are quite convincing. The Kruskal-Wallis is slightly more conservative. In a five-step procedure at the exam, you should use the critical value approach or the p-value approach. It is not needed to use both. In our solutions, we sometimes write down both approaches, to help you check your solution. Q4 (based on Berenson 11/E, 11.38) A student team in a business statistics course performed an experiment to investigate the time required for pain-relief tablets to dissolve in a glass of water. The factor of interest is the temperature of the water (hot or cold). The experiment consisted of 12 replicates for each of the two temperature levels. The following data show the time a tablet took to dissolve (in seconds) for the 24 tablets used in the experiment: At the 0.05 level of significance, a. Is there an effect due to water temperature? Use Output from SPSS: BS 7 Tutorial 4

8 b. Can you redo this analyses with a t-test? Q4 a. there is a temperature effect ; b. the analysis can be repeated with an independent samples t-test. Sol a. Use SPSS output, with model Y ij = μ + α i + ε ij and ε ij ~N(0, σ 2 ) (i) H 0 : α H = α C = 0; H 1 : not H 0 ; α = 0.05 (ii) Sample statistic: F = MSA ; reject for large values MSW (iii) Under H 0 : F~F 1,22 ; requirements: normal populations with equal variances (which we will assume) (iv) F calc = ; F crit = 4.30; p-value = (Mind that F calc is not to scale) (v) reject H 0 without any doubt: there is a very clear temperature effect b. The analysis can also be done with an independent two-sample t-test, putting μ H = μ + α H and μ C = μ + α C (i) H 0 : μ H = μ C ; H 1 : not H 0 ; α = 0.05 (ii) Sample statistic: t = (Y C Y H) (μ C μ H ) ; reject for large and small values S Y C Y H BS 8 Tutorial 4

9 (iii) Under H 0 : t~t n 2 ; requirements: symmetric populations (15 n < 30) with equal variances (upper row in SPSS; ignore the result of Levene s test for a minute) (iv) t calc = ; t crit = ±2.074; p-value = Extra (note: t calc not drawn to scale) (v) reject H 0 without any doubt: there is a very clear temperature effect Why ignore Levene s test? The purpose of this exercise was to do an ANOVA with a binary variable and to compare it with an independent samples t-test. For the independent samples ttest, we have a procedure to do a test (lower line) without the equal-variance assumption, but for ANOVA we had assumed equal variances. Note that the effect is so tremendous (p = 0.000), and that even without the equal-variance assumption the null-hypothesis of no temperature effect is rejected very convincingly. Levene s test in the ANOVA case by the way also rejects equality of variance (with the same F calc and p-value): The non-parametric Kruskal-Wallis tests, by the way, give a similar pattern: and so does the independent-samples Wilcoxon-Mann-Whitney: BS 9 Tutorial 4

10 8A+B Simple regression analysis + Multiple regression analysis and other issues Q1 (based on Doane & Seward, 4/E, 12.7) a. Interpret the slope of the fitted regression HomePrice = 125, SquareFeet. b. What is the prediction for HomePrice if SquareFeet = 2,000? c. Would the intercept be meaningful if this regression applies to home sales in a certain subdivision, different form the one used to find the regression equation? Q1 a. Increasing the size of a home by 1 square foot increases the price by $150. b. HomePrice = $125,000 + ($150 2,000) = $425,000. c. The intercept might be interpreted as the value of the lot without a home. But the range of values for SquareFeet does not include zero so it would be dangerous to extrapolate for SquareFeet = 0. Extra Observe the somewhat confusing habit in economic literature of writing regression equations in the form HomePrice = 125, SquareFeet, where SquareFeet is a variable, not a unit. Q2 (based on Doane & Seward, 4/E, 12.13) The regression equation HomePrice = Income was estimated from a sample of 34 cities in the eastern United States. Both variables are in thousands of dollars. HomePrice is the median selling price of homes in the city, and Income is median family income for the city. a. Interpret the slope. b. Is the intercept meaningful? Explain. c. Make a prediction of HomePrice when Income = 50 and also when Income = 100. d. Given: R 2 = What is the meaning of that? (Data are from Money Magazine 32, no. 1 [January 2004], pp ) Q2 a. Increasing the median income by $1,000 raises the median home price by $2,610; b. If median income is zero, then the model suggests that median home price is $51,300; c. $181,800 and $312,300; d. 34% of the variance of HomePrice is explained by the model. Sol a. Increasing the median income by $1,000 raises the median home price by $2,610. b. If median income is zero, then the model suggests that median home price is $51,300. While it does not seem logical that the median family income for any city is zero, it is unclear what the lower bound would be. c. prediction HomePrice = $ (2.61 $50) = $181.8 (in $1000) or $181,800 prediction Homeprice = $ (2.61 $100) = $312.3 (in $1000) or $312,300 d. 34% of the variance of HomePrice is explained by the model. That is quite low. And it might be due to chance: perhaps a lucky sample. Fortunately, the latter can be judged by BS 10 Tutorial 4

11 statistical significance. The model is significant if the slope is significantly different from zero: this seems to be the case looking at the p-value (see later). Q3 (based on Doane & Seward, 4/E, 12.26) A regression was performed using data on 16 randomly selected charities in The variables were Y = expenses (millions of dollars) and X = revenue (millions of dollars). a. Write the fitted regression equation. b. Construct a 95 percent confidence interval for the slope. c. Perform a right-tailed t test for zero slope at α =.05. State the hypotheses clearly. (Data are from Forbes 172, no. 12, p. 248, and SUMMARY OUTPUT Regression Statistics Multiple R 0, R Square 0, Adjusted R Square 0, Standard Error 14, Observations 16 ANOVA df SS MS F Significance F Regression , , , ,07289E-08 Residual , ,13245 Total Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 7, ,0403 Revenue 0,9467 0,0936 Q3 a. Y = X; b β ; c. reject H0 Sol a. Define Y: expenses ($1,000,000); X: revenue ($1,000,000) a. Y = X b. For a 95% confidence level use t 14;0.025 = The 95% confidence interval is ± ( ) or β c. Use our 5-steps procedure: Model ( step 0 ): Y i = β 0 + β 1 X i + ε i where ε i ~N(0, σ 2 ) (i) H 0 : β 1 0 versus H 1 : β > 0 (α = 0.05) (ii) Sample statistic: B 1 ; reject for large values. (iii) Distribution test statistic under H 0 : t = B 1 β 1 ~t S n 2 B 1 Requirements: see model (iv) Calculated test statistic: t calc = = Critical value: t crit = t 14;0.05 = (note: t calc not drawn to scale) (v) Because t calc > t crit, reject H 0. There is evidence that the slope is positive; increased revenue is correlated with increased expenses. BS 11 Tutorial 4

12 Q4 Use a linear regression model to explain the height (Dutch: lengte ) of female premaster students ( ) in terms of their shoe size (Dutch: schoenmaat ). Below you find some computer output, based on a random sample of these students. Predicted values for: Lengtecm 95% Confidence Interval 95% Prediction Interval Schoenmaat Predicted lower upper lower upper a. Determine the theoretical and the estimated model belonging to the given output. b. It is claimed that the slope in this model is larger than 2. Test this hypothesis (α = 1%). c. Is this a useful model in order to predict the height of female premaster students? (Perhaps you have seen a footprint in the snow; is it useful (using this model) to predict the height of the person concerned?) d. You see a footprint of size 38 in the snow and looking up you see in the distance a (female) premaster student just walking away. Give a relevant 95% interval for the height of this (female) premaster student. BS 12 Tutorial 4

13 Sol e. The next day you see another footprint of size 38. Give a relevant 95% interval for the average height of all (female) premaster students with shoe size 38. f. Calculate a 90% confidence interval for the constant in the regression model. Q4 a. Theoretical model: Yi = β0 + β1xi + εi, with εi~n(0, σ 2 ); Estimated model: Y = b0 + b1x = X; b. reject H0; c. not very useful; d. ሾ , ሿ; e. ሾ , ሿ; f. ሾ24.467,78.305ሿ a. Theoretical model: Y i = β 0 + β 1 X i + ε i, with ε i ~N(0, σ 2 ). Y = height in cm, X = shoe size (may be stated for individual observations with or without the subscript i). Estimated model: Y = b 0 + b 1 X = X b. Use the 5 steps procedure! step 0 (model): see a. (i) H 0 : β 1 2; H 1 : β 1 > 2; α = 1% (ii) Sample statistic: B 1 ; reject for large values (iii) Distribution test statistic under H 0 : t = B 1 β 1 ~t S n 2 = t 89 (n = 91) B 1 Requirements: see model. (iv) Calculated test statistic: t calc = = Critical value: t crit = t 89;0.01 = using Excel. With the table, you may take a conservative value t crit t 85;0.01 = p-value: (using Excel) (v) Decision: do reject H 0, because p-value smaller than 1% or because t calc > t crit. Conclude that the slope is larger than 2. c. It is a statistically significant model, so the question about practically relevant is meaningful. We have R 2 = which is quite low. It would have quite limited value in predicting the height of a thief if the police found a footprint in the snow. d. This is individual prediction : Y X1 = e. This is mean prediction : Y X1 = f. b 0 ± t n 2;0.05 s B0 = ± ( ), so β (Excel: t 89;0.05 = ) Q5 A consumer products company wants to measure the effectiveness of different types of advertising media in the promotion of its products. Specifically, the company is interested in the effectiveness of radio advertising and newspaper advertising (including the cost of discount coupons). A sample of 22 cities with approximately equal populations is selected for study during a test period of one month. Each city is allocated a specific expenditure level both for radio advertising and for newspaper advertising. The sales of the product (in thousands of dollars) and also the levels of media expenditure (in thousands of dollars) during the test month are recorded, with the following results: BS 13 Tutorial 4

14 SPSS results: a. State the multiple regression equation (description of the model including assumptions and the estimated model). b. Interpret the meaning of the slopes, b 1 and b 2, in this problem. c. Interpret the meaning of the regression coefficient, b 0. d. Which type of advertising is more effective? Explain. e. Determine whether there is a significant relationship between sales and the two independent variables (radio advertising and newspaper advertising) at the 0.05 level of significance. f. Interpret the meaning of the p-value. g. Compute the coefficient of multiple determination, R 2, and interpret its meaning. h. Find the adjusted R 2 and interpret its meaning. i. Is there evidence that the slope coefficient for Radio advertisements is more than 10 at α = 0.05? BS 14 Tutorial 4

15 Sol Q5 a. Y = β0 + β1x1 + β2x2 + ε with ε~n(0, σ 2 ) and Y = b0 + b1x1 + b2x2 = X X2; d. newspaper advertising is more effective; e. there is evidence of a significant linear relationship; i. there is evidence that the coefficient for radio advertisements is larger than 10. a. Statistical model: Y = β 0 + β 1 X 1 + β 2 X 2 + ε with ε~n(0, σ^2) where Y=Sales, X 1 =Radio Advertising, X 2 =Newspaper Advertising Estimated model: Y = b 0 + b 1 X 1 + b 2 X 2 = X X 2 where Y =Estimated Sales b. For a given amount of newspaper advertising, each increase of $1000 in radio advertising is estimated to result in a mean increase in sales of $13,081. For a given amount of radio advertising, each increase of $1000 in newspaper advertising is estimated to result in the mean increase in sales of $16,795. c. When there is no money spent on radio advertising and newspaper advertising, the estimated mean amount of sales is $156, d. The slope of newspaper advertising is higher than the slope of radio advertising, so newspaper advertising is more effective. e. Model: see a. (i) H 0 : β 1 = β 2 = 0; H 1 : not H 0 ; α = 0.05 (ii) Sample statistic: F = MSR ; reject for large values MSE (iii) Under H 0 : F~F 2,19 Requirement: see model formulation (error term normally distributed with constant variance) (iv) F calc = = 40.16; F crit = 3.522; p-value = (note: F calc not drawn to scale) (v) reject H 0 because F calc > F crit or equivalently because p < α. Conclude that there is evidence of a significant linear relationship. f. p-value < : the probability of obtaining an F calc of or even larger is less than if H 0 is true. g. R 2 = SSR = = , or rather directly from SPSS output. So, 80.87% of the SST variation in sales can be explained by variation in radio advertising and variation in newspaper advertising. Note: model is significant and R 2 = 0.81, so practically it is a useful model. 2 h. R adj = from computer output. This is the proportion of explained variance, but taking into account the number of variables and number of observations. i. This test is not provided by SPSS, but is not difficult to derive from it. (i) H 0 : β 1 10 against H 1 : β 1 > 10 (α = 0.05) (ii) Sample statistic: B 1 ; reject for large values. (iii) Under H 0 : B 1 β 1 ~t S 19 ; requirements: see earlier. B 1 (iv) t calc = = ; t crit = t 19;0.05 = BS 15 Tutorial 4

16 (v) Reject H 0 because t calc > t crit. There is evidence that the coefficient for radio advertisements is larger than 10. Old exam questions Q1 (i) 22 May 2017, Q1i-1k We wish to establish a demand function Q = a bp, using measurement data in 6 different years. Data are below; some part has been erased. (j) (k) (l) Write down the estimated demand function. See (i). Give the upper bound of the 95% confidence interval for the slope coefficient for price. (2 decimals) See (i). Give the value of the usual test statistic for testing the overall significance of the model. (1 decimal) See (i). Give the value of R 2. (2 decimals) Q1 i. Q = P j k or 12.8 l Sol i. Just look at the second table, keeping in mind that the regression model uses Q = β 0 + β 1 P + ε. Of course, you may also write Q = P, with the hat symbolizing estimation. BS 16 Tutorial 4

17 j. The estimate of the slope coefficient is and its standard error Use t crit = t n 2;0.025 = 2.776, and calculate = k. Overall model significance is tested with F = MSR = = For a simple MSE regression, overall significance can also be tested with t = B 0 SE B = = l. R 2 = SSR SST = = Q2 29 March 2017, Q3 The effect of alcohol and drugs on learning achievements is a subject of intense research. A group of test subjects is asked to do a test exam, with a score between 1 (low) and 10 (high). Researchers want to find out how this relates to their use of alcohol and drugs in the week before the test was taken. For instance, student #2 reported that he used 13 alcoholic beverages and 4 times drugs in the week before the test. Results are shown below (take care: these tables use a decimal point). Some parts of the output have been suppressed. In all questions, define all symbols, except when you use standard symbols (such as R 2 ). (a) (b) (c) (d) State the the theoretical model analyzed, as well as its practical relevance (numerical) and statistical significance (numerical and the null hypothesis). Interpret practical relevance in a few words. Define all non-standard symbols. Find a 95% confidence interval for the slope coefficient for alcohol. Before taking the exam, Bob wants to relax by taking either an alcoholic beverage or a drug. Given that he likes to obtain a high grade, what can he best take: alcohol or drugs? Explain why. Test, at α = 5%, if the hypothesis that the slope coefficient for drugs is equal to 0.2 can be rejected. Use the five-step procedure. BS 17 Tutorial 4

18 Sol (a) (b) (c) exam score is explained by alcohol and drugs use. (b) CIβ1;0.95 = ሾ , ሿ. (c) Bob can best take a drug, no alcohol. (d) Do not reject H0 and conclude that there is no reason for concluding that the slope factor for drugs is unequal to 0.2. = This means that 53% of the variance in the Theoretical model: Y = β 0 + β 1 X 1 + β 2 X 2 + ε, where X 1 is the use of alcohol and X 2 is the use of drugs. Statistical significance: the p-value of this model is 0.001, with H 0 : β 1 = β 2 = 0. Practical relevance: R 2 = SSR = This means that 53% of the variance in the = SST exam score is explained by alcohol and drugs use. Use b 1 = 0.156, S B1 = and df = 18. For a 95% confidence interval we need t 18;0.025 = So, CI β1 ;0.95 = ሾ , ሿ = ሾ , ሿ. Or alternatively ± 0.101, or β Bob can best take a drug, no alcohol. One drug will decrease his expected grade by 0.147, one alcohol consumption by (d) model: see question (a) with ε~n(0, σ 2 ) i) H 0 : β 2 = 0.2; H 1 : β 2 0.2;α = 5% ii) Sample statistic: B 2 ; reject for small and large values iii) Null distribution: B 2 β 2 ~t S 18 ; requirements: see model B 2 iv) t calc = ( 0.2) = 0.726; t crit = ±t 0.025;18 = ± = SST Practical relevance: R 2 = SSR Q2 (a) Theoretical model: Y = β0 + β1x1 + β2x2 + ε, where X1 is the use of alcohol and X2 is the use of drugs. Statistical significance: the p-value of this model is 0.001, with H0: β1 = β2 = 0. v) Do not reject H 0 and conclude that there is no reason for concluding that the slope factor for drugs is unequal to 0.2. Q3 29 March 2017, Q2c A study focuses on the speed of typing WhatsThat messages, split by age group. We ask random persons in three age groups to type a standard message, and observe the time required (seconds). Sample size, mean of the typing time and standard deviation of the typing time are reported below. What is the name of the test we use to find out if the differences in mean typing times of the three age groups are significant, and what is the critical value of the usual test statistic at α = 0.05? (4 points) BS 18 Tutorial 4

19 Sol Q3 Analysis of variance, the critical value is Or: Kruskal-Wallis, the critical value is We do an analysis of variance (ANOVA). The usual test statistic is F = MSA, and under H MSW 0, we have F~F 3 1, = F 2,27. At α = 0.05, the critical value is Alternatively, you may argue for Kruskal-Wallis (non-parametric ANOVA). This uses χ 3 1 with a critical value of , BS 19 Tutorial 4

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics

SEVERAL μs AND MEDIANS: MORE ISSUES. Business Statistics SEVERAL μs AND MEDIANS: MORE ISSUES Business Statistics CONTENTS Post-hoc analysis ANOVA for 2 groups The equal variances assumption The Kruskal-Wallis test Old exam question Further study POST-HOC ANALYSIS

More information

SIMPLE REGRESSION ANALYSIS. Business Statistics

SIMPLE REGRESSION ANALYSIS. Business Statistics SIMPLE REGRESSION ANALYSIS Business Statistics CONTENTS Ordinary least squares (recap for some) Statistical formulation of the regression model Assessing the regression model Testing the regression coefficients

More information

MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics

MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES Business Statistics CONTENTS Multiple regression Dummy regressors Assumptions of regression analysis Predicting with regression analysis Old exam question

More information

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Remember: Z.05 = 1.645, Z.01 = 2.33 We will only cover one-sided hypothesis testing (cases 12.3, 12.4.2, 12.5.2,

More information

Simple Linear Regression: One Quantitative IV

Simple Linear Regression: One Quantitative IV Simple Linear Regression: One Quantitative IV Linear regression is frequently used to explain variation observed in a dependent variable (DV) with theoretically linked independent variables (IV). For example,

More information

ANOVA - analysis of variance - used to compare the means of several populations.

ANOVA - analysis of variance - used to compare the means of several populations. 12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.

More information

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing

More information

LI EAR REGRESSIO A D CORRELATIO

LI EAR REGRESSIO A D CORRELATIO CHAPTER 6 LI EAR REGRESSIO A D CORRELATIO Page Contents 6.1 Introduction 10 6. Curve Fitting 10 6.3 Fitting a Simple Linear Regression Line 103 6.4 Linear Correlation Analysis 107 6.5 Spearman s Rank Correlation

More information

Statistics For Economics & Business

Statistics For Economics & Business Statistics For Economics & Business Analysis of Variance In this chapter, you learn: Learning Objectives The basic concepts of experimental design How to use one-way analysis of variance to test for differences

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

Chapter 4: Regression Models

Chapter 4: Regression Models Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,

More information

Simple Linear Regression: One Qualitative IV

Simple Linear Regression: One Qualitative IV Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression

More information

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

STAT 350 Final (new Material) Review Problems Key Spring 2016

STAT 350 Final (new Material) Review Problems Key Spring 2016 1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,

More information

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression Models. Chapter 4. Introduction. Introduction. Introduction Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager

More information

Chapter 3 Multiple Regression Complete Example

Chapter 3 Multiple Regression Complete Example Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be

More information

Multiple Linear Regression

Multiple Linear Regression 1. Purpose To Model Dependent Variables Multiple Linear Regression Purpose of multiple and simple regression is the same, to model a DV using one or more predictors (IVs) and perhaps also to obtain a prediction

More information

SPSS Guide For MMI 409

SPSS Guide For MMI 409 SPSS Guide For MMI 409 by John Wong March 2012 Preface Hopefully, this document can provide some guidance to MMI 409 students on how to use SPSS to solve many of the problems covered in the D Agostino

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

Chapter 14 Student Lecture Notes 14-1

Chapter 14 Student Lecture Notes 14-1 Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

Tribhuvan University Institute of Science and Technology 2065

Tribhuvan University Institute of Science and Technology 2065 1CSc. Stat. 108-2065 Tribhuvan University Institute of Science and Technology 2065 Bachelor Level/First Year/ First Semester/ Science Full Marks: 60 Computer Science and Information Technology (Stat. 108)

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should

More information

Degrees of freedom df=1. Limitations OR in SPSS LIM: Knowing σ and µ is unlikely in large

Degrees of freedom df=1. Limitations OR in SPSS LIM: Knowing σ and µ is unlikely in large Z Test Comparing a group mean to a hypothesis T test (about 1 mean) T test (about 2 means) Comparing mean to sample mean. Similar means = will have same response to treatment Two unknown means are different

More information

This gives us an upper and lower bound that capture our population mean.

This gives us an upper and lower bound that capture our population mean. Confidence Intervals Critical Values Practice Problems 1 Estimation 1.1 Confidence Intervals Definition 1.1 Margin of error. The margin of error of a distribution is the amount of error we predict when

More information

Chapter 7 Student Lecture Notes 7-1

Chapter 7 Student Lecture Notes 7-1 Chapter 7 Student Lecture Notes 7- Chapter Goals QM353: Business Statistics Chapter 7 Multiple Regression Analysis and Model Building After completing this chapter, you should be able to: Explain model

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Chapter 15: Analysis of Variance

Chapter 15: Analysis of Variance Chapter 5: Analysis of Variance 5. Introduction In this chapter, we introduced the analysis of variance technique, which deals with problems whose objective is to compare two or more populations of quantitative

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

STAT 328 (Statistical Packages)

STAT 328 (Statistical Packages) Department of Statistics and Operations Research College of Science King Saud University Exercises STAT 328 (Statistical Packages) nashmiah r.alshammari ^-^ Excel and Minitab - 1 - Write the commands of

More information

Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES

Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page 46 4 C H A P T E R Regression Models TEACHING SUGGESTIONS Teaching Suggestion 4.1: Which Is the Independent Variable? We find that students are often confused

More information

Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data

Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data Statistics for Managers Using Microsoft Excel Chapter 10 ANOVA and Other C-Sample Tests With Numerical Data 1999 Prentice-Hall, Inc. Chap. 10-1 Chapter Topics The Completely Randomized Model: One-Factor

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

What is a Hypothesis?

What is a Hypothesis? What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill in this city is μ = $42 population proportion Example:

More information

Non-parametric (Distribution-free) approaches p188 CN

Non-parametric (Distribution-free) approaches p188 CN Week 1: Introduction to some nonparametric and computer intensive (re-sampling) approaches: the sign test, Wilcoxon tests and multi-sample extensions, Spearman s rank correlation; the Bootstrap. (ch14

More information

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr. Department of Economics Business Statistics Chapter 1 Chi-square test of independence & Analysis of Variance ECON 509 Dr. Mohammad Zainal Chapter Goals After completing this chapter, you should be able

More information

4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures

4/6/16. Non-parametric Test. Overview. Stephen Opiyo. Distinguish Parametric and Nonparametric Test Procedures Non-parametric Test Stephen Opiyo Overview Distinguish Parametric and Nonparametric Test Procedures Explain commonly used Nonparametric Test Procedures Perform Hypothesis Tests Using Nonparametric Procedures

More information

The Multiple Regression Model

The Multiple Regression Model Multiple Regression The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables:

More information

SALES AND MARKETING Department MATHEMATICS. 2nd Semester. Bivariate statistics. Tutorials and exercises

SALES AND MARKETING Department MATHEMATICS. 2nd Semester. Bivariate statistics. Tutorials and exercises SALES AND MARKETING Department MATHEMATICS 2nd Semester Bivariate statistics Tutorials and exercises Online document: http://jff-dut-tc.weebly.com section DUT Maths S2. IUT de Saint-Etienne Département

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

Chapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance

Chapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance Chapter 8 Student Lecture Notes 8-1 Department of Economics Business Statistics Chapter 1 Chi-square test of independence & Analysis of Variance ECON 509 Dr. Mohammad Zainal Chapter Goals After completing

More information

Analysis of variance

Analysis of variance Analysis of variance Tron Anders Moger 3.0.007 Comparing more than two groups Up to now we have studied situations with One observation per subject One group Two groups Two or more observations per subject

More information

Part Possible Score Base 5 5 MC Total 50

Part Possible Score Base 5 5 MC Total 50 Stat 220 Final Exam December 16, 2004 Schafer NAME: ANDREW ID: Read This First: You have three hours to work on the exam. The other questions require you to work out answers to the questions; be sure to

More information

The independent-means t-test:

The independent-means t-test: The independent-means t-test: Answers the question: is there a "real" difference between the two conditions in my experiment? Or is the difference due to chance? Previous lecture: (a) Dependent-means t-test:

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

Analysis of variance (ANOVA) Comparing the means of more than two groups

Analysis of variance (ANOVA) Comparing the means of more than two groups Analysis of variance (ANOVA) Comparing the means of more than two groups Example: Cost of mating in male fruit flies Drosophila Treatments: place males with and without unmated (virgin) females Five treatments

More information

Independent Samples ANOVA

Independent Samples ANOVA Independent Samples ANOVA In this example students were randomly assigned to one of three mnemonics (techniques for improving memory) rehearsal (the control group; simply repeat the words), visual imagery

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Exercise I.1 I.2 I.3 I.4 II.1 II.2 III.1 III.2 III.3 IV.1 Question (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) Answer

Exercise I.1 I.2 I.3 I.4 II.1 II.2 III.1 III.2 III.3 IV.1 Question (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) Answer Solutions to Exam in 02402 December 2012 Exercise I.1 I.2 I.3 I.4 II.1 II.2 III.1 III.2 III.3 IV.1 Question (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) Answer 3 1 5 2 5 2 3 5 1 3 Exercise IV.2 IV.3 IV.4 V.1

More information

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01 An Analysis of College Algebra Exam s December, 000 James D Jones Math - Section 0 An Analysis of College Algebra Exam s Introduction Students often complain about a test being too difficult. Are there

More information

Chap The McGraw-Hill Companies, Inc. All rights reserved.

Chap The McGraw-Hill Companies, Inc. All rights reserved. 11 pter11 Chap Analysis of Variance Overview of ANOVA Multiple Comparisons Tests for Homogeneity of Variances Two-Factor ANOVA Without Replication General Linear Model Experimental Design: An Overview

More information

4.1. Introduction: Comparing Means

4.1. Introduction: Comparing Means 4. Analysis of Variance (ANOVA) 4.1. Introduction: Comparing Means Consider the problem of testing H 0 : µ 1 = µ 2 against H 1 : µ 1 µ 2 in two independent samples of two different populations of possibly

More information

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests Overall Overview INFOWO Statistics lecture S3: Hypothesis testing Peter de Waal Department of Information and Computing Sciences Faculty of Science, Universiteit Utrecht 1 Descriptive statistics 2 Scores

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

ECON 497 Midterm Spring

ECON 497 Midterm Spring ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain

More information

Regression Analysis. BUS 735: Business Decision Making and Research

Regression Analysis. BUS 735: Business Decision Making and Research Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X 1.04) =.8508. For z < 0 subtract the value from

More information

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December

More information

STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis

STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis STAT 135 Lab 9 Multiple Testing, One-Way ANOVA and Kruskal-Wallis Rebecca Barter April 6, 2015 Multiple Testing Multiple Testing Recall that when we were doing two sample t-tests, we were testing the equality

More information

What Is ANOVA? Comparing Groups. One-way ANOVA. One way ANOVA (the F ratio test)

What Is ANOVA? Comparing Groups. One-way ANOVA. One way ANOVA (the F ratio test) What Is ANOVA? One-way ANOVA ANOVA ANalysis Of VAriance ANOVA compares the means of several groups. The groups are sometimes called "treatments" First textbook presentation in 95. Group Group σ µ µ σ µ

More information

Lecture 10 Multiple Linear Regression

Lecture 10 Multiple Linear Regression Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable

More information

You are permitted to use your own calculator where it has been stamped as approved by the University.

You are permitted to use your own calculator where it has been stamped as approved by the University. ECONOMICS TRIPOS Part I Friday 11 June 004 9 1 Paper 3 Quantitative Methods in Economics This exam comprises four sections. Sections A and B are on Mathematics; Sections C and D are on Statistics. You

More information

Do not copy, post, or distribute. Independent-Samples t Test and Mann- C h a p t e r 13

Do not copy, post, or distribute. Independent-Samples t Test and Mann- C h a p t e r 13 C h a p t e r 13 Independent-Samples t Test and Mann- Whitney U Test 13.1 Introduction and Objectives This chapter continues the theme of hypothesis testing as an inferential statistical procedure. In

More information

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Overview Introduction 10-1 Scatter Plots and Correlation 10- Regression 10-3 Coefficient of Determination and

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables. Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate

More information

1 Introduction to Minitab

1 Introduction to Minitab 1 Introduction to Minitab Minitab is a statistical analysis software package. The software is freely available to all students and is downloadable through the Technology Tab at my.calpoly.edu. When you

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

MATH 644: Regression Analysis Methods

MATH 644: Regression Analysis Methods MATH 644: Regression Analysis Methods FINAL EXAM Fall, 2012 INSTRUCTIONS TO STUDENTS: 1. This test contains SIX questions. It comprises ELEVEN printed pages. 2. Answer ALL questions for a total of 100

More information

Non-parametric tests, part A:

Non-parametric tests, part A: Two types of statistical test: Non-parametric tests, part A: Parametric tests: Based on assumption that the data have certain characteristics or "parameters": Results are only valid if (a) the data are

More information

Finding Relationships Among Variables

Finding Relationships Among Variables Finding Relationships Among Variables BUS 230: Business and Economic Research and Communication 1 Goals Specific goals: Re-familiarize ourselves with basic statistics ideas: sampling distributions, hypothesis

More information

One-Way ANOVA Source Table J - 1 SS B / J - 1 MS B /MS W. Pairwise Post-Hoc Comparisons of Means

One-Way ANOVA Source Table J - 1 SS B / J - 1 MS B /MS W. Pairwise Post-Hoc Comparisons of Means One-Way ANOVA Source Table ANOVA MODEL: ij = µ* + α j + ε ij H 0 : µ 1 = µ =... = µ j or H 0 : Σα j = 0 Source Sum of Squares df Mean Squares F Between Groups n j ( j - * ) J - 1 SS B / J - 1 MS B /MS

More information

Statistics for IT Managers

Statistics for IT Managers Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample

More information

Regression Analysis II

Regression Analysis II Regression Analysis II Measures of Goodness of fit Two measures of Goodness of fit Measure of the absolute fit of the sample points to the sample regression line Standard error of the estimate An index

More information

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006 Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Chapter 7 Comparison of two independent samples

Chapter 7 Comparison of two independent samples Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N

More information

Multiple Regression Methods

Multiple Regression Methods Chapter 1: Multiple Regression Methods Hildebrand, Ott and Gray Basic Statistical Ideas for Managers Second Edition 1 Learning Objectives for Ch. 1 The Multiple Linear Regression Model How to interpret

More information

3. Nonparametric methods

3. Nonparametric methods 3. Nonparametric methods If the probability distributions of the statistical variables are unknown or are not as required (e.g. normality assumption violated), then we may still apply nonparametric tests

More information

Business Statistics 41000: Homework # 5

Business Statistics 41000: Homework # 5 Business Statistics 41000: Homework # 5 Drew Creal Due date: Beginning of class in week # 10 Remarks: These questions cover Lectures #7, 8, and 9. Question # 1. Condence intervals and plug-in predictive

More information

Pooled Variance t Test

Pooled Variance t Test Pooled Variance t Test Tests means of independent populations having equal variances Parametric test procedure Assumptions Both populations are normally distributed If not normal, can be approximated by

More information

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs) The One-Way Repeated-Measures ANOVA (For Within-Subjects Designs) Logic of the Repeated-Measures ANOVA The repeated-measures ANOVA extends the analysis of variance to research situations using repeated-measures

More information

STAT 212 Business Statistics II 1

STAT 212 Business Statistics II 1 STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

Analysis of variance

Analysis of variance Analysis of variance 1 Method If the null hypothesis is true, then the populations are the same: they are normal, and they have the same mean and the same variance. We will estimate the numerical value

More information

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000 Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Chapter 14 Simple Linear Regression (A)

Chapter 14 Simple Linear Regression (A) Chapter 14 Simple Linear Regression (A) 1. Characteristics Managerial decisions often are based on the relationship between two or more variables. can be used to develop an equation showing how the variables

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Econometrics Homework 1

Econometrics Homework 1 Econometrics Homework Due Date: March, 24. by This problem set includes questions for Lecture -4 covered before midterm exam. Question Let z be a random column vector of size 3 : z = @ (a) Write out z

More information

In the previous chapter, we learned how to use the method of least-squares

In the previous chapter, we learned how to use the method of least-squares 03-Kahane-45364.qxd 11/9/2007 4:40 PM Page 37 3 Model Performance and Evaluation In the previous chapter, we learned how to use the method of least-squares to find a line that best fits a scatter of points.

More information

Using SPSS for One Way Analysis of Variance

Using SPSS for One Way Analysis of Variance Using SPSS for One Way Analysis of Variance This tutorial will show you how to use SPSS version 12 to perform a one-way, between- subjects analysis of variance and related post-hoc tests. This tutorial

More information

PSYC 331 STATISTICS FOR PSYCHOLOGISTS

PSYC 331 STATISTICS FOR PSYCHOLOGISTS PSYC 331 STATISTICS FOR PSYCHOLOGISTS Session 4 A PARAMETRIC STATISTICAL TEST FOR MORE THAN TWO POPULATIONS Lecturer: Dr. Paul Narh Doku, Dept of Psychology, UG Contact Information: pndoku@ug.edu.gh College

More information

Simple Linear Regression

Simple Linear Regression 9-1 l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical Method for Determining Regression 9.4 Least Square Method 9.5 Correlation Coefficient and Coefficient

More information