Exam 2 Practice Problems

Size: px
Start display at page:

Download "Exam 2 Practice Problems"

Transcription

1 ST 312 Exam 2 Practice Problems MATERIAL COVERED ON EXAM 2 Reiland Lecture Handouts Corresponding textbook chapters 6: Infererence for the Difference Between Means Chapters 22, 23 7: Scatterplots, Correlation, Simple Linear Regression Chapters 6, 7, 8, 25 8: Transformations and Curvilinear Regression Chapter 9 9: Multiple Regression Including Indicator Variables, Interaction Chapters 28 and 29; Chapter 29 online at The above material is covered on webassign homework 5, 6, 7, and 8. NOTE: tables of the normal distribution and t-distributions will be supplied with the test. WARNING! these sample problems may not cover all topics for which you are responsible on the final exam 1. A copy machine dealer has data on the number x of copy machines at each of 89 customer locations and the number y of service calls in a month at each location. Summary calculations give x œ 8.4, s B œ 2.1, y œ 14.2, sc œ 3.8, and r œ.86. What is the slope of the least squares regression line of number of service calls on number of copiers? a b c d. none of these e. can't tell from information given 2. In the setting of the previous problem, about what percent of the variation in number of service calls is explained by the linear relation between number of service calls and number of machines? a. 86% b. 93% c. 74% d. none of these e. can't tell from information given 3. Outdoor temperature influences natural gas consumption for the purpose of heating a house. The usual measure of the need for heating is heating degree days. The number of heating degree days for a particular day is the number of degrees the average temperature for that day is below 65 F, where the average temperature for a day is the mean of the high and low temperatures for that day. An average temperature of 20 F, for example, corresponds to 45 heating degree days. A homeowner interested in switching to solar heating panels collects the following data on her natural gas use for the months October through June, where x is heating degree days per day for the month and y is gas consumption per day in hundreds of cubic feet. Month Oct Nov Dec Jan Feb Mar Apr May June x y * If ÐB BÑÐC CÑ œ *"Þ$"ß = œ "$Þ%ß = œ Þ(%, calculate the correlation coefficient r and interpret 3œ" 3 3 B C its value; draw a scatterplot of the data. 4. Each of the following statements contains a blunder. In each case explain what is wrong. a. There is a high correlation between the sex of American workers and their income." b. We found a high correlation ( r œ 1.09) between students' ratings of faculty teaching and ratings made by other faculty members." c. The correlation between planting rate and yield of corn was found to be r œ.23 bushel." 5..A study of 1,000 families gave the following results: average height of husband œbœ') 8 inches, = B œþ( in.; average height of wife œcœ'$ inches, = C œþ& in.; <œþ&. Estimate the height of a wife when her husband is ( inches tall. a. 63 inches b. 72 inches c. 64 inches d. none of these e. need more information

2 page 2 6. Outdoor temperature influences natural gas consumption for the purpose of heating a house. The usual measure of the need for heating is heating degree days. The number of heating degree days for a particular day is the number of degrees the average temperature for that day is below 65 F, where the average temperature for a day is the mean of the high and low temperatures for that day. An average temperature of 20 F, for example, corresponds to 45 heating degree days. A homeowner interested in switching to solar heating panels collects the following data on her natural gas use for the months October through June, where the explanatory variable x is heating degree days per day for the month and the response variable y is gas consumption per day in hundreds of cubic feet. Month Oct Nov Dec Jan Feb Mar Apr May June x y Summary statistics are as follows: B œ "Þ&%ß C œ &Þ&*ß = B œ "$Þ%ß = C œ Þ(%ß < œ Þ**! Calculate the least squares regression line y œ,!, " B of gas consumption y on heating degree days x. Draw the regression line on a scatterplot of the data. 7. To investigate the possible link between fluoride content of drinking water and cancer, the cancer death rates (number of deaths per 100,000 population) from in 20 selected U. S. cities - the ten largest fluoridated cities and the ten largest cities not fluoridated by were recorded. These data were used to calculate for each city the annual rate of increase in cancer death rate over this 18 year period. The data are given below: FLUORIDATED NONFLUORIDATED Annual Increase in Annual Increase in City Cancer Death Rate City Cancer Death Rate Chicago Los Angeles.8875 Philadelphia Boston Baltimore New Orleans Cleveland Seattle.4923 Washington Cincinnati Milwaukee Atlanta St. Louis Kansas City San Francisco Columbus Pittsburgh Newark Buffalo Portland a. Construct a 95% confidence interval for the difference between the mean annual increases in cancer death rates for fluoridated and nonfluoridated cities. (Use 18 degrees of freedom; do not assume equal variances). b. What assumption(s) must be satisfied for the confidence interval in part a) to be valid? 8a. We are interested in comparing the average supermarket prices of two leading colas. Our sample was taken by randomly selecting eight supermarkets and recording the price of a six-pack of each brand of cola at each supermarket. The data are shown in the following table: Supermarket Brand 1 cola $ Brand 2 cola $ Difference $ Summary statistics:. œ!þ!$(&ß =. œ!þ!$)" Find a 98% confidence interval for the difference in mean price of brand 1 and brand 2. i)!þ!$(&!þ!%!% ii)!þ!$(&!þ"$*$ iii)!þ!$(&!þ!%(" iv)!þ!$(&!þ!$%(

3 page 3 8b. Perform a hypothesis test to test if the difference in mean prices of brand 1 and brand 2 colas is different from 0. Use α =.02. Define the parameters, state hypotheses, calculate test statistic and T -value. 9. A new weight-reducing technique, consisting of a liquid protein diet, is currently undergoing tests by the Food and Drug Administration (FDA) before its introduction into the market. The weights of a random sample of five people are recorded before they are introduced to the liquid protein diet. The five individuals are then instructed to follow the liquid protein diet for 3 weeks. At the end of this period, their weights (in pounds) are again recorded. The results are listed in the table. Let. " be the true mean weight of individuals before starting the diet and let. be the true mean weight of individuals after 3 weeks on the diet Weight before diet Weight after diet Test to determine if the diet is effective at reducing weight. Use α œþ"!. 10. A simple linear regression was used to predict the score y on a final exam from the score xon the first exam. The slope of the least squares regression line is.(&. The standard error of the slope is."" and the sample size is 200. A 90% confidence interval for the true slope is (though the.0 is 8, use in the > - table) a..'% to.)' b..&$ to.*( c..&( to.*$ d..'% to.)' e. none of the above 11. Refer to the previous problem. To test the null hypothesis that the slope is zero versus the one-sided alternative that the slope is positive, we use the test statistic t œ a b..05 c..15 d..95 e If a pair of variables have a strong curvilinear (that is, nonlinear) relationship, which of the following is true? a. The correlation coefficient will be able to indicate that a nonlinear relationship is present b. A scatter plot will not be needed to indicate that a nonlinear relationship is present c. The correlation coefficient will not be able to indicate the relationship is nonlinear d. The correlation coefficient will be exactly equal to zero

4 page Sociologists are of the opinion that there has been a decrease in the difference in ages at first marriage for men and women since We want to examine data to determine if this decrease is significant. The following data summary and regression results were obtained, where the B variable is year and the C variable is the age difference (husband age wife age) at first marriage. Variable Count Mean StDev Year ( B) husband-wife age ( C) a. Interpret the value of the least squares slope, ". b. What is the value of the test statistic for testing L! À "" œ!? c. For the hypothesis test L! À" " œ! vs L+ À" "!ß select the choice below that gives the correct T- value and correct conclusion. i. The T-value is 0.68; do not reject L! À "" œ!; there is no linear relationship since 1975 between year and age difference between husband and wife at first marriage. ii. The T-value is ; reject L! À "" œ!; there is evidence that since 1975 the age difference (husband age wife age) has increased. iii The T-value is ; reject L! À "" œ!; there is evidence that since 1975 the age difference (husband age wife age) has decreased. iv The T-value is ; do not reject L! À "" œ!; there is no linear relationship since 1975 between year and age difference between husband and wife at first marriage. d. What is a 95% confidence interval for the slope? e. Find a 95% confidence interval for the mean difference (husband age wife age) at first marriage in f. Find a 95% prediction interval for the difference (husband age wife age) for a particular couple getting married for the first time in 1998.

5 page If a residual plot exhibits a curved pattern in the residuals, this means that: a. the B and C variable should be reversed so that a least squares line is appropriate for this data b. a least squares line is not appropriate because there is a nonlinear relation between B and C c. there is no significant relation between B and C d. there is a problem with the accuracy of the data Use the following Excel output to answer questions 15, 16, and 17 below. Regression Statistics Multiple R R Square Adjusted R Square Standard Error Observations 8 ANOVA df SS MS F Regression Residual Total Coefficients Standard Error t Stat P-value Intercept x Which of the following is true? a. The correlation between x and y must be approximately b. The correlation between x and y must be approximately c. The correlation between x and y must be approximately d. The correlation between x and y must be approximately Which of the following is true? a. x explains about 88.5% of the variation in y b. y explains about 88.5% of the variation in x c. x explains about 78.4% of the variation in y d. y explains about 78.4% of the variation in x 17. The predicted value of C when B œ "& is approximately a b c d

6 page 6 Use the following Excel output to answer questions 18, 19, and 20 below. 18. Given this information, what percent of the variation in the C variable is explained by differences in the explanatory variable? a. About 75 percent b. Approximately 57 percent c. Can't be determined without having the actual data available d. About 25 percent 19. Given this information, what was the sample size used in the study? a. 8 b. 18 c. 9 d If the B variable increased by 2 units, then C would change by approximately a b c d e

7 page If a least squares line were determined for the data in each scatterplot, which least squares line would result in the smallest sum of the squares of the residuals? a. A b. B c. C d. D A B C D 22. For all students at Walden University, the prediction equation for Cœcollege GPA (range 0-4.0) and B" œ high school GPA (range 0-4.0) and B œ college board score (range ) is s C œ!þ!!þ&!b"!þ!!b a. Find the predicted college GPA for students having (i) high school GPA = 4.0 and college board score =800; (ii) B œ.! and B œ!!. " b. A high school student trying to gain admission to Walden University re-takes the college board exam during the summer after his senior year in high school. If he increases his college board score by 100 points, what will be the change in his predicted college GPA? Note that his high school GPA does not change since he has already graduated from high school. c. For all students with B œ &!!, what is the predicted college GPA when high school GPA is the explanatory variable?

8 page For a study of crime in the United States, data for each of the 50 states and Washington, DC was collected on the violent crime rate (per 100,000 citizens), poverty rate (percent of the population), single parent households (percent of all state households), and urbanization (percent of state population living in urban areas). The multiple regression output is shown below where Cœviolent crime rate, B " œ poverty rate, B œ single parent households, and B œ urbanization. $ R e gre ssion Sta tistics Multiple R R Square Adjusted R Square Standard Error Observations 51 ANOVA df SS MS F Signific Regression Residual Total Coe fficie nts Sta nda rd Error t Sta t P-va lue Intercept E-08 poverty single parent E-07 urbanization E-05 a. What is the least squares prediction equation for the violent crime rate? b. What is the sum of the squares of the residuals SSE for this multiple regression model? c. What proportion of the variation in the violent crime rate is explained by poverty rate, single family households, and urbanization? d. If the poverty rate increased by 1 percent, with single parent households and urbanization unchanged, how would the violent crime rate change? e. Is this model useful for predicting the violent crime rate?

9 page Over 6 decades the Gallup Organization has periodically asked the following question: If your party nominated a generally well-qualified person for president who happened to be a woman, would you vote for that person? Below is a table showing the percentage answering yes and the year of the century (37 = 1937). % Yes Year summary statistics: B œ '(Þ%% = B œ "'Þ( C œ '"Þ)" = C œ "(Þ"* < œ Þ*(" a. Determine the estimates,! and," of the parameters "! and "" in the linear model y œ "! " " x %, where B is year and C is the percentage who respond yes. b. Use the least squares line to estimate the percentage of respondents that would say yes in c. Determine the estimate = / of the standard deviation 5 of the error component % "' (note that the sum of squares of residuals ÐC sc Ñ œ ). 3œ" 3 3 = / d. Calculate a 95% confidence interval for the slope " ". (Note that WIÐ, " Ñ œ ) 8 " = B e. Conduct an appropriate hypothesis test (use α =.05) to determine if the year of the century is useful for predicting the percentage of respondents that would answer yes to the above question. State the hypotheses, find the value of the test statistic, and state your conclusion based on the P-value or the rejection region. f. What assumptions must be (approximately) satisfied for the above statistical procedures to be valid? Perform 2 diagnostics to check these assumptions. 25. Suppose you fit the quadratic model IÐCÑ œ " " B " B to a set of 8 œ! data points and found! V œ!þ*!ß ÐC CÑ œ $(Þ"%ß and WWI œ $Þ)(Þ 3œ" 3! " 25a. Is there sufficient evidence to indicate that the model contributes information for predicting C? Test using α=0.10 i) The null hypothesis is L! : a) "! œ "" œ " œ! b) "! œ "" œ " Á! c) "" œ " œ! d) "" œ " Á! e) "! œ "" œ "! ii) The alternative hypothesis is L+ À a) at least one of "! ß "" ß "! b) at least one of "! ß "" ß " Á! c) at least one of "" ß "! d) at least one of " ß " Á! e) at least one of " ß "! " " iii) What is the value of the test statistic? a) '&Þ"$ b) "'Þ% c) ($Þ!' d) %)Þ)) e) )$Þ$ iv) If the œ!þ!!&, what is the conclusion for this test? a) Do not reject the null hypothesis. Conclude that there is not sufficient evidence that the model is useful for predicting CÞ b) Reject the null hypothesis. Conclude that there is not sufficient evidence that the model is useful for predicting CÞ c) Reject the null hypothesis. Conclude that at least one of "" ß " is nonzero and that the model is useful for predicting CÞ d) Reject the null hypothesis. Conclude that both "" and " are greater than 0 and that the model is useful for predicting C. e) Do not reject the null hypothesis. Conclude that since "" œ! and " œ!, then "! must be significantly different from 0.

10 page 10 25b. What null and alternative hypotheses would you test to determine whether upward curvature exists? a) L! À" œ!ßl+ À" Á! b) L! À" œ!ßl+ À"! c) L! À" œ!ßl+ À"! d) L À "!ß L À " Á! e) L À "!ß L À "!! +! + 25c. What null and alternative hypotheses would you test to determine whether downward curvature exists? a) L! À" œ!ßl+ À" Á! b) L! À" œ!ßl+ À"! c) L! À" œ!ßl+ À"! d) L À "!ß L À " Á! e) L À "!ß L À "!! +! Consider the following prediction equation with an interaction term: s C œ $Þ' "ÞB" Þ%B!ÞB" B Þ When B is held fixed at $ß how much does the predicted value of Cchange when B" increases by 1 unit? a) 1.8 b) 10.8 c) 11.4 d) 4.2 e) Consider the printout for an interaction regression between a response variable C and two explanatory variables B and B Þ " a. Write the prediction equation for the interaction model. b. Test the overall utility of the interaction model using the global F-test at α =.05. c. Test the hypothesis (at α =.05) that B" and B interact positively. d. Estimate the change in C for each additional 1-unit increase in B when B = *. " 28. An elections officer wants to model voter turnout C in a precinct as a function of type of election, national or state. 28a Write a model for voter turnout, C, as a function of type of election. i. Cœ"! " " B %, where Bœ" if the election is national, Bœ! if the election is local. ii. C œ "! " " B" " B % ß where B" œ " if the election is national, B" œ! if the election is not national; B œ " if the election is local, B œ! if the election is not local. iii. Cœ"! " " B %, where Bœvoter turnout. iv. C œ " " B " B %, where B œ voter turnout.! " 28b. Interpret the value of " " Þ i) the rate of increase in the voter turnout for national elections ii) the mean voter turnout for national elections iii) the difference between the mean voter turnout in national and local elections iv) the difference between the mean voter turnouts in national elections 1 year apart

11 page An elections officer wants to model voter turnout C in a precinct as a function of the type of precinct: urban, suburban, or rural. The elections officer uses the following regression model: C œ "! " " B" " B % where B" œ " if urban,! if not, B œ " if suburban,! if not. Interpret the value of " Þ i) the difference between the mean voter turnout for suburban and rural precincts ii) the rate of increase in voter turnout C for suburban precincts iii) the mean voter turnout for suburban precincts iv) the difference between the mean voter turnout for suburban and urban precincts SOLUTIONS 1. sc b. since b œ r œ œ c. since r œ (.86) œ.74 ) s ÐB BÑÐC CÑ 3 3 B 3. 3œ" r œ Ð8 "Ñ= B= C œ Þ**!Þ There is a strong positive linear relationship between heating degree days and gas consumption. SCATTERPLOT: Heat. Deg. Days (x), Gas Consump.(y) GAs Consumption Heating Degree Days GAS CONSUMPT. 4. a. The correlation we are studying measures the linear relationship between 2 quantitative variables; sex is a categorical variable. b. 1 Ÿ r Ÿ 1 is violated. c. r has no units. 5. c. The husband is 4 inches, or 4/2.7 œ 1.5 standard deviations above the mean husband height. The wife's height is predicted to be above average by.25(1.5) œ.4 standard deviations, or inches œ 1 inch. (Recall b r(s /s )) œ C B = " =! " b "! " C 6., œ < ;, œ C, Bà œ.202;, œ C, B œ 1.23 B

12 page 12 Gas Consumption/Day Heating Degree Days/Day Line Fit Plot Gas Consumption/Day Predicted Gas Consumption/Day Heating Degree Days/Day a. B , s œ 2.753, n œ 10; B œ , s œ 2.429, n œ 10, t 2.101, " œ 1 " 2.025,") œ ( ) œ œ (.5962, ). b. (i) The two samples must be independently selected (ii) The distributions of increases in cancer death rates for fluoridated and nonfluoridated cities can be approximated by normal models. (iii) The two groups we are comparing must be independent of each other.. 8a. i).. > œ!þ!$(& Þ**)! œ!þ!$(&!þ!%!%þ Þ!"ß( = 8 8b. L! À.". œ.. œ!!þ!$)" ) LEÀ.". œ.. Á! α œ Þ! where. is the mean price of brand 1 cola and. is the mean price of brand 2 cola. " Þ!$(& test statistic: > œ œ Þ()%. Þ!$)" ) œ T Ð> ( Þ()%Ñ œ Þ!(Þ Since Þ!, do not reject the null hypothesis L! À.". œ.. œ!. There appears to be no significant difference in the mean price of the 2 brands of soda. 9. L! À. ". œ.. œ!ßle À. ". œ..!àα œþ"!. œ &ß =. œ "Þ&)Þ. & > œ œ œ (Þ!)à T -value œ T Ð> (Þ!)Ñ Þ!!"Þ =. "Þ&) 8 & % Conclusion: Reject L! and conclude that the mean weight of individuals after the diet is less than the mean weight of individuals before the diet. 10. c," > WIÐ, " Ñ œ Þ(& "Þ'&&Þ"", WIÐ, Ñ Þ(& Þ"" " 11. a >œ œ 12. c " 13. a.," œ!þ!% (approximately); this means that each year since 1975 the average difference (husband age wife age) has decreased by Þ!%,"!Þ!$*&'ÞÞÞ b. > œ WIÐ, Ñ œ!þ!!&&!$þþþ œ %Þ$&$"" " c. iii. The T -value given in the Excel output is always for a 2-tail test; since we are conducting a 1-tail test!þ!!!&& L+ À ""!, the T -value is œ!þ!!!"(& d. Ð!Þ!$&$'*(ß!Þ!"&%$$&Ñ from the output; notice that the interval is entirely negative. e. use sc "**) > 8 WIÐ. s"**) Ñ; for the calculations below, note that from the output we have = / œ!þ")'''%*; C œ %*Þ*!"$!%$!Þ!$*&'&Ð"**)Ñ œ Þ!$( s "**) WIÐs. Ñ œ WI Ð, Ñ ÐB BÑ = "**) / œ ÐÞ!!&&!$$"&Ñ Ð"**) "*)'Þ&Ñ!Þ")'''%* " / 8 % œ Þ!($)'))*à > 8 WIÐ. s"**) Ñ œ Þ!(%Ð Þ!($)'))*Ñ œ Þ"&$!%à so sc "**) > 8 WIÐ. s"**) Ñ œ Þ!$( Þ"&$!% Ê ("Þ))$(*', Þ"*!!%) f. use sc > WIÐC s Ñ; "**) 8 "**)

13 page 13 for the calculations below, note that from the output we have = / œ!þ")'''%*; C œ %*Þ*!"$!%$!Þ!$*&'&Ð"**)Ñ œ Þ!$(à s "**) s "**) " / = 8 / / WIÐC Ñ œ WI Ð, Ñ ÐB BÑ = œ ÐÞ!!&&!$$"&Ñ Ð"**) "*)'Þ&Ñ!Þ")'''%* %!Þ")'''%* œ Þ!!("%à > 8 WIÐC s"**) Ñ œ Þ!(%ÐÞ!!("%Ñ œ Þ%"')à so sc > WIÐC s Ñ œ Þ!$( Þ%"') Ê "Þ'!(ß Þ%&$) "**) 8 "**) 14. b 15. b 16. c 17. a 18. b 19. c 20. e 21. a 22. a. i) sc œ!þ!!þ&!%þ!!þ!!)!! œ $Þ) ; ii) sc œ!þ!þ&!þ!!þ!!!! œ "Þ'. b. predicted college GPA will increase by!.!!"!! œ!þ. c. s C œ!þ!!þ&!b"!þ!!&!! œ!þ!!þ&!b" " œ "Þ!Þ&!B" <37/ s <+>/ œ :9@/<>C <+>/ =3816/ :+</8> ?<,+83D+>398 b. SSE œ 831, c. V œ d. the violent crime rate will increase by approx per 100,000 citizens. e. analyze the model by examining the F test statistic for the F test; the hypotheses are L! À "" œ " œ " $ œ!ß LE À at least one of the " 3's is not zero. QWI '*%"()Þ% From the regression output: J œ WWI œ "(')$Þ%' œ $*Þ&&(* ; the "Significance F" (that is, the P-value) is less than Therefore, reject and conclude that at least one of the " 's is not zero. = "(Þ"* L! 3 C 24. a.," œ < = œ Þ*(" "'Þ( œ Þ***%*à,! œ C," B œ '"Þ)" Þ***%*Ð'(Þ%%Ñ œ &Þ&*$&); B b. s C *( œ &Þ&*$&) Þ***%*Ð*(Ñ œ *"Þ$' WWI &&Þ(%) c. = / œ œ 8 "% œ %Þ(%; = / %Þ(% d. WIÐ, " Ñ œ =!''"; 8 " = œþ B "&"'Þ( confidence interval is," > 8 WIÐ, " Ñ œ Þ***%* Þ"%&ÐÞ!''"Ñ œ Þ***%* Þ"%"() Ê ÐÞ)&(("ß "Þ"%"(Ñ e. L! À" " œ! vs L+ À" " Á!à," Þ***%* test statistic > œ WIÐ, Ñ œ Þ!''" œ "&Þ"à " for α œ Þ!&ß the rejection region is > Þ"%& and > Þ"%& ( 8 œ "' œ "%.0Ñ; the T -value is 0 to nine decimal places. Conclusion: since the test statistic is in the rejection region, reject L! À "" œ! and conclude that year is useful for predicting the percentage of respondents that will answer yes to the question. f. In the model C 3 œ "! " " B 3 % 3, the error terms % 3 are assumed to be independent and normally distributed with mean 0 and standard deviation 5 for all B, that is, % µ iid RÐ!ß 5Ñ for all BÞ Below is the least squares line on the scatterplot, a plot of the residuals against the -values, and a histogram B of the residuals. The first 2 plots cast some doubt on the assumption of independent errors since the residuals appear to occur in groups of positive and negative residuals. The histogram casts some doubt on the normality of the errors since the residuals show some left-skewness. 3

14 page %Yes vs Year y = x R² = Residuals Residual Plot Year 25a i) c 25a ii) d 25a iii) c Test statistic is àsince 8œ! and 5œß Jœ QWV QWI WWI $Þ)( QWI œ œ œ!þ(( Þ To determine QWVß note that since WWX œ ÐC CÑ œ $(Þ"%, and 8 5 " "( WWX œ WWV WWI (that is, Total Sum of Squares œ Sum of Squares for Regression Sum of Squares of the! 3œ" 3

15 page 15 Residuals; see Annotated Excel Regression Output in the Lecture Handouts section of our class webpage), then WWV œ WWX WWI œ $(Þ"% $Þ)( œ $$Þ(. So QWV WWV $$Þ( œ 5 œ œ "'Þ'$& and QWV "'Þ'$& J œ œ œ ($Þ!' QWI!Þ(( 25a iv) c 25b c 25c b 26 a 27a s C œ "'Þ(! $Þ!$($B" "Þ!%'&B %Þ!("(B" B "" 27b Test the null hypothesis L! À "" œ " œ " $ œ!. The J statistic is *$*% with T -value Þ"" x "! ; since the T-value Þ!&, reject L! and conclude that the model is useful for predicting C. & 27c Test L! À " $ œ! vs L+ À " $!. The test statistic is > œ *Þ"( and the 2-tailed T -value œ *Þ%) x "! ß so & the T-value for this 1-tailed test is T-value œ %Þ(% x "!. Reject L! À " $ œ! and conclude that B" and B interact positively. 27d $$Þ'!) 28a i) 28b iii) 29 i)

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000 Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 17, 2010 Instructor: John Parman Final Exam - Solutions You have until 12:30pm to complete this exam. Please remember to put your

More information

This document contains 3 sets of practice problems.

This document contains 3 sets of practice problems. P RACTICE PROBLEMS This document contains 3 sets of practice problems. Correlation: 3 problems Regression: 4 problems ANOVA: 8 problems You should print a copy of these practice problems and bring them

More information

Unit 6 - Introduction to linear regression

Unit 6 - Introduction to linear regression Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,

More information

Inference for the Regression Coefficient

Inference for the Regression Coefficient Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates

More information

SECTION I Number of Questions 42 Percent of Total Grade 50

SECTION I Number of Questions 42 Percent of Total Grade 50 AP Stats Chap 7-9 Practice Test Name Pd SECTION I Number of Questions 42 Percent of Total Grade 50 Directions: Solve each of the following problems, using the available space (or extra paper) for scratchwork.

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Unit 6 - Simple linear regression

Unit 6 - Simple linear regression Sta 101: Data Analysis and Statistical Inference Dr. Çetinkaya-Rundel Unit 6 - Simple linear regression LO 1. Define the explanatory variable as the independent variable (predictor), and the response variable

More information

Q1: What is the interpretation of the number 4.1? A: There were 4.1 million visits to ER by people 85 and older, Q2: What percent of people 65-74

Q1: What is the interpretation of the number 4.1? A: There were 4.1 million visits to ER by people 85 and older, Q2: What percent of people 65-74 Lecture 4 This week lab:exam 1! Review lectures, practice labs 1 to 4 and homework 1 to 5!!!!! Need help? See me during my office hrs, or goto open lab or GS 211. Bring your picture ID and simple calculator.(note

More information

MATH 1301 (College Algebra) - Final Exam Review

MATH 1301 (College Algebra) - Final Exam Review MATH 1301 (College Algebra) - Final Exam Review This review is comprehensive but should not be the only material used to study for the final exam. It should not be considered a preview of the final exam.

More information

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1 9.1 Scatter Plots and Linear Correlation Answers 1. A high school psychologist wants to conduct a survey to answer the question: Is there a relationship between a student s athletic ability and his/her

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

Psych 230. Psychological Measurement and Statistics

Psych 230. Psychological Measurement and Statistics Psych 230 Psychological Measurement and Statistics Pedro Wolf December 9, 2009 This Time. Non-Parametric statistics Chi-Square test One-way Two-way Statistical Testing 1. Decide which test to use 2. State

More information

PART I. Multiple choice. 1. Find the slope of the line shown here. 2. Find the slope of the line with equation $ÐB CÑœ(B &.

PART I. Multiple choice. 1. Find the slope of the line shown here. 2. Find the slope of the line with equation $ÐB CÑœ(B &. Math 1301 - College Algebra Final Exam Review Sheet Version X This review, while fairly comprehensive, should not be the only material used to study for the final exam. It should not be considered a preview

More information

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Overview Introduction 10-1 Scatter Plots and Correlation 10- Regression 10-3 Coefficient of Determination and

More information

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression Chapter 12 12-1 North Seattle Community College BUS21 Business Statistics Chapter 12 Learning Objectives In this chapter, you learn:! How to use regression analysis to predict the value of a dependent

More information

AP Statistics Review Ch. 7

AP Statistics Review Ch. 7 AP Statistics Review Ch. 7 Name 1. Which of the following best describes what is meant by the term sampling variability? A. There are many different methods for selecting a sample. B. Two different samples

More information

11 Correlation and Regression

11 Correlation and Regression Chapter 11 Correlation and Regression August 21, 2017 1 11 Correlation and Regression When comparing two variables, sometimes one variable (the explanatory variable) can be used to help predict the value

More information

Mrs. Poyner/Mr. Page Chapter 3 page 1

Mrs. Poyner/Mr. Page Chapter 3 page 1 Name: Date: Period: Chapter 2: Take Home TEST Bivariate Data Part 1: Multiple Choice. (2.5 points each) Hand write the letter corresponding to the best answer in space provided on page 6. 1. In a statistics

More information

[ z = 1.48 ; accept H 0 ]

[ z = 1.48 ; accept H 0 ] CH 13 TESTING OF HYPOTHESIS EXAMPLES Example 13.1 Indicate the type of errors committed in the following cases: (i) H 0 : µ = 500; H 1 : µ 500. H 0 is rejected while H 0 is true (ii) H 0 : µ = 500; H 1

More information

Sem. 1 Review Ch. 1-3

Sem. 1 Review Ch. 1-3 AP Stats Sem. 1 Review Ch. 1-3 Name 1. You measure the age, marital status and earned income of an SRS of 1463 women. The number and type of variables you have measured is a. 1463; all quantitative. b.

More information

Inference for Regression Simple Linear Regression

Inference for Regression Simple Linear Regression Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

Final Exam. DO NOT SEPARATE the answer sheet from the rest of the test. Work and CIRCLE the answer to each problem INSIDE the test.

Final Exam. DO NOT SEPARATE the answer sheet from the rest of the test. Work and CIRCLE the answer to each problem INSIDE the test. Final Exam Math 114 - Statistics for Business Fall 2009 Name: Section: (circle one) 1 2 3 4 5 6 INSTRUCTIONS: This exam contains 24 problems. The first 16 are multiple-choice, and are 3 points each. Record

More information

σ. We further know that if the sample is from a normal distribution then the sampling STAT 2507 Assignment # 3 (Chapters 7 & 8)

σ. We further know that if the sample is from a normal distribution then the sampling STAT 2507 Assignment # 3 (Chapters 7 & 8) STAT 2507 Assignment # 3 (Chapters 7 & 8) DUE: Sections E, F Section G Section H Monday, March 16, in class Tuesday, March 17, in class Wednesday, March 18, in class Last Name Student # First Name Your

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

Business Statistics 41000: Homework # 5

Business Statistics 41000: Homework # 5 Business Statistics 41000: Homework # 5 Drew Creal Due date: Beginning of class in week # 10 Remarks: These questions cover Lectures #7, 8, and 9. Question # 1. Condence intervals and plug-in predictive

More information

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore

Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Math 223 Lecture Notes 3/15/04 From The Basic Practice of Statistics, bymoore Chapter 3 continued Describing distributions with numbers Measuring spread of data: Quartiles Definition 1: The interquartile

More information

their contents. If the sample mean is 15.2 oz. and the sample standard deviation is 0.50 oz., find the 95% confidence interval of the true mean.

their contents. If the sample mean is 15.2 oz. and the sample standard deviation is 0.50 oz., find the 95% confidence interval of the true mean. Math 1342 Exam 3-Review Chapters 7-9 HCCS **************************************************************************************** Name Date **********************************************************************************************

More information

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E

Salt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E Salt Lake Community College MATH 1040 Final Exam Fall Semester 011 Form E Name Instructor Time Limit: 10 minutes Any hand-held calculator may be used. Computers, cell phones, or other communication devices

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

The empirical ( ) rule

The empirical ( ) rule The empirical (68-95-99.7) rule With a bell shaped distribution, about 68% of the data fall within a distance of 1 standard deviation from the mean. 95% fall within 2 standard deviations of the mean. 99.7%

More information

STATS DOESN T SUCK! ~ CHAPTER 16

STATS DOESN T SUCK! ~ CHAPTER 16 SIMPLE LINEAR REGRESSION: STATS DOESN T SUCK! ~ CHAPTER 6 The HR manager at ACME food services wants to examine the relationship between a workers income and their years of experience on the job. He randomly

More information

Simple Linear Regression Using Ordinary Least Squares

Simple Linear Regression Using Ordinary Least Squares Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression

More information

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Chapter 10 Correlation and Regression McGraw-Hill, Bluman, 7th ed., Chapter 10 1 Example 10-2: Absences/Final Grades Please enter the data below in L1 and L2. The data appears on page 537 of your textbook.

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

Chapter 15: Other Regression Statistics and Pitfalls

Chapter 15: Other Regression Statistics and Pitfalls Chapter 15: Other Regression Statistics and Pitfalls Chapter 15 Outline Two-Tailed Confidence Intervals o Confidence Interval Approach: Which Theories Are Consistent with the Data? o A Confidence Interval

More information

Eco 391, J. Sandford, spring 2013 April 5, Midterm 3 4/5/2013

Eco 391, J. Sandford, spring 2013 April 5, Midterm 3 4/5/2013 Midterm 3 4/5/2013 Instructions: You may use a calculator, and one sheet of notes. You will never be penalized for showing work, but if what is asked for can be computed directly, points awarded will depend

More information

Chapter 14 Multiple Regression Analysis

Chapter 14 Multiple Regression Analysis Chapter 14 Multiple Regression Analysis 1. a. Multiple regression equation b. the Y-intercept c. $374,748 found by Y ˆ = 64,1 +.394(796,) + 9.6(694) 11,6(6.) (LO 1) 2. a. Multiple regression equation b.

More information

AP Statistics - Chapter 2A Extra Practice

AP Statistics - Chapter 2A Extra Practice AP Statistics - Chapter 2A Extra Practice 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study is A) yield

More information

Chapter 3: Examining Relationships

Chapter 3: Examining Relationships Chapter 3 Review Chapter 3: Examining Relationships 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study

More information

EX1. One way ANOVA: miles versus Plug. a) What are the hypotheses to be tested? b) What are df 1 and df 2? Verify by hand. , y 3

EX1. One way ANOVA: miles versus Plug. a) What are the hypotheses to be tested? b) What are df 1 and df 2? Verify by hand. , y 3 EX. Chapter 8 Examples In an experiment to investigate the performance of four different brands of spark plugs intended for the use on a motorcycle, plugs of each brand were tested and the number of miles

More information

STAT 512 MidTerm I (2/21/2013) Spring 2013 INSTRUCTIONS

STAT 512 MidTerm I (2/21/2013) Spring 2013 INSTRUCTIONS STAT 512 MidTerm I (2/21/2013) Spring 2013 Name: Key INSTRUCTIONS 1. This exam is open book/open notes. All papers (but no electronic devices except for calculators) are allowed. 2. There are 5 pages in

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

STAT 212 Business Statistics II 1

STAT 212 Business Statistics II 1 STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb

More information

SMAM 314 Practice Final Examination Winter 2003

SMAM 314 Practice Final Examination Winter 2003 SMAM 314 Practice Final Examination Winter 2003 You may use your textbook, one page of notes and a calculator. Please hand in the notes with your exam. 1. Mark the following statements True T or False

More information

SMAM 314 Exam 42 Name

SMAM 314 Exam 42 Name SMAM 314 Exam 42 Name Mark the following statements True (T) or False (F) (10 points) 1. F A. The line that best fits points whose X and Y values are negatively correlated should have a positive slope.

More information

Math 2000 Practice Final Exam: Homework problems to review. Problem numbers

Math 2000 Practice Final Exam: Homework problems to review. Problem numbers Math 2000 Practice Final Exam: Homework problems to review Pages: Problem numbers 52 20 65 1 181 14 189 23, 30 245 56 256 13 280 4, 15 301 21 315 18 379 14 388 13 441 13 450 10 461 1 553 13, 16 561 13,

More information

Statistics and Quantitative Analysis U4320

Statistics and Quantitative Analysis U4320 Statistics and Quantitative Analysis U3 Lecture 13: Explaining Variation Prof. Sharyn O Halloran Explaining Variation: Adjusted R (cont) Definition of Adjusted R So we'd like a measure like R, but one

More information

Simple Linear Regression: One Qualitative IV

Simple Linear Regression: One Qualitative IV Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression

More information

Chapter 3 Multiple Regression Complete Example

Chapter 3 Multiple Regression Complete Example Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be

More information

Examining Relationships. Chapter 3

Examining Relationships. Chapter 3 Examining Relationships Chapter 3 Scatterplots A scatterplot shows the relationship between two quantitative variables measured on the same individuals. The explanatory variable, if there is one, is graphed

More information

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012 University of California, Berkeley, Statistics 3A: Statistical Inference for the Social and Life Sciences Michael Lugo, Spring 202 Solutions to Exam Friday, March 2, 202. [5: 2+2+] Consider the stemplot

More information

Sociology 593 Exam 2 Answer Key March 28, 2002

Sociology 593 Exam 2 Answer Key March 28, 2002 Sociology 59 Exam Answer Key March 8, 00 I. True-False. (0 points) Indicate whether the following statements are true or false. If false, briefly explain why.. A variable is called CATHOLIC. This probably

More information

SMAM 314 Exam 49 Name. 1.Mark the following statements true or false (10 points-2 each)

SMAM 314 Exam 49 Name. 1.Mark the following statements true or false (10 points-2 each) SMAM 314 Exam 49 Name 1.Mark the following statements true or false (10 points-2 each) _F A. When fitting a least square equation it is necessary that the observations come from a normal distribution.

More information

Intro to Linear Regression

Intro to Linear Regression Intro to Linear Regression Introduction to Regression Regression is a statistical procedure for modeling the relationship among variables to predict the value of a dependent variable from one or more predictor

More information

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December

More information

2.57 when the critical value is 1.96, what decision should be made?

2.57 when the critical value is 1.96, what decision should be made? Math 1342 Ch. 9-10 Review Name SHORT ANSWER. Write the word or phrase that best completes each statement or answers the question. 9.1 1) If the test value for the difference between the means of two large

More information

Statistics for IT Managers

Statistics for IT Managers Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample

More information

1. Classify each number. Choose all correct answers. b. È # : (i) natural number (ii) integer (iii) rational number (iv) real number

1. Classify each number. Choose all correct answers. b. È # : (i) natural number (ii) integer (iii) rational number (iv) real number Review for Placement Test To ypass Math 1301 College Algebra Department of Computer and Mathematical Sciences University of Houston-Downtown Revised: Fall 2009 PLEASE READ THE FOLLOWING CAREFULLY: 1. The

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3

More information

MGEC11H3Y L01 Introduction to Regression Analysis Term Test Friday July 5, PM Instructor: Victor Yu

MGEC11H3Y L01 Introduction to Regression Analysis Term Test Friday July 5, PM Instructor: Victor Yu Last Name (Print): Solution First Name (Print): Student Number: MGECHY L Introduction to Regression Analysis Term Test Friday July, PM Instructor: Victor Yu Aids allowed: Time allowed: Calculator and one

More information

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Remember: Z.05 = 1.645, Z.01 = 2.33 We will only cover one-sided hypothesis testing (cases 12.3, 12.4.2, 12.5.2,

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

Simple Linear Regression

Simple Linear Regression CHAPTER 13 Simple Linear Regression CHAPTER OUTLINE 13.1 Simple Linear Regression Analysis 13.2 Using Excel s built-in Regression tool 13.3 Linear Correlation 13.4 Hypothesis Tests about the Linear Correlation

More information

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except in problem 1. Work neatly.

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except in problem 1. Work neatly. Introduction to Statistics Math 1040 Sample Final Exam - Chapters 1-11 6 Problem Pages Time Limit: 1 hour and 50 minutes Open Textbook Calculator Allowed: Scientific Name: The point value of each problem

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

STAT 350 Final (new Material) Review Problems Key Spring 2016

STAT 350 Final (new Material) Review Problems Key Spring 2016 1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,

More information

Correlation and Regression

Correlation and Regression Correlation and Regression October 25, 2017 STAT 151 Class 9 Slide 1 Outline of Topics 1 Associations 2 Scatter plot 3 Correlation 4 Regression 5 Testing and estimation 6 Goodness-of-fit STAT 151 Class

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3 Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 28. SIMPLE LINEAR REGRESSION III Fitted Values and Residuals To each observed x i, there corresponds a y-value on the fitted line, y = βˆ + βˆ x. The are called fitted values. ŷ i They are the values of

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 24, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

Urban Revival in America

Urban Revival in America Urban Revival in America Victor Couture 1 Jessie Handbury 2 1 University of California, Berkeley 2 University of Pennsylvania and NBER May 2016 1 / 23 Objectives 1. Document the recent revival of America

More information

Inferences About Two Proportions

Inferences About Two Proportions Inferences About Two Proportions Quantitative Methods II Plan for Today Sampling two populations Confidence intervals for differences of two proportions Testing the difference of proportions Examples 1

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

AMS 7 Correlation and Regression Lecture 8

AMS 7 Correlation and Regression Lecture 8 AMS 7 Correlation and Regression Lecture 8 Department of Applied Mathematics and Statistics, University of California, Santa Cruz Suumer 2014 1 / 18 Correlation pairs of continuous observations. Correlation

More information

Econometrics Homework 1

Econometrics Homework 1 Econometrics Homework Due Date: March, 24. by This problem set includes questions for Lecture -4 covered before midterm exam. Question Let z be a random column vector of size 3 : z = @ (a) Write out z

More information

Regression and Correlation

Regression and Correlation Regression and Correlation Raja Almukkahal Larry Ottman Danielle DeLancey Addie Evans Ellen Lawsky Brenda Meery Say Thanks to the Authors Click http://www.ck12.org/saythanks (No sign in required) To access

More information

UNIT 12 ~ More About Regression

UNIT 12 ~ More About Regression ***SECTION 15.1*** The Regression Model When a scatterplot shows a relationship between a variable x and a y, we can use the fitted to the data to predict y for a given value of x. Now we want to do tests

More information

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

Chapter 14 Student Lecture Notes 14-1

Chapter 14 Student Lecture Notes 14-1 Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this

More information

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Instructor Name Time Limit: 120 minutes Any calculator is okay. Necessary tables and formulas are attached to the back of the exam.

More information

Basic Statistics Exercises 66

Basic Statistics Exercises 66 Basic Statistics Exercises 66 42. Suppose we are interested in predicting a person's height from the person's length of stride (distance between footprints). The following data is recorded for a random

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

y n 1 ( x i x )( y y i n 1 i y 2

y n 1 ( x i x )( y y i n 1 i y 2 STP3 Brief Class Notes Instructor: Ela Jackiewicz Chapter Regression and Correlation In this chapter we will explore the relationship between two quantitative variables, X an Y. We will consider n ordered

More information

20. Ignore the common effect question (the first one). Makes little sense in the context of this question.

20. Ignore the common effect question (the first one). Makes little sense in the context of this question. Errors & Omissions Free Response: 8. Change points to (0, 0), (1, 2), (2, 1) 14. Change place to case. 17. Delete the word other. 20. Ignore the common effect question (the first one). Makes little sense

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).

Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,

More information

Math: Question 1 A. 4 B. 5 C. 6 D. 7

Math: Question 1 A. 4 B. 5 C. 6 D. 7 Math: Question 1 Abigail can read 200 words in one minute. If she were to read at this rate for 30 minutes each day, how many days would Abigail take to read 30,000 words of a book? A. 4 B. 5 C. 6 D. 7

More information

Intro to Linear Regression

Intro to Linear Regression Intro to Linear Regression Introduction to Regression Regression is a statistical procedure for modeling the relationship among variables to predict the value of a dependent variable from one or more predictor

More information

Chapter 3: Examining Relationships Review Sheet

Chapter 3: Examining Relationships Review Sheet Review Sheet 1. A study is conducted to determine if one can predict the yield of a crop based on the amount of yearly rainfall. The response variable in this study is A) the yield of the crop. D) either

More information