STAT-UB.0103 Exam APRIL.11 SQUARE Version Solutions
|
|
- Madlyn Barrett
- 5 years ago
- Views:
Transcription
1 STAT-UB.0103 Exam 01.APIL.11 SQUAE Version Solutions S1. Jason Harter is a professional fund raiser for charities. He s currently working with the Pets--Luv animal shelter. The operating account for Pets--Luv currently is at $80,000. It seems reasonable to assume that the daily changes in this account will be random with mean $3,000 and with standard deviation $,400. The daily changes involve both donations (positive) and expenses, including Jason s fees (negative). What is the probability that in exactly ten days of fund-raising, this account will reach or exceed $100,000? (a) [5 points] Please state any assumptions that are helpful in doing this work. (b)[15 points] Find the actual probability, using the numbers above and your assumptions in (a), that the account will reach or exceed $100,000 in ten days. SOLUTION: For (a) there are two critical assumptions. You ll need to assume that the daily values are independent of each other. You ll also need to assume that the daily changes come from a normal population. The sample of n = 10 is not officially enough to invoke the Central Limit theorem. For (b), let T be the total for 10 days. Let 0 = $80,000 be the current account, and let 10 be the account after 10 days. Observe that 10 = 0 + T. The mean of the distribution of T is 10 $3,000 = $30,000, and the standard deviation of the distribution of T is $, $7, Then P[ 10 $100,000 ] = P[ 0 + T $100,000 ] = P[ T $0,000 ] = T $30, 000 $0, 000 $30, 000 P $7, $7, P[ Z ] = P[ 0 Z ] P[ 0 Z 1.3 ] = = Software will get the slightly more precise answer This charity has a very good chance of reaching the $100,000 objective. S. The output below, based on a sample of used cars, shows the regression of mileage on year of manufacture. Commas were inserted in large numbers to improve readability. Several of the positions are blank. Please supply the missing values. 1 gs 01
2 STAT-UB.0103 Exam 01.APIL.11 SQUAE Version Solutions egression Analysis: Mileage versus Year The regression equation is Mileage = 0,063,597 + Year (a) Predictor Coef SE Coef T P Constant 0,063,597 1,046, Year -10, (b) S = -Sq = -Sq() = (c) (d) (e) Analysis of Variance Source DF SS MS F P egression 1 195,870,000, ,870,000, (f) esidual Error 536,183,115 (g) (h) Total 17 87,557,000,000 SOLUTION: (a) You can copy the slope from the listing below. It s -10,015.. (b) Coef Since t = SE Coef, this is 10, (c) The value in this position is s ε and it s recovered as MS esidual = 536,183,115 (d) (e) 3,156. The value is found as The SS egression SS Total value can be found as either = 195,870,000,000 87,557,000,000 1 s ε s Y To use the first formula, you ll need s Y = = 68.1%. n 1 or as 1 ( 1 ) SSTotal n 1 = 87,557, 000, s ε 3,156 40,888. Then 1 = 1 s Y 40, = 67.93%. To use the second formula, note that = 1 in a simple regression. This 17 gives 1 ( ) gs 01
3 STAT-UB.0103 Exam 01.APIL.11 SQUAE Version Solutions (f) Find F = MS MS egression esidual = 195,870,000, ,183, Since this is a one-predictor regression, you can also get this as t = (-19.11) (g) This is 171. (h) This is obtained directly as the difference SS Total SS egression = 87,557,000, ,870,000,000 = 91,687,000,000. You could also get this SSesidual from the relation MS esidual =. Thus SS esidual = 536,183, ,687,31,665. The complete listing follows. Some of the results differ slightly because values were reconstructed from rounded information. egression Analysis: Mileage versus Year The regression equation is Mileage = 0,063, Year Predictor Coef SE Coef T P Constant 0,063,597 1,046, Year -10, S = 3, Sq = 68.1% -Sq() = 67.9% Analysis of Variance Source DF SS MS F P egression 1 195,870,000, ,870,000, esidual Error ,687,31, ,183,115 Total 17 87,557,000,000 S3. Frank Tanner is the lab manager at TazerTek, a firm that develops games for cell phones, smart phones, ipads, and similar devices. He has been asked to estimate the solution time for a potential new game Angry Pigs. The subjects will be female high school students for whom the typical solution times for games of this style is 6.4 minutes, with a standard deviation of 1.8 minutes. How many students should Frank request if he d like his sample average to be within 1 3 of a minute (0 seconds) of the true average with a probability of at least 80%? zα/ σ SOLUTION: This is governed by the formula n E. Frank should use E = (target error limit), σ = 1.8 (plausible standard deviation), z α/ = 1.8 (corresponding to 1 α = 0.80 and α/ = 0.10). This gets to n So it looks like Frank should ask for 48 subjects. 3 gs 01
4 STAT-UB.0103 Exam 01.APIL.11 SQUAE Version Solutions It s possible that Frank may want to be a little more conservative about the assessed standard deviation. After all, the 1.8 minutes was obtained with slightly different games. Using σ =.5, say, would lead to a request for 93 subjects. On the other hand, it s very easy to find subjects and inexpensive to run the subjects through the experiment. S4. The regression of Z on H gave this Minitab output: egression Analysis: Z versus H The regression equation is Z = H Predictor Coef SE Coef T P Constant H S = Sq = 33.6% -Sq() = 3.1% Analysis of Variance Source DF SS MS F P egression esidual Error Total Please answer T (true) or F (false) to each of the following. (a) The regression slope would be regarded as not statistically significant. (b) The correlation between the variables H and Z is negative. (c) Most of the residuals are between -5 and +5. (d) The total sum of squares, namely 1,097.15, provides solid evidence of the effect of regression to mediocrity. (e) It is somewhat surprising that <. (f) The data provide convincing evidence that increasing H by one unit will cause a decrease in Z of 1.0 units. (g) The fit would be appraised as poor, because SS esidual Error > SS egression. (h) A new data point with H new = 10 would lead to a prediction of Z new = = (i) If all the data values (meaning all the H i s and all the Z i s) were doubled, the slope would remain unchanged. (j) If all the data values (meaning all the H i s and all the Z i s) were doubled, then the F statistic would be SOLUTION: (a) is false. (b) is true. The t statistic for the regression is -4.77, which is easily significant. The correlation coefficient has the same sign as b 1, the slope estimate. Since b 1 = -1.0, we know that the correlation coefficient must also be negative. 4 gs 01
5 STAT-UB.0103 Exam 01.APIL.11 SQUAE Version Solutions (c) is true. (d) is false. (e) is false. (f) is false. (g) is false. (h) is true. (i) is true. (j) is false. The residuals, as a set of numeric values, have mean zero and standard deviation s ε = Thus about 3 of the residuals will be in (-4.03, 4.03). It follows that considerably more than 3 of the residuals are in (-5, 5). The two issues here are completely unrelated. There s no surprise. This is what usually happens. There is absolutely nothing here to suggest causation. The relative sizes of these, as measured through the F statistic, is what really matters. That s exactly how a regression is to be used. The slope would not change. The F statistic would not change. S5. The following printout concerns a best subsets regression. Questions follow. esponse is ZP 139 cases used. D B e E A a p x r k C r p m e l e e o r a c r Vars -Sq -Sq() C-p S r y y t t X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X (a) What is the dependent variable? (b) Give the names of the independent variables. (c) If you had to select the best two-predictor model, which predictors would you use? (d) Which is the best set of predictors to use? Indicate your reasons. 5 gs 01
6 STAT-UB.0103 Exam 01.APIL.11 SQUAE Version Solutions SOLUTIONS: (a) The dependent variable is ZP. (b) There are five independent variables. These are Armor, Bakery, Clay, Deprect, and Expert. (c) The best two-predictor model uses Armor and Clay. (d) The best model, using the C p criterion, clearly uses Armor, Clay, Deprect, Expert (all but Bakery). If you focus on or on s ε, you could make a case for Armor, Clay, Deprect. S6. Each workday, Monday through Friday, a truckload of scrap metal is delivered from ound Two ecycling to Metals Plus. The payment is based on the weights delivered. Based on a long history, the daily weights have a mean of 14.6 tons, with a standard deviation of 1.6 tons. The mean of the previous 40 deliveries was 15.0 tons. What is the probability that, by chance alone, the mean of 40 deliveries would exceed 15.0 tons? SOLUTION: Let X 1, X,, X 40 be random variables representing the daily delivery weights. We should assume that these values are independent, each from a population with mean µ = 14.6 and σ = 1.6. It s not critical to assume that the population values follow a normal distribution. The sample average X will then have a distribution which is approximately normal with mean 14.6 and with standard deviation It follows then that X P[ X 15.0 ] = P P[ Z ] P[ Z 1.58 ] = 0.50 P[ 0 Z < 1.58 ] = = You could get a slightly more refined answer from software (or from a calculator that has a cumulative normal function); this would be For each of the following, please respond either impossible or could happen. You may use abbreviations I and CH. As illustrations, In a sample of size 15, the correlation between X and Y was 1.. The sample x 1, x,, x n produced an average x = impossible could happen (a) (b) (c) In a multiple regression with four predictors, it was found that >. In a multiple regression with four predictors, it was found that SS egression < SS esidual Error. In the regression of W on Z, the fitted regression line passed through the point ( Z, W ). 6 gs 01
7 STAT-UB.0103 Exam 01.APIL.11 SQUAE Version Solutions (d) Variables H and W have a correlation r HW = 0.48, and the linear regression of H on W produced a slope of (e) In a simple linear regression of Y on X, the value of s ε can be a larger number than the standard deviation of the independent variable X. (f) The regression of Y on {G, H, J} had an value that is less than the value from the regression of Y on {G, H}. (g) In a linear regression with n = 3, all the residuals were negative. (h) In a very strong multiple regression, the F statistic was found as F = 1,810. (i) In a regression of Y on X, the slope was b 1 = 0.8, the correlation was r = 0., and the standard deviation of the x-values was 1,80. (j) In a set of n = 59 values, the correlation was r X, Y = 0.9. Steve did the regression of Y on X and got the slope b 1 (Y on X) = 0.5, while Angela did the regression of X on Y and got the slope b 1 (X on Y) = SOLUTION: (a) Impossible. It always happens that n 1 = 1 ( 1 ) which shows that 1 < 1 <. This can be seen in the formula n 1. Just write as ( ). (b) Could happen. The statement is equivalent to < = 1 (c) Could happen. In fact, it has to happen as a simple consequence of b 0 = W - b 1 Z. (d) Impossible. The slope and correlation must have the same sign. (e) Could happen. Of course s ε and s X can only be compared in those cases in which Y and X are in the same units. (f) Impossible. As the set of predictors is enlarged, can only increase. (g) Impossible. The residuals must sum to zero. (h) Could happen. You will actually see monster values for F now and then. (i) Could happen. There are no contradictions in these statements. (j) Impossible. Observe that b 1 (Y on X) = Sxy Sxy and b 1 (X on Y) = S S. These are not reciprocals of each other, except in the very special case = But having = 1.00 means that either r = +1 or r = -1, which is not the story here. xx yy, 7 gs 01
O2. The following printout concerns a best subsets regression. Questions follow.
STAT-UB.0103 Exam 01.APIL.11 OVAL Version Solutions O1. Frank Tanner is the lab manager at BioVigor, a firm that runs studies for agricultural food supplements. He has been asked to design a protocol for
More informationApart from this page, you are not permitted to read the contents of this question paper until instructed to do so by an invigilator.
B. Sc. Examination by course unit 2014 MTH5120 Statistical Modelling I Duration: 2 hours Date and time: 16 May 2014, 1000h 1200h Apart from this page, you are not permitted to read the contents of this
More informationFinal Exam Bus 320 Spring 2000 Russell
Name Final Exam Bus 320 Spring 2000 Russell Do not turn over this page until you are told to do so. You will have 3 hours minutes to complete this exam. The exam has a total of 100 points and is divided
More informationSMAM 319 Exam 1 Name. 1.Pick the best choice for the multiple choice questions below (10 points 2 each)
SMAM 319 Exam 1 Name 1.Pick the best choice for the multiple choice questions below (10 points 2 each) A b In Metropolis there are some houses for sale. Superman and Lois Lane are interested in the average
More informationCh 13 & 14 - Regression Analysis
Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more
More informationSMAM 314 Practice Final Examination Winter 2003
SMAM 314 Practice Final Examination Winter 2003 You may use your textbook, one page of notes and a calculator. Please hand in the notes with your exam. 1. Mark the following statements True T or False
More informationAnalysis of Bivariate Data
Analysis of Bivariate Data Data Two Quantitative variables GPA and GAES Interest rates and indices Tax and fund allocation Population size and prison population Bivariate data (x,y) Case corr® 2 Independent
More informationUNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test, October 2013
UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test, October 2013 STAC67H3 Regression Analysis Duration: One hour and fifty minutes Last Name: First Name: Student
More informationSTAT 212 Business Statistics II 1
STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb
More informationChapter 26 Multiple Regression, Logistic Regression, and Indicator Variables
Chapter 26 Multiple Regression, Logistic Regression, and Indicator Variables 26.1 S 4 /IEE Application Examples: Multiple Regression An S 4 /IEE project was created to improve the 30,000-footlevel metric
More informationModel Building Chap 5 p251
Model Building Chap 5 p251 Models with one qualitative variable, 5.7 p277 Example 4 Colours : Blue, Green, Lemon Yellow and white Row Blue Green Lemon Insects trapped 1 0 0 1 45 2 0 0 1 59 3 0 0 1 48 4
More informationBusiness 320, Fall 1999, Final
Business 320, Fall 1999, Final name You may use a calculator and two cheat sheets. You have 3 hours. I pledge my honor that I have not violated the Honor Code during this examination. Obvioiusly, you may
More information(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.
FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December
More informationOregon Hill Wireless Survey Regression Model and Statistical Evaluation. Sky Huvard
Oregon Hill Wireless Survey Regression Model and Statistical Evaluation Sky Huvard Business Statistics Dr. George Canavos 4 May 2003 Overview Huvard 2 I am interested in starting a wireless broadband project
More informationAMS 315/576 Lecture Notes. Chapter 11. Simple Linear Regression
AMS 315/576 Lecture Notes Chapter 11. Simple Linear Regression 11.1 Motivation A restaurant opening on a reservations-only basis would like to use the number of advance reservations x to predict the number
More informationThis document contains 3 sets of practice problems.
P RACTICE PROBLEMS This document contains 3 sets of practice problems. Correlation: 3 problems Regression: 4 problems ANOVA: 8 problems You should print a copy of these practice problems and bring them
More informationAP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation
Scatterplots and Correlation Name Hr A scatterplot shows the relationship between two quantitative variables measured on the same individuals. variable (y) measures an outcome of a study variable (x) may
More informationPredict y from (possibly) many predictors x. Model Criticism Study the importance of columns
Lecture Week Multiple Linear Regression Predict y from (possibly) many predictors x Including extra derived variables Model Criticism Study the importance of columns Draw on Scientific framework Experiment;
More informationChapter 1. Linear Regression with One Predictor Variable
Chapter 1. Linear Regression with One Predictor Variable 1.1 Statistical Relation Between Two Variables To motivate statistical relationships, let us consider a mathematical relation between two mathematical
More informationTMA4255 Applied Statistics V2016 (5)
TMA4255 Applied Statistics V2016 (5) Part 2: Regression Simple linear regression [11.1-11.4] Sum of squares [11.5] Anna Marie Holand To be lectured: January 26, 2016 wiki.math.ntnu.no/tma4255/2016v/start
More informationMULTIPLE LINEAR REGRESSION IN MINITAB
MULTIPLE LINEAR REGRESSION IN MINITAB This document shows a complicated Minitab multiple regression. It includes descriptions of the Minitab commands, and the Minitab output is heavily annotated. Comments
More informationSMAM 314 Computer Assignment 5 due Nov 8,2012 Data Set 1. For each of the following data sets use Minitab to 1. Make a scatterplot.
SMAM 314 Computer Assignment 5 due Nov 8,2012 Data Set 1. For each of the following data sets use Minitab to 1. Make a scatterplot. 2. Fit the linear regression line. Regression Analysis: y versus x y
More information1 Least Squares Estimation - multiple regression.
Introduction to multiple regression. Fall 2010 1 Least Squares Estimation - multiple regression. Let y = {y 1,, y n } be a n 1 vector of dependent variable observations. Let β = {β 0, β 1 } be the 2 1
More informationSix Sigma Black Belt Study Guides
Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships
More informationINFERENCE FOR REGRESSION
CHAPTER 3 INFERENCE FOR REGRESSION OVERVIEW In Chapter 5 of the textbook, we first encountered regression. The assumptions that describe the regression model we use in this chapter are the following. We
More informationQuestion Possible Points Score Total 100
Midterm I NAME: Instructions: 1. For hypothesis testing, the significant level is set at α = 0.05. 2. This exam is open book. You may use textbooks, notebooks, and a calculator. 3. Do all your work in
More informationCorrelation & Simple Regression
Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.
More informationMultiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company
Multiple Regression Inference for Multiple Regression and A Case Study IPS Chapters 11.1 and 11.2 2009 W.H. Freeman and Company Objectives (IPS Chapters 11.1 and 11.2) Multiple regression Data for multiple
More informationBasic Business Statistics 6 th Edition
Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based
More information1. An article on peanut butter in Consumer reports reported the following scores for various brands
SMAM 314 Review Exam 1 1. An article on peanut butter in Consumer reports reported the following scores for various brands Creamy 56 44 62 36 39 53 50 65 45 40 56 68 41 30 40 50 50 56 65 56 45 40 Crunchy
More informationECON 497 Midterm Spring
ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain
More informationLINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises
LINEAR REGRESSION ANALYSIS MODULE XVI Lecture - 44 Exercises Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Exercise 1 The following data has been obtained on
More informationSimple Linear Regression Analysis
LINEAR REGRESSION ANALYSIS MODULE II Lecture - 6 Simple Linear Regression Analysis Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Prediction of values of study
More informationPre-Calculus Multiple Choice Questions - Chapter S8
1 If every man married a women who was exactly 3 years younger than he, what would be the correlation between the ages of married men and women? a Somewhat negative b 0 c Somewhat positive d Nearly 1 e
More information28. SIMPLE LINEAR REGRESSION III
28. SIMPLE LINEAR REGRESSION III Fitted Values and Residuals To each observed x i, there corresponds a y-value on the fitted line, y = βˆ + βˆ x. The are called fitted values. ŷ i They are the values of
More informationChapter 12 - Lecture 2 Inferences about regression coefficient
Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous
More informationMultiple Linear Regression
Andrew Lonardelli December 20, 2013 Multiple Linear Regression 1 Table Of Contents Introduction: p.3 Multiple Linear Regression Model: p.3 Least Squares Estimation of the Parameters: p.4-5 The matrix approach
More informationMultiple Regression Examples
Multiple Regression Examples Example: Tree data. we have seen that a simple linear regression of usable volume on diameter at chest height is not suitable, but that a quadratic model y = β 0 + β 1 x +
More informationInference for Regression Inference about the Regression Model and Using the Regression Line
Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about
More informationStatistics 100 Exam 2 March 8, 2017
STAT 100 EXAM 2 Spring 2017 (This page is worth 1 point. Graded on writing your name and net id clearly and circling section.) PRINT NAME (Last name) (First name) net ID CIRCLE SECTION please! L1 (MWF
More informationusing the beginning of all regression models
Estimating using the beginning of all regression models 3 examples Note about shorthand Cavendish's 29 measurements of the earth's density Heights (inches) of 14 11 year-old males from Alberta study Half-life
More information1 Introduction to Minitab
1 Introduction to Minitab Minitab is a statistical analysis software package. The software is freely available to all students and is downloadable through the Technology Tab at my.calpoly.edu. When you
More informationMULTIPLE REGRESSION METHODS
DEPARTMENT OF POLITICAL SCIENCE AND INTERNATIONAL RELATIONS Posc/Uapp 816 MULTIPLE REGRESSION METHODS I. AGENDA: A. Residuals B. Transformations 1. A useful procedure for making transformations C. Reading:
More informationMath Section MW 1-2:30pm SR 117. Bekki George 206 PGH
Math 3339 Section 21155 MW 1-2:30pm SR 117 Bekki George bekki@math.uh.edu 206 PGH Office Hours: M 11-12:30pm & T,TH 10:00 11:00 am and by appointment Linear Regression (again) Consider the relationship
More informationMultiple Regression Methods
Chapter 1: Multiple Regression Methods Hildebrand, Ott and Gray Basic Statistical Ideas for Managers Second Edition 1 Learning Objectives for Ch. 1 The Multiple Linear Regression Model How to interpret
More informationPh.D. Preliminary Examination Statistics June 2, 2014
Ph.D. Preliminary Examination Statistics June, 04 NOTES:. The exam is worth 00 points.. Partial credit may be given for partial answers if possible.. There are 5 pages in this exam paper. I have neither
More informationSTAT 212: BUSINESS STATISTICS II Third Exam Tuesday Dec 12, 6:00 PM
STAT212_E3 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICS & STATISTICS Term 171 Page 1 of 9 STAT 212: BUSINESS STATISTICS II Third Exam Tuesday Dec 12, 2017 @ 6:00 PM Name: ID #:
More informationACOVA and Interactions
Chapter 15 ACOVA and Interactions Analysis of covariance (ACOVA) incorporates one or more regression variables into an analysis of variance. As such, we can think of it as analogous to the two-way ANOVA
More informationPART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable,
Concordia University Department of Mathematics and Statistics Course Number Section Statistics 360/2 01 Examination Date Time Pages Final December 2002 3 hours 6 Instructors Course Examiner Marks Y.P.
More informationInteraction effects for continuous predictors in regression modeling
Interaction effects for continuous predictors in regression modeling Testing for interactions The linear regression model is undoubtedly the most commonly-used statistical model, and has the advantage
More informationSTAT763: Applied Regression Analysis. Multiple linear regression. 4.4 Hypothesis testing
STAT763: Applied Regression Analysis Multiple linear regression 4.4 Hypothesis testing Chunsheng Ma E-mail: cma@math.wichita.edu 4.4.1 Significance of regression Null hypothesis (Test whether all β j =
More informationExamination paper for TMA4255 Applied statistics
Department of Mathematical Sciences Examination paper for TMA4255 Applied statistics Academic contact during examination: Anna Marie Holand Phone: 951 38 038 Examination date: 16 May 2015 Examination time
More information(1) The explanatory or predictor variables may be qualitative. (We ll focus on examples where this is the case.)
Introduction to Analysis of Variance Analysis of variance models are similar to regression models, in that we re interested in learning about the relationship between a dependent variable (a response)
More informationExamination paper for TMA4255 Applied statistics
Department of Mathematical Sciences Examination paper for TMA4255 Applied statistics Academic contact during examination: Nikolai Ushakov Phone: 45128897 Examination date: 02 June 2018 Examination time
More informationStat 328 Final Exam (Regression) Summer 2002 Professor Vardeman
Stat Final Exam (Regression) Summer Professor Vardeman This exam concerns the analysis of 99 salary data for n = offensive backs in the NFL (This is a part of the larger data set that serves as the basis
More informationIntro to Linear Regression
Intro to Linear Regression Introduction to Regression Regression is a statistical procedure for modeling the relationship among variables to predict the value of a dependent variable from one or more predictor
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More informationSMAM 319 Exam1 Name. a B.The equation of a line is 3x + y =6. The slope is a. -3 b.3 c.6 d.1/3 e.-1/3
SMAM 319 Exam1 Name 1. Pick the best choice. (10 points-2 each) _c A. A data set consisting of fifteen observations has the five number summary 4 11 12 13 15.5. For this data set it is definitely true
More informationSMAM 314 Exam 42 Name
SMAM 314 Exam 42 Name Mark the following statements True (T) or False (F) (10 points) 1. F A. The line that best fits points whose X and Y values are negatively correlated should have a positive slope.
More information*Karle Laska s Sections: There is no class tomorrow and Friday! Have a good weekend! Scores will be posted in Compass early Friday morning
STATISTICS 100 EXAM 3 Spring 2016 PRINT NAME (Last name) (First name) *NETID CIRCLE SECTION: Laska MWF L1 Laska Tues/Thurs L2 Robin Tu Write answers in appropriate blanks. When no blanks are provided CIRCLE
More informationIntro to Linear Regression
Intro to Linear Regression Introduction to Regression Regression is a statistical procedure for modeling the relationship among variables to predict the value of a dependent variable from one or more predictor
More informationResearch Design - - Topic 13a Split Plot Design with Either a Continuous or Categorical Between Subjects Factor 2008 R.C. Gardner, Ph.D.
esearch Design - - Topic 3a Split Plot Design with Either a ontinuous or ategorical etween Subjects Factor 8.. Gardner Ph.D. General Description Example Purpose Two ategorical Factors Using Multiple egression
More informationSchool of Mathematical Sciences. Question 1. Best Subsets Regression
School of Mathematical Sciences MTH5120 Statistical Modelling I Practical 9 and Assignment 8 Solutions Question 1 Best Subsets Regression Response is Crime I n W c e I P a n A E P U U l e Mallows g E P
More information2.4.3 Estimatingσ Coefficient of Determination 2.4. ASSESSING THE MODEL 23
2.4. ASSESSING THE MODEL 23 2.4.3 Estimatingσ 2 Note that the sums of squares are functions of the conditional random variables Y i = (Y X = x i ). Hence, the sums of squares are random variables as well.
More informationQ1: What is the interpretation of the number 4.1? A: There were 4.1 million visits to ER by people 85 and older, Q2: What percent of people 65-74
Lecture 4 This week lab:exam 1! Review lectures, practice labs 1 to 4 and homework 1 to 5!!!!! Need help? See me during my office hrs, or goto open lab or GS 211. Bring your picture ID and simple calculator.(note
More informationQ Lecture Introduction to Regression
Q3 2009 1 Before/After Transformation 2 Construction Role of T-ratios Formally, even under Null Hyp: H : 0, ˆ, being computed from k t k SE ˆ ˆ y values themselves containing random error, will sometimes
More informationSTAT 350 Final (new Material) Review Problems Key Spring 2016
1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,
More informationTHE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS
THE ROYAL STATISTICAL SOCIETY 008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS The Society provides these solutions to assist candidates preparing for the examinations
More informationDescribing distributions with numbers
Describing distributions with numbers A large number or numerical methods are available for describing quantitative data sets. Most of these methods measure one of two data characteristics: The central
More informationSection 4: Multiple Linear Regression
Section 4: Multiple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 The Multiple Regression
More information23. Inference for regression
23. Inference for regression The Practice of Statistics in the Life Sciences Third Edition 2014 W. H. Freeman and Company Objectives (PSLS Chapter 23) Inference for regression The regression model Confidence
More informationEXAM IN TMA4255 EXPERIMENTAL DESIGN AND APPLIED STATISTICAL METHODS
Norges teknisk naturvitenskapelige universitet Institutt for matematiske fag Side 1 av 8 Contact during exam: Bo Lindqvist Tel. 975 89 418 EXAM IN TMA4255 EXPERIMENTAL DESIGN AND APPLIED STATISTICAL METHODS
More informationModels with qualitative explanatory variables p216
Models with qualitative explanatory variables p216 Example gen = 1 for female Row gpa hsm gen 1 3.32 10 0 2 2.26 6 0 3 2.35 8 0 4 2.08 9 0 5 3.38 8 0 6 3.29 10 0 7 3.21 8 0 8 2.00 3 0 9 3.18 9 0 10 2.34
More informationIT 403 Practice Problems (2-2) Answers
IT 403 Practice Problems (2-2) Answers #1. Which of the following is correct with respect to the correlation coefficient (r) and the slope of the leastsquares regression line (Choose one)? a. They will
More informationName: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm
Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam June 8 th, 2016: 9am to 1pm Instructions: 1. This is exam is to be completed independently. Do not discuss your work with
More information(4) 1. Create dummy variables for Town. Name these dummy variables A and B. These 0,1 variables now indicate the location of the house.
Exam 3 Resource Economics 312 Introductory Econometrics Please complete all questions on this exam. The data in the spreadsheet: Exam 3- Home Prices.xls are to be used for all analyses. These data are
More informationStart with review, some new definitions, and pictures on the white board. Assumptions in the Normal Linear Regression Model
Start with review, some new definitions, and pictures on the white board. Assumptions in the Normal Linear Regression Model A1: There is a linear relationship between X and Y. A2: The error terms (and
More information10 Model Checking and Regression Diagnostics
10 Model Checking and Regression Diagnostics The simple linear regression model is usually written as i = β 0 + β 1 i + ɛ i where the ɛ i s are independent normal random variables with mean 0 and variance
More informationCHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS
CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS QUESTIONS 5.1. (a) In a log-log model the dependent and all explanatory variables are in the logarithmic form. (b) In the log-lin model the dependent variable
More informationChapter 14 Multiple Regression Analysis
Chapter 14 Multiple Regression Analysis 1. a. Multiple regression equation b. the Y-intercept c. $374,748 found by Y ˆ = 64,1 +.394(796,) + 9.6(694) 11,6(6.) (LO 1) 2. a. Multiple regression equation b.
More informationCorrelation and Regression
Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should
More informationMultiple OLS Regression
Multiple OLS Regression Ronet Bachman, Ph.D. Presented by Justice Research and Statistics Association 12/8/2016 Justice Research and Statistics Association 720 7 th Street, NW, Third Floor Washington,
More informationConditions for Regression Inference:
AP Statistics Chapter Notes. Inference for Linear Regression We can fit a least-squares line to any data relating two quantitative variables, but the results are useful only if the scatterplot shows a
More informationUnivariate analysis. Simple and Multiple Regression. Univariate analysis. Simple Regression How best to summarise the data?
Univariate analysis Example - linear regression equation: y = ax + c Least squares criteria ( yobs ycalc ) = yobs ( ax + c) = minimum Simple and + = xa xc xy xa + nc = y Solve for a and c Univariate analysis
More informationMultiple Linear Regression
Multiple Linear Regression Simple linear regression tries to fit a simple line between two variables Y and X. If X is linearly related to Y this explains some of the variability in Y. In most cases, there
More informationMODELING. Simple Linear Regression. Want More Stats??? Crickets and Temperature. Crickets and Temperature 4/16/2015. Linear Model
STAT 250 Dr. Kari Lock Morgan Simple Linear Regression SECTION 2.6 Least squares line Interpreting coefficients Cautions Want More Stats??? If you have enjoyed learning how to analyze data, and want to
More informationChapter 14. Multiple Regression Models. Multiple Regression Models. Multiple Regression Models
Chapter 14 Multiple Regression Models 1 Multiple Regression Models A general additive multiple regression model, which relates a dependent variable y to k predictor variables,,, is given by the model equation
More information10. Alternative case influence statistics
10. Alternative case influence statistics a. Alternative to D i : dffits i (and others) b. Alternative to studres i : externally-studentized residual c. Suggestion: use whatever is convenient with the
More informationApplied Regression Analysis. Section 2: Multiple Linear Regression
Applied Regression Analysis Section 2: Multiple Linear Regression 1 The Multiple Regression Model Many problems involve more than one independent variable or factor which affects the dependent or response
More informationAnalysis of Variance
Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also
More informationUnit 7: Multiple linear regression 1. Introduction to multiple linear regression
Announcements Unit 7: Multiple linear regression 1. Introduction to multiple linear regression Sta 101 - Fall 2017 Duke University, Department of Statistical Science Work on your project! Due date- Sunday
More informationSociology 593 Exam 1 February 17, 1995
Sociology 593 Exam 1 February 17, 1995 I. True-False. (25 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. A researcher regressed Y on. When he plotted
More information1. Review of Lecture level factors Homework A 2 3 experiment in 16 runs with no replicates
Lecture 3.1 1. Review of Lecture 2.2 2-level factors Homework 2.2.1 2. A 2 3 experiment 3. 2 4 in 16 runs with no replicates Lecture 3.1 1 2 k Factorial Designs Designs with k factors each at 2 levels
More informationSTAT Chapter 11: Regression
STAT 515 -- Chapter 11: Regression Mostly we have studied the behavior of a single random variable. Often, however, we gather data on two random variables. We wish to determine: Is there a relationship
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor
More informationInterpreting the coefficients
Lecture Week 5 Multiple Linear Regression Interpreting the coefficients Uses of Multiple Regression Predict for specified new x-vars Predict in time. Focus on one parameter Use regression to adjust variation
More informationTable of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).
Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X 1.04) =.8508. For z < 0 subtract the value from
More informationProblem Set 1 ANSWERS
Economics 20 Prof. Patricia M. Anderson Problem Set 1 ANSWERS Part I. Multiple Choice Problems 1. If X and Z are two random variables, then E[X-Z] is d. E[X] E[Z] This is just a simple application of one
More informationHomework 6. Wife Husband XY Sum Mean SS
. Homework Wife Husband X 5 7 5 7 7 3 3 9 9 5 9 5 3 3 9 Sum 5 7 Mean 7.5.375 SS.5 37.75 r = ( )( 7) - 5.5 ( )( 37.75) = 55.5 7.7 =.9 With r Crit () =.77, we would reject H : r =. Thus, it would make sense
More informationConfidence Interval for the mean response
Week 3: Prediction and Confidence Intervals at specified x. Testing lack of fit with replicates at some x's. Inference for the correlation. Introduction to regression with several explanatory variables.
More information