MIDTERM 2 èver 1è. Name. 2. You must show your work and explain your answer in order to receive credit.

Similar documents
This document contains 3 sets of practice problems.

Stat 500 Midterm 2 12 November 2009 page 0 of 11

INFERENCE FOR REGRESSION

STAT 212 Business Statistics II 1

Model Building Chap 5 p251

Practice exam questions

Ch 13 & 14 - Regression Analysis

EX1. One way ANOVA: miles versus Plug. a) What are the hypotheses to be tested? b) What are df 1 and df 2? Verify by hand. , y 3

Apart from this page, you are not permitted to read the contents of this question paper until instructed to do so by an invigilator.

PART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable,

Question Possible Points Score Total 100

Department of Mathematics & Statistics Stat 2593 Final Examination 19 April 2001

SMAM 314 Exam 42 Name

THE AUSTRALIAN NATIONAL UNIVERSITY. Second Semester Final Examination November, Econometrics II: Econometric Modelling (EMET 2008/6008)

Correlation & Simple Regression

Models with qualitative explanatory variables p216

1. An article on peanut butter in Consumer reports reported the following scores for various brands

1. What does the alternate hypothesis ask for a one-way between-subjects analysis of variance?

Multiple Regression Examples

[4+3+3] Q 1. (a) Describe the normal regression model through origin. Show that the least square estimator of the regression parameter is given by

SMAM 319 Exam 1 Name. 1.Pick the best choice for the multiple choice questions below (10 points 2 each)

(1) The explanatory or predictor variables may be qualitative. (We ll focus on examples where this is the case.)

Basic Business Statistics, 10/e

Stat 231 Final Exam. Consider first only the measurements made on housing number 1.

COGS 14B: INTRODUCTION TO STATISTICAL ANALYSIS

Chapter 15 Multiple Regression

Annotated Exam of Statistics 6C - Prof. M. Romanazzi

Department of Mathematics & Statistics STAT 2593 Final Examination 17 April, 2000

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:

Final Exam Bus 320 Spring 2000 Russell

Examination paper for TMA4255 Applied statistics

Disadvantages of using many pooled t procedures. The sampling distribution of the sample means. The variability between the sample means

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

(4) 1. Create dummy variables for Town. Name these dummy variables A and B. These 0,1 variables now indicate the location of the house.

Sociology 593 Exam 2 March 28, 2002

4/22/2010. Test 3 Review ANOVA

Business Statistics 41000: Homework # 5

Math Section MW 1-2:30pm SR 117. Bekki George 206 PGH

Multiple Regression: Chapter 13. July 24, 2015

Chapter 14 Multiple Regression Analysis

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test, October 2013

MBA Statistics COURSE #4

15.8 MULTIPLE REGRESSION WITH MANY EXPLANATORY VARIABLES

Confidence Interval for the mean response

Ch. 3 Review - LSRL AP Stats

Trends in Human Development Index of European Union

Statistics and Data Analysis

1 Introduction to Minitab

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation

STATISTICS 110/201 PRACTICE FINAL EXAM

ECON Introductory Econometrics. Lecture 13: Internal and external validity

Sociology 593 Exam 2 Answer Key March 28, 2002

SMAM 319 Exam1 Name. a B.The equation of a line is 3x + y =6. The slope is a. -3 b.3 c.6 d.1/3 e.-1/3

Chapter 12 - Lecture 2 Inferences about regression coefficient

23. Inference for regression

TAMS38 Experimental Design and Biostatistics, 4 p / 6 hp Examination on 19 April 2017, 8 12

Chapter 12: Multiple Regression

Multiple Regression Methods

STAT 360-Linear Models

Concordia University (5+5)Q 1.

STAT 512 MidTerm I (2/21/2013) Spring 2013 INSTRUCTIONS

AP Statistics Bivariate Data Analysis Test Review. Multiple-Choice

Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010

STAT FINAL EXAM

Midterm 2 - Solutions

General Linear Model (Chapter 4)

y response variable x 1, x 2,, x k -- a set of explanatory variables

ECON 497 Midterm Spring

Modelling structural change using broken sticks

Inference for the Regression Coefficient

Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page!

MATH 10 SAMPLE FINAL EXAM. Answers are on the last page at the bottom

Chapter 9 - Correlation and Regression

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

The simple linear regression model discussed in Chapter 13 was written as

Chapter 10. Correlation and Regression. McGraw-Hill, Bluman, 7th ed., Chapter 10 1

Psych 230. Psychological Measurement and Statistics

STA220H1F Term Test Oct 26, Last Name: First Name: Student #: TA s Name: or Tutorial Room:

Difference in two or more average scores in different groups

LI EAR REGRESSIO A D CORRELATIO

Introduction to Regression

SMAM 314 Practice Final Examination Winter 2003

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Business 320, Fall 1999, Final

EXAM IN TMA4255 EXPERIMENTAL DESIGN AND APPLIED STATISTICAL METHODS

Midterm Exam 1, section 1. Tuesday, February hour, 30 minutes

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

Pre-Calculus Multiple Choice Questions - Chapter S8

1. Least squares with more than one predictor

Basic Business Statistics 6 th Edition

ACOVA and Interactions

Six Sigma Black Belt Study Guides

MEI STRUCTURED MATHEMATICS 4767

SLOW LEARNERS MATERIALS BUSINESS MATHEMATICS SIX MARKS QUESTIONS

The following formulas related to this topic are provided on the formula sheet:

Confidence Intervals, Testing and ANOVA Summary

Inference for Regression Inference about the Regression Model and Using the Regression Line

STAT 350 Final (new Material) Review Problems Key Spring 2016

Transcription:

MIDTERM 2 èver 1è STA 110E Thursday, April 11, 1996. Notes: TA: Anand Krishna Heather Brunelle 1. This is an open book and open notes exam. Name 2. You must show your work and explain your answer in order to receive credit. 3. The exam has 6 problems. 4. The exam carries 100 points. 5. The points assigned to each problem are indicated at the beginning of that problem. Use them to plan your time. You have 75minutes to ænish. Problem Speed I Singers D&S Ask Speed II Demand Total Score è15 è20 è15 è15 è20 è15

1. ë15 ptsë Comparisons of typing speeds. èaè Ten candidates are selected at random and their typing speeds are measured prior to a short typing course èx 1 è. After the course typing speeds are measured again èx 2 è. The results are given in the table below: Candidate 1 2 3 4 5 6 7 8 9 10 Before èx 1 è 50 42 70 63 58 35 46 52 60 49 After èx 2 è 55 46 78 61 52 45 47 57 71 58 æ Test the hypothesis that the course is useful at the level æ =5è: èbè Assume now that the same measurements X 1 and X 2 are obtained from the subjects selected from two independent groups. From the group of candidates that did not attend the course 10 subjects are selected at random èmeasurements given in the X 1 -rowè. From the group of candidates that took the course 10 are selected at random, as well, and the measurements given in the X 2 -row are obtained. æ Test the hypothesis that the course is useful at the level æ =5è: Assume that the population variances èfor the scoresè are the same. æ Comment on the ændings in èaè and èbè. èone paragraph max.è Help yourself: r çx 1 =52:5; X2 ç èn =57:0; s 1 =10:43; s 2 =10:79;s X2,X 1 =8:91;s p = 1,1ès 2 1 +èn 2,1ès 2 2 n 1 +n 2 =,2 10:61; t 18;0:05 =1:734; t 19;0:05 =1:729; t 18;0:025 =2:101; t 9;0:05 =1:833; t 9;0:025 =2:262: 2

ë20 ptsë Do singers of diæerent voice parts have diæerent heights? The observations were made in 1979 on heights èin inchesè of male singers of the NY Choral Society. 1 voice tenor 1: 69 72 71 66 74 74 71 66 68 67 70 65 tenor 2: 68 73 69 71 69 76 71 69 71 66 69 71 bass 1: 72 70 72 69 73 71 72 68 68 71 66 68 bass 2: 72 75 67 75 74 72 72 74 72 72 74 76 MTB é oneway c1 c2; SUBCé tukey. ANALYSIS OF VARIANCE ON C1 SOURCE DF SS MS F p C2 3 86.90 28.97 4.39 0.009 ERROR 44 290.08 6.59 TOTAL 47 376.98 INDIVIDUAL 95 PCT CI'S FOR MEAN BASED ON POOLED STDEV LEVEL N MEAN STDEV -+---------+---------+---------+----- 1 12 69.417 3.088 è------*-------è 2 12 70.250 2.563 è------*-------è 3 12 70.000 2.174 è------*------è 4 12 72.917 2.353 è-------*------è -+---------+---------+---------+----- POOLED STDEV = 2.568 68.0 70.0 72.0 74.0 Family error rate = 0.0500 Individual error rate = 0.0105 Critical value = 3.78 Intervals for ècolumn level meanè - èrow level meanè 1 2 3 2-3.635 1.968 3-3.385-2.552 2.218 3.052 4-6.302-5.468-5.718-0.698 0.135-0.115 æ What test has been performed here? What are the hypotheses H 0 and H 1? æ What are general assumptions necessary for the above procedure to be valid. æ What is your decision about H 0 and H 1 at æ =0:05? æ Give the orderings of the population means according to the tukey procedure. æ Is H 0 : ç 2 = ç 4 a contrast? If yes, test it at æ =0:05: 1 Chambers et al. è1983è Graphical Methods for Data Analysis. 3

ë15 ptsë Drinking & Smoking Alcohol and nicotine consumption during pregnancy are believed to be associated with certain unwanted characteristics in newborn children. Since drinking and smoking behaviors may be related, it is important to understand the nature of this relationship when assessing the possible eæects of these variables on children. In one study, 2 362 mothers were classiæed according to their alcohol intake prior to pregnancy recognition and their nicotine intake during pregnancy. The data are summarized in the following table. Nicotine èmgèdayè Alcohol èouncesèdayè None 1-15 16 or more None 105 7 11.01-.10 58 5 13.11-.99 84 37 42 1.00 or more 57 16 17 The minitab output table from the procedure CHISQ is given below: C1 C2 C3 Total 1 105 58 84 247 51.86 111.22 2 7 5 37 49 16.65 10.29 22.06 3 11 13 42 66 22.43 13.86 29.72 Total 123 76 163 362 èiè What kind of test has been performed here? Explain è1-2 sentencesè. èiiè Express the hypotheses H 0 and H 1 in terms of the problem. èiiiè Find the missing number in the cell è1,1è. èivè The ç 2 statistic in this output is 42.052. Perform the test èfrom èièè at æ =0:05 and state your decision. 2 Data taken from A. P. Streissguth et al., ëintrauterine alcohol and nicotine exposure: attention and reaction time in 4-year-old children," Developmental Psychology, 20 è1984è, pp. 533-541. 4

ë15 ptsë When to ask? A researcher is studying the eæect on learning of inserting questions into instructional materials. There is some doubt whether these questions would be more eæective preceding or following the passage about which the question is posed. In addition, the researcher wonders if the eæect of the position of the question is the same for the factual questions and for questions that require the learner to compose a thoughtful and original response. A group of twenty-four students is split at random into four groups of six student each. One group is assigned to each of the four combinations of factor B, ëposition of question èbefore versus after the passageè," and factor A, ëtype of question èfactual versus thoughtprovokingè." After ten hours of studying under these conditions, the twenty four students are given a æfty-item test on the content of the instructional materials. The following test scores are obtained. MTB é anova B 1 B 2 14 23 31 28 A 1 29 26 26 27 30 17 35 32 27 21 36 29 A 2 20 26 39 31 15 24 41 35 score = position type Factor Type Levels Values position fixed 2 1 2 type fixed 2 1 2 Analysis of Variance for score Source DF SS MS F position 28.17 type 580.17 position*type 60.17 2.54 Error Total 1141.83 æ Complete the ANOVA table. æ Test the signiæcance of the interaction term in the model. Use æ = 5è. æ Can you test for the signiæcance of the factors A and B? Comment. 5

ë20 ptsë Speed Limits and Traæc Fatalities. The following table gives the speed limits èslè in 14 countries and traæc fatalities ètfè per 100,000,000 driver miles 3. MTB é regress c2 1 c1; SUBCé predict 80. The regression equation is TF = - 2.9 + 0.172 SL Country SL TF Country SL TF Italy 87 6.4 France 81 8.0 Hungary 75 14.5 Belgium 75 10.5 Portugal 75 22.5 Britain 70 4.0 Spain 62 12.4 Denmark 62 4.8 Netherlands 62 6.0 Greece 62 12.9 Japan 62 4.7 Norway 56 4.2 Poland 56 12.2 US 65 3.3 Predictor Coef Stdev t-ratio p Constant -2.86 10.66-0.27 0.793 SL 0.1721 0.1557 1.11 0.291 s = 5.296 R-sq = 9.2è R-sqèadjè = 1.7è Analysis of Variance SOURCE DF SS MS F p Regression 1 34.28 34.28 1.22 0.291 Error 12 336.61 28.05 Total 13 370.90 Fit Stdev.Fit 95è C.I. 95è P.I. 10.90 2.36 è 5.76, 16.05è è -1.73, 23.54è æ Comment on the output. How good is predictability of the model? æ Test H 0 : æ 0 =,10 versus the two sided alternative. Use æ =0:05: æ What is the 98è CI for the intercept æ 1? æ What is the 90è CI for the mean response for SL=80? 3 West Germany with TF=7.9 has no speed limit! 6

ë15 ptsë Modeling the demand function. From the Economic Report of the President, 1987 the following èpartialè table is obtained: G PG I PNC PUC PPT PD PN PS YR 60 129.7 0.925 6036 1.045 0.836 0.810 0.444 0.331 0.302 61 131.3 0.914 6113 1.045 0.869 0.846 0.448 0.335 0.307 62 137.1 0.919 6271 1.041 0.948 0.874 0.457 0.338 0.314 63 141.6 0.918 6378 1.035 0.960 0.885 0.463 0.343 0.320 64 148.8 0.914 6727 1.032 1.001 0.901 0.470 0.347 0.325 65 155.9 0.949 7027 1.009 0.994 0.919 0.471 0.353 0.332... where Variable G PG I PNC PUC PPT PD PN PS YR Total gasoline consumption è10s mill 1967 gasoline-doll.è Price index for gasoline Per capita real disposable index Price index for new cars Price index for used cars Price index for public transportation Aggregate price for durable goods Aggregate prices for non-durable goods Aggregate prices for consumer services Normalized year èsince 1900è The goal is to model the total gasoline consumption ègè using other variables as predictors. æ From the breg procedure output propose your favorite model. Give an explanation for your choice è2-3 sentencesè. MTB é breg c1 9 c2-c10 Best Subsets Regression of G P P P Adj. P N U P P P P Y Vars R-sq R-sq C-p s G I C C T D N S R 1 97.6 97.5 26.3 1.2458 X 1 91.5 91.2 99.7 2.3601 X 2 99.7 99.7 10.7 0.42500 X X 2 99.7 99.7 16.4 0.46224 X X 3 99.8 99.8 6.7 0.39086 X X X 3 99.7 99.7 8.0 0.42935 X X X 4 99.8 99.8 5.1 0.36162 X X X X 4 99.8 99.8 5.6 0.36677 X X X X 5 99.8 99.8 5.2 0.35271 X X X X X 5 99.8 99.8 6.1 0.36230 X X X X X 6 99.8 99.8 6.0 0.35152 X X X X X X 6 99.8 99.8 6.3 0.35352 X X X X X X 7 99.9 99.8 6.6 0.35602 X X X X X X X 7 99.9 99.8 6.6 0.35610 X X X X X X X 8 99.9 99.8 8.0 0.35983 X X X X X X X X 8 99.9 99.8 8.4 0.36412 X X X X X X X X 9 99.9 99.8 10.0 0.37008 X X X X X X X X X 7

æ The full model is given below. Comment on the output. The regression equation is G = 35.6-0.0172 PG - 1.29 I + 0.00361 PNC - 8.23 PUC - 1.21 PPT + 1.34 PD + 30.9 PN + 6.7 PS - 2.3 YR Continue? y Predictor Coef Stdev t-ratio p VIF Constant 35.640 5.792 6.15 0.000 PG -0.01717 0.02534-0.68 0.507 23.9 I -1.287 1.186-1.09 0.293 9.0 PNC 0.0036121 0.0009875 3.66 0.002 19.2 PUC -8.225 4.542-1.81 0.088 7.4 PPT -1.2082 0.8532-1.42 0.175 13.3 PD 1.344 1.337 1.01 0.329 11.8 PN 30.85 10.16 3.04 0.007 37.6 PS 6.70 10.28 0.65 0.523 79.0 YR -2.30 17.62-0.13 0.898 99.5 s = 0.3701 R-sq = 99.9è R-sqèadjè = 99.8è Analysis of Variance SOURCE DF SS MS F p Regression 9 1635.67 181.74 1326.97 0.000 Error 17 2.33 0.14 Total 26 1638.00 æ Test H 0 : æ PUC =,20 at æ =0:01: 8